Developmental genes significantly afflicted by aberrant promoter methylation and somatic mutation predict overall survival of late-stage colorectal cancer

An, Ning; Yang, Xue; Cheng, Shujun; Wang, Guiqi; Zhang, Kaitai

doi:10.1038/srep18616

Download PDF

Article
Open access
Published: 22 December 2015

Developmental genes significantly afflicted by aberrant promoter methylation and somatic mutation predict overall survival of late-stage colorectal cancer

Ning An¹^na1,
Xue Yang¹^na1,
Shujun Cheng¹^na1,
Guiqi Wang²^na1 &
…
Kaitai Zhang¹^na1

Scientific Reports volume 5, Article number: 18616 (2015) Cite this article

2676 Accesses
14 Citations
1 Altmetric
Metrics details

Subjects

Abstract

Carcinogenesis is an exceedingly complicated process, which involves multi-level dysregulations, including genomics (majorly caused by somatic mutation and copy number variation), DNA methylomics and transcriptomics. Therefore, only looking into one molecular level of cancer is not sufficient to uncover the intricate underlying mechanisms. With the abundant resources of public available data in the Cancer Genome Atlas (TCGA) database, an integrative strategy was conducted to systematically analyze the aberrant patterns of colorectal cancer on the basis of DNA copy number, promoter methylation, somatic mutation and gene expression. In this study, paired samples in each genomic level were retrieved to identify differentially expressed genes with corresponding genetic or epigenetic dysregulations. Notably, the result of gene ontology enrichment analysis indicated that the differentially expressed genes with corresponding aberrant promoter methylation or somatic mutation were both functionally concentrated upon developmental process, suggesting the intimate association between development and carcinogenesis. Thus, by means of random walk with restart, 37 significant development-related genes were retrieved from a priori-knowledge based biological network. In five independent microarray datasets, Kaplan–Meier survival and Cox regression analyses both confirmed that the expression of these genes was significantly associated with overall survival of Stage III/IV colorectal cancer patients.

Prognostic genome and transcriptome signatures in colorectal cancers

Article Open access 07 August 2024

Analyzing aberrant DNA methylation in colorectal cancer uncovered intangible heterogeneity of gene effects in the survival time of patients

Article Open access 13 December 2023

Landscape of transcriptome variations uncovering known and novel driver events in colorectal carcinoma

Article Open access 16 January 2020

Introduction

Colorectal cancer (CRC) is the third most common cancer in men (746,000 cases, 10.0% of the total) and the second in women (614,000 cases, 9.2% of the total) worldwide, accounting for roughly 694,000 deaths per year¹. The initiation of CRC is an incredibly complicated biological process, involving multiple genomic and epigenomic alterations, occurring over an extended time period of usually a decade². Patient survival is limitedly dependent on the tumor stage at the time of diagnosis and reduced sensitivity to chemotherapy is still a major obstacle in effective treatment of advanced disease. Therefore, the discovery of novel molecules promoting CRC progression and indicating prognostic status, is still urgently needed³.

It is putatively accredited that carcinogenesis is caused by multi-level dysregulations, including genomics [majorly caused by somatic mutation and copy number variation (CNV)]^4,5, DNA methylomics^6,7 and transcriptomics^8,9. CNV plays a significant role in tumorigenesis in many cancers^{10,11,12,13,14}, whose accumulation during oncogenesis might be a result of preferential selection by which transforming cells gain evolutionary advantages¹⁵. Somatic mutation, together with CNV, could contribute to genomic instability⁴. It could also activate additional downstream pathways in many types of cancer to acquire proliferative advantages^16,17,18. DNA methylation is substantially important in promoting embryonic development¹⁹, aging²⁰ and nearly all types of cancer^21,22,23,24, by influencing DNA and chromatin structures²⁵. Numerous investigations indicated that the dysregulation of promoter region, especially promoter hypermethylation of tumor suppressor genes, was the essential epigenetic events in carcinogenesis, prognostic marker discovery and therapeutic utilities^26,27,28,29.

CNV, aberrant promoter methylation and somatic mutation could all influence gene activation or suppression, thereby influencing the process of carcinogenesis. CNVs may alter gene dosage by changing the number of copies of a gene that is present in the genome^30,31,32,33, explaining in most circumstances, CNV and corresponding gene expression are positively correlated in CRC³⁴. Promoter hypomethylation might lead to gene activation and promoter hypermethylation might cause gene suppression³⁵. Genes with somatic mutation could probably lead to the activation or suppression of downstream signaling pathways³⁶. For example, in thyroid cancer, somatic mutation of BRAF could activate MAPK pathway, thus influencing the massive dysregulation of gene activity³⁷.

The multi-level genomic dysregulations during carcinogenesis indicated that while looking into the dysregulation of gene expression in cancer, the aberrant patterns of multi-level events should also be paid considerable attention to shed light on the underlying intricate mechanisms of cancer initiation and deterioration. Therefore, the integrative analysis of cancer genomics, methylomics and transcriptomics is urgently needed to comprehensively dissect cancer etiology and provide clinical guidance.

The Cancer Genome Atlas (TCGA) database is an immeasurable source of knowledge launched in 2005, which provides publicly available cancer genomic datasets³⁸. Based on abundant resources of RNA sequencing (RNAseq), DNA sequencing (DNAseq), single nucleotide polymorphism (SNP) based platforms and DNA methylation, integrative analysis of cancer genomics was exuberantly emerging, for instance, in breast cancer³⁹, ovarian cancer⁴⁰, glioma⁴¹, lung cancer⁴², renal cancer⁴³ and many other types of cancers. Multi-dimensional analyses (MDA) of the genome, epigenome and transcriptome was proven to be greatly beneficial in facilitating the rational deduction of aberrant genes and pathways, delineating subtypes of cancer and promoting derivation of diagnostic and prognostic signatures, which otherwise would be overlooked in single genomic dimension investigations⁴⁴. Thus, the molecular abnormalities of multiple levels should be altogether taken into consideration and systematically identify genes or pathways critically important in carcinogenesis.

In this study, we first collected genes with significant dysregulations with regard to DNA copy number, DNA promoter methylation, gene expression and somatic mutation from TCGA paired samples. Differentially expressed genes (DEGs) with consistent aberrant promoter methylation or somatic mutation were found both exhibiting remarkable functional unity in developmental process. Gene to gene regulatory network was constructed by means of merging Human Protein Reference Database (HPRD) and Kyoto Encyclopedia of Genes and Genomes (KEGG) networks. By combining multi-dimensional genomic data of CRC and priori knowledge network, we applied a computational strategy, i.e. random walk with restart, to obtain the genes which were affected considerably by aberrant promoter methylation or somatic mutation. The most of these significant genes were connected in the network and proven to hold profound prognostic information in late stage (Stage III/IV) patients, which might be helpful for constructing prognosis prediction models and providing novel tools to guide clinical implementations for this deadly disease.

Material and Methods

A schematic for the study is depicted in Fig. 1.

Data retrieval

The multi-dimensional data of CRC associated datasets were retrieved from The Cancer Genome Atlas (TCGA) database (https://tcga-data.nci.nih.gov/tcga/). Four levels of paired data (cancer and normal adjacent tissues from CRC patients) were downloaded, including 32 paired RNA sequencing level 3 data [raw counts and RNASeq by Expectation Maximization (RSEM) normalized read counts], 500 paired DNA copy number level 3 data [conducted with Affymetrix SNP 6.0 platform and segmented by circular binary segmentation (CBS) method⁴⁵], 45 paired DNA methylation level 3 data [using Illumina HumanMethylation450 chips and the methylation level of each CpG site was calculated as the ratio (β value) of the signal of methylated probes relative to the sum of methylated and unmethylated probes, which ranged continuously from 0 (unmethylated) to 1 (fully methylated)] and somatic mutation level 2 data of 300 patients (mutation information of 17,427 genes).

The raw data for five human CRC mRNA microarray studies with overall survival (OS) information (sample size >60, referred to as Clinicinfo superset; Table 1) were downloaded from the National Center for Biotechnology Information Gene Expression Omnibus (GEO). The flowchart of Clinicinfo dataset retrieval is presented in Supplementary Figure S1. The combined data set contained a total of 940 samples (936 samples with clear OS information) hybridized to probe sets present on both the Affymetrix HG-U133A (with GEO accession number GPL96) and the HG-U133A Plus2 (GPL570) platforms, composed of data sets with accession numbers GSE39582, GSE17536, GSE29621, GSE39084 and GSE12945. In total, 22,277 probes were common in all data sets and of which the expression values were retrieved via robust multi-array average (RMA) algorithm and further quantile normalized using the “affy” Bioconductor package. The ComBat algorithm was utilized to eliminate potential batch effects and the expression levels of 12,500 genes were obtained as the median value of all the probes which could be mapped to this gene. All clinical information was extracted from the original publications.

Table 1 Colorectal cancer microarray datasets included in survival analysis.

Full size table

Circos plot of TCGA colorectal data in terms of DNA copy number, DNA methylation and somatic mutation

Colorectal primary tumor datasets in TCGA database, including 617 DNA copy number data, 393 DNA methylation data and 300 somatic mutation data, were enrolled for integrative Circos plot construction via Perl software “Circos plot” (Fig. 2). Bioconductor package “cghMCR” was used to compute the segment gain or loss (SGOL) scores to quantify chromosome regions showing common gains/losses by summation of the score in each patient. For DNA methylation, the whole genome was segmented into contiguous 500,000 base pair (bp) bins and the median and 75th percentile of methylation levels of CpGs which could be mapped onto each bin were plotted. As for somatic mutation data, genes with mutation rate >5% were shown in scatter plot.

Identification of candidate genes with significant alteration at multi-level

DEGs were identified using edgeR algorithm⁴⁶ with RNA sequencing raw counts (FDR < 0.01, fold change >2). As for DNA copy number data, Bioconductor package “CNTools” was used to process segmentation data and format the data into a gene-level matrix based on corresponding genomic location of 26,863 genes. Genes with genomic amplification and deletion were identified with paired t statistic test (FDR < 0.001, fold change >1.2). In methylation analysis, promoter region was defined as the region between 1,000 bp upstream transcription start site (TSS) and 300 bp downstream TSS. The β value of the probe which could be mapped to the CpG site located in the promoter region of a given gene was used to quantify the methylation level of this gene. If more than one probe could be mapped to the promoter region of a given gene, the mean value was adopted. In this manner, the methylation level of 16,996 genes were obtained with DNA methylation data and significant hypermethylated and hypomethylated genes were identified with paired t statistic test (FDR < 0.001, fold change >1.5).

By virtue of dysregulation pattern at different levels, three groups of candidate genes of interest were collected: (i) genes with differential expression and corresponding copy number alteration (i.e. genes with overexpression and amplification and genes with underexpression and deletion); (ii) genes with differential expression and corresponding promoter methylation (i.e. genes with overexpression and promoter hypomethylation and genes with underexpression and promoter hypermethylation); and (iii) genes with differential expression and somatic mutation.

Identification of significant genes through random walk

Gene ontology (GO) enrichment analysis was conducted using Bioconductor package “clusterProfiler”. The protein–protein interaction network was downloaded from HPRD database and KEGG network was constructed with Bioconductor package “KEGGgraph”. Therefore, gene regulatory network was established by merging HPRD and KEGG network, including 10,479 nodes and 60,689 edges after eliminating self-loops and duplicated edges.

Taking advantage of knowledge-based network topology, random walk algorithm was utilized to identify genes algorithmically most affected by aberrant promoter methylation and somatic mutation⁴⁷. In the network, genes of interest were designated as information source (i.e., source nodes) and the remaining genes in the network as the information target (i.e., target nodes). The information flow originates from source nodes iteratively and randomly transmits to their neighbors with a probability proportional to their topological features. At each step, the information can flow back to the source nodes with the same probability. The final steady-state probability assigned to each gene in the network reflects the integrated influence imposed by source nodes combining network topology. Formally, the random walk with restart is defined as:

where W is the column-normalized adjacency matrix of network and p^t is a vector in which the genes in the network holds probability in the iterative process up to step t. Source nodes were weighted with initial probability vector p⁰ (the sum of its elements was equal to 1) and r represents restart probability (r = 0.7 in this study). All the genes in the network were ranked according to the values in the steady-state probability vector p^∞. This was obtained at query time by performing the iteration until the difference between p^t and p^t + 1 (measured by the L1 norm) was lower than 10⁻¹⁰. In order to obtain genes with significantly high steady-state probability, 10,000 permutations of node labels (with network topology remained the same) were conducted to calculate the null distribution of final probability for each gene. The p value was termed as the ratio of random values that were greater than the observed final probability. Genes with p<0.01 were regarded as the genes significantly afflicted by these genetic or epigenetic abnormalities.

Validation of gene signature’s prognostic value in Clinicinfo superset

In order to assess the prognostic value of the significant genes we obtained (suppose the signature contained n genes), the risk score formula for predicting OS was developed based on a linear combination of the expression level (x₁, x₂, …, x_n) of a given patient weighted by the regression coefficients derived from the Cox regression analysis. GSE17536 was used as training cohort for Cox regression model construction and the remaining four Clinicinfo data sets were treated as test cohorts. The regression coefficient β was calculated with training cohort and the same coefficient was further applied to testing cohorts. The risk score r for Patient j was calculated as follows:

Five-fold cross validation was also conducted within training cohort to strengthen the validity of the test. We then divided patients into high-risk and low-risk groups using the median gene signature risk score. Patients with higher risk scores are expected to have significantly poor OS status, if the gene signature is closely related to OS. Kaplan–Meier survival analysis and log-rank test were performed to evaluate the prognostic difference between the two risk score assigned groups.

Results

Collection of genes with somatic mutation, differential expression, DNA copy number and promoter methylation with paired TCGA samples

Due to abundant resources of TCGA database, paired samples of CRC were used to obliterate individual difference. DEGs, calculated using edgeR algorithm, were composed of 1,457 up-regulated genes and 2,584 down-regulated genes (Fig. 3A). In addition, 1,057 genes were significantly amplified and 843 genes were found significantly deleted (Fig. 3A). Integrative Circos plot indicated there were severe copy number alteration in Chromosome 7, 8, 13, 17, 18 and 20, highly consistent with previous investigations^{34,48,49,50,51,52} (Fig. 2). By means of paired t statistic test, 1,464 genes with promoter hypermethylation and 498 genes with promoter hypomethylation were also identified (Fig. 3A) and 1,301 genes with mutation rate >5% were regarded as mutated genes.

Identification of candidate gene groups associated with DNA copy number alterations, promoter methylation and somatic mutation

Three groups of DEGs with aberrant genetic or epigenetic dysregulations (Fig. 3B) were categorized as follows: (i) 104 genes with overexpression and copy number amplification and 95 genes with underexpression and copy number loss (altogether 199 genes, termed as Group A); (ii) 46 genes with overexpression and promoter hypomethylation and 522 genes with underexpression and promoter hypermethylation (altogether 568 genes, termed as Group B); (iii) 397 genes (termed as Group C) with somatic mutation and differential expression (115 overexpression and 282 underexpression). Genetic and epigenetic dysregulation of DEGs were shown in Fig. 3C. Consistent with classic knowledge of gene regulation, promoter methylation exerted trans-regulation, while DNA copy number exerted cis-regulation upon gene expression and the promoter of DEGs tended to be hypermethylated in CRC (Fig. 3C).

The overlapping among these three gene groups was conducted and hypergeometric distribution was used to assess the statistical significance. The formula of hypergeometric distribution is as follows:

where N is the number of all DEGs (N = 4041, the background gene number since all candidate genes were DEGs); K is the gene number of one target gene groups; M is the gene number of the other target gene group; x is the number of common genes shared by the both gene groups. As shown in Supplementary Figure S2, the result of hypergeometric distribution test indicated that there was no significant overlapping between Group A and Group B (p = 0.966) or Group C (p = 0.398), while Group B significantly overlapped with Group C (n = 107, p = 6.309e-13).

Random walk in developmental process related network

GO analysis of aforementioned three gene groups indicated Group A was found no GO terms significantly enriched, whereas Group B (Fig. 3D, Supplementary Table S1) and Group C (Fig. 3E, Supplementary Table S2) were both significantly enriched with a variety of GO terms (Bonferroni adjusted FDR < 1e-07). The enriched GO terms were increasingly ordered with FDR value and top 30 GO terms were shown in Fig. 3D,E. All the offspring GO terms of “developmental process” were highlighted in red. Among top 30 enriched GO terms, 76.67% (23/30) of these terms were the offspring of “developmental process” for both Group B and Group C. Moreover, 48.33% (232/480, Supplementary Table S1) of Group B genes and 52.39% (186/355, Supplementary Table S2) of Group C genes belonged to this GO term (Fig. 3D,E). Among the 107 overlapping genes between Group B and Group C, 54.2% (58/107) of these genes belonged to the GO term “developmental process.

Since DEGs with abnormal promoter methylation and somatic mutation were both functionally concentrated on developmental process, developmental process related genes (DPRG, n = 5,161) were extracted from GO term “GO: 0032502”. Developmental process related network (DPRN) was established by extracting DPRGs and edges between DPRGs from the aforementioned merged network. The biggest connected component (BCC) of DPRN containing 3,271 DPRGs and 20,652 edges was established as walking graph for random walk (Fig. 4A). Genes in Group B or C and also present in the BCC were used as source nodes (n = 249). Genes only afflicted with dysregulated promoter methylation or somatic mutation were scored as 1 and genes afflicted with both abnormalities were scored as 2. The initial probability vector p₀ was obtained by normalizing the score vector (n = 249) so that the sum of the vector is equal to 1 (the input of random walk algorithm). When the steady-state was finally reached, all the genes in the BCC (including 249 source nodes) were scored with p^∞ (n = 3271, output of random walk algorithm) and thus the genes with significantly high score were mostly affected by both of these dysregulations. Therefore, 37 significant genes in respect to steady-stage probability were collected through 10,000 permutations (Fig. 4B) and algorithmically these genes received the most influence imposed by source genes with severe genetic and epigenetic dysregulations.

Validation of significant genes’ prognostic value via survival analysis

We used GSE17536 in Clinicinfo superset as training cohort to train Cox regression model with 37 significant genes and then used the constructed model to evaluate the risk score of patients in test cohorts. Patients in each test data set were further divided into high risk and low risk subgroups based on the median of their risk score. Kaplan–Meier survival analysis was performed to evaluate the actual survival difference between the two risk score assigned groups in samples from all American Joint Committee on Cancer (AJCC) stages (Fig. 5A), Stage I/II (Fig. 5B) and Stage III/IV (Fig. 5C) in each data set, respectively. Risk score calculated in all stage and Stage I/II samples were not significantly or consistently associated with patient’s OS in both self-cross validation and four individual test cohorts (Fig. 5A,B). However, patients with higher risk score in Stage III/IV patient groups tended to live significantly shorter than those with lower risk score. The ability of risk score to discriminate OS was quite satisfactory in Stage III/IV samples in each data set (Fig. 5C, GSE17536 cross validation, n = 96, p = 0.04; GSE39582, n = 264, p = 0.048; GSE29621, n = 36, p = 0.047; GSE39084, n = 38, p = 0.0093; GSE12945, n = 26, p = 0.18), suggesting the genes most influenced by promoter methylation dysregulation and somatic mutation probably hold great prognostic value in late stage CRC patients.

Confirmation of the prognostic value of these 37 genes by means of meta-analysis and Cox regression analysis

Meta-analysis of 37 significant genes and risk score in five Clinicinfo data sets also confirmed the result of survival analysis with both fixed-effect model (Fig. 6A) and random-effect model (Fig. 6B), corroborating the prognostic value of these significant genes in late stage (conducted with R package “meta”). Fixed-effect and random-effect model are the most commonly used methods in conducting meta-analysis. The two models are different from the way of pooling the effect sizes obtained from the individual studies into an overall effect size. The fixed-effect model assumes that the differences between the studies are so important that during the effect-size pooling process, individual effect sizes should be retained; while random-effect model assumed that the individual trial effect sizes are “random” quantities^53,54. Additionally, overall concordance index (C-index) analysis was also meta-analytically conducted to evaluate its OS predictive ability⁵⁵ and the result indicated that these 37 genes could significantly predict OS of late stage CRC patients (Supplementary Fig. S3). The Cox proportional hazards regression model was used to evaluate the independence of the prognostic factors in a stepwise manner (Table 2). We collected 122 Stage III/IV samples in Clinicinfo superset with definite information of OS, age, gender, stage and grade and univariate Cox regression analysis indicated stage [hazard ratio (HR): 4.384; 95% confidence interval (CI): 2.671 ~ 7.194; p = 7.894e-09] and the risk score (HR: 2.225; 95% CI: 1.740 ~ 2.845; p = 4.047e-10) generated by these 37 significant genes were significantly associated with patient’s OS. Multivariate Cox analysis indicated the risk score was an independent prognostic factor (HR: 2.223; 95% CI: 1.739 ~ 2.842; p = 1.831e-10).

Table 2 Univariate and multivariate analyses of overall survival in late stage CRC patients.

Full size table

Discussion

The booming amount of high-throughput and multi-dimensional genomic data usher us into a new era, when the tremendously complicated molecular mechanism of carcinogenesis were perceived and dissected in a more integrative perspective. In this study, we systematically analyzed CRC genomic data, including CNV, somatic mutation, DNA promoter methylation and gene expression, to discover novel and important molecules and genomic dysregulations in a more comprehensive manner. Paired samples in TCGA database were used to identify differential gene expression and genetic or epigenetic abnormalities, respectively and collected three groups of candidate genes with differential gene expression pattern and upstream corresponding dysregulations. The result of GO analysis indicated the functions of DEGs with abnormal promoter methylation (Group B) and somatic mutation (Group C) both majorly concentrated on developmental process, of which the outcome is an anatomical structure (which may be a subcellular structure, cell, tissue, or organ), or organism over time from an initial condition to a later condition⁵⁶. Additionally, the DEGs with CNV didn’t significantly overlap with the other DEG groups, while the majority of the significantly overlapping DEGs between Group B and Group C belonged to the GO term “developmental process” (Supplementary Fig. S2). These common DEGs shared by Group B and Group C play a pivotal role in both development and carcinogenesis. For instance, the germline gain-of-function mutation of ALK could disrupt the development of central nervous system⁵⁷, of which the same anomaly was also identified in sporadic and familial neuroblastoma cases^58,59,60,61. TIAM1, expressed in the base of intestinal crypts, established a fundamental role for Wnt-signaling pathway in the development and maintenance of normal intestinal physiology⁶². Its expression was greatly elevated in mouse intestinal tumors and human colon adenomas and the cross-talk between TIAM1 and canonical Wnt-signaling pathways could significantly influence intestinal tumor formation and progression⁶³. Based on GO and overlapping analyses, it is quite plausible that DEGs with aberrant promoter methylation and somatic mutation intimately cooperated together to facilitate the dysregulation of developmental process. DEGs with CNV, however, were not found functionally specific in terms of influencing certain biological process.

It has been more than 150 years since Rudolf Virchow first advocated that neoplasms arise “in accordance with the same law, which regulates development” in 1858. Emerging evidences supported the cellular behavioral similarity between ontogenesis and oncogenesis, for instance, in the process of epithelial-to-mesenchymal transition (EMT)⁶⁴, mesenchymal-to-epithelial transition (MET)⁶⁵ and immune-surveillance evasion⁶⁶. The molecular resemblances have been documented between certain malignant tumors and developing tissues on the basis of transcription factor activity⁶⁷, regulation of chromatin structure⁶⁸ and cellular signaling⁶⁹. Important molecules were reported to play substantial role in both development and carcinogenesis. For example, PTCH1 is a key regulator of development, whose overexpression could drive skin carcinogenesis⁷⁰. Developmental animal models were used to uncover the complicated molecular mechanisms of carcinogenesis and a variety of novel and pivotal molecules, pathways and biomarkers were discovered^71,72,73. Many important signaling pathways, including Notch1 signaling pathway, activated during development, are proven to be reactivated in the process of carcinogenesis^74,75. In addition, there were some pioneering works discovering that mRNA and microRNA expression profile of cancer could recapitulate the expression pattern of development^{72,76,77,78,79}. The intimate association between developmental process and carcinogenesis, together with astounding synchronization of promoter methylation dysregulation and somatic mutation in developmental process related genes (DPRGs), compelled us to propose the hypothesis that DPRGs affected most by the aberrance of promoter methylation and somatic mutation, probably hold meaningful explanation for the underlying mechanism of carcinogenesis and might be intimately associated with clinicopathological characteristics, for instance, OS.

In our study, we adopted a simple and effective computational strategy to randomly walk DPRGs with aberrant promoter methylation or somatic mutation in HPRD and KEGG merged biological network. Random walk with restart was adopted to decipher gene to disease association in priori-knowledge based network, whose performance was proven to be much more superior to other methods, such as neighborhood approaches^80,81,82. The advantage of this strategy is that it subtly combines observed multi-omic data with knowledge based regulatory network, tracing the information flow which would be greatly accumulated in significant genes.

The majority of these significant genes were connected to form a relatively compact biological module (Fig. 4B), implying enormous biological association existing among these genes. Many of these significant genes obtained through random walk algorithm were closely related to the initiation and progression of CRC. TGFBR1 is a central molecule in TGF-β pathway, whose alteration could strikingly enhance the susceptibility to CRC⁸³. The high microsatellite instability and expressional loss of EP300 may be a feature of gastric and colorectal cancers⁸⁴. PRKCA and PRKCB are both member of Protein kinase C (PKC) family, which have a role in cell proliferation, differentiation, angiogenesis and apoptosis⁸⁵. PRKCB inhibition by enzastaurin could lead to mitotic missegregation and preferential cytotoxicity toward colorectal cancer cells with chromosomal instability; loss of PRKCA signaling is a general characteristic of colorectal tumors regardless of other underlying genetic defects, pointing to the importance of this pathway⁸⁶.

Since candidate genes were collected based on aberrant patterns in multi-omic level of TCGA genomic data, we used microarray data sets with OS information from GEO database instead of TCGA to test the prognostic value of these significant genes. Recent expression profiling datasets lack of consistent results between the studies due to different technological platforms and lab protocols^87,88 and the microarray expression value of a particular genes could only be calculated based on different type of probes, which could probably compromise the accuracy and robustness of the whole meta-analysis. In addition, the relatively small number of sample size and noisiness of microarray data could cause the inconsistency of biological conclusions. To address these challenges, we collected five Affymetrix microarray data sets (n = 940, each sample number >60) with 22,277 common probes to get robust result of their significant clinical relevance. The expression value of 37 significant genes was retrieved and the prognostic value was evaluated with Cox regression model. The result indicated these 37 genes were significantly associated with OS in late stage (Stage III/IV) patients, rather than early stage (Stage I/II). According to AJCC staging system (7th edition)⁸⁹, the lesion of early stage CRC (Stage I/II) is relatively contained with neither lymph node invasion nor distant metastasis; when tumor advances to late stage (Stage III/IV), the involved area is greatly increased, lymph node is invaded (Stage III/IV) and distant organs might be afflicted via distant metastasis (Stage IV). Because of the small size of tumor involvement, Stage I and Stage II patients only need to receive radical treatment to defuse the peril caused by molecularly chaotic tumors. However, with the deterioration of the disease, Stage III patients principally should be treated with neoadjuvant chemoradiation therapy followed by surgery with or without adjuvant chemotherapy and patients with Stage IV CRC are primarily treated with chemotherapy although a selected group of patients can be cured with metastasectomy⁹⁰. Surgical resection of the primary tumor is not beneficial for most of Stage IV patients^91,92. Prognostic genes have the ability to predict patient’s OS status, probably by means of exerting influence on or reflecting tumor encroachment in the patient. Suppose the tumor is completely removed from the patient and then the expression of this gene signature would probably not precisely predict OS, since the persistent influence of the tumor is terminated along with the tumor excision. On account of the massive tumor involvement and potential metastasis of Stage III/IV CRC, surgical excision in late stage patients might not remove the tumor with extensive molecular dysregulation as completely as in early stage patients. Therefore indicative function of prognostic genes continues monitoring the interaction between the residual neoplasms and CRC patients, probably explaining the question why these genes were only significantly associated with the OS of late stage CRC patients.

In summary, with the increasing availability of multidimensional genomic data, we collected genes with high rate of somatic mutation, differential expression, promoter methylation dysregulation and significant CNV, using paired samples in TCGA database. Three groups of DEGs with corresponding genetic or epigenetic abnormalities were obtained; the GO enrichment and overlapping analysis suggested DEGs with aberrant promoter methylation or somatic mutation were both functionally centering on developmental process. Random walk with restart was used to extract significant developmental genes most affected by aberrant promoter methylation and somatic mutation in merged regulatory network. In addition, the significant genes were closely related to OS of late stage patient. It is also very tempting that the identification of the functional regulators of these genes might be profusely beneficial to the discovery of new drug targets for CRC treatment. It is our hope that our preliminary exploration would be helpful for the further study upon cancer etiology and treatment guidance.

Additional Information

How to cite this article: An, N. et al. Developmental genes significantly afflicted by aberrant promoter methylation and somatic mutation predict overall survival of late-stage colorectal cancer. Sci. Rep. 5, 18616; doi: 10.1038/srep18616 (2015).

References

Ferlay, J. et al. Cancer incidence and mortality worldwide: sources, methods and major patterns in GLOBOCAN 2012. Int. J. Cancer 136, E359–86 10.1002/ijc.29210 (2015).
Article CAS PubMed Google Scholar
Han, D. et al. Long noncoding RNAs: novel players in colorectal cancer. Cancer Lett. 361, 13–21 10.1016/j.canlet.2015.03.002 (2015).
Article CAS PubMed Google Scholar
Gonzalez-Pons, M. & Cruz-Correa, M. Colorectal Cancer Biomarkers: Where Are We Now? Biomed Res Int doi:Artn 149014 10.1155/2015/149014 (2015).
Ferguson, L. R. et al. Genomic instability in human cancer: Molecular insights and opportunities for therapeutic attack and prevention through diet and nutrition. Semin. Cancer Biol. 10.1016/j.semcancer.2015.03.005 (2015).
Taylor, B. S. et al. Integrative genomic profiling of human prostate cancer. Cancer Cell 18, 11–22, 10.1016/j.ccr.2010.05.026 (2010).
Article CAS PubMed PubMed Central Google Scholar
Shenker, N. & Flanagan, J. M. Intragenic DNA methylation: implications of this epigenetic mechanism for cancer research. Br. J. Cancer 106, 248–53, 10.1038/bjc.2011.550 (2012).
Article CAS PubMed Google Scholar
Akhavan-Niaki, H. & Samadani, A. A. DNA methylation and cancer development: molecular mechanism. Cell Biochem. Biophys. 67, 501–13, 10.1007/s12013-013-9555-2 (2013).
Article CAS PubMed Google Scholar
Curtis, C. et al. The genomic and transcriptomic architecture of 2,000 breast tumours reveals novel subgroups. Nature 486, 346–52, 10.1038/nature10983 (2012).
Article CAS PubMed PubMed Central Google Scholar
Domany, E. Using High-Throughput Transcriptomic Data for Prognosis: A Critical Overview and Perspectives. Cancer Res. 74, 4612–4621, 10.1158/0008-5472.Can-13-3338 (2014).
Article CAS PubMed Google Scholar
Leary, R. J. et al. Integrated analysis of homozygous deletions, focal amplifications and sequence alterations in breast and colorectal cancers. Proc. Natl. Acad. Sci. USA 105, 16224–16229, 10.1073/pnas.0808041105 (2008).
Article ADS PubMed PubMed Central Google Scholar
Despierre, E. et al. Somatic copy number alterations predict response to platinum therapy in epithelial ovarian cancer. Gynecol. Oncol. 135, 415–422, 10.1016/j.ygyno.2014.09.014 (2014).
Article CAS PubMed Google Scholar
Xu, H. T. et al. Non-invasive Analysis of Genomic Copy Number Variation in Patients with Hepatocellular Carcinoma by Next Generation DNA Sequencing. Journal of Cancer 6, 247–253, 10.7150/Jca.10747 (2015).
Article CAS PubMed PubMed Central Google Scholar
Silveira, S. M. et al. Genomic screening of testicular germ cell tumors from monozygotic twins. Orphanet J. Rare Dis. 9, Artn 181, 10.1186/S13023-014-0181-X (2014).
Horpaopan, S. et al. Genome-wide CNV analysis in 221 unrelated patients and targeted high-throughput sequencing reveal novel causative candidate genes for colorectal adenomatous polyposis. Int. J. Cancer 136, E578–E589, 10.1002/Ijc.29215 (2015).
Article CAS PubMed Google Scholar
Liang, L., Fang, J. Y. & Xu, J. Gastric cancer and gene copy number variation: emerging cancer drivers for targeted therapy. Oncogene, 10.1038/onc.2015.209 (2015).
Davies, M. A. & Samuels, Y. Analysis of the genome to personalize therapy for melanoma. Oncogene 29, 5545–5555, 10.1038/Onc.2010.323 (2010).
Article CAS PubMed PubMed Central Google Scholar
Jiang, B. H. & Liu, L. Z. PI3K/PTEN signaling in tumorigenesis and angiogenesis. Biochim. Biophys. Acta 1784, 150–8, 10.1016/j.bbapap.2007.09.008 (2008).
Article CAS PubMed Google Scholar
Yuan, T. L. & Cantley, L. C. PI3K pathway alterations in cancer: variations on a theme. Oncogene 27, 5497–5510, 10.1038/Onc.2008.245 (2008).
Article CAS PubMed PubMed Central Google Scholar
Law, J. A. & Jacobsen, S. E. Establishing, maintaining and modifying DNA methylation patterns in plants and animals. Nat Rev Genet 11, 204–220, 10.1038/Nrg2719 (2010).
Article CAS PubMed PubMed Central Google Scholar
Yuan, T. et al. An integrative multi-scale analysis of the dynamic DNA methylation landscape in aging. PLoS Genet. 11, e1004996, 10.1371/journal.pgen.1004996 (2015).
Article CAS PubMed PubMed Central Google Scholar
Docherty, S. J., Davis, O. S. P., Haworth, C. M. A., Plomin, R. & Mill, J. DNA methylation profiling using bisulfite-based epityping of pooled genomic DNA. Methods 52, 255–258, 10.1016/j.ymeth.2010.06.017 (2010).
Article CAS PubMed Google Scholar
Laird, P. W. The power and the promise of DNA methylation markers. Nat Rev Cancer 3, 253–266, 10.1038/Nrc1045 (2003).
Article CAS PubMed Google Scholar
Costello, J. F. & Plass, C. Methylation matters. J. Med. Genet. 38, 285–303, 10.1136/Jmg.38.5.285 (2001).
Article CAS PubMed PubMed Central Google Scholar
Baylin, S. B. Tying it all together: Epigenetics, genetics, cell cycle and cancer. Science 277, 1948–1949, 10.1126/science.277.5334.1948 (1997).
Article CAS PubMed Google Scholar
Akhavan-Niaki, H. & Samadani, A. A. DNA Methylation and Cancer Development: Molecular Mechanism. Cell Biochem. Biophys. 67, 501–513, 10.1007/s12013-013-9555-2 (2013).
Article CAS PubMed Google Scholar
De Carvalho, D. D. et al. DNA Methylation Screening Identifies Driver Epigenetic Events of Cancer Cell Survival. Cancer Cell 21, 655–667, 0.1016/j.ccr.2012.03.045 (2012).
Article CAS PubMed PubMed Central Google Scholar
Deckers, I. A. et al. Promoter Methylation of CDO1 Identifies Clear-Cell Renal Cell Cancer Patients with Poor Survival Outcome. Clin. Cancer Res., 10.1158/1078-0432.CCR-14-2049 (2015).
Busche, S. et al. Integration of High-Resolution Methylome and Transcriptome Analyses to Dissect Epigenomic Changes in Childhood Acute Lymphoblastic Leukemia. Cancer Res. 73, 4323–4336, 10.1158/0008-5472.Can-12-4367 (2013).
Article CAS PubMed Google Scholar
Choudhury, J. H. & Ghosh, S. K. Promoter Hypermethylation Profiling Identifies Subtypes of Head and Neck Cancer with Distinct Viral, Environmental, Genetic and Survival Characteristics. PLoS ONE 10, e0129808, 10.1371/journal.pone.0129808 (2015).
Article CAS PubMed PubMed Central Google Scholar
Perry, G. H. et al. Diet and the evolution of human amylase gene copy number variation. Nat. Genet. 39, 1256–60, 10.1038/ng2123 (2007).
Article CAS PubMed PubMed Central Google Scholar
Mills, R. E. et al. Mapping copy number variation by population-scale genome sequencing. Nature 470, 59–65, 10.1038/nature09708 (2011).
Article CAS PubMed PubMed Central Google Scholar
Conrad, D. F. et al. Origins and functional impact of copy number variation in the human genome. Nature 464, 704–12, 10.1038/nature08516 (2010).
Article CAS PubMed Google Scholar
Redon, R. et al. Global variation in copy number in the human genome. Nature 444, 444–54, 10.1038/nature05329 (2006).
Article CAS ADS PubMed PubMed Central Google Scholar
Ali Hassan, N. Z. et al. Integrated analysis of copy number variation and genome-wide expression profiling in colorectal cancer tissues. PLoS ONE 9, e92553, 10.1371/journal.pone.0092553 (2014).
Article CAS ADS PubMed PubMed Central Google Scholar
Ouadid-Ahidouch, H., Rodat-Despoix, L., Matifat, F., Morin, G. & Ahidouch, A. DNA methylation of channel-related genes in cancers. Biochim. Biophys. Acta, 10.1016/j.bbamem.2015.02.015 (2015).
Hanahan, D. & Weinberg, R. A. Hallmarks of Cancer: The Next Generation. Cell 144, 646–674, 10.1016/j.cell.2011.02.013 (2011).
Article CAS PubMed Google Scholar
Xing, M. Molecular pathogenesis and mechanisms of thyroid cancer. Nat Rev Cancer 13, 184–99, 10.1038/nrc3431 (2013).
Article CAS PubMed PubMed Central Google Scholar
Tomczak, K., Czerwinska, P. & Wiznerowicz, M. The Cancer Genome Atlas (TCGA): an immeasurable source of knowledge. Contemp Oncol (Pozn) 19, A68–77, 10.5114/wo.2014.47136 (2015).
Article Google Scholar
Cancer Genome Atlas Research Network. Comprehensive molecular portraits of human breast tumours. Nature 490, 61–70, 10.1038/nature11412 (2012).
Cancer Genome Atlas Research Network. Integrated genomic analyses of ovarian carcinoma. Nature 474, 609–15, 10.1038/nature10166 (2011).
Cancer Genome Atlas Research Network. Corrigendum: Comprehensive genomic characterization defines human glioblastoma genes and core pathways. Nature 494, 506, 10.1038/nature11903 (2013).
Cancer Genome Atlas Research Network. Comprehensive genomic characterization of squamous cell lung cancers. Nature 489, 519–25, 10.1038/nature11404 (2012).
Cancer Genome Atlas Research Network. Comprehensive molecular characterization of clear cell renal cell carcinoma. Nature 499, 43–9, 10.1038/nature12222 (2013).
Chari, R., Coe, B. P., Vucic, E. A., Lockwood, W. W. & Lam, W. L. An integrative multi-dimensional genetic and epigenetic strategy to identify aberrant genes and pathways in cancer. BMC Syst. Biol. 4, 67, 10.1186/1752-0509-4-67 (2010).
Article CAS PubMed PubMed Central Google Scholar
Olshen, A. B., Venkatraman, E. S., Lucito, R. & Wigler, M. Circular binary segmentation for the analysis of array-based DNA copy number data. Biostatistics 5, 557–72, 10.1093/biostatistics/kxh008 (2004).
Article PubMed MATH Google Scholar
Robinson, M. D., McCarthy, D. J. & Smyth, G. K. edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics 26, 139–140, 10.1093/bioinformatics/btp616 (2010).
Article CAS PubMed Google Scholar
Kohler, S., Bauer, S., Horn, D. & Robinson, P. N. Walking the interactome for prioritization of candidate disease genes. Am. J. Hum. Genet. 82, 949–958, 10.1016/j.ajhg.2008.02.013 (2008).
Article CAS PubMed PubMed Central Google Scholar
Nakao, M. et al. Identification of DNA copy number aberrations associated with metastases of colorectal cancer using array CGH profiles. Cancer Genet. Cytogenet. 188, 70–6, 10.1016/j.cancergencyto.2008.09.013 (2009).
Article CAS PubMed Google Scholar
Lassmann, S. et al. Array CGH identifies distinct DNA copy number profiles of oncogenes and tumor suppressor genes in chromosomal- and microsatellite-unstable sporadic colorectal carcinomas. J Mol Med (Berl) 85, 293–304, 10.1007/s00109-006-0126-5 (2007).
Article CAS Google Scholar
Jones, A. M. et al. Array-CGH analysis of microsatellite-stable, near-diploid bowel cancers and comparison with other types of colorectal carcinoma. Oncogene 24, 118–29, 10.1038/sj.onc.1208194 (2005).
Article CAS PubMed Google Scholar
Alcock, H. E., Stephenson, T. J., Royds, J. A. & Hammond, D. W. Analysis of colorectal tumor progression by microdissection and comparative genomic hybridization. Gene Chromosome Canc 37, 369–80, 10.1002/gcc.10201 (2003).
Article CAS Google Scholar
Lipska, L. et al. Tumor markers in patients with relapse of colorectal carcinoma. Anticancer Res. 27, 1901–5 (2007).
CAS PubMed Google Scholar
Helfenstein, U. Data and models determine treatment proposals–an illustration from meta-analysis. Postgrad. Med. J. 78, 131–4 (2002).
Article CAS PubMed PubMed Central Google Scholar
Senn, S. Trying to be precise about vagueness. Stat. Med. 26, 1417–1430, 10.1002/sim.2639 (2007).
Article MathSciNet PubMed Google Scholar
Pencina, M. J. & D’Agostino, R. B. Overall C as a measure of discrimination in survival analysis: model specific population value and confidence interval estimation. Stat. Med. 23, 2109–2123, 10.1002/sim.1802 (2004).
Article PubMed Google Scholar
Ashburner, M. et al. Gene Ontology: tool for the unification of biology. Nat. Genet. 25, 25–29 (2000).
Article CAS PubMed PubMed Central Google Scholar
de Pontual, L. et al. Germline gain-of-function mutations of ALK disrupt central nervous system development. Hum. Mutat. 32, 272–6, 10.1002/humu.21442 (2011).
Article PubMed Google Scholar
Chen, Y. et al. Oncogenic mutations of ALK kinase in neuroblastoma. Nature 455, 971–4, 10.1038/nature07399 (2008).
Article CAS ADS PubMed Google Scholar
Mosse, Y. P. et al. Identification of ALK as a major familial neuroblastoma predisposition gene. Nature 455, 930–5, 10.1038/nature07261 (2008).
Article CAS ADS PubMed PubMed Central Google Scholar
Janoueix-Lerosey, I. et al. Somatic and germline activating mutations of the ALK kinase receptor in neuroblastoma. Nature 455, 967–70, 10.1038/nature07398 (2008).
Article CAS ADS PubMed Google Scholar
George, R. E. et al. Activating mutations in ALK provide a therapeutic target in neuroblastoma. Nature 455, 975–8, 10.1038/nature07397 (2008).
Article CAS ADS PubMed PubMed Central Google Scholar
Clarke, A. R. Wnt signalling in the mouse intestine. Oncogene 25, 7512–21, 10.1038/sj.onc.1210065 (2006).
Article CAS PubMed Google Scholar
Malliri, A. et al. The Rac activator Tiam1 is a Wnt-responsive gene that modifies intestinal tumor development. J. Biol. Chem. 281, 543–548, 10.1074/jbc.M507582200 (2006).
Article CAS PubMed Google Scholar
Nieto, M. A. Epithelial Plasticity: A Common Theme in Embryonic and Cancer Cells. Science 342, 708-+, 10.1126/science.1234850 (2013).
Article CAS Google Scholar
Eastham, A. M. et al. Epithelial-mesenchymal transition events during human embryonic stem cell differentiation. Cancer Res. 67, 11254–11262, 10.1158/0008-5472.Can-07-2253 (2007).
Article CAS PubMed Google Scholar
Ridolfi, L., Petrini, M., Fiammenghi, L., Riccobon, A. & Ridolfi, R. Human embryo immune escape mechanisms rediscovered by the tumor. Immunobiology 214, 61–76, 10.1016/j.imbio.2008.03.003 (2009).
Article CAS PubMed Google Scholar
Hartwell, K. A. et al. The Spemann organizer gene, Goosecoid, promotes tumor metastasis. Proc. Natl. Acad. Sci. USA 103, 18969–18974, 10.1073/pnas.0608636103 (2006).
Article CAS ADS PubMed PubMed Central Google Scholar
Sparmann, A. & van Lohuizen, M. Polycomb silencers control cell fate, development and cancer. Nat Rev Cancer 6, 846–856, 10.1038/Nrcd1991 (2006).
Article CAS PubMed Google Scholar
Liu, S. L. et al. Hedgehog signaling and Bmi-1 regulate self-renewal of normal and malignant human mammary stem cells. Cancer Res. 66, 6063–6071, 10.1158/0008-5472.Can-06-0054 (2006).
Article CAS PubMed PubMed Central Google Scholar
Kang, H. C. et al. Ptch1 overexpression drives skin carcinogenesis and developmental defects in K14Ptch(FVB) mice. J. Invest. Dermatol. 133, 1311–20, 10.1038/jid.2012.419 (2013).
Article CAS PubMed Google Scholar
Kho, A. T. et al. Conserved mechanisms across development and tumorigenesis revealed by a mouse development perspective of human cancers. Genes Dev. 18, 629–640, 10.1101/Gad.1182504 (2004).
Article CAS PubMed PubMed Central Google Scholar
Liu, H. Y., Kho, A. T., Kohane, I. S. & Sun, Y. Predicting survival within the lung cancer histopathological hierarchy using a multi-scale genomic model of development. PLoS Med. 3, 1090–1102, Artn E232 10.1371/Journal.Pmed.0030232 (2006).
Article CAS Google Scholar
Kaiser, S. et al. Transcriptional recapitulation and subversion of embryonic colon development by mouse colon tumor models and human colon cancer. Genome Biol. 8, Artn R131, 10.1186/Gb-2007-8-7-R131 (2007).
Rhim, A. D. & Stanger, B. Z. Molecular Biology of Pancreatic Ductal Adenocarcinoma Progression Aberrant Activation of Developmental Pathways. Development, Differentiation and Disease of the Para-Alimentary Tract 97, 41–78 (2010).
Article CAS Google Scholar
Hu, H., Zhou, L., Awadallah, A. & Xin, W. Significance of Notch1-signaling pathway in human pancreatic development and carcinogenesis. Appl. Immunohistochem. Mol. Morphol. 21, 242–7, 10.1097/PAI.0b013e3182655ab7 (2013).
Article CAS PubMed PubMed Central Google Scholar
Hu, M. & Shivdasani, R. A. Overlapping gene expression in fetal mouse intestine development and human colorectal cancer. Cancer Res. 65, 8715–22, 10.1158/0008-5472.CAN-05-0700 (2005).
Article CAS PubMed Google Scholar
Borczuk, A. C. et al. Non-small-cell lung cancer molecular signatures recapitulate lung developmental pathways. Am. J. Pathol. 163, 1949–1960, 10.1016/S0002-9440(10)63553-5 (2003).
Article CAS PubMed PubMed Central Google Scholar
Kho, A. T. et al. Conserved mechanisms across development and tumorigenesis revealed by a mouse development perspective of human cancers. Genes Dev. 18, 629–40, 10.1101/gad.1182504 (2004).
Article CAS PubMed PubMed Central Google Scholar
Monzo, M. et al. Overlapping expression of microRNAs in human embryonic colon and colorectal cancer. Cell Res. 18, 823–833, 10.1038/Cr.2008.81 (2008).
Article CAS PubMed Google Scholar
Navlakha, S. & Kingsford, C. The power of protein interaction networks for associating genes with diseases. Bioinformatics 26, 1057–1063, 10.1093/bioinformatics/btq076 (2010).
Article CAS PubMed PubMed Central Google Scholar
Wang, X. J., Gulbahce, N. & Yu, H. Y. Network-based methods for human disease gene prediction. Brief Funct Genomics 10, 280–293, 10.1093/Bfgp/Elr024 (2011).
Article CAS PubMed Google Scholar
Zhang, C. L. et al. Identification of miRNA-Mediated Core Gene Module for Glioma Patient Prediction by Integrating High-Throughput miRNA, mRNA Expression and Pathway Structure. PLoS ONE 9, doi:ARTN e96908, 10.1371/journal.pone.0096908 (2014).
Xu, Y. & Pasche, B. TGF-beta signaling alterations and susceptibility to colorectal cancer. Hum. Mol. Genet. 16 Spec No 1, R14–20, 10.1093/hmg/ddl486 (2007).
Kim, M. S., Lee, S. H. & Yoo, N. J. Frameshift mutations of tumor suppressor gene EP300 in gastric and colorectal cancers with high microsatellite instability. Hum. Pathol. 44, 2064–70, 10.1016/j.humpath.2012.11.027 (2013).
Article CAS PubMed Google Scholar
Ali, A. S., Ali, S., El-Rayes, B. F., Philip, P. A. & Sarkar, F. H. Exploitation of protein kinase C: a useful target for cancer therapy. Cancer Treat. Rev. 35, 1–8, 10.1016/j.ctrv.2008.07.006 (2009).
Article CAS PubMed Google Scholar
Hao, F. et al. Protein kinase Calpha signaling regulates inhibitor of DNA binding 1 in the intestinal epithelium. J. Biol. Chem. 286, 18104–17, 10.1074/jbc.M110.208488 (2011).
Article CAS PubMed PubMed Central Google Scholar
Chen, R. et al. A Meta-analysis of Lung Cancer Gene Expression Identifies PTK7 as a Survival Gene in Lung Adenocarcinoma. Cancer Res. 74, 2892–2902, 10.1158/0008-5472.Can-13-2775 (2014).
Article CAS PubMed PubMed Central Google Scholar
Goonesekere, N. C. W., Wang, X. S., Ludwig, L. & Guda, C. A Meta Analysis of Pancreatic Microarray Datasets Yields New Targets as Cancer Genes and Biomarkers. PLoS ONE 9, ARTN e93046, 10.1371/journal.pone.0093046 (2014).
Edge, S. B. & American Joint Committee on Cancer. AJCC cancer staging manual, xiv, 648 p. (Springer, New York, 2010).
Ahmed, S., Johnson, K., Ahmed, O. & Iqbal, N. Advances in the management of colorectal cancer: from biology to treatment. Int. J. Colorectal Dis. 29, 1031–42, 10.1007/s00384-014-1928-5 (2014).
Article PubMed Google Scholar
Benoist, S. et al. Treatment strategy for patients with colorectal cancer and synchronous irresectable liver metastases. Br. J. Surg. 92, 1155–60, 10.1002/bjs.5060 (2005).
Article CAS PubMed Google Scholar
Galizia, G. et al. First-line chemotherapy vs bowel tumor resection plus chemotherapy for patients with unresectable synchronous colorectal hepatic metastases. Arch. Surg. 143, 352–8, discussion 358; 10.1001/archsurg.143.4.352 (2008).
Article PubMed Google Scholar

Download references

Acknowledgements

We thank the families for their participation in this project. This work was supported by the National High Technology Research and Development Program of China (SS2014AA020801) and the Sci-Tech Development Program of Beijing (D121100004712002) received by K.Z.

Author information

An Ning and Yang Xue contributed equally to this work.

Authors and Affiliations

State Key Laboratory of Molecular Oncology, Department of Etiology and Carcinogenesis, Peking Union Medical College & Cancer Institute (Hospital), Chinese Academy of Medical Sciences, Beijing, 100021, China
Ning An, Xue Yang, Shujun Cheng & Kaitai Zhang
Department of Endoscopy, Cancer Hospital, Chinese Academy of Medical Sciences, Beijing, 100021, China
Guiqi Wang

Authors

Ning An
View author publications
You can also search for this author in PubMed Google Scholar
Xue Yang
View author publications
You can also search for this author in PubMed Google Scholar
Shujun Cheng
View author publications
You can also search for this author in PubMed Google Scholar
Guiqi Wang
View author publications
You can also search for this author in PubMed Google Scholar
Kaitai Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

All authors have made substantial contributions to the conception and design of the study. N.A. and X.Y contributed to protocol design, search, data extraction, quality assessment, statistical analysis and writing the article. G.W., S.C. and K.Z. contributed to study design, interpretation of data and revision of the article. All authors have seen and approved the final version.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Electronic supplementary material

Supplementary Figure S1-3

Supplementary Table S1-2

Rights and permissions

This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

Reprints and permissions

About this article

Cite this article

An, N., Yang, X., Cheng, S. et al. Developmental genes significantly afflicted by aberrant promoter methylation and somatic mutation predict overall survival of late-stage colorectal cancer. Sci Rep 5, 18616 (2015). https://doi.org/10.1038/srep18616

Download citation

Received: 06 September 2015
Accepted: 19 November 2015
Published: 22 December 2015
DOI: https://doi.org/10.1038/srep18616

This article is cited by

Comprehensive analysis of copy number aberrations in microsatellite stable colon cancer in view of stromal component
- M Henar Alonso
- Susanna Aussó
- Victor Moreno
British Journal of Cancer (2017)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.