Comprehensive transcriptome profiling of Taiwanese colorectal cancer implicates an ethnic basis for pathogenesis

Wu, Shao-Min; Tsai, Wen-Sy; Chiang, Sum-Fu; Lai, Yi-Hsuan; Ma, Chung-Pei; Wang, Jian-Hua; Lin, Jiarong; Lu, Pei-Shan; Yang, Chia-Yu; Tan, Bertrand Chin-Ming; Liu, Hsuan

doi:10.1038/s41598-020-61273-y

Download PDF

Article
Open access
Published: 11 March 2020

Comprehensive transcriptome profiling of Taiwanese colorectal cancer implicates an ethnic basis for pathogenesis

Shao-Min Wu ORCID: orcid.org/0000-0003-1611-9463¹^na1,
Wen-Sy Tsai²^na1,
Sum-Fu Chiang^2,3,
Yi-Hsuan Lai^1,4,
Chung-Pei Ma^1,4,
Jian-Hua Wang¹,
Jiarong Lin⁵,
Pei-Shan Lu⁵,
Chia-Yu Yang^1,5,6,7,
Bertrand Chin-Ming Tan^1,4,8,9 &
…
Hsuan Liu^1,2,5,10

Scientific Reports volume 10, Article number: 4526 (2020) Cite this article

4151 Accesses
8 Citations
1 Altmetric
Metrics details

Subjects

Abstract

Colorectal cancer (CRC) is one of the most commonly diagnosed cancers worldwide. While both genetic and environmental factors have been linked to the incidence and mortality associated with CRC, an ethnic aspect of its etiology has also emerged. Since previous large-scale cancer genomics studies are mostly based on samples of European ancestry, the patterns of clinical events and associated mechanisms in other minority ethnic patients suffering from CRC are largely unexplored. We collected 104 paired and adjacent normal tissue and CRC tumor samples from Taiwanese patients and employed an integrated approach – paired expression profiles of mRNAs and microRNAs (miRNAs) combined with transcriptome-wide network analyses – to catalog the molecular signatures of this regional cohort. On the basis of this dataset, which is the largest ever reported for this type of systems analysis, we made the following key discoveries: (1) In comparison to the The Cancer Genome Atlas (TCGA) data, the Taiwanese CRC tumors show similar perturbations in expressed genes but a distinct enrichment in metastasis-associated pathways. (2) Recurrent as well as novel CRC-associated gene fusions were identified based on the sequencing data. (3) Cancer subtype classification using existing tools reveals a comparable distribution of tumor subtypes between Taiwanese cohort and TCGA datasets; however, this similarity in molecular attributes did not translate into the predicted subtype-related clinical outcomes (i.e., death event). (4) To further elucidate the molecular basis of CRC prognosis, we developed a new stratification strategy based on miRNA–mRNA-associated subtyping (MMAS) and consequently showed that repressed WNT signaling activity is associated with poor prognosis in Taiwanese CRC. In summary, our findings of distinct, hitherto unreported biosignatures underscore the heterogeneity of CRC tumorigenesis, support our hypothesis of an ethnic basis of disease, and provide prospects for translational medicine.

PERCEPTION predicts patient response and resistance to treatment using single-cell transcriptomics of their tumors

Article 18 April 2024

Massively parallel screen uncovers many rare 3′ UTR variants regulating mRNA abundance of cancer driver genes

Article Open access 18 April 2024

Deep whole-genome analysis of 494 hepatocellular carcinomas

Article 14 February 2024

Introduction

Colorectal cancer (CRC) is one of the most prevalent types of malignancies worldwide¹. Despite surgical resection and advances in radiotherapy and chemotherapy, it remains the third leading cause of cancer mortality globally². The progression of CRC follows the adenoma–carcinoma sequence, developing from normal mucosa into adenoma and eventually into malignant adenocarcinoma³. The etiology of this malignancy can be classified as either hereditary, which represents about 10%–15% of the overall incidence and is attributable to mutation in APC or DNA mismatch repair (MMR) genes, or a more prevalent sporadic type, which is characterized by chromosomal instability (CIN), microsatellite instability (MSI), or CpG island methylator phenotype pathways^4,5. Interestingly, several other risk factors of CRC have been documented, such as alcohol intake, obesity, smoking, sedentary lifestyle, a diet low in fruit and vegetables, and consumption of red meat^6,7, suggesting an environmental and possibly an ethnic basis of pathogenesis.

MicroRNAs (miRNAs) are a class of small, noncoding single-stranded RNAs that are initially transcribed by RNA polymerase II and then subject to an elaborate maturation process. miRNAs are known to exert posttranscriptional gene silencing via sequence complementary targeting to the 3’ UTR of target mRNA, with the aid of the RISC complex^8,9,10. Several miRNAs are reportedly associated with CRC progression. These include the tumor-suppressive let-7 that targets the KRAS transcript¹¹; the APC-targeting miR-135 that triggers activation of WNT signaling and chromosomal instability¹²; the microsatellite instability-promoting miR-155 that downregulates the multiple mismatch repair genes MLH1, MSH2, and MSH6¹³; and the DNA methyltransferase-inhibiting miR-29, miR-143, and miR-342, which collectively alter overall DNA methylome profiles and consequently contribute to misexpression of key tumor suppressors and oncogenes^14,15,16. However, given that malignant transformation typically involves molecular perturbations at the system level, biological relevance of miRNAs is best understood in the context of miRNA–mRNA regulatory networks. To this end, paired expression profiles of miRNAs and mRNA in tumor specimens are desired to enable the construction and interrogation of regulatory networks implicated in tumorigenesis.

Genome instability is considered a cancer hallmark that causes genome rearrangements, such as inversion, translocation, deletion, and duplication. When these genomic alterations take place between two genes, it may result in the generation of fusion genes that potentially produce chimeric proteins^17,18. Some fusion partner genes harbor a tyrosine kinase domain and exhibit a consecutive kinase activity, and this may constitute an oncogenic property that further drives cancer progression (e.g., ALK, NTRK, EGFR, and RET)^19,20. Owing to their intrinsic kinase activity, these fusion genes can also be turned into a drug-actionable target using kinase inhibitor^{19,21,22,23,24}. Several prevalent fusion genes in CRC, such as ALK, RET, ROS1, RSPO3, TCF7L1, and TCF7L2, have been previously described to impact CRC progression^{25,26,27,28,29,30}. The number of fusion genes observed in CRC is comparatively lower than in other cancers¹⁹. However, these fusion genes may play a large role in cancer progression, thus holding great potential as a therapeutic target. Hence, further investigation in the prevalence, relevance, and characteristics of fusion genes in CRC is necessary.

To systematically categorize tumor stage and progression at the molecular level, several proposed cancer subtyping systems have centered on mutational or transcriptomic profiles^31,32. Given that cancer is a disease of genome instability, cancer mutation signatures represent an effective means to stratify patient samples. Such stratification has been known to impact the expectancy of survival^33,34,35. In addition to mutation signatures, distinct transcriptome alterations associated with tumorigenesis also constitute an informative reporter of clinical and biological characteristics, warranting the development of transcriptome-based subtyping systems^36,37. An indicator of the emerging prevalence and reliability of transcriptome-based subtyping systems is the increasing number of studies that have established and applied transcriptome-based subtyping systems to molecularly stratify and understand the clinical outcome of CRC^38,39,40,41. Among these schemes, the Colorectal Cancer Subtyping Consortium (CRCSC) utilized data from six CRC studies and developed a robust classification system with four consensus molecular subtypes (CMSs)³⁶. Each of the four subtypes is characterized by distinct expression profiles of oncogenic/tumor suppressive genes and/or pathways, mutation states of particular genes, and MSI. Importantly, these subtypes also correlate with patient survival – CMS1 exhibits poor survival after relapse, whereas CMS4 shows dismal prognosis – strengthening the translational potential of this system in prognosticating CRC. Interestingly, however, CMS subtyping of a Japanese cohort revealed a worse survival outcome in CMS1⁴², which is a different result from previous observations and thus reinforces the ethnic or regional basis of pathogenesis.

Both genetic and environmental factors have been attributed to the pathogenesis of colorectal cancer (CRC). Given the recent upsurge in CRC incidence in different countries, an ethnic aspect of its etiology has emerged but remains largely unexplored. To address this issue, we collected 104 paired and adjacent normal tissue and CRC tumor samples from Taiwanese patients and deciphered their transcriptomic signatures. Comprehensive transcriptome sequencing analysis was combined with the analyses of profiles of RNAs, microRNAs (miRNAs), miRNA–mRNA network, fusion genes, and cancer subtypes. The comparison of transcriptomic profiles between The Cancer Genome Atlas (TCGA) and Taiwanese CRC samples reveals a moderate extent of correlated expression. We also identified novel CRC-associated fusion genes. Intriguingly, in comparison with the TCGA dataset, the Taiwanese CRC patient cohort exhibited a similar distribution of subtype assignment but showed distinct clinical outcomes among subtypes. To further investigate the miRNA–mRNA regulation network underlying the poor outcome, we constructed patient groupings according to miRNA–mRNA-associated subtyping (MMAS), which was previously applied in other cancer types^43,44,45, We then revealed based on the identified regulons that subtype-specific poor outcome is associated with repressed WNT signaling. Viewed together, our study identified several molecular and clinical distinctions in the Taiwanese CRC samples compared with the samples from Western countries, providing mechanistic insights into the nature of the disease and a strong support for an ethnic basis in CRC tumorigenesis. In summary, our findings of distinct, hitherto unreported biosignatures support our hypothesis of an ethnic basis of CRC tumorigenesis and further provide mechanistic insights into the nature of the disease and translational medicine potential of the new system.

Results

Comprehensive profiling of Taiwanese CRC transcriptomes reveals cohort-specific alterations

To comprehensively catalog the dysregulated transcriptomic alterations underlying ethnically or regionally specific CRC, we first recruited ethnic Taiwanese patients admitted to the Chang Gung Memorial Hospital. The clinical characteristics and demographics of our cohort are outlined in Table 1, which were further analyzed for their association with the overall survival (OS) and disease-free survival (DFS) of the patients (Table 2). Univariate analysis then revealed that TNM staging (P value = 0.0362 and 0.0341 for DFS and OS, respectively) and metastasis lymph node number (P value = 0.00652 and 0.0114 for DFS and OS, respectively) were significantly associated with DFS and OS and that such histological grades could be a risk factor for OS. Next, matched pairs of CRC tumors and adjacent normal tissues from the same patients (n = 104) were collected and subjected to both RNA-seq and small RNA-seq analyses. Detailed statistics of sequencing data are summarized in Figure S1A–B, Additional file 1. We generated 36.3 ± 10.5 million and 22.2 ± 10.7 million mean mapped read counts for RNA-seq and small RNA-seq, respectively. To control for the quality of our deep sequencing results, we first called variants from all samples and analyzed them by BAM-matcher, which examines sample pairing on the basis of the fraction of common SNPs among variants⁴⁶ (Additional file 1: Figure S1C). The results subsequently showed that the samples were properly paired and suitable for further analysis.

Table 1 Clinical characteristics of CRC patients in this study.

Full size table

Table 2 Univariate analysis of disease free survival (DFS) or overall survival (OS).

Full size table

Next, principal component analyses (PCA) of the RNA-seq and small RNA-seq data were performed to comparatively characterize the overall transcriptome profiles. To this end, the PCA plots reveal distinct expression profiles corresponding to the disease states (Additional file 1: Figure S1D–E). Genes exhibiting CRC-associated differential expression patterns were identified using DESeq. 2, which yielded 7,394 (3,884 upregulated and 3,510 downregulated) mRNAs and 318 (210 upregulated and 108 downregulated) miRNAs differentially represented in the CRC tumor vs. normal tissues (|fold change | >2, FDR < 0.001, Additional file 2: Tables S1–2). The overall distributions of these transcriptome changes in relation to various clinical attributes were further characterized by hierarchical clustering and shown by a heatmap (Fig. 1A,B). Given that tumorigenic progression is typically associated with alterations in molecular pathways^47,48, we next sought to explore dysregulated pathways in Taiwanese CRC based on our RNA-seq data. Pathway analysis of the upregulated genes using Gene Set Enrichment analysis (GSEA) reveals significant enrichment in several pathways, such as the cell cycle, DNA replication, and WNT signaling pathway (Fig. 1C, Additional file 2: Table S3). Conversely, downregulated genes were enriched in factors associated with oxidative phosphorylation, Parkinson’s disease, and starch and sucrose metabolism pathways (Fig. 1D).

Concurrent profiling of expressed mRNAs and miRNAs constitutes a strong basis for the in-depth assessment of the regulatory relationships between miRNAs and mRNAs in Taiwanese CRC transcriptomes. Toward this end, we retrieved miRNA–target interactions (MTIs) from miRTarBase, which archives experimentally validated miRNA–target interaction networks⁴⁹; we also retrieved computational MTIs from TargetScan and integrated both experimental and computational data to generate a reliable miRNA–target prediction ref. ⁵⁰. By cross-referencing MTIs with DEMs and the targeted genes identified in our cohort (|fold change | >2, FDR < 0.001; r < 0, P value <0.05), a global representation of miRNA–target interaction networks in Taiwanese CRC was constructed, in which 67 miRNAs and 1,529 mRNAs interconnected to form 2,166 unique miRNA–mRNA regulatory pairs. To further understand the potential outcome of these coordinated transcriptome alterations, we performed gene set over-representation analysis on miRNA-targeted mRNAs in these regulatory hierarchies and further discovered that mRNAs targeted by upregulated DEMs in these MTIs are enriched in genes involved in the pathways of cancer, gastric acid secretion, and inflammatory mediator regulation of TRP channels (Fig. 1E, Additional file 2: Table S4). Conversely, mRNAs targeted by downregulated DEMs were enriched in the components of cancer microRNAs, cellular senescence, and EGFR tyrosine kinase inhibitor resistance (Fig. 1F, Additional file 2: Table S4).

Our miRNA–mRNA networks illustrate thousands of nodes and edges representing complex interactions. However, critical nodes and edges within the networks remained largely elusive. It has been reported that genes with higher degrees of connections within a network may potentially play important biological roles^51,52. Therefore, to identify the critical subnetworks underlying CRC progression, we next focused on genes targeted by more than two miRNAs. We first selected network nodes that were either DEGs or DEMs, based on the dysregulated miRNAs revealed by differential expression analysis (Fig. 1A,B). These nodes were constructed into miRNA–mRNA subnetworks, with each node size being assigned a betweenness centrality score (Fig. 1G–I), which weighs the cluster connectivity and potential impact of a node⁵². Interestingly, several drug actionable genes were identified in this subnetwork, such as CKD6, CCND1 (PALBOCICLIB and Ribociclib), and VEGFA (AFLIBERCEPT)⁵³. Taken together, these observations illustrate the extensive and distinct fluctuations in the miRNA–mRNA regulatory cascades associated with Taiwanese CRC, with potential impact on the expression and outcome of certain tumorigenesis-associated molecular pathways.

Comparison between Taiwanese CRC data and The Cancer Genome Atlas (TCGA) reveals regional differences

We next compared the overall transcriptomic profiles of our Taiwanese cohort with the TCGA dataset, which comprises data from patients primarily of European ancestry. For this purpose, RNA-seq data for the 50 pairs of normal tissue and matched tumors of colon adenocarcinoma and rectum adenocarcinoma were downloaded from TCGA for transcriptome profiling using the same pipeline. Differential expression analysis first uncovered 5,617 DEGs, of which 1,997 and 3,620 genes were upregulated and downregulated, respectively (Additional file 2: Table S5). While there was a moderate extent of expression correlation between the two datasets, as shown by the scatter plot of Fig. 2A, genes that were significantly expressed (FDR < 0.001) exhibited markedly different patterns between the TCGA and Taiwanese data (R² = 0.479, Fig. 2A). Further comparison at the gene level shows that a sizable fraction of the DEGs was uniquely detected in either dataset (Fig. 2B), indicating prominent levels of distinction at the gene level. Interestingly, GSEA analysis demonstrated a high level of similarity in the overall pathway enrichment between the two CRC datasets (Additional file 1: Figure S2A–B, Additional file 2: Table S3 and S6). In particular, components of the tumor- or CRC-associated molecular pathways⁵⁴, such as cell cycle, DNA replication, mismatch repair, starch and sucrose metabolism, drug metabolism, and cytochrome p450 pathways, were dysregulated in both datasets (Additional file 2: Table S3 and S6). Nonetheless, we noticed differential representation of pathways between cohorts, such as those related to ECM receptor interaction, gap junction, and focal adhesion; these pathways were comparatively upregulated in the Taiwanese dataset (Fig. 2C–D, Additional file 1: Figure S2C–E). Comparison of the disturbed genes and pathways of the TCGA and Taiwanese datasets showed a certain degree of similarity, suggesting that the overall progression of CRC could be similar for both groups. However, metastasis-associated pathways, ECM receptor interaction, gap junction, and focal adhesion showed distinct patterns between the two datasets^55,56,57. Furthermore, the proportion of stage 4 Taiwanese CRC patients is lower than that reflected in TCGA (16% and 12.5% of stage 4 patients for TCGA and Taiwanese datasets, respectively.). This suggests a possibility of distinct development of metastasis between the cohorts represented by these two datasets.

Novel gene fusion events in Taiwanese CRC

Tumor cells are prone to gene fusion events owing to their intrinsically unstable genomes. Mounting evidence indicates that these fusion genes are associated with oncogenic properties and are thus therapeutically actionable targets^17,20. Therefore, we next identified the fusion genes in Taiwanese CRC by analyzing the RNA-seq data using the STAR-Fusion tool⁵⁸. To this end, we filtered candidate fusion genes by the following criteria: junction read counts (greater than five), in-frame fusion, and absence in the matched normal tissue. A total of 22 fusion genes were identified in 16 of the patient samples (Additional file 2: Table S7). Given that clinical attributes, such as MSI, are potentially correlated with the incidence of fusion genes⁵⁹, we next examined this possibility. However, we did not identify any preponderance of clinical attributes (Table 3) among these fusion-positive individuals.

Table 3 Association between fusion genes and clinical attributes.

Full size table

We then focused on known gene fusion events, as their detection is essentially more reliable than those of novel fusion genes. By installing additional filters in our selection, we thus excluded events not annotated in the STAR-Fusion databases (Fig. 3A). In addition, we annotated fused genes encoding the kinase domain, owing to the importance of kinases-mediated signaling in tumorigenesis (Additional file 2: Table S7)⁶⁰. As positive controls for this analysis, we analyzed two CRC-associated recurrent fusions, PTPRK-RSPO3 and TPM3-NTRK1, in three of our patient samples (Fig. 3A)^27,29. Interestingly, our cohort did not express other known CRC-associated fusion events, such as those involving ALK, RET, ROS1, and TCF7L2^25,28,30, implying a cohort-specific genomic alteration. We also identified three fusion events, RPS19-CEACAM5, TBC1D15-RAB21, and TNIP1-ANXA6, in which one partner has been implicated in non-CRC tumorigenesis^61,62,63. Interestingly, we identified two novel CRC fusions, ERBB2-PPP1R1B and FGFR3-TACC3, comprising genes encoding the kinase domain (Fig. 3A). To further validate these structurally anomalous transcripts, we conducted RT-PCR assays and Sanger sequencing for the fusion pairs PTPRK-RSPO3, TMP2-NTRK, FGFR3-TACC3, RPS19-CEACAM5, TC1D15-RAB21, and ERBB2-PPP1R1B, and successfully confirmed their expression in the corresponding samples (Fig. 3B–N).

Transcriptome-based subtyping of Taiwanese CRC patients

During the progression of tumor growth, cellular heterogeneity arises as a result of diverse mutation signatures, expression profiles, and tumor malignancies, contributing to the differentiation of tumor subtypes among patients. Importantly, distinct subtypes of tumors are highly correlated with disease outcome and can potentially result in variable responses to therapies^37,38,42. Therefore, relating tumor subtypes to clinical relevance can help illuminate disease mechanisms and develop precision medicine for CRC. To address this issue, CRCSC previously developed a robust molecular signature approach for subtyping CRC, resulting in the identification of four clinically relevant CMS for CRC³⁶. Another approach, the CRCA, collected two datasets and used NMF to define a 786-gene classifier for assigning samples into five subtypes based on the cell types of the colon crypt³⁷. The distributions of patients, molecular signatures, and survival outcome for each subtype were well documented by these studies. However, since the profiling data were obtained from patients of predominantly European ancestry, whether this classification scheme could be similarly applied to patients of other ethnicities remains unknown.

To test this possibility, we first applied the CRCSC and CRCA^36,37 systems to assign subtypes among the Taiwanese CRC patients based on the nearest template prediction, resulting in 88.5% and 91.3% of patients being successfully assigned, respectively (Additional file 1: Figure S3A–B). These approaches have previously shown that CMS4 and stem-like subtypes express signatures of EMT and TGF-β signaling that correspond to worse OS or DFS survival^36,37. We then examined whether Taiwanese CRC patients exhibit similar subtype-specific outcomes by performing OS and DFS survival and Cox proportional hazards analyses on the CRCSC and CRCA assigned patients (Additional file 1: Figure S3C–F). However, the stratified survival analysis did not reveal analogous patterns of patient survival in our cohort; in fact, no significant difference in survival outcome was observed among subtypes. We further compared the overall distribution of CRCSC and CRCA assigned subtypes between Taiwanese and TCGA patients. Our results indicate that while the assigned distributions by CRCSC and CRCA are similar between the two datasets (SD = 3.5% and 2.9%, respectively; Additional file 2: Table S8), the prognosis of each subtype was largely different between the two datasets (SD = 13% and 10.3%, respectively; Additional file 2: Table S8). For instance, whereas the CMS4 and stem-like subtypes in the TCGA dataset are prone to death, such is not the case in the Taiwanese dataset. This observation implies that cohort-specific, distinct molecular pathways are associated with the subtype-specific outcomes. Nevertheless, it remains formally possible that other factors such as distributions of CRC stages might also contribute to the inter-cohort differences in subtype-associated disease outcome.

Because CRCSC and CRCA classification approaches did not show consistent subtype-specific outcomes for the Taiwanese CRC patients (Additional file 1: Figure S3C–F), we next devised an independent classification scheme based on our multidimensional transcriptome data, aiming toward a robust and biologically relevant discrimination in terms of miRNA–mRNA regulation. To this end, we used NMF to classify tumor samples on the basis of mRNA and miRNA expression profiles and screened for subtypes differing in terms of OS or DFS (see Methods) and concordant sample grouping in both RNA molecule data. The most optimal NMF classification results are based on a 400-mRNA panel and a 50-miRNA panel, each of which clustered samples into two distinct groups, subtypes 1 and 2 (Additional file 1: Figure S4A–B, Additional file 2: Tables S9, 10). Moreover, both of the mRNA and miRNA subtype classifiers show that subtype 1 is linked to worse OS outcome (Fig. 4A,C, mRNA and miRNA log-rank test P value = 0.0243 and 0.014, respectively), but not in DFS outcome (log-rank test P value = 0.291 and 0.94, respectively; Fig. 4B,D). Total of 33 and 32 patients were concordantly assigned by both datasets to subtypes 1 and 2, respectively, as shown by a contingency table of sample groupings from the NMF classification (Table 4). Fisher’s exact test (P value = 0.01815) further indicates a significant extent of correlation between the mRNA- and miRNA-based groupings. Based on this concordance, we regrouped the patients according to the new “miRNA–mRNA-associated subtypes” (or MMAS) 1 and 2, whereas the remaining patients with ambivalent classification were defined as non-MMAS. A stratified survival analysis on the new MMASs again showed the worst OS outcome in MMAS-1 (but not DFS) among the three groups (Fig. 4E–F). The Cox proportional hazards analysis also shows that MMAS-1 exhibits considerably poorer outcome than MMAS-2 (Fig. 4E). In summary, these results have uncovered clinically relevant transcriptome signatures that represent key determinants underlying survival outcome of CRC patients, further supporting the notion that miRNA–mRNA molecular interactions contribute to the survival outcome of CRC patients.

Table 4 Contingency table of samples groupings from NMF classification using miRNA and mRNA data.

Full size table

Characterization of NMF-assigned subtypes shows differential representation of signatures

To further characterize the differences between the subtypes, we examined the distribution of sample attributes among MMASs. We first discovered that histological grade was significantly associated with MMASs, but did not detect any association for tumor localization, sex, and TNM staging (Fig. 5A–D). Given the distinct clinical outcomes, we hypothesized that transcriptome profiles might also be altered between these MMASs. Therefore, we performed subtype-specific differential expression analysis, resulting in the identification of 5,387 DEGs and 331 DEMs (|fold change | >1.5, FDR < 0.05, Additional file 1: Figure S5A–B; Additional file 2: Table S11, 12). Next, we examined CRC-associated signatures among patients by using ssGSEA (Fig. 5E)⁶⁴. The PCA reveals MMAS-1 and MMAS-2 clustered into two definitive groups, in contrast to the scattered pattern of the “non-MMAS subtype,” suggesting that MMASs exhibit distinct CRC-associated signatures (Fig. 5E–F). The PC1 and PC2 derived from variable loadings of PCA could generate vectors, of which the direction delineates signatures for the given samples. Interrogating the variable loadings of PCA identified attributes, such as late TA, CDX2 up, WNT-repressed, CRC stem down, and gastrointestinal pathways as the major variable pathways expressed in the MMAS-1 patients (Fig. 5F, Additional file 1: Figure S5C). In contrast, the MMAS-2 patients are enriched in WNT-induced and CRC stem-up pathways (Fig. 5F, Additional file 1: Figure S5C). In line with the potentially perturbed WNT signaling, our analysis further shows that WNT signaling and the MYC and CRC stem-down pathways were differentially expressed to a significant extent between MMASs (Fig. 5G). To examine the robustness of our MMAS approach, we repeated the above analyses with resampling of 80% samples and event sampling ratio of 5 (OS). We found that profiles of resampled data consistently corresponded to repressed WNT signaling pathways and poor outcome for OS in MMAS-1 (Additional file 1: Figure S6). Viewed together, these results reinforce the clinical and molecular distinctions between MMASs and further demonstrated that WNT signaling pathway is the predominant pathway discriminating between the MMASs.

While the MMAS classification is based on both mRNA and miRNA panels, it is unclear whether this tumor subtyping involves any coordinated miRNA–mRNA regulation. To explore this possibility, we performed pathway analysis of the miRNA-regulated gene networks (see Methods) and looked for any overlap with the enriched pathways of mRNA genes. We found a high correlation between enriched pathways from both transcriptome signatures, in which 5/7 and 13/14 of miRNA upregulated and downregulated pathways (MMAS-1 vs. MMAS-2; DEMs: |fold change | >1.5, FDR < 0.05; miRNA-mRNA: r < 0, P value < 0.05) corresponded to the downregulated and upregulated gene pathways, respectively (Fig. 5H–I, Additional file 2: Table S13). These results illustrate that the subtyping and outcome of the MMASs are governed not only by distinct expression profiles but also by the two-tier coordination of miRNA–mRNA regulation.

Interestingly, in line with the poor outcome of MMAS-1, the MMAS-1-associated transcriptome was enriched in several poor prognosis-associated pathways, such as WNT signaling, EMT, TGF-β, and stem cell pathways. Given that several TFs have been implicated in the gene expression associated with tumorigenesis and survival outcome^65,66,67,68, we next assessed the contribution of TFs in the subtype/outcome-specific gene networks. To identify MMAS-associated risk TFs, we performed the reconstruction of transcriptional networks (RTN) analysis by examining TF regulon activity for each sample⁶⁹. The result shows that the SPIB regulon has the highest association with MMAS (Fig. 5J; Fisher’s exact test P value = 3.65e-13), whereby the activated regulon is correlated with the poor outcome (log-rank test P value = 0.025) of MMAS-1. Furthermore, we uncovered activated AES regulon activity in association with MMAS-1 (Fig. 5K; Fisher’s exact test P value = 7.3e-08; log-rank test P value = 0.031). Notably, given that AES suppresses WNT signaling^70,71, our analysis in this part strengthened the link of WNT signaling to the stratification and clinical outcome of MMASs. Finally, upon further examination, we noticed that some MMAS-1 patient samples exhibit upregulated CMS1 signature pathways (inflammatory response and interferon response) and CMS4 signature pathways (angiogenesis, EMT, extracellular matrix mCRC, and TGF-β up)³⁶. This distinct combination of signatures strongly suggests that the MMAS-1 subtypes can express multiple pathways in addition to the suppressed WNT signaling. Taken together, this molecular characterization of MMASs underscores the heterogeneity of and regional predisposition to CRC tumorigenesis, thus serving as an important basis for precision medicine of this tumor type.

Discussion

ALK, ROS1, and NTRK fusions are well-known CRC-associated fusions; they have been classified as a new subtype of metastatic CRC that exhibits poor prognosis⁷². It has also been reported that kinase inhibitor intervention can be applied for these fusions^28,73,74. In addition to such targeted therapy, fusion genes are translated into peptides, which can potentially act as a neoantigen and can thus be treated with cancer immunotherapy^75,76. These findings extend the treatment options of CRC patients to further increase their chances for survival. In the present study, we observed several fusion genes, and 7 out of 104 (6.7%) patients were validated with corresponding fusions. Of these, PTPRK-RSPO3 and TPM3-NTRK1 are CRC-associated recurrent fusions that are amenable to the porcupine inhibitor and kinase inhibitor interventions, respectively^73,77. Notably, the FGFR3-TACC3 fusions recurrent in urothelial bladder carcinoma, glioblastoma (GBM), head–neck squamous cell carcinoma (HNSC), and low-grade glioma show stable and minor response to FGFR inhibitor treatment^19,78. For fusions such as ERBB2-PPP1R1B, RPS19-CEACAM5, TBC1D15-RAB21, and TNIP1-ANXA6 without appropriate actionable targets for drug therapy, the neopeptides generated from these fusions can potentially be the target of immunotherapy. According to an National Comprehensive Cancer Network (NCCN) guideline, larotrectinib was recently added as a treatment for metastatic colorectal cancer patients positive for NTRK gene fusion^79,80. Therefore, these fusion genes have a great potential to be a therapeutic target in a precision medicine approach.

Some MMAS-1 patients who died were enriched in the CMS4 signature reflecting the EMT and TGF-β pathways (Fig. 5E). It is intriguing that these patients had not been assigned as CMS4. From this observation, some considerations arise: first, we used NTP to perform the classification for the patients. NTP classifies samples based on the most expressed gene of a given subtype; therefore, it is possible that both CMS4 and CMS1/2/3 signatures are highly expressed, but the CMS1/2/3 signature genes are expressed at a higher level than CMS4, resulting in CMS1/2/3 classification. Previously, Ma et al. suggested that the CRC transcriptome exhibits a continuous profile among subtypes, and no evidence can support the existence of discrete subtypes⁸¹. Therefore, in some cases, the expression data of patients is potentially enriched in several pathways among different subtypes, showing a profile of mixed CMS signatures. When we performed the CRCSC classification of the TCGA dataset, we did not observe such a large proportion of CMS1/2/3 patients exhibiting CMS4 signatures (data not shown). Therefore, MMAS-1 patients may be exhibiting a profile of mixed CMS signatures. Furthermore, the distribution of CRCSC assignments is similar, but the death events in Taiwanese CRC tend to be different from TCGA dataset, raising the question about whether the profile of mixed CMS signatures of MMAS-1 patients is relevant to CRC deaths. However, more samples are needed to demonstrate the significance in the death distributions between TCGA and Taiwanese datasets stratified by CRCSC or CRCA subtyping.

In the present study, we discovered that MMAS discriminates samples by the activity of WNT signaling and stratifies samples based on distinct outcomes. Furthermore, MMAS highlights the importance of the WNT signaling pathway underlying the miRNA–mRNA regulation axis to the clinical outcome. Several studies have reported that activated WNT signaling is correlated with poor outcome of CRC^82,83,84,85. Surprisingly, we observed a poor prognosis associated with a suppressed WNT signaling. Melo et al. suggested that WNT target genes are silenced by methylation and exhibit a poor prognosis in CRC, which is partially in line with our observations⁸⁶. In their study, they examined the prognosis in relapse-free survival from CRC and reported that repressed WNT signaling is associated with metastasis. In the present study, the poor prognosis is observed specifically in the OS, but not the DFS (Fig. 4E–F), and the number of stage 4 patients (with metastasis) tends to be higher in MMAS-1 (Fig. 5D, P value> 0.05). In support of this notion, Kim et al. also suggested that methylation of the WNT signaling target gene can predict poor prognosis of CRC.⁸⁷. Collectively, this information strengthens the correlation between repressed WNT signaling and poor prognosis and further raises the possibility that the regulation of the miRNA–mRNA network may be associated or act coordinately with methylation.

By applying various filters, such as parameters of fold change, low abundance, and input size, we generated hundreds of combinations for the NMF clustering input for both mRNA and miRNA datasets. Then, we screened for the following: (1) sample groupings exhibiting significant differences in survival outcome from both datasets and (2) concordant sample grouping of both RNA molecule data. The rationale behind our approach is the use of either miRNA or RNA data to stratify samples, followed by the testing of the association between two grouping results to explore the clinically associated networks or connections between two regulatory layers. During the screening step, we found that if sample groupings show difference in terms of survival outcome, then the sample groupings between two datasets may show association in some cases, suggesting that the composition of the input contains not only the information of survival discrimination but also the information that connects two regulatory layers. Furthermore, we did not observe any sample grouping showing significant differences in DFS (data not shown). Events of DFS include death, recurrence, and local/distant metastasis, each of which may express distinct molecular pathways, and therefore, different sets of key signature genes. Due to this molecular heterogeneity, a simple clustering approach may not be sufficient in deconvoluting several clinical events all at once. Therefore, focusing on one clinical event at a time when performing similar analyses will result in improved resolution or accuracy.

Conclusions

In the past decade, several transcriptomic CRC studies have been reported. Most of their outcomes, such as TCGA, are relevant to people of European ancestry, and little is known about the transcriptomic expression underlying the tumorigenesis of CRC in other ethnicities. In the present study, we collected CRC samples from Taiwanese patients and extended the knowledge of CRC through the following findings: (1) We elucidated the spectrum of CRC-associated dysregulated mRNA molecules and pathways. By integrating this with the miRNA dataset, we built an miRNA–mRNA network and identified the critical subnetworks potentially involved in CRC tumorigenesis. (2) By comparing with the TCGA dataset, we discovered differential enrichment of metastasis-associated pathways between datasets, despite moderate extent of correlated expression in gene patterns. (3) We identified and validated several fusion genes in our CRC cohort, including recurrent and novel events. (4) The subtype profile of Taiwanese CRC patients was revealed using the CRCSC and CRCA classifier tools. The resulting distribution of subtype assignments is similar to that from the TCGA dataset but shows distinct clinical outcomes. (5) We then proposed a new subtype classification approach, MMAS, which discriminates survival outcomes of our patients and links the repressed activity of the WNT signaling pathway. Our findings support our hypothesis of an ethnic basis for CRC tumorigenesis and, hence, provide prospects for translational medicine. Furthermore, to the best of our knowledge, this dataset provides the most comprehensive and integrated set of altered transcriptomic signatures in patients of Han-Chinese ethnicity.

Methods

Colorectal cancer samples and RNA extraction

A total of 104 matched tissues were obtained from colon adenocarcinoma/adenoma patients during surgery at Chang Gung Memorial Hospital, Linkuo, Taiwan. The tumor and adjacent normal tissues were preserved in RNAlater immediately after resection and stored at 4 °C. About 50~60 mg of tissues were used for RNA extraction using TRIzol Reagent according to the manufacturer’s instructions. The quality of RNA samples was evaluated by LabChip GX, and samples with scores of greater than 7 were further processed. The period of samples collection was from December of 2014 to June of 2016.

Next-generation sequencing and data pre-processing

For RNA-seq, we prepared the libraries by Agilent SureSelect Strand-Specific RNA Library Preparation Kit for Illumina (Agilent Technologies) according to the manufacturer’s instructions. Briefly, 2 μg purified total RNA was enriched by the ploy-A tail beads, fragmentized by heat treatment and then reverse transcribed to cDNA using dUTP approach. Adapters ligated libraries were then PCR amplified and purified. For the small RNA-seq, we prepared libraries using Illumina TruSeq small RNA library preparation kits according to manufacturer’s instructions. Briefly, 1 μg of purified total RNA was ligated with adapter and PCR amplified. The small RNA libraries were size selected with a target insert size of 15~30 nt in length using the 6% TBE PAGE gel. The yield and size distribution of the purified RNA and small RNA libraries were assessed using the Agilent 2100 Bioanalyzer instrument with the Agilent High Sensitivity DNA Kit (Agilent Technologies). Equal amounts of libraries were pooled in molecular ratio and consequently sequenced by the NextSeq. 500 sequencer.

Adapter sequences from both small RNA-Seq and RNA-Seq experiments were trimmed by Skewer⁸⁸. For small RNA-Seq, trimmed reads were mapped to hg38 reference by using Bowtie⁸⁹ with the parameter of “-n 1 -l 15 -k 1”. miRNA counts were calculated using featureCounts 1.5.1⁹⁰ with miRBase 21 with 10 bp padding. For RNA-Seq, we mapped RNA reads by using BaseSpace app: RNA-Seq Alignment v1.0 with STAR and hg38 reference. Aligned RNA-seq reads were quantified and annotated using featureCounts 1.5.1⁹⁰ with GENCODE release 25. In addition, aligned RNA-Seq reads were analyzed using BAM-matcher to ensure that samples were properly paired⁴⁶. Both RNA-Seq and small RNA-Seq raw read counts were normalized to reads per million (RPM) for quantitative representation. Principal component analysis (PCA) analysis was performed on both sequencing data sets using R package ‘ggbiplot’ and ‘stats’.

Differential expression analysis

R/BioConductor ‘DESeq. 2’ was used to perform the differential expression analysis. We defined differentially expressed genes (DEGs) and miRNAs (DEMs) in the tumor/normal comparison on the basis of adjusted P value <0.001 and |fold change | >2. Because the fold changes of low abundance genes may not be unequivocally determined, we excluded genes with per kilobase per million mapped reads (RPKM) values of less than 10. For the comparison among MMAS groups, differential expression analysis was performed for the tumor samples and DEGs were defined by adjusted P value <0.05 and |fold change | >1.5. For the TCGA dataset, we downloaded all the colon adenocarcinoma (COAD) and rectum adenocarcinoma (READ) controlled-access RNA-Seq bam files from Genomic Data Commons (GDC). The quantification of read counts and differential expression analysis were performed as above.

Pathway analysis

To explore cellular pathways perturbed between tumor and normal samples, we used R/BioConductor packages ‘clusterProfiler’ and ‘fgsea’ for the overrepresentation enrichment and Gene Set Enrichment analysis (GSEA), respectively⁹¹. The gene set annotations of Kyoto Encyclopaedia of Genes and Genomes (KEGG) was obtained from Molecular Signatures Database (MSigDB)⁹². The visualization of KEGG pathway was conducted by the R/BioConductor package ‘Pathview’. To investigate the dysregulated pathway for each patient sample, we used R package ‘CMScaller’ with curated CRC pathways and performed the single-sample Gene Set Enrichment analysis (ssGSEA) using R/BioConductor package ‘GSVA’^64,93. The highly variable pathways were examined by median absolute deviation (MAD). The top 35 highly variable pathways were represented with heatmap, while the top 20 variable pathways were subjected to PCA to inspect sample grouping by using R package ‘ggbiplot’ and ‘stats’.

Gene fusion prediction and validation

We used STAR-Fusion⁵⁸ 1.3.1 to uncover potential gene fusion events based on the Genome Resource Lib of GRCh38_v27_CTAT_lib_Feb092018. We executed the STAR-Fusion on the fastq files with the parameters of “–FusionInspector validate” and “–examine_coding_effect” to obtain in silico predicted fusion transcript sequences. We further performed Pfam⁹⁴ to extend the functional domains annotations. For identification criteria, we filtered for fusion genes that are (1) in-frame transcripts, (2) with junction reads count> 5, and (3) without analogous fusion detection in the matched normal sample. For a high-confidence fusion gene set, we further annotated fusions that were observed in either CCEL, Cosmic, FA_CancerSupp, Chimer, YOSHIHARA_TCGA and Klijn_CellLines (STARFusion built-in databases). For validation, we adopted RT-PCR and Sanger sequencing approaches. One μg of total RNA was reverse transcribed to cDNA using SuperScript III Reverse Transcriptase kit (Invitrogen) according to the manufacturer’s instructions. Primers designed to detect specific fusions were as follows: ERBB2-PPP1R1B-F: AGA CAC GTT TGA GTC CAT GC, ERBB2-PPP1R1B-R: CTG GCC ACA GGT TGT CTT T; FGFR3-TACC3-F: AGA GGC CCA CCT TCA AGC, FGFR3-TACC3-R: ACT GCC TGG ACA GCT TGT G; PTPRK-RSPO3-F: TCT TGC TCC TCT CTC CTT GG, PTPRK-RSPO3-R: CAT GTT GCA CAG CCT CCT T; RPS19-CEACAM5-F: GGT GGA TAC CGT CAA GCT G, RPS19-CEACAM5-R: CAG CTG AGA GAC CAG GAG AAG; TBC1D15-RAB21-F: AAG CAG AAT GGG ACA TGG TT, TBC1D15-RAB21-R: TGC ATA CGA CTC TGC TTC TTG; TPM3-NTRK1-F: GAG TTT GCT GAG AGA TCG GTA G and TPM3-NTRK1-R: GTG TTT CGT CCT TCT TCT CCA. cDNA was amplified by using KAPA HiFi PCR kit (KAPABIOSYSTEMS) (Figure S7), cloned into pHE Vector (BIOTOOLS) and subjected to Sanger sequencing.

miRNA-mRNA network analysis

To perform miRNA-mRNA network analysis, we first built a miRNA-target interactions (MTIs) database by collecting miRNA-mRNA pairs annotated by both TargetScan 7.1 and miRTarBase 7.0 as the basis for confident miRNA-target prediction^49,50. A total of 25,892 MTIs was curated by this database. For the miRNA-mRNA network analysis between normal and tumor tissues, we calculated Pearson’s correlation and considered inversely correlated interactions (r < 0, P value <0.05) as putative targets for downstream analyses. For the MMAS miRNA-mRNA network analysis, the same analysis pipeline was used on MMAS-1 and MMAS-2 assigned tumor samples. The visualization of constructed miRNA-mRNA network was conducted by the R package ‘visNetwork’. The betweenness scores were calculated by the R package ‘igraph’.

CRC transcriptome subtype analysis

To perform CRCSC classification, we used the R package ‘CMScaller’ implemented with the nearest template prediction algorithm (NTP) and the built-in classifier⁶⁴. To generate the CRC-assigner (CRCA) classifier, the CRCassigner-786 gene list was obtained from the report by Sadanandam et al.³⁷. In addition to the publicly available tools, we also intended to categorize patient subtypes based on our own transcriptome sequencing data, which included both mRNA and miRNA information. To this end, we utilized the non-negative matrix factorization (NMF) for subtype identification. In addition to the conventional NMF clustering approach, which uses highly variable genes as the input, we applied several filters to obtain the screening input genes/miRNA set: input size, fold change cutoff, and low abundance cutoff. For the input size filter, we calculated MAD for the mRNA and miRNA datasets. We then generated mRNA input sets with sizes ranging from 100 to 1500 of the most variable mRNAs based on MAD (with increment of 100). Similar criteria were applied for the miRNA input size, but the size intervals were changed to 25 ~ 600 with increment of 25. For the fold change cutoff, we used |fold change | >1.5 or 2 in the tumor versus normal comparison (with FDR < 0.05) or nil criterion. For the low abundance cutoff, we retained mRNAs/miRNAs with zero counts in <25%, 50%, 75%, or 100% of cohort size. Consequently, these filters generated 180 and 288 combinations for mRNA and miRNA inputs, respectively.

We then performed NMF clustering (by k-mer = 2, Brunet method)⁹⁵ based on these screening input sets, and applied the following two criteria: (1) survival analysis of the stratified groups showing significant differences (log-rank test, P value < 0.05); (2) significant association between sample groupings based on the mRNA and miRNA sets (Fisher’s exact test, P value < 0.05). For effective use of the computing resources, we first ran the NMF with nrun = 25 to screen for the overall or disease-free survival (OS or DFS; P value <0.075). Subsequently, initially passed input sets were subjected to NMF again with nrun = 500 to screen for input sets that meet the criteria described above. We chose the input set by the most significant log-rank test results. Consequently, the attributes of the mRNA input set were: gene size of 400, |fold change | >1.5 and genes with zero counts less than 25% of sample size, whereas the miRNA input is characterized by size of 50, |fold change | >2 and genes with zero counts in less than 25% of samples.

Reconstruction of transcriptional network analysis

In order to identify risk regulons associated with MMAS, we used R/BioConductor package ‘RTN’ to identify putative transcription factor (TF) regulons. Subsequently, R/BioConductor package ‘RTNsurvival’ was conducted to compute the individual regulon activity, which was further stratified for the survival analysis. Any association between MMAS and regulon activity was examined by using Fisher’s exact test.

Statistical analysis

To test the association between clinical attributes, such as sex, TNM stage, location, CEA, alcohol, family cancer history, histological grade and smoke, and presence of fusion or MMAS, Fisher’s exact test was used and P value < 0.05 was considered as a significant association. For patient survival analyses (both OS and DFS), Kaplan-Meier method with log-rank test and Cox proportional hazard models were performed by using the R package ‘survival’ and ‘survminer’. Stage-four patients were not included for the DFS analysis.

Ethics approval and consent to participate

This study was approved by the Chang Gung Memorial Hospital Institutional Review Board as a retrospective analysis (IRB 103-2529B) and was conducted within the guidelines of the Declaration of Helsinki. Patients/families were counseled in the context of the present study design, and all participants provided written informed consent to participate in the study.

Consent for publication

All individuals involved in this study provided consent for publication. We also obtained consent to publish the clinical information of all individuals presented in this study.

Data availability

The fastq format data of RNA-seq and small RNA-seq were deposited at NCBI Sequence Read Archive (SRA) database with the project accession number PRJNA387172.

References

Jemal, A. et al. Global cancer statistics. CA Cancer J. Clin. 61, 69–90, https://doi.org/10.3322/caac.20107 (2011).
Article PubMed Google Scholar
Edwards, B. K. et al. Annual report to the nation on the status of cancer, 1975-2006, featuring colorectal cancer trends and impact of interventions (risk factors, screening, and treatment) to reduce future rates. Cancer 116, 544–573, https://doi.org/10.1002/cncr.24760 (2010).
Article PubMed Google Scholar
Armaghany, T., Wilson, J. D., Chu, Q. & Mills, G. Genetic alterations in colorectal cancer. Gastrointest. Cancer Res. 5, 19–27 (2012).
PubMed PubMed Central Google Scholar
Al-Sohaily, S., Biankin, A., Leong, R., Kohonen-Corish, M. & Warusavitarne, J. Molecular pathways in colorectal cancer. J. Gastroenterol. Hepatol. 27, 1423–1431, https://doi.org/10.1111/j.1440-1746.2012.07200.x (2012).
Article CAS PubMed Google Scholar
Sancho, E., Batlle, E. & Clevers, H. Signaling pathways in intestinal development and cancer. Annu. Rev. Cell Dev. Biol. 20, 695–723, https://doi.org/10.1146/annurev.cellbio.20.010403.092805 (2004).
Article CAS PubMed Google Scholar
Gonzalez, C. A. & Riboli, E. Diet and cancer prevention: Contributions from the European Prospective Investigation into Cancer and Nutrition (EPIC) study. Eur. J. Cancer 46, 2555–2562, https://doi.org/10.1016/j.ejca.2010.07.025 (2010).
Article PubMed Google Scholar
Johnson, C. M. et al. Meta-analyses of colorectal cancer risk factors. Cancer Causes Control. 24, 1207–1222, https://doi.org/10.1007/s10552-013-0201-5 (2013).
Article PubMed PubMed Central Google Scholar
Winter, J., Jung, S., Keller, S., Gregory, R. I. & Diederichs, S. Many roads to maturity: microRNA biogenesis pathways and their regulation. Nat. Cell Biol. 11, 228–234, https://doi.org/10.1038/ncb0309-228 (2009).
Article CAS PubMed Google Scholar
Lee, Y. et al. MicroRNA genes are transcribed by RNA polymerase II. EMBO J. 23, 4051–4060, https://doi.org/10.1038/sj.emboj.7600385 (2004).
Article CAS PubMed PubMed Central Google Scholar
Kozomara, A. & Griffiths-Jones, S. miRBase: annotating high confidence microRNAs using deep sequencing data. Nucleic Acids Res. 42, D68–73, https://doi.org/10.1093/nar/gkt1181 (2014).
Article CAS PubMed Google Scholar
Kjersem, J. B. et al. Let-7 miRNA-binding site polymorphism in the KRAS 3’UTR; colorectal cancer screening population prevalence and influence on clinical outcome in patients with metastatic colorectal cancer treated with 5-fluorouracil and oxaliplatin +/− cetuximab. BMC Cancer 12, 534, https://doi.org/10.1186/1471-2407-12-534 (2012).
Article CAS PubMed PubMed Central Google Scholar
Nagel, R. et al. Regulation of the adenomatous polyposis coli gene by the miR-135 family in colorectal cancer. Cancer Res. 68, 5795–5802, https://doi.org/10.1158/0008-5472.CAN-08-0951 (2008).
Article CAS PubMed Google Scholar
Qu, Y. L. et al. Up-regulated miR-155-5p promotes cell proliferation, invasion and metastasis in colorectal carcinoma. Int. J. Clin. Exp. Pathol. 8, 6988–6994 (2015).
CAS PubMed PubMed Central Google Scholar
Ng, E. K. et al. MicroRNA-143 targets DNA methyltransferases 3A in colorectal cancer. Br. J. Cancer 101, 699–706, https://doi.org/10.1038/sj.bjc.6605195 (2009).
Article CAS PubMed PubMed Central Google Scholar
Wang, H. et al. MicroRNA-342 inhibits colorectal cancer cell proliferation and invasion by directly targeting DNA methyltransferase 1. Carcinogenesis 32, 1033–1042, https://doi.org/10.1093/carcin/bgr081 (2011).
Article CAS PubMed Google Scholar
Morita, S. et al. miR-29 represses the activities of DNA methyltransferases and DNA demethylases. Int. J. Mol. Sci. 14, 14647–14658, https://doi.org/10.3390/ijms140714647 (2013).
Article CAS PubMed PubMed Central Google Scholar
Mertens, F., Johansson, B., Fioretos, T. & Mitelman, F. The emerging complexity of gene fusions in cancer. Nat. Rev. Cancer 15, 371–381, https://doi.org/10.1038/nrc3947 (2015).
Article CAS PubMed Google Scholar
Negrini, S., Gorgoulis, V. G. & Halazonetis, T. D. Genomic instability–an evolving hallmark of cancer. Nat. Rev. Mol. Cell Biol. 11, 220–228, https://doi.org/10.1038/nrm2858 (2010).
Article CAS PubMed Google Scholar
Gao, Q. et al. Driver Fusions and Their Implications in the Development and Treatment of Human Cancers. Cell Rep. 23, 227–238 e223, https://doi.org/10.1016/j.celrep.2018.03.050 (2018).
Article CAS PubMed PubMed Central Google Scholar
Stransky, N., Cerami, E., Schalm, S., Kim, J. L. & Lengauer, C. The landscape of kinase fusions in cancer. Nat. Commun. 5, 4846, https://doi.org/10.1038/ncomms5846 (2014).
Article ADS CAS PubMed Google Scholar
Cocco, E., Scaltriti, M. & Drilon, A. NTRK fusion-positive cancers and TRK inhibitor therapy. Nat. Rev. Clin. Oncol. 15, 731–747, https://doi.org/10.1038/s41571-018-0113-0 (2018).
Article CAS PubMed PubMed Central Google Scholar
Lin, J. J., Riely, G. J., Shaw, A. T. & Targeting, A. L. K. Precision Medicine Takes on Drug Resistance. Cancer Discov. 7, 137–155, https://doi.org/10.1158/2159-8290.CD-16-1123 (2017).
Article CAS PubMed PubMed Central Google Scholar
Watson, A. J. et al. Identification of selective inhibitors of RET and comparison with current clinical candidates through development and validation of a robust screening cascade. F1000Res 5, 1005, https://doi.org/10.12688/f1000research.8724.2 (2016).
Article CAS PubMed PubMed Central Google Scholar
Sehgal, K., Patell, R., Rangachari, D. & Costa, D. B. Targeting ROS1 rearrangements in non-small cell lung cancer with crizotinib and other kinase inhibitors. Transl. Cancer Res. 7, S779–S786, https://doi.org/10.21037/tcr.2018.08.11 (2018).
Article PubMed Google Scholar
Bass, A. J. et al. Genomic sequencing of colorectal adenocarcinomas identifies a recurrent VTI1A-TCF7L2 fusion. Nat. Genet. 43, 964–968, https://doi.org/10.1038/ng.936 (2011).
Article CAS PubMed PubMed Central Google Scholar
Cancer Genome, Atlas, N. Comprehensive molecular characterization of human colon and rectal cancer. Nat. 487, 330–337, https://doi.org/10.1038/nature11252 (2012).
Article ADS CAS Google Scholar
Seshagiri, S. et al. Recurrent R-spondin fusions in colon cancer. Nat. 488, 660–664, https://doi.org/10.1038/nature11282 (2012).
Article ADS CAS Google Scholar
Aisner, D. L. et al. ROS1 and ALK fusions in colorectal cancer, with evidence of intratumoral heterogeneity for molecular drivers. Mol. Cancer Res. 12, 111–118, https://doi.org/10.1158/1541-7786.MCR-13-0479-T (2014).
Article CAS PubMed Google Scholar
Ardini, E. et al. The TPM3-NTRK1 rearrangement is a recurring event in colorectal carcinoma and is associated with tumor sensitivity to TRKA kinase inhibition. Mol. Oncol. 8, 1495–1507, https://doi.org/10.1016/j.molonc.2014.06.001 (2014).
Article CAS PubMed PubMed Central Google Scholar
Le Rolle, A. F. et al. Identification and characterization of RET fusions in advanced colorectal cancer. Oncotarget 6, 28929–28937, https://doi.org/10.18632/oncotarget.4325 (2015).
Article PubMed PubMed Central Google Scholar
Wang, W. et al. Molecular subtyping of colorectal cancer: Recent progress, new challenges and emerging opportunities. Semin. Cancer Biol. 55, 37–52, https://doi.org/10.1016/j.semcancer.2018.05.002 (2019).
Article CAS PubMed Google Scholar
Alexandrov, L. B. & Stratton, M. R. Mutational signatures: the patterns of somatic mutations hidden in cancer genomes. Curr. Opin. Genet. Dev. 24, 52–60, https://doi.org/10.1016/j.gde.2013.11.014 (2014).
Article CAS PubMed PubMed Central Google Scholar
Chang, J. et al. Genomic analysis of oesophageal squamous-cell carcinoma identifies alcohol drinking-related mutation signature and genomic alterations. Nat. Commun. 8, 15290, https://doi.org/10.1038/ncomms15290 (2017).
Article ADS CAS PubMed PubMed Central Google Scholar
Li, X. et al. Distinct Subtypes of Gastric Cancer Defined by Molecular Characterization Include Novel Mutational Signatures with Prognostic Capability. Cancer Res. 76, 1724–1732, https://doi.org/10.1158/0008-5472.CAN-15-2443 (2016).
Article CAS PubMed Google Scholar
Barras, D. et al. BRAF V600E Mutant Colorectal Cancer Subtypes Based on Gene Expression. Clin. Cancer Res. 23, 104–115, https://doi.org/10.1158/1078-0432.CCR-16-0140 (2017).
Article CAS PubMed Google Scholar
Guinney, J. et al. The consensus molecular subtypes of colorectal cancer. Nat. Med. 21, 1350–1356, https://doi.org/10.1038/nm.3967 (2015).
Article CAS PubMed PubMed Central Google Scholar
Sadanandam, A. et al. A colorectal cancer classification system that associates cellular phenotype and responses to therapy. Nat. Med. 19, 619–625, https://doi.org/10.1038/nm.3175 (2013).
Article CAS PubMed PubMed Central Google Scholar
Sveen, A. et al. Colorectal Cancer Consensus Molecular Subtypes Translated to Preclinical Models Uncover Potentially Targetable Cancer Cell Dependencies. Clin. Cancer Res. 24, 794–806, https://doi.org/10.1158/1078-0432.CCR-17-1234 (2018).
Article CAS PubMed Google Scholar
Dienstmann, R. et al. Consensus molecular subtypes and the evolution of precision medicine in colorectal cancer. Nat. Rev. Cancer 17, 79–92, https://doi.org/10.1038/nrc.2016.126 (2017).
Article CAS PubMed Google Scholar
Isella, C. et al. Stromal contribution to the colorectal cancer transcriptome. Nat. Genet. 47, 312–319, https://doi.org/10.1038/ng.3224 (2015).
Article CAS PubMed Google Scholar
Fessler, E. et al. TGFbeta signaling directs serrated adenomas to the mesenchymal colorectal cancer subtype. EMBO Mol. Med. 8, 745–760, https://doi.org/10.15252/emmm.201606184 (2016).
Article CAS PubMed PubMed Central Google Scholar
Okita, A. et al. Consensus molecular subtypes classification of colorectal cancer as a predictive factor for chemotherapeutic efficacy against metastatic colorectal cancer. Oncotarget 9, 18698–18711, https://doi.org/10.18632/oncotarget.24617 (2018).
Article PubMed PubMed Central Google Scholar
Cantini, L. et al. MicroRNA-mRNA interactions underlying colorectal cancer molecular subtypes. Nat. Commun. 6, 8878, https://doi.org/10.1038/ncomms9878 (2015).
Article ADS CAS PubMed Google Scholar
Hua, L., Zhou, P., Li, L., Liu, H. & Yang, Z. Prioritizing breast cancer subtype related miRNAs using miRNA-mRNA dysregulated relationships extracted from their dual expression profiling. J. Theor. Biol. 331, 1–11, https://doi.org/10.1016/j.jtbi.2013.04.008 (2013).
Article CAS PubMed Google Scholar
Xu, T. et al. Identifying Cancer Subtypes from miRNA-TF-mRNA Regulatory Networks and Expression Data. PLoS one 11, e0152792, https://doi.org/10.1371/journal.pone.0152792 (2016).
Article CAS PubMed PubMed Central Google Scholar
Wang, P. P., Parker, W. T., Branford, S. & Schreiber, A. W. BAM-matcher: a tool for rapid NGS sample matching. Bioinforma. 32, 2699–2701, https://doi.org/10.1093/bioinformatics/btw239 (2016).
Article CAS Google Scholar
Markowitz, S. D. & Bertagnolli, M. M. Molecular origins of cancer: Molecular basis of colorectal cancer. N. Engl. J. Med. 361, 2449–2460, https://doi.org/10.1056/NEJMra0804588 (2009).
Article CAS PubMed PubMed Central Google Scholar
Hagland, H. R., Berg, M., Jolma, I. W., Carlsen, A. & Soreide, K. Molecular pathways and cellular metabolism in colorectal cancer. Dig. Surg. 30, 12–25, https://doi.org/10.1159/000347166 (2013).
Article CAS PubMed Google Scholar
Chou, C. H. et al. miRTarBase update 2018: a resource for experimentally validated microRNA-target interactions. Nucleic Acids Res. 46, D296–D302, https://doi.org/10.1093/nar/gkx1067 (2018).
Article CAS PubMed Google Scholar
Agarwal, V., Bell, G. W., Nam, J. W. & Bartel, D. P. Predicting effective microRNA target sites in mammalian mRNAs. Elife 4, https://doi.org/10.7554/eLife.05005 (2015).
Wu, S., Wu, F. & Jiang, Z. Identification of hub genes, key miRNAs and potential molecular mechanisms of colorectal cancer. Oncol. Rep. 38, 2043–2050, https://doi.org/10.3892/or.2017.5930 (2017).
Article ADS CAS PubMed PubMed Central Google Scholar
Fan, Y. et al. miRNet - dissecting miRNA-target interactions and functional associations through network-based visual analysis. Nucleic Acids Res. 44, W135–141, https://doi.org/10.1093/nar/gkw288 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Wagner, A. H. et al. DGIdb 2.0: mining clinically relevant drug-gene interactions. Nucleic Acids Res. 44, D1036–1044, https://doi.org/10.1093/nar/gkv1165 (2016).
Article ADS CAS PubMed Google Scholar
Colussi, D., Brandi, G., Bazzoli, F. & Ricciardiello, L. Molecular pathways involved in colorectal cancer: implications for disease behavior and prevention. Int. J. Mol. Sci. 14, 16365–16385, https://doi.org/10.3390/ijms140816365 (2013).
Article CAS PubMed PubMed Central Google Scholar
Holder, J. W., Elmore, E. & Barrett, J. C. Gap junction function and cancer. Cancer Res. 53, 3475–3485 (1993).
CAS PubMed Google Scholar
Lu, P., Weaver, V. M. & Werb, Z. The extracellular matrix: a dynamic niche in cancer progression. J. Cell Biol. 196, 395–406, https://doi.org/10.1083/jcb.201102147 (2012).
Article CAS PubMed PubMed Central Google Scholar
Okegawa, T., Pong, R. C., Li, Y. & Hsieh, J. T. The role of cell adhesion molecule in cancer progression and its application in cancer therapy. Acta Biochim Pol 51, 445–457, 035001445 (2004).
Haas, B. et al. STAR-Fusion: Fast and Accurate Fusion Transcript Detection from RNA-Seq. bioRxiv, https://doi.org/10.1101/120295 (2017).
Choi, Y. et al. Integrative analysis of oncogenic fusion genes and their functional impact in colorectal cancer. Br. J. Cancer 119, 230–240, https://doi.org/10.1038/s41416-018-0153-3 (2018).
Article CAS PubMed PubMed Central Google Scholar
Krause, D. S. & Van Etten, R. A. Tyrosine kinases as targets for cancer therapy. N. Engl. J. Med. 353, 172–187, https://doi.org/10.1056/NEJMra044389 (2005).
Article CAS PubMed Google Scholar
Mai, A. et al. Competitive binding of Rab21 and p120RasGAP to integrins regulates receptor traffic and migration. J. Cell Biol. 194, 291–306, https://doi.org/10.1083/jcb.201012126 (2011).
Article CAS PubMed PubMed Central Google Scholar
Leca, J. et al. Cancer-associated fibroblast-derived annexin A6+ extracellular vesicles support pancreatic cancer aggressiveness. J. Clin. Invest. 126, 4140–4156, https://doi.org/10.1172/JCI87734 (2016).
Article PubMed PubMed Central Google Scholar
Arthurs, C. et al. Expression of ribosomal proteins in normal and cancerous human prostate tissue. PLoS One 12, e0186047, https://doi.org/10.1371/journal.pone.0186047 (2017).
Article CAS PubMed PubMed Central Google Scholar
Eide, P. W., Bruun, J., Lothe, R. A. & Sveen, A. CMScaller: an R package for consensus molecular subtyping of colorectal cancer pre-clinical models. Sci. Rep. 7, 16618, https://doi.org/10.1038/s41598-017-16747-x (2017).
Article ADS CAS PubMed PubMed Central Google Scholar
Candy, P. A. et al. Notch-induced transcription factors are predictive of survival and 5-fluorouracil response in colorectal cancer patients. Br. J. Cancer 109, 1023–1030, https://doi.org/10.1038/bjc.2013.431 (2013).
Article CAS PubMed PubMed Central Google Scholar
Mullany, L. E. et al. Transcription factor-microRNA associations and their impact on colorectal cancer survival. Mol. Carcinog. 56, 2512–2526, https://doi.org/10.1002/mc.22698 (2017).
Article CAS PubMed PubMed Central Google Scholar
Virolle, T. et al. Egr1 promotes growth and survival of prostate cancer cells. Identification of novel Egr1 target genes. J. Biol. Chem. 278, 11802–11810, https://doi.org/10.1074/jbc.M210279200 (2003).
Article CAS PubMed Google Scholar
Anttila, M. A. et al. Expression of transcription factor AP-2alpha predicts survival in epithelial ovarian cancer. Br. J. Cancer 82, 1974–1983, https://doi.org/10.1054/bjoc.2000.1146 (2000).
Article CAS PubMed PubMed Central Google Scholar
Castro, M. A. et al. Regulators of genetic risk of breast cancer identified by integrative network analysis. Nat. Genet. 48, 12–21, https://doi.org/10.1038/ng.3458 (2016).
Article CAS PubMed Google Scholar
Costa, A. M. et al. GRG5/AES interacts with T-cell factor 4 (TCF4) and downregulates Wnt signaling in human cells and zebrafish embryos. PLoS One 8, e67694, https://doi.org/10.1371/journal.pone.0067694 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Chanoumidou, K. et al. Groucho related gene 5 (GRG5) is involved in embryonic and neural stem cell state decisions. Sci. Rep. 8, 13790, https://doi.org/10.1038/s41598-018-31696-9 (2018).
Article ADS CAS PubMed PubMed Central Google Scholar
Pietrantonio, F. et al. ALK, ROS1, and NTRK Rearrangements in Metastatic Colorectal Cancer. J Natl Cancer Inst 109, https://doi.org/10.1093/jnci/djx089 (2017).
Park, D. Y. et al. NTRK1 fusions for the therapeutic intervention of Korean patients with colon cancer. Oncotarget 7, 8399–8412, https://doi.org/10.18632/oncotarget.6724 (2016).
Article PubMed Google Scholar
Amatu, A. et al. Novel CAD-ALK gene rearrangement is drugable by entrectinib in colorectal cancer. Br. J. Cancer 113, 1730–1734, https://doi.org/10.1038/bjc.2015.401 (2015).
Article CAS PubMed PubMed Central Google Scholar
Hong, S. et al. Upregulation of PD-L1 by EML4-ALK fusion protein mediates the immune escape in ALK positive NSCLC: Implication for optional anti-PD-1/PD-L1 immune therapy for ALK-TKIs sensitive and resistant NSCLC patients. Oncoimmunology 5, e1094598, https://doi.org/10.1080/2162402X.2015.1094598 (2016).
Article CAS PubMed Google Scholar
Ota, K. et al. Induction of PD-L1 Expression by the EML4-ALK Oncoprotein and Downstream Signaling Pathways in Non-Small Cell Lung Cancer. Clin. Cancer Res. 21, 4014–4021, https://doi.org/10.1158/1078-0432.CCR-15-0016 (2015).
Article MathSciNet CAS PubMed Google Scholar
Li, C. et al. Identification of RSPO2 Fusion Mutations and Target Therapy Using a Porcupine Inhibitor. Sci. Rep. 8, 14244, https://doi.org/10.1038/s41598-018-32652-3 (2018).
Article ADS CAS PubMed PubMed Central Google Scholar
Di Stefano, A. L. et al. Detection, Characterization, and Inhibition of FGFR-TACC Fusions in IDH Wild-type Glioma. Clin. Cancer Res. 21, 3307–3317, https://doi.org/10.1158/1078-0432.CCR-14-2199 (2015).
Article CAS PubMed PubMed Central Google Scholar
National Comprehensive Cancer Network. Rectal Cancer (Version 1.2019). (2019).
National Comprehensive Cancer Network. Colon Cancer (Version 1.2019). (2019).
Ma, S. et al. Continuity of transcriptomes among colorectal cancer subtypes based on meta-analysis. Genome Biol. 19, 142, https://doi.org/10.1186/s13059-018-1511-4 (2018).
Article CAS PubMed PubMed Central Google Scholar
Zhang, Y. et al. CREPT facilitates colorectal cancer growth through inducing Wnt/beta-catenin pathway by enhancing p300-mediated beta-catenin acetylation. Oncogene 37, 3485–3500, https://doi.org/10.1038/s41388-018-0161-z (2018).
Article CAS PubMed PubMed Central Google Scholar
Kim, H. J., Moon, S. J., Kim, S. H., Heo, K. & Kim, J. H. DBC1 regulates Wnt/beta-catenin-mediated expression of MACC1, a key regulator of cancer progression, in colon cancer. Cell Death Dis. 9, 831, https://doi.org/10.1038/s41419-018-0899-9 (2018).
Article CAS PubMed PubMed Central Google Scholar
Rapetti-Mauss, R. et al. Bidirectional KCNQ1:beta-catenin interaction drives colorectal cancer cell differentiation. Proc. Natl Acad. Sci. USA 114, 4159–4164, https://doi.org/10.1073/pnas.1702913114 (2017).
Article CAS PubMed PubMed Central Google Scholar
Kumaradevan, S. et al. c-Cbl Expression Correlates with Human Colorectal Cancer Survival and Its Wnt/beta-Catenin Suppressor Function Is Regulated by Tyr371 Phosphorylation. Am. J. Pathol. 188, 1921–1933, https://doi.org/10.1016/j.ajpath.2018.05.007 (2018).
Article CAS PubMed PubMed Central Google Scholar
de Sousa, E. M. F. et al. Methylation of cancer-stem-cell-associated Wnt target genes predicts poor prognosis in colorectal cancer patients. Cell Stem Cell 9, 476–485, https://doi.org/10.1016/j.stem.2011.10.008 (2011).
Article CAS Google Scholar
Kim, S. H. et al. CpG Island Methylator Phenotype and Methylation of Wnt Pathway Genes Together Predict Survival in Patients with Colorectal Cancer. Yonsei Med. J. 59, 588–594, https://doi.org/10.3349/ymj.2018.59.5.588 (2018).
Article CAS PubMed PubMed Central Google Scholar
Jiang, H., Lei, R., Ding, S. W. & Zhu, S. Skewer: a fast and accurate adapter trimmer for next-generation sequencing paired-end reads. BMC Bioinforma. 15, 182, https://doi.org/10.1186/1471-2105-15-182 (2014).
Article Google Scholar
Langmead, B., Trapnell, C., Pop, M. & Salzberg, S. L. Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol. 10, R25, https://doi.org/10.1186/gb-2009-10-3-r25 (2009).
Article CAS PubMed PubMed Central Google Scholar
Liao, Y., Smyth, G. K. & Shi, W. featureCounts: an efficient general purpose program for assigning sequence reads to genomic features. Bioinforma. 30, 923–930, https://doi.org/10.1093/bioinformatics/btt656 (2014).
Article CAS Google Scholar
Yu, G., Wang, L. G., Han, Y. & He, Q. Y. clusterProfiler: an R package for comparing biological themes among gene clusters. OMICS 16, 284–287, https://doi.org/10.1089/omi.2011.0118 (2012).
Article CAS PubMed PubMed Central Google Scholar
Liberzon, A. et al. Molecular signatures database (MSigDB) 3.0. Bioinforma. 27, 1739–1740, https://doi.org/10.1093/bioinformatics/btr260 (2011).
Article CAS Google Scholar
Hanzelmann, S., Castelo, R. & Guinney, J. GSVA: gene set variation analysis for microarray and RNA-seq data. BMC Bioinforma. 14, 7, https://doi.org/10.1186/1471-2105-14-7 (2013).
Article Google Scholar
Finn, R. D. et al. Pfam: the protein families database. Nucleic Acids Res. 42, D222–230, https://doi.org/10.1093/nar/gkt1223 (2014).
Article CAS PubMed Google Scholar
Brunet, J. P., Tamayo, P., Golub, T. R. & Mesirov, J. P. Metagenes and molecular pattern discovery using matrix factorization. Proc. Natl Acad. Sci. USA 101, 4164–4169, https://doi.org/10.1073/pnas.0308531101 (2004).
Article ADS CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We are grateful to Dr. Chi Yang at the Molecular Medicine Research Center for discussion on the data processing of MMAS. This work was supported by grants from the Ministry of Science and Technology of Taiwan (MOST106–2320-B-182–035-MY3 to H.L.; MOST107-2320-B-182-042-MY3 and MOST105–2314-B-182–061-MY4 to B.C.M.T), Chang Gung Memorial Hospital (CMRPD1F0573, CMRPD1H0372, and BMRPF45 to H.L.; CMRPG3D1513 and CMRPG3D1514 to W.S.T.; CMRPD1F0443, CMRPD1H0022, CMRPD1H0262, and BMRP960 to B.C.M.T.), the Ministry of Education of Taiwan (EMRPD1I0261). This work was also financially supported by the Research Center for Emerging Viral Infections from The Featured Areas Research Center Program within the framework of the Higher Education Sprout Project by the Ministry of Education (MOE) in Taiwan and the Ministry of Science and Technology (MOST), Taiwan (MOST108-3017-F-182-001).

Author information

These authors contributed equally: Shao-Min Wu and Wen-Sy Tsai.

Authors and Affiliations

Graduate Institute of Biomedical Sciences, College of Medicine, Chang Gung University, Taoyuan, Taiwan
Shao-Min Wu, Yi-Hsuan Lai, Chung-Pei Ma, Jian-Hua Wang, Chia-Yu Yang, Bertrand Chin-Ming Tan & Hsuan Liu
Division of Colon and Rectal Surgery, Lin-Kou Medical Center, Chang Gung Memorial Hospital, Taoyuan, Taiwan
Wen-Sy Tsai, Sum-Fu Chiang & Hsuan Liu
Graduate Institute of Clinical Medical Sciences, College of Medicine, Chang Gung University, Taoyuan, Taiwan
Sum-Fu Chiang
Department of Biomedical Sciences, College of Medicine, Chang Gung University, Taoyuan, Taiwan
Yi-Hsuan Lai, Chung-Pei Ma & Bertrand Chin-Ming Tan
Molecular Medicine Research Center, Chang Gung University, Taoyuan, Taiwan
Jiarong Lin, Pei-Shan Lu, Chia-Yu Yang & Hsuan Liu
Department of Microbiology and Immunology, College of Medicine, Chang Gung University, Taoyuan, Taiwan
Chia-Yu Yang
Department of Otolaryngology-Head & Neck Surgery, Chang Gung Memorial Hospital, Linkou, Taoyuan, Taiwan
Chia-Yu Yang
Department of Neurosurgery, Linkou Medical Center, Chang Gung Memorial Hospital, Linkou, Taiwan
Bertrand Chin-Ming Tan
Research Center for Emerging Viral Infections, Chang Gung University, Taoyuan, Taiwan
Bertrand Chin-Ming Tan
Department of Cell and Molecular Biology, College of Medicine, Chang Gung University, Taoyuan, Taiwan
Hsuan Liu

Authors

Shao-Min Wu
View author publications
You can also search for this author in PubMed Google Scholar
Wen-Sy Tsai
View author publications
You can also search for this author in PubMed Google Scholar
Sum-Fu Chiang
View author publications
You can also search for this author in PubMed Google Scholar
Yi-Hsuan Lai
View author publications
You can also search for this author in PubMed Google Scholar
Chung-Pei Ma
View author publications
You can also search for this author in PubMed Google Scholar
Jian-Hua Wang
View author publications
You can also search for this author in PubMed Google Scholar
Jiarong Lin
View author publications
You can also search for this author in PubMed Google Scholar
Pei-Shan Lu
View author publications
You can also search for this author in PubMed Google Scholar
Chia-Yu Yang
View author publications
You can also search for this author in PubMed Google Scholar
Bertrand Chin-Ming Tan
View author publications
You can also search for this author in PubMed Google Scholar
Hsuan Liu
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

H.L. and W.T. contributed to initial study concept; W.T. and S.C. provided patients samples and clinical analysis; S.W., H.L. and B.C.T. contributed to experimental design, data interpretation, and manuscript writing; S.W. performed the sequencing experiments and bioinformatics analyses; Y.L., C.M., J.W., J.L., P.L. and C.Y. performed sequencing experiments and validation; H.L. and W.T. obtained funding; all authors discussed the results and commented on the manuscript.

Corresponding authors

Correspondence to Bertrand Chin-Ming Tan or Hsuan Liu.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information.

Supplementary Tables.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Wu, SM., Tsai, WS., Chiang, SF. et al. Comprehensive transcriptome profiling of Taiwanese colorectal cancer implicates an ethnic basis for pathogenesis. Sci Rep 10, 4526 (2020). https://doi.org/10.1038/s41598-020-61273-y

Download citation

Received: 22 October 2019
Accepted: 19 February 2020
Published: 11 March 2020
DOI: https://doi.org/10.1038/s41598-020-61273-y

This article is cited by

Dysregulation of SOX17/NRF2 axis confers chemoradiotherapy resistance and emerges as a novel therapeutic target in esophageal squamous cell carcinoma
- Chih-Hsiung Hsieh
- Wen-Hui Kuan
- Yi-Ching Wang
Journal of Biomedical Science (2022)
EV-miRome-wide profiling uncovers miR-320c for detecting metastatic colorectal cancer and monitoring the therapeutic response
- Chan-Keng Yang
- Hung-Chih Hsu
- Hsuan Liu
Cellular Oncology (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

PERCEPTION predicts patient response and resistance to treatment using single-cell transcriptomics of their tumors

Massively parallel screen uncovers many rare 3′ UTR variants regulating mRNA abundance of cancer driver genes

Deep whole-genome analysis of 494 hepatocellular carcinomas

Introduction

Results

Comprehensive profiling of Taiwanese CRC transcriptomes reveals cohort-specific alterations

Comparison between Taiwanese CRC data and The Cancer Genome Atlas (TCGA) reveals regional differences

Novel gene fusion events in Taiwanese CRC

Transcriptome-based subtyping of Taiwanese CRC patients

Characterization of NMF-assigned subtypes shows differential representation of signatures

Discussion

Conclusions

Methods

Colorectal cancer samples and RNA extraction

Next-generation sequencing and data pre-processing

Differential expression analysis

Pathway analysis

Gene fusion prediction and validation

miRNA-mRNA network analysis

CRC transcriptome subtype analysis

Reconstruction of transcriptional network analysis

Statistical analysis

Ethics approval and consent to participate

Consent for publication

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Additional information

Supplementary information

Supplementary Information.

Supplementary Tables.

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Dysregulation of SOX17/NRF2 axis confers chemoradiotherapy resistance and emerges as a novel therapeutic target in esophageal squamous cell carcinoma

EV-miRome-wide profiling uncovers miR-320c for detecting metastatic colorectal cancer and monitoring the therapeutic response

Comments

Search

Quick links