Colon cancer inter-tumour heterogeneity is installed on multiple levels, ranging from (epi)genetic driver events to signalling pathway rewiring reflected by differential gene expression patterns. Although the existence of heterogeneity in colon cancer has been recognised for a longer period of time, it is sparingly incorporated as a determining factor in current clinical practice. Here we describe how unsupervised gene expression-based classification efforts, amongst which the consensus molecular subtypes (CMS), can stratify patients in biological subgroups associated with distinct disease outcome and responses to therapy. We will discuss what is needed to extend these subtyping efforts to the clinic and we will argue that preclinical models recapitulate CMS subtypes and can be of vital use to increase our understanding of treatment response and resistance and to discover novel targets for therapy.
When diagnosed at early stages, colon cancer is associated with good overall survival rates, but these numbers rapidly decline in stage III or stage IV metastatic disease1. The high heterogeneity of colon cancer on the genetic and gene regulatory level contributes to differences in therapy response and, consequently, survival2. In an era where (semi-)individualised treatment can be more readily achieved due to advancements and cost reductions in various molecular biology techniques, scientists are striving to map colon cancer heterogeneity and to determine which factors can function as better prognostic and predictive markers for this disease. Molecular features are however sparsely assessed in current clinical practice, even though the histopathological parameters typically used do not suffice to recognise high risk patients that could benefit from alternative treatment strategies. This review will provide numerous examples of how molecular characteristics can aid in guiding clinical decision making and can improve outcome for colon cancer patients.
Over the last 10–15 years, the colon cancer research field has moved from mainly assessing mutations to measuring gene expression patterns to describe a broader spectrum of colon cancer heterogeneity. While initially used for dichotomisation into good and poor prognosis patients, unsupervised clustering methods have revealed that separation into strictly two groups does not reflect the biological diversity that exists within colon cancer. We will discuss how several layers of biological variance can be identified by distinct gene expression-based methods and will summarise how retrospective analysis of clinical trials has revealed that this variance has consequences for responses to currently used chemo- and targeted therapies. These findings can importantly be extended to preclinical models, thus this review aims to convey that these models capture colon cancer heterogeneity and can therefore be employed to optimise the efficiency of available therapies and to find novel targets for treatment.
Standard of care for colon cancer
Current colon cancer classification, prognosis prediction and therapy decision-making is mainly based on (histo)pathological features. The tumour, lymph node, metastasis (TNM) staging system functions as the backbone and uses anatomical information to stage a carcinoma, in which predicted prognosis worsens as the stage is increased3. Other factors known to influence prognosis include histological differentiation grade, tumour sidedness and BRAF mutations. Poor differentiation grade is associated with poor prognosis, as is the presence of a BRAF mutation3,4,5. Right-sidedness is also linked to poorer outcome and survival, but only for a specific subgroup of right-sided cancers6,7,8. Even though these negative prognostic markers are recognised in the clinic, their presence generally does not alter the treatment given. An exception can be made in stage II where factors such as a high T stage, poor differentiation grade and a low number of examined regional lymph nodes are considered as markers for high risk of recurrence, and patients can therefore be considered for more aggressive treatment compared to other stage II patients9.
Generally speaking, a one-size-fits-all treatment approach is used for each TNM stage of the disease. Surgical removal of the primary tumour is the main pillar of stage I–III colon cancer treatment. High-risk stage II and all stage III patients are additionally treated with adjuvant chemotherapy if the patient’s condition allows for it; age is the primary determinant. The regimen consists of folinic acid (leucovorin) and the chemotherapeutics 5-fluorouracil (5-FU) or capecitabine, a prodrug of 5-FU, and oxaliplatin (FOLFOX)9. Administration of chemotherapy to stage III patients increases overall survival, but the benefit for high-risk stage II patients has been called into question9,10. In stage IV disease a minority of metastatic lesions qualifies for surgery with curative intentions, but the main first line treatment in the metastatic setting currently consists of chemotherapy with FOLFOX or leucovorin, 5-FU or capecitabine and irinotecan (FOLFIRI) plus bevacizumab. The EGFR-targeting antibody cetuximab is also used under the prerequisite of wildtype KRAS or BRAF status, as mutations in these oncogenes render tumours insensitive to EGFR-targeted therapy9,11,12. Bevacizumab and cetuximab have been simultaneously administered in the metastatic setting, combined with chemotherapy, but this approach decreases quality of life and does not improve overall survival13.
The major issue of the standardised staging and treatment protocol is that it fails to identify the full cohort of poor prognosis colon cancer patients that should receive adjuvant therapy, and that it fails to predict which patients benefit. Some stage II patients, for example, are not categorised as high risk patients and yet eventually present with recurrent disease. Some stage III patients relapse even though they received adjuvant chemotherapy, whereas some never relapse even when no adjuvant chemotherapy was administered9. Consequently, some patients are inadequately treated and some are over-treated with chemotherapeutics that come with serious side effects. Inter-tumour heterogeneity, caused by the presence of distinct mutations and differential regulation of gene expression across colon carcinomas from different patients, is a contributor to these diverse responses as we will substantiate in the next sections.
Colon cancer inter-tumour heterogeneity
Genetic heterogeneity in hereditary and sporadic colon cancer
Part of colon cancer heterogeneity is installed at the premalignant stage, driven by mutations and epigenetic regulation affecting distinct biological pathways. They give rise to two major classes of adenomas and carcinomas that exist in both hereditary and sporadic cases of colon cancer.
The biggest class of sporadic colon cancer contains inactivating mutations of the APC gene, detected in ~80% of the cases14. Somatic mutations in this gene can already be found in early lesions such as dysplastic epithelium and small, benign tubular adenomas15. APC was initially identified as the gene that is heterozygously mutated in the germline of patients suffering from hereditary familial adenomatous polyposis, which is associated with high lifetime incidence of colon cancer16,17. Taken together, these observations fuel the notion that APC loss is the initial driver event for the development of adenomas and subsequent carcinomas14,15,18,19. This gatekeeper role of APC can be attributed to its function in Wnt signalling, the pathway that controls the self-renewal and proliferative capacity of intestinal stem cells (ISC). Inactivation of APC through mutations or loss of heterozygosity leads to aberrant activation of Wnt signalling18,20, but even in the presence of dysfunctional APC, some degree of regulation of Wnt pathway activity is maintained to optimally enable adenoma formation. This is genetically determined by the location and nature of truncating APC mutations on the two alleles, and microenvironmentally by stromal production of, for example, Wnt ligands, R-spondin and hepatocyte growth factor21,22,23,24,25,26. Further accumulation of mutations in well-known oncogenic pathways facilitate the progression from early to high grade adenoma and ultimately carcinoma. KRAS and TP53 mutations are prevalent in early stages, and the latter can contribute to the development of chromosomal instability (CIN)27. The PI3K and TGFβ pathways are frequently deregulated at later stages through mutation of the PIK3CA and PTEN, and SMAD2 and SMAD4 genes, respectively14,15,18,19.
The second, smaller (~15%), class of colon cancer, is characterised by high mutational load caused by defective DNA mismatch repair (MMR)14. Repetitive DNA sequences, such as microsatellites, are especially sensitive to mutation due to dysfunctional MMR and high abundance of microsatellite length alterations, a phenotype known as microsatellite instability (MSI), is therefore used as a surrogate read-out to establish MMR deficiency18. A small percentage of the MSI tumours arises through genetic predisposition; individuals affected by hereditary non-polyposis colorectal cancer (HNPCC) syndrome harbour germline-inactivating mutations in MMR genes18,28,29,30,31. MMR deficiency in sporadic MSI tumours is usually the result of CpG island promoter methylation and hence inactivation of the MLH1 gene14,18, which encodes a crucial player in the MMR pathway. This promoter methylation does not stand on its own, but is accompanied by widespread methylation of promoters throughout the genome, a phenomenon known as CpG island methylator phenotype (CIMP). CIMP-high MSI tumours are furthermore characterised by the frequent presence of mutations in the BRAF oncogene14,32,33,34,35.
Over the last 15 years it has become more widely accepted that BRAF mutation and CIMP-high phenotype are molecular features accumulated during a distinct neoplastic pathway, the serrated pathway (reviewed in ref. 36). These specific changes appear to be crucial first steps in the pathway, given that they can already be detected in sessile serrated adenomas, whereas high MSI is only detected at later stages34,37. The early presence of CIMP and the BRAFV600E oncogene might be functionally linked: it has been suggested that the BRAF oncogene directly contributes to the instalment of CIMP through direction of a repressor complex including DNA methyltransferase 3B towards CpG islands38. The tumour suppressor gene p16INK4a is epigenetically silenced in CIMP and this might be used to overcome BRAF-linked oncogene-induced senescence to drive progression from sessile serrated adenomas to hypermutated MSI carcinoma39,40.
MSI and non-hypermutated tumours can not only be distinguished using the mentioned (epi)genetic features, but also display divergent gene expression patterns, further corroborating that these carcinomas should be regarded as distinct entities41,42. Discrimination between the two is both prognostic and predictive for response to certain therapies. MSI is associated with a good prognos at stage I–III and these tumours do not recur or metastasise frequently. If they do, overall survival is significantly decreased compared to microsatellite stable tumours5,43,44. Clinical trials have revealed that MSI patients should not be treated with 5-FU as they do not benefit or as outcome is negatively affected in stage II disease45,46. These examples highlight that determining MSI status is clinically relevant, but unfortunately is not always standard of care. Additionally testing for the presence of the BRAFV600E mutation can further distinguish between hereditary and sporadic MSI cancer, as this mutation is rarely found in the hereditary form of the disease47.
Gene expression-based profiling of colon cancer
MSI status and a few mutations can be used as prognostic markers, but they fail to accurately select all poor prognosis patients. Over the last 10–15 years the focus has therefore shifted towards employing gene expression patterns as prognostic and predictive markers, an approach that captures a broader perspective on inter-tumour heterogeneity; it measures both differential activity of cell signalling pathways in cancer cells that is not necessarily regulated by mutations alone, and takes the make-up and influence of the tumour microenvironment into account.
Colon cancer outcome prediction
Two landmark studies utilised microarrays on primary breast cancer samples to deduce a 70-gene classifier to stratify patients into good and poor prognosis subsets48,49. A second approach uses a quantitative reverse transcriptase PCR gene expression panel as a prognostic tool for oestrogen receptor positive breast cancer50. Both platforms have been adapted to predict prognosis in stage II and III colon cancer and are known as ColoPrint® and Oncotype DX®, respectively. The latter can additionally predict benefit from treatment with 5-FU and leucovorin51,52,53. Neither of the two commercially available assays are however part of current standard clinical care for colon cancer patients9,54.
Over 20 additional gene signatures designed to identify poor prognosis patients have been published (summarised in ref. 55). Although these signatures may provide clinical use in terms of identification of patients that have higher risk of recurrence, they do not provide insight into the underlying reason. It is important to realise that recurrence for stage III patients may result from the aggressive nature of tumour cells, prompting a higher propensity for recurrence or early metastatic spread (poor prognosis), or from inferior response to treatment (poor prediction). Therefore, these signatures ignore gene regulatory information that might be relevant for understanding specific tumour biology that underlies the differential outcome.
Alternatively, more holistic molecular classification efforts have therefore been made to fully capture colon cancer heterogeneity, either using supervised approaches to study specific cell signalling pathway activity, or preferring unsupervised hierarchical clustering methods. These methods could provide insight into what biological processes fuel aggressive behaviour of cancer cells and what mechanisms are responsible for insensitivity to therapies.
Supervised, biology-driven clustering of colon cancer
The benefit of looking at gene expression signatures rather than mutations alone to study pathway regulation is exemplified by the observations that supervised analysis can be used to detect subsets of colon cancer in which the BRAF, KRAS and/or PIK3CA genes are not mutated, but gene expression patterns nevertheless match those of mutated tumours56,57. Hence, adherence to this pattern can point out patients in which oncogenic pathway activation is achieved through less frequent mutations or through non-genetic mechanisms. Importantly, these tumours behave similarly as compared to their mutated peers; for instance, BRAF-like tumours are equally associated with poor prognosis, but might be sensitive to microtubule-targeting chemotherapeutics56,58 and EGFR-activated cancers do not respond to cetuximab, but might be sensitive to drugs targeting downstream effectors in this pathway57.
Other supervised efforts trying to implement biologically relevant information dichotomised colon cancer based on low or high expression of gene signatures associated with normal ISC or cancer stem cells (CSC)59,60. The underlying notion of this approach was that CSC fuel tumour growth and give rise to different cancer cell types in the tumour, and that high CSC signature expression functions as a surrogate measure for augmented CSC presence20,59,60,61,62,63,64. As in other tumour types, ISC/CSC-like expression patterns in colon cancer were associated with low overall survival rates59,60,64. Surprisingly, high expression of Wnt signalling target genes was associated with good prognosis, even though activation of this pathway is a hallmark of the ISC and CSC state20,23,60. It has therefore been suggested that adherence to the ISC/CSC signatures reflects the relatively undifferentiated nature of the tumour, rather than the abundance of Wnt-active CSC60. Later unsupervised clustering approaches support this notion, as they, amongst others, identify a highly Wnt active cluster and in some cases a separate stem cell-like cluster65,66,67,68,69.
The consensus molecular subtypes of colorectal cancer
Although supervised clustering strategies were prognostic and predictive to a certain extent, forcing all tumours into two subtypes probably does not acknowledge the full heterogeneous biology of colon cancer; MSI cancers are for example not recognised as a separate group. Early unsupervised clustering methods revealed that more than two flavours of colon cancer most likely exist, although these studies did not discuss biological mechanisms associated with each cluster, nor did they link differential gene expression patterns to prognosis and therapy response14,35,41. Several research groups set out to integrate this information in unsupervised clustering, leading to the separation of colon cancer into 3–6 distinct subtypes, depending on the choice of gene expression analysis platforms, bioinformatics approach and statistical analyses14,65,66,67,68,69,70,71. Although the number of subtypes identified diverged, it soon became apparent that some key biological characteristics were shared between subtypes from different studies72. All approaches distinguished a cluster comprised of MSI/CIMP-high tumours, a mesenchymal/stem cell-like cluster and clusters in which gene expression patterns were observed that matched patterns associated with (subsets) of epithelial cells in the normal colon. An international consortium encompassing all the researchers originally reporting on the gene expression-based classification systems, consolidated separate findings into one overarching stratification system, the CMS (summarised in Fig. 1)73.
Four subtypes are described that, importantly, are linked to outcome: in stage I–III colon cancer, CMS4 patients present with the poorest overall survival, and CMS1 presents with the poorest outcome in the metastatic setting, whereas survival rates after relapse are higher in CMS2 tumours73. These findings were further corroborated in independent clinical trials retrospectively analysing CMS status of metastatic colorectal cancer patients74,75.
Differential outcome can be partially explained by the biological features of each subtype: CMS1 represents the MSI/CIMP-high cancers displaying high immune cell infiltration and harbouring the majority of BRAFV600E mutations present in the dataset. CMS2 and CMS3 are alike and share high expression of an epithelial signature, but differ in subtle ways. The CMS2 subtype is also termed the canonical subtype because it is characterised by high levels of CIN and high expression of Wnt and MYC target genes. KRAS mutations are enriched in CMS3. This subtype displays high activation levels of metabolic pathways and is therefore also described as the metabolic subtype. CMS4 expresses high levels of genes associated with a mesenchymal phenotype, illustrated by activation of the epithelial–mesenchymal transition (EMT) and TGFβ pathways. Mesenchymal features have been linked to poor prognosis in various other cancer types76. CMS4 is further characterised by high stromal content and infiltration of lymphocytic and monocytic immune cells77. It is important to note that while some mutations might be significantly more frequent in one subtype over the other, none are exclusively present in one CMS class. The differential gene expression signatures, linked to distinctive pathway activity between CMS subtypes, do however illustrate that the subtypes are biologically distinct and highlight that this is installed beyond the presence of genetic alterations. It furthermore makes it plausible that they can be targeted with specific classes of drugs as discussed later.
It has been suggested that biological differences between CMS subtypes are in part installed at the premalignant state. Expression patterns of SSA obtained from serrated polyposis syndrome patients have been connected to the CMS1 and CMS4 subtypes, whereas tubular adenomas from familial adenomatous polyposis patients bear resemblance to the epithelial subtypes66,78. Other classification efforts on precursor lesions have been described and categorise most hereditary and sporadic tubular adenomas into the epithelial subclasses CMS2 and CMS3, depending on the strategy employed. The majority of sporadic and serrated polyposis syndrome-derived sessile serrated adenomas and hyperplastic polyps classify as CMS1 in these analyses and CMS4-like adenomas are detected at very low incidence79,80. This discordance suggests that although mesenchymal features may be apparent in sessile serrated adenomas, they are not as pronounced as in CMS4 cancers. We therefore suggest that while tubular and sessile serrated adenomas may be predisposed to transform into a carcinoma of a particular CMS subtype, further oncogenic transformation and microenvironmental cues during progression are needed to definitively install the biological programs associated with the individual subtypes. One of those cues might be high levels of TGFβ that push sessile serrated adenomas towards a more CMS4-like rather than a CMS1-like state78.
Interestingly, the patterns in activity of specific biological pathways associated with each CMS subtype can be extrapolated to subtypes identified in carcinomas of other gastrointestinal origin. Elaborate discussion of this observation goes beyond the scope of this review, and it has been diligently reviewed elsewhere76. We would however like to point out an important message that can be distilled from this: these tumours originate from unique tissue types, each using organ-specific gene expression programs and each influenced by microenvironments composed of varying cell types. The overlap in subtype characteristics could point to epithelial cells using specific routes towards the development into carcinoma cells that lead to similar behaviour of these tumours. Additionally, it creates an opportunity to use the same treatment regimen for tumours originating from different tissues that nevertheless adhere to a similar subtype. An alternative explanation for the overlap in subtype characteristics across tumour types is that it does not represent distinct biological behaviour of the epithelial cancer cell compartment, but rather reflects the attraction of a unique microenvironment consisting of different cell types.
Caveats of the CMS and alternative classification strategies
It is evident that subdivision of colon cancer in CMS is not only driven by gene expression profiles derived from the tumour cells, but that it is highly influenced by the abundance and composition of the microenvironment. This is especially apparent in CMS1, in which relatively high numbers of immune cells infiltrate the tumour, and in CMS4 where the high expression of EMT and TGFβ target genes can in part be attributed to an ongoing desmoplastic reaction recruiting high amounts of fibroblasts73,81,82,83. The dependence of the classification on the microenvironment also implies that the sampling site of the tumour piece may impact on CMS classification, as the stromal make-up can differ between specimens. It has indeed been noted that CMS assignment is discordant between samples obtained from the same patient, which holds true when comparing multiple biopsies of the primary tumour, samples acquired from the invasive front and tumour core, or from primary and metastatic sites84,85,86,87.
Partial reliance on stromal gene expression for CMS classification is furthermore illustrated by the fact that stratification of preclinical models is complicated by the absence of stromal cells. The gene expression profile of cancer cell lines is solely derived from epithelial cells, while the stromal compartment of patient-derived xenografts (PDX) is derived from immune-deficient mice that lack B-, T- and often NK-cells. Depending on the platform used, the murine contribution can be partially detected (array-based analysis is optimised for human sequences) or can be analysed separately (RNAseq)88,89,90. Nevertheless, assignment of all subtypes, including the mesenchymal ones, to large cancer cell line panels has been achieved by multiple groups. Irrespective of the distinct bioinformatics analyses used, the majority of cell lines or primary cultures are concordantly subtyped, and switches in classification usually occur between the related CMS2/CMS3 and CMS1/CMS4 subtypes66,90,91,92. Taken together, this indicates that CMS-associated biology is still captured in these models. Similarly, assigning CMS to PDX models is feasible although this proved to require further adaptation of the bioinformatics method. It has been reported that if the patient-derived CMS random forest classifier is applied, PDX models are not classified as CMS4, specifically because murine stromal signals are not measured effectively81,82,89,93. Nevertheless, this can be overcome by rebuilding classifiers enriched for genes expressed in the epithelial fraction, and these adapted classifiers perform well on tumours from patients and on PDX models. In addition, such epithelial-enriched classifiers actually stratify cell lines more optimally compared to the CMS random forest classifier alone90,92,94. This implies that specific CMS features are locked in the tumour cells and that these cells are likely responsible for the manipulation of the microenvironment, yielding stromal-rich tumours in CMS4 or immune cell-rich tumours in CMS1. In agreement, the validity of these PDX classifications was further substantiated by the observation that tumours with a given subtype as defined for the patient’s specimen maintained their subtype in the PDX model in the majority of the cases.
Such a shift in focus towards cancer cell intrinsic expression patterns in the tumour setting has been proposed to resolve some of the issues associated with CMS classification. Gene expression data acquired from whole tumour samples however does not allow for distinction between cancer cell and stroma-derived signals. Single cell RNAseq can resolve this issue in the future, but researchers reasoned that the unique composition of PDX, with human cancer cells and mouse stroma, would in the meantime allow for identification of signals solely expressed in the tumour epithelium by employing two bioinformatical filtering strategies. Firstly, microarray probes cross-hybridising with murine transcripts were not included in further analyses as signals from such probes cannot be assigned faithfully to human or mouse RNA. To further enrich for transcripts of epithelial origin, RNA sequencing data was obtained for a number of PDX. If more than 50% of the total expression of a gene originated from mouse transcripts, it was excluded in the downstream analysis89. These two filtering strategies allowed for unsupervised stratification of colon cancer into five CRC intrinsic signature (CRIS) subtypes that capture epithelial-specific gene expression profiles driven by oncogenic mutations and pathways (summarised in Fig. 1).
CRIS-A consists of KRAS-mutated samples and MSI, BRAF-mutated samples and is characterised by glycolytic metabolism and inflammatory signals. CRIS-B tumours are poorly differentiated and display active TGFβ and EMT signalling. CRIS-C encompasses CIN tumours with wildtype KRAS status, MYC amplification and increased EGFR pathway activity. CRIS-D is associated with high Wnt, fibroblast growth factor receptor (FGFR) and insulin-like growth factor receptor (IGFR) pathway activity, of which the latter is in part achieved by IGF-2 amplification. CRIS-E is likewise characterised by high Wnt activity and TP53 mutations are prevalent in this subtype. Significant overlap exists between CRIS and CMS subtypes, although differences are described (Fig. 1). CRIS-A is mainly constituted of CMS1 and CMS3 tumours. CRIS-B incorporates the other CMS1 samples and a subset of CMS4 samples. The rest of the CMS4 tumours are equally distributed over CRIS-C, CRIS-D and CRIS-E, as are the CMS2 tumours89. Importantly, the CRIS classifier is better suited to concordantly classify multiple samples obtained from the same patient, irrespective of the sampling site, as its focus on the tumour cell intrinsic phenotype reduces the influence of stromal constitution85,86. Consequently, different preclinical models, such as cell lines or PDX models, can also be robustly classified without adapting the CRIS gene set89. As illustrated in Fig. 1, distribution of colon cancer cell lines over the respective CRIS and CMS classes follows the trends observed in patient samples, although small differences exist.
The CRIS classification system is thus fitted to capture oncogenic driver programs specifically active in the cancer cell compartment of a tumour. As will be discussed in the next section, this provides insight in therapy sensitivities, but filtering out stromal contribution to gene expression patterns in tumours comes with drawbacks as well. This is firstly illustrated by the observation that the prognostic value of CRIS is improved when information about the abundance of fibroblast infiltration is added, which to some extent results in a strategy reminiscent of the identification of the poor prognosis, stromal-rich CMS4 subgroup89. Furthermore, although the derivation of the CRIS classifier based on PDX allows for distinguishing cancer cell intrinsic features, it also introduces a bias due to two inherent limitations: (i) signalling incompatibility exists between murine and human ligands/receptors, and certain biological programs and feedback loops between the compartments are thus not completely recapitulated in PDX, and (ii) not all tumours successfully establish a PDX and it has been reported that some CMS classes are more capable of forming serially transplantable xenografts compared to others95. It must also be considered that predominant expression of a gene in the stroma does not mean it cannot be differentially expressed in the cancer cell compartment, and overall exclusion of such genes from further analysis can also neglect relevant biology.
Colon cancer heterogeneity is captured in preclinical model systems and is linked to differential therapy responses
Established cell lines and genetically engineered mouse models (GEMM) traditionally comprised the model systems available for preclinical research, but recent advances allow for generation of different patient-derived models. We have already alluded to direct PDX, in which a piece of human tumour is transplanted in immunocompromised mice96. Patient material can also be used to establish primary cell lines grown in suspension62, and seminal work has described protocols for in vitro 3D culture and expansion of both mouse and human organoids derived from healthy and tumour tissue97,98. Both culture types can be grafted into immune-deficient mice to procure indirect PDX. Furthermore, step-wise introduction of mutations in driver genes in organoids from healthy tissue can be used to model the full spectrum of adenoma-to-carcinoma sequence, including metastasis27,99,100,101,102,103,104.
Importantly, all these different types of preclinical models capture the various facets of colon cancer heterogeneity, including CRIS and CMS subtypes, illustrated by a vast range of published repositories65,66,90,91,92,93,95,100,101,105,106,107,108,109,110,111,112. Before we discuss patterns in therapy sensitivity and resistance that have been identified, we would like to point out that every preclinical model is associated with a certain level of bias in terms of the subgroups that it represents best (Fig. 2). GEMM for colon cancer were typically driven by APC loss and these mice develop tubular adenomas and epithelial-like tumours113,114. Novel conditional GEMM have emerged over the last years in which MMR genes are deleted to model MSI colon cancer115,116 or in which APC is not included as a driver gene. These non-canonical GEMM typically give rise to serrated-like adenomas and more invasive and aggressive colon cancer114,117,118,119,120. Traditional cell lines are enriched for MSI positivity, and primary cell line and organoid cultures adhere more to the CMS2 subtype90. Other patient-derived models are likewise associated with biases: the original organoid establishment protocol is less suitable for generating serrated polyp cultures98,112, whereas direct PDX appear to be more successfully derived from CMS1 and CMS4 tumours95. Fully individualised therapy selection based on an in vitro or in vivo model is therefore still challenging, as one cannot know up front what technique would establish a personal preclinical model for a particular patient. Studying the responses across panels of models has however yielded valuable insights.
Initial clinical trials using vemurafenib to target the BRAFV600E oncogene failed in colon cancer, despite its effectiveness in melanoma121. Preclinical data revealed that feedback activation of the MAPK pathway through EGFR was the underlying mechanism of resistance, and recent trials reported that combination of vemurafenib with MAPK pathway inhibitors overcomes resistance122,123,124. Various mechanisms leading to cetuximab insensitivity were elicited in PDX, amongst which mutations in various components of MAPK signalling (upstream receptor tyrosine kinases, PTEN, PI3K, BRAF, KRAS and NRAS)91,93,111. Pathway hyperactivation through amplification or upregulation of ligands and receptors is also frequently observed, such as human epidermal growth factor receptor 2, hepatocyte growth factor receptor and FGFR1 amplification and IGF2 overexpression110,111,125,126. Drugs to target these compensation mechanisms are available, and can indeed be successfully combined with cetuximab to generate better responses111,126. As with drug resistance, patterns can be observed in cell lines and PDX that predict sensitivity to cetuximab, including high Wnt and Myc pathway activity.
It should be noted that the described escape mechanisms and indicators for sensitivity signify specific CMS and CRIS subtypes, in patient samples and in preclinical models. CMS2-like and CRIS-C tumours display high Wnt and Myc activity and are associated with cetuximab response. A subgroup of CMS2 classifies as CRIS-D in which the FGFR and IGF signalling pathways are activated, and these samples do not respond to cetuximab66,73,89,90,91,92. CMS4-like cell lines do not respond to cetuximab either and are additionally associated with poorer responses to chemotherapy with 5-FU and oxaliplatin66,90,92.
The discussed examples focus on tumour cell intrinsic differences in pathway activation that are recapitulated in model systems. Increased stromal infiltration in the CMS4 subtype is similarly translated to PDX models90, suggesting that CMS4 cancer cells actively attract this microenvironment and that the tumour–stroma interaction could pose as a targeting possibility. Two recent studies using novel mouse models that give rise to primary tumours and spontaneous metastases support this notion114,127. The aggressive tumours were characterised by presence of reactive stroma, stimulated by TGFβ, and activation of CMS4 and CRIS-B-associated gene expression pathways. Critically, inhibition of TGFβ decreased metastatic outgrowth and caused a switch from an immunosuppressive microenvironment to one that allows for T-cell infiltration. The existence of an immunosuppressive environment has been noted before in CMS4 cancer specimens77,87 and these mouse studies suggest that TGFβ inhibitors can be combined with immune checkpoint inhibition to elicit an immune response114,127. They furthermore reveal that studying the tumour–stroma interaction is an avenue worth exploring to infer additional candidates for targeted therapy128. In vitro co-culture experiments, dissection of mouse and human RNA expression patterns in PDX models through RNA sequencing or single cell RNA sequencing of human tumours can be used to reveal these interactions.
Translation of CMS to the clinic
A number of criteria still need to be met to warrant implementation of CMS subtyping in clinical practice. First, the laboratory and informatics procedures currently employed for classification need to be improved and adapted to better suit the clinical reality. Second, the predictive value of CMS for therapy response needs to be corroborated by (prospective) clinical trials. In this section, we aim to shed light on the technical challenges faced to robustly classify patient samples and the avenues that are currently explored to resolve these issues. We will then give an overview of what we have learned so far about therapy outcome prediction by retrospective analysis of clinical trials.
Optimisation of CMS classification for clinical applicability
Gene expression profiling is readily influenced by the practical methods used to obtain and prepare RNA for analysis and is affected by the platform chosen to generate the expression profile. Normalisation issues additionally complicate CMS single sample prediction73,129,130. Furthermore, the majority of the RNA samples obtained for the CMS discovery datasets was extracted from fresh frozen tumour samples, whereas clinical specimens are normally formalin-fixed and paraffin-embedded (FFPE) before storage. The FFPE process generates low quality RNA due to fragmentation and degradation over time and these shorter fragments are more difficult to quantify using RNAseq, microarray or PCR-based techniques, but novel technology offers a solution to enable expression profiling on FFPE material. The NanoString platform uses short probes (35–50 nucleotides) to capture and quantify mRNA sequences and is therefore more compatible with fragmented RNA. Three recent studies have built customised NanoString probe sets and successfully classified panels of FFPE material according to CMS75,87,131. CMS classification between patient-matched FFPE and fresh frozen material was concordant, although sample size was limited75,87. Formal cross-validation of newly developed NanoString classifiers and the original consortium random forest classifier will have to be extended to a larger panel of matched samples to fully support the transformation towards a NanoString-based CMS classification protocol.
To aid the transition to the use of FFPE material, others have also assessed if immunohistochemistry for CMS-specific markers (CDX2, FRMD6, HTR2B and ZEB1) allows for classification. Combining these markers with pan-cytokeratin staining and MSI status facilitates stratification into CMS1, epithelial CMS2/CMS3 and mesenchymal CMS4 classes. The added benefit of immunohistochemistry is that the pathologist can specifically look at protein expression in the cancer cell fraction, thereby reducing the influence of stromal infiltration on CMS classification. Non-concordant CMS assignment of multiple biopsies from the same patient might thereby be overcome, but it then needs to be established if staining intensity for the CMS markers is similar in multiple regions of the same tumour86. Along that line of thinking, immunohistochemistry on FFPE material and review by a pathologist should be incorporated in the CMS classification allowing for a more consistent RNA isolation protocol. Preferably, CMS testing should be performed on sections that are routinely used for pathological diagnosis.
Therapy efficacy in subtypes: discovery in preclinical setting and retrospective analysis of trial samples
Over the last few years CMS(-affiliated) subtyping has been retrospectively applied to publicly available patient datasets and clinical trial cohorts to define predictive markers for therapy efficacy (summarised in Table 1)66,67,71,74,75,132,133,134,135,136,137. We are aware of the fact that the majority of the used cohorts consist of metastatic colon cancer patients and differences between datasets (patient characteristics, trial inclusion and exclusion criteria, treatment composition) and methods used for CMS classification exist. For an extensive discussion of the various intricacies that should be considered for the interpretation of these studies, we would like to point to refs. 138,139,140. Nevertheless, clinically relevant patterns in terms of therapy response can be distilled from these analyses, and we will highlight them in this section.
Not all stage II and III colon cancer patients respond to adjuvant chemotherapy, with recent data suggesting that CMS subtype-specific sensitivities can potentially explain this variation. Multiple studies have pointed out that 5-FU-based therapies, also when supplemented with oxaliplatin, only benefit epithelial CMS2-like patients and not those adhering to a CMS4-like subtype71,141,142. Given the poor prognosis that is already associated with the CMS4 subtype, these clinical studies call for consideration of other chemotherapeutics to increase overall survival. A potential successful alternative was suggested for the stem-like colon cancer subtype, which is reminiscent of the CMS4 subtype. At metastatic disease these tumours appeared more sensitive to FOLFIRI, in which oxaliplatin is replaced for irinotecan67. Although available sample sizes of patients were initially too small to fully support the potential of irinotecan use143,144, two independent clinical trials with larger patient cohorts have since substantiated the enhanced effect of FOLFIRI over FOLFOX in CMS4 in the metastatic setting133,145.
Metastatic patients are currently treated with FOLFIRI or FOLFOX supplemented with targeted therapies bevacizumab or cetuximab, depending on KRAS mutation status9. Clinical trials and preclinical studies have however described that further separation of patients or models according to CMS subtype is relevant for predicting cetuximab efficacy. Patients adhering to a mesenchymal subtype do not benefit from monotherapy with cetuximab regardless of the KRAS mutation status66,91,92,93,132. Combining cetuximab with FOLFOX does not provide better results; overall survival is not increased in CMS4-like patients75,137. Another trial however revealed opposing results, as CMS4 patients responded better to cetuximab addition than to bevacizumab, under the condition that FOLFIRI forms the chemotherapeutic backbone136. It remains to be elucidated whether the conflicting outcomes between trials are, in part, caused by the higher sensitivity of CMS4 to FOLFIRI compared to FOLFOX133,139,145. It however underlines that the best combination of chemotherapy plus bevacizumab or cetuximab can be different for each CMS subgroup. Preclinical models can help in defining the best combination for each subtype in order to guide the design of future clinical trials.
The benefit of bevacizumab over cetuximab in MSI/CMS1 classes of tumour is also not supported by all trials74,75,134, although the hypermutated status and abundant immune infiltration observed in these tumours has made them candidates for immune checkpoint inhibition. Treatment of metastatic MSI colorectal cancer patients has yielded good therapy responses in a subset of patients, and recent genetic analysis of tumours has revealed that a higher load of genomic insertions and deletions due to MMR deficiency can be used as a selective marker for better therapy response77,146,147.
Taken together these results demonstrate that CMS classification can be used to explain differences in response to the therapies currently used in the clinic and, critically, that poor prognosis CMS4 tumours tend not to respond.
To summarise, colon cancer is a heterogeneous disease marked by various molecular features such as mutations, CIN, MSI status and gene expression patterns. These factors are mostly disregarded in the clinic and we have pointed out that they should be incorporated in clinical decision making, as they are relevant for prognosis and therapeutic response.
The CMS and CRIS classification strategies integrate differential activity of biological programs between tumours beyond single gene mutations, and can importantly be linked to therapy sensitivity or resistance in specific subtypes, making them an attractive method to stratify colon cancer patients.
Continued efforts to devise a standardised CMS classification method that functions reliably on FFPE material are being made. Its completion would aid translatability to the clinic, although formal integration would only be warranted if clinical trials support the added value of CMS classification for patient prognosis or prediction of therapy response. To fulfil that need, CMS classification should be applied retrospectively to trials, but it should additionally be incorporated in prospective studies. In that regard the colon cancer field can learn from the i-SPY 2 trial that segregates breast cancer patients in 10 distinct molecular subgroups, adapts the treatment plan accordingly and that allows for rapid influx of experimental drugs into the trial148. Prospective studies taking CMS stratification into account are currently being set up. We have discussed that preclinical models capture the heterogeneous biology of colon carcinomas and we therefore believe that they should be employed to design novel treatment strategies that could yield better results than the current standards. The most promising findings should eventually be tested in these prospective trials. This will ultimately facilitate moving beyond the one-size-fits-all treatment currently used and may hopefully improve disease outcome for more colon cancer patients.
Adenomatous polyposis coli
CpG island methylator phenotype
Consensus molecular subtypes
CRC intrinsic subtypes
Cancer stem cell(s)
Epidermal growth factor receptor
Fibroblast growth factor receptor
Folinic acid (Leucovorin), 5-FU or capecitabine (5-FU prodrug) and irinotecan
Folinic acid (Leucovorin), 5-FU or capecitabine (5-FU prodrug) and oxaliplatin
Genetically engineered mouse model(s)
Insulin-like growth factor (receptor)
Intestinal stem cell(s)
Mitogen-activated protein kinase
DNA mismatch repair
Transforming growth factor beta
Tumour, lymph node, metastasis
Siegel, R. L., Miller, K. D. & Jemal, A. Cancer statistics, 2020. CA: Cancer J. Clin. (2020).
Markowitz, S. D. & Bertagnolli, M. M. Molecular origins of cancer: Molecular basis of colorectal cancer. N. Engl. J. Med.361, 2449–2460 (2009).
Amin, M. B. et al. AJCC Cancer Staging Manual 8th edn (Springer, New York, 2017).
Tol, J., Nagtegaal, I. D. & Punt, C. J. BRAF mutation in metastatic colorectal cancer. N. Engl. J. Med.361, 98–99 (2009).
Gavin, P. G. et al. Mutation profiling and microsatellite instability in stage II and III colon cancer: an assessment of their prognostic and oxaliplatin predictive value. Clin. Cancer Res.18, 6531–6541 (2012).
Sugai, T. et al. Analysis of molecular alterations in left- and right-sided colorectal carcinomas reveals distinct pathways of carcinogenesis: proposal for new molecular profile of colorectal carcinomas. J. Mol. Diagn.8, 193–201 (2006).
Lee, G. H. et al. Is right-sided colon cancer different to left-sided colorectal cancer? A systematic review. Eur. J. Surg. Oncol.41, 300–308 (2015).
Phipps, A. I. et al. Colon and rectal cancer survival by tumor location and microsatellite instability: the Colon Cancer Family Registry. Dis. Colon Rectum56, 937–944 (2013).
Punt, C. J. A., Koopman, M. & Vermeulen, L. From tumour heterogeneity to advances in precision treatment of colorectal cancer. Nat. Rev. Clin. Oncol.14, 235–246 (2017).
André, T. et al. Adjuvant fluorouracil, leucovorin, and oxaliplatin in stage II to III colon cancer: updated 10-year survival and outcomes according to BRAF mutation and mismatch repair status of the MOSAIC study. J. Clin. Oncol.33, 4176–4187 (2015).
Karapetis, C. S. et al. K-ras mutations and benefit from cetuximab in advanced colorectal cancer. N. Engl. J. Med.359, 1757–1765 (2008).
Di Nicolantonio, F. et al. Wild-type BRAF is required for response to panitumumab or cetuximab in metastatic colorectal cancer. J. Clin. Oncol.26, 5705–5712 (2008).
Tol, J. et al. Chemotherapy, bevacizumab, and cetuximab in metastatic colorectal cancer. N. Engl. J. Med.360, 563–572 (2009).
The Cancer Genome Atlas Network. Network TCGA comprehensive molecular characterization of human colon and rectal cancer. Nature487, 330–337 (2012).
Fearon, E. R. & Vogelstein, B. A genetic model for colorectal tumorigenesis. Cell61, 759–67 (1990).
Groden, J. et al. Identification and characterization of the familial adenomatous polyposis coli gene. Cell66, 589–600 (1991).
Kinzler, K. W. et al. Identification of FAP locus genes from chromosome 5q21. Science253, 661–665 (1991).
Fearon, E. R. Molecular genetics of colorectal cancer. Annu. Rev. Pathol.6, 479–507 (2011).
Gerstung, M. et al. The evolutionary history of 2,658 cancers. Nature578, 122–128 (2020).
de Sousa e Melo, F. & de Sauvage, F. J. Cellular plasticity in intestinal homeostasis and disease. Cell Stem Cell24, 54–64 (2019).
Albuquerque, C. et al. The ‘just-right’ signaling model: APC somatic mutations are selected based on a specific level of activation of the beta-catenin signaling cascade. Hum. Mol. Genet.11, 1549–1560 (2002).
Crabtree, M. et al. Refining the relation between ‘first hits’ and ‘second hits’ at the APC locus: the ‘loose fit’ model and evidence for differences in somatic mutation spectra among patients. Oncogene22, 4257–4265 (2003).
Vermeulen, L. et al. Wnt activity defines colon cancer stem cells and is regulated by the microenvironment. Nat. Cell Biol.12, 468–476 (2010).
Leedham, S. J. et al. A basal gradient of Wnt and stem-cell number influences regional tumour distribution in human and mouse intestinal tracts. Gut62, 83–93 (2013).
Yan, K. S. et al. Non-equivalence of Wnt and R-spondin ligands during Lgr5(+) intestinal stem-cell self-renewal. Nature545, 238–242 (2017).
Huels, D. J. et al. Wnt ligands influence tumour initiation by controlling the number of intestinal stem cells. Nat. Commun.9, 1132 (2018).
Drost, J. et al. Sequential cancer mutations in cultured human intestinal stem cells. Nature521, 43–47 (2015).
Lynch, H. T., Shaw, M. W., Magnuson, C. W., Larsen, A. L. & Krush, A. J. Hereditary factors in cancer. Study of two large midwestern kindreds. Arch. Intern. Med.117, 206–212 (1966).
Fishel, R. et al. The human mutator gene homolog MSH2 and its association with hereditary nonpolyposis colon cancer. Cell75, 1027–1038 (1993).
Papadopoulos, N. et al. Mutation of a mutL homolog in hereditary colon cancer. Science263, 1625–1629 (1994).
Miyaki, M. et al. Germline mutation of MSH6 as the cause of hereditary nonpolyposis colorectal cancer. Nat. Genet.17, 271–272 (1997).
Toyota, M., Ohe-Toyota, M., Ahuja, N. & Issa, J. P. Distinct genetic profiles in colorectal tumors with or without the CpG island methylator phenotype. Proc. Natl Acad. Sci. USA97, 710–715 (2000).
Rajagopalan, H. et al. Tumorigenesis: RAF/RAS oncogenes and mismatch-repair status. Nature418, 934 (2002).
Kambara, T. et al. BRAF mutation is associated with DNA methylation in serrated polyps and cancers of the colorectum. Gut53, 1137–1144 (2004).
Hinoue, T. et al. Genome-scale analysis of aberrant DNA methylation in colorectal cancer. Genome Res.22, 271–282 (2012).
IJspeert, J. E., Vermeulen, L., Meijer, G. A. & Dekker, E. Serrated neoplasia-role in colorectal carcinogenesis and clinical implications. Nat. Rev. Gastroenterol. Hepatol.12, 401–409 (2015).
O’Brien, M. J. et al. Comparison of microsatellite instability, CpG island methylation phenotype, BRAF and KRAS status in serrated polyps and traditional adenomas indicates separate pathways to distinct colorectal carcinoma end points. Am. J. Surg. Pathol.30, 1491–1501 (2006).
Fang, M., Ou, J., Hutchinson, L. & Green, M. R. The BRAF oncoprotein functions through the transcriptional repressor MAFG to mediate the CpG Island Methylator phenotype. Mol. Cell55, 904–915 (2014).
Toyota, M. et al. CpG island methylator phenotype in colorectal cancer. Proc. Natl Acad. Sci. USA96, 8681–8686 (1999).
Kriegl, L. et al. Up and downregulation of p16(Ink4a) expression in BRAF-mutated polyps/adenomas indicates a senescence barrier in the serrated route to colon cancer. Mod. Pathol.24, 1015–1022 (2011).
Shen, L. et al. Integrated genetic and epigenetic analysis identifies three different subclasses of colon cancer. Proc. Natl Acad. Sci. USA104, 18654–18659 (2007).
Jorissen, R. N. et al. DNA copy-number alterations underlie gene expression differences between microsatellite stable and unstable colorectal cancers. Clin. Cancer Res.14, 8061–8069 (2008).
Popat, S., Hubner, R. & Houlston, R. S. Systematic review of microsatellite instability and colorectal cancer prognosis. J. Clin. Oncol.23, 609–618 (2005).
Tran, B. et al. Impact of BRAF mutation and microsatellite instability on the pattern of metastatic spread and prognosis in metastatic colorectal cancer. Cancer117, 4623–4632 (2011).
Ribic, C. M. et al. Tumor microsatellite-instability status as a predictor of benefit from fluorouracil-based adjuvant chemotherapy for colon cancer. N. Engl. J. Med.349, 247–257 (2003).
Sargent, D. J. et al. Defective mismatch repair as a predictive marker for lack of efficacy of fluorouracil-based adjuvant therapy in colon cancer. J. Clin. Oncol.28, 3219–3226 (2010).
Funkhouser, W. K. Jr et al. Relevance, pathogenesis, and testing algorithm for mismatch repair-defective colorectal carcinomas: a report of the association for molecular pathology. J. Mol. Diagn.14, 91–103 (2012).
van ‘t Veer, L. J. et al. Gene expression profiling predicts clinical outcome of breast cancer. Nature415, 530–536 (2002).
van de Vijver, M. J. et al. A gene-expression signature as a predictor of survival in breast cancer. N. Engl. J. Med.347, 1999–2009 (2002).
Paik, S. et al. A multigene assay to predict recurrence of tamoxifen-treated, node-negative breast cancer. N. Engl. J. Med.351, 2817–2826 (2004).
Kerr, D. et al. A quantitative multigene RT-PCR assay for prediction of recurrence in stage II colon cancer: Selection of the genes in four large studies and results of the independent, prospectively designed QUASAR validation study. J. Clin. Oncol.27, 4000 (2009).
Salazar, R. et al. Gene expression signature to improve prognosis prediction of stage II and III colorectal cancer. J. Clin. Oncol.29, 17–24 (2011).
O’Connell, M. J. et al. Relationship between tumor gene expression and recurrence in four independent studies of patients with stage II/III colon cancer treated with surgery alone or surgery plus adjuvant fluorouracil plus leucovorin. J. Clin. Oncol.28, 3937–3944 (2010).
You, Y. N., Rustin, R. B. & Sullivan, J. D. Oncotype DX((R)) colon cancer assay for prediction of recurrence risk in patients with stage II and III colon cancer: a review of the evidence. Surg. Oncol.24, 61–66 (2015).
Lascorz, J., Chen, B., Hemminki, K. & Forsti, A. Consensus pathways implicated in prognosis of colorectal cancer identified through systematic enrichment analysis of gene expression profiling studies. PLoS ONE6, e18867 (2011).
Popovici, V. et al. Identification of a poor-prognosis BRAF-mutant-like population of patients with colon cancer. J. Clin. Oncol.30, 1288–1295 (2012).
Tian, S. et al. A combined oncogenic pathway signature of BRAF, KRAS and PI3KCA mutation improves colorectal cancer classification and cetuximab treatment prediction. Gut62, 540–549 (2013).
Vecchione, L. et al. A vulnerability of a subset of colon cancers with potential clinical utility. Cell165, 317–330 (2016).
Merlos-Suárez, A. et al. The intestinal stem cell signature identifies colorectal cancer stem cells and predicts disease relapse. Cell Stem Cell8, 511–524 (2011).
de Sousa e Melo, F. et al. Methylation of cancer-stem-cell-associated Wnt target genes predicts poor prognosis in colorectal cancer patients. Cell Stem Cell9, 476–485 (2011).
O’Brien, C. A., Pollett, A., Gallinger, S. & Dick, J. E. A human colon cancer cell capable of initiating tumour growth in immunodeficient mice. Nature445, 106–110 (2007).
Ricci-Vitiani, L. et al. Identification and expansion of human colon-cancer-initiating cells. Nature445, 111–115 (2007).
Vermeulen, L. et al. Single-cell cloning of colon cancer stem cells reveals a multi-lineage differentiation capacity. Proc. Natl Acad. Sci. USA105, 13427–13432 (2008).
Liu, R. et al. The prognostic role of a gene signature from tumorigenic breast-cancer cells. N. Engl. J. Med.356, 217–226 (2007).
Schlicker, A. et al. Subtypes of primary colorectal tumors correlate with response to targeted treatment in colorectal cell lines. BMC Med. Genomics.5, 66 (2012).
de Sousa e Melo, F. et al. Poor-prognosis colon cancer is defined by a molecularly distinct subtype and develops from serrated precursor lesions. Nat. Med.19, 614–618 (2013).
Sadanandam, A. et al. A colorectal cancer classification system that associates cellular phenotype and responses to therapy. Nat. Med.19, 619–625 (2013).
Marisa, L. et al. Gene expression classification of colon cancer into molecular subtypes: characterization, validation, and prognostic value. PLoS Med.10, e1001453 (2013).
Budinska, E. et al. Gene expression patterns unveil a new level of molecular heterogeneity in colorectal cancer. J. Pathol.231, 63–76 (2013).
Perez-Villamil, B. et al. Colon cancer molecular subtypes identified by expression profiling and associated to stroma, mucinous type and different clinical behavior. BMC Cancer12, 260 (2012).
Roepman, P. et al. Colorectal cancer intrinsic subtypes predict chemotherapy benefit, deficient mismatch repair and epithelial-to-mesenchymal transition. Int J. Cancer134, 552–562 (2014).
Sadanandam, A. et al. Reconciliation of classification systems defining molecular subtypes of colorectal cancer: interrelationships and clinical implications. Cell Cycle13, 353–357 (2014).
Guinney, J. et al. The consensus molecular subtypes of colorectal cancer. Nat. Med.21, 1350–1356 (2015).
Mooi, J. K. et al. The prognostic impact of consensus molecular subtypes (CMS) and its predictive effects for bevacizumab benefit in metastatic colorectal cancer: molecular analysis of the AGITG MAX clinical trial. Ann. Oncol.29, 2240–2246 (2018).
Lenz, H. J. et al. Impact of consensus molecular subtype on survival in patients with metastatic colorectal cancer: results from CALGB/SWOG 80405 (Alliance). J. Clin. Oncol.37, 1876–1885 (2019).
Bijlsma, M. F., Sadanandam, A., Tan, P. & Vermeulen, L. Molecular subtypes in cancers of the gastrointestinal tract. Nat. Rev. Gastroenterol. Hepatol.14, 333–342 (2017).
Becht, E. et al. Immune and stromal classification of colorectal cancer is associated with molecular subtypes and relevant for precision immunotherapy. Clin. Cancer Res.22, 4057–4066 (2016).
Fessler, E. et al. TGFbeta signaling directs serrated adenomas to the mesenchymal colorectal cancer subtype. EMBO Mol. Med.8, 745–760 (2016).
Chang, K. et al. Colorectal premalignancy is associated with consensus molecular subtypes 1 and 2. Ann. Oncol.29, 2061–2067 (2018).
Komor, M. A. et al. et al. Consensus molecular subtype classification of colorectal adenomas. J. Pathol.246, 266–276 (2018).
Calon, A. et al. Stromal gene expression defines poor-prognosis subtypes in colorectal cancer. Nat. Genet.47, 320–329 (2015).
Isella, C. et al. Stromal contribution to the colorectal cancer transcriptome. Nat. Genet.47, 312–319 (2015).
McCorry, A. M., Loughrey, M. B., Longley, D. B., Lawler, M. & Dunne, P. D. Epithelial-to-mesenchymal transition signature assessment in colorectal cancer quantifies tumour stromal content rather than true transition. J. Pathol.246, 422–426 (2018).
Dunne, P. D. et al. Challenging the cancer molecular stratification dogma: intratumoral heterogeneity undermines consensus molecular subtypes and potential diagnostic value in colorectal cancer. Clin. Cancer Res.22, 4095–4104 (2016).
Dunne, P. D. et al. Cancer-cell intrinsic gene expression signatures overcome intratumoural heterogeneity bias in colorectal cancer patient classification. Nat. Commun.8, 15657 (2017).
Alderdice, M. et al. Prospective patient stratification into robust cancer-cell intrinsic subtypes from colorectal cancer biopsies. J. Pathol.245, 19–28 (2018).
Piskol, R. et al. A clinically applicable gene-expression classifier reveals intrinsic and extrinsic contributions to consensus molecular subtypes in primary and metastatic colon cancer. Clin. Cancer Res.25, 4431–4442 (2019).
Chou, J. et al. Phenotypic and transcriptional fidelity of patient-derived colon cancer xenografts in immune-deficient mice. PLoS ONE8, e79874 (2013).
Isella, C. et al. Selective analysis of cancer-cell intrinsic transcriptional traits defines novel clinically relevant subtypes of colorectal cancer. Nat. Commun.8, 15107 (2017).
Linnekamp, J. F. et al. Consensus molecular subtypes of colorectal cancer are recapitulated in in vitro and in vivo models. Cell Death Differ.25, 616–633 (2018).
Medico, E. et al. The molecular landscape of colorectal cancer cell lines unveils clinically actionable kinase targets. Nat. Commun.6, 7002 (2015).
Sveen, A. et al. Colorectal cancer consensus molecular subtypes translated to preclinical models uncover potentially targetable cancer cell dependencies. Clin. Cancer Res.24, 794–806 (2018).
Schütte, M. et al. Molecular dissection of colorectal cancer in pre-clinical models identifies biomarkers predicting sensitivity to EGFR inhibitors. Nat. Commun.8, 14262 (2017).
Eide, P. W., Bruun, J., Lothe, R. A. & Sveen, A. CMScaller: an R package for consensus molecular subtyping of colorectal cancer pre-clinical models. Sci. Rep.7, 16618 (2017).
Prasetyanti, P. R. et al. Capturing colorectal cancer inter-tumor heterogeneity in patient-derived xenograft (PDX) models. Int. J. Cancer144, 366–371 (2019).
Bleijs, M., van de Wetering, M., Clevers, H. & Drost, J. Xenograft and organoid model systems in cancer research. EMBO J.38, e101654 (2019).
Sato, T. et al. Single Lgr5 stem cells build crypt-villus structures in vitro without a mesenchymal niche. Nature459, 262–265 (2009).
Sato, T. et al. Long-term expansion of epithelial organoids from human colon, adenoma, adenocarcinoma, and Barrett’s epithelium. Gastroenterology141, 1762–1772 (2011).
Matano, M. et al. Modeling colorectal cancer using CRISPR-Cas9-mediated engineering of human intestinal organoids. Nat. Med.21, 256–262 (2015).
O’Rourke, K. P. et al. Transplantation of engineered organoids enables rapid generation of metastatic mouse models of colorectal cancer. Nat. Biotechnol.35, 577–582 (2017).
Roper, J. et al. In vivo genome editing and organoid transplantation models of colorectal cancer and metastasis. Nat. Biotechnol.35, 569–576 (2017).
de Sousa e Melo, F. et al. A distinct role for Lgr5(+) stem cells in primary and metastatic colon cancer. Nature543, 676–680 (2017).
Lannagan, T. R. M. et al. Genetic editing of colonic organoids provides a molecularly distinct and orthotopic preclinical model of serrated carcinogenesis. Gut68, 684–692 (2019).
Kawasaki, K. et al. Chromosome engineering of human colon-derived organoids to develop a model of traditional serrated adenoma. Gastroenterology158, 638–651 (2019).
Ahmed, D. et al. Epigenetic and genetic features of 24 colon cancer cell lines. Oncogenesis2, e71 (2013).
Mouradov, D. et al. Colorectal cancer cell lines are representative models of the main molecular subtypes of primary cancer. Cancer Res.74, 3238–3247 (2014).
Berg, K. C. G. et al. Multi-omics of 34 colorectal cancer cell lines—a resource for biomedical studies. Mol. Cancer16, 116 (2017).
van de Wetering, M. et al. Prospective derivation of a living organoid biobank of colorectal cancer patients. Cell161, 933–945 (2015).
Gao, H. et al. High-throughput screening using patient-derived tumor xenografts to predict clinical trial drug response. Nat. Med.21, 1318–1325 (2015).
Bertotti, A. et al. A molecularly annotated platform of patient-derived xenografts (“xenopatients”) identifies HER2 as an effective therapeutic target in cetuximab-resistant colorectal cancer. Cancer Discov.1, 508–523 (2011).
Bertotti, A. et al. The genomic landscape of response to EGFR blockade in colorectal cancer. Nature526, 263–267 (2015).
Fujii, M. et al. A colorectal tumor organoid library demonstrates progressive loss of niche factor requirements during tumorigenesis. Cell Stem Cell18, 827–838 (2016).
Moser, A. R., Pitot, H. C. & Dove, W. F. A dominant mutation that predisposes to multiple intestinal neoplasia in the mouse. Science247, 322–324 (1990).
Jackstadt, R. et al. Epithelial NOTCH signaling rewires the tumor microenvironment of colorectal cancer to drive poor-prognosis subtypes and metastasis. Cancer Cell36, 319–336 e7 (2019).
Wojciechowicz, K. et al. Temozolomide increases the number of mismatch repair-deficient intestinal crypts and accelerates tumorigenesis in a mouse model of Lynch syndrome. Gastroenterology147, 1064–1072 e5 (2014).
Kucherlapati, M. H. et al. An Msh2 conditional knockout mouse for studying intestinal cancer and testing anticancer agents. Gastroenterology138, 993–1002 e1 (2010).
Bennecke, M. et al. Ink4a/Arf and oncogene-induced senescence prevent tumor progression during alternative colorectal tumorigenesis. Cancer Cell18, 135–146 (2010).
Carragher, L. A. et al. V600EBraf induces gastrointestinal crypt senescence and promotes tumour progression through enhanced CpG methylation of p16INK4a. EMBO Mol. Med.2, 458–471 (2010).
Leystra, A. A. et al. Mice expressing activated PI3K rapidly develop advanced colon cancer. Cancer Res.72, 2931–2936 (2012).
Rad, R. et al. A genetic progression model of Braf(V600E)-induced intestinal tumorigenesis reveals targets for therapeutic intervention. Cancer Cell24, 15–29 (2013).
Chapman, P. B. et al. Improved survival with vemurafenib in melanoma with BRAF V600E mutation. N. Engl. J. Med.364, 2507–2516 (2011).
Prahallad, A. et al. Unresponsiveness of colon cancer to BRAF(V600E) inhibition through feedback activation of EGFR. Nature483, 100–103 (2012).
Corcoran, R. B. et al. Combined BRAF, EGFR, and MEK inhibition in patients with BRAF(V600E)-mutant colorectal cancer. Cancer Discov.8, 428–443 (2018).
Kopetz, S. et al. Encorafenib, binimetinib, and cetuximab in BRAF V600E-mutated colorectal cancer. N. Engl. J. Med.381, 1632–1643 (2019).
Bardelli, A. et al. Amplification of the MET receptor drives resistance to anti-EGFR therapies in colorectal cancer. Cancer Discov.3, 658–673 (2013).
Zanella, E. R. et al. IGF2 is an actionable target that identifies a distinct subpopulation of colorectal cancer patients with marginal response to anti-EGFR therapies. Sci. Transl. Med.7, 272ra12 (2015).
Tauriello, D. V. F. et al. TGFbeta drives immune evasion in genetically reconstituted colon cancer metastasis. Nature554, 538–543 (2018).
Fridman, W. H., Miller, I., Sautes-Fridman, C. & Byrne, A. T. Therapeutic targeting of the colorectal tumor stroma. Gastroenterology158, 303–321 (2019).
Eilertsen, I. A. et al. Technical differences between sequencing and microarray platforms impact transcriptomic subtyping of colorectal cancer. Cancer Lett.469, 246–255 (2019).
Gao, F. et al. DeepCC: a novel deep learning-based framework for cancer molecular subtype classification. Oncogenesis8, 44 (2019).
Ragulan, C. et al. Analytical validation of multiplex biomarker assay to stratify colorectal cancer into molecular subtypes. Sci. Rep.9, 7665 (2019).
Khambata-Ford, S. et al. Expression of epiregulin and amphiregulin and K-ras mutation status predict disease control in metastatic colorectal cancer patients treated with cetuximab. J. Clin. Oncol.25, 3230–3237 (2007).
Del Rio, M. et al. Molecular subtypes of metastatic colorectal cancer are associated with patient response to irinotecan-based therapies. Eur. J. Cancer76, 68–75 (2017).
Smeets, D. et al. Copy number load predicts outcome of metastatic colorectal cancer patients receiving bevacizumab combination therapy. Nat. Commun.9, 4112 (2018).
Stintzing, S. et al. Consensus molecular subgroups (CMS) of colorectal cancer (CRC) and first-line efficacy of FOLFIRI plus cetuximab or bevacizumab in the FIRE3 (AIO KRK-0306) trial. J. Clin. Oncol.35, 3510 (2017).
Stintzing, S. et al. Consensus molecular subgroups (CMS) of colorectal cancer (CRC) and first-line efficacy of FOLFIRI plus cetuximab or bevacizumab in the FIRE3 (AIO KRK-0306) trial. Ann. Oncol.30, 1796–1803 (2019).
Trinh, A. et al. Practical and robust identification of molecular subtypes in colorectal cancer by immunohistochemistry. Clin. Cancer Res.23, 387–398 (2017).
Fontana, E., Eason, K., Cervantes, A., Salazar, R. & Sadanandam, A. Context matters—consensus molecular subtypes of colorectal cancer as biomarkers for clinical trials. Ann. Oncol.30, 520–527 (2019).
Aderka, D., Stintzing, S. & Heinemann, V. Explaining the unexplainable: discrepancies in results from the CALGB/SWOG 80405 and FIRE-3 studies. Lancet Oncol.20, e274–e83 (2019).
Sveen, A., Cremolini, C. & Dienstmann, R. Predictive modeling in colorectal cancer: time to move beyond consensus molecular subtypes. Ann. Oncol.30, 1682–1685 (2019).
Song, N. et al. Clinical outcome from oxaliplatin treatment in stage ii/iii colon cancer according to intrinsic subtypes: secondary analysis of NSABP C-07/NRG Oncology Randomized Clinical Trial. JAMA Oncol.2, 1162–1169 (2016).
Allen, W. L. et al. Transcriptional subtyping and CD8 immunohistochemistry identifies poor prognosis stage II/III colorectal cancer patients who benefit from adjuvant chemotherapy. JCO Precis. Oncol.2018, 1–15 (2018).
Martinez-Garcia, R., Lopez-Casas, P. P., Rico, D., Valencia, A. & Hidalgo, M. Colorectal cancer classification based on gene expression is not associated with FOLFIRI response. Nat. Med.20, 1230–1231 (2014).
Sadanandam, A., Gray, J. & Hanahan, D. Reply to colorectal cancer classification based on gene expression is not associated with FOLFIRI response. Nat. Med.20, 1231–1232 (2014).
Okita, A. et al. Consensus molecular subtypes classification of colorectal cancer as a predictive factor for chemotherapeutic efficacy against metastatic colorectal cancer. Oncotarget9, 18698–18711 (2018).
Le, D. T. et al. PD-1 blockade in tumors with mismatch-repair deficiency. N. Engl. J. Med.372, 2509–2520 (2015).
Mandal, R. et al. Genetic diversity of tumors with mismatch repair deficiency influences anti-PD-1 immunotherapy response. Science364, 485–491 (2019).
Barker, A. D. et al. I-SPY 2: an adaptive breast cancer trial design in the setting of neoadjuvant chemotherapy. Clin. Pharm. Ther.86, 97–100 (2009).
Koopman, M. et al. Sequential versus combination chemotherapy with capecitabine, irinotecan, and oxaliplatin in advanced colorectal cancer (CAIRO): a phase III randomised controlled trial. Lancet370, 135–142 (2007).
J.Y.B., A.T. and J.P.M. are supported by the Koningin Wilhelmina Fonds Dutch Cancer Society (grants UVA2014-7245 and 10150) and Oncode Institute.
Conflict of interest
The authors declare that they have no conflict of interest.
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Buikhuisen, J.Y., Torang, A. & Medema, J.P. Exploring and modelling colon cancer inter-tumour heterogeneity: opportunities and challenges. Oncogenesis 9, 66 (2020). https://doi.org/10.1038/s41389-020-00250-6