A temporal shift of the evolutionary principle shaping intratumor heterogeneity in colorectal cancer

Saito, Tomoko; Niida, Atsushi; Uchi, Ryutaro; Hirata, Hidenari; Komatsu, Hisateru; Sakimura, Shotaro; Hayashi, Shuto; Nambara, Sho; Kuroda, Yosuke; Ito, Shuhei; Eguchi, Hidetoshi; Masuda, Takaaki; Sugimachi, Keishi; Tobo, Taro; Nishida, Haruto; Daa, Tsutomu; Chiba, Kenichi; Shiraishi, Yuichi; Yoshizato, Tetsuichi; Kodama, Masaaki; Okimoto, Tadayoshi; Mizukami, Kazuhiro; Ogawa, Ryo; Okamoto, Kazuhisa; Shuto, Mitsutaka; Fukuda, Kensuke; Matsui, Yusuke; Shimamura, Teppei; Hasegawa, Takanori; Doki, Yuichiro; Nagayama, Satoshi; Yamada, Kazutaka; Kato, Mamoru; Shibata, Tatsuhiro; Mori, Masaki; Aburatani, Hiroyuki; Murakami, Kazunari; Suzuki, Yutaka; Ogawa, Seishi; Miyano, Satoru; Mimori, Koshi

doi:10.1038/s41467-018-05226-0

Download PDF

Article
Open access
Published: 23 July 2018

A temporal shift of the evolutionary principle shaping intratumor heterogeneity in colorectal cancer

Tomoko Saito^1,2,
Atsushi Niida³,
Ryutaro Uchi¹,
Hidenari Hirata¹,
Hisateru Komatsu¹,
Shotaro Sakimura¹,
Shuto Hayashi ORCID: orcid.org/0000-0002-1623-0738⁴,
Sho Nambara¹,
Yosuke Kuroda¹,
Shuhei Ito¹,
Hidetoshi Eguchi¹,
Takaaki Masuda¹,
Keishi Sugimachi¹,
Taro Tobo⁵,
Haruto Nishida⁶,
Tsutomu Daa⁶,
Kenichi Chiba⁴,
Yuichi Shiraishi⁴,
Tetsuichi Yoshizato ORCID: orcid.org/0000-0003-4283-2983⁷,
Masaaki Kodama²,
Tadayoshi Okimoto²,
Kazuhiro Mizukami²,
Ryo Ogawa²,
Kazuhisa Okamoto²,
Mitsutaka Shuto²,
Kensuke Fukuda²,
Yusuke Matsui⁸,
Teppei Shimamura⁸,
Takanori Hasegawa⁹,
Yuichiro Doki¹⁰,
Satoshi Nagayama¹¹,
Kazutaka Yamada¹²,
Mamoru Kato¹³,
Tatsuhiro Shibata^14,15,
Masaki Mori¹⁰,
Hiroyuki Aburatani ORCID: orcid.org/0000-0003-0438-1544¹⁶,
Kazunari Murakami²,
Yutaka Suzuki¹⁷,
Seishi Ogawa⁷,
Satoru Miyano^3,4 &
…
Koshi Mimori¹

Nature Communications volume 9, Article number: 2884 (2018) Cite this article

8899 Accesses
68 Citations
4 Altmetric
Metrics details

Subjects

Abstract

Advanced colorectal cancer harbors extensive intratumor heterogeneity shaped by neutral evolution; however, intratumor heterogeneity in colorectal precancerous lesions has been poorly studied. We perform multiregion whole-exome sequencing on ten early colorectal tumors, which contained adenoma and carcinoma in situ. By comparing with sequencing data from advanced colorectal tumors, we show that the early tumors accumulate a higher proportion of subclonal driver mutations than the advanced tumors, which is highlighted by subclonal mutations in KRAS and APC. We also demonstrate that variant allele frequencies of subclonal mutations tend to be higher in early tumors, suggesting that the subclonal mutations are subject to selective sweep in early tumorigenesis while neutral evolution is dominant in advanced ones. This study establishes that the evolutionary principle underlying intratumor heterogeneity shifts from Darwinian to neutral evolution during colorectal tumor progression.

Phenotypic plasticity and genetic control in colorectal cancer evolution

Article Open access 26 October 2022

Tumour heterogeneity and evolutionary dynamics in colorectal cancer

Article Open access 16 July 2021

Landscape of somatic single nucleotide variants and indels in colorectal cancer and impact on survival

Article Open access 20 July 2020

Introduction

Cancer evolution and intratumor heterogeneity (ITH) have attracted increasing attention in the cancer research field because ITH generated during cancer evolution presumably contributes to the therapeutic and diagnostic difficulties of cancer. With the advent of next-generation sequencing technology, the multiregion sequencing approach has been popularly used to understand ITH. Multiregion sequencing, in which multiple samples from physically separate regions of a single tumor are sequenced, typically identifies two categories of somatic mutations: “ubiquitous” and “heterogeneous” mutations, which are present in either all regions or a subset of regions, respectively. Ubiquitous mutations are assumed to accumulate in the early phase of cancer evolution. The parental clone that has acquired all the ubiquitous mutations then branches into subclones, which accumulate heterogeneous mutations and shape ITH. Multiregion sequencing has revealed the landscapes of ITH for renal^1,2, breast³, esophageal^4,5, lung^6,7, ovarian⁸, prostate^9,10, pancreatic¹¹, and other types of cancer. These studies have presented evidence that Darwinian evolution shapes at least part of ITH: there exist one or more subclonal driver events within distinct subclones of a tumor (hereafter, this evidence will be referred to simply as branched evolution). For a few types of tumors^1,2,3,5, more convincing evidence has been identified: multiple subclones harbor genetic alterations in the same gene or genes that work in the same pathway (hereafter, referred to as parallel evolution).

In the development of colorectal cancer (CRC), adenoma first forms a polyp and then partially progresses to early carcinoma, which subsequently grows beyond the muscularis mucosa to invade surrounding tissues and finally metastasize¹². To examine ITH in advanced CRC (ACRC), we previously performed multiregion sequencing of nine locally advanced or metastatic tumors¹³. While most of the known driver events represented by APC and KRAS mutations were observed as ubiquitous mutations, branched or parallel evolution was rarely observed in evolutionary histories of ACRC. By additionally performing a computational simulation of cancer evolution, we demonstrated the possibility that ITH in ACRC could be generated by neutral evolution. Other studies similarly combined multiregion analysis and mathematical modeling to report that neutral evolution could shape the majority of ITH in CRC as well as liver cancer^14,15. The neutral evolution model was also reported by analyzing the distribution of variant allele frequencies (VAFs) in single-region sequencing data^16,17.

Although differences in ITH across various cancer types have been well studied, little has been reported on the changes in ITH along the time course of tumorigenesis. To investigate ITH in the early process of colorectal tumorigenesis, we performed multiregion sequencing of ten colorectal tumors containing adenoma and early carcinoma. In contrast to our previous report about ACRC¹³, our multiregion analysis of the ten early colorectal tumors strongly suggests that Darwinian evolution plays a critical role in shaping ITH in the early phase of colorectal tumorigenesis.

Results

Multiregion sequencing of ten early colorectal tumor cases

To characterize ITH in the early phase of colorectal tumorigenesis, we performed multiregion whole-exome sequencing (WES) on ten early colorectal tumor cases, the details of which are provided in Supplementary Data 1. Although the samples subjected to our analysis contained colorectal adenoma and carcinoma in situ, we collectively refer to them as precancerous lesions of colorectal cancers (PCRCs) in this study. We selected tumors that were diagnosed as colorectal laterally spreading tumors (LSTs), which have suitable forms for multiregion sampling. For each case, we sequenced four to seven multiregion tumor samples and a paired normal mucosa sample as a control, which amounted to 53 tumor samples and 10 normal samples in total. Our WES, which had a median fold coverage of 132.0 (range: 75.5–200.1), detected a median of 150 (range: 82–244) mutations for each sample (Fig. 1, Supplementary Fig. 1a, Supplementary Data 2 and 3). From this, we estimated that each sample had a median mutation rate of 3.0 (range: 1.6–4.9) mutations per megabase. Considering that eight non-hypermutated ACRCs in our previous study¹³ harbored a median of 2.8 (range: 1.2–4.8) mutations per megabase (Supplementary Figs. 1b and 2), PCRCs and ACRCs have comparable somatic mutation rates. Our hierarchical Bayesian analysis, which removed the residuals associated with samples and cases, also confirmed that there were no clear differences in the distribution of the corrected mean numbers of somatic mutations between adenoma, early carcinoma, and ACRC (see Methods; Supplementary Fig. 1c). Based on multiregion mutation profiles (Fig. 1), mutations were categorized as either ubiquitous or heterogeneous mutations. In this study, heterogeneous mutations were further subcategorized into shared mutations, which existed in some of the samples, and private mutations, which were observed in a single sample. PCR-based deep sequencing of randomly sampled mutations validated 100%, 100%, and 94.2% of ubiquitous, shared, and private mutations, respectively. We also compared the number of ubiquitous and heterogeneous mutations between PCRC and ACRC after correcting for different number of samples across cases by downsampling (see Methods). PCRC tended to harbor fewer ubiquitous mutations and more heterogeneous mutations than ACRC; particularly, the number of shared mutations was significantly large in PCRC (Supplementary Fig. 1d–f; P = 0.011; Wilcoxon rank-sum test). We did not observe any significant differences in mutation spectra between ubiquitous and heterogeneous mutations across ten PCRCs (Supplementary Fig. 3a; Wilcoxon signed-rank test) or between PCRC and ACRC (Supplementary Fig. 3b; Fisher’s exact test).

Evolutionary histories of ten PCRCs

Ten PCRCs had already acquired many non-silent mutations in known CRC driver genes¹⁸ such as APC, KRAS, PIK3CA, FBXW7, SMAD4, and TP53 (observed in 8, 7, 3, 1, 1, and 1 patients, respectively; Fig. 1). Mutation rates of APC, KRAS, PIK3CA, FBXW7, and SMAD4 were consistent with previous reports on typical CRC^18,19 (Supplementary Tables 1 and 2), while the mutation rate of TP53 in PCRCs was less than that in the TCGA cohort^18,19 (Supplementary Table 2; 10% vs. 52.4%; P = 0.009; Fisher’s exact test). This was partly due to higher proportion of granular-type LST cases in our cohort, which was reported to harbor lower frequency of TP53 mutation compared to other CRC subtypes²⁰. We obtained evolutionary trees of the ten PCRCs by applying the Treeomics algorithm²¹ to our multiregion sequencing data (Fig. 2). While constructing an evolutionary tree, Treeomics corrects potential sequencing artifacts so that all mutations have mutation patterns compatible with the topologies of the evolutionary tree. Based on this property, Treeomics produces a new categorization of mutations based on parts of the inferred tree; namely, we obtained “trunk” and “branch” mutations, which were refined versions of ubiquitous and heterogeneous mutations, respectively. Similarly, shared and private mutations were mapped to “internal branch” and “external branch” mutations, respectively (hereafter, the two categorizations are referred to as the ubiquitous-heterogeneous and the trunk-branch categorizations; Supplementary Fig. 4). Treeomics also employs bootstrapping analysis, which demonstrated the robustness of our evolutionary tree inference (Supplementary Fig. 5). In each evolutionary tree, the length of the trunk and branches represented the number of trunk and branch mutations, respectively. Some of the PCRC trees had “palm tree-like” shapes that were composed of long trunks and short branches; such trees were typically observed in the ACRC trees, which were reconstructed from our previously published data using Treeomics (Supplementary Figs. 6, 7 and 8a). However, five PCRC trees had “forked tree-like” shapes, which were composed of short trunks and long branches and were not observed in ACRC cases (Supplementary Fig. 8c). These data are consistent with the observation that PCRC harbored more heterogeneous mutations than ACRC (Supplementary Fig. 1d–1f). To scrutinize the evolutionary history of each tumor, we mapped known driver genes with possible functional mutations along the evolutionary trees, which contained non-synonymous single-nucleotide variants (SNVs), stop-gain SNVs, splicing SNVs, or insertion/deletions (indels). For example, PCRC05 had two major branches, which appeared in the relatively early stage of evolution. The first APC mutation (R223X) was found in the trunk, while the second APC mutation (S1338X) was found only in the left major branch. We also found that both APC mutations in the left major branch had VAFs of ~0.4, while the first APC mutation (R223X) in the right major branch had an allele frequency of ~0.8. These observations suggest the two major subclones were subjected to two different processes leading to biallelic inactivation of APC; an additional mutation on the second allele was acquired in the left major branch, while LOH accompanying the first mutation occurred in the right major branch. Notably, the evolutionary tree of PCRC15 showed that two major branches accumulated multiple non-silent mutations in known driver genes; the right major branch had KRAS (A59G) and ARID2 (splice site), whereas the left major branch had KRAS (G12V) and ACVR1B (R474X). PCRC12 had an extremely short trunk containing double mutations in APC (Q1451X and R792fs) and long branches accumulating mutations on five different genes. In this case, an NRAS mutation (Q61K) was obtained as an internal branch mutation after the first branching point, while a KRAS mutation (Q61H) was obtained as an external branch mutation at the other side of the branching point. Comparisons between the evolutionary tree and physical positions of each sample suggest that subclonal branching generally proceeded in physically correlated ways. Treeomics optionally performs detection of subclonal mixing between evolutionally separated samples; our analysis detected subclonal mixing in seven of the ten cases (Supplementary Fig. 5).

Comparative analysis of ITH between PCRC and ACRC

These evolutionary trees suggest that branched evolution and parallel evolution are prominent in PCRC evolution, which is in contrast to the result from our ACRC study¹³. To consolidate this finding, we directly compared the clonal distribution of driver mutations between PCRC and ACRC; in the 10 PCRC cases, 25 of 51 driver mutations were branch mutations, while only 10 of 45 driver mutations were branch mutations in the 8 ACRC cases (Fig. 3a). Thus, compared with ACRC, PCRC had a stronger tendency to acquire driver mutations as branch mutations (Fig. 3b; P = 0.010; Fisher’s exact test). When examined on the ubiquitous-heterogeneous categorization, this tendency was more statistically significant (Supplementary Fig. 9a, b; P = 0.00090; Fisher’s exact test), which reflects the fact that several heterogeneous driver mutations were judged as trunk driver mutations by Treeomics. The contribution of natural selection to ITH can also be measured by the distribution of VAFs; if a set of subclonal mutations consisted of driver and associated passenger mutations, natural selection should have made their allele frequencies high, compared with those from a set without driver mutations²². Based on this idea, we compared the distribution of VAFs between PCRC and ACRC, for each type of mutation on the trunk-branch categorization. To correct the effects of tumor content and read depth, as well as to remove the residuals associated with individual mutations, samples and cases, we employed hierarchical Bayesian analysis, which demonstrated that PCRC harbored internal branch mutations at clearly higher VAFs than ACRC (see Methods; Fig. 3c). We confirmed the same tendency in cancer cell fractions (CCFs), which were obtained by removing effects of copy number alterations (CNAs) from VAFs (see Methods; Supplementary Fig. 10a). Analysis on the ubiquitous-heterogeneous categorization also reproduced the same result, although with less clearness (Supplementary Figs. 9c and 10b). Collectively, these results strongly suggest that evolutionary principles underlying ITH substantially differ between PCRC and ACRC; Darwinian evolution plays a more critical role in generating ITH in PCRC than in ACRC.

Multiregion analysis of CNAs

Finally, we estimated CNAs from WES data and comparatively analyzed multiregion CNA profiles between PCRC and ACRC (Supplementary Fig. 11a–11d). In contrast to single-nucleotide mutations, our hierarchical Bayesian analysis demonstrated that the number of CNAs increased during progression from adenoma through early carcinoma to ACRC (Supplementary Fig. 11e), consistent with previous studies^23,24,25. Tumor ploidy profiles estimated from WES data also suggest that polyploidization was prevailing in ACRC but not in PCRC (Supplementary Fig. 12). Similar to ITH of mutations, ITH of CNAs was observed in both PCRC and ACRC (Fig. 4a, Supplementary Figs. 13 and 14). By focusing on chromosomal arm-level CNAs, we compared the distributions of ubiquitous and heterogeneous CNAs between PCRC and ACRC. Overall, ACRC acquired significantly more ubiquitous CNAs than PCRC, while the numbers of heterogeneous CNAs were not significantly different (see Methods; Figs. 4b–d; P = 0.047 and 0.16, respectively; Wilcoxon rank-sum test). It should be noted that all samples were carcinoma in PCRC05, which harbored the maximum number of ubiquitous CNAs among PCRCs. By contrast, ACRC7, the ACRC case with a heterogeneous TP53 mutation, harbored no ubiquitous CNAs; heterogeneous CNAs were observed only in samples with the TP53 mutation (Supplementary Figs. 2, 5, and 13). We also found that ACRCs harbored an increased number of ubiquitous alterations for some of the chromosomal arms that are recurrently altered in the CRC population^{19,23,24,25,26}. Such CNAs contain 20q amplification, which is established as a driver event for CRC progression^26,27,28,29 (Fig. 4e). Collectively, our data suggest that CNAs act as a driver and are subject to selective sweep during progression from PCRC to ACRC.

Discussion

In this study, we thoroughly characterized and compared ITH in PCRC and ACRC. Although some studies^30,31,32 have examined ITH of PCRC, no conclusive view has been established. Similar to ITH in ACRC¹³, our multiregion sequencing unveiled extensive ITH in PCRC. In contrast to the neutral evolution model previously proposed for ACRC¹³, multiple lines of evidence indicate that at least a part of ITH of PCRC is shaped by Darwinian evolution. We found that multiple PCRC cases have evolutionary trees of forked tree-like shapes, which were not observed for ACRC. More direct evidence of Darwinian evolution was the observation that a significantly higher proportion of driver mutations accumulated as heterogeneous mutations in PCRC than in ACRC. (For simplicity, we discuss all results of the ubiquitous-heterogeneous categorization since analyses of the trunk-branch categorization also produced similar results.) In particular, heterogeneous mutations in APC and KRAS were noteworthy; these mutations were completely recognized as ubiquitous events in ACRC^13,14. Our observation is perfectly supported by a prescient study that focused on only well-known driver mutations and LOH and reported that early colorectal tumors harbored more subclonal alterations than advanced tumors³³. We also demonstrated that VAFs of shared mutations were higher in PCRC, additionally supporting the Darwinian evolution model²². We found that the number of somatic mutations per sample did not differ much between PCRC and ACRC. Although this finding may appear to contradict the fact that cancer genomes progressively accumulate mutations, it can be explained by the higher VAFs of PCRC subclonal mutations, which increased the sensitivity of mutation detection by WES. We also found that PCRC tended to harbor fewer ubiquitous mutations and more heterogeneous mutations than ACRC. However, the small cohort size limited the power of the statistical analysis (the ten PCRC cases vs. the eight ACRC cases); larger cohort size is necessary to confirm this tendency.

As for CNAs, their number progressively increased from adenoma through early carcinoma to ACRC; the increase in ubiquitous CNAs was especially prominent in ACRC. Together with the observation that mutations in well-known driver genes were already present in PCRC, these findings suggest that CNAs play more critical roles in the progression from PCRC to ACRC. This view is consistent with a recent report that CRISPR–Cas9-mediated engineering of canonical driver genes was not sufficient to confer invasive capacity to human intestinal organoids; the report also found that a chromosomal instability phenotype was necessary for metastatic behavior³⁴. A number of recent studies have demonstrated that genome-wide mutational events such as whole-genome duplication (WGD)^4,7,35,36 and chromothripsis³⁶ play essential roles in tumor progression. Although our WES-based tumor ploidy profiling identified signatures of polyploidization in ACRC but not in PCRC, copy number analysis with a higher resolution is required to prove that WGD is involved in the progression from PCRC to ACRC. It is also possible that chromothripsis delineates a boundary between PCRC and ACRC; we should explore this possibility in future studies employing whole-genome sequencing.

Finally, we propose a model of CRC evolution that can simply explain our data (Fig. 5). In our model, multiple subclones are generated by driver mutation acquisition and subsequent selective sweep in PCRC because early tumor growth is inevitably hampered by obstacles such as spatial and nutritional limitation³⁷ and immune attack³⁸. However, out of the multiple subclones generated by Darwinian evolution, the parental clone that can conquer the obstacles emerges. In addition to a sufficient set of driver mutations, such a clone possibly acquires driver CNAs that endow a tumor with malignant phenotypes such as invasion, angiogenesis, and immune escape, and then it dominantly regrows by overcoming the obstacles. This evolutionary bottleneck establishes all driver mutations composing the parental clone in PCRC as ubiquitous mutations in ACRC, and the parental clone then branches into numerous subclones by neutral evolution. This model is consistent with the well-established multi-step carcinogenesis model of CRC¹², in which mutations in major driver genes such as APC, KRAS, and TP53 are sequentially accumulated in adenoma and then additional CNAs are acquired during the progression from adenoma to carcinoma. The neutral evolution phase in our model is also consistent with the recently proposed Big Bang model¹⁴, where a tumor predominantly grows as a single expansion without selective sweep. The extensive ITH generated by neutral evolution definitively works as a rich source of therapy-resistant subclones. However, considering recent reports that certain subclones that have branched out from a primary tumor in the early evolutionary phase constitute recurrent lesions after chemotherapy or radiotherapy^39,40,41,42, it is also possible that subclones that appear in PCRC but that were weeded out by the evolutionary bottleneck remain as minimal residual clones, eventually contributing to therapeutic resistance. Further sequencing studies targeting recurrent lesions are necessary to elucidate more details of CRC evolutionary history.

For a long time after the establishment of the multi-step carcinogenesis model¹², CRC was assumed to be a clonal cell population originating from linear clonal evolution. However, this view has recently been revised by a series of studies proposing that neutral evolution shapes extensive ITH in ACRC^13,14,17. As an extension of these studies, this study provides a detailed view of ITH in PCRC and demonstrates that the evolutionary principle shaping ITH shifts from Darwinian to neutral evolution during CRC progression. We believe that our model of CRC evolution not only provides deep insights into the origin of ITH but also constitutes a foundation for conquering this malignancy.

Methods

Ethics statement

The study design was approved by the institutional review boards and ethics committees of the patients’ hospitals (Oita University Hospital Institutional Review Board: Protocol Number P-14-09, Kyushu University Institutional Review Board: Protocol Number 595-01). The study was conducted according to the principles expressed in the Declaration of Helsinki. We obtained written informed consent from all the patients in this study. There were no animal experiments in the study.

Sample collection and preparation

We obtained 53 samples of colorectal tumors from ten patients with colorectal LST who underwent endoscopic submucosal dissection or radical resection at Kyushu University Beppu Hospital (Beppu, Japan) or Oita University Hospital (Yufu, Japan). These samples were histologically diagnosed by two qualified pathologists as adenoma or carcinoma in situ. Cancer cells were only found in the epithelium or lamina propria without vessel invasion. Following the 2010 WHO classification of tumors of the digestive system, the pathologists evaluated low- and high-grade dysplasia of each sample, which corresponded to adenoma and carcinoma in situ, respectively. Detailed information about participants and samples is provided in Supplementary Data 1. To use high-purity tumor samples, we performed microdissection of all frozen multiregion tumor samples using a Leica Laser Microdissection System (Leica Microsystems, Wetzlar, Germany), distinguishing between adenoma and carcinoma based on the diagnosis of the pathologists. However, when the volume of samples that consisted of both adenoma and early carcinoma was not sufficient, we captured only one of them. We included the diagnostic information in the sample names: the last character of the sample name, “A” or “C”, meant “adenoma” or “carcinoma”, respectively. DNA was extracted from these captured tumor samples and adjacent normal intestinal mucosa with AllPrep DNA/RNA Mini Kit (Qiagen, Hiden, Germany)⁴³.

Whole-exome sequencing

Whole-exome capture was performed on all PCRC samples with the SureSelect Human All Exon V5 Kit (Agilent Technologies, Tokyo, Japan). The captured targets were subjected to sequencing using HiSeq 2500 (Illumina, San Diego, CA, USA) with the pair-end 100 bp read option. Information on read depth is provided in Supplementary Data 2. The sequence data were processed through an in-house pipeline⁴⁴. Briefly, the sequencing reads were aligned to the NCBI Human Reference Genome Build 37 hg19 with BWA version 0.7.8 using default parameters (http://bio-bwa.sourceforge.net/). PCR duplicate reads were removed with Picard (http://www.picard.sourceforge.net). Mutation calling was performed using the EBcall algorithm⁴⁵ with the following parameters: (i) mapping quality score ≥ 20, (ii) base quality score ≥ 15, (iii) both the tumor and normal depths ≥ 8, (iv) variant reads in tumors ≥ 4, (v) VAF in tumor samples ≥ 0.05, (vi) VAF in paired normal samples ≤ 0.1, (vii) minus logarithm of p value of Fisher’s exact test ≥ 1.3, and (viii) minus logarithm of p value of EBcall ≥ 5. The filtered mutations were annotated by ANNOVAR ver.2015Dec14 (http://www.openbioinformatics.org/annovar/). As for ACRC, the WES data obtained in our previous study¹³ were reanalyzed by the same pipeline as for PCRC.

Analysis of multiregion mutation profiles

For each case, we first obtained variants satisfying both the following criteria: (i) it was judged as a somatic mutation by EBcall in any sample and (ii) its position was covered by more than ten reads in all the samples. For each of the passed variants, we reexamined the presence of somatic mutations in each of the samples where EBcall did not judge the variant as a somatic mutation. In this step, which aimed to rescue false negatives missed by EBcall, we assumed the variant to be a somatic mutation if the variant satisfied all the following criteria: (i) VAF in the tumor sample ≥ 0.05, (ii) VAF ≤ 0.01 in the paired normal sample, and (iii) p value of Fisher’s exact test ≤ 0.05. This procedure was applied to each case to obtain a multiregion mutation profile, and then we defined mutations shared by all the samples in each case and other mutations as ubiquitous mutations and heterogeneous mutations, respectively. Heterogeneous mutations were further divided into shared mutations, which were shared by multiple samples, and private mutations, which uniquely existed in a single sample. Information for all the mutations is provided in Supplementary Data 3. The multiregion mutation profile obtained for each case was visualized as a heat map, in which intensities represented VAFs. In the heat map, ubiquitous mutations were ordered along chromosomal positions; shared mutations were ordered by a hierarchical clustering; private mutations were sorted for samples and VAFs. The list of the driver genes indicated in the heat maps was based on the significantly mutated genes that had been previously reported for CRC¹⁸ (Supplementary Table 1). Colors of PCRC sample labels were obtained in the same way as in our previous study¹³. Namely, from the multiregional mutation profile of each case, we also deduced a color-coding scheme to prepare color labels of samples. The multiregional mutation profile were regarded as an n × m matrix, whose n columns and m rows indexed n mutational positions and m samples, respectively. We applied principle component analysis to the multiregional mutation profile and obtained the first, second and third loading vectors. By multiplying these loading vectors, n-dimensional vectors representing mutational profiles of each sample were reduced into three-dimensional vectors. RGB colors used for sample labels are finally papered by mixing red, green, and blue proportionally to the three vector elements. In a color-coding scheme deduced by this approach, color similarity reflects similarity of mutation profiles between samples. For ACRC samples, we employed the same colors as used in our previous study¹³.

Mutation validation by targeted deep sequencing

We performed amplocon-sequencing of tumor DNA for 73 candidate mutations chosen randomly from ubiquitous, shared, and private mutations, based on a previously reported protocol⁴⁶. Briefly, regions containing candidate mutations were amplified from 10 ng of DNA using KOD plus neo (TOYOBO, Osaka, Japan) with primers that were attached by NotI sequences at the 5′ end. Successful amplification was confirmed by gel electrophoresis. Amplicons were pooled, purified using the FastGene Gel/PCR 5 Extraction Kit (Nippon Genetics, Tokyo, Japan), and digested with NotI restriction enzyme (Takara Bio, Shiga, Japan), according to the instruction manual. Samples were ligated using T4 DNA polymerase (Takara Bio) and re-purified; ligated DNA was sonicated into ~200 bp fragments using a Covaris sonicator (Covaris inc., Massachusetts, USA), and prepared for generation of sequencing libraries using NEBNext Ultra DNA Library Prep Kit for Illumina (New England BioLabs, Massachusetts, USA). Libraries were then subjected to deep sequencing on a Hiseq 2500 instrument. Candidate mutations were considered real if both of the following criteria were satisfied: (i) VAF in the tumor ≥ 0.01 and (ii) sequencing depth ≥ 500.

Construction of evolutionary trees

From the multiregion sequencing data for each case, an evolutionary tree was constructed using the Treeomics algorithm²¹ (https://github.com/johannesreiter/treeomics) with default parameters. For every mutation existing in the multiregion mutation profile, the numbers of variant reads, read depth, chromosomal coordinates, gene symbol, and substitution pattern were prepared as input data to Treeomics. Treeomics not only constructs an evolutionarily tree but also corrects potential sequencing artifacts so that all mutations have mutation patterns compatible with the topologies of the evolutionary tree. Based on the parts of the tree that the mutations constituted, we obtained trunk, branch, internal branch, and external branch mutations, which were refined versions of ubiquitous, heterogeneous, shared and private mutations, respectively. To remove potential sequencing artifacts, Treeomics also employs mutation filters, which filtered out 1.3% of our input mutations. Information about the trunk-branch categorization is also provided in Supplementary Data 3. We were unable to apply Treeomics to the ACRC3 data, which contained 21 samples, due to insufficient memory on our computer. To address this problem, we divided the ACRC3 data into two parts which corresponded to two apparent sample clusters in the multiregion profiles. After the divided data were subjected to Treeomics, an evolutionary tree was constructed by merging the results. Except for ACRC2 and ACRC3, the robustness of the evolutionary tree inference was examined on 1000 bootstrapping samples from the input mutations. For ACRC2 and ACRC3, only 50 bootstrapping samples were obtained due to the memory limitation. The inferred evolutionary trees were annotated with the same driver gene list as used for the heat maps of the multiregion mutation profiles (Supplementary Table 1). For detection of subclonal mixing, we reconstructed evolutionary trees with the “-u” option and the obtained information of subclonal mixing was added to the trees constructed without the “-u” option (Supplementary Figs. 6 and 7).

Analysis of CNAs

To detect CNAs from WES data, we used a software tool, EXCAVATOR⁴⁷ (http://sourceforge.net/projects/excavatortool/), that not only reports chromosomal segments subjected to CNAs but also outputs the log-transformed ratio of copy number intensities between tumor and normal samples (log₂R) for each locus. We used twice the median of the ubiquitous mutation allele frequencies for each sample as the cellularity parameters. CNAs whose length was larger than 50% of the chromosomal arm were classified as chromosomal arm-level CNAs, while the others were classified as focal CNAs. For each case, we made a multiregion arm-level CNA profile, which presented an average log₂R for each of the chromosomal arms subjected to CNAs. For each of the chromosomal arms that EXCAVATOR reported to have CNAs in any sample, we reexamined a presence of CNAs in each sample where EXCAVATOR did not report the chromosomal arm-level CNA; we assumed that a CNA existed if the absolute value of log₂R averaged along the chromosomal arm was greater than 0.15. We also prepared a multiregion focal CNA profile for each case, by focusing on candidate loci that were previously reported to be recurrently altered^19,26. For each candidate locus that had overlap with EXCAVATOR-deduced focal CNAs in any sample, we calculated log₂R averaged along the locus for each sample. We assumed that a CNA existed if the absolute value of the averaged log₂R was greater than 0.15.

Estimation of tumor ploidy

To estimate tumor ploidy from WES data, we used two software tools, FACETS⁴⁸ (https://github.com/mskcc/facets) and sequenza⁴⁹ (https://cran.r-project.org/web/packages/sequenza/index.html). In FACETS, we prepared germ line polymorphic sites cataloged in the Human Genetic Variation Database version 2.30 (http://www.hgvd.genome.med.kyoto-u.ac.jp) as a reference. Other parameters were set by default in both FACETS and sequenza.

Comparison of the numbers of ubiquitous and heterogeneous alterations between cases

We reanalyzed our previously published multiregion WES data sets of ACRC¹³ in the same way that PCRC data were analyzed for calling mutations and CNAs. Although the data set contained nine ACRCs, one hypermutated case was excluded in this study. The prefix “case” in the previous sample names was also replaced with “ACRC”. Each of the ten PCRC and eight ACRC cases had a different number of samples from 4 to 21, which led to an unfair comparison of the numbers of ubiquitous and heterogeneous alterations. We addressed this problem by employing a down-sampling approach, where the numbers of ubiquitous and heterogeneous alterations were estimated from randomly sampled sub-datasets of an equal number of samples across all cases, using the following steps: (i) for each case, we obtained every sub-dataset with four samples, which was the minimum number of samples in our data set; (ii) for each sub-dataset, the numbers of ubiquitous and heterogeneous alterations were calculated (here, since heterogeneous alterations were associated with each sample, we took the median across samples as the number of the heterogeneous alterations); and (iii) medians across all sub-datasets were assumed to be corrected numbers of ubiquitous and heterogeneous alterations. An explanatory example of our down-sampling approach is provided in Supplementary Fig. 15.

Comparison of the numbers of alterations between different tumor stages

To compare the number of mutations between different tumor stages, we employed hierarchical Bayesian analysis (Supplementary Fig. 16), which enabled us to estimate the mean number of mutations in each tumor stage, after removing the residuals associated with samples and cases in which the mutations were found. As the tumor stages, we assumed the following three categories: T^(SNV) = {adenoma, carcinoma, ACRC}. Let i and j denote the indices for samples and cases, respectively. From multiregion sequencing data of I cases, each of which contains J_i samples, we obtain the number of mutations in the j-th sample of the i case (hereafter, simply referred as to sample ij) as n_ij^(SNV) (i = 1, …, I and j = 1, …, J_i). Sample ij is associated with any of the three tumor stages: t_ij ∈ T^(SNV). We assume that n_ij^(SNV) is sampled from a Poisson distribution: n_ij^(SNV) ~ Poisson(μ_ij^(SNV)), where μ_ij^(SNV) is expressed by the main term associated with tumor stages and the residual term associated with samples and cases: μ_ij^(SNV) ~ exp(β_ij^(SNV) + r_ij^(SNV)). The main term β_ij^(SNV) is obtained by substituting the tumor stage of sample ij, t_ij, into variable β_t^(SNV), which represents the mean number of mutations in tumor stage t ∈ T^(SNV): β_ij^(SNV) ← β_tij^(SNV). To ensure the robustness of parameter estimation, we employed a Cauchy distribution as the prior distribution for β_t^(SNV): β_t^(SNV) ~ Cauchy(β^(SNV), τ^(SNV)). The two hyper-parameters are also sampled from Cauchy and half-Cauchy⁵⁰ hyper-priors, respectively: β^(SNV) ~ Cauchy(β₀^(SNV), τ₀^(SNV)) and τ^(SNV) ~ Half-Cauchy(λ₀^(SNV)), where we set β₀^(SNV) = 0, τ₀ ^(SNV) = 1, and λ₀^(SNV) = 1. The residual term r_ij^(SNV) is hierarchically sampled from two Cauchy distributions: r_i^(SNV) ~ Cauchy(r₀ ^(SNV), t^(SNV)) and r_ij^(SNV) ~ Cauchy(r_i^(SNV), t_i^(SNV)). We set r₀^(SNV) = 1 while the scale parameters are sampled from half-Cauchy hyper-priors: t^(SNV) ~ Half-Cauchy(l₀^(SNV)) and t_i^(SNV) ~ Half-Cauchy(l₁^(SNV)), where we set l₀^(SNV) = 1 and l₁^(SNV) = 1. For each of the 123 samples in the ten PCRCs and eight ACRCs, the number of all mutations and tumor stage was prepared as n_ij^(SNV) and t_ij, respectively. We estimated the posterior distribution of β_t^(SNV) by running MCMC on JAGS 4.2.0⁵¹ with the following parameter settings: number of chains = 20, number of burn-in iterations = 100,000, number of total iterations = 200,000 and thinning interval = 5. Convergence of Markov chains was confirmed by the Gelman-Rubin convergence diagnostic⁵². The density plot in Supplementary Fig. 1c shows the distribution of the MCMC samples of β_t^(SNV) for each tumor stage on the exponential scale. To compare of the number of CNAs, the numbers of CNAs in the 123 samples were prepared and processed in the same way, except for the MCMC parameter settings: number of chains = 20, number of burn-in iterations = 50,000, number of total iterations = 100,000 and thinning interval = 5 (Supplementary Fig. 11e).

Comparison of VAFs between different categories of mutations

Hierarchical Bayesian analysis was employed to compare VAFs between different categories of mutations, similarly to the comparison of the numbers of alterations between different tumor stages (Supplementary Fig. 17). We estimated the mean VAFs for each category of mutations, after correcting for the effects of tumor content and read depth, as well as removing the residuals associated with individual mutations, samples and cases in which the mutations were found. We assume that mutations are categorized into the following six categories: T^(VAF) = {PCRC, ACRC} × {trunk, internal branch, external branch} (or {PCRC, ACRC} × {ubiquitous, shared, private} on the ubiquitous-heterogeneous categorization). In addition to the case index i and sample index j, let k denote the index for mutations. We assume that K_ij (=n_ij^(SNV)) mutations are identified in sample ij and the k-th mutation (hereafter, simply referred as to mutation ijk) is categorized as t_ijk (k = 1, …, K_ij). For mutation ijk, we obtain the numbers of total and variant reads, which are represented as d_ijk and b_ijk, respectively. We assume that b_ijk is sampled from the binomial distribution with parameters d_ijk (the number of trials) and P_ijk^(m) (the success probability): b_ijk ~ Binomial(d_ijk, P_ijk^(m)). P_ijk^(m) is a modified VAF, which is obtained by multiplying the true VAF and the tumor content of sample ij: P_ijk^(m) ← P_ijk^(t) × TC_ij. As with μ_ij^(SNV), P_ijk^(t) is expressed by the main term associated with mutation categories and the residual terms associated with individual mutations, samples and cases: P_ijk^(t) ~ logistic(β_ijk^(VAF) + r_ijk^(VAF)). The main term β_ijk^(VAF) is obtained by substituting the category of mutation ijk, t_ijk, into variable β_t^(VAF), which represents the mean VAF of mutations of category t ∈ T^(VAF): β_ijk^(VAF) ← β_tijk^(VAF). β_t^(VAF) is hierarchically sampled from Cauchy and half-Cauchy distributions in the same way as β_t^(SNV). The residual term r_ijk^(VAF) is sampled from a Cauchy distribution: r_ijk^(VAF) ~ Cauchy(r_ij^(VAF), t_ij^(VAF)). r_ij^(VAF) is obtained in the same way as r_ij^(SNV), while t_ij^(VAF) is sampled from a half-Cauchy hyper-prior: t_ij^(VAF) ~ Half-Cauchy(l₂^(VAF)), where we set l₂^(VAF) = 1. For each of the mutations found in the 123 samples of the 10 PCRCs and 8 ACRCs, the numbers of total and variant reads and the mutation categories were prepared as d_ijk, b_ijk and t_ijk. For each sample, the tumor content estimated by Treeomics was used as TC_ij. The posterior distribution of β_t^(VAF) was estimated by MCMC on JAGS with the following parameter settings: number of chains = 20, number of burn-in iterations = 50,000, number of total iterations = 50,000 and thinning interval = 5. The density plot in Fig. 3c and Supplementary Fig. 9c show the distribution of the MCMC samples of β_t^(VAF) for each mutation category after logistic conversion.

Comparison of CCFs between different categories of mutations

To compare CCFs between different categories of mutations, we used a hierarchical Bayesian model similar to the one for the comparison of VAFs (Supplementary Fig. 18). Instead of P_ijk^(t)and P_ijk^(m), we introduced C_ijk and P_ijk, which represent CCF and VAF, respectively. We also added CN_ijk, which represents the absolution copy number of the locus where mutation ijk exists. VAF is then represented as follows; P_ijk ← TC_ij•C_ijk/{(1−TC_ij)•2 + TC_ij•CN_ijk}⁵³. Except for these points, the CCF model is the same as the VAF model. As input data, we prepared absolute copy numbers estimated by EXCAVATOR in addition to the input data used for the VAF model. Mutations on sex chromosomes were removed from the input to the CCF analysis. The posterior distribution of β_t^(CCF) was estimated by MCMC on JAGS with the following parameter settings: number of chains = 20, number of burn-in iterations = 200,000, number of total iterations = 200,000 and thinning interval = 5. The density plot in Supplementary Fig. 10 shows the distribution of the MCMC samples of β_t^(CCF) for each mutation category after logistic conversion.

Data availability

All WES data have been deposited in the Japanese Genotype-phenotype Archive with accession number JGAS00000000092 [https://humandbs.biosciencedbc.jp/en/hum0095-v1].

References

Gerlinger, M. et al. Intratumor heterogeneity and branched evolution revealed by multiregion sequencing. N. Engl. J. Med. 366, 883–892 (2012).
Article PubMed PubMed Central CAS Google Scholar
Gerlinger, M. et al. Genomic architecture and evolution of clear cell renal cell carcinomas defined by multiregion sequencing. Nat. Genet. 46, 225–233 (2014).
Article PubMed PubMed Central CAS Google Scholar
Yates, L. R. et al. Subclonal diversification of primary breast cancer revealed by multiregion sequencing. Nat. Med. 21, 751–759 (2015).
Article PubMed PubMed Central CAS Google Scholar
Stachler, M. D. et al. Paired exome analysis of Barrett’s esophagus and adenocarcinoma. Nat. Genet. 47, 1047–1055 (2015).
Article PubMed PubMed Central CAS Google Scholar
Murugaesu, N. et al. Tracking the genomic evolution of esophageal adenocarcinoma through neoadjuvant chemotherapy. Cancer Discov. 5, 821–831 (2015).
Article PubMed PubMed Central CAS Google Scholar
Zhang, J. et al. Intratumor heterogeneity in localized lung adenocarcinomas delineated by multiregion sequencing. Science 346, 256–259 (2014).
Article ADS PubMed PubMed Central CAS Google Scholar
de Bruin, E. C. et al. Spatial and temporal diversity in genomic instability processes defines lung cancer evolution. Science 346, 251–256 (2014).
Article ADS PubMed PubMed Central CAS Google Scholar
Schwarz, R. F. et al. Spatial and temporal heterogeneity in high-grade serous ovarian cancer: a phylogenetic analysis. PLoS Med. 12, e1001789 (2015).
Article PubMed PubMed Central CAS Google Scholar
Haffner, M. C. et al. Tracking the clonal origin of lethal prostate cancer. J. Clin. Invest. 123, 4918–4922 (2013).
Article PubMed PubMed Central CAS Google Scholar
Gundem, G. et al. The evolutionary history of lethal metastatic prostate cancer. Nature 520, 353–357 (2015).
Article PubMed PubMed Central CAS Google Scholar
Makohon-Moore, A. P. et al. Limited heterogeneity of known driver gene mutations among the metastases of individual patients with pancreatic cancer. Nat. Genet. 49, 358–366 (2017).
Article PubMed PubMed Central CAS Google Scholar
Fearon, E. R. & Vogelstein, B. A genetic model for colorectal tumorigenesis. Cell 61, 759–767 (1990).
Article PubMed CAS Google Scholar
Uchi, R. et al. Integrated multiregional analysis proposing a new model of colorectal cancer evolution. PLoS Genet. 12, e1005778 (2016).
Article PubMed PubMed Central CAS Google Scholar
Sottoriva, A. et al. A Big Bang model of human colorectal tumor growth. Nat. Genet. 47, 209–216 (2015).
Article PubMed PubMed Central CAS Google Scholar
Ling, S. et al. Extremely high genetic diversity in a single tumor points to prevalence of non-Darwinian cell evolution. Proc. Natl Acad. Sci. USA 112, E6496–E6505 (2015).
Article PubMed CAS Google Scholar
Williams, M. J. et al. Identification of neutral tumor evolution across cancer types. Nat. Genet. 48, 238–244 (2016).
Article PubMed PubMed Central CAS Google Scholar
Bozic, I., Gerold, J. M. & Nowak, M. A. Quantifying clonal and subclonal passenger mutations in cancer evolution. PLoS Comput. Biol. 12, e1004731 (2016).
Article ADS PubMed PubMed Central CAS Google Scholar
Lawrence, M. S. et al. Discovery and saturation analysis of cancer genes across 21 tumour types. Nature 505, 495–501 (2014).
Article ADS PubMed PubMed Central CAS Google Scholar
Cancer Genome Atlas, N. Comprehensive molecular characterization of human colon and rectal cancer. Nature 487, 330–337 (2012).
Article ADS CAS Google Scholar
Sakai, E. et al. TP53 mutation at early stage of colorectal cancer progression from two types of laterally spreading tumors. Cancer Sci. 107, 820–827 (2016).
Article PubMed PubMed Central CAS Google Scholar
Reiter, J. G. et al. Reconstructing metastatic seeding patterns of human cancers. Nat. Commun. 8, 14114 (2017).
Article ADS PubMed PubMed Central CAS Google Scholar
Graham, T. A. & Sottoriva, A. Measuring cancer evolution from the genome. J. Pathol. 241, 183–191 (2017).
Article PubMed Google Scholar
Meijer, G. A. et al. Progression from colorectal adenoma to carcinoma is associated with non-random chromosomal gains as detected by comparative genomic hybridisation. J. Clin. Pathol. 51, 901–909 (1998).
Article PubMed PubMed Central CAS Google Scholar
Diep, C. B. et al. The order of genetic events associated with colorectal cancer progression inferred from meta-analysis of copy number changes. Genes Chromosomes Cancer 45, 31–41 (2006).
Article PubMed CAS Google Scholar
Postma, C. et al. Chromosomal instability in flat adenomas and carcinomas of the colon. J. Pathol. 205, 514–521 (2005).
Article PubMed CAS Google Scholar
Wang, H., Liang, L., Fang, J. Y. & Xu, J. Somatic gene copy number alterations in colorectal cancer: new quest for cancer drivers and biomarkers. Oncogene 35, 2011–2019 (2016).
Article PubMed CAS Google Scholar
Carvalho, B. et al. Multiple putative oncogenes at the chromosome 20q amplicon contribute to colorectal adenoma to carcinoma progression. Gut 58, 79–89 (2009).
Article PubMed CAS Google Scholar
De Angelis, P. M., Clausen, O. P., Schjolberg, A. & Stokke, T. Chromosomal gains and losses in primary colorectal carcinomas detected by CGH and their associations with tumour DNA ploidy, genotypes and phenotypes. Br. J. Cancer 80, 526–535 (1999).
Article PubMed PubMed Central Google Scholar
Takahashi, Y. et al. The AURKA/TPX2 axis drives colon tumorigenesis cooperatively with MYC. Ann. Oncol. 26, 935–942 (2015).
Article PubMed CAS Google Scholar
Kim, T. M. et al. Clonal origins and parallel evolution of regionally synchronous colorectal adenoma and carcinoma. Oncotarget 6, 27725–27735 (2015).
PubMed PubMed Central Google Scholar
Kang, H. et al. Many private mutations originate from the first few divisions of a human colorectal adenoma. J. Pathol. 237, 355–362 (2015).
Article PubMed PubMed Central CAS Google Scholar
Sievers, C. K. et al. Subclonal diversity arises early even in small colorectal tumours and contributes to differential growth fates. Gut, https://doi.org/10.1136/gutjnl-2016-312232 (2016).
Losi, L. Evolution of intratumoral genetic heterogeneity during colorectal cancer progression. Carcinogenesis 26, 916–922 (2004).
Article CAS Google Scholar
Matano, M. et al. Modeling colorectal cancer using CRISPR-Cas9-mediated engineering of human intestinal organoids. Nat. Med. 21, 256–262 (2015).
Article PubMed CAS Google Scholar
Gao, R. et al. Punctuated copy number evolution and clonal stasis in triple-negative breast cancer. Nat. Genet. 48, 1119–1130 (2016).
Article PubMed PubMed Central CAS Google Scholar
Notta, F. et al. A renewed model of pancreatic cancer evolution based on genomic rearrangement patterns. Nature 538, 378–382 (2016).
Article ADS PubMed PubMed Central CAS Google Scholar
Aktipis, C. A. et al. Life history trade-offs in cancer evolution. Nat. Rev. Cancer 13, 883–892 (2013).
Article PubMed PubMed Central CAS Google Scholar
Khong, H. T. & Restifo, N. P. Natural selection of tumor variants in the generation of “tumor escape” phenotypes. Nat. Immunol. 3, 999–1005 (2002).
Article PubMed PubMed Central CAS Google Scholar
Faltas, B. M. et al. Clonal evolution of chemotherapy-resistant urothelial carcinoma. Nat. Genet. 48, 1490–1499 (2016).
Article PubMed PubMed Central CAS Google Scholar
Savas, P. et al. The subclonal architecture of metastatic breast cancer: results from a prospective community-based rapid autopsy program “CASCADE”. PLoS Med. 13, e1002204 (2016).
Article PubMed PubMed Central CAS Google Scholar
Morrissy, A. S. et al. Divergent clonal selection dominates medulloblastoma at recurrence. Nature 529, 351–357 (2016).
Article ADS PubMed PubMed Central CAS Google Scholar
Melchardt, T. et al. Clonal evolution in relapsed and refractory diffuse large B-cell lymphoma is characterized by high dynamics of subclones. Oncotarget 7, 51494–51502 (2016).
Article PubMed PubMed Central Google Scholar
Takatsuno, Y. et al. The rs6983267 SNP is associated with MYC transcription efficiency, which promotes progression and worsens prognosis of colorectal cancer. Ann. Surg. Oncol. 20, 1395–1402 (2013).
Article PubMed Google Scholar
Shiraishi, Y. et al. A comprehensive characterization of cis-acting splicing-associated variants in human cancer. Preprint at https://www.biorxiv.org/content/early/2017/09/28/162560 (2017).
Shiraishi, Y. et al. An empirical Bayesian framework for somatic mutation detection from cancer genome sequencing data. Nucleic Acids Res. 41, e89 (2013).
Article PubMed PubMed Central CAS Google Scholar
Yoshizato, T. et al. Somatic mutations and clonal hematopoiesis in aplastic anemia. N. Engl. J. Med. 373, 35–47 (2015).
Article PubMed CAS Google Scholar
Magi, A. et al. EXCAVATOR: detecting copy number variants from whole-exome sequencing data. Genome Biol. 14, R120 (2013).
Article PubMed PubMed Central Google Scholar
Shen, R. & Seshan, V. E. FACETS: allele-specific copy number and clonal heterogeneity analysis tool for high-throughput DNA sequencing. Nucleic Acids Res. 44, e131 (2016).
Article PubMed PubMed Central CAS Google Scholar
Favero, F. et al. Sequenza: allele-specific copy number and mutation profiles from tumor sequencing data. Ann. Oncol. 26, 64–70 (2015).
Article PubMed CAS Google Scholar
Gelman, A. Prior distributions for variance parameters in hierarchical models (Comment on an Article by Browne and Draper). Bayesian Anal. 1, 515–533 (2006).
Article MathSciNet MATH Google Scholar
Plummer, M. JAGS: a program for analysis of Bayesian graphical models using Gibbs sampling. Proc. 3rd International Workshop on Distributed Statistical Computing 124, 1–10 (2003).
Gelman, A. & Rubin, Donald B. Inference from iterative simulation using multiple sequences. Stat. Sci. 7, 457–472 (1992).
Article MATH Google Scholar
McGranahan, N. et al. Clonal status of actionable driver events and the timing of mutational processes in cancer evolution. Sci. Transl. Med. 7, 283ra254 (2015).
Article Google Scholar

Download references

Acknowledgements

This project was supported by JSPS KAKENHI (16K19107, 15H05707), Grant-in-Aid for Scientific Research on Innovative Areas (15H05912), Priority Issue on Post-K computer (hp170227, hp160219), and Research Grant of the Princess Takamatsu Cancer Research Fund, and partially supported by the Project for Cancer Research and Therapeutic Evolution (P-CREATE) and Practical Research for Innovative Cancer Control from Japan Agency for Medical Research and development, AMED. This research used the supercomputing resource provided by the Human Genome Center, Institute of Medical Science, University of Tokyo (http://sc.hgc.jp/shirokane.html). We thank the members of Department of Gastroenterology of Oita University and Department of Surgery of Kyushu University Beppu Hospital for sample collection and useful discussion; R. Yoshida for useful discussion; and K. Oda, M. Kasagi, S. Sakuma, N. Mishima, and T. Kawano for their assistance.

Author information

Authors and Affiliations

Department of Surgery, Kyushu University Beppu Hospital, 4546 Tsurumihara, Beppu, 874-0838, Japan
Tomoko Saito, Ryutaro Uchi, Hidenari Hirata, Hisateru Komatsu, Shotaro Sakimura, Sho Nambara, Yosuke Kuroda, Shuhei Ito, Hidetoshi Eguchi, Takaaki Masuda, Keishi Sugimachi & Koshi Mimori
Department of Gastroenterology, Oita University Hospital, 1-1 Idaigaoka, Yufu, 879-5593, Japan
Tomoko Saito, Masaaki Kodama, Tadayoshi Okimoto, Kazuhiro Mizukami, Ryo Ogawa, Kazuhisa Okamoto, Mitsutaka Shuto, Kensuke Fukuda & Kazunari Murakami
Division of Health Medical Computational Science, Health Intelligence Center, Institute of Medical Science, The University of Tokyo, 4-6-1 Shirokanedai, Minato-ku, Tokyo, 108-8639, Japan
Atsushi Niida & Satoru Miyano
Laboratory of DNA Information Analysis, Human Genome Center, Institute of Medical Science, The University of Tokyo, 4-6-1 Shirokanedai, Minato-ku, Tokyo, 108-8639, Japan
Shuto Hayashi, Kenichi Chiba, Yuichi Shiraishi & Satoru Miyano
Department of Pathology, Kyushu University Beppu Hospital, 4546 Tsurumihara, Beppu, 874-0838, Japan
Taro Tobo
Department of Diagnostic Pathology, Oita University Hospital, 1-1 Idaigaoka, Yufu, 879-5593, Japan
Haruto Nishida & Tsutomu Daa
Department of Pathology and Tumor Biology, Graduate School of Medicine, Kyoto University, Yoshida-Konoe-cho, Kyoto-shi Sakyo-ku, Kyoto, 606-8501, Japan
Tetsuichi Yoshizato & Seishi Ogawa
Division of Systems Biology, Nagoya University Graduate School of Medicine, 65 Tsurumai-cho, Showa-ku, Nagoya, 466-8550, Japan
Yusuke Matsui & Teppei Shimamura
Division of Health Medical Data Science, Health Intelligence Center, Institute of Medical Science, The University of Tokyo, 4-6-1 Shirokanedai, Minato-ku, Tokyo, 108-8639, Japan
Takanori Hasegawa
Department of Gastroenterological Surgery, Graduate School of Medicine, Osaka University, 2-2 Yamadaoka, Suita, 565-0871, Japan
Yuichiro Doki & Masaki Mori
Gastroenterological Center, Department of Gastroenterological Surgery, Cancer Institute Hospital, Japanese Foundation for Cancer Research, 3-8-31 Ariake, Koto, Tokyo, 135-8550, Japan
Satoshi Nagayama
Department of Surgery, Takano Hospital, 4-2-88 Obiyama, Chuo-ku, Kumamoto, 862-0924, Japan
Kazutaka Yamada
Department of Bioinformatics, National Cancer Center Research Institute, 5-1-1 Tsukiji, Chuo-ku, Tokyo, 104-0045, Japan
Mamoru Kato
Division of Cancer Genomics, National Cancer Center Research Institute, 5-1-1 Tsukiji, Chuo-ku, Tokyo, 104-0045, Japan
Tatsuhiro Shibata
Laboratory of Molecular Medicine, Human Genome Center, Institute of Medical Science, The University of Tokyo, 4-6-1 Shirokanedai, Minato-ku, Tokyo, 108-8639, Japan
Tatsuhiro Shibata
Genome Science Division, Research Center for Advanced Science and Technology (RCAST), The University of Tokyo, 4-6-1 Komaba, Meguro-ku, Tokyo, 153-8904, Japan
Hiroyuki Aburatani
Laboratory of Systems Genomics, Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, The University of Tokyo, 5-1-5 Kashiwanoha, Kashiwa-shi, Chiba, 277-8561, Japan
Yutaka Suzuki

Authors

Tomoko Saito
View author publications
You can also search for this author in PubMed Google Scholar
Atsushi Niida
View author publications
You can also search for this author in PubMed Google Scholar
Ryutaro Uchi
View author publications
You can also search for this author in PubMed Google Scholar
Hidenari Hirata
View author publications
You can also search for this author in PubMed Google Scholar
Hisateru Komatsu
View author publications
You can also search for this author in PubMed Google Scholar
Shotaro Sakimura
View author publications
You can also search for this author in PubMed Google Scholar
Shuto Hayashi
View author publications
You can also search for this author in PubMed Google Scholar
Sho Nambara
View author publications
You can also search for this author in PubMed Google Scholar
Yosuke Kuroda
View author publications
You can also search for this author in PubMed Google Scholar
Shuhei Ito
View author publications
You can also search for this author in PubMed Google Scholar
Hidetoshi Eguchi
View author publications
You can also search for this author in PubMed Google Scholar
Takaaki Masuda
View author publications
You can also search for this author in PubMed Google Scholar
Keishi Sugimachi
View author publications
You can also search for this author in PubMed Google Scholar
Taro Tobo
View author publications
You can also search for this author in PubMed Google Scholar
Haruto Nishida
View author publications
You can also search for this author in PubMed Google Scholar
Tsutomu Daa
View author publications
You can also search for this author in PubMed Google Scholar
Kenichi Chiba
View author publications
You can also search for this author in PubMed Google Scholar
Yuichi Shiraishi
View author publications
You can also search for this author in PubMed Google Scholar
Tetsuichi Yoshizato
View author publications
You can also search for this author in PubMed Google Scholar
Masaaki Kodama
View author publications
You can also search for this author in PubMed Google Scholar
Tadayoshi Okimoto
View author publications
You can also search for this author in PubMed Google Scholar
Kazuhiro Mizukami
View author publications
You can also search for this author in PubMed Google Scholar
Ryo Ogawa
View author publications
You can also search for this author in PubMed Google Scholar
Kazuhisa Okamoto
View author publications
You can also search for this author in PubMed Google Scholar
Mitsutaka Shuto
View author publications
You can also search for this author in PubMed Google Scholar
Kensuke Fukuda
View author publications
You can also search for this author in PubMed Google Scholar
Yusuke Matsui
View author publications
You can also search for this author in PubMed Google Scholar
Teppei Shimamura
View author publications
You can also search for this author in PubMed Google Scholar
Takanori Hasegawa
View author publications
You can also search for this author in PubMed Google Scholar
Yuichiro Doki
View author publications
You can also search for this author in PubMed Google Scholar
Satoshi Nagayama
View author publications
You can also search for this author in PubMed Google Scholar
Kazutaka Yamada
View author publications
You can also search for this author in PubMed Google Scholar
Mamoru Kato
View author publications
You can also search for this author in PubMed Google Scholar
Tatsuhiro Shibata
View author publications
You can also search for this author in PubMed Google Scholar
Masaki Mori
View author publications
You can also search for this author in PubMed Google Scholar
Hiroyuki Aburatani
View author publications
You can also search for this author in PubMed Google Scholar
Kazunari Murakami
View author publications
You can also search for this author in PubMed Google Scholar
Yutaka Suzuki
View author publications
You can also search for this author in PubMed Google Scholar
Seishi Ogawa
View author publications
You can also search for this author in PubMed Google Scholar
Satoru Miyano
View author publications
You can also search for this author in PubMed Google Scholar
Koshi Mimori
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

T. Saito: study design, all sample preparation, data analysis, and manuscript writing. N.A.: study design, project management and manuscript writing. S. Nambara, Y.K., S.I., H.E., K.S., T.M., M. Kodama, T.O., K. Mizukami, R.O., K.O., M.S., and K.F.: sample collection. R.U., H.H., H.K., S.S., Y.M., T. Shimamura: data analysis. S.H. and T.H.: statistical modeling. T.T., H.N., and T.D.: histopathological diagnosis. K.C., Y. Shiraishi, and S.M.: software development. T.Y.: validation production. M. Kato, Y.D., S. Nagayama, K.Y., T. Shibata, M.M., H.A., and S.O.: study design. Y. Suzuki: sequence data assembly. K. Murakami: study design and sample collection. K. Mimori: project leading, study design, and final approval of the article.

Corresponding author

Correspondence to Koshi Mimori.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Supplementary Information

Description of Additional Supplementary Files

Supplementary Data 1

Supplementary Data 2

Supplementary Data 3

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Saito, T., Niida, A., Uchi, R. et al. A temporal shift of the evolutionary principle shaping intratumor heterogeneity in colorectal cancer. Nat Commun 9, 2884 (2018). https://doi.org/10.1038/s41467-018-05226-0

Download citation

Received: 01 April 2018
Accepted: 22 June 2018
Published: 23 July 2018
DOI: https://doi.org/10.1038/s41467-018-05226-0

This article is cited by

Mutated genes on ctDNA detecting postoperative recurrence presented reduced neoantigens in primary tumors in colorectal cancer cases
- Satoshi Nagayama
- Yuta Kobayashi
- Koshi Mimori
Scientific Reports (2023)
Unraveling the Drivers of Tumorigenesis in the Context of Evolution: Theoretical Models and Bioinformatics Tools
- Xunuo Zhu
- Wenyi Zhao
- Xun Gu
Journal of Molecular Evolution (2023)
Convergent genomic diversity and novel BCAA metabolism in intrahepatic cholangiocarcinoma
- Akihiro Kitagawa
- Tsuyoshi Osawa
- Koshi Mimori
British Journal of Cancer (2023)
Subclonal accumulation of immune escape mechanisms in microsatellite instability-high colorectal cancers
- Yuta Kobayashi
- Atsushi Niida
- Koshi Mimori
British Journal of Cancer (2023)
Somatic targeted mutation profiling of colorectal cancer precursor lesions
- Wellington dos Santos
- Mariana Bisarro dos Reis
- Denise Peixoto Guimarães
BMC Medical Genomics (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.