Mismatch repair deficiency is not sufficient to elicit tumor immunogenicity

Westcott, Peter M. K.; Muyas, Francesc; Hauck, Haley; Smith, Olivia C.; Sacks, Nathan J.; Ely, Zackery A.; Jaeger, Alex M.; Rideout, William M.; Zhang, Daniel; Bhutkar, Arjun; Beytagh, Mary C.; Canner, David A.; Jaramillo, Grissel C.; Bronson, Roderick T.; Naranjo, Santiago; Jin, Abbey; Patten, J. J.; Cruz, Amanda M.; Shanahan, Sean-Luc; Cortes-Ciriano, Isidro; Jacks, Tyler

doi:10.1038/s41588-023-01499-4

Download PDF

Article
Open access
Published: 14 September 2023

Mismatch repair deficiency is not sufficient to elicit tumor immunogenicity

Nature Genetics volume 55, pages 1686–1695 (2023)Cite this article

26k Accesses
14 Citations
431 Altmetric
Metrics details

Subjects

Abstract

DNA mismatch repair deficiency (MMRd) is associated with a high tumor mutational burden (TMB) and sensitivity to immune checkpoint blockade (ICB) therapy. Nevertheless, most MMRd tumors do not durably respond to ICB and critical questions remain about immunosurveillance and TMB in these tumors. In the present study, we developed autochthonous mouse models of MMRd lung and colon cancer. Surprisingly, these models did not display increased T cell infiltration or ICB response, which we showed to be the result of substantial intratumor heterogeneity of mutations. Furthermore, we found that immunosurveillance shapes the clonal architecture but not the overall burden of neoantigens, and T cell responses against subclonal neoantigens are blunted. Finally, we showed that clonal, but not subclonal, neoantigen burden predicts ICB response in clinical trials of MMRd gastric and colorectal cancer. These results provide important context for understanding immune evasion in cancers with a high TMB and have major implications for therapies aimed at increasing TMB.

FOXO1 is a master regulator of memory programming in CAR T cells

Article Open access 10 April 2024

Personalized neoantigen vaccine and pembrolizumab in advanced hepatocellular carcinoma: a phase 1/2 trial

Article Open access 07 April 2024

The present and future of bispecific antibodies for cancer therapy

Article 06 March 2024

Main

Immunotherapy has revolutionized the treatment landscape of many cancers, particularly those with a high tumor mutational burden (TMB)^1,2,3,4,5. Somatic mutations can generate neoantigens capable of eliciting tumor-specific T cell responses^6,7, and it is widely believed that increased TMB renders tumors susceptible to immune attack after ICB treatment. Indeed, multiple pan-cancer meta-analyses have shown that TMB is one of the strongest predictors of ICB response^8,9,10, leading to approval by the US Food and Drug Administration (FDA) of pembrolizumab (anti-PD-1) for all tumors based on high TMB alone. In particular, MMRd is associated with some of the highest TMBs observed in cancer (hypermutation)^{11,12,13,14,15} and remarkable response rates to pembrolizumab^2,5. However, more than half the patients with MMRd tumors do not durably respond and TMB does not stratify responders^2,5. This underscores a critical need to understand what factors beyond TMB mediate efficacy. There are also conflicting studies suggesting that TMB is an imperfect biomarker of immunotherapy response¹⁶ and argue that FDA approval of pembrolizumab based on TMB alone may be too broad¹⁷. Specifically, TMB provides limited to no additional predictive value within specific subsets of cancer known to have high rates of response to ICB, like those with MMRd^2,5.

One factor that may dilute the power of TMB as a biomarker of ICB response is intratumor heterogeneity (ITH) of mutations or, simply defined, the fraction of mutations that are subclonal—present in a minority of tumor cells. Subclonal neoantigens could be deleted with minimal impact on tumor fitness¹⁸ or fail to elicit productive T cell responses¹⁹. Indeed, it has been observed in human cancer that ITH is associated with decreased T cell infiltration^20,21 and poor survival^22,23, whereas clonal neoantigens are predictive of response to ICB^23,24,25. This concept has been exemplified in an experimental setting of ultraviolet light B (UVB)-induced hypermutation of melanoma cell lines²⁶. It is reasonable to hypothesize that similar mechanisms are operating in MMRd cancers given their constitutive mutational instability. Although previous studies showed that MMRd mutagenesis in vitro renders cell lines immunogenic^27,28, it is unclear what the impact is of MMR loss in vivo in the presence of immunosurveillance, a process that exquisitely shapes tumor immunogenicity^6,29,30. To address these questions, we developed autochthonous mouse models of sporadic MMRd lung and colon cancer via targeted ablation of key genes in the MMR complex, MutL homolog 1 (Mlh1) and MutS homologs 2, 3 and 6 (Msh2, Msh3 and Msh6), and performed preclinical trials to determine ICB sensitivity.

Modeling sporadic MMRd in cancer

We adapted the autochthonous mouse model of lung cancer developed in our laboratory³¹ by breeding in a Cas9-expressing allele (Kras^LSL-G12D; Trp53^flox/flox; R26^LSL-Cas9 (KPC)) or conditional Msh2 knockout allele³² (Kras^LSL-G12D;Trp53^flox/flox;Msh2^flox/flox (KPM)). Intratracheal delivery of lentivirus-expressing Cre and an Msh2-targeting single guide (sg)RNA (sgMsh2) into the former or alveolar type II, cell-specific, adenovirus-expressing Cre into the latter induced lung adenocarcinomas with efficient MSH2 knockout (Fig. 1a–c and Extended Data Fig. 1a–e). We also adapted an endoscope-guided submucosal injection technique³³ to deliver lentivirus with sgRNAs targeting the colon tumor suppressor, Apc, in tandem with Msh2, Mlh1, Msh3 or Msh6, into the distal colon of mice with constitutive Cas9 expression (Fig. 1b). This efficiently induced focal colon adenomas with MMR gene knockout (Fig. 1d and Extended Data Fig. 1f–i).

**Fig. 1: Development of flexible in vivo models of DNA MMRd lung and colon cancer.**

To confirm mutation of MMR genes and investigate the degree of TMB, we performed whole-exome sequencing (WES) on micro-dissected tumors at 16–20 weeks post-initiation, including 26 sgMsh2- and -3- control (sgCtl)-targeted KPC lung tumors, 15 and 6 SPC-Cre-targeted KPM and KP (Msh2 wild-type) lung tumors and 5 sgMsh2-, 6 sgMlh1-, 2 sgMsh6-, 6 sgMsh3- and 5 sgCtl-targeted colon tumors (Supplementary Table 1). Targeted sequencing of MMR genes in these tumors confirmed a preponderance of frame-shifting insertions/deletions (indels) (Supplementary Table 1). KPM, sgMsh2-, sgMlh1- and sgMsh6-targeted lung and colon tumors showed increased burden of somatic single-nucleotide variants (SNVs), indels and microsatellite instability (MSI) scores (MSIsensor³⁴), whereas sgMsh3-targeted colon tumors showed elevated levels of indels only (Fig. 1e,f and Extended Data Fig. 1j–m). Indels across all MMRd tumors were predominantly single nucleotide and enriched at homopolymer repeat microsatellites (Fig. 1f and Extended Data Fig. 1n). Mutational patterns were highly consistent with those observed in human MMRd colon cancer³⁵ (cosine similarity of 0.9 and 0.8), as determined by decomposition of mutational spectra into individual signatures from the Catalogue of Somatic Mutations in Cancer (COSMIC)^36,37 (Fig. 1g,h and Extended Data Fig. 1n–q). The MMRd signature (SBS-MMRd) comprised 92%, 78% and 75% of all SNVs in the mouse MMRd colon and lung tumor models and human MMRd colon cancer, respectively (Fig. 1f). It is interesting that one lung tumor had a Ser415Arg mutation in the exonuclease domain of DNA polymerase epsilon (Pole) and, consequently, a much higher TMB (Extended Data Fig. 1j).

We also performed WES on cell lines derived from sgMsh2-targeted lung tumors. Unexpectedly, these showed on average a tenfold greater TMB than sgMsh2-targeted lung tumors (Fig. 1e and Extended Data Fig. 1k), suggesting low tumor purity (unlikely, because Msh2^KO lung tumors were on average 72% pure), mutagenesis in culture or substantial ITH reduced by clonal selection on plastic. Arguing against extensive mutagenesis in vitro, WES of a single-cell clone revealed more than double the TMB of the parental line, despite a much smaller increase in TMB after 20 passages (Extended Data Fig. 1r). Consistent with high ITH, more sequencing reads supporting mutations private to the single clone were found in the sequencing data of the parental compared with the unrelated control line (1,685 versus 451), albeit at variant allele frequencies (VAFs) below the threshold of mutation calling (Extended Data Fig. 1s,t), as previously observed in hypermutated gliomas³⁸.

Altogether, these results establish the utility of our models to recapitulate fundamental mutational processes underlying hypermutation in MMRd human cancer. The lower clonal TMB that we observed is probably due to neutral evolution in the absence of major selective bottlenecks, a feature of ablating MMR concomitantly with tumor initiation in our models. Although these models cannot capture the accumulation of mutations or clonal evolution over decades in human cancer, they are uniquely suited to study the role of ITH in immune dysfunction of cancers with high and low prevalence of MMRd alike. Importantly, ITH is associated with aggressive disease and decreased ICB response in humans^22,23,24,25, but it remains unclear how ITH impacts the immune response in MMRd cancers specifically.

Sporadic MMRd in the mouse is not immunogenic

Next, we sought to determine the effects of sporadic MMRd on tumorigenesis in our models. Neither KPC nor KPM Msh2^KO models showed a significant difference in overall tumor burden or grade at either timepoint (Fig. 2a–d and Extended Data Fig. 2b,c). Notably, there was no difference in tumor infiltration by T cells (CD3⁺) in the sgMsh2-targeted model (Extended Data Fig. 2d,e) or infiltration by cytotoxic (CD8⁺), helper (CD4⁺) or regulatory (CD4⁺FOXP3⁺) T cells within tumors (Fig. 2e–g) or whole lungs (Extended Data Fig. 2g,h) in the KPM model at either timepoint. Similar to the lung, tumors in the colon model showed no difference in growth kinetics with Msh2 targeting (Extended Data Fig. 2i).

**Fig. 2: MMRd models of lung and colon cancer are not immunogenic.**

To determine the sensitivity of these models to immunotherapy, we first performed preclinical trials with ICB (anti-CTLA-4/anti-PD-1) in the KPM model. We followed similar dosing as established in seminal preclinical studies that preceded the first human clinical trials of anti-CTLA-4 (ref. ³⁹) and anti-PD-1 (ref. ⁴⁰), although we continued treatment for longer (4 weeks). In addition, we included treatment with the chemotherapeutic combination of oxaliplatin and low-dose cyclophosphamide (Oxa/Cyc) alone and in combination with ICB (Fig. 2h–i), because Oxa/Cyc has been shown to synergize with ICB^41,42. To our surprise, no significant differences were observed between KPM and KP mice across all treatment arms, in both longitudinal change (Fig. 2j) and final tumor burden at necropsy (Fig. 2k and Extended Data Fig. 2j). Consistent with a lack of immunogenicity, we observed no differences in tumor grade or burden at 16 weeks between KPM and KP mice treated continuously with CD4⁺ and CD8⁺ T cell-depleting antibodies (αCD4/8) (Extended Data Fig. 2k–p). These results were not unique to the lung, because ICB treatment failed to induce any responses in sgMsh2-targeted colon tumors (Fig. 2l,m and Extended Data Fig. 2q,r). Likewise, there was no significant difference in endpoint size of sgCtl- versus sgMsh2-targeted colon tumors after ICB or continuous αCD4/8 treatment (Fig. 2n). Altogether, these data demonstrate that MMRd in these models is not sufficient to increase immunogenicity or sensitivity of tumors to ICB, alone or in combination with chemotherapy, in stark contrast to previous reports in cell-line transplant models^27,28.

MMRd drives mutational heterogeneity

To assess the clonal composition of mutations in our models, we estimated cancer cell fractions (CCFs)⁴³. TMB was predominantly subclonal and most mutations were present in less than a quarter of cells in MMRd tumors from lung and colon (Fig. 3a,b and Extended Data Fig. 3a–c). These mutations also adhered perfectly to a theoretical model of neutral evolution of subclonal mutations in cancer⁴⁴ (Fig. 3c,d), consistent with the absence of selective events after tumor initiation in our models. To investigate ITH more deeply, we performed WES on eight single-cell clones derived from an sgMsh2-targeted lung tumor cell line (09-2). Importantly, before subcloning, we restored MMR by re-expressing Msh2 on a bicistronic lentivirus conferring puromycin resistance (Fig. 3e). Of note, these clones (M1–8) maintained MSH2 expression and showed stable mutational and clonal architecture after 20 passages in the presence of puromycin (Extended Data Fig. 3d–f). Nevertheless, significantly more somatic mutations were called in all clones than in the parental line (Fig. 3f) and these mutations were not broadly shared across clones (Extended Data Fig. 3g), supporting the notion that ITH is underestimated by bulk sequencing methods³⁸. Phylogenetic analysis of clones further confirmed a considerable level of ITH (Fig. 3g).

**Fig. 3: MMRd models are defined by extensive ITH.**

Given that ITH in our models arose in vivo, we sought to understand the role of immunoediting^6,29,30 in shaping this process. We performed WES on micro-dissected Msh2^KO lung tumors (weeks 16–20) from animals continuously depleted of T cells (n = 34) or treated with ICB (n = 12). Continuous T cell depletion had no significant effect on overall TMB or tumor neoantigen burden (TNB), consistent with its lack of effect on tumor progression (Extended Data Fig. 2k–p). However, stratifying mutations by CCF revealed a significant increase in clonal (CCF ≥ 0.75), but not subclonal (CCF ≤ 0.5), TMB and TNB (Fig. 3h,i and Extended Data Fig. 3h,i). T cell-depleted tumors also showed a striking difference in CCF distribution and median CCF of expressed neoantigens, with significant enrichment of clonal neoantigens (Fig. 3j,k and Extended Data Fig. 3j). We observed similar results in the colon Msh2^KO model (Extended Data Fig. 3k–n). It is interesting that tumors from ICB-treated animals showed significantly lower subclonal, but not clonal, TMB and TNB, suggesting possible simultaneous loss of subclonal populations and selective expansion of others in this context of ICB nonresponse (Fig. 3h,i and Extended Data Fig. 3h,i). Altogether, these results argue that neoantigens with high, but not low, clonal fraction are negatively selected by the adaptive immune system during tumor progression. By extension, immunosurveillance in mutationally unstable tumors promotes ITH by selectively pruning clonal neoantigens and thereby increasing the relative fraction of subclonal neoantigens. ICB treatment may lower the CCF threshold for T cell-mediated elimination, but probably also drives expansion of less immunogenic subclones that maintain an overall state of ITH in nonresponders.

Intratumor heterogeneity enables immune evasion

To evaluate the impact of ITH, specifically tumor cell clonality, on immunogenicity in our models, we assessed survival of animals after orthotopic transplantation of the lung tumor cell lines and clones described above. The parental MSH2 knockout lines and low TMB control line were similarly nonimmunogenic, showing no difference in disease progression with and without continuous T cell depletion (Fig. 4a). This is consistent with the lack of immunogenicity of tumors induced by chemical carcinogens in immunocompetent animals and the principle of immunoediting^6,29. Unlike MMRd in the autochthonous model, however, mice transplanted with the parental Msh2^KO line (09-2) or an equal mixture of 09-2-derived Msh2^KO clones (M1–8) and treated with ICB showed 20–30% durable responses and reduced hazard ratios (HRs) (0.34–0.50) over 30 weeks (Fig. 4b). Mice transplanted individually with five otherwise nonimmunogenic Msh2^KO clones (M1, M2, M4, M5 and M6) and treated with ICB showed 30–75% durable responses and even further reduction in HRs (0.05–0.32) over 30 weeks (Fig. 4b). It is interesting that the other three Msh2^KO clones (M3, M7 and M8) were strongly immunogenic even without ICB treatment, efficiently driving disease only with T cell depletion (Fig. 4b). These differences were not the result of tumor intrinsic growth rates or loss of antigen presentation, because all clones showed similar in vitro growth kinetics and readily expressed major histocompatibility complex class I (MHC-I) (H-2K^b, H-2D^b) and programmed death-ligand 1 (PD-L1) on stimulation with interferon-γ (IFN-γ; Extended Data Fig. 4a–e). Although M3, M7 and M8 did not show increased TMB/TNB relative to the other clones, they did express on average higher H-2K^b/H-2D^b with IFN-γ stimulation, particularly the most immunogenic clone, M3, although this was not significant (Extended Data Fig. 4b,c). It is therefore possible that the higher baseline immunogenicity of these clones is due in part to greater surface presentation of neoantigens. Consistent with MMRd-derived neoantigens underlying the immunogenicity and ICB responsiveness of the clones, mice transplanted individually with single-cell clones derived from the low TMB control line (13-1) were uniformly and completely unresponsive to ICB (Extended Data Fig. 4f). Altogether, these results establish a model of MMRd wherein increasing mutational clonality (autochthonous tumors < cell lines < mixture of clones < clones) correlates with immunogenicity and ICB response.

**Fig. 4: ITH enables immune evasion of MMRd tumors.**

Given that the parental cell line 09-2 arose in an immunocompetent host yet contained highly immunogenic subclones (M3, M7 and M8), we reasoned that tumors evade immunosurveillance not only by selective outgrowth of nonimmunogenic subclones but passively through the failure of immunity to eliminate otherwise immunogenic tumor cells present at a low clonal fraction. To test this hypothesis, we collected lung tumors and metastases from mice orthotopically transplanted with an equal mixture of clones M1–8 and reconstructed their clonal makeup by ultradeep, targeted amplicon sequencing of unique clone-defining SNVs (Methods, Fig. 4c and Extended Data Fig. 4g,h). Despite the immunogenicity of M3, M7 and M8, these clones were detected in 12 of 20 tumors analyzed from immunocompetent animals at clonal fractions ≥1%. Of these 12 animals 5 also developed liver metastases, 3 of which were clonal outgrowths of an immunogenic clone (M8) (Extended Data Fig. 4h). In contrast, none of the immunocompetent animals transplanted with M3, M7 or M8 alone formed metastases. This pattern was not substantially different to that of transplants in continuously T cell-depleted or ICB-treated mice. However, tumors from T cell-depleted mice on average showed significantly lower clonal diversity with greater dominance by individual clones (Fig. 4c,d). In addition, tumors from immunocompetent animals harboring one or more immunogenic clones (M3, M7 or M8) at ≥1% fraction were significantly more heterogeneous than tumors without these clones present (Fig. 4e). These results are in agreement with experiments in the autochthonous models (Fig. 3h–k and Extended Data Fig. 3h–n) and further support a revised understanding of immunosurveillance as a process that strongly selects against clonal, but not subclonal, neoantigens.

Neoantigen clonality tunes the T cell response

To gain further mechanistic understanding of how T cells shape clonal architecture, we first profiled the MHC-I immunopeptidome⁴⁵ of M1–8 clones (Supplementary Table 2). Using quantitative tandem mass spectrometry (MS–MS), we identified seven neoepitopes with relative abundance patterns consistent with the associated mutations in the clones. Of these, five neoepitopes (71%) exhibited immunogenicity in normal mice after dendritic cell prime/boost/boost vaccination, by IFN-γ enzyme-linked immune adsorbent spot (ELISpot) and/or MHC-I:epitope tetramer staining of splenocytes (Extended Data Fig. 5a–e). One neoepitope unique to the M5 clone, QAYAFLQHL, elicited a higher-magnitude, antigen-specific CD8⁺ T cell response than the highly immunogenic model neoantigen SIINFEKL, from chicken ovalbumin⁴⁶ (Extended Data Fig. 5d). Although MS immunopeptidomics is far from exhaustive⁴⁷, our identification of five bona fide immunogenic neoepitopes supports the notion that neoantigens underlie the enhanced immunogenicity of Msh2^KO clones (Fig. 4b and Extended Data Fig. 4f).

Next, we transplanted the M5 clone into the lungs of syngeneic mice at different clonal fractions, diluted with the other M1–8 clones, and analyzed the M5-specific CD8⁺ T cell response using flow cytometry with MHC-I:QAYAFLQHL dextramers (Fig. 5a). Consistent with sensitivity of the M5 clone to ICB (Fig. 4b), a robust QAYAFLQHL-specific T cell response was induced in lungs and mediastinal draining lymph nodes (mLNs) of clonally transplanted mice after 2 weeks of ICB treatment. As the clonal fraction and total number of M5 cells were decreased, however, the magnitude of this response also decreased (Fig. 5b–d and Extended Data Fig. 5f–h). Similar results were obtained in analogous experiments with the M2 clone using MHC-I:AALQNAVTF tetramers to analyze M2-specific CD8⁺ T cells (Extended Data Fig. 5i–k). Surprisingly, the quality of the QAYAFLQHL-specific T cell response also decreased with lower clonal fraction, with a significantly smaller percentage of QAYAFLQHL-specific T cells expressing the major effector protease of cytotoxic T cells, granzyme B (GZMB) and a significantly greater percentage expressing TCF1 (Fig. 5e–i and Extended Data Fig. 5l–p). Expressed highly in naive and early activated T cells, the transcription factor TCF1 is lost during effector differentiation⁴⁸. We have previously shown that TCF1⁺GZMB⁻ tumor-specific T cells are enriched and characteristic of tolerogenic dysfunction in a model of microsatellite stable colon cancer⁴⁹. It is interesting that transplantation of an equal number of M5 cells (100,000) at clonality versus CCF of 0.5 (with 100,000 M1–4,6–8 cells) resulted in no significant difference in magnitude, GZMB⁺ or TCF⁺ percentage of the T cell response (Fig. 5d,g,i and Extended Data Fig. 5h,n,p), suggesting that total mass of neoantigen-expressing tumor cells is the primary determinant of quality of the neoantigen-specific T cell response. Altogether, these results provide a compelling rationale for why immunosurveillance fails to delete tumor cells bearing subclonal neoantigens (Fig. 5j).

**Fig. 5: The quality of tumor-specific T cell response is tuned by neoantigen clonality.**

Clonal TNB predicts ICB response in human MMRd cancer

To explore the translational relevance of our findings, we reanalyzed sequencing data from two clinical trials of anti-PD-1 treatment in advanced MMRd colorectal cancer (CRC; Bortolomeazzi et al.⁵⁰) and gastric cancer (Kwon et al.³⁵), including 16 and 13 patients, respectively. Predicting neoantigens and the associated CCFs (Supplementary Table 3), we asked whether clonal TNB and ITH are associated with the response to anti-PD-1. Clonal (CCF ≥ 0.75), but not subclonal (CCF ≤ 0.5), TNB was significantly associated with objective response (OR), whereas high ITH index (subclonal to clonal neoantigen ratio) was significantly associated with nonresponse (NR) (Fig. 6a,b and Extended Data Fig. 6a–d). Total TNB was also significantly associated with OR (Extended Data Fig. 6e–g), probably because tumors in these studies had generally more clonal than subclonal neoantigens. Although clonal TNB was generally correlated with total TNB, two notable outliers, both nonresponders, had high total TNB but very low clonal TNB (Extended Data Fig. 6h).

**Fig. 6: Clonal neoantigen burden is predictive of ICB response in human MMRd cancer.**

Clonal, but not subclonal, TNB was also significantly associated with longer progression-free survival (PFS) in combined analysis of the trials, whereas the ITH index was associated with shorter PFS (Fig. 6c–e). Total TNB also correlated with longer PFS, but did not reach the same level of significance as clonal TNB (Extended Data Fig. 6i). Importantly, there was no significant difference in PFS between the two trials (Extended Data Fig. 6j), justifying their combination in these analyses. Overall, these results support the major conclusions from our mouse models and suggest that clonal TNB is a more accurate predictor of ICB response than overall TMB in MMRd cancers. Although the trials that we reanalyzed in the present study are limited by small sample size and will require validation in larger prospective investigations in MMRd cancers, which are currently lacking, our results align with a growing body of literature that supports the superior predictive power of clonal versus total TMB/TNB as a biomarker of ICB response across other cancers^{20,21,22,23,24,25,26}.

Discussion

In this Article, we report sporadic MMRd mouse models that recapitulate critical features of human lung and colon cancer, including genetics, histopathology and in situ initiation in the relevant tissue microenvironment. Unlike previous studies that demonstrated a role for MMRd and TMB in immunotherapy response, in which mutagenesis occurred in vitro^26,27,28, our models enable study of mutations continuously acquired in vivo from tumor initiation through advanced disease. Other studies showed immunotherapy efficacy in autochthonous cancer models employing tissue-specific knockout of Msh2 or activation of mutant Pole (Pole^P286R) during embryogenesis^51,52. These previous models recapitulate familial cancers like Lynch syndrome in the accumulation of mutations in normal parenchyma preceding transformation (<5% of colorectal cancers (CRCs)), but not sporadic loss of MMR in established tumors (10–15% of CRCs)^14,15. In contrast, our models, which more closely resemble sporadic MMRd, followed a model of neutral evolutionary dynamics⁴⁴ and did not display increased baseline immunogenicity or response to ICB, probably owing to timing of MMR inactivation and the resulting patterns of clonal expansion⁵³. We induced MMRd concomitantly with tumor initiation, resulting in mutation accumulation during exponential cellular expansion that is reminiscent of so-called ‘born to be bad’ colon tumors that follow an early explosive growth trajectory⁵⁴. It is interesting that it was recently shown that MMRd occurring either de novo or induced by temozolomide treatment in advanced glioma led to extensive ITH and poor ICB response³⁸.

Tumors in our models developed extensive ITH and a high burden of subclonal mutations that was not detected by bulk sequencing analysis, highlighting the importance of standardization of clinical pipelines to estimate TMB. Strategies to robustly assay ITH, such as multi-region or single-cell DNA-sequencing, may enhance the predictive utility of TMB. Overall, the results from our models strongly support a potential role of ITH in the failure of ICB in some patients with MMRd cancer. However, studies in the mouse cannot be generalized to humans and these results will require clinical validation. Our reanalysis of MMRd cancer clinical trials^35,50 showed significant association of clonal neoantigen burden and ITH index with ICB response and lends credibility to our models, but is limited by small sample size and not powered to differentiate any predictive value of subclonal neoantigen burden or ITH index beyond their association with clonal neoantigen burden. Larger prospective clinical studies will be required to definitively establish the role of ITH and its potential utility as a biomarker of ICB response in human MMRd cancers^5,6.

Similar to other mechanistic studies of ITH^19,26, we found that experimental reconstitution of ITH potentiated immune evasion. Recently, a preclinical study showed that genetic or pharmacological enrichment of MMRd in the context of mixed MMRd/MMR-proficient (MMRp) cell-line transplants potentiated rejection of the MMRp fraction⁵⁵. This must be interpreted with care, however, because both MMRd and MMRp fractions were derived from the same carcinogen (N-nitroso-N-methylurethane)-induced colon carcinoma line, CT26, and probably share a high burden of clonal neoantigens that may underlie rejection of the MMRp fraction. What distinguishes our study from these prior cell line-based studies is that mutagenesis occurred spontaneously entirely in vivo. That some of the subclones we isolated were highly immunogenic on re-transplantation at clonal, but not subclonal, fraction suggests that they were protected from deletion by high ITH in the original tumor. This probably occurs passively due to low cellularity precluding efficient crosspriming and driving early T cell dysfunction or ignorance^19,49. Indeed, our high-resolution analysis of neoantigen-specific T cells after ICB treatment showed that the magnitude and effector potential of the response are attenuated with decreasing neoantigen clonality. This is in agreement with an orthogonal study that used retroviral neoantigen libraries to manipulate clonal fraction¹⁹. However, although that study concluded that neoantigen clonal fraction, not total cellularity, is the major determinant of the response, our experiments suggest the converse. Additional studies are needed to resolve these and other complexities, including the interplay of ITH with T cell interclonal dynamics, where the immune response may be deployed against a limited subset of ‘dominant’ neoantigens⁵⁶, at the expense of recognition of lower affinity, poorly expressed or subclonal neoantigens.

Paradoxically, we found that immunosurveillance may exacerbate ITH by shaping the clonal architecture of tumors while failing to delete most neoantigens. Therefore, we conclude that ITH is shaped by the interplay of tumor intrinsic, positively selective and immunogenic, negatively selective evolutionary pressures. These results provide nuance to our understanding of immunoediting and highlight the power of our models to capture a hallmark of human cancer that is lacking in carcinogen-induced and genetically engineered models alike⁵⁷—the gradual accumulation of mutations over time³⁶.

Our results raise important questions related to therapies aimed at deliberately increasing TMB to enhance tumor immunogenicity^27,58. These strategies will probably fail to elicit meaningful immune engagement. More concerning, collateral mutagenesis may drive more aggressive cancer, therapy resistance or secondary malignancies. Future studies with models that enable temporal control of cooperating tumorigenic events will be helpful in determining the impact of cancer clonal selection on immunosurveillance and immunotherapy response.

Methods

Mice

All animal use was approved by the Department of Comparative Medicine at the Massachusetts Institute of Technology (MIT) and the Institutional Animal Care and Use Committee under protocol no. 0714-076-17. Mice were housed with a 12-h light/12-h dark cycle with temperatures in the range 20–22 °C and 30–70% humidity. Kras^LSL-G12D (ref. ⁵⁹); Trp53^flox/flox (ref. ⁶⁰); R26^LSL-Cas9 (ref. ⁶¹) (KP; R26^LSL-Cas9) mice were maintained on an F1 (C57BL/6 × 129/SvJ) background. Kras^LSL-G12D; Trp53^flox/flox; Msh2^flox/flox (ref. ³²) (JAX, stock no. 016231) and R26^Cas9 (ref. ⁶²) (JAX, stock no. 028555) mice were maintained on a pure C57BL/6 background. Lung cell lines were isolated from tumors induced in albino C57BL/6 hosts chimeric for tissue derived from blastocyst injection of a KP; R26^LSL-Cas9 embryonic stem (ES) cell line (12A2) of mixed C57BL/6 and 129/SvJ background and male sex, as previously described⁶¹. In orthotopic lung studies, cell lines were transplanted into male chimeras generated from the same 12A2 ES cell line at 10–16 weeks of age. These chimeras are tolerized to C57BL/6 and 129/SvJ tissues, potential antigens in the R26^LSL-Cas9 allele and PuroR introduced into cell lines with Msh2 re-expression (unrecombined Kras^LSL-G12D expresses PuroR). Autochthonous tumors in lung and colon were induced in approximately equal numbers of male and female mice at 8–16 weeks of age.

Tumor models

Tumor burden, where measurable, was not allowed to exceed 1 cm² and animals showing discomfort or distress were humanely euthanized following the recommendations of the American Veterinary Medical Association. Autochthonous lung tumors in Kras^LSL-G12D; Trp53^flox/flox; R26^LSL-Cas9 and Kras^LSL-G12D; Trp53^flox/flox; Msh2^flox/flox mice were induced by intratracheal instillation of 2 × 10⁴ transduction units (TU) of lentivirus and 2 × 10⁸ plaque-forming units of adenovirus-expressing Cre driven by the alveolar type II cell-specific surfactant protein C promoter (SPC-Cre), respectively, as previously described³¹. Autochthonous colon tumors in R26^Cas9 mice were induced by endoscope-guided submucosal injection in the distal colon, as previously described^33,63. Two injections at 1.5 × 10⁶ TU of lentivirus in 50 μl of Opti-MEM were delivered per mouse. Lentivirus was produced in HEK293 cells (American Type Culture Collection) and concentrated as previously described³¹, and functional titers (Cre activity, mScarlet fluorescence) measured as previously described⁶⁴. Cell lines were orthotopically transplanted by intratracheal instillation of 1 × 10⁵ cells in 50 μl of Spinner Modification of Minimal essential Eagle’s Medium (SMEM)/5 mM EDTA, followed by a 30-μl rinse with the same medium. Cell lines were established from autochthonous lung tumors by microdissection and mechanical mincing in digestion buffer (Hanks’ balanced salt solution with 1 M Hepes, 125 units ml⁻¹ of collagenase type IV (Worthington) and 20 μg ml⁻¹ of DNase (Sigma-Aldrich)), followed by incubation at 37 °C with gentle agitation for 30 min and plating in RPMI + 10% fetal bovine serum (FBS). Lines were plated into 50:50 RPMI/Dulbecco’s modified Eagle’s medium (DMEM) + 10% FBS at first passage and DMEM + 10% FBS at second passage and thereafter. Cells were taken for WES at the third passage. Msh2-expressing lentivirus was produced as above. Cells were incubated with lentiviral supernatant and 3 d later selected and maintained thereafter on medium with 6 μg ml⁻¹ of puromycin (Thermo Fisher Scientific).

WES and mutation calling

Whole-exome libraries were generated using the SureSelect XT Mouse All Exon (Agilent) target enrichment kit; 100-bp paired-end sequencing of samples was performed on the Illumina HiSeq 4000 platform, with the exception of M1–8 passage of 20 single-cell clones, which were 150-bp paired-end sequenced on the Illumina NovaSeq 6000 S4 platform. Library preparation and sequencing to 100× on-target coverage were performed by Psomagen. Raw sequencing reads were mapped to the GRCm38 build of the mouse reference genome using BWA-MEM v.0.7.17-r1188 (ref. ⁶⁵). Aligned reads in BAM format were processed following the Genome Analysis Toolkit (GATK) v.4.1.8.0 Best Practices workflow to remove duplicates and recalibrate base quality scores⁶⁶. Median coverage was 98 (25th quartile = 86; 75th quartile = 112), 90 (25th quartile = 82; 75th quartile = 108) and 103 (25th quartile = 100; 75th quartile = 107) for normal tails, autochthonous tumors and cell lines, respectively.

Somatic SNVs and indels were detected using Mutect2, MuSE v.1.0rc⁶⁷, VarDict v.1.8.2 (ref. ⁶⁸) and Strelka2 v.2.9.2 (ref. ⁶⁹) against matched normal tails. Mutect2 was run using a panel of normals compiled from the 18 tails analyzed in the present study. Each caller was run independently on each tumor-normal pair and calls were integrated using SomaticCombiner v.1.03 (ref. ⁷⁰). For colon tumors, a panel of four normal tails was used to generate the matched normal control for all samples, because these mice were of pure background. We considered SNV mapping to only exonic regions that were detected by Mutect2 and at least one of the other algorithms. To increase accuracy of indel detection, only indels detected by at least two algorithms were considered. Variants mapping to dbSNP (build ID 150) positions were discarded. Mutations identified in tumors from two or more animals or in at least 50% of tumors from the same animal were discarded. No VAF filter was applied. Microsatellites were annotated using SciRoKo v.3.4 (ref. ⁷¹), with minimum score = 8, seed length = 8, repeats = 2 and mismatch penalty = 1.

CCF estimation

Somatic copy-number aberrations were detected by integrating output of GATK and FreeBayes v.1.3 (ref. ⁷²) using PureCN v.1.16.0 (ref. ⁷³). Briefly, GATK4 Somatic CNV workflow was utilized for normalization of read counts and genome segmentation using the panel of normals from all tails. FreeBayes was used to obtain B-allele frequencies for dbSNP variant sites. PureCN was used to integrate output from GATK and FreeBayes to estimate allele-specific consensus copy-number profile, purity and ploidy of each sample. The ploidy of cell lines was determined experimentally by metaphase spreads and input into PureCN. Finally, the CCF for each SNV and indel was computed using the R (v.4.0.2) package cDriver⁴³.

Mutational signature and MSI analyses

Mutational signatures were extracted with the R (v.4.0.2) package MutationalPatterns⁷⁴ (v.3.2.0) using the COSMIC Mutational Signatures catalog v.3 (ref. ⁷⁵). We used the function fit_to_signatures with default parameters and included only those mutational processes known to operate in human colon and/or lung cancer (excluding tobacco smoking)⁷⁵: SBS1, SBS5, SBS6, SBS10a, SBS10b, SBS14, SBS15, SBS17a, SBS17b, SBS18, SBS21, SBS26, SBS28, SBS37, SBS40 and SBS44. For visualization, we collapsed signatures of MMRd (SBS6, SBS14, SBS15, SBS21, SBS26 and SBS44; labeled as MMRd) and POLE deficiency (SBS10a, SBS10b, SBS28 and SBS17b; labeled as POLE). Goodness of fit was determined by computing cosine similarity between observed and reconstructed mutational spectra using estimated signature contributions. To estimate the contribution of each mutational process to the human MSI CRC mutational spectrum, we analyzed somatic mutations in the Kwon et al. cohort (non-formalin-fixed paraffin-embedded samples)³⁵ following the methodology described above. Sample MSI score was calculated using MSIsensor-pro (v.1.2.0)³⁴ against matched normal tails.

Clonal deconvolution by targeted amplicon sequencing

To identify private somatic SNVs for distinguishing individual clones in the M1–8 mixed clone tumors (Fig. 4c and Extended Data Fig. 4g,h), we compiled all clonal SNVs in copy-number-neutral regions (four copies, as all lines were tetraploid by metaphase spreads). We then checked the BAM files across all other samples for the complete absence of reads supporting the alternative allele (base quality >20, mapping quality >30) using an in-house Python script relying on the Pysam library. Four private SNVs for each clone and four common SNVs were validated by PCR amplification and Sanger sequencing before proceeding. The 200- to 250-bp regions spanning these SNVs were either individually PCR amplified from samples, gel purified and combined, or amplified in parallel using a multiplexed PCR panel with primers carrying unique molecular indices (CleanPlex UMI Custom Panel, Paragon Genomics). Amplicon libraries were sequenced on the Illumina NovaSeq 6000 S4 platform with150-bp paired-end chemistry.

Reads were aligned to a fasta reference file of all targets (±250 nt upstream/downstream of SNV, GRCm38) using BWA-MEM (v.0.7.17-r1188)⁶⁵, following the GATK Best Practices workflow. Pileups were generated using the mpileup function of bcftools v.1.10.2 (ref. ⁷⁶) with --min-BQ 30 and a bed file of all SNV coordinates. For CleanPlex UMI libraries, the following functions in fgbio (v.2.0.1) (https://github.com/fulcrumgenomics/fgbio) were called to extract unique molecular identifiers (UMIs) and call consensus reads: ExtractUmisFromBam, GroupReadsByUmi and CallMolecularConsensusReads. Using a customized R (v.4.0.2) script, total and SNV-specific depths at all locations were extracted. All SNVs were supported by more reads than other alternative alleles in the M1–8 clone-equal mixture control, except for M6_2, which was excluded from subsequent analysis. Background PCR/sequencing error for each SNV was estimated using the median observed frequencies of SNVs in all metastases of different clones, which represented truly clonal controls. SNV frequencies were adjusted by subtracting background values. Clonal percentages in ex vivo tumors were estimated by taking the median of private SNV frequencies, multiplying by 4 (SNVs are 1/4n) and dividing by tumor purity—estimated as the median observed/expected ratio of frequencies of the four common SNVs (present in all clones).

Neoantigen prediction and expression

Variant consequence was annotated using Ensembl Variant Effect Predictor (VEP) v.99 (ref. ⁷⁷) with Wildtype and Downstream plugins, the VEP cache and reference genome for GRCm38, and the following parameters: --symbol, --terms=SO, --cache, --offline, --transcript_version and --pick. The --pick parameter was reordered from default to report the transcript with most extreme consequence for each variant: rank, canonical, appris, tsl, biotype, ccds, length and mane. Neoepitopes were predicted with C57BL/6 mouse MHC-I alleles, H2-K1 (H-2K^b) and H2-D1 (H-2D^b) and variant effect predictions using pVACtools v.1.5.7 (ref. ⁷⁸). Mutant peptides were generated for peptides that were 8–11 amino acids and MHC:peptide binding affinity was predicted for all peptide:MHC allele pairs with NetMHC-4.0, NetMHCpan-4.0, SMM v.1.0 and SMMPMBEC v.1.0 (refs. ^79,80,81,82). The median value across all affinity predictions was taken as the final measure of binding affinity. Neoantigens were subset to those with median predicted H-2K^b/H-2D^b affinity ≤500 nM. Where multiple neoantigens were predicted for the same SNV, only that with highest predicted affinity was retained.

To assess allele-specific expression of neoantigens, RNA-sequencing (RNA-seq) was performed on autochthonous lung tumors (10 sgMsh2 and 10 sgMsh2 with αCD4/8 treatment) and M1–8 clones. Complementary DNA libraries were prepared using Kapa mRNA Hyperprep and 150-bp paired-end sequencing was performed on the Illumina NextSeq platform. Reads were aligned to the reference genome (GRCm38) using STAR v.2.7.1a⁸³ with outFilterMultimapNmax = 20, alignSJoverhangMin = 8, alignSJDBoverhangMin = 1, outFilterMismatchNmax = 999, outFilterMismatchNoverLmax = 0.1, alignIntronMin = 20, alignIntronMax = 1,000,000, alignMatesGapMax = 1,000,000, outFilterScoreMinOverLread = 0.33, outFilterMatchNminOverLread = 0.33 and limitSjdbInsertNsj = 1,200,000. PCR duplicates were removed using Picard v.2.23.4 (ref. ⁸⁴). Considering somatic variants identified by WES, we used a customized Python script to interrogate the presence of these variants in the RNA-seq BAM files. Only nonduplicate reads with mapping quality ≥255 and bases with base quality ≥20 were considered to compute VAFs.

Histology and immunohistochemistry

Quantification of lung tumor burden by grade was performed on scans of hematoxylin and eosin (H&E)-stained sections by automated convolutional neural network (CNN)—developed in collaboration with Aiforia Technologies Oy in consultation with veterinarian pathologist R. Bronson. Using semantic multi-class segmentation, the CNN was trained to classify lung parenchyma and adenocarcinoma grades 1–4. For supervised training, selected areas from 93 slides were chosen. The algorithm performed consistently and with high correlation with human graders across multiple validation datasets independent of the training dataset. Algorithm v.NSCLC_v25 was used. Triple staining (CD8a, CD4 and FOXP3) immunohistochemistry (IHC) and CNN quantification (Aiforia) were performed as previously described⁴⁹. CD3 infiltration in single-stain slide scans was measured as percentage of pixels positive for stain (diaminobenzidine) in Aperio ImageScope. The area of positive and negative MSH2 staining was quantified by manual annotation in QuPath v.0.1.2 (ref. ⁸⁵).

Western blotting

Cells were lysed in radioimmunoprecipitation (RIPA) buffer (Thermo Fisher Scientific), protein concentration determined using BCA Protein Assay (Thermo Fisher Scientific) and equal protein quantities (20–40 μg) run on NuPage 4–12% Bis–Tris gradient gels (Thermo Fisher Scientific) by sodium dodecylsulfate–polyacrylamide gel electrophoresis and transferred to poly(vinylidene fluoride) membranes. Western blotting was performed against MSH2 (catalog no. D24B5, Cell Signaling Technology) at 1:1,000, MLH1 (catalog no. ab92312, Abcam) at 1:1,000, glyceraldehyde 3-phosphate dehydrogenase (catalog no. 6C5, Santa Cruz) at 1:5,000 and β-actin (catalog no. 13E5, Cell Signaling Technology) at 1:5,000. Blots were stained with horseradish peroxidase (HRP) anti-rabbit immunoglobulin G and developed with Western Lightning Plus-ECL (Perkin Elmer) on X-ray film.

In vivo antibody and chemotherapy dosing

Antibodies were delivered intraperitoneally in 100 μl of phosphate-buffered saline (PBS). αCD4 (catalog no. GK1.5, BioXCell) and αCD8 (catalog no. 2.43, BioXCell) were administered at 200 μg every 4 d. αPD-1 (catalog no. 29F.1A12, BioXCell) was administered at 200 μg 3× a week. αCTLA (catalog no. 9H10, BioXCell) was administered at an initial dose of 200 μg, with subsequent doses at 100 μg, 3× a week. Oxaliplatin (Sigma-Aldrich) and cyclophosphamide (Sigma-Aldrich) (Oxa/Cyc) were co-delivered intraperitoneally in 100 μl of PBS at 2.5 mg per kg body weight and 50 mg per kg body weight, respectively, once a week for 3 weeks, as previously described⁴².

In vivo tumor imaging and quantification

Lung tumor progression was monitored longitudinally by X-ray microcomputed tomography (μCT) using a GE eXplore CT 120 system, as previously described⁸⁶. Solid lung volume (tumor burden) was quantified using a customized MATLAB (MathWorks) script, as previously described⁸⁶. Colon tumor progression was monitored longitudinally using a Karl Storz colonoscopy system with white light and red fluorescent protein fluorescence and biopsy forceps serving as a landmark for objective positioning, as previously described⁴⁹.

Lentiviral constructs

The U6::sgRNA-EFS::Cre (pUSEC) lentiviral construct⁸⁶ was digested with BsmBI and sgRNAs cloned as previously described⁸⁷. H1::sgApc-U6::sgRNA-EFS::mScarlet was generated by Gibson assembly using an H1::sgApc-scaffold gBlock synthetic gene fragment (IDT), PCR amplicons of U6::BsmBI-filler-BsmBI-scaffold, elongation factor-1 (EFS) promoter and mScarlet⁸⁸, and a lentiviral backbone from the Trono laboratory (Addgene). This was then digested with BsmBI and a second sgRNA cloned as above. The sgRNA sequences, including previously published sgApc³³ and sgCtl (mScarlet targeting)⁶⁴ are provided in Supplementary Table 4. The sgRNA controls against Olfr102 and mScarlet were used interchangeably with no observed differences in tumorigenesis. PGK::Msh2-EFS::PuroR was generated by Gibson assembly using multiple gBlocks spanning murine Msh2 (C57BL/6), PCR amplicons of PGK (3-phosphoglycerate kinase) promoter, EF-1 Alpha Short (EFS) promoter and PuroR, and the Trono lentiviral backbone. All primers were ordered from Sigma-Aldrich.

Validation of CRISPR–Cas9 editing and estimation of tumor purity

To validate efficiency of gene editing by clustered regularly interspaced short palindromic repeats (CRISPR)–Cas9, 200- to 250-bp regions spanning sgRNA sites in the genome were amplified and deep sequenced (Massachusetts General Hospital DNA Core). Colon tumor purity was estimated using a non-wild-type allele fraction at the sgRNA-targeted site in Apc. Loss of Apc is prerequisite for tumorigenesis in the model and thus an assumption was made that all tumor cells harbor loss-of-function edits at this locus. Tumor purity in sgMsh2-targeted lung tumors was estimated using WES BAM coverage spanning exons of the Trp53^flox allele and flanking genes (Wrap53, Atp1b2), which were retrieved using the bedcov function of SAMtools v.1.10. The ratio of median coverage in flanking exons (Wrap53 exons 0–9, Trp53 exon 11 and Atp1b2 exons 0–6) versus Trp53 exons flanked by Cre loxP sites (exons 2–10) was calculated in tumors and normal tails. This ratio in tumors, representing the extent of Trp53^flox recombination, was then normalized to the median ratio across matched normal tails to estimate tumor purity, with the assumption that all tumor, but not normal, cells underwent complete recombination of Trp53^flox alleles. Efficiency of Msh2 knockout in KP; Msh2^flox/flox lung tumors was similarly estimated by taking the ratio of reads at the exon flanked by loxP sites (exon 12) and surrounding exons, and adjusting this by tumor purity.

In vitro cell-line assays

Serial live cell imaging of cell lines grown in 96-well plates (Corning) and quantification of confluence were performed with an IncuCyte S3 (Sartorius). Eight replicate wells were seeded with 100 cells and imaged every 3 h for ~6 d. Murine IFN-γ (PeproTech) was used for in vitro stimulation of cell lines for 24 h, followed by live/dead staining (ghost ef780 (Corning), 1:500) in PBS and surface staining in 1 mM EDTA, 25 mM Hepes, 0.5% heat-inactivated FBS in PBS with anti-H-2K^b allophycocyanin (APC) (catalog no. AF6-88.5.5.3, Thermo Fisher Scientific, 1:200), anti-H-2D^b FITC (catalog no. 28-14-8, Thermo Fisher Scientific, 1:200) and anti-PD-L1 phycoerythrin (PE)-Cy7 (catalog no. 10F.9G2, BioLegend, 1:200). Samples were run on a BD LSRFortessa using BD FACSDiva v.8.0 software. Results were analyzed in FlowJo v.10.4.2, excluding dead (ef780-positive) cells.

Phylogenetic tree analysis

All somatic SNVs and indels called by the WES analysis pipeline in M1–8 clones and the 09-2 parental cell line were considered in constructing a phylogenetic tree. The R (v.4.0.2) Bioconductor package phangorn (v.2.7.0) was used to construct a tree from a binary presence/absence matrix of mutations across clones and 09-2_par. Specifically, the function prachet was used to calculate the tree using the parsimonious ratchet method and the function acctran was used to calculate branch lengths.

MHC-I immunopeptidomics

MHC-I (H-2K^b and H-2D^b) peptide isolation was performed on 10⁸ cells per triplicate for each M1–8 clone as we have previously described⁴⁹. Cells were grown to confluence before stimulation with 10 ng ml⁻¹ of murine IFN-γ (PeproTech) for 18 h before collection. Pulldowns were performed with 40 μl (bed volume) of rProtein A Sepharose beads (GE Healthcare) preloaded with 1 mg of anti-H-2K^b antibody (Y3, BioXCell) and 1 mg of anti-H-2D^b antibody (catalog no. 28-14-8S, purified from HB-27 hybridoma), performed sequentially. Peptides were eluted in 500 μl of 10% acetic acid and purified with 10-kDa MWCO spin filters (PALL Life Science).

MS–MS was performed on eluted peptides as we have previously described⁴⁹. Tandem mass spectra were searched with Sequest (Thermo Fisher Scientific, v.IseNode in Proteome Discoverer 2.5.0.400). Sequest was set to search the mouse Uniprot database (3 July 2020 version) with 55,650 entries, including common contaminants and green fluorescent protein, Cas9, puromycin and P2A (present in the cell lines) assuming no digestion enzyme, with fragment and parent ion mass tolerances of 0.02 Da and 10.0 p.p.m., respectively. TMTpro was added as a fixed modification on the carboxy and amino termini of peptides. Oxidation of methionine was specified in Sequest as a variable modification. The resulting peptides were filtered to exclude peptides with an isolation interference >30% and p.p.m. error >±3 of the median p.p.m. error of all peptide-spectrum matches. Peptides were further prioritized based on concordance of relative abundance across clones with presence/absence of the associated somatic mutation.

Dendritic cell vaccination, ELISpot and MHC-I multimer staining

Bone marrow-derived dendritic cells were prepared, activated, loaded with putative neoepitopes and injected intradermally at the base of the tail of healthy C57BL/6 mice, followed by two heterologous boosts, as previously described⁴⁵. A week after the second boost, spleen and peripheral blood were collected for IFN-γ ELISpot and MHC-I:epitope tetramer flow cytometric assays. Red blood cells were first lysed with ACK lysis buffer. IFN-γ ELISpot was performed following the manufacturer’s recommendations (ImmunoSpot, Cellular Technology Limited) using 750,000 splenocytes per well. H-2K^b tetramers were custom generated as previously described⁴⁵ and used at 1:200 dilution. H-2D^b tetramers were generated using UV-labile monomers (UVX Flex-T, BioLegend) following the manufacturer’s recommendations and used at 1:50 dilution. H-2K^b:QAYAFLQHL dextramers were generated using the U-Load Dextramer Kit (Immudex) following the manufacturer’s recommendations and used at 1:10 dilution.

Tissue preparation and flow cytometry

Then 2 min before euthanasia, mice were injected retro-orbitally with anti-CD45 APC-eFluor786 (catalog no. 30-F11, BioLegend, 1:50) to stain intravascular immune cells. Mediastinal lymph nodes and whole tumor-bearing lungs were collected and mechanically dissociated in RPMI-1640 (Corning) with 5% heat-inactivated (HI)-FBS (collection medium). Lungs were placed in digestion buffer containing 500 U ml⁻¹ of collagenase type IV and 20 μg ml⁻¹ of DNase (Sigma-Aldrich) in collection medium, lightly minced and digested at 37 °C for 30 min with gentle agitation, and further dissociated with a gentleMACS Octo Dissociator (Miltenyi Biotec) on the tumor_imp1.1 setting and passed through a 100-μm filter. Live/dead staining (Ghost Dye Red 780, Corning, 1:500 dilution) was performed in PBS and surface stains in FACS buffer (1 mM EDTA, 25 mM Hepes and 0.5% HI-FBS in PBS). For assessment of T cell depletion (Extended Data Fig. 2m), the following antibodies were used for surface staining: CD45 BV785 (catalog no. 30-F11, BioLegend, 1:200), CD3 BV421 (catalog no. 17A2, BioLegend, 1:400), CD8a BUV395 (catalog no. 53-6.7, BioLegend, 1:400) and CD4 AF647 (catalog no. RM4-5, BioLegend, 1:400). For analysis of neoantigen-specific T cells (Extended Data Fig. 5e), the following antibodies were used for surface staining: CD8a BUV395 (as above), CD4 BV711 (catalog no. RM4-5, BioLegend, 1:200), CD44 BV785 (catalog no. IM7, BioLegend, 1:200) and GZMB PE-CF594 (catalog no. GB11, BD Biosciences, 1:250); and intracellular staining: TCF1 AF647 (catalog no. C63D9, CST, 1:200). Cells were fixed for 1 h at room temperature in Fixation/Permeabilization Concentrate (Thermo Fisher Scientific) diluted 1:3 in Fixation/Permeabilization diluent (Thermo Fisher Scientific) and washed in permeabilization buffer (Thermo Fisher Scientific). Intracellular staining was performed in permeabilization buffer overnight at 4 °C. Cells were washed and resuspended in FACS buffer for analysis on a BD LSRFortessa four-laser flow cytometer running BD FACSDiva v.8.0 software. Results were analyzed in FlowJo v.10.4.2. Single lymphocytes were gated first on forward versus side scatter (FSC-A versus SSC-A) and then FSC-A versus FSC-H. Then, live CD8⁺ T cells were gated on positive CD8α and negative Ghost Red Dye 780 staining. Antigen-specific CD8⁺ T cells were further gated on CD44 positivity and tetramer/dextramer positivity in two channels (PE/APC). Expression of additional markers was analyzed specifically in this neoantigen-specific CD8⁺ T cell population.

Analysis of human MMRd cancer clinical trials

Raw WES reads from Bortolomeazzi et al.⁵⁰ and Kwon et al.³⁵ trials (ClinicalTrial.gov identifiers: NCT02563002 and NCT02589496) were mapped to the reference human genome (GRCh38) using BWA-MEM v.0.7.17-r1188 (ref. ⁶⁵). Aligned reads were processed as BAMs following the GATK v.4.1.8.0 Best Practices workflow to remove duplicates and recalibrate base quality scores⁶⁶^. Somatic SNVs and indels were detected using the same pipeline and callers described above for mouse tumors. CCF values were estimated as described above.

Raw RNA-seq reads were mapped to the human reference genome (GRCh38) using STAR v.2.7.1a⁸³. STAR was run using the same parameters as described above in the mouse analysis. The function Htseq-count from the Python library HTSeq (v.2.0.1)⁸⁹ was used to compute read counts for each gene (Ensembl release GRCh38.90), which were normalized to transcripts per million. Neoantigens were predicted and prioritized as described above in the mouse analysis. Clonal and subclonal neoantigens were classified as CCF ≥ 0.75 and <0.5, respectively.

Clinical responses were binned into two groups: OR, including partial and complete responders and NR, including patients with stable and progressive disease. PFS analysis was performed on a combination of both trials^35,50 with the trial study as a covariate. Importantly, there was no significant difference in PFS between patients from the two trials. Cox’s regression was performed in R (v.4.0.2) using the package survival (v.3.4-0)⁹⁰ with comparisons of patients in the upper versus lower quartiles of each variable tested.

Statistics and reproducibility

Statistical analyses and plotting were performed in R (v.4.2.1) using built-in functions and ggplot2 (v.3.4.1), beeswarm (v.0.4.0), corrplot (v.0.88), eulerr (v.6.1.0), gplots (v.3.1.3), survival (v.3.4.0), survminer (v.0.4.9) and RColorBrewer (v.1.1.3). To assess statistical significance, Fisher’s exact 2 × 2 test was used on categorical variables and two-tailed Wilcoxon’s rank-sum test or Student’s t-test (where the assumption of normality was met) was used on continuous variables. HRs were calculated and compared using Cox’s proportional hazards regression. Multiple-comparison corrections were performed using Holm’s method. No statistical method was used to predetermine sample size. In preclinical trials of lung and colon models, only those animals with apparent tumors by μCT or colonoscopy were recruited. No other data were excluded from analyses. Preclinical trials were randomized and investigators blinded to allocation during dosing, imaging and quantification. No experiments failed to replicate.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

Raw exome sequencing and RNA-seq data from Bortolomeazzi et al.⁵⁰ are available through controlled-access application via the European Genome-Phenome Archive (hosted by the EMBL-EBI and the CRG) under accession no. EGAD00001006165. Raw sequencing data from Kwon et al.³⁵ were downloaded from the European Nucleotide Archive (ENA) database under primary accession no. PRJEB40416. The sequencing data generated in the present study are available at ENA under primary accession no. PRJEB56609. Raw MS data generated in the present study are available at MassIVE under accession no. MSV000092096. Source data are provided with this paper.

Code availability

Customized scripts used in the analysis of sequencing data in the present study are available at: https://github.com/cortes-ciriano-lab/MMRd_immunogenicity.

References

Larkin, J. et al. Five-year survival with combined nivolumab and ipilimumab in advanced melanoma. N. Engl. J. Med. 381, 1535–1546 (2019).
CAS PubMed Google Scholar
Le, D. T. et al. Mismatch repair deficiency predicts response of solid tumors to PD-1 blockade. Science 357, 409–413 (2017).
CAS PubMed PubMed Central Google Scholar
Fradet, Y. et al. Randomized phase III KEYNOTE-045 trial of pembrolizumab versus paclitaxel, docetaxel, or vinflunine in recurrent advanced urothelial cancer: results of >2 years of follow-up. Ann. Oncol. 30, 970–976 (2019).
CAS PubMed PubMed Central Google Scholar
Reck, M. et al. Pembrolizumab versus chemotherapy for PD-L1–positive non–small-cell lung cancer. N. Engl. J. Med. 375, 1823–1833 (2016).
CAS PubMed Google Scholar
Diaz, L. A. et al. Pembrolizumab versus chemotherapy for microsatellite instability-high or mismatch repair-deficient metastatic colorectal cancer (KEYNOTE-177): final analysis of a randomised, open-label, phase 3 study. Lancet Oncol. 23, 659–670 (2022).
CAS PubMed PubMed Central Google Scholar
Matsushita, H. et al. Cancer exome analysis reveals a T-cell-dependent mechanism of cancer immunoediting. Nature 482, 400–404 (2012).
CAS PubMed PubMed Central Google Scholar
Blass, E. & Ott, P. A. Advances in the development of personalized neoantigen-based therapeutic cancer vaccines. Nat. Rev. Clin. Oncol. 18, 215–229 (2021).
Samstein, R. M. et al. Tumor mutational load predicts survival after immunotherapy across multiple cancer types. Nat. Genet. 51, 202–206 (2019).
CAS PubMed PubMed Central Google Scholar
Marabelle, A. et al. Association of tumour mutational burden with outcomes in patients with advanced solid tumours treated with pembrolizumab: prospective biomarker analysis of the multicohort, open-label, phase 2 KEYNOTE-158 study. Lancet Oncol. 21, 1353–1365 (2020).
CAS PubMed Google Scholar
Hellmann, M. D. et al. Tumor mutational burden and efficacy of nivolumab monotherapy and in combination with ipilimumab in small-cell lung cancer. Cancer Cell 33, 853–861.e4 (2018).
CAS PubMed PubMed Central Google Scholar
The Cancer Genome Atlas Network. Comprehensive molecular characterization of human colon and rectal cancer. Nature 487, 330–337 (2012).
Kim, T.-M., Laird, P. W. & Park, P. J. The landscape of microsatellite instability in colorectal and endometrial cancer genomes. Cell 155, 858–868 (2013).
CAS PubMed Google Scholar
Chalmers, Z. R. et al. Analysis of 100,000 human cancer genomes reveals the landscape of tumor mutational burden. Genome Med. 9, 34 (2017).
PubMed PubMed Central Google Scholar
Cortes-Ciriano, I., Lee, S., Park, W.-Y., Kim, T.-M. & Park, P. J. A molecular portrait of microsatellite instability across multiple cancers. Nat. Commun. 8, 59 (2017).
Google Scholar
Hause, R. J., Pritchard, C. C., Shendure, J. & Salipante, S. J. Classification and characterization of microsatellite instability across 18 cancer types. Nat. Med. 22, 1342–1350 (2016).
CAS PubMed Google Scholar
Gurjao, C., Tsukrov, D., Imakaev, M., Luquette, L. J. & Mirny, L. A. Limited evidence of tumour mutational burden as a biomarker of response to immunotherapy. Preprint at bioRxiv https://doi.org/10.1101/2020.09.03.260265 (2020).
Rousseau, B. et al. The spectrum of benefit from checkpoint blockade in hypermutated tumors. N. Engl. J. Med. 384, 1168–1170 (2021).
PubMed PubMed Central Google Scholar
McGranahan, N. & Swanton, C. Clonal heterogeneity and tumor evolution: past, present, and the future. Cell 168, 613–628 (2017).
Gejman, R. S. et al. Rejection of immunogenic tumor clones is limited by clonal fraction. eLife 7, e41090 (2018).
PubMed PubMed Central Google Scholar
Morris, L. G. T. et al. Pan-cancer analysis of intratumor heterogeneity as a prognostic determinant of survival. Oncotarget 7, 10051–10063 (2016).
PubMed PubMed Central Google Scholar
Lin, Z. et al. Intratumor heterogeneity correlates with reduced immune activity and worse survival in melanoma patients. Front. Oncol. 10, 596493 (2020).
PubMed PubMed Central Google Scholar
Andor, N. et al. Pan-cancer analysis of the extent and consequences of intratumor heterogeneity. Nat. Med. 22, 105–113 (2016).
CAS PubMed Google Scholar
Miao, D. et al. Genomic correlates of response to immune checkpoint blockade in microsatellite-stable solid tumors. Nat. Genet. 50, 1271–1281 (2018).
CAS PubMed PubMed Central Google Scholar
McGranahan, N. et al. Clonal neoantigens elicit T cell immunoreactivity and sensitivity to immune checkpoint blockade. Science 351, 1463–1469 (2016).
CAS PubMed PubMed Central Google Scholar
Litchfield, K. et al. Meta-analysis of tumor- and T cell-intrinsic mechanisms of sensitization to checkpoint inhibition. Cell 184, 596–614.e14 (2021).
CAS PubMed PubMed Central Google Scholar
Wolf, Y. et al. UVB-induced tumor heterogeneity diminishes immune response in melanoma. Cell 179, 219–235.e21 (2019).
CAS PubMed PubMed Central Google Scholar
Germano, G. et al. Inactivation of DNA repair triggers neoantigen generation and impairs tumour growth. Nature 552, 116–120 (2017).
CAS PubMed Google Scholar
Mandal, R. et al. Genetic diversity of tumors with mismatch repair deficiency influences anti-PD-1 immunotherapy response. Science 364, 485–491 (2019).
CAS PubMed PubMed Central Google Scholar
Shankaran, V. et al. IFNgamma and lymphocytes prevent primary tumour development and shape tumour immunogenicity. Nature 410, 1107–1111 (2001).
CAS PubMed Google Scholar
DuPage, M., Mazumdar, C., Schmidt, L. M., Cheung, A. F. & Jacks, T. Expression of tumour-specific antigens underlies cancer immunoediting. Nature 482, 405–409 (2012).
CAS PubMed PubMed Central Google Scholar
DuPage, M., Dooley, A. L. & Jacks, T. Conditional mouse lung cancer models using adenoviral or lentiviral delivery of Cre recombinase. Nat. Protoc. 4, 1064–1072 (2009).
CAS PubMed PubMed Central Google Scholar
Kucherlapati, M. H. et al. An Msh2 conditional knockout mouse for studying intestinal cancer and testing anticancer agents. Gastroenterology 138, 993–1002.e1 (2010).
PubMed Google Scholar
Roper, J. et al. In vivo genome editing and organoid transplantation models of colorectal cancer and metastasis. Nat. Biotechnol. 35, 569–576 (2017).
CAS PubMed PubMed Central Google Scholar
Jia, P. et al. MSIsensor-pro: fast, accurate, and matched-normal-sample-free detection of microsatellite instability. Genom. Proteom. Bioinform. 18, 65–71 (2020).
Google Scholar
Kwon, M. et al. Determinants of response and intrinsic resistance to pd-1 blockade in microsatellite instability–high gastric cancer. Cancer Discov. 11, 2168–2185 (2021).
CAS PubMed Google Scholar
Alexandrov, L. B. et al. Signatures of mutational processes in human cancer. Nature 500, 415–421 (2013).
CAS PubMed PubMed Central Google Scholar
Tate, J. G. et al. COSMIC: the catalogue of somatic mutations In cancer. Nucleic Acids Res. 47, D941–D947 (2019).
CAS PubMed Google Scholar
Touat, M. et al. Mechanisms and therapeutic implications of hypermutation in gliomas. Nature 580, 517–523 (2020).
CAS PubMed PubMed Central Google Scholar
Leach, D. R., Krummel, M. F. & Allison, J. P. Enhancement of antitumor immunity by CTLA-4 blockade. Science 271, 1734–1736 (1996).
CAS PubMed Google Scholar
Iwai, Y. et al. Involvement of PD-L1 on tumor cells in the escape from host immune system and tumor immunotherapy by PD-L1 blockade. Proc. Natl Acad. Sci. USA 99, 12293–12297 (2002).
CAS PubMed PubMed Central Google Scholar
DuPage, M. et al. Endogenous T cell responses to antigens expressed in lung adenocarcinomas delay malignant tumor progression. Cancer Cell 19, 72–85 (2011).
CAS PubMed PubMed Central Google Scholar
Pfirschke, C. et al. Immunogenic chemotherapy sensitizes tumors to checkpoint blockade therapy. Immunity 44, 343–354 (2016).
CAS PubMed PubMed Central Google Scholar
Zapata, L. et al. Signatures of positive selection reveal a universal role of chromatin modifiers as cancer driver genes. Sci. Rep. 7, 13124 (2017).
PubMed PubMed Central Google Scholar
Williams, M. J., Werner, B., Barnes, C. P., Graham, T. A. & Sottoriva, A. Identification of neutral tumor evolution across cancer types. Nat. Genet. 48, 238–244 (2016).
CAS PubMed PubMed Central Google Scholar
Jaeger, A. M. et al. Deciphering the immunopeptidome in vivo reveals new tumour antigens. Nature 607, 149–155 (2022).
CAS PubMed PubMed Central Google Scholar
Hogquist, K. A. et al. T cell receptor antagonist peptides induce positive selection. Cell 76, 17–27 (1994).
CAS PubMed Google Scholar
Ghosh, M. et al. Guidance document: Validation of a high-performance liquid chromatography-tandem mass spectrometry immunopeptidomics assay for the identification of HLA class I ligands suitable for pharmaceutical therapies. Mol. Cell. Proteom. 19, 432–443 (2020).
Lin, W.-H. W. et al. CD8+ T lymphocyte self-renewal during effector cell determination. Cell Rep. 17, 1773–1782 (2016).
CAS PubMed PubMed Central Google Scholar
Westcott, P. M. K. et al. Low neoantigen expression and poor T-cell priming underlie early immune escape in colorectal cancer. Nat. Cancer 2, 1071–1085 (2021).
CAS PubMed PubMed Central Google Scholar
Bortolomeazzi, M. et al. Immunogenomics of colorectal cancer response to checkpoint blockade: analysis of the KEYNOTE 177 trial and validation cohorts. Gastroenterology 161, 1179–1193 (2021).
CAS PubMed Google Scholar
Gebert, J. et al. Recurrent frameshift neoantigen vaccine elicits protective immunity with reduced tumor burden and improved overall survival in a lynch syndrome mouse model. Gastroenterology https://doi.org/10.1053/J.GASTRO.2021.06.073 (2021).
Article PubMed Google Scholar
Li, H.-D. et al. A PoleP286R mouse model of endometrial cancer recapitulates high mutational burden and immunotherapy response. JCI Insight 5, e138829 (2020).
PubMed PubMed Central Google Scholar
Lee, B. C. H. et al. Mutational landscape of normal epithelial cells in Lynch syndrome patients. Nat. Commun. 13, 2710 (2022).
CAS PubMed PubMed Central Google Scholar
Sottoriva, A. et al. A big bang model of human colorectal tumor growth. Nat. Genet 47, 209–216 (2015).
CAS PubMed PubMed Central Google Scholar
Amodio, V. et al. Genetic and pharmacological modulation of DNA mismatch repair heterogeneous tumors promotes immune surveillance. Cancer Cell 41, 196–209.e5 (2023).
CAS PubMed PubMed Central Google Scholar
Burger, M. L. et al. Antigen dominance hierarchies shape TCF1+ progenitor CD8 T cell phenotypes in tumors. Cell 184, 4996–5014.e26 (2021).
CAS PubMed PubMed Central Google Scholar
Westcott, P. M. K. et al. The mutational landscapes of genetic and chemical models of Kras-driven lung cancer. Nature 517, 489–492 (2015).
CAS PubMed Google Scholar
Helleday, T. Making immunotherapy ‘cold’ tumours ‘hot’ by chemotherapy-induced mutations—a misconception. Ann. Oncol. 30, 360–361 (2019).
CAS PubMed Google Scholar
Johnson, L. et al. Somatic activation of the K-ras oncogene causes early onset lung cancer in mice. Nature 410, 1111–1116 (2001).
CAS PubMed Google Scholar
Marino, S., Vooijs, M., van Der Gulden, H., Jonkers, J. & Berns, A. Induction of medulloblastomas in p53-null mutant mice by somatic inactivation of Rb in the external granular layer cells of the cerebellum. Genes Dev. 14, 994–1004 (2000).
CAS PubMed PubMed Central Google Scholar
Li, L. et al. Identification of DHODH as a therapeutic target in small cell lung cancer. Sci. Transl. Med. 11, 7852 (2019).
Google Scholar
Chu, V. T. et al. Efficient generation of Rosa26 knock-in mice using CRISPR/Cas9 in C57BL/6 zygotes. BMC Biotechnol. 16, 4 (2016).
PubMed PubMed Central Google Scholar
Roper, J. et al. Colonoscopy-based colorectal cancer modeling in mice with CRISPR-Cas9 genome editing and organoid transplantation. Nat. Protoc. 13, 217–234 (2018).
CAS PubMed PubMed Central Google Scholar
Sánchez-Rivera, F. J. et al. Rapid modelling of cooperating genetic events in cancer through somatic genome editing. Nature 516, 428–431 (2014).
PubMed PubMed Central Google Scholar
Li, H. Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM. Preprint at https://doi.org/10.48550/arXiv.1303.3997 (2013).
Van der Auwera, G. A. & O’Connor, B. D. Genomics in the Cloud: Using Docker, GATK, and WDL in Terra (O’Reilly Media, Inc., 2020).
Fan, Y. et al. MuSE: accounting for tumor heterogeneity using a sample-specific error model improves sensitivity and specificity in mutation calling from sequencing data. Genome Biol. 17, 1–11 (2016).
Google Scholar
Lai, Z. et al. VarDict: a novel and versatile variant caller for next-generation sequencing in cancer research. Nucleic Acids Res. 44, e108–e108 (2016).
PubMed PubMed Central Google Scholar
Kim, S. et al. Strelka2: fast and accurate calling of germline and somatic variants. Nat. Methods 15, 591–594 (2018).
CAS PubMed Google Scholar
Wang, M. et al. SomaticCombiner: improving the performance of somatic variant calling based on evaluation tests and a consensus approach. Sci. Rep. 10, 12898 (2020).
CAS PubMed PubMed Central Google Scholar
Kofler, R., Schlötterer, C. & Lelley, T. SciRoKo: a new tool for whole genome microsatellite search and investigation. Bioinformatics 23, 1683–1685 (2007).
CAS PubMed Google Scholar
Garrison, E. & Marth, G. Haplotype-based variant detection from short-read sequencing. Preprint at https://doi.org/10.48550/arXiv.1207.3907 (2012).
Riester, M. et al. PureCN: copy number calling and SNV classification using targeted short read sequencing. Source Code Biol. Med. 11, 13 (2016).
PubMed PubMed Central Google Scholar
Blokzijl, F., Janssen, R., van Boxtel, R. & Cuppen, E. MutationalPatterns: comprehensive genome-wide analysis of mutational processes. Genome Med. 10, 28 (2018).
Google Scholar
Alexandrov, L. B. et al. The repertoire of mutational signatures in human cancer. Nature 578, 94–101 (2020).
CAS PubMed PubMed Central Google Scholar
Li, H. et al. The sequence alignment/map format and SAMtools. Bioinformatics 25, 2078–2079 (2009).
PubMed PubMed Central Google Scholar
McLaren, W. et al. The Ensembl variant effect predictor. Genome Biol. 17, 122 (2016).
PubMed PubMed Central Google Scholar
Hundal, J. et al. PVACtools: A computational toolkit to identify and visualize cancer neoantigens. Cancer Immunol. Res. 8, 409–420 (2020).
CAS PubMed PubMed Central Google Scholar
Jurtz, V. et al. NetMHCpan-4.0: improved peptide–MHC class I interaction predictions integrating eluted ligand and peptide binding affinity data. J. Immunol. 199, 3360–3368 (2017).
CAS PubMed Google Scholar
Andreatta, M. & Nielsen, M. Gapped sequence alignment using artificial neural networks: spplication to the MHC class I system. Bioinformatics 32, 511–517 (2016).
CAS PubMed Google Scholar
Peters, B. & Sette, A. Generating quantitative models describing the sequence specificity of biological processes with the stabilized matrix method. BMC Bioinform. 6, 132 (2005).
Google Scholar
Kim, Y., Sidney, J., Pinilla, C., Sette, A. & Peters, B. Derivation of an amino acid similarity matrix for peptide:MHC binding and its application as a Bayesian prior. BMC Bioinform. 10, 394 (2009).
Google Scholar
Dobin, A. et al. STAR: ultrafast universal RNA-seq aligner. Bioinformatics 29, 15–21 (2013).
CAS PubMed Google Scholar
Picard Tools—By Broad Institute. GitHub http://broadinstitute.github.io/picard.
Bankhead, P. et al. QuPath: Open source software for digital pathology image analysis. Sci. Rep. 7, 16878 (2017).
PubMed PubMed Central Google Scholar
Tammela, T. et al. A Wnt-producing niche drives proliferative potential and progression in lung adenocarcinoma. Nature 545, 355–359 (2017).
CAS PubMed PubMed Central Google Scholar
Ran, F. A. et al. Genome engineering using the CRISPR–Cas9 system. Nat. Protoc. 8, 2281–2308 (2013).
CAS PubMed PubMed Central Google Scholar
Bindels, D. S. et al. mScarlet: a bright monomeric red fluorescent protein for cellular imaging. Nat. Methods 14, 53–56 (2016).
PubMed Google Scholar
Anders, S., Pyl, P. T. & Huber, W. HTSeq—a Python framework to work with high-throughput sequencing data. Bioinformatics 31, 166 (2015).
CAS PubMed Google Scholar
Therneau, T. M. & Grambsch, P. M. Modeling Survival Data: Extending the Cox Model (Springer, 2000).

Download references

Acknowledgements

This work was supported by the National Cancer Institute’s Cancer Center Support Grant (nos. P30-CA14051 and R01 CA233983 to T.J.) and the Howard Hughes Medical Institute (to T.J.) P.M.K.W. was supported by a Damon Runyon Fellowship Award. The funders had no role in the study design, data collection and analysis, decision to publish or preparation of the paper. We thank K. Yee, J. Teixeira, K. Anderson and M. Magendantz for administrative support, and our colleagues in the Jacks laboratory and the broader community at the Koch Institute of MIT for thoughtful discussions and technical advice. We thank the Koch Institute Swanson Biotechnology Center for core support from the BioMicro Center and Histology, Proteomics, and Flow Cytometry facilities, with particular thanks to S. Levine for high-throughput sequencing support, K. Cormier and C. Condon for histology support, A. Koller for help with design and analysis of MS experiments, and M. Griffin, M. Jennings and G. Paradis for flow cytometry support. F.M. and I.C.C. thank EMBL for funding. We are also grateful for collaboration with T. Westerling and Aiforia in developing automated CNNs for lung tumor grade and IHC quantification.

Funding

Open access funding provided by European Molecular Biology Laboratory (EMBL).

Author information

Peter M. K. Westcott
Present address: Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA
These authors contributed equally: Haley Hauck, Olivia C. Smith.

Authors and Affiliations

David H. Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, Cambridge, MA, USA
Peter M. K. Westcott, Haley Hauck, Olivia C. Smith, Nathan J. Sacks, Zackery A. Ely, Alex M. Jaeger, William M. Rideout III, Daniel Zhang, Arjun Bhutkar, Mary C. Beytagh, David A. Canner, Grissel C. Jaramillo, Santiago Naranjo, Abbey Jin, J. J. Patten, Amanda M. Cruz, Sean-Luc Shanahan & Tyler Jacks
European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, Cambridge, UK
Francesc Muyas & Isidro Cortes-Ciriano
Department of Biology, Massachusetts Institute of Technology, Cambridge, MA, USA
Zackery A. Ely
Rodent Histopathology Core, Harvard Medical School, Boston, MA, USA
Roderick T. Bronson & Tyler Jacks

Authors

Peter M. K. Westcott
View author publications
You can also search for this author in PubMed Google Scholar
Francesc Muyas
View author publications
You can also search for this author in PubMed Google Scholar
Haley Hauck
View author publications
You can also search for this author in PubMed Google Scholar
Olivia C. Smith
View author publications
You can also search for this author in PubMed Google Scholar
Nathan J. Sacks
View author publications
You can also search for this author in PubMed Google Scholar
Zackery A. Ely
View author publications
You can also search for this author in PubMed Google Scholar
Alex M. Jaeger
View author publications
You can also search for this author in PubMed Google Scholar
William M. Rideout III
View author publications
You can also search for this author in PubMed Google Scholar
Daniel Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Arjun Bhutkar
View author publications
You can also search for this author in PubMed Google Scholar
Mary C. Beytagh
View author publications
You can also search for this author in PubMed Google Scholar
David A. Canner
View author publications
You can also search for this author in PubMed Google Scholar
Grissel C. Jaramillo
View author publications
You can also search for this author in PubMed Google Scholar
Roderick T. Bronson
View author publications
You can also search for this author in PubMed Google Scholar
Santiago Naranjo
View author publications
You can also search for this author in PubMed Google Scholar
Abbey Jin
View author publications
You can also search for this author in PubMed Google Scholar
J. J. Patten
View author publications
You can also search for this author in PubMed Google Scholar
Amanda M. Cruz
View author publications
You can also search for this author in PubMed Google Scholar
Sean-Luc Shanahan
View author publications
You can also search for this author in PubMed Google Scholar
Isidro Cortes-Ciriano
View author publications
You can also search for this author in PubMed Google Scholar
Tyler Jacks
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

P.M.K.W. and T.J. conceived the study. P.M.K.W., T.J. and I.C.C. supervised the study. P.M.K.W., O.C.S., H.H., N.J.S., A.M.J., W.M.R., D.A.C., G.C.J. and S.L.S. carried out all aspects of animal care and experimentation. F.M. and I.C.C designed sequencing analysis pipelines and carried out analysis with P.M.K.W. Z.A.E. designed and executed the neoantigen prediction pipeline. All other data analysis was carried out by P.M.K.W. A.B. provided conceptual and technical guidance in sequencing analysis. S.N. designed dual sgRNA lentiviral constructs. D.Z., M.C.B., A.J., J.J.P. and A.M.C. produced and validated critical reagents. R.T.B. provided validation of ground truth for training the Aiforia lung tumor grading CNN. P.M.K.W., I.C.C. and T.J. wrote the paper, with feedback from all authors.

Corresponding authors

Correspondence to Peter M. K. Westcott, Isidro Cortes-Ciriano or Tyler Jacks.

Ethics declarations

Competing interests

T.J. is a member of the Board of Directors of Amgen and Thermo Fisher Scientific, and a co-founder of Dragonfly Therapeutics and T2 Biosystems. He serves on the Scientific Advisory Board of Dragonfly Therapeutics, SQZ Biotech and Skyhawk Therapeutics, and is the President of Break Through Cancer. None of these affiliations represents a conflict of interest with respect to the design or execution of the present study or interpretation of data presented in this manuscript. The Jacks laboratory also currently receives funding from the Johnson & Johnson Lung Cancer Initiative and the Lustgarten Foundation for Pancreatic Cancer Research, but this did not support the research described in the present study. All other authors declare no competing interests.

Peer review

Peer review information

Nature Genetics thanks Charles Swanton and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Extended data

Extended Data Fig. 1 Validation of in vivo DNA mismatch repair gene knockout.

(a) Schematic of Kras^LSL-G12D; Trp53^flox/flox; Msh2^floxl/flox (KPM) lung tumor model. (b-c) Percent lung tumor area negative for MSH2 by IHC at 5- and 16-weeks post-initiation of Kras^LSL-G12D; Trp53^flox/flox; R26^LSL-Cas9 (KPC) animals with sgMsh2 lentivirus (N = 7 5-week and 5 16-week animals) (b) and 12- and 16-weeks post-initiation of KPM animals (N = 4 12-week and 6 16-week animals) (c). (d) Deep sequencing of sgMsh2-targeted locus in cell lines derived from 16–20-week sgCtl- and sgMsh2-targeted lung tumors. WT = wild-type. (e) Western blot of MSH2 expression in lung tumor cell lines, experimentally replicated three times. Clone = 09-2 single cell clone. (f-g) Deep sequencing of Msh2, Apc (f) and Mlh1 (g) loci targeted in autochthonous colon tumors, representative of 5 sgMsh2- and 6 sgMlh1-targeted tumors. (h-i) Western blots of MSH2 (h) and MLH1 (i) expression in one organoid line each derived from sgCtl-, sgMsh2-, and sgMlh1-targeted colon tumors, experimentally replicated twice. (j-l) Total somatic single nucleotide variants (SNVs) and insertions/deletions (indels) within exome of autochthonous lung tumors (j), cell lines (k), and autochthonous colon tumors (l). Pole* = Pole S415R mutation. (m) MSIsensor-pro scores for tumors in (j-l). (n) Frequency of indels across DNA microsatellite contexts, including exonic and intronic mutations. Samples were ordered by total indels. Homo = homopolymer repeats ≥ 4 bases, 2-5 + = microsatellites with motifs of 2-5+ bases, and multi = microsatellites with multiple repetitive motifs. (o-p) COSMIC mutational signature decomposition of 15 KPM and 26 KPC (sgMsh2) lung tumors (o) based on frequencies of the 96 possible SNVs classified by trinucleotide context (p). SBS_MRD = DNA mismatch repair deficiency (MMRd) signature. (q) Pearson correlation of the fraction of mutations detected in each trinucleotide context between human and mouse MMRd datasets. (r) Total somatic SNVs/indels in exome of 09-2 lung tumor cell line and early- and 20-passage 09-2 single cell clone. (s-t) Total mutations in early-passage clone that are also supported in sequencing reads of 09-2 parental line (s) and an unrelated MMRp control line, 13-1 (t), with variant allele fraction on x-axis. Grey = official mutation calls; gold = not called. Significance in (b-c) was assessed by Wilcoxon Rank Sum test.

Source data

Extended Data Fig. 2 Tumor kinetics and immunogenicity are unaffected by MMRd.

(a-b) Percent lung area occupied by tumor grades 1-4 (G1-4) in sgMsh2- and sgCtl-targeted KPC mice at 5- (a) and 16-weeks (b) post-initiation, representative of 7 and 9 5-week and 5 and 3 16-week animals, respectively. (c-d) Average lung tumor area of animals in (a-b) positive for CD3 (T cells) by immunohistochemistry (IHC) (c), with representative CD3 IHC of an sgMsh2-targeted lung tumor (d). Scale bar = 200 μM. (e-h) IHC staining and Aiforia convoluted neural network (CNN) quantification of KPM and KP tumor-bearing lungs of animals in Fig. 2a,b. Stained 15-week KPM (e) and KP (f) tumor-bearing lungs (left panel) with Aiforia CNN annotations (right panel) for CD4⁺ (red), CD8⁺ (blue), and CD4⁺FOXP3⁺ Tregs (yellow), with quantification across whole lungs in KPM and KP models at 5- (g) and 15-weeks (h) post-initiation. Tumors in (e-f) are outlined in black; scale bar = 1 mm. (i) Change in focal colon tumor area by longitudinal colonoscopy. N = 7 sgMsh2- and 7 sgCtl-targeted animals with 10 and 9 tumors, respectively. (j-l) Percent lung area occupied by G1-4 tumors in KPM and KP mice (from Fig. 2i) after 4 weeks of ICB treatment (j), at 16-weeks post-initiation with continuous αCD4/8 treatment (k), and overall lung tumor burden with no treatment versus continuous αCD4/8 at 14- and 16-weeks post-initiation, respectively (l). (m-o) Flow cytometric analysis of CD4⁺ and CD8⁺ T cells in peripheral blood of experimental animals following αCD4/8 treatment. Gating strategy (m). Percent of CD4⁺ (n) and CD8⁺ (o) T cells relative to total CD45⁺ cells. (p) CD4, CD8, and FOXP3 IHC of 16-week tumor-bearing lungs from animals treated continuously with αCD4/8. Animals in (n-p) are the same as αCD4/8-treated animals in (k) and (q). (q-r) Colon preclinical trial arms (q) and schematic (r). Boxplots display median, interquartile range (box bounds), whiskers extending to most extreme points (≤1.5X interquartile range), and all datapoints. Significance in (a-c, g-h, j-l) was assessed by Wilcoxon Rank Sum test with multiple test corrections.

Source data

Extended Data Fig. 3 Immunoediting exacerbates intratumoral heterogeneity by pruning clonal but not subclonal neoantigens.

(a-b) Total consequential mutations (nonsynonymous SNVs and indels) per megabase (Mb) DNA for autochthonous lung tumors and cell lines and autochthonous colon tumors, separated by clonal (cancer cell fraction (CCF) ≥ 0.75) (a) and subclonal (CCF ≤ 0.75) (b), with fold-change shown for each comparison. (c) Histograms of total mutations by cancer cell fraction in a representative sgMsh2-targeted lung tumor, cell line, and colon tumor from Fig. 3a,b. (d) Western blot of MSH2 expression in single cell clones with Msh2 re-expression, after 20 passages with puromycin selection, experimentally replicated three times. WT = sgCtl-targeted cell line; MSH2^KO = parental sgMsh2-targeted cell line (09-2). (e) Venn diagrams of mutation overlap between M1-8 clones sequenced at early passage (called passage 0 for convenience) and 20 passages later. (f) Histograms of total mutations by cancer cell fraction in M1-8 clones at passage 0 and 20. (g) Pairwise intersection map of mutations across M1-8 clones. Scale represents fraction of total mutations shared between each pair. (h-i) Total clonal (CCF ≥ 0.75) (h) and subclonal (CCF ≤ 0.5) (i) predicted neoantigens in lung tumors from Fig. 3g–j. (j) Distribution of cancer cell fraction estimates from Fig. 3i with sgMsh2-targeted Pole S415R mutant lung tumor removed. (k-l) Total mutations / Mb (k) and predicted neoantigens (l) in 16–20-week autochthonous MMRd colon tumors from animals with no treatment (blue shades, N = 5 Msh2^KO, 6 Mlh1^KO, 6 Msh3^KO, 2 Msh6^KO) and continuous antibody-mediated T cell depletion (αCD4/8, magenta, N = 7 Msh2^KO). (m-n) CCF distribution (m) and per tumor median (n) of all SNV-derived neoantigens (no expression filter) in colon tumors from (j-k). Significance and smoothing in (i, l) were assessed by two-sided Kolmogorov-Smirnov test and Gaussian kernel density estimation, respectively. Significance in (a-b, h-i, k-l, n) was assessed by Wilcoxon Rank Sum test, with Holm correction for five comparisons in (a-b). P values in (h-j) are uncorrected.

Source data

Extended Data Fig. 4 Clones re-expressing Msh2 grow similarly in vitro and are IFNγ responsive.

(a) In vitro growth kinetics of parental MSH2 knockout cell lines (09-2, 22-1, 22-2) and Msh2 re-expressing clones generated from 09-2 (M1-8), measured by live cell imaging with an IncuCyte S3. Bars represent standard deviation of eight replicates (wells). (b-e) Flow cytometric analysis of surface expression of MHC-I alleles H-2K^b (b) and H-2D^b (c) and IFNγ-response gene PD-L1 (d) in cell lines from (a) following overnight stimulation with 0, 0.1, and 1.0 ng/mL IFNγ. MFI = mean fluorescence intensity. (e) Representative histograms of H-2K^b, H-2D^b and PD-L1 expression in a clone (M3) from the experiment in (b-d). (f) Survival Hazard Ratios (HR) of syngeneic mice orthotopically transplanted via intratracheal instillation with clones (C1-5) derived from parental sgCtl-targeted line 13-1 (Msh2 WT), with and without ICB treatment. N = number of animals, P = P-value, Norm.HR = normalized HR. Bars represent upper and lower 95% confidence intervals. (g-h) Estimation of clonal frequencies of M1-8 clones in equimolar mixture of DNA (g) and lung tumors and associated metastases (denoted by arrows) from animals transplanted with an equal mixture of all clones and receiving no treatment, continuous αCD4/8, or ICB (h), as determined by targeted deep amplicon sequencing of 4 private SNVs per clone. ICB treatment in (f, h) was started 2-weeks post-transplant and continued for 4 weeks. Significance in (b-d) was assessed by Wilcoxon Rank Sum test with correction for 3 tests. Significance in (f) was assessed by Cox proportional hazards regression with Holm correction for 12 tests (other 7 tests shown in Fig. 4b).

Source data

Extended Data Fig. 5 Identification and validation of bona fide MMRd-derived neoantigens.

(a) Somatic mutation-derived epitopes (neoepitopes) found to be presented on surface MHC-I (H2-K^b and H2-D^b) in M1-8 clones by Tandem Mass Tag Mass Spectrometry. Immunogenicity was validated by IFNγ ELISpot and MHC-I tetramer staining. (b) Vaccine regimen to assess in vivo immunogenicity of neoepitopes. Conventional cross-presenting dendritic cells (cDC1s) were in vitro differentiated from bone marrow, activated, loaded with neoepitope peptide and injected intradermally into mice, which were boosted with two different adjuvants delivered with peptide over the course of the experiment (see Methods). (c-d) IFNγ ELISpot (c) and flow cytometric staining with neoepitope-loaded MHC-I tetramers (d) of splenocytes isolated from mice vaccinated with the indicated peptides. SIINFEKL = immunogenic peptide positive control; “- controls” = no peptide vaccination. (e) Flow cytometric gating strategy for analysis of neoantigen-specific CD8⁺ T cells in Fig. 5. (f-h) Representative flow plot (f) and total QAYAFLQHL-specific CD8⁺ T cells (g-h) in lungs from animals in Fig. 5 as determined by MHC-I dextramer staining in two channels (PE and APC). (i-k) Flow cytometric analyses of M2 clone neoantigen (AALQNAVTF)-specific T cells isolated from mLNs and lungs of syngeneic mice intratracheally transplanted with M2 at CCF = 1 (N = 7), 0.5 (N = 8), and 0.125 (N = 7), and treated with ICB for 2 weeks starting 2-weeks post-transplant. Representative flow plot (i) and total AALQNAVTF-specific CD8⁺ T cells in mLNs (j) and lungs (k) as determined by MHC-I tetramer staining in two channels (PE and APC). (l-p) Representative flow plot (l) and quantification of percent of QAYAFLQHL-specific CD8⁺ T cells positive for TCF1 (m-n) and GZMB (o-p) in lungs from animals in Fig. 5. Significance in (g, j-k, m, o) was assessed by both Spearman Rank Correlation with a numeric x-axis (CCF) and Wilcoxon Rank Sum test (CCF = 1 versus 0.125 groups). Significance in (h, n, p) was assessed by Wilcoxon Rank Sum test. Samples with < 10 QAYAFLQHL-specific CD8⁺ T cells were excluded from analysis in (m-p).

Source data

Extended Data Fig. 6 Overall tumor neoantigen burden is imperfectly correlated with clonal burden and ICB response in human MMRd cancer.

(a-d) Total clonal (CCF = 0.75 to 1.0) and subclonal (CCF = 0 to 0.5) tumor neoantigen burden (TNB) (a-b) and ITH index (subclonal to clonal neoantigen ratio) (c-d) in patients with objective response (OR) versus nonresponse (NR) in separated analyses of Bortolomeazzi trial⁵⁰ (a,c) and Kwon trial³⁵ (b,d). (e-g) TNB, regardless of clonality, in combined analysis (e) and separated analyses of Bortolomeazzi (f) and Kwon (g) trials. (h) Pearson correlation of overall versus clonal TNB across both studies, with correlation outliers circled in red. (i-j) Progression free survival of patients separated by upper versus lower quartiles of overall TNB (i) and clinical study (j). Number of patients from each study is indicated under plots. Significance in (a-g) was assessed by Wilcoxon Rank Sum test, and in (i-j) was assessed by Cox proportional hazards regression, with clinical trial study as a covariate in (i).

Source data

Supplementary information

Reporting Summary

Peer Review File

Supplementary Tables

Supplementary Table 1. WES/RNA-seq detailed sample information, including summary of major edits in CRISPR–Cas9-targeted tumors. Supplementary Table 2. M1–8 single-cell clone MHC-I MS peptide calls of high, medium and low confidence. Supplementary Table 3. Predicted neoantigens and CCFs from reanalysis of WES data in Bortolomeazzi et al.⁵⁰ and Kwon et al.³⁵ clinical trials. Supplementary Table 4. The sgRNA sequences used for in vivo CRISPR–Cas9 editing.

Source data

Source Data Fig. 1

Statistical source data for Fig. 1.

Source Data Fig. 2

Statistical source data for Fig. 2.

Source Data Fig. 3

Statistical source data for Fig. 3.

Source Data Fig. 4

Statistical source data for Fig. 4.

Source Data Fig. 5

Statistical source data for Fig. 5.

Source Data Fig. 6

Statistical source data for Fig. 6.

Source Data Extended Data Fig. 1

Statistical source data for Extended Data Fig. 1.

Source Data Extended Data Fig. 2

Statistical source data for Extended Data Fig. 2.

Source Data Extended Data Fig. 3

Statistical source data for Extended Data Fig. 3.

Source Data Extended Data Fig. 4

Statistical source data for Extended Data Fig. 4.

Source Data Extended Data Fig. 5

Statistical source data for Extended Data Fig. 5.

Source Data Extended Data Fig. 6

Statistical source data for Extended Data Fig. 6.

Source Data Extended Data Figs 1 and 3

Unprocessed western blots for Extended Data Figs. 1e,h,i and 3d.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Westcott, P.M.K., Muyas, F., Hauck, H. et al. Mismatch repair deficiency is not sufficient to elicit tumor immunogenicity. Nat Genet 55, 1686–1695 (2023). https://doi.org/10.1038/s41588-023-01499-4

Download citation

Received: 09 March 2023
Accepted: 07 August 2023
Published: 14 September 2023
Issue Date: October 2023
DOI: https://doi.org/10.1038/s41588-023-01499-4

This article is cited by

Somatic mouse models of gastric cancer reveal genotype-specific features of metastatic disease
- Josef Leibold
- Kaloyan M. Tsanov
- Scott W. Lowe
Nature Cancer (2024)
Neoantigen-targeted TCR-engineered T cell immunotherapy: current advances and challenges
- Zhi Pang
- Man-man Lu
- Wei-zhong Wu
Biomarker Research (2023)
Tumor heterogeneity impairs immunogenicity in mismatch repair deficient tumors
- James L. Reading
- Deborah R. Caswell
- Charles Swanton
Nature Genetics (2023)
Neoplasia risk in patients with Lynch syndrome treated with immune checkpoint blockade
- Emily C. Harrold
- Michael B. Foote
- Zsofia K. Stadler
Nature Medicine (2023)

Subjects

Abstract

Similar content being viewed by others

Main

Modeling sporadic MMRd in cancer

Sporadic MMRd in the mouse is not immunogenic

MMRd drives mutational heterogeneity

Intratumor heterogeneity enables immune evasion

Neoantigen clonality tunes the T cell response

Clonal TNB predicts ICB response in human MMRd cancer

Discussion

Methods

Mice

Tumor models

WES and mutation calling

CCF estimation

Mutational signature and MSI analyses

Clonal deconvolution by targeted amplicon sequencing

Neoantigen prediction and expression

Histology and immunohistochemistry

Western blotting

In vivo antibody and chemotherapy dosing

In vivo tumor imaging and quantification

Lentiviral constructs

Validation of CRISPR–Cas9 editing and estimation of tumor purity

In vitro cell-line assays

Phylogenetic tree analysis

MHC-I immunopeptidomics

Dendritic cell vaccination, ELISpot and MHC-I multimer staining

Tissue preparation and flow cytometry

Analysis of human MMRd cancer clinical trials

Statistics and reproducibility

Reporting summary

Data availability

Code availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Peer review

Peer review information

Additional information

Extended data

Supplementary information

Source data

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Quick links