Abstract
Neoantigen burden is regarded as a fundamental determinant of response to immunotherapy. However, its predictive value remains in question because some tumours with high neoantigen load show resistance. Here, we investigate our patient cohort together with a public cohort by our algorithms for the modelling of peptide-MHC binding and inter-cohort genomic prediction of therapeutic resistance. We first attempt to predict MHC-binding peptides at high accuracy with convolutional neural networks. Our prediction outperforms previous methods inβ>β70% of test cases. We then develop a classifier that can predict resistance from functional mutations. The predictive genes are involved in immune response and EGFR signalling, whereas their mutation patterns reflect positive selection. When integrated with our neoantigen profiling, these anti-immunogenic mutations reveal higher predictive power than known resistance factors. Our results suggest that the clinical benefit of immunotherapy can be determined by neoantigens that induce immunity and functional mutations that facilitate immune evasion.
Similar content being viewed by others
Introduction
Cancer immunotherapy has become remarkably effective in a range of human cancers. In particular, checkpoint blockade therapies, such as anti-CTLA-4 (ipilimumab) and anti-PD-1 (nivolumab and pembrolizumab), are able to reverse tumour-induced immunosuppression and induce durable clinical responses1. Tumour cells produce neoantigens or antigens that the immune system never encountered without cancer. The epitopes of neoantigens, which are displayed with major histocompatibility complexes (MHCs) on the surface of cancer cells, provoke immune response when recognised by T cells. Tumours loaded with more neoantigens therefore are more likely to be responsive to the anti-immunosuppressive strategies2. However, a sizeable fraction of those tumours resist to checkpoint blockade3.
Tumour neoantigens are generated by somatic mutations producing novel peptides that can be recognised as foreign, thereby conferring immunogenicity to cancer cells. Neoantigen burden is therefore regarded as a fundamental determinant of response to immunotherapy including checkpoint blockade. Neoantigen burden has been estimated by several computational tools that predict peptide binding to MHC class I4. However, the current tools fail to capture the nonlinear high-order features of interactions among different amino acids5. In this regard, a recent study has shown that amino acids distal to contact interfaces can exert significant effects on the interactions between MHC-peptide complexes and T cell receptors (TCRs)6. Therefore, we are in need of a prediction method that captures the spatial features amino acid interactions. Convolutional neural networks (CNNs) have been applied successfully for the identification of local sequence patterns in protein-nucleic acid interactions7 or functional effects of noncoding variants8. Hence, in this work, we attempted to develop a CNN-based algorithmic framework for the prediction of peptide binding to MHC class I molecules.
There are additional putative biomarkers that have been reported to predict response to checkpoint blockade together with neoantigen burden. Tumour heterogeneity9, copy number alteration10, aneuploidy11, and genetic alteration of specific genes12 or pathways13 have been identified as resistance markers. Pre-existing T cell infiltration also may impinge on response to checkpoint blockade14,15. Since all of these markers or factors differ greatly between individual cancer patients, a framework for describing diverse predictive factors has been proposed to enable personalised cancer immunotherapy16. However, resistance mechanisms involving alterations of individual genes have not been explored at the genomic level. Especially, tumours loaded with neoantigens should carry a large number of functional mutations due to high mutation rates. The functional mutations that facilitate immune evasion may be subject to positive selection. In this work, we examine whether mutation profiles can explain the therapeutic resistance of tumours loaded with neoantigens. We first use our clinical data together with public cohort data and then perform further tests using molecular immune evasion markers in other tumour types.
Results
Accurate prediction of peptide-MHC class I binding
We constructed a convolutional neural network (CNN) model to predict the binding of MHC class I molecules and peptides. About 50,000 binding data obtained from the immune epitope database (IEDB) 3.0. (http://www.iedb.org/)17 were used for the model training. The CNN architecture enables to capture specific local properties of input data such as in images. To build an amino acid interaction map for our CNN model, we inferred the binding preference of each pair of amino acids using interaction energy estimated from the frequency of neighbouring amino acids in native protein structures18. The interaction map between peptides and HLA sequences were scanned by a number of kernels to detect particular binding motifs that are critical for peptide-MHC I binding (Fig.Β 1a and Supplementary Fig.Β 1). According to the receiver operating characteristic (ROC) curves, the area under the curve (AUC) for the training data was 0.93 for HLA-A and 0.94 for HLA-B, respectively. In the test data, the AUC was 0.89 for HLA-A and 0.86 for HLA-B.
Comparison with an interaction map consisting of randomly permuted or null values made it clear that the amino acid interaction map we used for our model played a critical role in predicting the MHC-peptide binding (Supplementary Fig.Β 2). This implies that the amino acid interaction preferences derived from native protein structures18 can serve as interaction parameters for the modelling of the binding of MHCs and peptides. We also tested whether the mere sum of the amino acid binding preferences in the interaction map can be used as a predictor. If this is the case, the sum of the preferences should be lowest for true positives because the binding preferences were encoded as the energy levels of interactions. However, we could rule out this possibility (Supplementary Fig.Β 4), indicating that our CNN model was trained on the pattern rather than the aggregate of binding preferences.
Next, we tested the prediction accuracy of our CNN model using the IEDB test datasets that are weekly updated along with performance evaluations of current prediction tools including the most widely used one named NetMHCpan19 (now updated to NetMHCpan 4.020). Our model showed higher performance than the compared prediction tools. In terms of AUC, our method was superior to SMMPMBEC, ANN, NetMHCcon, and NetMHCpan for 100%, 100%, 90%, and 70% of the test cases. With regards to the F1 score, our CNN was superior to all the methods for 80% of the test cases (Fig.Β 1b). Highest performance was achieved for binding of HLA-As and 9-mer peptides. In general, the training dataset was largest for this class of binding, implying that higher prediction accuracy can be expected as in vitro binding data grow. In contrast to the size of training data, prediction accuracy was maintained irrespective of the size of test data (Supplementary Fig.Β 3).
Clinical and molecular relevance of neoantigen load estimated by CNN
We applied our CNN model to predict MHC class I-binding neoantigens for clinical samples that were used in four cohorts of melanoma10,21,22,23 and three cohorts of lung cancer24,25, including our clinical cohort referred to as SMC (Samsung Medical Center) cohort (TableΒ 1 and Supplementary DataΒ 1). The samples were divided by curated clinical response, and the extent of neoantigen load was assessed on the basis of our prediction model. Neoantigen load predicted by the CNN method showed a significant correlation with clinical benefit in most cohorts (Fig.Β 2a). In particular, the P values of our association were significant except two cohorts, for which marginal or lower associations were observed when NetMHCpan was used for neoantigen prediction. We compared patient survival between the high- and low-neoantigen groups for the cohorts showing the significant associations. Higher neoantigen burden was significantly associated with longer disease free time consistently in all cohorts (Fig.Β 2b). This correlation was not seen when NetMHCpan was used (Supplementary Fig.Β 5).
We wanted to further validate the relevance of the CNN predictions using samples from The Cancer Genome Atlas (TCGA) although not in the setting of checkpoint blockade therapy. To this end, we collected exome and transcriptome data for skin cutaneous melanoma, performed HLA typing for MHC class I, and then predicted neoantigens that were capable of binding the relevant MHC I molecules by using our CNN. We also computed the immune score and TCR diversity for the same samples. The diversity of tumour-reactive T cell clonotypes represented by TCR repertoire can reflect the load of immunogenic antigens that stimulate T cell infiltration26. Similarly, the immune score, defined as the geometric mean of the expression levels of immunologically relevant genes, has been reported to correlate with the clinical benefit of checkpoint blockade immunotherapy10. According to our analyses, neoantigen load estimated by the CNN model showed a significant association with the immune score and TCR diversity (Fig.Β 2c). For these samples, our estimate of neoantigen load (left of Fig.Β 2d) than mutation burden (right of Fig.Β 2d) showed a better correlation with patient survival. However, this should be interpreted with caution given no statistically significance, which may be attributed to the fact that we simply compared patient survival but not in the setting of checkpoint therapy.
Functional immune mutations explain poor therapeutic response
A large amount of neoantigens reflects a high mutation frequency. If some of protein-changing mutations perturb immune reaction, tumours carrying these mutations will not be responsive to immunotherapy despite high neoantigen load. However, there was no statistical significance when we directly compared the responsive and resistant groups in mutation burden on genes involved in the immune system, immune response, antigen presentation, or antigen processing (Supplementary Fig.Β 6). Mutation burden was generally higher in the resistant tumours for genes involved in antigen presentation or processing but only marginally (Supplementary Fig.Β 6). This suggests a role for genes that can affect immune response indirectly, for example, by interacting with genes that are formally annotated as immune related.
We therefore developed a classifier that uses the exomic profiles of functional mutations to predict samples that will be resistant to checkpoint blockade. Our MHC-binding prediction was performed for all the cohorts (TableΒ 1) to select samples with high neoantigen burden and define the resistant group according to the clinical outcome (Supplementary Fig.Β 7). For each cohort, we used the other cohorts of the same tumour type as training data. For example, we trained random forests with Hellman and Rizvi cohorts and tested performance on SMC cohort. As a negative control, we trained the classifier on synonymous mutations. As a result, only the models trained with functional mutations achieved reasonable accuracies for all the test datasets (Fig.Β 3a).
We next investigated which genes were critical for predicting therapeutic response when mutated. For this, we retrieved genes with a high βvariable importanceβ of random forests and their interacting partners in the protein interactome27,28,29 (Supplementary DataΒ 2 and 3). These genes were related to the adaptive immune system, cytokine signalling, and epidermal growth factor receptor (EGFR) signalling (Fig.Β 3b, c). Tumours produce cytokines that alter tumour immunogenicity and the antitumour immune response of the host. For example, interferon-Ξ³ pathway inactivation in tumours was implicated in resistance to immunotherapy13. The EGFR pathway was implicated in immune escape and clinical response to immunotherapy30,31.
We then examined whether the predictive genes identified from our classifier carry the signatures of positive selection. Recent studies investigated selection patterns at the gene level based on the ratio of nonsynonymous to synonymous mutations across a large number of tumour samples32,33. Positive selection on mutations will lead to the excess of nonsynonymous mutations for given background mutation rates estimated by the frequency of synonymous mutations. In other words, a gene under positive selection will carry an extra complement of driver mutations in addition to passenger mutations. We employed the scores that were previously calculated for each gene by the Bayesian inference32 and statistical model for covariates (dNdScv)33 based on the mutation patterns observed in the TCGA data. Using these scores, we compared the degree of positive selection on the predictive genes with that on genes categorised as antigen-presentation or immune-related pathway. The score distributions for all genes were also considered. As a result, significantly higher positive selection scores, in particular those from the Bayesian inference32, were observed for the predictive genes than for the other groups of genes (Fig.Β 3d), indicating that our prediction model was based on functional mutations that are subject to positive selection because of their contribution to immune evasion.
Profiling functional mutations together with neoantigens accurately predicts therapeutic response
Up to this point, we sought to resolve the contradiction between clinical observations and the βneoantigen rouletteβ theory3 by a more accurate estimation of neoantigen load as a predictive marker of clinical response and by the identification of functional mutations in immune-related genes for prediction of therapeutic resistance. We next wanted to evaluate the clinical utility of combining these two approaches on the basis of the melanoma samples in Roh cohort and lung cancer samples in SMC cohort that came with information on therapeutic response to checkpoint inhibitor and known resistance parameters such as copy number alteration (gain or loss) and tumour heterogeneity9,10,11.
We performed regression of therapeutic resistance on the exomic prediction score from our classifier (Fig.Β 4a, d), on previously known resistance parameters (Fig.Β 4b, e), and on all the variables together (Fig.Β 4c, f). Overall, our classifier performed markedly better when tumours with high neoantigen load were defined by using our CNN than NetMHCpan (Fig.Β 4aβc for SMC cohort and Fig.Β 4dβf for Roh cohort). Most importantly, our exomic prediction score was the most significant contributing factor regardless of the regression methods in both cohorts (Fig.Β 4c, f).
Functional immune mutations are associated with immune evasion signatures
To test whether this approach is applicable to other cancer types, we used exome and transcriptome data from TCGA. For training of mutational patterns, we selected four tumour types for which a sufficient number of samples were available in the database: bladder cancer (BLCA, nβ=β152), oesophagus cancer (ESCA, nβ=β36), head and neck cancer (HNSC, nβ=β242), and lung cancer (LUSC, nβ=β78). We performed HLA typing and our CNN prediction to obtain the estimate of neoantigen load. Among the genes that are used for calculating the immune score, we chose those that showed the largest positive correlation with neoantigen load in each tumour type. Then, we identified the samples with immune evasion by looking for the cases that deviated from the correlation by showing a low expression level of those genes despite high neoantigen load. These samples can be regarded as the equivalent of the resistant tumours in the melanoma cohorts.
Using these data, we repeated the same analyses as we did with melanoma. For each tumour type, we trained random forest on the samples showing the immune evasion signatures by using as features the genes that harbour deleterious or damaging mutations in >β5% of all samples. The performance of our classifier was evaluated by comparing the functional mutation model with the control model trained with synonymous mutations (Fig.Β 5a). We next investigated which genes exerted explanatory power in predicting immune escape when mutated. In general, the functional mutation model resulted in genes with high variable importance (Fig.Β 5b). We retrieved these genes in each tumour type and collected their interacting partners in the protein interactome27,28,29. Similar to the melanoma cases, these genes tended to be involved in cytokine signalling pathway, adaptive immune system, and EGFR signalling in cancer (Fig.Β 5c and Supplementary Fig.Β 8).
We finally assessed the contribution of the mutation features in explaining the molecular signatures of immune evasion across different tumour types. There was a recent study that reported the correlation of extensive somatic copy number alterations (SCNAs) with the immune score11. While focal SCNAs mainly correlated with proliferation markers, arm- and chromosome-level SCNAs were negatively associated with the immune score. Multiple regression of immune evasion status on tumour heterogeneity, focal SCNA, arm/chromosome-level SCNA, and mutation-based prediction revealed the mutation features as the most significant contributor (Fig.Β 5d). Arm- and whole-chromosome SCNAs generally stood out as the second important parameter in predicting immune evasion (Fig.Β 5d).
Discussion
In this work, we first developed a prediction model for neoantigen identification. This model was trained on two-dimensional interaction patterns for peptide-MHC class I binding in contrast to one-dimensional sequential modelling of current prediction tools. We then examined potential determinants of resistance to immunotherapy. While mutational patterns stood out, tumour heterogeneity also showed considerable explanatory power when our method was used for neoantigen prediction. A recent study presented the relevance of neoantigen heterogeneity to resistance of tumours with high mutation load9. Thus, tumour heterogeneity may confer resistance especially when neoantigen load is high.
Our methods focused on resistance of high-neoantigen tumours. However, there are samples that show sensitivity to checkpoint blockade despite a low estimate of neoantigen load. Hypersensitivity to the immune response can be a harmful trait for tumour growth. In contrast to mutations that promote immune evasion, such somatic variations that increase immune reaction will be eliminated by negative selection during tumour evolution. Therefore, this trait can be explained more likely by the genetic makeup of the donor rather than by somatic mutations. For example, a genetic polymorphism in the CTLA-4 gene was reported to alter the CTLA-4-driven regulation of T cell activation in the context of autoimmune disease34. A possible approach thus is to use the genotypes of the hypersensitive samples as predictive features for the classification of clinical response to immunotherapy. In this manner, the cases that contradict the neoantigen roulette3 by showing resistance despite high neoantigen load or hypersensitivity despite low neoantigen levels could be predicted only based on their somatic or germline genetic profile. It needs to be emphasised that the success of this approach will hinge on the accurate estimation of neoantigen burden.
The recently proposed concept of βcancer immunogramβ16 included seven parameters that could characterise aspects of cancer-immune interactions to determine treatment options in a more refined and personalised manner. Among those seven parameters, our study conferred immediate clinical utility to βtumour foreignnessβ and brought βtumour sensitivity to immune effectorsβ up to the systems level where potentially all available genes could be examined. There have been attempts to discover individual factors such as a particular gene12 or pathway13. However, this is the first attempt to profile mutations at the whole-exome level. In doing so, we discovered that the EGFR pathway could be a primary target of somatic disruption for immune evasion. Our results indicate that a large fraction of variation in the clinical benefit of immunotherapy can be explained by contrasting effects of antigenic versus functional mutations on tumour immunogenicity. This approach offers practical advantage as well because calling single nucleotide variants is straightforward and less technically challenging than determining other parameters such as copy number alterations and tumour heterogeneity.
Methods
IEDB peptide-MHC binding data sets
All training data for the prediction of peptide-MHC class I binding were obtained from IEDB 3.0. (http://www.iedb.org/)17. This database provided 57,173 data points consisting of binding affinity in terms of IC50/EC50 nM for 14,234 true (binding) and 42,879 false (non-binding) experiments. We used the affinity threshold that is commonly used to determine peptide-MHC binding to classify binding (IC50/EC50β<β500βnM) and no binding (IC50/EC50ββ₯β500βnM). The major subset of the data, composed of two classes (HLA-A and HLA-B) of MHC class I against 9-mer and 10-mer peptides, was used for the development of our prediction method. To evaluate prediction performance, we employed weekly updated test datasets used for an automated benchmarking of selected peptide-MHC class I binding prediction tools4. This enabled us to compare the performance of our prediction model with the reported performance of the tools used for the benchmarking. We used the weekly updated datasets that consisted of >10 binding affinity experiments.
Peptide-MHC class I binding prediction
We used CNN architecture to perform prediction of peptide-MHC class I binding. The schematic diagram of our model is shown in Fig.Β 1a. First, two-dimensional interaction maps for amino acid sequence pairs of peptides and MHC class I molecules were constructed as input matrices. We employed amino-acid interaction preferences computed based on contacts between CΞ± atoms or between any atoms in native protein structures18. For the calculation of the interaction map, the dataset of 1,654 proteins from the PISCES server35 was curated, and structural information from the PDB36 was obtained. The connectivity matrix based on the CΞ±-CΞ± distance was generated for each protein based on the distance cutoff of 6.5βΓ between CΞ±-CΞ± atoms of amino acids with the exclusion of nearest neighbours along the sequence. Atom-atom contacts between two amino acid residues were also used such that residues i and j were considered to be in contact if any atom of the residue i is within a distance of 4.5βΓ with any atom of the residue j. In this case, nearest neighbours (iβΒ±β2) along the sequence are not considered. We also prepared interaction maps consisting of randomly permuted or null values. The interaction parameters based on the CΞ±-CΞ± contacts showed highest validation performance (Supplementary Fig.Β 2) and thus were used as input. Our CNN model consists of sequential alternating convolution layers that extract interaction features at different spatial scales, a fully connected layer that integrates information from a full-length sequence, and a sigmoid output layer that computes the probability for binding of the given peptide and MHC protein. Each layer of the CNN model executes a linear transformation of the output from the previous layer by multiplying a weight matrix, the output of which is subjected to a nonlinear transformation by ReLU activation as described below. The weight matrices are learned during training in the process of minimising predictive errors.
A variety of kernels were tested to achieve as high performance as possible. The two convolution layers of our model performed 2D convolution operation after optimisation of settings with 50 kernels for the first layer, 10 kernels for the second layer, 1000 batches, the stride size of 1, and the kernel size of 5βΓβ183 for each layer. All convolution outputs were transformed by a rectified linear activation function (ReLU) lifting negative values to 0. The first convolution layer was designed to detect binding patterns with a moving window with step size 1 on the interaction parameters of amino acid pairs. In the higher-level convolution layer, each convolution kernel served as a binding pattern detector over the output of the previous layer. More formally, the convolution layer computed
where X is the input, i is the index of the output position, and k is the index of kernels. Each convolution kernel Wk is an MβΓβN weight matrix with M being the window size and N being the number of input channels. A pooling layer was not used in our CNN model because all values of the output of the convolution layer were informative in our prediction. In this regard, a recent genetics analysis reported that amino acids distal to contact interfaces can exert significant effects on the interactions between MHC-peptide complexes and TCRs6, implying that to detect precise binding patterns, all values in the output of the convolution layer should be taken into account in the pooling process.
To the second convolution layer we attached a fully connected layer in which all neurons receive inputs from all outputs of the previous layer for integration of information. This fully connected layer performed ReLU(WX), where X is the input and W is the weight matrix for the fully connected layer. The last layer, the sigmoid output layer, performed classification between binding and non-binding with the prediction scaling in the range of 0 ~ 1 based on the sigmoid function. In other words, the sigmoid output layer performed Sigmoid(WX), where X is the input and W is the weight matrix for the sigmoid output layer.
We trained our model in the direction toward the minimisation of the objective function, which was defined as the sum of negative log likelihood (NLL) and regularisation terms intended for overfitting control. Specifically,
and s indicates the index of training samples and t indicates the index of interaction features. \(Y_t^s\) is a 0 or 1 label for sample s and interaction feature t. \(f_t(X^s)\) represents the predicted probability output of the model for interaction feature t given input Xs. We used a combination of multiple regularisation techniques typically used for training deep neural networks. L2 regularisation term \(|\left| W \right||_2^2\) was defined to be the sum of the squares of all weight matrix entries. \(|\left| {H^{ - 1}} \right||_1\) was defined to be the L1 norm of all output values of the last layer (fully connected layer) preceding the output layer. Additionally, the optimisation was subjected to regularisation constraints that for any layer m and neuron n, \(|\left| {W_m^n} \right||_2 \le {\uplambda}_3\) or that the L2 norm of weights for any neuron must not be larger than a specified value. Hyperparameters we used in the model included the learning rate [0.001, 0.01, 0.1], number of kernels for the first and second layer [10, 30, 50], L1 and L2 regularisation parameter [0.0001, 0.001, 0.01], and momentum [0,1, 0.5, 0.9]. Especially, various filter sizes (1~5βbp for peptides and 1/2, 2/3, and full length of HLAs) were used to extract interaction features in the convolution layers.
Derivatives of the objective function with respect to the model parameters were computed by the standard backpropagation algorithm. We optimised the objective function by using stochastic gradient descent with momentum. We did not use dropout training because it could cause a decrease in training performance. Our model was implemented using the Theano library (https://github.com/Theano/Theano/) on Tesla K40x GPU.
We ran HLAminer37 for the samples of SMC cohort. We used HLA allele information provided by the authors for the published cohort samples. Amino acid sequences flanking nonsynonymous mutations were retrieved from RefSeq database38 by using the idfetch programme. The HLA sequences and mutant peptide sequences were subjected to our CNN prediction model.
SMC cohort for checkpoint blockade in lung cancer
A total of 122 advanced non-small cell lung carcinoma (NSCLC) patients who were treated with anti-PD-1/PD-L1 from 2014 to 2017 at Samsung Medical Center were enroled for this study. The clinical response was evaluated by the Response Evaluation Criteria in Solid Tumours (RECIST) version 1.1 with a minimum 6-month follow-up. The response to immunotherapy was classified into durable clinical benefit (DCB, responder) or non-durable benefit (NDB, non-responder)24. Partial response (PR) or stable disease (SD) or that lasted more than 6 months was considered as DCB/responder. Progressive disease (PD) or SD that lasted less than 6 months was considered as NDB/non-responder. Progression-free survival (PFS) was calculated from the start date of therapy to the date of progression or death, whichever is earlier. Patients were censored at the date of the last follow-up for PFS if they were not progressed and alive. This study was approved by the institutional review board of Samsung Medical Center (2018-03-130 and 2013-10-112). Informed written consent was obtained from all patients enroled in the study.
Tumour samples were obtained before anti-PD1/PD-L1 treatment, and then were embedded in paraffin after formalin fixation or kept fresh. DNA was prepared using AllPrep DNA/RNA Mini Kit (Qiagen, 80204), AllPrep DNA/RNA Micro Kit (Qiagen, 80284), or QIAamp DNA FFPE Tissue Kit (Qiagen, 56404) for library preparation for whole exome sequencing. Library preparation was performed by using SureSelectXT Human All Exon V5 (Agilent, 5190β6209) according to the instructions39. Briefly, 200β300βng of tumour and normal genomic DNA was sheared, and 150β200βbp of the sheared DNA fragments were further processed for end-repairing, phosphorylation, and ligation to adaptors. Ligated DNA was hybridised using whole-exome baits from SureSelectXT Human All Exon V5. The libraries were quantified by Qubit and 2200 Tapestation, and sequenced on an Illumina HiSeq 2500 platform with 2βΓβ100βbp paired ends. Target coverage for normal samples was 50 x and tumour sample was 100Γ. The sequencing reads were aligned to the hg19 reference genome by using Burrows-Wheeler Aligner (BWA) (http://bio-bwa.sourceforge.net)40 version 0.7.5a. Genome Analysis Toolkit (GATK) (https://software.broadinstitute.org/gatk)41 version 3.5 was applied for base quality score recalibration, indel realignment, and duplicate removal. The BAM files produced after these processes were subjected to MuTect (https://software.broadinstitute.org/cancer/cga/mutect)42 version 1.1.4 for the calling of single nucleotide variants (SNVs), and small insertions and deletions (indels). The dbSNP and COSMIC databases were used as references. Data processing and analysis were done with default parameters. The calling results for SNVs and indels are provided in Supplementary DataΒ 4. To estimate tumour purity, Clonal Heterogeneity Analysis Tool (CHAT) was used43. CNVkit44, a copy number detection tool specific for whole-exome and short-read sequencing platforms, was used to detect copy number alterations between matched normal and tumour samples. The log2 coverage depth was defined as copy number variation (CNV) value. This indicates the ratio of the mean coverage depths, which is excluding extreme outliers and is observed at the corresponding bin in each sample. The CNV results for our cohort samples are provided in Supplementary DataΒ 5. The results of neoantigen calling for the SNVs and indels from our SMC cohort samples are provided in Supplementary DataΒ 6.
Other cohorts for checkpoint blockade in lung cancer and melanoma
Somatic mutation calls for samples in three independent melanoma cohorts with anti-CTLA-4, namely, Van allen et al.βs dataset21, Snyder et al.βs dataset22, and Roh et al.βs dataset10, one melanoma cohort with anti-PD-1, namely, Riaz et al.βs dataset23, two NSCLC cohorts with anti-PD-1, namely Rizvi et al.βs dataset24, Hellmann et al.βs dataset25 were collected. For each cohort, we used the other cohorts of the same tumour type as training data. For example, we trained random forests with Hellman and Rizvi cohorts and tested performance on SMC cohort. The AUCs were 0.81 ~ 0.97 for the training data and 0.76 ~ 0.95 for the test data. We used HLA allele information provided for each sample. Amino acid sequences flanking nonsynonymous mutations were retrieved from RefSeq database38 by using the idfetch programme. The HLA sequences and mutant peptide sequences were subjected to our CNN prediction model. Resistant samples were defined as having >70 predicted neoantigens9 while no clinical benefit was reported from each respective study. All remaining samples were defined as non-resistant samples and trained along with the resistant samples.
TCGA data processing
We downloaded TCGA RNA-Seq data from dbGaP (https://www.ncbi.nlm.nih.gov/gap). HLA typing was performed by applying Seq2HLA tool45 to the RNA-Seq data. Somatic mutation calls were obtained from the UCSC Xena Browser (http://xena.ucsc.edu). Amino acid sequences flanking nonsynonymous mutations were retrieved from RefSeq database38 by using the idfetch programme. The HLA sequences and mutant peptide sequences were subjected to our CNN prediction model. We also profiled the TCR repertoire and calculated the immune score for each TCGA sample. The TCR repertoire was obtained by applying the TRUST tool26 to the RNA-Seq data. The immune score was calculated as the geometric mean of the expression level of cytolytic markers (GZMA, GZMB, PRF1, and GNLY), HLA molecules (HLA-A, HLA-B, HLA-C, HLA-E, HLA-F, HLA-G, HLA-H, HLA-DMA, HLA-DMB, HLA-DOA, HLA-DOB, HLA-DPA1, HLA-DPB1, HLA-DQA1, HLA-DQA2, HLA-DQB1, HLA-DRA, and HLA-DRB1), IFN-Ξ³ pathway genes (IFNG, IFNGR1, IFNGR2, IRF1, STAT1, and PSMB9), chemokines (CCR5, CCL3, CCL4, CCL5, CXCL9, CXCL10, and CXCL11), and adhesion molecules (ICAM1, ICAM2, ICAM3, ICAM4, ICAM5, and VCAM1)10. We used log2-transformed the normalised RNA-seq read counts. We also selected the molecular markers that were best correlated with neoantigen load in each tumour type: CCL5 and IFNG for BLCA; CD247, ICAM2, and IFNGR2 for ESCA; GZMB, GZMH, and PRF1 for HNSC; CCL4, CCR5, CXCL11, CXCL9, and GZMH for LUSC. For samples with potential immune evasion, we detected tumours with high (higher then 70th percentile) neoantigen load and low (lower than 30th percentile) immune signature defined by the average of the selected immune markers.
Patient survival analysis
We utilised the number of neoantigens or mutations as a predictor to perform patient survival analysis. For the clinical trial data, the cases in which the number of neoantigens or mutations was equal to or greater than 70 were classified to the high neoantigen or mutation group as suggested in a previous study9. The melanoma samples of the two clinical trials of anti-CTLA-421,22 and the lung cancer samples of three clinical trials anti-PD-1 or anti-CTLA-424,25 were subjected to survival analysis. The patients who died for reasons other than tumour were excluded from the analysis. The skin cutaneous melanoma samples from TCGA were divided into two groups based on whether the number of neoantigens or mutations was higher or lower than the median level. P values from the Wald test were used to determine the significance of differences between two groups.
Mutational analysis of therapeutic resistance
For each of all cohorts used in this study (TableΒ 1), resistant samples were defined as havingβ>β70 predicted neoantigens9 while no clinical benefit was reported from each respective study. All remaining samples were used as control. Functional mutations were called in each sample. Mutation functionality was defined based on SIFT (http://sift.bii.a-star.edu.sg)46 and PROVEAN (http://provean.jcvi.org)47. The mutations that were called simultaneously as damaging by SIFT and as deleterious by PROVEAN were defined as functional mutations. We then constructed a matrix for mutation status of genes for which the mutation frequency was >5% in the given population of samples. A random forest predictor consisting of 1000 decision trees was trained for tumours with therapeutic resistance. We implemented random forests by running the R package randomForest48 with ten-repeat 5-fold cross validation. We used the status of synonymous mutations on the same set of genes as a negative control training model. For each training, the number of features (mutated genes) was set to identical between the original and negative control model. The threshold of the mutation frequency was adjusted so that the same number of features were used.
The βvariable importanceβ of each feature (mutated gene) was evaluated on the basis of the mean decrease in accuracy as implemented in the randomForest R package. Specifically, the importance of the kth feature was measured as the degree of decrease in prediction accuracy upon random permutation of all values in the kth feature of the training dataset. We then performed a pathway analysis for the genes whose variable importance was greater than 3 and their interaction partners in the protein interactome. We used an integrative interactome map encompassing an integrated physical interaction network, referred to as Interactome, created by merging yeast two-hybrid-based proteome-scale interacting pairs27, integrated literature-based protein-protein interactions27, binary interactions identified from Stitch-seq interactome mapping29, and interactions from the high-quality protein interactome from the HINT database28. Enrichr49 was used to analyse the extended gene on the basis of pathways and functional terms retrieved from Reactome50, Panther51, and Gene Ontology52. Panther51 was performed with its original pathway analysis algorithm. Additional gene ontology terms that were not found in Enrichr49 were retrieved by using DAVID53 GOTERM_BP_DIRECT.
Regression analysis for resistance or immune evasion
We used SMC cohort and Roh cohort for the quantitative analysis of the explanatory power of our exomic prediction in comparison with previously known resistance parameters. The resistance parameters reported in the literature were obtained from the data of SMC cohort. Roh cohort was used because copy number alteration (gain or loss) and tumour heterogeneity were available. For the TCGA samples set apart as testing data, we employed the SCNA level calculated in a previous work11. We performed regression analysis for resistance or immune evasion on these resistance parameters together with our mutation based-prediction scores. The raw values of the resistance parameters were scaled from 0 to 1 as
where ei indicates the ith value of variable E. Emin and Emax indicate the minimum and maximum value for variable E, respectively. Our mutation based-prediction score was given as the vote ratio ranging from 0 to 1 that was provided by the random forest predictor. Therefore, this score was used without scaling. We then performed multiple linear regression by the simple, lasso, elastic net, and ridge regression method with 5-fold cross validation implemented in the glmnet R package54. The ROCR55 R package was used to calculate the AUC to assess the accuracy of each regression model. The variable importance of each feature in the regression model was evaluated on the basis of generalised cross-validation (GCV) implemented by the glmnet R package.
Reporting summary
Further information on research design is available in theΒ Nature Research Reporting Summary linked to this article.
Code availability
Codes for implementing the CNN model to predict peptide-MHC class I binding were made available at the authorsβ webpage (http://omics.kaist.ac.kr/resources).
References
Sharma, P. & Allison, J. P. The future of immune checkpoint therapy. Science 348, 56β61 (2014).
Schumacher, T. N. & Schreiber, R. D. Neoantigens in cancer immunotherapy. Science 348, 69β74 (2015).
Gubin, M. M. & Schreiber, R. D. The odds of immunotherapy success. Science 350, 158β159 (2015).
Trolle, T. et al. Automated benchmarking of peptide-MHC class i binding predictions. Bioinformatics 31, 2174β2181 (2015).
The problem with neoantigen prediction. Nat. Biotechnol. 35, 97β97 (2017). https://www.ncbi.nlm.nih.gov/pubmed/28178261.
Sharon, E. et al. Genetic variation in MHC proteins is associated with T cell receptor expression biases. Nat. Genet. 48, 995β1002 (2016).
Alipanahi, B., Delong, A., Weirauch, M. T. & Frey, B. J. Predicting the sequence specificities of DNA- and RNA-binding proteins by deep learning. Nat. Biotechnol. 33, 831β838 (2015).
Zhou, J. & Troyanskaya, O. G. Predicting effects of noncoding variants with deep learningβbased sequence model. Nat. Methods 12, 931β934 (2015).
McGranahan, N. et al. Clonal neoantigens elicit T cell immunoreactivity and sensitivity to immune checkpoint blockade. Science 351, 1463β1469 (2016).
Roh, W. et al. Integrated molecular analysis of tumor biopsies on sequential CTLA-4 and PD-1 blockade reveals markers of response and resistance. Sci. Transl. Med. 9, eaah3560 (2017).
Davoli, T., Uno, H., Wooten, E. C. & Elledge, S. J. Tumor aneuploidy correlates with markers of immune evasion and with reduced response to immunotherapy. Science 355, 8399 (2017).
Riaz, N. et al. Recurrent SERPINB3 and SERPINB4 mutations in patients who respond to anti-CTLA4 immunotherapy. Nat. Genet. 48, 1327β1330 (2016).
Gao, J. et al. Loss of IFN-Ξ³ pathway genes in tumor cells as a mechanism of resistance to anti-CTLA-4 therapy. Cell 167, 397β404 (2016).
Tumeh, P. C. et al. PD-1 blockade induces responses by inhibiting adaptive immune resistance. Nature 515, 568β571 (2014).
Herbst, R. S. et al. Predictive correlates of response to the anti-PD-L1 antibody MPDL3280A in cancer patients. Nature 515, 563β567 (2014).
Angeles, L., Khersonsky, O. & Fleishman, S. J. The βcancer immunogramβ. Science 352, 658β660 (2016).
Vita, R. et al. The immune epitope database (IEDB) 3.0. Nucleic Acids Res. 43, D405βD412 (2015).
Jha, A. N., Vishveshwara, S. & Banavar, J. R. Amino acid interaction preferences in proteins. Protein Sci. 19, 603β616 (2010).
Nielsen, M. et al. NetMHCpan, a method for quantitative predictions of peptide binding to any HLA-A and -B locus protein of known sequence. PLoS ONE 2, e796 (2007).
Jurtz, V. et al. NetMHCpan-4.0: improved peptideβMHC class I interaction predictions integrating eluted ligand and peptide binding affinity data. J. Immunol. 199, 3360β3368 (2017).
Van Allen, E. M. et al. Genomic correlates of response to CTLA-4 blockade in metastatic melanoma. Science 350, 207β211 (2015).
Snyder, A. et al. Genetic basis for clinical response to CTLA-4 blockade in melanoma. N. Engl. J. Med. 371, 2189β2199 (2014).
Riaz, N. et al. Tumor and microenvironment evolution during immunotherapy with nivolumab. Cell 171, 934β949 (2017).
Rizvi, N. A. et al. Mutational landscape determines sensitivity to PD-1 blockade in non-small cell lung cancer. Science 348, 124β128 (2015).
Hellmann, M. D. et al. Genomic features of response to combination immunotherapy in patients with advanced non-small-cell lung cancer. Cancer Cell 33, 843β852 (2018).
Li, B. et al. Landscape of tumor-infiltrating T cell repertoire of human cancers. Nat. Genet. 48, 725β732 (2016).
Rolland, T. et al. A proteome-scale map of the human interactome network. Cell 159, 1212β1226 (2014).
Das, J. & Yu, H. HINT: High-quality protein interactomes and their applications in understanding human disease. BMC Syst. Biol. 6, 92 (2012).
Yu, H. et al. Next-generation sequencing to generate interactome datasets. Nat. Methods 8, 478β480 (2011).
Kato, S. et al. Hyperprogressors after immunotherapy: analysis of genomic alterations associated with accelerated growth rate. Clin. Cancer Res. 23, 4242β4250 (2017).
Akbay, E. A. et al. Activation of the PD-1 pathway contributes to immune escape in EGFR-driven lung tumors. Cancer Discov. 3, 1355β1363 (2013).
Weghorn, D. & Sunyaev, S. Bayesian inference of negative and positive selection in human cancers. Nat. Genet. 49, 1785β1788 (2017).
Martincorena, I. et al. Universal patterns of selection in cancer and somatic tissues. Cell 171, 1029β1041 (2017). e21.
MΓ€urer, M. et al. A polymorphism in the human cytotoxic T-lymphocyte antigen 4 (CTLA4) gene (exon 1 +49) alters T-cell activation. Immunogenetics 54, 1β8 (2002).
Wang, G. & Dunbrack, R. L. PISCES: A protein sequence culling server. Bioinformatics 19, 1589β1591 (2003).
Berman, H. M. et al. The protein data bank. Acta Crystallogr. Sect. D Biol. Crystallogr. 58, 899β907 (2002).
Warren, R. L. et al. Derivation of HLA types from shotgun sequence datasets. Genome Med. 4, 95 (2012).
OβLeary, N. A. et al. Reference sequence (RefSeq) database at NCBI: current status, taxonomic expansion, and functional annotation. Nucleic Acids Res. 44, D733βD745 (2016).
Chen, R., Im, H. & Snyder, M. Whole-exome enrichment with the agilent sureselect human all exon platform. Cold Spring Harb. Protoc. 2015, 626β633 (2015).
Li, H. & Durbin, R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 25, 1754β1760 (2009).
McKenna, A. et al. The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 20, 1297β1303 (2010).
Cibulskis, K. et al. Sensitive detection of somatic point mutations in impure and heterogeneous cancer samples. Nat. Biotechnol. 31, 213β219 (2013).
Li, B. & Li, J. Z. A general framework for analyzing tumor subclonality using SNP array and DNA sequencing data. Genome Biol. 15, 473 (2014).
Talevich, E., Shain, A. H., Botton, T. & Bastian, B. C. CNVkit: genome-wide copy number detection and visualization from targeted DNA sequencing. PLOS Comput. Biol. 12, e1004873 (2016).
Boegel, S. et al. HLA typing from RNA-Seq sequence reads. Genome Med. 4, 102 (2012).
Ng, P. C. & Henikoff, S. Predicting deleterious amino acid substitutions predicting deleterious amino acid substitutions. Genome Res. 11, 863β874 (2001).
Choi, Y. & Chan, A. P. PROVEAN web server: a tool to predict the functional effect of amino acid substitutions and indels. Bioinformatics 31, 2745β2747 (2015).
Liaw, A. & Wiener, M. Classification and regression by randomForest. R. News 2, 18β22 (2002).
Chen, E. Y. et al. Enrichr: interactive and collaborative HTML5 gene list enrichment analysis tool. BMC Bioinforma. 14, 128 (2013).
Fabregat, A. et al. The reactome pathway knowledgebase. Nucleic Acids Res. 44, D481βD487 (2016).
Thomas, P. D., Campbell, M. J., Kejariwal, A., Mi, H. & Karlak, B. PANTHER: a library of protein families and subfamilies indexed by function. Genome Res. 13, 2129β2141 (2003).
Consortium, T. G. O. Gene Ontology: tool for the unification of biology. Nat. Genet. 25, 25β29 (2000).
Huang, D. W., Sherman, B. T. & Lempicki, R. A. Bioinformatics enrichment tools: Paths toward the comprehensive functional analysis of large gene lists. Nucleic Acids Res. 37, 1β13 (2009).
Friedman, J., Hastie, T. & Tibshirani, R. Regularization paths for generalized linear models via coordinate descent. J. Stat. Softw. 33, 1β22 (2010).
Sing, T., Sander, O., Beerenwinkel, N. & Lengauer, T. ROCR: visualizing classifier performance in R. Bioinformatics 21, 3940β3941 (2005).
Acknowledgements
This work was supported by the Post-Genome Technology Development Program [10067758] funded by the Ministry of Trade, Industry and Energy, by the Bio & Medical Technology Development Program (NRF-2019M3A9B6064688) funded by the Ministry of Science and ICT, and by the Basic Science Research Program [2018R1D1A1B07047485] funded by the Ministry of Education .
Author information
Authors and Affiliations
Contributions
K.K. performed all data analyses and wrote the manuscript. H.S.K. generated cohort data and helped with manuscript writing. J.Y.K. and H.J. participated in data analyses. J.-M.S., J.S.A., M.-J.A., and K.P. helped with cohort data generation and analyses. S.-H.L. and J.K.C. conceived and supervised the study.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Peer review information Nature Communications thanks Zlatko Trajanoski and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.
Publisherβs note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Source data
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the articleβs Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the articleβs Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Kim, K., Kim, H.S., Kim, J.Y. et al. Predicting clinical benefit of immunotherapy by antigenic or functional mutations affecting tumour immunogenicity. Nat Commun 11, 951 (2020). https://doi.org/10.1038/s41467-020-14562-z
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41467-020-14562-z
This article is cited by
-
Advances in pediatric gliomas: from molecular characterization to personalized treatments
European Journal of Pediatrics (2024)
-
Prognostic significance of the CRAFITY score in hepatocellular carcinoma treated with immunotherapy: a systematic review and meta-analysis
BMC Cancer (2023)
-
MHC II immunogenicity shapes the neoepitope landscape in human tumors
Nature Genetics (2023)
-
Integrated analysis of single-cell and bulk RNA sequencing identifies a signature based on macrophage marker genes involved in prostate cancer prognosis and treatment responsiveness
Functional & Integrative Genomics (2023)
-
Relationship between ATOH1 and tumor microenvironment in colon adenocarcinoma patients with different microsatellite instability status
Cancer Cell International (2022)
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.