Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

  • Article
  • Published:

Structure-based prediction of T cell receptor recognition of unseen epitopes using TCRen

A preprint version of the article is available at bioRxiv.

Abstract

T cell receptor (TCR) recognition of foreign peptides presented by major histocompatibility complex protein is a major event in triggering the adaptive immune response to pathogens or cancer. The prediction of TCR–peptide interactions has great importance for therapy of cancer as well as infectious and autoimmune diseases but remains a major challenge, particularly for novel (unseen) peptide epitopes. Here we present TCRen, a structure-based method for ranking candidate unseen epitopes for a given TCR. The first stage of the TCRen pipeline is modeling of the TCR–peptide–major histocompatibility complex structure. Then a TCR–peptide residue contact map is extracted from this structure and used to rank all candidate epitopes on the basis of an interaction score with the target TCR. Scoring is performed using an energy potential derived from the statistics of TCR–peptide contact preferences in existing crystal structures. We show that TCRen has high performance in discriminating cognate versus unrelated peptides and can facilitate the identification of cancer neoepitopes recognized by tumor-infiltrating lymphocytes.

This is a preview of subscription content, access via your institution

Access options

Buy this article

Prices may be subject to local taxes which are calculated during checkout

Fig. 1: Description of the TCRen method.
Fig. 2: The performance of TCRen in distinguishing cognate TCR epitopes from unrelated peptides.
Fig. 3: A comparison of TCRen with structure-based methods for the prediction of general protein interactions.
Fig. 4: The performance of TCRen when homology models are used as input.
Fig. 5: TCRen for the prediction of cancer neoepitopes recognized by TILs.

Similar content being viewed by others

Data availability

All crystal structures of TCR–peptide–MHC complexes from the PDB used to derive the TCRen statistical potential and the datasets from previously published studies used to validate the performance of TCRen are available via GitHub at https://github.com/antigenomics/tcren-ms. Data used for benchmarking was taken from previously published studies; references are given in Table 1. Source data are provided with this paper.

Code availability

All the code and data required to reproduce the analysis performed in the study, as well as a script and tutorial for running TCRen on new data, are available via GitHub at https://github.com/antigenomics/tcren-ms. Code for the TCRen pipeline is also available via Zenodo at https://doi.org/10.5281/zenodo.11129800 (ref. 41). All analysis was performed using R version 4.2.0, homology modeling was performed using TCRpMHCmodels version 1.0 and NetMHCIIpan-4.0 software was used to predict peptide binding to MHC class II.

References

  1. Qi, Q. et al. Diversity and clonal selection in the human T-cell repertoire. Proc. Natl Acad. Sci. USA 111, 13139–13144 (2014).

    Article  Google Scholar 

  2. Mora, T. & Walczak, A. M. How many different clonotypes do immune repertoires contain? Curr. Opin. Syst. Biol. 18, 104–110 (2019).

    Article  Google Scholar 

  3. Valkiers, S. et al. Recent advances in T-cell receptor repertoire analysis: bridging the gap with multimodal single-cell RNA sequencing. ImmunoInformatics 5, 100009 (2022).

    Article  Google Scholar 

  4. Rosati, E. et al. Overview of methodologies for T-cell receptor repertoire analysis. BMC Biotechnol. 17, 1–16 (2017).

    Article  Google Scholar 

  5. Joglekar, A. V. & Li, G. T cell antigen discovery. Nat. Methods 18, 873–880 (2020).

    Article  Google Scholar 

  6. Lin, X. et al. Rapid assessment of T-cell receptor specificity of the immune repertoire. Nat. Comput. Sci. 1, 362–373 (2021).

    Article  Google Scholar 

  7. Singh, N. K. et al. Emerging concepts in TCR specificity: rationalizing and (maybe) predicting outcomes. J. Immunol. 199, 2203–2213 (2017).

    Article  Google Scholar 

  8. Hudson, D., Fernandes, R. A., Basham, M., Ogg, G. & Koohy, H. Can we predict T cell specificity with digital biology and machine learning? Nat. Rev. Immunol. 23, 511–521 (2023).

    Article  Google Scholar 

  9. Gielis, S. et al. Detection of enriched T cell epitope specificity in full T cell receptor sequence repertoires. Front. Immunol. 10, 2820 (2019).

    Article  Google Scholar 

  10. Montemurro, A. et al. NetTCR-2.0 enables accurate prediction of TCR–peptide binding by using paired TCRα and β sequence data. Commun. Biol. 4, 1060 (2021).

    Article  Google Scholar 

  11. Mayer-Blackwell, K. et al. TCR meta-clonotypes for biomarker discovery with tcrdist3 enabled identification of public, HLA-restricted clusters of SARS-CoV-2 TCRs. eLife 10, e68605 (2021).

    Article  Google Scholar 

  12. Weber, A., Born, J. & Rodriguez Martínez, M. TITAN: T-cell receptor specificity prediction with bimodal attention networks. Bioinformatics 37, i237–i244 (2021).

    Article  Google Scholar 

  13. Springer, I., Tickotsky, N. & Louzoun, Y. Contribution of T cell receptor alpha and beta CDR3, MHC typing, V and J genes to peptide binding prediction. Front. Immunol. 12, 664514 (2021).

    Article  Google Scholar 

  14. Bagaev, D. V. et al. VDJdb in 2019: database extension, new analysis infrastructure and a T-cell receptor motif compendium. Nucleic Acids Res. 48, D1057–D1062 (2020).

    Article  Google Scholar 

  15. Tickotsky, N., Sagiv, T., Prilusky, J., Shifrut, E. & Friedman, N. McPAS-TCR: a manually curated catalogue of pathology-associated T cell receptor sequences. Bioinformatics 33, 2924–2929 (2017).

    Article  Google Scholar 

  16. Berman, H. M. et al. The protein data bank. Nucleic Acids Res. 28, 235–242 (2000).

    Article  Google Scholar 

  17. Jensen, K. K. et al. TCRpMHCmodels: structural modelling of TCR–pMHC class I complexes. Sci. Rep. 9, 14530 (2019).

    Article  Google Scholar 

  18. Meysman, P. et al. Benchmarking solutions to the T-cell receptor epitope prediction problem: IMMREP22 workshop report. ImmunoInformatics 9, 100024 (2023).

    Article  Google Scholar 

  19. Jiang, Y., Huo, M. & Cheng Li, S. TEINet: a deep learning framework for prediction of TCR-epitope binding specificity. Brief. Bioinform. 24, bbad086 (2023).

    Article  Google Scholar 

  20. Cai, M., Bang, S., Zhang, P. & Lee, H. ATM-TCR: TCR–epitope binding affinity prediction using a multi-head self-attention model. Front. Immunol. 13, 893247 (2022).

    Article  Google Scholar 

  21. Gao, Y. et al. Pan-Peptide Meta Learning for T-cell receptor–antigen binding recognition. Nat. Mach. Intell. 5, 236–249 (2023).

    Article  Google Scholar 

  22. Keskin, O., Bahar, I., Badretdinov, A. Y., Ptitsyn, O. B. & Jernigan, R. L. Empirical solvent-mediated potentials hold for both intra-molecular and inter-molecular inter-residue interactions. Protein Sci. 7, 2578–2586 (1998).

    Article  Google Scholar 

  23. Miyazawa, S. & Jernigan, R. L. Estimation of effective interresidue contact energies from protein crystal structures: quasi-chemical approximation. Macromolecules 18, 534–552 (1985).

    Article  Google Scholar 

  24. Vita, R. et al. The Immune Epitope Database (IEDB): 2018 update. Nucleic Acids Res. 47, D339–D343 (2019).

    Article  Google Scholar 

  25. Birnbaum, M. E. et al. Deconstructing the peptide-MHC specificity of T cell recognition. Cell 157, 1073–1087 (2014).

    Article  Google Scholar 

  26. Alford, R. F. et al. The rosetta all-atom energy function for macromolecular modeling and design. J. Chem. Theory Comput. 13, 3031–3048 (2017).

    Article  Google Scholar 

  27. Kumari, R. & Kumar, R. Open source drug discovery consortium, A. Lynn, g_mmpbsa–a GROMACS tool for high-throughput MM-PBSA calculations. J. Chem. Inf. Model. 54, 1951–1962 (2014).

    Article  Google Scholar 

  28. Gee, M. H. et al. Antigen identification for orphan T cell receptors expressed on tumor-infiltrating lymphocytes. Cell 172, 549–563.e16 (2018).

    Article  Google Scholar 

  29. Atchley, W. R., Zhao, J., Fernandes, A. D. & Drüke, T. Solving the protein sequence metric problem. Proc. Natl Acad. Sci. USA 102, 6395–6400 (2005).

    Article  Google Scholar 

  30. Kosmrlj, A., Jha, A. K., Huseby, E. S., Kardar, M. & Chakraborty, A. K. How the thymus designs antigen-specific and self-tolerant T cell receptor sequences. Proc. Natl Acad. Sci. USA 105, 16671–16676 (2008).

    Article  Google Scholar 

  31. Bobisse, S. et al. Sensitive and frequent identification of high avidity neo-epitope specific CD8+ T cells in immunotherapy-naive ovarian cancer. Nat. Commun. 9, 1092 (2018).

    Article  Google Scholar 

  32. Devlin, J. R. et al. Structural dissimilarity from self drives neoepitope escape from immune tolerance. Nat. Chem. Biol. 16, 1269–1276 (2020).

    Article  Google Scholar 

  33. Bigot, J. et al. Splicing patterns in SF3B1-mutated uveal melanoma generate shared immunogenic tumor-specific neoepitopes. Cancer Discov. 11, 1938–1951 (2021).

    Article  Google Scholar 

  34. Bradley, P. Structure-based prediction of T cell receptor:peptide-MHC interactions. eLife 12, e82813 (2023).

    Article  Google Scholar 

  35. Abramson, J. et al. Accurate structure prediction of biomolecular interactions with AlphaFold 3. Nature 630, 493–500 (2024).

    Article  Google Scholar 

  36. Yin, R. et al. TCRmodel2: high-resolution modeling of T cell receptor recognition using deep learning. Nucleic Acids Res. https://doi.org/10.1093/nar/gkad356 (2023).

    Article  Google Scholar 

  37. Tubiana, J., Schneidman-Duhovny, D. & Wolfson, H. J. ScanNet: an interpretable geometric deep learning model for structure-based protein binding site prediction. Nat. Methods 19, 730–739 (2022).

    Article  Google Scholar 

  38. Gainza, P. et al. Deciphering interaction fingerprints from protein molecular surfaces using geometric deep learning. Nat. Methods 17, 184–192 (2020).

    Article  Google Scholar 

  39. Reynisson, B., Alvarez, B., Paul, S., Peters, B. & Nielsen, M. NetMHCpan-4.1 and NetMHCIIpan-4.0: improved predictions of MHC antigen presentation by concurrent motif deconvolution and integration of MS MHC eluted ligand data. Nucleic Acids Res. 48, W449–W454 (2020).

    Article  Google Scholar 

  40. Riley, T. P. et al. T cell receptor cross-reactivity expanded by dramatic peptide-MHC adaptability. Nat. Chem. Biol. 14, 934–942 (2018).

    Article  Google Scholar 

  41. Karnaukhov, V. Structure-based prediction of T-cell receptor recognition of unseen epitopes using residue-level pairwise statistical potential TCRen. Zenodo https://doi.org/10.5281/zenodo.11129800 (2024).

Download references

Acknowledgements

MD simulations were carried out with the use of computational facilities of the Supercomputer Center ‘Polytechnical’ at the St. Petersburg Polytechnic University. We thank S. Bobisse and A. Harari for providing data for 302TIL candidate neoepitopes. The study was supported by a grant from the Ministry of Science and Higher Education of Russian Federation (075-15-2019-1789). MD simulations were supported by the HSE University Basic Research Program.

Author information

Authors and Affiliations

Authors

Contributions

Conceptualization was carried out by V.K.K. and M.S. Methodology was planned by V.K.K. and M.S. Validation was performed by V.K.K. Curation of the database of TCR–peptide–MHC structures from the PDB was carried out by D.S.S. Supervison was performed by M.S., I.V.Z. and D.M.C. Supervison of MD simulations was performed by A.O.C. and R.G.E. Writing of the original draft was done by V.K.K. Writing, review and editing was done by V.K.K., M.S., I.V.Z., D.M.C., A.O.C. and R.G.E.

Corresponding authors

Correspondence to Vadim K. Karnaukhov, Dmitriy M. Chudakov or Mikhail Shugay.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Computational Science thanks Shuai Cheng and Andrew Fiore-Gartland for their contributions to the peer review of this work. Primary Handling Editor: Ananya Rastogi, in collaboration with the Nature Computational Science team. Peer reviewer reports are available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Supplementary Figs. 1–16, Tables 1–2 and Notes 1–2.

Reporting Summary

Peer Review File

Supplementary Data 1.

The nonredundant set of crystal structures of TCR–peptide–MHC complexes from the PDB that was used to derive the TCRen potential.

Source data

Source Data Fig. 2.

Statistical source data.

Source Data Fig. 3

Statistical source data.

Source Data Fig. 4

Statistical source data.

Source Data Fig. 5

Statistical source data.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Karnaukhov, V.K., Shcherbinin, D.S., Chugunov, A.O. et al. Structure-based prediction of T cell receptor recognition of unseen epitopes using TCRen. Nat Comput Sci 4, 510–521 (2024). https://doi.org/10.1038/s43588-024-00653-0

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1038/s43588-024-00653-0

This article is cited by

Search

Quick links

Nature Briefing

Sign up for the Nature Briefing newsletter — what matters in science, free to your inbox daily.

Get the most important science stories of the day, free in your inbox. Sign up for Nature Briefing