Letter | Published:

Quantifiable predictive features define epitope-specific T cell receptor repertoires

Nature volume 547, pages 8993 (06 July 2017) | Download Citation


T cells are defined by a heterodimeric surface receptor, the T cell receptor (TCR), that mediates recognition of pathogen-associated epitopes through interactions with peptide and major histocompatibility complexes (pMHCs). TCRs are generated by genomic rearrangement of the germline TCR locus, a process termed V(D)J recombination, that has the potential to generate marked diversity of TCRs (estimated to range from 1015 (ref. 1) to as high as 1061 (ref. 2) possible receptors). Despite this potential diversity, TCRs from T cells that recognize the same pMHC epitope often share conserved sequence features, suggesting that it may be possible to predictively model epitope specificity. Here we report the in-depth characterization of ten epitope-specific TCR repertoires of CD8+ T cells from mice and humans, representing over 4,600 in-frame single-cell-derived TCRαβ sequence pairs from 110 subjects. We developed analytical tools to characterize these epitope-specific repertoires: a distance measure on the space of TCRs that permits clustering and visualization, a robust repertoire diversity metric that accommodates the low number of paired public receptors observed when compared to single-chain analyses, and a distance-based classifier that can assign previously unobserved TCRs to characterized repertoires with robust sensitivity and specificity. Our analyses demonstrate that each epitope-specific repertoire contains a clustered group of receptors that share core sequence similarities, together with a dispersed set of diverse ‘outlier’ sequences. By identifying shared motifs in core sequences, we were able to highlight key conserved residues driving essential elements of TCR recognition. These analyses provide insights into the generalizable, underlying features of epitope-specific repertoires and adaptive immune recognition.

  • Subscribe to Nature for full access:



Additional access options:

Already a subscriber?  Log in  now or  Register  for online access.


  1. 1.

    & T-cell antigen receptor genes and T-cell recognition. Nature 334, 395–402 (1988)

  2. 2.

    & Quantifying lymphocyte receptor diversity. bioRxiv 046870 (2016)

  3. 3.

    et al. Fast multiclonal clusterization of V(D)J recombinations from high-throughput sequencing. BMC Genomics 15, 409 (2014)

  4. 4.

    , , & IMGT/HighV-QUEST: the IMGT® web portal for immunoglobulin (IG) or antibody and T cell receptor (TR) analysis from NGS high throughput and deep sequencing. Immunomethods 882, 569–604 (2012)

  5. 5.

    et al. MiTCR: software for T-cell receptor sequencing data analysis. Nat. Methods 10, 813–814 (2013)

  6. 6.

    , , & RTCR: a pipeline for complete and accurate recovery of T cell repertoires from high throughput sequencing data. Bioinformatics 32, 3098–3106 (2016)

  7. 7.

    , , & Structural determinants of T-cell receptor bias in immunity. Nat. Rev. Immunol. 6, 883–894 (2006)

  8. 8.

    et al. Recombinatorial biases and convergent recombination determine interindividual TCRβ sharing in murine thymocytes. J. Immunol. 189, 2404–2413 (2012)

  9. 9.

    et al. Sharing of T cell receptors in antigen-specific responses is driven by convergent recombination. Proc. Natl Acad. Sci. USA 103, 18691–18696 (2006)

  10. 10.

    , , , & Highly diverse TCRα chain repertoire of pre-immune CD8+ T cells reveals new insights in gene recombination. EMBO J. 31, 1666–1678 (2012)

  11. 11.

    et al. High-resolution analysis of the human T-cell receptor repertoire. Nat. Commun. 6, 8081 (2015)

  12. 12.

    et al. Chromatin conformation governs T-cell receptor Jβ gene segment usage. Proc. Natl Acad. Sci. USA 109, 15865–15870 (2012)

  13. 13.

    et al. High-throughput pairing of T cell receptor α and β sequences. Sci. Transl. Med. 7, 301ra131 (2015)

  14. 14.

    et al. Feature selection using a one dimensional naive Bayes’ classifier increases the accuracy of support vector machine classification of CDR3 repertoires. Bioinformatics 33, 951–955 (2017)

  15. 15.

    et al. Tracking global changes induced in the CD4 T-cell receptor repertoire by immunization with a complex antigen using short stretches of CDR3 protein sequence. Bioinformatics 30, 3181–3188 (2014)

  16. 16.

    et al. Structural basis for enabling T-cell receptor diversity within biased virus-specific CD8+ T-cell responses. Proc. Natl Acad. Sci. USA 108, 9536–9541 (2011)

  17. 17.

    et al. Genetic and structural basis for selection of a ubiquitous T cell receptor deployed in Epstein–Barr virus. PLoS Pathog. 6, e1001198 (2011)

  18. 18.

    , , , & A structural basis for immunodominant human T cell receptor recognition. Nat. Immunol. 4, 657–663 (2003)

  19. 19.

    et al. The structural dynamics and energetics of an immunodominant T cell receptor are programmed by its Vβ domain. Immunity 28, 171–182 (2008)

  20. 20.

    et al. Epitope-specific TCRβ repertoire diversity imparts no functional advantage on the CD8+ T cell response to cognate viral peptides. Proc. Natl Acad. Sci. USA 105, 2034–2039 (2008)

  21. 21.

    , , & Evolution of the antigen-specific CD8+ TCR repertoire across the life span: evidence for clonal homogenization of the old TCR repertoire. J. Immunol. 186, 2056–2064 (2011)

  22. 22.

    , , , & Methods for comparing the diversity of samples of the T cell receptor repertoire. J. Immunol. Methods 321, 182–195 (2007)

  23. 23.

    et al. Landscape of tumor-infiltrating T cell repertoire of human cancers. Nat. Genet. 48, 725–732 (2016)

  24. 24.

    et al. Isolation of T cell receptors specifically reactive with mutated tumor associated antigens from tumor infiltrating lymphocytes based on CD137 expression. Clin. Cancer Res. 23, 2491–2505 (2016)

  25. 25.

    et al. Tumor- and neoantigen-reactive T-cell receptors can be identified based on their frequency in fresh tumor. Cancer Immunol. Res. 4, 734–743 (2016)

  26. 26.

    et al. Cancer immunotherapy based on mutation-specific CD4+ T cells in a patient with epithelial cancer. Science 344, 641–645 (2014)

  27. 27.

    et al. Paired analysis of TCRα and TCRβ chains at the single-cell level in mice. J. Clin. Invest. 121, 288–295 (2011)

  28. 28.

    , , , & T cell receptor αβ diversity inversely correlates with pathogen-specific antibody levels in human cytomegalovirus infection. Sci. Transl. Med. 4, 128ra42 (2012)

  29. 29.

    , & Single-cell analysis of T-cell receptor αβ repertoire. Methods Mol. Biol. 1343, 181–197 (2015)

  30. 30.

    et al. Rapid cloning, expression, and functional characterization of paired αβ and γδ T-cell receptor chains from single-cell analysis. Mol. Ther. Methods Clin. Dev. 3, 15054 (2016)

  31. 31.

    , , , & Basic local alignment search tool. J. Mol. Biol. 215, 403–410 (1990)

  32. 32.

    et al. IMGT, the international ImMunoGeneTics information system. Nucleic Acids Res. 37, D1006–D1012 (2009)

  33. 33.

    et al. Mother and child T cell receptor repertoires: deep profiling study. Front. Immunol. 4, 463 (2013)

  34. 34.

    & On Information and Sufficiency. Ann. Math. Stat. 22, 79–86 (1951)

  35. 35.

    Divergence measures based on the Shannon entropy. IEEE Trans. Inf. Theory 37, 145–151 (1991)

  36. 36.

    , & Information theoretic measures for clusterings comparison. in Proceedings of the 26th Annual International Conference on Machine Learning - ICML ’09 (2009). doi:10.1145/1553374.1553511

  37. 37.

    & Amino acid substitution matrices from protein blocks. Proc. Natl Acad. Sci. USA 89, 10915–10919 (1992)

  38. 38.

    , & in Data Mining and Knowledge Discovery Handbook 321–352 (2005)

  39. 39.

    et al. Paired TCRαβ analysis of virus-specific CD8+ T cells exposes diversity in a previously defined ‘narrow’ repertoire. Immunol. Cell Biol. 93, 804–814 (2015)

Download references


We would like to thank the St Jude Children’s Research Hospital Animal Resource Center’s staff for their support and excellent animal care. We thank G. Lennon for the help with single-cell sorting. We thank the Hartwell Center at St Jude for sequencing support. We also thank M. Morris, L. McLaren, T. H. Oguin III, W. Awad, A. Zamora, D. Boyd, X. Guo, S. Valkenburg, E. Grant, N. Bird and N. Mifsud for their help in conducting experiments and preparation of the manuscript. The work was supported by NIH grant AI107625 and ALSAC (to P.G.T.), FHCRC internal development funding to P.B., and an NHMRC Program Grant (1071916) to K.K. and N.L.L. N.L.L. is the recipient of a Sylvia and Charles Viertel Senior Medical Research Fellowship. E.B.C. is an NHMRC Peter Doherty Fellow and K.K. is an NHMRC SRF Level B Fellow. G.C.W. was the recipient of National Institute on Aging (NIA) K23 AG033113, NIA P30 AG021334, John A. Hartford Foundation’s Center of Excellence in Geriatric Medicine Scholars Award, and Johns Hopkins Biology of Healthy Aging Program.

Author information


  1. Department of Immunology, St Jude Children’s Research Hospital, Memphis, Tennessee 38105, USA

    • Pradyot Dash
    • , Aisha Souquette
    • , Jeremy Chase Crawford
    •  & Paul G. Thomas
  2. Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Research Center, Seattle, Washington 98109, USA

    • Andrew J. Fiore-Gartland
    •  & Tomer Hertz
  3. The Shraga Segal Department of Microbiology, Immunology and Genetics, Ben-Gurion University of the Negev, Beer-Sheva 84105, Israel

    • Tomer Hertz
  4. Division of Geriatric Medicine and Gerontology, Biology of Healthy Aging Program, Johns Hopkins University School of Medicine, Baltimore, Maryland 21224, USA

    • George C. Wang
  5. Department of Veterinary Physiology and Biochemistry, Lala Lajpat Rai University of Veterinary and Animal Sciences, Hisar, Haryana 125004, India

    • Shalini Sharma
  6. Department of Microbiology and Immunology, University of Melbourne, Peter Doherty Institute for Infection and Immunity, Parkville, Victoria 3010, Australia

    • E. Bridie Clemens
    • , Thi H. O. Nguyen
    • , Katherine Kedzierska
    •  & Nicole L. La Gruta
  7. Infection and Immunity Program and Department of Biochemistry and Molecular Biology, Biomedicine Discovery Institute, Monash University, Clayton, Victoria 3800, Australia

    • Nicole L. La Gruta
  8. Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, Washington 98109, USA

    • Philip Bradley
  9. Institute for Protein Design, University of Washington, Seattle, Washington 98195, USA

    • Philip Bradley


  1. Search for Pradyot Dash in:

  2. Search for Andrew J. Fiore-Gartland in:

  3. Search for Tomer Hertz in:

  4. Search for George C. Wang in:

  5. Search for Shalini Sharma in:

  6. Search for Aisha Souquette in:

  7. Search for Jeremy Chase Crawford in:

  8. Search for E. Bridie Clemens in:

  9. Search for Thi H. O. Nguyen in:

  10. Search for Katherine Kedzierska in:

  11. Search for Nicole L. La Gruta in:

  12. Search for Philip Bradley in:

  13. Search for Paul G. Thomas in:


P.D., A.G., T.H., P.B. and P.G.T. wrote the manuscript and designed figures. P.D., G.C.W., S.S. and P.G.T. designed experiments. P.D., G.C.W., A.S. and S.S. conducted experiments. P.D., G.C.W., S.S. and A.S. acquired data. P.D., G.C.W., S.S., A.S., J.C.C., B.C., T.H.O.N., K.K., N.L.L., P.B. and P.G.T. analysed data. P.D., A.G., T.H., P.B., P.G.T. interpreted data. P.D., A.G., T.H., A.S., P.B., G.C.W., K.K., N.L.L., P.G.T. and J.C.C. edited the manuscript. All authors approved final manuscript.

Competing interests

The authors declare no competing financial interests.

Corresponding authors

Correspondence to Philip Bradley or Paul G. Thomas.

Reviewer Information Nature thanks B. Chain and the other anonymous reviewer(s) for their contribution to the peer review of this work.

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Extended data

Supplementary information

PDF files

  1. 1.

    Supplementary Information

    This file contains Supplementary Text and References.

About this article

Publication history






Rights and permissions

To obtain permission to re-use content from this article visit RightsLink.


By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.