VDJdb in the pandemic era: a compendium of T cell receptors specific for SARS-CoV-2

Goncharov, Mikhail; Bagaev, Dmitry; Shcherbinin, Dmitrii; Zvyagin, Ivan; Bolotin, Dmitry; Thomas, Paul G.; Minervina, Anastasia A.; Pogorelyy, Mikhail V.; Ladell, Kristin; McLaren, James E.; Price, David A.; Nguyen, Thi H. O.; Rowntree, Louise C.; Clemens, E. Bridie; Kedzierska, Katherine; Dolton, Garry; Rius, Cristina Rafael; Sewell, Andrew; Samir, Jerome; Luciani, Fabio; Zornikova, Ksenia V.; Khmelevskaya, Alexandra A.; Sheetikov, Saveliy A.; Efimov, Grigory A.; Chudakov, Dmitry; Shugay, Mikhail

doi:10.1038/s41592-022-01578-0

Download PDF

Correspondence
Published: 15 August 2022

VDJdb in the pandemic era: a compendium of T cell receptors specific for SARS-CoV-2

Nature Methods volume 19, pages 1017–1019 (2022)Cite this article

5287 Accesses
35 Citations
14 Altmetric
Metrics details

Subjects

To the Editor — Here, we report the VDJdb database (https://vdjdb.cdr3.net) update prepared between 2019 and 2022, marked by the emergence of SARS-CoV-2, the causative agent of COVID-19.

In 2016, we started a community effort to gather and curate publicly available sequence data acquired from T cell receptor (TCRs) with defined antigen specificities, as well as communicated datasets from our colleagues, by developing the VDJdb database, which has since been extended with a web interface that allows batch querying of adaptive immune receptor repertoire sequencing (AIRR-seq) datasets and the identification of TCR sequence motifs linked with specific epitopes¹.

In the current pandemic era, a large majority of recent T cell repertoire profiling and antigen-specificity studies have focused on TCR variants that target the SARS-CoV-2 coronavirus^2,3,4. As a consequence, millions of TCR sequences have now been isolated from donors with COVID-19. To complement these efforts, in the latest release of VDJdb, we incorporated TCR specificity data from various studies of COVID-19. We collected data from an international network of laboratories focused on assaying antigen-specific T cell responses in COVID-19 (Fig. 1a). Data acquired from multiple laboratories across the world feature over 3,000 TCR α and β chain sequences recognizing dozens of SARS-CoV-2 epitopes. These analyses revealed a set of reproducible TCR motifs that could find utility in large-scale clinical and experimental studies focused on COVID-19. We showed consistency and reproducibility of TCR specificity data across laboratories. Inferred TCR motifs will facilitate the tracking SARS-CoV-2-specific T cells and the discovery of immune signatures associated with protection against COVID-19. T cell antigen specificity is encoded by somatically rearranged TCRs. Current techniques allow the comprehensive profiling of TCR repertoires via high-throughput sequencing, which is compatible with various methods for elucidating the antigen specificity of T cell populations⁵.

**Fig. 1: Overview of COVID-19 data compendium stored in VDJdb.**

The first set of TCR repertoires with known specificity for SARS-CoV-2 epitopes was acquired from the Efimov laboratory⁴. This work prioritized the HLA-A*02-restricted YLQ and RLQ epitopes, producing 573 VDJdb records (unpaired TCR α and β chains), which were subsequently detected in other studies and served as a template for the first SARS-CoV-2-specific TCR–peptide–MHC crystal structures⁶. This submission was followed by a number of studies from different laboratories performed in 2021. One dataset reported multiple TCR sequences specific for SARS-CoV-2 epitopes restricted by HLA-A*24⁷, a prominent HLA class I allotype among indigenous Asian populations. A report from the Kedzierska laboratory complemented these data with the addition of TCR sequences specific for SARS-CoV-2 epitopes restricted by HLA-A*02, HLA-A*24 and HLA-B*07³. A large set of paired TCRαβ sequences specific for a range of SARS-CoV-2 epitopes was acquired from the Thomas laboratory⁸. Smaller datasets were also imported from other published works and private communications (all listed in the issue section of the VDJdb github repository), including one notable study that reported TCR sequences specific for SARS-CoV-2 epitopes restricted by HLA class II allotypes⁹. In total, the current VDJdb release features 3,187 unique TCR specificity records spanning 46 distinct SARS-CoV-2 epitopes (Fig. 1b and Supplementary Table 1).

An important test of consistency for any biological dataset is independent reproducibility, and TCR repertoire sequencing in particular is prone to methodological and operator-dependent biases. To explore potential biases in the SARS-CoV-2-related VDJdb dataset, we performed a comparative analysis of TCR α and β chain specificity records for the most widely studied epitope, YLQ-HLA-A*02. No preferential clustering of these specificity records was observed across laboratories (Fig. 1c, top), while the overall structure of the TCR similarity map was preserved, suggesting that different laboratories sampled uniformly from the same space of epitope-specific TCR sequences.

Conversely, the independently generated data validated a set of TCR complementarity-determining region 3 (CDR3) sequences, which clustered as clearly defined motifs across different laboratories (Fig. 1c). Of note, the most commonly obtained CDR3 sequences were used successfully in crystallographic studies to generate ternary structures⁶, providing new insights into the molecular mechanisms that underpin TCR recognition of the YLQ epitope in complex with HLA-A*02.

Imprints of common infections can be detected in TCR repertoire sequencing datasets¹⁰, which in turn can be used to predict immune responses and stratify patients with COVID-19⁵. VDJdb has been used successfully in the past for similar purposes and currently serves as a benchmark standard for testing TCR-specificity prediction algorithms². In this work we demonstrated that the COVID-19 TCR-specificity compendium is unaffected by inter-laboratory biases and thus can be employed as a reference in TCR repertoire annotation. These precedents suggest that VDJdb can be used in the future to build classifiers trained to identify biologically relevant T cell responses in patients with COVID-19. Overall, we anticipate that the present release will enhance the versatility of VDJdb in the pandemic era, supporting the development of more effective vaccines and addressing future challenges associated with viral evolution and the emergence of new pathogens beyond SARS-CoV-2.

Data availability

All code and data are available at https://github.com/antigenomics/vdjdb-db, https://github.com/antigenomics/vdjdb-motifs and https://github.com/antigenomics/vdjdb-web, released under open-source Apache 2.0 and CC BY-ND 4.0 licenses.

References

Dolton, G. et al. Front. Immunol. 9, 1378 (2018).
Article Google Scholar
Nguyen, T. H. O. et al. Immunity 54, 1066–1082.e5 (2021).
Article CAS Google Scholar
Shomuradova, A. S. et al. Immunity 53, 1245–1257.e5 (2020).
Article CAS Google Scholar
Shoukat, M. S. et al. Cell Rep. Med. 2, 100192 (2021).
Article CAS Google Scholar
Bagaev, D. V. et al. Nucleic Acids Res. 48, D1057–D1062 (2020).
Article Google Scholar
Chaurasia, P. et al. J. Biol. Chem. 297, 101065 (2021).
Article CAS Google Scholar
Rowntree, L. C. et al. Immunol. Cell Biol. https://doi.org/10.1111/imcb.12482 (2021).
Article PubMed PubMed Central Google Scholar
Minervina, A. A. et al. Nat. Immunol. 23, 781–790 (2022).
Article CAS Google Scholar
Verhagen, J. et al. Clin. Exp. Immunol. 205, 363–378 (2021).
Article CAS Google Scholar
Pogorelyy, M. V. et al. Genome Med. 10, 68 (2018).
Article Google Scholar

Download references

Acknowledgements

This work was supported by a grant from the Ministry of Science and Higher Education of the Russian Federation (075-15-2019-1789). Additional funds were provided by the National Health and Medical Research Council (NHMRC; Australia) via a Leadership Investigator Grant (no. 1173871 to K.K.), the Research Grants Council of the Hong Kong Special Administrative Region, China (no. T11-712/19-N to K.K.) and the Medical Research Future Fund (Australia; no. 2005544 to K.K.). T.H.O.N. was supported by an NHMRC Emerging Leadership Level 1 Investigator Grant (no. 1194036). E.B.C. was supported by an NHMRC Peter Doherty Fellowship (no. 1091516). D.A.P. was supported by a Wellcome Trust Senior Investigator Award (UK; 100326/Z/12/Z). G.A.E. was supported by Russian Science Foundation Grant (20-15-00395).

Author information

These authors contributed equally: Mikhail Goncharov, Dmitry Bagaev.

Authors and Affiliations

Institute of Bioorganic Chemistry of Russian Academy of Sciences, Moscow, Russia
Mikhail Goncharov, Dmitry Bagaev, Ivan Zvyagin, Dmitry Bolotin, Dmitry Chudakov & Mikhail Shugay
Center of Life Sciences, Skolkovo Institute of Science and Technology, Moscow, Russia
Mikhail Goncharov & Dmitry Chudakov
Signal Processing Group, Eindhoven University of Technology, Eindhoven, the Netherlands
Dmitry Bagaev
Center for Precision Genome Editing and Genetic Technologies for Biomedicine, Institute of Translational Medicine, Pirogov Russian National Research Medical University, Moscow, Russia
Dmitrii Shcherbinin, Ivan Zvyagin, Dmitry Bolotin, Dmitry Chudakov & Mikhail Shugay
Department of Immunology, St. Jude Children’s Research Hospital, Memphis, Tennessee, USA
Paul G. Thomas, Anastasia A. Minervina & Mikhail V. Pogorelyy
Division of Infection and Immunity, Cardiff University School of Medicine, Cardiff, UK
Kristin Ladell, James E. McLaren & David A. Price
Systems Immunity Research Institute, Cardiff University School of Medicine, Cardiff, UK
David A. Price & Andrew Sewell
Department of Microbiology and Immunology, University of Melbourne, Peter Doherty Institute for Infection and Immunity, Melbourne, Victoria, Australia
Thi H. O. Nguyen, Louise C. Rowntree, E. Bridie Clemens & Katherine Kedzierska
T-Cell Modulation Group, Division of Infection and Immunity, Cardiff University School of Medicine, Cardiff, UK
Garry Dolton, Cristina Rafael Rius & Andrew Sewell
Kirby Institute, University of New South Wales, Sydney, New South Wales, Australia
Jerome Samir & Fabio Luciani
National Research Center for Hematology, Moscow, Russia
Ksenia V. Zornikova, Alexandra A. Khmelevskaya, Saveliy A. Sheetikov & Grigory A. Efimov
Biological Faculty, Lomonosov Moscow State University, Moscow, Russia
Ksenia V. Zornikova & Saveliy A. Sheetikov

Authors

Mikhail Goncharov
View author publications
You can also search for this author in PubMed Google Scholar
Dmitry Bagaev
View author publications
You can also search for this author in PubMed Google Scholar
Dmitrii Shcherbinin
View author publications
You can also search for this author in PubMed Google Scholar
Ivan Zvyagin
View author publications
You can also search for this author in PubMed Google Scholar
Dmitry Bolotin
View author publications
You can also search for this author in PubMed Google Scholar
Paul G. Thomas
View author publications
You can also search for this author in PubMed Google Scholar
Anastasia A. Minervina
View author publications
You can also search for this author in PubMed Google Scholar
Mikhail V. Pogorelyy
View author publications
You can also search for this author in PubMed Google Scholar
Kristin Ladell
View author publications
You can also search for this author in PubMed Google Scholar
James E. McLaren
View author publications
You can also search for this author in PubMed Google Scholar
David A. Price
View author publications
You can also search for this author in PubMed Google Scholar
Thi H. O. Nguyen
View author publications
You can also search for this author in PubMed Google Scholar
Louise C. Rowntree
View author publications
You can also search for this author in PubMed Google Scholar
E. Bridie Clemens
View author publications
You can also search for this author in PubMed Google Scholar
Katherine Kedzierska
View author publications
You can also search for this author in PubMed Google Scholar
Garry Dolton
View author publications
You can also search for this author in PubMed Google Scholar
Cristina Rafael Rius
View author publications
You can also search for this author in PubMed Google Scholar
Andrew Sewell
View author publications
You can also search for this author in PubMed Google Scholar
Jerome Samir
View author publications
You can also search for this author in PubMed Google Scholar
Fabio Luciani
View author publications
You can also search for this author in PubMed Google Scholar
Ksenia V. Zornikova
View author publications
You can also search for this author in PubMed Google Scholar
Alexandra A. Khmelevskaya
View author publications
You can also search for this author in PubMed Google Scholar
Saveliy A. Sheetikov
View author publications
You can also search for this author in PubMed Google Scholar
Grigory A. Efimov
View author publications
You can also search for this author in PubMed Google Scholar
Dmitry Chudakov
View author publications
You can also search for this author in PubMed Google Scholar
Mikhail Shugay
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

M.G., M.S., D.S. and I.Z. proofread and incorporated sequencing data into the database and performed statistical analysis. D. Bagaev and D. Bolotin implemented, hosted and supported the web interface for the database. P.G.T., A.A.M., M.V.P., K.L., J.E.M., D.A.P., T.H.O.N., L.C.R., E.B.C., K.K., G.D., C.R.R., A.S., J.S., F.L., K.V.Z., A.A.K., S.A.S. and G.A.E. gathered, formatted and submitted sequencing data to the database. M.S., I.Z. and D.C. designed and curated the study. M.S., D.C., D.A.P., P.G.T., K.K., F.L., G.A.E. and A.S. wrote and edited the manuscript. All authors read and approved the manuscript.

Corresponding author

Correspondence to Mikhail Shugay.

Ethics declarations

Competing interests

The authors declare no conflicts of interest.

Peer review

Peer review information

Nature Methods thanks Sam Darko, Baojun Zhang and the other, anonymous, reviewer(s) for their contribution to the peer review of this work.

Supplementary information

Supplementary Information

Supplementary Table 1, Supplementary Methods

Rights and permissions

Reprints and permissions

About this article

Cite this article

Goncharov, M., Bagaev, D., Shcherbinin, D. et al. VDJdb in the pandemic era: a compendium of T cell receptors specific for SARS-CoV-2. Nat Methods 19, 1017–1019 (2022). https://doi.org/10.1038/s41592-022-01578-0

Download citation

Published: 15 August 2022
Issue Date: September 2022
DOI: https://doi.org/10.1038/s41592-022-01578-0

This article is cited by

The spike-specific TCRβ repertoire shows distinct features in unvaccinated or vaccinated patients with SARS-CoV-2 infection
- Eleonora Vecchio
- Salvatore Rotundo
- Camillo Palmieri
Journal of Translational Medicine (2024)
Single-cell characterisation of tissue homing CD4 + and CD8 + T cell clones in immune-mediated refractory arthritis
- Dipabarna Bhattacharya
- Jason Theodoropoulos
- Tapio Lönnberg
Molecular Medicine (2024)
Activation-based repertoire analysis for T cell clonal dynamics in hybrid COVID-19 immunity
- Louise C. Rowntree
- Carolyn A. Cohen
- Sophie A. Valkenburg
Nature Immunology (2024)
Deep learning predictions of TCR-epitope interactions reveal epitope-specific chains in dual alpha T cells
- Giancarlo Croce
- Sara Bobisse
- David Gfeller
Nature Communications (2024)
Repeated mRNA vaccination sequentially boosts SARS-CoV-2-specific CD8+ T cells in persons with previous COVID-19
- Emily S. Ford
- Koshlan Mayer-Blackwell
- David M. Koelle
Nature Immunology (2024)

VDJdb in the pandemic era: a compendium of T cell receptors specific for SARS-CoV-2

Subjects

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Peer review

Peer review information

Supplementary information

Supplementary Information

Rights and permissions

About this article

Cite this article

This article is cited by

The spike-specific TCRβ repertoire shows distinct features in unvaccinated or vaccinated patients with SARS-CoV-2 infection

Single-cell characterisation of tissue homing CD4 + and CD8 + T cell clones in immune-mediated refractory arthritis

Activation-based repertoire analysis for T cell clonal dynamics in hybrid COVID-19 immunity

Deep learning predictions of TCR-epitope interactions reveal epitope-specific chains in dual alpha T cells

Repeated mRNA vaccination sequentially boosts SARS-CoV-2-specific CD8+ T cells in persons with previous COVID-19

Search

Quick links

Subjects

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Peer review

Peer review information

Supplementary information

Supplementary Information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

The spike-specific TCRβ repertoire shows distinct features in unvaccinated or vaccinated patients with SARS-CoV-2 infection

Single-cell characterisation of tissue homing CD4 + and CD8 + T cell clones in immune-mediated refractory arthritis

Activation-based repertoire analysis for T cell clonal dynamics in hybrid COVID-19 immunity

Deep learning predictions of TCR-epitope interactions reveal epitope-specific chains in dual alpha T cells

Repeated mRNA vaccination sequentially boosts SARS-CoV-2-specific CD8+ T cells in persons with previous COVID-19

Search

Quick links