SARS-CoV-2 infection establishes a stable and age-independent CD8+ T cell response against a dominant nucleocapsid epitope using restricted T cell receptors

Choy, Cecily; Chen, Joseph; Li, Jiangyuan; Gallagher, D. Travis; Lu, Jian; Wu, Daichao; Zou, Ainslee; Hemani, Humza; Baptiste, Beverly A.; Wichmann, Emily; Yang, Qian; Ciffelo, Jeffrey; Yin, Rui; McKelvy, Julia; Melvin, Denise; Wallace, Tonya; Dunn, Christopher; Nguyen, Cuong; Chia, Chee W.; Fan, Jinshui; Ruffolo, Jeannie; Zukley, Linda; Shi, Guixin; Amano, Tomokazu; An, Yang; Meirelles, Osorio; Wu, Wells W.; Chou, Chao-Kai; Shen, Rong-Fong; Willis, Richard A.; Ko, Minoru S. H.; Liu, Yu-Tsueng; De, Supriyo; Pierce, Brian G.; Ferrucci, Luigi; Egan, Josephine; Mariuzza, Roy; Weng, Nan-Ping

doi:10.1038/s41467-023-42430-z

Download PDF

Article
Open access
Published: 23 October 2023

SARS-CoV-2 infection establishes a stable and age-independent CD8⁺ T cell response against a dominant nucleocapsid epitope using restricted T cell receptors

Nature Communications volume 14, Article number: 6725 (2023) Cite this article

2478 Accesses
2 Altmetric
Metrics details

Subjects

Abstract

The resolution of SARS-CoV-2 replication hinges on cell-mediated immunity, wherein CD8⁺ T cells play a vital role. Nonetheless, the characterization of the specificity and TCR composition of CD8⁺ T cells targeting non-spike protein of SARS-CoV-2 before and after infection remains incomplete. Here, we analyzed CD8⁺ T cells recognizing six epitopes from the SARS-CoV-2 nucleocapsid (N) protein and found that SARS-CoV-2 infection slightly increased the frequencies of N-recognizing CD8⁺ T cells but significantly enhanced activation-induced proliferation compared to that of the uninfected donors. The frequencies of N-specific CD8⁺ T cells and their proliferative response to stimulation did not decrease over one year. We identified the N_222-230 peptide (LLLDRLNQL, referred to as LLL thereafter) as a dominant epitope that elicited the greatest proliferative response from both convalescent and uninfected donors. Single-cell sequencing of T cell receptors (TCR) from LLL-specific CD8⁺ T cells revealed highly restricted Vα gene usage (TRAV12-2) with limited CDR3α motifs, supported by structural characterization of the TCR–LLL–HLA-A2 complex. Lastly, transcriptome analysis of LLL-specific CD8⁺ T cells from donors who had expansion (expanders) or no expansion (non-expanders) after in vitro stimulation identified increased chromatin modification and innate immune functions of CD8⁺ T cells in non-expanders. These results suggests that SARS-CoV-2 infection induces LLL-specific CD8⁺ T cell responses with a restricted TCR repertoire.

Characterization of pre-existing and induced SARS-CoV-2-specific CD8+ T cells

Article 12 November 2020

An immunodominant NP105–113-B*07:02 cytotoxic T cell response controls viral replication and is associated with less severe COVID-19 disease

Article Open access 01 December 2021

Robust SARS-CoV-2-specific T cell immunity is maintained at 6 months following primary infection

Article 05 March 2021

Introduction

CD8⁺ T cells play a vital role in combatting SARS-CoV-2 and forming long-term memory responses to this coronavirus^1,2,3. Unlike the viral epitopes recognized by antibodies, which are sensitive to mutations causing viral escape by new variants, CD8⁺ T cells recognize epitopes from both mutable and highly conserved viral proteins, offering longer immune protection^4,5. Due to the complex nature of antigen recognition by T cell receptors (TCR), which involves the presentation of many epitopes by highly polymorphic human leukocyte antigen (HLA) molecules, TCR repertoires for defined SARS-CoV-2 epitopes have not been as fully characterized as antibody repertoires.

Activation of CD8⁺ T cells is observed in the blood of COVID-19 patients^6,7, and low CD8⁺ T cell counts are associated with severity of COVID-19 symptoms and poor outcomes^8,9,10. Analysis of the targets recognized by CD8⁺ T cells after in vitro stimulation with pooled peptides of SARS-CoV-2 proteins has shown recognition of both highly conserved structural proteins, such as nucleocapsid (N) and membrane proteins, as well as the highly mutable spike (S) protein of SARS-CoV-2^{11,12,13,14,15,16}. Furthermore, CD8⁺ T cells recognizing SARS-CoV-2 are not only found in COVID-19 patients and vaccinated donors but also in uninfected donors^7,12,17,18. Phenotypically, both naïve and memory subsets exist in SARS-CoV-2-recognizing CD8⁺ T cells of COVID-19 patients, vaccinated donors, and uninfected individuals. Acute SARS-CoV-2 infection generates memory T cells^19,20,21, but the exact functional changes in these memory T cells remain to be determined. The presence of differentiated CD8⁺ T cells recognizing epitopes from the N protein in donors without any known prior SARS-CoV-2 infection suggests that these CD8⁺ T cells are likely cross-reactive to other common coronaviruses.

Previous studies of CD8⁺ T cell responses to SARS-CoV-2 have focused mainly on epitopes derived from the S protein. For example, S_269-277 (YLQPRTFLL, referred to as YLQ) is a dominant yet variable spike epitope that elicits a polyfunctional CD8⁺ T cell response in COVID-19 recovered patients^13,15,22. Sequence analysis of YLQ-specific TCR repertoire revealed public TCRs with highly biased usage of the TRAV12-1 and TRAV12-2 gene segments²³. Crystal structures of TCR–YLQ–HLA-A2 complexes provided insights into the selection of particular TRAV and TRBV genes and the effects of viral variants on TCR recognition^23,24,25,26. Less is known about the TCR repertoires elicited by nucleocapsid epitopes^14,27. N_222-230 (LLLDRNQL, referred to as LLL) is presented by HLA-A2 and has a broad CD8⁺ T cell recognition by peptide stimulation and tetramer staining¹¹. Of note, LLL is one of six SARS-CoV-2 T cell epitopes included in a recent peptide-based vaccine against COVID-19 (CoVac-1)^28,29. This vaccine induced T cell responses in a Phase I/II clinical trial that were unaffected by current SARS-CoV-2 variants of concern. The LLL peptide is also a component of a T cell-directed mRNA vaccine (BNT162b4) that protected hamsters against severe disease³⁰.

Aging is associated with changes in CD8⁺ T cell homeostasis and functions^31,32, including a reduction in circulating naïve CD8⁺ T cells and an increase in differentiated memory CD8⁺ T cells due to thymic atrophy and lifelong stimulation by environmental and intrinsic insults^33,34,35,36. This leads to reduced TCR repertoire diversity in older adults^36,37 and decreased immune response to various infections and vaccines^38,39,40. Despite the high mortality rate of COVID-19^41,42, older adults tend to have fairly robust antibody responses to the mRNA-based COVID-19 vaccines^43,44. Analysis of CD8⁺ T cells in patients with acute COVID-19 has shown that reduced naïve T cells and reduced antigen-specific T cell responses are observed in older patients with severe COVID-19^42,45. The mechanism behind reduced T cell function with age has recently been analyzed using high-dimensional flow cytometry and multi-omics data³⁴, but it is still unclear what changes in CD8⁺ T cells determine their activation-induced proliferation and function.

In this study, we analyzed the frequency, differentiation status, and in vitro expansion of circulating CD8⁺ T cells recognizing six epitopes from the SARS-CoV-2 N protein in uninfected and convalescent COVID-19 donors. We found that the frequencies of CD8⁺ T cells recognizing the LLL epitope were significantly higher in recovered patients than in uninfected donors and remains stable over one year follow-up. In vitro antigenic challenge identified LLL-specific CD8⁺ T cells from convalescent donors had a significantly higher percentage of expanders and a more robust proliferative response than uninfected donors. Further scTCRseq analysis of LLL-specific CD8⁺ T cells showed highly restricted Vα gene usage (TRAV12-2) with limited CDR3α motifs, supported by the crystal structure of a TCR–LLL–HLA-A2 complex. Lastly, single-cell transcriptome analysis of LLL-specific CD8⁺ T cells from donors who had expansion or no expansion after in vitro stimulation identified increased chromatin modification and innate immune functions of CD8⁺ T cells from non-expanders in a TCR-independent manner, suggesting that these transcriptome changes may regulate activation-induced CD8⁺ T cell proliferation and expansion.

Results

Increased frequencies of SARS-CoV-2 nucleocapsid-specific CD8⁺ T cells post-infection

To understand CD8⁺ T cell immunity against the highly conserved SARS-CoV-2 nucleocapsid protein, we analyzed the frequencies of N-recognizing circulating CD8⁺ T cells from COVID-19 convalescent donors (mild clinical presentations of the disease and not hospitalized, n = 75, F = 56, M = 19, age range 18–89 years old) and uninfected controls (n = 138, F = 74, M = 64, age 17–92 years old) (Fig. 1a, Supplementary Data 1). Convalescent donors had a positive PCR test and detectable levels of anti-N IgG antibodies (Fig. 1b) or detectable anti-S IgG antibodies. The uninfected controls had undetectable levels of blood anti-N IgG antibodies (Fig. 1b). Utilizing multi-color flow cytometry, we measured three differentiation markers (CD127, CD28, and CD27) within the general CD8⁺ T cell population in the peripheral blood of convalescent donors and uninfected controls. Convalescent donors had significantly increased percentages of CD8⁺ T cells expressing CD127⁺ (IL7R) and CD28⁺/CD27⁺ and reduced CD28⁻/CD27⁺ subsets compared to the uninfected controls (Fig. 1c, Supplementary Fig. 1a). These findings suggest that SARS-CoV-2 infection alters the composition and status of CD8⁺ T cells through an enrichment of cells that are not fully differentiated.

**Fig. 1: Experimental scheme and frequencies of CD8⁺ T cell recognizing six epitopes of nucleocapsid protein of SARS-CoV-2.**

We analyzed CD8⁺ T cells recognizing six previously reported epitopes (presented by HLA-A2) of the highly conserved N protein of SARS-CoV^46,47 in convalescent (n = 34–35) and uninfected controls (n = 21–73) who are HLA-A2 positive. To measure the frequency of circulating N-specific CD8⁺ T cells, we created six MHC class I tetramers (HLA-A2) bearing these six epitopes (Fig. 1d). In agreement with previous reports^11,16,22,48, we found that both convalescent and uninfected donors had less than 1% of CD8⁺ T cells positive for each tetramer; however, compared to their uninfected counterparts, COVID-19 convalescent patients had significantly higher frequencies of CD8⁺ T cells specific for the epitope LLL and for the sum of all six epitopes (Fig. 1e). Specifically, central memory (T_CM) CD8⁺ T cells specific for LLL was increased in convalescent patients (Fig. 1f). Although there is no substantial sequence similarity between these epitopes of SARS-CoV-2 and other common coronaviruses (Supplementary Table 1)⁴⁹, these memory phenotype N-epitope recognizing CD8⁺ T cells may derive from cross-reactive TCRs with other antigens. Collectively, we observed that SARS-CoV-2 infection is associated with significantly higher frequencies of both LLL-specific and sum of six N epitope-specific CD8⁺ T cells compared to uninfected controls.

Stable frequency of nucleocapsid-specific CD8⁺ T cells over time

To examine changes in the frequency and function of CD8⁺ T cells over time, we collected samples from convalescent donors across three visits over a year (Fig. 2a). As previously reported⁵⁰, we observed a significant decline of anti-nucleocapsid protein antibody titers in the blood, with a decay rate of −0.012 AU/mL per day (p = 0.001) (Fig. 2b). In contrast, the frequency of CD8⁺ T cells recognizing the six epitopes of nucleocapsid protein did not reduce over the course of a year (Fig. 2c–e, Supplementary Fig. 2a). We further analyzed changes in the subsets of LLL-specific CD8⁺ T cells and found that T_CM percentage significantly increased over time, but the other four subsets (T_N, T_SCM, T_EM, and T_EMRA) were not significantly changed (Supplementary Fig. 2b). These findings revealed that, in contrast to the decline of anti-nucleocapsid IgG titer over the course of a year, the overall frequencies and proportions of different subsets of nucleocapsid-specific CD8⁺ T cells remain stable.

**Fig. 2: Longitudinal analysis of LLL-specific CD8⁺ T cells in the blood of convalescent subjects.**

SARS-CoV-2 infection is associated with enhanced activation and proliferation of N-specific CD8⁺ T cells in vitro

To study how N-recognizing CD8⁺ T cells respond to antigenic challenge, we performed overnight stimulation with a pool of peptides from S and N protein (ALN, LQL, LLL, GMS, ILL, and LAL) and checked for any upregulation of activation-induced markers (Fig. 3a). We observed a mild increase in CD69⁺ CD8⁺ T cells in convalescent donors compared to uninfected controls (Supplementary Fig. 3a). Activation marker expression, however, does not necessarily predict if downstream expansion of CD8⁺ T cells will occur, so to measure this, we performed a long-term in vitro stimulation to observe the expansion of CD8⁺ T cells specific to these six nucleocapsid epitopes (Fig. 3a). We found that convalescent patients were able to expand in response to all six nucleocapsid epitopes, while uninfected donors could only respond to four epitopes (LLL, ALN, LQL, and ILL) (Fig. 3b, Supplementary Fig. 3b, and Supplementary Table 2). Among the six epitopes, we observed that LLL induced CD8⁺ T cell expansion in 94% of convalescent patients, compared to 50% of uninfected donors. To quantify the degree of expansion, we set the classifications as mild (1 < x < 3 cell divisions) or robust (≥3 cell divisions). We found that the average LLL-induced CD8⁺ T cell expansion in convalescent donors was four divisions higher (cell count 16-fold higher) compared to uninfected donors (Fig. 3c). These findings suggest that convalescent donors have a greater ability to expand CD8⁺ T cells in response to nucleocapsid epitopes, particularly to the dominant LLL epitope, compared to uninfected donors.

**Fig. 3: Functional assessment of the ability for in vitro expansion of SARS-CoV-2 nucleocapsid-specific CD8⁺ T cells.**

To determine whether the initial number of LLL-recognizing CD8⁺ T cells and their differentiation status influence activation-induced LLL-specific cell expansion, we compared the number of seeded naive and memory epitope-recognizing CD8⁺ T cells and the magnitude of expansion after in vitro stimulation. We found that the magnitude of expansion of LLL⁺ CD8⁺ T cells was positively correlated with the number of initially seeded T_EM LLL⁺ CD8⁺ T cells (p = 0.009) (Fig. 3d). The number of seeded LLL-specific CD8⁺ T cells of T_N, T_SCM, T_CM, and T_EMRA differentiation status had no effect on the magnitude of LLL⁺ CD8⁺ T cell expansion (Supplementary Fig. 3c). To further determine whether activation-induced expansion changes over time, we compared the magnitude of in vitro expansion of LLL⁺ CD8⁺ T cells across multiple visits of convalescent donors. We found that expansion of LLL⁺ CD8⁺ T cells in the majority of donors (65%) was unchanged or increased (Fig. 3e). Taken together, these findings identified a dominant nucleocapsid epitope (LLL) for CD8⁺ T cells in both convalescent and uninfected donors and demonstrated that SARS-CoV-2 infection primes LLL⁺ CD8⁺ T cells to have improved long-lasting expansion capabilities in response to subsequent antigenic challenge.

Stable frequency and expansion of LLL-recognizing CD8⁺ T cells with age

Since COVID-19 disproportionately affects the elderly population, we sought to determine whether age alters the frequency and expansion of CD8⁺ T cells against SARS-CoV-2 N epitopes. We found that age does not affect the levels of anti-N IgG titer in convalescent donors (Fig. 4a), nor does it affect the frequencies of CD8⁺ T cells specific for six N epitopes in either convalescent or uninfected controls (Fig. 4b and Supplementary Fig. 4a). We further analyzed the different subsets of LLL⁺ CD8⁺ T cells and did not observe significant change in the frequency of LLL⁺ CD8⁺ T cell subsets (T_N, T_CM, T_SCM, T_EM, and T_EMRA) in convalescent donors with age (Fig. 4c and Supplementary Fig. 4b). We also did not find that the age of convalescent donors impacts the activation-induced expansion of CD8⁺ T cells against LLL or any of the other five epitopes of the N protein (Fig. 4d and Supplementary Fig. 4c). Overall, age does not have a significant impact on anti-N IgG titer or LLL⁺ CD8⁺ T cell frequency and expansion; however, we acknowledge that the older convalescent donors in our study presented only mild symptoms of COVID-19 and thus may not reflect the immune status of the general elderly population.

**Fig. 4: Age-associated changes in titer of anti-nucleocapsid antibody and LLL-specific CD8⁺ T cells in uninfected and convalescent donors.**

Highly restricted Vα gene usage by LLL-specific CD8⁺ TCRs

To investigate the α and β chain sequences of LLL-recognizing TCRs, we isolated LLL tetramer⁺ CD8⁺ T cells from 15 donors (6 convalescent and 7 uninfected donors) before stimulation and from 14 donors (13 convalescent and 1 uninfected donors) with LLL⁺ CD8⁺ T cells which expanded after in vitro stimulation. After sorting LLL⁺ CD8⁺ T cells using flow cytometry, we determined the TCR sequences of these isolated cells through scTCR-seq of a total of 26,269 LLL⁺ CD8⁺ T cells (24,795 are primary α chain and 1474 additional TCRs containing a second unique functional α chain) consisting of 6,695 unique αβTCR sequences (Fig. 5a). Initial V gene usage analysis revealed a dominant Vα gene (TRAV12-2 = 50%) with relatively diverse Vβ genes (the most abundant: TRVB9 = 21%) in LLL⁺ CD8⁺ T cells (Fig. 5b). To confirm binding specificity, we selected 23 LLL-recognizing TCRs and expressed them in a Jurkat T cell line (Fig. 5c, Supplementary Fig. 5a). We tested each TCR-expressing cell’s ability to bind to the LLL-tetramer and checked for expression of Nur77, an early activation-induced gene expressed post TCR signaling, after LLL-HLA-A2 stimulation in vitro. We found that nine of the ten TCRs from the TRAV12-2 family (91%) displayed strong binding to the LLL-tetramer as well as strong GFP signaling, but TCRs made up of the other five Vα gene families (TRAV12-1, 17, 19, 21, and 34) did not show a high percentage of tetramer binding nor had substantial activation-induced Nur77 reporter expression (Fig. 5c). We further analyzed LLL-specific TCRs used TRAV12-2 gene by TCRDist classification⁵¹ and identified four clusters containing experimentally proved LLL-binding TCRs (N = 516 representing 13,037 cells) and one cluster contained a no LLL binding TCR (N = 49 representing 91 cells) (Fig. 5d). We then analyzed the CDR3 motifs within each cluster of LLL-binding TCRs and found that CDR3α had a limited number of motifs compared to CDR3β. CDR3 motifs of different lengths can be interchangeably paired and the combination of five CDR3α motifs and five CDR3β motifs of Cluster 2 (C2) accounted for 23% of LLL⁺TCR-TRAV12-2 (Fig. 5e). Clusters 1 and 3 (C1 and C3) had similar motif combinations and accounted for 22% and 10% of LLL⁺TCR-TRAV12-2, respectively (Supplementary Fig. 5b). Our findings revealed highly restricted Vα gene usage by LLL-specific TCRs and highly interchangeable pairing of TCRα and TCRβ within the TCR cluster.

**Fig. 5: Characteristics of TCRs and their predictability of binding to a dominant nucleocapsid LLL epitope.**

Since approximately 43% (10/23) of the tested LLL-TCRs from the sorted LLL tetramer⁺ CD8⁺ T cells bound to LLL-HLA-A2 and delivered signals post-activation, we sought to develop a method to identify TCRs that bind to the LLL-HLA-A2 complex by using a random forest (RF) algorithm to score the TCRs based on their CDR3α and CDR3β sequences. We selected the positive TCRs from TRAV12-2⁺ LLL tetramer⁺ CD8⁺ T cells that were clustered by TCRDist and were confirmed to bind to LLL-HLA-A2 in vitro. In parallel, we used TCRDist to cluster TCRs specific to other SARS-CoV-2 epitopes besides LLL and selected the TCRs with no LLL-HLA-A2 binding as negative TCRs. (Supplementary Data 2). The amino acid sequences of CDR3α and CDR3β of both positive and negative TCRs were then broken down into 3-mers and given five positional encoding (left end, left, center, right, and right end) were trained by a RF model as described⁵² (Supplementary Fig. 5c). The RF algorithm showed good accuracy in predicting unseen data, with an AUC (area under the curve) of 92.2% (Fig. 5f). Further analysis identified kmers that had the most impact on RF (Supplementary Fig. 5d). When we applied this RF algorithm to score all unique TRAV12-2⁺ TCRs from LLL tetramer⁺ sorted CD8⁺ T cells, we found that 76.5% of the TCRs had a score greater than 0.8, and this fraction of TCRs accounted for 91.1% of all TRAV12-2⁺ TCRs expressing CD8⁺ T cells (Fig. 5g). These results show that LLL-HLA-A2 binding TCRs preferentially use the TRAV12-2 gene and consist of a limited number of CDR3α motifs that are interchangeably paired with CDR3β motifs. Lastly, our machine learning (ML) algorithm demonstrates accurate prediction of LLL-HLA-A2 binding TCRs.

Affinity and overall structure of an LLL-specific TCR bound to LLL–HLA-A2

The above findings led us to investigate the structural basis for dominant usage of the TRAV12-2 gene segment by LLL-specific TCRs. TCR LLL8, which utilizes TRAV12-2 and TRAJ54 for the α chain and TRBV7-2 and TRBJ2-1 for the β chain, was selected for further characterization. We used surface plasmon resonance (SPR) to measure the affinity of TCR LLL8 for HLA-A2 loaded with LLL peptide (Fig. 6a). TCR LLL8 bound LLL–HLA-A2 with a dissociation constant (K_D) of 19.2 ± 1.2 μM. This affinity is within the range of TCRs specific for microbial antigens (1–50 μM)⁵³, including TCRs specific for SARS-CoV-2 spike epitopes^25,54. Kinetic parameters (on- and off-rates) for the binding of LLL8 to LLL–HLA-A2 were k_on = 1.7 × 10⁴ M^–1s^–1 and k_off = 0.34 s^–1, corresponding to a K_D of 20.4 μM (Fig. 6a), in close agreement with the K_D from equilibrium analysis (19.2 μM).

**Fig. 6: Affinity and structure of the TCR LLL8–LLL–HLA-A2 complex.**

To understand how LLL-specific TCRs isolated from COVID-19 convalescent patients recognize the LLL epitope, we determined the structure of the LLL8–LLL–HLA-A2 complex to 3.18 Å resolution (Fig. 6b, Supplementary Table 3). The interface between TCR and pMHC was in unambiguous electron density for each of the four complex molecules in the asymmetric unit of the crystal (Supplementary Fig. 6a). The root-mean-square difference (r.m.s.d.) in α-carbon positions for the TCR VαVβ and MHC α1α2 modules, including the LLL peptide, ranged from 0.5 Å to 1.0 Å for the four LLL8–LLL–HLA-A2 complexes, indicating close similarity. Therefore, the following description of TCR–pMHC interactions applies to all molecules in the asymmetric unit of the crystal unless noted otherwise. TCR LLL8 docks over LLL–HLA-A2 in a canonical diagonal orientation, with Vα over the α2 helix of HLA-A2 and Vβ over the α1 helix. The crossing angle of TCR to pMHC⁵⁵ is 31° (Fig. 6c). The incident angle⁵⁶, which corresponds to the degree of tilt of TCR over pMHC, is 3°. As depicted by the footprint of TCR LLL8 on the pMHC surface (Fig. 6d), LLL8 establishes contacts with the N-terminal half of the peptide mainly through the CDR1α and CDR3α loops, whereas the CDR3β loop mostly contacts the C-terminal half.

Interaction of TCR LLL8 with HLA-A2

Of the total number of contacts (84) that TCR LLL8 makes with HLA-A2, excluding the LLL peptide, CDR1α, CDR2α, and CDR3α contribute 20%, 6%, and 33%, respectively, compared with 0%, 31%, and 5% for CDR1β, CDR2β, and CDR3β, respectively (Table 1) (Fig. 6g). Hence, Vα dominates the interactions of LLL8 with MHC (54 of 84 contacts: 64%), with the somatically generated CDR3α loop contributing more than any other CDR to MHC recognition (28 contacts). TCR LLL8 makes many more interactions with the HLA-A2 α1 helix than the α2 helix (Fig. 6e, f), mainly through CDR3α and CDR2β. These include a dense network of six hydrogen bonds linking Gln96α, Tyr48β, and Gln50β to Arg65H and Gln72H of helix α1 (Supplementary Table 4). In addition, Arg28α forms two hydrogen bonds with Glu166H at the C-terminus of helix α2 that further anchor LLL8 to HLA-A2. In agreement with this analysis, computational alanine scanning mutagenesis with Rosetta⁵⁷ of MHC residues in the interface with TCR identified Arg65H, Gln72H, and Glu166H as the three most energetically important HLA-A2 residues for engaging LLL8 (Supplementary Table 5).

Table 1 LLL8 TCR atomic contacts with the LLL peptide and HLA-A2

Full size table

Based on the TCR3d database of experimentally determined TCR–pMHC structures⁵⁸, there are >40 structures containing TCRs that possess the TRAV12-2 germline gene and that bind HLA-A2, collectively representing at least 10 unique human TCRs. Several of these, including the TCR A6–Tax–HLA-A2 complex (PDB code 1AO7)⁵⁹ and TCR DMF5–MART-1–HLA-A2 complex (3QDG)⁶⁰, have α chain interactions with MHC, as well as with peptide backbone, that are highly similar to those of TCR LLL8 (Supplementary Fig. 6b–e). These conserved interactions, which occur between germline-encoded CDR1 and CDR2 loops and pMHC, appear to support the hypothesis that the canonical diagonal docking orientation of TCR on MHC, which is maintained in the LLL8–LLL–HLA-A2 complex, is the result of coevolution of TCR and MHC molecules^61,62. However, there are several HLA-A2-binding TCRs that possess the TRAV12-2 germline gene but whose α chains engage pMHC through different sets interactions, as seen in TCR–pMHC complex structures RD1–MART-1–HLA-A2 (5E9D)⁶³, 868–SL9–HLA-A2 (5NME)⁶⁴, NYE-S1–NY–ESO–1–HLA-A2 (6RPB)⁶⁵, and YLQ7-YLQ-HLA-A2 (7N1F)²⁵ which contains a TCR bound to a SARS-CoV-2 spike epitope (see Discussion). Thus, convergent or preferred germline interaction motifs, as observed for LLL8 and other TRAV12-2 TCRs, are not always observed and are dependent on the TCR context (CDR3, TRBV gene) and/or epitope target.

Vα dominates LLL peptide recognition

A remarkable feature of LLL-specific TCRs isolated from COVID-19 convalescent patients is the almost exclusive use of members of TRAV12 gene family (TRAV12-2 in the case of LLL8). Coincidentally, the large majority (~85%) of HLA-A*02:01-restricted TCRs specific for the YLQ spike epitope, which is unrelated in sequence to LLL, also use TRAV12-2 or TRAV12-1 gene segments^15,26. The TRAV12-2 chain of LLL-specific TCRs can pair with multiple Vβs, including TRBV9, 2, 7–2, 6–6, 18, and 14. TRBV gene usage appears to be widely distributed, with TRBV9 the most frequent (8.6%) out of 510 unique LLL-specific TCRs. The structure of the LLL8–LLL–HLA-A2 complex revealed the basis for this combinatorial diversity. Of the 56 total contacts that LLL8 establishes with the LLL peptide, the bulk (41; 73%) are mediated by Vα (Table 1). This Vα dominance allows pairing with multiple Vβs, which, like TRBV7-2 of LLL8, are expected to make comparatively few interactions with the peptide, as well as MHC (see above). CDR1α, CDR2α, and CDR3α account for 38%, 9%, and 27% of contacts with LLL, respectively, compared to 2%, 2%, and 23% for CDR1β, CDR2β, and CDR3β, respectively (Fig. 6j). Of note, the germline-encoded CDR1α loop contributes more than any other CDR to peptide recognition, with Gln31α and Ser32α forming a cluster of four hydrogen bonds with LLL: Gln31α Nε2–O P2 Leu, Gln31α Oε1–Nη2 P5 Arg, Ser32α N–Oδ2 P4 Asp, and Ser32α Oγ–Oδ2 P4 Asp (Fig. 6h, i) (Supplementary Table 6). It appears that the TRAV12-2 sequence is uniquely suited to providing this configuration of hydrogen bonds for specific binding with the ionic P4 Asp-P5 Arg core of the LLL peptide.

Both TRAV12-1 and TRAV12-2 encode CDR1α residues Gln31α and Ser32α, whereas TRAV12-3 encodes CDR1α residues Gln31α and Tyr32α. Computational mutagenesis of Ser32α to Tyr in the LLL8–LLL–HLA-A2 complex using Rosetta shows a highly unfavorable ΔΔG (17 kcal/mol), indicating that the TRAV12-3-encoded CDR1α Tyr residue would be incompatible or much less compatible with the LLL8 mode of LLL–HLA-A2 engagement. The TRAV12-1 CDR2α loop has a different length than TRAV12-2: 8 residues (TRAV12-1) vs. 9 residues (TRAV12-2) based on TCR3d CDR loop definitions⁵⁸. This difference in length leads to a preferred backbone conformation observed in most structurally characterized TRAV12-1 TCRs (e.g., PDB codes 6VRM, 7N6E, 7PBE, 7EA6) that is distinct from TRAV12-2 TCRs, including LLL8, suggesting that TRAV12-1 is incompatible with LLL8-like recognition of LLL-HLA-A2 (which includes two CDR2α residues as binding hotspots; Supplementary Table 6). Thus, residue and length features of the CDR1α and CDR2α loops of TRAV12-3 and TRAV12-1, respectively, may be responsible for the observed lack of those germline genes in LLL–HLA-A2-specific TCRs.

TCR LLL8 engages six residues of the LLL peptide, burying 353 Å² of peptide surface (Fig. 6h; Supplementary Table 7). However, most interactions involve central residues P4 Asp and P5 Arg (36 of 54 van der Waals contacts) (Fig. 6i), whose protruding side chains pack against the CDR1α and CDR3α loops. Based on SARS-CoV-2 sequences in the GISAID database⁶⁶, the LLL epitope is highly conserved, and there are only two polymorphisms with >0.1% frequency: Q229H and L230F. Analysis of the LLL8–LLL–HLA-A2 structure using Rosetta⁵⁷ predicts that the Q229H substitution at TCR-contacting position P8 will lead to maintained or improved LLL8 binding, whereas the L230F substitution at MHC anchor position P2 will prevent epitope presentation by HLA-A2. LLL8 represents a pan-sarbecovirus reactive TCR due to the conservation of the LLL epitope within that group, while in other coronaviruses (e.g., DLLNRLQAL in MERS-CoV N; four substitutions from the SARS-CoV-2 LLL sequence) it varies, and we anticipate no LLL8 cross-reactivity due to substitutions in three TCR-contacting peptide residues.

The key CDR3 residues in the LLL8–LLL–HLA-A2 complex provide insights into the observed CDR3 motifs, particularly for CDR3α. TCR LLL8 exemplifies the CDR3α motif (G/N) (G/A)(Q/N)K with its subsequence GAQK, which includes residues Ala95α and Gln96α that have key contacts with pMHC; both of those residues are identified as binding hotspots based on Rosetta (Supplementary Table 7). The specific CDR3α subsequence GAQK was observed in 27 out of 516 (5%) of LLL–HLA-A2-binding TCRs. Due in part to the apparent diversity of CDR3β sequences in LLL-specific TCRs and resultant lack of pronounced motifs, it is not clear whether the one CDR3β hotspot residue that was identified by Rosetta (Asp97β) based on the LLL8–LLL–HLA-A2 structure corresponds to a motif position and residue that is structurally conserved among LLL–HLA-A2-binding TCRs. The LLL8 CDR3β may be one example of a highly variable array of CDR3β recognition strategies in the context of restricted ΤCRα sequences and variable TRBV germline genes.

Superposition of the MHC α1α2 domains of unbound LLL–HLA-A2 (7KGQ)⁴⁹ onto those of LLL–HLA-A2 in complex with TCR LLL8 showed small yet relevant differences in peptide conformation, corresponding to r.m.s.d. of 0.87 Å for main-chain atoms of LLL. The largest displacement by far is for P5 Arg, whose α-carbon position shifts 2.7 Å. It appears that several residues of the LLL8 TCR α chain impinge on P5 Arg and cause it to bend from its erect posture above the peptide in unbound LLL–HLA-A2, downward and toward the HLA-2 α2 helix in the LLL8–LLL–HLA-A2 complex.

Transcriptome alteration associated with the LLL-specific CD8⁺ T cell response

In vitro stimulation with LLL-HLA-A2 separated donors into two groups: expanders who had clear expansion of LLL-recognizing CD8⁺ T cells and non-expanders who did not have expansion of LLL-recognizing CD8⁺ T cells despite the presence of detectable levels of LLL tetramer⁺ CD8⁺ T cells. To understand what regulates activation-induced proliferation, we analyzed LLL tetramer⁺ unstimulated CD8⁺ T cells from both expanders and non-expanders by scRNAseq and identified six subsets (T_N, T_SCM, T_CM, T_EM, T_EMRA, and activated) of CD8⁺ T cells (Fig. 7a, b, Supplementary Fig. 7a) based on their characteristic gene expression features. Next, we compared the transcriptome of CD8⁺ T cells and of each subset between expanders and non-expanders using GSEA and found that CD8⁺ T cells (T_N, T_SCM, T_EM, and T_EMRA) from expanders expressed enriched genes involved in negative regulation of chemotaxis, cytokine activity, and responses to calcium ions (Supplementary Fig. 7b), whereas CD8⁺ T cells (T_N, T_SCM, T_EM, and T_EMRA) from non-expanders had enriched genes involved in chromatin modification, histone binding, positive regulation of cytokine production, and regulation of innate immune response (Fig. 7c). To rule out the possibility that differences in TCR quality between the two groups contributed to the activation-induced CD8⁺ T cell expansion, we selected LLL⁺ CD8⁺ T cells with high LLL-binding TCRs based on RF scores (>0.8) and compared the transcriptomes of the same subsets between expanders and non-expanders using GSEA. We found that these enriched functional groups presented in Figs. c, d remained the same between these two groups, suggesting that transcriptome changes identified between expanders and non-expanders are not due to differences in TCR quality. Furthermore, the enriched gene functional groups had a high degree of sharing among different CD8⁺ T cell subsets and are closely interact with each other as revealed by the gene network/pathway analysis (Fig. 7d, Supplementary Data 3). Together, these findings suggest a common underlying mechanism that regulates activation induced CD8⁺ T cell proliferation and expansion.

**Fig. 7: Altered transcriptomes of CD8⁺ T cells against nucleocapsid LLL epitope in non-expanders.**

Discussion

The importance of CD8⁺ T cells in combatting SARS-CoV-2 infection is increasingly being recognized. In this study, we show that SARS-CoV-2 infection augmented CD8⁺ T cell immunity against epitopes derived from the conserved N protein. The improved CD8⁺ T cell response includes (1) a mild increase in circulating epitope-recognizing CD8⁺ T cells but substantially more expansion in response to stimulation in vitro, (2) long-lasting activity over one year after infection without obvious change with age, (3) restricted Vα gene usage by TCRs recognizing LLL, and (4) shared transcriptome features associated with weaker activation-induced proliferation. These findings identified LLL as a dominant nucleocapsid epitope, characterized LLL-specific TCRs in structural terms, and revealed CD8⁺ T cell transcriptome features associated with expanders and non-expanders. Such information will be valuable for further evaluation of CD8⁺ T cell response to SARS-CoV-2 and for designing better SARS-CoV-2 vaccines which contains dominant epitopes not only from the S protein but also other proteins such as N protein. Indeed, LLL is one of six SARS-CoV-2 T cell epitopes included in a COVID-19 peptide-based vaccine (CoVac-1), which induces T cell immunity is not affected by current SARS-CoV-2 variants²⁸.

Analysis of CD8⁺ T cells that recognize six epitopes from the N protein of SARS-CoV-2 in COVID-19 convalescent and unexposed HLA-A2⁺ individuals revealed several key features of CD8⁺ T cell immunity against this virus. First, there exist low frequencies of epitope-specific CD8⁺ T cells with both naïve and memory phenotypes in unexposed individuals, which suggests these epitope specific memory CD8⁺ T cells may be activated by common coronaviruses or other viruses that shared similar sequences. Second, infection with SARS-CoV-2 results in only a slight increase in the frequencies of CD8⁺ T cells but significantly enhances proliferation in response to stimulation. While their enhanced proliferation is beneficial for containing initial infection, it remains to be determined if they also contribute to unintended consequences such as long COVID⁶⁷. Third, over a one-year period, N-recognizing CD8⁺ T cells from convalescent donors have stable frequencies and in vitro responses to activation, which is strikingly different from IgG titers against N proteins. These findings suggest that SARS-CoV-2 infection induced better and longer-lasting CD8⁺ T cell immunity in convalescent than in unexposed donors. COVID-19 vaccination also induces protective CD8⁺ T cell immunity⁶⁸. It is unknown whether vaccines induce comparably robust and long-lasting CD8⁺ T cell immunity against SARS-CoV-2 as infection^28,69. A better understanding of CD8⁺ T cell immunity against SARS-CoV-2 could serve a basis for efficacious developing T cell-based vaccines⁷⁰.

Knowledge of the diversity size of antigen-specific TCR repertoires and the nature of TCR–pMHC interactions is essential to inform us about the status of T cell immunity to SARS-CoV-2. Combining tetramer staining/cell sorting and scTCRseq, we analyzed 22,727 LLL⁺ CD8⁺ T cells, and strikingly found that LLL-binding TCRs used TRAV12-2, accounting for nearly 50% of all LLL⁺ TCRs. Additionally, the other half of LLL⁺ TCRs used different TRAV genes and none of their representative TCRs showed substantial binding or signaling. Even with the tight gating on tetramer+ cell during sorting, substantial false positive TCRs remain in the scTCR dataset. This suggests that the low frequency of antigen-specific CD8⁺ T cells identified by positive tetramer staining contained a high portion (50% in LLL⁺ CD8⁺ T cells) of false positives. This problem was overcome by our development of a ML algorithm (RF model) that is able to identify true LLL-recognizing TCRs with good accuracy, paving the way to curate high-quality TCRs from the pool of undefined TCRs. Like all predictive ML algorithms, its accuracy relies on the quality of positive and negative training data. By using experimentally confirmed LLL-binding and non-binding TCRs as the seed in the same cluster of TCRs classified by the TCRDist program based on CDR3 amino acid sequences⁷¹, we were able to select an adequate number of TCRs for ML training and testing. The quality of this ML algorithm will be further tested when more LLL-binding TCRs and their crystal structures become available, which will improve its accuracy even more.

Crystal structures of several TCRs from COVID-19 convalescent patients bound to two spike epitope (YLQ and RLQ) presented by HLA-A2 have been reported^23,24,25,26. These structures include: (1) TCR YLQ7–YLQ–HLA-A2²⁵, (2) TCR YLQ36–YLQ–HLA-A2²⁶, (3) TCR NR1C–YLQ–HLA-A2²⁴, (4) TCR RLQ3–RLQ–HLA-A2²⁵, and (5) TCR RLQ7–RLQ–HLA-A2⁷². Notably, TCRs LLL8 and YLQ7 use the same Vα gene segment, TRAV12-2, which is closely related to the TRAV12-1 gene segment used by TCRs YLQ36 and NR1C. The α chains of the three YLQ-specific TCRs (YLQ7, YLQ36, and NR1C) dock similarly atop HLA-A2, as the result of partly or fully conserved interactions between germline-encoded CDR1α and CDR2α loops and the α1 and α2 helices of HLA-A2 (Supplementary Fig. 8a–d). However, the α chain of LLL8 is displaced by ~4.5 Å towards the N-terminus of the LLL peptide compared to its position in the YLQ7–YLQ–HLA-A2 and other complexes (Supplementary Fig. 8e), resulting in a different set of interactions between the CDR1α and CDR2α loops and HLA-A2. This displacement is probably dictated by the LLL peptide, which is unrelated to the YLQ peptide.

The obvious discrepancy between the high mortality of COVID-19 and robust immune response to COVID-19 vaccines in older adults remains a puzzle. Here, we did not find significant age-related changes in (1) plasma IgG titer against the N-protein, (2) frequencies and in vitro expansion of CD8⁺ T cells recognizing N-epitopes, and (3) longevity of CD8⁺ T cells recognizing N-epitopes over one year after infection. Due to the lack of severely ill COVID-19 patients in our study cohort, it is possible that we missed age-associated immune defects. In an attempt to understand the mechanisms underpinning robust and poor CD8⁺ T cell responses, we compared the transcriptome of LLL-recognizing CD8⁺ T cells between expanders and non-expanders and found that CD8⁺ T cells from non-expanders have enhanced expression of genes related to histone modifications (KMT2A, PSMB9, LBH), differentiation (BCL6, RORA, PDCD4), and lymphocyte-mediated immunity (GZMB, PRF1, LYST). These changes appear to be shared among different memory subsets. These findings suggest that advanced differentiation within the defined memory subsets is associated with poor proliferative response to peptide stimulation. In contrast, CD8⁺ T cells from expanders have enhanced expression of genes related to response to calcium ions (JUN, JUNB, DUSP1), response to glucocorticoid (FOS, FOSB, AIF1), and structural molecule activity (ACTB, ACTG1, TUBA1A). Like what we found in non-responders, these changes are also generally shared among different CD8⁺ T cell subsets. The link between these enriched genes and how they collectively facilitate better stimulation induced proliferation as well as whether such changes are associated with aging remain to be determined.

The fine specificity of TCR and the cellular competence of activation-induced proliferation and differentiation are two key elements that determine the quality of CD8⁺ T cell immunity against SARS-CoV-2 and potential clinical outcomes. Empowered by single-cell technology and ML algorithms, analysis of antigen specific CD8⁺ T cell response will reveal essential details of the pre-existing, post-vaccine, and post-infection status of CD8⁺ T cells and will offer guidance for vaccine development and administration.

Methods

Human donors

Seventy-five convalescent and 138 uninfected donors were recruited under NIH IRB approved protocol (000140) and all donors provided written informed consent regarding their participation in the study. All convalescent patients had proof of positive COVID-19 PCR test and positive of anti- SARS-CoV-2 Spike protein IgG, detectable levels of anti-SARS-CoV-2 nucleocapsid IgG on the date of blood draw, and self-reported mild COVID-19 symptoms. Uninfected healthy donors who were either unvaccinated or vaccinated against SARS-CoV-2 (received the Pfizer-BioNTech or Moderna COVID-19 vaccines). All uninfected donors had undetectable levels of anti-SARS-CoV-2 nucleocapsid IgG on the date of blood draw. 78 donors did not receive the COVID-19 vaccine and had no detectable blood anti-spike and anti-nucleocapsid IgG antibodies. The other 60 donors had been vaccinated with either Pfizer-BioNTech or Moderna COVID-19 vaccines but displayed no detectable levels of anti-nucleocapsid antibodies.

Blood processing and PBMC isolation

Blood was collected in EDTA contain tubes. Sample processing and PBMC isolation was carried out within 1–24 h of sample collection. To obtain EDTA plasma, 1.5 mL of blood was centrifuged at 438 g and the resulting plasma supernatant was collected and stored at −80 °C for antibody testing at a later date. PBMCs were isolated by diluting blood samples with Hank’s Balanced Salt Solution (1X solution without calcium and magnesium), layering on Ficoll-Paque, and centrifuging at 894 g for 25 min. Cells at the interface were collected and washed twice with HBSS buffer before further processing or cryopreservation. A fraction of the isolated PBMCs were used for lymphocyte staining. The HLA-A2 genotype of the donor was determined by flow cytometry using HLA-A2-FITC antibody (BioLegend). PBMCs of HLA-A2+ donors were used for AIM assay and tetramer staining (Table 2). CD8⁺ T cells were positively selected using the EasySep™ Direct Human CD8⁺ T Cell Isolation Kit (STEMCELL Technologies) according to the manufacturer’s instructions and used for 14-day culture with peptide stimulation.

Table 2 Source of reagents

Full size table

Detection of anti-SARS-COV-2 antibody using ELISA

All donors were tested for the presence of antibodies (IgG) against SARS-CoV-2 nucleocapsid and spike proteins. Plasma samples stored at −80 °C were prepared following the manufacturer’s protocols for the LEGEND MAX™ SARS-CoV-2 Nucleocapsid Human IgG ELISA Kit and the LEGEND MAX™ SARS-CoV-2 Spike S1 Human IgG ELISA Kit. The samples were analyzed by microplate reader (SpectraMax M2, Molecular Devices) and a four-parameter logistic curve was fitted using the plate standards. All samples were titrated appropriately so that the OD value ≥ 1; a sample was considered undetectable if the OD value < 1 at a 1:50 dilution.

Flow cytometry analysis of CD8⁺ T cells

For all donors, 2 M freshly isolated PBMCs were used to analyze overall immune markers within lymphocyte populations. Cells were stained with Fixable Viability Stain 780 (BD Biosciences) at a 1:100 dilution for 5 min at room temperature, washed, and resuspended in Brilliant Stain Buffer (BD Biosciences). A surface stain cocktail mix including CD95-PE/Cy5, CD45RA-BUV805, HLA-DR-BUV737, CD8-BUV496, CD27-BUV395, CD28-BV786, CD127-BV711, CD69-BV650, CD137-BV605, CD3-V500, CD38- PerCP-eFluor 710, CD62L-FITC, and CD4-BUV661 was added (Table 2). After 30 min of incubation at 4 °C, cells were washed and fixed in 4% paraformaldehyde overnight. The next day, cells were washed using 1X Perm/Wash buffer (BD Biosciences), stained with an intracellular antibody cocktail made up of Granzyme B-PE/Cy7 and Perforin-PE/Dazzle 594, and incubated at 4 °C for 30 min. Cells were washed, resuspended in 1% paraformaldehyde, and analyzed by flow cytometry (FACSymphony, BD Biosciences). All collected flow cytometry data were analyzed by FlowJo10.5.

Tetramer staining

All MHC class I tetramers were made by the NIH Tetramer Core Facility. Up to five tetramer-peptide reagents with contrasting fluorescence were used in a given staining cocktail. The amount of each tetramer was titrated (0.1–1 μL) to obtain the optimal concentration for usage. Freshly isolated and 14-day cultured cells were washed with PBS, stained with tetramer cocktail in PBS + 2% FBS, and incubated at 4 °C for 30 min. An antibody cocktail comprised of CD8-PerCP/Cy5.5, CD45RA-BV510, CD62L-PE/Cy7, CD95-PE/Dazzle 594 (Biolegend) was added and samples incubated for an additional 30 min at 4 °C. Samples were then washed with FACS buffer and analyzed by flow cytometry (CytoFLEX, Beckman Coulter).

AIM assay

PBMCs (1 × 10⁶ per well) were cultured for 24 h in the presence of SARS-CoV-2 S1-specific peptides (1 mg/mL) (JPT Peptides) and six peptides of N protein (AlanScientific.com), 0.5% DMSO (equimolar amount) or 2 mg/mL phytohemagglutinin (PHA) in 96-wells U bottom plates. After stimulation, cells were collected and resuspended in 50 mL BSM. Human TruStain FcX™ Fc (2.5 µl) was added and incubated for 10 min at room temperature. Antibodies (CD8-BV510, CD38-APC, CD69-PECY7, CD137-PE, and HLA-DR-FITC, 2.5 µl each) were added (Table 2) and incubated for 30 min at 4 °C. Cells were then washed once with 2 mL FACS buffer and resuspended in 250 µL FACS buffer and collected using BD FACSCanto II.

In Vitro stimulation and culture

Positively selected CD8⁺ T cells from HLA-A02+ donors were used to determine their antigen-specific activation induced expansion in vitro as previously described⁷³. A mixture of 0.2 million CD8⁺ T cells, 2 million PBMCs, and 10 μg of each of the six nucleocapsid peptides was created and transferred to a 96-well round bottom plate at 100 μL medium per well. Each donor had three plates set up with the same peptide mix. Cells remained in culture at 37 °C for seven days with 40 μL of additional media being added on day 3 to replenish depleted nutrients.

On day 7, cells were harvested and counted, and 1 M cells were used for tetramer staining. Out of the remaining harvested cells, 6 M (2 M/plate) were restimulated using a microbubble loaded HLA-A2 with our SARS-CoV-2 nucleocapsid peptides and anti-CD28. To create a microbubble for each of our six epitopes, we combined 5 μg of peptide with 100 μL of HLA-A2/anti-CD28 microbubble, rotated the mixture at 4 °C for 20 min, then left it to sit at 4 °C overnight before using. The 6 M cells were combined with 10 μL of the microbubble mixture for each of the six epitopes and spun at room temperature for 20 min before being plated onto a 96-well round bottom plate (100 μL/well). Cells remained in culture at 37 °C for seven more days with 40 μL of additional media being added on day 10 to replenish depleted nutrients.

On day 14, cells were counted and 1 M cells were taken for tetramer staining. Samples displaying antigen specific CD8⁺ T cell expansion were suspended in freezing media and stored in liquid nitrogen freezer. The degree of expansion was calculated by the fold change over the course of the 14-day culture (the absolute cell count of epitope-specific CD8⁺ T cells on day 14 divided by the absolute cell count of epitope-specific CD8⁺ T cells on day 0).

scRNAseq and scTCRseq

Single-cell library generation and sequencing analysis

Single-cell RNA-seq (scRNA-seq) and single-cell TCR-seq (scTCR-seq) libraries were prepared following the protocol for the 10X Genomics Chromium Next GEM Single Cell 5’ Reagents Kits v2 (Dual Index). Prior to generation of Gel Beads-in-emulsion (GEMs), cells were stained with LLL-tetramer (BV421 or APC) and sorted by Molflow sorter. Up to 10,000 single cells were used for each library. Some libraries were also stained with a hashtag oligo antibody to allow pooling multiple samples in one library. In brief, each cell was captured in a GEM which was then followed by reverse transcription, cleanup, and cDNA amplification. After purification of the amplified cDNA, 50 ng of the purified cDNA sample was used for GEX library and 50 ng was used to generate scTCR-seq libraries. V(D)J amplification was carried out and scTCR-seq libraries were prepared by fragmenting V(D)J segments, repairing the ends, and attaching sample indexes. Both GEX and TCR libraries was fragmented, size selected, and indexed for each library that were pooled for sequencing (Illumina Nova-Seq).

scRNA-seq FASTQ files were generated from an Illumina NovaSeq Sequencer. Read 1, Read 2, and the sample index were sequenced to 28, 91, and 8 base pairs (single index) or 10 and 10 bases (dual index), respectively. Filtered gene expression reads were mapped to human reference genome, GRCh38 2020-A via the Cellranger 7.0.0 count pipeline to obtain unique molecular identifier (UMI) counts for each individual sample. Filtered V(D)J reads were mapped to the vdj GRCh38 alts ensemble 5.0.0 reference genome via the Cellranger 7.0.0 vdj pipeline, which generated contiguous VDJ sequences per single cell. To further separate between samples, hashtag oligo libraries matching with gene expression libraries were generated by using the Cellranger 7.0.0 count pipeline. UMIs correlating with specific hashtag oligo sequences designating each sample were counted, with cells demonstrating at least 5 UMIs and a significantly higher count of UMIs for a particular sequence was labeled as the sample specified by the particular hashtag oligo. This information was included within the meta data of gene expression libraries.

Data integration and clustering

Individual expression matrices were loaded in through Read10X via Seurat 4.0 and used for filtering, normalization, clustering and visualization. Cells were excluded if they expressed fewer than 500 genes, more than 10,000 genes and more than 20% mitochondrial genes. Expression was log-2-normalized via the Seurat function, NormalizeData and individual libraries saved for batch correction. All samples were merged and visualized via UMAP at 2000 features and 20 principal components to compare sample similarity. Libraries with fewer than 30 cells were merged with samples that most closely aligned within the initial clustered UMAP. Batch correction to remove potential sample to sample biases was carried out via IntegrateData, with 25 principal components and 1500 features. The batch corrected libraries were then visualized via UMAP at 25 principal components and 1500 features. Clustering was carried out via FindNeighbors and FindClusters at a resolution of 0.3 and markers defining each cluster found via FindAllMarkers to be compared and labeled by canonical single-cell markers of CD8⁺ T cells.

TCR cluster and CDR3 motif analysis

Clustering of 565 unique LLL-specific TCR sequences containing TRAV12-2 gene was performed using TCRDist clustering via CoNGA^71,74. Unique V(D)J sequences were first collected from single-cell V(D)J libraries and sorted on frequency of appearance. V(D)J sequences were grouped on similarity in amino acid sequence Alpha V gene, CDR3A, Alpha J gene, Beta V gene, CDR3B, and Beta J gene and assigned an associated index with TCR distance based on similarity to other TCRs. The resulting distances were then Louvain clustered via the Rpackage igraph at a resolution of 1 generating 5 clusters of unique TCRs. Unique CDR3 amino acid sequences were derived LLL-TCRs based on the usage of TRAV12-2 and clusters containing experimentally confirmed LLL-HLA-A2 binding TCRs. CDR3 regions of these specific TCRs were further sorted on alpha, beta CDR3 length within the TCRDist cluster and submitted for motif analysis. CDR3 motifs were generated via the MEME Suite 5.5.

Machine learning analysis

To establish the machine learning model, 575 TCRs were selected from the LLL TCRs identified from scTCRseq that used TRAV12-2 (including both first and second functional alpha chain) These TCRs were then grouped using TCRDist clustering and four of TCR clusters containing with experimentally confirmed LLL-HLA-A2 binding TCRs were combined as a resulting 524 positive LLL-HLA-A2 binding TCRs. In parallel, 7719 TCRs from HLA-A2⁺ donors that isolated for non-LLL tetramer⁺ TCRs were also grouped by the TCRDist and five clusters containing with experimentally confirmed no LLL-HLA-A2 binding TCRs were combined as a resulting set of 5355 the negative TCRs. 10% of the positive and negative TCRs was withheld for testing. The remaining 90% TCRs were used to generate a dictionary of 3 amino acid long sequences (kmer) representing 5 different regions (the most left or right end Kmer as L-end and R-end, followed by left and right side kmer and center kmer) of CDR3. Ten random forest models were created using 471 positive TCRs and 471 negative TCRs from within the total negative set. Through unguided machine learning, the models generated 15 decision trees selecting kmers that accurately separated defined either the positive or negative sets. TCRs were assigned a score based on the presence of these kmers, with a score greater than 0.8 being considered a positive TCR. The validity of the models was determined by their ability to accurately separate the positive and negative set in withheld TCRs. The analysis was conducted using Python code (via scikit learn and random forest classifier).

TCR expression and binding validation

Generating of pHAGE-TCRα/β plasmid

Full-length encoding sequence of TCRα and β chains joined by the P2A “self-cleaving” site which can terminate sequence translation at the final codon (Pro) of the 2A sequence and reinitiate translation of the following sequence. The entire sequence of TCRβ-P2A-TCRα were synthesized in pHAGE vector by Twist Bioscience.

Lentivirus transduction of GIL-specific TCR into NJ76 cell line

Plate HEK293 cells the day before transfection at a density of 1.5 × 10⁶ cells per well of a 100 mm dish in 10 ml of complete growth medium (DMEM + 10% Fetal Bovine Serum). Add 17ug pHAGE-TCR, pCMV-dR8.2 and pCMV-VSV-G plasmids into 800 µl of OptiMEM together with 50 µl of FuGENE® HD reagent. Mix and incubate for 10 min at room temperature, and then add into one plate of HEK293T cells. Collected 48 h/72 h SFFV-CD8 virus (Twist Bioscience) [ref.] from the supernatant of transfected HEK293T cells. Dilute NJ76 cells into complete medium to a final concentration of 1 × 10⁶ cells/mL with polybrene at concentration of 5 μg/ml. Add lentiviral solution to 6 mL Jurkat cells and incubate at room temperature for 20 min. Centrifuge the cells at 800 × g for 30 min at 22–32 °C and remove virus containing medium. Use 6 ml media to resuspend the cell pellet, and the cells are transferred to the T25 tissue culture flask. The flask is returned to the tissue culture incubator for 2–3 days. After 3 days’ culture, the NJ76 cells were stained with anti-Human TCR antibody and sorted for TCR⁺RFP⁺ double positive cells. NJ76 cells possessing GIL-specific TCRs were generated and ready for later experiments.

Stimulation in NJ76 cell line with Nur77 GFP reporter system

NJ76-TCR cells were cultured with the influenza GIL peptide-loaded (10⁻⁸M) artificial antigen presenting cells at 37 °C for 4/24 h. Anti-CD3/CD28- conjugated HLA-A2 microbubble (100–200 M/ml) were used for stimulating another aliquot of NJ76-TCR cells simultaneously in order to compare common MHC-TCR activation and GIL-specific TCR activation. The GFP expression in the TCRαβ⁺ NJ76-TCR cell population was quantified by Beckman CytoFLEX flow cytometry and results were analyzed with FlowJo (10.5). We later used the streptavidin-MB conjugated with biotinylated-HLA-A2/GIL and biotin-anti-CD28 for stimulation.

Crystallographic analysis

Protein preparation

Soluble TCR LLL8 for affinity measurement and structure determination was produced by in vitro folding from inclusion bodies expressed in Escherichia coli. Codon-optimized genes encoding the TCRα (residues 1–203) and β (1–243) chains were synthesized and cloned into the expression vector pET22b (GenScript). An interchain disulfide (CaCys157–CbCys171) was engineered to increase the folding yield of TCR αβ heterodimer. The TCR α and β chains were expressed separately as inclusion bodies in BL21(DE3) E. coli cells (Agilent Technologies). Bacteria were grown at 37 °C in LB medium to OD₆₀₀ = 0.6–0.8 and induced with 1 mM isopropyl-b-D-thiogalactoside for 3 h. The bacteria were harvested by centrifugation and resuspended in 50 mM Tris-HCl (pH 8.0) containing 0.1 M NaCl and 2 mM EDTA. After sonication, inclusion bodies were washed three times with 50 mM Tris-HCl (pH 8.0) and 5% (v/v) Triton X-100, then dissolved in 8 M urea, 50 mM Tris-HCl (pH 8.0), 10 mM EDTA, and 10 mM DTT. For in vitro folding, the TCRα (45 mg) and TCRβ (35 mg) chains of dissolved inclusion bodies were mixed and diluted into 1-liter folding buffer containing 100 mM Tris-HCl (pH 8.0), 5 M urea, 0.4 M L-arginine-HCl, 3.7 mM cystamine, and 6.6 mM cysteamine. After dialysis at 4 °C against distilled water and 10 mM Tris-HCl (pH 8.0) for 24 and 48 h, respectively, the folding mixture was concentrated 20-fold and dialyzed overnight against 50 mM MES buffer (pH 6.0). After removal of the precipitate by centrifugation, the folding mixture was dialyzed overnight at 4 °C against 20 mM Tris-HCl (pH 8.0) and 20 mM NaCl. Disulfide-linked TCR LLL8 was purified using consecutive Superdex 200 (20 mM Tris-HCl (pH 8.0), 20 mM NaCl) and Mono Q (10 mM Tris-HCl (pH 8.0), 0–1.0 M NaCl gradient) FPLC columns (GE Healthcare).

Soluble HLA-A2 loaded with LLL peptide (LLLDRLNQL) was prepared by in vitro folding of E. coli inclusion bodies as described⁷⁵. Correctly folded LLL–HLA-A2 complexes were purified using sequential Superdex 200 (20 mM Tris-HCl (pH 8.0), 20 mM NaCl) and Mono Q columns (10 mM Tris-HCl (pH 8.0), 0–1.0 M NaCl gradient). To produce biotinylated HLA-A2, a C-terminal tag (GGGLNDIFEAQKIEWHE) was attached to the HLA-A*0201 heavy chain. Biotinylation was carried out with BirA biotin ligase (Avidity).

Crystallization and data collection

For crystallization of the LLL8–LLL–HLA-A2 complex, TCR LLL8 was mixed with LLL–HLA-A2 in a 1:1 ratio and concentrated to 13 mg/ml. Crystals were obtained at room temperature by vapor diffusion in hanging drops. The LLL8–LLL–HLA-A2 complex crystallized in 0.1 M Tris-HCl (pH 8.5) and 13.5% (w/v) PEG 20 K. Before data collection, crystals were cryoprotected with 20% (w/v) glycerol and flash cooled. X-ray diffraction data were collected at beamline 23-ID-B of the Advanced Photon Source, Argonne National Laboratory. Diffraction data were indexed, integrated, and scaled using the program AIMLESS⁷⁶. Data collection statistics are shown in Table S6.

Structure determination and refinement

The LLL8–LLL–HLA-A2 structure was determined using the molecular replacement program PHASER⁷⁶ within the CCP4i suite of crystallographic software⁷⁷ after synchrotron diffraction screening of ~100 crystals and molecular replacement searches in four of the best datasets. The successful searches used probes derived from published PDB structures 2UWE⁷⁸ and 6VRM⁷⁵. An additional key probe, a sequence-based model of the VαVβ component of the TCR, was generated by the TCR structure prediction resource TCRmodel⁷⁹. With four TCR–pMHC complexes (~3300 amino acids) per asymmetric unit and 3.18 Å resolution data, molecular replacement was a process of building up the solution domain-wise, first locating the MHC components, then placing the Vα and Vβ domains of the TCRs, and finally the Cα and Cβ domains. Molecular replacement outputs were evaluated for their capacity to be reproduced subject to variations in probe, dataset, and resolution shell, as well as their structural reasonableness (e.g., polypeptide continuity at domain interfaces and avoidance of steric clashes). When all MHC and TCR components had been correctly placed, refinement using REFMAC⁸⁰ lowered the R-free metric from 0.45 to 0.40, and difference maps showed the remaining domains, thus demonstrating that the structure was solved. From that point, maps also guided the placement of about 200 residues that differed structurally or sequence-wise from the probes or had been omitted. The last parts to be built were the LLL peptides and CDR loops in the four complexes. Final electron density was unambiguous for all the main chain, but a few side chains in the CDRs retained weak density and were confirmed by residue-specific omit-refine maps. The four final complexes in the asymmetric unit are very similar, superposing with all six pairwise root-mean-square difference (r.m.s.d.) values under 1.5 Å for α-carbon positions. The LLL8–LLL–HLA-A2 complex with the clearest maps, which also has the lowest r.m.s.d. from the other three, has been assigned chain identifiers ABDEF, is designated the biological unit, and is described in Results. Refinement statistics are summarized in Table S6. Contact residues were identified with the CONTACT program in CCP4i⁷⁷ and were defined as residues containing an atom 4.0 Å or less from a residue of the binding partner. The PyMOL program (https://pymol.org/) was used for r.m.s.d. calculations, graphical map interpretation and model building, and to prepare figures.

Surface plasmon resonance analysis

The interaction of TCR LLL8 with LLL–HLA-A2 was assessed by surface plasmon resonance (SPR) using a BIAcore T100 biosensor at 25 °C. Biotinylated LLL–HLA-A2 was immobilized on a streptavidin-coated BIAcore SA chip (GE Healthcare) at around 1000 resonance units (RU). The remaining streptavidin sites were blocked with 20 μM biotin solution. An additional flow cell was injected with free biotin alone to serve as a blank control. For analysis of TCR binding, solutions containing different concentrations of LLL8 were flowed sequentially (50 μl/min, 600 s for dissociation) over chips immobilized with LLL–HLA-A2 or the blank. Dissociation constants (K_Ds) were calculated by fitting equilibrium and kinetic data to a 1:1 binding model using BIA evaluation 3.1 software.

Computational sequence and structural analysis

Computational mutagenesis was performed using the “interface” mode of Rosetta (v. 2.3)⁸¹ as described previously⁸², which models the mutant residue and calculates predicted energy change (ΔΔG) of TCR–pMHC binding using an optimized energy function. For mutations to amino acids other than alanine or glycine, minimization of proximal residues was permitted (“-min_interface -int_chi” flags in Rosetta) to allow for local side chain movements to accommodate the side chain substitution. Prior to computational mutagenesis calculations, the LLL8–LLL–HLA-A2 complex structure was pre-processed using the FastRelax protocol in Rosetta 3 (weekly release 2021.38)⁸², to perform constrained minimization to remove minor structural aberrations that would potentially bias subsequent Rosetta calculations. The flags used for FastRelax minimization, run with the “relax” executable in Rosetta, are noted below:

-relax:constrain_relax_to_start_coords

-relax:ramp_constraints false

-ex1

-ex2

-use_input_sc

-no_his_his_pairE

-no_optH false

-flip_HNQ

Statistical analysis

Group differences between convalescent and uninfected donors for total CD8⁺ T cells and each sub-populations were compared using separate linear regression models with each subpopulation of T-cells as the outcome. The main predictor of the model was group, with covariates of age and sex. Statistical trends with time since diagnosis for convalescent donors were analyzed via a mixed effect model accounting for multiple visits per donor with covariates of age and sex. P values less than 0.05 were considered significant. Two-tailed T tests comparing difference between convalescent and uninfected donors were normalized for age and sex. All analyses were performed using R version 4.1.0 through the stats package.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

Source data are provided with this paper. The scRNAseq data have been deposited in the NCBI accession code GSE227971. Atomic coordinates and structure factors for the LLL8–LLL–HLA-A2 complex have been deposited in the Protein Data Bank accession code 8DNT. Source data are provided with this paper.

Code availability

The script files of ML model used for LLL-TCR determination are deposited at GitHub (https://github.com/Weng-lab-NIH/RF_models).

References

Moss, P. The T cell immune response against SARS-CoV-2. Nat. Immunol. 23, 186–193 (2022).
Article CAS PubMed Google Scholar
Vardhana, S., Baldo, L., Morice, W. G. 2nd & Wherry, E. J. Understanding T cell responses to COVID-19 is essential for informing public health strategies. Sci. Immunol. 7, eabo1303 (2022).
Article CAS PubMed PubMed Central Google Scholar
Kent, S. J. et al. Disentangling the relative importance of T cell responses in COVID-19: leading actors or supporting cast? Nat. Rev. Immunol. 22, 387–397 (2022).
Article CAS PubMed PubMed Central Google Scholar
Naranbhai, V. et al. T cell reactivity to the SARS-CoV-2 Omicron variant is preserved in most but not all individuals. Cell 185, 1259 (2022).
Article CAS PubMed PubMed Central Google Scholar
Ng, O. W. et al. Memory T cell responses targeting the SARS coronavirus persist up to 11 years post-infection. Vaccine 34, 2008–2014 (2016).
Article CAS PubMed PubMed Central Google Scholar
Mathew, D. et al. Deep immune profiling of COVID-19 patients reveals distinct immunotypes with therapeutic implications. Science 369 https://doi.org/10.1126/science.abc8511 (2020).
Grifoni, A. et al. Targets of T cell responses to SARS-CoV-2 Coronavirus in humans with COVID-19 disease and unexposed individuals. Cell https://doi.org/10.1016/j.cell.2020.05.015 (2020).
Luo, M. et al. IL-6 and CD8+ T cell counts combined are an early predictor of in-hospital mortality of patients with COVID-19. JCI Insight 5 https://doi.org/10.1172/jci.insight.139024 (2020).
Diao, B. et al. Reduction and functional exhaustion of T cells in patients with Coronavirus disease 2019 (COVID-19). Front Immunol. 11, 827 (2020).
Article CAS PubMed PubMed Central Google Scholar
Bange, E. M. et al. CD8(+) T cells contribute to survival in patients with COVID-19 and hematologic cancer. Nat. Med. 27, 1280–1289 (2021).
Article CAS PubMed PubMed Central Google Scholar
Ferretti, A. P. et al. Unbiased screens show CD8(+) T cells of COVID-19 patients recognize shared epitopes in SARS-CoV-2 that largely reside outside the spike protein. Immunity 53, 1095–1107.e1093 (2020).
Article CAS PubMed PubMed Central Google Scholar
Le Bert, N. et al. SARS-CoV-2-specific T cell immunity in cases of COVID-19 and SARS, and uninfected controls. Nature 584, 457–462 (2020).
Article PubMed Google Scholar
Kared, H. et al. SARS-CoV-2-specific CD8+ T cell responses in convalescent COVID-19 individuals. J. Clin. Invest. 131 https://doi.org/10.1172/JCI145476 (2021).
Nguyen, T. H. O. et al. CD8(+) T cells specific for an immunodominant SARS-CoV-2 nucleocapsid epitope display high naive precursor frequency and TCR promiscuity. Immunity 54, 1066–1082.e1065 (2021).
Article CAS PubMed PubMed Central Google Scholar
Shomuradova, A. S. et al. SARS-CoV-2 epitopes are recognized by a public and diverse repertoire of human T cell receptors. Immunity 53, 1245–1257.e1245 (2020).
Article CAS PubMed PubMed Central Google Scholar
Habel, J. R. et al. Suboptimal SARS-CoV-2-specific CD8(+) T cell response associated with the prominent HLA-A*02:01 phenotype. Proc. Natl. Acad. Sci. USA 117, 24384–24391 (2020).
Article CAS PubMed PubMed Central ADS Google Scholar
Quiros-Fernandez, I., Poorebrahim, M., Fakhr, E. & Cid-Arregui, A. Immunogenic T cell epitopes of SARS-CoV-2 are recognized by circulating memory and naive CD8 T cells of unexposed individuals. EBioMedicine 72, 103610 (2021).
Article CAS PubMed PubMed Central Google Scholar
Mateus, J. et al. Selective and cross-reactive SARS-CoV-2 T cell epitopes in unexposed humans. Science 370, 89–94 (2020).
Article CAS PubMed PubMed Central ADS Google Scholar
Rodda, L. B. et al. Functional SARS-CoV-2-specific immune memory persists after mild COVID-19. Cell 184, 169–183.e117 (2021).
Article CAS PubMed Google Scholar
Poon, M. M. L. et al. SARS-CoV-2 infection generates tissue-localized immunological memory in humans. Sci. Immunol. 6, eabl9105 (2021).
Article CAS PubMed PubMed Central Google Scholar
Adamo, S. et al. Signature of long-lived memory CD8(+) T cells in acute SARS-CoV-2 infection. Nature 602, 148–155 (2022).
Article CAS PubMed ADS Google Scholar
Gangaev, A. et al. Identification and characterization of a SARS-CoV-2 specific CD8(+) T cell response with immunodominant features. Nat. Commun. 12, 2593 (2021).
Article CAS PubMed PubMed Central ADS Google Scholar
Szeto, C. et al. Molecular basis of a dominant SARS-CoV-2 spike-derived epitope presented by HLA-A*02:01 recognised by a public TCR. Cells 10 https://doi.org/10.3390/cells10102646 (2021).
Chaurasia, P. et al. Structural basis of biased T cell receptor recognition of an immunodominant HLA-A2 epitope of the SARS-CoV-2 spike protein. J. Biol. Chem. 297, 101065 (2021).
Article CAS PubMed PubMed Central Google Scholar
Wu, D. et al. Structural assessment of HLA-A2-restricted SARS-CoV-2 spike epitopes recognized by public and private T-cell receptors. Nat. Commun. 13, 19 (2022).
Article PubMed PubMed Central ADS Google Scholar
Dolton, G. et al. Emergence of immune escape at dominant SARS-CoV-2 killer T cell epitope. Cell 185, 2936–2951.e2919 (2022).
Article CAS PubMed PubMed Central Google Scholar
Lineburg, K. E. et al. CD8(+) T cells specific for an immunodominant SARS-CoV-2 nucleocapsid epitope cross-react with selective seasonal coronaviruses. Immunity 54, 1055–1065.e1055 (2021).
Article CAS PubMed PubMed Central Google Scholar
Heitmann, J. S. et al. A COVID-19 peptide vaccine for the induction of SARS-CoV-2 T cell immunity. Nature 601, 617–622 (2022).
Article CAS PubMed ADS Google Scholar
Heitmann, J. S. et al. Phase I/II trial of a peptide-based COVID-19 T-cell activator in patients with B-cell deficiency. Nat. Commun. 14, 5032 (2023).
Article CAS PubMed PubMed Central ADS Google Scholar
Arieta, C. M. et al. The T-cell-directed vaccine BNT162b4 encoding conserved non-spike antigens protects animals from severe SARS-CoV-2 infection. Cell 186, 2392–2409.e2321 (2023).
Article CAS PubMed PubMed Central Google Scholar
Goronzy, J. J. & Weyand, C. M. Mechanisms underlying T cell ageing. Nat. Rev. Immunol. 19, 573–583 (2019).
Article CAS PubMed PubMed Central Google Scholar
Mittelbrunn, M. & Kroemer, G. Hallmarks of T cell aging. Nat Immunol https://doi.org/10.1038/s41590-021-00927-z (2021).
Lin, Y. et al. Changes in blood lymphocyte numbers with age in vivo and their association with the levels of cytokines/cytokine receptors. Immun. Ageing 13, 24 (2016).
Article PubMed PubMed Central Google Scholar
Alpert, A. et al. A clinically meaningful metric of immune age derived from high-dimensional longitudinal monitoring. Nat. Med. 25, 487–495 (2019).
Article CAS PubMed PubMed Central Google Scholar
Grassmann, S. et al. Early emergence of T central memory precursors programs clonal dominance during chronic viral infection. Nat. Immunol. 21, 1563–1573 (2020).
Article CAS PubMed Google Scholar
Sun, X. et al. Longitudinal analysis reveals age-related changes in the T cell receptor repertoire of human T cell subsets. J. Clin. Invest. https://doi.org/10.1172/JCI158122 (2022).
Qi, Q. et al. Diversity and clonal selection in the human T-cell repertoire. Proc. Natl. Acad. Sci. USA 111, 13139–13144 (2014).
Article CAS PubMed PubMed Central ADS Google Scholar
McElhaney, J. E. et al. The immune response to influenza in older humans: beyond immune senescence. Immun. Ageing 17, 10 (2020).
Article PubMed PubMed Central Google Scholar
Dugan, H. L., Henry, C. & Wilson, P. C. Aging and influenza vaccine-induced immunity. Cell Immunol. 348, 103998 (2020).
Article CAS PubMed Google Scholar
Carrasco, E. et al. The role of T cells in age-related diseases. Nat. Rev. Immunol. 22, 97–111 (2022).
Article CAS PubMed Google Scholar
Koff, W. C. & Williams, M. A. Covid-19 and immunity in aging populations - A new research agenda. N. Engl. J. Med. https://doi.org/10.1056/NEJMp2006761 (2020).
Rydyznski Moderbacher, C. et al. Antigen-specific adaptive immunity to SARS-CoV-2 in acute COVID-19 and associations with age and disease severity. Cell 183, 996–1012.e1019 (2020).
Article CAS PubMed PubMed Central Google Scholar
Pawelec, G. & McElhaney, J. Unanticipated efficacy of SARS-CoV-2 vaccination in older adults. Immun. Ageing 18, 7 (2021).
Article CAS PubMed PubMed Central Google Scholar
Weng, N. P. & Pawelec, G. Validation of the effectiveness of SARS-CoV-2 vaccines in older adults in “real-world” settings. Immun. Ageing 18, 36 (2021).
Article CAS PubMed PubMed Central Google Scholar
Lewis, S. A. et al. Differential dynamics of peripheral immune responses to acute SARS-CoV-2 infection in older adults. Nat. Aging 1, 1038–1052 (2021).
Article PubMed Google Scholar
Hyun-Jung Lee, C. & Koohy, H. In silico identification of vaccine targets for 2019-nCoV. F1000Res 9, 145 (2020).
Article Google Scholar
Ahmed, S. F., Quadeer, A. A. & McKay, M. R. Preliminary identification of potential vaccine targets for the COVID-19 Coronavirus (SARS-CoV-2) based on SARS-CoV immunological studies. Viruses 12 https://doi.org/10.3390/v12030254 (2020).
Grifoni, A. et al. A sequence homology and bioinformatic approach can predict candidate targets for immune responses to SARS-CoV-2. Cell Host Microbe 27, 671–680.e672 (2020).
Article CAS PubMed PubMed Central Google Scholar
Szeto, C. et al. The presentation of SARS-CoV-2 peptides by the common HLA-A(*)02:01 molecule. iScience 24, 102096 (2021).
Article CAS PubMed PubMed Central ADS Google Scholar
Seow, J. et al. Longitudinal observation and decline of neutralizing antibody responses in the three months following SARS-CoV-2 infection in humans. Nat. Microbiol 5, 1598–1607 (2020).
Article CAS PubMed PubMed Central Google Scholar
Dash, P. et al. Quantifiable predictive features define epitope-specific T cell receptor repertoires. Nature 547, 89–93 (2017).
Article CAS PubMed PubMed Central ADS Google Scholar
Li, H. M. et al. TCRbeta repertoire of CD4+ and CD8+ T cells is distinct in richness, distribution, and CDR3 amino acid composition. J. Leukoc. Biol. 99, 505–513 (2016).
Article CAS PubMed Google Scholar
Yin, Y., Li, Y. & Mariuzza, R. A. Structural basis for self-recognition by autoimmune T-cell receptors. Immunol. Rev. 250, 32–48 (2012).
Article PubMed Google Scholar
Wu, D., Gowathaman, R., Pierce, B. G. & Mariuzza, R. A. T cell receptors employ diverse strategies to target a p53 cancer neoantigen. J. Biol. Chem. 298, 101684 (2022).
Article CAS PubMed PubMed Central Google Scholar
Rudolph, M. G., Stanfield, R. L. & Wilson, I. A. How TCRs bind MHCs, peptides, and coreceptors. Annu Rev. Immunol. 24, 419–466 (2006).
Article CAS PubMed Google Scholar
Pierce, B. G. & Weng, Z. A flexible docking approach for prediction of T cell receptor-peptide-MHC complexes. Protein Sci. 22, 35–46 (2013).
Article CAS PubMed Google Scholar
Kortemme, T., Kim, D. E. & Baker, D. Computational alanine scanning of protein-protein interfaces. Sci. STKE 2004, pl2 (2004).
Article PubMed Google Scholar
Gowthaman, R. & Pierce, B. G. TCR3d: The T cell receptor structural repertoire database. Bioinformatics 35, 5323–5325 (2019).
Article CAS PubMed PubMed Central Google Scholar
Garboczi, D. N. et al. Structure of the complex between human T-cell receptor, viral peptide and HLA-A2. Nature 384, 134–141 (1996).
Article CAS PubMed ADS Google Scholar
Borbulevych, O. Y., Santhanagopolan, S. M., Hossain, M. & Baker, B. M. TCRs used in cancer gene therapy cross-react with MART-1/Melan-A tumor antigens via distinct mechanisms. J. Immunol. 187, 2453–2463 (2011).
Article CAS PubMed Google Scholar
Feng, D., Bond, C. J., Ely, L. K., Maynard, J. & Garcia, K. C. Structural evidence for a germline-encoded T cell receptor-major histocompatibility complex interaction ‘codon’. Nat. Immunol. 8, 975–983 (2007).
Article CAS PubMed Google Scholar
Marrack, P., Scott-Browne, J. P., Dai, S., Gapin, L. & Kappler, J. W. Evolutionarily conserved amino acids that control TCR-MHC interaction. Annu. Rev. Immunol. 26, 171–203 (2008).
Article CAS PubMed PubMed Central Google Scholar
Harris, D. T. et al. An engineered switch in T cell receptor specificity leads to an unusual but functional binding geometry. Structure 24, 1142–1154 (2016).
Article CAS PubMed PubMed Central Google Scholar
Cole, D. K. et al. Dual molecular mechanisms govern escape at immunodominant HLA A2-restricted HIV epitope. Front Immunol. 8, 1503 (2017).
Article PubMed PubMed Central Google Scholar
Coles, C. H. et al. TCRs with distinct specificity profiles use different binding modes to engage an identical peptide-HLA complex. J. Immunol. 204, 1943–1953 (2020).
Article CAS PubMed PubMed Central Google Scholar
Elbe, S. & Buckland-Merrett, G. Data, disease and diplomacy: GISAID’s innovative contribution to global health. Glob. Chall. 1, 33–46 (2017).
Article PubMed PubMed Central Google Scholar
Paniskaki, K. et al. Low avidity circulating SARS-CoV-2 reactive CD8+ T cells with proinflammatory TEMRA phenotype are associated with post-acute sequelae of COVID-19. Front Microbiol 14, 1196721 (2023).
Article PubMed PubMed Central Google Scholar
Oberhardt, V. et al. Rapid and stable mobilization of CD8(+) T cells by SARS-CoV-2 mRNA vaccine. Nature 597, 268–273 (2021).
Article CAS PubMed PubMed Central ADS Google Scholar
Neidleman, J. et al. mRNA vaccine-induced SARS-CoV-2-specific T cells recognize B.1.1.7 and B.1.351 variants but differ in longevity and homing properties depending on prior infection status. bioRxiv https://doi.org/10.1101/2021.05.12.443888 (2021).
Yu, E. D. et al. Development of a T cell-based immunodiagnostic system to effectively distinguish SARS-CoV-2 infection and COVID-19 vaccination status. Cell Host Microbe 30, 388–399.e383 (2022).
Article CAS PubMed PubMed Central Google Scholar
Glanville, J. et al. Identifying specificity groups in the T cell receptor repertoire. Nature 547, 94–98 (2017).
Article CAS PubMed PubMed Central ADS Google Scholar
Wu, D., Efimov, G. A., Bogolyubova, A. V., Pierce, B. G. & Mariuzza, R. A. Structural insights into protection against a SARS-CoV-2 spike variant by T cell receptor diversity. J. Biol. Chem. 299, 103035 (2023).
Article CAS PubMed PubMed Central Google Scholar
Ndhlovu, Z. M. et al. Development of an artificial-antigen-presenting-cell-based assay for the detection of low-frequency virus-specific CD8(+) T cells in whole blood, with application for measles virus. Clin. Vaccin. Immunol. 16, 1066–1073 (2009).
Article CAS Google Scholar
Schattgen, S. A. et al. Integrating T cell receptor sequences and transcriptional profiles by clonotype neighbor graph analysis (CoNGA). Nat. Biotechnol. 40, 54–63 (2022).
Article CAS PubMed Google Scholar
Wu, D., Gallagher, D. T., Gowthaman, R., Pierce, B. G. & Mariuzza, R. A. Structural basis for oligoclonal T cell recognition of a shared p53 cancer neoantigen. Nat. Commun. 11, 2908 (2020).
Article CAS PubMed PubMed Central ADS Google Scholar
Evans, P. R. & Murshudov, G. N. How good are my data and what is the resolution? Acta Crystallogr D. Biol. Crystallogr 69, 1204–1214 (2013).
Article CAS PubMed PubMed Central ADS Google Scholar
McCoy, A. J. et al. Phaser crystallographic software. J. Appl Crystallogr 40, 658–674 (2007).
Article CAS PubMed PubMed Central ADS Google Scholar
Miller, P. J. et al. Single MHC mutation eliminates enthalpy associated with T cell receptor binding. J. Mol. Biol. 373, 315–327 (2007).
Article CAS PubMed PubMed Central Google Scholar
Gowthaman, R. & Pierce, B. G. TCRmodel: high resolution modeling of T cell receptors from sequence. Nucleic Acids Res. 46, W396–W401 (2018).
Article CAS PubMed PubMed Central Google Scholar
Murshudov, G. N., Vagin, A. A. & Dodson, E. J. Refinement of macromolecular structures by the maximum-likelihood method. Acta Crystallogr D. Biol. Crystallogr 53, 240–255 (1997).
Article CAS PubMed ADS Google Scholar
Kortemme, T. & Baker, D. A simple physical model for binding energy hot spots in protein-protein complexes. Proc. Natl. Acad. Sci. USA 99, 14116–14121 (2002).
Article CAS PubMed PubMed Central ADS Google Scholar
Khatib, F. et al. Algorithm discovery by protein folding game players. Proc. Natl. Acad. Sci. USA 108, 18949–18953 (2011).
Article CAS PubMed PubMed Central ADS Google Scholar

Download references

Acknowledgements

This work was supported by the Intramural Research Program of the National Institutes of Health, National Institute on Aging (to N-P.Weng), by National Institutes of Health Grants GM126299 (to B.G.Pierce), GM144083 (to B.G.Pierce), and AI129893 (to R.A.Mariuzza), R44CA265468 (to G.Shi) and by National Natural Science Foundation of China Grant 32100985 (to D.Wu). Structure results in this report are based on work performed at the GM/CA beamline at the Advanced Photon Source of Argonne National Laboratory, which is funded by the National Cancer Institute (ACB-12002) and the National Institute of General Medical Sciences (AGM-12006, P30GM138396). This work utilized computational resources of the NIH HPC Biowulf cluster (http://hpc.nih.gov) and the University of Maryland Institute for Bioscience and Biotechnology Research High Performance Computing Cluster. We acknowledge the NIAID Tetramer Core Facility for providing HLA-A2 tetramers. Identification of commercial materials and equipment does not imply recommendation or endorsement by the National Institute of Standards and Technology.

Funding

Open Access funding provided by the National Institutes of Health (NIH).

Author information

These authors contributed equally: Cecily Choy, Joseph Chen, Nan-Ping Weng.

Authors and Affiliations

Laboratory of Molecular Biology and Immunology, National Institute on Aging, NIH, Baltimore, MD, USA
Cecily Choy, Joseph Chen, Jiangyuan Li, Jian Lu, Ainslee Zou, Humza Hemani, Beverly A. Baptiste, Emily Wichmann, Qian Yang, Jeffrey Ciffelo, Tonya Wallace, Christopher Dunn, Cuong Nguyen & Nan-Ping Weng
National Institute of Standards and Technology (NIST), Gaithersburg, MD, USA
D. Travis Gallagher
W.M. Keck Laboratory for Structural Biology, University of Maryland Institute for Bioscience and Biotechnology Research, Rockville, MD, USA
Daichao Wu, Rui Yin, Brian G. Pierce & Roy Mariuzza
Laboratory of Clinical Investigation, National Institute on Aging, NIH, Baltimore, MD, USA
Julia McKelvy, Denise Melvin, Chee W. Chia & Josephine Egan
Computational Biology and Genomics Core, Laboratory of Genetics and Genomics, National Institute on Aging, NIH, Baltimore, MD, USA
Jinshui Fan & Supriyo De
Translational Gerontology Branch, National Institute on Aging, NIH, Baltimore, MD, USA
Jeannie Ruffolo, Linda Zukley & Luigi Ferrucci
Diagnologix LLC, San Diego, CA, USA
Guixin Shi & Yu-Tsueng Liu
Elixirgen Therapeutics, Inc, Baltimore, MD, USA
Tomokazu Amano & Minoru S. H. Ko
Laboratory of Behavioral Neuroscience, National Institute on Aging, NIH, Baltimore, MD, USA
Yang An
Laboratory of Epidemiology & Population Sciences, National Institute on Aging, NIH, Baltimore, MD, USA
Osorio Meirelles
Facility for Biotechnology Resources, CBER, Food and Drug Administration, Silver Spring, MD, USA
Wells W. Wu, Chao-Kai Chou & Rong-Fong Shen
NIH Tetramer Core Facility at Emory University, Atlanta, GA, USA
Richard A. Willis

Authors

Cecily Choy
View author publications
You can also search for this author in PubMed Google Scholar
Joseph Chen
View author publications
You can also search for this author in PubMed Google Scholar
Jiangyuan Li
View author publications
You can also search for this author in PubMed Google Scholar
D. Travis Gallagher
View author publications
You can also search for this author in PubMed Google Scholar
Jian Lu
View author publications
You can also search for this author in PubMed Google Scholar
Daichao Wu
View author publications
You can also search for this author in PubMed Google Scholar
Ainslee Zou
View author publications
You can also search for this author in PubMed Google Scholar
Humza Hemani
View author publications
You can also search for this author in PubMed Google Scholar
Beverly A. Baptiste
View author publications
You can also search for this author in PubMed Google Scholar
Emily Wichmann
View author publications
You can also search for this author in PubMed Google Scholar
Qian Yang
View author publications
You can also search for this author in PubMed Google Scholar
Jeffrey Ciffelo
View author publications
You can also search for this author in PubMed Google Scholar
Rui Yin
View author publications
You can also search for this author in PubMed Google Scholar
Julia McKelvy
View author publications
You can also search for this author in PubMed Google Scholar
Denise Melvin
View author publications
You can also search for this author in PubMed Google Scholar
Tonya Wallace
View author publications
You can also search for this author in PubMed Google Scholar
Christopher Dunn
View author publications
You can also search for this author in PubMed Google Scholar
Cuong Nguyen
View author publications
You can also search for this author in PubMed Google Scholar
Chee W. Chia
View author publications
You can also search for this author in PubMed Google Scholar
Jinshui Fan
View author publications
You can also search for this author in PubMed Google Scholar
Jeannie Ruffolo
View author publications
You can also search for this author in PubMed Google Scholar
Linda Zukley
View author publications
You can also search for this author in PubMed Google Scholar
Guixin Shi
View author publications
You can also search for this author in PubMed Google Scholar
Tomokazu Amano
View author publications
You can also search for this author in PubMed Google Scholar
Yang An
View author publications
You can also search for this author in PubMed Google Scholar
Osorio Meirelles
View author publications
You can also search for this author in PubMed Google Scholar
Wells W. Wu
View author publications
You can also search for this author in PubMed Google Scholar
Chao-Kai Chou
View author publications
You can also search for this author in PubMed Google Scholar
Rong-Fong Shen
View author publications
You can also search for this author in PubMed Google Scholar
Richard A. Willis
View author publications
You can also search for this author in PubMed Google Scholar
Minoru S. H. Ko
View author publications
You can also search for this author in PubMed Google Scholar
Yu-Tsueng Liu
View author publications
You can also search for this author in PubMed Google Scholar
Supriyo De
View author publications
You can also search for this author in PubMed Google Scholar
Brian G. Pierce
View author publications
You can also search for this author in PubMed Google Scholar
Luigi Ferrucci
View author publications
You can also search for this author in PubMed Google Scholar
Josephine Egan
View author publications
You can also search for this author in PubMed Google Scholar
Roy Mariuzza
View author publications
You can also search for this author in PubMed Google Scholar
Nan-Ping Weng
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

C.C., J.L. and J.L. carried out most wetlab experiments. J.C. conducted most scRNAseq and scTCRseq analysis and statistical analysis. H.H. and J.C. assisted computational analysis, A.Z., Q.Y., B.B. and E.W. assisted cell isolation and/or flow analysis, J.M., D.M., J.R. and L.Z., were responsible of recruiting donors and obtain blood sample, T.W., C.D., and C.N. assisted flow cytometry analysis and cell sort, C.W.C., L.F., J.M.E. assisted clinical data, protocol and advice, G.S. and Y-T.L. assisted with microbubble for in vitro stimulation, T.A. and M.S.H.K. designed peptide pools of SARS-CoV-2 proteins, A.Y. and O.M. helped statistical analysis, J.S.F., W.W.W., C-K.C., R-F.S. and S.D. helped sequencing, R.W. provided technical assistance of making tetramers. D.T.G. and D.W. performed the crystallography and structural analyses, and R.Y. performed structural analysis and modeling. B.G.P. and R.A.M. conceived and supervised the structure project. J.M.E. L.F. and N-P.W. conceived and supervised the project. C.C., J.C., D.T.G., R.A.M. and N-P.W. wrote the manuscript. All authors prepared the paper.

Corresponding author

Correspondence to Nan-Ping Weng.

Ethics declarations

Competing interests

Y-T.L. is the founder of Diagnologix LLC and G.S. works for Diagnologix LLC. M.S.H.K. is the founder of Elixirgen Therapeutics, Inc and T.A. works for Elixirgen Therapeutics, Inc. The rest of authors declare no conflict of interests.

Peer review

Peer review information

Nature Communications thanks Jia-huai Wang, Paul Moss and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Supplementary data 1

Supplementary data 2

Supplementary data 3

Source data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Choy, C., Chen, J., Li, J. et al. SARS-CoV-2 infection establishes a stable and age-independent CD8⁺ T cell response against a dominant nucleocapsid epitope using restricted T cell receptors. Nat Commun 14, 6725 (2023). https://doi.org/10.1038/s41467-023-42430-z

Download citation

Received: 04 April 2023
Accepted: 11 October 2023
Published: 23 October 2023
DOI: https://doi.org/10.1038/s41467-023-42430-z

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.