Meta-learning for T cell receptor binding specificity and beyond

Wang, Duolin; He, Fei; Yu, Yang; Xu, Dong

doi:10.1038/s42256-023-00641-5

Download PDF

News & Views
Published: 31 March 2023

Immunology

Meta-learning for T cell receptor binding specificity and beyond

Duolin Wang¹,
Fei He¹,
Yang Yu¹ &
…
Dong Xu ORCID: orcid.org/0000-0002-4809-0514¹

Nature Machine Intelligence volume 5, pages 337–339 (2023)Cite this article

3830 Accesses
1 Citations
2 Altmetric
Metrics details

Subjects

Predicting whether T cell receptors bind to specific peptides is a challenging problem because most binding examples in the training data involve only a few peptides. A new approach uses meta-learning to improve predictions for binding to peptides for which no or little binding data exists.

The T cell receptor (TCR), as a protein complex expressed on T cells, has a critical role in the adaptive immune system by recognizing and binding to specific antigen peptides¹. TCRs are highly diverse, allowing T cells to attack a wide range of antigens from pathogen-infected cells and cancer cells. A given antigen peptide can trigger a specific set of TCRs, resulting in a targeted immune response. There are many different peptide motifs and significant differences in the numbers of corresponding TCRs. Predicting which TCRs can bind to a specific antigen peptide has broad clinical applications, such as large-scale screening of potential TCR targets for cancer neoantigen therapy². In this issue of Nature Machine Intelligence, Gao et al. propose PanPep³, a meta-learning-based framework to address the TCR–antigen binding recognition problem for any type of antigen peptide. In particular, the method can predict binding to antigens that have never been seen in the immune system. The study could also motivate developments with meta-learning in other small-data bioinformatics problems.

The prediction of peptide-specific TCR binding, like many other bioinformatics problems, is challenged by the long-tail distribution of TCR binding (Fig. 1a), whereby a small number of peptides have many known binding TCRs, while many peptides have small numbers of known binding TCRs. Predictions may therefore be highly biased toward a few peptides represented in a majority of training data, with low utility for the majority of peptides for which insufficient or no training data are available. As a result, existing tools perform poorly on predictions for peptides located in the long-tail region — antigens with a few known TCRs and previously unseen antigens — in what are known as the few-shot and zero-shot learning problems. In recent years, various machine learning approaches have been developed to address long-tail problems, such as transfer learning⁴, domain adaptation methods⁵ and, most notably, meta-learning.

**Fig. 1: The PanPep workflow of meta-learning augmented with a neural Turing machine.**

The concept of meta-learning has evolved over time, and this continues to be an active area of machine learning research. Initially, the meta-learning approach aimed to improve algorithm performance by sharing information across tasks and learning how to best apply existing learning algorithms to new ones. Over time, the focus has shifted toward developing models that can quickly adapt to new tasks with limited data, as used in PanPep. Training a meta-learner typically requires two learning stages: meta-training and meta-testing. During meta-training, the model is exposed to a variety of different tasks to learn a general problem-solving strategy that can be applied to new tasks. During meta-testing, the model is presented with a new task and uses the knowledge gained during meta-training to quickly adapt to the new task and solve it.

To tackle the long-tail distribution problem, PanPep employs meta-learning in three settings: (i) the majority setting, for peptides with a large number of known binding TCRs, (ii) few-shot learning, for peptides with a small number (<10) of known binding TCRs and (ii) zero-shot learning, for peptides not present in the training data. PanPep applies a widely used optimization-based adaptation method, model-agnostic meta-learning (MAML⁶), to target the majority and few-shot settings. Specifically, in the meta-training stage, the model is trained on a set of peptide-specific TCR binding tasks to obtain a series of peptide-specific learners and optimize the meta-learner. Then, in the meta-testing procedure, the meta-learner is fine-tuned on a new peptide-specific binding recognition task. PanPep proposes a disentanglement distillation module to handle the zero-shot setting. A mapping between peptide embedding and the peptide-specific learners is constructed based on a neural Turing machine (NTM)⁷ (Fig. 1b). The read head of the NTM is used to map a peptide embedding to a new embedding space called peptide-specific learner generation space (PLGS), and a write head of the NTM is used to extract the peptide-specific learners. The NTM’s memory stores the mapping between peptide embedding and the extracted peptide-specific learners. This NTM-based module is trained based on all the peptide-specific learners through knowledge distillation. In this way, PanPep can extend the few-shot settings to zero-shot settings. Once an unseen peptide arises, the trained read head will map it into the PLGS, and then the PLGS will be used to retrieve the memory to generate a new peptide-specific learner for the inference of the unseen peptide. This innovative design makes PanPep a powerful tool for predicting TCR binding specificity of TCRs with few or unseen antigens.

In the evaluations using the curated independent data, PanPep achieved excellent performance on various peptide-specific tasks, especially for unseen peptides. Furthermore, Gao et al. demonstrate PanPep’s utility in several clinical applications. The output scores of PanPep indicated a relatively high correlation with clonal T cell expansion ratios, suggesting its potential to provide accurate binding identification for clonal T cells. In neoantigen therapy, PanPep effectively identified immune-responsive T cells and detected neoantigen-reactive T cell signatures, which may help improve adoptive cell transfer (ACT)-based tumour immunotherapy. In a COVID-19 study, PanPep demonstrated substantial improvement in recognizing peptide-specific TCRs over three other tools. Moreover, it provided interpretability by unveiling the nature of peptide and TCR interactions through protein structure modelling. Finally, PanPep displayed high computational efficiency.

It is often perceived that deep learning requires massive datasets of labelled training samples to be effective. However, labelled data may be sparse in many real-world applications, especially in biological and medical areas. One class of ‘small data’ cases is the long-tail problem. The work by Gao et al. represents a promising application of meta-learning in addressing long-tail distribution problems in bioinformatics. In particular, PanPep fills the gap in handling the zero-shot setting for TCR binding specificity prediction by integrating the meta-learning modules with disentanglement distillation. Although one can build a task-blind model using the peptide and TCR–CDR3 sequences as the input for all tasks, such an approach assumes that the data in all tasks follow the identically independent distribution (i.i.d.), which is not the case for peptide-specific tasks. The meta-learning and the NTM-based disentanglement distillation proposed in PanPep can take advantage of the peptide-specific data distribution to adapt to new tasks well. These methods may be further improved using newer machine learning methods, such as the NTM’s successor, the differentiable neural computer⁸. It may also be beneficial to use graph neural networks to represent all the peptides and TCRs by leveraging the global relationships among peptides and TCRs.

PanPep can potentially serve as a general framework for many new bioinformatics applications. It may be extended to tackle other peptide binding prediction tasks that are subject to long-tail distribution problems, such as peptide–HLA binding prediction and kinase-specific phosphorylation-site prediction⁹. The few-shot meta-learning methods may be applicable to protein function predictions, such as protein localization prediction¹⁰ and Enzyme Commission Number prediction. PanPep also has some limitations and new challenges. One limitation is that the proposed method did not provide superior performance, compared with existing methods, for the majority setting, involving predictions where ample training data is available. This may be because the meta-learner needs to balance between all the tasks to ensure that the model can generalize well to new tasks. Therefore, further regularization techniques or hyperparameter selection techniques may need to be implemented to ensure optimal training results in the majority setting. It is also noteworthy that even though the peptides are ‘unseen’ in the training procedure, the TCR may be ‘seen’ by other peptide–TCR binding pairs in the training procedure. It would be interesting to assess the performance difference between the unknown (peptides)–known (TCR) scenario and the unknown–unknown scenario. In summary, PanPep delivered great promise of using meta-learning to address bioinformatics' long-tail distribution problems. We anticipate that many new meta-learning methods will be developed for a wide range of bioinformatics applications.

References

Waldman, A. D., Fritz, J. M. & Lenardo, M. J. Nat. Rev. Immunol. 11, 651–668 (2020).
Article Google Scholar
Yamamoto, T. N., Kishton, R. J. & Restifo, N. P. Nat. Med. 25, 1488–1499 (2019).
Article Google Scholar
Gao, Y. et al. Nat. Mach. Intell. 5, 236–249 (2023).
Wang, Y. X., Ramanan, D. & Hebert, M. Adv. Neural Inf. Process. Syst. 30, 7029–7039 (2017).
Jamal, M. A., Brown, M., Yang, M. H., Wang, L. & Gong, B. In Proc. IEEE/CVF Conference on Computer Vision and Pattern Recognition 7610–7619 (2020).
Finn, C., Abbeel, P. & Levine, S. Proc. Mach. Learn. Res. 70, 1126–1135 (2017).
Google Scholar
Graves, A. et al. Preprint at https://doi.org/10.48550/arXiv.1410.5401 (2014).
Graves, A. et al. Nature 538, 471–476 (2016).
Article Google Scholar
Wang, D. et al. Bioinformatics 33, 3909–3916 (2017).
Article Google Scholar
Jiang, Y. et al. Comput. Struct. Biotechnol. J. 19, 4825–4839 (2021).
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Electrical Engineering and Computer Science, Bond Life Sciences Center, University of Missouri, Columbia, MO, USA
Duolin Wang, Fei He, Yang Yu & Dong Xu

Authors

Duolin Wang
View author publications
You can also search for this author in PubMed Google Scholar
Fei He
View author publications
You can also search for this author in PubMed Google Scholar
Yang Yu
View author publications
You can also search for this author in PubMed Google Scholar
Dong Xu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Dong Xu.

Ethics declarations

Competing interests

The authors declare no competing interests.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Wang, D., He, F., Yu, Y. et al. Meta-learning for T cell receptor binding specificity and beyond. Nat Mach Intell 5, 337–339 (2023). https://doi.org/10.1038/s42256-023-00641-5

Download citation

Published: 31 March 2023
Issue Date: April 2023
DOI: https://doi.org/10.1038/s42256-023-00641-5

Meta-learning for T cell receptor binding specificity and beyond

Subjects

References

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Competing interests

Rights and permissions

About this article

Cite this article

Pan-Peptide Meta Learning for T-cell receptor–antigen binding recognition

Search

Quick links

Subjects

References

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Competing interests

Rights and permissions

About this article

Cite this article

Share this article

Search

Quick links