General Prediction of Peptide-MHC Binding Modes Using Incremental Docking: A Proof of Concept

Antunes, Dinler A.; Devaurs, Didier; Moll, Mark; Lizée, Gregory; Kavraki, Lydia E.

doi:10.1038/s41598-018-22173-4

Download PDF

Article
Open access
Published: 12 March 2018

General Prediction of Peptide-MHC Binding Modes Using Incremental Docking: A Proof of Concept

Scientific Reports volume 8, Article number: 4327 (2018) Cite this article

13k Accesses
32 Citations
1 Altmetric
Metrics details

Subjects

Abstract

The class I major histocompatibility complex (MHC) is capable of binding peptides derived from intracellular proteins and displaying them at the cell surface. The recognition of these peptide-MHC (pMHC) complexes by T-cells is the cornerstone of cellular immunity, enabling the elimination of infected or tumoral cells. T-cell-based immunotherapies against cancer, which leverage this mechanism, can greatly benefit from structural analyses of pMHC complexes. Several attempts have been made to use molecular docking for such analyses, but pMHC structure remains too challenging for even state-of-the-art docking tools. To overcome these limitations, we describe the use of an incremental meta-docking approach for structural prediction of pMHC complexes. Previous methods applied in this context used specific constraints to reduce the complexity of this prediction problem, at the expense of generality. Our strategy makes no assumption and can potentially be used to predict binding modes for any pMHC complex. Our method has been tested in a re-docking experiment, reproducing the binding modes of 25 pMHC complexes whose crystal structures are available. This study is a proof of concept that incremental docking strategies can lead to general geometry prediction of pMHC complexes, with potential applications for immunotherapy against cancer or infectious diseases.

Opening opportunities for Kd determination and screening of MHC peptide complexes

Article Open access 23 May 2022

HLA3DB: comprehensive annotation of peptide/HLA complexes enables blind structure prediction of T cell epitopes

Article Open access 10 October 2023

TCRpMHCmodels: Structural modelling of TCR-pMHC class I complexes

Article Open access 10 October 2019

Introduction

The so-called cellular immune response¹ is based on a specific recognition system that is present in virtually every nucleated cell in the organism. As part of regular intracellular protein synthesis, some proteins are marked for degradation and proteolytically cleaved into smaller fragments (called peptides) that are then displayed at the cell surface². The key molecules in this process are specialized protein-receptors known as class I major histocompatibility complexes (MHCs)^1,2, which bind and display these intracellular peptides. Thanks to this peptide-presenting pathway, T-cell lymphocytes that circulate throughout the body scanning cell surfaces can monitor the intracellular content in almost every tissue. This allows for immune recognition of diseased cells (e.g., infected or tumoral). These peptide-MHC (pMHC) complexes (Fig. 1) can then be recognized by direct interaction with T-cell receptors (TCRs), activating T-cell cytolysis and triggering the elimination of the diseased cell³. Note that class II MHC molecules have a distinct structure², are limited in expression to specialized immune cells, and are involved in a different pathway; they will not be discussed here.

Since a given class I MHC molecule can only bind a subset of existing peptides⁴, and since viral proteins have high mutation rates (yielding ever-changing peptide pools), MHC diversity became essential for the survival of the host population¹. In fact, the MHC region is the most variable segment of the entire human genome: there are more than 8,000 known protein variants (or allotypes) of class I MHCs in the human population⁵, with up to 6 different allotypes per individual¹. Besides its importance for anti-viral immunity and vaccine development, the recognition of pMHC complexes is a key factor in autoimmunity, response to tissue transplantation, and immunity against tumors^6,7. In recent years, analyses of tumor-derived peptides capable of binding to patient-specific MHCs have played an essential role in the development of personalized immunotherapies against cancer⁷. Although many other molecules are involved in intercellular interactions between T-cells and tumor cells, the structural and biochemical properties of a given pMHC complex represent the central recognition feature and the most important information provided to the T-cell^6,8 (Fig. 1). In the context of immunotherapy, this information will define the chances that activated T-cells find and eliminate cancer cells throughout the body⁹; it will also determine the occurrence of potentially lethal off-target toxicities against healthy tissues (referred to as T-cell cross-reactivity)^9,10,11.

Given the importance of pMHC structural information in driving the cellular immune response, and the limitations of experimental methods for structural analyses of proteins, the structural prediction of pMHC complexes has been a desired goal in bioinformatics for over a decade^{12,13,14,15,16,17}. Computational methods such as molecular docking are the most promising tools for this task, given their efficiency and broad use for virtual screening of drug-like ligands^18,19,20. Molecular docking allows for the computational prediction of the three-dimensional structure of protein-ligand complexes (i.e., their binding mode)²¹. Since ligands are flexible molecules that adopt alternative conformations (i.e., different “shapes”), docking tools must consider a ligand’s rotatable bonds (i.e., its internal degrees of freedom, or DoFs), in addition to its position and orientation.

The high-dimensionality of the docking problem prevents an exhaustive exploration of all the DoFs of a ligand at once. Therefore, docking methods implement various heuristics to efficiently explore the ligand’s conformational space and quickly find a low-energy docked conformation of the ligand. For that, binding mode prediction is guided by approximated binding energy calculations (through what is called a scoring function)²¹. Ideally, a docking tool should also be general, in the sense that the accuracy of the predictions should not be impacted by the type of protein receptor or the class of ligands. However, docking methods are known to be much less reliable when applied to larger ligands (e.g., ligands with more than 10 internal DoFs)^22,23. For instance, peptides are known to be very flexible ligands²⁴; binding mode prediction of even small peptides, composed of up to 5 amino acids (which means around 24 internal DoFs), can be particularly challenging for available docking methods^25,26. This limitation makes the structural prediction of pMHC complexes an impossible task for most docking tools, since a typical MHC-binder is a peptide composed of 8 to 11 amino acids (which translates to more than 30 internal DoFs).

It is worth noting that molecular docking can be used with two distinct objectives: (i) structure-based binding affinity estimation, or (ii) geometry prediction (also referred to as geometry optimization)^14,27. For instance, recent publications use molecular docking or other structural analyses as part of broader strategies to identify and select MHC-binders (which is known as epitope prediction)^28,29 or to estimate MHC binding affinity^30,31,32. Although they also involve some level of structural prediction, these applications are focused on affinity estimations or approximated ranking of peptides, and are not primarily concerned with providing an accurate 3D model of the pMHC complex. Having a tool for accurate geometry prediction of pMHC complexes would improve the results of structure-based binding affinity predictions, and would enable a number of biomedically-relevant analyses that are not currently available (e.g., pMHC complex stability assessment, structure-based cross-reactivity prediction, etc).

As reviewed in previous publications^14,17, early attempts at predicting the geometry of pMHC structures usually divided the problem into smaller tasks, as a way to circumvent the limitations of docking tools. For instance, some methods focus on docking the two terminal residues of the peptide, resolving the central portion later^13,14. Others first approximate the backbone conformation, and then predict the side chains of the peptide^12,33,34,35. To make these divisions and approximations, these methods require ad-hoc constraints and are usually tailored towards specific MHC allotypes. There also exist methods relying on steps of molecular dynamics (MD)^36,37. MD can be useful to explore near native conformations of side-chains, but its higher computational cost makes it a less attractive solution for efficient exploration of the entire peptide conformational space³⁸. An attempt at a more efficient and yet general solution made use of grid potentials and a biased-probability Monte Carlo method¹⁴, implemented in the Internal Coordinate Mechanics (ICM) docking tool³⁹. This method uses an MHC-specific scoring function trained on available crystal structures via statistical learning. Despite promising results on a small number of known pMHC complexes, the choice of a more specific scoring function and the assumptions on the location of the peptide’s terminal amino acid residues raise questions about the generality of the method towards less prevalent MHC allotypes.

The combination of ICM docking and biased Monte Carlo optimization was also later implemented in pDOCK¹⁶, and validated on a larger dataset¹⁶. However, this validation focused on describing the average error of the peptide backbone only, without a broader discussion on the accuracy of side chain predictions. The latest published tool for pMHC structural prediction is DockTope¹⁷, which uses a protocol based on molecular docking and energy minimization³⁵. DockTope was validated on a large dataset of pMHC structures and was the first docking-based method for pMHC prediction to be made available as a webserver. However, it currently provides predictions for only 4 MHC allotypes because it approximates the backbone conformation using allotype-specific patterns from available crystal structures.

An approach similar to that of DockTope was described using the Rosetta FlexPepDock refinement protocol⁴⁰. The authors used available crystal structures of pMHCs as template for peptide backbone conformations, manually positioning the side chains of anchor residues in the expected locations within the binding site. This is justified by the fact that some peptide amino acid residues are known to stabilize the binding by interacting with deeper pockets in the MHC cleft⁴¹. Then, they conducted a backbone optimization step, followed by side chain prediction. Interestingly, good results were obtained for 5 selected allotypes, even when the template was from a different allotype. However, all reported examples involved complexes that had similar backbone conformations. In addition, the use of backbone templates and assumptions on the position of anchor residues represent important limitations of this method, since they might differ across different groups of MHC allotypes^35,42. Even for a given MHC allotype, the peptide backbone changes significantly depending on its length. There is also evidence of peptides presenting alternative anchors^42,43 or unusual binding modes⁴⁴. The FlexPepDock refinement protocol was only applied to 9-mer peptides and a limited number of MHC allotypes. Therefore, it is not yet clear how general this method can be, with respect to these limitations.

Note that most of the aforementioned methods aimed specifically at pMHC predictions. Increasing computational power and growing biomedical interest in peptide ligands and peptide-based inhibitors^45,46 have fueled the development of new tools for protein-peptide docking in general^47,48. Some of these tools have been applied to pMHC complexes^49,50, or validated on datasets including pMHC complexes^51,52,53. However, available results are insufficient to make claims on the accuracy and generality of these methods for pMHC structural prediction. For instance, many of these tools were tested using PeptiDB⁵⁴, a dataset of protein-peptide complexes, which, although diverse, includes only one class I MHC-restricted complex. Finally, there are promising tools for de novo prediction of protein-peptide complexes⁴⁷, such as Rosetta FlexPepDock ab-initio⁵⁵, and HADDOCK peptide docking⁵⁶, which could be applied to pMHC modeling. However, no evaluation of these tools on pMHC complexes has yet been published.

To sum up, to the best of our knowledge, there is yet no general tool for the reliable geometry prediction of pMHC complexes, considering different peptide lengths and MHC allotypes. With the aim of developing a general tool for de novo pMHC geometry prediction, in this paper we describe a proof of concept study using an incremental meta-docking approach referred to as DINC (Docking INCrementally)²³. DINC was previously developed by our group to predict the binding modes of peptidomimetic inhibitors⁵⁷, based on a divide and conquer approach. In contrast to the methods described above, DINC makes no assumption on the location of particular amino acid residues or the shape of the peptide backbone. To evaluate DINC’s applicability and generality in the context of prevalent human MHC allotypes, we performed a re-docking experiment on a diverse dataset of 25 pMHC complexes with available crystal structures. DINC was able to reproduce the binding modes of these complexes with an average error of 1.92 Å. Our results also show the ability of this incremental method to reproduce non-standard binding modes. Finally, we discuss the benefits of having a general tool for pMHC geometry prediction in the growing field of cancer immunotherapy.

Methods

Dataset selection

A total of 25 crystal structures of pMHC complexes restricted to human MHC allotypes were selected from the Protein Data Bank (PDB), as listed in the Supplementary Table S1. In humans, MHC receptors are also referred to as Human Leukocyte Antigens (HLAs)¹. When defining our dataset we prioritized (i) the diversity of peptide sequence and length, (ii) the high prevalence of the HLA allotype in the human population and (iii) the high resolution of the crystal structure. In addition, to analyze an example of T-cell cross-reactivity¹⁰, we included the recently-determined complexes MAGEA3/HLA-A*01:01 (PDB code 5BRZ) and Titin/HLA-A*01:01 (5BSO). The HLA-B allotypes HLA-B*57:01 and HLA-B*57:03 were also included in our dataset, given their great interest for biomedical purposes (e.g., their role in natural immunity against HIV and HCV)^58,59 and their different binding modes (as compared to the more prevalent HLA-A*02:01). Finally, an HLA-C complex was included to highlight the generality of the method across the three types of class I HLA.

Re-docking experiment

This dataset was used for a re-docking experiment, in which we tried to reproduce the binding modes observed in the crystal structures. As a first pre-processing step, all crystal structures were visually inspected and revised as needed²⁰. For instance, water molecules were removed since they are not accounted for in our docking method. Also, in cases of duplicated side chains (i.e., conformational heterogeneity), the subset with lower occupancy was removed. Finally, in cases of multiple molecules per asymmetric unit only the first subset was kept (e.g., chains A, B and C). Revised structures were then submitted to a three steps energy minimization with GROMACS v4.6.5⁶⁰, using the steepest-descent and conjugate gradient methods. The GROMOS96 (53a6) force field was used with the SPC water model; a cutoff value of 1.2 nm was used for both van der Waals and Coulomb interactions, with Fast Particle-Mesh Ewald electrostatics (PME). After this procedure, water molecules were removed from the output files and the coordinates of the minimized complexes were saved into PDB format files. These minimized crystal structures will be hereafter referred to as “reference structures”: they are the structures we aim to reproduce in this re-docking experiment. For that, the ligand and receptor in each complex are saved into independent PDB format files, and a docking software is used to reconstruct the original complex. The relevance of re-docking experiments lies in that the conformation and position of an input ligand are systematically randomized by the docking software.

In a re-docking experiment the accuracy of the results is evaluated by assessing the goodness-of-fit between the predicted complexes and the reference structure, usually in terms of Root Mean Square Deviation (RMSD). When computing the RMSD for all atoms of a peptide, between a predicted complex and its reference structure, the two pMHC complexes are first aligned based on the MHC structure. Therefore, the all-atom RMSD captures not only differences in conformation between the two binding modes, but also differences in position of the peptide inside the MHC cleft. While the all-atom RMSD captures changes in both the main chain and the side chains of the peptide, the Cα RMSD (i.e., RMSD for alpha carbons only) captures only changes in the main chain. Another goodness-of-fit measure is the Least Root Mean Square Deviation (LRMSD), computed after aligning the two peptide structures (as opposed to aligning the two MHC structures), either for all atoms or alpha carbons only. The LRMSD is a more precise evaluation of differences between two conformations of a peptide, irrespective of its position in the MHC cleft.

DINC algorithm

DINC is a parallelized meta-docking method for incremental docking of large ligands, described in detail in previous publications^23,61. Briefly, instead of docking the entire peptide at once, DINC starts by docking only a small fragment of the peptide (Fig. 2). The best conformation for this “initial fragment” is selected using a scoring function, and expanded through the addition of another subset of atoms from the original peptide. This new expanded fragment is then docked, and this process is incrementally repeated until the entire peptide is reconstructed and docked. Note that the word “fragment” is used here with a different meaning than that of fragment-based drug discovery (FBDD). FBDD uses libraries of fragments, which are very small molecules with no more than two functional groups⁶², to create new drugs or drug-like ligands. DINC was inspired by these methods, but does not use any library of small molecules, and does not dock independent fragments that are later connected. In the context of DINC, the fragments are overlapping sections of the input ligand, docked sequentially to grow the ligand incrementally⁶¹.

DINC currently uses the standard docking software AutoDock 4^20,63; a free online version of DINC is available as a webserver (http://dinc.kavrakilab.org/)⁶¹. While DINC manages the fragment selection and expansion, as well as the parallelization of the search, AutoDock 4 performs the sampling and scoring of individual fragments. More specifically, AutoDock 4 uses a Lamarckian genetic algorithm⁶³. Genetic algorithms are a type of evolutionary technique that is commonly used for the stochastic sampling of ligand conformations in molecular docking⁶⁴. For scoring, AutoDock 4 uses a semi-empirical free energy force field, including terms for dispersion/repulsion, hydrogen bonding, electrostatics and desolvation^20,63. In this paper we used a custom version of DINC, to explore different parameters (e.g., number of DoFs at each round) and heuristics (e.g., fragment expansion method).

Experimental setup

DINC is a customizable approach, allowing for the use of different combinations of parameters and heuristics. In this context, one set of parameters chosen for a particular job (i.e., one execution of DINC) is referred to as a DINC protocol. From previous experience, we define a default protocol for DINC (Fig. 3). This protocol represents a reasonable selection of parameter values for a standard job, but it is not expected to provide the best results in all situations. Therefore, in our re-docking experiment, four alternative protocols were defined, exploring other combinations of parameter values. The specific parameter values in these five protocols (including the default protocol) are presented in Supplementary Table S2. It is important to highlight that we do not exhaustively evaluate all possible protocols. For example, DINC protocols also comprise the specific parameters of the underlying docking software, in this case AutoDock 4 (Fig. 3). In our experiment, default values were used for AutoDock 4 parameters (e.g., ga_run = 50, pop_size = 150, num_evals = 250000, num_gen = 27000, elitism = 1, mutation_rate = 0.02, crossover_rate = 0.8, etc).

The five protocols are used in the following way: First, a total of 20 DINC jobs is executed for each of the 25 complexes in the dataset (Supplementary Table S1), using the default protocol (Fig. 3). The output conformation with the lowest binding energy for each complex is compared to the corresponding reference structure. If the all-atom RMSD between them is lower than 2 Å, this is considered a good reproduction of this complex. If not, a new batch of 20 DINC jobs is executed using the next alternative protocol. This process is repeated until a good reproduction is obtained, or the five protocols have been used (which corresponds to 100 jobs).

Finally, it is important to remember that the docking search is not biased by the reference structure. Additionally, each new DINC job is completely independent from all previous jobs. All 20 jobs of a given batch are executed in parallel. Our cluster contains 80 dual processor HP SL230s computing nodes, each one equipped with two Intel E5-2650v2 Ivy Bridge EP processors (for a total of 16 cores per node). The typical running time for a DINC job on our cluster is 30 minutes (for a CPU time of about 8 hours).

Visualization

Cartoon representations of the MHCs, cross-section views of the binding clefts, and side view images of the peptides were obtained with the UCSF Chimera⁶⁵ and UCSF ChimeraX packages. These packages are developed by the Resource for Biocomputing, Visualization, and Informatics at the University of California, San Francisco (supported by NIGMS P41-GM103311). Molecular surfaces of pMHC complexes were computed with GRASP2⁶⁶, which was also used to obtain top view images of these complexes. Electrostatic potentials were computed with Delphi for a range of −5 kT/e (red) to +5 kT/e (blue), using the GRASP2 interface.

Data availability

The datasets generated and analyzed during the current study are available from the corresponding author on reasonable request.

Results and Discussion

Re-docking of a diverse dataset of pMHC complexes

We performed a re-docking experiment with a diverse dataset of pMHC complexes (Supplementary Table S1). Our dataset includes 10 of the most prevalent human MHC allotypes, bound to peptides with different lengths (8 to 10 amino acids) and a varying number of DoFs (29 to 41). The goodness-of-fit between predicted binding modes and the corresponding reference structures (i.e., the minimized crystal structures) is estimated in terms of RMSD. A “near native” reproduction of an experimentally-observed binding mode usually corresponds to an all-atom RMSD lower than 2.0–2.5 Å^17,47. The results of our re-docking experiment show good reproductions of the 25 pMHC complexes (Fig. 4), with an average all-atom RMSD of only 1.92 Å (±0.41 Å). The all-atom RMSD is less than 2.2 Å in 68% of the cases, and less than 2.5 Å in 98%; the highest all-atom RMSD is 2.61 Å. Note that most previous work in the field was reported using backbone (or Cα) RMSD only^{13,14,16,36,50}. This means capturing the overall “shape” of reproduced peptides, but not necessarily the precise position of their side-chains. On the other hand, related work reporting all-atom RMSD was usually performed in less diverse datasets (e.g., only selected MHC allotypes or peptide lengths)^17,34,37,40. We report both Cα RMSD and all-atom RMSD for our re-docking experiment, which was conducted on a structurally diverse dataset of pMHC complexes (Supplementary Table S1).

As seen from Fig. 4, we do not observe any correlation between the all-atom RMSD and the number of DoFs (R = 0.39). For instance, the result for complex 4NT6 (9-mer, 31 DoFs) is worse than for complex 3VXS (10-mer, 38 DoFs). When considering all the variables listed in Supplementary Table S1, the strongest correlation (R = 0.5) is observed between peptide length and Cα LRMSD. By computing the Cα LRMSD we capture the accuracy of the backbone prediction. This is very meaningful information because a small error in the backbone has a bigger impact on the binding mode than a similar error in a side chain.

DINC makes no initial assumption on the backbone conformation, and has no constraint related to templates or expert-knowledge on the expected conformation (Fig. 2). In spite of that, our average Cα LRMSD is only 0.99 Å (±0.36 Å). As further discussed in the next section, a good reproduction of the backbone is obtained even when considering peptides with different lengths, or “non-standard” binding modes. Besides, similar levels of accuracy are obtained for very different MHC allotypes, highlighting the potential of DINC as a general method for pMHC structural prediction.

Accurate prediction of diverse binding modes

The allotype HLA-A*02:01 is one of the most extensively studied HLA variants^17,41. It is the second most prevalent allotype in humans^67,68 and arguably the HLA variant for which the most detailed and comprehensive data is available: more than 42,000 binding assays deposited in the Immune Epitope Database⁶⁹. As reported in previous studies, HLA-A*02:01 binds mostly 9-mers, but also larger peptides. It was also reported that the HLA-A*02:01 binding cleft is fairly constrained, with little conformational variation across available crystal structures. It also presents a clear pattern of preferred anchor residues⁴¹ at both peptide termini: usually positions 2 (p2) and 9 (p9) of the 9-mer ligands (Fig. 5). Comparing available crystal structures, a shared conformational pattern was observed for the backbone of 9-mer peptides bound to HLA-A*02:01³⁵. For example, this typical backbone pattern is observed in the crystal structure 3MRG, involving a virus-derived peptide (Fig. 5A). Some cancer-related peptides, however, are known to present unusual binding modes¹⁷. For instance, in the modified melanoma-associated antigen MART1-A27L the amino-terminal anchor to the HLA-A*02:01 binding cleft is p1 instead of p2. This alternative anchoring pattern creates a sideways deviation of the backbone in the middle of the peptide¹⁷, resulting in an unusual binding mode (Fig. 5B, 2GTW). In addition, larger peptides are known to present bulging conformations of the backbone, to accommodate a longer chain using the same anchoring pockets (Fig. 5A, 1I4F). DINC was able to reproduce each one of these 3 alternative backbone conformations, with sub-angstrom accuracy (Fig. 6).

The most prevalent HLA allotype is HLA-A*24:02^67,68, which is known to bind 8-mers, 9-mers and 10-mers. As expected, the conformation of 8-mers is more linear, with almost no bulged region between the two peptide termini (Fig. 5C, 4F7T). Currently, there are only two crystal structures of 9-mers bound to HLA-A*24:02, and they present different binding modes (Fig. 5C). According to the researchers who described these structures, the virus-derived peptide resolved in 3I6L shows an unusual binding mode, with a much more exposed p4 as compared to a regular self-derived peptide (2BCK). Further analysis of other crystal structures shows that the backbone conformation seen in 3I6L is very similar to that of 10-mer peptides bound to the same HLA (data not shown). In our re-docking experiment, DINC was able to reproduce all these binding modes (Fig. 6), as well as other 10-mers bound to HLA-A*24:02 (Supplementary Table S1). In the case of 3I6L, the authors claim that the observed binding mode is stabilized by an internal hydrogen bond established by p3, whose side chain is pointing towards the center of the binding cleft. The binding mode predicted by DINC for this complex does not feature this specific hydrogen bond (as determined by UCSF Chimera), but a similar orientation of p3 is observed (Fig. 6F).

The peptide’s binding mode is greatly influenced by the shape and properties of the HLA cleft; a given peptide might bind differently to different HLA allotypes (e.g., using different anchor residues or having different side chains exposed for TCR interaction)^35,70. These structural differences are key for recognition by T-cells, and contribute to the diversity of cellular responses observed among individuals with different subsets of HLAs¹. Although our dataset includes peptides bound to four different HLA-A allotypes, five HLA-B allotypes and one HLA-C allotype (Supplementary Table S1), we can reproduce the conformational differences imposed by these different binding clefts (Fig. 6).

The results in this paper show that it is possible to develop a general pMHC geometry prediction method. In addition to the good reproduction of peptides’ backbone, the average all-atoms LRMSD of 1.73 Å (±0.33 Å) demonstrates high accuracy reproduction of peptides’ side chains (Supplementary Table S1). Therefore, not only the buried side chains (i.e., those facing the MHC cleft) were correctly predicted, but the overall geometry of the ligand was closely reproduced (including side-chains of the bulging portion of the peptide, which are more exposed for TCR interaction). Obtaining a good approximation of the pMHC complex geometry is essential for the use of predicted models as input for other structure-based analyses.

Significance for T-cell-based immunotherapy

Thanks to the rapid technical developments of the last decade and our growing understanding of the mechanisms involved in cellular immunity, T-cell-based immunotherapy has emerged as one of the most promising approaches for cancer treatment^7,71,72. Significant anti-tumor activity has been reported in a number of clinical trials, involving different cancer types⁷³. Two melanoma-associated antigens, MAGEA3 and MART1, stand out among the leading tumor-derived peptides targeted by these immunotherapies (Supplementary Table S1).

The peptide-antigens derived from MAGEA3 and MART1 can be expressed by multiple tumor types, but are not expressed by most normal tissues, therefore allowing for the development of antigen-specific T-cell-based therapeutics^74,75,76. Unfortunately, unexpected off-target toxicities against healthy tissues have been reported^74,75,76, raising serious safety concerns. For instance, lethal cardiac toxicity was observed in two patients undergoing treatment with T-cells specific to the MAGEA3 antigen^74,75. Later investigation showed that the therapeutic T-cells used in these patients were also recognizing an unrelated Titin-derived peptide, displayed by HLA-A*01:01 molecules in healthy cardiac cells^74,75.

Although we usually refer to the peptides as being the targets recognized by the cytotoxic T-cells, TCRs actually recognize the combined surface of the peptide and MHC receptor, also referred to as the “TCR-interacting surface” of the pMHC complex (Fig. 1). Each TCR is thought to be specific for a given pMHC complex, but structural similarity between unrelated complexes can be responsible for off-target activation of T-cells^8,77; also known as T-cell cross-reactivity⁶. Using x-ray crystallography, Raman and colleagues¹⁰ have confirmed the structural similarity between the pMHC complexes involved in the MAGEA3-Titin cross-reactivity (Fig. 7A,B). Both of these pMHC complexes were included in our dataset (5BRZ and 5BS0). DINC was able to correctly predict the geometry of both peptides, bound to HLA-A*01:01, and reproduce the structural similarity of the resulting TCR-interacting surfaces (Fig. 7D,E).

The clinically relevant example described above highlights the significance of pMHC structural prediction in the context of T-cell-based immunotherapy. In this case, the two peptides have a sequence identity of 55%, which is already challenging for sequence-based cross-reactivity prediction. However, T-cell cross-reactivity can be triggered even by peptides with no sequence identity and low biochemical similarity⁷⁸, and might be driven by specific structural similarities in hot-spots over the TCR-interacting surface⁷⁹. In this context, structure-based methods for cross-reactivity prediction have been proposed, either clustering pMHCs of interest based on structural similarity^78,80, or integrating structural information and protein expression levels into sequence-based proteomic searches^81,82. This field will spawn significant developments in the coming years, particularly considering the importance of cross-reactivity prediction for T-cell-based immunotherapy¹¹. Moreover, considering the costs and practical limitations of experimental methods for protein structural analysis, fast and reliable computational methods for geometry prediction of pMHC complexes should play an important role in this process.

Current challenges and future work

The high-dimensionality of the search space is a challenge inherent to molecular docking. Algorithmic solutions to address this challenge are usually non-deterministic, introducing variability, which affects reproducibility. For instance, a single run of AutoDock 4 starts with a random conformation of the ligand, which is then randomly modified by the Lamarckian genetic algorithm to create new conformations^20,63. Therefore, the chances of obtaining different results in independent runs of AutoDock 4 increase with ligand size. A similar variability is observed across independent DINC jobs. However, this is not a problem here, as our goal was only to determine if DINC could predict different binding modes of pMHC complexes within a reasonable time. By providing a proof of concept that this goal is indeed attainable, we can now open new avenues for developing even better algorithms inspired by the meta-docking incremental approach. In fact, since only a few protocols were used in the context of this study, there is great potential for further improvement. As future work, we will perform a thorough evaluation of the parameters and heuristics in DINC in order to improve its efficiency and achieve fast, accurate and reproducible geometry prediction of pMHC complexes.

The general structural prediction of pMHC complexes requires addressing a combination of challenges, including peptide-docking, receptor flexibility and accurate scoring. Here, we focused on the peptide-docking problem, and showed how a simple incremental approach allows predicting binding modes of peptides with different lengths and bound to different MHCs. For that, we limited our analysis to a re-docking experiment involving a diverse dataset of human pMHCs. Receptor flexibility is another important challenge²⁷ and should be taken into consideration when predicting pMHC complexes⁸³. In fact, MHC flexibility can affect peptide loading, and peptide binding can induce local changes in the MHC receptor^84,85. However, the folding of MHC receptors is highly-conserved. Therefore, a docking protocol accounting for receptor flexibility could be combined with homology modeling to predict the binding modes of peptides to MHC allotypes for which no structural information is available^14,16. However, the high-dimensionality of the resulting search space requires even better algorithms, which may involve dimensionality reduction approaches⁸⁶.

Recent reviews indicate that other docking software can outperform AutoDock 4 in scoring predicted binding modes²⁶. In fact, scoring is one of the bottlenecks when trying to achieve greater accuracy in docking-based methods²⁶. In terms of sampling, our method could certainly provide results with sub-angstrom accuracy, but this would require a scoring function capable of discriminating between conformations with sub-angstrom differences. Therefore, our method could benefit from using consensus scoring⁸⁷, peptide-specific scoring⁸⁸ or HLA-specific scoring⁸⁹. Being a meta-docking application, DINC can integrate alternative sampling strategies or scoring methods. We plan to further investigate these issues in a future study, performing cross-docking and benchmarking on a much larger pMHC dataset.

Finally, a version of DINC with improved scoring could also provide a more general tool for epitope prediction and virtual screening of MHC binders. Gold standard tools for these tasks usually rely on machine learning methods trained on available datasets of previously tested peptide sequences⁹⁰, which are limited or inexistent for less prevalent MHC allotypes^91,92. DINC do not require ad-hoc knowledge on the typical binders for a given MHC allotype, or its preferred primary anchors. Therefore, once the aforementioned challenges are addressed, DINC could potentially complement sequence-based methods in epitope prediction projects, by providing structure-based ranking of peptide-ligands for any MHC of interest.

Conclusion

In this paper, we demonstrate that an incremental meta-docking approach can predict the binding modes of large peptide ligands bound to MHC receptors. Standard docking software can provide general solutions (i.e., solutions that are not restricted to a particular protein receptor), but cannot handle large ligands. On the other hand, methods focused on pMHC structural prediction lack generality because they often use expert-knowledge or frequent patterns as constraints. We argue that the use of incremental docking offers a new strategy to overcome these limitations. Our work shows that incremental docking allows handling different MHC allotypes, predicting unusual binding modes, and obtaining accurate structural prediction for peptides with up to 41 rotatable bonds. In addition, being a meta-docking approach, our method avoids the need for new docking software. We postulate that a similar incremental process could be implemented using different docking software, achieving similar or even better results. As a proof of concept, our study represents a landmark in the advancement of methods for geometry prediction of pMHC complexes. Future developments of these methods are expected to have a positive impact in many fields related to human health, including vaccine development and tissue transplantation. In particular, fast and accurate prediction of patient-specific pMHC complexes will be key for the development of safe and effective T-cell-based immunotherapies against cancer.

References

Vandiedonck, C. & Knight, J. C. The human Major Histocompatibility Complex as a paradigm in genomics research. Brief. Funct. Genomic Proteomic 8, 379–394, https://doi.org/10.1093/bfgp/elp010 (2009).
Article CAS PubMed PubMed Central Google Scholar
Neefjes, J., Jongsma, M. L., Paul, P. & Bakke, O. Towards a systems understanding of MHC class I and MHC class II antigen presentation. Nat. Rev. Immunol. 11, 823–836, https://doi.org/10.1038/nri3084 (2011).
Article CAS PubMed Google Scholar
Welsh, R. M., Che, J. W., Brehm, M. A. & Selin, L. K. Heterologous immunity between viruses. Immunol. Rev. 235, 244–266, https://doi.org/10.1111/j.0105-2896.2010.00897.x (2010).
Article CAS PubMed PubMed Central Google Scholar
Paul, S. et al. HLA class I alleles are associated with peptide-binding repertoires of different size, affinity, and immunogenicity. J. Immunol. 191, 5831–5839, https://doi.org/10.4049/jimmunol.1302101 (2013).
Article CAS PubMed Google Scholar
Robinson, J. et al. The IPD and IMGT/HLAdatabase: allele variant databases. Nucleic Acids Res. 43, D423–431, https://doi.org/10.1093/nar/gku1161 (2015).
Article CAS PubMed Google Scholar
Degauque, N., Brouard, S. & Soulillou, J. P. Cross-reactivity of TCR repertoire: Current concepts, challenges, and implication for allotransplantation. Front. Immunol. 7, 89, https://doi.org/10.3389/fimmu.2016.00089 (2016).
Article PubMed PubMed Central Google Scholar
Lizée, G. et al. Harnessing the power of the immune system to target cancer. Annu. Rev. Med. 64, 71–90, https://doi.org/10.1146/annurev-med-112311-083918 (2013).
Article PubMed Google Scholar
Birnbaum, M. E. et al. Deconstructing the peptide-MHC specificity of T cell recognition. Cell 157, 1073–1087, https://doi.org/10.1016/j.cell.2014.03.047 (2014).
Article CAS PubMed PubMed Central Google Scholar
Stone, J. D., Harris, D. T. & Kranz, D. M. TCR affinity for p/MHC formed by tumor antigens that are self-proteins: impact on efficacy and toxicity. Curr. Opin. Immunol. 33, 16–22, https://doi.org/10.1016/j.coi.2015.01.003 (2015).
Article CAS PubMed PubMed Central Google Scholar
Raman, M. C. et al. Direct molecular mimicry enables off-target cardiovascular toxicity by an enhanced affinity TCR designed for cancer immunotherapy. Sci. Rep. 6, 18851, https://doi.org/10.1038/srep18851 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Antunes, D. A. et al. Interpreting T-Cell cross-reactivity through structure: Implications for TCR-based cancer immunotherapy. Front Immunol 8, 1210, https://doi.org/10.3389/fimmu.2017.01210 (2017).
Article PubMed PubMed Central Google Scholar
Sezerman, U., Vajda, S. & DeLisi, C. Free energy mapping of class I MHC molecules and structural determination of bound peptides. Protein Sci. 5, 1272–1281, https://doi.org/10.1002/pro.5560050706 (1996).
Article CAS PubMed PubMed Central Google Scholar
Tong, J. C., Tan, T. W. & Ranganathan, S. Modeling the structure of bound peptide ligands to major histocompatibility complex. Protein Sci. 13, 2523–2532, https://doi.org/10.1110/ps.04631204 (2004).
Article CAS PubMed PubMed Central Google Scholar
Bordner, A. J. & Abagyan, R. Ab initio prediction of peptide-MHC binding geometry for diverse class I MHC allotypes. Proteins 63, 512–526, https://doi.org/10.1002/prot.20831 (2006).
Article CAS PubMed Google Scholar
Todman, S. J. et al. Toward the atomistic simulation of T cell epitopes automated construction of MHC: peptide structures for free energy calculations. J. Mol. Graph. Model. 26, 957–961, https://doi.org/10.1016/j.jmgm.2007.07.005 (2008).
Article CAS PubMed Google Scholar
Khan, J. M. & Ranganathan, S. pDOCK: a new technique for rapid and accurate docking of peptide ligands to Major Histocompatibility Complexes. Immunome Res. 6, S2, https://doi.org/10.1186/1745-7580-6-S1-S2 (2010).
Article PubMed PubMed Central Google Scholar
Rigo, M. M. et al. DockTope: a Web-based tool for automated pMHC-I modelling. Sci. Rep. 5, 18413, https://doi.org/10.1038/srep18413 (2015).
Article ADS CAS PubMed Google Scholar
Sousa, S. F. et al. Protein-ligand docking in the new millennium–a retrospective of 10 years in the field. Curr. Med. Chem. 20, 2296–2314, https://doi.org/10.2174/0929867311320180002 (2013).
Article CAS PubMed Google Scholar
Sliwoski, G., Kothiwale, S., Meiler, J. & Lowe, E. W. Computational methods in drug discovery. Pharmacol. Rev. 66, 334–395, https://doi.org/10.1124/pr.112.007336 (2014).
Article PubMed PubMed Central Google Scholar
Forli, S. et al. Computational protein-ligand docking and virtual drug screening with the AutoDock suite. Nat. Protoc. 11, 905–919, https://doi.org/10.1038/nprot.2016.051 (2016).
Article CAS PubMed PubMed Central Google Scholar
Guedes, I. A., de Magalhães, C. S. & Dardenne, L. E. Receptor-ligand molecular docking. Biophys. Rev. 6, 75–87, https://doi.org/10.1007/s12551-013-0130-2 (2014).
Article CAS PubMed Google Scholar
Chang, M. W., Ayeni, C., Breuer, S. & Torbett, B. E. Virtual screening for HIV protease inhibitors: a comparison of AutoDock 4 and Vina. PLoS ONE 5, e11955, https://doi.org/10.1371/journal.pone.0011955 (2010).
Article ADS PubMed PubMed Central Google Scholar
Dhanik, A., McMurray, J. S. & Kavraki, L. E. DINC: a new AutoDock-based protocol for docking large ligands. BMC Struct. Biol. 13(Suppl 1), S11, https://doi.org/10.1186/1472-6807-13-S1-S11 (2013).
Article PubMed PubMed Central Google Scholar
Devaurs, D. et al. Characterizing energy landscapes of peptides using a combination of stochastic algorithms. IEEE Trans. Nanobioscience 14, 545–552, https://doi.org/10.1109/TNB.2015.2424597 (2015).
Article PubMed Google Scholar
Rentzsch, R. & Renard, B. Y. Docking small peptides remains a great challenge: an assessment using AutoDock Vina. Brief. Bioinformatics 16, 1045–1056, https://doi.org/10.1093/bib/bbv008 (2015).
Article PubMed Google Scholar
Wang, Z. et al. Comprehensive evaluation of ten docking programs on a diverse set of protein-ligand complexes: the prediction accuracy of sampling power and scoring power. Phys. Chem. Chem. Phys. 18, 12964–12975, https://doi.org/10.1039/c6cp01555g (2016).
Article CAS PubMed Google Scholar
Antunes, D. A., Devaurs, D. & Kavraki, L. E. Understanding the challenges of protein flexibility in drug design. Expert Opin. Drug. Discov. 10, 1301–1313, https://doi.org/10.1517/17460441.2015.1094458 (2015).
Article CAS PubMed Google Scholar
E Silva, R. D. E. F. et al. Combination of In Silico methods in the search for potential CD4(+) and CD8(+) T cell epitopes in the proteome of Leishmania braziliensis. Front. Immunol. 7, 327, https://doi.org/10.3389/fimmu.2016.00327 (2016).
Article PubMed Central Google Scholar
Mahdavi, M. & Moreau, V. In Silico designing breast cancer peptide vaccine for binding to MHC class I and II: A molecular docking study. Comput. Biol. Chem. 65, 110–116, https://doi.org/10.1016/j.compbiolchem.2016.10.007 (2016).
Article CAS PubMed Google Scholar
Mukherjee, S., Bhattacharyya, C. & Chandra, N. HLaffy: estimating peptide affinities for Class-1 HLA molecules by learning position-specific pair potentials. Bioinformatics 32, 2297–2305, https://doi.org/10.1093/bioinformatics/btw156 (2016).
Article CAS PubMed Google Scholar
Ishikawa, T. Prediction of peptide binding to a major histocompatibility complex class I molecule based on docking simulation. J. Comput. Aided Mol. Des. 30, 875–887, https://doi.org/10.1007/s10822-016-9967-3 (2016).
Article ADS CAS PubMed Google Scholar
Borrman, T. et al. ATLAS: A database linking binding affinities with structures for wild-type and mutant TCR-pMHC complexes. Proteins https://doi.org/10.1002/prot.25260 (2017).
Schueler-Furman, O., Elber, R. & Margalit, H. Knowledge-based structure prediction of MHC class I bound peptides: a study of 23 complexes. Fold. Des. 3, 549–564, https://doi.org/10.1016/S1359-0278(98)00070-4 (1998).
Article CAS PubMed Google Scholar
Bui, H. H., Schiewe, A. J., von Grafenstein, H. & Haworth, I. S. Structural prediction of peptides binding to MHC class I molecules. Proteins 63, 43–52, https://doi.org/10.1002/prot.20870 (2006).
Article CAS PubMed Google Scholar
Antunes, D. A. et al. Structural allele-specific patterns adopted by epitopes in the MHC-I cleft and reconstruction of MHC:peptide complexes to cross-reactivity assessment. PLoS ONE 5, e10353, https://doi.org/10.1371/journal.pone.0010353 (2010).
Article ADS PubMed PubMed Central Google Scholar
Antes, I., Siu, S. W. & Lengauer, T. DynaPred: a structure and sequence based method for the prediction of MHC class I binding peptide sequences and conformations. Bioinformatics 22, 16–24, https://doi.org/10.1093/bioinformatics/btl216 (2006).
Article Google Scholar
Fagerberg, T., Cerottini, J. C. & Michielin, O. Structural prediction of peptides bound to MHC class I. J. Mol. Biol. 356, 521–546, https://doi.org/10.1016/j.jmb.2005.11.059 (2006).
Article CAS PubMed Google Scholar
Knapp, B., Demharter, S., Deane, C. M. & Minary, P. Exploring peptide/MHC detachment processes using hierarchical natural move Monte Carlo. Bioinformatics 32, 181–186, https://doi.org/10.1093/bioinformatics/btv502 (2016).
CAS PubMed Google Scholar
Abagyan, R., Totrov, M. & Kuznetsov, D. ICM-A new method for protein modeling and design: applications to docking and structure prediction from the distorted native conformation. J. Comput. Chem. 15, 488–506, https://doi.org/10.1002/jcc.540150503 (1994).
Article CAS Google Scholar
Liu, T. et al. Subangstrom accuracy in pHLA-I modeling by Rosetta FlexPepDock refinement protocol. J. Chem. Inf. Model. 54, 2233–2242, https://doi.org/10.1021/ci500393h (2014).
Article CAS PubMed Google Scholar
Sidney, J., Peters, B., Frahm, N., Brander, C. & Sette, A. HLA class I supertypes: a revised and updated classification. BMC Immunol. 9, 1, https://doi.org/10.1186/1471-2172-9-1 (2008).
Article PubMed PubMed Central Google Scholar
Chappell, P. et al. Expression levels of MHC class I molecules are inversely correlated with promiscuity of peptide binding. Elife 4, e05345, https://doi.org/10.7554/eLife.05345 (2015).
Article PubMed PubMed Central Google Scholar
Bassani-Sternberg, M. & Gfeller, D. Unsupervised HLA peptidome deconvolution improves ligand prediction accuracy and predicts cooperative effects in peptide-HLA interactions. J. Immunol. 197, 2492–2499, https://doi.org/10.4049/jimmunol.1600808 (2016).
Article CAS PubMed Google Scholar
Liu, J. et al. Novel immunodominant peptide presentation strategy: a featured HLA-A*2402-restricted cytotoxic T-lymphocyte epitope stabilized by intrachain hydrogen bonds from severe acute respiratory syndrome coronavirus nucleocapsid protein. J. Virol. 84, 11849–11857, https://doi.org/10.1128/JVI.01464-10 (2010).
Article CAS PubMed PubMed Central Google Scholar
Craik, D. J., Fairlie, D. P., Liras, S. & Price, D. The future of peptide-based drugs. Chem. Biol. Drug. Des. 81, 136–147, https://doi.org/10.1111/cbdd.12055 (2013).
Article CAS PubMed Google Scholar
Du, Q. S., Xie, N. Z. & Huang, R. B. Recent development of peptide drugs and advance on theory and methodology of peptide inhibitor design. Med. Chem. 11, 235–247, https://doi.org/10.2174/1573406411666141229163355 (2015).
Article CAS PubMed Google Scholar
London, N., Raveh, B. & Schueler-Furman, O. Peptide docking and structure-based characterization of peptide binding: from knowledge to know-how. Curr. Opin. Struct. Biol. 23, 894–902, https://doi.org/10.1016/j.sbi.2013.07.006 (2013).
Article CAS PubMed Google Scholar
Kilburg, D. & Gallicchio, E. Recent advances in computational models for the study of protein-peptide interactions. Adv. Protein Chem. Struct. Biol. 105, 27–57, https://doi.org/10.1016/bs.apcsb.2016.06.002 (2016).
Article CAS PubMed Google Scholar
Liu, Z., Dominy, B. N. & Shakhnovich, E. I. Structural mining: self-consistent design on flexible protein-peptide docking and transferable binding affinity potential. J. Am. Chem. Soc. 126, 8515–8528, https://doi.org/10.1021/ja032018q (2004).
Article CAS PubMed Google Scholar
Donsky, E. & Wolfson, H. J. PepCrawler: a fast RRT-based algorithm for high-resolution refinement and binding affinity estimation of peptide inhibitors. Bioinformatics 27, 2836–2842, https://doi.org/10.1093/bioinformatics/btr498 (2011).
Article CAS PubMed Google Scholar
Lee, H., Heo, L., Lee, M. S. & Seok, C. GalaxyPepDock: a protein-peptide docking tool based on interaction similarity and energy optimization. Nucleic Acids Res. 43, W431–435, https://doi.org/10.1093/nar/gkv495 (2015).
Article CAS PubMed PubMed Central Google Scholar
Yan, C., Xu, X. & Zou, X. Fully blind docking at the atomic level for protein-peptide complex structure prediction. Structure 24, 1842–1853, https://doi.org/10.1016/j.str.2016.07.021 (2016).
Article CAS PubMed PubMed Central Google Scholar
de Vries, S. J., Rey, J., Schindler, C. E. M., Zacharias, M. & Tuffery, P. The pepATTRACT web server for blind, large-scale peptide-protein docking. Nucleic Acids Res, https://doi.org/10.1093/nar/gkx335 (2017).
London, N., Movshovitz-Attias, D. & Schueler-Furman, O. The structural basis of peptide-protein binding strategies. Structure 18, 188–199, https://doi.org/10.1016/j.str.2009.11.012 (2010).
Article CAS PubMed Google Scholar
Raveh, B., London, N., Zimmerman, L. & Schueler-Furman, O. Rosetta FlexPepDock ab-initio: simultaneous folding, docking and refinement of peptides onto their receptors. PLoS ONE 6, e18934, https://doi.org/10.1371/journal.pone.0018934 (2011).
Article ADS CAS PubMed PubMed Central Google Scholar
Trellet, M., Melquiond, A. S. & Bonvin, A. M. A unified conformational selection and induced fit approach to protein-peptide docking. PLoS ONE 8, e58769, https://doi.org/10.1371/journal.pone.0058769 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Dhanik, A., McMurray, J. S. & Kavraki, L. E. Binding modes of peptidomimetics designed to inhibit STAT3. PLoS ONE 7, e51603, https://doi.org/10.1371/journal.pone.0051603 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Pereyra, F. et al. The major genetic determinants of HIV-1 control affect HLA class I peptide presentation. Science 330, 1551–1557, https://doi.org/10.1126/science.1195271 (2010).
Article PubMed PubMed Central Google Scholar
Kim, A. Y. et al. Spontaneous control of HCV is associated with expression of HLA-B 57 and preservation of targeted epitopes. Gastroenterology 140, 686–696, https://doi.org/10.1053/j.gastro.2010.09.042 (2011).
Article CAS PubMed Google Scholar
Pronk, S. et al. GROMACS 4.5: a high-throughput and highly parallel open source molecular simulation toolkit. Bioinformatics 29, 845–854, https://doi.org/10.1093/bioinformatics/btt055 (2013).
Article CAS PubMed PubMed Central Google Scholar
Antunes, D. A. et al. DINC 2.0: A New Protein-Peptide Docking Webserver Using an Incremental Approach. Cancer Res. 77, e55–e57, https://doi.org/10.1158/0008-5472.CAN-17-0511 (2017).
Article CAS PubMed Google Scholar
Erlanson, D. A., Fesik, S. W., Hubbard, R. E., Jahnke, W. & Jhoti, H. Twenty years on: the impact of fragments on drug discovery. Nat. Rev. Drug. Discov. 15, 605–619, https://doi.org/10.1038/nrd.2016.109 (2016).
Article CAS PubMed Google Scholar
Morris, G. M. et al. AutoDock4 and AutoDockTools4: automated docking with selective receptor flexibility. J. Comput. Chem. 30, 2785–2791, https://doi.org/10.1002/jcc.21256 (2009).
Article CAS PubMed PubMed Central Google Scholar
Bello, M., Martinez-Archundia, M. & Correa-Basurto, J. Automated docking for novel drug discovery. Expert Opin Drug Discov 8, 821–834, https://doi.org/10.1517/17460441.2013.794780 (2013).
Article CAS PubMed Google Scholar
Pettersen, E. F. et al. UCSF Chimera–a visualization system for exploratory research and analysis. J. Comput. Chem. 25, 1605–1612, https://doi.org/10.1002/jcc.20084 (2004).
Article CAS PubMed Google Scholar
Petrey, D. & Honig, B. GRASP2: visualization, surface properties, and electrostatics of macromolecular structures and sequences. Meth. Enzymol. 374, 492–509, https://doi.org/10.1016/S0076-6879(03)74021-X (2003).
Article CAS PubMed Google Scholar
Gonzalez-Galarza, F. F. et al. Allele frequency net 2015 update: new features for HLA epitopes, KIR and disease and HLA adverse drug reaction associations. Nucleic Acids Res. 43, D784–788, https://doi.org/10.1093/nar/gku1166 (2015).
Article CAS PubMed Google Scholar
Solberg, O. D. et al. Balancing selection and heterogeneity across the classical human leukocyte antigen loci: a meta-analytic review of 497 population studies. Hum. Immunol. 69, 443–464, https://doi.org/10.1016/j.humimm.2008.05.001 (2008).
Article CAS PubMed PubMed Central Google Scholar
Vita, R. et al. The immune epitope database (IEDB) 3.0. Nucleic Acids Res. 43, D405–412, https://doi.org/10.1093/nar/gku938 (2015).
Article CAS PubMed Google Scholar
Maenaka, K. et al. Nonstandard peptide binding revealed by crystal structures of HLA-B*5101 complexed with HIV immunodominant epitopes. J. Immunol. 165, 3260–3267, https://doi.org/10.4049/jimmunol.165.6.3260 (2000).
Article CAS PubMed Google Scholar
Perica, K., Varela, J. C., Oelke, M. & Schneck, J. Adoptive T cell immunotherapy for cancer. Rambam Maimonides Med. J. 6, e0004, https://doi.org/10.5041/RMMJ.10179 (2015).
Article PubMed PubMed Central Google Scholar
June, C. H., Riddell, S. R. & Schumacher, T. N. Adoptive cellular therapy: a race to the finish line. Sci. Transl. Med. 7, 280ps7, https://doi.org/10.1126/scitranslmed.aaa3643 (2015).
Article PubMed Google Scholar
Robbins, P. F. et al. A pilot trial using lymphocytes genetically engineered with an NY-ESO-1-reactive T-cell receptor: long-term follow-up and correlates with response. Clin. Cancer Res. 21, 1019–1027, https://doi.org/10.1158/1078-0432.CCR-14-2708 (2015).
Article CAS PubMed Google Scholar
Cameron, B. J. et al. Identification of a Titin-derived HLA-A1-presented peptide as a cross-reactive target for engineered MAGE A3-directed Tcells. Sci. Transl. Med. 5, 197ra103, https://doi.org/10.1126/scitranslmed.3006034 (2013).
Article PubMed Google Scholar
Linette, G. P. et al. Cardiovascular toxicity and titin cross-reactivity of affinity-enhanced T cells in myeloma and melanoma. Blood 122, 863–871, https://doi.org/10.1182/blood-2013-03-490565 (2013).
Article CAS PubMed PubMed Central Google Scholar
van den Berg, J. H. et al. Case report of a fatal serious adverse event upon administration of T cells transduced with a MART-1-specific T-cell receptor. Mol. Ther. 23, 1541–1550, https://doi.org/10.1038/mt.2015.60 (2015).
Article PubMed PubMed Central Google Scholar
Antunes, D. A. et al. Structural In Silico analysis of cross-genotype-reactivity among naturally occurring HCV NS3-1073-variants in the context of HLA-A*02:01 allele. Mol. Immunol. 48, 1461–1467, https://doi.org/10.1016/j.molimm.2011.03.019 (2011).
Article CAS PubMed Google Scholar
Zhang, S. et al. Frequency, private specificity, and cross-reactivity of preexisting hepatitis C virus (HCV)-specific CD8+ T cells in HCV-seronegative individuals: implications for vaccine responses. J. Virol. 89, 8304–8317, https://doi.org/10.1128/JVI.00539-15 (2015).
Article CAS PubMed PubMed Central Google Scholar
Adams, J. J. et al. Structural interplay between germline interactions and adaptive recognition determines the bandwidth of TCR-peptide-MHC cross-reactivity. Nat. Immunol. 17, 87–94, https://doi.org/10.1038/ni.3310 (2016).
Article CAS PubMed Google Scholar
Mendes, M. F., Antunes, D. A., Rigo, M. M., Sinigaglia, M. & Vieira, G. F. Improved structural method for T-cell cross-reactivity prediction. Mol. Immunol. 67, 303–310, https://doi.org/10.1016/j.molimm.2015.06.017 (2015).
Article CAS PubMed Google Scholar
Dhanik, A. et al. In-silico discovery of cancer-specific peptide-HLA complexes for targeted therapy. BMC Bioinformatics 17, 286, https://doi.org/10.1186/s12859-016-1150-2 (2016).
Article PubMed PubMed Central Google Scholar
Jaravine, V., Raffegerst, S., Schendel, D. J. & Frishman, D. Assessment of cancer and virus antigens for cross-reactivity in human tissues. Bioinformatics 33, 104–111, https://doi.org/10.1093/bioinformatics/btw567 (2017).
Article PubMed Google Scholar
Hawse, W. F. et al. Peptide modulation of class I major histocompatibility complex protein molecular flexibility and the implications for immune recognition. J. Biol. Chem. 288, 24372–24381, https://doi.org/10.1074/jbc.M113.490664 (2013).
Article CAS PubMed PubMed Central Google Scholar
Kurimoto, E. et al. Structural and functional mosaic nature of MHC class I molecules in their peptide-free form. Mol. Immunol. 55, 393–399 (2013).
Article CAS PubMed Google Scholar
Yanaka, S. & Sugase, K. Exploration of the conformational dynamics of major histocompatibility complex molecules. Front. Immunol. 8, 632, https://doi.org/10.3389/fimmu.2017.00632 (2017).
Article PubMed PubMed Central Google Scholar
Novinskaya, A., Devaurs, D., Moll, M. & Kavraki, L. E. Defining low-dimensional projections to guide protein conformational sampling. J. Comput. Biol. 24, 79–89, https://doi.org/10.1089/cmb.2016.0144 (2017).
Article CAS PubMed Google Scholar
Kukol, A. Consensus virtual screening approaches to predict protein ligands. Eur. J. Med. Chem. 46, 4661–4664, https://doi.org/10.1016/j.ejmech.2011.05.026 (2011).
Article CAS PubMed Google Scholar
Schymkowitz, J. et al. The FoldX web server: an online force field. Nucleic Acids Res. 33, W382–388, https://doi.org/10.1093/nar/gki387 (2005).
Article CAS PubMed PubMed Central Google Scholar
Bordner, A. J. Towards universal structure-based prediction of class II MHC epitopes for diverse allotypes. PLoS ONE 5, e14383, https://doi.org/10.1371/journal.pone.0014383 (2010).
Article ADS CAS PubMed PubMed Central Google Scholar
Lundegaard, C., Lund, O. & Nielsen, M. Prediction of epitopes using neural network based methods. J. Immunol. Methods 374, 26–34, https://doi.org/10.1016/j.jim.2010.10.011 (2011).
Article CAS PubMed Google Scholar
Wang, S. et al. Improving the prediction of HLA class I-binding peptides using a supertype-based method. J. Immunol. Methods 405, 109–120, https://doi.org/10.1016/j.jim.2014.01.015 (2014).
Article CAS PubMed Google Scholar
Han, Y. & Kim, D. Deep convolutional neural networks for pan-specific peptide-MHC class I binding prediction. BMC Bioinformatics 18, 585, https://doi.org/10.1186/s12859-017-1997-x (2017).
Article PubMed PubMed Central Google Scholar

Download references

Acknowledgements

This work was supported by NIH (grant number 1R21CA209941-01), through the Informatics Technology for Cancer Research (ITCR) initiative of the National Cancer Institute (NCI), and by a training fellowship from the Gulf Coast Consortia, on the Computational Biology Training Program (CPRIT Grant No. RP170593). Additionally, this work was partially supported by a fellowship (202186/2014-8) from CNPq (Conselho Nacional de Desenvolvimento Científico e Tecnólgico), under the Brazilian Scientific Mobility Program. Finally, this work was also partially supported by the Big-Data Private-Cloud Research Cyberinfrastructure MRI-award funded by NSF under grant CNS-1338099 and by Rice University.

Author information

Authors and Affiliations

Department of Computer Science, Rice University, Houston, TX, 77005, USA
Dinler A. Antunes, Didier Devaurs, Mark Moll & Lydia E. Kavraki
Department of Melanoma Medical Oncology - Research, The University of Texas MD Anderson Cancer Center, Houston, TX, 77054, USA
Gregory Lizée

Authors

Dinler A. Antunes
View author publications
You can also search for this author in PubMed Google Scholar
Didier Devaurs
View author publications
You can also search for this author in PubMed Google Scholar
Mark Moll
View author publications
You can also search for this author in PubMed Google Scholar
Gregory Lizée
View author publications
You can also search for this author in PubMed Google Scholar
Lydia E. Kavraki
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

D.A.A., L.E.K. and G.L. suggested the initial idea behind this work. D.A.A. and D.D. conceived the experiments. D.A.A., L.E.K. and M.M. worked on the protocols used. D.A.A. selected the pMHC targets and conducted the experiments. M.M. supervised computational choices and software development. D.A.A. and D.D. analyzed the results. D.A.A. wrote the paper. All authors reviewed the manuscript.

Corresponding author

Correspondence to Lydia E. Kavraki.

Ethics declarations

Competing Interests

The authors declare no competing interests.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Antunes, D.A., Devaurs, D., Moll, M. et al. General Prediction of Peptide-MHC Binding Modes Using Incremental Docking: A Proof of Concept. Sci Rep 8, 4327 (2018). https://doi.org/10.1038/s41598-018-22173-4

Download citation

Received: 12 May 2017
Accepted: 16 February 2018
Published: 12 March 2018
DOI: https://doi.org/10.1038/s41598-018-22173-4

This article is cited by

Using parallelized incremental meta-docking can solve the conformational sampling issue when docking large ligands to proteins
- Didier Devaurs
- Dinler A Antunes
- Lydia E Kavraki
BMC Molecular and Cell Biology (2019)
Highly conserved hemagglutinin peptides of H1N1 influenza virus elicit immune response
- Neha Lohia
- Manoj Baranwal
3 Biotech (2018)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.