HLA-B57 micropolymorphism defines the sequence and conformational breadth of the immunopeptidome

Illing, Patricia T.; Pymm, Phillip; Croft, Nathan P.; Hilton, Hugo G.; Jojic, Vladimir; Han, Alex S.; Mendoza, Juan L.; Mifsud, Nicole A.; Dudek, Nadine L.; McCluskey, James; Parham, Peter; Rossjohn, Jamie; Vivian, Julian P.; Purcell, Anthony W.

doi:10.1038/s41467-018-07109-w

Download PDF

Article
Open access
Published: 08 November 2018

HLA-B57 micropolymorphism defines the sequence and conformational breadth of the immunopeptidome

Nature Communications volume 9, Article number: 4693 (2018) Cite this article

6555 Accesses
29 Citations
57 Altmetric
Metrics details

Subjects

Abstract

Immunophenotypic differences between closely related human leukocyte antigen (HLA) alleles have been associated with divergent clinical outcomes in infection, autoimmunity, transplantation and drug hypersensitivity. Here we explore the impact of micropolymorphism on peptide antigen presentation by three closely related HLA molecules, HLA-B*57:01, HLA-B*57:03 and HLA-B*58:01, that are differentially associated with the HIV elite controller phenotype and adverse drug reactions. For each allotype, we mine HLA ligand data sets derived from the same parental cell proteome to define qualitative differences in peptide presentation using classical peptide binding motifs and an unbiased statistical approach. The peptide repertoires show marked qualitative overlap, with 982 peptides presented by all allomorphs. However, differences in peptide abundance, HLA-peptide stability, and HLA-bound conformation demonstrate that HLA micropolymorphism impacts more than simply the range of peptide ligands. These differences provide grounds for distinct immune reactivity and insights into the capacity of micropolymorphism to diversify immune outcomes.

An autoantibody signature predictive for multiple sclerosis

Article 19 April 2024

Colin R. Zamecnik, Gavin M. Sowa, … Michael R. Wilson

The present and future of bispecific antibodies for cancer therapy

Article 06 March 2024

Christian Klein, Ulrich Brinkmann, … Roland E. Kontermann

A lipid atlas of human and mouse immune cells provides insights into ferroptosis susceptibility

Article 08 April 2024

Pooranee K. Morgan, Gerard Pernes, … Andrew J. Murphy

Introduction

The human leucocyte antigen (HLA) molecules, encoded by the major histocompatibility complex (MHC) region of the genome, are cell surface glycoproteins responsible for the presentation of both endogenous and exogenously derived peptide antigens for immune surveillance. The introduction of novel complexes into this array, such as those containing peptides derived from invading pathogens, stimulates immune responses against infected cells. The genes encoding the HLA molecules (HLA-A, -B and -C for the classical HLA class I molecules, and HLA-DP, -DQ and -DR for the HLA class II molecules) are the most polymorphic of the human genome, with HLA-B alone possessing over 3000 functional allomorphs¹. Sequence diversity in HLA class I molecules ranges from micropolymorphisms, which comprise just a few amino acids, to differences of more than 30 amino acids in more distantly related allomorphs. Peptides bind to HLA molecules via interactions between the side chains of anchor residues of the peptide and pockets within the antigen-binding cleft. In the HLA class I molecules, these pockets are denoted A–F, and a large part of their landscape is determined by polymorphic amino acid residues. These polymorphisms alter the stereo- and electrochemical environment of the pockets, dictating their ability to accommodate different peptide side chains thereby influencing the nature and quantity of peptides that are bound by a given HLA allomorph^2,3,4. The nature of the peptide anchor residues accommodated by a particular HLA molecule is often referred to as the peptide-binding motif. Polymorphism further shapes the peptide array via impacting interactions with chaperones such as tapasin, which modulates peptide selection during peptide loading in the endoplasmic reticulum, biasing the peptide repertoire towards more stable ligands^5,6.

Strikingly, polymorphism at a single amino acid in the antigen-binding cleft can cause divergent immune reactivity in many clinical scenarios. For example, ankylosing spondylitis is associated with some, but not all, HLA-B27 family members; for instance, HLA-B*27:02, -B*27:03, -B*27:04 and -B*27:05 confer risk, whilst micropolymorphic family members HLA-B*27:06 and -B*27:09 do not (reviewed⁷). Although the differential association of HLA-B27 allomorphs with ankylosing spondylitis has long been thought to be directly related to differences in ligand-binding characteristics, our recent studies have challenged this hypothesis, and show a more quantitative impact of micropolymorphism on the immunopeptidome rather than merely effects on ligand binding^8,9. Similarly, abacavir hypersensitivity syndrome (AHS), a severe systemic hypersensitivity reaction to the antiretroviral drug abacavir and drug-induced liver injury mediated by the antibiotic flucloxacillin, are associated with HLA-B*57:01^10,11, whilst the closely related HLA-B*57:03, containing two amino acid substitutions in the antigen-binding cleft shows no association. Similarly, HLA-B*58:01, possessing four substitutions in the antigen-binding cleft, shows no association with AHS¹², but is instead strongly associated with allopurinol hypersensitivity¹³. It has been proposed that associations with adverse drug reactions are due to the unique ability of the associated HLA class I allomorph to present antigenic ligands, whether they be self-peptides, drug-modified peptides, or directly presented small-molecule drugs/metabolites¹⁴. However, whilst this view stands true for abacavir, which is uniquely accommodated within the antigen-binding cleft of HLA-B*57:01 in the vicinity of residues that are polymorphic between HLA-B*57:03 and HLA-B*58:01^15,16, it may be too simplistic in the context of peptide presentation. For example, altered presentation of the same peptides by micropolymorphic allomorphs has been reported to impact immunogenicity and immunodominance hierarchies in the HLA-B35 family through altered plasticity and binding kinetics^17,18, whilst distinct conformations of identical ligands presented by members of the HLA-B7 family are proposed to favour distinct escape mutations in human immunodeficiency virus (HIV)¹⁹. Equally, a single residue can delineate tapasin dependence for peptide loading and define the susceptibility of an HLA molecule to viral interference with the peptide loading pathway^5,20.

HLA-B57 family members are renowned for their association with the elite controller phenotype of HIV-infected individuals^21,22. The protective effect of HLA-B57 is hypothesised to be due to more efficient presentation of immunogenic HIV peptides to antiviral cytotoxic T lymphocytes than by non-protective HLA variants (possessing disparate peptide-binding properties). However, despite the near-identical nature of the previously described HLA-ligand-binding motif within this family, a protective hierarchy is still evident and distinctions in T-cell response to HIV epitopes are manifest²³. Moreover, despite similar modes of presentation by HLA-B*57:01 and HLA-B*57:03, the immunodominant HIV-Gag-derived peptide KAFSPEVIPMF stimulates divergent T-cell responses in the context of these two allomorphs, with HLA-B*57:01 presentation generating a T-cell response able to recognise escape variants^24,25.

These examples show the capacity for minor changes of the antigen-binding cleft to have marked effects on immune response and generate an impetus to understand more broadly the complexities of antigen presentation within the HLA-B57 family. HLA-B*57:01, HLA-B*57:03 and HLA-B*58:01 are micropolymorphic HLA allotypes of the HLA-B17 serotype. Polymorphism within these allotypes is focussed on regions of the antigen-binding cleft (cleft polymorphisms in comparison to HLA-B*57:01: Asp114Asn and Ser116Tyr in HLA-B*57:03, and Met45Thr, Ala46Glu, Val97Arg and Val103Leu in HLA-B*58:01). Notably, residues 97, 114 and 116 contribute to the E pocket of the antigen-binding cleft, whilst residue 116 also contributes to the F pocket. These are key locations of interaction between peptide ligands and the HLA heavy chain, with the F pocket accommodating the C-terminal anchor residue (PΩ), and the E pocket interacting with PΩ-2. The use of mass spectrometry to resolve binding preferences of single HLA allotypes is well established, and allows comparison of both micropolymorphic and distantly related molecules^8,26.

Here we utilised a large database of constitutive peptide ligands isolated from HLA-B*57:01, HLA-B*57:03 and HLA-B*58:01¹⁵, to map micropolymorphism-dependent changes in the HLA peptide repertoire. Although speculated upon in many studies^27,28, our study formally shows that subtle differences in primary and secondary anchor preferences of each allomorph correlate with altered stability of the respective HLA-peptide (pHLA) complexes and altered conformation of common peptides within the antigen-binding cleft. The ability to resolve these key differences is crucial to understanding altered disease outcomes between individuals having closely related HLA molecules that can represent ‘taboo mismatches’ in clinical transplantation^28,29.

Results

Qualitative resolution of the HLA peptide-binding motifs

To measure the impact of micropolymorphism on peptide presentation by HLA-B*57:01, HLA-B*57:03 and HLA-B*58:01 (cleft polymorphism locations depicted in Fig. 1a, b, described as changes in comparison to HLA-B*57:01 for B*57:03/B*58:01 throughout the manuscript) we interrogated large monoallelic ligand data sets (>2500 non-redundant peptide sequences) generated by isolation of naturally processed and presented HLA class I-bound peptides from individually transfected class I reduced (C1R) cells¹⁵. Of note, derivation from the same parental cell line ensured differences in peptide repertoire were attributable to differences in determinant selection by the HLA allotypes and not the source proteome or polymorphisms within antigen-processing machinery.

Consistent with previously reported HLA class I ligands, peptides bound to the three HLA-B allotypes were predominantly 9–11 residues in length with a preference for nonamers (Fig. 1c, Supplementary Data 1). Sequence motifs, generated from non-redundant lists of all 9–11mers depict specific amino acid preferences at each position of the peptide ligand. These preferences are very similar for the three allotypes, which display bias for Ser (S), Thr (T), Ala (A) and to a lesser extent Val (V) at P2, and aromatic residues at the C terminus (PΩ) (Fig. 1d–f, Supplementary Fig. 1a–c and 2a–c). Notably, whilst Trp (W) was the most prevalent PΩ residue for HLA-B*57:01 and HLA-B*58:01 ligands (62–80% of 9–11mers), HLA-B*57:03-bound peptides showed a higher prevalence of Phe (F) (43–50% Phe compared to 22–34% Trp across 9–11mers) (Supplementary Table 1, Fig. 1d–f, Supplementary Fig. 1a–c and 2a–c). In addition to traditional motif analysis, we also performed an unbiased statistical analysis of the allomorph-specific peptidomes using co-variation analysis (Fig. 1d–f). The P2 and PΩ preferences were observed to exist as conserved pairings within peptides with Ser at P2 strongly paired with PΩ Trp (HLA-B*57:01 and HLA-B*58:01) or Phe (HLA-B*57:03) in 9mers consistent with their nature as primary anchors (Supplementary Fig. 3).

In order to more deeply probe differences in ligand-binding characteristics, the physicochemical properties of each amino acid residue were used to independently compare the source of variation between the peptide repertoires. Each bound peptide was defined by a set of four parameters (in addition to 20 amino acid identity parameters) at each amino acid position: molecular weight, surface area, hydropathy index and isoelectric point. Consideration of broader physicochemical properties can capture similarities among amino acids that may be missed using traditional motif analysis³⁰. Peptides of 9–11 residues in length that formed ligands of the three allotypes were then independently subjected to principal component analysis (PCA). For the three peptide lengths, the HLA ligands of each allotype were distributed asymmetrically across two distinct clusters defined by PC1 and PC2 (although further substructure was also observed for 9 and 11mers [Supplementary Fig. 4 and 5]). Although ligands mapped to both clusters for all variants, suggesting the potential to sample peptides of similar physicochemical properties, the bias of peptides between the clusters was inverted for HLA-B*57:03 relative to HLA-B*57:01 and -B*58:01 (Fig. 1g–i, Supplementary Fig. 1d–f and 2d–f). Parameters distinguishing these clusters include features of PΩ, reflected by the enrichment of Phe in cluster 1 (c₁) and Trp in cluster 2 (c₂) of the 9mer PCA plots, demonstrating this location is the main point of difference among the repertoires (Supplementary Fig. 6 and 7). Features of P1 were major contributors to PC2 (and PC1 for 10 and 11mers), and distributed peptides within these major clusters in a similar fashion for all allotypes. In addition to the amino acid identity, the PCA provides additional insights into ligand selection by the different allomorphs. Importantly, a major driver of the PCA was the amino acid surface area at PΩ, which was not the case at P2 where hydropathy index was the most important property. This pattern was true for each of the three allotypes and was independent of peptide length.

In addition to the main anchor residues, HLA-B*57:01 ligands showed a higher Arg (R), and to a lesser extent Lys (K), prevalence at PΩ-2 (i.e. P7, P8 and P9 of 9mer, 10mer and 11mer peptides, respectively), whilst HLA-B*58:01 ligands displayed greater PΩ-2 Glu (E) (Fig. 1d–f, Supplementary Fig. 1a–c and 2a–c, Supplementary Table 2). We therefore hypothesised that PΩ-2 is a secondary anchor site, shaped by polymorphic residues of the E pocket (Fig. 1a, b). Consequently, to probe the nature of the anchor sites more stringently we assessed enrichment of amino acid use at a particular location relative to global prevalence in the human proteome using iceLogo software³¹ (iceLogo v1.2, static reference method, Homo sapiens Swiss-prot means, p < 0.05). Enrichment was depicted as a fold change (FC) in prevalence at primary anchor locations P2 and PΩ, and the potential secondary anchor site PΩ-2, for 9mer peptides (Fig. 2). At both P2 and PΩ, strong enrichment of a small subset of amino acids (Ala, Ser, Thr and Val for P2 and Phe, Trp and Tyr for PΩ) was displayed whilst other amino acids were disfavoured (i.e. less prevalent than in the human proteome, depicted as converted FC [FC_con]). This was particularly evident at PΩ, although HLA-B*57:03 showed some enrichment of Leu, Ile and Met in addition to aromatic residues (Fig. 2c). In contrast with the primary anchor sites, no amino acids were strongly disfavoured at PΩ-2. However, enrichment of Arg was a distinct feature of HLA-B*57:01 (FC 1.73), compared to HLA-B*57:03 (FC_con −4.65) and HLA-B*58:01 (FC_con −4.92) (Fig. 2b, red box). Lys at PΩ-2, though not significantly enriched for HLA-B*57:01 (FC 1.07, p > 0.05), was disfavoured by HLA-B*57:03 (FC_con −2.55) and B*58:01 (FC_con −2.20). Although most prevalent in the repertoire of HLA-B*58:01, Glu was enriched at PΩ-2 for all allotypes (FC 1.36–1.65). Pro showed enrichment at PΩ-2 for HLA-B*57:03 alone (FC 1.71), and was present in >10% of 9mer ligands of this allomorph, however this was not the case for 10 and 11mer peptides (Figs. 1d–f and 2b, Supplementary Fig. 1, 2). Collectively, these data resolve three distinct peptide-binding motifs based on the physicochemical properties and amino acid occupancy at different positions of the bound peptide ligand.

Quantitative differences in presentation of common peptides

As anticipated there is a large overlap in peptide ligands presented with 982 peptides (by sequence alone) common to all three HLA-B allotypes (26–33% of the allotypic repertoire), increasing to 1361–1546 (38–51%) between allotype pairs (Supplementary Data 2). During tandem mass spectrometry (MS/MS) analysis, high quality of sequencing data was achieved by limiting fragmentation to the 30 most abundant species observed per second¹⁵. This selection resulted in the potential to miss less-abundant ions, especially in chromatographic regions of high complexity. Thus, it is likely that some of the peptides identified as unique to a particular allotype by liquid chromatography (LC)-MS/MS-based peptide identification were present at low concentrations in eluates from the other allotypes and as such failed to be selected for MS/MS.

Targeted LC-MS techniques such as multiple reaction monitoring (MRM) have been used to detect both high and low abundance ligands in highly complex samples eluted from the MHC and allow comparison of abundance^32,33. Therefore, we designed an LC-MRM-MS approach to assess whether peptides were identified as unique to a particular allotype due to sampling issues rather than binding specificity. To this purpose, we generated a representative list containing 60 native peptides from those identified by MS/MS in biological replicate experiments for a given allotype and could be identified in eluates from 10⁸ cells of at least one allotype. These peptides were selected from each of the following categories: unique to an allotype, common to all allotypes or common to two of the three allotypes. Three new independent biological replicates were analysed per allotype by LC-MRM-MS and individual peptide relative abundance determined across samples.

Using this sampling independent approach 5/17, 7/28 and 6/25 (3/17, 7/28 and 5/25 in multiple replicates) of the peptides not previously identified in eluates from HLA-B*57:01, HLA-B*57:03 and HLA-B*58:01 respectively were detected, generally at considerably lower relative abundance than for the allotypes for which they were initially described as ligands (Fig. 3), supporting the rationale for this targeted approach. Indeed, although only identified with a deaminated Arg at P9 in the immunopeptidome of HLA-B*57:01 by LC-MS/MS (confidence 95, >5% false discovery rate (FDR), Supplementary Data 2), native SAAADETLRLW contributed most to the immunopeptidome of this allotype. Quantitative differences across the allotypes were observed for all peptides, even those that were consistently isolated from all three allotypes in the original non-targeted LC-MS/MS experiments, implying quantitative, as much as qualitative, differences distinguish the immunopeptidomes of these closely related allotypes.

PΩ and PΩ-2 preferences correlate with pHLA stability

Given the differences in PΩ and PΩ-2 anchor preferences, we examined the impact of these residues on pHLA complex stability. To do so we chose two 9mers containing Arg/Lys at P7 (PΩ-2) that formed a natural part of the HLA-B*57:01 peptide repertoire, LSSPVTKSF and LTVQVARVY, and designed P7/9 variants, to utilise in thermal stability experiments. LSSPVTKSF was also identified in the repertoires of HLA-B*57:03 and HLA-B*58:01 by LC-MS/MS, and the structure of LSSPVTKSF in complex with HLA-B*57:01 has been published previously, showing a salt-bridge between P7Lys and Asp114 of the HLA-B*57:01 heavy chain (PDB 2RFX)³⁴. LTVQVARVY was not detected in the repertoire of HLA-B*57:03 or HLA-B*58:01 (Fig. 3).

LTVQVARVY complexes were markedly more stable in the context of HLA-B*57:01 than HLA-B*57:03 (~9 °C difference in temperature for 50% unfold [T_m], Table 1). Consistent with the enrichment of P7Arg by HLA-B*57:01 alone, the P7Gln mutation (chosen due to similar enrichment of this residue at P7 by all allotypes, Fig. 2b) reduced this difference to ~4 °C by increasing the stability of HLA-B*57:03 complexes and reducing the stability of HLA-B*57:01 complexes. In contrast, the substitution of the P9Tyr for Trp, which is more enriched at this location in ligands of both allotypes, improved the stability of both HLA-B*57:01 (+2.8 °C) and HLA-B*57:03 (+8.6 °C) complexes. The P7Gln mutation had less impact in the context of P9Trp (<2 °C) for both HLA-B*57:01 and HLA-B*57:03 but increased the stability of HLA-B*58:01 complexes (+4.6 °C), consistent with disfavoured P7Arg by HLA-B*58:01 (Fig. 2b).

Table 1 The impact of PΩ-2 and PΩ peptide residue substitutions on HLA-peptide complex thermal stability

Full size table

For pHLA containing LSSPVTKSF, the impacts of P7/P9 substitutions were less pronounced. For all allotypes mutation of the P7Lys to Gln had marginal impact on the T_m (<1 °C), although the P9Phe to Trp mutation showed a trend towards stabilising all complexes (+1.8–2.5 °C). Introduction of the P7Gln mutation also had minimal impact in the context of P9Trp mutation. The reduced influence of the P7Lys to Gln mutation is consistent with the weaker enrichment/diminution of P7Lys as compared to P7Arg displayed by each allotype (Fig. 2b). Similarly, all allotypes show less selective discrimination between Phe and Trp at P9 than between Tyr and Trp (Fig. 2c), consistent with a smaller impact of mutation of P9 to Trp in the context of LSSPVTKSF compared to LTVQVARVY.

SAAADETLRLW, detected in the repertoire of HLA-B*58:01 (MS/MS and MRM) and HLA-B*57:01 (MRM only) as described above, was also subject to thermal stability assays. The trend in thermal stability across the allomorphs (HLA-B*57:01 [67.5 °C] > HLA-B*58:01 [64.4 °C] > HLA-B*57:03 [61.3 °C], Table 1) correlated with the relative abundance detected in their repertoires by MRM (HLA-B*57:01 > HLA-B*58:01 > HLA-B*57:03, Fig. 3).

Conformations of common ligands differ between allomorphs

To understand the structural effects of micropolymorphism between HLA-B*57:01, HLA-B*57:03 and HLA-B*58:01 and its relationship to peptide association, crystal structures in complex with the peptides LTVQVARVW, LTVQVARVY and LSSPVTKSW were determined to resolutions of 1.6–2.0 Å (data collection and refinement statistics summarised in Table 2). The high quality of the resultant models allowed for direct and reliable comparison of the structures (Supplementary Fig. 8). The three HLA molecules had similar overall tertiary structures for each peptide complex (root mean square deviation values ranging from 0.16 to 0.51 Å over Cα positions for residues 1–175) and there were no significant deviations in the secondary structure elements of the peptide-binding cleft (Supplementary Fig. 9). As such, the differential peptide-binding preferences were not due to any gross structural differences but rather to subtle differences in the architecture of the peptide-binding pockets.

Table 2 Data collection and refinement statistics

Full size table

The three HLA molecules are differentiated by polymorphisms distributed across the length of the peptide-binding groove (Fig. 1a, b). As a consequence of these substitutions the C, D, E and F pockets of HLA-B*57:01 were the deepest and most negatively charged of the three allomorphs (Fig. 4a). HLA-B*58:01 had the most shallow C and D pockets and B*57:03 the shallowest E and F pockets (Fig. 4b, c). Peripheral to the peptide-binding groove are further substitutions at positions Ala46Glu and Val103Leu (Fig. 1b) that subtly affect the structure of the β3-β4 and β5-β6 loops respectively (Supplementary Fig. 9). However, it should be noted that the location of these peripheral polymorphisms suggests that they are unlikely to directly influence T-cell receptor (TCR) recognition. These observations are consistent with the shared P2 anchor preference across the allomorphs, whilst the shallower F pocket of HLA-B*57:03 correlates with greater permissiveness for Phe and smaller non-aromatic anchors at PΩ (Fig. 2c) and the reduction in preference for high amino acid surface area at PΩ in the PCA. Whilst differences in F pocket architecture did not impact upon the conformation of Phe and Trp residues at PΩ, Tyr116 of HLA-B*57:03 restricted the accommodation of the hydroxyl group of PΩ Tyr, illustrated by a 1.4 Å shift of this group observed between HLA-B*57:01-LTVQVARVY and HLA-B*57:03-LTVQVARVY structures (Fig. 4f).

Notwithstanding the differences at the F pocket, the more striking differences in peptide conformation appear to be engendered by the respective D and E pocket environments. The depth and negative charge of the D and E pockets of HLA-B*57:01 accommodated the Arg and Lys residues at the PΩ-2 position of LTVQVARVW and LSSPVTKSW with the guanidino-head group of the Arg bound deeply within the E pocket, interacting with Asp114 (Fig. 4d, e, g). HLA-B*58:01 was similarly able to accommodate Arg and Lys residues within the E pocket, although the manner in which the Arg at PΩ-2 interacted with Asp114 differed. That is, in HLA-B*57:01-LTVQVARVW the PΩ-2 Arg formed a bi-dentate salt-bridge interaction with Asp114 (Fig. 4g), whilst in HLA-B*58:01-LTVQVARVW the PΩ-2 Arg side chain was twisted by 60° and the Asp114 side chain rotated 90° to accommodate the Val97Arg micropolymorphism (Fig. 4h). This altered the salt-bridge between the PΩ-2 Arg and Asp114 to a less favourable conformation in HLA-B*58:01, consistent with the reduced thermal stability of the complex (Fig. 4h, Table 1). In contrast, Arg and Lys were unable to be accommodated in the E pocket of HLA-B*57:03. Instead, due to the reduced E pocket volume caused by the Ser116Tyr substitution, these residues deviated by 9.7 Å and pointed out of the groove (Fig. 4d–f, i). The deviation at PΩ-2 was concomitant with a 5.3 Å shift and 180° rotation of the PΩ-3 residue to fill the C-pocket of HLA-B*57:03 (Fig. 4d–f). Overall, these structures showed that the buried polymorphisms generated minimal differences to the surface of the HLA molecule available to TCRs whilst engendering marked differences in the peptide surface landscape available for T-cell interaction.

Reciprocal T-cell alloreactivity occurs between allotypes

Vigorous T-cell alloresponses can be generated by a high degree of HLA class I mismatching between allogeneic individuals or as little as a single amino acid mismatch (e.g. across HLA-B44 allotypes)^28,35,36,37. Here we examined whether the closely related alleles HLA-B*57:01 and HLA-B*58:01 are capable of eliciting either anti-HLA-B*58:01 (B*57:01 responder vs B*58:01 stimulator) or anti-HLA-B*57:01 (B*58:01 responder vs B*57:01 stimulator) CD8⁺ T-cell alloreactivity, which would support distinct presentation of the immunopeptidome. Experiments did not include HLA-B*57:03 due to a lack of availability of HLA-B*57:03⁺ donors who are rare in Caucasian populations^38,39. A total of 16 unidirectional mixed lymphocyte reactions (MLRs) were performed utilising a combinatorial matrix incorporating six healthy individuals (Fig. 5a, d, Supplementary Table 3). Alloreactive T cells were expanded for 13 days, after which these bulk T-cell cultures were restimulated with a panel of B-lymphoblastoid cell lines (B-LCLs) expressing the mismatched stimulator HLA-A and -B alloantigens (Supplementary Table 3) to dissect their individual contribution (measured by interferon-gamma (IFNγ) production) to the overall alloresponse.

The first set of MLRs (1–4, 11, 12, 15 and 16; Fig. 5a, Supplementary Table 3) were designed to evaluate anti-HLA-B*58:01 T-cell alloreactivity between responders expressing HLA-B57 (DHS011), HLA-B*57:01 (DHS006 and DHS009) or both HLA-B*57:01 and HLA-B*58:01 (heterozygote, AP012), and stimulators expressing HLA-B*58:01 (AP013 and AP015). All HLA-A and -B mismatched alloantigens expressed by the stimulators, including HLA-B*58:01, generated alloreactive CD8⁺ T-cell responses. In contrast, no allo-specific CD8⁺ T cells were directed against HLA-B*58:01 in the heterozygote responder (Fig. 5b, c). The second set of MLRs (5–10, 13 and 14; Fig. 5d, Supplementary Table 3) were designed to evaluate anti-HLA-B*57:01 T-cell alloreactivity between responders expressing HLA-B*58:01 (AP013 and AP015), HLA-B*57:01 (matched, DHS006) or both HLA-B*57:01 and HLA-B*58:01 (heterozygote, AP012) and stimulators expressing HLA-B57 (DHS011) or HLA-B*57:01 (DHS009). Similar to the first set of MLRs, all HLA-A and -B mismatched alloantigens expressed by the stimulators generated alloreactive CD8⁺ T-cell responses, whilst no allo-specific CD8⁺ T cells were directed against HLA-B*57:01 by either the HLA-B*57:01/B*58:01 heterozygote or the HLA-B*57:01 matched responder (Fig. 5e, f). Thus, differences in self-peptide presentation by HLA-B*57:01 and HLA-B*58:01, not background proteome, generate alloreactivity between these related molecules.

Discussion

In order to resolve the impacts of micropolymorphism on the peptide repertoire, we comprehensively analysed data sets comprising 2673 HLA-B*57:01-bound peptides, 3168 HLA-B*57:03-bound peptides and 2526 HLA-B*58:01-bound peptides isolated from monoallelic C1R transfectants¹⁵. Due to differences in the ionisation efficiencies of individual peptides, which precludes absolute peptide quantitation without the introduction of sequence-matched isotope-labelled standards, distribution of specific sequence features across the population of peptides was used to define and compare the peptide-binding motif of each allotype. The analysis was performed at three levels; first, at the level of amino acid preference individually for 9, 10 or 11mer peptide ligands; second, at the level of amino acid physical chemistry using a recently established PCA-based statistical analysis of the data; and finally, at the level of proteome enrichment. The majority of peptide ligands identified were 9–11 residues in length and possessed Ser, Thr or Ala at P2, and aromatic residues at their C terminus, consistent with investigations of HLA-B*57:01 and HLA-B*58:01 from other groups^26,40,41. Of the three allotypes, the previously underexplored HLA-B*57:03 had the most distinctive binding preferences. Although length and P2 preferences were equivalent to the other allomorphs, preferences at PΩ differed, showing greater enrichment of Phe and greater sampling of ligands with smaller PΩ residues. Despite this, HLA-B*57:03, like HLA-B*57:01 and HLA-B*58:01, was more stable when in complex with peptides containing the bulky PΩTrp (compared to PΩTyr or Phe). The incongruity between the stabilisation effects conferred by Trp vs Phe at P9 and their prevalence in the repertoire of HLA-B*57:03 suggests an interplay between ligand availability (Trp and Phe constitute approximately 1 and 4% of the human proteome respectively) and complex stability in shaping the resultant peptide repertoire, which may be further influenced by interactions with the peptide-loading complex. Indeed, evidence of a hierarchy of tapasin dependence between these allomorphs (HLA-B*57:01 > HLA-B*58:01 > HLA-B*57:03)⁴² may indicate that HLA-B*57:03 can more readily escape the benefits of peptide editing within the peptide-loading complex, as suggested for other alleles⁴³.

We further defined a secondary anchor site that modulated peptide affinity for the HLA molecule and distinguished the HLA-B*57:01 peptide-binding motif. Arg appeared at P7 of 9mers (and PΩ-2 of longer peptides) almost exclusively in the HLA-B*57:01 data set and correlated with stabilisation of HLA-B*57:01 complexes by PΩ-2 Arg. In contrast, PΩ-2 Arg negatively impacted the stability of HLA-B*57:03 and HLA-B*58:01 complexes. Structural analysis of bound peptide conformation showed a pronounced change in orientation of PΩ-2 Arg/Lys conferred by the Ser116Tyr polymorphism of HLA-B*57:03 that would markedly change the surface presented to T cells. More subtle changes were induced by the Val97Arg polymorphism in HLA-B*58:01. These observations strongly parallel differences in peptide presentation between micropolymorphic allotypes HLA-B*35:01 and HLA-B*35:08, in which a Leu156Arg substitution generates a secondary anchor site that improves the binding kinetics of peptides containing a negatively charged residue at P5. Of note, this results in distinct immunodominance hierarchies for human cytomegalovirus pp65 T-cell epitopes in HLA-B*35:01⁺ and HLA-B*35:08⁺ individuals¹⁸. In addition, the Leu156Arg polymorphism can alter peptide conformation and the plasticity of bound ligands, resulting in divergent T-cell responses to several Epstein Barr virus epitopes. Changes in T-cell responses were attributed to the adoption of different conformations within the antigen-binding cleft and/or the ability of the HLA-peptide complex to accommodate conformational change on TCR engagement^17,44,45. Although HLA-B*57:01, HLA-B*57:03 and HLA-B*58:01 all associate with long-term non-progression of HIV-1 infection to acquired immunodeficiency syndrome^21,22, differences in viral load between patients possessing different HLA-B57/58 alleles correlate with differential immunogenicity of identical peptide ligands²³. The differences in peptide presentation described here provide grounds for this differential T-cell recognition of B57/B58-bound ligands. Indeed, HLA-B*57:01 and HLA-B*57:03 restricted presentation of the 11mer KAFSPEVIPMF HIV-Gag162-172 epitope induce distinct T-cell responses²⁴. Although presented in similar conformations by both molecules, Tyr116 of HLA-B*57:03 reduces the space available to accommodate changes in KAFSPEVIPMF conformation on TCR ligation, requiring re-orientation of Tyr116, and impacting TCR selection through altered TCR-pHLA affinity²⁵. In contrast, our structural analyses encompass peptides of optimal length to be contained within the antigen-binding cleft (9mers). These peptides occupy the cleft without marked bulging or overhang^{25,45,46,47,48}, however polymorphic residues of the cleft cause distinct amino acid residue orientations. Our data suggest that 9-10mer HLA-B57/58 HIV-1 epitopes possessing Arg or Lys at PΩ-2 such as QATQDVKNW (Gag308-316), and its escape variants, and AVRHFPRIW (Vpr30-38)^21,23 may adopt distinct conformations across the B57 family and generate structurally distinct targets for T-cell responses, which, in conjunction with quantitative differences in contribution to the immunopeptidome, may in turn explain the clinical differences in patients with these allotypes.

The observed alloreactivity between HLA-B*57:01 and HLA-B*58:01 further indicates that the differences in presentation of the self-proteome described are sufficient to alter recognition reminiscent of alloresponses between HLA-B*44:02 and HLA-B*44:03 molecules, which differ by a single residue buried within the antigen-binding cleft (Asp156Leu) and induce alloreactivity when mismatched in transplant scenarios^49,50. Although the residue 156 polymorphism does not impact the primary anchor pockets of these allotypes, resulting in highly similar peptide-binding motifs and immunopeptidomes, differences are sufficient to stimulate alloresponses and are augmented by the ability of these allotypes to present identical peptides in structurally distinct conformations²⁸. Thus, caution may be necessary when embarking on transplants between individuals bearing HLA-B57/58 mismatches.

In summary, we present the first comprehensive investigation of the impact of micropolymorphism on the immunopeptidome of HLA class I molecules. This has involved detailed analysis of ligand-binding specificity, qualitative and quantitative analysis of the immunopeptidomes of three clinically important HLA-B57 family members, and structural and functional characterisation of these differences. We show that micropolymorphism influences the immunopeptidome at several interlinked levels: (i) the repertoire of displayed peptides; (ii) quantity of displayed peptides; (iii) stability of pHLA, which will impact on the dynamics of the immunopeptidome; and (iv) conformation of pHLA. Importantly, such differences may amplify the responding T-cell repertoire against pathogens in heterozygous individuals but restrict transplantation options when considering micropolymorphic mismatches between donor-recipient pairings. Moreover, these findings suggest a need to look beyond qualitative analysis of the peptide repertoire when trying to unravel the nature of HLA-peptide presentation that dictates susceptibility to viral infection, autoimmunity, transplant rejection and drug hypersensitivity.

Methods

Ethics

Healthy individuals (n = 6) expressing either HLA-B*57:01, HLA-B*58:01 or both were recruited for the study. Ethics was granted from both Monash University (DHS numbers) and the Australian Bone Marrow Donor Registry (AP numbers) human ethics committees. Informed consent was obtained from all participants and research was performed in compliance with ethical regulations for the use of human samples.

Peripheral blood mononuclear cell isolation

Peripheral blood samples were collected in heparinised vacutainer tubes and peripheral blood mononuclear cells (PBMCs) were isolated by Ficoll–Paque (GE Healthcare, Sweden) and density gradient centrifugation and cryopreserved until required.

Cell lines and culture

C1R.B*57:01, C1R.B*57:03 and C1R.B*58:01 are B-LCLs, derived from the C1R cell line that expresses reduced amounts of HLA class I (reduced HLA-A2, reduced HLA-B35 and normal HLA-Cw4^51,52), and transfected with HLA-B*57:01, HLA-B*57:03 or HLA-B*58:01 cDNA cloned into the pcDNA3.1(−) vector (Invitrogen, USA)¹⁵.

T-cell alloreactivity assays included the following B-LCLs (9053⁵³: A*33:03, B*44:03 and C*14:03; T241: A*23:01, B*07:02, B*41:01, C*07:02 and C*08:02; A21: A2 and B40) and transfected cell lines (C1R.parental/A*01:01/A*02:01/A*03:01/B*07:02/B*08:01/B*15:01/B*44:02/B*44:03/B*57:01/B*58:01). C1R transfectants were produced within the McCluskey laboratory (Peter Doherty Institute, University of Melbourne, Victoria); T241 and A21 were provided by the Victorian Transplantation and Immunogenetics Service (West Melbourne, Victoria).

All cell lines were cultured in RF10 [RPMI 1640 (Life Technologies, USA) supplemented with 10% foetal calf serum (Sigma, St Louis, USA), 7.5 mM HEPES (MP Biomedicals, Germany), 100 U mL⁻¹ Pen-Strep (benzyl-penicillin/streptomycin, Life Technologies, USA), 2 mM l-glutamine (MP Biomedicals, Germany), 76 μM β-mercaptoethanolamine (Sigma-Aldrich, USA) and 150 μM non-essential amino acids (Life Technologies, USA)] at 37 °C, 5% CO₂. Maintenance of transfected HLA expression during long-term culture was facilitated by addition of Geneticin, 0.4–0.5 mg mL⁻¹ (G418; Life Technologies, USA), or Hygromycin B, 0.2–0.3 mg mL⁻¹ (Life Technologies, USA). Increased HLA class I expression (as compared to C1R parental) was confirmed via flow cytometry after staining with the HLA class I pan-specific monoclonal antibody W6/32⁵⁴ (produced in house from the W6/32 hybridoma) and Goat F(ab′)2 Anti-Mouse IgG(H + L), Human ads-PE (1:500, catalogue number 1032-09, Southern Biotech, USA). All cell lines were tested for mycoplasma contamination.

Motif characterisation and data set comparisons

We had previously isolated and sequenced peptide ligands from HLA class I of 10⁹ C1R-B*57:01, C1R-B*57:03 and C1R-B*58:01 cells by LC-MS/MS using an information-dependent acquisition (IDA) strategy¹⁵. Spectra were assigned with ProteinPilot^TM software version 5.0 (SCIEX, USA) searching against the reviewed Swiss-Prot human proteome (accessed November 2017) and peptide identities determined subject to strict bioinformatic criteria, assigning confidence values to each peptide and including the use of a decoy database to calculate the FDR. Peptides known to bind the endogenous HLA class I of C1R cells (HLA-C*04:01 and HLA-B*35:03)⁵⁵ were removed before subsequent analysis. A further list of peptide contaminants, generated by comparison of a large number of similar elution experiments for MHC I and MHCII were also disregarded, in addition to peptides of the HLA proteins (Supplementary Data 3). To characterise the peptide-binding motif of each HLA allotype, distinct peptides identified within three biological replicate experiments were filtered using a confidence cut-off for a 5% local FDR (95.2–97) and pooled to generate a single data set for analysis. The frequency of peptides (non-redundant by sequence) of specific lengths and/or possessing a particular amino acid at a specified position within the peptide was then calculated and sequence motifs generated for 9–11 residue peptides. Heat maps of the inter-position coupling matrices were generated for each of the 9mer, 10mer and 11mer peptides. Statistical coupling of two sites in the peptide was defined as the degree to which amino acid frequencies at one site change in response to a perturbation of frequencies at a second site⁵⁶. Coupling matrices were processed and analysed with custom Perl and MATLAB (The MathWorks Inc., Natick, MA) scripts⁵⁷. Scripts are available (https://github.com/jlmendozabio/covariation_stats).

For PCA based on amino acid physicochemical properties, 4 quantitative biophysical properties (molecular weight, hydropathy index, surface area and isoelectric point) were determined for each position of the peptide. We also incorporated 20 additional parameters at each amino acid position describing the identity of the amino acid present. From the points generated by the PCA, a two-dimensional kernel density plot was used to more clearly display large numbers of peptides. These variables were processed for peptides of 9–11 amino acids in length from each HLA allotype. For each different combination of PC scores, we performed clustering by k-means clustering. Silhouette analysis was used to provide a quantitative assessment of cluster similarity. On the basis of peaks in silhouette coefficient across the number of clusters, peptides were assigned into one of the two distinctive clusters present in all allotypes and for all peptide lengths by using k-means clustering (k = 2) on the first two principal components. To visualise the sequence motifs present in each cluster, peptide sequences were extracted from each cluster and their motifs generated based on residue frequency as described above. These analyses were performed using a custom R script^58,59,60,61 (available at https://github.com/ParhamLab/PeptidePCA/tree/master/R).

Amino acid enrichment/regulation over prevalence in the human proteome was determined using the icelogo v1.2 stand-alone software via the static reference method (reference Homo sapiens Swiss-Prot means), and is depicted as FC (FC = prevalence in data set/prevalence in human proteome) for enriched amino acids, and converted FC (FC_con = −1/FC) for negatively regulated residues³¹. FC or FC_con was only depicted where the Z-score fell outside the confidence interval for a p-value of 0.05. Post-translational modifications of peptides were not considered during motif analysis.

To perform sequence-based comparison of data sets for overlap within the peptide repertoire, all peptides in a data set identified with a confidence ≥ 95 were included. Peptides identified with a confidence > 20 were also included if, and only if, they appeared in a compared data set with a confidence ≥ 95. Modifications were considered in overlap analysis.

Purification of HLA-peptide complexes

C1R transfectants were grown to high density in 100 mL RF10 containing 0.5 mg mL⁻¹ G418 in T175 tissue culture flasks (Greiner Bio-One International AG, Austria). Cells were harvested in batches of 10⁸ cells by centrifugation (1200 × g, 20 min, 4 °C), washed twice in chilled phosphate-buffered saline and frozen on dry ice for 15 min or by submersion in liquid nitrogen. Pelleted cells were stored at −80 °C until time of use. Detergent-based lysis was performed by resuspending cell pellets in 5 mL lysis buffer [0.5% IGEPAL (Sigma-Aldrich, USA), 50 mM Tris, pH 8, 150 mM NaCl (Merck-Millipore, Germany) and protease inhibitors (Complete Protease Inhibitor Cocktail Tablet [1 tablet per 50 mL solution]; Roche Molecular Biochemicals, Switzerland)] and incubating for 45 min at 4 °C with slow end-over-end mixing. Lysates were cleared by centrifugation at 16 000 × g for 20 min at 4 °C.

HLA-peptide complexes were immunoaffinity purified from cell lysates using 1 mg W6/32 monoclonal antibody crosslinked to protein A sepharose⁶. Bound complexes were eluted with 2 mL 10% acetic acid. The eluted mixture of peptides, class I heavy chain and β₂-microglobulin (β₂m) was fractionated on a 4.6 mm internal diameter × 50 mm long monolithic reversed-phase (RP) C₁₈ high-performance liquid chromatography (HPLC) column (Chromolith Speed Rod, Merck-Millipore, Germany) utilising an ÄKTAmicro™ HPLC system (GE Healthcare, UK; Unicorn v5.11 software) and using a mobile phase consisting of buffer A (0.1% trifluoroacetic acid (TFA) [Thermo Scientific, USA]) and buffer B (80% acetonitrile (ACN) [Fisher Scientific, USA] and 0.1 % TFA), running at 1 mL min⁻¹ with a gradient of B of 2–40% over 4 min, 40–45% over 4 min and 45–99% over 2 min, collecting 500 μL fractions. Three fraction pools were generated and vacuum concentrated for MS analysis. Ultraviolet absorbance of eluted material was monitored at 215 nm. The relative amount of HLA purified was measured as the area under the curve for the β₂m.

MRM quantification of HLA-bound peptides

Fraction pools from the RP-HPLC purification were concentrated using a speed vacuum concentration system (LABCONCO, USA). MRM detection was performed using an AB SCIEX QTRAP 5500 mass spectrometer, equipped with a Tempo nanoLC (Eksigent) autosampler and cHiPLC nanoflex (Eksigent) and utilising Analyst 1.6 (SCIEX) software. Samples were injected and loaded onto a trap column (200 µm × 0.5 mm ChromXP C₁₈-CL packed with 3 µm particles, nominal pore size 120 Å) at a flow rate of 5 µL min⁻¹ in 98% buffer A (0.1% formic acid in water), 2% buffer B (95% ACN and 0.1% formic acid in water) for 10 min. Samples were eluted from the trap column and over a cHiPLC column (75 µm × 15 cm ChromXP C₁₈- packed with 3 µm particles, nominal pore size 120 Å) at 300 nL min⁻¹ using the following gradient conditions: 0–3 min 2–10% B, 3–62 min 10–50% B, 62–65 min 40–80% B, 65–70 min hold at 80% B, 70–73 min 80–2% B, followed by equilibration at 2% B for 7 min. The QTRAP 5500 was operated in MRM mode in unit resolution for Q1 and Q3, coupled to an IDA criterion set to trigger an EPI scan (10 000 Da s⁻¹; rolling CE; unit resolution) following any MRM transition exceeding 600 counts. Triggering MRM transitions were ignored for the subsequent 6 s.

The detection of all three to four transitions overlapping at a particular retention time, accompanied by MRM triggered-MS/MS fragmentation in at least one experiment, was used as an indicator of peptide presence. Fragment ion intensity rankings were compared to those in initial IDA-based discovery experiments using a spectral library generated from data for the three HLA allotypes using Skyline 64 bit 3.5.0.9319 (MacCoss Laboratory⁶²) and calculated as a dot product value. Peptides detected in a sample without MS/MS validation were considered valid if the retention time (RT) was ±1.5 min of the average RT for MS/MS validated appearances of that peptide and the dot product value was >0.7. Relative peptide abundance was calculated as the total area under the curve for the detected transitions using Skyline software, normalised to the amount of purified HLA from which the sample was derived, allowing comparison between samples in the absence of absolute quantitation.

Allogeneic T-cell stimulation

T-cell cultures were generated from 5 × 10⁶ responder PBMCs stimulated with 2.5 × 10⁶ irradiated allogeneic PBMCs. Culture medium was supplemented with 20 U mL⁻¹ recombinant human IL-2 (Cetus) and changed every 2–3 days to maintain saturating levels of nutrients and growth factors. On day 13, 2 × 10⁵ responders from the T-cell culture were restimulated with 10⁵ B-LCLs expressing allo-HLA. After 2 h of coincubation (37 °C, 5% CO₂), 10 µg mL⁻¹ Brefeldin A (Sigma-Aldrich, USA) was added for a further 4 h. Responder CD8⁺ T cells were stained with anti-CD8 PerCP-Cy5.5 (1:20, clone SK1, catalogue number 341051, Becton Dickinson [BD] Biosciences, USA), anti-CD4 PE (1:20, clone RPA-T4, catalogue number 555347, BD Biosciences) and a viability dye (1:750, LIVE/DEAD™ Fixable Aqua Dead Cell Stain, 405 nm excitation, catalogue number L34957, Thermo Fisher), fixed with 1% paraformaldehyde (ProSciTech, Australia) and permeabilised with 0.3% Saponin (Sigma-Aldrich, USA) containing anti-IFNγ PE-Cy7 (1:250, clone B27, catalogue number 557643, BD Biosciences, USA) and acquired on a LSRII flow cytometer (BD, USA) utilising BD FACSDIVA™ software. The percentage of allo-specific CD8⁺ T cells producing IFNγ was analysed using FlowJo software (Tree Star Inc., USA)³⁶, utilising the gating strategy shown in Supplementary Fig. 10. Sample numbers were dictated by availability of HLA-B*57:01/HLA-B*58:01 PBMC.

Recombinant HLA-peptide complex generation

The HLA-B*57:01, HLA-B*57:03, HLA-B*58:01 and β₂m genes were sub-cloned into the pET-30 expression vector and were expressed into inclusion bodies separately in Escherichia coli. The HLA complexes were refolded in the presence of the peptides listed in Table 1 and purified as described previously⁶³. Briefly, 90 mg HLA heavy chain was refolded by rapid dilution in a solution containing 3 M urea (Sigma-Aldrich, USA), 100 mM Tris-HCl, pH 8.0 (Sigma-Aldrich, USA), 400 mM l-arginine-HCl, 5 mM reduced glutathione (Sigma-Aldrich, USA) and 0.5 mM oxidised glutathione (Sigma-Aldrich, USA) in the presence of 30 mg β₂m and 10 mg of the appropriate peptide for 48 h. The refolded HLA-peptide complexes were dialysed into 10 mM Tris, pH 8.0, and purified by size-exclusion chromatography using HiLoad 16/60 Superdex 200 pg (GE Healthcare, USA) columns on an AKTA Purifier (GE Healthcare, USA) FPLC chromatography systems in 10 mM Tris, pH 8.0, and 150 mM NaCl buffer. Final purification was by anion exchange using a HiTrap Q Fast Flow column (GE Healthcare, USA) on the same AKTA system in 10 mM Tris pH 8.0 buffer with a NaCl gradient from 0 to 500 mM over 45 min.

Thermal melt experiments

Thermal stability assays were performed at 0.5 and 1 mg mL⁻¹ HLA-peptide complex in 10 mM Tris and 150 mM NaCl, pH 8.0 in a reaction volume of 25 μL in duplicate except where otherwise indicated. Protein unfolding was monitored by the addition of the fluorescent dye SYPRO^® Orange (Sigma-Aldrich, USA) at 10× concentration. Refolded complexes were heated from 35 to 90 °C at a heating rate of 1 °C min⁻¹ in the Real Time Detection system (Rotor-Gene^® Q, QIAGEN) and fluorescence intensity was measured using an excitation wavelength of 530 nm and emission at 555 nm.

X-ray crystallography

The peptide sequences crystallised in complex with HLA-B*57:01, HLA-B*57:03 and HLA-B*58:01 are noted in Table 1. HLA-peptide complexes were concentrated to ~10 mg mL⁻¹ and crystallised at 294 K by the hanging-drop vapour-diffusion method from a solution comprising 12–20% PEG 4000, 0.2 M ammonium acetate and 0.1 M tri-sodium citrate pH 5.4–5.6. Prior to data collection, crystals were equilibrated in reservoir solution with 10% glycerol added as a cryoprotectant and then flash-cooled in a stream of liquid nitrogen at 100 K. Data sets were collected at the MX2 beamline (Australian Synchrotron, Victoria). The data were recorded on a Quantum-315 CCD detector and were integrated and scaled using MOSFLM and SCALA from the CCP4 programme suite^64,65,66. Details of the data processing statistics are summarised in Table 2. Phases for the structures were determined by molecular replacement as implemented in PHASER⁶⁷ with HLA-B*57:01-LF9 used as the search model (Protein Data Bank accession number: 2RFX³⁴). Refinement of the models proceeded with iterative rounds of manual building in COOT⁶⁸, refinement in PHENIX⁶⁹ and validation with MOLPROBITY⁶⁹. Refinement statistics are summarised in Table 2.

Code availability

Scripts for co-variation analysis and PCA are available at https://github.com/jlmendozabio/covariation_stats and https://github.com/ParhamLab/PeptidePCA/tree/master/R.

Data availability

Proteomics data sets analysed during this study have been deposited to the ProteomeXchange Consortium via the PRIDE⁷⁰ partner repository with the data set identifiers PXD008570 (C1R.B*57:01 LC-MS/MS), PXD008571 (C1R.B*57:03 LC-MS/MS), PXD008572 (C1R-B*58:01 LC-MS/MS) and PXD009850 (LC-MRM). Coordinates and structure factors were deposited in the PDB with the following codes: B5701-LSSPVTKSW 5VUD; B5701-LTVQVARVW 5VUE; B5701-LTVQVARVY 5VUF; B5703-LSSPVTKSW 5VVP; B5703-LTVQVARVW 5VWD; B5703-LTVQVARVY 5VWF; B5801-LSSPVTKSW 5VWH; and B5801-LTVQVARVW 5VWJ. All other data are available from the corresponding author on reasonable request.

References

Robinson, J. et al. The IPD and IMGT/HLA database: allele variant databases. Nucleic Acids Res. 43, D423–D431 (2015).
Article CAS Google Scholar
Reche, P. A. & Reinherz, E. L. Sequence variability analysis of human class I and class II MHC molecules: functional and structural correlates of amino acid polymorphisms. J. Mol. Biol. 331, 623–641 (2003).
Article CAS Google Scholar
Adams, E. J. & Luoma, A. M. The adaptable major histocompatibility complex (MHC) fold: structure and function of nonclassical and MHC class I–like molecules. Annu. Rev. Immunol. 31, 529–561 (2013).
Article CAS Google Scholar
Madden, D. R. The three-dimensional structure of peptide-MHC complexes. Annu. Rev. Immunol. 13, 587–622 (1995).
Article CAS Google Scholar
Williams, A. P., Peh, C. A., Purcell, A. W., McCluskey, J. & Elliott, T. Optimization of the MHC class I peptide cargo is dependent on tapasin. Immunity 16, 509–520 (2002).
Article CAS Google Scholar
Purcell, A. W. et al. Quantitative and qualitative influences of tapasin on the class I peptide repertoire. J. Immunol. 166, 1016–1027 (2001).
Article CAS Google Scholar
Bowness, P. HLA-B27. Annu. Rev. Immunol. 33, 29–48 (2015).
Article CAS Google Scholar
Schittenhelm, R. B., Sian, T. C., Wilmann, P. G., Dudek, N. L. & Purcell, A. W. Revisiting the arthritogenic peptide theory: quantitative not qualitative changes in the peptide repertoire of HLA-B27 allotypes. Arthritis Rheumatol. 67, 702–713 (2015).
Article CAS Google Scholar
Schittenhelm, R. B., Sivaneswaran, S., Lim Kam Sian, T. C., Croft, N. P. & Purcell, A. W. Human leukocyte antigen (HLA) B27 allotype-specific binding and candidate arthritogenic peptides revealed through heuristic clustering of data-independent acquisition mass spectrometry (DIA-MS) data. Mol. Cell. Proteomics 15, 1867–1876 (2016).
Article CAS Google Scholar
Saag, M. et al. High sensitivity of human leukocyte antigen-B*5701 as a marker for immunologically confirmed abacavir hypersensitivity in white and black patients. Clin. Infect. Dis. 46, 1111–1118 (2008).
Article CAS Google Scholar
Daly, A. K. et al. HLA-B*5701 genotype is a major determinant of drug-induced liver injury due to flucloxacillin. Nat. Genet. 41, 816–819 (2009).
Article CAS Google Scholar
Mallal, S. et al. Association between presence of HLA-B*5701, HLA-DR7, and HLA-DQ3 and hypersensitivity to HIV-1 reverse-transcriptase inhibitor abacavir. Lancet 359, 727–732 (2002).
Article CAS Google Scholar
Hung, S.-I. et al. HLA-B*5801 allele as a genetic marker for severe cutaneous adverse reactions caused by allopurinol. Proc. Natl Acad. Sci. USA 102, 4134–4139 (2005).
Article ADS CAS Google Scholar
Bharadwaj, M. et al. Drug hypersensitivity and human leukocyte antigens of the major histocompatibility complex. Annu. Rev. Pharmacol. Toxicol. 52, 401–431 (2012).
Article CAS Google Scholar
Illing, P. T. et al. Immune self-reactivity triggered by drug-modified HLA-peptide repertoire. Nature 486, 554–558 (2012).
Article ADS CAS Google Scholar
Ostrov, D. A. et al. Drug hypersensitivity caused by alteration of the MHC-presented self-peptide repertoire. Proc. Natl Acad. Sci. USA 109, 9959–9964 (2012).
Article ADS CAS Google Scholar
Tynan, F. E. et al. A T cell receptor flattens a bulged antigenic peptide presented by a major histocompatibility complex class I molecule. Nat. Immunol. 8, 268–276 (2007).
Article CAS Google Scholar
Burrows, J. M. et al. The impact of HLA-B micropolymorphism outside primary peptide anchor pockets on the CTL response to CMV. Eur. J. Immunol. 37, 946–953 (2007).
Article CAS Google Scholar
Kloverpris, H. N. et al. A molecular switch in immunodominant HIV-1-specific CD8 T-cell epitopes shapes differential HLA-restricted escape. Retrovirology 12, 20 (2015).
Article Google Scholar
Park, B., Lee, S., Kim, E. & Ahn, K. A single polymorphic residue within the peptide-binding cleft of MHC class I molecules determines spectrum of tapasin dependence. J. Immunol. 170, 961–968 (2003).
Article CAS Google Scholar
Altfeld, M. et al. Influence of HLA-B57 on clinical presentation and viral control during acute HIV-1 infection. AIDS 17, 2581–2591 (2003).
Article CAS Google Scholar
Kaslow, R. A. et al. Influence of combinations of human major histocompatibility complex genes on the course of HIV-1 infection. Nat. Med. 2, 405–411 (1996).
Article CAS Google Scholar
Kloverpris, H. N. et al. HLA-B*57 Micropolymorphism shapes HLA allele-specific epitope immunogenicity, selection pressure, and HIV immune control. J. Virol. 86, 919–929 (2012).
Article CAS Google Scholar
Yu, X. G. et al. Mutually exclusive T-cell receptor induction and differential susceptibility to human immunodeficiency virus type 1 mutational escape associated with a two-amino-acid difference between HLA class I subtypes. J. Virol. 81, 1619–1631 (2007).
Article CAS Google Scholar
Stewart-Jones, G. B. et al. Structural features underlying T-cell receptor sensitivity to concealed MHC class I micropolymorphisms. Proc. Natl Acad. Sci. USA 109, E3483–E3492 (2012).
Article CAS Google Scholar
Abelin, J. G. et al. Mass spectrometry profiling of HLA-associated peptidomes in mono-allelic cells enables more accurate epitope prediction. Immunity 46, 315–326 (2017).
Article CAS Google Scholar
Zernich, D. et al. Natural HLA class I polymorphism controls the pathway of antigen presentation and susceptibility to viral evasion. J. Exp. Med. 200, 13–24 (2004).
Article CAS Google Scholar
Macdonald, W. A. et al. A naturally selected dimorphism within the HLA-B44 supertype alters class I structure, peptide repertoire, and T cell recognition. J. Exp. Med. 198, 679–691 (2003).
Article CAS Google Scholar
Doxiadis, I. I. N. et al. Association between specific HLA combinations and probability of kidney allograft loss: the taboo concept. Lancet 348, 850–853 (1996).
Article CAS Google Scholar
Hilton, H. G. et al. The intergenic recombinant HLA-B*46:01 has a distinctive peptidome that includes KIR2DL3 ligands. Cell Rep. 19, 1394–1405 (2017).
Article CAS Google Scholar
Colaert, N., Helsens, K., Martens, L., Vandekerckhove, J. & Gevaert, K. Improved visualization of protein consensus sequences by iceLogo. Nat. Methods 6, 786–787 (2009).
Article CAS Google Scholar
Dudek, N. L. et al. Constitutive and inflammatory immunopeptidome of pancreatic β-cells. Diabetes 61, 3018–3025 (2012).
Article CAS Google Scholar
Tan, C. T., Croft, N. P., Dudek, N. L., Williamson, N. A. & Purcell, A. W. Direct quantitation of MHC-bound peptide epitopes by selected reaction monitoring. Proteomics 11, 2336–2340 (2011).
Article CAS Google Scholar
Chessman, D. et al. Human leukocyte antigen class I-restricted activation of CD8 + T cells provides the immunogenetic basis of a systemic drug hypersensitivity. Immunity 28, 822–832 (2008).
Article CAS Google Scholar
Macdonald, W. A. et al. T cell allorecognition via molecular mimicry. Immunity 31, 897–908 (2009).
Article CAS Google Scholar
Mifsud, N. A. et al. Immunodominance hierarchies and gender bias in direct T-CD8-cell alloreactivity. Am. J. Transplant. 8, 121–132 (2008).
Article CAS Google Scholar
Bettens, F., Buhler, S. & Tiercy, J.-M. Allorecognition of HLA-C mismatches by CD8(+) T cells in hematopoietic stem cell transplantation is a complex interplay between mismatched peptide-binding region residues, HLA-C expression, and HLA-DPB1 disparities. Front. Immunol 7, 584 (2016).
Article Google Scholar
Gonzalez-Galarza, F. F., Christmas, S., Middleton, D. & Jones, A. R. Allele frequency net: a database and online repository for immune gene frequencies in worldwide populations. Nucleic Acids Res. 39, D913–D919 (2011).
Article CAS Google Scholar
Gonzalez-Galarza, F. F. et al. Allele frequency net 2015 update: new features for HLA epitopes, KIR and disease and HLA adverse drug reaction associations. Nucleic Acids Res. 43, D784–D788 (2015).
Article CAS Google Scholar
Barber, L. D. et al. Polymorphism in the alpha(1) helix of the HLA-B heavy chain can have an overriding influence on peptide-binding specificity. J. Immunol. 158, 1660–1669 (1997).
CAS PubMed Google Scholar
Falk, K. et al. Peptide motifs of HLA-B58, B60, B61 and B62 molecules. Immunogenetics 41, 165–168 (1995).
Article CAS Google Scholar
Rizvi, S. M. et al. Distinct assembly profiles of HLA-B molecules. J. Immunol. 192, 4967–4976 (2014).
Article CAS Google Scholar
Bailey, A. et al. Selector function of MHC I molecules is determined by protein plasticity. Sci. Rep. 5, 14928 (2015).
Article ADS CAS Google Scholar
Tynan, F. E. et al. The immunogenicity of a viral cytotoxic T cell epitope is controlled by its MHC-bound conformation. J. Exp. Med. 202, 1249–1260 (2005).
Article CAS Google Scholar
Tynan, F. E. et al. High resolution structures of highly bulged viral Epitopes bound to major histocompatibility complex class I—implications for T-cell receptor engagement and T-cell immunodominance. J. Biol. Chem. 280, 23900–23909 (2005).
Article CAS Google Scholar
Pymm, P. et al. MHC-I peptides get out of the groove and enable a novel mechanism of HIV-1 escape. Nat. Struct. Mol. Biol. 24, 387–394 (2017).
Article CAS Google Scholar
McMurtrey, C. et al. Toxoplasma gondii peptide ligands open the gate of the HLA class I binding groove. eLife 5, e12556 (2016).
Article Google Scholar
Collins, E. J., Garboczi, D. N. & Wiley, D. C. Three-dimensional structure of a peptide extending from one end of a class I MHC binding site. Nature 371, 626–629 (1994).
Article ADS CAS Google Scholar
Fleischhauer, K., Kernan, N. A., O’Reilly, R. J., Dupont, B. & Yang, S. Y. Bone marrow-allograft rejection by T lymphocytes recognizing a single amino acid difference in HLA-B44. N. Engl. J. Med. 323, 1818–1822 (1990).
Article CAS Google Scholar
Keever, C. A. et al. HLA-B44-directed cytotoxic T cells associated with acute graft-versus-host disease following unrelated bone marrow transplantation. Bone Marrow Transplant. 14, 137–145 (1994).
CAS PubMed Google Scholar
Storkus, W. J., Howell, D. N., Salter, R. D., Dawson, J. R. & Cresswell, P. NK susceptibility varies inversely with target cell class I HLA antigen expression. J. Immunol. 138, 1657–1659 (1987).
CAS PubMed Google Scholar
Zemmour, J., Little, A. M., Schendel, D. J. & Parham, P. The HLA-A,B negative mutant cell line C1R expresses a novel HLA-B35 allele, which also has a point mutation in the translation initiation codon. J. Immunol. 148, 1941–1948 (1992).
CAS PubMed Google Scholar
Degli-Esposti, M. A. et al. Characterization of 4AOHW cell line panel including new data for the 10IHW panel. Hum. Immunol. 38, 3–16 (1993).
Article CAS Google Scholar
Barnstable, C. J. et al. Production of monoclonal antibodies to group A erythrocytes, HLA and other human cell surface antigens-new tools for genetic analysis. Cell 14, 9–20 (1978).
Article CAS Google Scholar
Schittenhelm, R. B., Dudek, N. L., Croft, N. P., Ramarathinam, S. H. & Purcell, A. W. A comprehensive analysis of constitutive naturally processed and presented HLA-C*04:01 (Cw4)-specific peptides. Tissue Antigens 83, 174–179 (2014).
Article CAS Google Scholar
Lockless, S. W. & Ranganathan, R. Evolutionarily conserved pathways of energetic connectivity in protein families. Science 286, 295–299 (1999).
Article CAS Google Scholar
Mendoza, J. L. et al. Requirements for efficient correction of DeltaF508 CFTR revealed by analyses of evolved sequences. Cell 148, 164–174 (2012).
Article CAS Google Scholar
Wickham, H. ggplot2: Elegant Graphics for Data Analysis. (Springer-Verlag, New York, 2009).
Book Google Scholar
Lê, S., Josse, J. & Husson, F. FactoMineR: an R package for multivariate analysis. J. Stat. Softw. 25, 1–18 (2008).
RCoreTeam., R. A Language and Environment for Statistical Computing (R Foundation for Statistical Computing, Vienna, Austria, 2016).
Google Scholar
Schloerke, B. et al. GGally: Extension to ‘ggplot2’ (The Comprehensive R Archive Network, 2016) https://github.com/ggobi/ggally.
MacLean, B. et al. Skyline: an open source document editor for creating and analyzing targeted proteomics experiments. Bioinformatics 26, 966–968 (2010).
Article CAS Google Scholar
Clements, C. S. et al. The production, purification and crystallization of a soluble heterodimeric form of a highly selected T-cell receptor in its unliganded and liganded state. Acta Crystallogr. D Biol. Crystallogr. 58, 2131–2134 (2002).
Article Google Scholar
Collaborative. The CCP4 suite: programs for protein crystallography. Acta Crystallogr. D Biol. Crystallogr. 50, 760–763 (1994).
Article Google Scholar
Evans, P. Scaling and assessment of data quality. Acta Crystallogr. D Biol. Crystallogr. 62, 72–82 (2006).
Article Google Scholar
Leslie, A. G. W. Recent changes to the MOSFLM package for processing film and image plate data. Joint CCP4 + ESF-EAMCB Newsletter on Protein Crystallography 26, https://www.ccp4.ac.uk/newsletters/No26.pdf, (1992).
McCoy, A. J. et al. Phaser crystallographic software. J. Appl. Crystallogr. 40, 658–674 (2007).
Article CAS Google Scholar
Emsley, P. & Cowtan, K. Coot: model-building tools for molecular graphics. Acta Crystallogr. D Biol. Crystallogr. 60, 2126–2132 (2004).
Article Google Scholar
Adams, P. D. et al. PHENIX: a comprehensive Python-based system for macromolecular structure solution. Acta Crystallogr. D Biol. Crystallogr. 66, 213–221 (2010).
Article CAS Google Scholar
Vizcaino, J. A. et al. 2016 update of the PRIDE database and its related tools. Nucleic Acids Res. 44, D447–D456 (2016).
Article CAS Google Scholar

Download references

Acknowledgements

Thank you to Tracy Josephs for assistance with protocols for the thermal stability experiments. This work was funded by a NHMRC Project grant (1063829) to A.W.P. and J.P.V., and by NIH grant AI22039 to P.P. A.W.P. is supported by a NHMRC Senior Research Fellowship (1044215). P.T.I. was supported by a NHMRC Early Career Fellowship (1072159). J.R. is supported by an Australian Research Council Australian Laureate Fellowship. J.L.M. is supported by NIH award K01CA175127. J.M. was supported by a NHMRC Project Grant (1120467) and Program Grant (1113293).

Author information

These authors contributed equally: Patricia T. Illing, Phillip Pymm.
These authors jointly supervised this work: Julian P. Vivian, Anthony W. Purcell.

Authors and Affiliations

Infection and Immunity Program and Department of Biochemistry and Molecular Biology, Monash Biomedicine Discovery Institute, Monash University, Clayton, VIC, 3800, Australia
Patricia T. Illing, Phillip Pymm, Nathan P. Croft, Nicole A. Mifsud, Nadine L. Dudek, Jamie Rossjohn, Julian P. Vivian & Anthony W. Purcell
Australian Research Council Centre of Excellence for Advanced Molecular Imaging, Monash University, Clayton, VIC, 3800, Australia
Phillip Pymm, Jamie Rossjohn & Julian P. Vivian
Departments of Structural Biology and Microbiology & Immunology, School of Medicine, Stanford University, Stanford, 94305, CA, USA
Hugo G. Hilton & Peter Parham
Calico Life Sciences LLC, South San Francisco, 94080, CA, USA
Hugo G. Hilton & Vladimir Jojic
Department of Genetics, School of Medicine, Stanford University, Stanford, 94305, CA, USA
Alex S. Han
Department of Molecular and Cellular Physiology, School of Medicine, Stanford University, Stanford, 94305, CA, USA
Juan L. Mendoza
Institute for Molecular Engineering and Department of Biochemistry & Molecular Biology, University of Chicago, Chicago, 60637, IL, USA
Juan L. Mendoza
Department of Microbiology and Immunology, Peter Doherty Institute for Infection and Immunity, University of Melbourne, Parkville, VIC, 3010, Australia
James McCluskey
Institute of Infection and Immunity, Cardiff University School of Medicine, Heath Park, Cardiff, CF14 4XN, UK
Jamie Rossjohn

Authors

Patricia T. Illing
View author publications
You can also search for this author in PubMed Google Scholar
Phillip Pymm
View author publications
You can also search for this author in PubMed Google Scholar
Nathan P. Croft
View author publications
You can also search for this author in PubMed Google Scholar
Hugo G. Hilton
View author publications
You can also search for this author in PubMed Google Scholar
Vladimir Jojic
View author publications
You can also search for this author in PubMed Google Scholar
Alex S. Han
View author publications
You can also search for this author in PubMed Google Scholar
Juan L. Mendoza
View author publications
You can also search for this author in PubMed Google Scholar
Nicole A. Mifsud
View author publications
You can also search for this author in PubMed Google Scholar
Nadine L. Dudek
View author publications
You can also search for this author in PubMed Google Scholar
James McCluskey
View author publications
You can also search for this author in PubMed Google Scholar
Peter Parham
View author publications
You can also search for this author in PubMed Google Scholar
Jamie Rossjohn
View author publications
You can also search for this author in PubMed Google Scholar
Julian P. Vivian
View author publications
You can also search for this author in PubMed Google Scholar
Anthony W. Purcell
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

P.T.I., A.W.P. and J.P.V. contributed to study design, data collection, data analysis and writing of the manuscript. Ph.P., N.A.M., N.P.C., H.G.H., V.J., A.S.H., J.L.M., N.L.D., P.P., J.M. and J.R. contributed to data collection, data analysis and writing of the manuscript.

Corresponding authors

Correspondence to Julian P. Vivian or Anthony W. Purcell.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Supplementary Information

Description of Additional Supplementary Files

Supplementary Data 1

Supplementary Data 2

Supplementary Data 3

Reporting Summary

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Illing, P.T., Pymm, P., Croft, N.P. et al. HLA-B57 micropolymorphism defines the sequence and conformational breadth of the immunopeptidome. Nat Commun 9, 4693 (2018). https://doi.org/10.1038/s41467-018-07109-w

Download citation

Received: 08 August 2017
Accepted: 12 October 2018
Published: 08 November 2018
DOI: https://doi.org/10.1038/s41467-018-07109-w

This article is cited by

Magnesium deficiency and its interaction with the musculoskeletal system, exercise, and connective tissue: an evidence synthesis
- Maria V. Sankova
- Vladimir N. Nikolenko
- Yury O. Zharikov
Sport Sciences for Health (2024)
A transformer-based model to predict peptide–HLA class I binding and optimize mutated peptides for vaccine design
- Yanyi Chu
- Yan Zhang
- Dong-Qing Wei
Nature Machine Intelligence (2022)
Antigen presentation in cancer: insights into tumour immunogenicity and immune evasion
- Suchit Jhunjhunwala
- Christian Hammer
- Lélia Delamarre
Nature Reviews Cancer (2021)
Thermostability profiling of MHC-bound peptides: a new dimension in immunopeptidomics and aid for immunotherapy design
- Emma C. Jappe
- Christian Garde
- Anthony W. Purcell
Nature Communications (2020)
Genetic T-cell receptor diversity at 1 year following allogeneic hematopoietic stem cell transplantation
- Stéphane Buhler
- Florence Bettens
- Jean Villard
Leukemia (2020)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.