SARS-CoV-2 antibodies recognize 23 distinct epitopic sites on the receptor binding domain

Jiang, Jiansheng; Boughter, Christopher T.; Ahmad, Javeed; Natarajan, Kannan; Boyd, Lisa F.; Meier-Schellersheim, Martin; Margulies, David H.

doi:10.1038/s42003-023-05332-w

Download PDF

Article
Open access
Published: 19 September 2023

SARS-CoV-2 antibodies recognize 23 distinct epitopic sites on the receptor binding domain

Communications Biology volume 6, Article number: 953 (2023) Cite this article

2720 Accesses
2 Citations
23 Altmetric
Metrics details

Subjects

Abstract

The COVID-19 pandemic and SARS-CoV-2 variants have dramatically illustrated the need for a better understanding of antigen (epitope)-antibody (paratope) interactions. To gain insight into the immunogenic characteristics of epitopic sites (ES), we systematically investigated the structures of 340 Abs and 83 nanobodies (Nbs) complexed with the Receptor Binding Domain (RBD) of the SARS-CoV-2 spike protein. We identified 23 distinct ES on the RBD surface and determined the frequencies of amino acid usage in the corresponding CDR paratopes. We describe a clustering method for analysis of ES similarities that reveals binding motifs of the paratopes and that provides insights for vaccine design and therapies for SARS-CoV-2, as well as a broader understanding of the structural basis of Ab-protein antigen (Ag) interactions.

Vaccination impairs de novo immune response to omicron breakthrough infection, a precondition for the original antigenic sin

Article Open access 10 April 2024

Mechanisms of SARS-CoV-2 entry into cells

Article 05 October 2021

Coronavirus biology and replication: implications for SARS-CoV-2

Article 28 October 2020

Introduction

Our ability to predict protein interactions is still very limited despite great progress in the application of computational methods for determining protein structures from amino acid sequence alone^1,2. This limitation is even more evident with regard to the interactions among highly variable immune receptor surfaces as dictated by Ab complementarity determining region (CDR) loops and the antigenic structures they bind. Accordingly, efforts directed toward providing systematic analyses or rational design strategies for Ab-Ag interactions need to incorporate experimentally determined structural data on specific Abs. Recent efforts in Ab design take advantage of segmental approaches³ or extensive computational resources^4,5. Such hindrances emphasize the importance of incorporating as much information on naturally occurring specific Ab-Ag structures as possible. Here, we report a systematic structural analysis, taking advantage of the thousands of structures of SARS-CoV-2-derived proteins, including spike and various Ab complexes that have been determined to further our understanding of the fundamental mechanisms of the pathogenesis and neutralization of SARS-CoV-2 in the context of the human immune system. Many Abs have been reported to have potent neutralizing activity, preventing spike interaction with the cellular receptor, angiotensin converting enzyme (ACE) 2. Several Abs have been developed as therapeutics and have variable efficacy against variants of concern (VOC). Our analysis of available structures may aid in understanding which Abs may be of value for emerging variants and contribute to evolving strategies for prophylaxis, treatment, and immunization.

Ab-protein antigen (Ab-Ag) interfaces have been a focus of immunologists and protein chemists for more than 80 years⁶, not only because of the important role of Abs in defense against infection⁷, but also due to the general interest in understanding protein-protein interactions⁸. High resolution structural analysis of protein-protein complexes, based initially on X-ray crystallography and more recently on cryogenic electron microscopy (cryo-EM), provides an objective basis for understanding not only the biophysical principles that determine affinity and specificity, but also for elucidating biological and evolutionary rules that govern immunological molecular recognition of foreign molecules and pathogens^9,10. With an ever-expanding database of detailed Ab-Ag structures, great attention has been directed to the characterization of such molecular interfaces, particularly as an understanding of the rules of engagement might permit rationalization of the reactivity of existing Abs, the design of Abs with new binding activities, and strategies for design of immunogens that might elicit more broadly neutralizing Abs^11,12,13.

The widespread infectivity, variance, and molecular characterization of the SARS-CoV-2 virus have provided a wealth of information concerning the functional and structural biology of the immune response. At the beginning of the SARS-CoV-2 pandemic, many laboratories accomplished detailed structural characterization of anti-RBD Abs and nanobodies (Nbs, single domain antibodies), leading to a classification of Abs based on the location of their footprints on the RBD surface. Initially, four classes of Ab were categorized, based on the orientation of the RBD bound and whether the Ab blocks infectivity or binding to the cellular receptor, ACE2¹⁴ (Supplementary Table 1). A receptor binding motif (RBM) has been defined as those RBD residues that specifically interact with ACE2¹⁵. Binding analysis of Nbs and human mAbs derived from patients along with a limited number of protein structures assigned five surface regions of the RBD reflecting its antigenic anatomy¹⁶. Epitopic analysis was further extended by the definition of seven “communities” of Abs that bind to the RBD surface¹⁷. Recent analysis of anti-RBD Ab and Nb as well as molecular dynamics analysis in the context of evolving escape mutations has taken advantage of these earlier classification schemes^{18,19,20,21,22,23}. Others have analyzed a number of anti-spike Nb in terms of their affinity and neutralization capacity²⁴. The functional classification of RBD epitopes, i.e. those that block infectivity of SARS-CoV-2, is valuable in identifying Abs likely to be of immediate therapeutic benefit during a rapidly spreading pandemic. The structure-based, function-agnostic, approach described here captures a broader set of RBD epitopes and is aimed primarily towards understanding the physico-chemical basis of epitope-paratope interactions. Such an understanding can enable predictions of antibody reactivities of new RBD variants based solely on RBD amino acid sequences.

Although these classification schemes have been valuable and adopted widely in the analysis of Abs as to how they bind to RBD and spike, particular Abs and Nbs may not be unambiguously classified (Supplementary Fig. 1). The previous summaries were based on a relatively small number of available structures and focused on the relative superposition of the Abs in the complexes, rather than on a comparison of the epitopic contacts of the RBD surface. In particular, the original distinction between Class 1 and Class 2 seemed clear based on the initial structures. However, as more structural models became available, apparent inconsistencies arose. For example, Ahmad et al.²⁵ determined that synthetic Nbs Sb16 and Sb45 contacted both Class 1 and Class 2 epitopic surfaces and approached the RBD from different angles. As more structures of Ab and Nb complexes are determined, it is apparent that an expansion of the initial classification scheme is warranted.

In this work, we focus on complexes of Abs and Nbs bound to the RBD of the spike protein to generate a comprehensive structural framework to further our understanding of Ab- and Nb-RBD recognition. Using a large database, we offer a structure-based classification exploiting quantitatively defined contacting amino acid residues on the RBD as well as a clustering analysis. These analyses reveal common characteristics of some 23 frequently contacted ES and the structural nature of the surfaces of the RBD that interact with Ab/Nb. We also systematically analyze the molecular features that define these antibodies and, by applying a rigorous evaluation of the surface features of the RBD that are seen by Abs and Nbs, generate general insights into the fundamental nature of Ab-Ag recognition. This analysis should facilitate the characterization of new anti-RBD antibodies as they arise.

Results

Identification of epitopic sites (ES)

To identify common features of ES of the RBD, we systematically investigated structures of Abs (as Fabs and Fvs, Ab fragments that confer antigen binding activity) and of Nbs (as VHH or synthetic library-derived sybodies) in complex with the spike protein or its RBD as collected in the CovAbDab²⁶ and the protein data bank (PDB)^27,28. Abs and Nbs that bind the SARS-CoV-2 RBD are summarized in Table 1. As of 12/22/2022, a total of 6746 Ab and 620 Nb sequences have been collected in the CovAbDab. Of the Abs, 6321 are human, including those from vaccinees, and 390 derive from humanized mouse or phage display Ab libraries. For Nbs, 620 sequences derive from camelids (alpaca/camel/llama), of which 276 are from camelid-derived phage display libraries, some naïve, some immunized. Among these sequences, structural coordinates for only ~5% of the Abs and ~10% of Nbs were available in the PDB, and we compiled a non-redundant list of 340 Ab and of 83 Nb X-ray or cryo-EM structures (Supplementary Data 1 & 2) which serve as the basis of our structural analysis.

Table 1 Summary of sequences and structures of anti-SARS-CoV-2 antibodies and nanobodies.

Full size table

Evaluation of the biophysical properties that contribute to protein-protein interactions may be based on different criteria, including calculation of free energy terms of interacting residues²⁹, measurement of shape complementarity (Sc³⁰), and calculation of buried or accessible surface area^{31,32,33,34,35}. We elected to simplify this analysis first by calculating interatomic contacts between Ab (paratopic) and Ag (epitopic) residues at the interface because the biophysical basis of binding (due to charge, hydrophobicity, hydrogen bonding and van der Waals interactions) is reflected in such contacts. For the hydrogen bond interactions, the distance is usually cut-off about 3.8–4.0 angstrom, but for non-bonded interaction or van der Waals interaction, it could be up to 5–8 angstroms. Generally, distances within 4–6 angstroms (Å) are considered indicative of direct contacts between interacting proteins. For computational approaches cut-offs that rang from 5 to 10 Å due to the dynamics feature of proteins may be used. Vangone and Bonvin³⁶ studied the correlation of the contact distance and the binding affinity, and found the approximate distance range is between 4.0 to 5.5 Å. Viloria et al.³⁷, determined an optimal distance cut-off for contact-based protein networks of 5.0 Å. Krawzyk et al.³⁸ used 4.5 Å to define epitope contacts. We adopted the contact distance cut-off at 5.0 Å for our Ab-Ag interaction, based on comparison of different cut-offs from 4.0 to 5.5 Å (see Methods). We plotted the numbers of Ab (paratope) contacts as hits versus the residue number of the RBD (epitope) for the Ab heavy (H) (Fig. 1a) and light (L) (Supplementary Fig. 2a) chains individually, and also overall for both H and L chains together (Supplementary Fig. 2b). We also plot the number of hits of the 83 Nbs to each RBD residue (Fig. 1c). For 340 Abs, H chains contribute 5623 contacts and L chains 3107 (Supplementary Table 2). By comparison, for 83 Nbs, 1836 contacts are observed. Thus, the number of contacts is ~25 per Ab and ~22 per Nb. Although the RBD residues bound by either Ab, H chain, or Nb are by and large, the same, the relative distribution of hits varies for several regions. In particular, the region from RBD residue 368 to 386 is recognized more frequently by Nbs, while other contiguous surfaces are seen equivalently (Fig. 1a, c). The numbers of hits for Ab H chains are represented graphically as a heat map on the RBD surface in Fig. 1b, and the heat maps for the Nbs are shown in Fig. 1d.

**Fig. 1: Number of contacts to RBD by Abs and Nbs.**

Several contiguous stretches of amino acids of the RBD that make Ab contact were apparent, although the frequency of hits varied considerably for different regions on the surface of the RBD. A fine-grained tabulation of regions of the RBD consisting of three to nine residues define each individual ES as shown in Table 2. Each of these ES may be assigned to either of the four major classes identified earlier or to the RBM recognized by the ACE2 receptor (Table 3). These regions include distinct secondary structural features such as strands, loops, turns, and helices (Supplementary Movie 1), and represent contacts seen by few ( < 0.3%) to many ( > 10%) Abs. Consideration of the secondary structural features (loops, turns, or short β strands) and the accessible surface area prompts the identification of 23 distinct contiguous sites, including regions encompassing residues 404 to 421 that had been overlooked in previous studies. The hit numbers are not evenly distributed over the RBD surface, and it is difficult to distinguish which binding sites belong to the previously defined Class 1 or Class 2 due to overlaps generated by the reduction of the three-dimensional surface to a two-dimensional plot. Figure 2a, b displays these ES on the RBD surface with the ES numbers for Abs (magenta) and Nbs (blue) respectively. The thickness of the putty cartoon indicates greater hit numbers. The computed accessible surface area (ASA) (see Methods) for each individual ES (Table 2) ranged from ~100 Å² to more than 500 Å². The total buried surface area (BSA) was also computed for each of 340 Abs (for H chain, L chain and H plus L chain) and for 83 Nbs as in Supplementary Data 1 (for Abs) and 2 (for Nbs). The values of BSA range from 64 Å² to 1112 Å² for Ab H, from 0 Å ² to 912 Å² for Ab L, and between 264 and 1824 Å² for H plus L of the 340 Abs. BSA for the 83 Nb ranges from 437 Å² to 1412 Å².

Table 2 Definitions of epitopic sites (ES) seen by Abs and Nbs.

Full size table

Table 3 Correlation of ES with class definitions.

Full size table

**Fig. 2: Distribution of Abs and Nbs on RBD surface.**

As an indication of the relative immunogenicity of each of the 23 ES, we tabulated the proportion of Abs and Nbs that recognized each site (Fig. 2c). Approximately 7 to 11% of Ab H chains recognized ES11, 13, 16, 18, and 20, which represent ES contained within the previously defined Class 1 and Class 2 regions. In general, Nb recognition of specific ES was similar to that of Ab H chains, with the predominant recognition representing from 7 to about 10% of Nbs see Table 2 and Fig. 2c, falling within Class 2 and Class 4. Notable differences in the predominant ES recognized by Abs and Nbs are that ES8, 13, 16, and 18 are more frequently seen by Abs while ES4, 5, 6, 7, 11, and 20 are more frequently identified by Nbs. For example, ES16 was recognized by 10% of Abs and by 0.16% of Nbs. This difference may be explained since ES16 forms a solvent exposed convex structure which may not be conducive to recognition by Nbs. By contrast, ES4, 5, and 6 form a contiguous patch, recognized more frequently by Nbs, a region that is not exposed to solvent in the complete spike when the RBD is in the down position. Thus, Nbs may be better able to access such hidden surfaces, perhaps because of their relatively small size (12kD compared to ~25 or 50 kD for Fv and Fab respectively or ~150 kD for complete bivalent IgG, with corresponding three-dimensional volumes)³⁹. Alternatively, since many Nbs were identified based on binding to isolated RBD, some epitopes identified from such screens may be partially hidden in the complete spike protein. In comparing L chains with H chains, as shown in Fig. 2d, L chains generally contribute less to these ES. Nevertheless, L chains seem to preferentially contact ES7, 20 and 21. We note that some ES (e.g. ES7, 8, 9, and 23) could not be placed into the previous classification schemes and some sites overlap on Class 1 and Class 2 (i.e. ES12, 19, and 20). However, most of the 23 ES may be viewed within the four classes described by Barnes (Table 3)¹⁴. In addition, the RBM of the RBD¹⁵ may be defined in terms of the ES that overlap the ACE2-RBD interface (i.e. ES8, 11, 12, 13, 16, 18, 19, 20, 21, and 22 (Table 3)). With these 23 fine-grained ES, we extend the prior classification for Class 1 to now include ES8 and 9 (Table 3). Each ES surface area or footprint is illustrated by a color map of the RBD surface (Fig. 2e, Supplementary Movie 2). The sum of these 23 ES covers as much as 70% of the total accessible surface area (ASA) of the isolated RBD, illustrating the breadth of the human antibody response to RBD.

Analysis of CDR loop contributions and epitope-paratope interactions

The CDRs in the hypervariable region of Abs play critical roles in recognizing antigens^9,40,41, and their variability in sequence and length facilitates interaction with distinct antigenic epitopes⁴². We tabulated the number of contacts for each CDR loop or non-CDR residues of 340 H chains and L chains and 83 Nbs to each of the 23 ES. The contact percentages are summarized in Fig. 3a, b, c respectively. The corresponding statistics are listed in Supplementary Table 2a, b,c. For Ab H chains (Fig. 3a), CDR loops account for 82% of the contacts to ES (CDR1 = 16%, CDR2 = 21%, CDR3 = 45%), while only 18% of the contacts are from non-CDR residues. Interestingly, CDR1 of H chains plays a major role in binding to ES16. For Ab L chains (Fig. 3b), CDR1 loops play a major role (40%) in binding to RBD while CDR3 represent only 25% of the contacts. One explanation for the reduced the role of the CDR3 loop of L chains might be that their average length (10 aa for 340 Abs) is generally shorter than that of H chain CDR3 (15 aa for 340 Abs), see Fig. 3d. For Nbs (Fig. 3c), CDR represent 73% (CDR1 = 13%, CDR2 = 14%, CDR3 = 46%) of the contacts to the RBD surface, while 27% involve non-CDR residues. The average length of Nb CDR3 is 16 aa. Thus, for both Ab H chains and Nbs, CDR3 contributes the greater proportion of those residues that interact with the RBD, reflecting a major role for CDR3 in RBD recognition. (Illustrations of CDR1, CDR2, and CDR3 contacts are shown in Fig. 5c).

**Fig. 3: Distribution of CDR loops of contacts to RBD surface over ES.**

We plotted the frequency of particular amino acids used by Abs and Nbs (paratopic residues) that interact with particular ES of the RBD for Ab H chains (Fig. 4a) and for Nbs (Fig. 4b). These are shown as heat maps. The residues listed on the top of the panel represent the most frequently contacting amino acids for the specific ES. The frequency of usage of each amino acid for Abs (pink) and Nbs (blue) is compared in Fig. 4c. Tyrosine (Y), serine (S), and arginine (R) are the three amino acids most preferred for binding any ES of RBD (Fig. 4c). Previous analyses of paratopic preferences for a wide range of Abs recognized a high frequency of tyrosine usage⁴³. We also observed that tryptophan is more frequently used in Nbs as compared with Abs (Fig. 4c). The usage of CDR3, CDR2 and CDR1 amino acids is plotted in Fig. 4d, e, f respectively. To illustrate the predominance of particular paratopic residues of the Ab H chains that contact specific ES, we also grouped these as WebLogo plots⁴⁴ (Supplementary Fig. 3).

**Fig. 4: Distribution of amino acids of Abs/Nbs over ES.**

Cluster analysis of epitopic sites and binding motifs

Having identified the sets of ES bound by each Ab and Nb (see Supplementary Data 1, 2), we then grouped the Abs and Nbs by computation of the similarity of the ES recognized (see Methods). Similarity of a pair of ES sets is a value between 0 and 1 reflecting recognition of completely different (0) or identical (1) sites. This clustering method compares ES sets on the RBD without visualization of graphic models. Assigning a similarity threshold of 0.85 (see Methods) results in the identification of 33 clusters for Abs, designated A1 to A33 (Supplementary Table 3a) and 10 clusters for Nbs, N1 to N10 (Supplementary Table 3b). Although Abs within a single cluster bind the same subset of ES, they may, or may not address the RBD from the same angle or utilize CDR of the same length or composition. These differences are illustrated in Fig. 5a for clusters A1, A3, and A11 for H chains and in Fig. 5b for clusters N1, N3, and N4 for Nbs. The members of nanobody cluster N4 reveal a similar orientation because they have the same conformation and length of CDR loops. Abs or Nbs within the same cluster recognize the same contiguous RBD surface and are expected to compete sterically.

**Fig. 5: Identity of the similarity of the ES and clustering of Abs/Nbs (see Supplementary Table 3).**

CDR loops contain sequence motifs for epitope recognition^45,46,47,48. To identify such motifs we analyzed a subset of interfaces from cluster A1, designated A1S1, that recognized ES with a similarity of ≥0.9. A1S1 consists of 28 members (cluster A1 has 56 members of similarity ≥0.85). All the members of A1S1 recognize the same ES set (ES8, 9, 12, 13, 16, 18, and 19) (Fig. 5c), utilize the same CDR loops, and superpose well. Analysis of the residues of CDR1, 2, and 3 that contact the RBD indicated those residues that are preferentially utilized by this stringently selected cluster of Abs. For the binding motifs of CDR1, 2, and 3 of A1S1, the favored residues are summarized in a WebLogo plot (Fig. 5c). Remarkably, Y, S, G, and T predominate for all CDR except CDR3 which exploits R in most instances. Thus, application of a more stringent ES similarity score helps to identify the preferred binding motif utilized by the Ab of the same subgroup. This stringent grouping of Abs and Nbs, based on high similarity score of their respective ES, may prove a useful adjunct in structure prediction based on amino acid sequence and antibody competition. Previous work identified the over-represented public class of mAbs encoded by IGHV3-53 and IGHV3-66 that neutralize the spike^45,49,50. We also investigated the V(D)J gene combinations representing those mAb structures (Supplementary Fig. 6a). Among 6316 mAb sequences in the CovAbDab, the top three IGHV genes are 3–30, 1–69 and 3–53, and IGHJ genes are 4, 6 and 3. However, the top IGHV genes for the structural representatives are 3–53 and 3–58 and IGHJ gene 4, 6, and 3 combined (see blue heat map). A large cluster based on the gene combination similarity, GA1 (IGHV3-53/IGHJ6), as shown in Supplementary Fig. 6b, has an ES set of (8,9,13,16,18,19) which is related to the cluster of A1S1 (Fig. 5c). However, GA1 is a subset of the cluster A1S1 (17 vs 28 members).

To extend the utility of our ES definitions, we set out to determine broad biophysical trends common among the Abs that cluster to each ES region. Using the automated immune molecule separator (AIMS) software⁵¹, a tool which characterizes immune molecules without structural knowledge, we analyzed similar SARS-CoV-2-specific Abs. With this we identified 11 clusters which are designated as AIMS1, AIMS2, etc (Fig. 5d). Not all Abs in a single AIMS cluster bind the same ES. However, AIMS6 and AIMS7 overlap as subsets of cluster A1 and have a similarity score of 0.85.

Relation of ES and SARS-CoV-2 escape mutations

SARS-CoV-2 variants have evolved rapidly from Alpha, Beta, Delta, and Omicron with multiple mutations and deletions. The development of the latest Omicron subvariants can be traced from BA.1, BA.1.1, BA.2, BA.3, BA.4/5, and XBB.1 to XBB.1.5 and they incorporate as many as 30 mutations and deletions in their RBDs^52,53,54. Table 4 lists the mutations in these variants and the ES to which they map. Subvariants marked “X” have different substitutions at a given position. Table 5 lists the major Omicron subvariants and their associated ES. (For example, XBB.1.5 has substitutions of P and S for V445 and G446, respectively, which are contained in ES11, and substitution of S and Q for F490 and R493, respectively, which are in ES19). Similarly, XBB.4 preserves the same substitutions, but also substitutes R for L452 in ES12. Figure 6a–d illustrates the location of these variants on the RBD surface for Omicron and their mutation sites are matched to one or more of the 23 ES. Strikingly, Omicron escape mutations are distributed throughout several distinct ES of the RBD (Table 4, Fig. 6a–d), posing a formidable challenge in the design of new vaccines and therapeutic antibodies. Notably, mutations in ES3, 6, 9, 14, 15, and 23 have not yet been reported. These ES for which mutations have not yet been reported are illustrated in Fig. 6e.

Table 4 Relation of ES to SARS2-CoV-2 escape mutations.

Full size table

Table 5 Latest mutations in the major (PANGO) lineage of subvariants of Omicron and corresponding ES site.

Full size table

**Fig. 6: Ilustration of location of variant mutations and associated ES on RBD surface.**

Our comprehensive analysis of RBD epitopes and their corresponding Ab paratopes offers the possibility of identifying currently approved SARS-CoV-2 therapeutic Abs that may be used to neutralize emerging SARS-CoV-2 variants and Omicron subvariants. The latest reported structures^47,55,56,57 describe some Abs that bind these subvariants. We can identify a number of Abs or Nbs that target particular ES sets that are either mutated or preserved in emerging variants. Those Abs/Nbs exhibiting multiple contacts to contiguous ES sites with concomitantly large buried surface area and high binding affinity deserve the greatest attention. Thus, using Ab/Nb structures already determined that target particular ES, we can model the effects of the variant mutations on antibody recognition.

Two examples illustrate this approach: the R346T RBD mutation in the subvariants BA.4, BA.5, BF.7 and XBB.1.5 lies within ES2 (Table 2, Table 4, Fig. 6d), and those Abs that recognize ES2 may be further evaluated for their ability to bind the mutants that harbor the R- > T substitution. Supplementary Table 4a lists a number of Abs and Nbs whose structures are known that interact with ES2, and analysis of several Abs which may potentially resist the escape mutation (Supplementary Fig. 4a). Specifically, the emergency use authorized (EUA) mAb S309 (one of three Fab modeled in PDB 7JX3) (sotrovimab) may have neutralizing potency when combined with other antibodies to BA.1.1.529, BA.1, BA.2.75 subvariants^58,59. A second is the F486 mutation found in XBB.1 (F486S) and XBB1.5 (F486P) which is located in ES18 and 19 (F490 & R493). We identified a number of Abs and Nbs (Supplementary Table 4b) that have multiple contacts with ES17, 18, and 19, such as for COVOX-45, which preserves those to P486 from the main-chain of the CDR3 loop. Also, the nanobody Nb-2-67 makes multiple hydrogen bonds to maintain contact with ES18 (Supplementary Fig. 4b).

Our analysis of ES recognized by Abs and Nbs and the identification of specific ES affected by mutations in VOC provides an explanation for the ineffectiveness of some Ab that have been tested therapeutically. One example, Evushield™, which consists of two Abs, tixagevimab (AZD 8895) and cligavimab (AZD 1061) illustrates this point. These Ab have been studied by X-ray crystallography (tixagevimab, PDB 7L7D, both tixagevimab and cligavimab in 7L7E⁶⁰) and by cryo-EM⁶¹. By our analysis, tixagevimab interacts with ES13, 16, 18, 19, and 20 and cligavimab with ES2, 10, 11, and 12. As shown in Table 4, residues in every one of these ES are mutated in the Omicron variant. This then explains the lack of beneficial effect of Evushield™ and supports a molecular basis for the recent revision of its EUA by the FDA (https://www.fda.gov/drugs/drug-safety-and-availability/fda-announces-evusheld-not-currently-authorized-emergency-use-us). In particular, Omicron variant XBB.1.5 (see Table 3b), harbors mutations that reduce efficient recognition by the Ab products (bamlanivimab plus etesevimab, casirivimab plus imdevimab, sotrovimab, and bebtelovimab), which are no longer authorized for use in the United States https://www.covid19treatmentguidelines.nih.gov/therapies/antivirals-including-antibody-products/anti-sars-cov-2-monoclonal-antibodies/#:~:text=Four%20anti%2DSARS%2DCoV%2D,mild%20to%20moderate%20COVID%2D19.

Discussion

The enormous world-wide effort to elucidate the mechanistic underpinnings of the immune response to SARS-CoV-2 has provided deep insight into aspects of the B cell and T cell responses to infection and immunization and has contributed to ongoing strategies for therapy and prevention. Here, we have taken advantage of the ever-increasing structural database of anti-SARS-CoV-2 Abs and Nbs to analyze the three-dimensional features that are described by X-ray and cryo-EM structures of Ab and Nb complexes with the RBD of the virus, either alone or in the context of the full spike protein. We have developed several analytical computational tools described in detail in the methods that allow the tabulation and analysis of molecular contacts and ES between the Abs/Nbs and the RBD. These provide a convenient avenue for querying and comparing the binding sites and interactions of particular Abs/Nbs and will support additional queries as the CovAbDab and PDB entries increase. This has permitted the categorization of the epitope-paratope interactions and molecular surface characteristics that lend themselves to recognition by Abs and the recurrent structural motifs of the CDR residues of the Abs/Nbs. This identification of 23 ES derives from evaluation of a large number of Ab/Nb-RBD and Ab/Nb-spike structures and their interface contacts, and thus surpasses analyses based on amino acid sequence or gross structural comparison alone. Our method of clustering ES sites with various stringencies, and independently of the antibodies that recognize them, offers an additional tool towards the goal of prediction of CDR sequences that recognize particular epitopic sites. This analysis is focused on the RBD alone, and does not take into account potential contacts with the glycan moiety linked at N343. Only 27 of 340 Ab make any contact with residue N343.

Of some 340 Abs and 83 Nbs, our analysis indicates that the 23 ES on the RBD characterized in part by secondary structural features may be recognized at different frequencies. This fine-grained analysis of the RBD surface reveals that as many as 10% of Abs may recognize common features such as those of ES16 as seen by Abs, or of ES11 as seen by Nbs. Our findings and definition of 23 ES are not dependent on the distance cut-off parameter in computing the Ag-Ab contacts. Although the total numbers of contacts may increase slightly with a longer distance cut-off, the percentage of contacts in each ES bin are almost the same (as shown in Supplementary Fig. 5) with the distance cut-offs at 4.0 Å (gray), 4.5 Å (blue), 5.0 Å (red) and 5.5 Å (green) respectively (H chain only).

Understanding the biophysical or structural characteristics of antigenic or immunogenic sites on protein antigens has been a subject of considerable interest for many years, beginning with efforts to understand common sites seen by heterogeneous Abs and further refined as monoclonal Abs have been studied^{6,40,42,43,62}. Recent efforts have identified common motifs that human antibodies exploit to bind similar epitopes⁶³. Consistent features of antigenic sites include hydrophobicity, accessibility, and segmental mobility as well as sequence dissimilarity to the Ab-producing organism (tolerance). Here we have taken the opportunity to investigate a large number of Abs and Nbs for which the antigenic site of a single protein is defined at high resolution by structural criteria. Here we took advantage of currently available structures of complexes from a database of antibodies that derive largely from patients or vaccinees but also from mice and includes single chain antibodies (nanobodies) derived from immune or naïve libraries (see Table 1). Of course, analysis of structures compiled in any antibody database may be biased by a variety of factors including the biological source(s) of the antibodies (from natural infection or immunization; or from naïve or immune based libraries), whether they could be engineered effectively to produce adequate amounts of protein for X-ray or cryo-EM analysis, whether the proteins crystallized well, or how well they bind variant viral proteins. Despite such potentially confounding factors, several important consistent conclusions may be drawn: (1) common sites are recognized by both Abs or Nbs; (2) several major surfaces of the RBD have not been addressed by either Abs or Nbs; and (3) some sites are favored by either Abs (e.g, ES16 and ES18) or by Nbs (e.g., ES4 and ES5). This latter phenomenon may reflect germline VH gene preferences in the human (as suggested⁶⁴) or the well-recognized characteristic of Nbs, whose relatively long CDR3 loops are capable of exploring concave surfaces⁶⁵.

Our analysis suggests that several regions of the RBD, that are recognized by a higher proportion of Ab may be particularly important to incorporate into peptide-based immunogens (such as ES11, 13, 16, 18, and 20) and that further generation vaccines might pay particular attention to new viral variants that affect these sites. Alternatively, Ab therapies may benefit from a focus on those reagents that recognize both common antigenic sites as well as those that are rarely identified. Although our analysis here has been confined to Abs/Nbs that recognize the RBD of the spike protein of SARS-CoV-2, this approach may, in principle, be applied to a variety of Abs/Nbs directed against proteins of pathogenic organisms.

Methods

Datasets

Covid Ab and Nb sequences were culled from the Coronavirus Antibody Database, CovAbDab (http://opig.stats.ox.ac.uk/webapps/covabdab/)²⁶ and coordinates of three-dimensional models were taken from the protein data bank (PDB) (https://www.rcsb.org/; and https://rcsb.org/covid19/)^27,28. Using the CovAbDab list as of 12/20/2022, we downloaded all complexes of Ab/spike or Ab/RBD structures determined by X-ray crystallography or cryo-EM from the PDB. The total number of downloaded PDB entries is about 595 (see the “Related Structures” column in Supplementary Data 1). We manually curates this structure dataset as follows: (a) We removed the structures with resolution worse than 5.0 Å; (b) when the same Ab appeared in two or more different structures, we selected the one of highest resolution; (c) if an Ab had both X-ray and cryo-EM structures, we removed the one that was of worse resolution, regardless of the method; (d) since the spike is a trimer, if two or more of the same Ab bound to different chains of the same spike, we considered only one complex with this Ab; (e) in the case of bispecific or bivalent Ab or multiple Ab/Nb in the same PDB structure, we evaluated two or more such “unique” Ab/RBD complexes. (f) CDR designations were taken from the CoVAbDab database, which follows the IMGT numbering system. The sequence designations of CDRH3 and CDRL3 were taken directly from the CoVAbDab download. CDRH1, CDRH2, CDRL1, CDRL2 were identified by sequence alignment.

This procedure resulted in 340 unique “non-redundant” neutralizing antibodies from 595 PDB entries (see Supplementary Data 1). We performed the same analysis for Nb (from a list of 124 structures reduced to 83 “non-redundant” Nb, see Supplementary Data 2). We did not rectify any missing residues or atoms and kept the original model from the downloaded PDB structures. We used the CNS 1.3 script, “contact.inp” with the distance cut-off at 5.0 angstroms to compute the interacting contacts between Abs and RBD for these 340 structures. If two or more residues from an Ab contacted identical residues on the RBD within 5.0 angstroms, we list them as multiple contacts. This procedure produced the “raw” epitope-paratope contact dataset (see github.com/jiangj-niaid/RBD-SARS2/340absH-contact-dis-0207.txt) for the following analysis step. Of the curated structures, 59 of the 340 Ab and 4 of 83 Nb structures were determined with variant RBDs. Our analysis included all variants, but eliminating those variants from the analysis showed little difference in the residue/hit plots. (See Suppplementary Data 1 and 2).

Buried surface area (BSA) is an important structural character and a quantitative measurement of interaction at the interface. BSA values correlate to the sum of individual contacts and directly link to the binding affinity or neutralizing potency. We also calculate the BSA values (using PISA/CCP4 program) for each chain (H and L) of an antibody/nanobody.

Based on these curations and evaluations, we created the annotation files for each Abs and Nbs; see Supplementary Data 1 and Data 2. [the Data 1 file contains the following data items: Name of Abs, PDB id, epitope-chain-id, paratope-chain-ids, R/S (RBD alone or spike), X/E (X-ray or cryo-EM), Resolution, BSA H chain, BSA L chain, BSA H + L chains, ES H chain, ES L chain, Related Structures, Variants and list of Mutations (on RBD). The Data 2 file contains the following data items: Name of Nbs, PDB id, epitope-chain-id, paratope-chain-id, R/S (RBD alone or spike), X/E (X-ray or cryo-EM), Resolution, BSA, ES, Related Structures, Variants and list of Mutations (on RBD). (see github.com/jiangj-niaid/RBD-SARS2/)].

The produced “EPI contact datasets” composed of the following data items for each contact: PDB id and name of Abs or Nbs, Chain-ids (epitope-paratope), Residue-name of RBD, Residue-id of RBD, ES-id, Residue-name of Abs or Nbs, Residue-id of Abs or Nbs, Distance, CDR-id. (See github.com/jiangj-niaid/RBD-SARS2/).

Software

All analyses were performed with our EPI (Epitope-Paratope Interaction) software package of mixed scripts in C-shell, perl, and python. EPI software is available at https://github.com/jiangj-niaid/EPI/. Supplementary Fig. 7 shows the flowchart of this package. We downloaded the sequences, including the extracted CDR sequences from CovAbDab, also downloaded all structures of Ab/Nb in complex with spike or RBD from the PDB and deduced a “non-redundant” structural dataset (see “Datasets” above). The Input files and parameters include: (a) A list of PDB IDs and names of Ab/Nb along with the epitope chain and paratope-chain dersignations. (b) Predefined ES residue range (in Table 2). (c) Predefined CDR loops. (d) A list of known variants and mutations. (e) The contact distance cut-off (default is 5.0 Å). The program will generate “EPI contact datasets” according to the input files and pdbid list. Separate sub-dataset for Ab H chain, L chain, or Nb may be designated. Based on this EPI dataset, the analysis scripts then perform a structural alignment and evaluate the statistics of the CDR loops and amino acid usage. Additional variants and mutants may be detected.

Contact distances were calculated based on scripts taken from CNS 1.3 (http://cns-online.org/v1.3/)⁶⁶. Buried surface area (BSA)^34,67,68 was calculated with PISA (Proteins, Interfaces, Structures and Assemblies³⁴), and accessible surface area (ASA)^{35,69,70,71,72} was calculated with CNS 1.3.

The clustering method used in EPI is based on the ES (i.e. RBD binding sites) not amino acid sequences of Abs or Nbs. The numbers of ES (1–23) are then converted to a corresponding string of 23 letters from “a” to “w” and the similarity between sets of ES is computed using the Normalized Edit Distance that was developed from the Hamming⁷³ and Levenshtein⁷⁴ Distance. A similarity of 1 indicates that the two strings or two ES sets are identical; a similarity of 0 indicates that the two strings or ES sets are completely different. The similarity is then calculated for pairwise combinations of all Abs or Nbs based on their ES sets. Abs or Nbs can be clustered by imposing a similarity threshold. For 340 Abs we empirically evaluated similarity thresholds from 0.50 to 0.99 at 0.05 intervals and found that a similarity threshold of 0.85 yielded 33 clusters. For 83 Nbs a similarity threshold of 0.85 yielded 10 clusters.

The AIMS analysis package⁵¹ used for biophysical clustering of antibody sequences can be found at https://github.com/ctboughter/AIMS, including generalized Jupyter Notebooks and a Python-based GUI for the replication of the results presented herein or for the application of this analysis to novel datasets. Detailed descriptions of the foundational concepts critical for this analysis and the instructions for use can be found at https://aims-doc.readthedocs.io.

Figures for structural models are generated by using PyMOL⁷⁵ (https://pymol.org/2/). Sequence logo figures were generated with WebLogo (https://weblogo.berkeley.edu/)⁴⁴. Sequence alignments were made with Clustal Omega (https://www.ebi.ac.uk/Tools/msa/clustalo/)⁷⁶. Graphic plots were generated with Prism 9 (https://GraphPad.com).

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

All data generated for analysis in this study have been made available on ZENODO at https://zenodo.org/record/8241951 (ref: doi/10.5281/zenodo.8241951).

Code availability

All code generated for analysis in this study with our EPI (Epitope-Paratope Interaction) software package of mixed scripts in C-shell, perl, and python have been made available on GitHub at https://github.com/jiangj-niaid/EPI/.

References

Jumper, J. et al. Highly accurate protein structure prediction with AlphaFold. Nature 596, 583–589 (2021).
Article CAS PubMed PubMed Central Google Scholar
Yin, R., Feng, B. Y., Varshney, A. & Pierce, B. G. Benchmarking AlphaFold for protein complex modeling reveals accuracy determinants. Protein Sci. 31, e4379 (2022).
CAS PubMed PubMed Central Google Scholar
Aguilar Rangel, M. et al. Fragment-based computational design of antibodies targeting structured epitopes. Sci. Adv. 8, eabp9540 (2022).
PubMed PubMed Central Google Scholar
Fischman, S. & Ofran, Y. Computational design of antibodies. Curr. Opin. Struct. Biol. 51, 156–162 (2018).
CAS PubMed Google Scholar
Chidyausiku, T. M. et al. De novo design of immunoglobulin-like domains. Nat. Commun. 13, 5661 (2022).
CAS PubMed PubMed Central Google Scholar
Tiselius, A. & Kabat, E. A. An electrophoretic study of immune sera and purified antibody preparations. J. Exp. Med. 69, 119–131 (1939).
CAS PubMed PubMed Central Google Scholar
Lu, L. L., Suscovich, T. J., Fortune, S. M. & Alter, G. Beyond binding: antibody effector functions in infectious diseases. Nat. Rev. Immunol. 18, 46–61 (2018).
CAS PubMed Google Scholar
Jones, S. & Thornton, J. M. Principles of protein-protein interactions. Proc. Natl Acad. Sci. USA 93, 13–20 (1996).
CAS PubMed PubMed Central Google Scholar
Davies, D. R. & Cohen, G. H. Interactions of protein antigens with antibodies. Proc. Natl Acad. Sci. USA 93, 7–12 (1996).
CAS PubMed PubMed Central Google Scholar
Burley, S. K. et al. Electron microscopy holdings of the Protein Data Bank: the impact of the resolution revolution, new validation tools, and implications for the future. Biophys. Rev., 1–21 https://doi.org/10.1007/s12551-022-01013-w (2022).
Mendoza, P., Lorenzi, J. C. C. & Gaebler, C. COVID-19 antibody development fueled by HIV-1 broadly neutralizing antibody research. Curr. Opin. HIV AIDS 16, 25–35 (2021).
CAS PubMed Google Scholar
Uversky, V. N. & Van Regenmortel, M. H. V. Mobility and disorder in antibody and antigen binding sites do not prevent immunochemical recognition. Crit. Rev. Biochem. Mol. Biol. 56, 149–156 (2021).
CAS PubMed Google Scholar
Regep, C., Georges, G., Shi, J., Popovic, B. & Deane, C. M. The H3 loop of antibodies shows unique structural characteristics. Proteins 85, 1311–1318 (2017).
CAS PubMed PubMed Central Google Scholar
Barnes, C. O. et al. SARS-CoV-2 neutralizing antibody structures inform therapeutic strategies. Nature 588, 682–687 (2020).
CAS PubMed PubMed Central Google Scholar
Shang, J. et al. Structural basis of receptor recognition by SARS-CoV-2. Nature 581, 221–224 (2020).
CAS PubMed PubMed Central Google Scholar
Dejnirattisai, W. et al. The antigenic anatomy of SARS-CoV-2 receptor binding domain. Cell 184, 2183–2200 (2021). e2122.
CAS PubMed PubMed Central Google Scholar
Hastie, K. M. et al. Defining variant-resistant epitopes targeted by SARS-CoV-2 antibodies: a global consortium study. Science 374, 472–478 (2021).
CAS PubMed PubMed Central Google Scholar
Greaney, A. J. et al. The SARS-CoV-2 delta variant induces an antibody response largely focused on class 1 and 2 antibody epitopes. PLoS Pathog. 18, e1010592 (2022).
CAS PubMed PubMed Central Google Scholar
Greaney, A. J. et al. Mapping mutations to the SARS-CoV-2 RBD that escape binding by different classes of antibodies. Nat. Commun. 12, 4196 (2021).
CAS PubMed PubMed Central Google Scholar
Starr, T. N. et al. SARS-CoV-2 RBD antibodies that maximize breadth and resistance to escape. Nature 597, 97–102 (2021).
CAS PubMed PubMed Central Google Scholar
Lubin, J. H. et al. Modeling of ACE2 and antibodies bound to SARS-CoV-2 provides insights into infectivity and immune evasion. JCI Insight 8, e168296 (2023).
Deshpande, A., Harris, B. D., Martinez-Sobrido, L., Kobie, J. J. & Walter, M. R. Epitope classification and RBD binding properties of neutralizing antibodies against SARS-CoV-2 variants of concern. Front. Immunol. 12, 691715 (2021).
CAS PubMed PubMed Central Google Scholar
Di Rienzo, L. et al. Dynamical changes of SARS-CoV-2 spike variants in the highly immunogenic regions impact the viral antibodies escaping. Proteins https://doi.org/10.1002/prot.26497 (2023).
Rossotti, M. A. et al. Arsenal of nanobodies shows broad-spectrum neutralization against SARS-CoV-2 variants of concern in vitro and in vivo in hamster models. Commun. Biol. 5, 933 (2022).
CAS PubMed PubMed Central Google Scholar
Ahmad, J. et al. Structures of synthetic nanobody-SARS-CoV-2 receptor-binding domain complexes reveal distinct sites of interaction. J. Biol. Chem. 297, 101202 (2021).
CAS PubMed PubMed Central Google Scholar
Raybould, M. I. J., Kovaltsuk, A., Marks, C. & Deane, C. M. CoV-AbDab: the coronavirus antibody database. Bioinformatics 37, 734–735 (2021).
CAS PubMed Google Scholar
Sussman, J. L. et al. Protein Data Bank (PDB): database of three-dimensional structural information of biological macromolecules. Acta Crystallogr. D Biol. Crystallogr. 54, 1078–1084 (1998).
CAS PubMed Google Scholar
Berman, H. M. et al. The Protein Data Bank. Acta Crystallogr. D Biol. Crystallogr. 58, 899–907 (2002).
PubMed Google Scholar
Kastritis, P. L., Rodrigues, J. P. & Bonvin, A. M. HADDOCK(2P2I): a biophysical model for predicting the binding affinity of protein-protein interaction inhibitors. J. Chem. Inf. Model. 54, 826–836 (2014).
CAS PubMed PubMed Central Google Scholar
Lawrence, M. C. & Colman, P. M. Shape complementarity at protein/protein interfaces. J. Mol. Biol. 234, 946–950 (1993).
CAS PubMed Google Scholar
Thornton, J. M., Singh, J., Campbell, S. & Blundell, T. L. Protein-protein recognition via side-chain interactions. Biochem. Soc. Trans. 16, 927–930 (1988).
CAS PubMed Google Scholar
Laskowski, R. A. et al. PDBsum: a Web-based database of summaries and analyses of all PDB structures. Trends Biochem. Sci. 22, 488–490 (1997).
CAS PubMed Google Scholar
Laskowski, R. A., Chistyakov, V. V. & Thornton, J. M. PDBsum more: new summaries and analyses of the known 3D structures of proteins and nucleic acids. Nucleic Acids Res. 33, D266–D268 (2005).
CAS PubMed Google Scholar
Krissinel, E. & Henrick, K. Inference of macromolecular assemblies from crystalline state. J. Mol. Biol. 372, 774–797 (2007).
CAS PubMed Google Scholar
Lee, B. & Richards, F. M. The interpretation of protein structures: estimation of static accessibility. J. Mol. Biol. 55, 379–400 (1971).
CAS PubMed Google Scholar
Vangone, A. & Bonvin, A. M. Contacts-based prediction of binding affinity in protein-protein complexes. Elife 4, e07454 (2015).
PubMed PubMed Central Google Scholar
Salamanca Viloria, J., Allega, M. F., Lambrughi, M. & Papaleo, E. An optimal distance cutoff for contact-based protein structure networks using side-chain centers of mass. Sci. Rep. 7, 2838 (2017).
PubMed PubMed Central Google Scholar
Krawczyk, K., Liu, X., Baker, T., Shi, J. & Deane, C. M. Improving B-cell epitope prediction and its application to global antibody-antigen docking. Bioinformatics 30, 2288–2294 (2014).
CAS PubMed PubMed Central Google Scholar
Ingram, J. R. et al. Anti-CTLA-4 therapy requires an Fc domain for efficacy. Proc. Natl Acad. Sci. USA 115, 3912–3917 (2018).
CAS PubMed PubMed Central Google Scholar
Kabat, E. A., Wu, T. T. & Bilofsky, H. Attempts to locate residues in complementarity-determining regions of antibody combining sites that make contact with antigen. Proc. Natl Acad. Sci. USA 73, 617–619 (1976).
CAS PubMed PubMed Central Google Scholar
Chothia, C. et al. Conformations of immunoglobulin hypervariable regions. Nature 342, 877–883 (1989).
CAS PubMed Google Scholar
Alzari, P. M., Lascombe, M. B. & Poljak, R. J. Three-dimensional structure of antibodies. Annu. Rev. Immunol. 6, 555–580 (1988).
CAS PubMed Google Scholar
Padlan, E. A. Structural basis for the specificity of antibody-antigen reactions and structural mechanisms for the diversification of antigen-binding specificities. Q Rev. Biophys. 10, 35–65 (1977).
CAS PubMed Google Scholar
Crooks, G. E., Hon, G., Chandonia, J. M. & Brenner, S. E. WebLogo: a sequence logo generator. Genome Res. 14, 1188–1190 (2004).
CAS PubMed PubMed Central Google Scholar
Robinson, S. A. et al. Epitope profiling using computational structural modelling demonstrated on coronavirus-binding antibodies. PLoS Comput. Biol. 17, e1009675 (2021).
CAS PubMed PubMed Central Google Scholar
Wang, Y. et al. A large-scale systematic survey reveals recurring molecular features of public antibody responses to SARS-CoV-2. Immunity 55, 1105–1117 (2022).
CAS PubMed PubMed Central Google Scholar
Liu, L. et al. An antibody class with a common CDRH3 motif broadly neutralizes sarbecoviruses. Sci. Transl. Med. 14, eabn6859 (2022).
CAS PubMed Google Scholar
Tan, T. J. C. et al. Sequence signatures of two public antibody clonotypes that bind SARS-CoV-2 receptor binding domain. Nat. Commun. 12, 3815 (2021).
CAS PubMed PubMed Central Google Scholar
Yuan, M. et al. Structural basis of a shared antibody response to SARS-CoV-2. Science 369, 1119–1123 (2020).
CAS PubMed PubMed Central Google Scholar
Zhang, Q. et al. Potent and protective IGHV3-53/3-66 public antibodies and their shared escape mutant on the spike of SARS-CoV-2. Nat. Commun. 12, 4210 (2021).
CAS PubMed PubMed Central Google Scholar
Boughter, C. T. et al. Biochemical patterns of antibody polyreactivity revealed through a bioinformatics-based analysis of CDR loops. Elife 9, e61393 (2020).
CAS PubMed PubMed Central Google Scholar
Wang, Q. et al. Antibody evasion by SARS-CoV-2 Omicron subvariants BA.2.12.1, BA.4 and BA.5. Nature 608, 603–608 (2022).
CAS PubMed PubMed Central Google Scholar
Tuekprakhon, A. et al. Antibody escape of SARS-CoV-2 Omicron BA.4 and BA.5 from vaccine and BA.1 serum. Cell 185, 2422–2433 (2022).
CAS PubMed PubMed Central Google Scholar
Nutalai, R. et al. Potent cross-reactive antibodies following Omicron breakthrough in vaccinees. Cell 185, 2116–2131 (2022).
CAS PubMed PubMed Central Google Scholar
Cao, Y. et al. Rational identification of potent and broad sarbecovirus-neutralizing antibody cocktails from SARS convalescents. Cell Rep. 41, 111845 (2022).
CAS PubMed PubMed Central Google Scholar
Cao, Y. et al. Imprinted SARS-CoV-2 humoral immunity induces convergent Omicron RBD evolution. Nature https://doi.org/10.1038/s41586-022-05644-7 (2022).
Mannar, D. et al. SARS-CoV-2 Omicron variant: antibody evasion and cryo-EM structure of spike protein-ACE2 complex. Science 375, 760–764 (2022).
CAS PubMed PubMed Central Google Scholar
McCallum, M. et al. Structural basis of SARS-CoV-2 Omicron immune evasion and receptor engagement. Science 375, 864–868 (2022).
CAS PubMed PubMed Central Google Scholar
Wu, Y. et al. Lineage-mosaic and mutation-patched spike proteins for broad-spectrum COVID-19 vaccine. Cell Host Microbe 30, 1732–1744 (2022).
CAS PubMed PubMed Central Google Scholar
Dong, J. et al. Genetic and structural basis for SARS-CoV-2 variant neutralization by a two-antibody cocktail. Nat. Microbiol. 6, 1233–1244 (2021).
CAS PubMed PubMed Central Google Scholar
Parzych, E. M. et al. DNA-delivered antibody cocktail exhibits improved pharmacokinetics and confers prophylactic protection against SARS-CoV-2. Nat. Commun. 13, 5886 (2022).
CAS PubMed PubMed Central Google Scholar
Benjamin, D. C. et al. The antigenic structure of proteins: a reappraisal. Annu Rev. Immunol. 2, 67–101 (1984).
CAS PubMed Google Scholar
Shrock, E. L. et al. Germline-encoded amino acid-binding motifs drive immunodominant public antibody responses. Science 380, eadc9498 (2023).
CAS PubMed PubMed Central Google Scholar
Barnes, C. O. et al. Structures of human antibodies bound to SARS-CoV-2 spike reveal common epitopes and recurrent features of antibodies. Cell 182, 828–842 (2020).
CAS PubMed PubMed Central Google Scholar
Arbabi-Ghahroudi, M. Camelid single-domain antibodies: promises and challenges as lifesaving treatments. Int. J. Mol. Sci. 23 https://doi.org/10.3390/ijms23095009 (2022).
Brunger, A. T. et al. Crystallography & NMR system: a new software suite for macromolecular structure determination. Acta Crystallogr. D Biol. Crystallogr. 54, 905–921 (1998).
CAS PubMed Google Scholar
Chothia, C. & Janin, J. Principles of protein-protein recognition. Nature 256, 705–708 (1975).
CAS PubMed Google Scholar
Chen, J., Sawyer, N. & Regan, L. Protein-protein interactions: general trends in the relationship between binding affinity and interfacial buried surface area. Protein Sci. 22, 510–515 (2013).
CAS PubMed PubMed Central Google Scholar
‘NACCESS’, computer program (Department of Biochemistry and Molecular Biology, University College, 1993).
Jones, S. & Thornton, J. M. Analysis of protein-protein interaction sites using surface patches. J. Mol. Biol. 272, 121–132 (1997).
CAS PubMed Google Scholar
Fraczkiewicz, R. & Braun, W. Exact and efficient analytical calculation of the accessible surface areas and their gradients for macromolecules. J. Comp. Chem. 19, 319–333 (1998).
CAS Google Scholar
Ribeiro, J., Rios-Vera, C., Melo, F. & Schuller, A. Calculation of accurate interatomic contact surface areas for the quantitative analysis of non-bonded molecular interactions. Bioinformatics 35, 3499–3501 (2019).
CAS PubMed PubMed Central Google Scholar
Hamming, R. W. Error detecting and error correcting codes. Bell Syst. Tech. J. 29, 147–160 (1950).
Google Scholar
Levenshtein, V. I. Binary codes capable of correcting deletions, insertions, and reversals. Sov. Phys. Dokl. 10, 707–710 (1966).
Google Scholar
The PyMOL Molecular Graphics System, Version 2.5.4 (Schrödinger, LLC, 2023).
Madeira, F. et al. Search and sequence analysis tools services from EMBL-EBI in 2022. Nucleic Acids Res. 50, W276–W279 (2022).
CAS PubMed PubMed Central Google Scholar
Xu, J. et al. Nanobodies from camelid mice and llamas neutralize SARS-CoV-2 variants. Nature 595, 278–282 (2021).
CAS PubMed PubMed Central Google Scholar
Kabsch, W. & Sander, C. Dictionary of protein secondary structure: pattern recognition of hydrogen-bonded and geometrical features. Biopolymers 22, 2577–2637 (1983).
CAS PubMed Google Scholar

Download references

Acknowledgements

This research was supported by the Intramural Research Program of the National Institute of Allergy and Infectious Diseases, NIH.

Funding

Open Access funding provided by the National Institutes of Health (NIH).

Author information

Authors and Affiliations

Molecular Biology Section, Laboratory of Immune System Biology, National Institute of Allergy and Infectious Diseases, NIH, Bethesda, MD, 20892, USA
Jiansheng Jiang, Javeed Ahmad, Kannan Natarajan, Lisa F. Boyd & David H. Margulies
Computational Biology Section, Laboratory of Immune System Biology, National Institute of Allergy and Infectious Diseases, NIH, Bethesda, MD, 20892, USA
Christopher T. Boughter & Martin Meier-Schellersheim

Authors

Jiansheng Jiang
View author publications
You can also search for this author in PubMed Google Scholar
Christopher T. Boughter
View author publications
You can also search for this author in PubMed Google Scholar
Javeed Ahmad
View author publications
You can also search for this author in PubMed Google Scholar
Kannan Natarajan
View author publications
You can also search for this author in PubMed Google Scholar
Lisa F. Boyd
View author publications
You can also search for this author in PubMed Google Scholar
Martin Meier-Schellersheim
View author publications
You can also search for this author in PubMed Google Scholar
David H. Margulies
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

J.J. conceived the project, wrote programs, analyzed and discussed data, prepared figures. C.T.B. contributed program scripts, analyzed and discussed data, and prepared figures. J.J., C.T.B., J.A., K.N., L.F.B., M.M.-S. and D.H.M. analyzed and discussed data, and wrote and revised the paper.

Corresponding authors

Correspondence to Jiansheng Jiang or David H. Margulies.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Communications Biology thanks Brandon Havranek and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Primary Handling Editor: Gene Chong. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Peer Review File

Supplementary Material

Description of Additional Supplementary Files

Supplementary Data 1

Supplementary Data 2

Supplementary Movie 1

Supplementary Movie 2

Reporting Summary

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Jiang, J., Boughter, C.T., Ahmad, J. et al. SARS-CoV-2 antibodies recognize 23 distinct epitopic sites on the receptor binding domain. Commun Biol 6, 953 (2023). https://doi.org/10.1038/s42003-023-05332-w

Download citation

Received: 04 May 2023
Accepted: 07 September 2023
Published: 19 September 2023
DOI: https://doi.org/10.1038/s42003-023-05332-w

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

Identification of epitopic sites (ES)

Analysis of CDR loop contributions and epitope-paratope interactions

Cluster analysis of epitopic sites and binding motifs

Relation of ES and SARS-CoV-2 escape mutations

Discussion

Methods

Datasets

Software

Reporting summary

Data availability

Code availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Peer review

Peer review information

Additional information

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

Comments

Search

Quick links