Predominant structural configuration of natural antibody repertoires enables potent antibody responses against protein antigens

Chen, Hong-Sen; Hou, Shin-Chen; Jian, Jhih-Wei; Goh, King-Siang; Shen, San-Tai; Lee, Yu-Ching; You, Jhong-Jhe; Peng, Hung-Pin; Kuo, Wen-Chih; Chen, Shui-Tsung; Peng, Ming-Chi; Wang, Andrew H.-J.; Yu, Chung-Ming; Chen, Ing-Chien; Tung, Chao-Ping; Chen, Tzu-Han; Ping Chiu, Kuo; Ma, Che; Yuan Wu, Chih; Lin, Sheng-Wei; Yang, An-Suei

doi:10.1038/srep12411

Download PDF

Article
Open access
Published: 23 July 2015

Predominant structural configuration of natural antibody repertoires enables potent antibody responses against protein antigens

Hong-Sen Chen¹^na1,
Shin-Chen Hou¹^na1,
Jhih-Wei Jian^1,2,3^na1,
King-Siang Goh¹^na1,
San-Tai Shen¹^na1,
Yu-Ching Lee¹^na1,
Jhong-Jhe You¹^na1,
Hung-Pin Peng¹^na1,
Wen-Chih Kuo⁴^na1,
Shui-Tsung Chen⁴^na1,
Ming-Chi Peng⁴^na1,
Andrew H.-J. Wang⁴^na1,
Chung-Ming Yu¹^na1,
Ing-Chien Chen¹^na1,
Chao-Ping Tung¹^na1,
Tzu-Han Chen¹^na1,
Kuo Ping Chiu¹^na1,
Che Ma¹^na1,
Chih Yuan Wu¹^na1,
Sheng-Wei Lin⁴^na1 &
…
An-Suei Yang¹^na1

Scientific Reports volume 5, Article number: 12411 (2015) Cite this article

5543 Accesses
16 Citations
1 Altmetric
Metrics details

Subjects

Abstract

Humoral immunity against diverse pathogens is rapidly elicited from natural antibody repertoires of limited complexity. But the organizing principles underlying the antibody repertoires that facilitate this immunity are not well-understood. We used HER2 as a model immunogen and reverse-engineered murine antibody response through constructing an artificial antibody library encoded with rudimentary sequence and structural characteristics learned from high throughput sequencing of antibody variable domains. Antibodies selected in vitro from the phage-displayed synthetic antibody library bound to the model immunogen with high affinity and specificities, which reproduced the specificities of natural antibody responses. We conclude that natural antibody structural repertoires are shaped to allow functional antibodies to be encoded efficiently, within the complexity limit of an individual antibody repertoire, to bind to diverse protein antigens with high specificity and affinity. Phage-displayed synthetic antibody libraries, in conjunction with high-throughput sequencing, can thus be designed to replicate natural antibody responses and to generate novel antibodies against diverse antigens.

Depleting myeloid-biased haematopoietic stem cells rejuvenates aged immunity

Article 27 March 2024

Jason B. Ross, Lara M. Myers, … Irving L. Weissman

DNA glycosylases provide antiviral defence in prokaryotes

Article Open access 17 April 2024

Amer A. Hossain, Ying Z. Pigli, … Luciano A. Marraffini

An autoantibody signature predictive for multiple sclerosis

Article 19 April 2024

Colin R. Zamecnik, Gavin M. Sowa, … Michael R. Wilson

Introduction

Antibodies binding to different epitopes on an antigen result in different biological consequences. Pertuzumab binds to the dimerization domain (domain II) of HER2 (human epidermal growth factor receptor 2) and inhibits the dimerization of HER2 with other HER receptors, in turn inhibiting the down-stream transduction of tumor survival signals¹; trastuzumab targeting at the fourth domain (domain IV) of HER2² inhibits HER2-positive breast cancer growth by down-regulating the HER2 expression, inhibiting the releasing of the HER2 ectodomain and inducing antibody-dependent cell-mediated cytotoxicity (ADCC).

Epitopes on an immunogen that are recognized by antibodies in an individual are largely determined by the individual’s antibody repertoire. While the entire accessible surface of a protein immunogen can be antigenic, i.e., no inherent property of the protein molecule could restrict epitope locations on the protein surface^3,4,5, individual antibody response against the immunogen elicits antibodies binding to only a few predominant epitopes³. The natural antibody responses can be partially understood by the large number of B cells (on the order of 10¹¹ in a human body⁶), each of which expresses a unique B cell receptor (BCR) through antibody gene segment recombination and segment junction diversification⁷. When exposed to a novel immunogen, the immune system searches for suitable BCRs in the B cell repertoire, followed by clonal expansion and somatic hyper-mutation⁸ (SHM) leading to BCR affinity maturation in the germinal centers of the peripheral lymphoid tissues^9,10. Meanwhile, the epitopes of the BCRs on the immunogen remain largely unchanged throughout the affinity maturation process¹¹; the initial selection of the naïve BCRs in the repertoire defines the epitopes of the antibody response. This notion has been supported by a large body of evidence: For example, twelve monoclonal antibodies derived independently from mice immunized with different form of amyloid β peptide are found to bind to the same immunodominant epitope using the same VH/VL germline sequence pairing with the key antigen-contacting residues already encoded in the germline sequence¹². Also, a large number of broadly neutralizing antibodies found in diverse human individuals target the immunodominant stem region epitope on hemagglutinin of influenza virus with IGHV1-69 germline heavy chain variable domain sequence affinity-matured with only a few SHMs^13,14.

Well-configured antibody repertoires are key to the protective humoral immunity, but the working principles underlying the BCR populations of the antibody repertoires are not known. Searching for functional antibodies from the vast sequence space of antibody variants to counter infinite diversity of potential immunogens can be intractable without a well-configured antibody repertoire, which need to provide near-hit solutions each time encountering a novel immunogen. The remarkable functionality of the natural antibody repertoires must reside in the repertoire composition, where nature solves the intractable sequence space search problem, after billions of years of evolution, by shaping the antibody repertoires as such that adequate antibodies are readily available in the repertoire to respond to almost any immunogen challenges without extensively relying on SHM to explore the vast sequence space. This expectation has been supported by the evidence where antibodies selected in vitro from phage-displayed human antibody libraries¹⁵ or phage-displayed synthetic antibody libraries mimicking natural antibody repertoires¹⁶ are able to recognize random protein antigens with high affinity and specificity without affinity maturation in vivo. But what the nature has learned about configuring a functional antibody repertoire is far from being clearly understood. The goal of this work is to elucidate the working principles underlying the functionality of natural antibody repertoires where antibodies with high affinity and specificity against almost any protein immunogens can frequently be elicited within a few weeks in an individual.

Investigations of natural antibody responses are becoming technologically feasible to understand humoral immunity and inform vaccine development. Two technical barriers hamper the understanding of natural antibody repertoires: first, large complexity of the antibody repertoire in an individual host (10¹¹ in humans and 10⁹ in mouse) has hindered the exhaustive enumeration of the antibody members in the repertoires; second, experimental systems to test hypotheses on antibody repertoires are limited by the capability to synthesize a large number of recombinant antibodies. These barriers are increasingly being overcome by recent technical advancement in next generation sequencing (NGS) and synthetic antibody libraries. The results have enhanced the understanding towards the natural antibody responses protecting the extracellular spaces in animals from pathogen invasion and in informing vaccine development to elicit antibody responses on targeted antigens to confer humoral immune protection against the specific infectious pathogens in advance¹⁷.

Experimental validations of the hypotheses derived from high throughput sequencing data on antibody repertoires remain challenging. Recent NGS technologies have enabled high throughput parallel sequencing capacity of determining on the order of 10⁷ antibody variable domain sequences on a single microfluidic chip¹⁸. The NGS-driven discoveries on BCR repertoires are generating a large volume of sequence data, forming a basis for global and unbiased view of the composition and evolution of antibody repertoires as well as the antibody responses towards infectious diseases and vaccination^17,18,19,20. High throughput DNA sequencing of immunized mouse antibody repertories has led to discovery of monoclonal antibodies with high specificity and affinity without conventional screening processes²¹. While unprecedented insights into the evolution and functionality of antibody repertoires have been attained from computational inference of the vast antibody sequence database, experimental examination of the bioinformatics insights remains challenging because of the technical difficulties in reproducing the natural antibody repertoires in vitro²².

Synthetic antibody libraries can be used as hypothesis-driven tools to investigate the determinants of functional antibody repertoires. One approach is to construct phage-displayed synthetic antibody libraries to mimic natural antibody repertoires^{16,23,24,25,26,27,28,29,30}. It has been argued that functional antibodies recognize protein surfaces mainly through aromatic side chains on the complementarity determining regions (CDRs)⁵. The binding of the paratope aromatic side chains to epitope backbone atoms and side chain carbons, which are ubiquitous on protein surfaces, contributes the major portion of the antibody-protein binding energy. The implication is that a synthetic antibody library bearing paratopes with diverse structural contours enriched with aromatic residues among short chain hydrophilic residues can recognize all sorts of proteins through binding to the common physicochemical features on the protein surfaces⁵. Insights from a natural antibody repertoire can be encoded into the structural diversities of CDRs and the distributions of amino acid types in the key CDR residue positions, leading to the artificial antibody library bearing the characteristics of the natural antibody repertoire. Once the antibody library is synthesized and displayed on phage particles^31,32,33, the characteristics of the natural antibody repertoires can be queried with the phage display system.

In this work, the antigen recognition capabilities of the mouse antibody repertoires were validated by testing the functionality of a corresponding synthetic phage-displayed antibody library encoded with the CDR structural and sequence characteristics resembling those of the antibody repertoires from naïve and immunized mice. NGS was applied to sample the antibody repertoire sequences obtained from the naïve and immunized mice with extracellular domain (ECD) of human epidermal growth factor receptor 2 (HER2) as a model antigen. Computational analysis of the NGS data indicated that the antibody repertoires from naïve mice and mice immunized with various immunization protocols are largely skewed toward a CDR canonical structure (CS) combination. A synthetic antibody library was constructed based on the structural and sequence characteristics of the mouse antibody repertoires and tested for HER2-specific antibody discovery. The synthetic antibody library was able to generate antibodies with equally high affinity and specificity as the affinity-matured antibodies from the mouse antibody repertoires. Moreover, the synthetic antibody library was also able to generate binders to all sorts of protein surface in addition to that of HER2/ECD. These results suggested that the predominant CDR canonical structure combination in mouse antibody repertoires is a robust framework capable of encoding highly functional antibodies against diverse protein antigens. The results also demonstrate the utilities of phage-displayed synthetic antibody libraries in testing insights into natural antibody repertoires and in developing artificial antibody repertoires to discover highly functional antibodies targeting at various antigens on diverse epitopes.

Results

Naïve and immunized mouse antibody variable domain structural repertoires are predominantly biased to a single combination of CDR canonical structures

The antibody gene segment usages were assessed with high throughput sequencing of the phage-displayed antibody repertoires from the splenocytes of mice with different immunization history. One naïve mouse (m0) and 3 mice immunized with different HER2/ECD immunization protocols (m3, m4 and m6) (Supplementary Figure S1) were sacrificed to harvest the splenocytes, from which the antibody variable domain cDNA of each of the 4 antibody repertoires was PCR-amplified (Fig. 1a) with well-established primer sets³⁴, which encompass most of the known mouse germline gene sequence families of antibody variable domains (see Methods). Three cDNA libraries were constructed for each mouse: VH domain libraries encompass V_H-D_H-J_H gene segment combinations; VL(κ) and VL(λ) domain libraries encompass Vκ-Jκ and Vλ-Jλ gene segment combinations respectively. These cDNA libraries were inserted into phagemids to construct a scFv (single chain variable fragment) antibody library with respect to each mouse (Fig. 1a). Two experimental tasks were carried out for each of the antibody libraries: (1) the VH and VL sequences of the antibody libraries were respectively sampled with NGS (Supplementary Table S1); (2) phagemids harboring the gene of the antibody repertoires were expressed as phage-displayed scFv antibody libraries^34,35 (Fig. 1a). Each of the phage-displayed antibody libraries contained more than 10⁹ variants.

The NGS data indicated that the gene segment usages for both VH and VL of the 4 antibody repertoires were substantially skewed, with the top 20 out of 101 V_H gene segments⁷ and the top 15 out of 93 Vκ gene segments⁷ account for ~60% of VH and VL(κ) sequences respectively (Supplementary Figure S2a and S2b). Vλ1 gene segment was predominantly used in VL(λ) (Supplementary Figure S2c). Note that the relative ratio of the VL(κ) and VL(λ) was lost during PCR amplification (Fig. 1a), but previous works have established that overall, VL(κ) dominates the light chain variable domains in mouse antibody repertoires (~95%)^7,36. The relative distributions of the gene segment usages for both VH and VL(κ and λ) domains were all similar among the 4 antibody repertoires and were largely insensitive to the immunization protocols (Supplementary Figure S2a~c).

The distributions of the CDR canonical structure (CS) combinations in the antibody repertoires (i.e. the structural repertoires) of the V_H gene segments and Vκ gene segments were biased further towards only one predominant CS combination respectively. The mouse V_H gene segments encompass CDR-H1 and CDR-H2 of the VH domain. The work by Chothia et al.³⁷ indicated that the CDR-H1 has 3 different CS types and the CDR-H2 has 5 different CS types and that almost all V_H gene segments belong to one of only 7 different CS combinations. The NGS data of the mouse antibody repertoires indicated that only one combination (type 1 CS for CDR-H1 and type 2 CS for CDR-H2, or 1–2 combination in short) of the 7 CS combinations was predominantly used (>60%) in all 4 antibody repertoires (Fig. 1b and Supplementary Table S1). Similarly, the mouse Vκ gene segments encompass CDR-L1 and CDR-L2 of the VL domain. The CDR-L1 has 4 different CS types and the CDR-L2 has only one CS type, indicating that almost all Vκ gene segments belong to one of the 4 different CS combinations³⁸. Again, only the 2-1 combination (type 2 CS for CDR-L1 and type 1 CS for CDR-L2) of the 4 CS combinations was predominantly used (>70%) in all 4 antibody repertoires (Fig. 1c and Supplementary Table S1). Although the CSs of the Vλ gene types are diverse³⁹, only one CS combination was observed in the antibody repertoires (Fig. 1d). Differences of the immunization protocol had only minor effect on the distributions of the structural repertoires (Fig. 1b~d and Supplementary Table S1). Although the pairing information of the VH and VL was lost during the scFv library construction, it has been demonstrated that the pairings of VH and VL are random in mature B cells^40,41, which survived the clonal selection. As a first approximation based on the empirical VL-VH pairing, it could be anticipated that the mouse antibody structural repertoires were largely skewed to the antibody main chain fold characterized by the 1-2-2-1 CS combination for CDR H1-H2-L1-L2. The CDR-L3 distributions were predominantly centered at the length of 9 residues (pie charts in Fig. 1c), with the CDR-L3s predominantly belonging to type 1 CS³⁸. CDR-H3s were more diverse in length due to the V_H-D_H and D_H-J_H junction diversity and usage of different D_H gene segments. The distributions range from 3 to 22 residues, with the maximum at 11 residues in the mouse antibody repertoires (pie charts in Fig. 1b). Together, although more than 10⁷ combinations of the genomic gene segments are likely to contribute to the antibody diversity in the repertoires⁷, the mouse antibody structural repertoires were all skewed towards one predominant main chain structure characterize by 1-2-2-1-1 for CDR H1-H2-L1-L2-L3 CS combination with CDR-H3 length distribution centered at 11 residues.

The majority of the antigen-binding antibodies from the mouse antibody repertoires had the predominant CDR canonical structure combination

The functional scFv HER2/ECD-binders isolated from the phage-displayed mouse antibody libraries belonged to the predominant antibody structure (1-2-2-1-1) and had been affinity-matured to sub-nanomolar affinity. A total of 316 positive scFvs (S316 set) were obtained, after 2 ~ 3 rounds of selection/amplification cycles, from the phage-displayed antibody libraries respectively constructed with the splenocytes of the 3 immunized mice (m3, m4 and m6) (Fig. 1a). The V_H gene segment usage in S316 was mainly attributed to two germline gene segments, with type 1–2 CS type for CDR-H1 and CDR-H2 and with CDR-H3 of 11 and 12 residues respectively (Fig. 2a). The V_H gene segments of m3 and m4 scFvs in S316 came from germline sequence IGHV14-1 and IGHV1-47 respectively; the m6 V_H gene segments in S316 were more diverse (Fig. 2a). Six Vκ gene segments predominated the usage of light chain gene segments in S316 (Fig. 2b), where CDR-L1 and CDR-L2 have CS type of 2-1 and almost all of the CDR-L3s belonged to type 1 CS (Fig. 2b). Six representative recombinant IgGs (M32, M41, M61, M62, M63, M64 selected from different mice and with different gene segments, see Fig. 2a,b) were expressed and purified and the K_D’s binding to HER2/ECD were in the sub-nanomolar range (measured with BIAcore, Supplementary Table S4), indicating that these IgGs were affinity-matured to the affinity ceiling (~10⁻¹⁰ M) in vivo⁹. All the 6 representative antibodies have the predominant 1-2-2-1-1 CS combination (CDR sequences shown in Supplementary Table S5).

The S316 antibodies were elicited from clonal expansion due to immunization and only the heavy chain variable domain sequences, especially those of CDR-H3s, were preferentially selected during the clonal expansion. As expected, the IGHV14-1 CDR sequence profiles of m3 derived from NGS were identical to the S316 antibody CDR sequence profiles from m3 (Fig. 2a and Supplementary Figure S3a), indicating that the IGHV14-1 antibodies in m3 were expanded clones in responding to the immunization. Similarly, the IGHV1-47 CDR sequence profiles of m4 and m6 derived from NGS resemble to those of the S316 antibodies from m4 and m6 (Supplementary Figure S3b), suggesting that IGHV1-47 germline gene was predominantly selected to respond to the antigen in m4 and m6. Both families of IGHV14-1 and IGHV1-47-related clones were not found in NGS antibody sequences from naïve mouse m0 (Supplementary Figure S3); the differences were particularly evident in the CDR-H3 region, suggesting that the clonal expansion of the IGHV14-1 and IGHV1-47 related antibodies was induced by the immunization. By contrast, comparisons of the light chain variable CDR sequence profiles derived from the NGS of the control and immunized mice with those of S316 antibodies revealed no substantial difference, indicating that light chain variable domains were not particularly selective in responding to immunization-induced clonal expansion. Together, the common theme emerging from the antibodies elicited in the immunized mice was that the expanded clones mostly had the same CS combination – 1-2-2-1-1 for CDR H1-H2-L1-L2-L3, although the germline gene usage could be different and the path of affinity maturation could be diverse in the antibody responses.

An artificial antibody library mimicking the predominant structural subgroup of the mouse antibody repertoires was constructed with three key principles

The configurations of the mouse antibody structural repertoires were highly skewed and did not change by different immunization history and the affinity-matured antibodies, which were elicited within weeks after immunization, belonged to the predominant antibody CDR structure types of the antibody repertoires (Figures 1~2). The implication was two-fold: the predominant antibody CDR structures could be shaped by natural evolution so as to be capable of responding to most antigens with binders of high affinity and specificity; diverse antigens might be recognizable by the predominant antibody CDR structures in the antibody repertoire. To test these hypotheses, we constructed an artificial scFv antibody library to reflect the most predominant VL-VH main chain fold in the mouse antibody repertoires. If the hypotheses were true and the artificial antibody library were reasonably constructed to encode the essence of the predominant characteristics of the natural antibody repertoires, the artificial antibody library should encompass high affinity antibodies for HER2/ECD as well as for many other random protein antigens.

Three key principles are known to be rudimentary in designing the artificial antibody library: first, conservation of residues types at key CDR positions underlying the CSs; second, distribution of sufficient number of aromatic residues over the CDR positions to ensure adequate binding energetics in antigen recognition; third, incorporation of short-chain hydrophilic residues to enhance antibody-antigen interaction specificity.

Conformation-determining CDR positions are conserved in amino acid type. The scFv template for the artificial antibody library has an invariable framework based on the human variable domain sequence combination of V_H3-23-J_H4 for the VH domain and Vκ1-Jκ1 for the VL domain^31,32. This artificial antibody library is dubbed GH2 (generic human, version 2) antibody library. The GH2 scFv template (Av1, Supplementary Figure S4, as used in previous works^31,32) has the 1-2-2-1-1 CS combination for CDR-H1, H2, L1, L2, L3 and the length of CDR-H3 is fixed at 11 residues. Hence, the GH2 antibody library strictly mimicked the most predominant subgroup in the mouse antibody repertoires (Fig. 1b,c). In addition, we have used the Av1 scFv as a template to map the V_L-V_H interdomain interface residues and concluded that the interface residues are directly coupled with antigen binding and should be treated as an integral part of the antigen-binding site³¹. As such, the previously identified interface residues remained as invariable residues. As the first key principle, amino acid types in the key CDR positions⁴² critical to the conformations of the CSs or critical to the V_L-V_H interdomain interface were fixed as in the template Av1; only the CDR residue positions shown in Supplementary Table S2 were diversified.

CDR positions for binding are populated with aromatic residues. 30 CDR residue positions (Supplementary Table S2) are not critical for CS conservation or to the interdomain interface and are known to be responsible for antigen binding to various extent⁴³. Residue type distributions (Supplementary Table S2) for these residue positions are diversified under the constraint to allow for enough antigen-binding hot spot residues distributed on the CDR surface⁵. A previous work has established the correlation between the predicted hot spot residue distributions and the distributions of the aromatic residues in the CDR structures⁵. As such, the second key principle for the GH2 library design is to mimic the distribution of the aromatic residues in similar CDR structures in natural antibody repertoires: the GH2 antibody library was encoded with the aromatic residue distributions (Fig. 3a,b, right hand side histograms) resembling the aromatic residue distributions observed in the mouse antibody repertoires derived with NGS as well as in identical CSs from known antibody structures (dataset S584) (Fig. 3a,b, left hand side histograms); the aromatic residue distributions were similar both in residue positions and in distribution amplitudes (Fig. 3a,b).

Short-chain hydrophilic CDR residues are critical for specific antigen binding. It has been demonstrated that short-chain hydrophilic residues distributed among aromatic residues in CDRs mediate antigen recognition specificity through short range electrostatic interaction and direct hydrogen bonding across antibody-antigen interfaces^5,33. The third key principle for the GH2 library design is to incorporate short chain hydrophilic residues among the aromatic residues in the 30 CDR residue positions that are responsible for antigen binding (Supplementary Table S2).

It is important to note that the GH2 antibody library design is not to imitate any natural naïve antibody repertoire. Rather, the GH2 library is designed to circumvent the limitations of natural naïve phage-displayed antibody libraries. As demonstrated in Supplementary Figure S2, antibody population in the natural naïve antibody repertoire of mouse m0 was strongly biased by the predominant variable domain germline gene usage, leading to major clusters of antibody sequences originated from a few predominant germline genes. Consequently, the scope of the nature antibody sequence space is relatively limited in comparison to that of the GH2 antibody library. Moreover, although specific antibodies against a random antigen with strong affinity could exist in the natural naïve antibody repertoire, these naïve antibodies were not expected to be discovered with phage display because these clones were mostly not reproduced in phage-displayed antibody libraries (Fig. 1a) due to the random VL-VH pairing nature of the phage-displayed antibody libraries (see Methods). As there is no guarantee that antibodies for a random antigen would be discovered from a naïve natural phage-displayed antibody library due to the two aforementioned reasons, the goal for the GH2 library design was to test the feasibility of the predominant CS configuration in recognizing the model antigen HER2/ECD and other random antigens given that the amino acid distributions are not limited by germline gene usage and the antibody discovery is carried out after the VL-VH pairing is settled in the library construction.

Specificities of antibodies from the artificial antibody library replicated the specificities of natural antibody responses

GH2 scFv library was constructed to the complexity on the order of 10⁹ well-folded scFv variants, close to the complexity of the expressed BCR repertoire in a mouse⁶. The construction of the GH2 library is depicted in Fig. 4 and the experimental details are described in Methods. 30 residue positions distributed over the 6 CDRs of the template scFv were diversified with a total of 48 DNA segments (Supplementary Table S3), each of which contains degenerate codons to encode the amino acid type distributions (Supplementary Table S2) for the GH2 antibody library.

Antibodies from the synthetic GH2 antibody library bound to the same epitope areas on HER2/ECD as the antibodies from natural mouse antibody repertoires. Competition of 90 mouse scFv randomly selected from S316 (labeled by black dots in the phylogenetic trees of Fig. 2a,b) with the six representative mouse anti-HER2/ECD IgGs (M32, M41, M61, M62, M63, M64; Fig. 2, Supplementary Table S4 and S5) in binding to HER2/ECD confirmed that the six mouse IgGs were comprehensive in representing the three epitope groups of the S316 scFvs: M32-M62, M41-M61 and M63-M64 (Fig. 5a). By comparison, 90 non-redundant GH2 scFvs (dataset S90) that bound specifically to HER2/ECD were randomly selected and screened after two selection/amplification cycles (Fig. 4). Competitions of the scFvs in S90 with the six mouse anti-HER2/ECD IgGs in binding to HER2/ECD showed that the S90 scFvs bound to the antigen HER2/ECD also on the same set of epitope groups (Fig. 5b), with three S90 scFvs binding to epitopes outside the three major mouse epitope groups.

The upper-end affinity of the synthetic antibodies was comparable to that of the affinity-matured natural antibodies

IgG antibodies derived from the GH2 scFvs had binding affinity comparable to that of the affinity matured mouse IgGs. 30 scFvs randomly selected from S90 were expressed in IgG1 format (S30 set; see Supplementary Table S4 and S5). Only one IgG failed to express in large quantity; two IgGs (GH2-78, GH2-87) were found to have a glycosylation site in the light chain CDR-L1 (Supplementary Figure S5). The EC₅₀’s of the S30 IgGs were compared with those of the six mouse IgGs (Fig. 5c). The lower limit of EC₅₀ of the S30 IgGs were comparable with that of the affinity matured mouse IgGs; 12 of the S30 IgGs have EC₅₀ lower than that of trastuzumab (Fig. 5c, Supplementary Table S4). BIAcore measurements of the K_D’s for the S30 IgGs binding to immobilized HER2/ECD indicated that the lower limit of the K_D approached sub-nanomolar, similar to that of the affinity-matured mouse IgGs (Supplementary Table S4). Together, the S30 IgGs derived from the GH2 scFv binders were as robust as the affinity-matured mouse IgGs in both folding and binding.

The majority of the epitopes of the antigen-binding antibodies from the artificial antibody library are novel

Only one of the three major groups of epitopes of the S90 scFvs overlapped with those of known anti-HER2/ECD antibodies. Competitions on HER2/ECD binding for the S90 scFvs against 4 positive control antibodies (Fig. 6a), where the x-ray structures of the antibody-HER/ECD complexes are known (A21⁴⁴, Fab37⁴⁵, pertuzumab¹ and trastuzumab²; see Fig. 6b), indicated that the prevalent M32-M62 epitope group was situated on the domain I of HER2/ECD and was near to – but not overlapped with – the epitope of A21, as demonstrated in the epitope mapping with HDX-MS (site E1 in Fig. 6b,c). A portion of epitopes in this group overlapped with that of A21 (Fig. 6a), suggesting that the intra-group epitopes are heterogeneous (sites E1 and E2 in Fig. 6b,c). The M63-M64 epitope group overlapped with the epitope of Fab37 (Fig. 6a), which binds to the domain III of HER2/ECD (Fig. 6b,c). The M41-M61 epitope group did not overlap with any known epitopes (Fig. 6a). Binding of the M41 and M61 antibodies to the recombinant fragment encompassing domain III-IV of HER2/ECD suggested that the M41-M61 epitope group was situated on the domain III-IV but overlapped neither with the epitope of Fab37 on domain III nor with the epitope of trastuzumab on domain IV (Fig. 6a). Epitope mapping with HDX-MS indicated that the M41-M61 epitope group was situated on domain IV (site E3 in Fig. 6b,c), on a surface patch distal from the epitope of trastuzumab (Fig. 6b,c). The sequences of the S90 scFvs were diverse and not correlated with the epitope groups (Fig. 6d), suggesting that completely different CDR sequences could bind to the same epitope area. Together, only one (pertuzumab) of the four previously known epitopes was not covered by the epitopes of the S90 scFvs and only the M63-M64 epitope group of the S90 scFvs overlapped with previously known epitopes (Fab37).

Synthetic antibodies binding to different epitopes of HER2/ECD on HER2-overexpressed cells result in different biological consequences

Antibodies binding to different epitopes on HER2/ECD had different biological functions. Unlike trastuzumab and pertuzumab, M32 bound to a novel epitope on domain I of HER2/ECD and caused internalization of HER2 and depletion of the receptor on HER2 overexpressed cell (SKBR3) surface (Fig. 7a,b). M62 shared similar epitope with M32 and had similar effect as M32 on cell surface HER2 depletion (Fig. 7a,b). M63 and M41 bound to cell surface HER2 on different epitopes (domain III and IV respectively) and did not cause HER2 depletion (Fig. 7a,b). IgGs GH2-42 and GH2-75 bound to cell surface HER2 on epitopes similar to that of M32 as determined by HDX-MS epitope mapping and epitope-binding competition (Figs 5b, 6a~c) with similar affinity (Supplementary Table S4) but did not cause HER2 depletion (Fig. 7a,b). However, combination of GH2-42 with trastuzumab or GH2-18, which shared similar epitope with trastuzumab (Fig. 6a), resulted in HER2 depletion (Fig. 7a,b). Combination of GH2-75 with GH2-18 also caused HER2 internalization (Fig. 7a,b). Binding of the antibodies to different epitopes on HER2/ECD also inhibited down-stream signaling involving HER2 to various extent (Fig. 7b). Together, IgGs binding to domain I of HER2/ECD might or might not deplete the surface receptor; combination of domain I and domain IV IgG binders always depleted HER2, but combination of domain II and domain IV binders had no effect on HER2 depletion. The depletion of cell surface HER2 resulted in reduction of downstream signal transduction (Fig. 7b), which was particularly evident in HER2 overexpressed cells treated with combination of domain I and domain IV IgG binders. The results demonstrated that antibodies binding to different epitopes of HER2/ECD affected differently the distribution and signal transduction of the receptor.

Antibodies binding to other random protein antigens can also be found from the artificial antibody library

The GH2 antibody library has been applied to discover antibodies against 14 other random protein antigens. Antibodies binding to 12 out of the total 14 proteins have been discovered (Supplementary Table S6) with the same selection and screening procedure and criteria (see Methods). The results indicated that antibodies with high affinity and specificity for these proteins can be found in the GH2 library without the need to further affinity maturation.

Discussion

The notion that highly specific affinity-matured antibodies against infinitely diverse immunogens can be elicited from natural antibody repertoires with limited complexity can be understood by the structural configuration of the antibody repertoires. In this work, immunogen (HER2/ECD)-specific affinity-matured antibodies were elicited, a few weeks after immunizations, from the immunized mice, for which the distributions of antibody structural configurations in the antibody repertoires were revealed by NGS data. These data suggested that the repertoires were all biased toward the same predominant variable domain main chain structure regardless of the immunogen exposure history. The variable domain structures of the affinity-matured antibodies were consistent with the predominant structure in the antibody repertoires, although the germline lineages of the matured antibody variable domains were diverse. By comparison, the GH2 artificial antibody repertoire was constructed to encode minimal but essential characteristics key to the competence of the natural antibody repertoires: all GH2 antibody variants were derived from a single variable domain template with all six CDR structures resembling the most prevalent CDR structures respectively as revealed by the NGS data of the mouse antibody repertoires; the amino acid type diversity and preference at each residue position of the 6 CDRs in the artificial repertoire were encoded with the rudimentary principles in (a) conserving amino acid types at structurally critical residue positions, (b) mimicking the aromatic amino acid distributions in position and magnitude over the CDRs in natural antibodies and (c) distributing short chain hydrophilic residues among the aromatic residues. Synthetic antibodies derived from phage display selection and screening of the GH2 antibody library bound specifically to the HER2/ECD with affinities approaching to those of the affinity-matured antibodies elicited from HER2/ECD-immunized mice, indicating that the combination of CS configuration and proper CDR amino acid distribution are sufficient conditions to reach affinity-matured antibodies. The epitopes of the mouse antibodies were fully reproduced by the antibodies from the GH2 library, suggesting that the synthetic antibody library had been constructed to capture the essence of the predominant characteristics of the mouse antibody repertoires. Moreover, antibodies with high specificity and affinity against diverse random protein antigens were also discovered from the GH2 antibody library with the same phage display selection and screening process in vitro, indicating that antibodies with high affinity and specificity from the natural antibody repertoires biased to the predominant antibody CDR structures were readily available to counter diverse protein immunogens. It seems that natural antibody repertoires are shaped through evolution to converge to only few predominant antibody structural classes that can be readily encoded, within the complexity limit, with many functional antibodies capable of binding to varieties of protein antigens with high specificity and affinity without the need to extensively relying on SHM to explore the antibody sequence space. The rudimentary principles encoded in the GH2 antibody library suggest that the configuration of the variable domain structures and the associated diversity of the CDR sequences underlie the functionality of the natural antibody repertoires. Phage-displayed synthetic antibody libraries can be further used to decipher the natural antibody responses and to develop novel antibodies against diverse antigens.

Methods

Construction of phage-displayed mouse antibody libraries

Splenocyte harvesting

Splenocyte harvests from immunized female BalbC/j mice bred and kept under approved SPF conditions were carried out in accordance with the approved guidelines; all experimental protocols had been approved by the Institutional Animal Care and Use Committee of Academia Sinica protocol ID: IACUC_13-03-545. scFv antibody libraries were constructed by overlap extension PCR with an 18-amino acid linker (SSGGGGSGGGGGGSSRSS). The experimental procedures were adapted and modified from Cold Spring Harbor Laboratory Manual for Phage Display³⁴. In brief, immunized mouse was sacrificed and the spleen was harvested into 2mL of TRI reagent (Invitrogen). Immediately, the sample was homogenized and dispensed into 1.5 mL microtubes (0.5 mL/tube) to be stored at −80 °C. RNA extraction from thawed sample using QIAGEN RNeasy Plus Mini Kit was carried out to obtain 60–80 μg of total RNA from 1/4 spleen.

cDNA library construction

Reverse transcription (RT) with the RNA extract was performed with SuperScript III First-Strand Synthesis System (Invitrogen) following the manufacturer’s protocol. The reaction was carried out as follows: 10 μg of total RNA, 1 μL of 10 μM primer Oligo(dT)₂₀ and 1 μL of 10 mM dNTP mix were added to each 0.2 mL tube and the total volume was adjusted to 10 μL with DEPC H₂O (0.1% diethylpyrocarbonate-treated H₂O). The mixture was incubated at 65 °C for 5 min and immediately chilled on ice. 10 μL of cDNA synthesis mix was added to each tube as follows: 2 μL of 10X RT buffer, 4 μL of 20 mM MgCl₂, 2 μL of 0.1 M dithiothreitol (DTT), 1 μL of RNaseOut (40 U/μL) and 1 μL of SuperScript III RT (200 U/μL). The mixture was incubated at 50 °C for 50 min to allow the synthesis of first strand of cDNA. The reactions was terminated by incubating at 85 °C for 5 min and then kept the tubes at 4 °C. 1 μL of RNase H was added to the sample and incubated for 20 min at 37 °C to remove residual RNA. After quantitating the concentration at OD₂₆₀, the samples were stored at 20 °C until used for PCR.

Phage-displayed scFv library construction

In the first round of PCR, two variable domains of light chain Vκ and Vλ and one of heavy chain VH were amplified separately from cDNA using the primer mixes according to the protocol³⁴, where the primer set contains 19 sense primers paired with 3 reverse primers for VH, 17 sense primers paired with 3 reverse primers for Vκ and 1 sense primer paired with 1 reverse primer for Vλ. PCR reactions were carried out in a volume of 50 μL with MyTaq Hot Start polymerase (Bioline), 0.5 μg cDNA template and 0.3 μM of each primer mix for 25 cycles (30 sec 95 °C, 30 sec 65 °C, 1 min 72 °C) followed by a 10 min final synthesis step. The PCR products were checked and then purified by agarose gel electrophoresis. In the second round of PCR, two variable domains of light chain Vκ and Vλ were assembled separately with heavy chain VH and the overlapping primers: 100 ng of the recovered Vκ or Vλ and VH PCR fragments from the first PCR were added to total volume of 50 μL containing MyTaq Hot Start polymerase (Bioline) and 0.3 μM of each primer for 30 PCR cycles (30 sec 95 °C, 30 sec 65 °C, 1 min 30 sec 72 °C) followed by a 10 min final synthesis step. The assembled Vκ-VH or Vλ-VH fragments were doubly digested with SfiI and NotI (New England BioLabs) and cloned into pCANTAB5E phagemid vector. 10 ~ 15 μg ligation product was electroporated into Escherichia coli ER2738 at 3000 V with an electroporator. The phage-displayed scFv library should reach complexity above 10⁹ CFU.

Synthetic antibody library construction

scFv template preparation

The framework sequence of GH2 scFv library was derived from G6 anti-VEGF Fab (Protein Bank Code 2FJG) and cloned into pCANTAB5E (GE Healthcare) phagemid via SfiI and NotI restriction sites (dubbed Av1, Supplementary Figure S4). TAA stop codons were introduced in CDRs to ensure that only the phagemids carrying the mutagenic oligonucleotides would produce pIII fusion scFv on phage surface (Supplementary Figure S4).

Primer design and heavy chain/light chain variable domain library construction

Two separate phage display libraries of the GH2 light and heavy chain respectively were constructed based on the oligonucleotide-directed mutagenesis procedure⁴⁶. Positions were mutagenized using synthesized oligonucleotides with the following degenerate codons to produce equal molar ratio of designed amino acids: Trp/Gly ([T/G]GG), Phe/Ser/Tyr (T[T/C/A][C/T], Gly/Asp/Ser/Gln ([G/A][G/A][C/T]), Gly/Ala/Ser/Thr/Arg/Pro ([G/A/C][G/C][T/C]), Ala/Thr/Pro/Ser ([A/G/T/C]C[A/G/T/C]), Phe/Tyr/Asp/Val/Asn/Ile/His/Leu ([A/G/T/C][A/T][T/C]) and Leu/Ile/Val/Phe/Met ([A/G/T/C]T[A/G/T/C]). For the light chain repertoire, CDR-L1, -L2 and -L3 were diversified with 21 mutagenic oligonucleotides (Supplementary Table S3) on the basis of the template V3a-LC TAA (Supplementary Figure S4). For heavy chain repertoire, CDR-H1, -H2 and -H3 were diversified with 27 mutagenic oligonucleotides (Supplementary Table S3) on the basis of the template V3c-HC TAA (Supplementary Figure S4). In brief, mutagenic oligonucleotides for each CDR were mixed and phosphorylated by T4 polynucleotide kinase (New England BioLabs) in 70 mM Tris–HCl (pH 7.6), 10 mM MgCl₂, 1 mM ATP and 5 mM dithiothreitol (DTT) at 37 °C for 1 h. The phosphorylated oligonucleotides were then annealed to uracilated single-stranded DNA template, at a molar ratio of 3:1 (oligonucleotide:ssDNA), by heating the mixture at 90 °C for 2 min, followed by a temperature decrease of 1 °C/min to 20 °C in a thermal cycler. Subsequently, the template-primer annealing mixture was incubated in 0.32 mM ATP, 0.8 mM dNTPs, 5 mM DTT, 600 units of T4 DNA ligase and 75 units of T7 DNA polymerase (New England BioLabs) to prime in vitro DNA synthesis. After overnight incubation at 20 °C, the synthesized dsDNA was desalted and concentrated by a centrifugal filter (Amicon^® Ultra 0.5 mL 30 K device), then electroporated into Escherichia coli ER2738 at 3000 V with an electroporator. Typically, 1 μg of dU-ssDNA produced about 10⁷–10⁸ recombinant phage variants and 75–90% of the phage variants carried mutagenic oligonucleotides at three CDR regions simultaneously.

Protein A/L selection of functional scFv variants

The rescued phage libraries of light- and heavy-chain were precipitated with 20% PEG/NaCl and resuspended in phosphate-buffered saline (PBS) for the following protein A/L selection process. First, NUNC 96-well Maxisorb immunoplates were coated overnight at 4 °C with Protein A (for selection of heavy chain-diversified libraries) or Protein L (for selection of light chain-diversified libraries) (1 μg/100 μL PBS per well) and blocked with 5% skim milk in PBST for 1 h. After blocking, 100 μL of resuspended phage library (10¹³ cfu/mL) was added to each well for 1 h under gentle shaking. The plate was washed 12 times with 200 μL PBST [0.05% (v/v) Tween 20] and 2 times with 200 μL PBS. The bound phages were eluted with 100 μL of 0.1 M HCl/glycine (pH 2.2) per well, followed by neutralization with 8 μL of 2 M Tris-base buffer (pH 9.1). The eluted phages were mixed with 1 mL of E. coli strand ER2738 (A_{600 nm} = 0.6) for 15 min at 37 °C. Infected E. coli was titered and amplified with 50 mL of 2 X YT containing 100 μg/mL ampicillin at 37 °C overnight. After centrifugation, the bacterial pellet was resuspended and its phagemid DNA was extracted.

Combination of functional scFv variants into the generic human (GH) antibody libraries

GH2 library was assembled in scFv format as previously described with some modification³⁵. In the first PCR, two variable domains VL and VH were amplified separately from light- and heavy-chain library after selection for binding to Protein A/L by using the primers V_Lfor (5’-GGGCCCAGCCGGCCATGGCCGATATTCAAATGACCCAGAGCCCGAGC-3’) with V_Lrev (5’-GGAAGATCTAGAGGAACCACCGCGTTTGATTTCCACTTTGGTGCCTTGACC-3’) and V_Hfor (5’- GGTGGTTCCTCTAGATCTTCCTCCTCTGGTGGCGGTGGCTCGGGCGGTGGTGGGGAAGTGCAGCTGGTGGAATCGGG -3’) with V_Hrev (5’- CCTGCCTGCGGCCGCTGACGCCGAGC -3’), respectively (linker sequence is underlined). PCR reactions were performed in a volume of 50 μL using KOD Hot Start polymerase (Novagen), 100 ng DNA template and 0.3 μM of each primer for 25 cycles (30 sec 95 °C, 30 sec 65 °C, 1 min 72 °C) followed a 10 min final synthesis step. The PCR products were digested with EcoRI and then purified by agarose gel electrophoresis. In the second PCR, two variable domains were assembled using the overlapping primers (SfiI and NotI restriction sites are underlined): Overlapfor (5’- GAGGAGGAGGAGGAGGAGGCGGGGCCCAGCCGGCCATGGCCGATATTC -3’) with Overlaprev (5’- GAGGAGGAGGAGGAGGAGCCTGCCTGCGGCCGCTGACGCC -3’). 100 ng of the purified VL and VH PCR products of the first PCR were used in a a volume of 50 μL using MyTaq Hot Start polymerase (Bioline) and 0.3 μM of each primer for 30 cycles (30 sec 95 °C, 30 sections 65 °C, 1 min 30 sec 72 °C) followed by a 10 min final synthesis step. The assembled VL-VH fragments were doubly digested with SfiI and NotI (New England BioLabs) and cloned into pCANTAB5E phagemid vector. The resulting ligation product was electroporated into Escherichia coli ER2738 at 3000 V with an electroporator.

Selection-amplification cycles to identify antigen-specific scFv binders

Selection-amplification cycles

Antigen (2 ~ 5 μg per well) was coated in PBS buffer (pH 7.4) in NUNC 96-well Maxisorb immunoplates overnight at 4 °C and then blocked with 5% skim milk in PBST for 1 h. After blocking, 100 μL of resuspended polyethylene glycol/NaCl-precipitated phage library (10¹³ cfu/mL in blocking buffer) was added to each well for 1 h under gentle shaking. The plate was washed 12 times with 200 μL PBST [0.05% (v/v) Tween 20] and 2 times with 200 μL PBS. The bound phages were eluted with 100 μL of 0.1 M HCl/glycine (pH 2.2) per well, immediately neutralized with 8 μL of 2 M Tris-base buffer (pH 9.1). The eluted phages were mixed with 1 mL of E. coli ER2738 (A_{600 nm} = 0.6) for 30 min at 37 °C; uninfected bacteria were eliminated by adding ampicillin. After ampicillin treatment for 30 minutes, the bacterial culture was infected with 100 μL M13KO7 helper phage (~10¹¹ CFU total) at 37 °C for 1 h and then added to 50 mL of 2X YT medium containing kanamycin 50 μg/mL and ampicillin 100 μg/mL overnight at 37 °C with vigorous shaking. The rescued phage library was precipitated with 20% polyethylene glycol/NaCl and resuspended in PBS. The concentrated phage solution was used for the next round of panning.

Picking antigen-specific scFv binders

After 2–3 rounds of selection-amplification cycle, single colonies were randomly selected into deep 96 well culture plate (plate A; phage form); each well contained 950 μL 2YT (100 μg/mL ampicillin). After 3 h incubation at 37 °C with shaking, 100 μL of bacterial culture was transferred to the corresponding well of a fresh deep 96-well plate (plate B; secreted scFv); each well contained 0.8 mL 2YT with 100 μg/mL ampicillin. In the meantime, 50 μL M13KO7 (~5 × 10¹⁰ CFU total) was added to each well of plate A. After 1 h incubation, 100 μL 2YT containing kanamycin (500 μg/mL) was added to each well of plate A; 100 μL 2YT containing IPTG (10 mM) was added to each well of plate B. After overnight incubation at 37 °C with vigorous shaking, the cultures were centrifuged at 3000 g for 10 min at 4 °C. The plate A was stored for further use. For secreted scFv culture plate (plate B), 50 μL culture medium and 50 μL 5% PBST milk was added to a corresponding well of three 96-well Maxisorb immunoplates pre-coated with protein L (0.1 μg/well), antigen (0.25~1 μg/well) and Maltose-Binding Protein (MBP)(1 μg/well), respectively and blocked with 5% PBST milk. After 1 h incubation at room temperature, the plates were washed six times with PBST. 100 μL Protein A-HRP (1:3000, Thermo Scientific) was added to each well of Protein L-coated immunoplate; 100 μL anti E-tag-HRP(1:3000, Abcam^®) was added to each well of antigen-coated and MBP-coated immunoplates. After 1 h incubation, the plates were washed six times with PBST buffer and twice with PBS, developed for 3 min with 3,3’,5,5’-tetramethyl-benzidine peroxidase substrate (Kirkegaard & Perry Laboratories), quenched with 1.0 M HCl and read spectrophotometrically at 450 nm. Positive clones were selected by the following criteria: ELISA OD₄₅₀ > 0.2 for the antigen-coated well (antigen binding positive); OD₄₅₀ < 0.05 in MBP-coated well (non-specific binding negative); OD₄₅₀ > 0.5 for the Protein L-coated well (soluble scFv binding to both Protein L and Protein A to ensure proper folding in solution). Empirically, the IgG converted from a positive clone of scFv selected with the above criteria had K_D < 10⁻⁷ M. Unique clones were determined by sequencing the scFv DNA harbored in the phagemid.

Other experimental procedures in supplemental information

The details of mouse immunization, NGS of phage-displayed antibody libraries, computational analysis of the NGS data, competition of antibody-HER2/ECD interaction, antibody-antigen interaction affinity and kinetics measurements by surface plasmon resonance, epitope mapping with hydrogen-deuterium exchange measured with LC-tandem mass spectroscopy (HDX-MS), EC₅₀ for antibody-antigen interactions, transient expression of IgG with HEK293-F cells, cell line and reagents, immunofluorescence microscopy and Western blotting are described in Supplemental Experimental Procedures.

Additional Information

How to cite this article: Chen, H.-S. et al. Predominant structural configuration of natural antibody repertoires enables potent antibody responses against protein antigens. Sci. Rep. 5, 12411; doi: 10.1038/srep12411 (2015).

References

Franklin, M. C. et al. Insights into ErbB signaling from the structure of the ErbB2-pertuzumab complex. Cancer Cell 5, 317–328 (2004).
CAS PubMed Google Scholar
Cho, H. S. et al. Structure of the extracellular region of HER2 alone and in complex with the Herceptin Fab. Nature 421, 756–760 (2003).
ADS CAS PubMed Google Scholar
Berzofsky, J. A. Intrinsic and extrinsic factors in protein antigenic structure. Science 229, 932–940 (1985).
ADS CAS PubMed Google Scholar
Benjamin, D. C. et al. The antigenic structure of proteins: a reappraisal. Annu Rev Immunol 2, 67–101 (1984).
CAS PubMed Google Scholar
Peng, H. P., Lee, K. H., Jian, J. W. & Yang, A. S. Origins of specificity and affinity in antibody-protein interactions. Proc Natl Acad Sci USA 111, E2656–2665 (2014).
CAS PubMed PubMed Central Google Scholar
Weinstein, J. A., Jiang, N., White, R. A., 3rd, Fisher, D. S. & Quake, S. R. High-throughput sequencing of the zebrafish antibody repertoire. Science 324, 807–810 (2009).
ADS CAS PubMed PubMed Central Google Scholar
Schroeder, H. W., Jr. Similarity and divergence in the development and expression of the mouse and human antibody repertoires. Dev Comp Immunol 30, 119–135 (2006).
CAS PubMed Google Scholar
Di Noia, J. M. & Neuberger, M. S. Molecular mechanisms of antibody somatic hypermutation. Annu Rev Biochem 76, 1–22 (2007).
CAS PubMed Google Scholar
Eisen, H. N. Affinity enhancement of antibodies: how low-affinity antibodies produced early in immune responses are followed by high-affinity antibodies later and in memory B-cell responses. Cancer Immunol Res 2, 381–392 (2014).
CAS PubMed Google Scholar
Victora, G. D. & Nussenzweig, M. C. Germinal centers. Annu Rev Immunol 30, 429–457 (2012).
CAS PubMed Google Scholar
Wu, X. et al. Focused evolution of HIV-1 neutralizing antibodies revealed by structures and deep sequencing. Science 333, 1593–1602 (2011).
ADS CAS PubMed PubMed Central Google Scholar
Robert, R. et al. Restricted V gene usage and VH/VL pairing of mouse humoral response against the N-terminal immunodominant epitope of the amyloid beta peptide. Mol Immunol 48, 59–72 (2010).
CAS PubMed PubMed Central Google Scholar
Pappas, L. et al. Rapid development of broadly influenza neutralizing antibodies through redundant mutations. Nature 516, 418–422 (2014).
ADS CAS PubMed Google Scholar
Avnir, Y. et al. Molecular signatures of hemagglutinin stem-directed heterosubtypic human neutralizing antibodies against influenza A viruses. PLoS Pathog 10, e1004103 (2014).
PubMed PubMed Central Google Scholar
Lloyd, C. et al. Modelling the human immune response: performance of a 1011 human antibody repertoire against a broad panel of therapeutically relevant antigens. Protein Eng Des Sel 22, 159–168 (2009).
CAS PubMed Google Scholar
Zhai, W. et al. Synthetic antibodies designed on natural sequence landscapes. J Mol Biol 412, 55–71 (2011).
ADS CAS PubMed Google Scholar
Georgiou, G. et al. The promise and challenge of high-throughput sequencing of the antibody repertoire. Nat Biotechnol 32, 158–168 (2014).
CAS PubMed PubMed Central Google Scholar
Finn, J. A. & Crowe, J. E., Jr. Impact of new sequencing technologies on studies of the human B cell repertoire. Curr Opin Immunol 25, 613–618 (2013).
CAS PubMed Google Scholar
Galson, J. D., Pollard, A. J., Truck, J. & Kelly, D. F. Studying the antibody repertoire after vaccination: practical applications. Trends Immunol 35, 319–331 (2014).
CAS PubMed Google Scholar
Mathonet, P. & Ullman, C. G. The application of next generation sequencing to the understanding of antibody repertoires. Front Immunol 4, 265 (2013).
PubMed PubMed Central Google Scholar
Reddy, S. T. et al. Monoclonal antibodies isolated without screening by analyzing the variable-gene repertoire of plasma cells. Nat Biotechnol 28, 965–969 (2010).
CAS PubMed Google Scholar
Laserson, U. et al. High-resolution antibody dynamics of vaccine-induced immune responses. Proc Natl Acad Sci USA 111, 4928–4933 (2014).
ADS CAS PubMed PubMed Central Google Scholar
Adams, J. J., Nelson, B. & Sidhu, S. S. Recombinant genetic libraries and human monoclonal antibodies. Methods Mol Biol 1060, 149–170 (2014).
PubMed Google Scholar
Tiller, T. et al. A fully synthetic human Fab antibody library based on fixed VH/VL framework pairings with favorable biophysical properties. MAbs 5, 445–470 (2013).
PubMed PubMed Central Google Scholar
Mahon, C. M. et al. Comprehensive Interrogation of a Minimalist Synthetic CDR-H3 Library and Its Ability to Generate Antibodies with Therapeutic Potential. J Mol Biol 425, 1712–1730 (2013).
CAS PubMed Google Scholar
Miersch, S. & Sidhu, S. S. Synthetic antibodies: concepts, potential and practical considerations. Methods 57, 486–498 (2012).
CAS PubMed Google Scholar
Larman, H. B., Jing Xu, G., Pavlova, N. N. & Elledge, S. J. Construction of a rationally designed antibody platform for sequencing-assisted selection. Proc Natl Acad Sci USA 109, 18523–18528 (2012).
ADS CAS PubMed PubMed Central Google Scholar
Finlay, W. J. & Almagro, J. C. Natural and man-made V-gene repertoires for antibody discovery. Front Immunol 3, 342 (2012).
PubMed PubMed Central Google Scholar
Prassler, J. et al. HuCAL PLATINUM, a synthetic Fab library optimized for sequence diversity and superior performance in mammalian expression systems. J Mol Biol 413, 261–278 (2011).
CAS PubMed Google Scholar
Ponsel, D., Neugebauer, J., Ladetzki-Baehs, K. & Tissot, K. High affinity, developability and functional size: the holy grail of combinatorial antibody library generation. Molecules 16, 3675–3700 (2011).
CAS PubMed PubMed Central Google Scholar
Hsu, H. J. et al. Antibody variable domain interface and framework sequence requirements for stability and function by high-throughput experiments. Structure 22, 22–34 (2014).
CAS PubMed Google Scholar
Chang, H. J. et al. Loop-sequence features and stability determinants in antibody variable domains by high-throughput experiments. Structure 22, 9–21 (2014).
CAS PubMed Google Scholar
Yu, C. M. et al. Rationalization and design of the complementarity determining region sequences in an antibody-antigen recognition interface. PLoS One 7, e33340 (2012).
ADS CAS PubMed PubMed Central Google Scholar
Barbas, C. F., 3rd, Burton, D. R., Scott, J. K. & Silverman, G. J. in Phage Display A Laboratory Manual. Ch. 9, 9.1-9.113 (Cold Spring Harbor Press, Cold Spring Harbor, New York; 2001).
Andris-Widhopf, J., Steinberger, P., Fuller, R., Rader, C. & Barbas, C. F., 3rd Generation of human scFv antibody libraries: PCR amplification and assembly of light- and heavy-chain coding sequences. Cold Spring Harb Protoc 2011 (2011), 10.1101/pdb.prot065573.
Almagro, J. C., Hernandez, I., del Carmen Ramirez, M. & Vargas-Madrazo, E. The differences between the structural repertoires of VH germ-line gene segments of mice and humans: implication for the molecular mechanism of the immune response. Mol Immunol 34, 1199–1214 (1997).
CAS PubMed Google Scholar
Chothia, C. et al. Structural repertoire of the human VH segments. J Mol Biol 227, 799–817 (1992).
CAS PubMed Google Scholar
Tomlinson, I. M., Cox, J. P., Gherardi, E., Lesk, A. M. & Chothia, C. The structural repertoire of the human V kappa domain. EMBO J 14, 4628–4638 (1995).
CAS PubMed PubMed Central Google Scholar
Chailyan, A., Marcatili, P., Cirillo, D. & Tramontano, A. Structural repertoire of immunoglobulin lambda light chains. Proteins 79, 1513–1524 (2011).
CAS PubMed Google Scholar
DeKosky, B. J. et al. High-throughput sequencing of the paired human immunoglobulin heavy and light chain repertoire. Nat Biotechnol 31, 166–169 (2013).
CAS PubMed PubMed Central Google Scholar
de Wildt, R. M., Hoet, R. M., van Venrooij, W. J., Tomlinson, I. M. & Winter, G. Analysis of heavy and light chain pairings indicates that receptor editing shapes the human antibody repertoire. J Mol Biol 285, 895–901 (1999).
CAS PubMed Google Scholar
Martin, A. C. & Thornton, J. M. Structural families in loops of homologous proteins: automatic classification, modelling and application to antibodies. J Mol Biol 263, 800–815 (1996).
CAS PubMed Google Scholar
Raghunathan, G., Smart, J., Williams, J. & Almagro, J. C. Antigen-binding site anatomy and somatic mutations in antibodies that recognize different types of antigens. J Mol Recognit 25, 103–113 (2012).
CAS PubMed Google Scholar
Zhou, H. et al. Structural insights into the down-regulation of overexpressed p185(her2/neu) protein of transformed cells by the antibody chA21. J Biol Chem 286, 31676–31683 (2011).
CAS PubMed PubMed Central Google Scholar
Fisher, R. D. et al. Structure of the complex between HER2 and an antibody paratope formed by side chains from tryptophan and serine. J Mol Biol 402, 217–229 (2010).
CAS PubMed Google Scholar
Sidhu, S. S. & Weiss, G. A. in Phage Display, Edn. 1st. (eds. T. Clackson & H. B. Lowman ) Ch. 2, 27–41 (Oxford University Press, New York; 2004).
Chothia, C. et al. Conformations of immunoglobulin hypervariable regions. Nature 342, 877–883 (1989).
ADS CAS PubMed Google Scholar

Download references

Acknowledgements

We would like to thank the support of the research projects 103-2622-B-001-001-CC2 and 103-2325-B-001-023 from Ministry of Science and Technology (MoST) in Taiwan and support of postdoctoral position (S-T Shen) under the project 103-2811-B-001-006 from MoST. This research was also supported by Academia Sinica and MoST [104-0210-01-09-02]. We would like to thank Dr. Peter Kwong of NIH for helpful discussion and Dr. Todd Juan of JHL biotech Inc. for collaboration in IgG production and characterization.

Author information

Chen Hong-Sen and Hou Shin-Chen contributed equally to this work.

Authors and Affiliations

Genomics Research Center, Academia Sinica, Taipei, 115, Taiwan
Hong-Sen Chen, Shin-Chen Hou, Jhih-Wei Jian, King-Siang Goh, San-Tai Shen, Yu-Ching Lee, Jhong-Jhe You, Hung-Pin Peng, Chung-Ming Yu, Ing-Chien Chen, Chao-Ping Tung, Tzu-Han Chen, Kuo Ping Chiu, Che Ma, Chih Yuan Wu & An-Suei Yang
Institute of Biomedical Informatics, National Yang-Ming University, Taipei, 112, Taiwan
Jhih-Wei Jian
Bioinformatics Program, Taiwan International Graduate Program, Institute of Information Science, Academia Sinica, Taipei, 115, Taiwan
Jhih-Wei Jian
Institute of Biological Chemistry, Academia Sinica, Taipei, 115, Taiwan
Wen-Chih Kuo, Shui-Tsung Chen, Ming-Chi Peng, Andrew H.-J. Wang & Sheng-Wei Lin

Authors

Hong-Sen Chen
View author publications
You can also search for this author in PubMed Google Scholar
Shin-Chen Hou
View author publications
You can also search for this author in PubMed Google Scholar
Jhih-Wei Jian
View author publications
You can also search for this author in PubMed Google Scholar
King-Siang Goh
View author publications
You can also search for this author in PubMed Google Scholar
San-Tai Shen
View author publications
You can also search for this author in PubMed Google Scholar
Yu-Ching Lee
View author publications
You can also search for this author in PubMed Google Scholar
Jhong-Jhe You
View author publications
You can also search for this author in PubMed Google Scholar
Hung-Pin Peng
View author publications
You can also search for this author in PubMed Google Scholar
Wen-Chih Kuo
View author publications
You can also search for this author in PubMed Google Scholar
Shui-Tsung Chen
View author publications
You can also search for this author in PubMed Google Scholar
Ming-Chi Peng
View author publications
You can also search for this author in PubMed Google Scholar
Andrew H.-J. Wang
View author publications
You can also search for this author in PubMed Google Scholar
Chung-Ming Yu
View author publications
You can also search for this author in PubMed Google Scholar
Ing-Chien Chen
View author publications
You can also search for this author in PubMed Google Scholar
Chao-Ping Tung
View author publications
You can also search for this author in PubMed Google Scholar
Tzu-Han Chen
View author publications
You can also search for this author in PubMed Google Scholar
Kuo Ping Chiu
View author publications
You can also search for this author in PubMed Google Scholar
Che Ma
View author publications
You can also search for this author in PubMed Google Scholar
Chih Yuan Wu
View author publications
You can also search for this author in PubMed Google Scholar
Sheng-Wei Lin
View author publications
You can also search for this author in PubMed Google Scholar
An-Suei Yang
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

A.-S.Y., H.-S.C., S.-C.H., J.-W.J., H.-P.P., K.-S.G. and S.-T.S. designed the research; H.-S.C., S.-C.H., J.-W.J., H.-P.P., K.-S.G., S.-T.S., Y.-C.L., J.-J.Y., W.-C.K., S.-T.C., M.-C.P, H.-J.W., C.-M.Y., I.-C.C., C.-P.T., T.-H.C., K.-P.C., C.M., C.-Y.W. and S.-W.L. performed the research; A.-S.Y., H.-S.C., S.-C.H., J.-W.J., H.-P.P. and K.-S.G. wrote the paper.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Electronic supplementary material

Supplementary Information

Rights and permissions

This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

Reprints and permissions

About this article

Cite this article

Chen, HS., Hou, SC., Jian, JW. et al. Predominant structural configuration of natural antibody repertoires enables potent antibody responses against protein antigens. Sci Rep 5, 12411 (2015). https://doi.org/10.1038/srep12411

Download citation

Received: 09 February 2015
Accepted: 11 June 2015
Published: 23 July 2015
DOI: https://doi.org/10.1038/srep12411

This article is cited by

Eradicating mesothelin-positive human gastric and pancreatic tumors in xenograft models with optimized anti-mesothelin antibody–drug conjugates from synthetic antibody libraries
- Hung-Ju Hsu
- Chao-Ping Tung
- An-Suei Yang
Scientific Reports (2021)
A panel of anti-influenza virus nucleoprotein antibodies selected from phage-displayed synthetic antibody libraries with rapid diagnostic capability to distinguish diverse influenza virus subtypes
- Chung-Ming Yu
- Ing-Chien Chen
- An-Suei Yang
Scientific Reports (2020)
Noninvasive assessment of characteristics of novel anti-HER2 antibodies by molecular imaging in a human gastric cancer xenograft-bearing mouse model
- Wei-Ying Kuo
- Jia-Jia Lin
- Chun-Yi Wu
Scientific Reports (2018)
High throughput discovery of influenza virus neutralizing antibodies from phage-displayed synthetic antibody libraries
- Ing-Chien Chen
- Yi-Kai Chiu
- An-Suei Yang
Scientific Reports (2017)
High throughput cytotoxicity screening of anti-HER2 immunotoxins conjugated with antibody fragments from phage-displayed synthetic antibody libraries
- Shin-Chen Hou
- Hong-Sen Chen
- An-Suei Yang
Scientific Reports (2016)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.