Main

The initial immune response to a new carbohydrate epitope arises largely from antibodies constructed from germline gene segments; however, the molecular basis underlying the mechanism by which a germline antibody recognizes a range of epitopes has been a subject of debate for decades. Germline gene segments have evolved to confer immunity to common pathogens, and the introduction of a new pathogen has the potential to decimate a population. Additionally, the ability of germline antibodies to adapt to new pathogens has been a central aspect of antibody theory, from the template instruction theory1,2 to the clonal selection theory3,4,5,6,7. To promote the survival of the organism, the portion of the germline dedicated to humoral response must balance between antibody segments that show the greatest adaptability to new pathogens with those segments that convey hereditary immunity to common pathogens.

The repertoire of germline gene segments is a critical consideration for carbohydrate-specific antibodies (which generally do not undergo class switching and affinity maturation) in immune surveillance for microbial pathogenesis and diseased tissue. The observed low affinity of antibodies to oligosaccharides has limited the study of the three-dimensional mechanisms of specificity of antibodies to carbohydrates, and there exist just a few reports of the structures of antibody fragments in complex with carbohydrate antigen8,9,10,11,12. Progress has been further limited because these reports consist largely of unrelated antibodies that are specific for markedly different oligosaccharide epitopes. This paucity in structural data has prevented an assessment of the relative contributions of germline gene segments to the recognition of bacterial carbohydrate epitopes. Here we have selected from an extensive library of monoclonal antibodies (mAbs) two highly homologous antibodies constructed largely from germline gene segments that recognize several distinct highly conserved epitopes of bacterial LPS.

Bacterial LPS is a major antigenic surface feature for all Gram- negative bacteria and generally consists of a lipid A moiety to which an inner core oligosaccharide consisting of 3-deoxy-D-manno-oct-2-ulosonic acid (Kdo), an outer core region of heptoses and hexoses, and the polysaccharide O-antigen are successively attached. Chlamydial LPS and rough Re-type LPS found in enterobacterial genera including Salmonella, Escherichia and Proteus are truncated, lacking both the outer core and the O-antigen13,14,15. The inner core oligosaccharide of Re-type LPS features a Kdo disaccharide epitope αKdo(2→4)-αKdo (Fig. 1a), the smallest LPS structure required to maintain bacterial membrane barrier functions13,14. A unique Kdo trisaccharide epitope, αKdo(2→8)-αKdo(2→4)-αKdo (Fig. 1a), is present in all species of the Chlamydiaceae family but has not been observed in any other bacterium, and is thus an important taxonomic and diagnostic marker. One species, Chlamydophila psittaci, also shows the trisaccharide epitope αKdo(2→4)-αKdo(2→4)-αKdo in addition to the family- specific epitope16. Members of the bacterial family Chlamydiaceae are obligatory intracellular pathogens. Chlamydia trachomatis is a leading cause of preventable blindness17, sexually transmitted infection of the urogenital tract and acquired infertility18, and is also known to cause pneumonitis and urogenital tract infections in mice19. Chlamydophila pneumoniae causes infections of the respiratory tract20 and has been linked to bronchial asthma21, heart disease22,23,24, multiple sclerosis and Alzheimer's disease25. The ubiquitous nature of these LPS epitopes has made them high-profile targets for vaccine development26,27,28.

Figure 1: Chemical structures of antigens and antibodies.
figure 1

(a) α(2→8) antigens αKdo(2→8)–αKdo(2→O-allyl) disaccharide (top left), αKdo(2→8)-αKdo(2→4)-αKdo(2→O-allyl) trisaccharide antigens (top right) and αKdo(2→4)-αKdo(2→O-allyl) disaccharide (bottom right) were crystallized with S25-2; and pentasaccharide antigen αKdo(2→4)-αKdo(2→4)-αKdo(2→6)-βGlcN-4P(1→6)-αGlcN-1P (bottom left) was crystallized with S45-18. Figure produced using ChemSketch (Advanced Chemistry Developments). (b) Amino acid sequence alignment of S25-2 and S45-18 variable domains with the CDRs shaded. The numbering is based on studies by Chothia42,43,46 and the online Kabat database (http://www.bioinf.org.uk/abs/simkab.html)58. The residues in bold above the sequence alignment shows where S25-2 differs from germline in the regions encoded by the V and J immunoglobulin genes. The sequences encoded by the D minigenes are highlighted in green.

S25-2 is a mAb with affinity for the chlamydial family-specific terminal α(2→8)-linked Kdo residues and binds the trisaccharide αKdo(2→8)-αKdo(2→4)-αKdo against which it was raised29. Notably, S25-2 also binds the partial chlamydial epitope αKdo(2→8)-αKdo and the Re-type epitope αKdo(2→4)-αKdo disaccharides30. The mAb S45-18 is specific for the trisaccharide αKdo(2→4)-αKdo(2→4)-αKdo and does not bind to the αKdo(2→8)-αKdo(2→4)-αKdo trisaccharide epitope16.

A comparison of the polypeptide sequences of S25-2 with known murine germline sequences shows that S25-2 corresponds to the germline genes IGKV8-21 Y15982, IGKJ2 V00777, IGHV7S3 J00525 and IGHJ3 V00770 (IMGT, the international ImMunoGeneTics database, http://imgt.cines.fr; initiator and coordinator, Marie-Paule Lefranc, Montpellier, France) with only three mutations: Y95R (light chain), W100bR (heavy chain, Fig. 1b) and A100dS (heavy chain, Fig. 1b). S45-18 has additional mutations away from germline, including six residues in the light chain and seven in the heavy chain, not including the D region. CDRs L3 and H1 are common between S25-2 and S45-18, whereas CDRs L1, L2 and H2 have one or two changes in residue identity with no change of CDR length. The most important differences between S45-18 and S25-2 lie in CDRs H3 (ref. 31), which are based on different germline gene segments such as IGHJ4 V00770 in S45-18 (Fig. 1a), and the 'D' region, which is prone to mutation. Specifically, the sequence of S45-18 has more mutations away from germline, and the 'JH' region is encoded by a different germline gene. Unlike other CDRs, H3 is formed by two joining events involving the D (diversity) minigenes. During each rearrangement the flanking regions of the genes involved can undergo addition and deletion of nucleotides, and duplication of the D gene can also occur32. Mutation of germline sequences and the structural basis for improvement of affinity for protein antigen have been studied recently33; however, no corresponding model for carbohydrate recognition has been reported.

We have determined the structures of Fabs from both antibodies to high resolution in the presence and absence of several epitopes in order to establish the three-dimensional nature of a hereditary murine germline response to a carbohydrate epitope of a bacterial pathogen, the manner by which such a site can accommodate several distinct epitopes and the pathway by which the substitution of a few residues can transform the low-specificity S25-2 into the highly specific S45-18.

Results

The Fab from mAb S25-2 has been crystallized in complex with 2→O-allyl derivatives containing the α(2→8)-α(2→4) Kdo tri- saccharide, α(2→8) Kdo disaccharide, α(2→4) Kdo disaccharide epitopes and the Kdo monosaccharide, and unliganded in two different crystal forms. The Fab from mAb S45-18 has been crystallized in complex with the deacylated LPS pentasaccharide antigen αKdo(2→4)-αKdo(2→4)-αKdo(2→6)-βGlcN-4P(1→6)-αGlcN-1P containing the α(2→4)-α(2→4) Kdo trisaccharide epitope as well as in the unliganded form. Data have been collected from 1.49 to 2.2 Å resolution (Table 1). All of the CDRs and most of the polypeptide chains show excellent electron density, although CDR H3 of S25-2 shows high thermal motion in the unliganded structures and in the α(2→4) disaccharide and monosaccharide complexes. Well-defined electron density is observed for all antigen Kdo residues (Fig. 2a–d). The pentasaccharide antigen cocrystallized with S45-18 (Fig. 1a) contains two glucosamine phosphate residues that lie in excellent electron density in one of the two Fab molecules in the asymmetric unit of the liganded structure. These residues do not form part of the epitope and are not observed to contact the parent Fab, although they form a hydrogen bond to a water molecule that bridges to an adjacent Fab. The Kdo residues of the monosaccharide complex and α(2→8) disaccharide complex with S25-2 lie in the same position and conformation as the corresponding Kdo residues of the α(2→8)-α(2→4) trisaccharide complex. In general, antibody recognition of carbohydrate antigens has been observed to include a large number of stacking interactions9,11; however, antigen recognition in each S25-2 complex is dominated by hydrogen bonds, with only one residue (PheH97 (H, heavy chain)) forming substantial stacking interactions in the S45-18 complex.

Table 1 Data refinement and model statistics for unliganded and liganded forms of S25-2 Fab and S48-18 Fab
Figure 2: Observed 2Fo − Fc σA-weighted electron density maps for selected liganded antigens, contoured at 1.0 σ.
figure 2

(a) α(2→8)-α(2→4) Kdo trisaccharide bound by S25-2. (b) α(2→4) Kdo disaccharide bound by S25-2. (c) The Kdo monosaccharide bound by S25-2. (d) The pentasaccharide bound by S45-18. Figures 2,3,4 were produced using SETOR57.

Inherited terminal Kdo recognition site

All liganded S25-2 and S45-18 structures show a similar terminal Kdo (Kdo3) binding site made from CDRs L3, H1, H2 and H3 (Fig. 3a). The residues contacting Kdo3 in mAb S25-2 correspond to those found in known germline segments with the exception of ArgL95 (L, light chain), which is tyrosine in the germline; however, residue L95 occurs at the junction of two germline segments and is thus prone to mutation32. To allow the antibodies to bind several distinct epitopes, the Kdo3 binding site itself shows some flexibility in antigen positioning, because there is among the various complexes a 0.7-Å shift of Kdo3 which is accomplished while maintaining largely the same set of hydrogen bonds. The homologous antibody S45-18 recognizes Kdo3 of the α(2→4)-α(2→4) Kdo epitope using many of the corresponding residues (Fig 3a, bottom); however, S45-18 prevents this shift (and so decreases the ability to recognize the α(2→8) disaccharide) by the mutation of GluH100a to aspartate (Fig. 1b) and the insertion of a buried bridging water molecule (Fig. 3a). This bridging water molecule is also present in the unliganded S45-18 structure, though exposed to the solvent.

Figure 3: Binding environments for representative liganded antigens.
figure 3

The α(2→8)-α(2→4) trisaccharide bound to S25-2 is yellow, the α(2→4) disaccharide bound to S25-2 is magenta and the α(2→4)-α(2→4) trisaccharide of the pentasaccharide antigen bound to S45-18 is green. (a) All complexes share a related terminal Kdo binding pocket. (b) S25-2 shows flexibility in binding the remaining Kdo residues in range of distinct epitopes, whereas S45-18 uses a different CDR H3 to specifically recognize one antigen.

The flexibility of the common Kdo3 binding site in the S25-2 and S45-18 complexes is further illustrated by the conservation of two water molecules that bridge identical contacts between antibody and antigen yet are observed to shift position by 1.8 Å between the complexes.

S25-2 uses the same residues to recognize different epitopes

Although all complexes show Kdo3 as occupying similar positions in the binding site, the corresponding Kdo2 residues lie in markedly different conformations, and several corresponding amino acid residues show multiple functionalities (Fig. 3b). Specifically, TyrH33 forms a hydrogen bond to O5 of Kdo2 in the α(2→4) disaccharide but forms a hydrogen bond to O7 in the α(2→8)-α(2→4) trisaccharide complex. AsnH52d forms a hydrogen bond with O7 of Kdo2 in both complexes, even though Kdo2 lies in considerably different conformations in each complex. ArgL30c forms a bidentate salt bridge with the carboxylate of Kdo2 in the α(2→4) disaccharide complex but forms a hydrogen bond to Kdo1 O5 in the α(2→8)-α(2→4) trisaccharide (Fig. 3b). Finally, although water molecules are observed bridging the protein and Kdo2 in the α(2→8) complex, no ordered water molecules are observed about the α(2→4) disaccharide.

Swapping CDR H3 markedly changes specificity

Antibody S45-18 was raised against the α(2→4)-α(2→4) Kdo trisaccharide and is virtually homologous to S25-2 with the exception of CDR H3, which arises from a different germline gene (Fig. 1b). However, unlike S25-2, S45-18 shows high specificity and is not observed to bind to any other epitope. The ability of a related set of antigens to elicit an antibody response from a restricted set of germline genes is common34,35,36,37,38 and demonstrates the potential to combine the inherited terminal Kdo binding site with different CDR H3 loops to change and constrain the specificity toward a particular epitope.

The higher specificity of S45-18 stems from several factors. In addition to the change in Kdo3 binding caused by the E100aD (heavy chain, Fig. 1b) mutation, the longer length and altered sequence results in the projection of the bulk of PheH97 into the binding site (Fig. 3a,b), where it forms van der Waals contacts (so-called stacking interactions) with all three hexose rings of the α(2→4)-α(2→4) trisaccharide. The presence of PheH97 also precludes the binding of an α(2→8)-linked Kdo2 residue and forces the rearrangement of the conformation of Kdo2 from that observed in the S25-2 α(2→4) disaccharide complex. The mutation N52dK (heavy chain, Fig. 1) in S45-18 allows hydrogen bonds to form between the lysine and O6 and O7 of Kdo2, respectively. AsnL30a and ArgL30c show the same diverse functionality observed in complexes of S25-2 by forming several hydrogen bonds to Kdo1 (Fig. 3b). In fact, ArgL30c forms hydrogen bonds with a different antigen moiety in each of the complex structures in which it participates.

These are the first reported examples of an antigen-binding fragment crystallized with a carbohydrate antigen having charged groups, and notably, almost all the antigen's carboxylic acid moieties form salt bridges in the liganded structures. The charged glucosamine phosphate residues linked to the α(2→4)-α(2→4) trisaccharide in S45-18 are not observed to form interactions with any protein atoms.

Discussion

The most marked aspect of these complexes is the facility of mAb S25-2 to recognize the distinct carbohydrate epitopes formed by different glycosidic linkages. S25-2 and S45-18 both use one portion of the combining site to recognize the terminal Kdo (Kdo3) in each epitope in the same way and with high specificity, whereas the rest of the combining site adapts to or excludes the remainder of the different epitopes. Because the sequence of S25-2, and to a lesser extent that of S45-18, corresponds to germline segments, these structures offer insight into the balance found in inherited germline gene segments between the maintenance of immunity to common pathogens and the flexibility to recognize new pathogens.

Heredity versus adaptability

In addition to the use of many amino acid residues in binding to ligand, mAb S25-2 shows substantial flexibility in the conformation of CDR H3 (and to a lesser extent L1), between the liganded and unliganded forms (Fig. 4a). The four liganded forms have approximately the same conformation, which is different from that of the two unliganded forms, and shows that S25-2 undergoes a significant induced fit upon antigen binding. In contrast, the more specific S45-18 shows little induced fit (Fig. 4b).

Figure 4: Conformational variation observed among liganded and unliganded forms of S25-2 and S45-18.
figure 4

(a) Overlap of unliganded S25-2 (crystal form 1, dark blue; crystal form 2, white), S25-2 bound to α(2→8)-α(2→4) trisaccharide (yellow), S25-2 bound to α(2→4) disaccharide (magenta) and S25-2 bound to the Kdo monosaccharide (cyan). The trace of the α(2→8) disaccharide complex is similar to the α(2→8)-α(2→4) trisaccharide complex and is not shown. (b) Overlap of unliganded (white) and liganded (green) S45-18.

The observation of an antibody corresponding to germline sequence showing substantial flexibility has fundamental implications when considering the competing aspects in antigen recognition by germline antibodies. The inherited germline antibody response can be critical, and animals typically carry hereditary resistance to common pathogens. This is important for carbohydrate antigens, which usually cannot induce T-cell help and stimulate the production of antibodies with higher affinities. In addition, germline antibodies in general must also exhibit some flexibility to cope with new or changing pathogen surface structures. However, any advantage of flexibility can be offset by the large entropic penalty incurred by the immobilization of labile antibody CDR loops.

S25-2 shows how a germline antibody could achieve this balance by coding for the recognition of the terminal Kdo residue using CDRs with stable conformations while leaving the rest of the combining site to either exhibit flexibility or impose specificity. Such a strategy allows this antibody to recognize several epitopes showing the terminal Kdo residue without the entropic penalties of a more flexible combining site. Antibody S45-18 shows how removing this adaptability can lead to an antibody of higher specificity.

The original concept of the highly adaptable germline antibody of the template instruction theory1,2 was abandoned when clonal selection was found to be the basis of antibody response3,4,5,6,7; however, it is becoming increasingly apparent that flexibility is a key aspect of germline antibody recognition39,40,41 when coupled to hereditary recognition of pathogen epitopes.

Canonical forms, inherited binding sites and CDR H3

H3 has a unique genetic origin among the murine CDRs. Whereas L1, L2, L3, H1 and H2 correspond closely to a few germline sequences, H3 displays a substantially higher level of sequence variation due to the insertion of D minigenes (reviewed in ref. 32; Fig. 1b). These D genes comprise 11–23 base pairs and during genetic rearrangement can undergo duplication with the flanking regions of the genes subject to high levels of mutation. This nonencoded sequence variation contributes to the special properties observed for H3.

For example, one of the seminal advances in the prediction of protein structure was the characterization of the canonical conformations of most of the CDRs by Chothia and Lesk42,43,44,45,46,47; however, a successful methodology for the prediction of the conformation of CDR H3 remained elusive for many years. This became increasingly frustrating as reports mounted of the importance of H3 in determining antigen specificity40,48,49,50. Important headway in predicting H3 conformation was reported by Nakamura in 1996 (ref. 49); however, S25-2 exemplifies the reason that H3 remains difficult. Using Nakamura's method, CDR H3 of S45-18 corresponds to one of the new canonical forms, as does H3 of the structure of the liganded S25-2 (ref. 49); however, the conformation of H3 in the unliganded form of S25-2 is still not predicted from its sequence. Notably, the single hydrogen bond formed between GluH100a and Kdo O4 in the structure of S25-2 bound to Kdo monosaccharide is sufficient to convert CDR H3 from an unpredicted form to a predicted form.

Finally, the structures of S25-2 and S45-18 offer insight into an evolutionary strategy behind the observed high sequence variation in H3, illustrating the requirement that antibody germline sequences possess a balance between inherited immunity and the ability to respond to new pathogens. Both of these antibodies reveal in this case that inherited immunity is provided largely by CDR loops that are known to have greater genetic and conformational stability, whereas the determination of specificity (or lack thereof) is made by the more variable and flexible H3. It seems probable that this is a general strategy for other classes of epitopes, because the combination of the inherited terminal Kdo binding site in these mAbs provided by L3, H1 and H2 and with different H3 loops gives the ability to produce efficiently both antibodies that are specific for a range of related Kdo epitopes as well as antibodies that have high specificity.

Methods

Purification and crystallization of Fab.

The production of IgG S25-2 and IgG S45-18 has been described16,29, as has the isolation and purification of the corresponding Fabs31. S25-2 Fab was crystallized without ligand, in complex with a synthetic trisaccharide analog of the family-specific epitope αKdo(2→8)-αKdo(2→4)-αKdo(2→O-allyl), and also with two synthetic disaccharides, αKdo(2→8)-αKdo(2→O-allyl) and αKdo(2→4)-αKdo (2→O-allyl) and Kdo monosaccharide (Kdo ammonium salt, K2755 from Sigma) corresponding to its partial epitopes. The chemical identities of the carbohydrate LPS components were deduced from earlier studies51,52, and the synthetic analogs were shown to mimic their natural counterparts29. S45-18 Fab has been crystallized without ligand and in complex with the deacylated LPS pentasaccharide bisphosphate antigen αKdo(2→4) -αKdo(2→4)-αKdo(2→6)-βGlcN-4P(1→6)-αGlcN-1P. The crystals were grown by the hanging-drop vapor diffusion method in the presence of 80-fold molar excess of antigen to Fab under the conditions reported31, with the exception of S25-2 Fab in complex with the αKdo(2→4)-αKdo(2→O-allyl) disaccharide (18 mg ml−1 Fab, 100-fold molar excess antigen, 40 mM MgCl2, 30 mM ZnCl2, 8% (w/v) PEG 4000, 11% (w/v) ethylene glycol, 20 mM Tris, pH 8, over a reservoir containing 200 mM MgCl2, 25% (w/v) PEG 4000, 100 mM Tris, pH 8.5) and Kdo monosaccharide (15 mg ml−1 Fab, 150-fold molar excess antigen, 40 mM MgCl2, 20 mM ZnCl2, 7% (w/v) PEG 4000, 9% (w/v) ethylene glycol, 20 mM Tris, pH 8, over a reservoir of 200 mM MgCl2, 25% (w/v) PEG 4000, 100 mM Tris, pH 8.5).

Data collection.

Data was collected at wavelength 1.15 Å at Brookhaven National Laboratories (Upton, New York, USA) at the National Synchrotron Light Source (NSLS) beamline X8C (Area Detector Systems Corporation Quantum 4Q CCD detector; Oxford Cryosystems Model 600 low-temperature system). DENZO and SCALEPACK were used as part of the HKL 2000 package53 for data integration and scaling.

Structure determination.

Molecular replacement (MR) and refinement were conducted using CNS54 with the variable domain of the unliganded YsT9.1 Fab (PDB entry 1MAM)55,56 initially as the search model for the unliganded S45-18 Fab. Rigid-body, simulated annealing, positional and B-factor refinements were carried out using CNS. SETOR57 and PyMol (DeLano Scientific; http://www.pymol.org) were used for manual fitting of σA-weighted 2Fo − Fc and Fo − Fc electron density maps. The refined structure of unliganded S45-18 Fab was then used as a search model for the remaining S45-18 and S25-2 liganded structures. Unambiguous MR solutions were found for each data set using the CNS scripts provided for rotation, translation and translation-dimer searches. The diffraction data and model statistics are given in Table 1.

Coordinates.

Atomic coordinates have been deposited in the Protein Data Bank S25-2 unliganded forms 1 and 2 (accession codes 1Q9K and 1Q9L, respectively); S25-2 in complex with α(2→8)-α(2→4) Kdo trisaccharide, α(2→8) Kdo disaccharide, α(2→4) Kdo disaccharide and Kdo monosaccharide (accession codes 1Q9Q, 1Q9R, 1Q9T and 1Q9V, respectively); and S45-18 unliganded and in complex with the pentasaccharide containing the α(2→4)-α(2→4) Kdo trisaccharide (accession codes 1Q9O and 1Q9W, respectively).