Main

The third variable (V3) loop region of the HIV–1 envelope glycoprotein gp120 plays a significant role in HIV infection and AIDS. As the principal neutralizing determinant of the virus1, the V3 loop elicits potently neutralizing antibodies2,3 and has been a target for AIDS vaccine development4. The V3 loop is also required for viral entry into target T cells and macrophages5, interacts with chemokine co–receptors on the surfaces of these cells6,7,8,9 and is a determinant of viral tropism6,10. The highly variable amino acid sequence of the V3 loop contributes to the ability of HIV to escape the host immune response, but it contains a highly conserved11 central Gly–Pro–Gly–Arg (GPGR) motif that has been proposed to have structural and biological significance5,11,12. The GPGR motif has been predicted to adopt a type II β–turn conformation11,13. Crystal structures of V3 peptides bound to Fab fragments of neutralizing monoclonal antibodies12,14,15, in which the peptides derive from gp120 of HIV–1 strain MN and the antibodies were elicited with a disulfide–linked peptide representing the entire V3 loop of strain MN, have revealed a more complex, double β–turn conformation. A similar GPGR conformation has been obtained from liquid–state NMR measurements on a bacteriophage coat protein–V3 peptide fusion protein solubilized in micelles16. NMR studies of unbound V3 peptides in solution indicate fractional population of type I and/or type II β–turns by the GPGR motif17,18,19,20,21,22. To promote crystallization, the V3 loop was deleted from the gp120 construct in a recent crystal structure of a ternary CD4–gp120–antibody Fab complex23. Significant questions remain12,15 regarding the uniqueness of the GPGR conformation in gp120–antibody and gp120–chemokine receptor complexes, as well as in unbound gp120.

In this letter, we report a structural study of the V3 loop peptide RP1351,24, which comprises the central 24 residues of the 40–residue V3 loop of HIV–1 strain IIIB (residues 308–331, sequence Asn–Asn–Thr–Arg–Lys–Ser–Ile–Arg–Ile–Gln–Arg–Gly–Pro– Gly–Arg–Ala–Phe–Val–Thr–Ile–Gly–Lys–Ile–Gly), bound to the neutralizing monoclonal anti–gp120 antibody 0.5β (ref. 2), which was elicited with full–length gp120 of strain IIIB (gp120IIIB). We apply two recently developed 13C solid–state NMR techniques25,26,27,28 to frozen solutions of the peptide–antibody Fab complex in which the peptide is doubly 13C–labeled. The solid–state NMR measurements provide information about the peptide backbone conformation in the GPGR motif. We find that the GPGR conformation in the RP135–0.5β complex differs significantly from the previously reported and predicted GPGR conformations. Our results indicate that the conformation in V3 loop–antibody complexes is antibody dependent and suggest that the GPGR motif may be conformationally heterogeneous in unbound gp120. This work also demonstrates the capability of the solid–state NMR methods to provide atomic–level local structural information in high–molecular–weight, noncrystalline systems.

Solid–state NMR measurements

Solid–state 13C–NMR measurements were carried out at –125 °C in a 9.39 T field on frozen solutions prepared from nearly equimolar mixtures of 0.5β Fab fragments and 13C–labeled RP135. Samples were prepared with three different doubly 13C–labeled RP135 peptides, labeled at carbonyl sites of Gly 319 and Pro 320, Pro 320 and Gly 321, and Gly 321 and Arg 322 in the GPGR motif (samples 1, 2 and 3, respectively). Examples of one–dimensional 13C magic–angle spinning (MAS) NMR spectra of sample 2 are shown in Fig. 1 . In the absence of Fab (Fig. 1a), we observe a single carbonyl NMR line at 171.2 p.p.m. with a 6 p.p.m. linewidth typical of structurally heterogeneous peptides in frozen solution. The observed heterogeneity is consistent with liquid–state NMR studies of free V3 peptides17,18,19,20,21,22. In the presence of Fab (Fig. 1b), 13C–NMR signals of the labeled peptide are partially obscured by natural–abundance 13C signals of the high–molecular–weight (>50,000 Mr) complex. Double–quantum filtering suppresses these natural–abundance signals and results in a clean spectrum of the labeled carbonyl sites (Fig. 1c) with lines 1.5 p.p.m. wide at 168.9 p.p.m. (Gly 321) and at 175.8 p.p.m. (Pro 320). The narrow carbonyl linewidths in the frozen RP135–0.5β Fab complex, which are comparable to those in crystalline peptides25,26,27, and the 6.9 p.p.m. chemical–shift difference, which exceeds the random–coil shift difference of 2.3 p.p.m.29, provide compelling evidence that the GPGR motif is well structured in the complex in frozen solution.

Figure 1: 13C–NMR spectra of the gp120 V3 loop peptide RP135, with 13C labels at the carbonyl sites of Pro 320 and Gly 321 (sample 2).
figure 1

a, The free peptide in frozen solution (1:1 v/v glycerol/H2O, pH 7.2, –125 °C). Glycerol was added to suppress crystallization. b, c, The complex of the peptide with Fab fragments of the anti–gp120 monoclonal antibody 0.5β (pH 7.0, no glycerol, –125 °C, 4 mM peptide and Fab concentrations). All spectra were acquired at a carrier frequency of 100.4 MHz, with magic–angle spinning at νR = 3.94 kHz. The frequency scale indicates chemical shifts relative to tetramethylsilane, giving isotropic shifts near 175 p.p.m. for carbonyl carbons. Asterisks indicate spinning sideband lines, which are separated from the isotropic shifts by integral multiples of νR. Large signals between 55 and 80 p.p.m. in (a) from natural–abundance 13C in glycerol have been cropped. In (b), large natural–abundance 13C–NMR signals, primarily from the Fab, appear in both the carbonyl and the aliphatic regions of the spectrum. In (c), natural–abundance signals are suppressed by double–quantum filtering, resulting in a clean spectrum of the labeled carbonyl sites of the antibody–bound peptide. This spectrum corresponds to the data point at 8.12 ms in Fig. 2b.

Two solid–state 13C–NMR methods, developed specifically for investigations of backbone conformations in doubly carbonyl–labeled peptides and proteins25,26,27,28, were applied to samples 1, 2 and 3 to constrain the φ and Ψ dihedral angles that define the backbone conformation at Pro 320, Gly 321 and Arg 322, respectively. Data from 2D MAS exchange spectroscopy (Fig. 2a) take the form of the intensities of crosspeaks that connect MAS sideband lines of the two labeled sites in a 2D spectrum. These intensities depend on the relative orientations of the two carbonyl groups (more precisely, the two carbonyl chemical–shift anisotropy (CSA) tensors), so that the structural information in the data is purely angular in nature, with strong dependences on both φ and Ψ25,26,27,28. Data from constant–time double–quantum–filtered dipolar dephasing (CTDQFD) measurements27 (Fig. 2b) take the form of the build–up and decay of carbonyl signal intensity with increasing duration of nuclear spin evolution under an effective magnetic dipole–dipole coupling created by a radiofrequency–driven recoupling (RFDR) pulse sequence27,30,31. Because the coupling scales as R–3, where R is the carbon–carbon internuclear distance, CTDQFD curves depend principally on the angle φ. Dependence of the effective coupling on the CSA tensor orientations imparts a weaker dependence on Ψ (ref. 27). Application of both solid–state NMR methods to the same doubly 13C–labeled samples increases the precision of the structural constraints and minimizes ambiguities that arise from CSA symmetry properties26 and signal–to–noise limitations. Interference from natural–abundance 13C signals is minimized by double–quantum filtering27,32 in CTDQFD measurements and by the fact that isolated 13C nuclei contribute only to diagonal and weak intrasite crosspeaks25,26 in 2D MAS exchange measurements.

Figure 2: Examples of structural data from solid–state NMR measurements on the RP135–0.5β Fab complex in frozen solution.
figure 2

a, A 2D MAS exchange spectrum of the complex with 13C labels at carbonyl sites of Pro 320 and Gly 321 of RP135 (sample 2), at νR = 2.5 kHz. Intersite crosspeaks, which connect spinning sideband lines of the two labeled carbonyl sites and appear as doublets oriented perpendicular to the diagonal of the 2D spectrum, contain the structural information. b, Experimental CTDQFD data for sample 2 (circles) and sample 3 (triangles), at νR = 3.94 kHz. Sample 3 contains the complex with 13C labels at Gly 321 and Arg 322 of RP135. CTDQFD simulations, using φ,Ψ values that correspond to minima in the ν2 plots in Fig. 3, are also shown for samples 2 (dashed line, simulated with φ,Ψ = –62°, 71°) and 3 (dotted line, simulated with φ,Ψ = –141°, 76°). Details of the measurements are given in the Methods.

Previous experiments on model peptides of known structure25,26,27,28, both in polycrystalline form25,26,27 and in noncrystalline frozen solution27,28, have shown that the 2D MAS exchange and CTDQFD methods employed in this work yield best–fit φ and Ψ values with an accuracy better than ±10°. The accuracy of related RFDR measurements of internuclear distances has also been convincingly demonstrated on model systems30.

Analysis of structural constraints

2D MAS exchange spectra were recorded for all three samples. CTDQFD data were acquired for samples 2 and 3. The data were analyzed by comparison with numerical simulations, as described25,26,27,28. For each data set, the deviation χ2 (φ,Ψ) between experimental and simulated data was evaluated as a function of the φ and Ψ values assumed in the simulations, with the definition:

where Ei and Si(φ,Ψ) are the experimental and simulated data points, respectively, σ2 is the mean–squared uncertainty per data point, λ(φ,Ψ) is a scaling factor calculated to minimize χ2 at each φ,Ψ pair, and N is the number of data points. For samples 2 and 3, the χ2 values for 2D MAS exchange and CTDQFD data sets were summed to give the total deviation between experiments and simulations. The resulting functions, which display the structural information in the data in an explicit manner, are shown in Fig. 3. Only negative values of φ are shown because, for these NMR measurements, Si(φ,Ψ) = Si(-φ,-Ψ), so that χ2 (φ,Ψ) has inversion symmetry26. For Pro 320 (Fig. 3a), minima in χ2 that represent good fits to the NMR data occur at (φ,Ψ) = (–68°,146°), (–163°,86°) and (–111°,–103°). Of these, only the minimum at (–68°,146°) is physically reasonable because of the well–known restriction of φ to the range –70° ± 30° for prolines33. For Arg 322 ( Fig. 3c), a single minimum in χ2 occurs at (–141°,76°). The symmetry–related minimum at (141°,–76°) is ruled out because this area of the φ,Ψ plane is not populated by arginines33. For Gly 321 (Fig. 3b), minima that represent good fits to the data occur at (–62°,71°) and (–62°,151°). The symmetry–related minima at (62°,–71°) and (62°,–151°) also represent allowed glycine conformations.

Figure 3: Total deviation χ2 between simulated and experimental solid–state NMR data, as a function of the dihedral angles φ and Ψ assumed in the simulations, for three successive residues in the GPGR motif of the RP135–0.5β Fab complex.
figure 3

Contour levels increase in units of 10, with red regions, which represent the best–fit φ and Ψ values for each residue, corresponding to χ2< 21 (a), χ2 < 66 (b) and χ2 < 97 (c). Black regions correspond to χ2 > 61 (a), χ2 > 106 ( b) and χ2 > 137 (c). The precision of the solid–state NMR structural constraints is indicated approximately by the sizes of the yellow regions. Crosses indicate φ and Ψ values for the corresponding residues in the RP142–59.1 Fab complex. These values were extracted from coordinates determined by X–ray crystallography12, as retrieved from the Protein Data Bank. Cross sizes do not represent uncertainties in the X–ray structure.

To determine if any of the four Gly 321 conformations that are consistent with the solid–state NMR data are inconsistent with steric constraints or other energetic considerations, Langevin dynamics (LD) simulations were carried out for the peptide Ac–GPGRA–NH2 (Gly 319 through Ala 323, with acetyl and amide capping groups) in vacuo at 300 K, using the CHARMM simulation program and standard CHARMM bond, angle, improper, van der Waals and electrostatic (but with neutralized arginine side chains) potentials. The χ2(φ,Ψ) surfaces for each data set were included in the LD simulations as pseudopotential energies after multiplication by kT(2(N–1))–1/2, where the factor of (2(N–1))–1/2 represents the approximate standard deviation in χ2 for N – 1 degrees of freedom. The NMR–based pseudopotentials effectively pinned the peptide backbone conformation close to the best–fit φ,Ψ pairs for Pro 320 and Arg 322 and to any one of the four best–fit φ,Ψ pairs for Gly 321 listed above, depending on the initial conformation chosen for the simulation run. No significant dependence of the average total CHARMM potential energy on the φ,Ψ pair adopted by Gly 321 was observed, implying that none of the four Gly 321 conformations can be ruled out on energetic grounds.

Additional constraints on the antibody–bound conformation of RP135 come from liquid–state nuclear Overhauser effect (NOE) data34, extracted from NOESY difference experiments on RP135–0.5β Fab complexes with selective deuteration, mutation and truncation of the peptide. These constraints define an antiparallel β–sheet structure for residues 311–316 and 324–327 of RP135, which flank the GPGR motif. The GPGR conformation is not constrained by the NOE data34. LD simulations on residues 311–327 of RP135, incorporating both the solid–state NMR pseudopotentials and 61 interresidue NOE distance constraints, indicate that all four best–fit φ,Ψ pairs for Gly 321 are consistent with the β–sheet structure proposed34.

Implications for V3 loop structure and function

Because the monoclonal antibody 0.5β was elicited with intact gp120IIIB (ref. 2), we expect the GPGR conformation in the RP135–0.5β complex to reflect the GPGR conformation in the complex of gp120IIIB with 0.5β. Our NMR data on the RP135–0.5β complex therefore provide atomic–resolution constraints on the structure of a region of gp120 that is of particular biological relevance, in antibody–bound form. As summarized in Table 1, the conformation of the GPGR motif in the RP135–0.5β complex is clearly different from the conformation determined in earlier crystallographic studies of the V3 loop peptide RP142 bound to Fab fragments of monoclonal antibody 59.112. The backbone dihedral angles in the GPGR motif in the RP135–0.5β complex also differ significantly from those of the type II β–turn predicted by sequence analyses11,13,35 and supported in part by the RP142–59.112 and MP1–50.112,14 peptide–antibody Fab crystal structures and by liquid–state NMR studies of unbound V3 peptides16,17,18,19,20,21,22. The GPGR conformation in RP135–0.5β is also different from all other standard β–turns35. Although the Pro 320 conformation is similar in all structures in Table 1, all possible φ,Ψ pairs for Gly 321 and Arg 322 in RP135–0.5β are inconsistent with the type II/type III double β–turn conformation of the GPGR motif in RP142–59.1, as depicted in Fig. 4. The extended conformation of Arg 322 in RP135–0.5β is in marked contrast to the 310–helix–like conformation in RP142–59.1.

Table 1 Comparison of backbone dihedral angles (φ,Ψ) in the GPGR motif of the V3 loop
Figure 4: Backbone conformations for the GPGR motif of the RP135–0.5β complex that are consistent with solid–state NMR data, corresponding to the minima in χ2 in Fig. 3.
figure 4

Segments shown include Cα of Gly 319 through Cα of Ala 323, which is the portion of the RP135 backbone constrained by the NMR data. Side chains are not shown, with the exception of Pro 320. For comparison, the GPGR backbone conformation in the RP142–59.1 complex12 is shown in thinner lines. Proline residues in RP135 and RP142 are optimally aligned, and overall orientations are chosen to make clear the conformational differences. φ,Ψ values for Gly 321 are (a) –62°,71°, (b) 62°,–71°, (c) –62°,151° and (d) 62°,–151°. Figure created with the program MOLMOL44.

The 59.1 and 0.5β antibodies differ in that the former was elicited with RP70, a 40–residue disulfide–linked V3 loop peptide from HIV–1 strain MN (ref. 3), while the latter was elicited with full–length gp120IIIB ( ref. 2). Although both RP135 and RP142 (which represents a central portion of RP70) contain the conserved GPGR motif in their epitopes2,3, the QR insert immediately preceding GPGR in RP135 is absent in RP142, and the two peptides otherwise differ in 9 of 22 positions. It is therefore not surprising that the GPGR conformations differ in the RP135–0.5β and RP142–59.1 complexes. More specifically, at least three possibilities may account for the difference: (i) the GPGR motif is conformationally heterogeneous in unbound RP70, unbound gp120IIIB or both; (ii) the GPGR conformation is well defined in both unbound RP70 and unbound gp120IIIB but different in the two cases because of sequence differences and the loss of interactions with other regions of gp120 in RP70; (iii) the GPGR conformation is well defined and the same in the unbound states of RP70 and gp120IIIB, but is significantly distorted by binding to 59.1, 0.5β or both. The third possibility is at variance with crystallographic studies of other peptide–antibody and protein–antibody complexes36,37, which generally show little distortion of well–structured protein epitopes upon interaction with antibodies. The second possibility seems unlikely because both 0.5β and 59.1 neutralize the HIV–1 IIIB strain2,3. In both cases, neutralization presumably involves binding the GPGR region of gp120IIIB, because both 0.5β and 59.1 have been shown to bind the GPGR region of RP1352,3. Thus, the GPGR motif in gp120IIIB appears to have sufficient conformational flexibility to adopt both the RP135–0.5β and the RP142–59.1 conformations, in accordance with the first possibility.

There is strong experimental evidence that the V3 loop is a site of interaction between gp120 and the chemokine co–receptors of human T and macrophage cells5,6,7,8,9,10 during HIV infection. As in its interactions with neutralizing antibodies, the V3 loop may adopt a well–defined conformation upon interacting with co–receptors. Some viral strains can bind to several distinct co–receptors6,38,39, an adaptive advantage that may be enhanced by the flexibility of the V3 loop. The conserved nature of the GPGR motif may arise out of structural constraints specific to the co–receptor–bound state.

Generality of the methods

The solid–state NMR methods described here are a new general approach to structural investigations of protein–bound peptides, which often serve as useful models for protein–protein interactions. This approach provides local structural constraints that are particularly useful in the testing and development of structural models and when questions of specific structural details are at issue. Crystals are not required. Large (100,000 Mr) complexes can be studied. The molecular–weight limit is dictated principally by signal–to–noise considerations, not by dynamic correlation times. Association and dissociation rates of the complexes are not important, and binding constants as low as 104–105 M–1 are feasible. Conformations of peptides in complexes with membrane–bound proteins, about which little data are otherwise available, may also be studied with these methods. In contrast to other solid–state NMR methods that provide internuclear distance40,41 or dihedral–angle42,43 constraints, the methods described here allow both φ and ψ to be determined or constrained by measurements on a single doubly labeled sample.

Methods

Samples for solid–state NMR were prepared by first mixing buffered solutions of labeled peptide and Fab, with typically 0.9 μmol of peptide and 1.0 μmol of Fab in 1 ml of 10 mM phosphate buffer, pH 7.2. Solutions also contained 0.05% NaN3 as a preservative and 1 mM CuNa2EDTA to reduce proton spin–lattice relaxation times to roughly 1 s. Sample volumes were reduced to 200 μl by vacuum centrifugation, with care taken to avoid freezing of the sample at this stage. Samples were then pipetted into 6–mm–diameter MAS rotors (240 μl volume), frozen by immersion in liquid nitrogen and then transferred into the precooled MAS NMR probe without thawing.

2D MAS exchange data were obtained with the radiofrequency (rf) pulse sequence CP – t1 – P – τ – P – t2, where CP represents cross–polarization from protons to 13C nuclei, t1 is the evolution period, P is a 13C 90° pulse, τ is the exchange period (500 ms in these measurements) and t2 is the detection period. Proton decoupling was applied during t1 and t2 but not τ. Pulses were synchronized with sample spinning as described25,26. Elongated crosspeaks running parallel to the diagonal in Fig. 2a are intrasite crosspeaks primarily from natural–abundance 13C nuclei, due to 14N nuclear spin–lattice relaxation25,26. CTDQFD data were obtained with the pulse sequence [CPy – (RFDR)L – Px]ξ – Py – (RFDR)M – Py – Px – (RFDR)N – t2 where (RFDR)M represents application of a rf–driven dipolar recoupling sequence27,30,31 for M sample rotation periods, x and y indicate rf pulse phases, and ξ represents overall phase shifts for double–quantum filtering27,32. In Fig. 2b, the total double–quantum–filtered 13C–NMR signal from the labeled carbonyl sites is plotted as a function of the dipolar evolution time, defined as (M – N)/νR. M + N is kept constant at 96 for sample 2 and 112 for sample 3. L is 32 for sample 2 and 48 for sample 3. Total acquisition times were 10 days for each 2D MAS exchange spectrum and 3 days for each CTDQFD data set. Intensities of intersite crosspeaks between spinning sidebands of order –3 to +3 were measured from 2D MAS exchange spectra for samples 2 and 3, and of order –3 to +2 for sample 1. Intensities of all significant spinning sidebands (order –1 to +1) were summed in CTDQFD spectra.

Simulations of NMR data assumed the CSA tensor orientations and chemical–bond geometries described26. For sample 1, 2D MAS exchange simulations were also carried out assuming a cis Gly–Pro peptide bond. The global minimum of χ2 for the cis simulations was 19.1, occurring at φ,Ψ = –70°,150°, so that the possibility of a cis Gly–Pro peptide bond in the GPGR motif cannot be ruled out from our data. Carbonyl 13C CSA principal values and resonance assignments were determined from double–quantum–filtered MAS spectra with long double–quantum preparation and mixing periods as follows: (ξ11, ξ22, ξ33) = (242.8, 197.6, 87.0) p.p.m. for Pro 320, (243.0, 171.7, 92.0) p.p.m. for Gly 321, (228.6, 189.2, 94.6) p.p.m. for Arg 322. The principal values (234.9, 179.5, 91.1) p.p.m. for Gly 319 were estimated from MAS spectra of the free peptide. N (number of data points) was 30 for 2D MAS exchange data on sample 1, 42 for 2D MAS exchange data on samples 2 and 3, 6 for CTDQFD data on sample 2, and 7 for CTDQFD data on sample 3. χ2 is the sum of the mean–squared experimental noise and the estimated mean–squared imprecision of the simulations.