Main

The gp120 and gp41 glycoproteins of the HIV envelope (Env) are targets of antibodies that inhibit virus infectivity, and attempts to induce these antibodies with a prophylactic HIV vaccine have used various approaches. The failure of early gp120 protein-based vaccines to induce neutralizing antibodies1 redirected attention to vaccine-mediated induction of the cellular arm of the immune response. The subsequent failure of a T cell-based vaccine2 left the HIV-1 vaccine field at a crossroads, with uncertainty as to the way forward. Recently, a dual-component HIV-1 vaccine, in which priming with a recombinant canarypox vector was followed by boosts with two recombinant gp120 proteins, imparted a measure of protection from infection3. Protection in this trial may have been due to the induction of an antibody response4, but the nature and specificity of the protective antibodies, and how to design immunogens that induce higher and more persistent levels of such antibodies, remain unresolved issues.

The difficulties of HIV-1 vaccine research are, in part, a result of the extreme antigenic variation of HIV-1. Conventional wisdom suggests that 'constant' rather than 'variable' regions of Env should be targeted by the immune system to elicit broad responses against antigenically diverse HIV strains. However, these regions were classified in early studies on the basis of the sequences of only a few virus strains5. Immunological and three-dimensional (3D) structural studies of Env from diverse strains of HIV-1 have now revealed the presence of higher-order structural features that would alter these early classifications and explain how antibodies specific for some variable regions have neutralizing activity against diverse viruses. Although several epitopes that induce neutralizing antibodies have been identified in conserved regions of both the gp120 and gp41 portions of Env, inducing neutralizing antibodies that target these epitopes using rationally designed immunogens has, so far, been difficult6,7,8,9. Moreover, although a vigorous antibody response is induced by natural infection with HIV-1, only a minority of individual HIV-positive patient sera neutralize a broad range of HIV strains10,11.

We think that targeting conserved elements within the immunogenic sequence-variable Env regions (Fig. 1) using rationally designed immunogens is a promising approach for vaccine design, but inducing broad, cross-strain neutralizing antibodies specific for these regions is a challenge. Nevertheless, current evidence indicates that 3D visualization might identify additional invariant epitopes that are capable of inducing protective antibodies and that are hidden within the sequence-variable Env regions. Indeed, this view provides a rational basis for understanding the documented immunological cross-reactivity of many monoclonal antibodies targeting the second (V2) and third (V3) variable loops of gp120, and the quaternary neutralization epitopes (QNEs) formed by V2 and V3 (Box 1; Table 1). In hindsight, this perspective is a logical outgrowth of classic immunochemical studies12 showing that cross-reactive antibodies recognize antigens that are related but not identical in sequence (as is the case for V2-specific monoclonal antibodies that recognize gp120 monomers derived from diverse primary isolates of HIV-1 (Ref. 13)). These old, but seminal, studies support a new paradigm for HIV-1 immunogen design.

Figure 1: Ribbon diagram of the crystallographic structure of the gp120 monomer as bound to CD4.
figure 1

a | This gp120 monomer structure represents the truncated, deglycosylated conserved core of gp120 without the variable loops. The locations of the variable region stems and bridging sheet are shown. b | Ribbon diagram of the crystallographic structure of the V3 loop of gp120 in situ from the crystal structure, indicating the base, stem and crown regions of the V3 loop. c | Zones of sequence variability mapped onto the β-hairpin conformation of the V3 crown. The crystallographic structure of the V3 crown is shown as a ribbon diagram, with commonly occurring side chains shown in stick depiction. A multicoloured illustrative cylinder is superimposed on the structure to highlight the four zones of the V3 crown: the arch (green), the hydrophilic face (red) and hydrophobic face (blue) of the circlet, and the band (yellow). Only the hydrophilic face has high sequence variability, but the amino acids that constitute the region of variability are distributed widely in the linear protein sequence, which obscures the existence of such a region unless the 3D structural context is appreciated. The side chains of the V3 crown that are bound by neutralizing monoclonal antibodies are annotated with coloured lines: monoclonal antibody 268 neutralizes only a few HIV-1 strains and engages various side chains, including one in the variable hydrophilic face of V3; monoclonal antibodies 2219 and 3074 neutralize viruses from several HIV-1 subtypes37 and avoid engagement of the hydrophilic variable zone. The image in part a was drawn using data from Ref. 17. The image in part b was drawn using data from Ref. 80. The image in part c is adapted, with permission, from Ref. 40 © (2010) Mary Ann Liebert, Inc.

Table 1 HIV-1 envelope-specific neutralizing human monoclonal antibodies

In this article, we bring together data from both the biological arena (viral and immunological studies) and the physical arena (structural and bioinformatics studies) to show how structure–function relationships of Env proteins can inform the design of an effective HIV-1 vaccine. With special emphasis on the variable regions of gp120, we highlight the relationship between 3D molecular structure, biological function and immunological cross-reactivity. This approach shows the potential utility of targeting sequence-variable regions and QNEs that are formed by the interaction of sequence-variable regions in the context of the Env trimers. Because of their sequence variability, these regions were previously dismissed as vaccine targets, but we now think that their potential use should be reassessed.

Biological functions of the gp120 regions

Antibodies that interfere with key biological functions of HIV-1 can 'neutralize' virus infectivity. Therefore, understanding the function of various parts of Env assists in determining the antibody targets that should be induced by protective vaccines.

Receptor-binding sites. It is now well established that to trigger exposure of the HIV-1 gp41 fusion domain that initiates virus–host cell fusion, gp120 must first bind to two cell-surface receptors: CD4 and CXC-chemokine receptor 4 (CXCR4) or CC-chemokine receptor 5 (CCR5)14,15. The receptor-binding surfaces on gp120 form distinct structural and antigenic regions that are highly conformational and are formed by discontinuous amino acids in several regions of gp120 (Refs 16, 17 (Box 1). The CD4-binding site consists of many residues in the constant regions of gp120 but, until recently, only one of many CD4-binding site-specific monoclonal antibodies had been described with broad neutralizing activity18. Three more have recently been selected and characterized19,20(Table 1). This suggests that the crucial region within the large CD4-binding site may be difficult to target with vaccine constructs, a fact that is supported by the disappointing results from attempts to induce neutralizing CD4-binding site-specific antibodies with various vaccine constructs6,7,9.

The chemokine receptor-binding site of gp120 consists of the invariant β2 and β3 strands in the V1–V2 double loop, the β20 and β21 strands in the fourth conserved region (C4), and residues in the V3 loop16,17. This highly conformational, discontinuous region of gp120 is often known as the CD4-induced region because, as its name implies, it is present after gp120 has bound CD4 with high affinity. Many human monoclonal antibodies specific for the CD4-induced region have been described21,22,23,24, indicating the strong immunogenicity of this region. However, when exposed on the virus particle, the CD4-induced region is sterically protected from most specific antibodies during the process of infection owing presumably to its close proximity to the cell membrane25. Consequently, antibodies specific for the CD4-induced region may not protect against infection by most HIV-1 strains.

The V3 loop. The V3 loop of gp120 is a component of the chemokine receptor-binding region, and it is an important determinant of co-receptor usage26,27,28,29,30,31. The length of the V3 loop is essentially constant at 34–35 amino acids. Although its amino acid sequence is highly variable — except in clade C viruses where the V3 sequence is generally conserved32 — sequence variation is restricted to only 20% of the amino acid positions and is located mainly in the two β-strands in the crown of the V3 loop (Figs 1b, 2d). The functional importance of V3 is shown by the fact that deletion of V3 completely abrogates virus infectivity33. Nonetheless, the efficacy of V3-specific antibodies in blocking virus infection has been controversial because some V3-specific antibodies, particularly those elicited early after infection, have limited breadth of neutralization in terms of different strains of HIV-1 (Ref. 34): many V3-specific antibodies neutralize only the most neutralization-sensitive 'Tier 1' viruses19,35, no single V3-specific monoclonal antibody neutralizes more than 10–25% of the more resistant 'Tier 2' viruses19,36,37 and the accessibility of V3 is poor on many viruses38,39. However, given the essential function of V3 in HIV infectivity, its structural conservation40,41, its strong immunogenicity42, the existence of many V3-specific monoclonal antibodies with cross-clade neutralizing activity19,36,37,43 and the demonstration that polyclonal V3-specific antibodies can be induced that neutralize viruses from diverse HIV-1 subtypes44, the V3 region remains a potentially important target for vaccine development.

Figure 2: Conserved and variable residues in the V1, V2 and V3 loops of gp120.
figure 2

a | Sequence logo depicting the amino acid conservation pattern across a multiple alignment of many V1 loops, each of which was selected because it has the most common V1 length depicted in Fig. 3a. Data used to derive the sequence logo were obtained from the Los Alamos National Laboratory (LANL), New Mexico, USA, using one sequence per patient, and all HIV-1 subtypes were included. The height of the letter indicates the degree of conservation of the most common amino acid at that position. Amino acids are coloured according to their chemical properties: small amino acids (G, P, A, S and T) are green; strongly polar, charged basic or charged acidic amino acids (K, R, D, E, Q and N) are black and hydrophobic amino acids (V, L, I, Y, W, F and M) are blue. The few amino- and carboxy-terminal amino acids of the V1 loop are reasonably conserved, but most of the loop fades to small letters, which indicate no conservation except for a run of asparagine (N) and threonine (T) residues near to the centre of the loop, suggesting a region of putative glycosylation. b | Sequence logo depicting the amino acid conservation pattern across a multiple alignment of many V2 loops, each of which was selected because it has the most common V2 length (as depicted in Fig. 3b). Sequence data were obtained from LANL (one V2 sequence per patient, with sequences from all HIV-1 subtypes included). Although the V2 loop has many more insertions and deletions than the V3 loop, it has a degree of amino acid conservation similar to that of the V3 loop when one controls for loop length as explained above. c | The amino acid positions and sites of putative glycosylation that are implicated in binding of the V2-specific monoclonal antibody 697-D and the QNE-specific monoclonal antibodies PG9, PG16 and 2909 are mapped onto a schematic illustration of the V1–V2 loop structure. Square brackets indicate more than one commonly occurring amino acid at a single position. The individual sites associated with binding of each single antibody are distributed throughout the V1–V2 primary sequence linearly but, in 3D space, must cluster into one or a few overlapping epitopes. Note that the amino acid numbering in Fig. 2c does not exactly match that in Figs 2a and 2b owing to the variation in V1 and V2 lengths. d | Sequence logo produced in the same manner as Figs 2a and 2b depicting the amino acid conservation pattern across a multiple alignment of all V3 loops from LANL sequence data (one V3 sequence per patient, with sequences from all HIV-1 subtypes included).

The V2 loop. By contrast, V2 is not essential for virus infectivity33, which suggests that the evolutionary constraints to preserve an invariant V2 structure are less pronounced than for V3. However, V2 participates in several Env functions: by providing a motif that binds to α4β7 integrin (the receptor potentially involved in the homing of the virus to gut mucosal cells)45, by contributing to Env trimer formation46 and by participating as part of the chemokine receptor-binding surface. V2 also has a crucial role in masking neutralizing epitopes of gp120: changes in V2 length and glycosylation patterns allow viruses to escape from neutralization mediated by antibodies specific for V3 and the CD4-binding site of gp120 (Refs 47, 48, 49, 50). Given these functions of the V2 loop, its newly recognized role as a component of QNEs51,52,53,54 and its partial structural conservation (see below), V2 becomes an additional potential target for vaccine development.

V1, V4 and V5. The functions of the other variable regions of gp120 are poorly defined. V1 might have functional significance as it can be a neutralizing epitope, and several potent neutralizing V1-specific monoclonal antibodies have been developed in transgenic mice55. The immunogenicity of V1 is affected by glycosylation56, and mutations in V1 can affect virus-induced syncytium formation of host cells57; however, little more than this is known about the function of V1. V4, by contrast, has no well-defined function, although it is a target for early autologous neutralizing antibodies58 and V4-specific neutralizing antibodies have been described in immunized rabbits59. V4 is involved in neutralization escape60, and it undergoes extreme variation in early infection61. The V5 region, the only variable region that does not form a disulphide-linked loop, participates in forming the CD4-binding site surface, but no polyclonal or monoclonal neutralizing V5-specific antibodies have been described; nonetheless, V5 is involved in neutralization escape, although V5 peptides could not block the activity of neutralizing antibodies in sera62. Current data, therefore, do not support the use of these variable regions as targets for HIV-1-specific antibody-based vaccines.

Trimeric Env spikes. Crystal structures of gp120 monomers have been available for more than a decade17, but the crystal structure of the trimeric Env spike is not available, although it has been modelled using monomer structures and cryo-electron tomography63,64,65. There are important consequences to the oligomerization of the HIV-1 Env protein: trimer formation is a requirement for infectivity, results in the burial of neutralizing epitopes within oligomeric interfaces and allows conformational epitope masking66,67,68. The QNEs that form after trimerization of gp120 have been defined by various human and macaque monoclonal antibodies, which are exceptionally potent51,52,53,54, a fact that reflects a crucial function of the QNEs that has not yet been defined.

Variable loop conservation

Recently, several 3D structural studies have provided important insights into the conserved structural elements in the variable loops of gp120, which have indicated potential windows of opportunity for designing immunogens expressing these conserved structures.

The V3 loop. Although the V3 loop is the least variable of the HIV-1 Env variable regions because, in part, it is the only one that is essentially constant in length, its sequence varies considerably across strains, and this variability is most pronounced in its 'crown', 14 amino acids at the tip of the V3 loop (Fig. 1b). Although the specificity of some V3-specific antibodies restricts their immunochemical and biological functions69,70, some V3-specific monoclonal and polyclonal antibodies from infected individuals can neutralize diverse HIV-1 strains19,36,43,71,72,73,74. Furthermore, some individual V3-specific monoclonal antibodies can neutralize 25–50% of Tier 1 and Tier 2 HIV-1 strains in which the correct epitope is present19,37,43,72,75; polyclonal V3-specific responses to infection can neutralize Tier 1 and Tier 2 viruses from diverse clades74. Vaccine-induced immune responses targeted to V3 can also elicit broader neutralizing activity than that seen in many human HIV-positive sera or with mixtures of human monoclonal antibodies44.

Interestingly, crystallographic structures of the gp120 V3 loop crown from different virus strains in complex with various human monoclonal antibodies41,59,76,77,78,79, and structures of the V3 loop freely arranged in situ on the core of gp120 (Ref. 80), all have a V3 crown that forms variations of a common β-hairpin tertiary structure. Furthermore, the sequence variation of the V3 loop tends to cluster into a single continuous small zone when viewed in 3D space40 (Fig. 1c). We hypothesize that the crown of the V3 loop might be organized into a folded (albeit highly flexible) globular domain, with most of its surface being structurally, and therefore potentially antigenically, invariant. This structural perspective explains both the cross-reactivity of many V3-specific antibodies, which bind the conserved surfaces of V3, and the narrow specificity of some V3-specific monoclonal antibodies, which bind the small sequence-variable zone40,41 (Fig. 1c). The discovery of conserved 3D structures in the sequence-variable V3 loop is consistent with the function of the V3 loop as a chemokine receptor-binding element: the receptor-binding surface of gp120 must be conserved to preserve virus infectivity, and these constraints apparently involve more of the amino acid positions of the V3 loop than they leave free to vary. This view of the V3 loop indicates which regions of V3 should be targeted with immunogens to elicit broadly neutralizing antibodies; it also indicates the region of V3 that induces only strain-specific antibody responses.

The V1–V2 double loop. V2 varies in both sequence and length (Figs 2, 3) and, as a result, is a much more variable region than is V3. V2 is immunogenic in 20–45% of HIV-1-infected humans13 in contrast to V3, which induces antibodies in essentially all infected individuals42. Several V2-specific human monoclonal antibodies have been generated that cross-react with gp120 molecules from diverse HIV-1 isolates, and V2-specific monoclonal antibodies have also been produced from cells of infected chimpanzees and immunized rats81,82,83; this indicates that V2 might also contain some structurally conserved elements. Interestingly, many V2-specific monoclonal antibodies interact with the same amino acid residues or with overlapping regions of V2 (Refs 13, 52, 81, 83, 84) (Fig. 2c). Published studies document the neutralizing activity of human and chimpanzee V2-specific monoclonal antibodies; although many of these antibodies have little or weak neutralizing activity, some have notable potency83,85,86.

Figure 3: Histograms showing the length distributions of the V1 and V2 loops of gp120 from recorded HIV-1 sequences in the Los Alamos National Laboratory (LANL) database.
figure 3

The histograms show the number (y axis) of recorded viruses having each V1 or V2 length plotted against the number of amino acids of the V1 or V2 loop (x axis). Loop length is plotted from zero to the maximal recorded length; so, for example, the origin shows that no viruses are recorded to have entirely deleted V1 or V2 loops (length of loop = 0).

Little is known about the 3D structure of the V2 loop, but the biological advantage conferred by the integrin α4β7-binding motif in V2 (Ref. 45) might constrain at least part of the V2 structure. This receptor-binding motif occurs in a region of V2 containing alternating and periodic conserved charged and hydrophobic amino acid residues that are typical of a folded domain. In addition, V2 contains a well-conserved disulphide bond; such bonds are common in small, loosely folded domains with high-level sequence variation in their 3D structural folds (such as the thioredoxin fold found in many proteins with known 3D structures)87. It is also notable that the carboxy-terminal portion of the stem of the V2 loop is located directly adjacent to the CD4-binding site in crystal structures of gp120 bound to CD417,80 and that some data indicate that the presence or absence of V2 might affect recognition and neutralization by the CD4-binding site-specific antibody IgG1b12 (Refs 66, 88). Further structural information is derived from a comparison of the length distributions of V1 and V2 (Fig. 3a,b). The normal distribution of V1 lengths is typical of random sampling from several optional structures and thereby suggests that V1 is likely to be a dynamic, unfolded and disordered flexible loop that alternates between many conformations. The only other structural clue available for V1 comes from the presence in the mid-region sequence of V1 (as observed in the most common length) of repetitive sites of putative glycosylation (Asn-X-Thr/Ser shown as V1 residues 136–144; see Fig. 2a).

The different patterns of length distribution of V1 and V2 reflect biological functions that suggest 3D structural order within V2 but, as noted, structural disorder within V1. Specifically, the V2 length distribution pattern has a sharp drop-off in length from the most frequent length to shorter lengths, but a more gradual drop-off as the V2 length increases, indicating a viral mechanism that has little tolerance of V2 lengths shorter than 40 amino acids. This pattern of length variation is typical of either a folded globular domain, which requires a crucial length below which the fold cannot assemble, or inter-monomer contacts, which require a crucial length below which the loop cannot reach over from one monomer to another. In both cases, the observed length distribution requires some V2 structural order. The available structural data — together with the observed cross-reactive immunological patterns of V2-specific antibodies — suggest that a similar, but more challenging, approach to vaccine targeting as envisioned for V3 might also be successful for V2, but the data also suggest that V1 is probably not a suitable vaccine target.

V2–V3 complexes in the envelope trimer. There is now evidence that V2 and V3 can, together, form complex epitopes comprised of contributions from both variable loops. These complex epitopes exist preferentially or exclusively in the trimeric form of the HIV Env spike, which defines them as quaternary epitopes (Box 1). Antibodies specific for quaternary epitopes were first described in simian–HIV89.6P (SHIV89.6P)-infected macaques; although these antibodies were difficult to characterize in polyclonal sera, their activity seemed to map to a discontinuous epitope formed by V2 and V3 present on the Env spikes of the virus89. Similar antibodies were found in an HIV-infected chimpanzee90,91. Remarkably, these antibodies were extremely potent neutralizing reagents. The first human QNE-specific monoclonal antibody to be described, 2909, is extremely potent (IC50 < 1.0 ng/ml) and targets both the V2 and V3 regions of gp120 (Ref. 51). The epitope to which monoclonal antibody 2909 binds is found only on the surface of virions and on env-transfected cells and does not exist on the gp120 monomer or recombinant soluble Env trimers52,68, which indicates that this epitope is formed only when gp120 monomers form trimers in lipid membranes. Monoclonal antibody 2909 is highly strain specific although it tolerates extensive amino acid sequence changes in V3 (Ref. 52). The strain specificity of 2909 is the result of binding to a lysine, rather than the more common asparagine, at residue 160 in V2 (Ref 52, 54) (Fig. 2b,c). So, although 2909 is narrow in its reactivity, this monoclonal antibody defined the first quaternary epitope in the HIV-1 Env protein and revealed both tolerance for extensive variation in V3 and specificity constraints imposed by a single position in V2.

Recently, several QNE-specific monoclonal antibodies have been generated from macaques infected with SHIVSF162P4 (Ref. 54). Similar to monoclonal antibody 2909, these antibodies have potent but strain-specific neutralizing activity, target V2 and V3, and are limited in terms of their breadth of reactivity by residues in V2. So, macaques and humans both make similar potent QNE-specific antibodies.

Notably, two clonally related human monoclonal antibodies (PG9 and PG16) have now been described that also target V2 and V3 and have potent neutralizing activity (IC50 < 1.0 ng/ml for many strains)53. Both of these monoclonal antibodies can be considered QNE-specific antibodies although PG9 reacts weakly with monomeric gp120 as well as with trimeric gp140 and seems to target mainly discontinuous residues in gp120 V2. Monoclonal antibody PG16, which is a true QNE-specific antibody, recognizes only the trimeric form of gp120 and interacts with discontinuous residues found in V2 and the crown of V3. Unlike human monoclonal antibody 2909 and the macaque QNE-specific monoclonal antibodies, PG9 and PG16 have broad neutralizing activity, neutralizing 73% and 79% of 162 diverse pseudoviruses, respectively. The crucial difference that distinguishes PG9 and PG16 from monoclonal antibody 2909, and accounts for the neutralization breadth of PG9 and PG16, is their dependence on the asparagine-linked carbohydrate moiety that is present in most HIV-1 isolates at position 160 in V2 (Fig. 2b,c). The neutralization breadth of PG9 and PG16 indicates that the epitopes they target are conserved and not masked in most strains of HIV-1.

The immunological reactivity and neutralizing activity of the human QNE-specific monoclonal antibodies show that they can readily tolerate extensive amino acid changes in V3 (Refs 52, 53), and that broad or narrow neutralization activity depends on V2 variation. Furthermore, the broad neutralizing activity of PG9 and PG16 shows that the quaternary epitopes they recognize are composed of conserved elements in the V2 and V3 sequence-variable regions and are some of the most accessible of all neutralizing epitopes present on the CD4-unliganded form of the trimeric Env spike. The modelled Env trimers63,64,65 provide clues for visualizing the 3D structures of V2 and V3 contacts, which probably occur within the many possible unbound forms of the Env trimer. The resolved 3D structure (or structures) of Env trimers in QNE-bound conformations ultimately should be highly informative for vaccine design.

In summary, recent data reveal conserved targets for broadly reactive antibodies in the sequence-variable V2 and V3 loops of gp120 and on quaternary structures formed by these loops in the trimeric Env. These data explain the broad immunological and functional cross-reactivity of some of the monoclonal antibodies specific for these regions and provide a framework for conceptualizing the value of such regions in vaccine design.

Structural view of epitope mapping

Although masking is often described in terms of the ability of glycans and other structural features to prevent or decrease the neutralizing activity of V3-specific antibodies19,34,38,47,92, epitope masking is a phenomenon that is common to all HIV-1 neutralizing epitopes and is the result of many factors67. For example, the 3D structure of a particular epitope can assemble by adopting the 'correct' conformation that clusters the key antibody-binding amino acid side-chains together into a single structure, but the protein may be designed to easily 'flicker' out of that conformation, which would be observed as masking of the epitope from its cognate antibody. Similarly, other protein loops or glycans might 'flicker' in to cover an epitope, which would also be observed immunologically as partial or complete masking. The result is that, although antibodies might be elicited by immunization strategies targeting epitopes that are broadly present in circulating viruses, the effective neutralization mediated by these antibodies in vivo is decreased to an uncertain extent by different types of masking.

Masking can sometimes be overcome when particular epitope conformations are stabilized by various molecular ligands. For example, in some strains of HIV-1, V3 epitope accessibility on gp120 is markedly increased by binding to CD4 and certain gp120-specific monoclonal antibodies that induce and stabilize conformational changes in gp120 (Refs 39, 93, 94). These data suggest that exposure of 'masked' epitopes could be accomplished, and effective neutralizing activity thereby achieved, by inducing 'unmasking' antibodies together with antibodies specific for the 'masked' epitope (or epitopes). Another view of masking is illustrated by some monoclonal antibodies that have different neutralizing activities despite the fact that they target similar regions of Env (Fig. 2c). Binding of V2-, V3- and QNE-specific antibodies to overlapping regions of V2 and V3 has different effects that might depend on the molecular dynamics of V2 and V3 loops in different conformations of the Env trimer and/or on the kinetics of V2 and V3 movement as they flicker between low- and high-energy configurations95.

Masking might, therefore, be a result of the functional, as well as the physical, characteristics of an epitope in any of several molecular conformations. Hypothetically, residues in V2 and V3 could form the epitope (or epitopes) recognized by QNE-specific antibodies only when V2 and V3 interact in a configuration that leads to virus infectivity. By contrast, some individual V2- and V3-specific antibodies might only recognize the epitopes on their respective loops when these loops occupy different space — that is, are not in contact with one another — as in trimeric Env spikes that are not in an infectious conformation. The epitopes that are not in contact with one another would seem to be 'masked', as these V2- and V3-specific antibodies cannot neutralize in this case.

Attempts to understand masking fully probably require interdisciplinary studies that include structural, as well as immunological and virological, elements.

Conclusions and the way forward

Although the HIV-1 Env glycoproteins use variation — in terms of both sequence and conformation — to evade neutralization, there are nevertheless regions of structural conservation that can be targeted for vaccine design. This view is supported by data showing that sequence-variable loops of HIV-1 gp120 can elicit a range of antibodies, some with narrow specificities for only one or a few strains of HIV-1, and others with broad immunological and neutralizing activities against diverse HIV-1 strains. This is particularly true of antibodies specific for V2, V3 and the QNEs formed by these two regions of gp120. The immunological cross-reactivity of many of the monoclonal antibodies specific for these epitopes can now be explained in terms of the structural conservation of 3D shapes within these sequence-variable loops and in the context of the functional requirements that limit the quality and quantity of sequence variability. This synthesis of the structural, functional and antigenic conservation of sequence-variable regions of gp120 provides a paradigm for the rational design of recombinant immunogens: it shows how form must follow function, it recalls how form relates to antigenicity and it suggests how immunogens can be rationally designed to focus the immune response towards the portions of structures that elicit broadly reactive antibodies and away from those regions that elicit antibodies with narrow reactivity. Such rationally designed immunogens will be modelled on the detailed 3D structures of epitopes that are recognized by families of cross-clade and broadly neutralizing antibodies, targeting conserved structural features and eliminating or replacing those elements that narrow the antibody specificity.

New techniques have led to a large increase in the number and specificities of monoclonal antibodies that can be derived from patient specimens19,20,51,53,73. Other newly discovered epitopes and as yet unidentified epitopes recognized by serum antibodies with HIV-1 neutralizing activity10,73,74,96,97,98 will provide additional vaccine targets beyond those that have been described here. Using structural and computational biology to rationally design recombinant epitope–scaffold immunogens based on neutralizing epitopes in the variable regions of Env has the potential of developing immunogens that elicit antibodies with greater specificity and potency than are currently achieved when various forms of gp120 and/or gp41 are used. Indeed, until recently, there has been limited success in inducing broadly reactive polyclonal neutralizing antibody responses that target selected epitopes in gp120 and gp41 (Refs 6, 7, 8, 9 and summarized in Ref. 99). However, focusing the immune response on neutralizing epitopes through the use of epitope–scaffold vaccines designed to present conserved structural elements to the immune system and to eliminate those epitopes that narrow antibody specificity is beginning to yield modest successes44. These techniques hold the promise of constructing immunogens that elicit polyclonal antibody responses specific for particular chosen epitopes of HIV-1 with finer specificity and greater potency than is currently achieved.