Existing vaccines have been developed mostly against pathogens that show no or limited antigenic variation and that can be controlled by neutralizing serum antibodies. In contrast, the conquest of pathogens that display more variable antigens and require T-cell immunity remains elusive, requiring several gaps in our knowledge to be bridged. The advent of genomics is allowing us to address the problem of antigen variability by increasing the number of characterized antigens by several orders of magnitude and making possible the selection of antigens that are conserved and that induce antibody-mediated protection. But filling the remaining knowledge gaps will require greater understanding of the molecular nature of epitopes and the mechanisms of T-cell and mucosal immunity. These problems can be best addressed by systematic studies of antigen three-dimensional structure and further progress in our ability to program and measure the immune system's ability to protect the host.

Technological breakthroughs

For more than a century, vaccine development has followed Pasteur's principles: “isolate, inactivate and inject” the causative microorganism1. Bacterial and viral vaccines have been produced that consist of whole killed pathogens, live attenuated pathogens or purified components from pathogens (Fig. 1). Together with the wider availability of potable water, the provision of vaccines has had the most profound positive effect on the quality of public health of any measure: during the past century, these products essentially eliminated most infectious diseases causing mortality in infants and children. Vaccines against diphtheria, tetanus, polio, measles, mumps, rubella, pneumococcus, hepatitis B and meningitis (Haemophilus influenzae and serogroup C meningococcus) have reduced the incidence and mortality of these diseases by > 97–99% (ref. 2).

Figure 1: Vaccines licensed so far have all been developed by following Pasteur's principles: “isolate, inactivate and inject” the causative microorganism.
figure 1

This led to the development of killed, live attenuated and subunit vaccines. Recent technological breakthroughs have amplified the potential of the vaccines developed by Pasteur's principles by making it easier to manufacture structures mimicking killed viral vaccines (virus-like particles or VLPs) by engineering replicating and nonreplicating recombinant vectors that mimic live vaccines and by providing new forms of subunit vaccine such as conjugate and recombinant vaccines.

Technological breakthroughs during the past three decades have extended considerably the power of the vaccines produced according to Pasteur's principles, especially subunit vaccines. The first breakthrough was the advent of gene-splicing technology and the production of a recombinant hepatitis B virus (HBV) vaccine3. In this case, although a vaccine made by inactivated viral surface antigen (HBsAg) was shown to be effective in the seventies, vaccine manufacture was limited since the antigen could only be isolated from the plasma of infected people, because HBV could not be cultivated outside patients. Recombinant DNA technology allowed the production of unlimited amounts of HBsAg of the vaccine in yeast, thus solving forever the vaccine-manufacturing problem.

The second technological breakthrough was conjugate vaccines. In this case, polysaccharide subunit vaccines were known to be able to induce protective immunity, but they were not widely used because they could not engage a T-cell response and failed to work in infants. But conjugating polysaccharide antigens to proteins created T-cell epitopes that made these vaccines very effective in infants, opening the way to the development of several safe and effective vaccines against H. influenzae, Neisseria meningitidis serogroup A, C, Y, W135 and Streptococcus pneumoniae4.

A third breakthrough was the engineering of Bordetella pertussis to produce a mutant pertussis toxin that had lost all the toxic activities of pertussis toxin5,6. In this case, the recombinant DNA technologies produced a clean, nontoxic and highly immunogenic antigen. Previously, such antigens could only be derived from the pathogenic microorganism directly and detoxified by harsh chemical treatment, which often compromised the protective epitopes of the subunit vaccine.

Finally, a recent breakthrough has been the ability to produce nonreplicating, recombinant virus-like particles of many viruses7. This innovation has allowed the development of a vaccine against human papillomavirus and is likely to provide new and improved vaccines against many other viruses, including influenza and Dengue virus. Even live attenuated vaccines are being extended by a new family of replicating and nonreplicating viral vectors8.

The first gap: antigen variability

Conventional vaccinology has been very successful in developing vaccines against pathogens that do not change the vaccine-targeted antigens over time (Fig. 2). Classical vaccines such as diphtheria and tetanus, for instance, target toxin antigens that have not had antigenic drift during the past century. Other antigens with little or no antigenic drift include the surface antigens of poliovirus (where the three known serotypes have been stable for the past 60 years), the envelope antigens of measles, mumps and rubella, the HBV and hepatitis A virus surface antigens, and the polysaccharide antigen of H. influenzae type B.

Figure 2: Schematic representation showing that the success rate of vaccine development decreases with the increasing variability of the surface antigens of pathogens.
figure 2

Vaccines mentioned in the figure are selected examples of each category and not a complete list. MMR, measles, mumps and rubella; HBV, hepatitis B virus; HAV, hepatitis A virus; HIB, H. influenzae type B.

To a certain extent, conventional vaccinology has had success in targeting particular pathogens that have many variants of target-vaccine antigens and change them over time. A classical example is the polysaccharide antigens of pneumococci. Although this bacterium has >90 chemically different polysaccharide forms, most human disease is associated with only 23 variants. This made it possible to produce a very complex, but practical, 23-valent polysaccharide vaccine against the disease9,10. This approach reached its limits when conjugate vaccines were developed. In this case, it was practically impossible to manufacture a 23-valent conjugate vaccine, and thus lower valency conjugate vaccines were initially developed, primarily to cover the serotypes responsible for invasive disease in one region of the world. Thus, a vaccine might offer good coverage in, for example, the United States, but offer poorer coverage in other regions of the world.

In the long term, this solution also has its drawbacks. This is because even in countries where the seven-valent vaccine is originally protective, serotypes not covered by the vaccine slowly colonize the vaccinated population through selective pressure. In the long term, therefore, it is possible that existing serotypes in the seven-valent vaccine may have to be replaced with new serotypes occurring in the target population11,12. This is not necessarily an insurmountable problem because vaccines do effectively control the disease during the first few years after introduction. Moreover, we also have the tools to follow the epidemiology of pathogens and to design and license new vaccine formulations before they are actually needed to address the emergence of new pathogenic variants. Indeed, the development of 11-valent and 13-valent pneumococcal vaccines, which are in the late stage of development, moves in this direction. For pneumococcus, therefore, global control of the disease is possible simply by designing regionally targeted vaccines and updating their formulation every 10 years.

Vaccination against meningococcus, a bacterium with five chemically different capsular polysaccharides known as serogroup A, B, C, Y and W135, also follows a similar paradigm. When the first conjugate vaccines were developed in the early nineties, the disease in Europe and the United States was mainly caused by serogroup C and B. A decade later, however, serogroup Y became prevalent in the United States. Thus, today, although vaccines containing only serogroup C are still useful in Europe, in the United States these are insufficient and only C- and Y-containing vaccines are protective13,14.

Conventional vaccinology has also been relatively successful in developing vaccines against those pathogens that change their surface antigens every year, such as influenza. In this case, to face the antigenic variability of the influenza viruses, the formulation of the vaccine is altered every year and the population is vaccinated every year with the new formulation15,16.

It is all too clear, however, that pathogens changing their antigens faster than once a year are beyond the reach of conventional vaccinology. RNA viruses, such as HIV, are an extreme example of how in the absence of proof-editing of the genetic code a virus can generate antigenically different forms every day and bypass any attempt to make a conventional vaccine17,18.

The second gap: T cell immunity

Most vaccines licensed so far induce antibodies, which are responsible for the protection induced by the vaccine (Fig. 3). We have been successful in developing vaccines against diphtheria, tetanus, polio, H. influenzae type B, hepatitis A, hepatitis B, rabies, measles, mumps, rubella, influenza, papillomavirus, pneumococcus and meningococcus because these pathogens can be entirely prevented by appropriate amounts of immunoglobulin G antibodies in the serum19,20 (Fig. 3). In some instances, we also know that the vaccine protection seems to go beyond the protection mediated by the antibodies alone; however, we have no way to quantify it. We know, for example, that the Sabin oral polio vaccine induces concentrations of serum antibodies that are lower than those induced by the killed Salk vaccine. We suspect that the Sabin vaccine, which multiplies in the gut, is likely to induce a protective immunity that goes beyond immune protection mediated solely by serum antibodies; however, we do not know how to measure the extra protection21,22,23.

Figure 3: Schematic representation showing that the success rate of vaccine development decreases with the decreasing ability of antibodies to confer protective immunity.
figure 3

Vaccines mentioned in the figure are selected examples of each category and not a complete list. IPV, inactivated poliovirus; OPV, oral poliovirus; TB, tuberculosis; other abbreviations as in Figure 2.

Other examples where T cells are likely to contribute to immune protection are live attenuated vaccines such as varicella zoster virus24. The recently developed live attenuated vaccine against influenza provides a similar paradigm. One dose of vaccine has been shown to induce a protection superior to that induced by two doses of inactivated influenza vaccine, despite lower recorded serum antibodies25. The literature suggests that the extra protection induced by the live vaccine must be due to the cytotoxic T cells induced by the replicating virus, and perhaps to mucosal immunity at the portal of pathogen entry. Even so, no one has yet been able to provide an assay that can correlate the observed protection with an in vitro measure of T-cell or mucosal immunity. As a consequence, although regulatory agencies normally approve vaccines based on serum antibody measurements, they are not in the position to use mucosal or T-cell immunity to license vaccines.

A large body of literature suggests that cytotoxic T cells are important in protection from infectious diseases and cancer, and many vaccines are being tested to see whether T-cell–based vaccines can actually protect on their own. Despite the scientifically sound theory behind T-cell based immunity, and many clinical trials trying to prevent infection and disease by eliciting T cells alone, today we have indications and trends, but not one single solid example of an effective T-cell based vaccine. Therefore, the requirement of T cell immunity for protection is still an open question in vaccine development (Fig. 3).

In this field, the evidence that CD8+ T cells can control viral infections comes largely from HIV. Given the failure to protect using the antibody-based gp120 vaccines26,27, T-cell–based vaccines have been widely tested in nonhuman primates. The consensus emerging from these studies is summarized in Figure 4 (refs. 17,28). T-cell vaccination has never been able to prevent infection; however, it has been consistently able to reduce the peak viremia after infection and to reduce 100-fold the viral load for a prolonged period. Eventually, in most cases, variants escaping T-cell control emerge, and viral load returns to that seen in unvaccinated controls. In contrast, passive immunization with antibodies or the administration of antibody-inducing vaccines is able to prevent infection against a challenge with the virus strain used to make the vaccine, but unfortunately not against other viral strains. Although there is a lot of debate as to whether the initial reduction in viremia induced by T-cell vaccines results in a clinical benefit, the pattern of protection (Fig. 4) seems to suggest the following: antibody-based vaccines are excellent in providing protection from infection; in contrast, T-cell–based vaccines are unable to prevent infection, but may be important in controlling an established infection. Comparable data from several other pathogens, including hepatitis C virus29, back up this picture.

Figure 4: Schematic diagram of the HIV infection cycle and the predicted outcome of a T-cell and B-cell vaccination as deduced from vaccination of nonhuman primates.
figure 4

Although T-cell vaccination is not predicted to prevent infection, it is predicted to control the viral load until escape mutants arise. Vaccines inducing neutralizing antibodies are able to prevent infection but cannot cope with the antigenic variability.

Indeed, a similar conclusion can be reached for vaccines against cancer and therapeutic vaccines in general, which are based on inducing cytotoxic T cells. To create T-cell–based vaccines, several systems in animal models and humans have been recently developed to specifically induce cytotoxic T cells, including DNA vaccines, replicating and nonreplicating viral vectors (for example, adenovirus and pox-viruses), mixed regimes based on DNA priming and viral-vector boosts, bacterial vectors (for example, Listeria), and adjuvants18,30. Despite all of these technical developments, proof-of-principle that a pure T-cell vaccine is protective on its own is still lacking. And recently, the premature termination for lack of clinical efficacy of Merck's (Whitehouse Station, NJ, USA) V520 HIV prime/boost vaccine (a trivalent vaccine of gag, pol and nef) has not only cast more serious doubts on whether 'T-cell only' vaccines can be used in prevention but also challenged the validity of the results obtained in nonhuman primates shown in Figure 4. In fact, the vaccine based on adenovirus vector to deliver gag HIV T-cell epitopes not only failed to protect patients from disease, but also had little impact on viral load—a result that was somewhat expected from the studies in nonhuman primates (Fig. 4; ref. 31). It is possible that this result may represent a proof-of-concept that pure T-cell vaccines are not a solution for preventive vaccines.

In contrast, for those vaccines that are capable of inducing local mucosal immunity, we do have proof-of-concept that they can be effective and that mucosal immunity contributes to protection by neutralizing pathogens at their portal of entry. Even so, despite many successful vaccines developed in mice—and several adjuvants and delivery systems developed to specifically induce local immunity—a live oral vaccine against salmonella (T21A), a killed oral vaccine against cholera and a live attenuated intranasal vaccine against influenza represent the only mucosal vaccines currently on the market22,23,25. Although these mucosal vaccines have been shown to be efficacious in protecting from disease, we do not yet understand their mechanism of action and we do not have an assay that correlates with protection.

The first bridge: reverse vaccinology

Figure 5 illustrates vaccine development in the form of a graph where the two gaps for vaccine development (antigen variability and dependence on serum antibodies) are used as the horizontal and vertical axes, respectively. The vaccines available today, based on conserved antigens and serum antibodies, are mostly in the upper right red quadrant and extend to the left quadrant to capture some of the variable antigens such as pneumococcus and influenza. Although existing vaccines include an impressive number of diseases and have improved the quality of life, they still cover only a small portion of the space that could be addressed by vaccination.

Figure 5: Known facts and knowledge gaps in vaccine development.
figure 5

The graph is built using antigen variability and type of immunity from Figures 2 and 3 as horizontal and vertical axes, respectively. Existing vaccines are mostly in the upper right red quadrant. Moving down and left we exit the comfortable zone of the known, move into the unknown, and face increasingly difficult challenges. Reverse vaccinology, which has already shown to have the potential to address many diseases by mining the genomes to find relatively conserved antigens able to induce protective antibodies, may extend considerably the area covered by vaccines (orange). The most difficult challenges are in the lower and left part of the graph, where antigens are extremely variable and protection relies only on T cells. Addressing these diseases and expanding vaccine coverage to the area shown in yellow will require filling the gaps in knowledge by structural vaccinology, developing new adjuvants and learning how to rationally program the immune system. MenB, Meningococcus serogroup B; GBS, group B streptococcus; other abbreviations as in Figures 2 and 3.

How can we tackle those diseases that have not been addressed by conventional vaccinology? The availability of genomic sequences has allowed us to use computers to search the entire genetic repertoire for protective antigens, thus increasing by several orders of magnitude the number of antigens available for vaccine development. With many more antigens available for each pathogen, it is now possible to select those antigens that respond to validated principles, such as limited or absent antigenic variability and antibody-mediated protection. Because this approach to vaccine discovery starts with the analysis of the information contained in a computer instead of with growing pathogens, it has been named 'reverse vaccinology'32.

A classic example of genome-based antigen discovery is meningococcus B. In this case, the chemical composition of the capsular polysaccharide, which is a polysialic acid identical in structure to a self-antigen present in many human glycoproteins, prevented use of a conjugate vaccine, an approach that had been very successful against meningococcus serogroups A, C, Y and W135. Searching for alternative approaches, during the past several decades, conventional vaccinology had come up with vaccines based on detergent extracts of whole bacteria (outer membrane vesicles or OMVs). These are excellent vaccines, which can be very effective in eliminating epidemics caused by single strains (an example is the recent elimination of the New Zealand epidemic caused by meningococcus serosubtype P1. 4 (ref. 33 and http://www.moh.govt.nz/moh.nsf). But such vaccines are not effective against endemic disease, which is typically caused by strains that have huge antigenic variability in the PorA protein—the protective antigen in OMV vaccines. Until very recently, OMV vaccines were the only option and conventional approaches provided little chance of identifying protective antigens against all strains. Here, the availability of genomic sequence was truly revolutionary because it made available hundreds of new vaccine candidates and facilitated the identification within a short period of time of 29 new validated vaccine targets. The 29 antigens were then prioritized to identify those that were most effective and showed less antigenic variability, and a new vaccine was designed that is able to induce protection against the majority of disease-causing strains34,35. The vaccine is now being tested in phase 2 clinical studies.

Reverse vaccinology is now being applied to many pathogens, and thus far it has been successful in every case in providing new, valid antigens for vaccine design. A variation on the initial concept of reverse vaccinology, the pan-genome approach, is that today several genome sequences of the same pathogen may be used to capture the genomic variability of the species36,37. Other variations on the theme are the identification of new vaccine targets by screening of genome-wide libraries using antibodies from convalescent subjects38, or the direct identification of antigens by mass spectroscopy, which is now possible thanks to the knowledge of the genomic sequence39. Among the pathogens that have been tackled by reverse vaccinology are group B streptococcus, group A streptococcus, pneumococcus, chlamydia and staphylococcus. Even so, the approach is applicable to many more pathogens, including parasites, which so far have been recalcitrant targets for vaccine development40. A systematic approach to bacteria, parasites and viruses with large genomes for which a vaccine is not yet available is thus likely to provide many new vaccines and considerably expand the space covered by vaccines (Fig. 5).

The second bridge: addressing antigenic variability

Despite great progress, we still face some very tough pathogens that so far have eluded vaccine design efforts. These pathogens usually have the common property of showing to the immune system antigens of high variability, which allows escape from immune recognition. A typical example is HIV, which changes its antigenic surface daily, making it impossible for us to design a conventional vaccine. An improved vaccine against influenza with broader protection that would not require repeated immunization every year would also be desirable. These pathogens usually have two types of surface epitopes: those that are highly visible by the immune system, conventionally named immunodominant epitopes (which also have the property of being highly variable), and those that do not mutate and are common to all strains. Unfortunately, the latter are usually completely invisible or poorly visible to the immune system, and immunity against them cannot be elicited by conventional vaccinology. For example, in the case of HIV, the immune response is almost exclusively directed against variable epitopes on the surface of the gp120 envelope protein. At the same time, we do know that monoclonal antibodies against epitopes common to all HIV strains have been identified and that these antibodies (for example, 2F5, 2G12, 4E10 and b12) neutralize most HIV viruses in vitro17 and provide complete protection from infection in animal models41,42,43 (Fig. 4). Therefore, why are we unable to elicit an immune response against these epitopes held in common?

It is obvious that if we were able to generate a broadly neutralizing antibody response, we would bring the HIV field into the top right quadrant of Figure 5, and we would have the potential to solve the HIV problem immediately. The problem is that we do not understand the molecular nature of 'immunodominant' or 'immunosilent' epitopes. To our eyes, both types of epitopes are similar; nevertheless, they must look extremely different to the immune system. To solve this problem, we need to go back to basic science and try to understand the molecular nature of immunodominance. One way of addressing this problem is to determine the three-dimensional structure of many immunodominant and silent epitopes and try to understand the basic structural rules underlying them. Once we understand the rules, we should be in the position to engineer new types of immunogens capable of inducing an immune response against conserved epitopes and conquer those diseases that are still beyond our reach today. I thus believe we need to start a project for high-throughput determination of the three-dimensional structure of antigens and antigen-antibody complexes, and this project and its technology should become an intrinsic part of vaccinology. An example in this direction is the recent determination of the three-dimensional structure of the complex between gp120 and the broadly neutralizing antibody b12, which shows that the antibody recognizes seven noncontiguous conserved amino acid stretches in the CD4 receptor–binding region44.

Hopefully, for structural vaccinology45, we will see a trend similar to the one we have seen for reverse vaccinology. We started using genomes for vaccine development only 10 years ago; today, we do not start a vaccine project without knowing the genome. In 10 years' time, we should not be starting a vaccine project without knowing the structure of the relevant antigens.

The third bridge: harnessing T-cell and mucosal immunity

Vaccines that are licensed or are in late-stage clinical trials today make use of adjuvanted formulations based on aluminum salts or oil-in-water emulsions. We use them because they work and are safe; however, our understanding of their mechanism of action is basically the same as that of Glenny46, who introduced alum in 1926, or Freund, who introduced oil emulsions more than 70 years ago47. In other words, the mechanism of action of adjuvants is still a black box. Today, for the first time, we have the opportunity to address the mechanisms by which adjuvants work in vivo and to use them to program the immune response48. The combined availability of a variety of animal models with single or multiple knockout mutations in key components of the immune response and of molecular adjuvants acting as agonists of toll-like receptors or natural killer T cells should help address some fundamental questions that we need to answer to design the vaccines of the future. Typical questions are the following: first, is the function of CD4+ T cells only to help and educate B and CD8+ T cells or do such cells actually have a direct role in protection; second, are cytotoxic T lymphocytes needed at all in preventive vaccines; third, is there a role in protective immunity for B and T cells that home to mucosal tissues; and fourth, can vaccines do anything after infections have been established?

In a few instances, we have begun to get preliminary answers to these questions, but they are only hints of the avalanche of new information that we are likely to see in the near future. For example, it has been shown that in the case of Helicobacter pylori, protection can be achieved in the absence of antibody-producing B cells, suggesting a direct role of CD4+ T cells in protection. Even so, we do not know what the effector mechanisms of CD4+ T cells are and whether protection is mediated by an unknown subset of T cells49. The recent discovery of T-helper-17 (TH17) T cells indicates that there are more than just TH1 and TH2 T-cell subsets, and that to fully exploit their potential we must understand the role of each subset and learn how we can stimulate or inhibit the development of each subset by vaccination50,51. Recent findings showing that an established chronic viral infection can be eradicated by the existing CD8+ T cells simply by removing immunosuppression using an antibody to interleukin-10 that restores the natural antiviral function of CD8+ T cells52, or that herpes simplex virus reactivation is effectively controlled by CD8+ T cells that are localized to the surface of sensory nerves53, illustrate that we are starting to understand more of how T-cell–mediated immunity works and can be manipulated. Another example of how the immune system can be programmed to develop CD8+ or CD4+ T cells with defined specificity to control and eliminate established tumors is engineering lentiviral vectors to transduce hematopoietic stem cells with genes for the alpha and beta chains of T-cell receptor54,55.

This example of 'instructive immunotherapy' shows that the rules of programming the immune system are within our reach and that it is likely that during the next decade we will see a paradigm shift in vaccinology, moving from the empirical use of what works by happenstance to the rational design of an appropriate immune response56,57. This may lead to the conquest of the most difficult quadrant, shown in yellow in Figure 5. It will also not only extend vaccination to the prevention of the most challenging infections agents that we currently face—AIDS, tuberculosis, malaria—but also enable us to develop therapeutic vaccines to treat cancer, chronic infections and metabolic and neurodegenerative diseases.