The phylogenomics of evolving virus virulence


How virulence evolves after a virus jumps to a new host species is central to disease emergence. Our current understanding of virulence evolution is based on insights drawn from two perspectives that have developed largely independently: long-standing evolutionary theory based on limited real data examples that often lack a genomic basis, and experimental studies of virulence-determining mutations using cell culture or animal models. A more comprehensive understanding of virulence mutations and their evolution can be achieved by bridging the gap between these two research pathways through the phylogenomic analysis of virus genome sequence data as a guide to experimental study.


Despite many advances in our understanding of virus evolution, how virulence evolves in a virus, particularly following a jump to a new host species, continues to be contentious. Will a virus become more or less virulent in a new host? What level of virulence is optimized by natural selection and why? Is there any consistent association between host-jumping and virulence such that predictions can be made about how virulence might evolve following emergence? Not only will the answers to these questions reveal fundamental aspects of virus biology, but they also may assist in infectious disease management and mitigation, particularly as humans, other animals and plants face a continual threat from emerging viruses.

The term ‘virulence’ has different meanings depending on context, can be assessed in a variety of ways and is often only an operational measure1. To be as general as possible, we assume a simple working definition of virulence: the harm caused by pathogen infection, particularly in terms of host morbidity and mortality. Virulence is also a complex trait determined by a combination of pathogen, host and environmental factors. Although it is obviously necessary to understand all three, we focus on the pathogen (virus) component as this is the most tractable, with the small genomes and rapid replication and evolution of viruses facilitating comparative and experimental studies, and because there is strong evidence for heritable virus genetic variation for virulence2. Although we purposely focus on the analysis of virus genomes, genome-wide association studies (GWAS) promise to open up new ways to explore the evolutionary impact of viral infections on host genomes3 and hence the intimate interaction between host and virus4.

Another complexity in studies of virulence evolution is that metagenomics is increasingly showing that mixed (that is, polymicrobial) infections are commonplace5, and even seemingly healthy hosts can carry multiple microorganisms of the kind often thought to be pathogenic6,7. Determining which microorganism is the cause of a particular disease syndrome can be troublesome, and it is possible that overt illness might result from synergistic interactions between multiple microorganisms that overwhelm the host. Hence, the model of one pathogen–one disease that has dominated studies of human infections, and implicitly models of virulence evolution, may be overly simplistic. It is also possible that measurements of relative virulence vary between humans and wildlife populations. For example, whereas dengue is considered an important infectious disease of humans, the virulence of dengue virus is low in terms of overall mortality in humans, and infections of equivalent severity may go unnoticed in wildlife, particularly as there is a strong sampling bias towards the most virulent presentations.

Virulence evolution in viruses has traditionally been studied from one of two separate research paths — the theoretical and the empirical — that have largely been pursued independently. Although theory and empiricism have each generated important and parallel insights, they have each been able to paint only a partial picture of virulence evolution. Few attempts have been made to bridge this divide8,9.

There is now a large body of long-standing evolutionary theory that considers what level of virulence maximizes pathogen fitness under variable conditions, such as differing modes of transmission, levels of co-infection, selection pressures, and both within and between hosts10. Although of great value, a drawback is that this work is unavoidably based on a small number of case studies, the most famous of which is the co-evolution of myxoma virus (MYXV) and European rabbits following the release of MYXV as a biological control11. However, the insights from examples such as MYXV may be insufficient to adequately inform theoretical models confronted with novel, real-world emergence events.

By contrast, empirical studies involve laboratory-based methods to identify the mutations that affect virulence (that is, virulence determinants), usually on the basis of a combination of reverse genetics and cell culture and/or animal models8,9,12,13,14,15,16. These studies are often very successful in pinpointing causal mutations (see Table 1 for illustrative examples) and are commonplace following the emergence of a new disease. However, because the mutations identified through experimental studies are not considered in an evolutionary context, their relevance for general theories of virulence evolution is usually ignored. In addition, in vitro methods may not reflect real-world selection pressures, there may be little consideration about how virulence mutations affect inter-host transmission, and animal models commonly differ from the species infected in the field. For example, virulence determinants in MYXV identified on the basis of in vitro studies and mouse models have often not been upheld in reverse genetic experiments using the natural rabbit host17. Similarly, despite the regularity of their use, there has been a long-standing debate over the validity of ferrets as accurate models for human influenza18.

Table 1 Examples of virulence determinants in viruses

Bridging the gap between the theoretical and empirical approaches would bring a new impetus to studies of virulence evolution. In this Review, we outline how this can be achieved within a phylogenomics framework. We show that virus phylogenies are being increasingly used to help identify virulence determinants and that the data obtained can be used to test general theories of virulence evolution. Such a phylogenomic approach to studying virulence evolution is timely because of the rapidity with which virus genome sequence data are now being generated, including during ongoing disease outbreaks of emerging viruses19,20,21, and because of the development of new phylogeny-based methods for studying and visualizing genomic data22,23,24. However, the success of this approach also requires that phylogenomic data are combined with relevant clinical, epidemiological and experimental metadata so that a direct link can be made between virulence, virus genotype and phenotype, and population fitness.

Disease emergence and virulence evolution

Arguably the most interesting context of virulence evolution is following a host jump as this sits at the heart of virus emergence, and the question of how virulence will evolve is commonly asked following the appearance of a new virus or the emergence of an existing virus with altered host range (Box 1). To make meaningful inferences, it is important to compare virulence in both the reservoir (that is, donor) and novel (that is, recipient) host species. Although this may sound straightforward and is tractable in some cases25, in reality it faces a number of difficulties. In many cases, including common infections such as hepatitis C virus26, as well as emerging infectious diseases such as Zika virus (ZIKV), the reservoir species is unknown or is at best uncertain. Even if a reservoir species is known, we generally know little, if anything, about virulence in that species, and there is likely to be an ascertainment bias towards the most virulent cases. For example, although species of fruit bats appear to be reservoirs for Ebola virus (EBOV)27, little is known about its virulence in these animals28, as is true of many wildlife infections. The identification of host species may also change with better sampling, which is invariably poor in wildlife. For example, it was long thought that the canine parvovirus (CPV) that emerged in dogs in the late 1970s had jumped from cats infected by a closely related virus29. However, more recent sampling of wild carnivore species has shown this to be incorrect, such that the true reservoir species for CPV is unclear30. Therefore, it is crucial to understand disease processes, including virulence, in reservoir hosts under natural conditions, which will require more detailed studies of animal ecology.

Although we currently know little about virulence in reservoir species, comparative data tell us that, on average, low-virulence infections have a greater chance of successfully establishing transmission cycles in humans than viruses with higher mortality31. This greater chance is presumably because high virulence requires a greater supply of susceptible hosts during the early stages of emergence.

Theories of virulence evolution

Evolutionary biologists have had a long fascination with virulence32,33,34,35,36,37. Because there is a very large literature base on this subject, we necessarily provide only a brief overview here. A straightforward interpretation of virulence evolution is that natural selection will optimize the level of virulence that maximizes pathogen fitness, expressed as the basic reproductive number (R0)1, although in reality fitness is shaped by a complex set of host–pathogen interactions38,39. Current evolutionary theory tells us that when a virus jumps to a new species, its initial virulence can vary from asymptomatic to highly pathogenic, and precisely where it lies on this virulence spectrum is difficult to predict. However, it is possible that the direction of virulence evolution can be anticipated, at least in part, if the key relationship between virulence and transmissibility, and hence fitness, is understood. Importantly, there is also evidence from insect viruses that host phylogeny is able to predict some aspects of virulence evolution following species jumps, with related host species tending to have similar levels of virulence25.

A commonly stated idea is that there is often an evolutionary trade-off between virulence and transmissibility because intra-host virus replication is necessary to facilitate inter-host transmission but may also lead to disease, and it is impossible for natural selection to optimize all traits simultaneously. In the case of MYXV, this trade-off is thought to lead to ‘intermediate’ virulence grades being selectively advantageous: higher virulence may mean that the rabbit host dies before inter-host transmission, whereas lower virulence is selected against because it does not increase virus transmission rates. A similar trade-off model has been proposed to explain the evolution of HIV virulence40. However, many doubts have been raised about the general applicability of the trade-off model35,41,42,43, virus fitness will be affected by traits other than virulence and transmissibility39,41,44, contrary results have been observed in experimental studies45 and relatively little is known about evolutionary trade-offs in nature. For example, in the case of the second virus released as a biocontrol against European rabbits in Australia — rabbit haemorrhagic disease virus (RHDV) — there is evidence that virulence has increased through time, probably because virus transmission often occurs through blow flies that feed on animal carcasses, making host death selectively favourable46. Similarly, experimental studies of plant RNA viruses have shown that high virulence does not necessarily impede host adaptation47 and, in the case of malaria, higher virulence was shown to provide the Plasmodium parasites with a competitive advantage within hosts48.

Other factors in addition to evolutionary trade-offs can shape the level of virulence in an emerging virus. For example, ‘short-sighted’ virulence evolution within a single host may be detrimental for inter-host transmission49, and newly emerged ‘spillover’ infections that have experienced only a limited number of transmission events are likely to have virulence levels that have not yet been optimized for transmissibility by natural selection50. Accordingly, for spillover infections, ongoing transmission may be largely at the mercy of random drift effects, including the severe population bottlenecks that routinely accompany such events51. Finally, it is possible that virulence may sometimes simply be a coincidental by-product of selection for another trait or selection for transmission in another species.

Theory therefore tells us that natural selection can increase or decrease pathogen virulence, depending on the particular combination between host, virus and environment1,32,33,37,41,52,53. Although providing a useful framework, theory can provide only useful generalities because the relevant factors vary substantially and need to be assessed on a case-by-case basis. Virulence evolution could, however, be better understood if its genomic basis were known.

Phylogenomics for assessing virulence evolution

Phylogenetic studies of viruses, including those that consider whole-genome sequences, are commonplace and are often used to understand a variety of aspects of virus evolution. In particular, virus phylogenies are being increasingly used to understand the evolution of key phenotypic traits such as virulence9 (and see the examples described below). Phylogenomics provides an informative way to help understand virulence evolution and establishes a set of hypotheses that can be tested using appropriate experimental assays8,9. We also believe that phylogenomics provides valuable information on how natural selection acts on virulence and can be used to test general models of virulence evolution, thereby providing a key link between theoretical and empirical approaches to studying virulence evolution. The crux of this approach involves mapping mutations onto phylogenetic trees of viruses sampled within and/or between disease outbreaks and from reservoir and novel hosts. The phylogenetic location of these changes — whether they fall on shallow or deep nodes (branches) and/or singularly or in parallel — makes it possible to infer, at least in broad terms, the selection pressures acting on virulence mutations and from this infer important aspects of virulence evolution (Box 1; Fig. 1).

Fig. 1: Phylogenomics of virulence evolution.

a | A model phylogeny with virulence determinants mapped to a fairly deep node suggesting that higher virulence has increased virus fitness. b | A model phylogeny with virulence traits mapped to shallow nodes suggesting that higher virulence reduced pathogen fitness so that viruses with these mutations are purged from the population or require compensatory mutations. c | A model phylogeny with a high-virulence mutation arising multiple times independently owing to parallel or convergent evolution. The occurrence of parallel/convergent mutations that occur more frequently than by chance8 is likely to reflect adaptive evolution (Fig. 2). d | The relationship between virulence, fitness and host jumps. A virus is assumed to be at a fitness peak (high R0), in this case high virulence, in the reservoir host, so that the mutations determining both virulence and host range are expected to be subject to strong purifying selection (for example, a low value of dN/dS). As the virus emerges in the new recipient host, it will initially be maladapted (that is, reside in a fitness valley) and subject to genetic drift as the population is small. As it adapts to the new host, virulence will be selectively optimized (in this case declining), increasing R0 and resulting in positive selection (for example, dN/dS > 1, although other measures of selection pressure are available). Once the virus becomes adapted to the new host, the virulence determinants are again subject to purifying selection.

The greater the fitness of a virulence determinant, the more rapidly it will spread through the virus population and the deeper it will fall on a virus phylogeny (that is, closer to the root of the tree), including on the branch linking reservoir and novel hosts. Of particular importance are repeated occurrences of the same mutation falling on deep branches across multiple outbreaks, or multiple cross-species transmission events, as both parallel evolution and convergent evolution can be signatures of adaptive evolution8,54,55,56,57 (Fig. 1). For example, in the case of West Nile virus (WNV), a single mutation in the virus helicase protein repeatedly evolved in high-mortality outbreaks in birds, which is indicative of a selective advantage58. Similarly, the reversion to virulence in oral polio vaccine (OPV) strains of poliovirus has been associated with extensive parallel evolution8, and parallel evolution was also associated with host-specific adaptation in experimental studies of cross-species transmission involving Drosophila virus C59. Because adaptive evolution has been at play, phylogenies of the sequence in question may have a characteristic shape24, and the sequences associated with selected branches may also contain genomic signatures indicative of positive selection, such as the rapid fixation of amino acid changes or an increased rate of nonsynonymous to synonymous substitutions per site (ratio dN/dS)8,60. In the case of frequent parallel or convergent evolution for specific virulence mutations, it is also possible that the amino acid sites involved will have signatures of positive selection, such as an elevated dN/dS (as was the case in WNV; see below).

Following the same logic, mutations that fall on shallow branches in virus phylogenies (that is, closer to the tips) are present in a smaller proportion of the population and are therefore more likely to be of lower fitness such that they may be removed by purifying selection. Hence, virulence-determining mutations that repeatedly fall on tip branches alone are likely to inhibit some other aspect of pathogen fitness, thereby reducing R0 at the population scale.

Although this approach has a solid theoretical and empirical basis61,62, a complicating factor is that a virulence-determining mutation that has very recently emerged will necessarily fall towards the tips rather than on an internal branch even if it is selectively advantageous. Similarly, although popular, dN/dS measures are less robust over short timescales, such as during outbreaks, because mutations may not have reached fixation by positive selection or had time to be purged by purifying selection, and it can be difficult to detect selected mutations that occur only once63,64. Approaches to detect positive selection that do not rely on dN/dS, such as those based on tree shape24, or tracking mutations that are increasing in frequency compared with those thought to be evolving neutrally64,65, may therefore add analytical power.

The phylogenetic mapping of virulence mutations can proceed in two ways depending on the extent of a priori knowledge. In a ‘top-down’ approach, in which virulence determinants are unknown, a virus phylogeny is inferred, mutations are mapped onto this phylogeny and the mutations on key branches are then identified. Such ‘key branches’ include those directly associated with cross-species transmission events, invasions of new geographic areas, increases in rates of transmission, spikes in morbidity and/or mortality or clear instances of positive selection. The mutations identified in this way are candidates for virulence determinants that can be tested in an appropriate experimental framework8. An example of this approach is shown in Fig. 2. The second, ‘bottom-up’, approach utilizes existing knowledge of virulence determinants, such as that determined by an experimental study. The putative virulence determinant is then mapped onto the phylogeny, and its phylogenetic location (that is, deep or shallow branch, singular or parallel/convergent evolution) is used to infer how it affects virulence evolution, whether it is associated with reciprocal mutations that reflect evolutionary trade-offs and the selection pressures it faces.

Fig. 2: Example of how phylogenomics can guide the experimental analysis of virulence determinants.

The evolution of virulence in strains of oral polio vaccine (OPV)8. OPV is an attenuated form of poliovirus that can occasionally revert to a virulent form and cause outbreaks of poliomyelitis. A | Phylogenetic analysis of OPV strains in nature reveals that some mutations associated with high virulence have experienced more frequent parallel evolution than expected by chance (and occupy well supported nodes) and hence are likely to be seletively favoured8. B | Computational evolutionary analysis then reveals that this parallel evolution for high virulence is associated with a hypothetical threonine-to-proline (T-to-P) amino acid change that is subject to significant adaptive evolution (which can be detected in a variety of ways)60,63. C | The virulence impact of these mutations is then confirmed in both in vitro (cell culture; part Ca) and in vivo (mouse; part Cb) experimental studies. In all cases, the red shading signifies increased virulence.

Although this phylogenomic approach is being increasingly used to identify virulence determinants, and we discuss a number of real data examples below, it can be used to make general statements about the nature of virulence evolution. Specifically, a virulence mutation that falls deep in the phylogeny such that it is inherited in all subsequent branches, and one evolving in parallel or with evidence of positive selection, necessarily implies that virulence is selectively advantageous. Conversely, a virulence determinant that occurs sporadically on shallow branches and is subject to strong purifying (negative) selection suggests that virulence is not directly beneficial, probably because it inhibits some other component of overall fitness. In such cases, each instance of high virulence may represent an independent and transient evolutionary event. If only a single mutation is associated with a change in virus virulence, as in the case of WNV, then this change in virulence is likely to be selectively advantageous without an evolutionary trade-off with transmissibility, as a reduction in transmissibility would probably need to be compensated for by additional reciprocal mutations located elsewhere in the genome. Hence, if multiple mutations fall on a branch associated with a change in virulence, it is possible that some are virulence determinants and the others are associated with evolutionary trade-offs on other traits.

Although we have described it in terms of emerging viruses, this phylogenomic approach can, in theory, be applied to any system in which a phylogeny can be inferred and in which it is possible to experimentally assess the impact of individual mutations on virulence. Similarly, it can be used to study other virological traits associated with disease emergence, particularly host range. For example, the repeated evolution of the same amino acid changes following the cross-species transmission of avian influenza virus to humans strongly suggests that they directly affect host range66, and a similar approach has been used to elucidate the nature of the evolutionary arms race between viruses and their hosts67,68.

Critically, however, the approach described here should also be considered an idealized one that works best when a limited number of genomic mutations act independently to shape virulence. Virulence determinants may be harder to identify when there are more complex interactions between mutations9, which appears to be true of MYXV (Box 2). Although epistasis is likely to be commonplace in RNA viruses69, little is currently known about whether virulence mutations interact epistatically70. Similarly, this approach may work best for RNA viruses because their constrained genome sizes mean that there are probably a limited number of virulence determinants, increasing the likelihood that they are subject to parallel and/or convergent evolution, and rates of recombination (which complicate phylogenetic relationships) are often fairly low within species71.

Examples of virulence evolution in nature

To illustrate how a phylogenomic approach can shed light on the evolution of virus virulence, we now briefly outline a number of cases in which it can be or has been applied. We begin by considering cases in which virulence determinants have been successfully mapped (WNV and avian influenza A virus (AIV)), move on to those in which revealing the mutations that underpin changes in virulence has been more complex (MYXV, Marek’s disease virus (MDV) and HIV) and end by examining virulence evolution in two recent disease outbreaks (EBOV and ZIKV). When possible, we also outline what the phylogenomic analysis in each case has told us about the evolution of virulence in general.

West Nile virus

In 1999, a new lineage of WNV became the leading cause of arthropod-borne viral encephalitis in humans and horses in North America, spreading from east to west across the continent72 and causing severe mortality in many bird species, particularly the American crow73. Phylogenomic analysis revealed that a single Thr249Pro (T249P) amino acid substitution in the virus NS3 helicase protein was associated with high-virulence WNV outbreaks in corvids on multiple continents. Experimental analysis in captive crows then showed that this mutation was sufficient to explain the high fatality rates in American crows, perhaps because it increased the rate of virus replication58. WNV therefore provides an important example of where a single genetic switch controls virulence, which is obviously the easiest scenario to detect using a phylogenomic approach. Of more general importance was that T249P evolved in parallel and experienced an elevated rate of nonsynonymous change, suggesting that high virulence was selectively favoured in the absence of an evolutionary trade-off as no reciprocal mutations were observed elsewhere in the viral genome58. However, because WNV infects a variety of bird species, it is possible that the repeated appearance of T249P in fact reflects aspects of viral evolution in different hosts. In particular, American robins may have been responsible for a substantial proportion for the cross-continent spread of the virus74, in which case T249P may have been selected to increase replication (transmissibility) in that species, a coincidental by-product of which was heightened virulence in crows.

Avian influenza A virus

Ever since the emergence of the highly pathogenic H5N1 subtype of influenza virus, there has been concern over whether this influenza virus could establish sustained transmission in humans, in which it causes only sporadic spillover infections at present75,76. More recently, highly pathogenic H7N9 has sporadically infected humans77 and continues to spread through poultry populations in China78, evolving from a low-virulence ancestor79. Although the true number of human cases, and hence accurate mortality, is difficult to ascertain, it is clear that both H5N1 and H7N9 cause fairly high mortality in humans and could have serious consequences were they to trigger a large-scale human epidemic. This concern has led to attempts to use genomic data to help in pandemic risk assessment80.

At the virus subtype level, the presence of a run of polybasic amino acids in the hinge region between the HA1 and HA2 subunits that make up the haemagglutinin (HA) protein of influenza virus helps it establish a systemic, and subsequently more serious, infection and thereby acts as a useful marker of high-virulence strains of the H5 and H7 AIV subtypes79,81,82. This marker makes it relatively easy to distinguish between potentially low-virulence and high-virulence AIVs, although what triggers the evolution of the high-pathogenicity variants in these subtypes is unclear83. Other individual amino acid changes, affecting a variety of gene functions, have also been proposed as specific virulence determinants for H5N1 (refs84,85,86) as well as in those viruses that circulate in human populations such as seasonal H3N2 (ref.87), the H1N1 virus responsible for the global pandemic of 1918–1919 (ref.88) (in which host inflammatory and cell death responses to infection appear to play a key role)89, and influenza B virus90 (Table 1).

The key unresolved question is how natural selection will shape both virulence and transmissibility if an AIV-like H5N1 or H7N9 virus is eventually able to develop sustained transmission in humans. An added complexity is that phylogenomic analyses reveal a consistent set of mutations that distinguish human and avian influenza viruses, although whether these affect host range alone, or both host range and virulence, is unclear66,84.

Myxoma virus

The canonical study of virulence evolution following a species jump is MYXV in European rabbits, with a body of classic work undertaken by Fenner and colleagues91,92,93 (Box 2). In both Australia and Europe, highly virulent strains of MYXV were used as a biological control against the European rabbit population, with releases beginning in the early 1950s. In both continents, the same trajectory of virulence evolution was observed: virulence declined from the highly virulent (that is, grade I) release strains to encompass a far wider range of virulence grades, including the most attenuated grade V strains, with strains of ‘intermediate’ virulence the most commonly sampled in the field. This pattern, reflecting a combination of the virus evolving more attenuated strains and the host developing resistance, fuelled the idea of a trade-off between virulence and transmissibility.

Sixty years after the initial release of MYXV, the first large-scale genomic studies of its spread were performed (Box 2). Phylogenomic analysis revealed that the virulence phenotype has changed on a regular basis94. However, a major surprise was that each change in virulence was associated with a different set of mutations across multiple genes94,95. Although which mutations had the greatest impact on virulence is still unclear and requires further experimental analysis, such a phylogenomic pattern indicates that there are multiple routes to achieving the same levels of virulence, including attenuation, such that there has been convergent evolution for phenotype but not genotype. It is likely that this evolutionary flexibility in part reflects the fairly large genome size of MYXV (a double-stranded DNA virus of ~160,000 bp), which may mean that there is a large number of potential virulence determinants that can interact through epistasis96, in turn complicating any phylogenomic analysis.

Marek’s disease virus

Whether ‘imperfect’ (that is, ‘leaky’) vaccination against infectious disease, in which disease symptoms are reduced but there is less impact on virus replication and transmission, will change the selection pressures acting on the pathogen and affect virulence evolution has been the source of debate97,98. Although still contentious, particularly in the case of human disease, there is good evidence that imperfect vaccination has increased virulence in the case of MDV, a DNA herpesvirus that poses a major problem to the poultry industry99. In the 1960s, the appearance of virulent MDV strains forced the development of the first generation of Marek’s disease vaccines. However, because these vaccines were imperfect, ‘very virulent’ MDV began to appear within 10 years, necessitating a second-generation vaccine. This very virulent MDV was followed, more rapidly, by the appearance of ‘very virulent plus’ MDV, requiring a third-generation vaccine (Fig. 3). Imperfect MDV vaccines enhance virulence by elongating the infectious periods and hence transmission potential of virulent strains that would have been removed by natural selection before transmission in the absence of vaccination99. Although the genomic basis to MDV virulence evolution is currently uncertain, with some causative amino acid changes proposed100, initial phylogenomic studies suggest that, as in the case of MYXV, there are multiple genetic pathways to high virulence101 (and which again may reflect the fairly large size of the viral genome). Not only does virulence evolution in MDV have important implications for vaccination strategies against other diseases in which vaccine efficacy is fairly low102, but it also shows that in some circumstances increased virulence can be selectively advantageous.

Fig. 3: Evolution of virulence in the context of imperfect vaccination.

In the 1960s, a vaccine was developed for Marek’s disease virus (MDV) of chickens present on poultry farms. This imperfect vaccine reduced disease symptoms but did not prevent virus replication, thereby extending the infectious periods, and hence potential for transmission, of virulent strains that would have been removed by natural selection before transmission to a new host in the pre-vaccine era99. Because of this, ‘very virulent’ MDV began to appear within 10 years, necessitating the development of a second-generation vaccine that was also imperfect. This was followed, in an even shorter period, by the appearance of ‘very virulent plus’ MDV, requiring a third-generation vaccine. Although the genomic basis of MDV virulence is currently unknown, the phylogenies at the bottom of the figure hypothetically assign virulence to multiple causative mutations (as in the case of myxoma virus). The dashed arrows indicate the evolution of viruses to the next virulence grade.


Given the importance of HIV to human health and that it ignited much of the research on disease emergence, it is no surprise that there has been considerable discussion on the evolution of HIV virulence103,104,105,106,107. Indeed, it is striking that HIV in humans is markedly more virulent than the closely related viruses that naturally infect non-human primates in Africa (Box 1). Although there have been suggestions that HIV has begun to evolve reduced virulence108, discussions of the trajectory of virulence evolution are necessarily complicated by the fact that antiviral therapy has greatly extended life expectancy.

HIV virulence is often approximated as the degree of variation in the set point viral load (SPVL) that is established soon after initial infection104. The higher the SPVL, reflecting greater levels of virus replication, then the more rapidly the patient will progress to AIDS in the absence of antiviral therapy, although other studies have suggested that the replicative capacity of the virus itself is a more informative marker of virulence and is also a direct measure of virus fitness109. Indeed, some ‘controller’ individuals are able to control levels of HIV in the absence of antiviral therapy, and it has been shown that this is in part due to infection with viruses of reduced replicative capacity110.

Importantly, viral genetic variation may play a more important role in shaping HIV virulence than host factors, with approximately one-third of the observed variability in SPVL assigned to virus factors111 and only ~13% seemingly due to the host112. This observation also implies that SPVL, and hence virulence, can be selectively optimized113. In support is evidence that SPVL, and hence virulence, has declined in some African HIV subtypes, even accounting for the use of antiviral therapy, and that this reflects a trade-off between virulence and transmissibility114. Importantly, however, despite many studies into the determinants of HIV virulence, the virus genomic mutations responsible for determining SPVL are still uncertain and multiple genes may be involved104. The difficulty in assigning the genetic determinants of SPVL may be in part due to genetic variation across viral populations111. For example, heritability in SPVL was highest (~60%) between individuals in the Swiss HIV cohort, which also represents the most homogenous viral population113.

Ebola virus

The 2013–2016 outbreak of EBOV (Makona variant) in West Africa was the largest and longest described in humans since the first description of the disease in 1976, with approximately 29,000 cases and some 11,000 deaths. In addition to hindering attempts at disease control, this elongated period of transmission in humans may have resulted in different selection pressures from those faced in the animal reservoir. This outbreak also raised key questions about virulence evolution, particularly whether natural selection would have favoured EBOV variants causing higher or lower human case fatality rates had the virus not been stamped out by public health intervention115.

Phylogenetic analysis of EBOV during the 2013–2016 outbreak revealed an Ala82Val (A82V) substitution in the virus glycoprotein to be of particular importance70,116. A82V is notable as it falls on a deep internal branch of the EBOV phylogeny, compatible with adaptive evolution, whereas other amino acid changes are associated with individual or only small clusters of sequences. Moreover, A82V improves binding to the human NPC1 receptor utilized by EBOV117, which would increase infectivity in humans, at the same time reducing infectivity in cells from the bat reservoir species70,116. Intriguingly, the appearance of A82V on the EBOV phylogeny is associated with two key epidemiological features: an increase in case numbers and an increase in mortality (Fig. 4). If there was indeed an increase in both EBOV transmissibility and virulence, then higher virulence is likely to have directly increased viral fitness, and in the absence of an evolutionary trade-off as only a single substitution was identified. However, these apparent changes in phenotype also coincided with the movement of the virus from Guinea to Sierra Leone, such that any change in case numbers and mortality could in fact be due to a change in epidemiological factors (such as access to health care or differing human demographics and/or transmission networks), and recent studies using animal models suggest that A82V has no direct impact on virulence118.

Fig. 4: The relationship between host adaptation and the evolution of virulence in Ebola virus.

A model Ebola virus (EBOV; Makona variant) phylogeny illustrates the evolution of a single amino acid substitution (glycoprotein, A82V) that is associated with viral adaptation to the human host during the West African EBOV outbreak of 2013–2016. A82V improves binding to the human NPC1 receptor utilized by EBOV, increasing infectivity in humans (red) while simultaneously reducing infectivity in cells from the bat reservoir species (blue)70,116. Maps above the phylogeny show the spread of EBOV over the timeline of the outbreak in the three affected countries in West Africa, where blue-shaded regions correspond to the wild-type virus variant (A82) and red-shaded areas correspond to mutated virus variant (V82). It is possible that A82V was also associated with an increase in both EBOV case numbers and mortality (that is, virulence) as the outbreak progressed, such that increased virulence is directly selectively advantageous, although this is confounded by epidemiological factors.

Zika virus

ZIKV is the most recent emerging virus to lead to a major public health scare and is puzzling because a seemingly benign virus suddenly increased in virulence, causing severe neurological disease in humans. Before 2007, there were fewer than 20 human cases of ZIKV reported and all were mild infections restricted to Africa and Asia9,119. Consequently, neither the disease caused by ZIKV nor the molecular determinants of ZIKV virulence were well characterized, and it is likely that there was systematic under-reporting of infections, including those associated with severe disease. In 2007, the Pacific Islands reported the first major outbreaks of ZIKV before the virus spread to the Americas in 2014. Although the majority of human infections range from asymptomatic to mild, the virus was associated with the neurological Guillain–Barré syndrome in French Polynesia in 2013 (ref.120), and those cases from the Americas, particularly Brazil, were linked to more severe diseases, including congenital abnormalities such as microcephaly121. Phylogenetic analysis revealed that the most recent Zika epidemics are due to the Asian lineage of ZIKV, rather than the African lineage122, and that the virus spread cryptically in Brazil for at least a year before its detection20. Although there are multiple amino acid differences between the African and Asian lineages122,123, it has been claimed that those in the Asian lineage that spread through the Americas may be directly linked to both increased infectivity in Aedes aegypti mosquitoes124 and microcephaly, most notably a Ser139Asn (S139N) amino acid change in the PrM protein125. However, there is still considerable uncertainty in this area, with others arguing that viruses from both lineages can cause neurovirulence but that cases often go unreported9,126. Hence, the case of ZIKV highlights the difficulty in assessing virulence evolution within a background of sparse and biased sampling even with phylogenomic data and shows the importance of collecting reliable, real-time epidemiological data even in low-incidence situations.

Future directions and recommendations

Virulence evolution has been one of the longest-standing issues in evolutionary biology. Although a strong body of theory has been developed, there are few cases in which we understand the forces that have shaped particular instances of virulence evolution and even fewer in which we have successfully linked evolutionary theory with individual genomic changes. We believe that a synthesis of experimental studies of virulence determinants and long-standing theory of virulence evolution set within a phylogenomic framework will generate a more comprehensive understanding of virulence evolution. In particular, not only does a phylogenomic approach enable potential virulence determinants to be identified, which is being increasingly used in the case of emerging viruses, but this analysis also sheds light on the models of virulence evolution that have occupied theoreticians for decades.

Recent advances in real-time genomics during disease outbreaks127,128 and the increased demand for precision in public health interventions may help in the development of a new understanding of the evolution of pathogen virulence. We contend that this can be achieved within a phylogenomic framework as long as relevant data are available and strong links are made between genomics, phylogenetics, epidemiology, and experimental studies of virus virulence and fitness. Therefore, it is critically important to collect clinical (that is, disease symptoms and severity) and epidemiological (that is, time and place of sampling) metadata concurrently with the sequencing of virus genomes and to sample across a range of clinical syndromes, not just those associated with severe disease. We also stress the value of gathering concurrent and historical data from likely reservoir species as these will provide a more complete insight into virulence evolution and determining the full range of microorganisms that infect a particular species, as well as their interactions, as assigning disease syndromes to individual pathogens may often be difficult. Thankfully, advances in metagenomics now make the latter task feasible6,7,129. Similarly, there is a marked lack of good virulence grading schemes among viral infections. Although such schemes can sometimes be simplistic, assuming discrete virulence categories that may not exist in nature and incorporating degrees of subjectivity, the case of MYXV shows that they are key to considering the relationship between genotype and phenotype that is essential to understanding virulence evolution.

Finally, it is possible that an increased understanding of virulence evolution drawn from a phylogenomic approach may contribute to new strategies for pathogen control and eradication, and there is a clear potential for this framework to inform and improve the fields of disease management and the biological control of invasive pests. Although predicting where and when a new disease might emerge is clearly unfeasible because of the immense complexities involved54,130, predicting the overall trajectory of the virulence evolution of a virus in a novel host may be more achievable. Once again, biocontrol presents a compelling example. Although controversial131,132,133, the proposed release of cyprinid herpesvirus 3 (CyHV-3) as a biological control against invasive common carp (Cyprinus carpio L.) in Australia may present a unique opportunity to follow, in real time, the co-evolution between host and virus at both the genotypic and phenotypic scales. Both theory and virus natural history predict that CyHV-3 virulence will decline with time134, and it will be interesting and informative to see how any such virulence evolution is manifest in phylogenomic data.


  1. 1.

    Bull, J. J. & Lauring, A. S. Theory and empiricism in virulence evolution. PLOS Pathog. 10, e1004387 (2014). This review provides insight into how virulence theory might be united with empirical data and what is needed to better understand the relationship between virulence and transmissibility.

  2. 2.

    McMahon, D. P. et al. Elevated virulence of an emerging viral genotype as a driver of honeybee loss. Proc. Biol. Sci. 283, 20160811 (2016).

  3. 3.

    Power, R. A., Parkhill, J. & de Oliveira, T. Microbial genome-wide association studies: lessons from human GWAS. Nat. Rev. Genet 18, 41–50 (2017).

  4. 4.

    Ansari, M. A. et al. Genome-to-genome analysis highlights the effect of the human innate and adaptive immune systems on the hepatitis C virus. Nat. Genet. 49, 666 (2017).

  5. 5.

    Yang, J. et al. Unbiased parallel detection of viral pathogens in clinical samples by use of a metagenomic approach. J. Clin. Microbiol. 49, 3463–3469 (2011).

  6. 6.

    Shi, M. et al. Redefining the invertebrate virosphere. Nature 540, 539–543 (2016).

  7. 7.

    Shi, M. et al. The evolutionary history of vertebrate RNA viruses. Nature 556, 197–202 (2018).

  8. 8.

    Stern, A. et al. The evolutionary pathway to virulence of an RNA virus. Cell 169, 35–46 (2017). This study is a powerful example of the analytical approach outlined here. The authors use phylogenetics and experimental study to describe the evolutionary pathways by which OPV strains become pathogenic and propose a framework for vaccine design.

  9. 9.

    Rossi, S. L., Ebel, G. D., Shan, C., Shi, P.-Y. & Vasilakis, N. Did Zika virus mutate to cause severe outbreaks? Trends Microbiol. (2018).

  10. 10.

    Alizon, S. & Lion, S. Within-host parasite cooperation and the evolution of virulence. Proc. Biol. Sci. 278, 3738–3747 (2011).

  11. 11.

    Kerr, P. J. et al. Myxoma virus and the leporipoxviruses: an evolutionary paradigm. Viruses 7, 1029–1061 (2015).

  12. 12.

    Ebihara, H. et al. Molecular determinants of Ebola virus virulence in mice. PLOS Pathog. 2, e73 (2006).

  13. 13.

    Jimenez-Guardeño, J. M. et al. The PDZ-binding motif of severe acute respiratory syndrome coronavirus envelope protein is a determinant of viral pathogenesis. PLOS Pathog. 10, e1004320 (2014).

  14. 14.

    Kataoka, C. et al. The role of VP1 amino acid residue 145 of enterovirus 71 in viral fitness and pathogenesis in a cynomolgus monkey model. PLOS Pathog. 11, e1005033 (2015).

  15. 15.

    Song, H., Santi, N., Evensen, O. & Vakharia, V. N. Molecular determinants of infectious pancreatic necrosis virus virulence and cell culture adaptation. J. Virol. 79, 10289–10299 (2005).

  16. 16.

    Uraki, R. et al. Virulence determinants of pandemic A(H1N1)2009 influenza virus in a mouse model. J. Virol. 87, 2226–2233 (2013).

  17. 17.

    Liu, J. et al. Reverse engineering field isolates of myxoma virus demonstrates that some gene disruptions or loss of function do not explain virulence changes observed in the field. J. Virol. 91, e01289–17 (2017).

  18. 18.

    Oh, D. Y. & Hurt, A. C. Using ferrets as an animal model for investigating influenza antiviral effectiveness. Front. Microbiol. 7, 80 (2016).

  19. 19.

    Dudas, G. et al. Virus genomes reveal factors that spread and sustained the Ebola epidemic. Nature 544, 309–315 (2017).

  20. 20.

    Faria, N. R. et al. Epidemic establishment and cryptic transmission of Zika virus in Brazil and the Americas. Nature 546, 406–410 (2017).

  21. 21.

    Holmes, E. C., Dudas, G., Rambaut, A. & Andersen, K. G. The evolution of Ebola virus: insights from the 2013–2016 epidemic. Nature 358, 193–200 (2016).

  22. 22.

    Hadfield, J. et al. Nextstrain: real-time tracking of pathogen evolution. Bioinformatics (2018).

  23. 23.

    Lässig, M. & Łuksza, M. Can we read the future from a tree? eLife 3, e05060 (2014).

  24. 24.

    Neher, R. A., Russell, C. A. & Shraiman, B. I. Predicting evolution from the shape of genealogical trees. eLife 3, e03568 (2014).

  25. 25.

    Longdon, B. et al. The causes and consequences of changes in virulence following pathogen host shifts. PLOS Pathog. 11, e1004728 (2015). This study combines experimental cross-species transmission and phylogenetics to reveal the nature of virulence evolution in a Drosophila virus. The results show that virulence levels were predictable from the host phylogeny.

  26. 26.

    Walter, S. et al. Differential infection patterns and recent evolutionary origins of equine hepaciviruses in donkeys. J. Virol. 91, e01711–e01716 (2016).

  27. 27.

    Leroy, E. M. et al. Fruit bats as reservoirs of Ebola virus. Nature 438, 575–566 (2005).

  28. 28.

    Allocati, N. et al. Bat-man disease transmission: zoonotic pathogens from wildlife reservoirs to human populations. Cell Death Dis. 2, 16048 (2016).

  29. 29.

    Truyen, U., Evermann, J. F., Vieler, E. & Parrish, C. R. Evolution of canine parvovirus involved loss and gain of feline host range. Virology 215, 186–189 (1996).

  30. 30.

    Allison, A. B. et al. Host-specific parvovirus evolution in nature is recapitulated by in vitro adaptation to different carnivore species. PLOS Pathog. 10, e1004475 (2014).

  31. 31.

    Geoghegan, J. L., Senior, A. M., Di Giallonardo, F. & Holmes, E. C. Virological factors that increase the transmissibility of emerging human viruses. Proc. Natl Acad. Sci. USA 113, 4170–4175 (2016).

  32. 32.

    Anderson, R. M. & May, R. M. Coevolution of hosts and parasites. Parasitology 85, 411–426 (1982).

  33. 33.

    Bull, J. J. Virulence. Evolution 48, 1423–1437 (1994).

  34. 34.

    Ebert, D. in Evolution in Health and Disease (ed. Stearns, S. C.) 161–172 (Oxford Univ. Press, Oxford, 1999).

  35. 35.

    Ebert, D. & Bull, J. J. Challenging the trade-off model for the evolution of virulence: is virulence management feasible? Trends Microbiol. 11, 15–20 (2003).

  36. 36.

    Ewald, P. W. Evolution of Infectious Diseases. (Oxford Univ. Press, Oxford, 1994).

  37. 37.

    Read, A. F. The evolution of virulence. Trends Microbiol. 2, 73–76 (1994).

  38. 38.

    Furió, V. et al. Relationship between within-host fitness and virulence in the vesicular stomatitis virus: correlation with partial decoupling. J. Virol. 86, 12228–12236 (2012).

  39. 39.

    Lion, S. & Metz, J. A. J. Beyond R0 maximisation: on pathogen evolution and environmental dimensions. Trends Ecol. Evol. 33, 458–473 (2018). This article argues that current formulations of R 0 are too simplistic to understand pathogen evolution and that a more complex, multidimensional approach is needed.

  40. 40.

    Fraser, C., Hollingsworth, T. D., Chapman, R., de Wolf, F. & Hanage, W. P. Variation in HIV-1 set-point viral load: epidemiological analysis and an evolutionary hypothesis. Proc. Natl Acad. Sci. USA 104, 17441–17446 (2007).

  41. 41.

    Alizon, S., Hurford, A., Mideo, N. & Van Baalen, M. Virulence evolution and the trade-off hypothesis: history, current state of affairs & the future. J. Evol. Biol. 22, 245–259 (2009). This paper provides an overview of the theory that there is a trade-off between virulence and transmissibility, highlighting areas in which the theory has been challenged and extended over the past 20 years.

  42. 42.

    Alizon, S. & Michalakis, Y. Adaptive virulence evolution: the good old fitness-based approach. Trends Ecol. Evol. 30, 248–254 (2015).

  43. 43.

    Lipsitch, M. & Moxon, E. R. Virulence and transmissibility of pathogens: what is the relationship? Trends Microbiol. 5, 31–37 (1997).

  44. 44.

    Novella, I. S., Elena, S. F., Moya, A., Domingo, E. & Holland, J. J. Size of genetic bottlenecks leading to virus fitness loss is determined by mean initial population fitness. J. Virol. 69, 2869–2872 (1995).

  45. 45.

    Betancourt, M., Escriu, F., Fraile, A. & García-Arenal, F. Virulence evolution of a generalist plant virus in a heterogeneous host system. Evol. Appl. 6, 875–890 (2013).

  46. 46.

    Di Giallonardo, F. & Holmes, E. C. Virus biocontrol: grand experiments in disease emergence and evolution. Trends Microbiol. 23, 83–90 (2015).

  47. 47.

    Willemsen, A., Zwart, M. P. & Elena, S. F. High virulence does not necessarily impede viral adaptation to a new host: a case study using a plant RNA virus. BMC Evol. Biol 17, 25 (2017).

  48. 48.

    De Roode, J. C. et al. Virulence and competitive ability in genetically diverse malaria infections. Proc. Natl Acad. Sci. USA 102, 7624–7628 (2005).

  49. 49.

    Levin, B. R. & Bull, J. J. Short-sighted evolution and the virulence of pathogenic microorganisms. Trends Microbiol. 2, 76–81 (1994).

  50. 50.

    Lloyd-Smith, J. O. et al. Epidemic dynamics at the human-animal interface. Science 326, 1362–1367 (2009).

  51. 51.

    Bergstrom, C. T., McElhany, P. & Real, L. A. Transmission bottlenecks as determinants of virulence in rapidly evolving pathogens. Proc. Natl Acad. Sci. USA 96, 5095–5100 (1999).

  52. 52.

    Bolker, B. M., Nanda, A. & Shah, D. Transient virulence of emerging pathogens. J. R. Soc. Interface 7, 811–822 (2010).

  53. 53.

    Brown, S. P., Cornforth, D. M. & Mideo, N. Evolution of virulence in opportunistic pathogens: generalism, plasticity, and control. Trends Microbiol. 20, 336–342 (2012).

  54. 54.

    Dolan, P. T., Whitfield, Z. J. & Andino, R. Mapping the evolutionary potential of RNA viruses. Cell Host Microbe 23, 435–446 (2018). This article discusses what aspects of virus evolution might be predictable, including virulence, and how this can be tested and highlights the importance of parallel evolution as a marker of adaptation.

  55. 55.

    Sackman, A. M. et al. Mutation driven parallel evolution during viral adaptation. Mol. Biol. Evol. 34, 3243–3253 (2017).

  56. 56.

    Stewart, C.-B., Schilling, J. W. & Wilson, A. C. Convergent evolution of lysozyme sequences? Nature 332, 788 (1988).

  57. 57.

    Frickel, J., Feulner, P. G. D., Karakoc, E. & Becks, L. Population size changes and selection drive patterns of parallel evolution in a host-virus system. Nat. Commun. 9, 1706 (2018).

  58. 58.

    Brault, A. C. et al. A single positively selected West Nile viral mutation confers increased avian virogenesis in American crows. Nat. Genet. 39, 1162–1166 (2007). This study demonstrates that positive selection on a single amino acid substitution within the genome of WNV was responsible for increased virulence in the American crow.

  59. 59.

    Longdon, B. et al. Host shifts result in parallel genetic changes when viruses evolve in closely related species. PLOS Pathog. 14, e1006951 (2018).

  60. 60.

    Weaver, S. et al. Datamonkey 2.0: a modern web application for characterizing selective and other evolutionary processes. Mol. Biol. Evol. 35, 773–777 (2018).

  61. 61.

    Kryazhimskiy, S. & Plotkin, J. B. The population genetics of dN/dS. PLOS Genet. 4, e1000304 (2008).

  62. 62.

    Pybus, O. G. et al. Phylogenetic evidence for deleterious mutation load in RNA viruses and its contribution to viral evolution. Mol. Biol. Evol. 24, 845–852 (2007).

  63. 63.

    Bhatt, S., Holmes, E. C. & Pybus, O. G. The genomic rate of molecular adaptation of the human influenza A virus. Mol. Biol. Evol. 28, 2443–2451 (2011).

  64. 64.

    Geoghegan, J. L., Senior, A. M. & Holmes, E. C. Pathogen population bottlenecks and adaptive landscapes: overcoming the barriers to disease emergence. Proc. Biol. Sci. 283, 1837 (2016).

  65. 65.

    Morley, V. J. & Turner, P. E. Dynamics of molecular evolution in RNA virus populations depend on sudden versus gradual environmental change. Evolution 71, 872–883 (2017).

  66. 66.

    Taubenberger, J. K. et al. Characterization of the 1918 influenza virus polymerase genes. Nature 437, 889–893 (2005).

  67. 67.

    Sawyer, S. L., Emerman, M. & Malik, H. S. Ancient adaptive evolution of the primate antiviral DNA-editing enzyme APOBEC3G. PLOS Biol. 2, e275 (2004).

  68. 68.

    Sawyer, S. L., Wu, L. I., Emerman, M. & Malik, H. S. Positive selection of primate TRIM5alpha identifies a critical species-specific retroviral restriction domain. Proc. Natl Acad. Sci. USA 102, 2832–2837 (2005).

  69. 69.

    Bedhomme, S., Hillung, J. & Elena, S. F. Emerging viruses: why they are not jacks of all trades? Curr. Opin. Virol. 10, 1–6 (2015).

  70. 70.

    Urbanowicz, R. A. et al. Human adaptation of Ebola virus during the West African outbreak. Cell 167, 1079–1087 (2016).

  71. 71.

    Simon-Loriere, E. & Holmes, E. C. Why do RNA viruses recombine? Nat. Rev. Microbiol. 9, 617–626 (2011).

  72. 72.

    Di Giallonardo, F. et al. Fluid spatial dynamics of West Nile virus in the USA: rapid spread in a permissive host environment. J. Virol. 90, 862–872 (2016).

  73. 73.

    LaDeau, S. L., Kilpatrick, A. M. & Marra, P. P. West Nile virus emergence and large-scale declines of North American bird populations. Nature 447, 710–713 (2007).

  74. 74.

    Kilpatrick, A. M. Globalization, land use, and the invasion of West Nile virus. Science 334, 323–327 (2011).

  75. 75.

    Guan, Y. et al. H5N1 influenza: a protean pandemic threat. Proc. Natl Acad. Sci. USA 101, 8156–8161 (2004).

  76. 76.

    Webby, R. J. & Webster, R. G. Are we ready for pandemic influenza? Science 302, 1519–1522 (2003).

  77. 77.

    Lam, T. T.-Y. et al. Dissemination, divergence and establishment of H7N9 influenza viruses in China. Nature 522, 102–105 (2015).

  78. 78.

    Yang, L. et al. Genesis and spread of newly emerged highly pathogenic H7N9 avian viruses in mainland China. J. Virol. 91, e01277–e01217 (2017).

  79. 79.

    Qi, W. et al. Emergence and adaptation of a novel highly pathogenic H7N9 influenza virus in birds and humans from a 2013 human-infecting low pathogenic ancestor. J. Virol. 92, e00921–17 (2018).

  80. 80.

    Lipsitch, M. et al. Viral factors in influenza pandemic risk assessment. eLife 5, e18491 (2016).

  81. 81.

    Baigent, S. J. & McCauley, J. W. Influenza type A in humans, mammals and birds: determinants of virus virulence, host-range and interspecies transmission. Bioessays 25, 657–671 (2003).

  82. 82.

    Horimoto, T. & Kawaoka, Y. Influenza: lessons from past pandemics, warnings from current incidents. Nat. Rev. Microbiol. 3, 591–600 (2005).

  83. 83.

    Monne, I. et al. Emergence of a highly pathogenic avian influenza virus from a low pathogenic progenitor. J. Virol. 88, 4375–4388 (2014).

  84. 84.

    Russell, C. A. et al. The potential for respiratory droplet-transmissible A/H5N1 influenza virus to evolve in a mammalian host. Science 336, 1541–1547 (2012).

  85. 85.

    Hatta, M., Gao, P., Halfmann, P. & Kawaoka, Y. Molecular basis for high virulence of Hong Kong H5N1 influenza A viruses. Science 293, 1840–1842 (2001).

  86. 86.

    Pearce, M. B. et al. Enhanced virulence of clade highly pathogenic avian influenza A H5N1 viruses in ferrets. Virology 502, 114–122 (2017).

  87. 87.

    Nogales, A., Martinez-Sobrido, L., Topham, D. J. & DeDiego, M. L. NS1 protein amino acid changes D189N and V194I affect interferon responses, thermosensitivity, and virulence of circulating H3N2 human influenza A viruses. J.Virol. 91, e01930–16 (2017).

  88. 88.

    Jackson, D., Hossain, M. J., Hickman, D., Perez, D. R. & Lamb, R. A. A new influenza virus virulence determinant: the NS1 protein four C-terminal residues modulate pathogenicity. Proc. Natl Acad. Sci. USA 105, 4381–4386 (2008).

  89. 89.

    Kash, J. C. et al. Genomic analysis of increased host immune and cell death responses induced by 1918 influenza virus. Nature 443, 578–581 (2006). This paper presents a comprehensive study of the virulence of the 1918 influenza virus in a mouse model, revealing the importance of virus–host interactions involving a wide set of interacting virus genes.

  90. 90.

    Bae, J.-Y. et al. A single amino acid in the polymerase acidic protein determines the pathogenicity of influenza B viruses. J. Virol. 92, e00259–e00218 (2018).

  91. 91.

    Fenner, F. Biological control as exemplified by smallpox eradication and myxomatosis. Proc. R. Soc. B. 218, 259–285 (1983).

  92. 92.

    Fenner, F. & Marshall, I. D. A comparison of the virulence for European rabbits (Oryctolagus cuniculus) of strains of myxoma virus recovered in the field in Australia, Europe and America. J. Hyg. 55, 149–191 (1957).

  93. 93.

    Fenner, F. & Ratcliffe, F. N. Myxomatosis (Cambridge Univ. Press, Cambridge, 1965).

  94. 94.

    Kerr, P. J. et al. Evolutionary history and attenuation of myxoma virus on two continents. PLOS Pathog. 8, e1002950 (2012). This article presents the first large-scale phylogenomic analysis of MYXV and shows that, despite parallel evolution at the phenotypic (that is, virulence grade) level, different virus mutations were responsible for evolution of virulence in both Australia and Europe.

  95. 95.

    Kerr, P. J. et al. Genome scale evolution of myxoma virus (MYXV) reveals host-pathogen adaptation and rapid geographic spread. J. Virol. 87, 12900–12915 (2013).

  96. 96.

    Elena, S. F., Carrasco, P., Daròs, J. A. & Sanjuán, R. Mechanisms of genetic robustness in RNA viruses. EMBO Rep. 7, 168–173 (2006).

  97. 97.

    Gandon, S., Mackinnon, M. J., Nee, S. & Read, A. F. Imperfect vaccines and the evolution of pathogen virulence. Nature 414, 751–756 (2001).

  98. 98.

    Smith, T. Imperfect vaccines and imperfect models. Trends Ecol. Evol. 17, 154–156 (2002).

  99. 99.

    Read, A. F. et al. Imperfect vaccination can enhance the transmission of highly virulent pathogens. PLOS Biol. 13, e1002198 (2015). This paper presents a clear demonstration that imperfect vaccination was responsible for increased virulence in the case of the MDV of chickens and considers the evolutionary implications of imperfect vaccination in general.

  100. 100.

    Kamil, J. P. et al. vLIP, a viral lipase homologue, is a virulence factor of Marek’s disease virus. J. Virol. 79, 6984–6996 (2005).

  101. 101.

    Trimpert, J. et al. A phylogenomic analysis of Marek’s disease virus reveals independent paths to virulence in Eurasia and North America. Evol. Appl. 10, 1091–1101 (2017).

  102. 102.

    Mackinnon, M. J. & Read, A. F. Immunity promotes virulence evolution in a malaria model. PLOS Biol. 2, e230 (2004).

  103. 103.

    Carre, N. et al. Has the rate of progression to AIDS changed in recent years? AIDS 11, 1611–1618 (1997).

  104. 104.

    Fraser, C. et al. Virulence and pathogenesis of HIV-1 infection: an evolutionary perspective. Science 343, 1243727 (2014). This key study of the evolution of HIV virulence focuses on the factors that determine the SPVL and shows the importance of virus genetic traits in shaping SPVL and hence disease severity.

  105. 105.

    Galai, N. et al. Temporal trends of initial CD4 cell counts following human immunodeficiency virus seroconversion in Italy, 1985-1992. The human immunodeficiency virus Italian seroconversion study. Am. J. Epidemiol. 143, 278–282 (1996).

  106. 106.

    Herbeck, J. T. et al. Is the virulence of HIV changing? A meta-analysis of trends in prognostic markers of HIV disease progression and transmission. AIDS 26, 193–205 (2012).

  107. 107.

    Vanhems, P., Lambert, J., Guerra, M., Hirschel, B. & Allard, R. Association between the rate of CD4+ T cell decrease and the year of human immunodeficeny virus (HIV) type 1 seroconversion among persons enrolled in the Swiss HIV cohort study. J. Infect. Dis. 180, 1803–1808 (1999).

  108. 108.

    Arien, K. K., Vanham, G. & Arts, E. J. Is HIV-1 evolving to a less virulent form in humans? Nat. Rev. Microbiol. 5, 141–151 (2007).

  109. 109.

    Claiborne, D. T. et al. Replicative fitness of transmitted HIV-1 drives acute immune activation, proviral load in memory CD4+ T cells, and disease progression. Proc. Natl Acad. Sci. USA 112, E1480–E1489 (2015).

  110. 110.

    Miura, T. et al. Impaired replication capacity of acute/early viruses in persons who become HIV controllers. J. Virol. 84, 7581–7591 (2010).

  111. 111.

    Blanquart, F. et al. Viral genetic variation accounts for a third of variability in HIV-1 set-point viral load in Europe. PLOS Biol. 15, e2001855 (2017).

  112. 112.

    Fellay, J. et al. Common genetic variation and the control of HIV-1 in humans. PLOS Genet. 5, e1000791 (2009).

  113. 113.

    Alizon, S. et al. Phylogenetic approach reveals that virus genotype largely determines HIV set-point viral load. PLOS Pathog. 6, e1001123 (2010).

  114. 114.

    Blanquart, F. et al. A transmission-virulence evolutionary trade-off explains attenuation of HIV-1 in Uganda. eLife 5, e20492 (2016).

  115. 115.

    Sofonea, M. T., Aldakak, L., Boullosa, L. F. V. V. & Alizon, S. Can Ebola virus evolve to be less virulent in humans? J. Evol. Biol. 31, 382–392 (2018).

  116. 116.

    Diehl, W. E. et al. Ebola virus glycoprotein with increased infectivity dominated the 2013-2016 epidemic. Cell 167, 1088–1098 (2016). This study uses a combination of phylogenetics and experimental study to demonstrate the importance of the A82V substitution in the EBOV glycoprotein to viral infectivity and suggests that A82V has increased virus virulence.

  117. 117.

    Ng, M. et al. Filovirus receptor NPC1 contributes to species-specific patterns of ebolavirus susceptibility in bats. eLife 4, e11785 (2015).

  118. 118.

    Marzi, A. et al. Recently identified mutations in the Ebola virus-Makona genome do not alter pathogenicity in animal models. Cell Rep. 23, 1806–1816 (2018).

  119. 119.

    Chan, J. F., Choi, G. K., Yip, C. C., Cheng, V. C. & Yuen, K. Y. Zika fever and congenital Zika syndrome: an unexpected emerging arboviral disease. J. Infect. 72, 507–524 (2016).

  120. 120.

    Cao-Lormeau, V. M. et al. Guillain–Barré syndrome outbreak associated with Zika virus infection in French Polynesia: a case–control study. Lancet 387, 1531–1539 (2016).

  121. 121.

    Fauci, A. S. & Morens, D. M. Zika virus in the Americas — yet another arbovirus threat. N. Engl. J. Med. 374, 601–604 (2016).

  122. 122.

    Simonin, Y., van Riel, D., Van de Perre, P., Rockx, B. & Salinas, S. Differential virulence between Asian and African lineages of Zika virus. PLOS Negl. Trop. Dis. 11, e0005821 (2017).

  123. 123.

    Grubaugh, N. D., Faria, N. R., Andersen, K. G. & Pybus, O. G. Genomic insights into Zika virus emergence and spread. Cell 172, 1160–1162 (2018).

  124. 124.

    Liu, Y. et al. Evolutionary enhancement of Zika virus infectivity in Aedes aegypti mosquitoes. Nature 545, 482–486 (2017). The results of this study suggest that a mutation in the NS1 protein in the ZIKV associated with the recent outbreak in the Americas causes increased infectivity in Aedes aegypti mosquitoes, in turn elevating epidemic potential.

  125. 125.

    Yuan, L. et al. A single mutation in the prM protein of Zika virus contributes to fetal microcephaly. Science 358, 933–936 (2017).

  126. 126.

    Rosenfeld, A. B., Doobin, D. J., Warren, A. L., Racaniello, V. R. & Vallee, R. B. Replication of early and recent Zika virus isolates throughout mouse brain development. Proc. Natl Acad. Sci. USA 114, 12273–12278 (2017).

  127. 127.

    Firth, C. & Lipkin, W. I. The genomics of emerging pathogens. Annu. Rev. Genom. Hum. Genet. 14, 281–300 (2013).

  128. 128.

    Gardy, J. L. & Loman, N. J. Towards a genomics-informed, real-time, global pathogen surveillance system. Nat. Rev. Genet. 19, 9–20 (2018).

  129. 129.

    Shi, M., Zhang, Y.-Z. & Holmes, E. C. Meta-transcriptomics and the evolutionary biology of RNA viruses. Virus Res. 243, 83–90 (2018).

  130. 130.

    Geoghegan, J. L. & Holmes, E. C. Predicting virus emergence amidst evolutionary noise. Open Biol. 7, 170189 (2017).

  131. 131.

    Lighten, J. & van Oosterhout, C. Biocontrol of common carp in Australia poses risks to biosecurity. Nat. Ecol. Evol. 1, 87 (2017).

  132. 132.

    Marshall, J. et al. Biocontrol of invasive carp: risks abound. Science 359, 877 (2018).

  133. 133.

    McColl, K. A., Sheppard, A. W. & Barwick, M. Safe and effective biocontrol of common carp. Nat. Ecol. Evol. 1, 134 (2017).

  134. 134.

    McColl, K. A., Sunarto, A. & Holmes, E. C. Cyprinid herpesvirus 3 and its evolutionary future as a biological control agent for carp in Australia. Virol. J. 13, 206 (2016).

  135. 135.

    Andre, J. B. & Hochberg, M. E. Virulence evolution in emerging infectious diseases. Evolution 59, 1406–1412 (2005).

  136. 136.

    Chahroudi, A., Bosinger, S. E., Vanderford, T. H., Paiardini, M. & Silvestri, G. Natural SIV hosts: Showing AIDS the door. Science 335, 1188–1193 (2012).

  137. 137.

    Silvestri, G., Paiardini, M., Pandrea, I., Lederman, M. M. & Sodora, D. L. Understanding the benign nature of SIV infection in natural hosts. J. Clin. Invest. 117, 3148–3154 (2007).

  138. 138.

    Keele, B. F. et al. Increased mortality and AIDS-like immunopathology in wild chimpanzees infected with SIVcpz. Nature 460, 515–519 (2009). The results of this study suggest that simian immunodeficiency virus in chimpanzees, which is the ancestor of HIV, increases mortality in these animals, although with less overt disease than in humans.

  139. 139.

    Crawford, P. C. et al. Transmission of equine influenza virus to dogs. Science 310, 482–485 (2005).

  140. 140.

    Parrish, C. R., Murcia, P. R. & Holmes, E. C. Influenza virus reservoirs and intermediate hosts: dogs, horses, and new possibilities for influenza virus exposure of humans. J. Virol. 89, 2990–2994 (2015).

  141. 141.

    Feng, K. H. et al. Equine and canine influenza viruses H3N8 viruses show minimal biological differences despite phylogenetic divergence. J. Virol. 89, 6860–6873 (2015).

  142. 142.

    Bourhy, H., Dautry-Varsat, A., Hotez, P. J. & Salomon, J. Rabies, still neglected after 125 years of vaccination. PLOS Negl Trop. Dis. 4, e839 (2010).

  143. 143.

    Garver, K. A., Batts, W. N. & Kurath, G. Virulence comparisons of infectious hematopoietic necrosis virus U and M genogroups in sockeye slamon and rainbow trout. J. Aquat. Anim. Health 18, 232–243 (2006).

  144. 144.

    Penaranda, M. M., Purcell, M. K. & Kurath, G. Differential virulence mechanisms of infectious hematopoietic necrosis virus in rainbow trout (Oncorhynchus mykiss) include host entry and virus replication kinetics. Gen. J. Virol. 90, 2172–2182 (2009).

  145. 145.

    Kerr, P. J. et al. The evolution of myxoma virus: genomic and phenotypic characterization of isolates from Great Britain reveals multiple successful evolutionary pathways distinct from those in Australia. PLOS Pathog. 13, e1006252 (2017).

  146. 146.

    Kerr, P. J. et al. Next step in the ongoing arms race between myxoma virus and wild rabbits in Australia is a novel disease phenotype. Proc. Natl Acad. Sci. USA 114, 9397–9402 (2017).

  147. 147.

    Jiao, P. et al. A single-amino-acid substitution in the NS1 protein changes the pathogenicity of H5N1 avian influenza viruses in mice. J. Virol. 82, 1146–1154 (2008).

  148. 148.

    Cotter, C. R., Jin, H. & Chen, Z. A single amino acid in the stalk region of the H1N1pdm influenza virus HA protein affects viral fusion, stability and infectivity. PLOS Pathog. 10, e1003831 (2014).

  149. 149.

    Gromowski, G. D., Firestone, C. Y. & Whitehead, S. S. Genetic determinants of Japanese encephalitis virus vaccine strain SA14-14-2 that govern attenuation of virulence in mice. J. Virol. 89, 6328–6337 (2015).

  150. 150.

    He, W. et al. Effect of an 88-amino-acid deletion in nsp2 of porcine reproductive and respiratory syndrome virus on virus replication and cytokine responses in vitro. Arch. Virol. 163, 1489–1501 (2018).

  151. 151.

    Yu, X. et al. The glutamic residue at position 402 in the C-terminus of Newcastle disease virus nucleoprotein is critical for the virus. Sci. Rep. 7, 17471 (2017).

  152. 152.

    Panzarin, V. et al. Low evolutionary rate of infectious pancreatic necrosis virus (IPNV) in Italy is associated with reduced virulence in trout. Virus Evol. 4, vey019 (2018).

  153. 153.

    Dietzschold, B. et al. Characterization of an antigenic determinant of the glycoprotein that correlates with pathogenicity of rabies virus. Proc. Natl Acad. Sci. USA 80, 70–74 (1983).

  154. 154.

    Blanie, S., Mortier, J., Delverdier, M., Bertagnoli, S. & Camus-Bouclainville, C. M148R and M149R are two virulence factors for myxoma virus pathogenesis in the European rabbit. Vet. Res. 40, 11 (2009).

  155. 155.

    Bauer, P. H. et al. Genetic and structural analysis of a virulence determinant in polyomavirus VP1. J. Virol. 69, 7925–7931 (1995).

  156. 156.

    Balinsky, C. A. et al. Sheeppox virus kelch-like gene SPPV-019 affects virus virulence. J. Virol. 81, 11392–11401 (2007).

Download references


E.C.H. is funded by an Australian Research Council Australian Laureate Fellowship (FL170100022). The authors thank J. Bull for helpful discussions.

Reviewer information

Nature Reviews Genetics thanks S. Alizon and the other, anonymous reviewer(s) for their contribution to the peer review of this work.

Author information




Both authors contributed to all aspects of the manuscript.

Corresponding author

Correspondence to Edward C. Holmes.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Related links

National Geographic: leaky vaccines enhance spread of deadlier chicken viruses:

The Scientist Infographic: evolving virulence:



Although virulence can be defined in a range of case-specific ways, the simplest definition is the severity or harmfulness of a pathogen.


The study of genetic material recovered from microbial (including viral) communities in which genome sequence data are simultaneously generated for all the microorganisms present.

Biological control

The reduction in numbers or elimination of pest organisms through the introduction of a pathogen.

Virulence determinants

Mutations in pathogen genomes that directly affect virulence.

Reverse genetics

Experimental method used to identify gene function by modifying the sequence of a target gene and analysing its phenotypic consequences.


The inference of evolutionary patterns and/or processes through the comparison of whole-genome sequences.


Depiction of the evolutionary history of a genetically related group of organisms. Contains branches and nodes.

Emerging infectious diseases

Infectious diseases that have recently appeared in populations or known diseases that are rapidly increasing in incidence or geographic range.

Basic reproductive number

(R0). Represents the number of secondary infections caused by an infectious host in an entirely susceptible population.

Evolutionary trade-off

Occurs when a change in one trait increases fitness but simultaneously reduces fitness because of its impact on another trait, thereby preventing the organism from optimizing both traits.


The initial and sometimes transient appearance of a pathogen in a new species following a host jump.

Population bottlenecks

Reductions, sometimes drastic, in the size of populations. They often accompany inter-host virus transmission.

Cross-species transmission

The transmission of a pathogen from one host species to another. Also called host-jumping or host-switching.

Parallel evolution

An evolutionary process by which two or more separate lineages develop identical characteristics independently.

Convergent evolution

The descendants of unrelated ancestors that have evolved similar traits independently.

Positive selection

Natural selection that leads to advantageous mutations spreading through a population. Mutations in coding regions can be either synonymous (which do not change the amino acid; measured as dS) or non-synonymous (which change the amino acid; measured as dN). The dN/dS ratio >1 is sometimes used to infer the occurrence of positive selection, although the accuracy of this measure depends on various factors, including the timescale of sampling.

Purifying selection

Natural selection that acts to remove low-fitness (including deleterious) mutations from populations. It is the most common form of natural selection and gives a dN/dS < 1. Also called negative selection.


The interaction among genes at different loci. The function and evolution of one gene may be dependent on the presence of one or more other genes.

Subtype of influenza virus

Influenza A viruses are categorized into subtypes on the basis of the diversity in two proteins present in the viral envelope: haemagglutinin (H or HA) and neuraminidase (N or NA).


The amount of phenotypic variation in a population that is attributable to individual genetic differences.

Rights and permissions

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Geoghegan, J.L., Holmes, E.C. The phylogenomics of evolving virus virulence. Nat Rev Genet 19, 756–769 (2018).

Download citation

Further reading