Next-generation sequencing technology has facilitated the discovery of millions of genetic variants in human genomes. A sizeable fraction of these variants are predicted to be deleterious. Here, we review the pattern of deleterious alleles as ascertained in genome sequencing data sets and ask whether human populations differ in their predicted burden of deleterious alleles — a phenomenon known as mutation load. We discuss three demographic models that are predicted to affect mutation load and relate these models to the evidence (or the lack thereof) for variation in the efficacy of purifying selection in diverse human genomes. We also emphasize why accurate estimation of mutation load depends on assumptions regarding the distribution of dominance and selection coefficients — quantities that remain poorly characterized for current genomic data sets.
At a glance
- Development of neutral and nearly neutral theories. Theor. Popul. Biol. 49, 128–142 (1996).
This paper reviews the development of the neutral and nearly neutral theories by key contributors to the field of population genetics.
- The mutation load in small populations. Genetics 48, 1303–1312 (1963).
This is a foundational paper on the effect of drift on mutation load in finite populations, demonstrating that mildly deleterious alleles can contribute more to load than strongly deleterious alleles.
- Non-Darwinian evolution. Science 164, 788–798 (1969). &
- The allele frequency spectrum in genome-wide human variation data reveals signals of differential demographic history in three large world populations. Genetics 166, 351–372 (2004). , , &
- Formulating a historical and demographic model of recent human evolution based on resequencing data from noncoding regions. PLoS ONE 5, e10284 (2010). , , &
- Bayesian inference of ancient human demography from individual genome sequences. Nature Genet. 43, 1031–1034 (2011). , , , &
- An early divergence of KhoeSan ancestors from those of other modern humans is supported by an ABC-based analysis of autosomal resequencing data. Mol. Bio Evol. 29, 617–630 (2012). et al.
- Evolutionary rate at the molecular level. Nature 217, 624–626 (1968).
- 1985). The Neutral Theory of Molecular Evolution (Cambridge Univ. Press,
- Slightly deleterious mutant substitutions in evolution. Nature 246, 96–98 (1973).
- Genetic loads and the cost of natural selection. Math. Top. Popul. Genet. 1, 128–177 (1970).
- Mutation load: the fitness of individuals in populations where deleterious alleles are abundant. Annu. Rev. Ecol. Evol. Syst. 43, 115–135 (2012). &
- The concept of genetic load: a reply. Am. J. Hum. Genet. 15, 310–315 (1963).
- Fundamental concepts in genetics: the genetics of inbreeding depression. Nature Rev. Genet. 10, 783–796 (2009).
This is a broad review of inbreeding depression and heterosis, fitness phenomena that are caused by the presence of deleterious recessive mutations in populations.
- Worldwide human relationships inferred from genome-wide patterns of variation. Science 319, 1100–1104 (2008). et al.
- The great human expansion. Proc. Natl Acad. Sci. USA 109, 17758–17764 (2012). , &
- Inferences about the distribution of dominance drawn from yeast gene knockout data. Genetics 187, 553–566 (2011).
The distribution of dominance coefficients is directly measured from yeast knockout experiments, showing that large-effect mutations tend to be more recessive than weak-effect mutations.
- Mutation rate and dominance of genes affecting viability in Drosophila melanogaster. Genetics 72, 335–355 (1972). , , &
- The effects of spontaneous mutation on quantitative traits. II. Dominance of mutations with effects on life-history traits. Genet. Res. 70, 27–34 (1997). , , &
- Fitness landscapes: an alternative theory for the dominance of mutation. Genetics 189, 923–937 (2011). , &
- An estimate of the mutational damage in man from data on consanguienous marriages. Proc. Natl Acad. Sci. USA 42, 855–863 (1956).
This is among the earliest work to empirically measure the mutation load in humans by considering the reduction in fitness due to recessive mutations in consanguineous unions.
- On the allelic spectrum of human disease. Trends Genet. 17, 502–510 (2001). &
- Consanguinity, human evolution, and complex diseases. Proc. Natl Acad. Sci. USA 107, 1779–1786 (2010). &
- Long runs of homozygosity are enriched for deleterious variation. Am. J. Hum. Genet. 93, 90–102 (2013). et al.
- Evidence of inbreeding depression on human height. PLoS Genet. 8, e1002655 (2012). et al.
- Pathogenic variants for Mendelian and complex traits in exomes of 6,517 European and African Americans: implications for the return of incidental results. Am. J. Hum. Genet. 95, 183–193 (2014).
Based on analysis of the exome sequences of >6,500 individuals, this study shows that nearly 45% of individuals carry a known variant associated with severe Mendelian diseases.
- Deleterious- and disease-allele prevalence in healthy individuals: insights from current predictions, mutation databases, and population-scale resequencing. Am. J. Hum. Genet. 91, 1022–1032 (2012). et al.
- Resequencing of 200 human exomes identifies an excess of low-frequency non-synonymous coding variants. Nature Genet. 42, 969–972 (2010). et al.
- The low frequency of recessive disease: insights from ENU mutagenesis, severity of disease phenotype, GWAS associations, and demography: an analytical review. J. Appl. Genet. 55, 319–327 (2014). &
- Population structure at different minor allele frequency levels. BMC Proc. 8, S55 (2014). &
- Fine-scale population structure and the era of next-generation sequencing. Hum. Mol. Genet. 19, R221–R226 (2010). , , , &
- Demography and the age of rare variants. PLoS Genet. 10, e1004528 (2014). &
- A serial founder effect model for human settlement out of Africa. Proc. Biol. Sci. 276, 291–300 (2009). , , &
- 1000 Genomes Project Consortium et al. An integrated map of genetic variation from 1,092 human genomes. Nature 490, 56–65 (2013).
- Explaining worldwide patterns of human genetic variation using a coalescent-based serial founder model of migration outward from Africa. Proc. Natl Acad. Sci. USA 106, 16057–16062 (2009). , &
- An abundance of rare functional variants in 202 drug target genes sequenced in 14,002 people. Science 337, 100–104 (2012). et al.
- Evolution and functional impact of rare coding variation from deep sequencing of human exomes. Science 337, 64–69 (2012). et al.
- Evolutionary constraint facilitates interpretation of genetic variation in resequenced human genomes. Genome Res. 20, 301–310 (2010). et al.
- A systematic survey of loss-of-function variants in human protein-coding genes. Science 335, 823–828 (2012). et al.
- Are rare variants responsible for susceptibility to complex diseases? Am. J. Hum. Genet. 69, 124–137 (2001).
- Evaluating empirical bounds on complex disease genetic architecture. Nature Genet. 45, 1418–1427 (2013). , , , GoT2D Consortium &
- Rare and common variants: twenty arguments. Nature Rev. Genet. 13, 135–145 (2012).
- Population genetics of rare variants and complex diseases. Hum. Hered. 74, 118–128 (2012). , , &
- The fate of mutations surfing on the wave of a range expansion. Mol. Bio. Evol. 23, 482–490 (2005).
- The functional spectrum of low-frequency coding variation. Genome Biol. 12, R84 (2011). et al.
- Recent explosive human population growth has resulted in an excess of rare genetic variants. Science 336, 740–743 (2012). &
- Assessing the evolutionary impact of amino acid mutations in the human genome. PLoS Genet. 4, e1000083 (2008).
This paper estimates selection coefficients for alleles with different predicted deleterious effects in humans and includes a discussion of methods to infer the DFE via site frequency spectra.
- Proportionally more deleterious genetic variation in European than in African populations. Nature 451, 994–997 (2008).
This is a formative paper considering the proportion of deleterious mutations in European-Americans compared to African-Americans based on analysis of an early genome sequencing data set. The higher proportion of deleterious variants in European-Americans was ascribed to increased genetic drift during the Out-of-Africa bottleneck.
- The deleterious mutation load is insensitive to recent population history. Nature Genet. 46, 220–224 (2014).
This paper challenges the earlier studies (for example, reference 48) by demonstrating, via simulation, that the average number of deleterious mutations per individual under an additive model should be the same across populations for different human demographic histories.
, , &
- Whole-exome sequencing reveals a rapid change in the frequency of rare functional variants in a founding population of humans. PLoS Genet. 9, e1003815 (2013). et al.
- No evidence that selection has been less effective at removing deleterious mutations in Europeans than in Africans. Nature Genet. 47, 126–131 (2015). et al.
- Analysis of 6,515 exomes reveals the recent origin of most human protein-coding variants. Nature 493, 216–220 (2013). et al.
- Characteristics of neutral and deleterious protein-coding variation among individuals and populations. Am. J. Hum. Genet. 95, 421–436 (2014).
This paper shows that European-American individuals carry slightly more deleterious derived alleles in their genome sequences, on average, than African-Americans under a conservation-based framework to predict variant function; this is consistent with Out-of-Africa bottleneck simulations.
, , &
- When is selection effective? bioRXiv http://dx.doi.org/10.1101/010934 (2014).
- Distribution and medical impact of loss-of-function variants in the Finnish founder population. PLoS Genet. 10, e1004494 (2014). et al.
- Paternal and maternal DNA lineages reveal a bottleneck in the founding of the Finnish population. Proc. Natl Acad. Sci. USA 93, 12035–12039 (1996). et al.
- Demographic history and rare allele sharing among human populations. Proc. Natl Acad. Sci. USA 108, 11983–11988 (2011). et al.
- Inference of human population history from individual whole-genome sequences. Nature 475, 493–496 (2011). &
- A high-coverage genome sequence from an archaic Denisovan individual. Science 338, 222–226 (2012). et al.
- Inferring the joint demographic history of multiple populations from multidimensional SNP frequency data. PLoS Genet. 5, e1000695 (2009). , , &
- Hunter-gatherer genomic diversity suggests a southern African origin for modern humans. Proc. Natl Acad. Sci. 108, 5154–5162 (2011). et al.
- The impact of population demography and selection on the genetic architecture of complex traits. PLoS Genet. 10, e1004379 (2014).
Reprising his earlier work in reference 48, this paper focuses on patterns of deleterious variants over time, given different demographic scenarios of expansion, bottleneck and combinations of demographic events.
- Support from the relationship of genetic and geographic distance in human populations for a serial founder effect originating in Africa. Proc. Natl Acad. Sci. USA 102, 15942–15947 (2005). et al.
- Impact of range expansions on current human genomic diversity. Curr. Opin. Genet. Dev. 29, 22–30 (2014). , &
- Deep human genealogies reveal a selective advantage to be on an expanding wave front. Science 334, 1148–1150 (2011). et al.
- On the accumulation of deleterious mutations during range expansions. Mol. Ecol. 22, 5972–5982 (2013).
This is a complex simulation study showing that range expansion can result in expansion load from deleterious mutations that rise to a high frequency on a geographical wave front.
, , &
- Surfing downhill: when should population range expansion be characterized by reductions in fitness? Mol. Ecol. 22, 5963–5965 (2013).
- Deep resequencing reveals excess rare recent variants consistent with explosive population growth. Nature Commun. 1, 131–136 (2010). et al.
- Rapid, global demographic expansions after the origins of agriculture. Proc. Natl Acad. Sci. USA 108, 6044–6049 (2011). , &
- MtDNA analysis of global populations support that major population expansions began before Neolithic time. Sci. Rep. 2, 745 (2012). , , &
- Ice ages and the mitochondrial DNA chronology of human dispersals: a review. Phil. Trans. R. Soc. Lond. B. 359, 255–264 (2004).
- Population growth inflates the per-individual number of deleterious mutations and reduces their mean effect. Genetics 195, 969–978 (2013). , , &
- Modeling recent human evolution in mice by expression of a selected EDAR variant. Cell 152, 691–702 (2013). et al.
- Classic selective sweeps were rare in recent human evolution. Science 331, 920–924 (2011). et al.
- Childhood anemia at high altitude: risk factors for poor outcomes in severe pneumonia. Pediatrics 132, e1156–e1162 (2013). et al.
- Factors affecting the genetic load in Drosophila: synergistic epistasis and correlations among fitness components. Evolution 54, 1654–1660 (2000). &
- On the rate and linearity of viability declines in Drosophila mutation-accumulation experiments: genomic mutation rates and synergistic epistasis revisited. Genetics 166, 797–806 (2004).
- The mystery of missing heritability: genetic interactions create phantom heritability. Proc. Natl Acad. Sci. USA 109, 1193–1198 (2012). , , &
- Genome-wide inference of natural selection on human transcription factor binding sites. Nature Genet. 45, 723–729 (2013). et al.
- The distribution of deleterious genetic variation in human populations. Curr. Opin. Genet. Dev. 29, 139–146 (2014).
- Supplementary information S1 (box) (171 KB)
Variant Annotation Algorithms
- Supplementary information S2 (figure) (249 KB)
Demographic history based on the site frequency spectrum and sharing of rare alleles.
- Supplementary information S3 (figure) (238 KB)
Allele sharing versus allele frequency among European populations.