Abstract
Whole-genome sequences provide a rich source of information about human evolution. Here we describe an effort to estimate key evolutionary parameters based on the whole-genome sequences of six individuals from diverse human populations. We used a Bayesian, coalescent-based approach to obtain information about ancestral population sizes, divergence times and migration rates from inferred genealogies at many neutrally evolving loci across the genome. We introduce new methods for accommodating gene flow between populations and integrating over possible phasings of diploid genotypes. We also describe a custom pipeline for genotype inference to mitigate biases from heterogeneous sequencing technologies and coverage levels. Our analysis indicates that the San population of southern Africa diverged from other human populations approximately 108–157 thousand years ago, that Eurasians diverged from an ancestral African population 38–64 thousand years ago, and that the effective population size of the ancestors of all modern humans was ∼9,000.
This is a preview of subscription content, access via your institution
Access options
Subscribe to this journal
Receive 12 print issues and online access
$209.00 per year
only $17.42 per issue
Buy this article
- Purchase on Springer Link
- Instant access to full article PDF
Prices may be subject to local taxes which are calculated during checkout
Similar content being viewed by others
References
Cavalli-Sforza, L.L. & Feldman, M.W. The application of molecular genetic approaches to the study of human evolution. Nat. Genet. 33 (suppl.) 266–275 (2003).
Green, R.E. et al. A draft sequence of the Neandertal genome. Science 328, 710–722 (2010).
Reich, D. et al. Genetic history of an archaic hominin group from Denisova Cave in Siberia. Nature 468, 1053–1060 (2010).
Cann, R.L., Stoneking, M. & Wilson, A.C. Mitochondrial DNA and human evolution. Nature 325, 31–36 (1987).
Gonder, M.K., Mortensen, H.M., Reed, F.A., de Sousa, A. & Tishkoff, S.A. Whole-mtDNA genome sequence analysis of ancient African lineages. Mol. Biol. Evol. 24, 757–768 (2007).
Zhivotovsky, L.A., Rosenberg, N.A. & Feldman, M.W. Features of evolution and expansion of modern humans, inferred from genomewide microsatellite markers. Am. J. Hum. Genet. 72, 1171–1186 (2003).
Liu, H., Prugnolle, F., Manica, A. & Balloux, F. A geographically explicit genetic model of worldwide human-settlement history. Am. J. Hum. Genet. 79, 230–237 (2006).
Voight, B.F. et al. Interrogating multiple aspects of variation in a full resequencing data set to infer human population size changes. Proc. Natl. Acad. Sci. USA 102, 18508–18513 (2005).
Fagundes, N.J. et al. Statistical evaluation of alternative models of human evolution. Proc. Natl. Acad. Sci. USA 104, 17614–17619 (2007).
Wall, J.D., Lohmueller, K.E. & Plagnol, V. Detecting ancient admixture and estimating demographic parameters in multiple human populations. Mol. Biol. Evol. 26, 1823–1827 (2009).
Gutenkunst, R.N., Hernandez, R.D., Williamson, S.H. & Bustamante, C.D. Inferring the joint demographic history of multiple populations from multidimensional SNP frequency data. PLoS Genet. 5, e1000695 (2009).
Keinan, A., Mullikin, J.C., Patterson, N. & Reich, D. Measurement of the human allele frequency spectrum demonstrates greater genetic drift in east Asians than in Europeans. Nat. Genet. 39, 1251–1255 (2007).
Levy, S. et al. The diploid genome sequence of an individual human. PLoS Biol. 5, e254 (2007).
Bentley, D.R. et al. Accurate whole human genome sequencing using reversible terminator chemistry. Nature 456, 53–59 (2008).
Wang, J. et al. The diploid genome sequence of an Asian individual. Nature 456, 60–65 (2008).
Ahn, S.M. et al. The first Korean genome sequence and analysis: full genome sequencing for a socio-ethnic group. Genome Res. 19, 1622–1629 (2009).
Schuster, S.C. et al. Complete Khoisan and Bantu genomes from southern Africa. Nature 463, 943–947 (2010).
Garrigan, D. et al. Inferring human population sizes, divergence times and rates of gene flow from mitochondrial, X and Y chromosome resequencing data. Genetics 177, 2195–2207 (2007).
Tishkoff, S.A. et al. The genetic structure and history of Africans and African Americans. Science 324, 1035–1044 (2009).
Rannala, B. & Yang, Z. Bayes estimation of species divergence times and ancestral population sizes using DNA sequences from multiple loci. Genetics 164, 1645–1656 (2003).
Burgess, R. & Yang, Z. Estimation of hominoid ancestral population sizes under Bayesian coalescent models incorporating mutation rate variation and sequencing errors. Mol. Biol. Evol. 25, 1979–1994 (2008).
Nielsen, R. & Wakeley, J. Distinguishing migration from isolation: a Markov chain Monte Carlo approach. Genetics 158, 885–896 (2001).
Hey, J. Isolation with migration models for more than two populations. Mol. Biol. Evol. 27, 905–920 (2010).
Patterson, N., Richter, D.J., Gnerre, S., Lander, E.S. & Reich, D. Genetic evidence for complex speciation of humans and chimpanzees. Nature 441, 1103–1108 (2006).
Kondrashov, A.S. Direct estimates of human per nucleotide mutation rates at 20 loci causing Mendelian diseases. Hum. Mutat. 21, 12–27 (2003).
Lewontin, R.C. The apportionment of human diversity. in Evolutionary Biology (eds. Dobzhansky, T.H., Hecht, M.K. & Steere, W.C.) 6 (Appleton-Century-Crofts, New York, New York, USA, 1972).
Beaumont, M.A., Zhang, W. & Balding, D.J. Approximate Bayesian computation in population genetics. Genetics 162, 2025–2035 (2002).
Li, H. & Durbin, R. Inference of human population history from individual whole-genome sequences. Nature 475, 493–496 (2011).
Hobolth, A., Christensen, O.F., Mailund, T. & Schierup, M.H. Genomic relationships and speciation times of human, chimpanzee, and gorilla inferred from a coalescent hidden Markov model. PLoS Genet. 3, e7 (2007).
Paul, J.S., Steinrucken, M. & Song, Y.S. An accurate sequentially markov conditional sampling distribution for the coalescent with recombination. Genetics 187, 1115–1128 (2011).
Li, H. & Durbin, R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 25, 1754–1760 (2009).
Li, H. et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics 25, 2078–2079 (2009).
McKenna, A. et al. The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 20, 1297–1303 (2010).
1000 Genomes Project Consortium. A map of human genome variation from population-scale sequencing. Nature 467, 1061–1073 (2010).
Li, H., Ruan, J. & Durbin, R. Mapping short DNA sequencing reads and calling variants using mapping quality scores. Genome Res. 18, 1851–1858 (2008).
Frazer, K.A. et al. A second generation human haplotype map of over 3.1 million SNPs. Nature 449, 851–861 (2007).
Acknowledgements
This research was supported by a Packard Fellowship (to A.S.), National Science Foundation grant DBI-0644111 and National Institutes of Health training grant T32HD052471 (to C.G.D.). We thank S. Schuster, W. Miller, D. Reich, G. Coop, J. Hey, J. Wall, R.S. Wells, A. Keinan, A.G. Clark, S.C. Choi, C.D. Bustamante, B. Henn and others for helpful discussions and feedback.
Author information
Authors and Affiliations
Contributions
A.S. conceived of and designed the study. I.G. implemented G-PhoCS and applied it to both simulated and real data. B.G. implemented BSNP and applied it to the individual genomes. I.G., M.J.H., B.G., C.G.D. and A.S. performed additional statistical analyses. I.G. and A.S. wrote the paper with review and contributions by all authors.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing financial interests.
Supplementary information
Supplementary Text and Figures
Supplementary Figures 1–13, Supplementary Tables 1–7 and Supplementary Note. (PDF 3609 kb)
Rights and permissions
About this article
Cite this article
Gronau, I., Hubisz, M., Gulko, B. et al. Bayesian inference of ancient human demography from individual genome sequences. Nat Genet 43, 1031–1034 (2011). https://doi.org/10.1038/ng.937
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1038/ng.937
This article is cited by
-
Genomic Evidence for the Complex Evolutionary History of Macaques (Genus Macaca)
Journal of Molecular Evolution (2024)
-
Testing assertions of widespread introgressive hybridization in a clade of neotropical toads with low mate selectivity (Rhinella granulosa species group)
Heredity (2023)
-
Ancient DNA reveals genetic admixture in China during tiger evolution
Nature Ecology & Evolution (2023)
-
Comparative genomic analysis of Babesia duncani responsible for human babesiosis
BMC Biology (2022)
-
Revised time estimation of the ancestral human chromosome 2 fusion
BMC Genomics (2022)