Research involving ancient DNA (aDNA) has experienced a true technological revolution in recent years through advances in the recovery of aDNA and, particularly, through applications of high-throughput sequencing. Formerly restricted to the analysis of only limited amounts of genetic information, aDNA studies have now progressed to whole-genome sequencing for an increasing number of ancient individuals and extinct species, as well as to epigenomic characterization. Such advances have enabled the sequencing of specimens of up to 1 million years old, which, owing to their extensive DNA damage and contamination, were previously not amenable to genetic analyses. In this Review, we discuss these varied technical challenges and solutions for sequencing ancient genomes and epigenomes.
At a glance
- DNA sequences from the quagga, an extinct member of the horse family. Nature 312, 282–284 (1984). , , , &
- Ancient bone DNA amplified. Nature 342, 485 (1989). , &
- Ancient DNA: extraction, characterization, molecular cloning, and enzymatic amplification. Proc. Natl Acad. Sci. USA 86, 1939–1943 (1989).
- Ancient human genome sequence of an extinct Paleo-Eskimo. Nature 463, 757–762 (2010).
This study takes advantage of the relative absence of environmental microorganisms within ancient hairs to characterize the first high-quality ancient human genome.
- Sequencing the nuclear genome of the extinct woolly mammoth. Nature 456, 387–390 (2008). et al.
- A draft sequence of the Neandertal genome. Science 328, 710–722 (2010).
This paper reports the first draft genome of an archaic hominin and many methodological developments that are still commonly used for characterizing and analysing ancient genomes.
- A draft genome of Yersinia pestis from victims of the Black Death. Nature 478, 506–510 (2011).
This paper reports the first genome isolated from an ancient pathogenic bacterium, confirming the Black Death as a plague epidemic. It revealed that no derived variant is unique to the medieval strain, suggesting that non-genetic factors enhanced the virulence of the pathogen.
- Reconstructing genome evolution in historic samples of the Irish potato famine pathogen. Nat. Commun. 4, 2172 (2013). et al.
- Genome-wide comparison of medieval and modern Mycobacterium leprae. Science 341, 179–183 (2013). et al.
- Pre-Columbian mycobacterial genomes reveal seals as a source of New World human tuberculosis. Nature 514, 494–497 (2014). et al.
- Second-pandemic strain of Vibrio cholera from the Philadelphia cholera outbreak. N. Engl. J. Med. 370, 334–340 (2014). et al.
- Ancient pathogen DNA in archaeological samples detected with a microbial detection array. Sci. Rep. 4, 4245 (2014). et al.
- Yersinia pestis and the Plague of Justinian 541–543 AD: a genomic analysis. Lancet Infect. Dis. 14, 319–326 (2014). et al.
- An Aboriginal Australian genome reveals separate human dispersals into Asia. Science 334, 94–98 (2011). et al.
- New insights into the Tyrolean Iceman's origin and phenotype as inferred by whole-genome sequencing. Nat. Commun. 3, 698 (2012). et al.
- A high-coverage genome sequence from an archaic Denisovan individual. Science 338, 222–226 (2012).
This paper describes a novel method for constructing aDNA libraries using ssDNA templates, which enabled the characterization of the Denisovan genome at a quality rivalling that of modern genomes, starting from only minute amounts of DNA extracts.
- Recalibrating Equus evolution using the genome sequence of an early Middle Pleistocene horse. Nature 499, 74–78 (2013).
This study takes advantage of both second-generation (high-throughput, and library- and amplification-dependent) and third-generation (high-throughput, and library- and amplification-independent) sequencing technologies to present the oldest genome sequence hitherto characterized: that of an ~700,000-year-old horse.
- Genome flux and stasis in a five millennium transect of European prehistory. Nat. Commun. 5, 5257 (2014). et al.
- Speciation with gene flow in equids despite extensive chromosomal plasticity. Proc. Natl Acad. Sci. USA 111, 18655–18660 (2014). et al.
- Two ancient human genomes reveal Polynesian ancestry among the indigenous Botocudos of Brazil. Curr. Biol. 24, R1035–R1037 (2014). et al.
- Derived immune and ancestral pigmentation alleles in a 7,000-year-old Mesolithic European. Nature 507, 225–228 (2014). et al.
- The complete genome sequence of a Neanderthal from the Altai Mountains. Nature 505, 43–49 (2014). et al.
- The genetic prehistory of the New World Arctic. Science 345, 1255832 (2014). et al.
- Upper Paleolithic Siberian genome reveals dual ancestry of Native Americans. Nature 505, 87–91 (2014). et al.
- The genome of a Late Pleistocene human from a Clovis burial site in western Montana. Nature 506, 225–229 (2014). et al.
- Prehistoric genomes reveal the genetic foundation and cost of horse domestication. Proc. Natl Acad. Sci. USA 111, E5661–E5669 (2014). et al.
- Genomic structure in Europeans dating back at least 36,200 years. Science 346, 1113–1118 (2014). et al.
- Genome data from a sixteenth century pig illuminate modern breed relationships. Heredity 114, 175–184 (2015). et al.
- Genome-wide ancestry of 17th century enslaved Africans from the Caribbean. Proc. Natl Acad. Sci. USA 112, 3669–3673 (2015). et al.
- Sequencing technologies — the next generation. Nat. Rev. Genet. 11, 31–46 (2010).
- Complete mitochondrial genome sequence of a Middle Pleistocene cave bear reconstructed from ultrashort DNA fragments. Proc. Natl Acad. Sci. USA 110, 15758–15763 (2013). et al.
- A mitochondrial genome sequence of a hominin from Sima de los Huesos. Nature 505, 403–406 (2014). et al.
- Reconstructing the DNA methylation maps of the Neandertal and the Denisovan. Science 344, 523–527 (2014). et al.
- Genome-wide nucleosome map and cytosine methylation levels of an ancient human genome. Genome Res. 24, 454–466 (2014).
This study exploits DNA degradation patterns in HTS data sets to characterize, for the first time, genome-wide nucleosome and methylation maps from an ancient human and infer ancient gene expression levels and the age at death of the individual.
- Major transitions in human evolution revisited: a tribute to ancient DNA. J. Hum. Evol. 79, 4–20 (2015). , , &
- A paleogenomic perspective on evolution and gene function: new insights from ancient DNA. Science 343, 1236573 (2014). &
- Using ancient DNA to understand evolutionary and ecological processes. Ann. Rev. Ecol. Evol. Syst. 45, 573–598 (2014). &
- An epigenetic window into the past? Science 345, 511–512 (2014). &
- DNA damage and DNA sequence retrieval from ancient tissues. Nucleic Acids Res. 24, 1304–1307 (1996). , , , &
- Statistical evidence for miscoding lesions in ancient DNA templates. Mol. Biol. Evol. 18, 262–265 (2001). , , , &
- DNA sequences from multiple amplifications reveal artifacts induced by cytosine deamination in ancient DNA. Nucleic Acids Res. 29, 4793–4799 (2001). , , , &
- Patterns of nucleotide misincorporations during enzymatic amplification and direct large-scale sequencing of ancient DNA. Proc. Natl Acad. Sci. USA 103, 13578–13584 (2006). et al.
- Recharacterization of ancient DNA miscoding lesions: insights in the era of sequencing-by-synthesis. Nucleic Acids Res. 35, 1–10 (2007). et al.
- Patterns of damage in genomic DNA sequences from a Neandertal. Proc. Natl Acad. Sci. USA 104, 14616–14621 (2007).
This study characterizes typical nucleotide misincorporation and fragmentation patterns using HTS data from aDNA extracts, which have been subsequently used as essential authentication criteria.
- True single-molecule DNA sequencing of a Pleistocene horse bone. Genome Res. 21, 1705–1719 (2011). et al.
- Temporal patterns of nucleotide misincorporations and DNA fragmentation in ancient DNA. PLoS ONE 7, e34131 (2012). et al.
- Next-generation sequencing offers new insights into DNA degradation. Trends Biotechnol. 30, 364–368 (2012). , &
- mapDamage2.0: fast approximate Bayesian estimates of ancient DNA damage parameters. Bioinformatics 29, 1682–1684 (2013). et al.
- Crosslinks rather than strand breaks determine access to ancient DNA sequences from frozen sediments. Genet. 173, 1175–1179 (2006). et al.
- Road blocks on paleogenomes — polymerase extension profiling reveals the frequency of blocking lesions in ancient DNA. Nucleic Acids Res. 38, e161 (2010). et al.
- Nuclear gene sequences from a late Pleistocene sloth coprolithe. Curr. Biol. 13, 1150–1152 (2003). , , , &
- Metagenomics to paleogenomics: large-scale sequencing of mammoth DNA. Science 311, 393–394 (2006).
This study reports the first genetic analysis of ancient specimens based on a HTS technology, paving the way for whole-genome sequencing from ancient specimens.
- The half-life of DNA in bone: measuring decay kinetics in 158 dated fossils. Proc. Biol. Sci. 279, 4724–4733 (2012). et al.
- The thermal history of human fossils and the likelihood of successful DNA amplification. J. Hum. Evol. 45, 203–217 (2003). , , , &
- New insights from old bones: DNA preservation and degradation in permafrost preserved mammoth remains. Nucleic Acids Res. 37, 3215–2129 (2009). et al.
- Improving the performance of true single molecule sequencing for ancient DNA. BMC Genomics 13, 177 (2012). et al.
- Shotgun microbial profiling of fossil remains. Mol. Ecol. 23, 1780–1798 (2014). et al.
- Improving access to endogenous DNA in ancient bones and teeth. BioRxiv http://dx.doi.org/10.1101/014985 (2015). et al.
- Relatively well preserved DNA is present in the crystal aggregates of fossil bones. Proc. Natl Acad. Sci. USA 102, 13783–13788 (2005). , , &
- Survival and recovery of DNA from ancient teeth and bones. J. Archaeol. Sci. 38, 956–964 (2011). , , &
- Ligation bias in illumina next-generation DNA libraries: implications for sequencing ancient genomes. PLoS ONE 8, e78575 (2013). et al.
- Length and GC-biases during sequencing library amplification: a comparison of various polymerase-buffer systems with ancient and modern DNA sequencing libraries. Biotechniques 87–94 (2012). &
- A new strategy for genome assembly using short sequence reads and reduced representation libraries. Genome Res. 20, 249–256 (2010). et al.
- Amplification of TruSeq ancient DNA libraries with AccuPrime Pfx: consequences on nucleotide misincorporation and methylation patterns. STAR 1, STAR2015112054892315Y.0000000005 (2015). et al.
- Palindromic sequence artifacts generated during next generation sequencing library preparation from historic and ancient DNA. PLoS ONE 9, e89676 (2014). et al.
- Single-stranded DNA library preparation for the sequencing of ancient or damaged DNA. Nat. Protoc. 8, 737–748 (2013). &
- Whole-genome shotgun sequencing of mitochondria from ancient hair shafts. Science 317, 1927–1930 (2007). et al.
- Selective enrichment of damaged DNA molecules for ancient genome sequencing. Genome Res. 24, 1543–1549 (2014). &
- Targeted retrieval and analysis of five Neandertal mtDNA genomes. Science 325, 318–321 (2009). et al.
- Multiplexed DNA sequence capture of mitochondrial genomes using PCR products. PLoS ONE 5, e14004 (2010). , &
- Massive migration from the steppe was a source for Indo-European languages in Europe. Nature http://dx.doi.org/10.1038/nature14317 (2015). et al.
- Partial uracil–DNA–glycosylase treatment for screening of ancient DNA. Phil. Trans. R. Soc. B 370, 20130624 (2015). , , , &
- Targeted investigation of the Neandertal genome by array-based sequence capture. Science 328, 723–725 (2010).
This paper reports the first characterization of an ancient exome using target enrichment approaches on microarrays.
- A revised timescale for human evolution based on ancient mitochondrial genomes. Curr. Biol. 23, 553–559 (2013). et al.
- Mitochondrial phylogenomics of modern and ancient equids. PLoS ONE 8, e55950 (2013). et al.
- Patterns of coding variation in the complete exomes of three Neandertals. Proc. Natl Acad. Sci. USA 111, 6666–6671 (2014). et al.
- DNA analysis of an early modern human form Tianyuan Cave, China. Proc. Natl Acad. Sci. USA 110, 2223–2227 (2013).
This paper describes a target enrichment procedure exploiting millions of DNA probes cleaved from user-designed DNA microarrays to characterize the almost complete sequence of the non-repetitive fraction of chromosome 21 for an ~40,000-year-old human.
- Pulling out the 1%: whole-genome capture for the targeted enrichment of ancient DNA sequencing libraries. Am. J. Hum. Genet. 93, 852–864 (2013).
This paper reports the first whole-genome target enrichment method, which makes use of self-generated RNA probes. The method substantially reduces the operational cost of target enrichment and allows genetic analyses of specimens with only minute amounts of aDNA templates.
- Ancient whole genome enrichment using baits built from modern DNA. Mol. Biol. Evol. 31, 1292–1295 (2014). et al.
- Comparative performance of two whole-genome capture methodologies on ancient DNA Illumina libraries. Methods Ecol. Evol. http://dx.doi.org/10.1111/2041-210X.12353 (2015). et al.
- Removal of deaminated cytosines and detection of in vivo methylation in ancient DNA. Nucleic Acids Res. 38, e87 (2010).
This paper presents an enzymatic procedure based on the treatment of DNA extracts with USER mix, which can considerably reduce the sequencing error rate of ancient genomes by limiting the effect of nucleotide misincorporations at damaged sites.
- Efficient cross-species capture hybridization and next-generation sequencing of mitochondrial genomes from noninvasively sampled museum specimens. Genome Res. 21, 1695–1704 (2011). , , &
- Morphological and genetic evidence for early Holocene cattle management in northeastern China. Nat. Commun. 4, 2755 (2013). et al.
- Rodents of the Caribbean: origin and diversification of hutias unraveled by next-generation museomics. Biol. Lett. http://dx.doi.org/10.1098/rsbl.2014.0266 (2014). et al.
- Tracking niche variation over millennial timescales in sympatric killer whale lineages. Proc. Biol. Sci. 280, 20131481 (2013). et al.
- Targeted enrichment of ancient pathogens yielding the pPCP1 plasmid of Yersinia pestis from victims of the Black Death. Proc. Natl Acad. Sci. USA 108, E746–E452 (2011). et al.
- Double indexing overcomes inaccuracies in multiplex sequencing on the Illumina platform. Nucleic Acids Res. 40, e3 (2012). , &
- Application and comparison of large-scale solution-based DNA capture-enrichment methods on ancient DNA. Sci. Rep. 1, 73 (2011). et al.
- Hybrid selection of discrete genomic intervals on custom-designed microarrays for massively parallel sequencing. Nat. Protoc. 4, 960–974 (2009). et al.
- Parallel detection of ancient pathogens via array-based DNA capture. Phil. Trans. R. Soc. B 370, 20130375 (2015). et al.
- Characterization of ancient and modern genomes by SNP detection and phylogenomic and metagenomic analysis using PALEOMIX. Nat. Protoc. 9, 1056–1082 (2013).
This paper presents a fully automated pipeline performing all sequence analyses associated with re-sequencing genomic projects, phylogenomic inference and metagenomic profiling. It is applicable to both modern and ancient sequence data sets.
- AdapterRemoval: easy cleaning of next-generation sequencing reads. BMC Res. Notes 5, 337 (2012).
- Fast and accurate short read alignment with Burrows–Wheeler transform. Bioinformatics 25, 1754–1760 (2009). &
- Fast gapped-read alignment with Bowtie 2. Nat. Methods 9, 357–359 (2012). &
- The genome analysis toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 20, 1297–1303 (2010). et al.
- ExaML version 3: a tool for phylogenomic analyses on supercomputers. Bioinformatics http://dx.doi.org/10.1093/bioinformatics/btv184 (2015). , &
- Metagenomic microbial community profiling using unique clade-specific marker genes. Nat. Methods 9, 811–814 (2012). et al.
- Improving ancient DNA read mapping against modern reference genomes. BMC Genomics 13, 178 (2012). et al.
- Adaptable probabilistic mapping of short reads using position specific scoring matrices. BMC Bioinformatics 15, 100 (2014). , , &
- mapDamage: testing for damage patterns in ancient DNA sequences. Bioinformatics 27, 2153–2155 (2011). , , , &
- Separating endogenous ancient DNA from modern day contamination in a Siberian Neandertal. Proc. Natl Acad. Sci. USA 111, 2229–2234 (2014). et al.
- SNPest: a probabilistic graphical model for estimating genotypes. BMC Res. Notes 7, 698 (2014). , &
- Origins and genetic legacy of Neolithic farmers and hunter-gatherers in Europe. Science 336, 466–469 (2012). et al.
- Fragmentation of contaminant and endogenous DNA in ancient samples determined by shotgun sequencing; prospects for human paleogenomics. PLoS ONE 6, e24161 (2011). et al.
- Genomic affinities of two 7,000-year-old Iberian hunter-gatherers. Curr. Biol. 22, 1494–1499 (2012). et al.
- The Neandertal genome and ancient DNA authenticity. EMBO J. 28, 2494–2502 (2009). et al.
- Genetic history of an archaic hominin group from Denisova Cave in Siberia. Nature 468, 1053–1060 (2010). et al.
- High-resolution analysis of cytosine methylation in ancient DNA. PLoS ONE 7, e30226 (2012). et al.
- Genomic methylation patterns in archaeological barley show de-methylation as a time-dependent diagenetic process. Sci. Rep. 4, 5559 (2014). et al.
- Molecular breeding of polymerases for amplification of ancient DNA. Nat. Biotech. 25, 939–943 (2007). et al.
- The origin and evolution of maize in the American Southwest. Nat. Plants 1, 14003 (2015). et al.
- Toward a new history and geography of human genes informed by ancient DNA. Trends Genet. 30, 377–389 (2014). &
- Deep sequencing of RNA from ancient maize kernels. PLoS ONE 8, e50961 (2013). et al.
- Using archaeogenomic and computational approaches to unravel the history of local adaptation in crops. Phil. Trans. R. Soc. B 370, 20130377 (2015). et al.
- Proteomic analysis of a Pleistocene mammoth femur reveals more than one hundred ancient bone proteins. 11, 917–926 (2012). et al.
- Unlocking ancient protein palimpsests. Science 343, 1320–1322 (2014). , &
- Direct evidence of milk consumption from ancient human dental calculus. Sci. Rep. 4, 7104 (2014). et al.
- Sequencing ancient calcified dental plaque shows changes in oral microbiota with dietary shifts of the Neolithic and Industrial revolutions. Nat. Genet. 45, 450–455 (2013). et al.
- Pathogens and host immunity in the ancient human oral cavity. Nat. Genet. 46, 336–344 (2014). et al.
- 2014). How to Clone a Mammoth — the Science of De-extinction (Princeton University Press,
- Ancient human genomes suggest three ancestral populations for present-day Europeans. Nature 513, 409–413 (2014). et al.
- Genome sequence of a 45,000-year-old modern human from western Siberia. Nature 514, 445–449 (2014). et al.
- The complete mitochondrial DNA genome of an unknown hominin from southern Siberia. Nature 464, 894–897 (2010). et al.
- 400,000-year-old mitochondrial genome questions phylogenetic relationships amongst archaic hominins. Bioessays 36, 598–605 (2014).
- Revising the human mutation rate: implications for understanding human evolution. Nat. Rev. Genet. 13, 745–753 (2012). &
- Fast model-based estimation of ancestry in unrelated individuals. Genome Res. 19, 1655–1664 (2009). , &
- bammds: a tools for assessing the ancestry of low-depth whole-genome data using multidimensional scaling (MDS). Bioinformatics 30, 2962–2964 (2014). et al.
- Investigating population history using temporal genetic differentiation. Mol. Biol. Evol. 31, 2516–2527 (2014). , , , &
- Testing for ancient admixture between closely related populations. Mol. Biol. Evol. 28, 2239–2252 (2011). , , &
- Ancient admixture in human history. Genetics. 192, 1065–1093 (2012). et al.
- Effect of ancient population structure on the degree of polymorphism shared between modern human populations and ancient hominins. Proc. Natl Acad. Sci. USA 109, 13956–13960 (2012). &
- The date of interbreeding between Neandertals and modern humans. PLoS Genet. 8, e1002947 (2012). , , , &