The efficacy of high-throughput sequencing and target enrichment on charred archaeobotanical remains

Nistelberger, H. M.; Smith, O.; Wales, N.; Star, B.; Boessenkool, S.

doi:10.1038/srep37347

Download PDF

Article
Open access
Published: 24 November 2016

The efficacy of high-throughput sequencing and target enrichment on charred archaeobotanical remains

H. M. Nistelberger¹,
O. Smith²,
N. Wales³,
B. Star¹ &
…
S. Boessenkool¹

Scientific Reports volume 6, Article number: 37347 (2016) Cite this article

3717 Accesses
26 Citations
3 Altmetric
Metrics details

Subjects

Abstract

The majority of archaeological plant material is preserved in a charred state. Obtaining reliable ancient DNA data from these remains has presented challenges due to high rates of nucleotide damage, short DNA fragment lengths, low endogenous DNA content and the potential for modern contamination. It has been suggested that high-throughput sequencing (HTS) technologies coupled with DNA enrichment techniques may overcome some of these limitations. Here we report the findings of HTS and target enrichment on four important archaeological crops (barley, grape, maize and rice) performed in three different laboratories, presenting the largest HTS assessment of charred archaeobotanical specimens to date. Rigorous analysis of our data – excluding false-positives due to background contamination or incorrect index assignments – indicated a lack of endogenous DNA in nearly all samples, except for one lightly-charred maize cob. Even with target enrichment, this sample failed to yield adequate data required to address fundamental questions in archaeology and biology. We further reanalysed part of an existing dataset on charred plant material, and found all purported endogenous DNA sequences were likely to be spurious. We suggest these technologies are not suitable for use with charred archaeobotanicals and urge great caution when interpreting data obtained by HTS of these remains.

The Persian plateau served as hub for Homo sapiens after the main out of Africa dispersal

Article Open access 25 March 2024

Leonardo Vallini, Carlo Zampieri, … Luca Pagani

A remarkable assemblage of petroglyphs and dinosaur footprints in Northeast Brazil

Article Open access 19 March 2024

Leonardo P. Troiano, Heloísa B. dos Santos, … Aline M. Ghilardi

Unveiling unique microbial nitrogen cycling and nitrification driver in coastal Antarctica

Article Open access 12 April 2024

Ping Han, Xiufeng Tang, … Guitao Shi

Introduction

Advances in DNA extraction methodology and sequencing technology have allowed for the field of plant archaeogenetics – DNA analysis of archaeological plant remains – to flourish over the last decade^1,2. This has increased our ability to taxonomically identify specimens, examine genetic relatedness to contemporary varieties, infer various functional and phenotypic characteristics of ancient specimens and study the history of plant domestication². Nevertheless, finding suitable sources of ancient DNA (aDNA) in plant species has been problematic due to the rapid decomposition of most plant material and the presence of PCR inhibitors in preserved material such as wood and seeds^3,4.

The most abundant sources of plant archaeological material available are charred^5,6,7,8. Thousands of charred seeds have been found in numerous archaeological sites under different states of preservation, with some deposits as old as the Stone Age⁹. The utility of charred material in archaeogenetics is questionable, with studies reporting variable success¹. Experimental studies on modern material have shown the extent of damage in charred material is due to a combination of temperature, time and oxidation^10,11,12. Ancient remains have the added disadvantage of DNA degradation accumulating over time¹³. Despite such damage, several studies have reported successful extraction and amplification of DNA from charred plant material from a range of species including peas^3,14, wheat^6,15,16, rice¹⁷, grapes¹⁸, maize¹⁹ and radish²⁰. Yet other studies reported failure to amplify authentic DNA from charred material^21,22,23,24, suggesting a high degree of stochasticity in successful experiments, compounded by the likelihood of bias toward publishing positive results²⁵ and the absence of a formal method to assess the extent of charring.

A major hurdle when working with charred plant DNA, both modern and ancient, has been that the short DNA fragment lengths, typically <60 bp, are difficult to amplify via PCR^2,26. High-throughput sequencing (HTS) overcomes this limitation, allowing for sequencing of very short DNA fragments²⁷. Moreover, techniques such as target enrichment now allow for the preferential sequencing of DNA sequences of interest, regardless of fragment length^{28,29,30,31,32}. Owing to these benefits, a combination of these techniques has been suggested as the future method of choice when working with charred, archaeobotanical material^2,8. To date, the only study to examine the use of HTS on charred material describes successful recovery of barley, wheat and millet sequences from a 3300-year-old charred cereal assemblage, and discusses the potential for techniques such as target enrichment to enable sequencing of specific genes or regions of interest⁸.

Here we have combined the results of independent studies of four domesticated plant species Hordeum vulgare L (barley), Vitis Vinifera L (grape), Zea mays L (maize) and Oryza sativa L (rice) using a combination of shotgun sequencing (all species) and target enrichment (barley, maize and rice) in order to assess the utility of HTS in aDNA studies of charred plant material. The specimens used range in age from 4450 calibrated years before present (YBP) to 550 YBP and represent a range of preservation states commonly encountered at archaeological sites. Our aim was to determine whether we could generate sufficient, authentic data from charred material using HTS that allows further downstream analyses relevant to the fields of archaeology and biology. We further re-analysed a study that has reported endogenous DNA from charred cereal grains over 3000 years old⁸.

Results

Read characteristics

The number of raw DNA sequencing reads (Illumina technologies) obtained ranged from 135 982 in Maize3 to 43 339 302 in Blank_maize1 for shotgun libraries (Table 1). Read lengths were constrained by the sequencing mode, although were generally short, with averages ranging from 63 bp to 228 bp (Table 1). The percentage of mapped reads identified as duplicates was highly variable across both species and samples, ranging from 0% (Barley3a, Barley8, Rice9) to 94.5% (Rice15) (Table S1). On average, duplicates were nearly five times more prevalent in enrichment libraries (average of 40.8% of reads mapped) than in samples that were shotgun sequenced (average of 7.9%).

Table 1 Sample details and characteristics including: sequencing platform, paired-end (pe) or single-end (se), maximum read length (bp); whether libraries were shotgun sequenced or subjected to target enrichment, whole genome (WG) or solid-state(SS); double stranded (ds) or single stranded (ss) library build; average read length (bp); total number of raw reads, or read pairs if paired-end; percentage of reads that mapped to sample genomes after duplicate removal.

Full size table

Reads mapped to the reference genomes

Post-filtering, the percentage of reads that mapped to sample genomes ranged from 0% (Rice 10 and Rice12a) to 0.12% (Barley5) (Table 1). There was no significant difference between the number of reads mapped to the respective genomes in libraries that had been enriched (independent of enrichment method) as opposed to shotgun sequenced when standardised for sequencing effort (Mann-Whitney U = 193, n1 = 24 n2 = 20, P > 0.05 two tailed). Although statistical testing was not possible due to small sample sizes, when comparing the different enrichment methods applied to the rice libraries, we observed more reads mapping in the target enriched libraries (avg. 0.01%), followed by whole genome (WG) enriched libraries (avg. 0.004%) and solid-state (SS) enriched libraries (avg. 0.0008%) (Table 1).

When mapping reads to the genomes of the other taxa included in the study, a higher percentage of reads were found to map to other, non-target genomes compared to the percentage of reads mapping to the target genome in the majority of cases (Table 2). In particular, on average, a greater proportion of reads mapped to the barley genome with 35 of our 51 samples mapping better to barley than to the other genomes (Table S2). The least amount of reads mapped to the rice genome (Table 2).

Table 2 Average percentage of reads from all barley, grape, maize, rice and blank samples that map to the four reference genomes.

Full size table

aDNA damage patterns and sample bleeding

The majority of samples did not yield reads with the typical fragmentation and mis-incorporation patterns associated with aDNA, though for samples with few aligned reads there was insufficient data to obtain meaningful distributions from mapDamage. Nonetheless, for those with sufficient data, a total of 15 libraries yielded reads that showed typical aDNA damage patterns after aligning these reads to the grape genome (Supplementary Information, Table S2 and Fig. S2). These reads were sequenced from libraries of three grape samples, seven maize samples (including both libraries from Maize8), one maize extraction blank and three barley extraction blanks. Irrespective of the sample origin, aDNA damage patterns were only observed when reads were mapped to the grape genome, and not when mapped to any of the other three genomes included in this study. Furthermore, all libraries in which we observed these aDNA damage patterns were sequenced at the Danish National High-throughput DNA Sequencing Centre, and were sequenced in pools that contained additional libraries from non-charred ancient grape samples with high (10% to 69%) endogenous DNA that were not part of the present study (as well as libraries from non-charred maize with low endogenous DNA and libraries from other taxa). Given that aDNA damage patterns were only observed in those samples that were sequenced together with ancient grape samples (with high endogenous DNA content), we investigated if these damage patterns could originate from reads that were falsely assigned to the charred samples, i.e. due to “sample bleeding”. Sample bleeding is a recognized, but arguably underappreciated technical error caused by Illumina hardware and software, leading to a very small proportion of reads erroneously being assigned to another sample in a multiplexed run³³ (see discussion and methods). By directly observing the index of short mapped reads (Supplementary Information Fig. S3)³³, we indeed found a significant increase of incorrect-index assignments of non-charred grape samples in those reads that mapped to the grape genome in the 15 libraries with aDNA damage patterns. We could trace 4% to 39% of these reads back to non-charred grape samples with high endogenous DNA sequenced in the same pool. Such levels of false-index assignment are orders of magnitude higher compared to the background levels of grape sample bleeding (between 0.002% to 0.09%) observed in the non-mapping read data (Wilcoxon Signed Rank, W = 0, N = 16, p < 0.05) (Table S3). In other words, the grape aligned read data in these charred samples and these extraction blanks are significantly more likely to have originated from erroneous aDNA sources compared to other reads in these libraries.

Metagenomic analysis

The majority of reads generated from the four species libraries were either bacterial in origin (43–72%) or unassignable (21–55%) according to analysis with BLASTn and MEGAN (Table 3). This was followed by hits to eukaryote genomes (2–8%), other plants (0.4–2%) and the target species (0.01–0.16%). An average of 84.7% of reads generated from the extraction blanks were unassignable, with the rest determined as mostly bacterial in origin. Detailed assessments of each sample are presented in the supplementary information (Supplementary Information Table S4).

Table 3 Percentage taxonomic assignments of reads averaged across each species and the series of blanks determined using MEGAN.

Full size table

PIA filtering

All metagenomic BLASTn analyses produced hits on the target species, but following Phylogenetic Intersection Analysis (PIA³⁴) and filtering for low taxonomic diversity within the larger landscape of BLAST hits plus further filtering for coverage length (95% and 99%), only eight samples retained positive hits. These samples were Barley1a, Barley5, Grape3, Grape4, Grape5, Maize5 (all 1–3 hits) and Maize8a (54 and 37 hits for 95% and 99% coverage respectively) and Maize8b (114 and 82 hits for 95% and 99% coverage respectively; Table 4). The Maize8 library that had been subjected to target enrichment (Maize8b) produced twice as many hits as the library that was shotgun sequenced with similar sequencing depth (Table 4). Of the PIA filtered reads from Maize 8a (shotgun library), 11 of the 54 (95% coverage) and 1 of the 37 (99% coverage) were non-duplicate reads that mapped back to the maize genome. For Maize 8b (capture library), 15 of the 114 (95% coverage) and 1 of the 82 (99% coverage) were non-duplicate reads that mapped back to the maize genome.

Table 4 Results of BLASTn and Phylogenetic Intersect Analysis (PIA) showing the number of reads BLASTED after duplicate removal using Prinseq, the number of metagenomic BLAST hits on the sample species, Post PIA filtering hits at 95% coverage (>0.2 taxon diversity) and 99% coverage (>0.2 taxon diversity).

Full size table

BLASTn analyses of the 496 purportedly endogenous reads from Bunning et al.⁸ resulted in a majority of reads producing hits to Mus musculus (domestic mouse; Supplementary Information Table S5). Only 0.2% were assigned to one of the listed taxa, Hordeum vulgare, but none of these reads remained following PIA filtering. Using RepeatMasker we further identified 195 of the 496 sequences as containing regions of low complexity or simple repeats (Supplementary Information Table S6).

Discussion

PCR-based aDNA studies have highlighted the difficulties of working with charred archaeobotanical remains, showing that endogenous DNA is often inaccessible, and highly damaged when it is recovered⁸. While HTS has opened new doors to paleogenomic approaches for many species and tissue types, HTS of charred archaeobotanical specimens remains relatively unexplored. We have evaluated shotgun HTS and target enrichment in independent studies of 38 archaeological remains from four species, and failed to retrieve sufficient authentic DNA data to address basic archaeological and biological questions from our specimens. Given that most ancient plant remains are preserved via charring, this is an especially disappointing revelation. Below we discuss the problems associated with the low or absent endogenous DNA content present in charred specimens and argue the need for thorough analytical approaches to avoid spurious conclusions on DNA authenticity.

Using laboratory and analytical pipelines optimized for aDNA, we found exceptionally low percentages of reads (from 0 to 0.12%) mapping to the target genomes in all 38 samples. Low endogenous DNA content is a common characteristic of ancient DNA specimens²⁷ with values often falling below 1%³⁵. This in itself does not preclude the presence of authentic reads in samples, although it does require an often prohibitory amount of sequencing in order to yield sufficient data for meaningful downstream analyses³⁵.

In order to evaluate the authenticity of our data we mapped all sample reads to the genomes of the three other taxa used in this study. Short reads that may contain sequence errors or mutations are notoriously difficult to align and can result in their mapping to multiple locations within a genome or even to multiple genomes^36,37. We observed similar, and at times greater numbers of reads mapping to the non-target genomes. Moreover, in every case the extraction blanks, on average, had a higher percentage of reads mapping to all four genomes than the respective specimen samples. Preferential mapping of short reads to certain genomes may depend on factors such as genome size and complexity. In our study, the average highest percentage of reads (regardless of sample origin) was found to map to the large barley genome (5.3 Gbp), whereas the lowest percentage mapped to the smaller rice genome (0.4 Gbp). The mapping of short reads to multiple genomes does not necessarily preclude the presence of authentic DNA³⁶, yet when reads map equally well or better to a number of unrelated genomes, the authenticity of the reads is questionable. The reads mapping to the target genomes may therefore not represent endogenous DNA and instead may be an artefact of spurious mapping of short reads.

Analysis of DNA damage can also be used to support the authenticity of the reads obtained. In our case the use of mapDamage served to highlight another issue that can arise when working with very small numbers of reads–that of the potential for sample bleeding to occur between samples run on the same sequencing lane³³. Sample bleeding can occur via two processes, the introduction of errors during PCR or sequencing, or over-clustering/mixed clusters on the flow cell^33,38. The former issue can be mitigated by using indexes that are dissimilar, for example differing by at least 3 nucleotides, a strategy employed in this study. The latter issue is caused by incorrect index assignment on the Illumina flowcell, leading one read cluster from a sample being assigned the index of a neighbouring read cluster from another sample. This problem is not detectable in most cases, and should have negligible impact when high coverage filters are implemented in downstream analyses. The identification of what appeared to be authentic ancient grape reads based on damage patterns in 15 of our samples (three charred grape, seven charred maize and four extraction blanks), was found to be associated with significant increases in false-index assignments, linking these damage patterns to reads from uncharred ancient grape samples with high endogenous DNA content sequenced in the same pool, rather than to the samples themselves. False-index assignments (sample-bleeding) occurs in an estimated ~0.3% of reads when using single-indexed libraries³⁸, and in most circumstances sample-specific, endogenous DNA reads greatly outnumber such erroneously assigned reads. However, false-index pairings can be particularly problematic in studies that assess rare variants or when emphasis is placed on a highly limited number of reads³⁸. In our case, we observed less than 0.3% of reads aligning to any of the respective reference genomes, which makes our data particularly vulnerable to the confounding effects of false-index assignments. We caution researchers in the aDNA field against making conclusions based on very low read numbers particularly when samples have been pooled together with high endogenous ancient libraries of the same taxa.

Analysis of the metagenome has begun to receive more attention in recent aDNA studies allowing for additional information to be obtained from samples^39,40,41,42. When HTS output of libraries from charred material indicates low endogenous content, metagenomic analysis may, for example, reveal misidentification of the archaeological specimen. Charring can impact seed morphology, hampering specimen identification, particularly when working with mixed charred assemblages^1,43. In these cases, if endogenous DNA remains, the metagenome may reveal the true specimen identity. Metagenomic analysis of our samples indicated the majority of reads were microbial in origin or un-assignable, with the remainder identified as either eukaryotic or plant in origin. This contrasts to another published metagenome analysis of charred grains which identified the majority of reads as metazoan in origin followed by green plants⁸ (but see below). Very few reads produced BLAST hits to the target species in our study (less than 0.2%) and these were filtered using PIA to validate authenticity. PIA filtering is robust at identifying or dismissing sequence data as probabilistically ‘genuine’ from BLAST outputs, due to its consideration for database bias in favour of model organisms and its ability to identify threshold-scoring yet spuriously-assigned reads with only superficial similarity to their closest database match (see ref. 34 for further details). The PIA algorithm also gives a probabilistic assignment to a given sequence recursively at descending taxonomic ranks (i.e. where a read matching barley can be dismissed as being conclusively Hordeum sp., it might be confidently assigned at a higher taxonomic rank). PIA filtering indicated that in our data only eight of the 44 samples contained sequences that passed justifiably stringent levels of filtering. Of these, six of the samples (two barley, three grape and one maize) were left with so few remaining reads (≤3) that we would consider them inconsequential, particularly given the possibility of contamination (see below). Two of the maize libraries – represented by one shotgun and one target enrichment library of the same specimen - retained higher numbers of reads post-filtering with 95% coverage, which although less stringent than the 99% filter may better account for near-terminal cytosine deamination. However, further investigation revealed that of the reads that could be mapped back to the maize genome, many were duplicates leaving just 11 non-duplicate reads from the shotgun library and 15 non-duplicate reads from the capture library (26 reads total from Maize8 specimen). The presence of duplicates in the filtered data was surprising given these samples had previously been run through duplicate removal software and highlights the importance of cross checking all data post-analysis. The 26 non-duplicate reads were derived from a portion of a cob dated to circa 3960 YBP. This specimen had been identified as lightly charred and it could be that less charred portions of the cob retained endogenous DNA, warranting further investigation (Fig. S1).

By mapping our data to other genomes, using PIA filtering of our BLASTn results, and further analysing these results, we have rigorously tested the authenticity of all putatively endogenous reads in our dataset, showing extremely low success rates in our samples. Bunning et al.⁸ reported 496 reads from barley, einkorn, emmer and broomcorn millet from a 3300-year-old charred grain assemblage using SOLiD 5500 sequencing. Our reanalysis of these reads using the current NCBI database does not support this conclusion, with the majority of reads receiving either no hits or producing hits to Mus musculus. Although 0.2% of reads produced hits to barley, none of these remained following PIA filtering. Further analysis showed over 40% of the reads were identified as either low complexity or as containing simple repeats, which can produce spurious results in BLAST database searches⁴⁴. Our reanalysis of these data differed from the published work in that we BLASTed against the entire NCBI database as opposed to using a specific cereals database. This discrepancy emphasizes the great impact of reference database selection in aDNA analyses and the corresponding repercussions on reliable taxonomic assessments.

Metagenomic profiles of extraction blanks provide possibilities to identify the level and nature of contamination present in lab reagents used⁴⁵. Metagenomic analysis of the extraction blanks used in our study indicated fewer microbial reads than the charred sample libraries. The extraction blanks were in fact relatively low in bacterial DNA in comparison to control samples sequenced from other laboratories, as assessed in Salter et al.⁴⁵. That study showed the composition of common laboratory contaminants to consist of over 90% bacterial reads⁴⁵. The relatively low bacterial presence in our extraction blanks may reflect the effect of the stringent precautions taken in aDNA laboratories compared to laboratories where contemporary samples are analysed. Alternatively the unassigned hits may also be bacterial in origin but are underrepresented in the sequence database.

Contamination of ancient samples with modern DNA remains a serious concern, necessitating adherence to strict precautions when working with aDNA⁴⁶. Despite the most stringent efforts, however, contamination can still occur and may be more likely when working with ubiquitous commercial crops, such as members of the Triticeae^47,48. This is particularly problematic given many charred archaeobotanical studies aim to investigate aspects of historical domestication and the spread of agriculture in these species²⁶. Even if the aim of analysing charred material is to taxonomically identify the charred specimen, the risks of underlying contamination and hence false positives needs to be considered. To better understand this issue, future studies that examine the background levels of contamination by modern crops in deep-sequenced datasets would provide a useful baseline for understanding and potentially quantifying this risk.

Although charring and carbonization is known to destroy DNA, the process by which this occurs is less well understood^1,7,10. Fully carbonized material, where remains have been completely converted to inorganic material, is expected to be devoid of endogenous DNA². Assessing the degree of charring in archaeological specimens prior to processing is difficult, but recent insights into the changing carbon and nitrogen isotope values over different charring conditions may help archaeologists assess the degree of charring prior to use in aDNA studies⁴⁹. Larger plant structures may not always be charred throughout, allowing for the persistence of small amounts of endogenous DNA. Accordingly, the one sample in our study from which we may have retrieved authentic target DNA was a maize cob, which is larger than rice, barley or grape seeds. Nevertheless, the number of reads retrieved from this sample is extremely low and too little for any further analyses on functional traits or demography, which would require several orders of magnitude more data.

Although we used the most optimal protocol currently known for extracting DNA from botanical remains⁵⁰ for three of the taxa (barley, maize and grape), advances in aDNA extraction methodology are continually improving the volume and size distribution of DNA data obtained from archaeological material⁵¹. Future studies of charred material may benefit from testing protocols specifically aimed at recovering ultra short DNA fragments (<50 bp, e.g. ref. 52).

Conclusion

Charring is the most common form of archaeobotanical preservation, yet such remains have long vexed aDNA researchers due to inconsistent success, beyond what is commonly observed in most aDNA studies¹. HTS and target enrichment have been suggested as promising solutions that would enable the recovery of aDNA from charred remains, ultimately providing the technology to investigate a range of archaeological and biological questions. Regrettably, we report that based on our independent studies of four plant species and the reanalysis of an earlier dataset, charred plant material appears to be largely incompatible with these technologies. For a combined cost of over 16 000 € (laboratory and sequencing costs only), our four studies have yielded a total of 26 potentially authentic sequences from one lightly charred specimen, out of a total of more than 200 million reads and 38 unique samples. Coupled with the substantial investment in time and money required to process charred samples we expect most HTS experiments of charred material will not yield sufficient reliable genetic data. We urge a great degree of caution to future researchers who would invest in charred material for archaeogenetic purposes and suggest all data be carefully scrutinized for false-positives resulting from non-stringent analyses or originating from contamination. Future studies that develop a cost effective means of evaluating the degree of charring present in archaobotanicals prior to their processing may provide useful developments in the field.

Materials and Methods

Samples

Barley

Eight archaeological barley seeds, four excavated from Quoygrew, Orkney islands (950–850 calibrated YBP, from a well-stratified midden deposit⁵³) and four from Kaupang, southern Norway (ca. 1150 YBP from waterlogged pitfalls⁵⁴) were provided by the University of Cambridge and the Museum of Cultural History, University of Oslo, respectively. Seeds were light, fragile and appeared to be partially or fully carbonised based on colour and composition (Fig. 1).

Grape

Five archaeological grape seeds originated from Tell Tayinat, in southern Turkey. Two of the seeds were fully carbonised and dated based on stratigraphy and association with diagnostic artefacts to the Early Bronze Age, ca. 4450–3950 calendar YBP, and the remaining three seeds were less carbonised and dated to the Iron Age ca. 3050–2500 YBP (pers. com. Doga Karakaya) (Fig. 1).

Maize

A total of eight archaeological maize samples were tested within this project. One sample was excavated from the Montoya Site in the Cañada Alamosa, New Mexico and provided by Human Systems Research. A portion of the partially charred cob has been directly AMS dated to 3925 calibrated YBP. The seven other maize samples were excavated and provided by Arizona State University. Three heavily carbonised specimens consisting of cobs with attached kernels come from Barton Creek Cave, a Maya site from the Late to Terminal Classic Era (ca. 1350–950 YBP) in the Cayo District of Western Belize. Three cobs come from Non-Grid 4, an Epiclassic human sacrifice shrine site in the Northern Basin of Mexico (ca. 1350–1050 YBP). An additional heavily charred cob originates from a chinampa canal at Xaltocan, a Postclassic site (ca. 750–550 YBP) in the Northern Basin of Mexico (Fig. 1).

Rice

Grains from a total of seven archaeological accessions from sites across the Indian subcontinent, Thailand, and the Comoros Islands were excavated and provided by the University College London and Oxford University. The carbon-14 calibrated dated sites included Sima (Comoros; ca. 1265–965 YBP), Ter, Balathal (India; ca. 2145–1990 YBP and 2345–2155 YBP), Ban Non Wat (two contexts; ca. 2655–2185 YBP), Noen Ul Loek (ca. 1695–1535 YBP and Non Ban Jak (Thailand; Iron Age). All samples were light, porous, fragile and heavily carbonised (Fig. 1).

DNA extractions

All DNA extractions and library builds were carried out in dedicated ancient DNA laboratories at the University of Oslo (barley), the University of Copenhagen (grape and maize) and University of Warwick (rice and three barley seeds); all of which adhere to the highest standards of aDNA quality control⁴⁶. Originally, these were four independent experiments not intended for publication together and as a result methods vary amongst species and laboratories.

The barley, grape and maize samples were extracted using the methodology of Wales et al.^50,55. Treatment of the charred material prior to extraction and minor modifications made are detailed in the Supplementary Information. The rice was extracted using a modified DNEasy protocol (Qiagen) (see Supplementary Information for details). All extraction experiments included negative controls.

Library Preparation

DNA libraries of barley extracts were built using both a single stranded (ss) DNA library preparation protocol⁵⁶ and a double stranded (ds) DNA library preparation protocol⁵⁷. For both library builds, sample-specific seven bp indexes in the P7 primer were used⁵⁷. Details on the library preparation are provided in the Supplementary Information.

DNA libraries for grape and maize were also constructed using dsDNA library preparation protocol⁵⁷. The method was similar to that used for the barley samples, but with a few differences in reaction volumes and purification strategies described in the Supplementary Information.

Rice libraries were constructed using Illumina TruSeq Nano kits, following the manufacturer’s instructions. Modifications are listed in the Supplementary Information.

Target Enrichment

Six barley libraries (four built using the ss protocol and two using the ds protocol) were subjected to target enrichment using a custom-designed MYbaits kit (MYcroarray, Ann Arbor, Michigan) consisting of 25029 biotinylated RNA probes (80-mer length, 4 x flexible tiling density). For rice, three separate enrichment approaches were applied: whole-genome in-solution, targeted in-solution, and solid-state targeted that utilises an array chip for hybridization (Table 1). All formats were supplied by MYcroarray. One maize specimen (Montoya) was enriched for 348 genes using a targeted in-solution hybridization MYbaits kit (MYcroarrary). Details of the target enrichment design can be found in the Supplementary Information.

Sequencing

HTS platforms used for each library are provided in Table 1. Sequencing of the libraries was carried out at the Norwegian Sequencing Centre (barley), Danish National High-throughput DNA Sequencing Centre (grape, maize and barley extraction blanks) and the University of Warwick (rice). See Supplementary Information for more details on quantification and pooling.

Data Filtering and Analysis

Raw reads from all four species were collapsed (when paired-end sequenced), trimmed of adapters and truncated where necessary using AdapterRemoval v. 2.1.2⁵⁸ with the following settings: –qualitybase 33 –minlength 25 –mm 3 –trimns –trimqualities. Reads from each sample were mapped against the following four reference genomes: Hordeum vulgare 082214v1.29, Zea mays AGPv3.30, Vitis Vinifera IGGP_12 × 30, Oryza sativa IRGSP-1.0.30 downloaded from the ENSEMBL database. To evaluate the authenticity of the mapped reads, all reads were mapped to all four genomes used in the study. Reads were mapped using the bwa aln and samse algorithms, with seeding disabled and -o 1 and -n 0.03, following recommendations in Schubert et al.⁵⁹. SAM files were converted to BAM files and sorted using Samtools v1.1, keeping only those reads with a minimum mapping quality score (MapQ) of 25. Duplicates were removed with MarkDuplicates from Picard Tools v.1.96 (http://picard.sourceforge.net/). Finally, we obtained aDNA damage patterns using mapDamage v.2.0.6^60,61 for reads from all libraries mapped to each of the four genomes (i.e. from four BAM files per library). After observing aDNA damage patterns in several libraries when mapped to the grape genome only (see results) we investigated the potential for false-index assignment (i.e. sample bleeding)³³ from non-charred grape samples that were sequenced in the same pool. The Illumina platform uses a separate set of index cycles to read the sample-specific index, and computationally assigns sequencing reads to their respective sample based on that data. Nonetheless, the sequencing cycles may directly observe the index in those cases where the insert is short, leaving sufficient cycles to pass the Illumina specific P7 adapter and the index itself (our data required 38 cycles to cross the P7 adapter and the index, see also Supplementary Information and Fig. S3). It was therefore possible to compare the index used during demultiplexing (i.e. read by the index cycles) to the one in the actual sequencing data (read by the sequencing cycles when inserts are sufficiently short). Hence, we observed the indexes generated by the sequencing cycles in all libraries showing aDNA damage when mapped to grape. In these libraries, we calculated the fraction of correct indexes and the fraction of indexes that belonged to the high endogenous, non-charred grape samples that had been sequenced in the same pool. We did this analysis for both the unaligned sequencing data and the grape aligned BAM files. For this analysis, we only used reads that were short enough for the P7 adapter and the index to be fully sequenced (i.e. read-length minus 40 bp), and we used simple, exact pattern matching (unix; grep) to identify indexes in these sequencing reads. Because HiSeq sequencing data typically has increased levels of sequencing error at read-ends⁶², requesting an exact match of the entire adapter sequence including the index would fail to identify many instances of the adapter and the index. We therefore counted all reads that contained an exact match of the first 12 bp of the P7 adapter (confirming the presence of the adapter in the sequence reads), followed by an exact 6 bp match of the specific index under investigation. This approach allowed for sequence variation in the adapter sequence between the first 12 bp of the adapter and the 6 bp index (a 22 bp stretch).

MEGAN and PIA

Exact duplicates were removed from raw fastq data using Prinseq Lite v. 0.20.4 (-derep 1 -derep_min 2)⁶³. Files were subsequently converted into fasta format and subjected to metagenomic BLAST using the complete NCBI nucleotide database (downloaded 19/02/2015) on a standalone server. To avoid over-sensitivity or over-stringency, default values for seed size and the scoring matrix were used for the BLASTn algorithm. BLAST output was tabulated with taxon IDs appended for downstream analysis. Complete BLAST outputs for each sample were imported into MEGAN 5⁶⁴ using the default parameters. For each sample, reads from the terminal node were exported for the species in question (i.e. barley, grape, maize or rice). Sequence data for each read ID was then recovered from the original data files using the Unix grep function and run through the BLAST program again, using default output format to obtain read length data for downstream analysis. These reads were further analysed using phylogenetic intersection analysis (PIA³⁴) to obtain taxon diversity information and further filtered according to read length coverage³⁴ disregarding reads with less than 95% or 99% coverage to their closest database match. We included results from both 95% and 99% coverage filtering as although the 99% filter is more stringent, the 95% filter may better account for near-terminal cytosine deamination, typical of aDNA. After filtering with PIA we double-checked the quality of the remaining reads in Maize8a and Maize 8b by mapping these back to the maize genome and removing duplicates using the methodology described above.

In addition, we accessed the 496 reads of Bunning et al.⁸ that were reported as reads from barley, einkorn, emmer and broomcorn millet and we reanalysed these data using BLASTn and PIA. The sequences were also run through RepeatMasker⁶⁵ using default settings to assess whether reads could be classified as containing either simple repeats or low complexity.

Additional Information

Accession codes: All individual read data are available at the European Nucleotide Archive (ENA, www.ebi.ac.uk/ena) under study accession number PRJEB15180. http://www.nature.com/srep

How to cite this article: Nistelberger, H. M. et al. The efficacy of high-throughput sequencing and target enrichment on charred archaeobotanical remains. Sci. Rep. 6, 37347; doi: 10.1038/srep37347 (2016).

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

Palmer, S. A., Smith, O. & Allaby, R. G. The blossoming of plant archaeogenetics. Ann. Anat . 194, 146–156 (2012).
Article CAS PubMed Google Scholar
Brown, T. A. et al. Recent advances in ancient DNA research and their implications for archaeobotany. Veg. Hist. Archaeobot. 24, 207–214 (2015).
Article Google Scholar
Smýkal, P. et al. A comparative study of ancient DNA isolated from charred pea (Pisum sativum L.) seeds from an Early Iron Age settlement in southeast Serbia: inference for pea domestication. Genet. Resour. Crop Evol. 61, 1533–1544 (2014).
Article CAS Google Scholar
Gugerli, F., Parducci, L. & Petit, R. J. Ancient plant DNA: Review and prospects. New Phytol. 166, 409–418 (2005).
Article CAS PubMed Google Scholar
Zohary, D. & Hopf, M. Domestication of plants in the Old World. (Oxford Universiy Press, 2000).
Allaby, R. G., O´Donoghue, K., Sallares, R., Jones, M. K. & Brown, T. A. Evidence for the survival of ancient DNA in charred wheat seeds from European archaeological sites. Anc. Biomol. 1, 119–129 (1997).
CAS Google Scholar
Schlumbaum, A., Tensen, M. & Jaenicke-Després, V. Ancient plant DNA in archaeobotany. Veg. Hist. Archaeobot. 17, 233–244 (2008).
Article Google Scholar
Bunning, S. L., Jones, G. & Brown, T. A. Next generation sequencing of DNA in 3300-year-old charred cereal grains. J. Archaeol. Sci. 39, 2780–2784 (2012).
Article CAS Google Scholar
Nadel, D. et al. Stone Age hut in Israel yields world’s oldest evidence of bedding. Proc. Natl. Acad. Sci. USA 101, 6821–6826 (2004).
Article CAS ADS PubMed PubMed Central Google Scholar
Threadgold, J. & Brown, T. A. Degradation of DNA in artificially charred wheat seeds. J. Archaeol. Sci. 30, 1067–1076 (2003).
Article Google Scholar
Chalfoun, D. J. & Tuross, N. In Ancient Biomolecules 3, 67–79 (1999).
CAS Google Scholar
Boardman, S. & Jones, G. Experiments on the Effect of Charring on Cereal Plant Components. J. Archaeol. Sci. 17, 1–11 (1990).
Article Google Scholar
Pääbo, S. et al. Genetic analyses from ancient DNA. Annu. Rev. Genet. 38, 645–679 (2004).
Article PubMed CAS Google Scholar
Mikić, A. M. The first attested extraction of ancient DNA in legumes (Fabaceae). Front. Plant Sci. 6, 1–4 (2015).
Article ADS Google Scholar
Schlumbaum, A. & Jacomet, S. Coexistence of Tetraploid and Hexaploid Naked Wheat in a Neolithic Lake Dwelling of Central Europe:Evidence from Morphology and Ancient DNA. J. Archaeol. Sci. 25, 1111–1118 (1998).
Article Google Scholar
Bilgic, H., Hakki, E. E., Pandey, A., Khan, M. K. & Akkaya, M. S. Ancient DNA from 8400 Year-Old Çatalhöyük Wheat: Implications for the Origin of Neolithic Agriculture. PLoS One 11, e0151974 (2016).
Article PubMed PubMed Central CAS Google Scholar
Castillo, C. C. et al. Archaeogenetic study of prehistoric rice remains from Thailand and India: evidence of early japonica in South and Southeast Asia. Archaeol. Anthropol. Sci. (2015).
Manen, J.-F. et al. Microsatellites from archaeological Vitis vinifera seeds allow a tentative assignment of the geographical origin of ancient cultivars. J. Archaeol. Sci. 30, 721–729 (2003).
Article Google Scholar
Goloubinoff, P., Paabo, S. & Wilson, A. C. Evolution of maize inferred from sequence diversity of an Adh2 gene segment from arhaeological specimens. Proc. Natl. Acad. Sci. USA 90, 1997–2001 (1993).
Article CAS ADS PubMed PubMed Central Google Scholar
O´Donoghue, K., Clapham, A., Evershed, R. P. & Brown, T. A. Remarkable preservation of biomolecules in ancient radish seeds. Proc. R. Soc. London B 263, 541–547 (1996).
Article ADS Google Scholar
Blatter, R. H. E., Jacomet, S. & Schlumbaum, A. Little evidence for the preservation of a single-copy gene in charred archaeological wheat. Anc. Biomol. 4, 65–77 (2002).
CAS Google Scholar
Brown, T. A., Allaby, R. G., Sallares, R. & Jones, G. Ancient DNA in charred wheats: Taxonomic identification of mixed and single grains. Anc. Biomol. 2, 185–193 (1998).
CAS Google Scholar
Oliveira, H. R. et al. Ancient DNA in archaeological wheat grains: preservation conditions and the study of pre-Hispanic agriculture on the island of Gran Canaria (Spain). J. Archaeol. Sci. 39, 828–835 (2012).
Article CAS Google Scholar
Fernández, E. et al. DNA analysis in charred grains of naked wheat from several archaeological sites in Spain. J. Archaeol. Sci. 40, 659–670 (2013).
Article CAS Google Scholar
Møller, A. P. & Jennions, M. D. Tetsing and adjusting for publication bias. Trends Ecol. Evol. 16, 580–586 (2001).
Article Google Scholar
Brown, T. A. How ancient DNA may help in understanding the origin and spread of agriculture. Philos. Trans. R. Soc. B Biol. Sci. 354, 89–98 (1999).
Article CAS Google Scholar
Knapp, M. & Hofreiter, M. Next Generation Sequencing of Ancient DNA: Requirements, Strategies and Perspectives. Genes. 1, 227–243 (2010).
Article CAS PubMed PubMed Central Google Scholar
Ávila-Arcos, M. C. et al. Application and comparison of large-scale solution-based DNA capture-enrichment methods on ancient DNA. Sci. Rep. 1, 1–5 (2011).
Article CAS Google Scholar
Burbano, H. A. et al. Timing of human protein evolution as revealed by massively parallel capture of Neandertal nuclear DNA sequences. Science. 328, 723–725 (2010).
Article CAS ADS PubMed PubMed Central Google Scholar
Bos, K. I. et al. A draft genome of Yersinia pestis from victims of the Black Death. Nature 478, 506–510 (2011).
Article CAS ADS PubMed PubMed Central Google Scholar
Kistler, L. et al. Gourds and squashes (Cucurbita spp.) adapted to megafaunal extinction and ecological anachronism through domestication. Proc. Natl. Acad. Sci. USA 112, 15107–15112 (2015).
Article CAS ADS PubMed PubMed Central Google Scholar
Kistler, L. et al. Transoceanic drift and the domestication of African bottle gourds in the Americas. Proc. Natl. Acad. Sci. USA 111, 2937–2941 (2014).
Article CAS ADS PubMed PubMed Central Google Scholar
Mitra, A., Skrzypczak, M., Ginalski, K. & Rowicka, M. Strategies for achieving high sequencing accuracy for low diversity samples and avoiding sample bleeding using Illumina platform. PLoS One 10, 1–21 (2015).
Google Scholar
Smith, O. et al. Sedimentary DNA from a submerged site reveals wheat in the British Isles 8000 years ago. Science 347, 998–1001 (2015).
Article CAS ADS PubMed Google Scholar
Carpenter, M. L. et al. Pulling out the 1%: Whole-Genome Capture for the Targeted Enrichment of Ancient DNA Sequencing Libraries. Am. J. Hum. Genet. 93, 852–864 (2013).
Article CAS PubMed PubMed Central Google Scholar
Li, H., Ruan, J. & Durbin, R. Mapping short DNA sequencing reads and calling variants using mapping quality scores. Genome Res. 18, 1851–1858 (2008).
Article CAS PubMed PubMed Central Google Scholar
Trapnell, C. & Salzberg, S. L. How to map billions of short reads onto genomes. Nat. Biotechnol. 27, 455–457 (2009).
Article CAS PubMed PubMed Central Google Scholar
Kircher, M., Sawyer, S. & Meyer, M. Double indexing overcomes inaccuracies in multiplex sequencing on the Illumina platform. Nucleic Acids Res. 40, 1–8 (2012).
Article CAS Google Scholar
Orlando, L. et al. True single-molecule DNA sequencing of a pleistocene horse bone. Genome Res. 21, 1705–1719 (2011).
Article CAS PubMed PubMed Central Google Scholar
Palmer, S. A. et al. Archaeogenomic evidence of punctuated genome evolution in Gossypium. Mol. Biol. Evol. 29, 2031–2038 (2012).
Article CAS PubMed Google Scholar
Poinar, H. N. et al. Metagenomics to Paleogenomics: Large scale sequencing of Mammoth DNA. Science. 311, 392–394 (2006).
Article CAS ADS PubMed Google Scholar
Schubert, M. et al. Characterization of ancient and modern genomes by SNP detection and phylogenomic and metagenomic analysis using PALEOMIX. Nat. Protoc. 9, 1056–1082 (2014).
Article CAS PubMed Google Scholar
Smith, H. & Jones, G. Experiments on the Effect of Charring on Cereal Plant Components. J. Archaeol. Sci. 17, 317–327 (1990).
Article Google Scholar
Altschul, S. F., Boguski, M. S., Gish, W. & Wootton, J. C. Issues in searching molecular databases. Nat. Genet. 7, 362–369 (1994).
Article Google Scholar
Salter, S. J. et al. Reagent and laboratory contamination can critically impact sequence-based microbiome analyses. BMC Biol. 12, 87 (2014).
Article PubMed PubMed Central CAS Google Scholar
Cooper, A. & Poinar, H. N. Ancient DNA: Do it Right or Not at All. Science. 289, 1139 (2000).
Article CAS PubMed Google Scholar
Boessenkool, S. et al. Use of ancient sedimentary DNA as a novel conservation tool for high-altitude tropical biodiversity. Conserv. Biol. 28, 446–455 (2014).
Article PubMed Google Scholar
Epp, L. S. et al. Lake sediment multi-taxon DNA from North Greenland records early post-glacial appearance of vascular plants and accurately tracks environmental changes. Quat. Sci. Rev. 117, 152–163 (2015).
Article ADS Google Scholar
Nitsch, E. K., Charles, M. & Bogaard, A. Calculating a statistically robust δ13C and δ15N offset for charred cereal and pulse seeds. Sci. Technol. Archaeol. Res. 1, 1–8 (2015).
Google Scholar
Wales, N., Andersen, K., Cappellini, E., Avila-Arcos, M. C. & Gilbert, M. T. P. Optimization of DNA recovery and amplification from non-carbonized archaeobotanical remains. PLoS One 9, e86827 (2014).
Article ADS PubMed PubMed Central CAS Google Scholar
Gamba, C. et al. Comparing the performance of three ancient DNA extraction methods for high-throughput sequencing. Mol. Ecol. Resour. 16, 459–469 (2016).
Article CAS PubMed Google Scholar
Dabney, J. et al. Complete mitochondrial genome sequence of a Middle Pleistocene cave bear reconstructed from ultrashort DNA fragments. Proc. Natl. Acad. Sci. 110, 15758–15763 (2013).
Article CAS ADS PubMed PubMed Central Google Scholar
Adams, C. T., Poaps, S. L. & Huntley, J. P. In Being an Islander (ed. Barrett, J. H. ) 161–197 (McDonald Institute for Archaeological Research, 2012).
Barrett, J. et al. In Kaupang in Skiringssal. Kaupang Excavation Project Publication Series 283–319 (Aarhus University Press, 2007).
Wales, N., Romero-Navarro, J. A., Cappellini, E. & Gilbert, M. T. P. Choosing the Best Plant for the Job: A Cost-Effective Assay to Prescreen Ancient Plant Remains Destined for Shotgun Sequencing. PLoS One 7, e45644 (2012).
Article CAS ADS PubMed PubMed Central Google Scholar
Gansauge, M.-T. & Meyer, M. Single-stranded DNA library preparation for the sequencing of ancient or damaged DNA. Nat. Protoc. 8, 737–748 (2013).
Article PubMed CAS Google Scholar
Meyer, M. & Kircher, M. Illumina sequencing library preparation for highly multiplexed target capture and sequencing. Cold Spring Harb. Protoc. 5 (2010).
Schubert, M., Lindgreen, S. & Orlando, L. AdapterRemoval v2: rapid adapter trimming, identification, and read merging. BMC Res. Notes 9, 88 (2016).
Article PubMed PubMed Central Google Scholar
Schubert, M. et al. Improving ancient DNA read mapping against modern reference genomes. BMC Genomics 13, 178 (2012).
Article CAS PubMed PubMed Central Google Scholar
Ginolhac, A., Rasmussen, M., Gilbert, M. T. P., Willerslev, E. & Orlando, L. mapDamage: Testing for damage patterns in ancient DNA sequences. Bioinformatics 27, 2153–2155 (2011).
Article CAS PubMed Google Scholar
Jónsson, H., Ginolhac, A., Schubert, M., Johnson, P. L. F. & Orlando, L. MapDamage2.0: Fast approximate Bayesian estimates of ancient DNA damage parameters. Bioinformatics 29, 1682–1684 (2013).
Article PubMed PubMed Central CAS Google Scholar
Minoche, A. E., Dohm, J. C. & Himmelbauer, H. Evaluation of genomic high-throughput sequencing data generated on Illumina HiSeq and Genome Analyzer systems. Genome Biol. 12, R112 (2011).
Article CAS PubMed PubMed Central Google Scholar
Schmieder, R. & Edwards, R. Quality control and preprocessing of metagenomic datasets. Bioinformatics 27, 863–864 (2011).
Article CAS PubMed PubMed Central Google Scholar
Huson, D., Mitra, S. & Ruscheweyh, H. Integrative analysis of environmental sequences using MEGAN4. Genome Res. 21, 1552–1560 (2011).
Article CAS PubMed PubMed Central Google Scholar
Smit, A., Hubley, R. & Green, P. RepeatMasker Open-3.0. at http://www.repeatmasker.org (2015).

Download references

Acknowledgements

We thank James Barrett, Rachel Ballantyne, Dagfinn Skre, Kerstin Griffin, Anneleen Kool (barley), Doga Karakaya (grape), Christopher Morehart, Karl Laumbach, Linda Cordell, Maxine McBrinn, Bruce Smith (maize), Dorian Fuller, Cristina Castillo, Nicole Boivin and Alison Crowther (rice) for the provision of archaeological material and sampling assistance. Thanks to Terry Brown and Sandra Kennedy for the data from Bunning et al. We further thank Matti Leino for advice on barley target enrichment design; Lisbeth Thorbek, Agata Gondek and Anne Kathrine Wiborg Runge for technical assistance in the laboratory; Jazmín Ramos Madrigal for advice on the damage analyses. Alison Devault from MYcroarray for advice regarding target enrichment; Tom Gilbert, Robin Allaby and Logan Kistler for support and feedback. This work was supported by the Research Council of Norway (grant no. 230821/F20 to SB), the Danish Council for Independent Research (10-081390), the Danish National Research Foundation (DNRF94), and the Natural Environment Research Council (UK; NE/L006847/1). Computational analyses were done on the Abel computing cluster owned by the UIO and the Norwegian Metacenter for Computational Science (NOTUR) and the Warwick Archaeogenomics Group dedicated server.

Author information

Authors and Affiliations

Department of Biosciences, Centre for Ecological and Evolutionary Synthesis, University of Oslo, P.O. Box 1066, Blindern, NO-0316, Oslo, Norway
H. M. Nistelberger, B. Star & S. Boessenkool
School of Life Sciences, Gibbet Hill Campus, University of Warwick, Coventry, CV4 7AL, United Kingdom
O. Smith
Centre for GeoGenetics, Natural History Museum of Denmark, University of Copenhagen, Øster Voldgade 5-7, Copenhagen, 1350, Denmark
N. Wales

Authors

H. M. Nistelberger
View author publications
You can also search for this author in PubMed Google Scholar
O. Smith
View author publications
You can also search for this author in PubMed Google Scholar
N. Wales
View author publications
You can also search for this author in PubMed Google Scholar
B. Star
View author publications
You can also search for this author in PubMed Google Scholar
S. Boessenkool
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

H.N., O.S., N.W. and S.B. contributed to the project design and laboratory work. O.S., N.W., B.S. and S.B. conducted analyses. All authors contributed to the interpretation of the data and H.N. led the writing.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Electronic supplementary material

Supplementary Information

Rights and permissions

This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

Reprints and permissions

About this article

Cite this article

Nistelberger, H., Smith, O., Wales, N. et al. The efficacy of high-throughput sequencing and target enrichment on charred archaeobotanical remains. Sci Rep 6, 37347 (2016). https://doi.org/10.1038/srep37347

Download citation

Received: 25 May 2016
Accepted: 25 October 2016
Published: 24 November 2016
DOI: https://doi.org/10.1038/srep37347

This article is cited by

Accurate classification of fresh and charred grape seeds to the varietal level, using machine learning based classification method
- Vlad Landa
- Yekaterina Shapira
- Elyashiv Drori
Scientific Reports (2021)
Tracking the history of grapevine cultivation in Georgia by combining geometric morphometrics and ancient DNA
- Laurent Bouby
- Nathan Wales
- David Maghradze
Vegetation History and Archaeobotany (2021)
Classification of archaic rice grains excavated at the Mojiaoshan site within the Liangzhu site complex reveals an Indica and Japonica chloroplast complex
- Katsunori Tanaka
- Chunfang Zhao
- Cailin Wang
Food Production, Processing and Nutrition (2020)
Paleogenomics: reconstruction of plant evolutionary trajectories from modern and ancient DNA
- Caroline Pont
- Stefanie Wagner
- Jerome Salse
Genome Biology (2019)
Ancient DNA (aDNA) extraction and amplification from 3500-year-old charred economic crop seeds from Kaymakçı in Western Turkey: comparative sequence analysis using the 26S rDNA gene
- Asiye Ciftci
- Funda O. Değirmenci
- Zeki Kaya
Genetic Resources and Crop Evolution (2019)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

Read characteristics

Reads mapped to the reference genomes

aDNA damage patterns and sample bleeding

Metagenomic analysis

PIA filtering

Discussion

Conclusion

Materials and Methods

Samples

Barley

Grape

Maize

Rice

DNA extractions

Library Preparation

Target Enrichment

Sequencing

Data Filtering and Analysis

MEGAN and PIA

Additional Information

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Ethics declarations

Competing interests

Electronic supplementary material

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links