Contribution of historical herbarium small RNAs to the reconstruction of a cassava mosaic geminivirus evolutionary history

Rieux, Adrien; Campos, Paola; Duvermy, Arnaud; Scussel, Sarah; Martin, Darren; Gaudeul, Myriam; Lefeuvre, Pierre; Becker, Nathalie; Lett, Jean-Michel

doi:10.1038/s41598-021-00518-w

Download PDF

Article
Open access
Published: 28 October 2021

Contribution of historical herbarium small RNAs to the reconstruction of a cassava mosaic geminivirus evolutionary history

Adrien Rieux¹,
Paola Campos^1,2,
Arnaud Duvermy¹,
Sarah Scussel¹,
Darren Martin³,
Myriam Gaudeul^2,4,
Pierre Lefeuvre¹,
Nathalie Becker² &
…
Jean-Michel Lett¹

Scientific Reports volume 11, Article number: 21280 (2021) Cite this article

1893 Accesses
6 Citations
4 Altmetric
Metrics details

Subjects

Abstract

Emerging viral diseases of plants are recognised as a growing threat to global food security. However, little is known about the evolutionary processes and ecological factors underlying the emergence and success of viruses that have caused past epidemics. With technological advances in the field of ancient genomics, it is now possible to sequence historical genomes to provide a better understanding of viral plant disease emergence and pathogen evolutionary history. In this context, herbarium specimens represent a valuable source of dated and preserved material. We report here the first historical genome of a crop pathogen DNA virus, a 90-year-old African cassava mosaic virus (ACMV), reconstructed from small RNA sequences bearing hallmarks of small interfering RNAs. Relative to tip-calibrated dating inferences using only modern data, those performed with the historical genome yielded both molecular evolution rate estimates that were significantly lower, and lineage divergence times that were significantly older. Crucially, divergence times estimated without the historical genome appeared in discordance with both historical disease reports and the existence of the historical genome itself. In conclusion, our study reports an updated time-frame for the history and evolution of ACMV and illustrates how the study of crop viral diseases could benefit from natural history collections.

Herbarium specimen sequencing allows precise dating of Xanthomonas citri pv. citri diversification history

Article Open access 20 July 2023

A taxonomic monograph of Ipomoea integrated across phylogenetic scales

Article 11 November 2019

The genomic basis of the plant island syndrome in Darwin’s giant daisies

Article Open access 28 June 2022

Introduction

Crop pests and diseases have plagued farmers since the dawn of agriculture¹. Today they continue to be major threats to agro-ecosystems worldwide, significantly reducing yields, incurring economic losses and threatening food security^2,3. Amongst the different taxonomic groups of crop pathogens, viruses account for almost half of emerging infectious diseases⁴ and, as such, are a major focus of ongoing scientific investigation⁵.

The effective management of infectious viral crop diseases requires understanding the factors underlying virus emergence, adaptation and spread⁶. Elucidating the history of a pathogen’s emergence is a prerequisite to inferring the evolutionary, ecological and anthropogenic factors that have driven the past epidemiological dynamics of the pathogen; inferences which could in turn be used to design efficient future disease control strategies⁷. As sequencing technologies have become more accessible, pathogen genomic analyses have played an increasingly important role in infectious disease research⁸. Concomitantly, recent methodological developments in molecular phylogeography can now be applied to study the emergence and evolution of viral pathogens in space and time with an unprecedented degree of accuracy⁹. Examples of such recent inferences performed on field-sampled viruses include the reconstruction of the spread and evolution of tomato yellow leaf curl virus (TYLCV) ¹⁰, maize streak virus (MSV)¹¹ or rice yellow mottle virus (RYMV)^12,13. Interestingly, analyses of ancient DNA and RNA virus genomic sequence data obtained from herbaria or archaeological material have demonstrated that historical samples can be leveraged to substantially improve phylogenetic based molecular dating studies^14,15,16. By countering the molecular-clock calibration biases that occur when using modern genomes to infer ancient lineage divergence times, the addition of ancient genomes with known sampling dates commonly yields estimates of viral lineage divergence times that are older and more in accordance with historical reports than when the ancient sequences are not included in molecular dating studies^17,18. In this context, the oldest historical crop-associated virus genome published to date is a member of the Chrysovirus genus isolated from a 1,000 year old maize cob¹⁹.

High throughput sequencing (HTS) and bioinformatic analyses have already contributed to a paradigm shift in the fields of virus discovery and diagnosis^20,21,22. Among various possible targets, such as virion-associated nucleic acids, double-stranded RNAs, total RNAs, ribosomal-RNA-depleted RNAs or messenger RNAs, the sequencing of small RNAs (sRNA) offers several advantages²³. First, since plant viruses are targeted by host silencing mechanisms, the sequencing of small interfering RNAs (siRNAs) should enable the identification of all types of plant viral agents, whatever the nature or structure of their genomes (DNA or RNA, single or double stranded). In this context, the pioneering work of Kreuze et al.²⁴ demonstrated the universal power of targeting, sequencing and analysing sRNAs for the comprehensive reconstruction of viral genomes from fresh material of cultivated and non-cultivated plants (as reviewed in²⁵). Moreover, viral sRNAs were reported as more stable than long RNA and DNA molecules, and proved to be suitable for deep sequencing, including paleovirology applications for several plant RNA phytoviruses^17,26. As an illustration, Smith et al.¹⁷ have reported the identification and reconstruction of an ancient isolate of barley stripe mosaic virus (Genus Hordeivirus, family Virgaviridae) by sequencing sRNAs extracted from 700 years-old barley seeds, with 99.4% of the contemporary virus reference genome being covered by sRNA contigs. In a recent study reconstructing RNA phytovirus genomes, a detailed characterisation (using size distribution and coverage data) underscored the preservation of siRNAs among viral sRNAs from dried, modern samples, yet to be shown from historical samples²⁷.

Cassava cultivation is associated with a wide range of diseases that seriously undermine the food and economic security in sub-Saharan African countries, the most notable of which is cassava mosaic disease (CMD), caused by a complex of cassava mosaic geminiviruses (CMGs, genus Begomovirus, family Geminiviridae)²⁸. CMD is currently the most damaging plant virus disease in the world (estimates of US$1.9–2.7 billion annual loss) and was associated with an East African famine in the late 1990s that likely caused the deaths of thousands of people²⁹. As an expanding global threat, CMD is currently under surveillance in Southeast Asia since its first description in Eastern Cambodia in 2016^30,31. CMGs are transmitted by whiteflies of the Bemisia tabaci species complex or by the use of infected cuttings (for review see²⁸). In sub-Saharan Africa cassava growing areas, several native species of the B. tabaci species complex referred as sub-Saharan African species (SSA) have been reported as the prevalent whiteflies associated with the spread of viruses that cause cassava mosaic disease (CMD)³². However, several cassava surveys suggest that the use of infected cassava cuttings for the establishment of new plantations appears to be largely responsible for the high incidence of CMD in sub-Saharan Africa^33,34. CMGs possess bipartite genomes, with genome components, called DNA-A and DNA-B, comprising 2.7 kb circular single-stranded DNA molecules. Both components are necessary for successful infection of cassava. While DNA-A encodes proteins and regulatory elements responsible for replication, encapsidation functions and the control of gene expression, DNA-B encodes proteins enabling viral movement³⁵. In plant cells infected by geminiviruses, bidirectional read-through transcription of the circular viral dsDNA generates sense and antisense transcripts²⁶. These dsRNA overlapping transcripts are processed by Dicer-like (DCL) family proteins from the RNA interference machinery, generating 21, 22 and 24 nt siRNAs and covering the entire circular virus genome (including coding sequences, as well as the intergenic region that contains the promoter^36,37).

Interestingly, whereas cassava originates from South America³⁸, the African CMGs are endemic to Africa and are likely recent descendants of geminiviruses adapted to infect indigenous uncultivated African plant species³⁹. Therefore the adaptation of CMGs to cassava could have only started, either after cassava was introduced to West Africa in the Gulf of Guinea during the 16th century, or after it was introduced to East Africa and the South West Indian Ocean islands in the 18th century. Since the initial characterisation in the early 1980s of the first known CMG species, African cassava mosaic virus (ACMV), several others have been described in sub-Saharan Africa, surrounding islands and the Indian subcontinent⁴⁰. The distribution of ACMV on the African continent has enabled the use of phylogeographic studies to investigate its evolutionary and epidemiological dynamics. Based on time-scaled phylogeographic analyses of modern ACMV isolates sampled between 1982 and 2012, it has been inferred that ACMV-driven CMD began disseminating in the 1980s only, with a single discreet movement of the virus from East Africa to Madagascar between 1996 and 2003⁴¹.

Here we report the genome of a 90-year-old ACMV isolate reconstructed from sRNAs, characteristic of bona fide siRNAs and whose damage patterns prove its authenticity. Using tip-calibrated phylogenetic inferences, we estimate both rates of molecular evolution and divergence times, underscoring the contribution of the historical genome in this calculation. Finally, we demonstrate how this single genome significantly improves our understanding of the history of ACMV in Africa.

Results and discussion

Nucleic acids isolation and high-throughput sequencing

From a small leaf fragment of a Manihot glaziovii (cassava) herbarium leaf specimen (Fig. 1) collected in the Central African Republic in 1928 and displaying typical symptoms of CMD, 185 ng of total DNA and 215 ng of total RNA were carefully extracted in a bleach-cleaned hospital laboratory with no prior exposure to plant material. Our first attempt to amplify and sequence viral DNA following Rolling Circle Amplification (RCA) failed (data not shown), likely due to substantial fragmentation of DNA, as previously described for herbarium specimens of similar age⁴² . Hence, based on the pioneering work of Kreuze et al.²⁴ and Smith et al.¹⁷, we decided to target sRNAs. After library construction, high throughput sequencing of the sRNA fraction on an Illumina Hi-Seq High Output platform generated 8.6 M single-end reads with a base call accuracy of 99.90 to 99.96%. Following adaptor trimming and quality checking, reads ranging from 18 to 26 nt in length were selected for further analyses (Fig. 2a).

Detection of genuine historical ACMV in herbarium cassava specimen

The analysis of sRNA reads with VirusDetect software revealed the presence of both DNA-A and DNA-B ACMV segments within the historical cassava specimen, with one contig covering 99.3% of the reference DNA-A sequence and fourteen contigs covering 88.7 % of the reference DNA-B sequence (Figure S1). We hence attempted to PCR amplify ACMV-specific DNA but no amplicons were successfully generated (data not shown). This result further highlights the promising potential underlying sRNAs sequencing to reconstruct historical viral pathogen genomes, as compared to classical approaches targeting DNA. Both DNA-A and DNA-B contigs harboured all eight typical open reading frames (ORFs) and inverted repeats (IRs) described for bipartite cassava geminiviruses (as depicted in⁴³). No other viruses were detected by VirusDetect from this sample. Running BWA-aln, a dedicated tool optimised for small read mapping, 1.45% of reads aligned to ACMV reference sequences and 87.55% of reads mapped to the M.glaziovii (cassava) reference sequence (Fig. 2b). Interestingly, among the 18–26 nt sRNAs mapping to ACMV or cassava, a predominance of ACMV-mapping sRNAs was observed for 21 and 22 nt sRNAs (Fig. 2a). These viral sRNAs may represent siRNAs, among the 21, 22 and 24 nt siRNA size classes described for geminiviruses²⁵.

To authenticate the historical nature of the ACMV siRNA reads and rule out the possibility of them being derived from lab contaminations, they were assessed for the presence of postmortem RNA damage. We found a clear pattern of C to U deamination reaching maximum values (±4%) at read extremities and declining exponentially inwards (Fig. 3C), as expected and previously shown for historical RNA^17,44. In addition, the examined modern control displayed no such pattern. We found no difference in deamination patterns between DNA-A and DNA-B segments (not shown). The historical consensus sequences of ACMV DNA-A and DNA-B were reconstructed and covered 97.2% and 82.7% of the reference sequences (at 1X-fold) with a mean depth of 787.8 and 21.7 fold, respectively (Fig. 3A, B). Importantly, our mapping strategy aiming to reconstruct ACMV DNA-A and DNA-B consensus sequences was shown robust to (i) the choice of the short-read aligner, (ii) the presence of shared genomic regions between DNA-A & DNA-B segments and (iii) the choice of the reference sequences (Figure S2). The difference in sequencing depth between DNA-A and DNA-B could be explained by a difference in the abundance of these components in the plant tissues, and/or by higher host plant's RNAi-based antiviral defences targeting the DNA-A. In line with this latter observation, analyses of siRNA in ACMV-infected plants (Nicotiana benthiamana and cassava)^36,43 showed a majority of siRNAs derived from the DNA-A component. A more detailed analysis of sRNA read coverage (Figure S3) locates a hotspot on ACMV-A, corresponding to overlapping transcripts coding for AC1, AC2 and AC3, consistent with previous siRNA analyses derived from ACMV^36,43.

Recent large-scale surveys have revealed pervasiveness of transcriptionally active endogenous geminiviral sequences (EGSs) in several plant genomes^45,46. The hypothetical presence of sRNAs deriving from EGVs and their use in our analyses could potentially impact the reconstruction of our ancient viral sequences. However, for all the arguments developed below, we are convinced that the sRNAs sequenced in this study are from episomal viral DNA rather than EGS. First, to date only small portions of endogenous geminiviral sequences were proposed to be transcribed (homologous to ren and rep genes^45,46) while we were able to reconstruct a nearly complete ACMV genome from sRNA sequences. Second, in their recent study, Sharma et al.⁴⁶ did not find any trace of EGSs within the genome of Manihot esculenta. In this work, we analysed the currently publicly available genomic resources of Manihot glaziovii⁴⁷. Importantly, of the contigs that displayed similarities with geminiviruses (of length ranging between 163 and 2929 nt), all the hits covered 99 to 100% of the contigs. No chimeric contigs (containing both viruses and cassava sequences, that would indicate the presence of EGS), were detected (Table S3). This observation suggests that the analysed M. glaziovii genomes were generated from plants contaminated with episomal geminiviruses. Third, the herbarium specimen analysed displayed typical symptoms of Cassava Mosaic Disease. Although symptoms promoted by integrated viral sequences are theoretically possible, they wouldn't be expected for endogenous virus sequences, whose partial integration is unlikely to promote any infection⁴⁶, even for the longest integrated EGSs described so far⁴⁵. In addition, geminiviral endogenous elements have not so far been reported to give rise to episomal viruses^25,48. Finally, our reconstructed ACMV genomes showed a very high pairwise genetic identity (>99%) with their modern counterparts, a value that we would predict to be smaller in case of non-functional geminivirus sequences integrated in plant genomes for long periods⁴⁹.

Phylogenetic inferences and dating using both historical and modern sequences

In order to investigate the phylogenetic relationship of our historical sequences to those already available from recent samples, we built nucleotide alignments of our historical genome along with 134 and 99 public modern African ACMV DNA-A and DNA-B sequences, respectively. The historical and modern sequences displayed an average nucleotide divergence of 2.3% for DNA-A and 2.9% for DNA-B. Two recombinant events were detected in the ACMV sequences analysed in this study (Table S1). Recombinant ACMV regions (positions 631-781 & 1901-1933 relative to AY211884 sequence for ACMV DNA-A) were identified with RDP4⁵⁰ and removed from further inferences to avoid the potentially confounding effects these could have on the accuracy of inferred phylogenies. Note that as a precaution, recombinant region 2 was removed from the analysis, despite being detected in the historical DNA-A sequence with a single method only. The 1081 and 850 non-recombining SNPs obtained for ACMV DNA-A and DNA-B respectively were used to build Maximum-Likelihood (ML) phylogenies, using a cassava mosaic Madagascar virus (CMMGV) isolate (belonging to another species of CMG) as outgroup (Figure S4). The resulting ML trees were globally well supported (most bootstrap values >0.7) and appeared to be geographically structured. Interestingly, the historical ACMV genome sampled in 1928 in the Central African Republic clustered within a clade composed of modern isolates from the same country in both the ACMV DNA-A and DNA-B trees.

In order to date the evolutionary history of ACMV, we used the ACMV DNA-A dataset, as the historical DNA-A sequence displayed a much higher depth and coverage than the ACMV DNA-B. As a prerequisite to perform tip-based calibration, we tested the presence of temporal signal in our tree with both a linear regression between sample ages and root-to-tip distance, and a date-randomisation test. Both statistical tests revealed the presence of a temporal signal (i.e. progressive accumulation of substitutions over time) within the ACMV DNA-A tree. The linear regression test displayed a significant positive slope (slope value= 0.00017, adjusted R² = 0.0136 with a p-value = 0.038) and the date-randomisation test of the inferred root age of the real versus date-randomised dataset showed no overlap (Fig. 4). Additionally, our results showed no evidence of confounding between temporal and genetic structures (Mantel test: r = 0.001, p-value = 0.481), suggesting that the temporal signal detected is reliable and robust⁵¹. We therefore built a time-calibrated tree with BEAST⁵², which was globally congruent with the ML tree (similar topology and node supports; Figure 4). As in the ML-tree, the historical ACMV DNA-A sequence clustered within a clade composed of modern isolates sampled in the Central African Republic. This observation emphasises the value of historical samples in improving our understanding of the epidemiology of crop pathogens⁵³. Indeed, our historical ACMV genome constitutes “fossil” evidence that CMD has occurred in the Central African Republic since at least 1928, consistently with the very first historical report of a disease resembling CMD that was made in this country in 1924⁵⁴.

We inferred that the most recent common ancestor (MRCA) of all the analysed African ACMV DNA-A isolates most likely existed in 1849 [95% HPD: 1810–1880], a date that predates by more than 100 years the estimate of 1980 [95% HPD: 1990–1975] obtained by De Bruyn et al.⁴¹. The earlier estimate of the ACMV MRCA is more consistent with historical descriptions of the disease. Indeed, the earliest report of CMD-like symptoms in Africa was made in 1894 in what is now Tanzania⁴⁰. Subsequent reports were made in the 1920s in relation to CMD epidemics in Sierra Leone, Ivory Coast, Ghana, Nigeria, Madagascar and Uganda⁴⁰. By the end of the 1930s, CMD was reported from virtually all cassava-growing regions of the African mainland and surrounding islands.

We estimated a mean ACMV DNA-A substitution rate of 1.27x10^-4 [95% HPD: 0.8 $\times $ 10⁻⁴–1.7 $\times $ 10⁻⁴] per site per year, with a standard deviation for the uncorrelated log-normal relaxed clock of 0.26 [95% HPD: 0.18–0.33], suggesting low substitution-rate heterogeneity amongst branches. This rate estimate is ~20 $\times $ and ~12.5 $\times $ lower than that the ones previously obtained using modern isolates only of ACMV⁴¹ and EACMV⁵⁵, respectively.

Although our reconstructed evolutionary history of ACMV appears broadly inconsistent with the latter study using only modern isolates, the two analyses are not directly comparable because of differences in dataset composition and other methodological choices. To specifically evaluate the contribution of the historical ACMV DNA-A sequence to ACMV DNA-A MRCA date and substitution rate estimates, we reanalysed our dataset after removing the historical sequence. As anticipated, this reanalysis under the exact same parameters still yielded significantly different results, while belonging to the same order of magnitude. Excluding the historical sequence yielded a five times higher substitution rate estimate (Fig. 5A). The standard deviation of substitution rates amongst branches for the uncorrelated log-normal relaxed clock did not change significantly from the analysis including the historical sequence (not shown). Excluding the historical sequence also yielded a significantly later estimate date for the MRCA of the analysed ACMV DNA-A sequences (1957 [95% HPD: 1934–1976], Fig. 5B). Similarly, the MRCA age for Malagasy island isolates (believed to have arisen from a single introduction) was estimated to 1936 [95% HPD: 1900 – 1964] and 1990 [95% HPD: 1983–1998] when including or excluding it, respectively (Fig. 5C).

The timeline of ACMV DNA-A evolution that we have inferred when including the historical sequence is likely to be more accurate than that determined without this sequence for two main reasons. First, this estimated timeline fits better with historical reports of CMD disease, dating back to 1894 in Africa and to the 1930s in Madagascar⁴⁰. Second, the 95% credibility intervals of the estimated date of the ACMV DNA-A MRCA that was inferred without the historical sequence excludes 1928 and it therefore cannot be reconciled with the fact that a sequence sampled in 1928 clusters within the ACMV tree (i.e. it is not an outgroup) (Figure S5). Such striking lower substitution rate and hence higher divergence time estimates, when including ancient viral genome sequences, have been previously described in molecular dating studies focusing on different virus group representatives: barley stripe mosaic virus (BSMV)¹⁷, Human immunodeficiency virus⁵⁶, hepatitis B virus⁵⁷, as well as parvovirus B19⁵⁸ (a ssDNA Baltimore group II virus to whom ACMV belongs), as recently reviewed in¹⁵.

In summary, our results illustrate that high-quality historical genomes of DNA viruses can be both reconstructed by sequencing the small RNA fraction of a plant herbarium specimen, harbouring siRNA characteristics and authenticated by analysing post-mortem RNA damage patterns. Such historical genomes represent “fossil” records of past viral diversity that have the potential to shed light on the spatiotemporal history of plant diseases. Indeed, our results demonstrate that CMD-causing ACMV variants were already present in the Central African Republic in 1928, supporting the accuracy of the description of a historical record of CMD made in 1924 from visual inspection of cassava leaves. Second, phylogenetic inferences performed with the inclusion of our historical ACMV DNA-A sequence significantly altered the inferred date at which the MRCA of all currently sampled ACMV variants likely existed, providing a better fit with historical reports than previous estimates and yielding a lower rate of ACMV DNA-A molecular evolution. Future studies including additional historical ACMV genome sequences that are more geographically/temporally dispersed will help us to refine the evolutionary parameters inferred herein. The presence of ACMV should also be tested in other herbarium plant species/families if one aims to investigate possible host-switching events that may have led to the emergence of CMD in cassava. More generally, similar investigations on other important viral crop pathogens will improve disease monitoring and sustainable control, while highlighting the importance of natural history collections.

Material and methods

Herbarium sampling

In 2014, the historical collection of cassava specimens of the National Herbarium of the Muséum National d'Histoire Naturelle, Paris (https://www.mnhn.fr/en) was searched for in 2014 for leaves displaying symptoms of CMD. Sample P04808771 (Fig. 1), a Manihot glaziovii specimen collected by C. Tisserant at Bambari, Central African Republic in 1928, displayed chlorotic mosaic and leaf distortion, two typical symptoms of CMD. A small leaf fragment (≈1cm² / 12mg of dry material) was excised from this specimen using a disinfected blade and gloves, sealed in a clean envelope, transported to Reunion Island and stored in a vacuum-sealed box at 14°C until use. Permission to sample and perform destructive analysis on historical specimen P04808771 was obtained from the Muséum national d'Histoire naturelle (Paris, France). Collection of any plant material used in this study complies with institutional, national, and international guidelines.

DNA extraction, amplification and sequencing

DNA isolation was performed in a bleach-cleaned molecular biology laboratory at the Centre Hospitalier Universitaire Sud Réunion that met the authenticity criteria for the extraction of ancient biomolecules⁵⁹: a laboratory in which no plant samples had been manipulated before. Total DNA was extracted from the herbarium sample following manufacturer’s instructions of the Qiagen DN easy plant kit. We attempted to detect both viral and ACMV specific DNA using the standard RCA-Cloning-Sanger sequencing protocol⁶⁰ and amplification of overlapping ACMV-specific PCR amplicons (ranging from 54 to 381nt), using validated primers (Harimalala, personal communication) listed in Table S2, respectively.

RNA extraction, library preparation, sequencing and quality control

RNA isolation was also performed at the Centre Hospitalier Universitaire Sud Réunion. Total RNA was extracted from the herbarium sample using a PureLink Plant RNA Reagent kit (Ambion) and quantified using an Agilent 2200 Tapestation system (Agilent, France). Purification of siRNA, library preparation and sequencing were carried out by Fasteris NGS service team in Geneva, Switzerland. Using polyacrylamide gel electrophoresis, fragments of 18-30nt long were selected and converted into sequencing library using the Illumina TruSeq Small RNA Library Preparation kit. Sequencing was performed in a 1×50 cycle mode on a HiSeq instrument. Adaptors were trimmed from raw reads using the Illuminaclip option in Trimmomatic 0.36⁶¹. Additional quality-trimming was performed using the same tool to remove low Illumina quality score-associated bases (SLIDINGWINDOW:5:20) and reads shorter than 15nt (MINLEN:15). Those of size 18-30nt were retained as clean reads.

Virus detection and taxonomic classification

To identify viruses from our historical sample, we first used VirusDetect⁶², a bioinformatic pipeline built to efficiently analyse large-scale small RNA (sRNA) datasets. We fixed all parameters to their default values and used the Sept 2019 GenBank reference virus genome database. In a second step, we used the dedicated short read aligner BWA-aln⁶³ (with the following optimised options fixed as in VirusDetect pipeline: -n 1 -o 1 -e 1 -i 0 -l 15 -k 1) to map quality-trimmed reads to both viral (i.e. the species detected by VirusDetect) and host plant (Manihot glaziovii specimen GIShi—SRA: SRS597345) reference genomes. Our reads-mapping strategy was further assessed for the three following aspects. First, we evaluated the performance of another short-read aligner, Bowtie⁶⁴, allowing one mismatch. Second, we compared the effect of mapping reads either independently or simultaneously to both ACMV DNA-A and DNA-B segments, in order to evaluate the influence of shared genomic regions. Finally, we assessed the effect of reference choice on mapping statistics and variant calling/filtering. To this aim, reads were mapped to three supplementary reference sequences (selected for their close, intermediate and distant phylogenetic proximity with the historical genome).

Historical viral genome authentication and reconstruction

We examined the sequences for cytosine deamination patterns—a typical proxy of postmortem RNA damage–to authenticate the historical nature of the siRNA ACMV sequences obtained. Distributions of C to U vs other transitions along the siRNA reads were assessed from raw untrimmed reads using the dedicated mapDamage2 tool⁶⁵. Postmortem RNA damage was compared between the historical specimen and RNA isolated from an ACMV infected Manihot esculenta leaf sample collected in Madagascar in 2017. The modern RNA sample was obtained using the exact same wet-lab protocol used to obtain RNA from the 1928 sample. Quality scores of post-mortem damaged bases were downscaled using the rescale parameter to correct for the effect of deamination and avoid generating artifactual SNPs in subsequent analyses. Historical ACMV DNA-A and DNA-B sequences were reconstructed from rescaled-BWA-aln generated BAM files for both DNA-A (JX658682) and DNA-B (KJ887590) GeneBank segment references. In brief, PCR duplicates were removed using picardtools 2.7.0 MarkDuplicates⁶⁶ and depth statistics were computed with the genomecov option of BEDTools 2.24.0⁶⁷, which were then graphically represented with CIRCOS 0.69.9⁶⁸. SNPs were called with GATK UnifiedGenotyper⁶⁹ and filtered out when their sequenced depth was <10 or their allelic frequency was < 0.6. Consensus historical sequences were then reconstructed by editing the reference DNA-A and DNA-B sequences with the remaining high-quality SNPs while replacing both filtered-out variants and unsequenced nucleotide sites (i.e. sites with a sequencing depth= 0) with “Ns”. Genes coding for AC3 and AC4 were deduced from other known ACMV sequences; all sequences were checked for open reading frame features.

In order to investigate the persistence of endogenous geminiviral sequences (EGSs) within Manihot glaziovii genomes, we downloaded raw reads of the two only available African M. glaziovii samples⁴⁷ at the date of search (01/08/2021) within the SRA database (SRR2847420 & SRR2847424). After de novo assembly of the reads into contigs with SPAdes V3.15.2⁷⁰ using default parameters, all reconstructed contigs were blasted (using BLASTN) on a custom-built database containing all described species of cassava mosaic geminiviruses. We predicted that the identification of chimeric contigs (composed of both cassava and virus sequences) would indicate the presence of EGSs. Instead, the detection of contigs displaying hits with virus sequences on their whole length would suggest plant infection by episomal viruses. Finally, the absence of any hits would reveal the absence of viral DNA, both from episomal and integrated forms, within M. glaziovii genomes.

Phylogenetic inferences using both historical and modern sequences

Alignments of our historical ACMV DNA-A and DNA-B components with 134 (for DNA-A) and 99 (for DNA-B) publicly available ACMV genome component sequences sampled between 1978 and 2014 (Table S4) were constructed with MAFFT⁷¹ for phylogenetic analyses. Each of these alignments also included a CMMGV sequence as an outgroup (accession number HE617299 and HE617300 for DNA-A and DNA-B, respectively). Regions acquired via recombination were identified with RDP4⁵⁰ with default settings. Events that were detected by three or more methods with P-values <0.05 were accepted as credible and removed to avoid the potentially biasing impacts of recombination on phylogenetic reconstruction. Note that the historical sequence was analysed with particular scrutiny and recombination events detected with a single method were taken into account. Maximum likelihood trees for each of these alignments were constructed using RAxML 8.2.4⁷² using a rapid bootstrap test and the GTR+G+I model of nucleotide substitution was chosen as best-fitted model based on the Bayesian Information Criterion (BIC) computed with JModelTest2.0⁷³.

The existence of a temporal signal in this dataset was investigated using two different tests. First, a linear regression was fitted between sample age and root-to-tip distance using the distRoot function of the adephylo R package⁷⁴. Temporal signal was considered present if a significant positive correlation was observed. Secondly, we performed a date-randomisation test (DRT)⁷⁵ with 20 independent date-randomised datasets using the R package, TipDatingBeast⁷⁶. Temporal signal was considered present when there was no overlap between the inferred root height 95% highest posterior density (95% HPD) of the initial dataset and that of 20 date-randomised datasets. Finally, we also investigated whether our dataset showed confounding effects between temporal and genetic structures using a Mantel confounding test which investigate whether closely related sequences were more likely to have been sampled at similar times. This additional test is important because both the root-to-tip regression and the DRT can be confounded in such a situation⁵¹.

Tip-dating was performed with BEAST 1.8.4⁵² considering a GTR substitution model with a Γ distribution and invariant sites (GTR+G+I) along with an uncorrelated log-normal relaxed (UCLNR) clock to account for minor variations between lineages. Bayes factors calculated from the marginal likelihoods using both path and stepping-stone sampling methods shown “very strong” support (BF>10⁷⁷) for UCLNR over strict (S) and random local (RL) clocks. To minimise prior assumptions about demographic history, an extended Bayesian skyline plot (EBSP) approach was adopted to integrate data over different coalescent histories⁷⁸. Three independent chains were run for 25 million steps and sampled every 2500 steps with a burn-in of the first 2500 steps. Convergence to the stationary distribution and sufficient sampling and mixing were checked by inspection of posterior samples (effective sample size >200) in Tracer 1.7.1⁷⁹. Parameter estimation was based on the samples combined from the different chains. The best-supported tree was estimated from the combined samples using the maximum clade credibility method implemented in TreeAnnotator. In order to specifically assess the effect of including our historical genome in the dating calibration, we computed the same inferences on a dataset where the 1928 DNA-A sequence was excluded (i.e. using only sequences sampled after 1977). Wilcoxon rank sum tests with continuity correction were performed to compare the means of the posterior estimates obtained from both datasets.

Data availability

Raw reads were deposited to the Sequence Read Archive (SRR13608699). Consensus historical genome reconstructed for ACMV DNA-A and DNA-B molecules have also been deposited on GenBank database (MW788219 & MW788220). The modern genomes used in this study have previously been published in the NCBI GenBank repository under accession numbers listed in Table S4.

References

Stukenbrock, E. H. & McDonald, B. A. The origins of plant pathogens in agro-ecosystems. Annu. Rev. Phytopathol. https://doi.org/10.1146/annurev.phyto.010708.154114 (2008).
Article PubMed Google Scholar
Savary, S., Ficke, A., Aubertot, J. N. & Hollier, C. Crop losses due to diseases and their implications for global food production losses and food security. Food Secur. https://doi.org/10.1007/s12571-012-0200-5 (2012).
Article Google Scholar
Strange, R. N. & Scott, P. R. Plant disease: a threat to global food security. Annu. Rev. Phytopathol. https://doi.org/10.1146/annurev.phyto.43.113004.133839 (2005).
Article PubMed Google Scholar
Anderson, P. K. et al. Emerging infectious diseases of plants: pathogen pollution, climate change and agrotechnology drivers. Trends Ecol. Evol. https://doi.org/10.1016/j.tree.2004.07.021 (2004).
Article PubMed Google Scholar
Scholthof, K. B. G. et al. Top 10 plant viruses in molecular plant pathology. Mol. Plant Pathol. https://doi.org/10.1111/j.1364-3703.2011.00752.x (2011).
Article PubMed PubMed Central Google Scholar
Stukenbrock, E. H. & Bataillon, T. A population genomics perspective on the emergence and adaptation of new plant pathogens in agro-ecosystems. PLoS Pathog. https://doi.org/10.1371/journal.ppat.1002893 (2012).
Article PubMed PubMed Central Google Scholar
Gilligan, C. A. Sustainable agriculture and plant diseases: an epidemiological perspective. Philos. Trans. R. Soc. B: Biol. Sci. https://doi.org/10.1098/rstb.2007.2181 (2008).
Article Google Scholar
Li, L. M., Grassly, N. C. & Fraser, C. Genomic analysis of emerging pathogens: methods, application and future trends. Genome Biol.ogy https://doi.org/10.1186/s13059-014-0541-9 (2014).
Article Google Scholar
Lemey, P., Rambaut, A., Drummond, A. J. & Suchard, M. A. Bayesian phylogeography finds its roots. PLoS Comput. Biol. https://doi.org/10.1371/journal.pcbi.1000520 (2009).
Article MathSciNet PubMed PubMed Central Google Scholar
Lefeuvre, P. et al. The spread of tomato yellow leaf curl virus from the middle east to the world. PLoS Pathog. https://doi.org/10.1371/journal.ppat.1001164 (2010).
Article PubMed PubMed Central Google Scholar
Monjane, A. L. et al. Reconstructing the history of maize streak virus strain A dispersal tor reveal diversification hot spots and its origin in southern Africa. J. Virol. https://doi.org/10.1128/jvi.00640-11 (2011).
Article PubMed PubMed Central Google Scholar
Trovao, N. S. et al. Host ecology determines the dispersal patterns of a plant virus. Virus Evol. https://doi.org/10.1093/ve/vev016 (2015).
Article PubMed PubMed Central Google Scholar
Rakotomalala, M. et al. Comparing patterns and scales of plant virus phylogeography: rice yellow mottle virus in Madagascar and in continental Africa. Virus Evol. https://doi.org/10.1093/ve/vez023 (2019).
Article PubMed PubMed Central Google Scholar
Gibbs, A. J., Fargette, D., García-Arenal, F. & Gibbs, M. J. Time - The emerging dimension of plant virus studies. J General Virol. https://doi.org/10.1099/vir.0.015925-0 (2010).
Article Google Scholar
Simmonds, P., Aiewsakun, P. & Katzourakis, A. Prisoners of war: host adaptation and its constraints on virus evolution. Nat. Rev. Microbiol. https://doi.org/10.1038/s41579-018-0120-2 (2019).
Article PubMed Google Scholar
Jones, R. A. C., Boonham, N., Adams, I. P. & Fox, A. Historical virus isolate collections: an invaluable resource connecting plant virology’s pre-sequencing and post-sequencing eras. Plant Pathol. 70, 235–248 (2021).
Article Google Scholar
Smith, O. et al. A complete ancient RNA genome: Identification, reconstruction and evolutionary history of archaeological Barley Stripe Mosaic Virus. Sci. Rep. https://doi.org/10.1038/srep04003 (2014).
Article PubMed PubMed Central Google Scholar
Malmstrom, C. M., Shu, R., Linton, E. W., Newton, L. A. & Cook, M. A. Barley yellow dwarf viruses (BYDVs) preserved in herbarium specimens illuminate historical disease ecology of invasive and native grasses. J. Ecol. https://doi.org/10.1111/j.1365-2745.2007.01307.x (2007).
Article Google Scholar
Peyambari, M., Warner, S., Stoler, N., Rainer, D. & Roossinck, M. J. A 1000-Year-old RNA virus. J. Virol. 93, e01188-18 (2019).
Article CAS PubMed Google Scholar
Adams, I. P. et al. Next-generation sequencing and metagenomic analysis: a universal diagnostic tool in plant virology. Mol. Plant Pathol. https://doi.org/10.1111/j.1364-3703.2009.00545.x (2009).
Article PubMed PubMed Central Google Scholar
Vayssier-Taussat, M. et al. Shifting the paradigm from pathogens to pathobiome new concepts in the light of meta-omics. Front. Cell. Infect. Microbiol. https://doi.org/10.3389/fcimb.2014.00029 (2014).
Article PubMed PubMed Central Google Scholar
Massart, S., Olmos, A., Jijakli, H. & Candresse, T. Current impact and future directions of high throughput sequencing in plant virus diagnostics. Virus Res. https://doi.org/10.1016/j.virusres.2014.03.029 (2014).
Article PubMed Google Scholar
Roossinck, M. J., Martin, D. P. & Roumagnac, P. Plant virus metagenomics: advances in virus discovery. Phytopathology https://doi.org/10.1094/PHYTO-12-14-0356-RVW (2015).
Article PubMed Google Scholar
Kreuze, J. F. et al. Complete viral genome sequence and discovery of novel viruses by deep sequencing of small RNAs: a generic method for diagnosis, discovery and sequencing of viruses. Virology https://doi.org/10.1016/j.virol.2009.03.024 (2009).
Article PubMed Google Scholar
Pooggin, M. M. Small RNA-omics for plant virus identification, virome reconstruction, and antiviral defense characterization. Front. Microbiol. https://doi.org/10.3389/fmicb.2018.02779 (2018).
Article PubMed PubMed Central Google Scholar
Hartung, J. S. et al. History and diversity of Citrus Leprosis virus recorded in herbarium specimens. Phytopathology https://doi.org/10.1094/PHYTO-03-15-0064-R (2015).
Article PubMed Google Scholar
Golyaev, V., Candresse, T., Rabenstein, F. & Pooggin, M. M. Plant virome reconstruction and antiviral RNAi characterization by deep sequencing of small RNAs from dried leaves. Sci. Rep. https://doi.org/10.1038/s41598-019-55547-3 (2019).
Article PubMed PubMed Central Google Scholar
Patil, B. L. & Fauquet, C. M. Cassava mosaic geminiviruses: actual knowledge and perspectives. Mol. Plant Pathol. https://doi.org/10.1111/j.1364-3703.2009.00559.x (2009).
Article PubMed PubMed Central Google Scholar
Legg, J. P., Owor, B., Sseruwagi, P. & Ndunguru, J. Cassava mosaic virus disease in east and central Africa: epidemiology and management of a regional pandemic. Adv. Virus Res. https://doi.org/10.1016/S0065-3527(06)67010-3 (2006).
Article PubMed Google Scholar
Wang, H. L. et al. First report of Sri Lankan cassava mosaic virus infecting cassava in Cambodia. Plant Dis. https://doi.org/10.1094/PDIS-10-15-1228-PDN (2016).
Article PubMed Google Scholar
Minato, N. et al. Surveillance for sri lankan cassava mosaic virus (SLCMV) in Cambodia and Vietnam one year after its initial detection in a single plantation in 2015. PLoS One https://doi.org/10.1371/journal.pone.0212780 (2019).
Article PubMed PubMed Central Google Scholar
Mugerwa, H., Wang, H. L., Sseruwagi, P., Seal, S. & Colvin, J. Whole-genome single nucleotide polymorphism and mating compatibility studies reveal the presence of distinct species in sub-Saharan Africa Bemisia tabaci whiteflies. Insect Sci. https://doi.org/10.1111/1744-7917.12881 (2020).
Article PubMed Google Scholar
Ntawuruhunga, P. et al. Incidence and severity of cassava mosaic disease in the Republic of Congo. African Crop Sci. J. https://doi.org/10.4314/acsj.v15i1.54405 (2010).
Article Google Scholar
Zinga, I. et al. Epidemiological assessment of cassava mosaic disease in Central African Republic reveals the importance of mixed viral infection and poor health of plant cuttings. Crop Prot. https://doi.org/10.1016/j.cropro.2012.10.010 (2013).
Article Google Scholar
Jeske, H. Geminiviruses. Curr. Topics Microbiol. Immunol. https://doi.org/10.1007/978-3-540-70972-5_11 (2009).
Article Google Scholar
Vanitharani, R., Chellappan, P. & Fauquet, C. M. Geminiviruses and RNA silencing. Trends Plant Sci. https://doi.org/10.1016/j.tplants.2005.01.005 (2005).
Article PubMed Google Scholar
Aregger, M. et al. Primary and secondary siRNAs in geminivirus-induced gene silencing. PLoS Pathog. https://doi.org/10.1371/journal.ppat.1002941 (2012).
Article PubMed PubMed Central Google Scholar
Olsen, K. M. & Schaal, B. A. Evidence on the origin of cassava: Phylogeography of Manihot esculenta. Proc. Natl. Acad. Sci. USA https://doi.org/10.1073/pnas.96.10.5586 (1999).
Article PubMed PubMed Central Google Scholar
Fauquet, C. African cassava mosaic virus: etiology, epidemiology, and control. Plant Dis. https://doi.org/10.1094/pd-74-0404 (1990).
Article Google Scholar
Legg, J. P. & Fauquet, C. M. Cassava mosaic geminiviruses in Africa. Plant Mol. Biol. https://doi.org/10.1007/s11103-004-1651-7 (2004).
Article PubMed Google Scholar
De Bruyn, A. et al. Divergent evolutionary and epidemiological dynamics of cassava mosaic geminiviruses in Madagascar. BMC Evol. Biol. https://doi.org/10.1186/s12862-016-0749-2 (2016).
Article PubMed PubMed Central Google Scholar
Weiß, C. L. et al. Temporal patterns of damage and decay kinetics of dna retrieved from plant herbarium specimens. R. Soc. Open Sci. https://doi.org/10.1098/rsos.160239 (2016).
Article PubMed PubMed Central Google Scholar
Chellappan, P., Vanitharani, R., Ogbe, F. & Fauquet, C. M. Effect of temperature on geminivirus-induced RNA silencing in plants. Plant Physiol. https://doi.org/10.1104/pp.105.066563 (2005).
Article PubMed PubMed Central Google Scholar
Smith, O. & Gilbert, M. T. P. Ancient RNA. in (2018). doi:https://doi.org/10.1007/13836_2018_17.
Filloux, D. et al. The genomes of many yam species contain transcriptionally active endogenous geminiviral sequences that may be functionally expressed. Virus Evol. https://doi.org/10.1093/ve/vev002 (2015).
Article PubMed PubMed Central Google Scholar
Sharma, V. et al. Large-scale survey reveals pervasiveness and potential function of endogenous geminiviral sequences in plants. Virus Evol. https://doi.org/10.1093/ve/veaa071 (2020).
Article PubMed PubMed Central Google Scholar
Bredeson, J. V. et al. Sequencing wild and cultivated cassava and related species reveals extensive interspecific hybridization and genetic diversity. Nat. Biotechnol. https://doi.org/10.1038/nbt.3535 (2016).
Article PubMed Google Scholar
Serfraz, S. et al. Insertion of Badnaviral DNA in the Late Blight Resistance Gene (R1a) of Brinjal Eggplant (Solanum melongena). Front. Plant Sci. https://doi.org/10.3389/fpls.2021.683681 (2021).
Article PubMed PubMed Central Google Scholar
Lefeuvre, P. et al. Evolutionary time-scale of the begomoviruses: evidence from integrated sequences in the Nicotiana genome. PLoS One https://doi.org/10.1371/journal.pone.0019193 (2011).
Article PubMed PubMed Central Google Scholar
Martin, D. P., Murrell, B., Golden, M., Khoosal, A. & Muhire, B. RDP4: detection and analysis of recombination patterns in virus genomes. Virus Evol. https://doi.org/10.1093/ve/vev003 (2015).
Article PubMed PubMed Central Google Scholar
Murray, G. G. R. et al. The effect of genetic structure on molecular dating and tests for temporal signal. Methods Ecol. Evol. 7, 80–89 (2016).
Article PubMed Google Scholar
Drummond, A. J. & Rambaut, A. BEAST: Bayesian evolutionary analysis by sampling trees. BMC Evol. Biol. https://doi.org/10.1186/1471-2148-7-214 (2007).
Article PubMed PubMed Central Google Scholar
Yoshida, K. et al. Mining herbaria for plant pathogen genomes: back to the future. PLoS Pathog. https://doi.org/10.1371/journal.ppat.1004028 (2014).
Article PubMed PubMed Central Google Scholar
Dufrénoy, J. & Hédin, L. . La. Mosaïque des feuilles du Manioc au Cameroun. J. d’agriculture Tradit. Bot. appliquée 94, 361–365 (1929).
Google Scholar
Duffy, S. & Holmes, E. C. Validation of high rates of nucleotide substitution in geminiviruses: phylogenetic evidence from East African cassava mosaic viruses. J. Gen. Virol. 90, 1539–1547 (2009).
Article CAS PubMed PubMed Central Google Scholar
Worobey, M. et al. Direct evidence of extensive diversity of HIV-1 in Kinshasa by 1960. Nature https://doi.org/10.1038/nature07390 (2008).
Article PubMed PubMed Central Google Scholar
Mühlemann, B. et al. Ancient hepatitis B viruses from the Bronze Age to the Medieval period. Nature https://doi.org/10.1038/s41586-018-0097-z (2018).
Article PubMed Google Scholar
Toppinen, M. et al. Bones hold the key to DNA virus history and epidemiology. Sci. Rep. https://doi.org/10.1038/srep17226 (2015).
Article PubMed PubMed Central Google Scholar
Gilbert, M. T. P., Bandelt, H. J., Hofreiter, M. & Barnes, I. Assessing ancient DNA studies. Trends Ecol. Evol. https://doi.org/10.1016/j.tree.2005.07.005 (2005).
Article PubMed Google Scholar
Inoue-Nagata, A. K., Albuquerque, L. C., Rocha, W. B. & Nagata, T. A simple method for cloning the complete begomovirus genome using the bacteriophage φ29 DNA polymerase. J. Virol. Methods https://doi.org/10.1016/j.jviromet.2003.11.015 (2004).
Article PubMed Google Scholar
Bolger, A. M., Lohse, M. & Usadel, B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics https://doi.org/10.1093/bioinformatics/btu170 (2014).
Article PubMed PubMed Central Google Scholar
Zheng, Y. et al. VirusDetect: An automated pipeline for efficient virus discovery using deep sequencing of small RNAs. Virology https://doi.org/10.1016/j.virol.2016.10.017 (2017).
Article PubMed Google Scholar
Li, H. & Durbin, R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics https://doi.org/10.1093/bioinformatics/btp324 (2009).
Article PubMed PubMed Central Google Scholar
Langmead, B., Trapnell, C., Pop, M. & Salzberg, S. L. Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol. https://doi.org/10.1186/gb-2009-10-3-r25 (2009).
Article PubMed PubMed Central Google Scholar
Jónsson, H., Ginolhac, A., Schubert, M., Johnson, P. L. F. & Orlando, L. MapDamage2.0: Fast approximate Bayesian estimates of ancient DNA damage parameters. in Bioinformatics (2013). doi:https://doi.org/10.1093/bioinformatics/btt193.
Broad Institute. Picard Tools - By Broad Institute. Github (2009).
Quinlan, A. R. & Hall, I. M. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics https://doi.org/10.1093/bioinformatics/btq033 (2010).
Article PubMed PubMed Central Google Scholar
Krzywinski, M. et al. Circos: an information aesthetic for comparative genomics. Genome Res. https://doi.org/10.1101/gr.092759.109 (2009).
Article PubMed PubMed Central Google Scholar
Depristo, M. A. et al. A framework for variation discovery and genotyping using next-generation DNA sequencing data. Nat. Genet. https://doi.org/10.1038/ng.806 (2011).
Article PubMed PubMed Central Google Scholar
Bankevich, A. et al. SPAdes: A new genome assembly algorithm and its applications to single-cell sequencing. J. Comput. Biol. https://doi.org/10.1089/cmb.2012.0021 (2012).
Article MathSciNet PubMed PubMed Central Google Scholar
Katoh, K. & Standley, D. M. MAFFT multiple sequence alignment software version 7: Improvements in performance and usability. Mol. Biol. Evol. https://doi.org/10.1093/molbev/mst010 (2013).
Article PubMed PubMed Central Google Scholar
Stamatakis, A. RAxML version 8: A tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics 30, 1312–1313 (2014).
Article CAS PubMed PubMed Central Google Scholar
Darriba, D., Taboada, G. L., Doallo, R. & Posada, D. JModelTest 2: More models, new heuristics and parallel computing. Nat. Methods https://doi.org/10.1038/nmeth.2109 (2012).
Article PubMed PubMed Central Google Scholar
Jombart, T. & Dray, S. Adephylo: Exploratory analyses for the phylogenetic comparative method. Bioinformatics (2010).
Duchêne, S., Duchêne, D., Holmes, E. C. & Ho, S. Y. W. The performance of the date-randomization test in phylogenetic analyses of time-structured virus data. Mol. Biol. Evol. 32, 1895–1906 (2015).
Article PubMed Google Scholar
Rieux, A. & Khatchikian, C. E. Tipdatingbeast: an r package to assist the implementation of phylogenetic tip-dating tests using beast. Mol. Ecol. Resour. https://doi.org/10.1111/1755-0998.12603 (2017).
Article PubMed Google Scholar
Raftery, A. E. Approximate Bayes factors and accounting for model uncertainty in generalised linear models. Biometrika https://doi.org/10.1093/biomet/83.2.251 (1996).
Article MathSciNet MATH Google Scholar
Ho, S. Y. W. & Shapiro, B. Skyline-plot methods for estimating demographic history from nucleotide sequences. Mol. Ecol. Resour. https://doi.org/10.1111/j.1755-0998.2011.02988.x (2011).
Article PubMed Google Scholar
Rambaut, A., Drummond, A. J., Xie, D., Baele, G. & Suchard, M. A. Posterior summarization in Bayesian phylogenetics using Tracer 1.7. Syst. Biol. (2018) doi:https://doi.org/10.1093/sysbio/syy032.

Download references

Acknowledgements

We thank the Herbarium of the Muséum national d'Histoire naturelle (Paris, France) for allowing us to sample and perform destructive analysis on the Manihot glaziovii historical specimen P04808771. Collection of any plant material used in this study complies with institutional, national, and international guidelines. This work was financially supported by l’Agence Nationale pour la Recherche (JCJC MUSEOBACT contrat ANR-17-CE35-0009-01), the European Regional Development Fund (ERDF contract GURDT I2016‐1731‐0006632), Région Réunion, the French Agropolis Foundation (Labex Agro – Montpellier, E-SPACE Project Number 1504-004, MUSEOVIR project number 1600-004), the SYNTHESYS Project http://www.synthesys.info/ (Grants GB-TAF-6437 and GB-TAF-7130) which is financed by European Community Research Infrastructure Action under the FP7 "Capacities" Program & CIRAD/AI-CRESI- 3/2016. PhD of P.C. was co-funded by ED 227, Museum national d'Histoire naturelle et Sorbonne Université, French Ministry of Higher Education, Research and Innovation, France. Computational work was performed on the CIRAD - UMR AGAP HPC data center of the south green bioinformatics platform (http://www.southgreen.fr/). This work was conducted on the Plant Protection Platform (3P, IBISA). The authors thank the Centre Hospitalier Sud Réunion and Dr Julien Jaubert for hosting us in their laboratory, Denis Filloux, Philippe Roumagnac, Mikhail Pooggin, François Balloux, Violaine Llaurens, Regis Debruyne for fruitful discussions during this study and Dr. James Legg for his assistance with the history of the cassava mosaic disease in Africa.

Author information

Authors and Affiliations

CIRAD, UMR PVBMT, 97410, St Pierre, La Réunion, France
Adrien Rieux, Paola Campos, Arnaud Duvermy, Sarah Scussel, Pierre Lefeuvre & Jean-Michel Lett
Institut de Systématique, Evolution, Biodiversité (ISYEB), Muséum national d’Histoire naturelle, CNRS, Sorbonne Université, EPHE, Université des Antilles, 57 Rue Cuvier, CP 50, 75005, Paris, France
Paola Campos, Myriam Gaudeul & Nathalie Becker
Computational Biology Division, Department of Integrative Biomedical Sciences, Institute of Infectious Diseases and Molecular Medicine, University of Cape Town, Observatory, Cape Town, South Africa
Darren Martin
Herbier national (P), Muséum national d’Histoire Naturelle, CP39, 57 Rue Cuvier, 75005, Paris, France
Myriam Gaudeul

Authors

Adrien Rieux
View author publications
You can also search for this author in PubMed Google Scholar
Paola Campos
View author publications
You can also search for this author in PubMed Google Scholar
Arnaud Duvermy
View author publications
You can also search for this author in PubMed Google Scholar
Sarah Scussel
View author publications
You can also search for this author in PubMed Google Scholar
Darren Martin
View author publications
You can also search for this author in PubMed Google Scholar
Myriam Gaudeul
View author publications
You can also search for this author in PubMed Google Scholar
Pierre Lefeuvre
View author publications
You can also search for this author in PubMed Google Scholar
Nathalie Becker
View author publications
You can also search for this author in PubMed Google Scholar
Jean-Michel Lett
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

This project was globally led by J-M.L, N.B & A.R. M.G provided historical material and insights on herbarium specimen sampling. S.S performed the wetlab processing of the historic sample under the supervision of N.B, P.L & J-M.L. A.R, P.C, A.D, S.S, D.M, N.B & J-M.L analyzed the data and performed genetic analyses. A.R & J-M.L wrote the first draft and all authors contributed to the final version.

Corresponding authors

Correspondence to Adrien Rieux or Jean-Michel Lett.

Ethics declarations

Competing interest

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Rieux, A., Campos, P., Duvermy, A. et al. Contribution of historical herbarium small RNAs to the reconstruction of a cassava mosaic geminivirus evolutionary history. Sci Rep 11, 21280 (2021). https://doi.org/10.1038/s41598-021-00518-w

Download citation

Received: 08 March 2021
Accepted: 13 October 2021
Published: 28 October 2021
DOI: https://doi.org/10.1038/s41598-021-00518-w

This article is cited by

Herbarium specimen sequencing allows precise dating of Xanthomonas citri pv. citri diversification history
- Paola E. Campos
- Olivier Pruvost
- Lionel Gagnevin
Nature Communications (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.