Discovery and genetic characterization of diverse smacoviruses in Zambian non-human primates

Anindita, Paulina D.; Sasaki, Michihito; Gonzalez, Gabriel; Phongphaew, Wallaya; Carr, Michael; Hang’ombe, Bernard M.; Mweene, Aaron S.; Ito, Kimihito; Orba, Yasuko; Sawa, Hirofumi

doi:10.1038/s41598-019-41358-z

Download PDF

Article
Open access
Published: 08 April 2019

Discovery and genetic characterization of diverse smacoviruses in Zambian non-human primates

Paulina D. Anindita¹,
Michihito Sasaki¹,
Gabriel Gonzalez ORCID: orcid.org/0000-0002-2180-2120²,
Wallaya Phongphaew¹,
Michael Carr^3,4,
Bernard M. Hang’ombe^5,7,8,
Aaron S. Mweene^6,7,8,
Kimihito Ito^2,3,
Yasuko Orba¹ &
…
Hirofumi Sawa^1,3,7,8,9

Scientific Reports volume 9, Article number: 5045 (2019) Cite this article

1607 Accesses
7 Citations
4 Altmetric
Metrics details

Subjects

A Publisher Correction to this article was published on 06 June 2019

This article has been updated

Abstract

The Smacoviridae has recently been classified as a family of small circular single-stranded DNA viruses. An increasing number of smacovirus genomes have been identified exclusively in faecal matter of various vertebrate species and from insect body parts. However, the genetic diversity and host range of smacoviruses remains to be fully elucidated. Herein, we report the genetic characterization of eleven circular replication-associated protein (Rep) encoding single-stranded (CRESS) DNA viruses detected in the faeces of Zambian non-human primates. Based on pairwise genome-wide and amino acid identities with reference smacovirus species, ten of the identified CRESS DNA viruses are assigned to the genera Porprismacovirus and Huchismacovirus of the family Smacoviridae, which bidirectionally encode two major open reading frames (ORFs): Rep and capsid protein (CP) characteristic of a type IV genome organization. The remaining unclassified CRESS DNA virus was related to smacoviruses but possessed a genome harbouring a unidirectionally oriented CP and Rep, assigned as a type V genome organization. Moreover, phylogenetic and recombination analyses provided evidence for recombination events encompassing the 3′-end of the Rep ORF in the unclassified CRESS DNA virus. Our findings increase the knowledge of the known genetic diversity of smacoviruses and highlight African non-human primates as carrier animals.

Complexity of avian evolution revealed by family-level genomes

Article 01 April 2024

Josefin Stiller, Shaohong Feng, … Guojie Zhang

Mechanisms of SARS-CoV-2 entry into cells

Article 05 October 2021

Cody B. Jackson, Michael Farzan, … Hyeryun Choe

The evolutionary drivers and correlates of viral host jumps

Article Open access 25 March 2024

Cedric C. S. Tan, Lucy van Dorp & Francois Balloux

Introduction

Recent advances in high-throughput sequencing technologies have allowed metagenomic analyses to discover an ever-increasing genetic diversity of viral genomes from vertebrates, invertebrates, prokaryotic and environmental samples¹. Small circular replication-associated protein (Rep) encoding single-stranded (CRESS) DNA viruses have been discovered in a diversity of prokaryotes and eukaryotes². With the increasing diversity of CRESS DNA viruses, they have been classified into six viral families: Genomoviridae, Geminiviridae, Nanoviridae, Circoviridae, Bacilladnaviridae and Smacoviridae. The family Smacoviridae was recently assigned as a new viral family by the International Committee on Taxonomy of Viruses (ICTV)³, which is further classified into six genera. Smacoviruses have 2.3–2.9 kb genomes, containing two major open reading frames (ORFs), encoding Rep and the capsid protein (CP). Rep possesses DNA helicase activity and initiates viral replication by a rolling circle replication (RCR)⁴. In comparison with Rep, the CP ORF is more divergent among smacoviruses. The genetic diversity observed within smacoviruses might be due to high mutation rates⁵ and intra-familial recombination events in their genomes⁶.

Smacoviruses, previously known as “stool-associated circular viruses”, have been detected in faecal samples obtained from healthy and diarrheic animal species, including cattle⁷, sheep⁷, pigs^8,9, rats¹⁰, chickens¹¹, camels¹², non-human primates¹³ and humans^13,14,15, as well as insect species such as dragonflies¹⁶ and blow flies¹⁷ but not from environmental samples. Despite the lack of evidence for a direct causal relationship, smacoviruses were identified in the faecal virome derived from human patients with diarrhea in France as well as in central and south American children with unexplained gastrointestinal disease negative for known pathogens^13,15. It remains, however, to be established whether smacovirus infect human cells, causes overt disease or not in humans and animals.

In this study, sequence reads related to CRESS DNA virus genomes were initially discovered in faecal samples of Zambian NHPs through viral metagenomic analysis. We subsequently determined whole genome sequences of eleven CRESS DNA viruses and characterized ten of them as new smacovirus species. This study extends the known genetic diversity of smacoviruses and the species range of NHPs which harbour these ssDNA viruses.

Results

Identification of CRESS DNA viruses in Zambian NHP species

Fifty faecal samples from NHPs consisting of 25 malbroucks (Chlorocebus cynosuros) and 25 baboons (Papio spp.) in Zambia were suspended, pooled and subjected to metagenomic analysis. Among a total of 63,587,648 sequence reads generated, 1,381,545 reads were assigned to ssDNA viruses by BLASTx by comparison of the translated nucleotide sequences from the samples with the viral protein database¹⁸. Six contigs, ranging in length from 0.8–2.0 kb, related to members of Smacoviridae were generated by de novo assembly.

To examine the prevalence of the smacovirus-like genomes, six different pairs of primer sets were designed and used to screen fifty faecal samples from NHPs (Supplementary Table S1). Ten (20%) of the NHP faecal samples were positive for smacovirus-like genomes. Nine faecal samples were positive for a single smacovirus-like genome, whereas one malbrouck faecal sample (ZM09#96) harboured two different smacovirus-like genomes (Table 1). Speciation of NHPs was confirmed by sequencing of mitochondrial cytochrome b (cytb) (Table 1).

Table 1 Sample information and results of PCR screening.

Full size table

The complete, circular genomes from eleven smacovirus-like genomes were then amplified by inverse PCR, cloned into plasmid vectors and then sequenced bidirectionally by a primer walking strategy employing Sanger sequencing. As a result, we found that the complete circular genome sizes ranged from 2488 to 2766 nucleotides, which is within the known range of previously reported CRESS DNA virus genomes. BLASTx analysis showed that these CRESS DNA virus genomes were related to known viruses from the family Smacoviridae.

Classification and genome organization of Zambian NHP CRESS DNA viruses

The smacovirus-like CRESS DNA viruses from Zambian NHPs each contained two large ORFs which showed sequence similarity to CP and Rep of previously described smacoviruses. Following the CRESS DNA virus classification scheme proposed by Rosario et al.⁴, the genomes of ten CRESS DNA viruses belonged to the ambisense type IV organization whereas the genome of one CRESS DNA virus (isolate PkSmV1-ZM09–64) contained a unisense type V organization in which the CP and Rep ORFs were in the same orientation similar to the previously described porcine smacovirus-related CRESS DNA virus, PigSCV (JQ023166)^13,19 (Fig. 1). The predicted stem loop structures were located near the 3′-end of the Rep ORF with homology to the degenerate NAGTNTTAC nonanucleotide sequence motif which are also shared by all other reported smacoviruses⁶ (Table 2). This motif has been identified as the putative origin of RCR of smacoviruses during the replication cycle^7,13.

Table 2 Genome features of Zambian non-human primate CRESS DNA viruses.

Full size table

To further characterize the identified viruses as CRESS DNA viruses, we also searched for amino acid motifs within the encoded Rep that play important functional roles in viral genome replication. All Rep proteins of the identified CRESS DNA viruses harboured an RCR domain (motifs I, II and III) and a helicase super family 3 domain (Walker A, B and C) as illustrated in Table 2. Interestingly, a variation of amino acid residues within the RCR motifs I and II was found throughout the Rep proteins of the identified CRESS DNA viruses compared to the previously described consensus motifs⁷. Notably, ten of eleven CRESS DNA viruses encoded Rep proteins possessing 5 amino acid residues within RCR motif I whereas the Rep of isolate PkSmV1-ZM09-64 had 6 amino acid residues (Table 2). In addition, the Rep ORF of PkSmV1-ZM09-64 encoded a leucine residue at the beginning of the Walker B motif which is unseen in smacoviruses, which possess isoleucine, valine or tryptophan at the first residue (Table 2).

The pairwise nucleotide sequence identities were calculated to determine the genetic distances between the identified CRESS DNA virus genomes and previously described smacoviruses (Table 3). The isolate PkSmV1-ZM09-64, carrying a unisense type V genome, was not assigned to a virus family and excluded from this analysis due to the inversion of the replication-associated protein ORF (Fig. 1). All of the identified genomes except PkSmV1-ZM09-64 had <77% genome-wide pairwise nucleotide sequence identity. Based on the smacovirus species demarcation threshold of 77% genome-wide pairwise nucleotide identity⁶, we grouped these CRESS DNA viruses into five smacovirus species (species 2–6 in Table 3). Species 3 and 6 were only identified from C. cynosuros, whereas species 4 and 5 were identified from 2 different NHP species.

Table 3 Pairwise sequence identity among Zambian non-human primate CRESS DNA viruses and known smacoviruses.

Full size table

Pairwise amino acid sequence identity was calculated for the Rep proteins of all smacoviruses from Zambian NHPs (Zm-SmVs) and that of known smacoviruses (Table 3). Among the Zm-SmVs species, four species belonged to the genus Porprismacovirus while a single species was most closely related to the genus Huchismacovirus following the smacovirus genus demarcation threshold of 40% pairwise amino acid sequence identity of Rep⁶. PcSmV6-ZM09-72 and CcSmV6-ZM09-96 were assigned to the genus Porprismacovirus as they were classified as closely-related species to other Porprismacoviruses (CcSmV4-ZM09-95 and CcSmV5-ZM09-83, respectively).

Phylogenetic relationships among Zambian CRESS DNA viruses and known smacoviruses

Phylogenetic trees were constructed based on the analyses of the whole genome sequences (Fig. 2), and, separately, of the amino acid sequences of CP (Fig. 3) and Rep (Fig. 4) using a maximum likelihood (ML) estimation coupled with a Bayesian inference. The genome-wide phylogenetic tree revealed that ZM-SmVs, shown in red color in the tree, segregated into a cluster of previously reported smacoviruses (Fig. 2). The phylogenetic tree of Rep supported the genus assignment of the ZM-SmVs: PcSmV3-ZM09-74, PkSmV3-ZM09-76, PcSmV3-ZM09-51, CcSmV1-ZM09-86 and CcSmV1-ZM09-96 formed a cluster with members of Porprismacovirus and PcSmV2-ZM09-71 shared a common ancestor with members of Huchismacovirus (Fig. 4). CcSmV4-ZM09-95, CcSmV5-ZM09-83, PcSmV6-ZM09-72 and CcSmV6-ZM09-96 were distinct from known smacoviruses with low amino acid identities for their Rep proteins (Table 3).

Even though a clear and unambiguous congruence between the CP and Rep phylogenies was not apparent, PcSmV6-ZM09-72, CcSmV5-ZM09-83, CcSmV4-ZM09-95, and CcSmV6-ZM09-96 were clustered together in both CP and Rep trees, forming their own group (Figs 3 and 4) suggesting that they likely shared a common ancestor. PcSmV3-ZM09-51, PcSmV3-ZM09-74, and PkSmV3-ZM09-76 also consistently clustered together with porcine stool associated circular virus 1 (DP2, KJ577810) throughout the CP and Rep trees (Figs 3 and 4). In contrast, PkSmV1-ZM09-64 clustered together with CcSmV1-ZM09-86, CcSmV1-ZM09-96, human feces smacovirus 2 (SmaCV2, KT600068) and chimpanzee stool associated circular ssDNA virus (DP152, GQ351272) in the CP phylogeny (Fig. 3), whereas PkSmV1-ZM09-64 was located outside of the cluster formed by these smacoviruses in the Rep tree and was most closely related to a previously described bovine smacovirus (Fec59973, KT862223) and more distantly related to human and avian smacoviruses (Fig. 4). This phylogenetic incongruence revealed that the CP ORFs of PkSmV1-ZM09-64, CcSmV1-ZM09-86 and CcSmV1-ZM09-96 were derived from a common ancestor; however, the ancestor of the Rep gene of PkSmV1-ZM09-64 was different from that of CcSmV1-ZM09-86 and CcSmV1-ZM09-96. These findings suggested the possibility of recombination event(s) that may account for the discordant phylogenies. In addition, PcSmV2-ZM09-71 did not cluster with other ZM-SmVs throughout the constructed trees (Figs 2, 3 and 4). Taken together, these results indicated that all ZM-SmVs discovered in the study have distinct evolutionary histories and PkSmV1-ZM09-64 may have arisen from recombination.

Recombination analysis of smacovirus genomes

Detection of the phylogenetic incongruence between the CP and Rep phylogenies prompted us to investigate whether potential recombination sites existed in the ZM-SmV genomes. This recombination analysis revealed a region with multiple recombination breakpoints (i.e. a recombination hot spot) adjacent to the 3′-end of the Rep ORF (Fig. 5), which, interestingly, has also been inferred by another study¹³. These results corroborate prior studies indicating that smacoviruses increase their genetic diversity through recombination events. Two cold spots, where a recombination event is less likely to occur, were observed at the 5′-end of the CP ORF and the 3′-end of the Rep ORF. The presence of these cold spots implies functional conservation which is noteworthy in viruses as diverse as the ssDNA smacoviruses and suggests importance for these regions in the viral life cycle.

Discussion

In the present study, sub-genomic fragments of CRESS DNA virus were initially detected in the faeces of NHPs in Zambia by metagenomic analysis. The complete genomes of eleven CRESS DNA viruses were then subsequently recovered by inverse PCR in 20% of faecal samples indicative of a high prevalence in Zambian NHPs. Although, a single CRESS DNA virus was found in nine individuals, two were found in a single Zambian malbrouck suggestive of a co-infection event (Table 1), a necessary precondition for viral recombination and emergence of new strains. Based on the genome-wide pairwise sequence identity analysis and degree of sequence divergence, these newly identified viruses could be tentatively classified as novel smacovirus species and await formal classification by the ICTV⁶. ZM-SmVs from both malbroucks and baboons formed distinct clusters within the genus Porprismacovirus on the phylogenetic trees, suggesting that they evolved from different ancestral progenitors further exemplifying the extent of the viral diversity.

Despite the high prevalence in Zambian NHPs, the detected ZM-SmVs showed phylogenetic divergence and there was no evidence for spread of specific ZM-SmV strains in the species of monkey and baboon NHPs we studied, which raises the question whether ZM-SmVs infect and transmit among NHPs and potentially also other species. Detection of these viruses in the NHP faecal matter suggests at least two distinct hypotheses with respect to their origin. First is that they might productively infect the NHPs; however, smacoviruses have not been identified in animal tissues⁶. Second is that they may represent ssDNA viruses ultimately derived from plants, insects or mammals, which comprise the NHP diet or from a resident microorganism of the NHP gut^11,13,20,21. Indeed, a recent study has described high sequence similarities between smacoviral genomes and spacer sequences of a faecal archaeon, Candidatus Methanomassiliicoccus intestinalis, indicating a tropism of smacoviruses for archaea²². To date, and in common with a growing number of ssDNA and other uncultured viruses, isolates of infectious smacoviruses have not been reported. Taken together, the precise origins of the ZM-SmVs reported here remain to be established and further studies including attempts at isolation of smacoviruses are needed to characterize smacovirus infection in detail.

There was no clear congruence between the CP and Rep phylogenies for the identified CRESS DNA viruses. Specifically, PkSmV1-ZM09-64 showed clearly different phylogenetic relationships in both the CP and Rep trees. We also detected a potential recombination hot spot of breakpoints in the genome of smacovirus at the 3′-end of the Rep ORF providing further evidence of the importance of recombination events during the evolution of smacoviruses^13,23,24. A recent study has also reported that the Rep of these viruses is chimeric and likely derives from recombination events that lead to intra-host lineage diversification²⁴. Interestingly, the recombination analysis showed the breakpoint hot spot extended into the intergenic region between the CP and Rep ORFs. This observation has been seen in diverse ssRNA²⁵ and dsDNA viruses²⁶ and supposed the existence of “functionally interchangeable modules”, i.e. shuffling of the CP ORF by recombination may conceivably impact on virus tropism of recombinants. Our results are in agreement with the notion of recombination patterns including a mechanistic predisposition to recombination in virion-strand replication origin and recombination breakpoints which significantly tend to occur in intergenic regions or at 5′ and 3′ termini of genes rather than within the genes of ssDNA viruses^23,27. Recombination breakpoints are known to be disfavoured within coding regions, as observed in the CP. Therefore, genes in ssDNA viruses preferentially move as modules which contain >50% of the coding region and natural selection disfavours viruses harbouring recombinant proteins which leads to the observed nonrandom distribution of breakpoint observed²⁷. The modular genetic exchange by recombination within non-coding regions have also been previously implicated in the emergence of new viral strains^28,29. Whether these, or related phenomena, exist for recombinant smarcoviruses warrants further study.

PkSmV1-ZM09-64 showed a unisense genome organization of CP and Rep ORF similar to previously reported CRESS DNA virus, PigSCV (JQ023166) (Fig. 1)⁸. The precise reasons underlying this ORF organization by these CRESS DNA viruses remain unclear. It is possible that this genome organization may have arisen from errors during recombination event between ancestral viruses which led to a unidirectional ORF organization instead of the more common ambisense bidirectional genome organization evident in the majority of known smacoviruses.

In conclusion, our studies indicate the presence of previously unrecognized CRESS DNA viruses in the NHP virome and provide further evidence of the extent of the genetic diversity of DNA viruses in primates.

Materials and Methods

Ethical statement and sample collection

All animal experiments were approved by the then Zambia Wildlife Authority (ZAWA), now the Department of National Parks and Wildlife, Ministry of Tourism and Arts and performed in accordance with the relevant guidelines and regulations (certificate no. 2604). Tissue and faecal samples were collected from NHPs (Chlorocebus cynosuros, n = 25; Papio spp., n = 25) in Mfuwe District in 2009 and used for different research projects as reported previously^30,31. For NHP species typing, the mitochondrial cytb gene was amplified and sequenced from genomic DNA extracted from spleen tissues of the NHPs, as described elsewhere³¹.

Metagenomic analysis using high-throughput sequencing

Viral nucleic acids were extracted from the pooled faecal suspensions as described previously³², and double-stranded cDNA was synthesized with the PrimeScript Double Strand cDNA Synthesis kit (Takara BIO, Shiga, Japan). Sequencing libraries were prepared with Nextera XT DNA Sample Preparation kit (Illumina, San Diego, CA) and sequenced on the Illumina MiSeq platform (Illumina). The obtained reads were compared against NCBI NT/NR database as described previously³³. The sequence reads related to smacoviruses were de novo assembled to contigs using CLC Genomics Workbench software (CLC bio, Aarhus, Denmark).

PCR screening, whole genome sequencing, and genome annotation

Based on the nucleotide sequences of the generated contigs, six different pairs of primer sets were designed for PCR screening with GENETYX software (GENETYX, Tokyo, Japan) (Supplementary Table S1). DNA was extracted from faecal samples for each individual NHP with the High Pure Viral Nucleic Acid kit (Roche Diagnostics, Mannheim, Germany) and PCR screened for putative smacoviruses with Tks Gflex DNA Polymerase (TAKARA BIO). PCR products were sequenced and the sequences were used to design additional primers for the complete genome amplification of CRESS DNA viruses by inverse PCR. The amplicons were subsequently cloned into a pCR4-Blunt-TOPO vector (Invitrogen; Thermo Fischer Scientific, Waltham, MA) and sequenced by a primer walking strategy. The whole circular genome of each CRESS DNA virus was assembled with Phred and Phrap³⁴ with quality scores >30 in all assembled nucleotide positions and annotated using Geneious³⁵. The pairwise identity among sequences was calculated with Sequence Demarcation Tool (SDT) v1.2³⁶.

Phylogenetic analysis

The complete genome nucleotide sequences and predicted amino acid protein sequences were aligned with MAFFT using the algorithm FFT-NS-i³⁷. To infer the phylogenetic relation between sequenced and available samples maximum likelihood (ML) approaches with IQ-TREE v1.6.5³⁸ were used to determine the best substitution model, infer the topology and the branch support with a bootstrap of 1,000 repetitions. Additionally, Bayesian inference approaches with MrBayes v3.2.6³⁹ were used to search for the best substitution model and estimate the posterior probability of the inferred branches with chains of one million states. Three phylogenetic trees were inferred in this study for the whole genome nucleotide sequences (Fig. 2), and the amino acid sequences of CP (Fig. 3) and Rep (Fig. 4). The tree topology of the ML approach was used and annotated with the support of the posterior probability from the Bayesian approach.

Recombination analysis

The genome multiple sequence alignment was assessed for evidence of recombination events by the suite of methods in the recombination detection program (RDP) v.4.58^40,41. Detected recombination events required statistical support p < 0.01 and the distribution of recombination breakpoints were analyzed with a sliding window of 400 nucleotides and one nucleotide step, with 1,000 permutations for estimating the statistical support of the breakpoint distribution. To assess the effects of the recombination events on the phylogenetic relationships among sequences, a compatibility matrix was built²⁵, where the compatibility of two windows with 300 nucleotides from a sliding window and 100 nucleotides per step is defined as the normalized Robinson-Foulds distance⁴² between the corresponding neighbor-joining phylogenetic trees under Tamura-Nei substitution model. The compatibility reflects how similar are the inferred phylogenies for any two genome windows ranging between 0 (identical topologies) to 1 (completely dissimilar topologies).

Data Availability

The whole genomes of the identified viruses in this study were submitted to the GenBank/EMBL/DDBJ database under accession numbers of LC386195-LC386205.

Change history

06 June 2019
A correction to this article has been published and is linked from the HTML and PDF versions of this paper. The error has been fixed in the paper.

References

Simmonds, P. et al. Consensus statement: Virus taxonomy in the age of metagenomics. Nat Rev Microbiol 15, 161–168, https://doi.org/10.1038/nrmicro.2016.177 (2017).
Article CAS PubMed Google Scholar
Zhao, L., Rosario, K., Breitbart, M. & Duffy, S. Eukaryotic Circular Rep-Encoding Single-Stranded DNA (CRESS DNA) Viruses: Ubiquitous Viruses With Small Genomes and a Diverse Host Range. Adv Virus Res 103, 71–133, https://doi.org/10.1016/bs.aivir.2018.10.001 (2019).
Article PubMed Google Scholar
King, A. M. Q. et al. Changes to taxonomy and the International Code of Virus Classification and Nomenclature ratified by the International Committee on Taxonomy of Viruses (2018). Arch Virol 163, 2601–2631, https://doi.org/10.1007/s00705-018-3847-1 (2018).
Article CAS PubMed Google Scholar
Rosario, K., Duffy, S. & Breitbart, M. A field guide to eukaryotic circular single-stranded DNA viruses: insights gained from metagenomics. Arch Virol 157, 1851–1871, https://doi.org/10.1007/s00705-012-1391-y (2012).
Article CAS PubMed Google Scholar
Duffy, S., Shackelton, L. A. & Holmes, E. C. Rates of evolutionary change in viruses: patterns and determinants. Nat Rev Genet 9, 267–276, https://doi.org/10.1038/nrg2323 (2008).
Article CAS PubMed Google Scholar
Varsani, A. & Krupovic, M. Smacoviridae: a new family of animal-associated single-stranded DNA viruses. Arch Virol 163, 2005–2015, https://doi.org/10.1007/s00705-018-3820-z (2018).
Article CAS PubMed Google Scholar
Steel, O. et al. Circular replication-associated protein encoding DNA viruses identified in the faecal matter of various animals in New Zealand. Infect Genet Evol 43, 151–164, https://doi.org/10.1016/j.meegid.2016.05.008 (2016).
Article CAS PubMed Google Scholar
Cheung, A. K. et al. A divergent clade of circular single-stranded DNA viruses from pig feces. Arch Virol 158, 2157–2162, https://doi.org/10.1007/s00705-013-1701-z (2013).
Article CAS PubMed PubMed Central Google Scholar
Cheung, A. K. et al. Identification of several clades of novel single-stranded circular DNA viruses with conserved stem-loop structures in pig feces. Arch Virol 160, 353–358, https://doi.org/10.1007/s00705-014-2234-9 (2015).
Article CAS PubMed Google Scholar
Sachsenröder, J. et al. Metagenomic identification of novel enteric viruses in urban wild rats and genome characterization of a group A rotavirus. J Gen Virol 95, 2734–2747, https://doi.org/10.1099/vir.0.070029-0 (2014).
Article CAS PubMed PubMed Central Google Scholar
Lima, D. A. et al. Faecal virome of healthy chickens reveals a large diversity of the eukaryote viral community, including novel circular ssDNA viruses. J Gen Virol 98, 690–703, https://doi.org/10.1099/jgv.0.000711 (2017).
Article CAS PubMed Google Scholar
Woo, P. C. et al. Metagenomic analysis of viromes of dromedary camel fecal samples reveals large number and high diversity of circoviruses and picobirnaviruses. Virology 471–473, 117–125, https://doi.org/10.1016/j.virol.2014.09.020 (2014).
Article CAS PubMed Google Scholar
Ng, T. F. et al. A diverse group of small circular ssDNA viral genomes in human and non-human primate stools. Virus Evol 1, vev017, https://doi.org/10.1093/ve/vev017 (2015).
Article PubMed PubMed Central Google Scholar
Blinkova, O. et al. Novel circular DNA viruses in stool samples of wild-living chimpanzees. J Gen Virol 91, 74–86, https://doi.org/10.1099/vir.0.015446-0 (2010).
Article CAS PubMed PubMed Central Google Scholar
Phan, T. G. et al. Small circular single stranded DNA viral genomes in unexplained cases of human encephalitis, diarrhea, and in untreated sewage. Virology 482, 98–104, https://doi.org/10.1016/j.virol.2015.03.011 (2015).
Article CAS PubMed Google Scholar
Dayaram, A. et al. Identification of diverse circular single-stranded DNA viruses in adult dragonflies and damselflies (Insecta: Odonata) of Arizona and Oklahoma, USA. Infect Genet Evol 30, 278–287, https://doi.org/10.1016/j.meegid.2014.12.037 (2015).
Article CAS PubMed Google Scholar
Rosario, K. et al. Virus discovery in all three major lineages of terrestrial arthropods highlights the diversity of single-stranded DNA viruses associated with invertebrates. PeerJ 6, e5761, https://doi.org/10.7717/peerj.5761 (2018).
Article PubMed PubMed Central Google Scholar
Altschul, S. F., Gish, W., Miller, W., Myers, E. W. & Lipman, D. J. Basic local alignment search tool. J Mol Biol 215, 403–410, https://doi.org/10.1016/S0022-2836(05)80360-2 (1990).
Article CAS PubMed Google Scholar
Sachsenröder, J. et al. Simultaneous identification of DNA and RNA viruses present in pig faeces using process-controlled deep sequencing. PLoS One 7, e34631, https://doi.org/10.1371/journal.pone.0034631 (2012).
Article CAS PubMed PubMed Central ADS Google Scholar
Kim, H. K. et al. Identification of a novel single-stranded, circular DNA virus from bovine stool. J Gen Virol 93, 635–639, https://doi.org/10.1099/vir.0.037838-0 (2012).
Article CAS PubMed Google Scholar
Phan, T. G. et al. The fecal virome of South and Central American children with diarrhea includes small circular DNA viral genomes of unknown origin. Arch Virol 161, 959–966, https://doi.org/10.1007/s00705-016-2756-4 (2016).
Article CAS PubMed PubMed Central Google Scholar
Díez-Villaseñor, C. & Rodriguez-Valera, F. CRISPR analysis suggests that small circular single-stranded DNA smacoviruses infect Archaea instead of humans. Nat Commun 10, 294, https://doi.org/10.1038/s41467-018-08167-w (2019).
Article CAS PubMed PubMed Central ADS Google Scholar
Martin, D. P. et al. Recombination in eukaryotic single stranded DNA viruses. Viruses 3, 1699–1738, https://doi.org/10.3390/v3091699 (2011).
Article CAS PubMed PubMed Central Google Scholar
Kazlauskas, D., Varsani, A. & Krupovic, M. Pervasive Chimerism in the Replication-Associated Proteins of Uncultured Single-Stranded DNA Viruses. Viruses 10, 187, https://doi.org/10.3390/v10040187 (2018).
Article CAS PubMed Central Google Scholar
Heath, L., van der Walt, E., Varsani, A. & Martin, D. P. Recombination patterns in aphthoviruses mirror those found in other picornaviruses. J Virol 80, 11827–11832, https://doi.org/10.1128/JVI.01100-06 (2006).
Article CAS PubMed PubMed Central Google Scholar
Carr, M. et al. Discovery of African bat polyomaviruses and infrequent recombination in the large T antigen in the Polyomaviridae. J Gen Virol 98, 726–738, https://doi.org/10.1099/jgv.0.000737 (2017).
Article CAS PubMed Google Scholar
Lefeuvre, P., Lett, J. M., Varsani, A. & Martin, D. P. Widely conserved recombination patterns among single-stranded DNA viruses. J Virol 83, 2697–2707, https://doi.org/10.1128/JVI.02152-08 (2009).
Article CAS PubMed Google Scholar
Gonzalez, G., Koyanagi, K. O., Aoki, K. & Watanabe, H. Interregional Coevolution Analysis Revealing Functional and Structural Interrelatedness between Different Genomic Regions in Human Mastadenovirus D. J Virol 89, 6209–6217, https://doi.org/10.1128/JVI.00515-15 (2015).
Article CAS PubMed PubMed Central Google Scholar
Muslin, C., Joffret, M. L., Pelletier, I., Blondel, B. & Delpeyroux, F. Evolution and Emergence of Enteroviruses through Intra- and Inter-species Recombination: Plasticity and Phenotypic Impact of Modular Genetic Exchanges in the 5′ Untranslated Region. PLoS Pathog 11, e1005266, https://doi.org/10.1371/journal.ppat.1005266 (2015).
Article CAS PubMed PubMed Central Google Scholar
Sasaki, M. et al. Distinct Lineages of Bufavirus in Wild Shrews and Nonhuman Primates. Emerg Infect Dis 21, 1230–1233, https://doi.org/10.3201/eid2107.141969 (2015).
Article CAS PubMed PubMed Central Google Scholar
Carr, M. et al. Isolation of a simian immunodeficiency virus from a malbrouck (Chlorocebus cynosuros). Arch Virol 162, 543–548, https://doi.org/10.1007/s00705-016-3129-8 (2017).
Article CAS PubMed Google Scholar
Sasaki, M. et al. Metagenomic analysis of the shrew enteric virome reveals novel viruses related to human stool-associated viruses. J Gen Virol 96, 440–452, https://doi.org/10.1099/vir.0.071209-0 (2015).
Article CAS PubMed Google Scholar
Gonzalez, G. et al. An optimistic protein assembly from sequence reads salvaged an uncharacterized segment of mouse picobirnavirus. Sci Rep 7, 40447, https://doi.org/10.1038/srep40447 (2017).
Article CAS PubMed PubMed Central ADS Google Scholar
Ewing, B. & Green, P. Base-calling of automated sequencer traces using phred. II. Error probabilities. Genome Research 8, 186–194, https://doi.org/10.1101/gr.8.3.186 (1998).
Article CAS PubMed Google Scholar
Kearse, M. et al. Geneious Basic: an integrated and extendable desktop software platform for the organization and analysis of sequence data. Bioinformatics 28, 1647–1649, https://doi.org/10.1093/bioinformatics/bts199 (2012).
Article PubMed PubMed Central Google Scholar
Muhire, B. M., Varsani, A. & Martin, D. P. SDT: a virus classification tool based on pairwise sequence alignment and identity calculation. PLoS One 9, e108277, https://doi.org/10.1371/journal.pone.0108277 (2014).
Article CAS PubMed PubMed Central ADS Google Scholar
Katoh, K. & Standley, D. M. A simple method to control over-alignment in the MAFFT multiple sequence alignment program. Bioinformatics 32, 1933–1942, https://doi.org/10.1093/bioinformatics/btw108 (2016).
Article CAS PubMed PubMed Central Google Scholar
Nguyen, L. T., Schmidt, H. A., von Haeseler, A. & Minh, B. Q. IQ-TREE: A Fast and Effective Stochastic Algorithm for Estimating Maximum-Likelihood Phylogenies. Mol Biol Evol 32, 268–274, https://doi.org/10.1093/molbev/msu300 (2015).
Article CAS PubMed Google Scholar
Ronquist, F. et al. MrBayes 3.2: Efficient Bayesian Phylogenetic Inference and Model Choice Across a Large Model Space. Syst Biol 61, 539–542, https://doi.org/10.1093/sysbio/sys029 (2012).
Article PubMed PubMed Central Google Scholar
Martin, D. P., Murrell, B., Golden, M., Khoosal, A. & Muhire, B. RDP4: Detection and analysis of recombination patterns in virus genomes. Virus Evol 1, vev003, https://doi.org/10.1093/ve/vev003 (2015).
Article PubMed PubMed Central Google Scholar
Martin, D. P., Murrell, B., Khoosal, A. & Muhire, B. Detecting and Analyzing Genetic Recombination Using RDP4. Methods Mol Biol 1525, 433–460, https://doi.org/10.1007/978-1-4939-6622-6_17 (2017).
Article CAS PubMed Google Scholar
Steel, M. A. & Penny, D. Distributions of tree comparison metrics—some new results. Syst Biol 42, 126–141 (1993).
Google Scholar

Download references

Acknowledgements

We thank the Department of National Parks and Wildlife of the Ministry of Tourism and Arts (formerly ZAWA), Government of the Republic of Zambia, for assistance with sample collection in Zambia. This work was supported by the Japan Initiative for Global Research Network of Infectious Diseases (J-GRID) from Japan Agency for Medical Research and Development (AMED) (JP18fm0108008); Grants-in-Aid for Scientific Research on Innovative Areas from the Ministry of Education, Culture, Sports, Science and Technology (MEXT) of Japan (16H06429, 16H06431, 16K21723); and Japan Society for the Promotion of Science (JSPS) KAKENHI (16H05805).

Author information

Authors and Affiliations

Division of Molecular Pathobiology, Research Center for Zoonosis Control, Hokkaido University, Sapporo, 001-0020, Japan
Paulina D. Anindita, Michihito Sasaki, Wallaya Phongphaew, Yasuko Orba & Hirofumi Sawa
Division of Bioinformatics, Research Center for Zoonosis Control, Hokkaido University, Sapporo, 001-0020, Japan
Gabriel Gonzalez & Kimihito Ito
Global Institution for Collaborative Research and Education (GI-CoRE), Hokkaido University, Sapporo, 001-0020, Japan
Michael Carr, Kimihito Ito & Hirofumi Sawa
National Virus Reference Laboratory, School of Medicine, University College Dublin, Belfield, Dublin, 4, Ireland
Michael Carr
Department of Paraclinical Studies, School of Veterinary Medicine, University of Zambia, PO Box 32379, Lusaka, 10101, Zambia
Bernard M. Hang’ombe
Department of Disease Control, School of Veterinary Medicine, University of Zambia, PO Box 32379, Lusaka, 10101, Zambia
Aaron S. Mweene
Africa Centre of Excellence for Infectious Diseases of Humans and Animals, University of Zambia, P.O. Box 32379, Lusaka, 10101, Zambia
Bernard M. Hang’ombe, Aaron S. Mweene & Hirofumi Sawa
Global Virus Network Affiliate, University of Zambia, P.O. Box 32379, Lusaka, 10101, Zambia
Bernard M. Hang’ombe, Aaron S. Mweene & Hirofumi Sawa
Global Virus Network, 801 W. Baltimore St., Baltimore, MD, 21201, USA
Hirofumi Sawa

Authors

Paulina D. Anindita
View author publications
You can also search for this author in PubMed Google Scholar
Michihito Sasaki
View author publications
You can also search for this author in PubMed Google Scholar
Gabriel Gonzalez
View author publications
You can also search for this author in PubMed Google Scholar
Wallaya Phongphaew
View author publications
You can also search for this author in PubMed Google Scholar
Michael Carr
View author publications
You can also search for this author in PubMed Google Scholar
Bernard M. Hang’ombe
View author publications
You can also search for this author in PubMed Google Scholar
Aaron S. Mweene
View author publications
You can also search for this author in PubMed Google Scholar
Kimihito Ito
View author publications
You can also search for this author in PubMed Google Scholar
Yasuko Orba
View author publications
You can also search for this author in PubMed Google Scholar
Hirofumi Sawa
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

M.S. and H.S. conceived the research, P.D.A., M.S., G.G. and W.P. conducted the experiments and analyzed the data. M.S., B.M.H., A.S.M., K.I., Y.O. and H.S. contributed samples/reagents/analysis tools. P.D.A., M.S., G.G., M.C., A.S.M. and H.S. wrote the manuscript. All authors reviewed the manuscript.

Corresponding author

Correspondence to Michihito Sasaki.

Ethics declarations

Competing Interests

The authors declare no competing interests.

Additional information

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Table S1

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Anindita, P.D., Sasaki, M., Gonzalez, G. et al. Discovery and genetic characterization of diverse smacoviruses in Zambian non-human primates. Sci Rep 9, 5045 (2019). https://doi.org/10.1038/s41598-019-41358-z

Download citation

Received: 24 January 2019
Accepted: 07 March 2019
Published: 08 April 2019
DOI: https://doi.org/10.1038/s41598-019-41358-z

This article is cited by

Bacterial and Viral Diversity of Didelphid Opossums from Brazil
- Leonardo Cardia Caserta
- Gabriela Mansano do Nascimento
- Clarice Weis Arns
EcoHealth (2023)
Multiple novel smaco-like viruses identified in chicken cloaca swabs
- Shixing Yang
- Dianqi Zhang
- Wen Zhang
Archives of Virology (2022)
The virome of the white-winged vampire bat Diaemus youngi is rich in circular DNA viruses
- André Alberto Witt
- Raquel Silva Alves
- Renata da Fontoura Budaszewski
Virus Genes (2022)
A 2021 taxonomy update for the family Smacoviridae
- Mart Krupovic
- Arvind Varsani
Archives of Virology (2021)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

Identification of CRESS DNA viruses in Zambian NHP species

Classification and genome organization of Zambian NHP CRESS DNA viruses

Phylogenetic relationships among Zambian CRESS DNA viruses and known smacoviruses

Recombination analysis of smacovirus genomes

Discussion

Materials and Methods

Ethical statement and sample collection

Metagenomic analysis using high-throughput sequencing

PCR screening, whole genome sequencing, and genome annotation

Phylogenetic analysis

Recombination analysis

Data Availability

Change history

06 June 2019

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing Interests

Additional information

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links