Abstract
The diversity of pathogens associated with acute respiratory infection (ARI) makes diagnosis challenging. Traditional pathogen screening tests have a limited detection range and provide little additional information. We used total RNA sequencing (“meta-transcriptomics”) to reveal the full spectrum of microbes associated with paediatric ARI. Throat swabs were collected from 48 paediatric ARI patients and 7 healthy controls. Samples were subjected to meta-transcriptomics to determine the presence and abundance of viral, bacterial, and eukaryotic pathogens, and to reveal mixed infections, pathogen genotypes/subtypes, evolutionary origins, epidemiological history, and antimicrobial resistance. We identified 11 RNA viruses, 4 DNA viruses, 4 species of bacteria, and 1 fungus. While most are known to cause ARIs, others, such as echovirus 6, are rarely associated with respiratory disease. Co-infection of viruses and bacteria and of multiple viruses were commonplace (9/48), with one patient harboring 5 different pathogens, and genome sequence data revealed large intra-species diversity. Expressed resistance against eight classes of antibiotic was detected, with those for MLS, Bla, Tet, Phe at relatively high abundance. In summary, we used a simple total RNA sequencing approach to reveal the complex polymicrobial infectome in ARI. This provided comprehensive and clinically informative information relevant to understanding respiratory disease.
Similar content being viewed by others
Introduction
Acute respiratory infections (ARI) are a leading cause of morbidity and mortality in newborns and young children, who experience an average of 3 to 6 ARIs annually1,2,3. Identifying the diversity of pathogens responsible for ARIs remains challenging because they involve a diverse set of viruses, bacteria, and fungal pathogens, with co-infection among them commonplace4,5. Traditional testing methods such as PCR, serological typing, bacterial culture and antibody detection, are regarded as the “gold standard” and widely used in ARI diagnosis6,7. However, despite an ongoing effort to include multiple pathogens in a single assay8,9, it remains difficult to simultaneously identify all potential ARI pathogens and capture new or uncommon respiratory pathogens10.
Metagenomic next-generation sequencing (mNGS) is an unbiased way of discovering a broad range of infectious agents11,12,13, and has been recently introduced into clinical research to investigate the microbial cause of unusual disease cases14, perform broad-scale surveys for pathogens in undiagnosed diseases15,16, and understand the role of opportunistic infections17,18. For example, a study of severe pneumonia revealed that mNGS is both efficient and reliable19,20. Importantly, the utility of mNGS goes beyond pathogen identification. In particular, total RNA sequencing (“meta-transcriptomics”) has successfully revealed the entire “infectome” (viruses, bacteria and eukaryotes) present within an organism and provided relevant data on genome sequence, gene expression, and pathogen abundance21,22,23,24,25,26. In addition, the focus on expressed RNA means that the sequence data generated is not dominated by those from the host genome, in turn simplifying pathogen discovery.
Herein, we used meta-transcriptomics to investigate the infectious causes of ARI in 48 children attending a single hospital in China over a defined time-period. Our aim was to characterize the complex infectome of ARIs using a straightforward yet powerful approach that provides plentiful information in addition to identifying the likely pathogen, establishing a bench-mark for future studies in this area. Accordingly, the total spectrum of microbes present in patient throat swabs samples were identified, which involved both complex and diagnostically challenging cases.
Results
Cohort characteristics
In total, throat swab samples of 48 paediatric patients and seven controls were subjected to meta-transcriptomics analysis (Fig. 1). The sex ratio of the patients was 2:1 male to female, with age ranging from neonates to 6 years (medium, 2.9 years). These patients all had final diagnoses of bronchopneumonia (37/48), pneumonia (3/48), acute upper respiratory tract infection (4/48), acute bronchitis (3/48), acute tonsillitis (2/48), or influenza (4/48) (Fig. 1). The corresponding symptoms included fever, cough, phlegm, snivel, wheeze, and nasal obstruction. Rash, oral herpes, pharynx hyperemia, and diarrhea were observed in some cases (Fig. 1).
Characterizing total infectomes
All 55 samples (48 patients, seven controls) were examined individually using total RNA sequencing, which generated between 13.7–98.1 million reads per sample (Table S1). Downstream analyses based on reads and assembled contigs revealed a broad range of microbes, including potential pathogens likely responsible for ARI, that comprised RNA viruses, DNA viruses, bacteria, and fungi.
Presence and abundance of viruses
Blastx comparisons against the nr database identified at least 15 virus species associated with human infection: 11 RNA viruses and four DNA viruses (Fig. 2). The total virus positive rate was 73% (35/48). Other than picobirnaviruses that are present in both case and control samples, all the other viruses are known to be disease associated, although some (such as human rhinovirus A and Epstein–Barr virus) are opportunistic pathogens frequently found in healthy populations. Among the clinically relevant viruses, influenza B virus (InfB) had the highest positivity rate (n = 10), followed by human metapneumovirus (HMPV, n = 9), human cytomegalovirus (HCMV, n = 9), human respiratory syncytial virus (HRSV, n = 4), human parainfluenza virus 3 (HPIV3, n = 3), human coronavirus HKU1 (HCoV HKU1, n = 3), rhinoviruses A (HRVA, n = 2) and C (HRVC, n = 3), and influenza A virus (H1N1, n = 2). Measles virus was detected in two of the samples, although neither of these patients exhibited symptoms compatible with measles infection, such as rash or Koplik’s spots. In addition, echovirus 6, an enterovirus not commonly with respiratory infections, was detected in two patients.
We quantified virus expression levels by estimating their relative abundance (RPM, reads per million). Accordingly, the highest abundance was 63,005 RPM (Human parainfluenza virus 3), while those lower than 1 RPM were not considered as true positives (Fig. 2). Viruses with greater than 10,000 RPM (>1% of total reads) included echovirus E6 (Case 22), HRSV (Case 4), and HPIV-3 (Case 8). Such high levels of abundance suggest active replication. In comparison, picobirnaviruses, herpes simplex virus 1, and papillomaviruses were all at very low abundance (Fig. 2). Importantly, although RNA sequencing was performed, the abundance of DNA viruses can also be quantified by estimating the abundance of the RNA transcripts they produce. Although no DNA viruses were highly abundant, in Cases 1, 10, 15, 31 the abundance levels of herpesviruses (i.e. HCMV and EBV) were greater than 100 RPM (Fig. 2).
Virus characterization
The meta-transcriptomic data generated here also provided information on the specific viral genotypes/lineages present in this population and their epidemiological origins (Fig. 3). Notably, multiple genotypes or subtypes were identified in a number of cases, including both the Yamagata and Victoria lineages of influenza B virus, and the A and B genotypes of HMPV (Fig. 3A,D). Conversely, phylogenetic analysis revealed distinct sequence clusters containing several very closely related genomes, indicating that these viruses might originate from the same outbreak in the Chinese population (Fig. 3A,D). To exclude contamination, all metagenomic hits to influenza B viruses and HMPV were confirmed by PCR. Interestingly, the MeV sequence in patient 8 exhibited a very close phylogenetic relationship to a Chinese vaccine strain (Shanghai-191, 99.97% nucleotide identity) (Fig. 3F): this suggests a vaccine origin, although one that has clearly replicated to high levels in this patient (702 RPM) (Fig. 2). In contrast, the echovirus 6 virus discovered here (EchoE6/22/ZJ/CHN/2018), was relatively distant to known viruses (<89.78% nt identity), such that it likely represents a new variant of this virus (Fig. 3G). The only influenza A virus identified belonged to the H1N1/09 subtype (Fig. 3C).
Bacteria and fungi
The dominant bacterial genera identified in these patients were Neisseria (20.19%, measured in RPM), Prevotella (19.89%), Veillonella (19.76%), Leptotrichiaceae unclassified (16.01%), Capnocytophaga (8.76%), Haemophilus (5.76%), and Streptococcus (2.66%) (Fig. S1). However, since relatively high bacterial abundance was observed in both patient and control samples it is difficult to distinguish commensal from potentially pathogenic bacteria at the genus level (Fig. S2). A further species level identification based on both MetaPhlAn and multiple reference gene mapping identified several disease-causing bacterial species, including Haemophilus influenzae (three cases), Klebsiella pnemoniae, Moraxella catarrhalis, and Streptococcus pneumoniae that may be responsible for some of the ARIs observed (Fig. 4 and Table 1).
Notably, a fungal infection was identified in one patient (Case 12), for which the initial culturing of 10 samples identified a yeast-like organism (Table 1). This was validated in our metagenomic study, with the sequencing reads mapping to fungi of the genus Candida. Subsequently, we retrieved the cytochrome c oxidase subunit 1 (COX 1) gene and an internal transcribed spacer region (ITS, 375 bp)27 from the assembled contigs of the fungus species and utilized these in phylogenetic analyses: this revealed that the newly identified fungus was closely related to C. Africana and C. albicans (Fig. 5). However, its position in the COX1 phylogeny and its divergence (93.33%) suggests that it represents a new fungal species.
Microbial co-infections
In total, 28 of the 48 cases harboured at least one potentially pathogenic microbe that are likely responsible for the ARIs observed (Table 1). Notably, among these, nine cases were co-infected with more than two pathogens, five had virus and bacteria co-infections, seven had multiple virus infections, and in one case there was evidence for the presence of at least five different viral species. Microbial co-infections were therefore commonplace in these patients. The virus and bacteria species involved in co-infections were diverse, although it is notable that HMPV and Haemophilus influenzae co-appeared in three cases (Table 1).
Comparison with other diagnostic methods
As well as our meta-transcriptomic analysis, all the patients studied here were subjected to other pathogen screening protocols performed at the Hangzhou Children’s Hospital. These diagnostic tests included bacterial/fungal culture, PCR, real-time PCR, and antibody-based testing, using either the same (i.e. oral swaps) or different (e.g. sputum, blood) samples as the metagenomics. Notably, a large proportion of the viruses, especially RNA viruses, were not identified in the diagnostic laboratory tests, and sometimes the tests were unavailable or not performed. Consequently, many cases with high viral loads (e.g. Cases 1–4, 8, and 30) were diagnosed as bacterial infections (Table 1). In the case of bacteria and fungi the meta-transcriptomics and standard diagnostic tests returned both consistent (Cases 4 and 12) and inconsistent (Cases 1–3, 5–8, 11, 15, 20, and 40) results, although meta-transcriptomics had a lower bacterial detection rate (Table 1).
Diversity and abundance of antibiotic resistance genes (ARGs)
We used the meta-transcriptomic data generated here to screen for the presence and relative abundance of ARGs. After removing those ARGs likely associated with cloning vectors (e.g. TEM-116 and TetC), our analyses revealed the expression of resistance genes against eight classes of antibiotics: aminoglycosides (AGly), beta-lactamases (Bla), fluoroquinolones (Flq), macrolide-lincosamide-streptogramin (MLS), phenicols (Phe), sulfonamides (Sul), tetracyclines (Tet), trimethoprim (Tmt) (Fig. 6). Strikingly, there was a significant difference (P < 0.05, Fig. S3) in the diversity and abundance of ARGs between cases and controls. The number of ARGs in each case varied from 1–25, with the total abundance level 0.11–263.16 RPM, whereas for the controls the numbers varied from 5–13, with abundance ranging from 0.11–13.78 (Fig. 6).
Individual cases
We now discuss a number of the individual cases in more detail, each highlighting different aspects of the complex microbial basis of ARI.
Multiple virus infections
Case 8 was a 9-month boy who presented with persistent cough of seven days duration followed by fever. He was twice admitted to hospital with bronchitis before sampling, and was diagnosed with bronchopneumonia on his second visit. A blood examination revealed a WBC count of 18.89/ul, including 33.1% neutrophils, 55.8% lymphocytes, a HGB 121 g/l, Plt 332ul and CRP 85 mg/L. The initial laboratory test suggested that Staphylococcus aureus, Viridans streptococci and Haemophilus influenzae were present in this patient, none of which were confirmed with our meta-transcriptomics analysis. Instead, five different viruses were detected in this patient, including three that are likely pathogenic (HRV-C, HPIV3, and HMPV), one vaccine related (measles virus), as well as one opportunistic virus (HCMV). These viruses all had medium to high abundance (48.07–63005.37 RPM) (Figs. 2 and S4), indicative of active replication. HPIV3 had the highest abundance and is commonly associated with bronchiolitis and pneumonia. The disease manifestation was not severe, no measles specific syndromes were recorded, and the patient was discharged after five days.
Fungal infection
Case 12 was 2-year-old girl with seven days of fever (39 °C) and one day of cough who presented with a typical hand, foot and mouth disease (HFMD) syndrome. However, a PCR and antibody test against EV71, Coxsackievirus A16 and enteroviruses (universal) were negative. A throat culture followed by microscopy suggested the presence of a yeast-like agent, which was identified as a member of the genus Candida by phylogenetic analyses (see above; Fig. 5). The species identified is closely related to C. albicans and C. Africana, often associated with a condition called “thrush”.
Enterovirus infection
Case 20 was 5-year-old boy who experienced seven days paroxysmal cough and two of days fever, as well as paroxysmal attacks of frontal pain. Blood and throat swab cultures for bacterial and fungal pathogens were all negative, as were PCR and antibody tests for common respiratory viruses. Strikingly, our meta-transcriptomic analyses identified a divergent variant of echovirus 6 present at high abundance (11382.76 RPM) (Fig. 2). Echovirus 6 is a member of enterovirus group B28, and associated with aseptic meningitis, herpangina, HFMD, and sometimes respiratory disease: hence, this is the likely pathogen in this particular case.
Coronavirus infection
Case 36 was an 11-month-old boy who presented with seven days of fever (>38 °C) and cough. PCR and antibody tests against common respiratory viruses were all negative, and a throat swab culture found no evidence of bacterial or fungal pathogens. In contrast, meta-transcriptomics identified human coronavirus HKU1, which is the likely pathogen in this case (Fig. 2 and Table 1). Importantly, this virus is often excluded from the PCR panel used for respiratory disease, thereby illustrating the utility of meta-transcriptomics.
Discussion
We used an expansive meta-transcriptomics approach to characterize the microbial pathogens associated with paediatric ARI. By revealing a diverse array of pathogens that are often not included in standard laboratory diagnostics we demonstrated that total RNA sequencing is an efficient, accurate and powerful means to characterize the infectome of a target tissue in clinical cases.
Importantly, since NGS-based pathogen discovery involves complicated multistep protocols, contamination can be introduced from a variety of sources such as the laboratory environment, the reagents used, and from multiplexing samples in a single sequencing run29,30. To reduce the likelihood of contamination in this study, we utilised (i) case controls, (ii) detailed genetic analyses involving full genome or multiple genes, (iii) PCR confirmation in the case of closely related sequences, and (iv) additional evidence whenever a new species was identified, such as the use of microscopy in the case of the novel fungal pathogen.
Not only did our meta-transcriptomic analysis reveal a diversity of RNA viruses, but also DNA viruses, bacteria and fungi through their expressed RNA. In addition to those pathogens routinely screened, we were able to identify a novel fungal pathogen, a divergent variant of an atypical respiratory virus (echovirus 6), a replicating vaccine strain of measles virus, and an array of pathogens not commonly included in the PCR panels routinely used for respiratory pathogens, such as coronaviruses, rhinoviruses, and herpesviruses. Meta-transcriptomics also provided substantial additional biological information, including pathogen abundance (a marker of replication activity and infectivity), genome sequence data that informs on epidemiological and evolutionary origins, and information on antimicrobial resistance genes that will assist treatment plans.
Also striking was that ARI was often associated with multiple viruses or the co-occurrence of both viral and bacterial pathogens within a single patient. Synergistic interactions have been frequently described between bacteria and viruses, although their importance to disease manifestation requires further examination5,31. As a case in point, the patients studied here that harboured more than two likely pathogens did not experience an elevated disease severity.
A notable result was the marked difference between the pathogens detected by meta-transcriptomics and those determined by more routine laboratory diagnostics. As might be expected given the focus on RNA26, meta-transcriptomics generated a much larger diversity of viruses, especially those with RNA genomes. In addition, the assays for RNA viruses are costly and time-consuming for most hospital laboratories, and there are insufficient diagnostical tests to cover the entire diversity of respiratory viruses. Conversely, more bacteria were discovered using traditional culture methods, which is also likely true of shotgun DNA sequencing-based metagenomics32. This suggests that RNA-based meta-transcriptomics can be subject to false-negative results in some instances. This likely occurs because metagenomic bacterial identification was based on non-rRNA genes whose RNA expression level is often relatively low25. However, because they may be indicative of more elevated replication, those bacteria identified by meta-transcriptomics are perhaps more likely relevant to disease manifestation.
In sum, our study described a straightforward and powerful way to investigate the full infectome of paediatric ARIs, establishing an bench-mark for this important disease syndrome. Despite this, it is clear that studies based on larger sample sizes are necessarily required for a more complete understanding of the microbial basis of ARIs in different patient populations.
Methods
Ethics statement
This project was approved by the ethics committee of the Zhejiang Provincial Centre for Disease Control and Prevention, China. The need for informed consent from parents/guardians was waived by the ethics committee of the Zhejiang Provincial Centre for Disease Control and Prevention because the analyses were performed retrospectively following standard diagnostic tests, posing no extra patient burden. In addition, all data were de-identified and anonymous. All human-related sample processing and sequencing were performed in accordance with relevant guidelines and regulations of the Zhejiang Provincial CDC.
Respiratory sample collection
We studied children with ARIs admitted to Hangzhou Children’s Hospital (Zhejiang, China) between March 2017 and January 2018. The inclusion criteria were inpatient age between 0 and 10 years who presented with at least two of the following: cough, fever, snivel, sneeze, pharyngeal discomfort, and nasal obstruction, and who were diagnosed with either bronchopneumonia, pneumonia, acute bronchitis, or acute upper respiratory tract infection. Throat swabs were collected as part of the diagnostic routine protocol, and among these 48 samples were selected for meta-transcriptomics. For comparison, samples of 7 healthy subjects were collected between October 2017 - February 2018 and utilized as controls. All specimens were collected by sterile flocked swab and were immediately sent to the hospital laboratory where they were stored at −20 °C for less than 24 hours. They were later transferred on dry ice to a −80 °C freezer.
RNA extraction, library construction and sequencing
Samples were subjected to RNA extraction using TRlzol LS reagent (Invitrogen, USA). After obtaining total RNA, purification steps were performed using the RNeasy Plus Mini Kit (Qiagen, USA) according to the manufacturer’s instructions. The concentration and quality of final extractions were examined using a NanoVue plus spectrophotometer (GE Healthcare, UK). In all cases library preparation kits that target low-concentration RNA samples (as low as 500 pg), including the SMARTer® Stranded Total RNA-Seq Kit v2 - Pico Input Mammalian (Takara Bio, USA) and the Trio RNA-Seq kit (NuGEN Technologies, USA) were used. Paired-end (150 bp) sequencing of these RNA library was performed using the Illumina HiSeq platform, generating between 4.12–29.44 Gbp data for each library/case (Table S1).
Pathogen characterization
For each library/case, we performed quality control and removed adaptor sequences and low-quality/low-complex reads. Human reads were removed by mapping to the human genome. All non-human sequence reads generated here have been deposited on the NCBI Sequence Read Achieve (SRA; BioProject accession PRJNA540900). These reads were then compared to the non-redundant protein database using Diamond33, as well as to a reference virus database downloaded from GenBank using blastn. Blast hits were sorted taxonomically to the species level by matching the accession number with the taxonomy database. Virus reads were then de novo assembled using Megahit34,35, with virus identified based on the blast procedure described above. In cases with low genome coverage, reads were directly mapped to the sequence of a close relative, and a consensus genome was obtained from the mapped reads. The resulting complete or partial virus genomes (Table S2) were aligned with related viruses from GenBank using MAFFT version 736, and subjected to phylogenetic analysis using the maximum likelihood method in PhyML 3.037, employing the General Time Reversible (GTR) model of nucleotide substitution with a gamma distribution of among-site rate variation, and 1000 bootstrap replicates.
To estimate virus abundance, reads were mapped back to each virus genome, and the relation “mapped reads/total reads * one million” was used to calculate the number of Reads Per Million (RPM). To exclude contamination due to index hopping, for each virus only those present at >0.1% of the highest viral abundance were considered true positives. For viruses in which highly similar sequences appeared in multiple libraries (i.e. influenza B virus, human metapneumovirus), RT-PCR and Sanger sequencing were performed to confirm their presence.
Bacterial and fungal pathogens were initially identified using MetaPhlAn238, which mapped the non-human sequence reads to a set of ∼1 million marker genes from >7500 microbial species. These results were confirmed using phylogenetic analyses based on multiple gene sets (bacteria, with reference sequences downloaded from https://pubmlst.org/; Table S3) and the COX1 gene (fungi). All sequences were obtained from either the assembled contigs (blastx) or by direct mapping to a reference gene set. All genes associated with antibiotic resistance (ARGs) were first identified by Short Read Sequencing Typing (SRST2)39. To confirm that ARGs did not originate from cloning vectors, the assembled contigs were blasted against the nt database. The diversity and abundance of ARGs among the cases and controls were compared using a t-test.
Data availability
All non-human sequence reads generated have been deposited on the NCBI Sequence Read Achieve (SRA) database (BioProject accession PRJNA540900). And all sequence alignments used for phylogenetic analyses have been included as Supplementary Data.
References
Monto, A. S. & Ullman, B. M. Acute respiratory illness in an American community: The Tecumseh study. JAMA 227, 164–169 (1974).
Simoes, E. A. F. et al. Acute respiratory infections in children. In: Disease Control Priorities in Developing Countries (2 nd Edition) 483–499 (Oxford University Press 2016).
Liu, L. et al. Global, regional, and national causes of child mortality: an updated systematic analysis for 2010 with time trends since 2000. The Lancet 379, 2151–2161 (2012).
Tregoning, J. S. & Schwarze, J. Respiratory viral infections in infants: causes, clinical symptoms, virology, and immunology. Clin. Microbiol. Rev. 23, 74–98 (2010).
Bosch, A. A., Biesbroek, G., Trzcinski, K., Sanders, E. A. & Bogaert, D. Viral and bacterial interactions in the upper respiratory tract. PLoS Pathog. 9, e1003057 (2013).
Das, S., Dunbar, S. & Tang, Y. W. Laboratory diagnosis of respiratory tract infections in children - the state of the art. Front. Microbiol. 9, 2478 (2018).
Mahony, J. B. Detection of respiratory viruses by molecular methods. Clin. Microbiol. Rev. 21, 716–747 (2008).
Arens, M. Q. et al. Comparison of the Eragen multi-code respiratory virus panel with conventional viral testing and real-time multiplex PCR assays for detection of respiratory viruses. J. Clin. Microbiol. 48, 2387–1395 (2010).
Tiveljung-Lindell, A. et al. Development and implementation of a molecular diagnostic platform for daily rapid detection of 15 respiratory viruses. J Med Virol 81, 167–75 (2009).
Rajapaksha, P. et al. A review of methods for the detection of pathogenic microorganisms. Analyst 144, 396–411 (2019).
Gu, W., Miller, S. & Chiu, C. Y. Clinical metagenomic next-generation sequencing for pathogen detection. Annu. Rev. Pathol. 14, 319–338 (2019).
Wilson, M. R. et al. Clinical metagenomic sequencing for diagnosis of meningitis and encephalitis. N. Engl. J. Med. 380, 2327–2340 (2019).
Chiu, C. Y. & Miller, S. A. Clinical metagenomics. Nat. Rev. Genet. 20, 341–355 (2019).
Palacios, G. et al. A new arenavirus in a cluster of fatal transplant-associated diseases. N. Engl. J. Med. 358, 991–998 (2008).
Wilson, M. R. et al. Chronic meningitis investigated via metagenomic next-generation sequencing. JAMA Neurol. 75, 947–955 (2018).
Brown, J. R., Bharucha, T. & Breuer, J. Encephalitis diagnosis using metagenomics: application of next generation sequencing for undiagnosed cases. J. Infect. 76, 225–240 (2018).
Tirosh, O. et al. Expanded skin virome in DOCK8-deficient patients. Nat. Med. 24, 1815–1821 (2018).
Suzuki, T. et al. Comprehensive detection of viruses in pediatric patients with acute liver failure using next-generation sequencing. J. Clin. Virol. 96, 67–72 (2017).
Xie, Y. et al. Next generation sequencing for diagnosis of severe pneumonia: China, 2010–2018. J. Infect. 78, 158–69 (2019).
Liu, X. et al. Exploration of high-throughput sequencing method in severe pneumonia pathogens detection. Chinese Journal of Laboratory Medicine 40, 609–613 (2017).
Eden, J.-S. et al. Francisella tularensis ssp. holarctica in ringtail possums, Australia. Emerg. Infect. Dis. 23, 1198–1201 (2017).
Li, C. X. et al. Unprecedented genomic diversity of RNA viruses in arthropods reveals the ancestry of negative-sense RNA viruses. eLife 4, e05378 (2015).
Shi, M. et al. Redefining the invertebrate RNA virosphere. Nature 540, 539–543 (2016).
Shi, M. et al. The evolutionary history of vertebrate RNA viruses. Nature 556, 197–202 (2018).
Shi, M. et al. No detectable effect of Wolbachia wMel on the prevalence and abundance of the RNA virome of Drosophila melanogaster. Proc. Biol. Sci. 285, 20181165 (2018).
Shi, M., Zhang, Y. Z. & Holmes, E. C. Meta-transcriptomics and the evolutionary biology of RNA viruses. Virus Res. 243, 83–90 (2018).
Giosa, D. et al. Whole RNA-Sequencing and transcriptome assembly of Candida albicans and Candida africana under chlamydospore-inducing conditions. Genome Biol. Evol. 9, 1971–1977 (2017).
Tebruegge, M. & Curtis, N. Enterovirus infections in neonates. Sem. Fetal Neonatal Med. 14, 222–227 (2009).
Head, S. R. et al. Library construction for next-generation sequencing: overviews and challenges. Biotechniques 56, 61–77 (2014).
Asplund, M. et al. Contaminating viral sequences in high-throughput sequencing viromics: a linkage study of 700 sequencing libraries. Clin. Microbiol. Infect. 25, 1277–1285 (2019).
Brealey, J. C., Sly, P. D., Young, P. R. & Chappell, K. J. Viral bacterial co-infection of the respiratory tract during early childhood. FEMS Microbiol. Lett. 362, fnv062 (2015).
Hilton, S. K. et al. Metataxonomic and Metagenomic Approaches vs. Culture-Based Techniques for Clinical Pathology. Front. Microbiol. 7, 484 (2016).
Buchfink, B., Xie, C. & Huson, D. H. Fast and sensitive protein alignment using DIAMOND. Nat. Methods 12, 59–60 (2015).
Li, D. et al. MEGAHIT v1.0: A fast and scalable metagenome assembler driven by advanced methodologies and community practices. Methods 102, 3–11 (2016).
Li, D., Liu, C. M., Luo, R., Sadakane, K. & Lam, T. W. MEGAHIT: an ultra-fast single-node solution for large and complex metagenomics assembly via succinct de Bruijn graph. Bioinformatics 31, 1674–1676 (2015).
Nakamura, T., Yamada, K. D., Tomii, K. & Katoh, K. Parallelization of MAFFT for large-scale multiple sequence alignments. Bioinformatics 34, 2490–2492 (2018).
Guindon, S. et al. New algorithms and methods to estimate maximum-likelihood phylogenies: assessing the performance of PhyML 3.0. Syst. Biol. 59, 307–321 (2010).
Truong, D. T. et al. MetaPhlAn2 for enhanced metagenomic taxonomic profiling. Nat. Methods 12, 902–903 (2015).
Inouye, M. et al. SRST2: Rapid genomic surveillance for public health and hospital microbiology labs. Genome Med. 6, 90 (2014).
Acknowledgements
This work was supported by the 60th China postdoctoral Science Foundation (grant No. 2016M601972) and by a Zhejiang CDC research support grant. ECH is supported by an ARC Australian Laureate Fellowship (FL170100022).
Author information
Authors and Affiliations
Contributions
All authors agree to be accountable for all aspects of the work and read and approved the final manuscript. C.X.L. E.C.H. and M.S. contributed to the study concept and design, data review and comments, interpretation of data, study supervision, and manuscript preparation. C.X.L., W.L., J.Z., B.Z., Y.F., C.P.X., and Y.Y.L. contributed to the data acquisition, data preparation, and data analyses. C.X.L., W.L., J.Z., B.Z., Y.F., C.P.X., Y.Y.L., E.C.H. and M.S. contributed to the organization, data preparation, and manuscript review and critique.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Li, CX., Li, W., Zhou, J. et al. High resolution metagenomic characterization of complex infectomes in paediatric acute respiratory infection. Sci Rep 10, 3963 (2020). https://doi.org/10.1038/s41598-020-60992-6
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-020-60992-6
This article is cited by
-
Cont-ID: detection of sample cross-contamination in viral metagenomic data
BMC Biology (2023)
-
Transcript host-RNA signatures to discriminate bacterial and viral infections in febrile children
Pediatric Research (2022)
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.