A universal primer-independent next-generation sequencing approach for investigations of norovirus outbreaks and novel variants

Fonager, Jannik; Stegger, Marc; Rasmussen, Lasse Dam; Poulsen, Mille Weismann; Rønn, Jesper; Andersen, Paal Skytt; Fischer, Thea Kølsen

doi:10.1038/s41598-017-00926-x

Download PDF

Article
Open access
Published: 11 April 2017

A universal primer-independent next-generation sequencing approach for investigations of norovirus outbreaks and novel variants

Jannik Fonager¹,
Marc Stegger²,
Lasse Dam Rasmussen¹,
Mille Weismann Poulsen¹,
Jesper Rønn¹,
Paal Skytt Andersen ORCID: orcid.org/0000-0001-5656-0427^2,4 &
…
Thea Kølsen Fischer^1,3

Scientific Reports volume 7, Article number: 813 (2017) Cite this article

3463 Accesses
18 Citations
3 Altmetric
Metrics details

Subjects

Abstract

Norovirus (NoV) is the most common cause of non-bacterial gastroenteritis and is a major agent associated with outbreaks of gastroenteritis. Conventional molecular genotyping analysis of NoV, used for the identification of transmission routes, relies on standard typing methods (STM) by Sanger-sequencing of only a limited part of the NoV genome, which could lead to wrong conclusions. Here, we combined a NoV capture method with next generation sequencing (NGS), which increased the proportion of norovirus reads by ~40 fold compared to NGS without prior capture. Of 15 NoV samples from 6 single-genotype outbreaks, near full-genome coverage (>90%) was obtained from 9 samples. Fourteen polymerase (RdRp) and 15 capsid (cap) genotypes were identified compared to 12 and 13 for the STM, respectively. Analysis of 9 samples from two mixed-genotype outbreaks identified 6 RdRp and 6 cap genotypes (two at >90% NoV genome coverage) compared to 4 and 2 for the STM, respectively. Furthermore, complete or partial sequences from the P2 hypervariable region were obtained from 7 of 8 outbreaks and a new NoV recombinant was identified. This approach could therefore strengthen outbreak investigations and could be applied to other important viruses in stool samples such as hepatitis A and enterovirus.

Minimal requirements for ISO15189 validation and accreditation of three next generation sequencing procedures for SARS-CoV-2 surveillance in clinical setting

Article Open access 28 April 2023

Phylogenomics reveals viral sources, transmission, and potential superinfection in early-stage COVID-19 patients in Ontario, Canada

Article Open access 12 February 2021

Sample descriptors linked to metagenomic sequencing data from human and animal enteric samples from Vietnam

Article Open access 15 October 2019

Introduction

Norovirus (NoV) is a positive-sense single-stranded RNA virus in the Caliciviridae family, and at least 40 genotypes divided into seven genogroups have been identified^1,2,3. NoV infection is the most widespread cause of non-bacterial gastroenteritis, responsible for up to one fifth of all cases of gastroenteritis globally⁴. Despite some progress^{5, 6}, no vaccine or therapeutic intervention is available, and interceptive strategies mainly aim to rapidly identify the source of infection, increase hygiene measures, and isolate infected patients^{7, 8}. Conventional molecular analysis of NoV transmission routes relies mainly on standard typing methods (STM) based on Sanger-sequencing of partial RNA-dependent polymerase and capsid genes (RdRp and cap)^{9, 10}. More recently, sequencing of larger parts of the capsid gene containing the hypervariable P2 region has increased the discriminatory power to resolve outbreaks with higher accuracy^{11, 12}. Although new recombinant or dominant NoV strains has routinely been reported^{13, 14}, commonly used gene-specific primers may have limited the ability to rapidly detect emerging strains due to sequence differences in the primer binding regions. In such situations, it is necessary to amplify and sequence these regions using other primers and subsequently to make these sequences publically available to enable other research groups to redesign their primers accordingly. Therefore, the current STM for analysis could lead to incorrect conclusions about possible transmission chains, underestimation of the genetic diversity of NoV, and delay early identification of new emerging strains. Recent methodological approaches including NGS to achieve full norovirus genome coverage have been published. However, these methods mostly rely on genotype specific primers^15,16,17 why they, are time consuming and need frequent updating due to the high natural mutation rates of NoV¹⁸.

Cultivation of pathogens is commonly used to ensure a pure and high concentration for further investigation and has combined with NGS considerably improved the ability to identify transmission chains and resistance genes for cultivated bacterial infections in particular^{19, 20}. Although human NoV to some degree can replicate in animal models, no cell culture system exists²¹. Therefore investigation of NoV suspected cases is usually limited to direct analysis of viral RNA in stool samples, in which viral genomes only constitute a minor proportion of the nucleic acids present^22,23,24.

Recent studies have addressed several of these problems by using both random sequencing^{24, 25} strategies and different methods for virus enrichment such as: virion isolation and enzymatic removal of host/bacterial nucleic acids²⁶, capture-based^27,28,29 methods or PCR activated cell-sorting methods³⁰. Despite these recent advances, such methods are still time-consuming, laborious and potentially costly and/or rely on approaches that will require extensive periodic updates in primers or probes to reflect the current knowledge on viral diversity.

In this study, we have evaluated an easy-to-use laboratory method that allows for a ~40-fold enrichment of all NoV genotypes in stool samples. Furthermore, we have used bioinformatics approaches to accurately screen for NoV in highly complex samples. Analysis of NoV positive samples from eight foodborne outbreaks yielded sufficient NoV read counts to allow the assembly of several complete or nearly complete genomes for molecular comparisons. Furthermore, this approach allowed for the identification of an additional genotype, missed by STM, as well as discovery of a new recombinant NoV.

Results

Using NGS directly on samples

Despite a large sequencing depth allocated to each sample (1.5 to 5.5 million reads), only a relatively small proportion of the obtained reads were of NoV origin (on average: 0.25%; corresponding to ~700 to ~22,000 reads).

Evaluation of the poly(A)-capture technique

To specifically enrich for NoV RNA and reduce the amount of non-polyadenylated bacterial RNA, a poly(A)-capture method was employed after nucleic acid extraction. To evaluate this enrichment strategy, NoV viral load was measured in 6 GGI and 3 GGII quantitated survey samples were (called QS1 to QS9, See Materials and methods and Table 1) along with five non-quantified survey samples (called S1 to S5, See Table 1 and Materials and Methods). All samples were split after RNA extraction with only one part subjected to poly(A)-capture. SMARTer libraries were constructed from both extracted parts and subjected to MiSeq sequencing simultaneously. The efficiency (Table 1) was evaluated by measuring the proportion of reads mapping to full genome sequences from the common human gut bacterial species Bacterioides uniformis and Ruminococcus bromii L2 + 63³¹ or from a set of 16 sRNA sequences identified in human microbiome studies^{32, 33}. Poly(A)-capture increased the proportion of obtained NoV reads over the entire range of NoV input RNA copies (Log₁₀ 1,89 to 6,82; see Table 1 and Fig. 1), despite some variation for especially samples with low numbers of input NoV RNA copies. While the proportion of bacterial reads was reduced by 0.28 to 0.41 fold, the number of NoV reads increased by on average 45.1 ± 27.77 -fold. Although the average Ct value decreased by 0.96 ± 0.07-fold after poly(A)-capture, the poly(A)-captured NoV was also eluted in only one fifth of the suspension volume used before poly(A)-capture. The average percentages of reads from the non poly(A)-captured survey samples mapping to three approximately equally- sized parts of the NoV reference genome sequences were: 1^st part (genome-position: 1–2499): 32.2% (±17.7%), 2^nd part (genome-position: 2500–4997): 53.3% (±12.8%) and 3^rd part (genome-position: 4998–7496): 14.5% (±8.9%), while the average percentages of NoV reads from the poly(A)-captured survey samples mapping to these regions were: 1^st part: 10.1% (±3.2%), 2^nd part: 43.5% (±8%) and 3^rd part: 46.4% (±8.9%).

Table 1 Summary of quantification, sequencing, mapping and genotyping results of outbreak samples before and after poly(A)-capture.

Full size table

Outbreak analysis

Samples from all eight outbreaks were subjected to the poly(A)-capture method and SMARTer library construction. A general linear trend was observed between the Ct values measured after poly(A)-capture and the number of reads obtained (Fig. 2), although a few samples deviated from this trend by containing a higher than expected number of NoV reads per million reads. Although full-genome coverage (>99%) was observed at ~4,800 NoV reads in total, equivalent to an average coverage of ~80 per sample (Fig. 3), sufficient sequence quality along the entire genome was only observed above ~11,000 reads with an average coverage of ~260.

Assigning genotypes to outbreak samples

The first level of sequence comparison in an outbreak is the comparison of genotypes obtained from different persons in the outbreak. Complete NoV genotyping relies on sufficient sequence coverage in two regions: ORF1 (RdBp/pol) and ORF2 (Cap) for complete genotyping. Using the NGS approach, 14 complete and one partial genotype were detected in 15 samples from 6 of the 8 outbreaks (see Table 2) containing a single NoV genotype compared with 10 complete and five partial genotypes detected with the STM approach.

Table 2 Summary of sequencing, mapping and genotyping results of outbreak samples.

Full size table

Since it had been demonstrated by Real time PCR and STM that two NoV genogroups and several genotypes were involved in Ob-4 and Ob-6, HMM searches for additional genotypes was performed on de novo assembled contigs (See Materials and Methods). From sample Ob-4-1, 10,765 and from sample Ob-4-2 15,391 de novo assemblies were generated, of which 11 and 107 were identified as norovirus assemblies by the HMM search respectively. The candidate NoV contigs were further investigated by BLASTN and genotyping of the contigs, with subsequent reference based mapping which confirmed the presence of the following genotypes in the two samples: Ob-4-1: GII.Pg_GII.1 and GII.4_Sydney, Ob-4-2: GI.Pb_GI.6 and GII.7P_GII.6 (Table 2). Due to insufficient reads mapping to the GI.Pb_GI.6 reference in sample Ob-4-1, a valid phylogenetic comparison could not be performed, although a BLASTN of the consensus sequence generated from the 22 mapped reads indicated this to be GI.Pb_GI.6 as well. The HMM analysis also detected genotype GII.Pg_GII.1 in the Ob-4-1 sample which was not identified by STM.

From Outbreak 6, the following numbers of HMM hits were obtained out of the total number of de novo assembled contigs: Ob-6-1: 0 of 18,361, Ob-6-2: 2 of 1,928, Ob-6-3: 11 of 18,973, Ob-6-4: 6 of 34,959, Ob-6-5: 5 of 37,507, Ob-6-6: 4 of 14,250 and Ob-6-7: 1 of 43,308. Following the same procedure as described for Ob-4 lead to the identification of GII.P7_GII.6 and GI.P3_GI.3 in all seven samples. The GII.P7_GII.6 genotype was supported by a large number of reads in all samples except Ob-6-3 and could be compared phylogenetically, whereas the GI.P3_GI.3 genotype was only supported by a low number of reads in all samples. This also suggested that the two NoV genotypes were present in all the samples at variable concentrations.

Overall, support for the hypothesis of a common infection source by shared pol and cap genotypes for at least one genotype and in at least two different persons from the outbreak was obtained for seven of eight outbreaks with the NGS method and for five of eight outbreaks with the STM method. Similar genotypes were observed in several of the outbreaks, all of which were found to be different (identities: Ob-1/Ob-6: 84.4%, Ob-2/Ob-3: 99.8%, Ob-4/Ob-6: 84.2%, Ob-4/Ob-7/S-4: 97.6% to 98.8%, S-5/Ob-5: 97.3%).

Phylogenetic analysis of outbreak samples

Phylogenetic analysis was performed for all outbreaks, except for Ob-4 due to the absence of shared well-covered genotype reference sequences. Consensus sequences with the following maximum lengths were generated in CLCbio: Ob-1: 7666 nt, Ob-2: 1828 nt, Ob-3: 1226 nt, Ob-4: not analyzed, Ob-5: 1593 nt, Ob-6:7344 nt, Ob-7: 7697 nt, Ob-8: 6228 nt and used in a phylogenetic comparison. The genome coverage is shown in Fig. 4 for the individual outbreaks. The comparison included either the complete hypervariable P2 region (P2 region is 456 to 483 nt depending on genotype) for Ob-1, Ob-6, Ob-7 and Ob-8, and partial P2 region comparisons for Ob-2 (471 nt), Ob-3 (243 nt) and Ob-5 (291 nt). The phylogenetic analysis revealed that the NoV involved in Ob-3, Ob-5 and Ob-7 were 100% identical (Fig. 4f,h and l), whereas differences in the NoV genomes (Fig. 4b,d,j and o) were observed for: Ob-1 (1 nt difference; two samples had an A-residue at reference sequence position 3321 while three samples had a G-residue), Ob-2 (3 nt differences in the P2 region), Ob-6 (1 nt difference in ORF3 and several differences in the 3′ non-coding A rich part of the genome), Ob-8 (1 nt difference in the P2 region).

Identifying a new recombinant

When reads from the two samples from Ob-8 were mapped to the two reference sequences known to be present from the initial partial genotyping, a mutually exclusive distribution of reads was observed (Fig. 5a and b). In addition, reads that spanned the ORF1/ORF2 junction of an in silico generated reference sequence were observed (Fig. 5c), confirming that both these samples harbored a novel GII.P16_GII.4_Sydney recombinant.

Discussion

The use of poly(A)-capture significantly enhanced the number of norovirus reads obtained from stool samples, allowing comparisons of full or near full (>85%) genome sequences from 4 outbreaks and partial genome comparisons in 3 outbreaks. In total, 14 complete and one partial genotype were detected in the 15 samples from the 6 outbreaks containing a single NoV genotype compared with 10 complete and five partial genotypes detected with the STM approach. In addition, additional genotypes (partial or complete) were identified with the NGS approach in the two mixed-genotype outbreaks samples (Ob4 and Ob6). STM generated more genotype information than NGS in four cases. In these cases, either none or a low number of NoV reads of mapped to the specific genotype, however none of the reads mapped to the ORF1 (pol) or ORF2 (cap) genotyping-regions. This showed that although the NGS method overall improved the genotyping results, some samples might be challenging due to low amounts of available virus RNA in combination with the random distribution of reads obtained. One way to reduce this problem would be to allocate a larger sequencing depth for especially samples with low amounts of virus.

The NGS derived consensus sequences used for phylogenetic comparison ranged from 1226 to 7692 nt (average 4800 nt) and included either the complete or a substantial proportion of the hypervariable P2 region. In comparison, STM only covers ~9% of the genome and does not include the P2 region. Therefore, even in the three outbreaks, in which only partial genomes were recovered, the data were found to significantly improve the molecular resolution of outbreaks.

Interestingly, minor nucleotide variations between sequences from different samples from three of the outbreaks were observed. Two of these differences were mapped to the P2 region, known to be highly variable^{12, 34,35,36} and a single nucleotide difference was observed between two groups of samples from a single epidemiologically linked outbreak (Ob-2). This challenge the 100% identity-paradigm used in general NoV outbreak investigations^{12, 36} that normally distinguishes only between identical and non-identical strains. Other studies have also questioned if these strict criteria should be maintained³⁷, when comparing larger parts of the NoV genome.

NoV bioaccumulation in or adhesion to food items such as oysters and lettuce generates complex outbreak profiles including several genotypes^{38, 39}, which require separate RT-PCR amplification steps if STM are used³⁹. In this study, six NoV genotypes were identified in samples from two mixed outbreaks, three of which was supported by high genomic coverages (66% to 99% of the entire NoV genome). HMM improved the detection of genotypes by identifying a genotype (GII.Pg_GII.1) missed using STM. Although phylogenetic comparisons could not be performed for all genotypes due to varied sequence coverage of some genotypes in the samples, greater sequencing depth may circumvent this problem in future analysis. Interestingly, a mutually exclusive presence of genotypes was observed for three of the four genotypes identified in the two samples from Ob-4 and different relative abundances of the two genotypes found in Ob-6 was found for sample Ob-6-3 compared with the other samples. This could indicate differences in host exposure and/or susceptibility to different NoV genotypes in complex outbreaks.

A near-complete genome sequence of (>90%) a new GII.P16_GII.4_Sydney recombinant NoV was directly confirmed from the NGS data by using reads spanning the ORF1/ORF2 junction of the two different genotypes, showing that NGS can be used to distinguish between co-infection with different genotypes and new emergent recombinants.

This study was performed retrospectively on samples stored at −20 °C and previously analyzed by STM where samples had all been freeze-thawed at least twice, which may have resulted in some degree of degradation of the NoV. Five samples were excluded after poly(A)-capture, as a large increase (>5) in Ct values were observed, indicating fragmentation of NoV RNA. Therefore, for future applications of the present method, it will be of great importance to retain NoV RNA integrity until library preparation.

We have introduced a novel NoV enrichment NGS-based approach to investigate foodborne outbreaks without discriminating between genotypes. This method can be used directly to enrich other clinically important viruses in stool such as enteroviruses, or other positive-sense RNA viruses with a polyadenylated 3′ tail. Although the poly (A)-capture lead to a 3′ bias in sequencing depth, it allowed for a significant enrichment of NoV reads obtained from the samples. Future studies are required to test the efficiency of enrichment from other specimen types. Although the likelihood of obtaining complete NoV genomes is strongly dependent on NoV concentration in the sample, deeper sequencing would likely allow for retrieval of more NoV reads even in more scarce NoV samples. With common access to benchtop sequencers, we anticipate that NGS will soon become a definitive, non-discriminatory tool for viral infection control and serve to monitor both the evolution and spread of genotypes and enhance viral outbreak investigations.

Materials and Methods

Ethics statement

According to the “Danish Act on Research Ethics Review of Health Research Projects” this study does not require approval by the ethics committees, as it is considered a quality development/control project and does not analyze human sequences. This was confirmed by the Committees on Health Research Ethics for the Capital Region of Denmark in a specific waiver of approval (H-16019654).

Sample material

Twenty-four NoV positive samples from eight different foodborne outbreaks (termed Ob1 to Ob8) were analyzed (Table 3). Five survey samples (termed S1 to S5) and 9 quantitative survey samples (termed QS1 to QS9) were analyzed with or without poly(A)-capture to assess the efficiency of this method. Five samples where Ct values increased >5 after poly(A)-capture vs. before were excluded from NGS analysis as they were considered to be too degraded.

Table 3 Line-list of eight norovirus-associated outbreaks in Denmark 2013–2015.

Full size table

Extraction of nucleic acids, poly(A) capture, real-time RT-PCR and norovirus typing

Nucleic acids were extracted from 10% stool suspensions (kept at −20 °C) using the MagNA Pure LC (Roche Diagnostics); poly(A)-capture was performed using a Dynabeads mRNA Purification Kit (Ambion Cat. No. 61006) according to the manufacturer’s instructions with modifications to use 100 µL input material and 26 µL Dynabeads. The concentration of nucleic acids was measured using 1 µL extract on a NanoDrop 1000 Spectrophotometer (NanoDrop Technologies). The presence of NoV Genogroup I and II was assessed using real-time multiplex PCR⁴⁰ and genotyping was performed as described previously^{40, 41}.

Quantification of NoV RNA

A quantitative NoV GGI standard was obtained from ATCC (Quantitative Synthetic Norovirus G1 (I) RNA (ATCC® VR3234SD™; specification range (log₁₀) 5–6 RNA copies/µL, of which the lower end range was used for the calculations. In addition, a previously published NoV GGII standard⁴² was obtained from collaborators at the Danish Technical University at a confirmed concentration of 5.19 (log₁₀) ± 4.80 (log₁₀) RNA copies/µL. Both standards were diluted in a fivefold 1:10 dilution series and analyzed in triplicates in the real time multiplex PCR (described above) alongside 9 NoV Quantiative Survey samples (QS1 to QS9; all both with and without poly(A)-capture). Analysis of real time data was performed in MxPro Mx3005 P v4.10, resulting in the following standard curves for GGI and GGII respectively: Y = −3.047xLOG(X) + 41.74; R²: 0.994 and Y = −3.090xLOG(X) + 41.18; R²: 0.971. Calculations of the amount of NoV genomes used as input in the extraction/capture and NGS analyses were also performed in MxPro.

Preparation of samples for Illumina MiSeq sequencing

Single-indexed cDNA libraries were generated using the SMARTer Stranded RNA-Seq Kit (Clontech Inc.) in accordance with the manufacturer’s instructions. Fluorescent measurement of DNA concentrations in each library was performed using Qubit dsDNA BR and ssDNA assay kit (Thermo Fischer Scientific).

Quality trimming and filtering

Sequences were imported into CLCbio’s Genomics Workbench (v. 8.5) with the removal of failed reads. Quality trimming within the workbench was performed using both a modified Mott trimming algorithm implemented (limit = 0.5) and by trimming reads containing more than two ambiguous nucleotides. Human sequence reads were removed by alignment to the homo sapiens hg19 reference genome (similarity fraction = 0.8).

Reference based mapping

Quality-trimmed reads were mapped to reference sequences using the Mapping tool in CLCbio’s Genomics Workbench with default settings. NoV reads from all samples loaded on the same MiSeq run were mapped to all expected reference sequences. The following reference sequences were used for mapping of Miseq reads: JQ388274 (GI.Pb_GI.6), JX459908 (GII.Pe_GII.4_Sydney), KJ685411 (GII.P4_New_Orleans_GII.4_Sydney), DQ456824 (GII.P2_GII.2), EU921389 (GII.P21_GII.3), HCU07611 (GII.Pg_GII.1), JQ388274 (GI.Pb_GI.6), KJ196292.1 (GI.P3_GI.3), KM198484 (GII.P21_GII.3), KM198534 (GII.P7_GII.6) and the following reference sequences were used in the phylogenetic analysis: AB187514.1 (GI.P3_GI.3), KM198484 (GII.P21_GII.3), KM198500.1 (GII.P21_GII.3), EU921389 (GII.P21_GII.3), KJ685411 (GII.P4_New_Orleans_2009_GII.4_Sydney_2012), JX459908 (GII.Pe_GII.4_Sydney), LN854568 (GII.P7_GII.6), JQ388274 (GI.Pb_GI.6), LN854564.1 (GI.Pb_GI.6), KM036380 (GII.P16_GII.13), JX459908 (GII.Pe_GII.4_Sydney). In cases where no appropriate full-length reference sequence was available for mapping of reads, a de novo assembled sequence (see below) or a consensus sequence generated from the most similar full-length reference sequence available was used instead.

De novo assembly

Reads from three outbreaks (Ob-1, Ob-6 and Ob-8) were mapped to de novo assembled reference sequences, as no well-matching and/or full-length reference sequences were identified in public databases. De novo assembled reads were generated using CLCbio’s assembler at default settings and with the fast mapping mode and a minimum contig length of 200 bases.

Generation of consensus sequences

Consensus sequences were generated from mapped reads using the majority vote option and inserting N in places of ambiguity or missing data. Depth thresholds at >0, >2, or ≥5 reads were evaluated for sequence quality, and only high quality consensus sequences (average quality score ≥30, as calculated in CLCbio) were used for sequence comparison and phylogenetic analysis. The P2 region on the consensus sequence was defined as previously described¹¹.

Confirmation of genotypes

Genotypes from all mappings were confirmed by submission of consensus sequences to analysis at the Dutch National Institute for Public Health and the Environment (RIVM)’s NoV typing tool (http://www.rivm.nl/mpf/norovirus/typingtool) and/or BLASTN followed by genotyping of the best hits at RIVM.

Hidden Markov model (HMM) building and searches

In total, 858 sequences matching the terms “norovirus” and “complete” at NCBI (accessed on April 25^th, 2015) were downloaded and genotypes confirmed using the Dutch National Institute for Public Health and the Environment (RIVM)’s NoV typing tool (http://www.rivm.nl/mpf/norovirus/typingtool). From this set, 112 representative sequences were selected for hidden Markov model (HMM) building. Sequences were aligned in MAFFT v.7 (http://www.ebi.ac.uk/Tools/msa/mafft/) and a NoV HMM was built using HMMer 3.0⁴³. HMM searches were performed among de novo assembled reads at default settings and identified assemblies evaluated by BLASTN and NoV typing at RIVM.

Multiple alignment and phylogenetic analyses

Consensus and reference sequences were aligned in MAFFT and phylogenetic analyses were performed by maximum-likelihood with a generalized time-reversible (GTR) substitution model and a 1000 bootstrap replicates in MEGA 6.06⁴⁴.

References

Kroneman, A. et al. Proposal for a unified norovirus nomenclature and genotyping. Arch. Virol. 158, 2059–2068, doi:10.1007/s00705-013-1708-5 (2013).
Article CAS PubMed Google Scholar
Mesquita, J. R., Barclay, L., Nascimento, M. S. & Vinje, J. Novel norovirus in dogs with diarrhea. Emerg. Infect. Dis. 16, 980–982, doi:10.3201/eid1606.091861 (2010).
Article PubMed PubMed Central Google Scholar
Vinje, J. Advances in laboratory methods for detection and typing of norovirus. J. Clin. Microbiol. 53, 373–381, doi:10.1128/JCM.01535-14 (2015).
Article CAS PubMed PubMed Central Google Scholar
Ahmed, S. M. et al. Global prevalence of norovirus in cases of gastroenteritis: a systematic review and meta-analysis. Lancet Infect. Dis. 14, 725–730, doi:10.1016/S1473-3099(14)70767-4 (2014).
Article PubMed Google Scholar
Debbink, K., Lindesmith, L. C. & Baric, R. S. The state of norovirus vaccines. Clin. Infect. Dis. 58, 1746–1752, doi:10.1093/cid/ciu120 (2014).
Article PubMed PubMed Central Google Scholar
Kim, Y., Galasiti Kankanamalage, A. C., Chang, K. O. & Groutas, W. C. Recent Advances in the Discovery of Norovirus Therapeutics. J. Med. Chem. 58, 9438–9450, doi:10.1021/acs.jmedchem.5b00762 (2015).
Article CAS PubMed PubMed Central Google Scholar
Greig, J. D. & Lee, M. B. A review of nosocomial norovirus outbreaks: infection control interventions found effective. Epidemiol. Infect. 140, 1151–1160 (2012).
Article CAS PubMed Google Scholar
Vinnard, C., Lee, I. & Linkin, D. Successful control of a norovirus outbreak among attendees of a hospital teaching conference. Am. J. Infect. Control 40, 73–74, doi:10.1016/j.ajic.2011.03.033 (2012).
Article PubMed Google Scholar
Kojima, S. et al. Genogroup-specific PCR primers for detection of Norwalk-like viruses. J. Virol. Methods 100, 107–114, doi:10.1016/S0166-0934(01)00404-9 (2002).
Article CAS PubMed Google Scholar
Vennema, H., de, B. E. & Koopmans, M. Rational optimization of generic primers used for Norwalk-like virus detection by reverse transcriptase polymerase chain reaction. J. Clin. Virol. 25, 233–235, doi:10.1016/S1386-6532(02)00126-9 (2002).
Article CAS PubMed Google Scholar
Holzknecht, B. J. et al. Sequence analysis of the capsid gene during a genotype II.4 dominated norovirus season in one university hospital: identification of possible transmission routes. PLoS. One. 10, e0115331, doi:10.1371/journal.pone.0115331 (2015).
Article PubMed PubMed Central Google Scholar
Xerry, J., Gallimore, C. I., Iturriza-Gomara, M., Allen, D. J. & Gray, J. J. Transmission events within outbreaks of gastroenteritis determined through analysis of nucleotide sequences of the P2 domain of genogroup II noroviruses. J. Clin. Microbiol. 46, 947–953, doi:10.1128/JCM.02240-07 (2008).
Article CAS PubMed PubMed Central Google Scholar
de, G.M. et al. Emergence of a novel GII.17 norovirus - End of the GII.4 era? Euro. Surveill 20 (2015).
van, B. J. et al. Indications for worldwide increased norovirus activity associated with emergence of a new variant of genotype II.4, late 2012. Euro. Surveill 18, 8–9 (2013).
Google Scholar
Cotten, M. et al. Deep sequencing of norovirus genomes defines evolutionary patterns in an urban tropical setting. J. Virol. 88, 11056–11069, doi:10.1128/JVI.01333-14 (2014).
Article PubMed PubMed Central Google Scholar
Kundu, S. et al. Next-generation whole genome sequencing identifies the direction of norovirus transmission in linked patients. Clin. Infect. Dis. 57, 407–414, doi:10.1093/cid/cit287 (2013).
Article CAS PubMed PubMed Central Google Scholar
Park, J. S. et al. Complete nucleotide sequence analysis of the norovirus GII.4 Sydney variant in South Korea. Biomed. Res. Int 2015, 374637, doi:10.1155/2015/374637 (2015).
PubMed PubMed Central Google Scholar
Victoria, M. et al. Bayesian coalescent inference reveals high evolutionary rates and expansion of Norovirus populations. Infect. Genet. Evol. 9, 927–932, doi:10.1016/j.meegid.2009.06.014 (2009).
Article CAS PubMed Google Scholar
Didelot, X., Bowden, R., Wilson, D. J., Peto, T. E. & Crook, D. W. Transforming clinical microbiology with bacterial genome sequencing. Nat. Rev. Genet. 13, 601–612, doi:10.1038/nrg3226 (2012).
Article CAS PubMed PubMed Central Google Scholar
Long, S. W. et al. A genomic day in the life of a clinical microbiology laboratory. J. Clin. Microbiol. 51, 1272–1277, doi:10.1128/JCM.03237-12 (2013).
Article PubMed PubMed Central Google Scholar
Rocha-Pereira, J., Neyts, J. & Jochmans, D. Norovirus: targets and tools in antiviral drug discovery. Biochem. Pharmacol. 91, 1–11, doi:10.1016/j.bcp.2014.05.021 (2014).
Article CAS PubMed Google Scholar
Shimada, S. et al. Use of S1 nuclease in deep sequencing for detection of double-stranded RNA viruses. J. Vet. Med. Sci. 77, 1163–1166, doi:10.1292/jvms.14-0607 (2015).
Article CAS PubMed PubMed Central Google Scholar
Nakamura, S. et al. Direct metagenomic detection of viral pathogens in nasal and fecal specimens using an unbiased high-throughput sequencing approach. PLoS. One. 4, e4219, doi:10.1371/journal.pone.0004219 (2009).
Article ADS PubMed PubMed Central Google Scholar
Wylie, K. M. et al. Genome sequence of enterovirus D68 from St. Louis, Missouri, USA. Emerg. Infect. Dis. 21, 184–186, doi:10.3201/eid2101.141605 (2015).
Article CAS PubMed PubMed Central Google Scholar
Batty, E. M. et al. A modified RNA-Seq approach for whole genome sequencing of RNA viruses from faecal and blood samples. PLoS. One. 8, e66129, doi:10.1371/journal.pone.0066129 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Hall, R. J. et al. Evaluation of rapid and simple techniques for the enrichment of viruses prior to metagenomic virus discovery. J. Virol. Methods 195, 194–204, doi:10.1016/j.jviromet.2013.08.035 (2014).
Article CAS PubMed Google Scholar
Wylie, T. N., Wylie, K. M., Herter, B. N. & Storch, G. A. Enhanced virome sequencing through solution-based capture enrichment. Genome Res (2015).
Brown, J. R. et al. Norovirus Whole-Genome Sequencing by SureSelect Target Enrichment: a Robust and Sensitive Method. J. Clin. Microbiol. 54, 2530–2537, doi:10.1128/JCM.01052-16 (2016).
Article PubMed PubMed Central Google Scholar
Vinner, L. et al. Investigation of Human Cancers for Retrovirus by Low-Stringency Target Enrichment and High-Throughput Sequencing. Sci. Rep 5, 13201, doi:10.1038/srep13201 (2015).
Article ADS CAS PubMed PubMed Central Google Scholar
Lim, S. W., Lance, S. T., Stedman, K. M. & Abate, A. R. PCR-activated cell sorting as a general, cultivation-free method for high-throughput identification and enrichment of virus hosts. J. Virol. Methods 242, 14–21, doi:10.1016/j.jviromet.2016.12.009 (2016).
Article PubMed Google Scholar
Qin, J. et al. A human gut microbial gene catalogue established by metagenomic sequencing. Nature 464, 59–65, doi:10.1038/nature08821 (2010).
Article CAS PubMed PubMed Central Google Scholar
A framework for human microbiome research. A framework for human microbiome research. Nature 486, 215–221, doi:10.1038/nature11209 (2012).
Article ADS Google Scholar
Structure, function and diversity of the healthy human microbiome. Structure, function and diversity of the healthy human microbiome. Nature 486, 207–214, doi:10.1038/nature11234 (2012).
Article ADS Google Scholar
Allen, D. J., Gray, J. J., Gallimore, C. I., Xerry, J. & Iturriza-Gomara, M. Analysis of amino acid variation in the P2 domain of the GII-4 norovirus VP1 protein reveals putative variant-specific epitopes. PLoS. One. 3, e1485, doi:10.1371/journal.pone.0001485 (2008).
Article ADS PubMed PubMed Central Google Scholar
Sukhrie, F. H. et al. P2 domain profiles and shedding dynamics in prospectively monitored norovirus outbreaks. J. Clin. Virol. 56, 286–292, doi:10.1016/j.jcv.2012.12.006 (2013).
Article PubMed Google Scholar
Xerry, J., Gallimore, C. I., Iturriza-Gomara, M. & Gray, J. J. Tracking the transmission routes of genogroup II noroviruses in suspected food-borne or environmental outbreaks of gastroenteritis through sequence analysis of the P2 domain. J. Med. Virol. 81, 1298–1304, doi:10.1002/jmv.v81:7 (2009).
Article CAS PubMed Google Scholar
Sukhrie, F. H. et al. Using molecular epidemiology to trace transmission of nosocomial norovirus infection. J. Clin. Microbiol. 49, 602–606, doi:10.1128/JCM.01443-10 (2011).
Article PubMed PubMed Central Google Scholar
Gandhi, K. M., Mandrell, R. E. & Tian, P. Binding of virus-like particles of Norwalk virus to romaine lettuce veins. Appl. Environ. Microbiol. 76, 7997–8003, doi:10.1128/AEM.01566-10 (2010).
Article CAS PubMed PubMed Central Google Scholar
Symes, S. J., Gunesekere, I. C., Marshall, J. A. & Wright, P. J. Norovirus mixed infection in an oyster-associated outbreak: an opportunity for recombination. Arch. Virol. 152, 1075–1086, doi:10.1007/s00705-007-0938-9 (2007).
Article CAS PubMed Google Scholar
van Alphen, L. B. et al. The application of new molecular methods in the investigation of a waterborne outbreak of norovirus in Denmark, 2012. PLoS. One. 9, e105053, doi:10.1371/journal.pone.0105053 (2014).
Article ADS PubMed PubMed Central Google Scholar
Franck, K. T., Fonager, J., Ersboll, A. K. & Bottiger, B. Norovirus epidemiology in community and health care settings and association with patient age, Denmark. Emerg. Infect. Dis. 20, 1123–1131, doi:10.3201/eid2007.130781 (2014).
Article PubMed PubMed Central Google Scholar
Gentry, J., Vinje, J. & Lipp, E. K. A rapid and efficient method for quantitation of genogroups I and II norovirus from oysters and application in other complex environmental samples. J. Virol. Methods 156, 59–65, doi:10.1016/j.jviromet.2008.11.001 (2009).
Article CAS PubMed Google Scholar
Eddy, S. R. Accelerated Profile HMM Searches. PLoS. Comput. Biol. 7, e1002195, doi:10.1371/journal.pcbi.1002195 (2011).
Article ADS MathSciNet CAS PubMed PubMed Central Google Scholar
Tamura, K., Stecher, G., Peterson, D., Filipski, A. & Kumar, S. MEGA6: Molecular Evolutionary Genetics Analysis version 6.0. Mol. Biol. Evol. 30, 2725–2729, doi:10.1093/molbev/mst197 (2013).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We thank the Danish Veterinary and Food administration for obtaining and providing samples from outbreaks used in in this study and to Dr. Anna Charlotte Schultz, Danish Technical University for providing us with the NoV GGII quantitative standard.

Author information

Authors and Affiliations

Virology Surveillance and Research Section, Department of Microbiological diagnostics and Virology, Statens Serum Institut, Copenhagen, Denmark
Jannik Fonager, Lasse Dam Rasmussen, Mille Weismann Poulsen, Jesper Rønn & Thea Kølsen Fischer
Department of Microbiology and Infection Control, Statens Serum Institut, Copenhagen, Denmark
Marc Stegger & Paal Skytt Andersen
Department of Infectious Diseases and Centre for Global health, Clinical Unit, University of Southern Denmark, Odense, Denmark
Thea Kølsen Fischer
Department of Veterinary Disease Biology, University of Copenhagen, Copenhagen, Denmark
Paal Skytt Andersen

Authors

Jannik Fonager
View author publications
You can also search for this author in PubMed Google Scholar
Marc Stegger
View author publications
You can also search for this author in PubMed Google Scholar
Lasse Dam Rasmussen
View author publications
You can also search for this author in PubMed Google Scholar
Mille Weismann Poulsen
View author publications
You can also search for this author in PubMed Google Scholar
Jesper Rønn
View author publications
You can also search for this author in PubMed Google Scholar
Paal Skytt Andersen
View author publications
You can also search for this author in PubMed Google Scholar
Thea Kølsen Fischer
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conceived concept and designed experiments J.F., M.S., L.D.R., T.K.F., P.S.A. Performed the experiments: M.W.P., J.R., M.S. Analysed the data: J.F., M.S. Contributed reagents/materials/analysis tools: T.K.F., P.S. Prepared the manuscript: J.F. Critical revision and approval of manuscript: All.

Corresponding author

Correspondence to Jannik Fonager.

Ethics declarations

Competing Interests

The authors declare that they have no competing interests.

Additional information

Accession codes: All mapped virus reads were submitted to the European Nucleotide Archive (ENA) under the study accession number PRJEB14339 (http://www.ebi.ac.uk/ena/data/view/PRJEB14339).

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Fonager, J., Stegger, M., Rasmussen, L.D. et al. A universal primer-independent next-generation sequencing approach for investigations of norovirus outbreaks and novel variants. Sci Rep 7, 813 (2017). https://doi.org/10.1038/s41598-017-00926-x

Download citation

Received: 16 June 2016
Accepted: 21 March 2017
Published: 11 April 2017
DOI: https://doi.org/10.1038/s41598-017-00926-x

This article is cited by

Using molecular methods to delineate norovirus outbreaks: a systematic review
- Luqiu Tao
- Xinyang Zhang
- Jie Ding
Archives of Virology (2024)
Detection and Sequencing of Multiple Human Norovirus Genotypes from Imported Frozen Raspberries Linked to Outbreaks in the Province of Quebec, Canada, in 2017
- Philippe Raymond
- Sylvianne Paul
- Hugues Charest
Food and Environmental Virology (2022)
Novel opportunities for NGS-based one health surveillance of foodborne viruses
- Marion Desdouits
- Miranda de Graaf
- Marion P. G. Koopmans
One Health Outlook (2020)
Unrevealed genetic diversity of GII Norovirus in the swine population of North East Italy
- L. Cavicchio
- L. Tassoni
- M. S. Beato
Scientific Reports (2020)
Genetic characterization of norovirus GII.4 variants circulating in Canada using a metagenomic technique
- Nicholas Petronella
- Jennifer Ronholm
- Neda Nasheri
BMC Infectious Diseases (2018)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

Using NGS directly on samples

Evaluation of the poly(A)-capture technique

Outbreak analysis

Assigning genotypes to outbreak samples

Phylogenetic analysis of outbreak samples

Identifying a new recombinant

Discussion

Materials and Methods

Ethics statement

Sample material

Extraction of nucleic acids, poly(A) capture, real-time RT-PCR and norovirus typing

Quantification of NoV RNA

Preparation of samples for Illumina MiSeq sequencing

Quality trimming and filtering

Reference based mapping

De novo assembly

Generation of consensus sequences

Confirmation of genotypes

Hidden Markov model (HMM) building and searches

Multiple alignment and phylogenetic analyses

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing Interests

Additional information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links