Norovirus (NoV) is the most common cause of non-bacterial gastroenteritis and is a major agent associated with outbreaks of gastroenteritis. Conventional molecular genotyping analysis of NoV, used for the identification of transmission routes, relies on standard typing methods (STM) by Sanger-sequencing of only a limited part of the NoV genome, which could lead to wrong conclusions. Here, we combined a NoV capture method with next generation sequencing (NGS), which increased the proportion of norovirus reads by ~40 fold compared to NGS without prior capture. Of 15 NoV samples from 6 single-genotype outbreaks, near full-genome coverage (>90%) was obtained from 9 samples. Fourteen polymerase (RdRp) and 15 capsid (cap) genotypes were identified compared to 12 and 13 for the STM, respectively. Analysis of 9 samples from two mixed-genotype outbreaks identified 6 RdRp and 6 cap genotypes (two at >90% NoV genome coverage) compared to 4 and 2 for the STM, respectively. Furthermore, complete or partial sequences from the P2 hypervariable region were obtained from 7 of 8 outbreaks and a new NoV recombinant was identified. This approach could therefore strengthen outbreak investigations and could be applied to other important viruses in stool samples such as hepatitis A and enterovirus.
Norovirus (NoV) is a positive-sense single-stranded RNA virus in the Caliciviridae family, and at least 40 genotypes divided into seven genogroups have been identified1,2,3. NoV infection is the most widespread cause of non-bacterial gastroenteritis, responsible for up to one fifth of all cases of gastroenteritis globally4. Despite some progress5, 6, no vaccine or therapeutic intervention is available, and interceptive strategies mainly aim to rapidly identify the source of infection, increase hygiene measures, and isolate infected patients7, 8. Conventional molecular analysis of NoV transmission routes relies mainly on standard typing methods (STM) based on Sanger-sequencing of partial RNA-dependent polymerase and capsid genes (RdRp and cap)9, 10. More recently, sequencing of larger parts of the capsid gene containing the hypervariable P2 region has increased the discriminatory power to resolve outbreaks with higher accuracy11, 12. Although new recombinant or dominant NoV strains has routinely been reported13, 14, commonly used gene-specific primers may have limited the ability to rapidly detect emerging strains due to sequence differences in the primer binding regions. In such situations, it is necessary to amplify and sequence these regions using other primers and subsequently to make these sequences publically available to enable other research groups to redesign their primers accordingly. Therefore, the current STM for analysis could lead to incorrect conclusions about possible transmission chains, underestimation of the genetic diversity of NoV, and delay early identification of new emerging strains. Recent methodological approaches including NGS to achieve full norovirus genome coverage have been published. However, these methods mostly rely on genotype specific primers15,16,17 why they, are time consuming and need frequent updating due to the high natural mutation rates of NoV18.
Cultivation of pathogens is commonly used to ensure a pure and high concentration for further investigation and has combined with NGS considerably improved the ability to identify transmission chains and resistance genes for cultivated bacterial infections in particular19, 20. Although human NoV to some degree can replicate in animal models, no cell culture system exists21. Therefore investigation of NoV suspected cases is usually limited to direct analysis of viral RNA in stool samples, in which viral genomes only constitute a minor proportion of the nucleic acids present22,23,24.
Recent studies have addressed several of these problems by using both random sequencing24, 25 strategies and different methods for virus enrichment such as: virion isolation and enzymatic removal of host/bacterial nucleic acids26, capture-based27,28,29 methods or PCR activated cell-sorting methods30. Despite these recent advances, such methods are still time-consuming, laborious and potentially costly and/or rely on approaches that will require extensive periodic updates in primers or probes to reflect the current knowledge on viral diversity.
In this study, we have evaluated an easy-to-use laboratory method that allows for a ~40-fold enrichment of all NoV genotypes in stool samples. Furthermore, we have used bioinformatics approaches to accurately screen for NoV in highly complex samples. Analysis of NoV positive samples from eight foodborne outbreaks yielded sufficient NoV read counts to allow the assembly of several complete or nearly complete genomes for molecular comparisons. Furthermore, this approach allowed for the identification of an additional genotype, missed by STM, as well as discovery of a new recombinant NoV.
Using NGS directly on samples
Despite a large sequencing depth allocated to each sample (1.5 to 5.5 million reads), only a relatively small proportion of the obtained reads were of NoV origin (on average: 0.25%; corresponding to ~700 to ~22,000 reads).
Evaluation of the poly(A)-capture technique
To specifically enrich for NoV RNA and reduce the amount of non-polyadenylated bacterial RNA, a poly(A)-capture method was employed after nucleic acid extraction. To evaluate this enrichment strategy, NoV viral load was measured in 6 GGI and 3 GGII quantitated survey samples were (called QS1 to QS9, See Materials and methods and Table 1) along with five non-quantified survey samples (called S1 to S5, See Table 1 and Materials and Methods). All samples were split after RNA extraction with only one part subjected to poly(A)-capture. SMARTer libraries were constructed from both extracted parts and subjected to MiSeq sequencing simultaneously. The efficiency (Table 1) was evaluated by measuring the proportion of reads mapping to full genome sequences from the common human gut bacterial species Bacterioides uniformis and Ruminococcus bromii L2 + 6331 or from a set of 16 sRNA sequences identified in human microbiome studies32, 33. Poly(A)-capture increased the proportion of obtained NoV reads over the entire range of NoV input RNA copies (Log10 1,89 to 6,82; see Table 1 and Fig. 1), despite some variation for especially samples with low numbers of input NoV RNA copies. While the proportion of bacterial reads was reduced by 0.28 to 0.41 fold, the number of NoV reads increased by on average 45.1 ± 27.77 -fold. Although the average Ct value decreased by 0.96 ± 0.07-fold after poly(A)-capture, the poly(A)-captured NoV was also eluted in only one fifth of the suspension volume used before poly(A)-capture. The average percentages of reads from the non poly(A)-captured survey samples mapping to three approximately equally- sized parts of the NoV reference genome sequences were: 1st part (genome-position: 1–2499): 32.2% (±17.7%), 2nd part (genome-position: 2500–4997): 53.3% (±12.8%) and 3rd part (genome-position: 4998–7496): 14.5% (±8.9%), while the average percentages of NoV reads from the poly(A)-captured survey samples mapping to these regions were: 1st part: 10.1% (±3.2%), 2nd part: 43.5% (±8%) and 3rd part: 46.4% (±8.9%).
Samples from all eight outbreaks were subjected to the poly(A)-capture method and SMARTer library construction. A general linear trend was observed between the Ct values measured after poly(A)-capture and the number of reads obtained (Fig. 2), although a few samples deviated from this trend by containing a higher than expected number of NoV reads per million reads. Although full-genome coverage (>99%) was observed at ~4,800 NoV reads in total, equivalent to an average coverage of ~80 per sample (Fig. 3), sufficient sequence quality along the entire genome was only observed above ~11,000 reads with an average coverage of ~260.
Assigning genotypes to outbreak samples
The first level of sequence comparison in an outbreak is the comparison of genotypes obtained from different persons in the outbreak. Complete NoV genotyping relies on sufficient sequence coverage in two regions: ORF1 (RdBp/pol) and ORF2 (Cap) for complete genotyping. Using the NGS approach, 14 complete and one partial genotype were detected in 15 samples from 6 of the 8 outbreaks (see Table 2) containing a single NoV genotype compared with 10 complete and five partial genotypes detected with the STM approach.
Since it had been demonstrated by Real time PCR and STM that two NoV genogroups and several genotypes were involved in Ob-4 and Ob-6, HMM searches for additional genotypes was performed on de novo assembled contigs (See Materials and Methods). From sample Ob-4-1, 10,765 and from sample Ob-4-2 15,391 de novo assemblies were generated, of which 11 and 107 were identified as norovirus assemblies by the HMM search respectively. The candidate NoV contigs were further investigated by BLASTN and genotyping of the contigs, with subsequent reference based mapping which confirmed the presence of the following genotypes in the two samples: Ob-4-1: GII.Pg_GII.1 and GII.4_Sydney, Ob-4-2: GI.Pb_GI.6 and GII.7P_GII.6 (Table 2). Due to insufficient reads mapping to the GI.Pb_GI.6 reference in sample Ob-4-1, a valid phylogenetic comparison could not be performed, although a BLASTN of the consensus sequence generated from the 22 mapped reads indicated this to be GI.Pb_GI.6 as well. The HMM analysis also detected genotype GII.Pg_GII.1 in the Ob-4-1 sample which was not identified by STM.
From Outbreak 6, the following numbers of HMM hits were obtained out of the total number of de novo assembled contigs: Ob-6-1: 0 of 18,361, Ob-6-2: 2 of 1,928, Ob-6-3: 11 of 18,973, Ob-6-4: 6 of 34,959, Ob-6-5: 5 of 37,507, Ob-6-6: 4 of 14,250 and Ob-6-7: 1 of 43,308. Following the same procedure as described for Ob-4 lead to the identification of GII.P7_GII.6 and GI.P3_GI.3 in all seven samples. The GII.P7_GII.6 genotype was supported by a large number of reads in all samples except Ob-6-3 and could be compared phylogenetically, whereas the GI.P3_GI.3 genotype was only supported by a low number of reads in all samples. This also suggested that the two NoV genotypes were present in all the samples at variable concentrations.
Overall, support for the hypothesis of a common infection source by shared pol and cap genotypes for at least one genotype and in at least two different persons from the outbreak was obtained for seven of eight outbreaks with the NGS method and for five of eight outbreaks with the STM method. Similar genotypes were observed in several of the outbreaks, all of which were found to be different (identities: Ob-1/Ob-6: 84.4%, Ob-2/Ob-3: 99.8%, Ob-4/Ob-6: 84.2%, Ob-4/Ob-7/S-4: 97.6% to 98.8%, S-5/Ob-5: 97.3%).
Phylogenetic analysis of outbreak samples
Phylogenetic analysis was performed for all outbreaks, except for Ob-4 due to the absence of shared well-covered genotype reference sequences. Consensus sequences with the following maximum lengths were generated in CLCbio: Ob-1: 7666 nt, Ob-2: 1828 nt, Ob-3: 1226 nt, Ob-4: not analyzed, Ob-5: 1593 nt, Ob-6:7344 nt, Ob-7: 7697 nt, Ob-8: 6228 nt and used in a phylogenetic comparison. The genome coverage is shown in Fig. 4 for the individual outbreaks. The comparison included either the complete hypervariable P2 region (P2 region is 456 to 483 nt depending on genotype) for Ob-1, Ob-6, Ob-7 and Ob-8, and partial P2 region comparisons for Ob-2 (471 nt), Ob-3 (243 nt) and Ob-5 (291 nt). The phylogenetic analysis revealed that the NoV involved in Ob-3, Ob-5 and Ob-7 were 100% identical (Fig. 4f,h and l), whereas differences in the NoV genomes (Fig. 4b,d,j and o) were observed for: Ob-1 (1 nt difference; two samples had an A-residue at reference sequence position 3321 while three samples had a G-residue), Ob-2 (3 nt differences in the P2 region), Ob-6 (1 nt difference in ORF3 and several differences in the 3′ non-coding A rich part of the genome), Ob-8 (1 nt difference in the P2 region).
Identifying a new recombinant
When reads from the two samples from Ob-8 were mapped to the two reference sequences known to be present from the initial partial genotyping, a mutually exclusive distribution of reads was observed (Fig. 5a and b). In addition, reads that spanned the ORF1/ORF2 junction of an in silico generated reference sequence were observed (Fig. 5c), confirming that both these samples harbored a novel GII.P16_GII.4_Sydney recombinant.
The use of poly(A)-capture significantly enhanced the number of norovirus reads obtained from stool samples, allowing comparisons of full or near full (>85%) genome sequences from 4 outbreaks and partial genome comparisons in 3 outbreaks. In total, 14 complete and one partial genotype were detected in the 15 samples from the 6 outbreaks containing a single NoV genotype compared with 10 complete and five partial genotypes detected with the STM approach. In addition, additional genotypes (partial or complete) were identified with the NGS approach in the two mixed-genotype outbreaks samples (Ob4 and Ob6). STM generated more genotype information than NGS in four cases. In these cases, either none or a low number of NoV reads of mapped to the specific genotype, however none of the reads mapped to the ORF1 (pol) or ORF2 (cap) genotyping-regions. This showed that although the NGS method overall improved the genotyping results, some samples might be challenging due to low amounts of available virus RNA in combination with the random distribution of reads obtained. One way to reduce this problem would be to allocate a larger sequencing depth for especially samples with low amounts of virus.
The NGS derived consensus sequences used for phylogenetic comparison ranged from 1226 to 7692 nt (average 4800 nt) and included either the complete or a substantial proportion of the hypervariable P2 region. In comparison, STM only covers ~9% of the genome and does not include the P2 region. Therefore, even in the three outbreaks, in which only partial genomes were recovered, the data were found to significantly improve the molecular resolution of outbreaks.
Interestingly, minor nucleotide variations between sequences from different samples from three of the outbreaks were observed. Two of these differences were mapped to the P2 region, known to be highly variable12, 34,35,36 and a single nucleotide difference was observed between two groups of samples from a single epidemiologically linked outbreak (Ob-2). This challenge the 100% identity-paradigm used in general NoV outbreak investigations12, 36 that normally distinguishes only between identical and non-identical strains. Other studies have also questioned if these strict criteria should be maintained37, when comparing larger parts of the NoV genome.
NoV bioaccumulation in or adhesion to food items such as oysters and lettuce generates complex outbreak profiles including several genotypes38, 39, which require separate RT-PCR amplification steps if STM are used39. In this study, six NoV genotypes were identified in samples from two mixed outbreaks, three of which was supported by high genomic coverages (66% to 99% of the entire NoV genome). HMM improved the detection of genotypes by identifying a genotype (GII.Pg_GII.1) missed using STM. Although phylogenetic comparisons could not be performed for all genotypes due to varied sequence coverage of some genotypes in the samples, greater sequencing depth may circumvent this problem in future analysis. Interestingly, a mutually exclusive presence of genotypes was observed for three of the four genotypes identified in the two samples from Ob-4 and different relative abundances of the two genotypes found in Ob-6 was found for sample Ob-6-3 compared with the other samples. This could indicate differences in host exposure and/or susceptibility to different NoV genotypes in complex outbreaks.
A near-complete genome sequence of (>90%) a new GII.P16_GII.4_Sydney recombinant NoV was directly confirmed from the NGS data by using reads spanning the ORF1/ORF2 junction of the two different genotypes, showing that NGS can be used to distinguish between co-infection with different genotypes and new emergent recombinants.
This study was performed retrospectively on samples stored at −20 °C and previously analyzed by STM where samples had all been freeze-thawed at least twice, which may have resulted in some degree of degradation of the NoV. Five samples were excluded after poly(A)-capture, as a large increase (>5) in Ct values were observed, indicating fragmentation of NoV RNA. Therefore, for future applications of the present method, it will be of great importance to retain NoV RNA integrity until library preparation.
We have introduced a novel NoV enrichment NGS-based approach to investigate foodborne outbreaks without discriminating between genotypes. This method can be used directly to enrich other clinically important viruses in stool such as enteroviruses, or other positive-sense RNA viruses with a polyadenylated 3′ tail. Although the poly (A)-capture lead to a 3′ bias in sequencing depth, it allowed for a significant enrichment of NoV reads obtained from the samples. Future studies are required to test the efficiency of enrichment from other specimen types. Although the likelihood of obtaining complete NoV genomes is strongly dependent on NoV concentration in the sample, deeper sequencing would likely allow for retrieval of more NoV reads even in more scarce NoV samples. With common access to benchtop sequencers, we anticipate that NGS will soon become a definitive, non-discriminatory tool for viral infection control and serve to monitor both the evolution and spread of genotypes and enhance viral outbreak investigations.
Materials and Methods
According to the “Danish Act on Research Ethics Review of Health Research Projects” this study does not require approval by the ethics committees, as it is considered a quality development/control project and does not analyze human sequences. This was confirmed by the Committees on Health Research Ethics for the Capital Region of Denmark in a specific waiver of approval (H-16019654).
Twenty-four NoV positive samples from eight different foodborne outbreaks (termed Ob1 to Ob8) were analyzed (Table 3). Five survey samples (termed S1 to S5) and 9 quantitative survey samples (termed QS1 to QS9) were analyzed with or without poly(A)-capture to assess the efficiency of this method. Five samples where Ct values increased >5 after poly(A)-capture vs. before were excluded from NGS analysis as they were considered to be too degraded.
Extraction of nucleic acids, poly(A) capture, real-time RT-PCR and norovirus typing
Nucleic acids were extracted from 10% stool suspensions (kept at −20 °C) using the MagNA Pure LC (Roche Diagnostics); poly(A)-capture was performed using a Dynabeads mRNA Purification Kit (Ambion Cat. No. 61006) according to the manufacturer’s instructions with modifications to use 100 µL input material and 26 µL Dynabeads. The concentration of nucleic acids was measured using 1 µL extract on a NanoDrop 1000 Spectrophotometer (NanoDrop Technologies). The presence of NoV Genogroup I and II was assessed using real-time multiplex PCR40 and genotyping was performed as described previously40, 41.
Quantification of NoV RNA
A quantitative NoV GGI standard was obtained from ATCC (Quantitative Synthetic Norovirus G1 (I) RNA (ATCC® VR3234SD™; specification range (log10) 5–6 RNA copies/µL, of which the lower end range was used for the calculations. In addition, a previously published NoV GGII standard42 was obtained from collaborators at the Danish Technical University at a confirmed concentration of 5.19 (log10) ± 4.80 (log10) RNA copies/µL. Both standards were diluted in a fivefold 1:10 dilution series and analyzed in triplicates in the real time multiplex PCR (described above) alongside 9 NoV Quantiative Survey samples (QS1 to QS9; all both with and without poly(A)-capture). Analysis of real time data was performed in MxPro Mx3005 P v4.10, resulting in the following standard curves for GGI and GGII respectively: Y = −3.047xLOG(X) + 41.74; R2: 0.994 and Y = −3.090xLOG(X) + 41.18; R2: 0.971. Calculations of the amount of NoV genomes used as input in the extraction/capture and NGS analyses were also performed in MxPro.
Preparation of samples for Illumina MiSeq sequencing
Single-indexed cDNA libraries were generated using the SMARTer Stranded RNA-Seq Kit (Clontech Inc.) in accordance with the manufacturer’s instructions. Fluorescent measurement of DNA concentrations in each library was performed using Qubit dsDNA BR and ssDNA assay kit (Thermo Fischer Scientific).
Quality trimming and filtering
Sequences were imported into CLCbio’s Genomics Workbench (v. 8.5) with the removal of failed reads. Quality trimming within the workbench was performed using both a modified Mott trimming algorithm implemented (limit = 0.5) and by trimming reads containing more than two ambiguous nucleotides. Human sequence reads were removed by alignment to the homo sapiens hg19 reference genome (similarity fraction = 0.8).
Reference based mapping
Quality-trimmed reads were mapped to reference sequences using the Mapping tool in CLCbio’s Genomics Workbench with default settings. NoV reads from all samples loaded on the same MiSeq run were mapped to all expected reference sequences. The following reference sequences were used for mapping of Miseq reads: JQ388274 (GI.Pb_GI.6), JX459908 (GII.Pe_GII.4_Sydney), KJ685411 (GII.P4_New_Orleans_GII.4_Sydney), DQ456824 (GII.P2_GII.2), EU921389 (GII.P21_GII.3), HCU07611 (GII.Pg_GII.1), JQ388274 (GI.Pb_GI.6), KJ196292.1 (GI.P3_GI.3), KM198484 (GII.P21_GII.3), KM198534 (GII.P7_GII.6) and the following reference sequences were used in the phylogenetic analysis: AB187514.1 (GI.P3_GI.3), KM198484 (GII.P21_GII.3), KM198500.1 (GII.P21_GII.3), EU921389 (GII.P21_GII.3), KJ685411 (GII.P4_New_Orleans_2009_GII.4_Sydney_2012), JX459908 (GII.Pe_GII.4_Sydney), LN854568 (GII.P7_GII.6), JQ388274 (GI.Pb_GI.6), LN854564.1 (GI.Pb_GI.6), KM036380 (GII.P16_GII.13), JX459908 (GII.Pe_GII.4_Sydney). In cases where no appropriate full-length reference sequence was available for mapping of reads, a de novo assembled sequence (see below) or a consensus sequence generated from the most similar full-length reference sequence available was used instead.
De novo assembly
Reads from three outbreaks (Ob-1, Ob-6 and Ob-8) were mapped to de novo assembled reference sequences, as no well-matching and/or full-length reference sequences were identified in public databases. De novo assembled reads were generated using CLCbio’s assembler at default settings and with the fast mapping mode and a minimum contig length of 200 bases.
Generation of consensus sequences
Consensus sequences were generated from mapped reads using the majority vote option and inserting N in places of ambiguity or missing data. Depth thresholds at >0, >2, or ≥5 reads were evaluated for sequence quality, and only high quality consensus sequences (average quality score ≥30, as calculated in CLCbio) were used for sequence comparison and phylogenetic analysis. The P2 region on the consensus sequence was defined as previously described11.
Confirmation of genotypes
Genotypes from all mappings were confirmed by submission of consensus sequences to analysis at the Dutch National Institute for Public Health and the Environment (RIVM)’s NoV typing tool (http://www.rivm.nl/mpf/norovirus/typingtool) and/or BLASTN followed by genotyping of the best hits at RIVM.
Hidden Markov model (HMM) building and searches
In total, 858 sequences matching the terms “norovirus” and “complete” at NCBI (accessed on April 25th, 2015) were downloaded and genotypes confirmed using the Dutch National Institute for Public Health and the Environment (RIVM)’s NoV typing tool (http://www.rivm.nl/mpf/norovirus/typingtool). From this set, 112 representative sequences were selected for hidden Markov model (HMM) building. Sequences were aligned in MAFFT v.7 (http://www.ebi.ac.uk/Tools/msa/mafft/) and a NoV HMM was built using HMMer 3.043. HMM searches were performed among de novo assembled reads at default settings and identified assemblies evaluated by BLASTN and NoV typing at RIVM.
Multiple alignment and phylogenetic analyses
Consensus and reference sequences were aligned in MAFFT and phylogenetic analyses were performed by maximum-likelihood with a generalized time-reversible (GTR) substitution model and a 1000 bootstrap replicates in MEGA 6.0644.
Kroneman, A. et al. Proposal for a unified norovirus nomenclature and genotyping. Arch. Virol. 158, 2059–2068, doi:10.1007/s00705-013-1708-5 (2013).
Mesquita, J. R., Barclay, L., Nascimento, M. S. & Vinje, J. Novel norovirus in dogs with diarrhea. Emerg. Infect. Dis. 16, 980–982, doi:10.3201/eid1606.091861 (2010).
Vinje, J. Advances in laboratory methods for detection and typing of norovirus. J. Clin. Microbiol. 53, 373–381, doi:10.1128/JCM.01535-14 (2015).
Ahmed, S. M. et al. Global prevalence of norovirus in cases of gastroenteritis: a systematic review and meta-analysis. Lancet Infect. Dis. 14, 725–730, doi:10.1016/S1473-3099(14)70767-4 (2014).
Debbink, K., Lindesmith, L. C. & Baric, R. S. The state of norovirus vaccines. Clin. Infect. Dis. 58, 1746–1752, doi:10.1093/cid/ciu120 (2014).
Kim, Y., Galasiti Kankanamalage, A. C., Chang, K. O. & Groutas, W. C. Recent Advances in the Discovery of Norovirus Therapeutics. J. Med. Chem. 58, 9438–9450, doi:10.1021/acs.jmedchem.5b00762 (2015).
Greig, J. D. & Lee, M. B. A review of nosocomial norovirus outbreaks: infection control interventions found effective. Epidemiol. Infect. 140, 1151–1160 (2012).
Vinnard, C., Lee, I. & Linkin, D. Successful control of a norovirus outbreak among attendees of a hospital teaching conference. Am. J. Infect. Control 40, 73–74, doi:10.1016/j.ajic.2011.03.033 (2012).
Kojima, S. et al. Genogroup-specific PCR primers for detection of Norwalk-like viruses. J. Virol. Methods 100, 107–114, doi:10.1016/S0166-0934(01)00404-9 (2002).
Vennema, H., de, B. E. & Koopmans, M. Rational optimization of generic primers used for Norwalk-like virus detection by reverse transcriptase polymerase chain reaction. J. Clin. Virol. 25, 233–235, doi:10.1016/S1386-6532(02)00126-9 (2002).
Holzknecht, B. J. et al. Sequence analysis of the capsid gene during a genotype II.4 dominated norovirus season in one university hospital: identification of possible transmission routes. PLoS. One. 10, e0115331, doi:10.1371/journal.pone.0115331 (2015).
Xerry, J., Gallimore, C. I., Iturriza-Gomara, M., Allen, D. J. & Gray, J. J. Transmission events within outbreaks of gastroenteritis determined through analysis of nucleotide sequences of the P2 domain of genogroup II noroviruses. J. Clin. Microbiol. 46, 947–953, doi:10.1128/JCM.02240-07 (2008).
de, G.M. et al. Emergence of a novel GII.17 norovirus - End of the GII.4 era? Euro. Surveill 20 (2015).
van, B. J. et al. Indications for worldwide increased norovirus activity associated with emergence of a new variant of genotype II.4, late 2012. Euro. Surveill 18, 8–9 (2013).
Cotten, M. et al. Deep sequencing of norovirus genomes defines evolutionary patterns in an urban tropical setting. J. Virol. 88, 11056–11069, doi:10.1128/JVI.01333-14 (2014).
Kundu, S. et al. Next-generation whole genome sequencing identifies the direction of norovirus transmission in linked patients. Clin. Infect. Dis. 57, 407–414, doi:10.1093/cid/cit287 (2013).
Park, J. S. et al. Complete nucleotide sequence analysis of the norovirus GII.4 Sydney variant in South Korea. Biomed. Res. Int 2015, 374637, doi:10.1155/2015/374637 (2015).
Victoria, M. et al. Bayesian coalescent inference reveals high evolutionary rates and expansion of Norovirus populations. Infect. Genet. Evol. 9, 927–932, doi:10.1016/j.meegid.2009.06.014 (2009).
Didelot, X., Bowden, R., Wilson, D. J., Peto, T. E. & Crook, D. W. Transforming clinical microbiology with bacterial genome sequencing. Nat. Rev. Genet. 13, 601–612, doi:10.1038/nrg3226 (2012).
Long, S. W. et al. A genomic day in the life of a clinical microbiology laboratory. J. Clin. Microbiol. 51, 1272–1277, doi:10.1128/JCM.03237-12 (2013).
Rocha-Pereira, J., Neyts, J. & Jochmans, D. Norovirus: targets and tools in antiviral drug discovery. Biochem. Pharmacol. 91, 1–11, doi:10.1016/j.bcp.2014.05.021 (2014).
Shimada, S. et al. Use of S1 nuclease in deep sequencing for detection of double-stranded RNA viruses. J. Vet. Med. Sci. 77, 1163–1166, doi:10.1292/jvms.14-0607 (2015).
Nakamura, S. et al. Direct metagenomic detection of viral pathogens in nasal and fecal specimens using an unbiased high-throughput sequencing approach. PLoS. One. 4, e4219, doi:10.1371/journal.pone.0004219 (2009).
Wylie, K. M. et al. Genome sequence of enterovirus D68 from St. Louis, Missouri, USA. Emerg. Infect. Dis. 21, 184–186, doi:10.3201/eid2101.141605 (2015).
Batty, E. M. et al. A modified RNA-Seq approach for whole genome sequencing of RNA viruses from faecal and blood samples. PLoS. One. 8, e66129, doi:10.1371/journal.pone.0066129 (2013).
Hall, R. J. et al. Evaluation of rapid and simple techniques for the enrichment of viruses prior to metagenomic virus discovery. J. Virol. Methods 195, 194–204, doi:10.1016/j.jviromet.2013.08.035 (2014).
Wylie, T. N., Wylie, K. M., Herter, B. N. & Storch, G. A. Enhanced virome sequencing through solution-based capture enrichment. Genome Res (2015).
Brown, J. R. et al. Norovirus Whole-Genome Sequencing by SureSelect Target Enrichment: a Robust and Sensitive Method. J. Clin. Microbiol. 54, 2530–2537, doi:10.1128/JCM.01052-16 (2016).
Vinner, L. et al. Investigation of Human Cancers for Retrovirus by Low-Stringency Target Enrichment and High-Throughput Sequencing. Sci. Rep 5, 13201, doi:10.1038/srep13201 (2015).
Lim, S. W., Lance, S. T., Stedman, K. M. & Abate, A. R. PCR-activated cell sorting as a general, cultivation-free method for high-throughput identification and enrichment of virus hosts. J. Virol. Methods 242, 14–21, doi:10.1016/j.jviromet.2016.12.009 (2016).
Qin, J. et al. A human gut microbial gene catalogue established by metagenomic sequencing. Nature 464, 59–65, doi:10.1038/nature08821 (2010).
A framework for human microbiome research. A framework for human microbiome research. Nature 486, 215–221, doi:10.1038/nature11209 (2012).
Structure, function and diversity of the healthy human microbiome. Structure, function and diversity of the healthy human microbiome. Nature 486, 207–214, doi:10.1038/nature11234 (2012).
Allen, D. J., Gray, J. J., Gallimore, C. I., Xerry, J. & Iturriza-Gomara, M. Analysis of amino acid variation in the P2 domain of the GII-4 norovirus VP1 protein reveals putative variant-specific epitopes. PLoS. One. 3, e1485, doi:10.1371/journal.pone.0001485 (2008).
Sukhrie, F. H. et al. P2 domain profiles and shedding dynamics in prospectively monitored norovirus outbreaks. J. Clin. Virol. 56, 286–292, doi:10.1016/j.jcv.2012.12.006 (2013).
Xerry, J., Gallimore, C. I., Iturriza-Gomara, M. & Gray, J. J. Tracking the transmission routes of genogroup II noroviruses in suspected food-borne or environmental outbreaks of gastroenteritis through sequence analysis of the P2 domain. J. Med. Virol. 81, 1298–1304, doi:10.1002/jmv.v81:7 (2009).
Sukhrie, F. H. et al. Using molecular epidemiology to trace transmission of nosocomial norovirus infection. J. Clin. Microbiol. 49, 602–606, doi:10.1128/JCM.01443-10 (2011).
Gandhi, K. M., Mandrell, R. E. & Tian, P. Binding of virus-like particles of Norwalk virus to romaine lettuce veins. Appl. Environ. Microbiol. 76, 7997–8003, doi:10.1128/AEM.01566-10 (2010).
Symes, S. J., Gunesekere, I. C., Marshall, J. A. & Wright, P. J. Norovirus mixed infection in an oyster-associated outbreak: an opportunity for recombination. Arch. Virol. 152, 1075–1086, doi:10.1007/s00705-007-0938-9 (2007).
van Alphen, L. B. et al. The application of new molecular methods in the investigation of a waterborne outbreak of norovirus in Denmark, 2012. PLoS. One. 9, e105053, doi:10.1371/journal.pone.0105053 (2014).
Franck, K. T., Fonager, J., Ersboll, A. K. & Bottiger, B. Norovirus epidemiology in community and health care settings and association with patient age, Denmark. Emerg. Infect. Dis. 20, 1123–1131, doi:10.3201/eid2007.130781 (2014).
Gentry, J., Vinje, J. & Lipp, E. K. A rapid and efficient method for quantitation of genogroups I and II norovirus from oysters and application in other complex environmental samples. J. Virol. Methods 156, 59–65, doi:10.1016/j.jviromet.2008.11.001 (2009).
Eddy, S. R. Accelerated Profile HMM Searches. PLoS. Comput. Biol. 7, e1002195, doi:10.1371/journal.pcbi.1002195 (2011).
Tamura, K., Stecher, G., Peterson, D., Filipski, A. & Kumar, S. MEGA6: Molecular Evolutionary Genetics Analysis version 6.0. Mol. Biol. Evol. 30, 2725–2729, doi:10.1093/molbev/mst197 (2013).
We thank the Danish Veterinary and Food administration for obtaining and providing samples from outbreaks used in in this study and to Dr. Anna Charlotte Schultz, Danish Technical University for providing us with the NoV GGII quantitative standard.
The authors declare that they have no competing interests.
Accession codes: All mapped virus reads were submitted to the European Nucleotide Archive (ENA) under the study accession number PRJEB14339 (http://www.ebi.ac.uk/ena/data/view/PRJEB14339).
Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Fonager, J., Stegger, M., Rasmussen, L.D. et al. A universal primer-independent next-generation sequencing approach for investigations of norovirus outbreaks and novel variants. Sci Rep 7, 813 (2017). https://doi.org/10.1038/s41598-017-00926-x
This article is cited by
Detection and Sequencing of Multiple Human Norovirus Genotypes from Imported Frozen Raspberries Linked to Outbreaks in the Province of Quebec, Canada, in 2017
Food and Environmental Virology (2022)
One Health Outlook (2020)
Scientific Reports (2020)
Genetic characterization of norovirus GII.4 variants circulating in Canada using a metagenomic technique
BMC Infectious Diseases (2018)