Unbiased Strain-Typing of Arbovirus Directly from Mosquitoes Using Nanopore Sequencing: A Field-forward Biosurveillance Protocol

Russell, Joseph A.; Campos, Brittany; Stone, Jennifer; Blosser, Erik M.; Burkett-Cadena, Nathan; Jacobs, Jonathan L.

doi:10.1038/s41598-018-23641-7

Download PDF

Article
Open access
Published: 03 April 2018

Unbiased Strain-Typing of Arbovirus Directly from Mosquitoes Using Nanopore Sequencing: A Field-forward Biosurveillance Protocol

Scientific Reports volume 8, Article number: 5417 (2018) Cite this article

8198 Accesses
37 Citations
37 Altmetric
Metrics details

Subjects

Abstract

The future of infectious disease surveillance and outbreak response is trending towards smaller hand-held solutions for point-of-need pathogen detection. Here, samples of Culex cedecei mosquitoes collected in Southern Florida, USA were tested for Venezuelan Equine Encephalitis Virus (VEEV), a previously-weaponized arthropod-borne RNA-virus capable of causing acute and fatal encephalitis in animal and human hosts. A single 20-mosquito pool tested positive for VEEV by quantitative reverse transcription polymerase chain reaction (RT-qPCR) on the Biomeme two3. The virus-positive sample was subjected to unbiased metatranscriptome sequencing on the Oxford Nanopore MinION and shown to contain Everglades Virus (EVEV), an alphavirus in the VEEV serocomplex. Our results demonstrate, for the first time, the use of unbiased sequence-based detection and subtyping of a high-consequence biothreat pathogen directly from an environmental sample using field-forward protocols. The development and validation of methods designed for field-based diagnostic metagenomics and pathogen discovery, such as those suitable for use in mobile “pocket laboratories”, will address a growing demand for public health teams to carry out their mission where it is most urgent: at the point-of-need.

A cost-effective RNA extraction and RT-qPCR approach to detect California serogroup viruses from pooled mosquito samples

Article Open access 29 January 2024

Sensitivity and specificity of metatranscriptomics as an arbovirus surveillance tool

Article Open access 18 December 2019

Small RNA sequencing of field Culex mosquitoes identifies patterns of viral infection and the mosquito immune response

Article Open access 30 June 2023

Introduction

With increasing accessibility of metagenomics- and metatranscriptomics-based analyses (meta-omics), clinicians and researchers have begun to embrace the technology as a means of detection for unknown etiological agents of disease^{1,2,3,4,5,6,7,8,9}. In addition, metagenomics has an emerging role in environmental biosurveillance across multiple mission contexts including bioterrorism defense¹⁰, epidemiological public health^11,12,13, water-quality monitoring¹⁴, and agriculture/food safety^15,16,17. In comparison to PCR-based amplicon assays, metagenomics has the added value of not requiring a priori knowledge of a target (i.e., unbiased), delivers functional genomic information of constituent organisms in a sample (in addition to detection), and provides an estimate of their relative abundance. However, the benefits of this information are inextricably dependent on the quality of sample extraction and sequencing reads, the depth of sequencing and titer-level of the etiological agent, the comprehensiveness of reference databases, and the power and suitability of back-end computational equipment and bioinformatics analysis. Additionally, metagenomic sequencing on second-generation sequencing technology typically requires more than a 24 hour time investment on non-portable machines. Consequently, field-forward biosurveillance has been largely limited to PCR-based assays^18,19,20 or antibody hybridization technologies^21,22,23,24.

Recent development in nanopore technology, pioneered by Oxford Nanopore Technologies, Inc. (ONT) with their MinION sequencing device, has opened the possibility of bringing the power of metagenomics to virtually any environment in the world. The MinION is a pocket-sized, USB-powered nanopore sequencing platform, weighing less than 100 grams, yet capable of up to 20 GB of ultra-long read (>100 kb) sequence data^25,26. The device’s ultra-portability has been leveraged to perform in-field metagenomic characterization of environments ranging from the deep subsurface²⁷ to the Antarctic Dry Valleys²⁸. But perhaps the most compelling application of the MinION platform is the improvement of pathogen surveillance and diagnostics, and subsequently, health outcomes, for the world’s most disadvantaged populations. The small footprint of the MinION, and other hand-held molecular biology hardware, is particularly important for austere settings with limited access to the critical infrastructure often required for traditional diagnostics and biosurveillance assays. Routine, point-of-sampling detection, phylogeny, and genomic characterization of microbial and viral pathogens from clinical and environmental samples stands to fundamentally change public health practices^29,30,31,32. Critically, nanopore sequencing has the added benefit of real-time analysis³³, allowing sample-to-answer intervals that match clinically relevant timeframes. Recent work has demonstrated the efficacy of nanopore sequencing in RNA-based metatranscriptomic detection of viral pathogens from human blood samples^34,35. More recently, single-nucleotide polymorphism (SNP) detection was demonstrated on the MinION, using PCR amplicons of short tandem repeats, for the purposes of forensic genotyping³⁶. During the Zika Virus (ZIKV) outbreak of 2015–2016 in Brazil, several groups used nanopore sequencing of RT-qPCR amplicons from mosquito samples to track incidence of ZIKV infection and study ZIKV vector dynamics^37,38. And recently, an Australian group demonstrated the use of nanopore sequencing for whole genome sequencing of Ross River Virus, directly from a single mosquito under laboratory control conditions³⁹. However, to date, there have been no reports of unbiased (non-PCR) strain-level detection of specific organisms-of-interest directly from environmental sample matrices (e.g., non-clinical, non-sterile, non-laboratory derived) using nanopore sequencing. This is likely due to the lower sequencing depth of nanopore data relative to second-generation sequencing machines, and subsequent detection of predominantly host genomic material. Overcoming this challenge will enable genome-based biosurveillance without the constraint of PCR primer design and optimization. This would be particularly useful for monitoring arbovirus and other viral hemorrhagic fever (VHF) vectors in hot-spot regions throughout the world where frequent epizootic events threaten the health of human populations. Often, the pathogens responsible for these events are RNA viruses with small genomes and high mutation rates, rendering the maintenance of high-fidelity primer sets an ongoing challenge.

An example of such a pathogen can be found in the Americas. Venezuelan Equine Encephalitis Virus (VEEV) is a positive-sense single-stranded RNA virus with an approximately 11.4 kilobase (KB) genome. An important human and equine pathogen that has previously been weaponized, VEEV is categorized as an overlap Select Agent by the U.S. government due to its pathogenicity to both humans and livestock. VEEV is responsible for the most persistent recurrent outbreaks of New World alphaviruses in the Togaviridae family⁴⁰. In humans, VEEV causes a non-specific febrile illness, with onset of symptoms (fever/chills, malaise, tachycardia) after a 2 to 5-day incubation period. More severe cases (<1% in humans) will result in encephalitis, and eventually, death 5 to 10 days after infection⁴¹. It has been determined that some enzootic equine-avirulent VEEV strains can alter their serotype, and range of both mosquito vector and vertebrate host, through mutations in the genes encoding the E2 envelope glycoprotein⁴². Adaptation to equines results in extremely high viremia (>10⁷ PFU/ml), leading to a greater chance of human disease, and highlighting the role of genome-based strain tracking for public health purposes. An enzootic, sylvatic strain of VEEV (subtype II) circulates in and around the Everglades region of Southern Florida. Commonly known as Everglades virus (EVEV), this VEEV subtype is exclusively transmitted by the mosquito species Culex (Melanoconion) cedecei, with cotton rats and cotton mice as its primary vertebrate host^40,43,44. Surveys in the 1960’s and 1970’s indicated high seroprevalence of EVEV antibodies in humans residing in Southern Florida (>50% amongst Seminole Native Americans living north of Everglades National Park)^45,46,47, and it has been suggested that EVEV may be an important, unrecognized cause of human illness in the region⁴³.

In this study, we successfully demonstrate field-ready protocols for sample collection, RNA extraction, reverse transcription quantitative PCR amplification, eukaryote host genome depletion, and nanopore sequencing of a mosquito sample metatranscriptome for the purposes of arbovirus biosurveillance (Fig. 1). We report the first use of nanopore sequencing to detect, and strain-type, an arbovirus directly from field-trapped mosquitoes using a metatranscriptome approach. The EVEV-positive sample was processed using current “gold standard” platforms (e.g., CFX-96, Illumina MiSeq) to benchmark differences in results with more conventional methods. This work demonstrates the practical utility of hand-held thermocyclers and nanopore sequencing devices for unbiased strain-level detection of arboviruses from complex, environmental sample matrices suitable for use in field-based “pocket laboratories”⁴⁸.

Results

RT-QPCR Arbovirus Surveillance

A single mosquito pool (Sample 4.1) tested positive for VEEV on both in the field with the Biomeme two3 (C_t 33.92) and later again on CFX96 system (C_t 30.63). The results are shown in Fig. 2. The Biomeme two3 device proved to be an effective, ultra-portable platform for initial triaging of mosquito samples in the field. While it could benefit from a higher throughput capacity, its small size and intuitive user interface render it a very capable field-forward molecular biosurveillance tool. It can also perform as a field-able heat-block and thermocycler for the steps in nanopore library generation that require such items.

Nanopore Sequencing (REPLI-g Single Cell WTA)

426,580 reads were successfully basecalled for the REPLI-g processed Sample 4.1; the average read length was 1,403 bp and the maximum read length was 21,258 bp. 106,040 reads were successfully basecalled for the REPLI-g processed Sample 1.1; the average read length was 2,038 bp and the maximum read length was 61,951 bp (Table 1). Detection of VEEV in 4.1 varied across several metagenomic taxonomy callers (Kraken, Kaiju, Centrifuge) that assign read-derived kmers to comprehensive genome databases (e.g., RefSeq). BWA-MEM, a full-length read-mapping alignment tool, also varied in reported VEEV signal.

Table 1 Sequencing library information and VEEV/EVEV detection information across various analytical tools from both virus-positive (4.1) and virus-negative (1.1) samples. Numbers in the rows corresponding to taxonomic analysis tools indicate the number of VEEV/EVEV reads detected by that tool from the particular dataset. Asterisks (*) indicate analysis against a curated database of 144 VEEV and EVEV genomes, rather than the full RefSeq-sized database used by kmer tools (Kraken, Kaiju, Centrifuge).

Full size table

Kraken assigned a single nanopore read from sample 4.1 to VEEV, Centrifuge assigned 2 reads, and Kaiju assigned up to 10 reads. Kraken and Centrifuge offer less flexibility in parameter adjustment/loosening and were run with defaults as they were deemed acceptable for nanopore classification (i.e., Kraken’s–min-hits and Centrifuge’s-min-hitlen and-min-totallen). Kaiju allows greater flexibility in parameter adjustment. In our Kaiju submission script, we leveraged the ‘greedy mode’ and set the number of allowed mismatches to 10. We also lowered the minimum match score to 35 (from a default of 65). Running Kaiju with default parameters detected 9 VEEV reads in the 4.1 nanopore data. Loosening Kaiju’s parameters further than described above did not yield more than 10 VEEV reads. Of the direct read-mapping tools, BWA-MEM (with ‘-x ont2d’ flag passed) identified 33 VEEV reads from 4.1 nanopore data (see Supplementary Material). With default settings, BWA-MEM identified 27 VEEV reads in sample 4.1. The remaining tools tested associated no reads with VEEV in sample 1.1 (Table 1).

For each read that was mapped to the VEEV database by BWA-MEM, the highest quality alignment was overwhelmingly one of two strains; EVG3-95 (KR260737) and Fe3-7c (AF075251). Both strains are the only Everglades Virus strains in the database of 144 VEEV genomes. The 33 reads aligned to VEEV by BWA-MEM were associated with 3 strains total; 19 reads to EVG3-95 (KR260737), 13 reads to Fe3-7c (AF075251), and 1 read to AG80-663 (AF075258, isolated in Argentina 1998). The 19 EVG3-95 reads covered 16% of the reference genome. The 13 Fe3-7c reads covered 10% of the reference genome (Table 2).

Table 2 Read mapping statistics for Illumina (A) and nanopore libraries prepared with REPLI-g (B) and Sigma (C) WTA protocols. Reads were mapped using BWA-MEM with default settings (Illumina) or nanopore-specific (-x ont2d) settings. Columns are, respectively, NCBI accession numbers for each VEEV reference genome shown; total number of reads mapped to each reference; number of non-specific matches, number of perfect matches, average depth of coverage of non-zero coverage regions, percentage of reference genome covered by at least 1 read; number of regions in the reference with zero coverage; and total length of zero coverage regions.

Full size table

All REPLI-g nanopore reads mapping to Everglades Virus strain EVG3–95 via BWA-MEM aligned to the final ~4,000 basepairs of the 3′-region of the genome (Fig. 3). This region encodes a sub-genomic 26 S rRNA that is translated into a structural polyprotein which undergoes proteolytic cleaving to generate the viral capsid and the E2 and E1 envelope glycoproteins⁴⁹. A particularly high abundance of Illumina reads mapping to this region, and exclusive mapping of REPLI-g reads, is a likely indicator of an actively replicating viral infection since the 26 S rRNA can only be transcribed from a full-length, negative sense RNA intermediate that itself can only be produced from the nsP1/nsP4 enzyme complex required for replication⁵⁰. RNA sequencing studies have recently shown that this region of the VEEV genome is also transcribed at significantly higher levels relative to the full length genomic RNA at the initial stages of infection. Thus, one can expect that a sample sequenced at this stage would have a high abundance of reads recruited to the 3′-region of the genome. This is observed in the alignment dynamics of both Illumina and REPLI-g nanopore reads. However, it should also be noted that the majority of VEEV-aligning REPLI-g generated nanopore reads were chimeric in nature. This can be visualized in the shade of green of REPLI-g nanopore reads aligning to Everglades Virus strain EVG3-95 (Fig. 3). Darker green regions align to the reference, whereas lighter green regions do not.

Nanopore Sequencing (Sigma WTA2)

212,192 reads were successfully basecalled for the Sigma WTA2 processed Sample 4.1; the average read length was 957 bp and the maximum read length was 13,207 bp. 71,355 reads were successfully basecalled for the REPLI-g processed Sample 1.1; the average read length was 448 bp and the maximum read length was 60,895 bp (Table 1). The same taxonomy-calling tools tested on the REPLI-g nanopore data were also tested on the Sigma WTA2 nanopore data, using the same settings. In Sample 4.1; Kraken detected 2 VEEV reads, Kaiju detected 17 VEEV reads, Centrifuge detected 2 VEEV reads, and BWA-MEM detected 21 VEEV reads with default settings and 75 with the ‘-x ont2d’ flag passed. In Sample 1.1; Kraken detected 0 VEEV reads, Kaiju detected 1 VEEV read, Centrifuge detected 0 VEEV reads, and BWA-MEM detected 6 VEEV reads with default settings and 21 with the ‘-x ont2d’ flag passed (Table 1). It is presumed that the higher incidence in VEEV reads detected in Sample 1.1 from Sigma WTA2 generated nanopore data is due to carry-over from insufficient washing and re-use of the flow cell after sequencing Sample 4.1. During REPLI-g testing, a fresh flow cell was used for each sample.

In contrast to the REPLI-g generated nanopore reads, Sigma WTA2 reads aligned to all coding regions of the EVG3–95 genome. Additionally, Sigma WTA2 generated nanopore reads showed lower rates of chimeric reads (Fig. 3). This may be primarily due to the lack of the ligation step in the Sigma WTA2 protocol. However, various other key differences between the kits (e.g., SensiPhi vs. WTA2 polymerase activity, sequence composition of universal primers, etc.) are likely to contribute to observed differences in alignment dynamics for VEEV-associated reads. The 74 Sigma reads aligned to VEEV by BWA-MEM were associated with 3 strains; 45 reads to KR260736 (VEEV strain COAN5506, a 1967 equine isolate from Colombia), 25 reads to KR260737, and 4 reads to AF075251. While the numerical majority of Sigma WTA2 reads aligned to the Colombian strain, the coverage of this genome was much lower (2%) than that of the Everglades Virus strains (KR260737-55%, AF075251-11%) (Table 2). The total length of strain COAN5506 with zero read coverage is 11,276 basepairs out of a 11,495 bp genome, indicating stacking of approximately 219 bp reads at one location (Table 2).

Illumina Sequencing

6,429,832 and 4,876,006 paired-end (2 × 151 bp) reads respectively were generated for Sample 4.1 and Sample 1.1 using an Illumina MiSeq. The average quality-trimmed read length was 143 bp (Table 1). The taxonomy-calling tools used on the nanopore datasets were also used on the Illumina data, however, default settings of each tool were used rather than nanopore-specific parameters (See Methods). Kraken identified 796 VEEV reads, Kaiju identified 2,420, Centrifuge identified 1,042, default BWA-MEM identified 5,269, and CLC identified 12,680 VEEV reads under default settings (Table 1). The highest represented VEEV strain in the Illumina data were the two Everglades virus (EVEV) strains; EVG3-95 (KR260737), followed by Fe3-7c (AF075251). These two strains accounted for 99.7% of all VEEV-associated reads as mapped by BWA-MEM (Table 2). Complete family-level taxonomy classification of reads from both samples, as done by Kraken, is provided in the Supplemental Materials.

We observed a notable increase of Illumina reads mapping to the 26S sub-genomic RNA region, in the final 4.0 kb of the EVG3–95 genome, as was observed in the REPLI-g generated nanopore reads (Fig. 3). While we predicted an active viral infection based solely on a limited number of nanopore reads, the higher density of Illumina reads mapping to the 26S region provides evidence of active EVEV replication in the 4.1 mosquito pool sample, rather than a latent infection or trace detection.

Variant Detection in Nanopore and Illumina Data

From Illumina sequencing data, we observed 16 high-quality single nucleotide variants (SNVs) across the strain EVG3-95 genome (Table 3, Fig. 3). 10 of these variants (~62%) were also detected in a nanopore sequencing read, regardless of which WTA-method was used. 10 SNVs were located in the 26S sub-genomic RNA region. Of these 10 variants, 7 were detected in a MinION nanopore read, 6 of the 7 were detected by a REPLI-g generated MinION read, and 3 of the 7 were detected by a Sigma WTA2 generated MinION read. Of the 6 SNVs in the first ~7 kb of the reference genome, 3 were only detected by Sigma WTA2 MinION reads. This data highlights the potential of the MinION nanopore sequencer to be leveraged for real-time, unbiased, SNV-level strain-tracking of arbovirus targets, directly from complex environmental samples.

Table 3 Metrics of high-quality single nucleotide variants (SNVs) detected across Everglades Virus EVG3–95 genome via Illumina sequencing of virus-positive mosquito pool sample 4.1. NT Position (column 1) based on NCBI accession #KR260737. All SNV presented here were 100% penetrant at indicated reference position. 10 out of 16 SNVs (~62%) were detected via MinION reads.

Full size table

Phylogeny of EVEV-2016_CxCdci_4_1

Everglades virus strains belong to the Type-2 VEEV serogroup; a distinct phylogenetic group within the VEEV serocomplex. A consensus genome of the suspected strain of EVEV present in sample 4.1 was generated from the EVEV strain EVG3–95 genome, EVEV strain Fe3–7c genome, and Illumina reads mapping to these genomes in CLC bio. We label this consensus genome scaffold ‘EVG-2016_CxCdci_4_1’ (Fig. 4). This name denotes the strain’s detection just outside of Everglades National Park in the autumn of 2016, the vector mosquito species (Culex cedecei), and the mosquito pool sample from this study that contained the virus (4.1). Phylogenetic analysis using 26S subgenomic RNA regions of the 144 VEEV strains clustered EVG-2016_CxCdci_4_1 distinctly with the other VEEV Type II EVEV strains, and it appears more closely related to the EVG3-95 strain (KR260737, isolated in 2013) than the older Fe3-7c strain (AF075251, isolated in 1963) (Fig. 4).

Discussion

Unbiased meta-omics approaches offer the ability to monitor the presence of nearly all potential pathogens in a single test. In geographic regions where several distinct pathogens can cause nearly identical febrile illness symptoms, the elimination of the need for multiple individual tests translates to reduced time for appropriate clinical or public health decisions to be made. Often, these same regions have limited capacity and infrastructure requirements to fully support brick-and-mortar laboratories and second-generation sequencing machines. Consequently, the prospect of unbiased meta-omics pathogen surveillance on devices as portable and low-maintenance as the MinION is a critical advantage that stands to fundamentally change the fight against emerging infectious diseases worldwide. However, challenges to the full realization of this potential remain.

When using unbiased meta-omics techniques, signal from the organism-of-interest is generally a small fraction of the total data output. Indeed, over 99% of nanopore reads from both sample 4.1 and 1.1 were annotated as ‘unclassified’ by Kraken, Kaiju, and Centrifuge. The reference databases used by the metagenomics classifiers used are focused on microbial and viral species, so this result indicates that over 99% of the nanopore signal was (not surprisingly) from the eukaryote host (Culex cedecei). This was confirmed through BLAST analysis of several of the longest reads (Fig. S1A,B and data not shown). The use of the GeneRead rRNA Depletion Kit was critical in this context, enabling sufficient host depletion for detection of EVEV RNA.

Interestingly, although the goal this project was not to demonstrate it’s utility as a method for pathogen discovery, overall microbial and viral profiles of the two samples (4.1 and 1.1) were nontheless obtained (see Supplementary Material). Data such as this could potentially be an entry point into identifying additional, potentially novel, microbial or viral members of the mosquito microbiome. For example, in our Illumina data for Sample 1.1, Kaiju (but not the other tools) identified 709 reads similar to Mercadeo Virus (MECDV) – a recently identified insect-specific flavivirus (ISF)⁵¹. Furthermore, our read mapping results show 752 Illumina reads from Sample 1.1 aligned to MECDV, yet zero reads from Sample 4.1 were aligned to the same virus. Although interesting, upon closer inspection these reads were revealed to map to a few narrowly defined regions with relatively low sequence identity (see Figure S3). Although metagenomics is potentially a promising approach to as a discovery tool, these results highlight that the results can often be tool and platform specific and research groups need to use multiple pipelines, NGS platforms, and laboratory methods to verify their results.

EVEV was detected in sample 4.1 using kmer-based taxonomy callers that leveraged RefSeq-sized databases. A more robust signal was, however, observed when using read-mapping tools with target-specific databases (Table 1). While the use of targeted databases may preclude the reporting of other organisms that the MinION reads may map equally well to, it should not falsely inflate the presence of the organism-of-interest since read-mapping settings are fixed and each read is given an equal chance at mapping to each reference genome. Thus, we should expect the same number of reads associated with the organism-of-interest whether we are using all of RefSeq or a streamlined, targeted database. The key advantage in using streamlined databases targeting specific organisms-of-interest is that it enables read-mapping tools to be deployed on portable commodity computing systems (i.e., Intel NUC, Macbook Pro, etc.), further supporting the field-forward position of these types of analytical approaches. Importantly, field-forward researchers do not need to “choose” one or the other. Kmer-tools with comprehensive databases (i.e., Kaiju, Centrifuge) and read-mapping tools that query streamlined, targeted databases (i.e., BWA-MEM) can both be utilized effectively on portable computing systems. Therefore, there’s an advantage to installing Kaiju or Centrifuge, and BWA-MEM, onto any computing system meant for agnostic nanopore sequencing in the field, and using them in tandem with appropriate corresponding databases to conduct surveillance of broad groups of organisms.

REPLI-g Cell & Single Cell WTA Challenges for Nanopore Sequencing

Inspection of the alignments of REPLI-g generated nanopore reads that mapped to VEEV showed a high proportion of chimeric reads that only had a fraction of the read length aligning with any VEEV reference genomes. BLAST analysis of the remainder of the reads often hit to various mosquito species’ genomes (Supplementary Figure 1A,B), indicating combined vector/pathogen chimeric reads. Review of the specific chemistry of REPLI-g’s WTA process highlighted a step that is likely to be problematic for long-read sequencing technologies: namely, the ligation step. After complementary DNA (cDNA) is generated from RNA templates, the cDNA fragments are randomly ligated together to create longer molecules. This enhances the efficiency of the REPLI-g SensiPhi DNA polymerase during the multiple displacement amplification (MDA) reaction and, if used for populations of single cells from singular organisms, the impact on downstream quantification of transcripts is negligible. However, given the high efficiency and fidelity of the SensiPhi DNA polymerase, these kits have been attractive for meta-omics studies, with populations of multiple species, from low diversity, low biomass environments or investigations with minimal biological sample material^52,53,54. For short-read sequencing technologies (i.e., Illumina), the confounding effects of the ligation step, and subsequent chimeric cDNAs, are negligible, or an acceptable trade-off for the efficacy of SensiPhi DNA polymerase. This is due to the low fraction of short reads that will, by chance, span a chimeric junction. With long read sequencing technologies, such as MinION or PacBio, this fraction is more likely to pose a challenge as long reads have a higher likelihood of spanning chimeric junctions. This is not necessarily problematic, depending on the use-case. For example, if the goal is simply detection of organisms of interest from a particular sample matrix in a biosurveillance context, then bioinformatics precautions can be set such that any existing signal will be recovered, despite the ligated fragments (e.g., reducing required fraction of reads that must align, reducing seed lengths, etc.). Indeed, informative recovery of Everglades Virus reads was observed with REPLI-g generated nanopore data (Table 1, Fig. 3). However, in other analyses requiring high quality alignments (e.g., epidemiological strain mapping, genome finishing), these chimeras will present more of a problem. Ideally, no analyses are precluded from the generated data, so we selected another WTA kit (WTA2 kit from Sigma-Aldrich) to test that does not include the random ligation of cDNA fragments inherent to the REPLI-g kits.

Total analysis time for EVEV-positive sample 4.1, from sample collection through data analysis, was approximately 26 hours with MinION sequencing of the Sigma WTA2 product, compared to more than 30 hours with sequencing on the Illumina MiSeq (Fig. 1). BWA-MEM, Centrifuge, and Kaiju were tested on a compact, portable commodity computing system (hyper-threaded quad-core, 32GB RAM Intel NUC Skull Canyon) running the Ubuntu 16.04 LTS operating system. These tools had rapid processing times for the nanopore data (≤~20 mins). The time-to-result for nanopore sequencing could have been reduced to between 3 and 6 hours if the original VEEV amplicon had been used as the sequencing material, and an internet connection was available for real-time taxonomy calling³³. However, an agnostic approach demands extra time investment not required of amplicon sequencing due to the increased sequencing depth required to detect ultra-low abundance signals. We did not monitor the nanopore data in real-time, so it is not possible to determine exactly how long it took to detect an EVEV signal. Nonetheless, we were able to generate actionable biosurveillance data within a time frame amenable to enacting rapid response measures from public health entities (~1 day).

One aspect of ONT’s workflow that is critical for effective field deployment is their flow cell wash procedure. Minimizing the required amount of consumables that must be carried to remote field sites is still one of the primary challenges for mobilized deployment of nanopore sequencing, and so, the washing/flushing and re-use of the flow cells is an important feature of the MinION platform. In addition, when attempting to distinguish problematic or infectious samples from benign samples using agnostic sequencing, inter-run cross-contamination will be a confounding issue. We washed and re-used a R9.4 flowcell to sequence EVEV-negative, Sigma WTA2-processed sample 1.1 after we sequenced EVEV-positive, Sigma processed sample 4.1.

We found low-level cross contamination of EVEV reads in sample 1.1 that we suspect originated from sample 4.1, despite following the ONT wash kit protocol exactly (Table 1). It is not suspected that this was trace signal of EVEV in sample 1.1 that was only detected with Sigma WTA2 processing since sample 1.1 was negative for VEEV/EVEV in the RT-qPCR assay (Fig. 2). When a fresh R9.4 flowcell was used for each REPLI-g processed sample, no cross contamination was widely reported across tested taxonomy classification tools.

A list specific items (hardware, software, reagents, consumables, etc.) used to complete the work discussed here is given in the Supplementary Material. Taken together, these items can fit within a single, medium sized (40 L) expedition-style backpack. This has not been lost on the research community and efforts to push nanopore-based molecular biosurveillance as far afield as possible have been prodigous^{27,28,37,55,56}, including Low-Earth orbit⁵⁷ and beyond^58,59. However, while carrying the items that are physically handled during sample processing is trivial, transporting the accompanying power and cold-chain logistical equipment has been more challenging and likely a primary factor preventing wider adoption of the technology in austere public health settings. The incredibly small footprint of the MinION is not fully empowered when one must also transport diesel generators, fuel, and mini-freezers as well. Development of intuitively designed, logistics-integrated, single-person portable laboratories will facilitate the future that the MinION’s form-factor inspires.

Future work will determine whether agnostic nanopore sequencing will be an effective biosurveillance tool on lower-titer pathogens – such as contaminated food samples or blood samples taken from sentinel wildlife populations. Our work may have benefited from a potentially high-titer, which has previously been reported to be above 4 p.f.u./ml in C. cedecei for EVEV and is often characteristic of other strains of VEEV as well⁶⁰. However, the chemistry of ONT’s MinION flowcells and library preparation reagents remain under active development and improvements in both data yield and sequencing read quality are being released regularly⁵⁵. We expect this to translate to unbiased strain-level detection of a wider array of organisms from even more challenging samples in the near future.

Conclusions

Previous unbiased, meta-omics nanopore sequencing approaches to strain-specific target classification and SNV-calling have been limited to human blood, unknown isolates, or mixed culture sample matrices^34,56. In this study, we’ve pushed this capability to include complex biological sample matrices collected in the field – namely crushed mosquito pools collected from field traps. We describe a protocol that leverages ultra-compact hardware (e.g. the Biomeme two3, Intel NUC, and ONT MinION) to enable field-forward use of unbiased nanopore sequencing for the purposes of arbovirus biosurveillance. This work demonstrates the utility of nanopore sequencing for a wide array of public health and basic research use-cases in environmental biosurveillance. It is our hope that this work will further encourage the adoption of field-forward sequencing and bioinformatics to routinely bring the laboratory to the sample.

Methods

Sample Collection

Twelve mosquito traps (CO₂/light-baited) were set approximately 50 m apart for overnight collection near carbonate dissolution pools in a forested environment on October 17^th, 2016. The transect of the traps ran adjacent to Canal 111E in Homestead, FL, USA. The first clinical case of Everglades virus in humans was likely acquired while fishing along C-111 canal⁶¹. The sampling site was located at 25.4078, −80.5237, approximately 3.8 miles from the Ingraham Highway entrance to Everglades National Park, FL, USA. Several thousand mosquitoes were collected. Female Culex cedecei individuals were visually sorted and separated via light microscopy inspection into their own sample pools of 20 individuals per 1.5 ml Eppendorf tubes. Twenty-five (25) sample pools were sorted, for a total of 500 female Culex cedecei mosquitoes.

Sample Extraction

Bulk nucleic acids were extracted from each mosquito pool individually using the Bulk Nucleic Acids Field Extraction kit from Biomeme, Inc. (Philadelphia, PA, USA). The manufacturer’s protocol was followed, with slight modifications. Each 20-mosquito pool was mashed in 1.5 ml tube with kit-provided pestle for 1 minute. 50 µl of Biomeme Lysis Buffer (BLB) was added to the tube and mashing continued for an additional minute. 450 µl of BLB was added and mashing continued for an additional 30 seconds. The tube was then vortexed for 1 minute, then centrifuged for 1 minute at 5,000 × g to pellet course debris. Subsequently, 500 µl of supernatant was transferred to 1000 µl aliquot of BLB. The supernatant/BLB mix was briefly vortexed to mix. The Biomeme syringe extraction column was assembled and the entire supernatant/BLB mix was drawn up through the column, and then expelled slowly three times. Next, the entire volume of a 500 µl aliquot of Biomeme Protein Wash solution (BPW) was drawn up through the column and expelled slowly. Then, the entire volume of a 750 µl aliquot of Biomeme Wash Buffer (BWB) was drawn up through the column and expelled slowly. After expelling the BWB, the column was pumped continuously (air-dried) without any reagents until no buffer was spraying from the tip into the collection vial and there were minimal droplets in the column’s tubing. Finally, 200 µl of Biomeme Elution Buffer (BEB) was drawn into the column and allowed to incubate for 1 minute at room temperature. The BEB containing eluted total nucleic acids (TNA) was then expelled into a fresh 1.5 ml tube.

RT-qPCR

Each mosquito pool RNA extract was queried with a quantitative real-time reverse transcription PCR assay specific for Venezuelan Equine Encephalitis Virus (VEEV), using the SuperScript™ III Platinum^® One-Step Quantitative RT-PCR System from Invitrogen (Waltham, MA, USA). The master mix contained, per 25 µl reaction; 5.25 µl dH₂O, 12.5 µl 2 × reaction mix, 0.5 µl RNaseOUT™ ribonuclease inhibitor, 0.5 µl Superscript III™ RT/Platinum^® Taq polymerase, 0.5 µl VEEV forward primer, 0.5 µl VEEV reverse primer, and 0.25 µl of VEEV taq-man probe. 5 µl sample RNA was added to each reaction. The RT-qPCR reactions for all 25 samples, plus a positive control and a no-template negative control, were run on both the Biomeme two3 hand-held qPCR machine and the BioRad CFX96 Touch™ Real-time PCR detection system with the following cycling conditions; 50 °C for 15 minutes, 95 °C for 2 minutes, then 50 cycles of 95 °C for 15 seconds and 60 °C for 1 minute. A single mosquito pool (sample 4.1) was positive for VEEV on both the Biomeme two3 machine and the CFX96. Primer and probe sequences available upon request.

Generation of Metatranscriptomes

We processed two samples for metatranscriptome sequencing; the single sample that tested positive for EVEV (4.1) and a sample that was negative for EVEV (1.1). Following the manufacturer’s protocol, the GeneRead rRNA Depletion Kit (Qiagen, Inc., Hilden, Germany) was used to reduce the burden of C. cedecei vector DNA and RNA. Depleted samples were then processed with the REPLI-g Single Cell Whole Transcriptome Amplification (WTA) kit (Qiagen, Inc.), according to the manufacturer’s protocol. After review of the REPLI-g nanopore data, it was determined that a comparison to another WTA method for nanopore metatranscriptome sequencing was prudent (additional details in Results). Following the manufacturer’s protocol, we also processed the raw TNA sample with the WTA2 Complete Whole Transcriptome Amplification kit from Sigma-Aldrich Inc. (St. Louis, MO, USA). WTA products (from either kit) were purified with Agencourt (Beverly, MA, USA) AMPure® beads as follows; 1.8x the eluted WTA product volume (54 µl) of AMPure beads was added to the WTA-product (30 µl) and pipette-mixed 10 times. The reaction was placed on a magnetic stand for 10 minutes and the cleared solution was aspirated away. The cDNA-bound magnetic beads were washed 2x with 200 µl 70% ethanol and allowed to air dry for 5 minutes. 40 µl of dH₂O was added to the washed beads and pipette-mixed 10 times. The sample was placed back on the magnetic stand for 10 minutes. The purified, eluted cDNA was transferred to a fresh 1.5 ml tube and quantified with a Qubit flourometer (ThermoFisher Sci., Waltham, MA, USA) according to the manufacturer’s protocol.

Nanopore sequencing of Metatranscriptomes

The WTA products from virus-positive sample 4.1 and virus-negative sample 1.1 were prepared for nanopore sequencing using ONT’s 1D Ligation Sequencing library preparation kit (SQK-LSK108), following the manufacturer’s protocol. The library was loaded onto an R9.4 flow cell. Two separate flow cells were used for each sample for sequencing of the REPLI-g generated samples. For the Sigma WTA2 generated samples, the same flow cell was re-used for sample 1.1 (virus-negative sample) after sequencing sample 4.1 (virus-positive sample) and flushing/washing with the ONT Flowcell Wash Kit (EXP-WSH002). For all WTA products, the NC_48Hr_Sequencing_Run_FLO-MIN106_SQK-LSK108_plus_Basecaller.py script was used for collecting data. The REPLI-g 4.1 sample was run for approximately 26.5 hours, with a total of 1142 channels with active pores detected during the pre-run mux scan. The REPLI-g 1.1 sample was run for approximately 12 hours, with a total of 1465 channels with active pores detected during the pre-run mux scan. The Sigma WTA2 4.1 sample was run for approximately 20 hours, with a total of 1168 channels with active pores detected during the pre-run mix scan. The Sigma WTA2 1.1 sample was run for approximately 7.5 hours, with a total of 551 channels with active pores detected during the pre-run mux scan.

Illumina sequencing of Metatranscriptomes

The REPLI-g WTA product from virus-positive sample 4.1 was prepared for Illumina sequencing using Illumina’s Nextera XT library prep kit (FC-131–1024), following the manufacturer’s protocol through the library clean-up step. Manual normalization was performed following DNA quantitation of the CAN product using the Qubit fluorometer to ensure a sufficient quantity of library was generated. The library was then diluted using a conversion factor of 2 to 2 nM and pooled with other libraries. The libraries were added to a cartridge at a final loading concentration of 12pM using a MiSeq Reagent Kit V2 (MS-102–2002). A 2 × 151 paired-end run was performed on the Illumina MiSeq system (SY-410–1003) using the FASTQ only workflow.

Bioinformatics and Data Analysis

Nanopore Data

Nanopore reads were basecalled using the local basecalling algorithm in MinKNOW version 1.4.3. FAST5 files of basecalled reads were converted to FASTA files using poretools⁶². The FASTA files from each sequenced sample (4.1 and 1.1) were queried for EVEV/VEEV using several kmer-based metagenomics taxonomy callers, including Kraken⁶³, Kaiju⁶⁴, and Centrifuge⁶⁵. Family-level profiles, as classified by Kraken, are available in the Supplemental Materials (see Figure S2 and Supplemental Table 2). A full-length read-mapping alignment tool, BWA-MEM⁶⁶, was also tested. For computational resource and analysis time considerations, these read-mapping tools were deployed with a custom database of 144 VEEV genomes (rather than the larger RefSeq-sized databases of the kmer tools). See Supplementary Materials for specific parameters called for each tool, as well as a list of accession numbers for all VEEV references in the custom database. The SAM alignment file generated via BWA-MEM mapping (with ‘-x ont2d’ flag called) to the custom VEEV database was imported into CLC-Genomics Workbench version 10.0.1 (CLC) and converted to tracks for visualization purposes and exploratory analysis in comparison to Illumina MiSeq reads mapping to the same VEEV genomes.

Illumina Data

Sequencing reads from Sample 4.1 and 1.1 were trimmed to a Q = 30 quality score in CLC and analyzed for total taxonomic composition using Kraken, Kaiju, and Centrifuge (See Supplementary Material for specific parameters for each tool, and family-level profiles). No reads in Sample 1.1 were classified as Alphavirus by any of the above tools. For Sample 4.1, however, reads were mapped to the custom database of VEEV genomes with BWA-MEM and in CLC. Variants were called in CLC using the Basic Variant Detection Tool, which makes no assumptions about the underlying data. Sixteen (16) high frequency (HF) variants were called using this tool such that non-specific matches were ignored, a minimum of 30x coverage was required, and the variant called was required to be 100% penetrant (homozygous) with a minimum Q30 quality score. Three HF variants were predicted to result in amino acid changes. Low frequency (LF) variants, 60 in total including HF variants, were also called in a similar manner but had a 30x minimum coverage, with the variant allele being a minimum 10% frequency and 10x coverage at Q30 or above. A total of 18 LF variants were predicted to result in amino acid changes. A consensus genome was generated from the top two VEEV genomes recruiting the most reads, and the Illumina reads themselves. The consensus genome, called ‘EVG-2016_CxCdci_4_1’, was included in a multiple sequence alignment (MSA) with the database of 144 VEEV genomes. A pruned phylogenetic tree was constructed using the 30 closest relatives. The chosen tree construction was the neighbor-joining method⁶⁷ and the nucleotide distance measure used was Jukes-Cantor⁶⁸. The tree was validated with 1,000 bootstrap replicates.

Data availability

All raw sequence data from this work can be found at NCBI under BioProject PRJNA399278 (https://www.ncbi.nlm.nih.gov/bioproject/399278).

References

Afshinnekoo, E. et al. Precision Metagenomics: Rapid Metagenomic Analyses for Infectious Disease Diagnostics and Public Health Surveillance. J. Biomol. Tech. 28, 40–45 (2017).
Article PubMed PubMed Central Google Scholar
Inglis, T. J. J. Adapting the mobile laboratory to the changing needs of the Ebolavirus epidemic. J. Med. Microbiol. 64, 587–91 (2015).
Article PubMed Google Scholar
Wölfel, R. et al. Mobile diagnostics in outbreak response, not only for Ebola: a blueprint for a modular and robust field laboratory. Euro Surveill. 20, (2015).
Gardy, J., Loman, N. J. & Rambaut, A. Real-time digital pathogen surveillance — the time is now. Genome Biol. 16, 155 (2015).
Article PubMed PubMed Central Google Scholar
Mulcahy-O’Grady, H. & Workentine, M. L. The Challenge and Potential of Metagenomics in the Clinic. Front. Immunol. 7, 29 (2016).
PubMed PubMed Central Google Scholar
Miller, R. R., Montoya, V., Gardy, J. L., Patrick, D. M. & Tang, P. Metagenomics for pathogen detection in public health. Genome Med. 5, 81 (2013).
Article PubMed PubMed Central Google Scholar
Lim, Y. W. et al. Clinical insights from metagenomic analysis of sputum samples from patients with cystic fibrosis. J. Clin. Microbiol. 52, 425–437 (2014).
Article PubMed PubMed Central Google Scholar
Huang, A. D. et al. Metagenomics of Two Severe Foodborne Outbreaks Provides Diagnostic Signatures and Signs of Coinfection Not Attainable by Traditional Methods. Appl. Environ. Microbiol. 83 (2017).
Doggett, N. A. et al. Culture-Independent Diagnostics for Health Security. Heal. Secur. 14, 122–42 (2016).
Article Google Scholar
Valdivia-Granda, W. A. Biodefense Oriented Genomic-Based Pathogen Classification Systems: Challenges and Opportunities. J. Bioterror. Biodef. 3, 1–9 (2012).
Google Scholar
Epstein, J. H. et al. Identification of GBV-D, a novel GB-like flavivirus from old world frugivorous bats (Pteropus giganteus) in Bangladesh. PLoS Pathog. 6, e1000972 (2010).
Article PubMed PubMed Central Google Scholar
Coffey, L. L. et al. Enhanced arbovirus surveillance with deep sequencing: Identification of novel rhabdoviruses and bunyaviruses in Australian mosquitoes. Virology 448, 146–158 (2014).
Article CAS PubMed Google Scholar
Temmam, S., Davoust, B., Berenger, J.-M., Raoult, D. & Desnues, C. Viral metagenomics on animals as a tool for the detection of zoonoses prior to human infection? Int. J. Mol. Sci. 15, 10377–97 (2014).
Article CAS PubMed PubMed Central Google Scholar
Port, J. A., Cullen, A. C., Wallace, J. C., Smith, M. N. & Faustman, E. M. Metagenomic frameworks for monitoring antibiotic resistance in aquatic environments. Environ. Health Perspect. 122, 222–8 (2014).
Article PubMed Google Scholar
Bergholz, T. M., Moreno Switt, A. I. & Wiedmann, M. Omics approaches in food safety: fulfilling the promise? Trends Microbiol. 22, 275–81 (2014).
Article CAS PubMed PubMed Central Google Scholar
Diaz-Sanchez, S., Hanning, I., Pendleton, S. & D’Souza, D. Next-generation sequencing: the future of molecular genetics in poultry production and food safety. Poult. Sci. 92, 562–72 (2013).
Article CAS PubMed Google Scholar
Ottesen, A. R. et al. Baseline survey of the anatomical microbial ecology of an important food plant: Solanum lycopersicum (tomato). BMC Microbiol. 13, 114 (2013).
Article PubMed PubMed Central Google Scholar
Ozanich, R. M. et al. Evaluation of PCR Systems for Field Screening of Bacillus anthracis. Heal. Secur. 15, 70–80 (2017).
Article Google Scholar
Harrison, G. F., Scheirer, J. L. & Melanson, V. R. Development and validation of an arthropod maceration protocol for zoonotic pathogen detection in mosquitoes and fleas. J. Vector Ecol. 40, 83–9 (2015).
Article PubMed Google Scholar
Meagher, R. et al. Real-time, Autonomous Biosurveillance for Vector-borne Viral Pathogens (SMART Traps). Assessing Risk for Emerging Arboviral Disease. https://www.osti.gov/servlets/purl/1366891 (2016).
Laing, E., Yan, L., Sterling, S. & Broder, C. A Luminex-based multiplex assay for the simultaneous detection of glycoprotein specific antibodies to ebolaviruses, marburgviruses, and henipaviruses. Int. J. Infect. Dis. 53, 108–109 (2016).
Article Google Scholar
Lee, W. Review and analysis of bioidentification systems for mobile laboratory and field use (2016).
Shukla, S., Hong, S.-Y., Chung, S. H. & Kim, M. Rapid Detection Strategies for the Global Threat of Zika Virus: Current State, New Hypotheses, and Limitations. Front. Microbiol. 7, 1–15 (2016).
Article CAS Google Scholar
Bartholomew, R. A. et al. Evaluation of Immunoassays and General Biological Indicator Tests for Field Screening of Bacillus anthracis and Ricin. Heal. Secur. 15, 81–96 (2017).
Article Google Scholar
Urban, J. M., Bliss, J., Lawrence, C. E. & Gerbi, S. A. Sequencing ultra-long DNA molecules with the Oxford Nanopore MinION. bioRxiv 19281, (2015).
Giordano, F. et al. De novo yeast genome assemblies from MinION, PacBio and MiSeq platforms. Sci. Rep. 7, 3935 (2017).
Article ADS PubMed PubMed Central Google Scholar
Edwards, A. et al. Deep Sequencing: Intra-Terrestrial Metagenomics Illustrates The Potential Of Off-Grid Nanopore DNA Sequencing. bioRxiv, https://doi.org/10.1101/133413 (2017).
Johnson, S. S., Zaikova, E., Goerlitz, D. S., Bai, Y. & Tighe, S. W. Real-Time DNA Sequencing in the Antarctic Dry Valleys Using the Oxford Nanopore Sequencer. J. Biomol. Tech. 28, 2–7 (2017).
PubMed PubMed Central Google Scholar
Branton, D. et al. The potential and challenges of nanopore sequencing. Nat. Biotechnol. 26, 1146–53 (2008).
Article CAS PubMed PubMed Central Google Scholar
Köser, C. U. et al. Routine use of microbial whole genome sequencing in diagnostic and public health microbiology. PLoS Pathog. 8, e1002824 (2012).
Article PubMed PubMed Central Google Scholar
Quick, J. et al. Rapid draft sequencing and real-time nanopore sequencing in a hospital outbreak of Salmonella. Genome Biol. 16, 114 (2015).
Article PubMed PubMed Central Google Scholar
Hoenen, T. et al. Nanopore Sequencing as a Rapidly Deployable Ebola Outbreak Tool. Emerg. Infect. Dis. 22, 331–4 (2016).
Article CAS PubMed PubMed Central Google Scholar
Juul, S. et al. What’s in my pot? Real-time species identification on the MinION. bioRxiv 30742, (2015).
Greninger, A. L. et al. Rapid metagenomic identification of viral pathogens in clinical samples by real-time nanopore sequencing analysis. Genome Med. 7, 99 (2015).
Article PubMed PubMed Central Google Scholar
Hewitt, F. C., Guertin, S. L., Ternus, K. L., Schulte, K. & Kadavy, D. R. Toward Rapid Sequenced-Based Detection and Characterization of Causative Agents of Bacteremia. bioRxiv 162735 (2017).
Cornelis, S., Gansemans, Y., Deleye, L., Deforce, D. & Van Nieuwerburgh, F. Forensic SNP Genotyping using Nanopore MinION Sequencing. Sci. Rep. 7, 41759 (2017).
Article ADS CAS PubMed PubMed Central Google Scholar
Faria, N. R. et al. Mobile real-time surveillance of Zika virus in Brazil. Genome Med. 8, 97 (2016).
Article PubMed PubMed Central Google Scholar
Guedes, D. R. et al. Zika virus replication in the mosquito Culex quinquefasciatus in Brazil. Emerg. Microbes Infect. 6, e69 (2017).
Article PubMed PubMed Central Google Scholar
Batovska, J., Lynch, S. E., Rodoni, B. C., Sawbridge, T. I. & Cogan, N. O. Metagenomic arbovirus detection using MinION nanopore sequencing. J. Virol. Methods, https://doi.org/10.1016/j.jviromet.2017.08.019 (2017).
Weaver, S. C., Winegar, R., Manger, I. D. & Forrester, N. L. Alphaviruses: population genetics and determinants of emergence. Antiviral Res. 94, 242–57 (2012).
Article CAS PubMed PubMed Central Google Scholar
Zacks, M. A. & Paessler, S. Encephalitic Alphaviruses. Vet. Microbiol. 140, 281 (2010).
Article CAS PubMed Google Scholar
Brault, A. C., Powers, A. M. & Weaver, S. C. Vector infection determinants of Venezuelan equine encephalitis virus reside within the E2 envelope glycoprotein. J. Virol. 76, 6387–92 (2002).
Article CAS PubMed PubMed Central Google Scholar
Coffey, L. L. et al. Serologic evidence of widespread everglades virus activity in dogs, Florida. Emerg. Infect. Dis. 12, 1873–9 (2006).
Article PubMed PubMed Central Google Scholar
Carrara, A. et al. Venezuelan equine encephalitis virus infection of cotton rats. Emerg. Infect. Dis. 13, 1158–65 (2007).
Article PubMed PubMed Central Google Scholar
Chamberlain, R. W. et al. Arbovirus studies in south Florida, with emphasis on Venezuelan equine encephalomyelitis virus. Am. J. Epidemiol. 89, 197–210 (1969).
Article CAS PubMed Google Scholar
Ventura, A. K., Buff, E. E. & Ehrenkranz, N. J. Human Venezuelan equine encephalitis virus infection in Florida. Am. J. Trop. Med. Hyg. 23, 507–12 (1974).
Article CAS PubMed Google Scholar
Bigler, W. J., Lassing, E., Buff, E., Lewis, A. L. & Hoff, G. L. Arbovirus surveillance in Florida: wild vertebrate studies 1965–1974. J. Wildl. Dis. 11, 348–56 (1975).
Article CAS PubMed Google Scholar
Perkel, J. M. Pocket laboratories. Nature 545, 119–121 (2017).
Article ADS CAS PubMed Google Scholar
Guerbois, M. et al. IRES-driven expression of the capsid protein of the Venezuelan equine encephalitis virus TC-83 vaccine strain increases its attenuation and safety. PLoS Negl. Trop. Dis. 7, e2197 (2013).
Article CAS PubMed PubMed Central Google Scholar
Baer, A. et al. Venezuelan Equine Encephalitis Virus Induces Apoptosis through the Unfolded Protein Response Activation of EGR1. J. Virol. 90, 3558–72 (2016).
Article CAS PubMed PubMed Central Google Scholar
Carrera, J.-P. et al. Mercadeo Virus: A Novel Mosquito-Specific Flavivirus from Panama. Am. J. Trop. Med. Hyg. 93, 1014–9 (2015).
Article CAS PubMed PubMed Central Google Scholar
Zoll, J. et al. Direct multiplexed whole genome sequencing of respiratory tract samples reveals full viral genomic information. J. Clin. Virol. 66, 6–11 (2015).
Article CAS PubMed Google Scholar
Yao, G. et al. A Perspective Study of Koumiss Microbiome by Metagenomics Analysis Based on Single-Cell Amplification Technique. Front. Microbiol. 8, 165 (2017).
PubMed PubMed Central Google Scholar
Tong, X. et al. High diversity of airborne fungi in the hospital environment as revealed by meta-sequencing-based microbiome analysis. Sci. Rep. 7, 39606 (2017).
Article ADS CAS PubMed PubMed Central Google Scholar
Leggett, R. M. & Clark, M. D. A world of opportunities with nanopore sequencing. J. Exp. Bot. https://doi.org/10.1093/jxb/erx289 (2017).
Walter, M. C. et al. MinION as part of a biomedical rapidly deployable laboratory. J. Biotechnol. 250, 16–22 (2017).
Article CAS PubMed Google Scholar
Castro-Wallace, S. L. et al. Nanopore DNA Sequencing and Genome Assembly on the International Space Station. bioRxiv 77651, (2016).
Rezzonico, F. Nanopore-based instruments as biosensors for future planetary missions. Astrobiology 14, 344–51 (2014).
Article ADS CAS PubMed Google Scholar
Karouia, F., Peyvan, K. & Pohorille, A. Toward biotechnology in space: High-throughput instruments for in situ biological research beyond Earth. Biotechnol. Adv. https://doi.org/10.1016/j.biotechadv.2017.04.003 (2017).
Coffey, L. L. & Weaver, S. C. Susceptibility of Ochlerotatus taeniorhynchus and Culex nigripalpus for Everglades virus. Am. J. Trop. Med. Hyg. 73, 11–6 (2005).
Article PubMed Google Scholar
Ehrenkranz, N. J., Sinclair, M. C., Buff, E. & Lyman, D. O. The natural occurrence of Venezuelan equine encephalitis in the United States. N. Engl. J. Med. 282, 298–302 (1970).
Article CAS PubMed Google Scholar
Loman, N. J. & Quinlan, A. R. Poretools: a toolkit for analyzing nanopore sequence data. Bioinformatics 30, 3399–401 (2014).
Article CAS PubMed PubMed Central Google Scholar
Wood, D. E. & Salzberg, S. L. Kraken: ultrafast metagenomic sequence classification using exact alignments. Genome Biol. 15, R46 (2014).
Article PubMed PubMed Central Google Scholar
Menzel, P., Lee N, K. & Krogh, A. Kaiju: Fast and sensitive taxonomic classification for metagenomics. bioRxiv, https://doi.org/10.1101/031229 (2015).
Kim, D., Song, L., Breitwieser, F. P. & Salzberg, S. L. Centrifuge: rapid and sensitive classification of metagenomic\nsequences. Genome Res. gr. 210641, 116, https://doi.org/10.1101/gr.210641.116 (2016).
Google Scholar
Li, H. & Durbin, R. Fast and accurate long-read alignment with Burrows-Wheeler transform. Bioinformatics 26, 589–95 (2010).
Article PubMed PubMed Central Google Scholar
Saitou, N. & Nei, M. The neighbor-joining method: a new method for reconstructing phylogenetic trees. Mol. Biol. Evol. 4, 406–25 (1987).
CAS PubMed Google Scholar
Jukes, T. H. & Cantor, C. R. Evolution of Protein Molecules. In Mammalian ProteinMetabolism 21–132, https://doi.org/10.1016/B978-1-4832-3211-9.50009-7 (Elsevier, 1969).

Download references

Acknowledgements

This work was supported by an internal research & development grant from MRIGlobal.

Author information

Authors and Affiliations

MRIGlobal, 65 West Watkins Mill Road, Gaithersburg, MD, 20878, USA
Joseph A. Russell & Jonathan L. Jacobs
MRIGlobal, 1470 Treeland Blvd. SE, Palm Bay, FL, 32909, USA
Brittany Campos & Jennifer Stone
University of Florida - Florida Medical Entomology Laboratory, 200 9th St. SE, Vero Beach, FL, 32962, USA
Erik M. Blosser & Nathan Burkett-Cadena

Authors

Joseph A. Russell
View author publications
You can also search for this author in PubMed Google Scholar
Brittany Campos
View author publications
You can also search for this author in PubMed Google Scholar
Jennifer Stone
View author publications
You can also search for this author in PubMed Google Scholar
Erik M. Blosser
View author publications
You can also search for this author in PubMed Google Scholar
Nathan Burkett-Cadena
View author publications
You can also search for this author in PubMed Google Scholar
Jonathan L. Jacobs
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

J.A.R., E.M.B., N.B.C. and J.L.J. designed the experiments. J.A.R., B.C., J.S., E.M.B., and N.B.C. carried out the experiments and conducted the data analysis. J.A.R. and J.L.J. wrote the manuscript.

Corresponding author

Correspondence to Jonathan L. Jacobs.

Ethics declarations

Competing Interests

The authors declare no competing interests.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Supplementary Materials

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Russell, J.A., Campos, B., Stone, J. et al. Unbiased Strain-Typing of Arbovirus Directly from Mosquitoes Using Nanopore Sequencing: A Field-forward Biosurveillance Protocol. Sci Rep 8, 5417 (2018). https://doi.org/10.1038/s41598-018-23641-7

Download citation

Received: 02 November 2017
Accepted: 16 March 2018
Published: 03 April 2018
DOI: https://doi.org/10.1038/s41598-018-23641-7

This article is cited by

Metagenomic surveillance for bacterial tick-borne pathogens using nanopore adaptive sampling
- Evan J. Kipp
- Laramie L. Lindsey
- Peter A. Larsen
Scientific Reports (2023)
Nanopore-Based Metagenomic Sequencing in Respiratory Tract Infection: A Developing Diagnostic Platform
- Robert Chapman
- Luke Jones
- Stefan Bagby
Lung (2023)
Technical comparison of MinIon and Illumina technologies for genotyping Chikungunya virus in clinical samples
- Leandro Menezes de Souza
- Isabelle Dias de Oliveira
- Leonardo José Tadeu de Araújo
Journal of Genetic Engineering and Biotechnology (2023)
Rapid, in-field deployable, avian influenza virus haemagglutinin characterisation tool using MinION technology
- Ellen M. de Vries
- Noel O. I Cogan
- Stacey E. Lynch
Scientific Reports (2022)
Evaluating the lettuce metatranscriptome with MinION sequencing for future spaceflight food production applications
- Natasha J. Haveman
- Christina L. M. Khodadad
- Jamie S. Foster
npj Microgravity (2021)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

RT-QPCR Arbovirus Surveillance

Nanopore Sequencing (REPLI-g Single Cell WTA)

Nanopore Sequencing (Sigma WTA2)

Illumina Sequencing

Variant Detection in Nanopore and Illumina Data

Phylogeny of EVEV-2016_CxCdci_4_1

Discussion

REPLI-g Cell & Single Cell WTA Challenges for Nanopore Sequencing

Conclusions

Methods

Sample Collection

Sample Extraction

RT-qPCR

Generation of Metatranscriptomes

Nanopore sequencing of Metatranscriptomes

Illumina sequencing of Metatranscriptomes

Bioinformatics and Data Analysis

Nanopore Data

Illumina Data

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing Interests

Additional information

Electronic supplementary material

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links