Direct RNA Sequencing of the Coding Complete Influenza A Virus Genome

Keller, Matthew W.; Rambo-Martin, Benjamin L.; Wilson, Malania M.; Ridenour, Callie A.; Shepard, Samuel S.; Stark, Thomas J.; Neuhaus, Elizabeth B.; Dugan, Vivien G.; Wentworth, David E.; Barnes, John R.

doi:10.1038/s41598-018-32615-8

Download PDF

Article
Open access
Published: 26 September 2018

Direct RNA Sequencing of the Coding Complete Influenza A Virus Genome

Matthew W. Keller ORCID: orcid.org/0000-0002-5850-1698¹^na1,
Benjamin L. Rambo-Martin²^na1,
Malania M. Wilson²^na1,
Callie A. Ridenour²,
Samuel S. Shepard³,
Thomas J. Stark³,
Elizabeth B. Neuhaus³,
Vivien G. Dugan³,
David E. Wentworth³ &
…
John R. Barnes³

Scientific Reports volume 8, Article number: 14408 (2018) Cite this article

18k Accesses
80 Citations
29 Altmetric
Metrics details

Subjects

An Author Correction to this article was published on 19 October 2018

This article has been updated

Abstract

For the first time, a coding complete genome of an RNA virus has been sequenced in its original form. Previously, RNA was sequenced by the chemical degradation of radiolabeled RNA, a difficult method that produced only short sequences. Instead, RNA has usually been sequenced indirectly by copying it into cDNA, which is often amplified to dsDNA by PCR and subsequently analyzed using a variety of DNA sequencing methods. We designed an adapter to short highly conserved termini of the influenza A virus genome to target the (-) sense RNA into a protein nanopore on the Oxford Nanopore MinION sequencing platform. Utilizing this method with total RNA extracted from the allantoic fluid of influenza rA/Puerto Rico/8/1934 (H1N1) virus infected chicken eggs (EID₅₀ 6.8 × 10⁹), we demonstrate successful sequencing of the coding complete influenza A virus genome with 100% nucleotide coverage, 99% consensus identity, and 99% of reads mapped to influenza A virus. By utilizing the same methodology one can redesign the adapter in order to expand the targets to include viral mRNA and (+) sense cRNA, which are essential to the viral life cycle, or other pathogens. This approach also has the potential to identify and quantify splice variants and base modifications, which are not practically measurable with current methods.

Universal whole-genome Oxford nanopore sequencing of SARS-CoV-2 using tiled amplicons

Article Open access 26 June 2023

Full-length sequencing of circular DNA viruses and extrachromosomal circular DNA using CIDER-Seq

Article 03 April 2020

N⁶-methyladenosine modification is not a general trait of viral RNA genomes

Article Open access 11 March 2024

Introduction

Decades ago, a method was published describing the use of base-specific chemical degradation with chromatographic and autoradiographic resolution as a way of directly sequencing short stretches of RNA¹. Since then, little progress has been made on directly sequencing RNA. Instead, the elucidation of RNA sequences is typically indirect and primarily requires methods that synthesize cDNA from RNA templates. While these methods are powerful², they suffer from limitations inherent to cDNA synthesis and amplification such as template switching³, artifactual splicing⁴, loss of strandedness information⁵, obscuring of base modifications⁶, and propagation of error⁷. In 2009, a method for RNA sequencing was developed on the Helicos Genetic Analysis System where poly(A) mRNA is sequenced by the step-wise synthesis and imaging of nucleotides labeled with an interfering but cleavable fluorescent dye⁸. While the input material requirements for this method are extremely low, the long workflow and short reads are limiting. Nevertheless, these approaches expose two major limitations of RNA sequencing: sequencing by synthesis and short read length. Overall, current technologies for sequencing RNA templates present difficulties in the assessment of base modifications, splice variants, and analysis of single RNA molecules.

Influenza A viruses are negative-sense segmented RNA viruses^9,10,11. Sequencing these viruses has played an important role in their understanding for 40 years^12,13 including the discovery of highly conserved vRNA termini¹⁴ (Fig. 1A). These 3′ and 5′ termini are 12 and 13 nucleotides in length, respectively, and they are highly conserved across the PB2, PB1, PA, HA, NP, NA, M, and NS genome segments of influenza A viruses, which enabled the development of a universal primer set for influenza A virus genome amplification^15,16. Even though these conserved vRNA termini have been readily exploited for efficient and sensitive next generation sequencing (NGS) of influenza A virus segments^16,17,18, current methods retain some of the limitations inherent to cDNA-based techniques^3,4,5,6,7. A new tool for long read direct RNA sequencing could reduce these biases and greatly aid efforts to directly sequence influenza A viruses and other RNA viruses.

Oxford Nanopore Technologies (ONT) recently released their direct RNA sequencing protocol. This method involves the sequential ligations of a reverse transcriptase adapter (RTA) and a sequencing adapter¹⁹. The RTA is a small dsDNA molecule (Fig. 1B) that contains a T₁₀ overhang designed to hybridize with poly(A) mRNA and a 5′ phosphate (P_i) that ligates to the RNA creating a DNA-RNA hybrid. The RTA also serves as a priming location for reverse transcription of the entire length of the RNA molecule, though the cDNA generated is not sequenced. The DNA-RNA hybrid is then ligated to the sequencing adapter which directs the RNA strand of the assembled library into the nanopore for sequencing¹⁹.

We describe direct RNA sequencing of five influenza A virus genomes through modification of recently released RNA methods from Oxford Nanopore Technologies¹⁹ (Fig. 1C) by targeting the conserved 3′ end of the influenza A virus genome with an adapter to capture it (Fig. 1D), rather than a primer to amplify it. The efficacy of the adapter is tested by sequencing the RNA genome of an influenza A virus generated by reverse genetics A/Puerto Rico/8/1934 (H1N1) as well as genetically diverse contemporary human or avian influenza A viruses including A/Florida/20/2018 (H1N1pdm09), A/Texas/50/2012 (H3N2), A/chicken Ghana/20/2015 (H5N1), and A/British Columbia/1/2015 (H7N9) (Table 1). The total RNA was purified from either allantoic fluid harvested from infected embryonated chicken eggs or infected MDCK cell culture supernatants. The results from the nanopore sequencing are compared to the current Illumina-based pipeline utilized by the Influenza Genomics Team at the Centers for Disease Control and Prevention.

Table 1 Influenza rA/Puerto Rico/8/1934 virus, a common laboratory strain and candidate vaccine virus backbone, was used in this study to demonstrate direct RNA sequencing effectiveness and repeatability using a crude starting material.

Full size table

Results

RNA calibration strand: enolase II mRNA

First, the RNA calibration strand enolase was directly sequenced on the MinION platform. Three sequencing experiments covered 100% of the coding regions of the 1,314 nucleotide long RNA molecule to an average depth of 122,207 ± 8,126 (sd). Of the 171,135 ± 21,987 reads, 98.6 ± 1.4% mapped to the reference sequence (Tables 2 and S3), with 100% of the mapped reads in the sense orientation. The direction of the reads and the positive slope of the coverage diagram (Fig. S1) are indicative of directional sequencing of mRNA from the 3′ end. The distribution of read lengths (Table S1 and Fig. S2) accurately corresponds to the expected length of 1,314 nucleotides. The read level accuracy was 90.4 ± 0.8%, and the consensus sequence was 99.72% ± 0.04% in concordance with the known reference.

Table 2 An individual MiSeq experiment of influenza rA/Puerto Rico/8/1934 (H1N1) vRNA from crude virus is compared to MinION experiments of enolase mRNA (technical triplicate), influenza rA/Puerto Rico/8/1934 (H1N1) vRNA from crude virus (triplicate), and single runs of crude material containing influenza A/Florida/20/2018 (H1N1pdm09), A/Texas/50/2012 (H3N2), A/chicken Ghana/20/2015 (HPAI H5N1), and A/British Columbia/1/2015 (LPAI H7N9) viruses. Values from triplicate experiments are presented as averages ± standard deviation. This data is expanded in Table S3.

Full size table

Sequencing RNA from crude versus purified influenza rA/Puerto Rico/8/1934 (H1N1) virus

Based on available details on the RTA system, it was possible to make further modification to target other RNA species (Fig. 1). To adapt this technique for the influenza A virus genome, the target sequence of the RTA was changed from an oligo-dT to a sequence complementary to the 12 nucleotides that are conserved at the 3′ end of the RNA segments of influenza A viruses (Table S2).

As a favorable substrate for the modified adapter and a positive control for future experiments, RNA from two sucrose purified influenza rA/Puerto Rico/8/1934 (H1N1) virus (EID₅₀ 4.2 × 10¹¹) preparations (pure) were sequenced via MinION. Two sequencing experiments covered 100% of the coding regions of the PB2, PB1, PA, HA, NP, NA, M, and NS vRNA segments to an average depth 8,360 and 936 respectively (Fig. S3). Of the 119,350 and 13,721 reads acquired in each run, 99.6 and 99.1% mapped to influenza rA/Puerto Rico/8/1934 (H1N1) virus, respectively (Table S3), in a roughly even distribution among the eight vRNA segments (Fig. S4) with 100% of the mapped reads in the negative-sense orientation. The distribution of read lengths (Fig. S5 and Table S1) corresponds to expected lengths of each respective segment. The read level accuracies for the two runs were 85.2 and 83.8%, and the consensus sequences were 98.7 and 98.5% in concordance with consensus sequence generated using our standardized M-RTPCR^15,16 amplified genome and MiSeq approach (Table S3).

To determine the effectiveness of the modified adapter, total RNA from allantoic fluid (crude) harvested from a genetically defined recombinant virus (rA/Puerto Rico/8/1934 (H1N1)) infected chicken eggs (EID₅₀ 6.8 × 10⁹) was sequenced via MinION. Three independent sequencing experiments each covered 100% of the coding regions of the PB2, PB1, PA, HA, NP, NA, M, and NS gene segments to an average depth of 2,789 ± 752 (Fig. 2) with reduced coverage at the extreme termini (Fig. 3). Since this approach reads from the 3′ to 5′ end of the molecule, there is a heavy coverage bias towards the 3′ terminus of the negative sense RNA. Of the 54,353 ± 15,314 reads, 98.8 ± 0.1% mapped to influenza rA/Puerto Rico/8/1934 (H1N1) virus (Tables 2 and S3) in a roughly even distribution among the 8 segments (Fig. S4), with 100% of the mapped reads in the negative-sense orientation. The distribution of read lengths (Fig. 4 and Table S1) corresponds well to the expected length of the respective segment. The read level accuracy was 86.2 ± 0.3%, and the consensus sequence was 98.97 ± 0.01% in concordance with consensus sequence generated using our standardized multi-segment reverse transcriptase polymerase chain reaction (M-RTPCR)^15,16, Nextera, and MiSeq approach (Tables 2 and S3).

To compare the accuracy of the consensus sequence generated from direct RNA sequencing, the vRNA segments from the influenza rA/Puerto Rico/8/1934 (H1N1) pure and crude virus preparations were amplified by M-RTPCR^15,16 and sequenced on the Illumina MiSeq. Sequencing of the RNA from purified virus and crude virus produced 163,264 and 143,572 reads, respectively, of which 99.9% mapped to influenza rA/Puerto Rico/8/1934 (H1N1) virus (Tables 2 and S3). The reads were roughly evenly distributed among the eight vRNA segments (Fig. S4). The mapped reads covered 100% of the coding regions of the PB2, PB1, PA, HA, NP, NA, M, and NS vRNA genome segments (Figs 2 and S3) with reduced coverage at the extreme termini (Fig. S6). The read level accuracy was 99.6% and the consensus sequences, which were used as the reference genome for the nanopore assemblies, were defined as 100% accurate and were 100% identical to each other.

Contemporary influenza A viruses

To demonstrate that the adapter targets a region highly conserved among influenza A viruses, we directly sequenced vRNA from four contemporary influenza A viruses: A/Florida/20/2018 (H1N1pdm09), A/Texas/50/2012 (H3N2), A/chicken Ghana/20/2015 high pathogenic avian influenza (HPAI H5N1) and A/British Columbia/1/2015 low pathogenic avian influenza (LPAI H7N9) (Table 1). Single sequencing experiments demonstrated the coding complete genomic RNA was sequenced for each of the vRNA segments (PB2, PB1, PA, HA, NP, NA, M, and NS) with an average depth greater than 650 (Figs S7–S10 and Table S3). For these experiments, >96% of reads mapped to the respective influenza A virus genome generated by M-RTPCR and illumina MiSeq (Table 2). The high percentage of mapped reads from crude lysates indicates that the modified adapter effectively targets a diverse subset of influenza A viruses. All these contemporary influenza A viruses were sequenced via our standardized M-RTPCR^15,16 amplified genome and MiSeq approach and that data was deposited in GenBank (NCBI).

Limit of detection

The sensitivity of the direct RNA sequencing of influenza A virus strategy was determined through serial dilution of the RNA from influenza A/Florida/20/2018 (H1N1pdm09) virus. RNA was extracted and diluted fivefold serially to generate five RNA samples with Ct values: 11.6, 14.2, 17.0, 19.6, and 22.3 (Table S4). RNA was aliquoted and sequenced via MinION in triplicate. While some influenza A/Florida/20/2018 virus reads were detected in the dilute samples, the most dilute sample that yielded at least 10x coverage and 90% consensuses identity (Table S5) had a Ct of 17 and a calculated TCID₅₀ of 1.89 × 10⁷. This is well outside the range of most original clinical samples and roughly four orders of magnitude less sensitive than M-RTPCR^15,16.

Discussion

We have demonstrated, for the first time, coding complete²⁰ sequencing of an RNA virus genome by direct RNA sequencing. Using a method originally designed to sequence mRNA, we adapted the target sequence to bind the 3′ sequence conserved among influenza A viruses. The specificity of this adapter allowed efficient sequencing of influenza rA/Puerto Rico/8/1934 virus RNA genomic segments from RNA isolated from purified virus particles (control) or from RNA isolated from a crude extract that contains a myriad of viral and host (chicken) RNAs. Using this adapter, 98.8% of reads from the crude virus RNA preparation mapped to influenza rA/Puerto Rico/8/1934 virus, which is practically as efficient as with the purified virus RNA sample (99.3%). This performance on crude virus stocks demonstrates that the sequence-directed library preparation is a very effective method to select specific target RNA species among a population of RNAs, as the vast majority of reads were to influenza rA/Puerto Rico/8/1934 virus using 12 ribonucleotides as the target sequence.

The utility of this adapter was demonstrated by directly sequencing RNA from crude stocks of contemporary influenza A/Florida/20/2018 (H1N1pdm09), A/Texas/50/2012 (H3N2), A/chicken Ghana/20/2015 (H5N1), and A/British Columbia/1/2015 (H7N9) viruses. The adapter was able to target the conserved 3′ termini of this diverse subset of influenza A viruses as all four were sequenced to coding complete coverage and roughly 98% consensus identity to M-RTPCR and MiSeq results. Moreover, the adapter remained efficient with these diverse viruses with >96% of reads mapping to its respective influenza A virus genome.

The data shows that further modifications to the adapter could target other RNA species such as RNAs from specific pathogens and different RNA species within a particular pathogen. For example, one could compare (+) sense cRNA [replication intermediate of (−) sense vRNAs], (+) sense mRNAs, or (−) sense RNAs present during RNA virus infections (such as for influenza A viruses). The adapter sequence could be modified to target specific viral families, genera, or species by extending the target sequence and or by adding degeneracies. This is an advantage over poly(A) methods that have a reduced signal-to-noise ratio due to host mRNA. Targeting influenza A vRNA and cRNA independently may prove difficult as there is complementarity between the two conserved termini of the vRNA segments, and therefore high sequence identity between the 3′ termini of the (−) sense vRNA and (+) sense cRNA. Rather, cRNA and vRNA reads can be sorted based on their (+) and (−) polarity, respectively. Moreover, this technique is highly amenable to sequencing a variety of non-poly-adenylated RNAs from hosts and pathogens, including untranslated regions (UTRs), without biasing the sequence to the primer. This allows the examination of the UTRs in their native form, which we have done here with influenza A virus. Genomic length and quantitative sequencing of viral mRNA species, using unmodified kit components, has the potential to provide direct detection of base modifications, splice variants, and transcriptional changes. By examining (−) sense vRNA, native UTRs, (+) sense cRNA, viral mRNAs, and host mRNAs activated during an influenza infection, one could dissect the viral replication processes and observe changes at a given point in time and under different replication conditions, such as viruses used for vaccine production.

The primary limitations of this technology are the high read level error rate and high input material requirements. Reducing the error rate would enable multiplexing and more accurate consensus sequence determination and is a requirement for understanding nucleotide polymorphisms and genome sub-populations, particularly in viruses such as influenza that have significant intra-host diversity and or base modifications to be identified. There are currently several bioinformatic tools for detecting DNA base modifications such as Tombo, Nanopolish, SignalAlign, and mCaller; however, RNA specific tools have yet to be released¹⁹. Currently, the RNA input requirements for direct RNA sequencing are high and are not physically achievable with most original clinical samples. While we were able to successfully sequence influenza A vRNA using much less input material than is recommended by ONT, direct sequencing of serially diluted influenza A vRNA revealed that this technique is not sensitive enough for most clinical samples and roughly four orders of magnitude less sensitive than M-RTPCR based sequencing. Hence, direct RNA sequencing is currently limited to cultured viruses. Lessening the RNA input requirement of the direct RNA sequencing would take full advantage of the unbiased nature of direct RNA sequencing and allow for the detection and description of the rich diversity intrinsic to influenza and other viruses. The continuing effort to advance this technology by ONT will undoubtedly result in higher accuracy reads and greatly improved utility.

Methods

Concentration and purification of A/Puerto Rico/8/1934 reassortant virus

Genetically defined rA/Puerto Rico/8/1934 virus was created by reverse genetics²¹ and propagated in 11 day-old embryonated hen eggs at 35 °C for 48 hours. Allantoic fluid was harvested from the chilled eggs and clarified at 5,400 × g, 10 minutes, 4 °C (Sorvall SLA-1500 rotor). The virus was clarified twice more by centrifugation at 15,000 × g, 5 minutes, 4 °C (Sorvall SLA-1500 rotor). Virus was pelleted by centrifugation at 39,000 × g, 3 hours at 4 °C (Sorvall A621 rotor). Virus pellets were resuspended overnight in PBS and loaded onto a 30%/55% (w/w) density sucrose gradient. The gradient was centrifuged at 90,000 × g for 14 hours at 4 °C (Sorvall AH629 rotor). The virus fractions were harvested and sedimented at 131,000 × g (Sorvall AH629 rotor) for 2.5 hours. The resulting virus pellet was resuspended in PBS and aliquoted for future use.

Propagation of contemporary influenza A viruses

A/Florida/20/2018 (H1N1pdm09), A/Texas/50/2012 (H3N2), and A/British Columbia/1/2015 were propagated in MDCK cells. A/chicken/Ghana/20/2015 was propagated in embryonated hen eggs and harvested as an E1/E3 passage.

RNA isolation

Enolase II (YHR174W) mRNA is supplied in the ONT materials as the RNA calibration strand (RCS) at a concentration of 50 ng/µL. For influenza A virus samples, total RNA was isolated by Invitrogen^TM TRIzol® extraction²² according to manufacturer’s instructions with additional considerations for biosafety. The virus was inactivated by the addition of 10 volumes of TRIzol® in a Biosafely Level 2 biosafety cabinet. Influenza A/British Columbia/1/2015 (LPAI H7N9) and A/chicken Ghana/20/2015 (HPAI H5N1) viruses were inactivated by the addition of 3 volumes of TRIzol® in a Biosafely Level 3 enhanced laboratory before removal. Following inactivation, a fume hood was used for the chloroform addition and aqueous phase removal steps. RNA pellets were resuspended in 10–40 µL nuclease free water and quantified by Quant-iT^TM RiboGreen® RNA Assay Kit or a Qubit^TM RNA Assay Kit. Due to the difficulty in acquiring sucrose-purified material, the pure controls were limited to one MiSeq run and two separate MinION experiments. RNA from influenza A/Florida/20/2018 (H1N1pdm09) virus was diluted serially and aliquoted for determining the limit of detection.

Nanopore Sequencing

The ONT direct RNA library preparation input material requirement is 500 ng of target molecule in a 9.5 µL volume (Table S6). For mRNA sequencing of the enolase control, the protocol was used according to the manufacturer’s instruction. For influenza vRNA sequencing, modifications were made to the protocol components (Table S2). We altered the supplied reverse transcriptase adapter (RTA) which has a T₁₀ overhang (T_m ~ 20 °C) to target the ligation of the RTA to mRNA, with 12 nucleotides complementary to the conserved 3′ end of Influenza A vRNA²³ (Fig. 1). RTA-U12 and RTA-U12.4 contained target sequences (5′ to 3′) AGC AAA AGC AGG and AGC GAA AGC AGG (T_m ~ 50 °C) respectively and were combined in a 2:3 molar ratio to a total concentration of 1.4 µM. This mixture was used as a direct replacement to the RTA supplied in the protocol for influenza A vRNA samples. Though there is some disagreement regarding the segment specific degeneracies of the 12 nucleotides at the 3′ end of the genome, RTA-U12 is expected to target the segments PA, NP, M, and NS; and RTA U-12.4 is expected to target the segments PB2, PB1, HA, and NA^24,25. For the pure, crude, and contemporary virus experiments, 10 µL of vRNA was ligated to 1 µL of RTA-U12. For the LOD experiment, which also used influenza A (H1N1pdm09) virus, 9 µL of vRNA and 0.5 µL of 50 ng/µL enolase mRNA were combined and ligated to 1 µL of RTA-U12 and 1 µL of the stock RTA.

Adapter ligated RNA was directly sequenced on the MinION nanopore sequencing using a FLO-MIN107 flowcell equipped with the R9 chemistry. The enolase sequencing experiments were operated through MinKNOW versions 1.4.2, 1.7.7, and 1.10.11; the pure sequencing experiments were operated through MinKNOW 1.7.7; the crude, contemporary virus, and LOD 1 sequencing experiments were operated through MinKNOW 1.10.11; and the LOD 2–5 sequencing experiments were operated through MinKNOW 2.1. Raw data was basecalled using Albacore 2.1.10 (released 01/26/2018), and reads were assembled using IRMA²⁶ with the FLU-MinION preset configuration to produce influenza A virus consensus sequences for comparison to MiSeq-derived consensuses. The FLU-MinION preset differs from the default FLU module settings by the following: dropping the median read Q-score filter from 30 to 0, raising the minimum read length from 125 to 150, raising the frequency threshold for insertion and deletion refinement from 0.25 to 0.75 and 0.6 to 0.75 respectively, and lowering the Smith-Waterman mismatch penalty from 5 to 3 and the gap open penalty from 10 to 6. For read-level comparisons of MinION to MiSeq, raw fastqs from both sequencing platforms were mapped with bwa-mem v.0.7.7 algorithm²⁷ to MiSeq + IRMA derived consensus sequences as references. Bwa-mem settings were left default except for the following arguments: “-A 2” and “-B 3”. Figures and tables were created in Tableau v.10.4.3.

Error rates were calculated against the aligned plurality consensus sequence as follows:

Accuracy rate = 1 − average number of insertions, deletions, and minority alleles/sum of aligned bases + number of deletions and insertions at left-adjacent (upstream or 5′ to the site) base per position per segment.
Insertion rate = average number of insertions, irrespective of insertion length/sum of aligned bases + number of insertions at left-adjacent base per position per segment.
Deletion rate = average number of deletions, irrespective of deletion length/sum of aligned bases + number of deletions at left-adjacent base per position per segment.
Substitution rate = average number of minority bases/sum of aligned bases per position per segment.
Alignment read lengths were calculated as matching + inserted bases per read (CIGAR M + I).

Illumina MiSeq Sequencing

The coding complete influenza A virus genome was amplified with the RNA from all viral samples. The MRT-PCR used the Uni/Inf primer set¹⁶ with SuperScript III One-Step RT-PCR with Platinum Taq High Fidelity (Invitrogen). Following amplification, indexed paired-end libraries were generated from 2.5 µl of 0.2 ng/µL using the Nextera XT Sample Preparation Kit (Illumina) following the manufacturer protocol using half-volume tagmentation reactions. Libraries were purified with 0.8X AMPure XP beads (Beckman Coulter, Inc.) and assessed for fragment size (QIAxcel Advanced System, Qiagen) and quantitated using Quant-iT dsDNA High Sensitivity Assay (Invitrogen). Six pmol of pooled libraries were sequenced on the Illumina MiSeq with MiSeq v2 300 cycle kit and 5% PhiX spike-in to increase the sequence diversity. Sequence analysis was performed using IRMA²⁶ as part of the current Illumina-based pipeline utilized by the Influenza Genomics Team at the Centers for Disease Control and Prevention.

Data Availability

Sequence data is accessioned at NCBI: PRJNA449380.

Change history

19 October 2018
A correction to this article has been published and is linked from the HTML and PDF versions of this paper. The error has been fixed in the paper.

References

Peattie, D. A. Direct chemical method for sequencing RNA. Proc Natl Acad Sci USA 76, 1760–1764 (1979).
Article ADS CAS Google Scholar
Wang, Z., Gerstein, M. & Snyder, M. RNA-Seq: a revolutionary tool for transcriptomics. Nat Rev Genet 10, 57–63 (2009).
Article CAS Google Scholar
Cocquet, J., Chong, A., Zhang, G. & Veitia, R. A. Reverse transcriptase template switching and false alternative transcripts. Genomics 88, 127–131 (2006).
Article CAS Google Scholar
Roy, S. W. & Irimia, M. When good transcripts go bad: artifactual RT‐PCR good tra’and genome analysis. Bioessays 30, 601–605 (2008).
Article CAS Google Scholar
Haddad, F., Qin, A. X., Giger, J. M., Guo, H. & Baldwin, K. M. Potential pitfalls in the accuracy of analysis of natural sense-antisense RNA pairs by reverse transcription-PCR. BMC Biotechnol 7, 21 (2007).
Article Google Scholar
Ebhardt, H. A. et al. Meta-analysis of small RNA-sequencing errors reveals ubiquitous post-transcriptional RNA modifications. Nucleic Acids Res 37, 2461–2470 (2009).
Article CAS Google Scholar
Nordgård, O., Kvaløy, J. T., Farmen, R. K. & Heikkilä, R. Error propagation in relative real-time reverse transcription polymerase chain reaction quantification models: The balance between accuracy and precision. Anal Biochem 356, 182–193 (2006).
Article Google Scholar
Ozsolak, F. et al. Direct RNA sequencing. Nature 461, 814–818 (2009).
Article ADS CAS Google Scholar
Andrewes, C. H., Bang, F. B. & Burnet, F. M. A. Short description of the Myxovirus group (influenza and related viruses). Virology 1, 176–184 (1955).
Article CAS Google Scholar
Le Clerc, J. Action of ribonuclease on the multiplication of the influenza virus. Nature 177, 578–579 (1956).
Article ADS Google Scholar
Pons, M. W. Studies on influenza virus ribonucleic acid. Virology 31, 523–531 (1967).
Article CAS Google Scholar
Air, G. M. Nucleotide sequence coding for the “signal peptide” and N terminus of the hemagglutinin from an Asian (H2N2) strain of influenza virus. Virology 97, 468–472 (1979).
Article CAS Google Scholar
Air, G. M. Sequence relationships among the hemagglutinin genes of 12 subtypes of influenza A virus. Proc Natl Acad Sci USA 78, 7639–7643 (1981).
Article ADS CAS Google Scholar
Desselberger, U., Racaniello, V. R., Zazra, J. J. & Palese, P. The 3′and 5′-terminal sequences of influenza A, B and C virus RNA segments are highly conserved and show partial inverted complementarity. Gene 8, 315–328 (1980).
Article CAS Google Scholar
Zhou, B. et al. Single-reaction genomic amplification accelerates sequencing and vaccine production for classical and swine origin human influenza a viruses. J Virol 83, 10309–10313 (2009).
Article CAS Google Scholar
Zhou, B. & Wentworth, D. E. In Influenza Virus: Methods and Protocols (eds Kawaoka, Y. & Neumann, G.) 175–192 (Humana Press, 2012).
Zhao, J. et al. Nanomicroarray and multiplex next-generation sequencing for simultaneous identification and characterization of influenza viruses. Emerg Infect Dis 21, 400–408 (2015).
Article CAS Google Scholar
Wang, J., Moore, N., Deng, Y.-M., Eccles, D. & Hall, R. MinION nanopore sequencing of an influenza genome. Front Microbiol 6 (2015).
Garalde, D. R. et al. Highly parallel direct RNA sequencing on an array of nanopores. Nat Methods (2018).
Ladner, J. T. et al. Standards for sequencing viral genomes in the era of high-throughput sequencing. mBio 5 (2014).
Johnson, A. et al. Identification of influenza A/PR/8/34 donor viruses imparting high hemagglutinin yields to candidate vaccine viruses in eggs. PLoS ONE 10, e0128982 (2015).
Article Google Scholar
Chomczynski, P. A reagent for the single-step simultaneous isolation of RNA, DNA and proteins from cell and tissue samples. Biotechniques 15, 532–534, 536–537 (1993).
Hoffmann, E., Stech, J., Guan, Y., Webster, R. & Perez, D. Universal primer set for the full-length amplification of all influenza A viruses. Arch Virol 146, 2275–2289 (2001).
Article CAS Google Scholar
Ma, J. et al. Impact of the segment-specific region of the 3′-untranslated region of the influenza A virus PB1 segment on protein expression. Virus Genes 47, 429–438 (2013).
Article CAS Google Scholar
Widjaja, I., de Vries, E., Rottier, P. J. M. & de Haan, C. A. M. Competition between Influenza A Virus Genome Segments. PLoS ONE 7, e47529 (2012).
Article ADS CAS Google Scholar
Shepard, S. S. et al. Viral deep sequencing needs an adaptive approach: IRMA, the iterative refinement meta-assembler. BMC Genomics 17, 708 (2016).
Article Google Scholar
Li, H. & Durbin, R. Fast and accurate long-read alignment with Burrows-Wheeler transform. Bioinformatics 26, 589–595 (2010).
Article Google Scholar

Download references

Acknowledgements

Research reported in this publication was supported by the office of Advanced Molecular Detection (AMD CAN 939018 C) at the Centers for Disease Control and Prevention. We thank Oxford Nanopore Technology’s technical support team, Bryant Catano in particular, for the recovery of QC data from early sequencing experiments.

Author information

Matthew W. Keller and Benjamin L. Rambo-Martin contributed equally

Authors and Affiliations

Oak Ridge Institute of Science and Education (ORISE), Oak Ridge, Tennessee, USA
Matthew W. Keller
Battelle Memorial Institute, Atlanta, Georgia, USA
Benjamin L. Rambo-Martin, Malania M. Wilson & Callie A. Ridenour
Influenza Division, National Center for Immunization and Respiratory Diseases (NCIRD), Centers for Disease Control and Prevention (CDC), Atlanta, Georgia, USA
Samuel S. Shepard, Thomas J. Stark, Elizabeth B. Neuhaus, Vivien G. Dugan, David E. Wentworth & John R. Barnes

Authors

Matthew W. Keller
View author publications
You can also search for this author in PubMed Google Scholar
Benjamin L. Rambo-Martin
View author publications
You can also search for this author in PubMed Google Scholar
Malania M. Wilson
View author publications
You can also search for this author in PubMed Google Scholar
Callie A. Ridenour
View author publications
You can also search for this author in PubMed Google Scholar
Samuel S. Shepard
View author publications
You can also search for this author in PubMed Google Scholar
Thomas J. Stark
View author publications
You can also search for this author in PubMed Google Scholar
Elizabeth B. Neuhaus
View author publications
You can also search for this author in PubMed Google Scholar
Vivien G. Dugan
View author publications
You can also search for this author in PubMed Google Scholar
David E. Wentworth
View author publications
You can also search for this author in PubMed Google Scholar
John R. Barnes
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

D.W. and J.B. conceived the research. M.K., M.W. and C.R. conducted the experiments. M.K., B.R.-M., T.S. and S.S. analyzed the results. B.R.-M. accessioned the raw data. M.K., B.R.-M., M.W., C.R., S.S., T.S., E.N., V.D., D.W. and J.B. edited the manuscript.

Corresponding author

Correspondence to John R. Barnes.

Ethics declarations

Competing Interests

The authors declare no competing interests.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Supplementary Figures and Table Legends

Dataset 1

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Keller, M.W., Rambo-Martin, B.L., Wilson, M.M. et al. Direct RNA Sequencing of the Coding Complete Influenza A Virus Genome. Sci Rep 8, 14408 (2018). https://doi.org/10.1038/s41598-018-32615-8

Download citation

Received: 23 April 2018
Accepted: 05 September 2018
Published: 26 September 2018
DOI: https://doi.org/10.1038/s41598-018-32615-8

Keywords

This article is cited by

Genomic characterization of equine influenza A subtype H3N8 viruses by long read sequencing and functional analyses of the PB1-F2 virulence factor of A/equine/Paris/1/2018
- Lena Kleij
- Elise Bruder
- Sophie Dhorne-Pollet
Veterinary Research (2024)
Magnetic hydrogel particles improve nanopore sequencing of SARS-CoV-2 and other respiratory viruses
- P. Andersen
- S. Barksdale
- B. Lepene
Scientific Reports (2023)
Application of Nanopore Sequencing in the Diagnosis and Treatment of Pulmonary Infections
- Jie Chen
- Feng Xu
Molecular Diagnosis & Therapy (2023)
Investigation on the applicability of a long-range reverse-transcription quantitative polymerase chain reaction assay for the rapid detection of active viruses
- Masato Yasuura
- Yuki Nakaya
- Takashi Fukuda
BMC Microbiology (2022)
Nanopore sequencing of a monkeypox virus strain isolated from a pustular lesion in the Central African Republic
- Mathias Vandenbogaert
- Aurélia Kwasiborski
- Nicolas Berthet
Scientific Reports (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.