Barley RNA viromes in six different geographical regions in Korea

Jo, Yeonhwa; Bae, Ju-Young; Kim, Sang-Min; Choi, Hoseong; Lee, Bong Choon; Cho, Won Kyong

doi:10.1038/s41598-018-31671-4

Download PDF

Article
Open access
Published: 05 September 2018

Barley RNA viromes in six different geographical regions in Korea

Yeonhwa Jo¹,
Ju-Young Bae²,
Sang-Min Kim²,
Hoseong Choi¹,
Bong Choon Lee² &
…
Won Kyong Cho ORCID: orcid.org/0000-0002-8416-5173¹

Scientific Reports volume 8, Article number: 13237 (2018) Cite this article

2000 Accesses
20 Citations
1 Altmetric
Metrics details

Subjects

Abstract

Barley is a kind of cereal grass belonging to the family Poaceae. To examine viruses infecting winter barley in Korea, we carried out a comprehensive study of barley RNA viromes using next-generation sequencing (NGS). A total of 110 barley leaf samples from 17 geographical locations were collected. NGS followed by extensive bioinformatics analyses revealed six different barley viromes: Barley yellow mosaic virus (BaYMV), Barley mild mosaic virus (BaMMV), Barley yellow dwarf virus (BYDV), Hordeum vulgare endornavirus (HvEV), and Barley virus G (BVG). BaYMV and HvEV were identified in all libraries, while other viruses were identified in some specific library. Based on the number of virus-associated reads, BaYMV was a dominant virus infecting winter barley in Korea causing yellow disease symptoms. We obtained nearly complete genomes of six BaYMV isolates and two BaMMV isolates. Phylogenetic analyses indicate that BaYMV and BaMMV were largely grouped based on geographical regions such as Asia and Europe. Single nucleotide polymorphisms analyses suggested that most BaYMV and BaMMV showed strong genetic variations; however, BaYMV isolate Jeonju and BaMMV isolate Gunsan exhibited a few and no SNPs, respectively, suggesting low level of genetic variation. Taken together, this is the first study of barley RNA viromes in Korea.

Evolutionary study of maize dwarf mosaic virus using nearly complete genome sequences acquired by next-generation sequencing

Article Open access 22 September 2021

Blueberry red ringspot virus genomes from Florida inferred through analysis of blueberry root transcriptomes

Article Open access 21 July 2020

Sweet potato viromes in eight different geographical regions in Korea and two different cultivars

Article Open access 13 February 2020

Introduction

Barley (Hordeum vulgare L.) is a kind of cereal grass belonging to the family Poaceae. Since its first cultivation in temperate regions in Eurasia about 10,000 years ago, barley has been used as animal feed and fermentable materials for beer and whisky production¹. Barley is the fourth biggest import crop, after maize, rice, and wheat, based on production. In Korea, winter barley is usually cultivated and is mostly consumed as polished barley mixed with rice; however, it can also be used as materials for bread, noodle, pastes, tea, beverage, and oils.

Barley is susceptible to diverse pathogens, including bacteria, fungi, and viruses. For example, the fungus Pyrenophora teres Drechsler, causing net blotch of barley, is a major disease in barley-growing regions². Furthermore, soil-borne Fusarium species causing Fusarium head blight results in high yield losses³.

To date, several viruses infecting barley have been identified (Table 1). Of identified viruses infecting barley, the two closely related ones, Barley yellow mosaic virus (BaYMV) and Barley mild mosaic virus (BaMMV), which are members of the genus Bymovirus in the family Potyviridae, are well known. Both are important viruses infecting barley, causing yellow mosaic disease and leading to serious yield loss⁴. Both BaYMV and BaMMV are single-strand viruses, and their genomes are composed of two RNA segments, RNA1 and RNA2^5,6,7,8. Both soil-borne BaYMV and BaMMV are transmitted by the fungal vector Polymyxa graminis^9,10.

Table 1 List of known major viruses infecting barley.

Full size table

Barley yellow dwarf virus (BYDV) is a single-stranded RNA virus in the genus Luteovirus in the family Luteoviridae causing yellowing symptoms¹¹. BYDV presenting in the phloem of the infected plants is transmitted by aphids, and several serotypes of BYDV are classified based on the aphid vector¹².

Barley stripe mosaic virus (BSMV) in the genus Hordeivirus usually infects two monocot crops such as barley and wheat; however, BSMV infects more than 250 plant species¹³. In addition, BSMV based vector is widely used for virus-induced gene silencing (VIGS) in barley and wheat¹⁴. Hordeum mosaic virus (HoMV) in the genus Rymovirus was initially isolated from barley in Alberta in Canada and infects plants in the family Poaceae such as wheat, oat, and rye¹⁵.

Several studies to identify genes conferring resistance to two bymoviruses have been reported. For example, two resistance genes, rym4 and rym5, are known to be effective against a barley yellow disease complex composed of BaMMV and BaYMV¹⁶. However, a previous study has identified a new BaYMV strain capable of breaking rym4-associated resistance in barley in Belgium¹⁷. In addition, The barley accession PI1963 carrying the rym11 gene confers resistance against all European strains of barley yellow mosaic disease^18,19.

Recent rapid advances of next-generation sequencing (NGS) facilitate the identification of novel plant viruses and assembly of viral genomes^20,21,22. Several different kinds of libraries have been prepared by using small RNAs, double-stranded (ds) RNAs, mRNAs, and ribosomal RNA-depleted total RNAs for NGS. For example, complete genome sequence of Hordeum vulgare endornavirus (HvEV), which has a dsRNA genome, has been obtained from dsRNA extraction followed by NGS using MiSeq²³. In addition, complete genome sequence of Barley yellow striate mosaic virus (BYSMV) in the genus Cytorhabdovirus was assembled by small RNA sequencing followed by Sanger-sequencing²⁴. Moreover, RNA-sequencing followed by Sanger-sequencing revealed complete genome sequence of Barley virus G (BVG) in the genus Polerovirus²⁵. Furthermore, a recent study has revealed a diversity of bymoviruses in barley in France using NGS and Sanger-sequencing methods²⁶.

In order to examine viruses infecting barley in Korea, we carried out a comprehensive study of barley RNA viromes using NGS. For that, we collected 110 barley samples from 17 geographical locations in six different provinces. Samples were pooled based on the collected provinces and used for library preparation. Extensive bioinformatics analyses revealed six different barley viromes in Korea.

Results

Collection of barley samples and library preparation

We collected 110 barley samples from 17 geographical locations (Table 2). Collected barley leaves showed yellow mosaic and dwarf disease symptoms (Fig. 1). Samples were pooled for total RNA extraction and library preparation based on province of collection. Six different libraries were prepared. For example, seven samples collected from Yeonggwang were assigned as library A, while 12 samples from Yeongduk and Daegu were assigned as library F. We conducted paired-end sequencing for six different libraries by HiSeq2000 system. The number of obtained reads ranged from 19,751,300 (library B) to 30,607,568 (library D) (Table 3). Obtained raw read sequences from each library were subjected to de novo transcriptome assembly using two different assemblers, Trinity and Velvet. The number of obtained contigs ranged from 62,074 (library B) to 250,090 (library D) by Trinity, whereas the number of contigs ranged from 997,502 (library B) to 4,103,731 (library D) (Table 4). In general, compared to Trinity, Velvet assembler produces a large number of contigs with short read length. Furthermore, the number of assembled contigs was much higher compared to other dicot plants, since the barley has a large genome.

Table 2 Detailed information for name of library, sample location, and number of samples in each library.

Full size table

Table 3 Summary of paired-end sequencing results for barley viromes using HiSeq2000 system.

Full size table

Table 4 Summary of de novo assembly by Trinity and Velvet assemblers.

Full size table

Identification of viruses infecting barley

The contigs obtained by Trinity and Velvet as well as raw sequence reads in each library were blasted against a viral reference database to identify viruses infecting barley. From contigs assembled by Trinity, the number of virus-associated contigs ranged from 16 (libraries B, D, E) to 36 (library F) (Fig. 2a and Table S1). The number of virus-associated contigs assembled by Velvet was very high as compared to those of Trinity, ranging from 49 contigs (library A) to 139 (library D). The number of raw sequence reads ranged from 70,727 (library D) to 1,236,749 (library B). The virus-associated contigs were matched to BaYMV, BaMMV, BYDV, HvEV, Barley virus G (BVG), and Valsa ceratosperma hypovirus 1 (VcHV1) (Table S1). Based on number of contigs assembled by Trinity, BaYMV (50 contigs) was the dominant virus followed by BYDV (38 contigs), BaMMV (23 contigs), and HvEV (21 contigs) (Fig. 2b). In contrast, HvEV (217 contigs) was the dominant virus followed by BaYMV (244 contigs) and BaMMV (36 contigs) based on number of contigs assembled by Velvet. Based on the number of raw sequence reads, most virus-associated reads were derived from RNA1 (1,283,996 reads) and RNA2 (796,777 reads) of BaYMV.

Next, we calculated the proportion of virus-associated contigs and reads in each library (Fig. 2c). The proportion of virus-associated contigs assembled by Trinity was very low, ranging from 0.006% (library D) to 0.033% (library F), and the proportion of virus-associated contigs assembled by Velvet ranged from 0.003% (library E) and 0.006% (library B). However, the proportion of virus-associated reads in each library was increased, ranging from 0.231% (library D) to 6.262% (library B).

Distribution of identified viruses based on geographical regions

We examined the geographical distribution of identified viruses (Fig. 3). We found that the partial sequence of VcHV1 was derived from barley host gene. Therefore, VcHV1 was excluded for further analysis. In total, we identified five viruses infecting barley. The identified viruses in each library were diverse. For example, we identified three viruses, including BaMMV, BaYMV, and HvEV, from library A, while BYDV, BaYMV, and HvEV were identified from library C. BaYMV and HvEV, identified from all six libraries, were the most common viruses infecting barley, followed by BaMMV (identified from five libraries). BVG was identified from libraries D and E, whereas BYDV was identified from libraries C, D, F.

Calculation of virus accumulation in each library

We calculated virus accumulation in each library based on read number and copy number. Copy number was calculated by total read number divided by each viral genome size (Fig. 4). In all libraries, BaYMV was the dominant virus based on read number and copy number. Size of BaYMV RNA1 is greater than that of BaYMV RNA2. Therefore, the read number of BaYMV RNA1 is higher than that of BaYMV RNA2. However, based on copy number, the proportions of BaYMV RNA1 and RNA2 were very similar in libraries A and C. Furthermore, the proportion of BaMMV RNA1 and RNA2 was very similar in library A.

De novo genome assembly and calculation of mutation rates

Using virus-associated contigs, we obtained six nearly complete genomes of BaYMV from six libraries (Fig. 5) and two BaMMV from libraries A and B (Fig. 6). Both viruses are composed of two RNA segments. The sizes of assembled BaYMV RNA1 ranged from 7,630 nucleotides (nt) (library F) to 7,644 nt (library C), while the size of assembled BaYMV RNA2 ranged from 3,537 nt (library B) to 3,584 nt (libraries D and E) (Fig. 5a–f). In order to analyze the SNP of each identified virus within each library, we mapped raw sequence reads on the assembled RNA genome. As shown in Fig. 5a–f, the number of reads mapped on each virus genome was sufficient to cover the nearly complete viral genome. We identified SNPs of BaYMV in each library (Fig. 5g). The numbers of identified SNPs were diverse among six libraries, for example ranging from three (library D) to 341 (library A) for BaYMV composed of two RNA segments. In case of BaYMV in libraries A and E, the number of identified SNPs for RNA1 was much higher than that for RNA2. However, the difference in identified SNP number between RNA1 and RNA2 was not significant for BaYMV in libraries C and F. Mutation rates of BaYMV ranged from 0.01 to 3.8 (Fig. 5h).

The sizes of assembled BaMMV RNA1 were 7,269 (library A) and 7,263 nt (library B), whereas the sizes of assembled BaMMV RNA2 were 3,523 nt (library A) and 3,524 nt (library B) (Fig. 6a,b). Interestingly, SNPs of BaMMV were only identified from library A while there was no SNP in BaMMV from library B. The numbers of SNPs for BaMMV from library A were 337 (BaMMV RNA1) and 44 (BaMMV RNA2), and their mutation rates were 4.6% and 1.2%, respectively.

Phylogenetic relationships of BaYMV and BaMMV

Based on assembled genome sequences for BaYMV and BaMMV, we generated phylogenetic trees. Since each virus is composed of two RNA fragments, two independent phylogenetic trees for each virus were generated (Fig. 7). The two phylogenetic trees using BaYMV RNA1 and RNA2 sequences showed that all six BaYMV isolates from this study were included in group A (Fig. 7a,b). Based on BaYMV RNA1 and RNA2 sequences, four isolates from libraries A, B, D, and E were grouped in the same clade with BaYMV isolate K05 from Kurashiki in Japan. The two isolates Goseong and Daegu, from libraries C and F, respectively, were closely related. In addition, BaYMV isolate Goseong showed sequence similarity to BaYMV strain III from Tochigi in Japan based on BaYMV RNA1 sequences. Interestingly, group A contains BaYMV isolates from three countries, China, Japan, and Korea, while group B contains BaYMV isolates from the United Kingdom and Germany.

Phylogenetic trees based on BaMMV RNA1 and RNA2 sequences also displayed two different groups of BaMMV isolates (Fig. 7c,d). The two isolates Gunsan and Yeonggwang were clustered together in group B. Group A again contains BaMMV isolates from three countries, Germany, France, and the United Kingdom, while Group B includes BaMMV isolates from Japan and Korea.

Confirmation of NGS results by RT-PCR

In order to confirm infection of identified viruses by NGS, we conducted RT-PCR using newly designed primers. To increase reliability of RT-PCR, we designed at least two different primer-pairs for each RNA fragment of four viruses, BaYMV, BaMMV, HvEV, and BVG (Fig. 8a and Table S2). Using four different primer-pairs for BaYMV, RT-PCR results clearly showed that all six libraries contained BaYMV sequences (Fig. 8b). In case of BaMMV, 285-bp PCR products were amplified in all six libraries, while 507-bp and 703-bp PCR products were not amplified from the library C. Furthermore, very weak bands were detected in libraries D, E, and F using primer-pairs amplifying 507-bp, 703-bp, and 347-bp of BaMMV sequences. Two primer-pairs of HvEV successfully amplified two PCR products (732-bp and 565-bp) from all six libraries. In addition, RT-PCR results showed that BVG was identified only from libraries D and E.

Discussion

Recently, several NGS-based studies have been conducted for viruses infecting barley. For instance, dsRNA extraction followed by NGS was used to determine the complete genome sequences of HvEV, which is composed of dsRNA²³. A recent study has examined genetic diversity of bymoviruses infecting barley in France using NGS and Sanger-sequencing, revealing that BaYMV-2 was responsible for the symptoms observed in varieties carrying the resistance gene rym4.

As compared to the previous studies associated with viruses infecting barley, our study focuses on barley RNA viromes associated with identification of viruses infecting barley, as well as geographical distribution and genetic diversity of identified viruses in Korea. We successfully identified several viruses (BaYMV, BaMMV, BYDV, BVG, and HvEV) by NGS followed by bioinformatics analyses. In addition, our study found that there were at least three different serotypes of BYDV infecting barley in Korea. Infection of BYDV strains PAV and MAV in barley and wheat have been reported in Korea; however, BYDV strains GAV and PAS infecting barley have not been reported in Korea. Moreover, we identified HvEV infecting barley in Korea for the first time. Prior to conducting barley virome study, we examined infection of three major viruses, BaYMV, BaMMV, and Soil-borne wheat mosaic virus (SBWMV), by RT-PCR using individual barley samples collected from the same major winter barley cultivation regions²⁷. However, we did not identify SBWMV in our study. Moreover, BSMV was not identified in our study. BSMV has been identified in a wide range of areas in the world, including North Africa, North America, Europe, and Asia²⁸. Thus, it is necessary to prevent the introduction of BSMV to avoid causing a serious loss of barley yield in Korea.

Although NGS is now generally used in many research areas, however, the price of NGS is still high for several individual samples in parallel. For the virome study of fruit trees that are mechanically infected by viruses, the library preparation using individual fruit trees followed by NGS is useful to decipher the viromes of individual fruit cultivars²⁹. On the other hand, seed propagated plants are usually transmitted by insect vectors; therefore, it is efficient to pool samples for the identification of viruses infecting a plant species in different geographical regions. Accordingly, 110 samples from 17 geographical regions were pooled according to six different provinces. As we expected, each library contains a different list of viruses infecting barley, suggesting that geographical region is an important factor for the composition of viruses in each virome. However, several problems can be generated by pooling of samples which is a cost-effective approach in the virome study. For example, as discussed previously³⁰, the misalignment of sequence reads could have occurred due to the presence of different virus isolates and the improper reference genome, although we used a consensus genome sequence assembled from NGS data. In fact, NGS is an effective method for identifying viral recombination³¹. However, the pooling of samples might interfere with the correct interpretation of recombination events due to a possible mixture of multiple virus isolates. Thus, our results might contain such problems.

There is still a dispute over the optimal method for plant virome study. A recent study has compared two different libraries using as small RNAs and ribosomal RNA-depleted total RNA, respectively, for NGS-based virus identification in plants²⁰. Based on their results, the yield of viral sequences was dependent on each viral genome organization. In our study, we prepared cDNA libraries for NGS using oligo-dT primers in order to facilitate identification of two bymoviruses containing poly-A tail: BaYMV and BaMMV. However, it was not surprising that viruses composed of dsRNAs can be also be identified by mRNA libraries, as shown in other previous studies^32,33.

Viruses most commonly infecting barley were BaYMV and HvEV, while other viruses including BaMMV, BYDV, and BVG were identified in specific libraries. Based on read number and copy number associated with identified viruses, BaYMV was the dominant virus in all six libraries. This result suggests that BaYMV could be a major virus in barley grown in Korea, which might be associated with yellow mosaic symptoms. Similarly, our previous study using RT-PCR also showed a high infection rate of single BaYMV in barley in Korea²⁷. Furthermore, double infection of BaYMV and BaMMV was also confirmed in the barley samples of libraries A and E by NGS, as shown in the previous study by RT-PCR²⁷. In particular, RT-PCR confirmed that all seven barley samples in Yeonggwang (library A) were co-infected by BaYMV, BaMMV, and HvEV (data not shown). HvEV is vertically transmitted by pollen and ovule infection³⁴ and appears to be a non-pathogenic virus like other known endornaviruses that infect plants. Thus, the infection rate of HvEV in cultivated barley may be very high due to vertical transmission via seeds.

For four viruses, we conducted RT-PCR with two independent primer-pairs. Read numbers and RT-PCR results for each virus give the level of virus accumulation in each library. Except for HvEV, read number from NGS was correlated with band intensity from RT-PCR. In case of HvEV, the read number from NGS was low in all six libraries; however, the intensity of amplified RT-PCR products was high. Since HvEV is composed of dsRNA, it cannot be well recovered by an mRNA library using oligo-dT primer followed by NGS. Therefore, the quantification of virus accumulation should be carefully assessed by a combination of several methods.

Size of RNA fragment was an important factor to estimate virus accumulation by NGS. In general, a large number of virus-associated reads could be obtained from a large genome fragment of a virus. In case of BaYMV and BaMMV, which each consist of two RNA fragments, the number of reads from RNA1 was always higher than that from RNA2, since it is bigger than RNA1. In contrast, the copy number of BaYMV RNA2 was slightly higher than that of BaYMV RNA1, although we hypothesized that the copy numbers for RNA1 and RNA2 might be equal.

Due to the presence of poly-A tail in two bymoviruses, reads associated with two viruses were high enough to assemble two genomes de novo. Six BaYMV genomes and two BaMMV genomes were assembled by NGS followed by two de novo genome assemblers, Trinity and Velvet. The advantages and disadvantage for these two assemblers for virome study have previously been described³². Using assembled genomes, we studied phylogenetic relationships of BaYMV and BaMMV. Phylogenetic trees showed two distinct groups, which were divided by geographical region, Europe and Far East Asia. Historically, many barley plants in Korea have been collected by the Japanese during Japanese occupation, and many of them have been reintroduced to Korea³⁵. Therefore, all Korean BaYMV and BaMMV isolates are in the same clade with Japanese isolates. Among five BaYMV isolates, four isolates showed sequence similarity to the known Japan strain K05. The BaYMV strain K05 was used as a template for the construction of infectious cDNA clones of a BaYMV leading to yellow mosaic disease in winter barley³⁶. Based on those results, we suppose that BaYMV infecting barley in Korea that is closely related to strain K05 could be a major virus causing yellow mosaic disease in winter barley in Korea. Moreover, two BaMMV isolates were grouped in the known Japanese BaMMV strain Na1, which showed pathogenicity in different barley cultivars³⁷, suggesting their possible pathogenicity.

In our study, the collected samples were pooled for the library preparation. The advantages of pooling samples might be the reduction of NGS cost and increase in the possibility of identifying viruses at a time. However, the disadvantage of pooling samples is that we cannot obtain detailed information on the viruses infecting a single plant. In general, a single plant is also frequently infected by diverse viruses or different isolates of a virus. Although we examined the SNPs of an individual assembled virus in our study, the SNP information did not represent the individual virus, but a collection of several isolates due to pooling samples. For example, SNPs for BaYMV and BaMMV using NGS data showed high mutation rates because pooled samples might contain a mixture of diverse isolates/strains for two viruses. However, BaYMV isolate Jeonju and BaMMV isolate Gunsan exhibited a few and no SNPs, respectively, suggesting low levels of genetic variation. Although samples were pooled, the mutation rates for two viruses were not correlated with the number of pooled samples. Library D pooled from 25 samples exhibited three SNPs for BaYMV, whereas Library A pooled from seven samples and Library F pooled from 12 samples showed 351 and 289 SNPs, respectively, for BaYMV. We next examined the correlation between the number of SNPs and the number of collected regions. The number of SNPs for BaYMV in Libraries A (one region), E (six regions), and F (two regions) was very high, while that in Library D (five regions) was very low, suggesting no correlation between the mutation rates and the number of collected regions. In fact, virus mutation is highly dependent on virus type, host type, and environmental conditions³⁸. Our previous study also showed that the mutation rate of a plant virus or a viroid varied in different plants²⁹. Therefore, we carefully hypothesized that libraries showing high mutation rates for identified viruses in our study might contain barley samples possessing the viruses with high mutation rates. Moreover, it is likely that not all, but a few, barley samples in the same library contained the viruses with high mutation rates.

In summary, six barley RNA viromes in this study provide a comprehensive overview of viruses infecting winter barley in Korea. Although several viruses infecting barley have been identified, we found that BaYMV was the dominant virus associated with yellow mosaic disease symptoms in Korea. Furthermore, phylogenetic trees using assembled viral genomes suggest that geographical region is a main factor to group BaYMV and BaMMV isolates/strains. Moreover, SNP analyses for BaYMV and BaMMV revealed genetic variations of two viruses in different geographical regions.

Methods

Plant materials

We collected 110 winter barley leaf samples from 17 geographical regions in Korea in March 2016. The 17 regions are the main winter barley production areas in Korea. Most barley samples showed yellow mosaic disease symptoms; however, some barley samples did not show any symptoms. The 110 samples were pooled according to six different provinces in Korea. Detailed information about geographical regions and the six different provinces can be found in Table 2 and Fig. 3.

Total RNA extraction and library preparation

Pooled barley leaf samples were frozen using liquid nitrogen and ground with a pestle and mortar. Total RNA was extracted using the RNeasy Plant Mini Kit (Qiagen, Hilden, Germany) according to manufacturer’s manuals. The quality and quantity of extracted total RNA were measured using an Agilent 2100 Bioanalyzer (Agilent, Santa Clara, CA) and gel electrophoresis. The extracted total RNAs were used for the library preparation for RNA sequencing using the NEBNext Ultra™ RNA Library Prep Kit for Illumina in accordance with the manufacturer’s instructions (NEB, Ipswich, Massachusetts, U.S.A.). In brief, we extracted mRNAs with poly-A tail using poly-T oligo-attached magnetic beads. The first strand of cDNA was synthesized by the purified mRNAs followed by a second strand of cDNA. After that, the adenylation of 3′ ends was conducted. Adapters were ligated, and PCR amplification was carried out to selectively enrich DNA fragments with adapters and amplify the amount of DNA in the library, respectively. The 2100 Bioanalyzer was used for quality control of the generated libraries (Agilent, Santa Clara, U.S.A.). The six prepared libraries were paired-end sequenced by Macrogen Co. (Seoul, South Korea) using the HiSeq2000 platform.

Bioinformatic analyses to identify viruses in the assembled transcriptome

We used the same bioinformatics analyses to identify viruses in the assembled transcriptome as described previously²⁹. In brief, two different methods, Trinity program (version 2.0.2, released 22^nd January 2015) with default parameters³⁹ and Velvet/Oases assembler (version 0.2.08)⁴⁰, were used for de novo transcriptome assembly. Assembled contigs in each transcriptome were subjected to MEGABLAST⁴¹ with a cut-off E-value of 1e⁻⁶ search against NCBI’s viral reference database downloaded from https://www.ncbi.nlm.nih.gov/genome/viruses/. Only virus-associated contigs were selected after deleting endogenous virus-like sequences.

De novo genome assembly of BaYMV and BaMMV

Nearly complete genomes for six BaYMV and two BaMMV genomes were assembled based on contigs generated by Trinity and Velvet assemblers as previously described^29,32. ClustalW program implemented in the MEGA7 program was used to align contigs on the assembled viral genomes⁴². Again, we aligned raw sequence reads on the assembled viral genome to confirm consensus sequences using a Burrows-Wheeler Aligner (BWA) program with default parameters⁴³.

Generation of phylogenetic trees for BaYMV and BaMMV

To generate phylogenetic trees for BaYMV and BaMMV, six BaYMV and two BaMMV genomes were used. Each virus is a bipartite virus composed of two RNA fragments. Therefore, four independent phylogenetic trees were constructed. BLASTN was conducted to find known BaYMV and BaMMV genome sequences using the assembled BaYMV and BaMMV RNA fragments as a query against GenBank (http://www.ncbi.nlm.nih.gov/genbank/). Each RNA fragment sequence for each virus was aligned using the ClustalW program with default parameters. In de novo transcriptome assembly, some virus-associated contigs include partial sequences derived from the host plant. Those sequences can be identified by BLASTN search and sequence alignment on the reference virus genomes. Those unnecessary sequences derived from the host plant and poly-A tails at the 5′ and 3′ regions were deleted. We manually edited aligned sequences. A phylogenetic tree was constructed using the MEGA7 program with the neighbor-joining method with 1,000 bootstrap replicates and Kimura 2-parameter distance⁴².

Analyses of SNPs for BaYMV and BaMMV using transcriptome data

SNPs for six BaYMV and two BaMMV transcriptomes were analyzed as described previously²⁹. In brief, the raw sequence reads from each transcriptome were aligned on the identified individual viral genome using the BWA program with default parameters. The assembled viral genome derived from each transcriptome was used for the reference to increase SNP specificity. According to our experience, the application of known viral reference genomes results in the identification of unexpected SNPs. The SAM files generated by the BWA program were converted into BAM files using SAMtools⁴⁴. The sorted BAM files were used to generate the VCF file format using the mpileup function of SAMtools for SNP calling. Finally, we used BCFtools implemented in SAMtools to call SNPs. The positions of identified SNPs on each viral genome were visualized by the Tablet program⁴⁵.

Design of primer-pairs and RT-PCR

We designed primer-pairs for four viruses, BaYMV, BaMMV, HvEV, and BVG. For each virus, two independent primer-pairs were designed. In case of BaYMV and BaMMV, two different primer-pairs for each RNA fragment were used. The same total RNA from the pooled sample was used as a template in RT-PCR. RT-PCR was conducted using the DiaStar™ OneStep RT-PCR Kit (SolGent, Daejeon, Korea), and the cycling conditions were 50 °C for 30 min, 95 °C for 15 min followed by 30 cycles at 95 °C for 20 sec, 50 °C to 56 °C for 40 sec (the annealing temperature can be variable depending on Tm values of primers), and 72 °C for 1 min, with a final extension at 72 °C for 5 min. The amplified RT-PCR products were confirmed by gel electrophoresis followed by EtBr staining. The amplified RT-PCR product was cloned in the pGEM-T-Easy Vector (Promega, Wisconsin, US) followed by Sanger sequencing.

Data Availability

The raw dataset in this study will be available, upon publication, in the Sequence Read Archive (SRA) repository with accession numbers SRR6706097, SRR6706098, SRR6706099, SRR6706100, SRR6706101, and SRR6706102. The six BaYMV and two BaMMV genome sequences obtained from this study were also deposited in GenBank, NCBI, with respective accession numbers.

References

Gupta, M., Abu‐Ghannam, N. & Gallaghar, E. Barley for brewing: Characteristic changes during malting, brewing and applications of its by‐products. Comprehensive reviews in food science and food safety 9, 318–328 (2010).
Article CAS Google Scholar
Liu, Z., Ellwood, S. R., Oliver, R. P. & Friesen, T. L. Pyrenophora teres: profile of an increasingly damaging barley pathogen. Molecular Plant Pathology 12, 1–19 (2011).
Article PubMed Google Scholar
Salas, B. et al. Fusarium species pathogenic to barley and their associated mycotoxins. Plant Disease 83, 667–674 (1999).
Article CAS Google Scholar
Adams, M. The distribution of barley yellow mosaic virus (BaYMV) and barley mild mosaic virus (BaMMV) in UK winter barley samples, 1987–1990. Plant Pathology 40, 53–58 (1991).
Article Google Scholar
Peerenboom, E. et al. The complete nucleotide sequence of RNA-2 of a fungally-transmitted UK isolate of barley mild mosaic bymovirus and identification of amino acid combinations possibly involved in fungus transmission. Virus research 40, 149–159 (1996).
Article PubMed CAS Google Scholar
Peerenboom, E. et al. Complete RNA1 sequences of two UK isolates of barley mild mosaic virus: a wild-type fungus-transmissible isolate and a non-fungus-transmissible derivative. Virus research 50, 175–183 (1997).
Article PubMed CAS Google Scholar
Kashiwazaki, S., Minobe, Y., Omura, T. & Hibino, H. Nucleotide sequence of barley yellow mosaic virus RNA 1: a close evolutionary relationship with potyviruses. Journal of general virology 71, 2781–2790 (1990).
Article PubMed CAS Google Scholar
Kashiwazaki, S., Minobe, Y. & Hibino, H. Nucleotide sequence of barley yellow mosaic virus RNA 2. Journal of general virology 72, 995–999 (1991).
Article PubMed CAS Google Scholar
Adams, M., Swaby, A. & Jones, P. Confirmation of the transmission of barley yellow mosaic virus (BaYMV) by the fungus Polymyxa graminis. Annals of Applied Biology 112, 133–141 (1988).
Article Google Scholar
Jianping, C., Swaby, A., Adams, M. & Yili, R. Barley mild mosaic virus inside its fungal vector, Polymyxa graminis. Annals of applied biology 118, 615–621 (1991).
Article Google Scholar
Miller, W. A. & Rasochová, L. Barley yellow dwarf viruses. Annual review of phytopathology 35, 167–190 (1997).
Article PubMed CAS Google Scholar
Gray, S. & Gildow, F. E. Luteovirus-aphid interactions. Annual review of phytopathology 41, 539–566 (2003).
Article PubMed CAS Google Scholar
Lee, W.-S., Hammond-Kosack, K. E. & Kanyuka, K. Barley stripe mosaic virus-mediated tools for investigating gene function in cereal plants and their pathogens: virus-induced gene silencing, host-mediated gene silencing, and virus-mediated overexpression of heterologous protein. Plant Physiology 160, 582–590 (2012).
Article PubMed PubMed Central CAS Google Scholar
Scofield, S. R. & Nelson, R. S. Resources for virus-induced gene silencing in the grasses. Plant Physiology 149, 152–157 (2009).
Article PubMed PubMed Central CAS Google Scholar
Slykhuis, J. & Bell, W. Differentiation of Agropyron mosaic, wheat streak mosaic, and a hitherto unrecognized Hordeum mosaic virus in Canada. Canadian Journal of Botany 44, 1191–1208 (1966).
Article Google Scholar
Pellio, B. et al. High-resolution mapping of the Rym4/Rym5 locus conferring resistance to the barley yellow mosaic virus complex (BaMMV, BaYMV, BaYMV-2) in barley (Hordeum vulgare ssp. vulgare L.). Theoretical and applied genetics 110, 283–293 (2005).
Article PubMed CAS Google Scholar
Vaïanopoulos, C. et al. Barley yellow mosaic virus is overcoming RYM4 resistance in Belgium. Communications in agricultural and applied biological sciences 72, 333–339 (2007).
PubMed Google Scholar
Nissan-Azzouz, F., Graner, A., Friedt, W. & Ordon, F. Fine-mapping of the BaMMV, BaYMV-1 and BaYMV-2 resistance of barley (Hordeum vulgare) accession PI1963. Theoretical and applied genetics 110, 212–218 (2005).
Article PubMed CAS Google Scholar
Lüpken, T. et al. Genomics-based high-resolution mapping of the BaMMV/BaYMV resistance gene rym11 in barley (Hordeum vulgare L.). Theoretical and Applied Genetics 126, 1201–1212 (2013).
Article PubMed CAS Google Scholar
Pecman, A. et al. Next Generation Sequencing for Detection and Discovery of Plant Viruses and Viroids: Comparison of Two Approaches. Frontiers in Microbiology 8, 1998 (2017).
Article PubMed PubMed Central Google Scholar
Barrero, R. A. et al. An internet-based bioinformatics toolkit for plant biosecurity diagnosis and surveillance of viruses and viroids. BMC bioinformatics 18, 26 (2017).
Article ADS PubMed PubMed Central CAS Google Scholar
Wu, Q., Ding, S.-W., Zhang, Y. & Zhu, S. Identification of viruses and viroids by next-generation sequencing and homology-dependent and homology-independent algorithms. Annual review of phytopathology 53, 425–444 (2015).
Article PubMed CAS Google Scholar
Candresse, T. et al. Complete genomic sequence of barley (Hordeum vulgare) endornavirus (HvEV) determined by next-generation sequencing. Archives of virology 161, 741–743 (2016).
Article PubMed CAS Google Scholar
Yan, T. et al. Characterization of the complete genome of Barley yellow striate mosaic virus reveals a nested gene encoding a small hydrophobic protein. Virology 478, 112–122 (2015).
Article PubMed CAS Google Scholar
Zhao, F. et al. The complete genomic sequence of a tentative new polerovirus identified in barley in South Korea. Archives of virology 161, 2047–2050 (2016).
Article PubMed CAS Google Scholar
Rolland, M. et al. Classical and next generation sequencing approaches unravel Bymovirus diversity in barley crops in France. Plos one 12, e0188495 (2017).
Article PubMed PubMed Central CAS Google Scholar
Bae, J.-Y. et al. Occurrence of barley virus diseases in southern part of Korea. Korean journal of organic agriculture 23, 859–866 (2015).
Article Google Scholar
Smith, O. et al. A complete ancient RNA genome: identification, reconstruction and evolutionary history of archaeological Barley Stripe Mosaic Virus. Scientific reports 4, 4003 (2014).
Article PubMed PubMed Central CAS Google Scholar
Jo, Y. et al. Peach RNA viromes in six different peach cultivars. Scientific reports 8, 1844 (2018).
Article ADS PubMed PubMed Central Google Scholar
Schlötterer, C., Tobler, R., Kofler, R. & Nolte, V. Sequencing pools of individuals—mining genome-wide polymorphism data without big funding. Nature Reviews Genetics 15, 749 (2014).
Article PubMed CAS Google Scholar
Iles, J. C. et al. Characterization of hepatitis C virus recombination in Cameroon by use of nonspecific next-generation sequencing. Journal of clinical microbiology 53, 3155–3164 (2015).
Article PubMed PubMed Central CAS Google Scholar
Jo, Y. et al. The pepper virome: natural co-infection of diverse viruses and their quasispecies. BMC genomics 18, 453 (2017).
Article PubMed PubMed Central CAS Google Scholar
Jo, Y., Choi, H., Yoon, J.-Y., Choi, S.-K. & Cho, W. K. In silico identification of Bell pepper endornavirus from pepper transcriptomes and their phylogenetic and recombination analyses. Gene 575, 712–717 (2016).
Article PubMed CAS Google Scholar
Zabalgogeazcoa, I., Cox-Foster, D. C. & Gildow, F. E. Pedigree analysis of the transmission of a double-stranded RNA in barley cultivars. Plant Science 91, 45–53 (1993).
Article CAS Google Scholar
Cho, W.-K., Lee, J.-M., Kwon, M.-S. & Chung, T.-Y. Evaluation of Morphological Characteristics and RAPD Analysis in Korean Landraces of Naked Barley. Journal of Plant Biotechnology 29, 217–222 (2002).
Article Google Scholar
You, Y. & Shirako, Y. Bymovirus reverse genetics: requirements for RNA2‐encoded proteins in systemic infection. Molecular plant pathology 11, 383–394 (2010).
Article PubMed CAS Google Scholar
Kashiwazaki, S. & Hibino, H. Genomic reassortment of barley mild mosaic virus: evidence for the involvement of RNA1 in pathogenicity. Journal of general virology 77, 581–585 (1996).
Article PubMed CAS Google Scholar
Roossinck, M. J. Plants, viruses and the environment: ecology and mutualism. Virology 479, 271–277 (2015).
Article PubMed CAS Google Scholar
Grabherr, M. G. et al. Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nature biotechnology 29, 644–652 (2011).
Article PubMed PubMed Central CAS Google Scholar
Zerbino, D. R. & Birney, E. Velvet: algorithms for de novo short read assembly using de Bruijn graphs. Genome research 18, 821–829 (2008).
Article PubMed PubMed Central CAS Google Scholar
Morgulis, A. et al. Database indexing for production MegaBLAST searches. Bioinformatics 24, 1757–1764 (2008).
Article PubMed PubMed Central CAS Google Scholar
Kumar, S., Stecher, G. & Tamura, K. MEGA7: Molecular Evolutionary Genetics Analysis version 7.0 for bigger datasets. Molecular biology and evolution 33, 1870–1874 (2016).
Article PubMed CAS Google Scholar
Li, H. & Durbin, R. Fast and accurate short read alignment with Burrows–Wheeler transform. Bioinformatics 25, 1754–1760 (2009).
Article PubMed PubMed Central CAS Google Scholar
Li, H. et al. The sequence alignment/map format and SAMtools. Bioinformatics 25, 2078–2079 (2009).
Article PubMed PubMed Central CAS Google Scholar
Milne, I. et al. Using Tablet for visual exploration of second-generation sequencing data. Briefings bioinformatics 14, 193–202 (2012).
Article CAS Google Scholar
Gustafson, G., Armour, S., Gamboa, G. C., Burgett, S. G. & Shepherd, J. W. Nucleotide sequence of barley stripe mosaic virus RNAα: RNAα encodes a single polypeptide with homology to corresponding proteins from other viruses. Virology 170, 370–377 (1989).
Article PubMed CAS Google Scholar
Gustafson, G. & Armour, S. L. The complete nucleotide sequence of RNAβ from the type strain of barley stripe mosaic virus. Nucleic acids research 14, 3895–3909 (1986).
Article PubMed PubMed Central CAS Google Scholar
Gustafson, G., Hunter, B., Hanau, R., Armour, S. & Jackson, A. Nucleotide sequence and genetic organization of barley stripe mosaic virus RNA-γ. Virology 158, 394–406 (1987).
Article PubMed CAS Google Scholar
Chen, J. et al. Molecular analysis of barley yellow mosaic virus isolates from China. Virus research 64, 13–21 (1999).
Article PubMed CAS Google Scholar
Miller, W. A., Waterhouse, P. & Gerlach, W. Sequence and organization of barley yellow dwarf virus genomic RNA. Nucleic acids research 16, 6097–6111 (1988).
Article PubMed PubMed Central CAS Google Scholar
French, R. & Stenger, D. Genome sequences of Agropyron mosaic virus and Hordeum mosaic virus support reciprocal monophyly of the genera Potyvirus and Rymovirus in the family Potyviridae. Archives of virology 150, 299–312 (2005).
Article PubMed CAS Google Scholar

Download references

Acknowledgements

This work was supported by the support of the “Cooperative Research Program for Agriculture Science & Technology Development” (Project No. PJ01186101) conducted by the Rural Development Administration, Republic of Korea.

Author information

Authors and Affiliations

Research Institute of Agriculture and Life Sciences, College of Agriculture and Life Sciences, Seoul National University, Seoul, 08826, Republic of Korea
Yeonhwa Jo, Hoseong Choi & Won Kyong Cho
Crop Foundation Division, National Institute of Crop Science, RDA, Wanju, 55365, Republic of Korea
Ju-Young Bae, Sang-Min Kim & Bong Choon Lee

Authors

Yeonhwa Jo
View author publications
You can also search for this author in PubMed Google Scholar
Ju-Young Bae
View author publications
You can also search for this author in PubMed Google Scholar
Sang-Min Kim
View author publications
You can also search for this author in PubMed Google Scholar
Hoseong Choi
View author publications
You can also search for this author in PubMed Google Scholar
Bong Choon Lee
View author publications
You can also search for this author in PubMed Google Scholar
Won Kyong Cho
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Y.J. is the main author of this paper. J.Y.B., S.M.K., H.C., B.C.L. harvested samples and observed disease symptoms. Y.J. performed the RNA extraction, library preparation and sequencing. Y.J. performed most bioinformatics analyses. Y.J., J.Y.B., S.M.K., H.C., B.C.L. and W.K.C. interpreted the data and discussed the results. This project was conceived and designed by B.C.L. and W.K.C. Y.J., B.C.L. and W.K.C. wrote the manuscript. All the authors have read, revised, and approved the manuscript.

Corresponding authors

Correspondence to Bong Choon Lee or Won Kyong Cho.

Ethics declarations

Competing Interests

The authors declare no competing interests.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Supporting information

Table S1

Table S2

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Jo, Y., Bae, JY., Kim, SM. et al. Barley RNA viromes in six different geographical regions in Korea. Sci Rep 8, 13237 (2018). https://doi.org/10.1038/s41598-018-31671-4

Download citation

Received: 06 March 2018
Accepted: 15 August 2018
Published: 05 September 2018
DOI: https://doi.org/10.1038/s41598-018-31671-4

This article is cited by

Genomic diversity of Areca Palm Velarivirus 1 (APV1) in Areca palm (Areca catechu) plantations in Hainan, China
- Xianmei Cao
- Ruibai Zhao
- Xi Huang
BMC Genomics (2021)
Sweet potato viromes in eight different geographical regions in Korea and two different cultivars
- Yeonhwa Jo
- Sang-Min Kim
- Won Kyong Cho
Scientific Reports (2020)
Bymovirus-induced yellow mosaic diseases in barley and wheat: viruses, genetic resistances and functional aspects
- Congcong Jiang
- Jinhong Kan
- Ping Yang
Theoretical and Applied Genetics (2020)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

Collection of barley samples and library preparation

Identification of viruses infecting barley

Distribution of identified viruses based on geographical regions

Calculation of virus accumulation in each library

De novo genome assembly and calculation of mutation rates

Phylogenetic relationships of BaYMV and BaMMV

Confirmation of NGS results by RT-PCR

Discussion

Methods

Plant materials

Total RNA extraction and library preparation

Bioinformatic analyses to identify viruses in the assembled transcriptome

De novo genome assembly of BaYMV and BaMMV

Generation of phylogenetic trees for BaYMV and BaMMV

Analyses of SNPs for BaYMV and BaMMV using transcriptome data

Design of primer-pairs and RT-PCR

Data Availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing Interests

Additional information

Electronic supplementary material

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links