De novo transcriptome assembly, polymorphic SSR markers development and population genetics analyses for southern corn rust (Puccinia polysora)

Sun, Qiuyu; Liu, Jie; Zhang, Keyu; Huang, Chong; Li, Leifu; Dong, Jiayu; Luo, Yong; Ma, Zhanhong

doi:10.1038/s41598-021-97556-1

Download PDF

Article
Open access
Published: 09 September 2021

De novo transcriptome assembly, polymorphic SSR markers development and population genetics analyses for southern corn rust (Puccinia polysora)

Qiuyu Sun¹,
Jie Liu²,
Keyu Zhang¹,
Chong Huang²,
Leifu Li¹,
Jiayu Dong¹,
Yong Luo¹ &
…
Zhanhong Ma¹

Scientific Reports volume 11, Article number: 18029 (2021) Cite this article

1469 Accesses
3 Citations
Metrics details

Subjects

Abstract

Southern corn rust is a destructive maize disease caused by Puccinia polysora Underw that can lead to severe yield losses. However, genomic information and microsatellite markers are currently unavailable for this disease. In this study, we generated a total of 27,295,216 high-quality cDNA sequence reads using Illumina sequencing technology. These reads were assembled into 17,496 unigenes with an average length of 1015 bp. The functional annotation indicated that 8113 (46.37%), 1933 (11.04%) and 5516 (31.52%) unigenes showed significant similarity to known proteins in the NCBI Nr, Nt and Swiss-Prot databases, respectively. In addition, 2921 (16.70%) unigenes were assigned to KEGG database categories; 4218 (24.11%), to KOG database categories; and 6,603 (37.74%), to GO database categories. Furthermore, we identified 8,798 potential SSRs among 6653 unigenes. A total of 9 polymorphic SSR markers were developed to evaluate the genetic diversity and population structure of 96 isolates collected from Guangdong Province in China. Clonal reproduction of P. polysora in Guangdong was dominant. The YJ (Yangjiang) population had the highest genotypic diversity and the greatest number of the multilocus genotypes, followed by the HY (Heyuan), HZ (Huizhou) and XY (Xinyi) populations. These results provide valuable information for the molecular genetic analysis of P. polysora and related species.

Comparative transcriptomics provides a strategy for phylogenetic analysis and SSR marker development in Chaenomeles

Article Open access 12 August 2021

Transcriptome sequence resource for the cucurbit powdery mildew pathogen Podosphaera xanthii

Article Open access 19 June 2019

Genome-Wide SNP discovery and genomic characterization in avocado (Persea americana Mill.)

Article Open access 27 December 2019

Introduction

Southern corn rust caused by Puccinia polysora is one of most devastating fungal diseases of corn; it was first described and named by Underw in 1897¹. SCR is distributed in tropical, subtropical and temperate cultivation regions, including Asia², the United States^3,4, Australia⁵ and Africa^1,6. An increase in disease occurrence has been reported in recent years in China. The lack of varieties resistant to SCR and long-distance migration of the disease are the main reasons for severe disease epidemics^3,7,8.

SCR appears to have great potential to damage plants. Symptoms initially appear on leaves and expand through an entire plant, including the leaves, stalks, leaf sheaths and husks, in late-season planting, which can result in leaf necrosis and complete destruction of photosynthetic areas, followed by death of the plant^9,10,11,12. The risk of the pandemics of this disease is a big threat¹¹. Although several physiologic races of P. polysora have been described^{13,14,15,16,17} and resistance has been reported^3,7, the majority of commercial hybrids grown in China and the United Stated are rated as susceptible to P. polysora^18,19,20. In the past, the disease has caused several serious yield losses in various countries, including West Africa (50%⁶), the Philippines (80–84%²¹), Brazil (40%²²), the United States (30–50%³) and China (42–53%²³).

The life cycle and mating system of P. polysora remain unknown. The alternate host of P. polysora has never been found. Urediniospores serve as both primary and secondary inoculum sources of mini cycle of the disease². The aecidial and pycnidial stages of P. polysora are absent⁹. Teliospores are rarely or not produced². All experiments attempting to induce the germination of teliospores have been unsuccessful^2,9. The function of teliospores in the life cycle remains mystery, although they could be of significance in assuring extended survival of the pathogen. The highly successful uredo stage and the ease of continuity of this stage may have resulted in the suppression of the sexual phase in the life cycle⁹. Due to the lack of evidence, Cammack⁹ provisionally classified P. polysora as a microcyclic and autoecious hemiform.

Although the disease is a main problem of corn production, few nucleotide sequences of P. polysora have been deposited in GenBank. The lack of whole-genome sequences means that little molecular information is available for the pathogen, which limits the research on this pathogen. Illumina sequencing of the transcriptomes of model and nonmodel organisms previously confirmed that short reads can be effectively assembled^24,25,26. For nonmodel organisms with limited genomic information, transcriptome sequencing, which focuses on the sequencing of functional and protein-coding RNAs, is a cost-effective method^27,28. Transcriptome data are invaluable in the discovery of gene functions, metabolic pathways and molecular markers²⁵.

Polymorphic simple sequence repeats (SSRs) have been proven to be important for assessing genetic diversity and population structure^29,30, however, no SSR markers for P. polysora have been developed and reported to date. Without effective molecular markers, the genetic diversity of P. polysora has rarely been reported. The population diversity of P. polysora was investigated using restriction fragment length polymorphisms (RFLPs) in Japan³¹ and inter-simple sequence repeats (ISSRs) in Thailand³² and China³³. The traditional and standard methods for developing SSR markers are time-consuming and expensive, whereas transcriptome sequencing provides a high-throughput source for investigating SSRs^34,35.

The objectives of this study were to (1) sequence the transcriptomes of urediniospores of P. polysora using an RNA-seq sequencing platform, Illumina HiSeq2000, (2) assemble and annotate the unigenes of P. polysora, (3) develop polymorphic SSR markers, and (4) genotype and characterize the population genetics of P. polysora isolates collected from the cities of Xinyi, Huizhou, Yangjiang and Heyuan in Guangdong Province. The transcriptome sequences provide a valuable genomic resource for the molecular study of P. polysora, and the findings provide the first perspective into the P. polysora population structure in Guangdong Province.

Results

Sequence assembly

We sequenced the transcriptomes of urediniospores of P. polysora. A total of 28,252,282 raw reads (4.09 Gbp) were obtained. The percentages of Q20 and Q30 were 96.62% and 91.76%, respectively. The GC content was approximately 49.49%. After quality control, the remaining high-quality reads were assembled into 10,539 transcripts with an average length of 1,229 bp and an N50 of 2,083 bp (Table 1). We obtained 17,496 unigenes varying from 201 bp to 15,021 bp with an average of 1,015 bp and an N50 of 1,922 bp (Table 1). The length distribution of the unigenes showed that 10,539 unigenes were 201–500 bp, 4,795 unigenes were 500–1,000 bp, 6,292 unigenes were 1,000–2,000 bp and 5,623 unigenes were longer than 2,000 bp (Fig. 1).

Table 1 Summary of sequence assembly.

Full size table

Functional annotation

The functional annotation of all the unigenes was conducted by searching against published databases. Based on the sequence similarity, a total of 9550 (54.58%) unigenes were annotated. Because of the lack of genome information for P. polysora, 7946 unigenes could not be matched to known genes. The overall functional annotation across each database is described in Table 2.

Table 2 Summary of functional annotation of unigenes of P. polysora.

Full size table

There were 6,603 unigenes categorized into three main GO categories: biological process (17,725), cellular component (7,824) and molecular function (12,942) (Fig. 2, Table S1). Within the three categories, ‘cellular process’ (4,040), ‘binding’ (3,495), ‘cell part’ (2,685) and ‘cell’ (2,685) were the most prevalent. A high percentage of genes as classified under the ‘single-organism process’ (2,958), ‘catalytic activity’ (2,964) and ‘metabolic process’ (3,772) terms. The categories ‘metallochaperone activity’ (3), ‘cell aggregation’ (1) and ‘cell killing’ (2) represented the smallest groups.

A total of 4,218 unigenes were clustered into 25 orthologous groups (KOG/COG) (Fig. S1, Table S2). Among these categories, the cluster ‘General function prediction only’ (567) was the largest group, followed by the cluster ‘Translation, ribosomal structure and biogenesis’ (533) and ‘Posttranslational modification, protein turnover, chaperones’ (477), while ‘Cell motility’ (3) and ‘Extracellular structures’ (2) represented the smallest groups.

To further analyse the transcriptome of P. polysora, all the unigenes were analysed in the KEGG pathway database. A total of 2,921 unigenes with significant matches were assigned to 32 KEGG pathways in five main categories (Fig. 3, Table S3). Among these five main categories, ‘Translation’ (480) constituted the largest category, followed by ‘Signal transduction’ (249), ‘Sorting and degradation’ (237) and ‘Amino acid metabolism’ (203).

SSR marker detection

After a bibliographic research, we have not found any SSR markers for P. polysora; therefore, we detected novel SSRs. In this study, all the assembled unigenes were used to mine potential SSRs. We identified a total of 8,798 potential SSRs in 6,653 unigenes (Table S4, Table S5). There were 2,157 unigenes containing more than one SSR. The largest fraction of identified SSR markers consisted of mononucleotides, which accounted for 52.4% (4,609), followed by those with trinucleotide repeats (2,461), dinucleotide repeats (1,351) and tetranucleotide repeats (290) (Table 3). In the mononucleotide repeats, the dominant repeat motif was A/T (3,631), followed by C/G (978). For the dinucleotide repeats, the AG/CT (898) motif was much more common than AC/GT (252), AT/AT (198) and CG/CG (3). For the trinucleotide repeats, the most common motif was ATC/ATG (826), followed by AGG/CCT (375), ACC/GGT (312) and AAG/CTT (278) (Table 3).

Table 3 Summary of SSRs identified from the transcriptome of P. polysora.

Full size table

Identification of novel polymorphic SSR markers

Based on their flanking sequences, a total of 10,440 primer pairs for 3,480 out of 8,798 potential SSRs were successfully designed. We selected 400 high-quality SSR primer pairs for amplification and polymorphism detection via PCR with 96 isolates of P. polysora sampled in Guangdong Province. Of the primers we designed, 9 primers produced polymorphic fragments with the 96 isolates tested (Table 4). For the polymorphic loci, the number of alleles per locus ranged from 2 to 4, with an average of 3.0 alleles per locus. Nei’s gene diversity (H_exp), which estimates gene diversity, ranged from 0.021 to 0.502³⁶. Simpson’s index λ (the genotypic diversity index) ranged from 0.021 to 0.499, and evenness ranged from 0.353 to 0.998 (Table 4)³⁷. Locus Ppoly8 had the highest Simpson diversity (0.499) and the most evenly distributed alleles (0.998). A genotype accumulation curve showed that these 9 loci are informative for population genetic analysis (Fig. S2).

Table 4 Polymorphic microsatellite primers of P. polysora. Locus represent each SSR. Motif is the repeat motif of each SSR. F is the forward primer and R is the reverse primer of SSR. Forward primers are labeled with fluorescent tags (FAM, HEX or TAMRA). Nb (allele sizes) is the number of alleles. Tm represents annealing temperature. λ and H_exp are diversity indices.

Full size table

Population genetic diversity

The population genetic diversity of four populations (HY, HZ, XY and YJ) of P. polysora was analysed (Table 5). A total of 32 multilocus genotypes (MLGs) were detected in the 96 isolates based on the 9 novel SSR markers. The number of MLGs observed for each population ranged from 6 to 14. To reduce the influence of sample size, a more appropriate comparison, eMLG, was performed. eMLG is a measure of the number of genotypes that would be expected at the largest, shared sample size based on rarefaction. The YJ population had the highest genotypic diversity and the most eMLGs, followed by HZ, HY and XY. In addition, YJ was also the most even population, followed by XY, HZ and HY. Except for the XY population, the index of association \(\overline{r}_{d}\) and the standardized index of association I_A showed that P. polysora reproduced clonally.

Table 5 Genotypic diversity statistics of four populations in Guangdong Province for P. polysora. N is the number of isolates in each population. MLG is the number of the multilocus genotypes observed. eMLG is the number of expected MLG. SE is the standard error based on eMLG. The next four columns are diversity indices. I_A and \(\overline{r}_{d}\) are the index of association and the standardized index of association, respectively. *P < 0.05.

Full size table

Population structure

Discriminant analysis of principal components (DAPC) was performed to analyse the genetic structure of 96 P. polysora isolates. The optimal number of PCs (30) was assessed using the find.clusters function. The cluster membership probabilities of each isolate based on the discriminant functions of DAPC were conducted with cluster numbers (K) from 2 to 8 (Fig. 4). With the increase in cluster number, population subdivision was gradually generated among the populations. All the populations in Guangdong Province exhibited admixture with other populations, and YJ showed the most admixture.

A neighbour-joining (NJ) phylogenetic tree and minimum spanning network (MSN) were conducted to further assess the population structure of P. polysora. The structure of P. polysora was dominated by the presence of two clonal linages (Fig. 5B). There were 44 and 17 isolates from all the populations clustered into these two clonal linages, respectively. Most isolates with high genetic distance from the YJ population were clustered into groups with cryptic diversity. The MSN results showed that two dominant MLGs were observed in all the populations (Fig. 5A). Populations HY and HZ contributed substantially to the major groups of MLGs, and HY had more unique MLGs than the other populations. The MSN results were consistent with the NJ tree analysis.

Discussion

Recently, southern corn rust has become research focus because of its wide distribution, great damage and high incidence in China. Although southern corn rust is a devastating disease worldwide, the complete genome sequence and transcriptomic data for P. polysora remain unknown. Transcriptome analysis is an attractive alternative to examine the properties of a transcriptome as a proxy for the whole genome³⁸. Transcriptome sequencing is a powerful and cost-effective method for generating large-scale transcriptomes and is used to annotate novel genes for model and nonmodel organisms and to develop molecular markers^24,25,26. To our knowledge, the present study is the first attempt to perform de novo assembly of the transcriptome and to develop SSR markers of P. polysora.

In this study, we obtained a total of 27,295,216 high-quality reads with a Q20 of 96.62% using Illumina paired-end sequencing. A total of 17,496 unigenes were assembled, which is similar to the gene numbers predicted for other rust fungi, such as Melampsora larici-populina (16,399³⁹), M. lini (16,271⁴⁰) and P. striiformis (20–25,000^41,42). The average length of the unigenes with an N50 of 1,922 bp was 1,015 bp. The GC content was 49.49%. These results indicated that the transcripts of P. polysora were of high quality and provided a large amount of valuable transcriptome information for gene annotation, novel gene discovery, and the investigation of molecular evolutionary mechanisms in P. polysora^24,25,26.

To annotate the biological function of unigenes at the transcriptome level, sequence similarity searching was conducted against the Nr, Nt, Swiss-Prot, GO, Pfam, KOG/COG and KEGG databases. More than half of the unigenes (9,550; 54.58%) were matched with known proteins in public databases. Due to the absence of a reference genome for P. polysora, almost half of the unigenes might represent novel genes whose function is unknown. In the Nr database, most unigenes (77.00%) were annotated to Puccinia graminis f. sp. tritici, which causes stem rust of wheat. This result confirmed that P. polysora is closely related to P. graminis which has been completely sequenced⁴³. In addition, we categorized 6,603, 4,218 and 2,921 unigenes into GO categories, orthologous groups (KOG/COG) and KEGG pathways, respectively, indicating that one-third of the unigenes could be well annotated with potential functions. The annotated unigenes might be useful in the investigation of gene function and valuable for further research in the future. These results again demonstrated that genes can be investigated by transcriptome analysis for nonmodel plant species.

In the present research, a total of 8,798 potential SSRs were identified in 6,653 unigenes. We selected 400 high-quality SSR primer pairs for amplification and polymorphism detection via PCR with 96 isolates of P. polysora. Only 9 primer pairs with the expected size were polymorphic among the 96 isolates of P. polysora. The primer pairs resulting in PCR products larger or smaller than expected may be due to the presence of introns, insertions or repeats, a lack of specificity, or assembly errors⁴⁴. The low percentage of polymorphic loci may be due to the close geographic origin of isolates or the clonal reproduction of the pathogen⁴⁵.

Previous genetic diversity studies of P. polysora populations fully support our results^32,33. Using ISSR markers, Unartngam et al.³² found that P. polysora isolates from different provinces in Thailand have similar genetic characteristics and are present in the same groups, likely resulting from spore migration. The same result was found in Chinese populations using ISSR markers³³. The urediniospores of P. polysora can be dispersed by air currents up to altitudes greater than 15,000 ft high and spread long distances by wind¹. The present study again confirmed that the isolates from different localities presented similar genetic characteristics, which might be due to spore migration.

All the populations in Guangdong Province exhibited admixture with other populations and were clustered into two clonal groups, while the YJ population showed the highest genotypic diversity and the most eMLGs. More isolates with unique MLGs were found in the YJ population. The pathogen can survive year-round in the most south city Yangjiang belonging to the tropical zone, which might increase the possibility of new genotypes arising.

Our results confirm the previous study showing that P. polysora is a microcyclic and autoecious hemiform pathogen^9,32. The index of association \(\overline{r}_{d}\) and the standardized index of association I_A rejected the null hypothesis of linkage equilibrium in three out of four populations, suggesting high rates of selfing and clonal reproduction. Population XY was the exception and was characterized by only 6 MLGs, of which 5 MLGs were shared with other populations. Eight out of 13 isolates clustered into the clonal groups, and 5 isolates showed slight genetic differentiation. The differences remaining in population XY could lead to not reject the null hypothesis of linkage equilibrium, despite the pathogen truly reproducing clonally. Failure to reject the null hypothesis of linkage equilibrium does not mean the confirmation of sexual reproduction. More isolates in population XY are needed for further confirmation. The majority of isolates were clustered into two dominant clonal groups, which could be additional support for clonality and selfing.

The observed cryptic diversity appeared to be sampling bias. Members of the diverse groups were from HY, HZ and XY; however, most isolates of these populations were clustered into the dominant clonal groups. Furthermore, the index of association \(\overline{r}_{d}\) and the standardized index of association I_A showed that the populations HY and HZ are highly clonal or selfing. Therefore, we propose that the observed cryptic diversity is because of sampling bias. Isolates of the cryptically diversity groups likely belong to their own clonal group or groups, that were not sampled deeply enough to obtain their membership.

This work provides novel insights into the transcriptome, annotated unigenes, polymorphic SSRs and population genetics of P. polysora. More work is needed to investigate the gene function and population structure of P. polysora.

Methods

Pathogen materials

There is no research report on the physiological races and pathogenic types of P. polysora in China. To investigate the population genetics of P. polysora in Guangdong Province, we collected diseased leaves from the cities of Xinyi, Huizhou, Yangjiang and Heyuan in Guangdong Province. In each city, two to ten sampling sites with distances of at least 500 m apart from each other were identified, and 10 diseased leaves were randomly collected in each sampling site (Table 6). The plant materials used in this study were obtained from the wild and permission was obtained to collect samples. The collection of plant materials also complied with institutional, national, or international guidelines and legislation. The voucher specimens were preserved in the public herbarium of the Key Lab of Pest Monitoring and Green Management, Ministry of Agriculture and Rural Affairs of China.

Table 6 Summary information of the voucher specimens for Puccinia polysora populations. P represents the population. Isolates collected from the same city were considered to be one population.

Full size table

To obtain enough pure urediniospores for RNA extraction, reproduction of P. polysora was conducted in a greenhouse. P. polysora is an obligate parasite. Three-leaf maize seedlings of susceptible ‘Zhengdan 958’ grown in a pot were rubbed to remove wax and sprayed with water mist. The urediniospores collected from diseased leaves were inoculated on the leaf surface. The inoculated plants were moisturized at 25 to 30 °C for 24 h in a sealed box to promote infection. The inoculated plants were transferred to a greenhouse at 25 to 30 °C. After 10 to 15 days, urediniospores were harvested and used for RNA extraction.

Populations were grouped based on cities and assigned as follows: XY (Xinyi), HZ (Huizhou), YJ (Yangjiang) and HY (Heyuan). A single pustule was collected from the sampled leaf to obtain each isolate on the assumption that each pustule was caused by a single urediniospore. To obtain enough urediniospores for DNA extraction, reproduction of isolates was conducted using the same method described above. A total of 98 isolates were obtained (Tables 5, 6).

RNA extraction

Total RNA was extracted using a TRIzol Kit (Promega, Beijing, China) following the manufacturer’s instructions. Residual DNA was removed using RNase-free DNase I (Takara Biotech Incorporation, Otsu, Japan) for 30 min at 37 °C. The purified RNA quality and quantity were verified using a spectrophotometer (Thermo Scientific, Waltham, MA, USA) and an Agilent 2100 Bioanalyzer (Agilent Technologies, Santa Clara, CA, USA).

cDNA library construction and Illumina sequencing

Poly(A) mRNA was enriched by oligo (dT) magnetic beads. All mRNA sequences were cut into short fragments by using fragmentation buffer. Taking the short fragments as templates, random hexamer primers were used to synthesize first-strand cDNA. Then, buffer, dNTPs, polymerase I and RNase H were added to synthesize second-strand cDNA. The cDNA fragments were purified using the AMPure XP system (Beckman Coulter, Beverly, MA, USA), resolved with EB buffer for end repair and A tailing, and connected with sequencing adapters. PCR was performed using Phusion High-Fidelity DNA polymerase, universal PCR primers and an index primer. PCR products were purified using the AMPure XP system and assessed on DNA high-sensitivity chips using the Agilent Bioanalyzer 2100 system to construct the final cDNA library. The cDNA library was sequenced by Illumina HiSeq2000 sequencing performed by Novogene Bioinformatics Technology Co., Ltd., Beijing, China.

Sequence assembly and gene annotation

The high-quality clean reads for assembly were separated from adapters. Reads with more than 50% Q20 bases and those with more than 10% ambiguous “N” bases were removed. De novo transcriptome assembly was conducted using Trinity software with an optimized k-mer length of 25⁴⁶. Unigenes of the transcriptome were annotated using the NCBI nonredundant protein (Nr) database, NCBI nonredundant nucleotide sequence (Nt) database, Kyoto Encyclopedia of Genes and Genomes (KEGG) database, a manually annotated and reviewed protein sequence (Swiss-Prot) database, Cluster of Orthologous Groups of proteins (KOG/COG) database, Gene Ontology (GO) database and protein family (Pfam) database. The statistical enrichment of the unigenes in KEGG pathways was tested by KOBAS software as shown previously^47,48. The annotation with the Nr, Nt, Swiss-Prot and KOG databases used NCBI Blast version 2.2.28 + with an e-value of 1e⁻⁵. GO analysis was performed using Blast2GO version 2.5⁴⁹. Pfam protein domain prediction was conducted using HMMER version 3.0⁵⁰.

Polymorphic SSR validation and assessment

Microsatellites were identified using MISA software⁵¹. The parameters were adjusted to identify mono-, di-, tri-, tetra-, penta- and hexanucleotide motifs with a minimum of ten, six, five, five, five and five repeats, respectively. Primer 3 version 2.2.3 was used to design the primer pairs with default settings. Forward primers were labelled with fluorescent tags (FAM, HEX or TAMRA) to improve the separation of overlapping markers during multiplexing. Ninety-six isolates sampled from Guangdong Province were used for PCR amplification and polymorphic identification. Amplifications were performed with a thermal cycler (Eppendorf AG) under the following conditions: initial denaturation at 94 °C for 5 min, followed by 35 cycles of 94 °C for 30 s, annealing temperature (Tm) (Table 1) for 30 s, and 72 °C for 1 min, and a final extension at 72 °C for 10 min. Electrophoresis and visualization of alleles were performed on an ABI 3730 DNA analyser (Applied Biosystems, Carlsbad, CA, USA) by Beijing Tsingke Biotech Co., Ltd. Allele sizes were estimated using GeneMarker version 2.2.0 software and compared to a GS500 (35–500 bp) internal standard.

Population genetics data analysis

We used various R packages to perform population genetic analysis⁵². The R package poppr⁵³ was used to calculate the number of MLGs and eMLGs, genotypic diversity (the Shannon–Wiener index H)⁵⁴, Stoddard and Taylor’s index G⁵⁵, and Simpson’s index λ³⁷, evenness E₅⁵⁶, Nei’s unbiased gene diversity H_exp³⁶, and index of association \(\overline{r}_{d}\) and standardized index of association I_A⁵⁷. eMLG estimates the number of expected MLGs that share the same sample size based on rarefaction. H and G both measure richness and evenness, increasing as genotypic richness and evenness increase. G weights abundant MLGs more heavily than less abundant MLGs. λ is an estimation of the probability that two randomly selected genotypes are different. E₅ is a measure of the distribution of genotype abundances. E₅ is equal to one when a population has equally abundant genotypes and closer to zero when it is dominated by a single genotype. H_exp estimates the average heterozygosity. \(\overline{r}_{d}\) and I_A are measures of linkage disequilibrium. Significant disequilibrium is expected due to linkage among loci when a population is clonal, and linkage among loci is not expected in sexually reproducing populations.

The package ape⁵⁸ was used to draw a neighbour-joining dendrogram with Prevosti’s distance⁵⁹. Minimum spanning networks (MSNs) were constructed with the genetic distance of Bruvo⁶⁰ using the package poppr. Discriminant analysis of principal components (DAPC) was plotted using the packages adegenet⁶¹ and ade4⁶².

Data deposit

The transcriptome has been submitted to the National Center for Biotechnology Information (NCBI). BioSample accession: SAMN18713839.

References

Orian, G. Occurrence of Puccinia polysora underwood in the Indian ocean area. Nature 173, 505 (1954).
Article ADS Google Scholar
Brewbaker, J. L., Kim, S. K., So, Y. S. & Logrono M. General resistance in maize to southern rust (Puccinia polysora Underw). Crop Sci. 51, 1393–1409 (2011).
Futrell, M. C., Hookerm, A. L. & Scott, G. E. Resistance in maize to corn rust, controlled by a single dominant gene. Crop Sci. 15, 597–599 (1975).
Article Google Scholar
Sim, T. IV. Southern rust of corn recognized in Kansas. Plant Dis. 64, 500 (1980).
Article Google Scholar
Barker, S. J. Testing of maize hybrids resistant to Puccinia polysora on the Atherton Tableland, Queensland. J. Agric. Anim Sci. 26, 319–327 (1969).
Google Scholar
Rhind, D., Waterston, J. M. & Deighton, F. C. Occurrence of Puccinia polysora Underw. in West Africa. Nature 169, 631 (1952).
Article ADS Google Scholar
Scott, G. E., King, S. B. & Armour, J. W. Jr. Inheritance of resistance to southern corn rust in maize populations. Crop Sci. 24, 268–267 (1984).
Article Google Scholar
Crouch, J. A. & Szabo, L. J. Real-time PCR detection and discrimination of the southern and common corn rust pathogens Puccinia polysora and Puccinia sorghi. Plant Dis. 95, 624–632 (2011).
Article PubMed Google Scholar
Cammack, R. H. Studies on Puccinia polysora underw: II. A consideration of the method of introduction of P. Polysora into Africa. Trans. Br. Mycol. Soc. 42, 27–32 (1959).
Article Google Scholar
Futrell, M. C. Puccinia polysora on maize associated with cropping practice and genetic homogeneity. Phytopathology 65, 1041–1042 (1975).
Article Google Scholar
Rodriguez-Ardon, R., Scott, G. E. & King, S. B. Maize yield losses caused by southern corn rust. Crop Sci. 20, 812–814 (1980).
Article Google Scholar
Scott, G. E. & Futrell, M. C. Southern corn rust. Crops Soils Mag. April-May, 16–18 (1976).
Casela, C. R. & Ferreira, A. S. Variability in isolates of Puccinia polysora in Brazil. Fitopatol. Bras. 4, 414–416 (2002).
Article Google Scholar
Robert, A. L. Host range and races of the corn rusts. Phytopathology 52, 1010–1015 (1962).
Google Scholar
Ryland, A. K. & Storey, H. H. Physiological races of Puccina polysora Underw. Nature 176, 655–656 (1955).
Article ADS Google Scholar
Storey, H. H. & Howland, K. A. Resistance in maize to a third east African race of Puccinia polysora Underw. Ann. Appl. Biol. 60, 297 (1967).
Article Google Scholar
Ullstrup, A. J. Inheritance and linkage of a gene determining resistance in maize American race of Puccinia polysora. Phytopathology 55, 425–430 (1965).
Google Scholar
Chen, C. X. et al. Molecular tagging and genetic mapping of the disease resistance gene RppQ to southern corn rust. Theor. Appl. Genet. 108, 945–950 (2004).
Article CAS PubMed Google Scholar
Ru-Hong, Z., King, S. B. & Scott, G. E. A study of slow rusting of southern rust of corn: preliminary report. USDA Bull, 925 (1983).
Wu, X. J. et al. Geographic and genetic identification of RppS, a novel locus conferring broad resistance to southern corn rust disease in China. Euphytica 205, 17–23 (2015).
Article ADS Google Scholar
Reyes, G. M. An epidemic outbreak of the maize rust in eastern and central Visayas, Philippines. Philipp. J. Agric. 18, 115–128 (1953).
Google Scholar
Von Bülow, J. F. W. As ferrugens (Puccinia sorghi, P. polysora, Physopella zeae) do milho (Zea mays). I. Revisão Bibliográfica. Pesq. Agropec. Bras. 1, 249–262 (1966).
Liu, Y. Y. & Wang, J. Southern corn rust occurred in Hebei province in 1998. Plant Prot. 25, 53–58 (1999).
Google Scholar
Hegedűs, Z. et al. Deep sequencing of the zebrafish transcriptome response to mycobacterium infection. Mol. Immunol. 46, 2918–2930 (2009).
Article PubMed CAS Google Scholar
Liu, J. J. et al. Comparative transcriptomics and RNA-Seq-based bulked segregant analysis reveals genomic basis underlying Cronartium ribicola vcr2 Virulence. Front. Microbiol. 12, 1–17 (2021).
Google Scholar
Rosenkranz, R., Borodina, T., Lehrach, H. & Himmelbauer, H. Characterizing the mouse ES cell transcriptome with Illumina sequencing. Genomics 92, 187–194 (2008).
Article CAS PubMed Google Scholar
Geng, Y. et al. A De novo transcriptome assembly of Ceratopteris richardii provides insights into the evolutionary dynamics of complex gene families in land plants. Genome Biol. Evol. 13, 1–14 (2021).
Article Google Scholar
Wang, Z., Gerstein, M. & Snyder, M. RNA-Seq: A revolutionary tool for transcriptomics. Nat. Rev. Genet. 10, 57–63 (2009).
Article CAS PubMed PubMed Central Google Scholar
Cowger, C., Parks, R. & Kosman, E. Structure and migration in US Blumeria graminis f. sp tritici populations. Phytopathology 106, 295–304 (2016).
Article CAS PubMed Google Scholar
Wang, C. C. et al. Genetic diversity and population structure of Puccinia striiformis f. sp. tritici reveal its migration from central to eastern China. Crop Prot. 128(104974), 0261–2194 (2020).
Google Scholar
Hirayae, K. et al. Genetic variation of southern rust fungus of corn in Japan. Kyushu Pl. Prot. Res. 44, 12–14 (1998).
Article CAS Google Scholar
Unartngam, J., Janruang, P. & To-anan, C. Genetic diversity of Puccinia polysora in Thailand based on inter simple sequence repeat (ISSR) markers analysis. Int. J. Agric. Technol. 7, 1125–1137 (2011).
Google Scholar
Guo, Y. Y. et al. Genetic diversity of Puccinia polysora Underw. in China. Sci. Agric. Sin. 46, 4523–4533 (2013).
CAS Google Scholar
Cardoso, S. D. et al. Efficient isolation of polymorphic microsatellites from high throughput sequence data based on number of repeats. Mar. Genom. 11, 11–16 (2013).
Article Google Scholar
Chen, W., Liu, Y. X. & Jiang, G. F. De novo assembly and characterization of the testis transcriptome and development of EST-SSR markers in the cockroach Periplaneta americana. Sci. Rep. 5, 11144. https://doi.org/10.1038/srep11144 (2015).
Article ADS CAS PubMed PubMed Central Google Scholar
Nei, M. Estimation of average heterozygosity and genetic distance from a small number of individuals. Genetics 89, 583–590 (1978).
Article CAS PubMed PubMed Central Google Scholar
Simpson, E. H. Measurement of diversity. Nature 163, 688. https://doi.org/10.1038/163688a0 (1949).
Article ADS MATH Google Scholar
Onda, Y. et al. Transcriptome analysis of thermogenic Arum concinnatum reveals the molecular components of floral scent production. Sci. Rep. 5, 8753. https://doi.org/10.1038/srep08753 (2015).
Article CAS PubMed PubMed Central Google Scholar
Duplessis, S. et al. Obligate biotrophy features unraveled by the genomic analysis of rust fungi. Proc. Natl. Acad. Sci. 108, 9166–9171 (2011).
Article ADS CAS PubMed PubMed Central Google Scholar
Nemri, A. et al. The genome sequence and effector complement of the flax rust pathogen Melampsora lini. Front. Plant Sci. 5, 98. https://doi.org/10.3389/fpls.2014.00098 (2014).
Article PubMed PubMed Central Google Scholar
Cantu, D. et al. Genome analyses of the wheat yellow(stripe) rust pathogen Puccinia striiformis f. sp. tritici reveal polymorphic and haustorial expressed secreted proteins as candidate effectors. BMC Genom. 14, 270. https://doi.org/10.1186/1471-2164-14-270 (2013).
Article CAS Google Scholar
Zheng, W. et al. High genome heterozygosity and endemic genetic recombination in the wheat stripe rust fungus. Nat. Commun. 4, 2673 (2013).
Article ADS PubMed CAS Google Scholar
Upadhyaya, N. M. et al. Comparative genomics of Australian isolates of the wheat stem rust pathogen Puccinia graminis f. sp. tritici reveals extensive polymorphism in candidate effector genes. Front. Plant Sci. 5, 759. https://doi.org/10.3389/fpls.2014.00759 (2015).
Article PubMed PubMed Central Google Scholar
Wei, W. L. et al. Characterization of the sesame (Sesamum indicum L.) global transcriptome using Illumina paired-end sequencing and development of EST-SSR markers. BMC Genom. 12, 451 (2011).
Article CAS Google Scholar
Wu, J., Cai, C. F., Cheng, F. Y., Cui, H. L. & Zhou, H. Characterisation and development of EST-SSR markers in tree peony using transcriptome sequences. Mol. Breed. 34, 1853–1866 (2014).
Article CAS Google Scholar
Grabherr, M. G. et al. Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nat. Biotechnol. 29, 644–652 (2011).
Article CAS PubMed PubMed Central Google Scholar
Kanehisa, M. & Goto, S. KEGG: Kyoto encyclopedia of genes and genomes. Nucleic Acids Res. 28, 27–30 (2000).
Article CAS PubMed PubMed Central Google Scholar
Mao, X., Tao, C. J. G. O. & Wei, L. Automated genome annotation and pathway identification using the KEGG Orthology (KO) as a controlled vocabulary. Bioinformatics 21, 3787–3793 (2005).
Article CAS PubMed Google Scholar
Götz, S. et al. High-throughput functional annotation and data mining with the Blast2GO suite. Nucleic Acids Res. 36, 3420–3435 (2008).
Article PubMed PubMed Central CAS Google Scholar
Finn, R. D., Clements, J. & Eddy, S. R. HMMER web server: Interactive sequence similarity searching. Nucleic Acids Res. 39, 29–37 (2011).
Article CAS Google Scholar
Thiel, T., Michalek, W., Varshney, R. K. & Graner, A. Exploiting EST databases for the development and characterization of gene-derived SSR-markers in barley (Hordeum vulgare L.). Theor. Appl. Genet. 106, 411–422 (2013).
Article CAS Google Scholar
R Core Team. R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria (2017).
Kamvar, Z. N., Tabima, J. F. & Grünwald, N. J. Poppr: An R package for genetic analysis of populations with clonal, partially clonal, and/or sexual reproduction. PeerJ 2, e281. https://doi.org/10.7717/peerj.281 (2014).
Article PubMed PubMed Central Google Scholar
Shannon, C. E. The mathematical theory of communication. 1963. MD Comput. 14, 306–317 (1997).
CAS PubMed Google Scholar
Stoddart, J. A. & Taylor, J. F. Genotypic diversity: Estimation and prediction in samples. Genetics 118, 705–711 (1988).
Article CAS PubMed PubMed Central Google Scholar
Pielou, E. C. Ecological Diversity. Wiley, New York. ISBN-10: 0471689254 (1975).
Grünwald, N. J., Goodwin, S. B., Milgroom, M. G. & Fry, W. E. Analysis of genotypic diversity data for populations of microorganisms. Phytopathology 93, 738–746 (2003).
Article PubMed Google Scholar
Paradis, E. & Schliep, K. ape 5.0: An environment for modern phylogenetics and evolutionary analyses in R. Bioinformatics 35, 526–528 (2019).
Article CAS PubMed Google Scholar
Prevosti, A., Ocaña, J. & Alonso, G. Distances between populations of Drosophila subobscura, based on chromosome arrangement frequencies. Theor. Appl. Genet. 45, 231–241 (1975).
Article CAS PubMed Google Scholar
Bruvo, R., Michiels, N. K., D’Sousa, T. G. & Schulenberg, H. A simple method for calculation of microsatellite genotypes irrespective of ploidy level. Mol. Ecol. 13, 2101–2106 (2004).
Article CAS PubMed Google Scholar
Jombart, T. Adegenet: a R package for the multivariate analysis of genetic markers. Bioinformatics 24, 1403–1405 (2008).
Article CAS PubMed Google Scholar
Dray, S. & Dufour, A. B. The ade4 Package: Implementing the duality diagram for ecologists. J. Stat. Softw. https://doi.org/10.18637/jss.v022.i04 (2007).
Article Google Scholar

Download references

Acknowledgements

We thank National Agro-tech Extension and Service Center, and plant protective stations for collecting disease samples. We thank Brian J. Knaus and Shankar K. Shakya for excellent technical support. This study was supported by the Natural Science Foundation of China (grant 31772101, 31972211).

Author information

Authors and Affiliations

College of Plant Protection, China Agricultural University, Beijing, 100193, People’s Republic of China
Qiuyu Sun, Keyu Zhang, Leifu Li, Jiayu Dong, Yong Luo & Zhanhong Ma
National Agro-Tech Extension and Service Center, Beijing, 100125, People’s Republic of China
Jie Liu & Chong Huang

Authors

Qiuyu Sun
View author publications
You can also search for this author in PubMed Google Scholar
Jie Liu
View author publications
You can also search for this author in PubMed Google Scholar
Keyu Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Chong Huang
View author publications
You can also search for this author in PubMed Google Scholar
Leifu Li
View author publications
You can also search for this author in PubMed Google Scholar
Jiayu Dong
View author publications
You can also search for this author in PubMed Google Scholar
Yong Luo
View author publications
You can also search for this author in PubMed Google Scholar
Zhanhong Ma
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Q.Y.S., Y.L., and Z.H.M. conceived and designed the experiments. Q.Y.S. performed the experiments, analysed the data, and wrote the paper. J.L. and C.H. participated in collection of samples of SCR. K.Y.Z., L.F.L. and J. Y. D. helped in propagating pathogens.

Corresponding author

Correspondence to Zhanhong Ma.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information 1.

Supplementary Information 2.

Supplementary Information 3.

Supplementary Information 4.

Supplementary Information 5.

Supplementary Information 6.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Sun, Q., Liu, J., Zhang, K. et al. De novo transcriptome assembly, polymorphic SSR markers development and population genetics analyses for southern corn rust (Puccinia polysora). Sci Rep 11, 18029 (2021). https://doi.org/10.1038/s41598-021-97556-1

Download citation

Received: 11 May 2021
Accepted: 25 August 2021
Published: 09 September 2021
DOI: https://doi.org/10.1038/s41598-021-97556-1

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

Sequence assembly

Functional annotation

SSR marker detection

Identification of novel polymorphic SSR markers

Population genetic diversity

Population structure

Discussion

Methods

Pathogen materials

RNA extraction

cDNA library construction and Illumina sequencing

Sequence assembly and gene annotation

Polymorphic SSR validation and assessment

Population genetics data analysis

Data deposit

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Rights and permissions

About this article

Cite this article

Share this article

Comments

Search

Quick links