Genome survey sequencing and characterization of simple sequence repeat (SSR) markers in Platostoma palustre (Blume) A.J.Paton (Chinese mesona)

Zheng, Zhao; Zhang, Nannan; Huang, Zhenghui; Zeng, Qiaoying; Huang, Yonghong; Qi, Yongwen

doi:10.1038/s41598-021-04264-x

Download PDF

Article
Open access
Published: 10 January 2022

Genome survey sequencing and characterization of simple sequence repeat (SSR) markers in Platostoma palustre (Blume) A.J.Paton (Chinese mesona)

Zhao Zheng^1,2^na1,
Nannan Zhang¹^na1,
Zhenghui Huang^1,2,
Qiaoying Zeng¹,
Yonghong Huang¹ &
…
Yongwen Qi^1,2

Scientific Reports volume 12, Article number: 355 (2022) Cite this article

1729 Accesses
5 Citations
Metrics details

Subjects

Abstract

Platostoma palustre (Blume) A.J.Paton is an annual herbaceous persistent plant of the Labiatae family. However, there is a lack of genomic data for this plant, which severely restricts its genetic improvement. In this study, we performed genome survey sequencing of P. palustre and developed simple sequence repeat (SSR) markers based on the resulting sequence. K-mer analysis revealed that the assembled genome size was approximately 1.21 Gb. A total of 15,498 SSR motifs were identified and characterized in this study; among them, dinucleotide, and hexanucleotide repeats had the highest and lowest, respectively. Among the dinucleotide repeat motifs, AT/TA repeat motifs were the most abundant, and GC/CG repeat motifs were rather rare, accounting for 44.28% and 0.63%, respectively. Genetic similarity coefficient analysis by the UPMGA methods clustered 12 clones, of P. palustre and related species into two subgroups. These results provide helpful information for further research on P. palustre resources and variety improvements.

The genome and population genomics of allopolyploid Coffea arabica reveal the diversification history of modern coffee cultivars

Article Open access 15 April 2024

A pan-genome of 69 Arabidopsis thaliana accessions reveals a conserved genome structure throughout the global species range

Article Open access 11 April 2024

Differential selection of yield and quality traits has shaped genomic signatures of cowpea domestication and improvement

Article 22 April 2024

Introduction

Platostoma palustre (Blume) A.J.Paton, also known as Chinese mesona, is an annual herbaceous persistent plant of the Labiatae (Lamiaceae) family¹. In China, P. palustre is mainly distributed in Taiwan, Zhejiang, Jiangxi, Guangdong, Fujian, and Guangxi provinces². As a traditional Chinese edible and medicinal plant, it contains polysaccharides³, triterpenoid acids^4,5, flavonoids⁶, phenolic compounds (such as epicatechins⁷ and caffeic acid⁸), and trace elements⁸. Wang et al.⁹ isolated five new caffeic acid oligomers, as well as four known analogues, and one compound showed significant in vitro antiviral activity against respiratory syncytial virus. A study by Song et al.¹⁰ showed that an extract of P. palustre had antioxidant and α-glucosidase inhibitory activities.P. palustre is widely used as a raw material for herbal tea, Guiling paste, and Chinese medicine. The caffeic acid extracted from P. palustre was proven to have antioxidative activity¹¹. Moreover, it was also reported that P. palustre polysaccharide (MP) treatment can increase the immunomodulatory activity of mice¹². Water and alcohol extracts of P. palustre were reported to be effective in ameliorating hypertension¹³ and hyperglycaemia¹⁴ in rats and can inhibit the growth of Escherichia coli and Salmonella^15,16.

To date, research on P. palustre has mainly focused on component extraction, activity, and development for food, and few studies on the genetic diversity of germplasm resources have been reported because of the limited genetic and genomic resources for this species. The concentrations of polysaccharides, triterpenoid acid, flavonoids, and other compounds of different P. palustre varieties vary widely, which directly affects their palatability and use in production¹⁷. Hence, variety identification is very important for P. palustre.

For the identification of P. palustre, morphological features such as leaf colour, tillering number, and flowering time have been employed, but this method relies on the accumulated experience of the appraiser, which is vulnerable to environmental and subjective factors and is time-consuming, laborious, and inaccurate. Therefore, it is very important to establish a set of rapid, accurate, and economical identification technologies to promote the utilization of P. palustre. Simple sequence repeat (SSR) marker are a powerful and cost-effective molecular method for quantifying genetic variation in plants due to their abundance in genome, polymorphism, co-dominance and high reproducibility¹⁸, and have been developed for many plant species¹⁹, especially those used in traditional Chinese herbal medicine, such as watermelon²⁰, Psidium²¹, Bupleurum falcatum²², and Ligusticum chuanxiong²³. These SSR markers have been broadly applied for genetic purity detection (for identifying off-types and selfed females in many hybrid seeds)²⁰, species identification²¹, haplotype determination, quantitative trait locus (QTL) discovery, marker-assisted selection (MAS) for desired traits and breeding, cultivar DNA fingerprinting, genome-wide association studies (GWASs), and harnessing heterosis^19,24. It is almost certain that, when developed, SSR markers can be used for P. palustre identification and accelerating breeding. There have been a few studies on SSR marker development for Chinese herbal medicines; however, few studies on P. palustre molecular markers have been reported. Thus, it is urgent to develop SSR markers for P. palustre.

With advances in next-generation sequencing (NGS) technology, genome survey sequencing has proven to be an important and cost-effective strategy for exploring genomic information and developing molecular markers for plants²⁵, especially for non-model plants which no genetic information is known¹⁹. In this study, genome survey sequencing was employed to investigate the genome of P. palustre. We first mined SSRs from genome survey sequences of P. palustre and validated 90 SSRs to understand the genetic relationships among six P. palustre varieties and six other Labiatae species. We aimed to provide a reference for the genotyping, breeding, germplasm collection, and management of P. palustre.

Results

Genome sequencing and estimation of genome size

Paired-end sequencing with 270-bp short inserts of P. palustre was conducted using genomic DNA from sample MX 1. A total of 54.99 Gb of raw data was generated by the Illumina HiSeq sequencing platform, which was approximately 45.37-fold the estimated genome size. All reads were used for k-mer analysis, and abnormal k-mers were removed to calculate genome size, the repeat rate, and heterozygosity. We used 270-bp library data to construct a k-mer distribution map with k = 19 (Fig. 1). For the 19-mer frequency distribution, the peak of the depth distribution was approximately 38. The sequence at the k-mer depth was more than twice the depth at the main peak, which can be attributed to the repeated k-mer sequence with a depth of more than 76. Moreover, a k-mer depth at half the main peak (near 19) represents heterozygosity.

The sequencing data yielded total k-mer values of 48,380,469 and 234. When k-mers with depth abnormalities were removed, the remaining k-mer values were found to be 46,608,868 and 033. These values were further used for the estimation of gene leaders. The genome size was estimated to be 1.21 Gbp, using the following formula: genome size = k-mer count/peak of the k-mer distribution. To ensure the accuracy of the genome size prediction, GenomeScope2 and findGSE software with different k-mer sizes (k = 21, 23, 25, and 27) as well as MGSE were used for genome size prediction. The genome sizes predicted by the different tools with different parameters were in the range of 1.3 Gb to 1.4 Gb (Supplementary Table 1). Based on the k-mer distribution, almost 70.62% of the sequence was repeated. The peak heterozygosity was as low as 0.33%; thus, there was no obvious heterozygosity. The results suggest that the genome of P. palustre is highly complex and has a high degree of repetition.

The resequencing data was de novo assembled by SOAP denovo software. A total of 6,968,859 raw contigs were observed. Unique contigs for scaffold generation were obtained after blasting reads and contigs. Gaps resulting from sequence repetition were filled with paired-end reads. Consequently, the genome was assembled in the form of a total of 5,822,179 scaffolds with a length of 1,374,372,218 bp. We obtain totally 401,762,775 raw reads. Among them, 393,971,228 (98.06%) reads were properly mapped against the assembled sequence by Bwa mem software. Among the scaffolds, scaffold N50 was found to be 191 bp in length and L50 was 1,359,845 from the SOAP denovo software, as shown in Table 1. The raw sequencing data have been submitted to the NCBI database (accession number: PRJNA706453). As the N50 value was very low in contigs as well as scaffolds, we also performed assembly with another software programme, SPAdes. The results showed that contig N50 was 193 bp, which was consistent with results from SOAP. The low quality of the assembled sequence might have been due to the complexity of the P. palustre genome.

Table 1 Contigs and scaffolds of P. palustre.

Full size table

Identification and characterisation of SSR motifs

In total, 15,498 SSRs were identified from the P. palustre genome survey results (Supplementary Table 2). The identified SSR motifs included dinucleotide (71.96%), trinucleotide (26.26%), tetranucleotide (1.52%), pentanucleotide (0.19%), and hexanucleotide (0.07%) repeats, as shown in Fig. 2a. AT/TA was the most abundant type of dinucleotide repeat, with a content of 44.28% (4939 of all dinucleotide repeats). The AG/CT content was 43.54% (4856 repeats), and the very rare type GC/CG accounted for only 0.63% (70 repeats) (Fig. 2b). In the case of trinucleotides, the most abundant type was ATT/AAT (1183, 29.07% of all trinucleotide repeats), followed by ATG/CAT (17.47%, 711 repeats) and AAC/GTT (14.40%, 586 repeats) (Fig. 2c). Among the tetra-, penta-, and hexanucleotide repeats, the most abundant type was TTTA/TAAA (48.94%, 115 repeats). Furthermore, 99.99% di-, 99.92% tri-, 100% tetra-, 79.31% penta- and 90.00% hexanucleotide repeats were shorter than 30 bp.

SSR motif analysis of P. palustre revealed repeat frequencies of 6–15, 5–10, and 5–6 for dinucleotide, trinucleotide, and hexanucleotide repeats, respectively. The repeat frequencies for both tetra- and pentanucleotides were in the range of 5–7, as shown in Fig. 3. The results further revealed the highest frequency for motifs with 6 tandem repeats (37.11%, 5751), followed by motifs with 5 tandem repeats (18.93%, 2934), 7 tandem repeats (18.07%, 2801), and 8 tandem repeats (11.60%, 1789).

SSR marker verification

A subset of 90 SSR markers representing each repeat class were randomly selected for their validation through PCR amplification (Supplementary Table 3). Among the selected markers, the di-, tri-, tetra-, penta-, and hexanucleotide repeat classes, were represented by 8, 6, 37, 29, and 10 SSR primer pairs, respectively. The results showed that 79 SSRs (87.78%) were successfully amplified, and 37 of these (46.83%) demonstrated polymorphic banding pattern.

The thirty-seven SSR markers were further investigated among P. palustre and related Labiatae genera including Mentha haplocalyx, M. spicata, Prunella vulgaris, Salvia miltiorrhiza, Scutellaria indica, and S. barbata (Supplementary Table 4). A total of 685 fragments were generated through the PCR amplification of the 12 accessions with a mean of 18.5 alleles per marker loci (Fig. 4 and Supplementary Table 4; the full-length gels are presented in Supplementary Fig. 1). Among the tested SSRs, 10 were specifically amplified in P. palustre while the remaining 27 showed a varied level of cross transferability in other related taxa. According to the clustering analysis performed with 27 SSR markers (Fig. 5), 12 accessions were divided into two groups. The 6 P. palustre accessions were clustered into one group, and the 6 accessions of related Labiatae genera were clustered into another group.

Discussion

P. palustre is an important traditional Chinese medicine and edible plant resource with heat-clearing and detoxifying functions. The leaves, roots, and stems of P. palustre have been widely found to contain gel mainly consisting of cortex phellodendri, benzoic acid, ursolic acid, organic acids, flavones, and catechins^3,4,5. Because food and medicinal products of P. palustre have different requirements in terms of quality, it is necessary to breed varieties with different characteristics through genetic improvement. In addition, adulterant plants are common in P. palustre collections. Thus, establishing an accurate and rapid method by molecular markers to identify P. palustre and related species is important for the genetic identification and improvement of P. palustre. Shi et al.² analysed P. palustre and its adulterants using the internal transcribed spacer 2 (ITS2) region and found that the ITS2 region, as a DNA barcode, could accurately and effectively distinguish P. palustre from its adulterants, including Isodon serra Maxim. However, the study showed that there was no difference in the ITS2 region among the 26 P. palustre accessions from Guangxi province, Guangdong province, Jiangxi province, Fujian province, and Hainan province in China. The results showed that the ITS2 region is not suitable for identifying P. palustre cultivars. Therefore, it is necessary to develop alternativemolecular markers for genetic resource evaluation and improvement.

A genome survey of P. palustre was applied for the first time in this study, with the aim of identifying markers for P. palustre and understanding the genetic diversity and relationships among cultivars and related species. According to the k-mer analysis of the genome survey sequences, the genome of P. palustre is approximately 1.21 Gbp and is complex with a low level of heterozygosity (0.33%). The genome of P. palustre is smaller than that of its related species; for instance, the genome of S. miltiorrhiza is 8.19 Gbp²⁶. However, it is much larger than that of other dicotyledons, such as buckwheat (497 Mb)²⁷, shantung maple (529 Mb)²⁸ and jute (338 Mb)²⁹. In plants, there is a positive correlation between genome size and repetitive elements³⁰. For example, the repetitive element content of P. palustre is 70.62%, which is higher than that of shantung maple (529 Mb, 48.8%) and lower than that of Radix bupleuri (2.11 Gb, 83.89%)³¹.

A draft reference de novo assembly with sequencing data was used to explore SSRs. A total of 54.99 Gb of clean reads were generated and de novo assembled into 6,968,859 contigs. Due to the complex genome of P. palustre, the contig N50 value was lower. SSRs with high polymorphism and codominance have been used to evaluate genetic resources and in a variety of improvement programs^32,33. In this study, a total of 15,498 SSRs were identified in P. palustre using genome survey sequencing. Morgante et al.claimed that there was a negative correlation between genome size and SSR distribution frequency³⁴. However, the SSR distribution frequency in this genome survey was estimated to be 12.80 SSRs per Mb, which is lower than that in R. bupleuri (43.11 SSR per Mb)³¹ and buckwheat (49.30 SSR per Mb)²⁷. Obviously, P. palustre did not follow this rule. The di- and trinucleotide repeats accounted for the majority of the SSRs, while tetra-, penta-, and hexanucleotide repeats accounted for a very small proportion. Similarly, among the five tandem repeat types of SSRs in P. palustre, di- and trinucleotide repeats accounted for 98.22% of the total SSRs, while tetra-, penta-, and hexanucleotide repeat SSRs accounted for only 1.52%. In P. palustre, we found that AT/TA (44.28%) and ATT/AAT (29.07%) were frequent among the di- and trinucleotide repeat SSRs; these percentages are different not only from those in sorghum³⁵ (AT/AT, 54.4% and CCG/CGG, 18.1%), rice³³ (AG/CT, 41.9% and CCG/CGG, 47.5%), and buckwheat²⁷ (AT/AT, 78.60% and AAT/TTA, 31.83%) but also from those in the majority of grasses (GA/TC dimers, A/T monomers, and GCG/CGC trimers were the most abundant SSR types), with some exceptions³⁴. Interestingly, in a study of 16 tree species, a similar trend was observed, where AT/TA base pairs were found to be the most prevalent dimers, followed by AG/TC. AAT/TTA were the most frequent trimers³⁶. In summary, SSR types have different distribution patterns among species at a large evolutionary scale^37,38, but the distribution patterns of closely related species and even different parts of the same species differ^39,40. The reason for the high polymorphism at these loci needs much more exploration.

As high variability in repeat unit number is observed, SSRs are highly polymorphic and are suitable for use as specific markers for different species/genera and germplasm characterization. In this study, we identified 64 SSRs with polymorphisms among the P. palustre accessions. By using 37 of the 64 SSRs, 395 specific fragments of P. palustre, accounting for 58.96% of all fragments, were detected. The results showed that there was significant genetic differentiation between P. palustre and related Labiatae species. The high polymorphism and specificity of the SSR markers developed in this research suggest that these SSRs could be further used in genetic linkage mapping, MAS, and the identification of genuine hybrids between cultivated P. palustre varieties and the other 6 related Labiatae genera.

This study revealed genomic information for P. palustre and unique SSR loci, providing valuable information for follow-up studies on cultivar identification, improvement and genetic resource management. However, because of the current absence of a reference genome sequence for this species, the genome location/genome coverage of these SSRs makers is unknown. In future, with more genome information for P. palustre is revealed, more molecular makers could be developed and accelerate genetic improvement of P. palustre.

Methods

Plant materials

The plant materials comprised six P. palustre accessions (MX 1, TW 1, ZC 1, ZC 2, XU 1, and XU 2) and six accessions of related Labiatae species, including M. haplocalyx, M. spicata, P. vulgaris, S. miltiorrhiza, S. indica, and S. barbata. Of the six P. palustre accessions, M X 1 and TW 1 was from Fujian province, China. While ZC 1, ZC 2, XU 1 and XU 2 were from Guangdong province, China.

Library construction, genome sequencing and genome character estimation

Total genomic DNA was isolated from young leaf tissue of all plants following a modified CTAB procedure⁴¹, and the quality was evaluated by 1% agarose gel electrophoresis. The concentrations of DNA were checked by a BioPhotometer (Eppendorf, Germany). The most widely planted P. palustre MX 1, was selected for the genome survey.

The genomic DNA was broken into fragments of approximately 270 bp by the ultrasonic vibration. The small-insert fragment library was constructed from fragmented random genomic DNA following the manufacturer’s instructions (NEBNext® Ultra DNA Library Prep Kit for Illumina). Adapter ligation and DNA cluster preparation were performed, followed by sequencing using an Illumina Genome Analyzer (Illumina HiSeq 2000, USA) according to the manufacturer’s standard protocol.

In total, four paired-end sequencing libraries with insert sizes of approximately 270 bp were constructed, and paired-ends of 150 bp were sequenced using the Illumina HiSeq 2100 platform. The quality control and pre-processing of sequencing raw reads were carried out using the fastp software⁴². 284, Raw reads were filtered by Trimmomatic software (v0.39; http://www.usadellab.org/cms/?page=trimmomatic) to remove low quality reads and adaptor sequences. GC distribution analysis was performed by in-house perl code After filtering, clean reads were obtained and used for the following analyses. K-mer (k = 19) analysis was performed, and the abnormal k-mers were filtered out for subsequent analysis. The rate of heterozygosity and the repeat rate were estimated according to k-mer analysis⁴³. GenomeScope2⁴⁴ and findGSE⁴⁵ with different k-mer sizes (k = 21, 23, 25, and 27) as well as MGSE software⁴⁶ were employed to predict genome size. The genome size was estimated with the formula: Genome_Size = K-mer coverage/Mean k-mer depth⁴⁷.

Genome assembly and SSR marker development

After removing the adapters, raw sequencing data were further cleaned for downstream analysis by filtering out reads containing low-quality bases, reads < 100 bp in length, and duplicated reads. The clean reads of all the libraries were assembled into scaffolds and contigs using SOAPdenovo v2 (http://soap.genomics.org.cn/soapdenovo.html) software. SSRs in the DNA sequences were identified using MIcro-SAtellite (MISA) software (version 1.0)⁴⁸. SSR identification was based on two parameters. First, SSR minimum numbers of 6, 5, 5, 5, and 5 were adopted for the identification of di-, tri-, tetra-, penta-, and hexanucleotides, respectively. Second, an interruption of less than 100 bp between two SSRs was defined as a compound repeat of SSR. Primer Premier V5.0 software (Premier Biosoft International, Palo Alto, CA) was used for primer design with the following parameters: 100–300 bp for final product length, 18–25 bp for primer size (with an optimum size of 20 nucleotides), 35–70% for GC content, and 55–65 °C for annealing temperature.

Verification of SSR markers and genetic similarity analysis

A total of six accessions of P. palustre and six related species, including M. haplocalyx, M. spicata, P. vulgaris, S. miltiorrhiza, S. indica, and S. barbata, were used for the verification of SSR markers developed by genome survey sequencing. In total, 90 SSR markers were selected to verify the quality of SSR markers and polymorphisms in the six accessions of P. palustre. Thirty-seven SSR markers were used to analyse the genetic similarity among the 12 accessions of P. palustre and related species. PCR was performed using EasyTaq® DNA Polymerase (TransGen Biotech, China) with the following programme: 94 °C for 5 min (initial denaturation) followed by 35 cycles of 94 °C for 30 s, 58–61 °C for 30 s, and 72 °C for 1 min, with an extension of 72 °C for 10 min and hold at 4 °C. The products obtained from the PCR were analyzed with 7% polyacrylamide gel electrophoresis (PAGE) and detected by staining with AgNO₃ solution. Clear and strong allelic fragments in the same horizontal position were scored manually as 0 (absent) or 1 (present), and the number of alleles (Na), effective number of alleles (Ne), percentage of polymorphic loci (PIC) and expected heterozygosity were calculated using GenAlEx 6.5^49,50. The genetic similarity coefficients of these clones were calculated and cluster analysis was performed based the neighbor-joining method using the pvclust R package⁵¹.

References

Santoso, H. Platostoma palustre (Blume) A.J.Paton Lamiaceae. In Ethnobotany of the Mountain Regions of Southeast Asia. Ethnobotany of Mountain Regions (ed. Franco, F. M.) (Springer, 2021).
Google Scholar
Shi, Y. H. et al. Identification of Herbal tea ingredient Mesona chinensis and its adulterants using ITS2 barcode. Chin. J. Pharm. 50, 1282–1285 (2015).
CAS Google Scholar
Huang, L. et al. Effect of highpressure microfluidization treatment on the physicochemical properties and antioxidant activities of polysaccharide from Mesona chinensis Benth. Carbohydr. Polym. 200, 191–199 (2018).
CAS PubMed Google Scholar
Zheng, L. et al. The analysis of natural acid-benzoic acid from Mesona chinensis benth by HPLC. China Food Addit. 5, 206–213 (2013).
Google Scholar
Liu, B. R. Determination of ursolic acid in Mesona chinensis Benth by HPLC. J. Foshan Univ. 6, 13–15 (2008).
Google Scholar
Liu, Z. W., Wu, H. M. & Zhang, C. Enzymatic extraction of flavonoids from Mesona chinensis Benth. Lishizhen Med. Med. Res. 11, 2903–2904 (2010).
Google Scholar
Qiu, T., Lin, X. C. & Wang, B. Y. Identification of epicatechin from Mesona chinensis Benth. Natl. Process. Res. Dev. 5, 798–800 (2010).
Google Scholar
Hung, C. Y. & Yen, G. Antioxidant activity of phenolic compounds isolated from Mesona procumbens Hemsl. J. Agric. Food Chem. 50, 2993–2997 (2002).
CAS PubMed Google Scholar
Wang, Z. Q. et al. Caffeic acid oligomers from Mesona chinensis and their In Vitro antiviral activities. Fitoterapia 144, 104603 (2020).
CAS PubMed Google Scholar
Song, X. et al. Study on antioxidant activity and inhibitory effect of α-glucosidase of different polar extracts of Mesona chinensis. Herald Medic. 39, 286–291 (2020).
Google Scholar
Qin, L. H. et al. Anti-anoxic constituents from Mesona chinensis Benth. J. Shenyang Pharm. Univ. 23(10), 633–636 (2006).
CAS Google Scholar
Huang, L. et al. Mesona chinensis Benth polysaccharides protect against oxidative stress and immunosuppression in cyclophosphamide-treated mice via MAPKs signal transduction pathways. Int. J. Biol. Macromol. 152, 766–774 (2020).
CAS PubMed Google Scholar
Yeh, C. T. et al. Antihypertensive effects of Hsian-tsao and its active compound in spontaneously hypertensive rats. J. Nutr. Biochem. 20, 866–875 (2009).
CAS PubMed Google Scholar
Liu, Y. et al. Hypoglycemic effect and acute toxicity test of Mesona chinensis Benth. J. Fuzhou Gen. Hosp. 12(4–5), 266–267 (2005).
Google Scholar
Liu, F. L. & Feng, C. L. Study on substituting antibiotics in duck production with Mesona chinensis Benth. Chin. J. Vet. Med. 2, 7–10 (2009).
Google Scholar
Liu, F. L. & Feng, C. L. The in vitro bacteriostasis test on avian Escherichia coli of Herba Herba. Guangdong J. Anim. Vet. Sci. 33(6), 17–43 (2008).
Google Scholar
Li, X. H., Li, Y. J., Huang, R. S., Jiang, M. L. & Wang, H. H. SCoT an ISSR analysis of genetic diversity of Mesona chinensis. Southwest China J. Agric. Sci. 25, 1834–1840 (2012).
Google Scholar
Vieira, M. L., Santini, L., Diniz, A. L. & Munhoz, C. Microsatellite markers: what they mean and why they are so useful. Genet. Mol. Biol. 39(3), 312–328 (2016).
PubMed PubMed Central Google Scholar
Sima, T. et al. Mining and development of novel SSR markers using next generation sequencing (NGS) data in plants. Molecules 23(2), 399–399 (2018).
Google Scholar
Lu, X. et al. Identification of high-efficiency SSR markers for assessing watermelon genetic purity. J. Genet. 97(5), 1295–1306 (2018).
CAS PubMed Google Scholar
Tuler, A. C. et al. SSR markers: a tool for species identification in Psidium (myrtaceae). Mol. Biol. Rep. 42(11), 1501–1503 (2015).
CAS PubMed Google Scholar
Zhu, C. R. et al. Genome survey analysis and SSR loci mining of Bupleurum falcatum. Zhongguo Zhong Yao Za Zhi 44(18), 3960–3966 (2019).
PubMed Google Scholar
Yuan, C. et al. EST-SSR identification, markers development of Ligusticum chuanxiong based on Ligusticum chuanxiong transcriptome sequences. Zhongguo Zhong Yao Za Zhi 42(17), 3332–3340 (2017).
PubMed Google Scholar
Tabkhkar, N., Rabiei, B., Lahiji, H. S. & Chaleshtori, M. H. Genetic variation and association analysis of the ssr markers linked to the major drought-yield QTLs of rice. Biochem. Genet. 56(4), 356–374 (2018).
CAS PubMed Google Scholar
Davey, J. et al. Genome-wide genetic marker discovery and genotyping using next-generation sequencing. Nat. Rev. Genet. 12, 499–510 (2011).
CAS ADS PubMed Google Scholar
Xu, H. et al. Analysis of the genome sequence of the medicinal plant Salvia miltiorrhiza. Mol. Plant. 9, 949–952 (2016).
CAS PubMed Google Scholar
Hou, S. et al. Genetic diversity of buckwheat cultivars (Fagopyrum tartaricum Gaertn.) assessed with SSR markers developed from genome survey sequences. Plant Mol. Biol. Rep. 34, 233–241 (2016).
Google Scholar
Wang, R. K. et al. Genome survey sequencing of Acer truncatum Bunge to identify genomic information, simple sequence repeat (SSR) markers and complete chloroplast genome. Forest 10(2), 87 (2019).
Google Scholar
Yao, J. Y. et al. Evaluation and characteristic analysis of SSRs from the whole genome of jute (Corchorus capsularis). Acta Agron. Sin. 45(1), 10–17 (2019).
Google Scholar
Wang, C. et al. Genome survey sequencing of purple elephant grass (Pennisetum purpureum Schum ‘Zise’) and identification of its SSR markers. Mol. Breed. 38(7), 1–10 (2018).
Google Scholar
Zhu, C. R. et al. Genome survey analysis and SSR loci mining of Bupleurum falcatum. China J. Chin. Mater. Med. 44(18), 3960–3966 (2019).
Google Scholar
Tautz, D. Hypervariabilty of simple sequences as a general source of polymorphic DNA markers. Nucleic Acids Res. 17, 6463–6471 (1989).
CAS PubMed PubMed Central Google Scholar
McCouch, S. R. et al. Microsatellite marker development, mapping and applications in rice genetics and breeding. Plant Mol. Biol. 35, 89–99 (1997).
CAS PubMed Google Scholar
Morgante, M. et al. Microsatellites are preferentially associated with nonrepetitive DNA in plant genomes. Nat. Genet. 30(2), 194–200 (2002).
CAS PubMed Google Scholar
Sonah, H. et al. Genome-wide distribution and organization of microsatellites in plants: an insight into marker development in Brachypodium. PLoS ONE 6, e21298 (2011).
CAS ADS PubMed PubMed Central Google Scholar
Xia, X. et al. Using the Genome-wide analysis of SSR and ILP markers in trees: diversity profiling, alternate distribution, and applications in duplication. Sci. Rep. 7, 17902 (2017).
ADS PubMed PubMed Central Google Scholar
Srivastava, S. et al. Patterns of microsatellite distribution across eukaryotic genomes. BMC Genom. 20(1), 153 (2019).
Google Scholar
Tóth, G., Gáspári, Z. & Jurka, J. Microsatellites in different eukaryotic genomes survey and analysis. Genome Res. 10, 967–981 (2000).
PubMed PubMed Central Google Scholar
Wang, X. T. et al. Comparative analyses of simple sequence repeats SSRs in 23 mosquito species genomes: Identification, characterization and distribution (Diptera: Culicidae). Insect Sci. 26, 606–619 (2018).
Google Scholar
Xu, Y. et al. Characterization of perfect microsatellite based on genome-wide and chromosome level in Rhesus monkey (Macaca mulatta). Gene 592, 269–275 (2016).
CAS PubMed Google Scholar
Saghi Maroof, M. A., Soliman, K. M., Jorgensen, A. R. & Allard, R. W. Ribisinal DNA space length polymorphism in barley: mendelian inheritance, chromosomal location and population dynamics. Proc. Natl. Acad. Sci. USA 81, 8014–8018 (1984).
ADS Google Scholar
Chen, S. et al. fastp: An ultra-fast all-in-one FASTQ preprocessor. Bioinformatics 34, i884–i890 (2018).
PubMed PubMed Central Google Scholar
Zeng, Q. et al. Genome survey sequence and the development of simple sequence repeat (SSR) markers in Erianthus arundinaceus. Sugar Tech https://doi.org/10.1007/s12355-020-00872-5 (2020).
Article Google Scholar
Vurture, G. W. et al. GenomeScope: Fast reference-free genome profiling from short reads. Bioinformatics 33, 2202–2204 (2017).
CAS PubMed PubMed Central Google Scholar
Sun, H., Ding, J., Mathieu, P. & Korbinian, S. findGSE: Estimating genome size variation within human and Arabidopsis using k-mer frequencies. Bioinformatics 34, 550–557 (2018).
CAS PubMed Google Scholar
Shi, J. P. et al. Chromosome conformation capture resolved near complete genome assembly of broomcorn millet. Nat. Commun. 10(1), 464 (2019).
CAS ADS PubMed PubMed Central Google Scholar
Pucker, B. Mapping-based genome size estimation. Boas Pucker. bioRxiv https://doi.org/10.1101/607390 (2019).
Article Google Scholar
Thiel, T. et al. Exploiting EST databases for the development and characterization of gene-derived SSR-markers in barley (Hordeum vulgare L.). Theor. Appl. Genet. 106(3), 411–422 (2003).
CAS PubMed Google Scholar
Peakall, R. & Smouse, P. E. GenAlEx 6.5: Genetic analysis in Excel Population genetic software for teaching and research—An update. Bioinformatics 28, 2537–2539 (2012).
CAS PubMed PubMed Central Google Scholar
Peakall, R. & Smouse, P. E. GENALEX 6: Genetic analysis in Excel. Population genetic software forteaching and research. Mol. Ecol. Notes 6, 288–295 (2006).
Google Scholar
Suzuki, R. & Shimodaira, H. Pvclust: An R package for assessing the uncertainty in hierarchical clustering. Bioinformatics 22, 1540–1542 (2006).
CAS PubMed Google Scholar

Download references

Acknowledgements

We are grateful to Xiangbo Zhang, Xiaomin Feng, the staff at INSI, GAAS for critical reading and revision of the manuscript. This work was supported by the GDAS Project of Science and Technology Development (2019GDASYL-0104013; 2020GDASYL-20200302005), the National Natural Science Foundation of China (32072027), the Science and Technology Planting Project of Guangdong Province, China (2019B020238001).

Author information

These authors contributed equally: Zhao Zheng and Nannan Zhang.

Authors and Affiliations

Guangdong Sugarcane Genetic Improvement Engineering Center, Institute of Bioengineering, Guangdong Academy of Sciences, Guangzhou, 510316, China
Zhao Zheng, Nannan Zhang, Zhenghui Huang, Qiaoying Zeng, Yonghong Huang & Yongwen Qi
Zhongkai University of Agriculture and Engineering, Guangzhou, 510225, China
Zhao Zheng, Zhenghui Huang & Yongwen Qi

Authors

Zhao Zheng
View author publications
You can also search for this author in PubMed Google Scholar
Nannan Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Zhenghui Huang
View author publications
You can also search for this author in PubMed Google Scholar
Qiaoying Zeng
View author publications
You can also search for this author in PubMed Google Scholar
Yonghong Huang
View author publications
You can also search for this author in PubMed Google Scholar
Yongwen Qi
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Y.Q. and Q.Z. designed the experiments; N.Z. and Y.H. collected plant materials; N.Z., Z.Z. and Z.H. performed the SSR experiments and analyzed the data; the whole genome sequencing assembly was performed by Q.Z.; Y.Q., N.Z. and Z.Z. drafted this manuscript.

Corresponding author

Correspondence to Yongwen Qi.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Zheng, Z., Zhang, N., Huang, Z. et al. Genome survey sequencing and characterization of simple sequence repeat (SSR) markers in Platostoma palustre (Blume) A.J.Paton (Chinese mesona). Sci Rep 12, 355 (2022). https://doi.org/10.1038/s41598-021-04264-x

Download citation

Received: 07 December 2020
Accepted: 30 November 2021
Published: 10 January 2022
DOI: https://doi.org/10.1038/s41598-021-04264-x

This article is cited by

Genome-wide discovery of single- and multi-locus simple sequence repeat markers and their characterization in Dendrocalamus strictus: a commercial polyploid bamboo species of India
- Shivani Rohilla
- Harish S. Ginwal
- Rajendra K. Meena
Genetic Resources and Crop Evolution (2024)
A first insight into the genomic background of Ilex pubescens (Aquifoliaceae) by flow cytometry and genome survey sequencing
- Peng Zhou
- Qiang Zhang
- Min Zhang
BMC Genomics (2023)
Genome survey sequencing-based SSR marker development and their validation in Dendrocalamus longispathus
- Rajendra K. Meena
- Priyanka Kashyap
- Harish S. Ginwal
Functional & Integrative Genomics (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.