Genic microsatellite marker characterization and development in little millet (Panicum sumatrense) using transcriptome sequencing

Desai, Hiral; Hamid, Rasmieh; Ghorbanzadeh, Zahra; Bhut, Nishant; Padhiyar, Shital M.; Kheni, Jasminkumar; Tomar, Rukam S.

doi:10.1038/s41598-021-00100-4

Download PDF

Article
Open access
Published: 18 October 2021

Genic microsatellite marker characterization and development in little millet (Panicum sumatrense) using transcriptome sequencing

Hiral Desai¹^na1,
Rasmieh Hamid²^na1,
Zahra Ghorbanzadeh³,
Nishant Bhut¹,
Shital M. Padhiyar¹,
Jasminkumar Kheni¹ &
…
Rukam S. Tomar⁴

Scientific Reports volume 11, Article number: 20620 (2021) Cite this article

2479 Accesses
18 Citations
1 Altmetric
Metrics details

Subjects

Abstract

Little millet is a climate-resilient and high-nutrient value plant. The lack of molecular markers severely limits the adoption of modern genomic approaches in millet breeding studies. Here the transcriptome of three samples were sequenced. A total of 4443 genic-SSR motifs were identified in 30,220 unigene sequences. SSRs were found at a rate of 12.25 percent, with an average of one SSR locus per 10 kb. Among different repeat motifs, tri-nucleotide repeat (66.67) was the most abundant one, followed by di- (27.39P), and tetra- (3.83P) repeats. CDS contained fewer motifs with the majority of tri-nucleotides, while 3′ and 5′ UTR carry more motifs but have shorter repeats. Functional annotation of unigenes containing microsatellites, revealed that most of them were linked to metabolism, gene expression regulation, and response to environmental stresses. Fifty primers were randomly chosen and validated in five little millet and 20 minor millet genotypes; 48% showed polymorphism, with a high transferability (70%) rate. Identified microsatellites can be a noteworthy resource for future research into QTL-based breeding, genetic resource conservation, MAS selection, and evolutionary genetics.

Transcriptome sequencing assisted discovery and computational analysis of novel SNPs associated with flowering in Raphanus sativus in-bred lines for marker-assisted backcross breeding

Article Open access 01 November 2019

Full-length SMRT transcriptome sequencing and microsatellite characterization in Paulownia catalpifolia

Article Open access 22 April 2021

Microsatellite analysis and polymorphic marker development based on the full-length transcriptome of Camellia chekiangoleosa

Article Open access 07 November 2022

Introduction

Minor millets belong to the Poaceae family, and are small-grained cereal crops known for their climate-resilient cultivation¹. Small millets or “Minor Millets” are raised in the semi-arid tropical realms of Africa and Asia². They include foxtail millet (Setaria italica), proso millet (Panicum miliaceum), barnyard millet (Echinocloa frumentacea), little millet (Panicum sumatrense), kodo millet (Paspalum scrobiculatum), and finger millet (Eleucine coracana). They are one of the most important crops with many nutritional benefits. They have a good amino acid profile, are rich in micronutrients e.g., calcium, zinc, iron, and iodine, and are gluten-free. They have excellent health benefits for patients who have atherosclerosis, diabetes, heart attack, blood pressure, migraine, and asthma. Its high fiber content prevents gallstone formation³. Minor millets are usually neglected in the society. Thus, very little information is available about their genome, transcriptome, and other essential aspects, which is necessary for initiating new research programs for food security globally⁴. Few genetic resources developed in finger millet, foxtail millet, proso millet, and Japanese barnyard millet⁵. Progress towards the desired goals necessitates modern breeding approaches based on DNA-based molecular markers⁶, particularly co-dominant marker systems⁷. Recent, crop improvement programs widely done using molecular breeding or marker-assisted breeding approach that utilizes molecular marker like EST-SSR or Genic SSR and genome-wide SSR marker with the use of publicly available genomic and transcriptomic resources available⁸.

Among the molecular markers, SSRs are the most commonly used molecular markers; they are also known as microsatellites and are widely distributed throughout the genome, and are extremely polymorphic, chromosome-specific, and frequently inherited in a Mendelian co-dominant fashion⁹. In contrast to genomic SSR markers, Genic SSRs or Expressed sequence tags microsatellites (eSSRs) are acquired through expression sequence tags generated by converting gene transcripts into cDNA¹⁰. As they originate from coding regions, eSSRs markers are preferred for studying plant species¹¹. Genic SSRs are found within gene sequences and are conserved; hence they can be applied to define alleles associated with important agronomical traits^12,13.

Furthermore, because of their high polymorphism, multi-allelic nature, reproducibility, and transferability, eSSRs are widely used as a robust molecular marker in marker-assisted breeding for crop improvement^14,15. Traditionally, SSR development is labor-intensive, like cloning DNA and building a library, and produces significantly fewer SSRs than next-generation sequencing (NGS) technology¹⁶. The added value of NGS technologies, particularly transcriptome sequencing, is that they can provide a plethora of high-quality and cost-effective sequences in a short period¹⁷. RNA sequencing (RNA-seq) helps us achieve details on functional genes that can be applied to detect eSSR markers in a reliable and high-throughput manner^10,18. Applying RNA-seq techniques, eSSRs have been identified in several plant species, including Mulberry¹⁹, bean²⁰, avocado¹⁸, turmeric²¹, apple²², barley¹⁵, cotton²³, wheat²⁴, Coriander²⁵, peanut¹³, and Pistacia²⁶. In this research, we used transcriptome sequencing to mine microsatellite loci from expressing sequences of little millet. The goals of this research are (1) to identify the frequency, distribution, and the role of genic microsatellites in the transcriptome of little millet; (2) establish polymorphic microsatellite markers and validate their polymorphism degree; and (3) assess cross transferability between/among other millet spices. The current study's flowchart is showed in Fig. 1.

Materials and methods

Plant materials

Little millet seed material was collected from the Indian Institute of Millets Research, Rajendranagar (Hyderabad, Telangana, India) and planted in the Department of Biotechnology, Junagadh Agricultural University's controlled climatic conditions of 70% relative humidity and 20/15 °C (14 h/10 h) day/night temperature regime. Leaves from the vegetative, flowering, and maturity stages were collected for RNA sequencing. All specimens were frozen in liquid nitrogen, immediately, and kept at − 80 °C until used. Young leaves from 25 genotypes (Table S1) were also collected for DNA extraction, which then used in the EST-SSR marker analysis. The collection of seeds and the complete experiment was carried out according to the national guidelines²⁷.

RNA extraction and cDNA library preparation

The RNeasy® Plant mini kit (Qiagen, Valencia, CA) was utilized to isolate total RNA from leaf samples obtained at the vegetative, flowering, and maturity stages, according to the manufacturer's instructions. The consistency and concentration of RNA were determined using a 1.2 percent agarose gel electrophoresis and a Qubit® 2.0 Fluorometer (Thermo Scientific, USA). The RNA Nano 6000 Assay Kit was also used to determine the integrity of the RNA. mRNA isolation from total RNA was carried out using oligo-dT-attached magnetic beads (Dynabeads® mRNA DIRECT™ Micro Kit), and Ion total RNA seq-kit v2 was used to create cDNA libraries (Thermo Fisher Scientific). For cDNA library preparation, purified mRNA randomly fragmented into short fragments (~ 200 bp) by RNaseIII enzyme then subjected to Ion adapter v2 hybridization, ligation and cDNA was synthesized and amplified as per manufacturer’s guidelines. After amplification, cDNA libraries were selected for target fragments of ~ 200 bp on 2% Agarose EGelTM (Thermo Fisher Scientific) and stored at − 20 °C until used^10,28. The selected ~ 200 bp cDNA library was subjected to emulsion PCR (Ion OneTouch™ 2 System, Thermofisher Scientific, USA), followed by enrichment of template positive bead recovery. Sequencing was conducted in an Ion S5 Machine (S5TM, Life Technologies). During the library preparation, each sample was assigned a unique molecular barcode.

Reads processing and assembly

PRINSEQ (0.20.4) was utilized for omitting low-quality sequences after FastQC (v0.11.9); the quality of the reads (if the reads had more than 10% nucleotide bases with Q value 25) were checked. Trinity (v2.13.2) reconstructed and assembled the high-quality reads into unigenes, using the default parameter²⁵. To benchmark completeness of individual and combined P. sumatrense transcriptome assembly, the Benchmarking Universal Single-Copy Orthologs (BUSCO) version v5.2.2 was used with the default E-value cut-off of 1e−03 against the ortholog set of Embryophyta_odb9 lineage (creation date: 2020-09-10, number of species: 70, and number of BUSCOs: 255) from OrthoDB v9²⁹. Further, TransDecoder (v. 5.5.0) was used for estimating the coding sequence present in Master assembly with the default parameters³⁰. The single best open reading frame (ORF) per transcript, longer than 200 peptides were selected. Further CD-HIT-EST program (v. 4.8.1) was used to reduce redundancy in transcript sequence and generate Unigene³¹. Then standalone blast with a command line for Basic Local Alignment Search Tool (BLAST) against the Uniprot database for functional annotation was used³².

SSR marker identification and primer design

The MIcroSAtellite (MISA) Identification Tool Perl script (http://pgrc.ipk-gatersleben.de/misa/) was used to find simple sequence repeat regions. The assembled transcriptome was screened for di-, tri-, tetra, Penta-, and hexanucleotide repeat motifs. Primer 3.0 (http://fokker.wi.mit.edu) was used to design primers from sequences containing SSRs. In addition, the following parameters were considered for primer designing: primer length 18–28 bp optimum length 20 bp; Tm-55–63 °C, with an optimum of 60 °C; GC content 40–60%, optimum value 50%; maximum Tm difference between forward and reverse primer 1.5 °C and product size range 100–300 bp, 150 bp being the ideal size^23,33.

Assigning functions to unigenes containing microsatellites

Functional annotation of SSR containing sequence carried out using standalone blast with NCBI non-redundant Nr database (e-value cut-off 10⁻⁶), Swiss-Prot (A manually annotated and reviewed protein sequence database), KOG (Clusters of eukaryotic Orthologous Groups), and Pfam (Protein family, assigned using the HMMER3.0 package)³⁰. Furthermore, the Blast2GO software with a cut-off E value of 1e⁻⁵ was used to assign GO (Gene Ontology) annotation on all unigene containing microsatellite motifs. Also, KEGG (KAAS, http://www.genome.jp/kegg) was used to perform metabolic pathways analyses (E-value 10⁻¹⁰).

DNA extraction, PCR, and eSSR validation

A total of 50 randomly selected sets of primers were selected and synthesized (Merck, India). Genomic DNA isolation was illustrated using DNeasy plant Mini Kit (Qiagen, Valencia, CA) from young leaves samples of five little millet, five proso millet, five Kodo millet, five barnyard millet, and five finger millet genotypes (Table S1). The polymerase chain reaction was carried out using the following PCR components: contained 1μL template DNA (50 ng), 1.0 μL forward primers, 1.0 μL reverse primers 2 μL 10 × Taq buffer + MgCl2 (15 mM), 2 μL dNTP (2 mM), 0.4 μL Taq polymerase (Promega 5U μL⁻¹) and 2.6 μL sterile distilled water. PCR reaction was performed in Veriti thermal cycler (Applied Biosystems, USA) with the following condition: Initial denaturation 94 °C for 3 min, denaturation 94 °C for 30 s, annealing 30 cycles 58–59 °C for 30 s and final extension at 72 °C for 7 min. Each primer's PCR product was visualized at 5 V/cm using 3 percent MetaPhor Agarose Gel Electrophoresis (MAGE) (Biowittaker, USA) in TBE buffer^34,35. The gel was stained with EtBr (0.5 mg/ml) after the run³⁶, and Agarose gel scanning was carried out in a Gel Documentation system (Syngene, India). A 50 bp Plus DNA Ruler (Thermofisher, USA) was used as a molecular size standard.

Polymorphism estimation and cluster analysis

The 24 polymorphic eSSR products were scored manually as ‘1’ (presence) and ‘0’ (absence). The PIC was calculated using the score from the polymorphic loci and the formula PIC = ∑1 − gpi (where pi is the frequency of ith allele for each locus)³⁷. The Popgene software version 1.31 was used to compute the observed heterozygosity (Ho) and expected heterozygosity (He). A matrix for genetic similarity was generated using the NTSYSpc 2.1 software³⁸. UPGMA algorithm in NTSYS pc software was used to depict similarity coefficients and genetic similarity among millet genotypes. Dendrogram and All the other graphs were prepared using R software version R-4.1.1³⁹ by ggplot277 R-packages^40,41.

Results and discussion

De novo assembly and characterization of unigene

In this study 11.4, 7.6, and 10.5 million reads with average read length 158 bp, 165 bp, and 150 bp were generated in leaf transcriptome at the vegetative stage, flowering, and maturity stage, respectively. Reads with Q25 bases (i.e., reads with a base quality ≥ 25) were selected as high-quality reads, through additional analysis (Table 1). Finally, using Trinity, a total of 29,629,186 (81.36 P) reads were assembled into 47,358 unigenes. All unigenes were classified based on the length of the sequences, and the average length of the unigenes was estimated to be 521.72 bp. The majority of the reads (91.7%) were in the 200–1000 bp range. There were 25,743 unigenes less than 1000 bp in length and 5,038 with more than 1000 bp (16.36P) (Fig. S1). The completeness of transcripts was determined by comparing them to universal single-copy orthologs (BUSCO). The percentage of detected BUSCOs is represented on the x-axis (Fig. 2). The powder blue diamond illustrates the complete (C) and single-copy (S) genes; the sunglow stands in for complete and duplicated (D) genes; the tacao diamond represents fragmented (F) genes; the coral diamond act for the missing (M) genes. In the vegetative, flowering, and maturity stages, the total number of core genes queried were 45,852, 47,851, and 48,370, respectively. BUSCO evaluated the completeness of transcripts. BUSCO assessment of the quality of transcriptome assemblies revealed that individual transcriptome assemblies have low completeness (17.3 to 22%), higher fragmentation (35.3% to 43.9%) and higher missing sequences (23.5 to 33%) compared with the combined assembly. The combined assembly showed high percentage of gene representation (40%), high completeness (39.6%), low percentage of fragmented (36%) and low missing sequences (23.9%). The single-copy BUSCOs accounted for 22 percent, 18 percent, and 17.3 percent of the vegetative, flowering, and maturity stages, respectively. However, the single-copy BUSCOs in the combined assemble was 39.6% with only 0.4% duplicates. Of the total 255, BUSCO groups searched, only 35.3%, 35.7%, 43.9% fragmented BUSCOs, and 23.5%, 33.0%, and 30.2% missing BUSCOs were found in our database, respectively in the vegetative stage, flowering stage, and maturity stage (Fig. 2). All of these findings indicated that our database was complete and ready for further investigation.

Table 1 Statistics of RNA sequencing data generated with Ion Torrent S5.

Full size table

EST-SSR discovery, frequencies, and distribution

MISA software was used to identify potential microsatellites from all 47,358 generated unigenes. A total of 4443 EST-SSRs were discovered (Table S2), and 3593 primers were created (Table S3). The distribution of SSRs in the little millet genome was found to be one SSRs per 10 Kb, comparable to the distribution of SSRs in other cereal genomes⁴². Among the 4,443 EST- SSRs identified in little millet, the tri-nucleotide motif (66.67%) was the most common, followed by di- (27.39%), tetra- (3.83%), Penta- (1.37%), and hexanucleotide motifs (0.71%). This trend demonstrated that the frequency of repeats decreased as motif length increased (See Fig. 3a). The number of microsatellite motif tandem repeats varied from 4 to 98. The most popular microsatellite had eight tandem repeats (1,245, 28.02%), followed by six tandem repeats (890, 20.03%), four tandem repeats (537, 12.08%), and five tandem repeats (465, 10.46%). Microsatellite motifs with more than 24 tandem repeats accounted for just 1.9 percent of the total (Fig. 3b). AG/CT was found to be the amplest type of di-nucleotide repeat, and it was observed to be 59.70% of the repeats, followed by AC/GT (24.50%) and AT/AT (11.10%). Among the tri-nucleotide repeats, the most frequent motifs were AGC/GCT (27.84%) and CCG/CGG (17.65%). The most common tetra-nucleotide repeat motif was AGAT/ATCT (10.52%), whereas the most prevalent Penta nucleotide repeat motif was AGAGG/CCTCT (21.31%). AGGATG/ATCCTC and ACACAG/CTGTGT (15.62%) had the most hexanucleotide motifs (Fig. 3c).

Microsatellite distribution through genic regions

The distribution of microsatellite loci in the transcriptome of little millet was studied. Of the 4,443 genic SSR, 751 and 3442 were located in coding sequence regions (CDS) and untranslated regions (UTRs), respectively (Fig. 4a). The remaining 251 microsatellites were derived from the sample due to a lack of information about their distribution. Microsatellites from different genic regions (CDS, 5′-UTRs, and 3′-UTRs) represented distinct distribution patterns (χ² = 2867.1, P 2.2e−16). The CDS had fewer microsatellites and was dominated by tri-nucleotides (542, 72.17 percent), while the UTRs had a greater density of mono- (1442, 41.89 percent) and di-nucleotide (1585, 46.07 percent) microsatellites. Shorter motif forms (mono-, di-, and tri-nucleotide microsatellites) were found to be more prevalent in the transcriptome (Fig. 4a). Furthermore, there were substantial variations in microsatellite length between three regions (CDS, 5′-UTR, and 3′-UTR), the average length of microsatellites in CDS regions (18.24 bp) was slightly greater than that of UTRs (17.45 bp). (Fig. S2). In these regions, the 5′UTRs had the highest GC content (57.86%) followed by CDSs (52.94%) and 3′UTRs (44.23%). The AT- and GC-content of mono- to hexanucleotide SSRs of these expressed areas were computed, and the findings are shown in Fig. 4b, and Table S4.

Functional annotation of microsatellites-containing unigenes

To functionally annotate the assembled non-redundant unigenes, a sequence similarity search against the 'nr' database and the Swiss-Prot database was performed using the BLASTx algorithm with cut-off E values of (1e−5) and 1e−10, respectively. According to our findings, 33,160 (69.4%) and 20,686 (43.29%) unigenes were homologous with sequences in the 'nr' and Swiss-Prot databases, respectively. Our result revealed that when subjected to BLAST against 'nr’ database, the majority of the unigenes i.e., more than 78.25 percent of unigenes with lengths greater than 500 bp, matched, whereas 21.75 percent with lengths less than 300 bp matched (Fig. S3).

GO and KEGG enrichment analysis of microsatellites-containing unigenes

The Blast2GO was used to classify the predicted functions of the assembled unigenes. Figure 5a depicts an overview of the unigene classification in each GO slim expression. GO analysis revealed that 22,717 unigenes (47.96 percent) could be classified into 60 Go terms. Among these unigenes, 8199 (36 P) were classified as Biological Processes, with the most enriched terms being organic substance metabolic process (1,543), cellular metabolic process (1457), and primary metabolic process (1446). The second annotated category was Molecular Function, with organic cyclic compound binding (1331), heterocyclic compound binding (1331), and ion binding (1235) being the most represented GO groups. In the cellular component organization, intracellular anatomical structure (1053), membrane (1009), and organelle (888) were the most represented terms (Fig. 5a and Table S5).

To evaluate the comprehensiveness of our transcriptome library, and thus the efficacy of the annotation protocol, BLASTx was used to align the Unigenes with KEGG, the E-value was kept below 1e−5, and the associated pathways were identified. While 30,400 (60.46%) of the 47,358 unigenes were annotated, only 5,644 (18.56%) were assigned to KEGG pathways. Thiamine and Purine metabolism were the most enriched pathways of the five major KEGG pathways, which included Thiamine metabolism, Purine metabolism (399), Starch and sucrose metabolism (387), Riboflavin metabolism (191), Folate biosynthesis (172), and Pyrimidine metabolism (159). In addition to starch and thiamine metabolism, a plethora of unigenes were involved in the synthesis of other essential vitamins and amino acids, photosynthesis, and secondary metabolite biosynthesis (Fig. 5b and Table S6).

SSR marker polymorphism assay

The 50 genic SSR primer pairs were randomly chosen from the total number of designed EST-SSR. In total, 39 were amplified and produced expected band size in all five tested minor millets’ species, 24 microsatellite loci showed allelic polymorphism. They were used to analyze the genetic diversity of 25 minor millet genotypes. The observed heterozygosity (Ho) ranged from 0.12 to 0.91, with an average of Ho = 0.49 (Table 2). 217 alleles were obtained from 48 SSR, and the allele rate was 3 to 16 per locus. Most alleles were found at the LtM 38 locus. Other loci with a high number of alleles included LtM 28, LtM 1903, and LtM 103. The average He value obtained was 0.72, which ranged between 0.32 (LtM25) and 0.92 (LtM 3287). The polymorphism percentage obtained for SSR primers ranged from 0 to 100%, with an average value of 91.19 percent per primer. The calculated PIC for each primer ranged from 0.11 to 0.90, with an average of 0.57 for each primer. The SSR primer index (SPI) ranged from 0.5 to 10.2, with an average of 4.21 (Fig. S4 and Table 2). At the species level, barnyard millet showed the largest degree of genetic variation (Na = 5.92, Ne = 3.28, Ho = 0.71, and He = 0.81), while Finger millet had the lowest (Na = 2.67, Ne = 1.44, Ho = 0.44, and He = 0.54) (Table 3).

Table 2 Novel genic SSR genetic diversity values in 25 Millet individuals: allele ranges, observed heterozygosity (Ho), expected heterozygosity (He), and (PIC) values of 24 loci.

Full size table

Table 3 Mean of species genetic parameters SSR loci in each of Millet species.

Full size table

The SSR polymorphism data were further used to perform genetic correlation analysis. The genetic similarity matrix was used to generate a dendrogram and accessions. The SSR data of 25 millet genotypes were subjected to similarity index and cluster analysis using Jaccard's coefficient, and UPGMA was performed using the NTSYSpc-2.02i package (Table S7). The dendrogram created using Jaccard's similarity coefficient, and the UPGMA method revealed the highest (80.5 percent) similarity between BAR-1406 and BAR-1407 and the lowest (18.3 percent) similarity between LIT-171 and FIN-1831. The tree plot was created with three main clusters, with an average similarity of 59% (Fig. 6). Cluster I consisted of 15 genotypes of all little millet and Kodo millet genotypes, and it consisted of three sub-clusters, the first sub-cluster of which has two branches, first one includes all little millet genotypes, the second sub-cluster is branched into two, the first one includes two genotypes of barnyard millet, and the second branch includes the remaining genotype, third sub-cluster include Proso Millet genotypes in two branches. The second cluster had two sub-cluster D and E with near about 41% likeness and included Kodo millet genotypes divided between four branches. The last cluster was further subdivided into three sub-cluster consisted of Finger Millet genotypes; among different genotypes, FIN-18301 and FIN-18322 are closely related with 78% similarity, while FIN-1828 has the greatest genetic distance from other genotypes.

The genes and enzymes involved in the biosynthesis of thiamin

Thiamine (vitamin B1) plays a crucial molecular role for all living organisms. Thiamine diphosphate (ThDP), also known as vitamin B1, serves as an enzymatic cofactor in glucose metabolism, the Krebs cycle, and the biosynthesis branched-chain amino acids in all living organisms. Humans, unlike plants and microorganisms, cannot synthesize ThDP from scratch and must obtain it from their diet. It is shown that severe thiamine deficiency would likely cause the lethal disease beriberi⁴³. Therefore, thiamine is a vital micronutrient for humans. Plants are the primary dietary source of thiamine. Little millet is rich in thiamine (0.2–0.48 mg/100 g)⁴⁴, riboflavin (0.12 mg/100 g)⁴⁵, potassium, zinc, magnesium, and manganese minerals, which are essential for healthy human diet^46,47. According to KEGG analysis results the most enriched pathway was thiamine metabolism; hence, all unigenes were studied and 12 unigenes involved in thiamine metabolism identified (Table 4). Among them, the expression level of TDPK1, TH1, THI1, THIC, TPK1, TPK2, PALE1 and TPK3 were higher in flowering and vegetative than the maturity phase, while transcript level of TDPC2, TENAC, THI42 and TPS1 was higher in maturity in comparison with other samples (Table 4).

Table 4 Transcript abundance for candidate genes involved in terpenoids biosynthesis.

Full size table

Discussion

Transcriptome sequencing, assembly, and genic SSR characterization

Minor millets, in particular, little millet, is a highly underappreciated species in terms of popularity and research interest, despite being a traditional food and fodder crop with a high nutrient profile. Millet molecular breeding is hampered by a lack of literature on molecular breeding and the inaccessibility of co-dominant markers. The identification and use of eSSR markers to screen accessions, varieties, germplasm, or cultivars will speed up the breeding process⁴⁸. Transcriptome sequencing is a sophisticated and efficient method to detect new genes, identifying expression patterns, and developing molecular markers⁴⁹. In this research, the Ion torrent S5 technology platform was utilized, and a total of 30 million reads (about 15 GB) were generated from three different samples. For sequence annotation, BLASTx against the NCBI database's ‘nr' protein revealed that 57.30% (27,340) of the 47,358 unigenes in our dataset had at least one significant homologous gene in other species, and approximately 42.7 percent of unigenes could not be functionally annotated either because they performed similar properties as protein with an uncharacterized function or there were no BLAST matches. In most cases, the length of the query sequence influences the ability to discover essential similarities between the sequences. Several previous studies found that the likelihood of BLAST matches in protein databases could be raised while increasing the unigenes length⁵⁰. This was also perceived in the current research, where 80 percent of the unigenes with lengths greater than 500 bp had homologs, whereas only 20 percent of the unigenes with lengths less than 300 bp were detected be similar to other homologs. Furthermore, there is limited data on the transcriptome, genome, and genes of little millet; in consequence, numerous little millet genes are not publicly available.

The scarcity of SSR markers limits basic and applied genomics research in little millet. Transcriptome sequencing generates a massive amount of sequence data, which can be used to create a large number of SSRs. Markers derived from transcriptomic sequences are far more helpful than markers derived from genomic sequence data for gene-based interpretation and detecting functional variation⁵¹. A total of, 9764 potential EST-SSR markers were identified from 47,358 unigene sequences. The genic microsatellites frequency was 11.25 percent, and the distribution density was one SSR per 10 kb, which was remarkably higher than sesame (8.9%), barley (2.8%), Asian lotus (8%), maize (7%), but lower than cotton (12%)^{23,52,53,54,55}. Some variability in the data mining method, search parameters, and scale of the unigene assembly dataset maybe addressed by variations in SSR abundance⁵⁶. Here we discovered six distinct repeat motifs. Tri-nucleotide repeats (66.67percent) were the most prevalent type, followed by di-nucleotide repeats (27.39 percent) and tetra- nucleotide (3.83 percent). The AGC/GCT motif (825, 30.99 percent) was the most common of the tri-nucleotide repeats, followed by the CCG/CGG motif (523, 17.52 percent). AG/CT (726, 20.44 percent) was the most dominant motif among the di-nucleotide repeats, which was similar with study in Triticum aestivum L.⁵⁷, and Raphanus sativus L.⁵⁸. In comparison, the CG/CG motif had the lowest frequency.

Genetic relationship, polymorphism, and transferability of EST-SSR markers

The majority of reports on genetic diversity analyses in little millet germplasm have used RAPD⁵⁹, SNP and genomic SSR markers⁶⁰. Tiwari, et al.⁵⁹ using 60 RAPD investigated the diversity of 36 little millet (Panicum sumatrense) genotypes, although these researchers observed high polymorphisms among the markers used, but as RAPD technique is notoriously laboratory dependent their results are hardly reproducible in other laboratory or in other genotypes. And there are only a few reports of EST-SSR application in little millet. Ali et al.⁶¹ using 22,961 EST sequences of switchgrass (Pancium virgatum) developed 48 species transferable EST-SSR markers. Das et al.⁶² also applied salinity response transcriptome of P. sumatrense to develop 37,100 genic SSR markers, that might be associated to salinity and drought stress tolerance. Genic SSRs are advantageous and frequently preferred for coding regions of the genome. Furthermore, the degree of transferability of this type in related species can provide a high level of acceptability⁶³. In this research using transcriptome sequencing of samples of three developmental stages, species transferable genic SSR marker sets were mined. To verify generated SSR markers, 50 primer pairs were randomly selected and validated, 39 of which were positively amplified in 25 millet genotypes (Table S1). The lack of amplicons from other primer pairs is most likely due to primer positioning across large introns, splice sites, or poor-quality sequences⁶⁴. Because of more significant sequence conservation in the transcribed regions, the rate of polymorphism of genic SSRs is typically lower than that of genomic SSRs⁶⁵. However; this was not the case in this research, because the genic SSRs discovered were highly polymorphic. The average number of alleles recorded in this research using genic SSR markers was 8.68 per locus, with values ranging from 3 to 16 alleles. Among the 39 primer pairs tested in our study, 24 (61 percent) exhibited polymorphism. The determined value was greater than the polymorphism recorded by Senthilvel et al.⁶⁶ in the examined pearl millet varieties. Polymorphism variation is related to the geographic origins of specimens, the number of samples, the genome sequence of the samples, and the primers used⁶⁷. In addition, the average PIC value was higher (0.57) as the result of the greater depth of sequencing coverage achieved in this research. One of the critical factors influencing polymorphism is the length of the microsatellite. SSR length may be classified as short (12 bp), medium (12–20 bp), or high (> 20 bp)⁶⁸. In this study, medium SSR lengths (2,055, 46.24 percent) had the greatest proportion. Thus, the microsatellites created from little millet transcriptome probably show a moderate degree of polymorphism. Six microsatellite motifs show high variations in length (Kruskal–Wallis rank-sum test, P 1.8 e−16), and the frequency of each decreased with increasing motif size (Nemenyi test, P 2.2e−16). Using five populations of minor millets, we confirmed the polymorphism of the 48 microsatellite markers. A total of 217 alleles were found. At the species level, Barnyard millet and Finger millet showed the highest and the lowest genetic diversity, respectively. Cluster analysis divided genotypes into three main clusters, with an average similarity of 59 percent. Cluster-I contains two species, little millet, and Kodo millet, with 62 percent likeness. Cluster-II has Kodo millet genotypes and consists of four branches with a 58 percent likeness. The third cluster, which is divided into three sub-clusters, contains Finger Millet genotypes. Genetic variation in species is affected by internal and external (historical or evolution causes) factors, little millet-as an ancient plant, has collected a large number of genetic variants during the long-term evolution, and it is supposed to retain a high degree of genetic diversity. In this study at the population level, all five minor millet species showed low genetic diversity, which could be a result of a small number of members in each population and distribution properties. Kodo millet showed the largest genetic diversity among all studied spices. While Barnyard millet and Proso Millet revealed limited genetic diversity, which could be the reason of gene flow between the two species during evolution. Generally, Microsatellite markers derived from transcriptome data/ESTs have a higher degree of transferability in related species⁶⁹. In the current study, 39 microsatellite markers were successfully amplified in minor millets, with 24 of them being polymorphic in all minor millets’ genotypes. The transferability ratio was 78%, which was higher than the transferability ratios recorded in Pistacia²⁶ but less than Moon Seed⁷⁰ and Arrowhead⁷¹. Genic SSRs generated in this research have strong potential for cross-species amplification and could be utilized in future genetic studies of other cereals species.

Potential function of unigenes containing microsatellites

Microsatellites derived from transcribed sequences can be closely associated with gene function and thereby may have important influences on gene products, causing phenotypic changes, and controlling gene expression. To expose the possible roles of these unigenes, functional annotations and classification of microsatellites-containing unigene sequences are performed. According to GO functional annotations, a significant amount of unigenes, including microsatellites were assigned to terms such as "regulation of cellular process, cellular metabolic process, biosynthetic process, cell communication catalytic action" and response to stress, implying that they could be associated with little millet’s basic metabolism, growth and developmental activities. Likewise, KEGG analyses revealed that SSRs, including unigenes, served various biological roles and were involved in different vital features of little millets. Notably, thiamine metabolism and purine metabolism were the most enriched pathway in KEGG analysis. Looking deeper, we identified twelve genes involved in the thiamine biosynthetic pathway. Compared to the maturity stage, expression levels of genes involved in the vitamin B1 pathway were higher during the vegetative and flowering phases. These results are in agreement with the report of Colinas and Fitzpatrick, 2015, where the expression of genes involved in thiamine biosynthesis was higher in photosynthesizing tissues⁷². Most of the genes involved in thiamine biosynthesis are not fully known; hence to improve human health and nutrition, it is vital to identify the genes and enzymes involved in thiamine biosynthesis⁷³.

Conclusions

Using transcriptome data, a total of 4443 new SSR markers were developed, and the frequency, distribution, and function of these genic microsatellites were characterized. The unigenes containing microsatellites performed a spread spectrum of biological roles, the majority of which were related to thiamine metabolism, purine metabolism, other metabolic process, and signaling pathways. These findings shed light on the function of microsatellites in the transcriptome. Furthermore, the polymorphism and transferability of 24 eSSR markers were examined in 25 minor millet genotypes. The results of this research can present an excellent resource for germplasm identification, genetic relationship studies, linkage maps, MAS reproduction, and diversity analysis in little millet and other related species in future genetic and genomic studies, as well as play a substantial role to update the millet genic SSR markers database.

Data availability

Transcriptome data generated in this study were submitted to NCBI with the SRA ID SRR14509267, SRX10855102, SRR14509268, SRX1085510, SRR14509269, SRX10855100, and SAMN19114419.

References

Bandyopadhyay, T., Muthamilarasan, M. & Prasad, M. Millets for next generation climate-smart agriculture. Front. Plant Sci. 8, 1266 (2017).
Article PubMed PubMed Central Google Scholar
Vetriventhan, M. et al. Genetic and genomic resources, and breeding for accelerating improvement of small millets: Current status and future interventions. Nucleus 63, 1–23 (2020).
Google Scholar
Jones, J. Grain-based foods and health. Cereal Foods World 51, 108 (2006).
ADS Google Scholar
Lata, C., Gupta, S. & Prasad, M. Foxtail millet: A model crop for genetic and genomic studies in bioenergy grasses. Crit. Rev. Biotechnol. 33, 328–343 (2013).
Article PubMed Google Scholar
Upadhyaya, H. D., Vetriventhan, M., Dwivedi, S. L., Pattanashetti, S. K. & Singh, S. K. Genetic and Genomic Resources for Grain Cereals Improvement 321–343 (Elsevier, 2016).
Book Google Scholar
Hamid, R., Siahpoosh, M., Mamaghani, R. & Siahpoosh, A. Evaluation the genetic diversity of 10 milk thistle (Silybum marianum L.) ecotypes using morphological, phenological and phytochemical traits (2014).
Zarei, A., Zamani, Z. & Sarkhosh, A. Biodiversity, germplasm resources and breeding methods. In The Pomegranate: Botany, Production and Uses 94 (2020).
Sandhu, N. et al. Marker assisted breeding to develop multiple stress tolerant varieties for flood and drought prone areas. Rice 12, 1–16 (2019).
Article Google Scholar
Boopathi, N. M. Genetic Mapping and Marker Assisted Selection 107–178 (Springer, 2020).
Book Google Scholar
Rathod, V. et al. Peanut (Arachis hypogaea) transcriptome revealed the molecular interactions of the defense mechanism in response to early leaf spot fungi (Cercospora arachidicola). Plant Gene 23, 100243 (2020).
Article CAS Google Scholar
Biswas, M. K. et al. Transcriptome wide SSR discovery cross-taxa transferability and development of marker database for studying genetic diversity population structure of Lilium species. Sci. Rep. 10, 1–13 (2020).
Article CAS Google Scholar
Nadeem, M. A. et al. DNA molecular markers in plant breeding: Current status and recent advancements in genomic selection and genome editing. Biotechnol. Biotechnol. Equip. 32, 261–285 (2018).
Article CAS Google Scholar
Rathod, V. et al. Comparative RNA-Seq profiling of a resistant and susceptible peanut (Arachis hypogaea) genotypes in response to leaf rust infection caused by Puccinia arachidis. 3 Biotech 10, 1–15 (2020).
Article Google Scholar
Cho, Y. G. et al. Diversity of microsatellites derived from genomic libraries and GenBank sequences in rice (Oryza sativa L.). Theor. Appl. Genet. 100, 713–722 (2000).
Article CAS Google Scholar
Zhang, M., Mao, W., Zhang, G. & Wu, F. Development and characterization of polymorphic EST-SSR and genomic SSR markers for Tibetan annual wild barley. PLoS ONE 9, e94881 (2014).
Article ADS PubMed PubMed Central CAS Google Scholar
Taheri, S. et al. De novo assembly of transcriptomes, mining, and development of novel EST-SSR markers in Curcuma alismatifolia (Zingiberaceae family) through Illumina sequencing. Sci. Rep. 9, 1–14 (2019).
Google Scholar
Hamid, R., Marashi, H., Tomar, R. S., Malekzadeh Shafaroudi, S. & Sabara, P. H. Transcriptome analysis identified aberrant gene expression in pollen developmental pathways leading to CGMS in cotton (Gossypium hirsutum L.). PLoS ONE 14, e0218381 (2019).
Article CAS PubMed PubMed Central Google Scholar
Ge, Y. et al. Transcriptome sequencing of different avocado ecotypes: De novo transcriptome assembly, annotation, identification and validation of EST-SSR markers. Forests 10, 411 (2019).
Article Google Scholar
Mathi Thumilan, B. et al. Development and characterization of genic SSR markers from Indian mulberry transcriptome and their transferability to related species of Moraceae. PLoS ONE 11, e0162909 (2016).
Article CAS PubMed PubMed Central Google Scholar
Chen, H. et al. Development and validation of EST-SSR markers from the transcriptome of adzuki bean (Vigna angularis). PLoS ONE 10, e0131939 (2015).
Article PubMed PubMed Central CAS Google Scholar
Sabu, K., Shehenaz, M. & Amrutha, J. Transcriptome mining for Est-Indels and development of EST-SSR markers in turmeric (Curcuma longa L.). Int. J. Agric., Environ. Biotechnol. 11, 487–491 (2018).
Google Scholar
Tao, S.-Q., Cao, B., Tian, C.-M. & Liang, Y.-M. Development and characterization of novel genic-SSR markers in apple-Juniper rust pathogen Gymnosporangium yamadae (Pucciniales: Pucciniaceae) using next-generation sequencing. Int. J. Mol. Sci. 19, 1178 (2018).
Article PubMed Central CAS Google Scholar
Hamid, R. et al. Transcriptome profiling and cataloging differential gene expression in floral buds of fertile and sterile lines of cotton (Gossypium hirsutum L.). Gene 660, 80–91 (2018).
Article CAS PubMed Google Scholar
Gupta, P. K. et al. Transferable EST-SSR markers for the study of polymorphism and genetic diversity in bread wheat. Mol. Genet. Genom. 270, 315–323 (2003).
Article CAS Google Scholar
Tulsani, N. J. et al. Transcriptome landscaping for gene mining and SSR marker development in coriander (Coriandrum sativum L.). Genomics 112, 1545–1553 (2020).
Article CAS PubMed Google Scholar
Karcι, H., Paizila, A., Topçu, H., Ilikçioğlu, E. & Kafkas, S. Transcriptome sequencing and development of novel genic SSR markers from Pistacia vera L. Front. Genet. 11, 1021 (2020).
Article PubMed PubMed Central CAS Google Scholar
Pedrini, S. & Dixon, K. W. International principles and standards for native seeds in ecological restoration. Restor. Ecol. 28, S286–S303 (2020).
Google Scholar
Hamid, R., Jacob, F., Marashi, H., Rathod, V. & Tomar, R. S. Uncloaking lncRNA-meditated gene expression as a potential regulator of CMS in cotton (Gossypium hirsutum L.). Genomics 112, 3354–3364 (2020).
Article CAS PubMed Google Scholar
Simão, F. A., Waterhouse, R. M., Ioannidis, P., Kriventseva, E. V. & Zdobnov, E. M. BUSCO: Assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics 31, 3210–3212 (2015).
Article PubMed CAS Google Scholar
Haas, B. J. et al. De novo transcript sequence reconstruction from RNA-seq using the Trinity platform for reference generation and analysis. Nat. Protoc. 8, 1494–1512 (2013).
Article CAS PubMed Google Scholar
Fu, L., Niu, B., Zhu, Z., Wu, S. & Li, W. CD-HIT: Accelerated for clustering the next-generation sequencing data. Bioinformatics 28, 3150–3152 (2012).
Article CAS PubMed PubMed Central Google Scholar
Bosamia, T. C., Mishra, G. P., Thankappan, R. & Dobaria, J. R. Novel and stress relevant EST derived SSR markers developed and validated in peanut. PLoS ONE 10, e0129127 (2015).
Article PubMed PubMed Central CAS Google Scholar
Parekh, M. J. et al. Development and validation of novel fiber relevant dbEST–SSR markers and their utility in revealing genetic diversity in diploid cotton (Gossypium herbaceum and G. arboreum). Ind. Crops Prod. 83, 620–629 (2016).
Article CAS Google Scholar
Kristamtini, K., Taryono, T., Basunanda, P. & Murti, R. H. High resolution microsatellite marker analysis of some rice landraces using metaphor agarose gel electrophoresis. Indones. J. Biotechnol. 20, 54–61 (2016).
Article Google Scholar
Asif, M., Mirza, J. & Zafar, Y. High resolution metaphor agarose gel electrophoresis for genotyping with microsatellite markers. Pak. J. Agric. Sci. 45, 75–79 (2008).
Google Scholar
Sánchez-Pérez, R., Ballester, J., Dicenta, F., Arús, P. & Martínez-Gómez, P. Comparison of SSR polymorphisms using automated capillary sequencers, and polyacrylamide and agarose gel electrophoresis: Implications for the assessment of genetic diversity and relatedness in almond. Sci. Hortic. 108, 310–316 (2006).
Article CAS Google Scholar
Weir, B. S. Genetic Data Analysis Methods for Discrete Population Genetic Data (Sinauer Associates, Inc. Publishers, 1990).
Google Scholar
Rohlf, F. NTSYS-pc. Numerical Taxonomy and Multivariate Analysis: Version 2.02 (Exeter Software, 1998).
Google Scholar
R Core Team. R: A Language and Environment for Statistical Computing (R Core Team, 2013).
Google Scholar
Meyer, S., Held, L. & Höhle, M. hhh4: Endemic-epidemic modeling of areal count time series. J. Stat. Softw. 1, 1–55 (2016).
Google Scholar
Wickham, H. Elegant graphics for data analysis. Media 35, 10–1007 (2009).
MATH Google Scholar
Sonah, H., Deshmukh, R., Sharma, A., Singh, V. & Gupta, D. Genome-wide distribution and organization of microsatellites in plants: An insight. PLoS ONE 6, e21298 (2011).
Article ADS CAS PubMed PubMed Central Google Scholar
Li, Y. et al. Benefiting others and self: Production of vitamins in plants. J. Integr. Plant Biol. 63, 210–227 (2021).
Article CAS PubMed Google Scholar
Saleh, A. S. M. et al. Millet grains: nutritional quality, processing, and potential health benefits. Compr. Rev. Food Sci. Food Saf. 12(3), 281–295 (2013).
Article CAS Google Scholar
Devi, P. B., Vijayabharathi, R., Sathyabama, S., Malleshi, N. G. & Priyadarisini, V. B. Health benefits of finger millet (Eleusine coracana L.) polyphenols and dietary fiber: A review. J. Food Sci. Technol. 51, 1021–1040 (2014).
Article CAS PubMed Google Scholar
De, L. Edible seeds and nuts in human diet for immunity development. Int. J. Recent Sci. Res. 6, 38877–38881 (2020).
Google Scholar
Ramashia, S. E., Anyasi, T. A., Gwata, E. T., Meddows-Taylor, S. & Jideani, A. I. O. Processing, nutritional composition and health benefits of finger millet in sub-Saharan Africa. Food Sci. Technol. 39, 253–266 (2019).
Article Google Scholar
Singh, R. K. & Prasad, M. The Foxtail Millet Genome 63–75 (Springer, 2017).
Book Google Scholar
Huang, X. et al. De novo transcriptome analysis and molecular marker development of two Hemarthria species. Front. Plant Sci. 7, 496 (2016).
Article PubMed PubMed Central Google Scholar
Zhao, H. et al. High-throughput sequencing analysis reveals effects of short-term low-temperature storage on miRNA-mediated flavonoid accumulation in postharvest toon buds. Plant Gene 26, 100291 (2021).
Article CAS Google Scholar
Zheng, X. et al. Development of microsatellite markers by transcriptome sequencing in two species of Amorphophallus (Araceae). BMC Genom. 14, 490 (2013).
Article CAS Google Scholar
Wei, W. et al. Characterization of the sesame (Sesamum indicum L.) global transcriptome using Illumina paired-end sequencing and development of EST-SSR markers. BMC Genom. 12, 451 (2011).
Article CAS Google Scholar
Zhang, W. et al. Characterization of flower-bud transcriptome and development of genic SSR markers in Asian lotus (Nelumbo nucifera Gaertn.). PLoS ONE 9, e112223 (2014).
Article ADS PubMed PubMed Central CAS Google Scholar
Varshney, R. et al. Genetic mapping and BAC assignment of EST-derived SSR markers shows non-uniform distribution of genes in the barley genome. Theor. Appl. Genet. 113, 239 (2006).
Article CAS PubMed Google Scholar
Peng, J. & Lapitan, N. L. Characterization of EST-derived microsatellites in the wheat genome and development of eSSR markers. Funct. Integr. Genom. 5, 80–96 (2005).
Article CAS Google Scholar
Raju, N. L. et al. The first set of EST resource for gene discovery and marker development in pigeonpea (Cajanus cajan L.). BMC Plant Biol. 10, 45 (2010).
Article PubMed PubMed Central CAS Google Scholar
Yang, Z., Peng, Z. & Yang, H. Identification of novel and useful EST-SSR markers from de novo transcriptome sequence of wheat (Triticum aestivum L.). Genet. Mol. Res. 15, 15017509 (2016).
Google Scholar
Zhai, L. et al. Novel and useful genic-SSR markers from de novo transcriptome sequencing of radish (Raphanus sativus L.). Mol. Breed. 33, 611–624 (2014).
Article CAS Google Scholar
Tiwari, N., Tiwari, S. & Tripathi, N. Genetic characterization of Indian little millet (Panicum sumatrense) genotypes using random amplified polymorphic DNA markers. Agric. Nat. Resour. 52, 347–353 (2018).
Google Scholar
Johnson, M., Deshpande, S., Vetriventhan, M., Upadhyaya, H. D. & Wallace, J. G. Genome-wide population structure analyses of three minor millets: Kodo millet, little millet, and proso millet. Plant Genome 12, 190021 (2019).
Article CAS Google Scholar
Ali, A. et al. Development of EST-SSRs and assessment of genetic diversity in little millet (Panicum sumatrense) germplasm. Korean J. Plant Resour. 30, 287–297 (2017).
Google Scholar
Das, R. R., Pradhan, S. & Parida, A. De-novo transcriptome analysis unveils differentially expressed genes regulating drought and salt stress response in Panicum sumatrense. Sci. Rep. 10, 1–14 (2020).
Article CAS Google Scholar
Vendramin, E. et al. A set of EST-SSRs isolated from peach fruit transcriptome and their transportability across Prunus species. Mol. Ecol. Notes 7, 307–310 (2007).
Article CAS Google Scholar
Varshney, R. K., Graner, A. & Sorrells, M. E. Genic microsatellite markers in plants: Features and applications. Trends Biotechnol. 23, 48–55 (2005).
Article CAS PubMed Google Scholar
Vieira, M. L. C., Santini, L., Diniz, A. L. & Munhoz, C. D. F. Microsatellite markers: What they mean and why they are so useful. Genet. Mol. Biol. 39, 312–328 (2016).
Article PubMed PubMed Central Google Scholar
Senthilvel, S. et al. Development and mapping of simple sequence repeat markers for pearl millet from data mining of expressed sequence tags. BMC Plant Biol. 8, 1–9 (2008).
Article CAS Google Scholar
Sonah, H. et al. Genome-wide distribution and organization of microsatellites in plants: An insight into marker development in Brachypodium. PLoS ONE 6, e21298 (2011).
Article ADS CAS PubMed PubMed Central Google Scholar
Temnykh, S. et al. Computational and experimental analysis of microsatellites in rice (Oryza sativa L.): Frequency, length variation, transposon associations, and genetic marker potential. Genom. Res. 11, 1441–1452 (2001).
Article CAS Google Scholar
Xu, R., Wang, Z., Su, Y. & Wang, T. Characterization and development of microsatellite markers in Pseudotaxus chienii (Taxaceae) based on transcriptome sequencing. Front. Genet. 11, 1249 (2020).
Article Google Scholar
Hina, F., Yisilam, G., Wang, S., Li, P. & Fu, C. D. novo transcriptome assembly, gene annotation and SSR marker development in the moon seed genus Menispermum (Menispermaceae). Front. Genet. 11, 380 (2020).
Article PubMed PubMed Central Google Scholar
You, Y. et al. Leaf transcriptome analysis and development of EST-SSR markers in arrowhead (Sagittaria trifolia L. var. Sinensis). Trop. Plant Biol. 13, 1–12 (2020).
Article CAS Google Scholar
Colinas, M. & Fitzpatrick, T. B. Natures balancing act: Examining biosynthesis de novo, recycling and processing damaged vitamin B metabolites. Curr. Opin. Plant Biol. 25, 98–106 (2015).
Article CAS PubMed Google Scholar
Strobbe, S. & Van Der Straeten, D. Toward eradication of B-vitamin deficiencies: Considerations for crop biofortification. Front. Plant Sci. 9, 443 (2018).
Article PubMed PubMed Central Google Scholar

Download references

Acknowledgements

The authors would like to express their gratitude to Dr. Mohammad Reza Ghaffari for providing enrichment analysis facilities. Also, we would like to give special thanks to Junagadh Agricultural University for fulfilling laboratory space during the experiment. We are also thankful to the Director, ICAR-Indian Institute of Millets Research (https://www.millets.res.in/), Hyderabad for providing accessions of minor millet used in the research under the Material Transfer Agreement (MTA).

Funding

This research was not supported by any agency.

Author information

These authors contributed equally: Hiral Desai and Rasmieh Hamid.

Authors and Affiliations

Department of Biotechnology, Junagadh Agricultural University, Junagadh, Gujarat, India
Hiral Desai, Nishant Bhut, Shital M. Padhiyar & Jasminkumar Kheni
Department of Plant Breeding, Cotton Research Institutes of Iran (CRII), Agricultural Research, Education and Extension Organization (AREEO), Gorgān, Iran
Rasmieh Hamid
Department of Systems Biology, Agricultural Biotechnology Research Institute of Iran, (ABRII), Agricultural Research Education and Extension Organization (AREEO), Karaj, Iran
Zahra Ghorbanzadeh
Main Oilseeds Research Station, Junagadh Agricultural University, Junagadh, Gujarat, India
Rukam S. Tomar

Authors

Hiral Desai
View author publications
You can also search for this author in PubMed Google Scholar
Rasmieh Hamid
View author publications
You can also search for this author in PubMed Google Scholar
Zahra Ghorbanzadeh
View author publications
You can also search for this author in PubMed Google Scholar
Nishant Bhut
View author publications
You can also search for this author in PubMed Google Scholar
Shital M. Padhiyar
View author publications
You can also search for this author in PubMed Google Scholar
Jasminkumar Kheni
View author publications
You can also search for this author in PubMed Google Scholar
Rukam S. Tomar
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

H.D. samples collection, executed laboratory procedures of the project, and preparing the initial draft of the manuscript; R.H. performed data analysis, data visualization, and diagram preparation, as well as Review & Editing the manuscript, Z.G.H. assisted in improving the manuscript and data analysis as well as diagram preparation, N.B. executed laboratory and fieldwork of the project also writing the initial draft, S.H.M.P. and J.K.H. executed laboratory work of the project and Draft Preparation, R.S.T. Project Administration, guided throughout the experiment, extended laboratory facility and helped in improving the manuscript.

Corresponding author

Correspondence to Rukam S. Tomar.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Legends.

Supplementary Figure S1.

Supplementary Figure S2.

Supplementary Figure S3.

Supplementary Figure S4.

Supplementary Tables.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Desai, H., Hamid, R., Ghorbanzadeh, Z. et al. Genic microsatellite marker characterization and development in little millet (Panicum sumatrense) using transcriptome sequencing. Sci Rep 11, 20620 (2021). https://doi.org/10.1038/s41598-021-00100-4

Download citation

Received: 20 May 2021
Accepted: 29 September 2021
Published: 18 October 2021
DOI: https://doi.org/10.1038/s41598-021-00100-4

This article is cited by

Nutritional and nutraceutical richness of neglected little millet genotypes from Eastern Ghats of India: implications for breeding and food value
- Debabrata Panda
- Pramila Muni
- Prashant K. Parida
Planta (2024)
De novo transcriptome sequencing of drought tolerance–associated genes in little millet (Panicum sumatrense L.)
- Dhawale Ramesh Narayanrao
- R. S. Tomar
- Sunil Tulshiram Hajare
Functional & Integrative Genomics (2023)
Identification of heat tolerant bread wheat (Triticum aestivum L.) genotypes through heat susceptibility index (HSI) and SSR markers
- Nikita S. Patel
- Jagdish B. Patel
- Rukam S. Tomar
Cereal Research Communications (2023)
Novel Microsatellite Markers Derived from Arachis pintoi Transcriptome Sequencing for Cross-Species Transferability and Varietal Identification
- Jônatas Chagas de Oliveira
- André Lucas Domingos da Silva
- Tatiana de Campos
Plant Molecular Biology Reporter (2023)
Unravelling the treasure trove of drought-responsive genes in wild-type peanut through transcriptomics and physiological analyses of root
- Feba Jacob Thoppurathu
- Zahra Ghorbanzadeh
- Meera Joshi
Functional & Integrative Genomics (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Materials and methods

Plant materials

RNA extraction and cDNA library preparation

Reads processing and assembly

SSR marker identification and primer design

Assigning functions to unigenes containing microsatellites

DNA extraction, PCR, and eSSR validation

Polymorphism estimation and cluster analysis

Results and discussion

De novo assembly and characterization of unigene

EST-SSR discovery, frequencies, and distribution

Microsatellite distribution through genic regions

Functional annotation of microsatellites-containing unigenes

GO and KEGG enrichment analysis of microsatellites-containing unigenes

SSR marker polymorphism assay

The genes and enzymes involved in the biosynthesis of thiamin

Discussion

Transcriptome sequencing, assembly, and genic SSR characterization

Genetic relationship, polymorphism, and transferability of EST-SSR markers

Potential function of unigenes containing microsatellites

Conclusions

Data availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links