RNA-Seq transcriptome profiling of immature grain wheat is a technique for understanding comparative modeling of baking quality

Ahmadi-Ochtapeh, Hossein; Soltanloo, Hassan; Ramezanpour, Seyyede Sanaz; Yamchi, Ahad; Shariati, Vahid

doi:10.1038/s41598-024-61528-y

Download PDF

Article
Open access
Published: 13 May 2024

RNA-Seq transcriptome profiling of immature grain wheat is a technique for understanding comparative modeling of baking quality

Hossein Ahmadi-Ochtapeh¹,
Hassan Soltanloo²,
Seyyede Sanaz Ramezanpour²,
Ahad Yamchi² &
…
Vahid Shariati³

Scientific Reports volume 14, Article number: 10940 (2024) Cite this article

Metrics details

Subjects

Abstract

Improving the baking quality is a primary challenge in the wheat flour production value chain, as baking quality represents a crucial factor in determining its overall value. In the present study, we conducted a comparative RNA-Seq analysis on the high baking quality mutant “O-64.1.10” genotype and its low baking quality wild type "Omid" cultivar to recognize potential genes associated with bread quality. The cDNA libraries were constructed from immature grains that were 15 days post-anthesis, with an average of 16.24 and 18.97 million paired-end short-read sequences in the mutant and wild-type, respectively. A total number of 733 transcripts with differential expression were identified, 585 genes up-regulated and 188 genes down-regulated in the “O-64.1.10” genotype compared to the “Omid”. In addition, the families of HSF, bZIP, C2C2-Dof, B3-ARF, BES1, C3H, GRF, HB-HD-ZIP, PLATZ, MADS-MIKC, GARP-G2-like, NAC, OFP and TUB were appeared as the key transcription factors with specific expression in the “O-64.1.10” genotype. At the same time, pathways related to baking quality were identified through Kyoto Encyclopedia of Genes and Genomes. Collectively, we found that the endoplasmic network, metabolic pathways, secondary metabolite biosynthesis, hormone signaling pathway, B group vitamins, protein pathways, pathways associated with carbohydrate and fat metabolism, as well as the biosynthesis and metabolism of various amino acids, have a great deal of potential to play a significant role in the baking quality. Ultimately, the RNA-seq results were confirmed using quantitative Reverse Transcription PCR for some hub genes such as alpha-gliadin, low molecular weight glutenin subunit and terpene synthase (gibberellin) and as a resource for future study, 127 EST-SSR primers were generated using RNA-seq data.

Unraveling the complexity of faba bean (Vicia faba L.) transcriptome to reveal cold-stress-responsive genes using long-read isoform sequencing technology

Article Open access 26 October 2021

Transcriptome dynamics of Gossypium purpurascens in response to abiotic stresses by Iso-seq and RNA-seq data

Article Open access 09 May 2024

De-novo transcriptome analysis unveils differentially expressed genes regulating drought and salt stress response in Panicum sumatrense

Article Open access 04 December 2020

Introduction

Wheat (Triticum aestivum L., 2n = 6x = 42) is an important food crop in the world and possesses unique flour quality that can be used to make various food products¹. However, current wheat varieties need improvement in processing quality to meet the increasing demand for better quality food products². Processing quality of wheat flour is determined by grain protein concentration and its composition which confer wheat dough with unique rheological properties, making it possible to produce a series of quality foods for human consumption^3,4,5. Baking quality is one of the most important parameters throughout the value chain of wheat flour⁶. Seed is the primary storage organ in plants for storing nutrients such as starch, lipids, and proteins⁷ and the baking quality is largely influenced by these nutrients. The content of gliadins and glutenins⁶, optimal water absorption, rheological parameters⁸ and grain hardness⁹ are correlated to high baking quality. Determination of flour water absorption, as well as the results of the farinographic analysis of the flour play a key role in the assessment of wheat flour baking quality ¹⁰. However, improving baking quality is a challenge for wheat breeders due to the time-consuming and costly testing forcing breeders to postpone sophisticated quality tests to the very last phases of variety development¹¹. The genome of a plant is the most critical factor to control baking quality trait in wheat¹². For example,² processing quality related key genes such as glutenin and gliadins, puroindolines, grain softness protein, alpha and beta amylases, proteases, were identified, and many other candidate genes related to cellular and molecular functions. Therefore, an understanding of the molecular basis of this trait would be a major advantage in wheat breeding¹³. Advances in molecular techniques offer new opportunities to identify the genetic and molecular basis of bread quality in wheat^1,2,14.

RNA-Seq analysis of various wheat genotypes has opened up new avenues for research on grain development and its composition, which ultimately determines the nutritional and functional quality of wheat^15,16. The gene expression during wheat grain development plays a crucial role in determining the yield and nutritional properties of the crop. Identifying genes expressed during the grain filling stage can be useful for improving yield and grain quality^17,18. Several studies have applied transcriptomics approaches to investigate the gene expression during grain development stages in wheat^{2,15,16,17,19,20,21}. The grain development process of wheat can be categorized into three stages, including cell division and expansion (0 ~ 14 DPA (days post-anthesis)), effective grain filling (14 ~ 28 DPA), and maturation and desiccation (28 DPA to maturity)²². Understanding these stages and the associated gene expression can lead to the development of new and improved wheat varieties with superior nutritional and functional properties^17,18. Early grain development, specifically the first 14 days after pollination, is critical for the final yield and quality of wheat. During this time, rapid grain expansion occurs through continuous cell division and endosperm filling with starch and gluten^16,23. At 30 DPA, a diverse range of genes are expressed at low levels with a predominance of genes associated with seed defense and stress tolerance¹³.

Wheat flour quality is highly influenced by protein content and composition, with glutens being the most abundant storage proteins, comprising about 80% of total grain proteins. Glutenins consist of high and low-molecular-weight glutenin subunits (HMW-GS and LMW-GS). Gene expression from these protein families occurs early in grain development^16,23. Zamani et al.²⁴ and Izadi-Darbandi et al.²⁵ reported that mutant, “O-64.1.10” and its parent, “Omid” wheat cultivar have two HMW-GS genes, Dx2 + Dy12 and Bx7 + By8 but the improvement of baking quality in the mutant, “O-64.1.10” was unknown. A negative effect of Dx2 + Dy12 was published previously by several studies^{26,27,28,29,30,31}. The allelic pair Bx7 + By8* (Glu-B1a1) strongly associated with dough strength³². The complex processing quality of wheat is controlled by many genes, which have not been completely explored². For example, phytohormones are the predominant biochemical basis for grain morphogenesis³³ and reported to be important signals in controlling seed development, maturation, and nutrients accumulation³⁴. Gene ontology (GO) enrichment and pathway enrichment analysis showed that many transcription products and transcription factors (TFs) associated with carbohydrate and protein metabolism were abundantly expressed in the grain³⁵. TFs regulate target genes to ensure tightly regulated developmental process¹⁵. bZIP TFs are regulators of important plant processes such as seed storage protein gene regulation³⁶, energy metabolism³⁷, unfolded protein response³⁶, and hormone and sugar signalling³⁸. NAC TF play a significant role in regulating the accumulation of glutenin and starch in wheat endosperm, which are crucial for the grains quality³⁹.

Rahemi et al.⁴⁰ reported that water absorption percentage, valorimeter value, farinograph quality number, zeleny number, hardness, wet gluten, protein content in high baking quality mutant “O-64.1.10” exhibited greater than “Omid”. We compared RNA-Seq data from immature grains of a “O-64.1.10” wheat genotype and its wild type cultivar, "Omid," which have significantly different baking qualities. The purpose of our study was to identify genes that are primarily expressed in wheat grain and are related to baking quality. Through this study, we gained new insights into the molecular mechanisms that underlie seed quality and identified potential candidate genes related to wheat grain quality. Our findings provide a solid foundation for future research aimed at improving bread-making quality.

Material and methods

At all stages, the research complied with relevant institutional, national, and international guidelines and legislation.

Plant materials and sampling

To investigate the transcriptome of immature seeds of Triticum aestivum L, mutant “O-64.1.10” genotype and its low baking quality wild type "Omid" cultivar were obtained from the GenBank of Nuclear Science and Technology Research Institute in Karaj, Iran. We used two genotypes of Iranian bread wheat, namely “O-64.1.10” (with high baking quality) and its parent cultivar “Omid” (with poor baking quality), to conduct a comparative analysis of their baking quality (Table 1). Mutant genotype “O-64.1.10” was produced by the gamma irradiation approach and evaluated for baking quality through rheological and proteomics trials^25,40. The “Omid” and its mutant “O-64–1-10”, were cultivated at the research field of Gorgan University of Agricultural and Natural Resources Gorgan, Iran. The main stem heads were marked with the anthesis date. The grains from the middle ear were collected at 5, 10, 15, 20, and 30 DPA between 9:00 and 10:00 am. For dynamic comparison of developing grains three biological replications of samples were used for the five stages. Also for RNA-seq samples, the seeds of “Omid” and “O-64.1.10” were used at 15 DPA. For any genotypes the seeds of 9 spikes (10 grains per spike) harvested from 9 different plants (three plants per three replications) and pooled. The samples were promptly frozen in liquid nitrogen and stored at − 80 °C for subsequent total RNA extraction.

Table 1 Comparison the baking quality traits in mutant genotype “O-64.1.10” and its wild type “Omid” cultivar¹².

Full size table

RNA isolation and cDNA library construction

We utilized 100 mg seed samples at 15 DPA and p-Biozol Buffer (BioFlux, Japan) to extract total RNA, following the manufacturer's recommendations. Additionally, 1% (w/v) RNase-free agarose gel electrophoresis and a Nanophotometer (Implen, Germany) were used to evaluate the amount and quality of RNA samples. The RNA samples of 15 DPA, passing the quality and quantity control, were sent to Beijing Genomics Institute (BGI), Hong Kong, China for cDNA library construction and sequencing on the Illumina sequencing platform (Illumina HiSeq™ 2500). A comprehensive quantitative evaluation of each RNA samples was performed using a Nanodrop 8000 Spectrophotometer (ThermoScientific, USA) and an Agilent 2100 Bioanalyzer System (Agilent Technologies, USA) to produce information on RNA concentration. The high-quality RNA (OD 260/280 = 2.06–2.08; OD 260/230 = 1.9–2.09; RIN value 7.5) was further processed for cDNA library creation using the Illumina TrueSeq RNA Sample Prep kit. To produce 150 bp paired-end reads, NGS sequencing of cDNA from seed tissue was performed.

Data processing and analysis

The quality of paired-end reads from raw sequencing data was assessed both before and after trimming. The sequencing quality of the raw reads from each sequenced sample was assessed using FASTQC software (https://www.bioinformatics.babraham.ac.uk/projects/fastqc/) accessed on January 8th, 2019)) and CLC Genomic Workbench 7.5.1 (CLC Bio-Qiagen, Denmark). Trimmomatic software (version 0.36) was used to remove adapter sequences, low-quality nucleotides/sequences, and reads shorter than 150 bp. The remaining high-quality reads with a Phred score of ≥ 30 was used for downstream analyses. These reads were aligned against the wheat reference genome (version IWGSC RefSeq v1.0, http://plants.ensembl.org/Triticum_aestivum/) using Hisat2 software (version 2.2.1.0)⁴¹. The mapped reads from each sample were assembled using Cufflinks v2.0.2 and htseq. These Cufflink assemblies were merged using Cuffmerge. The outputs were then used for differential expression analysis by Cuffdiff and EdgeR packages⁴². The normalization of gene expression values of two samples was estimated as fragments per kilobase of transcript per million fragments mapped (FKPM) and fold change (FC). Data analysis with an adjusted p-value threshold of 0.001 and Log2 Fold Change (log2FC) of ≥ 2 were assigned as differentially expressed genes (DEGs). GO enrichment analysis for DEGs was performed using goseq and AgriGO version 2.0. The Benjamini and Hochberg approach for controlling the false discovery rate (FDR) was used to adjust P-values. The threshold of 0.01 was set for the FDR value. The GO analysis was carried out to functionally categorize the DEGs from 15 DPA into three major aspects: "cellular component (CC)," "molecular function (MF)," and "biological process (BP)". Pathway enrichment analysis of DEGs was performed using the KEGG (Kyoto Encyclopedia of genes and genomes) database (http://www.genome.ad.jp/kegg/). To identify TFs, transcriptional regulators (TRs), and protein kinases (PKs) encoding genes, The DEGs were screened using the Plant Transcription Factor & Protein Kinase Identifier and Classifier database (iTAK v1.6). Simple Sequence Repeats (SSRs) were identified from RNA-seq data using the Perl script of MISA (MIcroSAtellite identification tool)⁴³ with default parameters⁴⁴.

Validation of RNA-seq data by real-time PCR

The RNA-Seq results were validated through the use of quantitative Reverse Transcription PCR (qRT-PCR). The qRT-PCR was performed on three genes that were chosen from the DEG analysis. The primers were designed based on the 3́-UTR region of the sequence (Table 2) using the Primer3 online software (http://www.embnet.sk/cgi-bin/primer3_www.cgi)⁴⁵.

Table 2 List of primers used in the experiment for quantitative Reverse Transcription PCR amplification.

Full size table

Before first-strand cDNA synthesis, possible genomic DNA contamination was removed by RNase-free DNaseI (Thermo Scientific, USA) at a ratio of 1 U DNaseI to 2 µg RNA, 1X DNaseI buffer, 10 U Ribolock RNase inhibitor (Thermo Scientific, USA), and DEPC water up to 9 µl, followed by incubation at 37 °C for 30 min. To finalize DNaseI activity, 25 mM EDTA buffer was added to the reaction and the RNA was then heat-denatured at 65 °C for 10 min. First-strand cDNA synthesis was carried out using 1 µg of total RNA in the reaction mixture containing 0.5 µg Oligo (dT) primer and DEPC water (nuclease-free) up to 11 µl. To eliminate secondary structures, the liquid was gently mixed, quickly centrifuged, then incubated at 70 °C for 5 min. Then, 4 µl 5X cDNA reaction buffer, 10 mM dNTP mix, and 20 U Ribolock RNase inhibitor were added to the reaction mixture. The final volume was adjusted to 19 µl with DEPC-treated water, and the mixture was incubated for 5 min at 37 °C. 200 U Revert Aid enzyme (Thermo Scientific, USA) was added to the reaction mixture and incubated at 42 °C for 1 h. A qRT-PCR was performed using an iCycler thermal cycler (Bio-Rad, iQ5, USA) with a reaction volume containing 3 µl of diluted cDNA, 10 µl of 2X SYBR Bio Pars PCR Master Mix (Gorgan University of Agricultural Sciences and Natural Resources, Iran), and 1 µl of each gene-specific primer (10 pmol) in a final volume 20 µl with double distilled water. The time course of qRT-PCR were done at the following conditions: 3 min at 95 °C for 1 cycle; 10 s at 95 °C, 10 s at 62 °C and 10 s at 72 °C for 35 cycles, and 2 min at 72 °C for 1 cycle. Upon that, PCR was done on each sample in three technical and biological replications. The fluorescence signal was detected at 72 °C. The REST software was used for the gene expression analysis⁴⁶, and the relative expression was computed utilizing the comparative Ct (2-∆∆^Ct) method⁴⁷ and compared with the expression levels of RNA-Seq. For qRT-PCR normalization, glyceraldehyde 3-phosphate dehydrogenase (GAPDH) was utilized as the reference gene.

Gene ontology analysis

The major biological activities of DEGs were determined using a GO enrichment analysis. DEGs from the 15 DPA stage were annotated and grouped into three major groups based on their functional characteristics: CC, MF, and BP categories.

Result

Grain morphology changes during seed development

The dynamic comparison of developing grains was examined at 5, 10, 15, 20, and 30 DPA stages between the “O-64.1.10” genotype and its “Omid” counterpart (Fig. 1a). The dry weight of the seeds showed a continuous increase from 5 to 20 DPA and reached its highest point at 20 DPA in both the mutant genotype “O-64.1.10” and its parent genotype “Omid” (Fig. 1b). Our results showed that mutant “O-64.1.10” exhibited greater grain weight growth than “Omid” at15, 20 and 30 DPA. Also highest rate of dry weight accumulation per day occurred between 10 and 15 DPA. The 15 DPA stage are beginning of effective grain filling stage in this stage the grain dry weight increases by about two-fold. The 15 DPA stage for the RNA-seq analysis. This finding is in agreement with earlier study conducted by Shewry et al.¹².

Identification of differentially expressed genes

The quality of RNA and quality of high-throughput sequencing data showed in supplementary data (S1 Figure). A total of 18.97 and 16.24 million paired-end short-read sequences with high-quality (Q30 > 99%) obtained by Illumina sequencing of the “Omid” and “O-64.1.10” cDNA libraries, respectively. The GC (guanine-cytosine) content for the clean data was 51% and 52% for the “Omid” and “O-64.1.10”, respectively (Table 3). DEGs were found in 773 genes between “O-64.1.10” and “Omid” (supplemental data 1). 585 genes up-regulated and 188 genes down-regulated in the “O-64.1.10” genotype compared to the “Omid”. Our study showed that most of the DEGs (339 genes, 43.85%) were anchored on B-genome (Fig. 2d, supplemental data 2). In addition, we found that up-regulated genes in “O-64.1.10” were enriched on chromosomes 2A (5.95%) and 3A (6.33%) and chromosomes 2B (6.59%) and 3B (6.2%), whereas down-regulated genes in “O-64.1.10” were overrepresented on chromosome 1B (14.5%) (Fig. 2).

Table 3 Summary of high-throughput sequencing data in “Omid” and mutant at 15 DPA.

Full size table

Functional classification and gene ontology analysis

GO enrichment analysis was carried out to characterize the main biological functions of DEGs in seeds 15 DPA. The DEGs from 15 DPA stage were functionally annotated and classified ontologically into three principal categories: CC, MF, and BP. Furthermore, 62 GO terms were enriched in the DEG from O-64.1.10 and Omid, including 19 CC, 15 MF, and 28 BP. A total of 733 DEGs were assigned to three main GO functional categories and then divided into 21 sub-categories. Metabolic process, cellular process, developmental process, and multicellular organismal process were found to be dominant in the BP category. The MF category was primarily dominated by catalytic activity and binding. Cell, cell part, and organelle were foremost in the cellular CC category (Fig. 3). The “O-64.1.10” genotype had the highest number of genes involved in specific BP, such as cellular protein catabolic process, diterpenoid biosynthetic process, gibberellin metabolic process, cellular nitrogen compound biosynthetic process, cellular amide metabolic process, lipid biosynthetic process and response to abiotic stimulus. Overall, these results suggest that the genes associated with wheat early grain development play crucial roles in encoding diverse regulators and proteins.

At the same time pathway enrichment analysis using KEGG database was performed to reveal the active biological pathways involved in baking quality of immature wheat grains. In this study, 68 and 34 pathways were identified in the “O-64.1.10” and the wild-type, respectively. Among all the pathways, the largest pathways were metabolic pathway, protein processing in endoplasmic reticulum and biosynthesis of secondary. An analysis using the KEGG database on biological pathways in the “O-64.1.10” genotype showed that a total of 187 annotated DEGs were assigned to 68 pathways. Among the 68 pathways, the largest pathway was metabolic pathways, which contained 32 genes, followed by protein processing in endoplasmic reticulum (19 genes), biosynthesis of secondary metabolites (16 genes), spliceosome (9 genes), ribosome (8 genes), plant hormone signal transduction (4 genes), and biosynthesis of amino acids (4 genes). The KEGG analysis in the “O-64.1.10” genotype indicated that 39 DEGs were involved in protein processing in endoplasmic reticulum (19 genes), biosynthesis of different amino acids pathways (14 genes), and different vitamins metabolism pathways (6 genes), which might participate in regulating baking quality (Fig. 6). Furthermore, the carbohydrate metabolism pathways (5 genes) in the wild-type genotype might participate in regulating baking quality. Pathway enrichment analysis of “Omid” genotype displayed the total of 68 annotated DEGs were assigned to 34 pathways. The top three enriched pathways include metabolic pathways (12 genes), biosynthesis of secondary metabolites (9 genes), and ribosome (4 genes) were largest pathways (Fig. 4, Supplemental data 4).

Functional annotations of these DEGs were achieved by using Mercator tool (Mercator V3.6). Results of differential expression showed that out of 773 differentially expressed genes, 109 (13.05%) BINs assigned to protein process. The number of genes were assigned to the functional category “protein” was highest (103), followed by “RNA” (74), “stress” (52) and “misc” (42) (Fig. 5). Lastly, In the “O-64.1.10”, 82 out of 103 DEGs involved in protein processing (Fig. 5), were up-regulated compared to the wild-type (Table 4). Elongation factor 1-alpha (EF1α) play a crucial role in the protein synthesis machinery of cells, and its function extends to various physiological processes in plants, including wheat⁴⁸. Our results indicated that this gene were highly expressed in “O-64.1.10” genotype.

Table 4 Categorization of differentially expressed hub genes (up/down-regulated) in “O-64.1.10” at 15 DPA into protein process; BIN code 29.

Full size table

Transcription factor, protein kinases and transcriptional regulators analysis

From DEGs, it was found that out of 22 TF families that the highest expression occurred for heat shock factor (HSF), AP2/ERF-ERF and basic leucine zipper (bZIP). Additionally, the HSF, bZIP, C2C2-Dof, B3-ARF, BES1, C3H, GRF, HB-HD-ZIP, PLATZ, MADS-MIKC, GARP-G2-like, NAC, OFP, and TUB TF families were exclusively expressed in the "O-64.1.10" genotype at the 15 DPA stage (Fig. 6a, supplemental data 3). Our findings revealed that, the bZIP TFs was related to storage protein gene regulation. TFs of the bZIP family are regulators of many central developmental and physiological processes including storage protein gene regulation⁴⁹, energy metabolism³⁷, light responses and oxidative stress signaling⁵⁰. The regulations of the grain-filling process may be regulated by several TF families, such as bZIP and HSF TFs. The HSF TFs are regulators in response to stress⁵¹, early grain filling¹⁶, seed maturation⁵² and pollen development⁵³. The HSF family is a transcriptional activator of heat shock protein (HSP) genes⁵⁴. The HSP family is an important factor ensuring correct protein folding that plays a significant role in degradation pathways, such as endoplasmic reticulum-associated degradation⁵⁵. Our results showed that LMW-GS genes and ERF TF had a higher expression in “Omid” and “O-64.1.10”, respectively. Hasrak et al.⁵⁶ reported there is a negative correlation between ERF TFs and lower expression of LMW-GS. Our data indicated that the NAC TF (TraesCS2B01G359200.1) were found from highly expressed in “O-64.1.10” genotype. The NAC019 TF regulates glutenin and starch accumulation and its elite allele improves wheat grain quality³⁹. The TKL, STE, CAMK, and Others of PKs were found from highly expressed in the “O-64.1.10” genotypes, whereas the AGC of PK families was expressed only in the “Omid” genotypes (Fig. 6b, supplemental data 3). The PKs may cooperate with the transcriptional networks to refine the regulation of genes during seed development³⁴. In this study, among all genes with differential expression, seven important families of TRs such as HMG, MBF1, MED, mTERF, SET, TAZ, and others genes were found from highly expressed in the “O-64.1.10” genotype. In contrast, one family of TRs (GNAT) only was found from highly expressed in the wild-type genotype (Fig. 6c, supplemental data 3).

Plant hormones and transposable elements (TEs) analysis

Among of the five hormone-related genes that were up-regulated in each of genotypes, three genes associated with gibberellic acid, brassinosteroid, and auxin were expressed only in “O-64.1.10” genotypes (Fig. 6d, supplemental data 3). The plant hormones regulate multiple biological processes in early grain development of wheat and the comprehensive expression profiling provides useful information for hormone regulatory mechanisms¹⁶. The hormone response genes affect the wheat grain quality through indirect pathways⁵⁷.

Our research displayed eleven TEs that were expressed only in “O-64.1.10” genotypes. These genes related to transposon may be responsible for causing the high baking quality in the mutant genotype “O-64.1.10”. So, further research is needed to find mutations governing genes related baking quality. TEs are mobile genetic elements in the eukaryotic genome which alter the expression of neighboring genes via insertion into promoter regions, or disrupt the function of protein-coding genes when inserted into the genes, or even change gene structure by altering its splicing and polyadenylation patterns^58,59,60.

Identification of SSRs

In the context of differential gene expression between the “O-64.1.10” and wild-type genotypes, 127 SSRs were identified. The repeat pattern of the identified SSRs indicated a prevalence of trinucleotide repeats (122) were the most abundant, followed by di- (3) and hexa- (2) nucleotide repeats (Table 5, supplemental data 3). Within the trinucleotide repeats, CCG/CGG (54) was the most frequent motif, followed by AGG/CCT (26) and AGG/CCT (22). The majority of motifs (96.06%) consisted of 5–7 repeats, while motifs with 8–10 repeats were rare (3.94%). For instance, gamma-gliadin contained AAC motifs with 10 repeats. A total of 109 sequences containing SSRs, 17 sequences containing more than one SSR, and 8 sequences containing SSRs in compound formation were identified. For instance, alpha-gliadin contained (ACA)9 and (CAA)8 motifs. These genic SSRs/RNAseq-SSR markers were developed from transcriptome sequences and can be used for marker-assisted selection (MAS).

Table 5 Details of SSR markers obtained from DEGs of “Omid” and “O-64.1.10”.

Full size table

Quantitative Reverse Transcription PCR Validation

qRT-PCR was used to validate the RNA-seq results. Therefore, out of DEGs, three important genes involved in baking quality were selected from the DEGs, including LMW-GS, Alpha-gliadin and Gibberellin. The LMW-GS and gliadins (alfa/beta, omega, and gamma) are the main storage proteins and the major components of the gluten polymer. The accumulation of these storage proteins affects wheat quality formation¹⁷. The relative fold changes in gene expression measured by qRT-PCR were found to be consistent with their expression levels determined by RNA-Seq. Both techniques yielded similar patterns of changes, validating the results obtained from RNA-Seq (Fig. 7).

Conclusions

The transcriptomics analysis is a powerful tool to study molecular processes taking place during grain development. In this study, differential expressed genes of two breed wheat genotypes “O-64.1.10” and “Omid” were investigated at 15 DPA by transcriptomics approaches. a total number of 733 DEGs were identified between each genotype, as well as the DEGs analysis suggested that up-regulated genes in “O-64.1.10” were enriched on A-genome, whereas down-regulated genes in “O-64.1.10” were overexpression on B-genome. Our recent studies have identified several crucial factors that regulate the baking quality of wheat, including grain storage proteins (gluten), TFs, TRs, PKs, plant hormones, carbohydrates (starch and sucrose), lipids (unsaturated fatty acids), and vitamins. Using the KEGG pathway enrichment analysis of DEGs comparing “O-64.1.10” with “Omid” showed that various pathways of metabolic pathways, protein processing in endoplasmic reticulum, biosynthesis of secondary metabolites, spliceosome, ribosome, plant hormone signal transduction, and biosynthesis of amino acids were enriched in the “O-64.1.10” genotype with the highest number of the genes, whereas the pathways associated with metabolic pathways, biosynthesis of secondary metabolites, and ribosome were frequent in the “Omid” genotype. Yu et al.¹⁷ reported that metabolic pathway network analysis that major and minormetabolic pathways regulate one another to ensure regular seed development and nutritive reserve accumulation. Yan et al.⁶¹ showed that carbohydrate and hormone metabolism has play important roles in the grain size and weight in wheat.

Our results indicated that TraesCS2A01G483600 (encodes Elongation factor 1-alpha) expressed only in “O-64.1.10” genotypes. Paul et al.⁴⁸ suggested that Elongation factor 1-alpha gene play an important role in cell expansion in early developmental wheat grain. The validation of transcripts via qRT-PCR presented in this study holds significant academic value. These transcripts have been found to have a positive correlation with the enhancement of the baking quality of wheat grains. Lastly, in addition, we also identified 127 RNA-seq SSR markers among all genes with differential expression in “O-64.1.10” and its wild type genotype. The repeat pattern of identified SSRs indicated an abundance of dinucleotides followed by tri, and hexanucleotide repeats.

In conclusion, results from our study provides valuable shed light into the genetic mechanisms underlying baking quality in wheat flour production, and could have significant implications for the baking industry.

Data availability

The datasets generated and/or analysed during the current study are available in the SRA NCBI repository, https://www.ncbi.nlm.nih.gov/sra/?term=PRJNA979921. BioSample accessions: https://www.ncbi.nlm.nih.gov/biosample/SAMN35615225. https://www.ncbi.nlm.nih.gov/biosample/?term=SAMN35615224.

References

Duan, J., Xia, C., Zhao, G., Jia, J. & Kong, X. Optimizing de novo common wheat transcriptome assembly using short-read RNA-Seq data. BMC Genomics 13, 1–2 (2012).
Article Google Scholar
Singh, A. et al. Genome-wide transcriptome study in wheat identified candidate genes related to processing quality, majority of them showing interaction (quality x development) and having temporal and spatial distributions. BMC Genomics 15, 1–9 (2014).
Article Google Scholar
Takač, V. et al. Differences in processing quality traits, protein content and composition between spelt and bread wheat genotypes grown under conventional and organic production. Foods 10(1), 156 (2021).
Article PubMed PubMed Central Google Scholar
Xue, C., Matros, A., Mock, H. P. & Mühling, K. H. Protein composition and baking quality of wheat flour as affected by split nitrogen application. Front. Plant Sci. 10, 642. https://doi.org/10.3389/fpls.2019.00642 (2019).
Article PubMed PubMed Central Google Scholar
Peng, Y. et al. Wheat quality formation and its regulatory mechanism. Front. Plant Sci. 13, 834654. https://doi.org/10.3389/fpls.2022.834654 (2022).
Article PubMed PubMed Central Google Scholar
Schuster, C., Huen, J. & Scherf, K. A. Comprehensive study on gluten composition and baking quality of winter wheat. Cereal Chem. 100, 142–155 (2023).
Article CAS Google Scholar
Xue, G. P., Sadat, S., Drenth, J. & Lynne, M. L. The heat shock factor family from Triticum aestivum in response to heat and other major abiotic stresses and their role in regulation of heat shock protein genes. J. Exp. Bot. 65(2), 539–557 (2012).
Article Google Scholar
Oyeyinka, S. A. & Bassey, I. A. Composition, functionality, and baking quality of flour from four brands of wheat flour. J. Culinary Sci. Technol. https://doi.org/10.1080/15428052.2023.21918745 (2023).
Article Google Scholar
Salmanowicz, B. P. et al. The relationship between grain hardness, dough mixing parameters and bread-making quality in winter wheat. Int. J. Mol. Sci. 13(4), 4186–4201 (2012).
Article CAS PubMed PubMed Central Google Scholar
Jańczak-Pieniążek, M. et al. Comparative assessment of the baking quality of hybrid and population wheat cultivars. Appl. Sci. 10, 7104. https://doi.org/10.3390/app10207104 (2020).
Article CAS Google Scholar
Michel, S. et al. Improving the baking quality of bread wheat by genomic selection in early generations. Theor. Appl. Genet. 131, 477–493 (2018).
Article CAS PubMed Google Scholar
Rahemi, M. R., Yamchi, A., Navabpour, S., Soltanloo, H. & Roepstorff, P. Evaluation of gamma ray effect on wheat bakery properties in Omid, Roshan and Tabasai cultivars by artificial neural network. JonSat. 40(4), 119–127 (2020).
Google Scholar
Rangan, P., Furtado, A. & Henry, R. The transcriptome of the developing grain: A resource for understanding seed development and the molecular control of the functional and nutritional properties of wheat. BMC Genomics 18, 766. https://doi.org/10.1186/s12864-017-4154-z (2017).
Article CAS PubMed PubMed Central Google Scholar
Furtado, A. et al. A novel highly differentially expressed gene in wheat endosperm associated with bread quality. Sci. Rep. 5, 10446. https://doi.org/10.1038/srep10446 (2015).
Article ADS CAS PubMed PubMed Central Google Scholar
Chi, Q. et al. Global transcriptome analysis uncovers the gene co-expression regulation network and key genes involved in grain development of wheat (Triticum aestivum L.). Funct. Integr. Genomics 19, 853–866 (2019).
Article CAS PubMed PubMed Central Google Scholar
Guan, J. et al. Transcriptome analysis of developing wheat grains at rapid expanding phase reveals dynamic gene expression patterns. Biology 11(2), 281 (2022).
Article CAS PubMed PubMed Central Google Scholar
Yu, Y. et al. Transcriptome analysis reveals key differentially expressed genes involved in wheat grain development. Crop J. 4(2), 92–106 (2016).
Article Google Scholar
Henry, R. J., Furtado, A. & Rangan, P. Wheat seed transcriptome reveals genes controlling key traits for human preference and crop adaptation. Curr. Opin. Plant Biol. 1(45), 231–236 (2018).
Article Google Scholar
Paul, S. et al. RNA-Seq analysis of developing grains of wheat to intrigue into the complex molecular mechanism of the heat stress response. Front. Plant Sci. 13, 904392 (2022).
Article PubMed PubMed Central Google Scholar
Curtis, T. Y. et al. Contrasting gene expression patterns in grain of high and low asparagine wheat genotypes in response to sulphur supply. BMC Genomics 20, 628 (2019).
Article PubMed PubMed Central Google Scholar
Pfeifer, M. et al. International wheat genome sequencing consortium genome interplay in the grain transcriptome of hexaploid bread wheat. Science 345, 1250091 (2014).
Article PubMed Google Scholar
Shewry, P. R. et al. An integrated study of grain development of wheat (cv. Hereward). J. Cereal Sci. 56(1), 21–30 (2012).
Article Google Scholar
Saulnier, L., Guillon, F. & Chateigner-Boutin, A. L. Cell wall deposition and metabolism in wheat grain. J. Cereal Sci. 56(1), 91–108 (2012).
Article CAS Google Scholar
Zamani, M. J. et al. Marker-assisted selection for recognizing wheat mutant genotypes carrying HMW glutenin alleles related to baking quality. Sci. World J. 2014, 1–5. https://doi.org/10.1155/2014/387912 (2014).
Article Google Scholar
Izadi-Darbandi, A., Yazdi-Samadi, B., Shanejat-Boushehri, A. A. & Mohammadi, M. Allelic variations in Glu-1 and Glu-3 loci of historical and modern Iranian bread wheat (Triticum aestivum L.) cultivars. J. Genet. 89, 193–9 (2010).
Article PubMed Google Scholar
Payne, P. I. Genetics of wheat storage proteins and the effect of allelic variation on bread-making quality. Plant Biol. 38, 141–153 (1987).
CAS Google Scholar
Shewry, P. R., Halford, N. G. & Tatham, A. S. High molecular weight subunits of wheat glutenin. Cereal Sci. 15, 115–119 (1992).
Article Google Scholar
Ahmad, M. Molecular marker-assisted selection of HMW glutenin alleles related to wheat bread quality by PCR-generated DNA markers. Theor. Appl. Genet. 101, 892–896 (2000).
Article CAS Google Scholar
Tabiki, T., Ikeguschi, S. & Ikeda, T. M. Effects of High-molecular-weight and Low-molecular-weight glutenin subunit alleles on common wheat flour quality. Breed. Sci. 56, 131–136 (2006).
Article CAS Google Scholar
Shahnejat-Bushehri, A. A., Gomarian, M. & Yazdi-Samadi, B. The high molecular weight glutenin subunit composition in old and modern bread wheat’s cultivated in Iran. Aust. J. Agric. Res. 57, 1109–1114 (2006).
Article CAS Google Scholar
Kocourková, Z. et al. Wheat breeding for the improved bread-making quality using PCR based markers of glutenins. Czech J. Genet. Plant Breed. 44(3), 105–113 (2008).
Article Google Scholar
Gianibelli, M. C., Echaide, M., Larroque, O. R., Carrillo, J. M. & Dubcovsky, J. Biochemical and molecular characterization of Glu-1 loci in Argentinean wheat cultivars. Euphytica 128, 61–73 (2002).
Article CAS Google Scholar
Locascio, A., Roig-Villanova, I., Bernardi, J. & Varotto, S. Current perspectives on the hormonal control of seed development in Arabidopsis and maize: A focus on auxin. Front. Plant Sci. 5, 412 (2014).
Article PubMed PubMed Central Google Scholar
Xue, L. J., Zhang, J. J. & Xue, H. W. Genome-wide analysis of the complex transcriptional networks of rice developing seeds. PloS One 7(2), e31081. https://doi.org/10.1371/journal.pone.0031081 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Wei, L. et al. Transcriptome analysis of wheat grain using RNA-Seq. Front. Agric. Sci. Eng. 1(3), 214–222 (2015).
Article Google Scholar
Iwata, Y. & Koizumi, N. An Arabidopsis transcription factor, AtbZIP60, regulates the endoplasmic reticulum stress response in a manner unique to plants. Proc Natl Acad Sci USA 102, 5280–5285 (2005).
Article ADS CAS PubMed PubMed Central Google Scholar
Baena-González, E., Rolland, F., Thevelein, J. M. & Sheen, J. A central integrator of transcription networks in plant stress and energy signalling. Nature 448(7156), 938–942 (2007).
Article ADS PubMed Google Scholar
Finkelstein, R. R. & Lynch, T. J. Abscisic acid inhibition of radicle emergence but not seedling growth is suppressed by sugars. Plant Physiol. 122, 1179–1186 (2000).
Article CAS PubMed PubMed Central Google Scholar
Gao, Y. et al. The endosperm-specific transcription factor TaNAC019 regulates glutenin and starch accumulation and its elite allele improves wheat grain quality. Plant Cell 33, 603–622 (2021).
Article PubMed PubMed Central Google Scholar
Rahemi, M. R., Yamchi, A., Navabpour, S., Soltanloo, H. & Roepstorff, P. Evaluation of gamma ray effect on wheat bakery properties in Omid, Roshan and Tabasai cultivars by artificial neural network. J. Nuclear Sci. Technol. 40(4), 119–127 (2020).
Google Scholar
Kim, D., Langmead, B. & Salzberg, S. L. HISAT: A fast spliced aligner with low memory requirements. Nat. Methods 12(4), 357–360 (2015).
Article CAS PubMed PubMed Central Google Scholar
Trapnell, C. et al. Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation. Nat. Biotechnol. 28(5), 511–515 (2010).
Article CAS PubMed PubMed Central Google Scholar
Beier, S., Thiel, T., Münch, T., Scholz, U. & Mascher, M. MISA-web: A web server for microsatellite prediction. Bioinformatics 33(16), 2583–2585 (2017).
Article CAS PubMed PubMed Central Google Scholar
Thiel, T., Michalek, W., Varshney, R. & Graner, A. Exploiting EST databases for the development and characterization of gene-derived SSR-markers in barley (Hordeum vulgare L.). Theor. Appl. Genet. 106, 411–22 (2003).
Article CAS PubMed Google Scholar
Rozen, S. & Skaletsky, H. Primer3 on the WWW for general users and for Biologist Programmers. In Bioinformatics Methods and Protocols (eds Misener, S. & Krawetz, S. A.) 365–386 (Humana Press, 1999). https://doi.org/10.1385/1-59259-192-2:365.
Chapter Google Scholar
Pfaffl, M. W., Horgan, G. W. & Dempfle, L. Relative expression software tool (REST©) for group-wise comparison and statistical analysis of relative expression results in real-time PCR. Nucleic Acids Res. 30(9), e36 (2002).
Article PubMed PubMed Central Google Scholar
Livak, K. J. & Schmittgen, T. D. Analysis of relative gene expression data using real-time quantitative PCR and the 2−ΔΔCT method. Methods 25(4), 402–408. https://doi.org/10.1006/meth.2001.1262 (2001).
Article CAS PubMed Google Scholar
Paul, S. et al. RNA-Seq analysis of developing grains of wheat to intrigue into the complex molecular mechanism of the heat stress response. Front. Plant Sci. https://doi.org/10.3389/fpls.2022.904392 (2022).
Article PubMed PubMed Central Google Scholar
Lara, P. et al. Synergistic activation of seed storage protein gene expression in Arabidopsis by ABI3 and two bZIPs related to OPAQUE2. J. Biol. Chem. 278(23), 21003–21011 (2003).
Article CAS PubMed Google Scholar
Guedes Corrêa, L. G. et al. The role of bZIP transcription factors in green plant evolution: adaptive features emerging from four founder genes. PLoS One 3(8), e2944. https://doi.org/10.1371/journal.pone.0002944 (2008).
Article ADS CAS Google Scholar
Skylas, D. J. et al. Heat shock of wheat during grain filling: Proteins associated with heat-tolerance. J. Cereal Sci. 35(2), 175–188 (2002).
Article CAS Google Scholar
Reňák, D., Gibalova, A., Šolcová, K. & Honys, D. A new link between stress response and nucleolar function during pollen development in A rabidopsis mediated by AtREN 1 protein. Plant Cell Environ. 37(3), 670–83 (2014).
Article PubMed Google Scholar
Kotak, S., Vierling, E., Baumlein, H. & Koskull-Döring, P. V. A novel transcriptional cascade regulating expression of heat stress proteins during seed development of Arabidopsis. Plant Cell. 19(1), 182–195 (2007).
Article CAS PubMed PubMed Central Google Scholar
Xue, G. P., Sadat, S., Drenth, J. & McIntyre, C. L. The heat shock factor family from Triticum aestivum in response to heat and other major abiotic stresses and their role in regulation of heat shock protein genes. J. Exp. Botany 65(2), 539–557 (2014).
Article CAS Google Scholar
Bozaykut, P., Ozer, N. K. & Karademir, B. Regulation of protein turnover by heat shock proteins. Free Radic. Biol. Med. 1(77), 195–209 (2014).
Article Google Scholar
Hasrak, S. et al. A study to assess the role of gluten encoded genes and their regulatory elements in bread making quality of wheat. Iran. J. Biotech. 17(4), e2164 (2019).
Google Scholar
Yang, Y. et al. Multi-locus GWAS of quality traits in bread wheat: Mining more candidate genes and possible regulatory network. Front. Plant Sci. https://doi.org/10.3389/fpls.2020.01091 (2020).
Article PubMed PubMed Central Google Scholar
Butelli, E. et al. Retrotransposons control fruit-specific, cold-dependent accumulation of anthocyanins in blood oranges. Plant Cell. 24, 1242–1255 (2012).
Article CAS PubMed PubMed Central Google Scholar
Kumar, A. & Bennetzen, J. L. Plant retrotransposons. Annu. Rev. Genet. 33, 479–532 (1999).
Article CAS PubMed Google Scholar
Moden, Y. et al. Comprehensive survey of transposon mPing insertion sites and transcriptome analysis for identifying candidate genes controlling high protein content of rice. Front. Plant Sci. 13, 969582 (2022).
Article Google Scholar
Yan, L. et al. Transcriptome analysis reveals potential mechanisms for different grain size between natural and resynthesized allohexaploid wheats with near-identical AABB genomes. BMC Plant Biol. 18, 28 (2018).
Article PubMed PubMed Central Google Scholar

Download references

Acknowledgements

The authors would like to acknowledge the reviewers for their helpful and constructive comments on this manuscript.

Funding

This study was supported by a grant from Gorgan University of Agricultural and Natural Resources.

Author information

Authors and Affiliations

Crop and Horticultural Science Research Department, Agricultural Research, Education and Extension Organization (AREEO), Golestan Agricultural and Natural Resources Research and Education Center, Gorgan, Iran
Hossein Ahmadi-Ochtapeh
Plant Breeding and Biotechnology Department, Gorgan University of Agricultural Sciences and Natural Resources (GUASNR), Gorgan, Iran
Hassan Soltanloo, Seyyede Sanaz Ramezanpour & Ahad Yamchi
Department of Plant Molecular Biotechnology, Assistant Professor in National Institute of Genetic Engineering and Biotechnology, Karaj, Iran
Vahid Shariati

Authors

Hossein Ahmadi-Ochtapeh
View author publications
You can also search for this author in PubMed Google Scholar
Hassan Soltanloo
View author publications
You can also search for this author in PubMed Google Scholar
Seyyede Sanaz Ramezanpour
View author publications
You can also search for this author in PubMed Google Scholar
Ahad Yamchi
View author publications
You can also search for this author in PubMed Google Scholar
Vahid Shariati
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

All authors reviewed the manuscript Hossein Ahmadi-Ochtapeh wrote the main manuscript and Hassan Soltanloo edited Advisors, Seyyede Sanaz Ramezanpour, Ahad Yamchi and Vahid Shariati respectively.

Corresponding author

Correspondence to Hassan Soltanloo.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Ahmadi-Ochtapeh, H., Soltanloo, H., Ramezanpour, S.S. et al. RNA-Seq transcriptome profiling of immature grain wheat is a technique for understanding comparative modeling of baking quality. Sci Rep 14, 10940 (2024). https://doi.org/10.1038/s41598-024-61528-y

Download citation

Received: 23 May 2023
Accepted: 07 May 2024
Published: 13 May 2024
DOI: https://doi.org/10.1038/s41598-024-61528-y

Keywords

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Unraveling the complexity of faba bean (Vicia faba L.) transcriptome to reveal cold-stress-responsive genes using long-read isoform sequencing technology

Transcriptome dynamics of Gossypium purpurascens in response to abiotic stresses by Iso-seq and RNA-seq data

De-novo transcriptome analysis unveils differentially expressed genes regulating drought and salt stress response in Panicum sumatrense

Introduction

Material and methods

Plant materials and sampling

RNA isolation and cDNA library construction

Data processing and analysis

Validation of RNA-seq data by real-time PCR

Gene ontology analysis

Result

Grain morphology changes during seed development

Identification of differentially expressed genes

Functional classification and gene ontology analysis

Transcription factor, protein kinases and transcriptional regulators analysis

Plant hormones and transposable elements (TEs) analysis

Identification of SSRs

Quantitative Reverse Transcription PCR Validation

Conclusions

Data availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Supplementary Information.

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Comments

Search

Quick links