Transcriptome analysis of differentially expressed unigenes involved in flavonoid biosynthesis during flower development of Chrysanthemum morifolium ‘Chuju’

Yue, Junyang; Zhu, Chuanxue; Zhou, Yu; Niu, Xiangli; Miao, Min; Tang, Xiaofeng; Chen, Fadi; Zhao, Weiping; Liu, Yongsheng

doi:10.1038/s41598-018-31831-6

Download PDF

Article
Open access
Published: 07 September 2018

Transcriptome analysis of differentially expressed unigenes involved in flavonoid biosynthesis during flower development of Chrysanthemum morifolium ‘Chuju’

Junyang Yue^1,2,3^na1,
Chuanxue Zhu¹^na1,
Yu Zhou¹,
Xiangli Niu¹,
Min Miao¹,
Xiaofeng Tang¹,
Fadi Chen⁴,
Weiping Zhao⁵ &
…
Yongsheng Liu^1,3

Scientific Reports volume 8, Article number: 13414 (2018) Cite this article

4754 Accesses
42 Citations
1 Altmetric
Metrics details

Subjects

Abstract

Chrysanthemum morifolium is an ornamentally and medicinally important plant species. Up to date, molecular and genetic investigations have largely focused on determination of flowering time in the ornamental species. However, little is known about gene regulatory networks for the biosynthesis of flavonoids in the medicinal species. In the current study, we employed the high-throughput sequencing technology to profile the genome-wide transcriptome of C. morifolium ‘Chuju’, a famous medicinal species in traditional Chinese medicine. A total of 63,854 unigenes with an average length of 741 bp were obtained. Bioinformatic analysis has identified a great number of structural and regulatory unigenes potentially participating in the flavonoid biosynthetic pathway. According to the comparison of digital gene expression, 8,370 (3,026 up-regulated and 5,344 down-regulated), 1,348 (717 up-regulated and 631 down-regulated) and 944 (206 up-regulated and 738 down-regulated) differentially expressed unigenes (DEUs) were detected in the early, middle and mature growth phases, respectively. Among them, many DEUs were implicated in controlling the biosynthesis and composition of flavonoids from the budding to full blooming stages during flower development. Furthermore, the expression patterns of 12 unigenes involved in flavonoid biosynthesis were generally validated by using quantitative real time PCR. These findings could shed light on the molecular basis of flavonoid biosynthesis in C. morifolium ‘Chuju’ and provide a genetic resource for breeding varieties with improved nutritional quality.

Spatial co-transcriptomics reveals discrete stages of the arbuscular mycorrhizal symbiosis

Article Open access 08 April 2024

Karen Serrano, Margaret Bezrutczyk, … Benjamin Cole

Single-cell and spatial RNA sequencing reveal the spatiotemporal trajectories of fruit senescence

Article Open access 10 April 2024

Xin Li, Bairu Li, … Robert Henry

A chromosomal-scale genome assembly of modern cultivated hybrid sugarcane provides insights into origination and evolution

Article Open access 08 April 2024

Yixue Bao, Qing Zhang, … Muqing Zhang

Introduction

Chrysanthemum morifolium, a species of herbaceous perennial plant from Asteraceae family, is commonly called Flos chrysanthemi in Latin and Juhua in mandarin¹. It is native of China although nowadays is widely cultivated worldwide. As one of the most important ornamental species, C. morifolium has already become the second economically most important floricultural crop following rose and its industry is still flourishing due to a steadily increased demand in cut flower market². Furthermore, the dry capitulum of C. morifolium is also a medicinal and edible cognate that has been used as an herbal tea or a food supplement for over 2000 years of history in China³.

Through a long time of selection, many commercial cultivars of C. morifolium are used specially for their curative effects rather than esthetic values⁴. According to growing regions, the main medicinal cultivars available in the herb markets are divided into Boju (Bozhou, Anhui Province), Chuanju (Zhongjiang, Sichuan Province), Chuju (Chuzhou, Anhui Province), Hangju (Tongxiang, Zhejiang Province), Huaiju (Wuzhi, Henan Province), Qiju (Anguo, Hebei Province), Yanju (Yancheng, Jiangsu Province). Among them, Chuju, Hangju, Huaiju and Boju are the four cultivars with highest reputations due to their quality and application in traditional Chinese medicine (TCM)⁴. Accumulating pharmacological studies have shown that the capitulum of medicinal C. morifolium possesses significant amounts of natural antioxidants, which execute essential functions in anti-inflammatory, anti-bacterial, anti-tumor and cardiovascular protective^5,6,7,8. Therefore, it is important to better understand molecular mechanisms involved in the biosynthesis and accumulation of antioxidant metabolites throughout the flower development of C. morifolium.

Flavonoids, a class of powerful antioxidant compounds, are very rich and primarily responsible for the aforementioned bioactivities in medicinal C. morifolium⁹. They are synthesized by starting with the phenylpropanoid pathway and subsequently leading to nine different metabolic branches, including the products of chalcones, aurones, isoflavonoids, flavones, flavonols, flavandiols, anthocyanins, condensed tannins, and phlobaphene¹⁰. Up to now, flavonoid-related genes have been extensively characterized through both forward and reverse genetic approaches in a range of model species¹¹. But in C. morifolium, only several homologous genes, such as flavonoid 3′-hydroxylase (F3′H), flavonoid 3′,5′-hydroxylase (F3′5′H) and cytochrome P450 have been isolated so far^12,13,14. The majority of genes that involve in the flavonoid accumulation are not yet known in the medicinal C. morifolium species.

Total RNA sequencing (RNA-Seq) could provide comprehensive insights into biological pathways by revealing the presence and quantity of gene expression levels¹⁵. In C. morifolium ‘Fenditan’, a large number of transcription factors and structural genes involved in the photoperiod pathway as well as flower organ determination have been identified from whole transcriptome data by using the RNA-Seq technology¹⁶. Meanwhile, a transcriptome analysis of another cultivar C. morifolium ‘Yuuka’ succeeded in identifying many differentially expressed genes under the treatment of various daylengths¹⁷. While this technology has been intensively applied for studying flowering-time regulation in ornamental C. morifolium cultivars, but to our knowledge, no transcriptomic analysis regarding the flavonoid biosynthesis has been reported in medicinal C. morifolium cultivars.

In the present study, whole transcriptome sequencing and comprehensive comparative analysis were performed using flowers of four sequentially developmental stages to identify flavonoid-associated genes in C. morifolium ‘Chuju’, a cultivar derived from TCM with top medicinal quality. Bioinformatics analysis of the sequencing data revealed a wide range of unigenes putatively participating in the flavonoid pathway, some of which represented differential expression patterns across the whole period of flower development. These results we obtained could help further elucidating the molecular mechanisms of flavonoid biosynthesis and better understanding the regulatory networks of gene expression.

Results

Transcriptome sequencing and de novo assembly

To obtain a comprehensive view of candidate genes involved in the flavonoid biosynthetic pathway, we have pooled and sequenced equivalent quantities of RNA extracts isolated from four sequential stages throughout flower development in C. morifolium ‘Chuju’, which were denoted as the budding (BD), bud breaking (BB), early blooming (EB) and full blooming (FB) stages (Fig. 1, detailed in Methods). In total, 46,527,128 raw reads were generated using the RNA-Seq technology. After removing adaptors, cleaning up contaminations and filtering out low quality reads, 38,454,857 clean reads with a total data of 7,767,261,963 bp (7.77 GB) were obtained. Their average Q30 and GC percentage were 95.07% and 43.16%, respectively. Based on the high-quality data, 3,475,234 contigs with 225,683,083 bp of data were subsequently assembled. Finally, 195,160 transcripts and 63,854 unigenes were successively recognized from these contigs via the paired-end relationships. The non-redundant unigenes obtained were associated with an average length of 741 bp and an N50 value of 1,234 bp (Table 1). The raw sequencing data have been deposited at the Sequence Read Archive (SRA) of the National Center for Biotechnology Information (NCBI) database under accession number PRJNA397042 (http://www.ncbi.nlm.nih.gov/bioproject/PRJNA397042).

Table 1 Summary for the transcriptome assembly.

Full size table

Gene annotation and functional classification

In order to obtain annotations of the assembled unigenes, their sequences were aligned against a series of publicly available nucleotide and protein databases. Of all these 63,854 unigenes, 34,362, 23,579, 7,694 and 10,578 were respectively aligned to the NCBI non-redundant (nr), Swiss-Prot, Kyoto encyclopedia of genes and genomes (KEGG) and Clusters of orthologous groups (COG) databases by using an E-value cutoff of 1e−5 (Table 2). After eliminating redundancy from different databases, a total of 34,605 unigenes were annotated at least once, accounting for approximately 54.19%. Among them, 34,362 could be annotated to the nr database, covering up to 99.30% of the total annotated unigenes. A statistical analysis of the distributed E-value has revealed that 79.02% of the mapped sequences have strong homologies (E-value < 1e-20) and 54.44% sequences have extremely strong homologies (E-value < 1e-50) to the available plant sequences in the nr database (Fig. 2a). Furthermore, species distribution of the top BLAST hits for the best alignment against the nr database was presented in Fig. 2b. The top scoring is Vitis vinifera (22.14%), followed by Solanum lycopersicum (13.81%) and Theobroma cacao (9.68%). In comparison, there are only 3.02% of the aligned sequences from the Asteraceae species, including Artemisia (0.52%), Carthamus (0.07%), Chrysanthemum (0.40%), Gerbera (0.11%), Helianthus (1.26%), Lactuca (0.18%), Senecio (0.07%) and Zinnia (0.25%).

Table 2 Annotation of unigenes against diverse databases.

Full size table

The Swiss-Prot database was reviewed by manual curation, consequently representing a high quality and accuracy. In the present study, 23,579 unigenes were annotated to the Swiss-Prot database, with a sharing of 23,386 annotated unigenes to the nr database. In addition, 7,694 and 10,578 unigenes were mapped into 118 KEGG pathways and 25 COG categories, respectively. The KEGG pathways for ‘metabolic pathways’ represented the largest group, followed by ‘biosynthesis of secondary metabolites’, ‘ribosome’, ‘protein processing in endoplasmic reticulum’, ‘plant hormone signal transduction’, ‘spliceosome’ and ‘RNA transport’ (Supplementary Table S1). Among the 25 COG categories, the clusters that frequently mapped were ‘general function prediction only’, ‘replication, recombination and repair’, ‘transcription’, ‘signal transduction mechanisms’ and ‘posttranslational modification, protein turnover, chaperones’ (Fig. 3).

On the basis of Gene Ontology (GO) annotation, 25,394 unigenes could be assigned to at least one GO category using the Blast2GO pipeline¹⁸. Among them, 19,856 unigenes were classified into the biological process category, 18,447 unigenes were classified into the cellular component category, and 19,130 unigenes were classified into the molecular function category. As summarized at the level 2, a total of 49 functional GO terms were annotated (Fig. 4). For each of the three main categories, the dominant GO terms were ‘cellular process’ (in ‘biological process’), ‘cell or cell part’ (in ‘cellular component’) and ‘catalytic activity’ (in ‘molecular function’), which indicated that the basic cellular process in cells were catalytic activities during flower development of C. morifolium ‘Chuju’. Also, the high percent of ‘metabolic process’ (in ‘biological process’) and ‘binding’ (in ‘molecular function’) might suggest that metabolic processes are regulated by a wide range of functional protein complex through interacting with each other. By contrast, there were relatively few genes from ‘locomotion’ (in ‘biological process’), ‘extracellular region part’ (in ‘cellular component’) and ‘translation regulator activity’ (in ‘molecular function’) in the three main categories, respectively.

Digital gene expression profiling and in silico analysis

To explore the patterns of gene expression throughout flower development, digital gene expression (DGE) profiling was also performed for each flower sample from the BD, BB, EB and FB stages of C. morifolium ‘Chuju’ by using the RNA-Seq technology. As a result, the average number of raw reads generated from these four samples was 10,432,456, 10,324,977, 11,330,110 and 10,814,843, respectively. After filtering and cleaning, 7,560,601, 8,270,676, 8,813,243 and 8,561,427 high-quality reads were obtained on average. Using the assembled unigenes as references, 48,336, 57,768, 56,679 and 54,873 sequences were respectively detected with a threshold RPKM (reads per kilobase per million mapped reads) value of 0.1. Among them, 42,348 unigenes were commonly expressed in all the four developmental stages, accounting for most of the identified unigenes. On the other hand, only 225, 1,254, 678 and 1,001 unigenes were specifically expressed in the BD, BB, EB and FB stages, respectively (Fig. 5a). These expression profile data were also deposited at the SRA database under the accession number PRJNA397042 (http://www.ncbi.nlm.nih.gov/bioproject/PRJNA397042).

Subsequently, differentially expressed unigenes (DEUs) were identified between every two sequential stages, including the early (between the BD and BB stages), middle (between the BB and EB stages) and mature (between the EB and FB stages) growth phases. The comparative results revealed that there were 8,370 DEUs (3,026 up-regulated and 5,344 down-regulated) in the early growth phase, 1,348 DEUs (717 up-regulated and 631 down-regulated) in the middle growth phase, and 944 DEUs (206 up-regulated and 738 down-regulated) in the mature growth phase (Fig. 5b). As shown in the figure, an overall decreasing number of unigenes were differentially expressed during flower development of C. morifolium.

In order to inspect the functional classification of these DEUs, all the unigenes involved were assigned to biological process and molecular function categories according to the GO annotation¹⁹. Generally, a great number of functional enzymes and regulatory proteins were identified (Fig. 6). Using the entire unigenes assembled from transcriptome analysis as background, significantly enriched GO terms for DEUs in each growth phase were obtained based on the Hypergometric test. In the early growth phase, the up-regulated unigenes were significantly enriched in ‘microtubule binding’ (in ‘molecular function’), ‘microtubule motor activity’ (in ‘molecular function’) and ‘regulation of flower development’ (in ‘biological process’) (Supplementary Table S2), while the down-regulated unigenes were significantly enriched in ‘chlorophyll binding’ (in ‘molecular function’), ‘response to cadmium ion’ (in ‘biological process’) and ‘water transport’ (in ‘biological process’) (Supplementary Table S3). In the middle growth phase, the up-regulated unigenes were significantly enriched in ‘plant-type cell wall modification’ (in ‘biological process’), ‘pollen tube growth’ (in ‘biological process’) and ‘hydrogen-exporting ATPase activity, phosphorylative mechanism’ (in ‘molecular function’) (Supplementary Table S4), while the down-regulated unigenes were significantly enriched in ‘cell proliferation’ (in ‘biological process’), ‘nucleosome assembly’ (in ‘biological process’) and ‘DNA unwinding involved in DNA replication’ (in ‘biological process’) (Supplementary Table S5). In the mature growth phase, the up-regulated unigenes were significantly enriched in ‘regulation of cellular component organization’ (in ‘biological process’), ‘protein complex subunit organization’ (in ‘biological process’) and ‘phosphatidylinositol phosphate binding’ (in ‘molecular function’) (Supplementary Table S6), while the down-regulated unigenes were significantly enriched in ‘fatty acid metabolic process’ (in ‘biological process’), ‘alkane 1-monooxygenase activity’ (in ‘molecular function’) and ‘ATP hydrolysis coupled proton transport’ (in ‘biological process’) (Supplementary Table S7).

Characterization of functional genes involved in flavonoid biosynthesis

Generally, genes involved in the flavonoid pathway are divided into two categories: structural genes that encode flavonoid biosynthetic enzymes and regulatory genes that control the expression of structural genes²⁰. According to existing experimental findings, the central pathway for flavonoid biosynthesis is conserved in plants and many functional genes involved have been extensively characterized in model species²¹. Based on bioinformatics analysis from transcriptome of C. morifolium, many unigenes were identified as homologous sequences of the major structural genes (Table 3), including chalcone synthase (CHS), chalcone isomerase (CHI), flavone synthase (FNS), flavanone 3-hydroxylase (F3H), flavonoid 3′-hydroxylase (F3′H), flavonoid 3′,5′-hydroxylase (F3'5′H), flavonol synthase (FLS), dihydroflavonol 4-reductase (DFR), anthocyanidin synthase (ANS) and UDP-glucose-flavonoid 3-O-glucosyltransferase (3GT). Meanwhile, 1,077 unigenes were predicted as transcription factors and distributed in 66 different families by using the iTAK online program (Supplementary Table S8)²². Among them, the most frequently occurring was the C2H2 family, followed by the AP2/ERF-ERF, bHLH, MYB and MYB-related families. With respect to transcription factors involved in the regulation of flavonoid biosynthesis, the bHLH, MYB and MYB-related families have been extensively documented to activate or repress transcription of specific target genes through formation of the highly dynamic MYB-bHLH-WD40 (MBW) complex in a number of plant species^23,24. Specially, the common function of WD40 proteins is coordinating multiple proteins to form complex assemblies by protein-protein interactions and a total of 37 homologous members were identified in this study.

Table 3 Homologous sequences of the major structural genes involved in flavonoid biosynthesis.

Full size table

Based on the above analysis of DEG profiling, a majority of the structural genes were expressed throughout flower development. Besides, there are two unigenes (unigene ID: c101558.graph_c0, a member of CHS homologies; unigene ID: c33294.graph_c0, a member of FLS homologies) that were not always expressed (Fig. 7). Among them, 18 homologous members were identified as DEUs and their expression patterns were compared in detail (Fig. 8). Interestingly, differential expression of all these unigenes was detected only once among the three comparisons and a majority of them were significantly down-regulated in the early growth phase.

Using the pattern score described by Muhlemann²⁵, we also found that MYB, bHLH and WD40 were expressed in 8, 11 and 3 different patterns, respectively (Fig. 9). Overall, the expression of most unigenes did not significantly fluctuate throughout flower development as they fell into the pattern score {0,0,0}. While with regard to the DEUs, a majority fell into the patterns {1,0,0} and {−1,0,0}, indicating that these unigenes were differentially expressed from BD to BB stages. Interestingly, the expression of WD40 only fell into the patterns {1,0,0} and {−1,0,0}.

Quantitative real time PCR validation of gene expression

To confirm the high-throughput sequencing data, quantitative real-time PCR (qRT-PCR) was employed as an effective method due to its sensitivity, specificity and rapidity²⁶. Specifically, 12 unigenes putatively participating in the flavonoid pathway were randomly chosen for validation of their expression levels and patterns at different developmental stages. As a result, the qRT-PCR analysis revealed that most of the tested unigenes had a strong correlation with the expression data obtained from RNA-Seq assay (Fig. 10). By contrast, two unigenes (c72593.graph_c0 and c82790.graph_c0) showed inconsistent patterns between qRT-PCR and RNA-Seq data. The reason might be that these two genes are expressed at extremely low levels at certain stages, which tends to bring error rates²⁷. Specially, the contradiction is mainly from the EB stage in c72593.graph_c0 gene expression and the BD stage in c82790.graph_c0 gene expression.

Discussion

For the transcriptome of mixed samples from four sequentially developmental stages of C. morifolium ‘Chuju’ flowers, approximately 7.77 Gb of high-quality data were obtained and assembled into 63,854 unigenes with an average length of 741 bp. Among them, 34,605 unigenes were assigned a specific or general function on the basis of comparison against sequences in the nr, Swiss-Prot, KEGG, COG and GO databases, corresponding to approximately 54.19% of the total unigenes. This value is similar to that in C. morifolium ‘Fenditan’ (47%)¹⁶, C. morifolium ‘Yuuka’ (60%)¹⁷ and C. lavandulifolium (53%)²⁸ based on an equivalent strategy of transcriptome analysis.

Subsequently, DGE profiling was performed in the individually developmental stages. Due to a lack of genomic information of C. morifolium, the sequencing data were aligned against the unigenes assembled from previous transcriptome analysis. As mentioned above, flowers from the BD, BB, EB and FB stages shared 42,348 unigenes in common, indicating that the majority of unigenes were constantly expressed across the entire developmental stages. Meanwhile, comparative analysis further demonstrated that the expression levels of most unigenes were generally stably throughout flower development. Undoubtedly, the constant and stable expression of unigenes should suggest their essential roles for maintaining flower development. By contrast, differentially expressed unigenes might imply their regulatory function during flower development in C. morifolium.

Actually, flavonoid biosynthetic pathway was well established in model species. CHS is the first committed enzyme specific for the flavonoid biosynthesis²⁹. Then, the following steps catalyzed by CHI, FNS, F3H, F3′H, F3′5′H, FLS, DFR, ANS and 3GT will lead to production of different flavonoid subgroups by modifying molecular skeleton and/or backbone. In this work, homologous unigenes of fourteen CHS, four CHI, two FNS, ten F3H, seven F3′′H, eighteen F3′5′H, seventeen FLS, eight DFR, four ANS, and thirteen 3GT were identified (Table 3). The average values of their expression levels are 33.3, 31.4, 27.9 and 11.6 in the four successive stages of flower development, respectively. Using the Wilcoxon test, we found that these values except for 11.6 are significantly higher than the average values of 14.5, 13.1, 13.3 and 13.0 for the entire unigene expression levels in each developmental stage (P-value < 1e-10). It should be noteworthy that a higher expression level of a gene stands for a more important function in a given cell³⁰. Meanwhile, different members from the same families of structural genes might be responsible for enzymes that differ for substrate specificities³¹. Therefore, these results could partially explain why a variety of flavonoid compounds were accumulated in the dry C. morifolium ‘Chuju’, and might indicate that the biosynthesis of flavonoids is enhanced at the beginning but not in the process of flowering.

Furthermore, we observed that the ray florets undergo a gradual change in color, ranging from yellow to white during the flower development (Fig. 1). This change is due to an increase in flavone and flavonol content as well as a possible decrease in anthocyanin accumulation³². Chemical component analysis also revealed that the dry capitulum is rich in flavone and flavonol, such as acacetin, apigenin, luteolin and quercetin³³. Hence, it is believed that there may be a dynamic variation of dominate metabolic pathway at different stages of flower development in C. morifolium ‘Chuju’. Coincident with the reduction of anthocyanin content, a majority of structural genes involved in the flavonoid biosynthesis were significantly down-regulated in the early growth phase as shown in Fig. 8. On the other hand, the only up-regulated gene FLS (unigene ID: c78657.graph_c0) in the mature growth phase, will make the flowers produce more amount of flavonols. These findings indicated that flavonoid biosynthesis might be synchronously controlled by the process of flower development.

Generally, the relationship of branched metabolic pathways involved in flavonoid biosynthesis could be regulated by the common upstream signaling. Emerging evidence has shown that the expression of structural genes participating in the flavonoid biosynthetic pathway is tightly regulated during flower development. To date, the ternary MYB-bHLH-WD40 (MBW) transcriptional activator responsible for the regulation of flavonoid metabolic pathway is well characterized and highly conserved throughout the plant kingdom. In Arabidopsis, TT2 (a MYB transcription factor), TT8 (a bHLH transcription factor) and TTG1 (a WD40 repeat protein) could interact with each other to form a MBW complex and subsequently activate the expression of downstream genes involved in flavonoid metabolism to promote the biosynthesis of proanthocyanidin in the seed coat³⁴. In grape, VvMYBA1 and VvMYBA2 could promote anthocyanin biosynthesis by activating the expression of an anthocyanin acyltransferase³⁵. In apple, MdbHLH3 could also increase anthocyanin biosynthesis through activating the expression of structural genes in isolation³⁶. Nonetheless, some members of the MYB family, such as MYBL2 in Arabidopsis³⁷, VvMYBC2 in grape³⁸, MYB182 in poplar³⁹, MYB27 in petunia⁴⁰, GmMYB100 in soybean⁴¹, may act as repressors to negatively regulate the biosynthesis of flavonoids by interfering with the formation of functional MBW complex.

In the present study, a number of unigenes homologous to MYB, bHLH and WD40 were identified with diverse expression patterns. While the different expression patterns of unigenes usually stand for their different roles in regulatory network³⁰, the huge expression variation of regulatory unigenes in the early growth phase might largely attributed to the metabolic adjustments at flower bud developmental stage (Fig. 9). Notably, these findings are in line with previous expression analysis that a majority of the structural unigenes were significantly regulated and the metabolic flux was redistributed towards a production of more flavonols following the flower bud developmental stage. Therefore, an implication is that in C. morifolium ‘Chuju’, like in many other plant species, the ternary MBW complex may also play important roles in controlling the biosynthesis and composition of flavonoids along with flower development.

To further explore the functional relationships between MBW complex and structural genes, we performed a gene co-expression network analysis and built a potential regulatory network. Each of the structural genes with any available regulatory genes were defined as a “network motif”. As a result, 15 network motifs including 18 regulatory genes and 8 structural genes were constructed (P-value < 0.01) (Fig. 11; Supplementary Table S9). Our data support that different members of the MBW complex might undergo different regulatory mechanisms with either positive control or negative control. Also, different members or combinations are likely responsible for different enzymes. For example, CHS might be regulated by MBW complex and 3GT could be regulated by the MYB transcriptional factor independently. Specially, the common function of WD40 proteins is coordinating multi-protein complex assemblies. As shown here, no WD40 proteins were identified alone, but always associated with the MYB or bHLH transcriptional factors.

In summary, the comparative transcriptome analysis of C. morifolium ‘Chuju’ has revealed that the biosynthesis of flavonoids is detected from the beginning of flower bud formation and tightly regulated by the MBW complex throughout diverse flower developmental stages. After flower bud development, the distribution of metabolic flux was varied towards different branches in the flavonoid biosynthetic pathway. Many regulatory and structural unigenes possibly participating in flavonoid biosynthesis were identified and bridged, which could provide valuable genetic resources for studies of metabolic pathway, nutrition improvement and transgenic breeding in chrysanthemum.

Methods

Plant material and sample collection

C. morifolium Ramat. cv. ‘Chuju’ was grown under natural conditions in the experimental field of Chuzhou University at Chuzhou, Anhui province, People’s Republic of China (N32°17′, E118°18′). The annual average of temperature and precipitation is approximately 15.42 °C and 1,054 millimeters, respectively. Based on the pre-existing study⁴², flower cuttings were collected from four sequentially developmental stages: budding (BD), bud breaking (BB), early blooming (EB) and full blooming (FB) stages (Fig. 1). The BD stage started on around Oct 25th and the subsequent stages are approximately one week later in turn. Meanwhile, multiple flowers from three plants were sampled for biological replicates. After harvest, these flower tissues were frozen immediately in liquid nitrogen and stored at −80 °C until further required. The location of this field nursery is not privately-owned or protected in any way.

RNA extraction and high-throughput sequencing

Total RNAs were extracted from flower tissues with the TRIzol reagent (Invitrogen, MA, the United States) by following the manufacturer’s protocol. RNA quality and quantity were evaluated using an Agilent 2100 Bioanalyzer device (Agilent, CA, the United States). Only the samples with a λ_260/280 ratio of 1.8–2.1 and a λ_260/230 ratio of 2.0–2.5 were retained. Next, RNase-free DNase I (Takara, Dalian, China) was used to remove the contaminating DNA, and oligo (dT) coated magnetic beads was mixed to separate the poly (A) fraction. The obtained mRNA molecules were sequentially broken into short fragments and converted to cDNA by DNA polymerase I (Takara, Dalian, China) through the reverse transcription polymerase chain reaction (RT-PCR) amplifications. The reaction mixture was subjected to polyacrylamide gel electrophoresis (PAGE) and the suitable fragments were isolated and purified. Finally, 30 µg library that has been equivalently pooled from the four stages of flower tissues was submitted for whole transcriptome sequencing and about 10 µg libraries remaining from each flower tissue were directly used for digital gene expression sequencing. All the high-throughput sequencing assay was performed using the HiSeq. 2000 system (Illumina, CA, the United States) at the Biomarker Institute (Beijing, China) with two independent technical replicates.

Transcriptome assembly and gene annotation

After removing the 5′ adaptor, trimming the 3′ acceptor, filtering those low quality reads with a quality value (Q) less than 10 (the quality value was calculated as following: Q = ASCII character code – 64) and cleaning up contaminated reads⁴³, we consequently obtained the clean reads. First, the distinct contigs were assembled with the short reads by using the software program SOAP2⁴⁴. Then, these reads were aligned back to the assembled contigs, which could detect those contigs from the same transcript. Next, scaffolds between closed contigs were constructed by employing the paired-end mapping analysis. Finally, paired-end reads were used again to fill the intra-scaffold gaps and a set of non-redundant unigenes were constructed with the least amount of unaligned reads.

For functional annotation, the BLAST program (E-value = 1e−5) was conducted between unigenes and various nucleotide/protein databases, namely the nr, Swiss-Prot, KEGG, COG and GO databases. Within the alignment against each database, the best aligning results were reserved. Specifically, when results from different databases conflicted with each other, a priority order of nr, Swiss-Prot, KEGG, COG and GO was followed to determine the annotated unigenes. Subsequently, the Blast2GO pipeline¹⁸ and WEGO online tool⁴⁵ were performed to assign and compare GO terms of unigenes in turn. Furthermore, the transcription factors were predicted by the iTAK online program²².

DEU identification and enrichment analysis

To eliminate any possible bias generated from variation in sequence length, the expression levels of unigenes were calculated using the RPKM method⁴⁶. DEUs were identified between every two successive stages based on a DESeq algorithm⁴⁷. The false discovery rate (FDR) method⁴⁸ was applied to correct for P-value when calculating the differential expression in multiple tests. Obviously, the smaller FDR and higher ratio of a given unigenes represent the larger variation in expression. Here, an FDR threshold of <0.01 and a fold-change threshold of >2 were selected to define the significant differences in unigene expression.

Subsequently, the identified DEUs were assigned to GO functional annotation. The enrichment analysis of each GO term was performed using the Hypergenometric test as follows:

$${f}({k}\,;\,{n},\,{m},\,{N})=\frac{(\begin{array}{c}{m}\\ {k}\end{array})(\begin{array}{c}{N}-{m}\\ {n}-{k}\end{array})}{(\begin{array}{c}{N}\\ {n}\end{array})}$$

$$P \mbox{-} value=1-{f}({k}\,{;}\,n{,}\,m{,}\,{N})$$

where N was defined as total number of annotated GO terms in all the unigenes, M was total number of a specific GO term in all the unigenes, n was total number of GO terms in all the DEUs, k was total number of a specific GO term in all the DEUs.

Building a gene regulatory network

The potential gene regulatory network was constructed based on the assessment of co-expression intensity between regulatory genes and structural genes as described in our previous work⁴⁹. For each assessment, all the DGE data across the four sequentially developmental stages were extracted and used. Co-expression intensity was calculated as a Pearson’s correlation coefficient (R), which takes a range of values from +1 to −1. In the current study, an R-value of 0.8 and P-value of 0.01 were firstly employed as thresholds. Then, positive maximum values and negative maximum values were retained, which stand for the most likely positive regulation and negative regulation, respectively.

Verification by qRT-PCR analysis

The expression patterns of unigenes in the RNA-Seq analysis were further confirmed by qRT-PCR method using an Applied Biosystems StepOne^TM Real-Time PCR System (Applied Biosystems, MA, the United States) and a FastStart Universal SYBR Green Master (Roche, Basel, Switzerland). A total of 12 unigenes that participate in the flavonoid biosynthetic pathway were randomly chosen and their appropriate primers were designed by using the Primer5 software (Table 4). The reactions were performed in a 96-well optical plate using the following conditions: an initial polymerase activation step for 30 s at 95 °C, 40 cycles of 5 s at 95 °C for denaturation, 20 s at 60 °C for annealing and elongation. After reaction, the threshold cycle (Ct), defined as the fractional cycle number at which the fluorescence passed a fixed threshold, was determined using the default threshold settings. Relative expression levels were calculated by the ${2}^{-{\rm{\Delta }}{\rm{\Delta }}{C}_{{\rm{T}}}}$ method⁵⁰. Each sample of the BD, BB, EB and FB tissues that were originally used for RNA-Seq assay was represented with three biological replicates. The catalytic subunit of protein phosphatase 2A (PP2Acs) encoding gene (unigene ID: c85200.graph_c0) was used as an internal reference (Table 4)⁵¹.

Table 4 Primers used for the qRT-PCR analysis.

Full size table

References

Hu, C. Chrysanthemum morifolium Ramat 菊花 (Juhua, Florists Chrysanthemum). In: Liu, Y. et al. (eds) Dietary Chinese Herbs. Vienna : Springer 681–691 (2015).
Teixeira, da S. J. A. Chrysanthemum: advances in tissue culture, cryopreservation, postharvest technology, genetics and transgenic biotechnology. Biotechnology Advances 21, 715–766 (2003).
Article CAS Google Scholar
Liu, D. H. et al. Nitrogen effects on total flavonoids, chlorogenic acid, and antioxidant activity of the medicinal plant Chrysanthemum morifolium. Journal of Plant Nutrition and Soil Science 173, 268–274 (2010).
Article CAS Google Scholar
Chinese Pharmmacopoeia Editorial Committee. Pharmmacopoeia of the People’s Republic of China. Beijing: Chemical Industry Press 1, 292 (2010).
Google Scholar
Ukiya, M. et al. Constituents of compositae plants. 2. Triterpene diols, triols, and their 3-o-fatty acid esters from edible chrysanthemum flower extract and their anti-inflammatory effects. Journal of Agricultural and Food Chemistry 49, 3187–3197 (2001).
Article PubMed CAS Google Scholar
Shan, B. et al. The in vitro antibacterial activity of dietary spice and medicinal herb extracts. International Journal of Food Microbiology 117, 112–119 (2007).
Article PubMed CAS Google Scholar
Xie, Y. Y. et al. Cytotoxic activity of flavonoids from the flowers of Chrysanthemum morifolium on human colon cancer Colon205 cells. Journal of Asian Natural Products Research 11, 771–778 (2009).
Article PubMed CAS Google Scholar
He, D. et al. Total flavonoids of Flos Chrysanthemi protect arterial endothelial cells against oxidative stress. Journal of Ethnopharmacology 139, 68–73 (2012).
Article PubMed CAS Google Scholar
Sun, Q. L. et al. Flavonoids and volatiles in Chrysanthemum morifolium Ramat flower from Tongxiang County in China. African Journal of Biotechnology 9, 3817–3821 (2010).
Article CAS Google Scholar
Winkel-Shirley, B. Flavonoid biosynthesis. A colorful model for genetics, biochemistry, cell biology, and biotechnology. Plant Physiology 126, 485–493 (2001).
Article PubMed PubMed Central CAS Google Scholar
Falcone-Ferreyra, M. L. et al. Flavonoids: biosynthesis, biological functions, and biotechnological applications. Frontiers in Plant Science 3, 222 (2012).
PubMed PubMed Central CAS Google Scholar
Seitz, C. et al. Cloning, functional identification and sequence analysis of flavonoid 3′-hydroxylase and flavonoid 3′,5′-hydroxylase cDNAs reveals independent evolution of flavonoid 3′,5′hydroxylase in the Asteraceae family. Plant Molecular Biology 61, 365–381 (2006).
Article PubMed CAS Google Scholar
Tanaka, Y. & Brugliera, F. Flower colour and cytochromes P450. Philosophical Transactions of the Royal Society of London Series B-biological Sciences 368, 20120432 (2013).
Article CAS PubMed PubMed Central Google Scholar
Zhao, W. P. et al. Molecular structure and the second introns variation of gene F3’H of two medicinal Chrysanthemum morifolium populations. Biochemical Systematics and Ecology 51, 251–258 (2013).
Article CAS Google Scholar
Lowe, R. et al. Transcriptomics technologies. PLoS Computational Biology 13, e1005457 (2017).
Article PubMed PubMed Central CAS Google Scholar
Liu, H. et al. Whole-Transcriptome Analysis of Differentially Expressed Genes in the Vegetative Buds, Floral Buds and Buds of Chrysanthemum morifolium. PLoS One 10, e0128009 (2015).
Article PubMed PubMed Central CAS Google Scholar
Ren, L. et al. Transcriptomic analysis of differentially expressed genes in the floral transition of the summer flowering chrysanthemum. BMC Genomics 17, 673 (2016).
Article PubMed PubMed Central CAS Google Scholar
Götz, S. et al. B2G-FAR, a species-centered GO annotation repository. Bioinformatics 27, 919–924 (2011).
Article PubMed PubMed Central CAS Google Scholar
Ashburner, M. et al. Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nature Genetics 25, 25–29 (2000).
Article PubMed PubMed Central CAS Google Scholar
Mola, J. et al. How genes paint flowers and seeds. Trends in Plant Science 3, 212–217 (1998).
Article Google Scholar
Gandía-Herrero, F. & García-Carmona, F. Biosynthesis of betalains: yellow and violet plant pigments. Trends in Plant Science 18, 334–343 (2013).
Article PubMed CAS Google Scholar
Zheng, Y. et al. iTAK: a program for genome-wide prediction and classification of plant transcription factors, transcriptional regulators, and protein kinases. Molecular Plant 9, 1667–1670 (2016).
Article PubMed CAS Google Scholar
Li, S. Transcriptional control of flavonoid biosynthesis: fine-tuning of the MYB-bHLH-WD40 (MBW) complex. Plant Signaling and Behavior 9, e27522 (2014).
Article PubMed CAS PubMed Central Google Scholar
Xu, W. et al. Transcriptional control of flavonoid biosynthesis by MYB-bHLH-WDR complexes. Trends in Plant Science 20, 176–185 (2015).
Article PubMed CAS Google Scholar
Muhlemann, J. K. et al. Developmental changes in the metabolic network of snapdragon flowers. PLoS One 7, e40381 (2012).
Article ADS PubMed PubMed Central CAS Google Scholar
Gachon, C. et al. Real-time PCR: what relevance to plant studies? Journal of Experimental Botany 55, 1445–1454 (2004).
Article PubMed CAS Google Scholar
Everaert, C. et al. Benchmarking of RNA-sequencing analysis workflows using whole-transcriptome RT-qPCR expression data. Scientific Reports 7, 1559 (2017).
Article ADS PubMed PubMed Central CAS Google Scholar
Wang, Y. et al. Construction and de novo characterization of a transcriptome of Chrysanthemum lavandulifolium: analysis of gene expression patterns in floral bud emergence. Plant Cell Tissue and Organ Culture 116, 297–309 (2014).
Article CAS Google Scholar
Tohge, T. et al. Phytochemical genomics in Arabidopsis thaliana: A case study for functional identification of flavonoid biosynthesis genes. Pure and Applied Chemistry 79, 811–823 (2007).
Article CAS Google Scholar
Pennacchio, L. A. et al. Enhancers: five essential questions. Nature Reviews Genetics 14, 288–295 (2013).
Article PubMed PubMed Central CAS Google Scholar
Shimada, N. et al. A comprehensive analysis of six dihydroflavonol 4-reductases encoded by a gene cluster of the Lotus japonicus genome. Journal of Experimental Botany 56, 2573–2585 (2005).
Article PubMed CAS Google Scholar
Chen, S. M. et al. The identification of flavonoids and the expression of genes of anthocyanin biosynthesis in the chrysanthemum flowers. Biologia Plantarum 56, 458–464 (2012).
Article CAS Google Scholar
Jin, M. et al. Growth and accumulation of bioactive compounds in medicinal Chrysanthemum morifolium Ramat. cv. ‘Chuju’ under different colored shade polyethylene. Journal of Medicinal Plants Research 6, 398–404 (2012).
CAS Google Scholar
Nesi, N. et al. The Arabidopsis TT2 gene encodes an R2R3 MYB domain protein that acts as a key determinant for proanthocyanidin accumulation in developing seed. Plant Cell 13, 2099–2114 (2001).
PubMed PubMed Central CAS Google Scholar
Rinaldo, A. R. et al. A grapevine anthocyanin acyltransferase, transcriptionally regulated by VvMYBA, can produce most acylated anthocyanins present in grape skins. Plant Physiology 169, 1897–1916 (2015).
PubMed PubMed Central CAS Google Scholar
Hu, D. G. et al. Glucose Sensor MdHXK1 Phosphorylates and Stabilizes MdbHLH3 to Promote Anthocyanin Biosynthesis in Apple. PLoS Genetics 12, e1006273 (2016).
Article PubMed PubMed Central CAS Google Scholar
Matsui, K. et al. AtMYBL2, a protein with a single MYB domain, acts as a negative regulator of anthocyanin biosynthesis in Arabidopsis. Plant Journal 55, 954–967 (2008).
Article CAS Google Scholar
Huang, Y. F. et al. A negative MYB regulator of proanthocyanidin accumulation, identified through expression quantitative locus mapping in the grape berry. New Phytologist 201, 795–809 (2014).
Article PubMed CAS Google Scholar
Yoshida, K. et al. The MYB182 protein down-regulates proanthocyanidin and anthocyanin biosynthesis in poplar by repressing both structural and regulatory flavonoid genes. Plant Physiology 167, 693–710 (2015).
Article PubMed PubMed Central CAS Google Scholar
Albert, N. W. et al. A conserved network of transcriptional activators and repressors regulates anthocyanin pigmentation in eudicots. Plant Cell 26, 962–980 (2014).
Article PubMed PubMed Central CAS Google Scholar
Yan, J. et al. The soybean R2R3 MYB transcription factor GmMYB100 negatively regulates plant flavonoid biosynthesis. Plant Molecular Biology 89, 35–48 (2015).
Article PubMed CAS Google Scholar
Wang, J. et al. Transcriptomic and hormone analyses reveal mechanisms underlying petal elongation in Chrysanthemum morifolium Jinba. Plant Molecular Biology 93, 593–606 (2017).
Article PubMed CAS Google Scholar
Li, B. et al. Evaluation of de novo transcriptome assemblies from RNA-Seq data. Genome Biology 15, 553 (2014).
Article PubMed PubMed Central CAS Google Scholar
Li, R. et al. SOAP2: an improved ultrafast tool for short read alignment. Bioinformatics 25, 1966 (2009).
Article PubMed CAS Google Scholar
Ye, J. et al. WEGO: a web tool for plotting GO annotations. Nucleic Acids Research 34, W293–W297 (2006).
Article PubMed PubMed Central CAS Google Scholar
Mortazavi, A. et al. Mapping and quantifying mammalian transcriptomes by RNA-Seq. Nature Methods 5, 621–628 (2008).
Article PubMed CAS Google Scholar
Anders, S. & Huber, W. Differential expression analysis for sequence count data. Genome Biology 11, R106 (2010).
Article PubMed PubMed Central CAS Google Scholar
Benjamini, Y. & Yekutieli, D. The control of the false discovery rate in multiple testing under dependency. Annals of Statistics 29, 1165–1188 (2001).
Article MathSciNet MATH Google Scholar
Yue, J. Y. et al. PTIR: Predicted Tomato Interactome Resource. Scientific Reports 6, 25047 (2016).
Article ADS PubMed PubMed Central CAS Google Scholar
Livak, K. J. & Schmittgen, T. D. Analysis of relative gene expression data using real-time quantitative PCR and the 2(-Delta Delta C(T)) Method. Methods 25, 402–408 (2001).
Article PubMed CAS Google Scholar
Gu, C. et al. Reference gene selection for quantitative real-time PCR in Chrysanthemum subjected to biotic and abiotic stress. Molecular Biotechnology 49, 192–197 (2011).
Article PubMed CAS Google Scholar

Download references

Acknowledgements

We thank Professor Enhua Xia from Anhui Agricultural University for the assistance of bioinformatics analysis. This work was supported by funds from the National Natural Science Foundation of China (31171179, 90717110, 31461143008 and 31471157), the Joint NSFC-ISF Research Program, jointly funded by the National Natural Science Foundation of China and the Israel Science Foundation (1936/14), the Fundamental Research Funds for the Central Universities (JZ2016HGBZ1017), the National Science Fund for Distinguished Young Scholars (30825030), Advanced Program of Doctoral Fund of Ministry of Education of China (20110181130009), a Key Project from the Government of Anhui Province (1401032006).

Author information

Junyang Yue and Chuanxue Zhu contributed equally.

Authors and Affiliations

School of Food Science and Engineering, Hefei University of Technology, Hefei, 230009, China
Junyang Yue, Chuanxue Zhu, Yu Zhou, Xiangli Niu, Min Miao, Xiaofeng Tang & Yongsheng Liu
School of Computer and Information, Hefei University of Technology, Hefei, 230009, China
Junyang Yue
School of Horticulture, Anhui Agricultural University, Hefei, 230036, China
Junyang Yue & Yongsheng Liu
College of Horticulture, Key Laboratory of Landscape Agriculture, Ministry of Agriculture, Nanjing Agricultural University, Nanjing, 210095, China
Fadi Chen
School of Biological Science and Food Engineering, Chuzhou University, Chuzhou, 239000, China
Weiping Zhao

Authors

Junyang Yue
View author publications
You can also search for this author in PubMed Google Scholar
Chuanxue Zhu
View author publications
You can also search for this author in PubMed Google Scholar
Yu Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Xiangli Niu
View author publications
You can also search for this author in PubMed Google Scholar
Min Miao
View author publications
You can also search for this author in PubMed Google Scholar
Xiaofeng Tang
View author publications
You can also search for this author in PubMed Google Scholar
Fadi Chen
View author publications
You can also search for this author in PubMed Google Scholar
Weiping Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Yongsheng Liu
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

J.Y.Y., W.P.Z. and Y.S.L. planned the project. W.P.Z. provided the flower samples. J.Y.Y. finished the bioinformatics analysis. C.X.Z., Y.Z., X.L.N., M.M. and X.F.T. performed the experimental work. J.Y.Y., F.D.C., W.P.Z. and Y.S.L. discussed and produced the first draft manuscript. All authors have approved this manuscript.

Corresponding authors

Correspondence to Weiping Zhao or Yongsheng Liu.

Ethics declarations

Competing Interests

The authors declare no competing interests.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Supplementary Table

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Yue, J., Zhu, C., Zhou, Y. et al. Transcriptome analysis of differentially expressed unigenes involved in flavonoid biosynthesis during flower development of Chrysanthemum morifolium ‘Chuju’. Sci Rep 8, 13414 (2018). https://doi.org/10.1038/s41598-018-31831-6

Download citation

Received: 18 April 2018
Accepted: 28 August 2018
Published: 07 September 2018
DOI: https://doi.org/10.1038/s41598-018-31831-6

This article is cited by

Transcriptomic-based analysis to identify candidate genes for blue color rose breeding
- Sheng-Hang Jiang
- Huan-Huan Wang
- Feng Ming
Plant Molecular Biology (2023)
Reference genes selection of Paeonia ostii ‘Fengdan’ under osmotic stresses and hormone treatments by RT-qPCR
- Lili Guo
- Yuying Li
- Xiaogai Hou
Molecular Biology Reports (2023)
Combination of long-read and short-read sequencing provides comprehensive transcriptome and new insight for Chrysanthemum morifolium ray-floret colorization
- Mitsuko Kishi-Kaboshi
- Tsuyoshi Tanaka
- Ryutaro Aida
Scientific Reports (2022)
Transcriptomic and metabolomic analysis reveals the difference between large and small flower taxa of Herba Epimedii during flavonoid accumulation
- Weihan Qin
- Yong Yang
- Xiang Liu
Scientific Reports (2022)
RNA-Seq analysis reveals the role of MYB12, MYB111 and MBW complex repressors in regulation of flavonoid biosynthesis in stigmas of saffron (Crocus sativus L.) flowers
- Jyoti Pathak
- Upasna Chettry
- Arun B. Gurung
Acta Physiologiae Plantarum (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

Transcriptome sequencing and de novo assembly

Gene annotation and functional classification

Digital gene expression profiling and in silico analysis

Characterization of functional genes involved in flavonoid biosynthesis

Quantitative real time PCR validation of gene expression

Discussion

Methods

Plant material and sample collection

RNA extraction and high-throughput sequencing

Transcriptome assembly and gene annotation

DEU identification and enrichment analysis

Building a gene regulatory network

Verification by qRT-PCR analysis

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing Interests

Additional information

Electronic supplementary material

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links