Abstract
Angiosperms are one of the most diverse and abundant plant groups that are widely distributed on Earth, from tropical to temperate and polar zones. The wide distribution of angiosperms may be attributed to the evolution of sophisticated mechanisms of environmental adaptability, including cold tolerance. Since the development of high-throughput sequencing, transcriptome has been widely utilized to gain insights into the molecular mechanisms of plants in response to cold stress. However, previous studies generally focused on single or two species, and comparative transcriptome analyses for multispecies responding to cold stress were limited. In this study, we selected 11 representative angiosperm species, performed phylotranscriptome experiments at four time points before and after cold stress, and presented a profile of cold-induced transcriptome changes in angiosperms. Our multispecies cold-responsive RNA-seq datasets provide valuable references for exploring conserved and evolutionary mechanisms of angiosperms in adaptation to cold stress.
Similar content being viewed by others
Background & Summary
Due to their sessile nature, plants have evolved sophisticated signaling pathways in adaptation to complex environmental fluctuations such as extreme temperature, high salinity, and drought1,2. Low temperature is one of the environmental stresses that causes a series of physiological and metabolic changes in plants3,4, and severe low temperature even causes plant death, which seriously threatens food and seed security. However, plants are not completely as passive as they seem. Generally, temperate plants could increase their freezing tolerance by preexposure to low but non-freezing temperature environment for a few days or weeks, the process is known as cold acclimation5. Cold acclimation is a complex process that involves multiple physiological and biochemical changes, such as modification of lipid composition, changes of protein and carbohydrate composition, and accumulation of anti-freezing and anti-oxidative substances5. Accumulating evidence demonstrates that most of these changes are mainly due to the expression of cold-responsive (COR) genes, which are induced by cold acclimation and play important roles in plants resistance to low temperatures6,7,8. Exploring the regulatory mechanisms of COR genes in cold acclimation contributes to our understanding of plants adapting to extremely low temperatures and molecular breeding.
Since the development of high-throughput sequencing, RNA sequencing (RNA-seq) has been widely utilized to gain insights into the molecular mechanisms of plants in response to cold stress. To date, molecular responses of cold stress have been widely investigated and characterized in many plant species, such as Arabidopsis thaliana9, Oryza sativa10, Zea mays11, Ocimum americanum12, Malus sieversii13, Nicotiana tabacum14, Phyllostachys edulis15, Betula platyphylla16, and Populus trichocarpa17,18. However, these prior studies generally focus on single or two species, and comparative transcriptome analyses for multispecies responding to cold stress were less investigated.
As the most diverse and abundant plant groups, angiosperms are widely distributed on Earth from tropical to polar terrestrial zones. The wide distribution of angiosperms may be attributed to the evolution of advanced and sophisticated mechanisms of environmental adaptability19. In this study, we present cold-stress transcriptome analyses of 11 angiosperm species, including six eudicots (Arabidopsis thaliana, Betula pendula, Populus trichocarpa, Carya illinoinensis, Glycine max, and Cucumis sativus) and five monocots (Oryza sativa, Setaria italica, Hordeum vulgare, Zea mays, and Phyllostachys edulis). For each of the species, seedlings were treated at four different time points of cold stress (0, 2, 24, and 168 hours (h)), and three biological replicates were performed for each time point. Finally, we generated a total of 132 RNA-seq datasets from the 11 angiosperms under cold treatments. Our multispecies cold-responsive RNA-seq datasets provide valuable references for exploring the conservation and evolution of cold-responsive molecular mechanisms of angiosperms.
Methods
Plant materials and growth conditions
Seedlings of 11 selected representative angiosperm species, including six eudicots (A. thaliana, B. pendula, P. trichocarpa, C. illinoinensis, G. max, and C. sativus) and five monocots (O. sativa, S. italica, H. vulgare, Z. mays, and P. edulis) (Fig. 1a), were cultured in an artificial climate chamber with 25 °C at a photoperiod of 16/8 h light/dark cycle. For A. thaliana, three-week-old seedlings were prepared for utilization in cold treatment. For other species, young seedlings growing up to ~30 cm in height were prepared.
Cold stress treatment
Under cold stress, gene expression is reprogrammed to form a hierarchical regulatory network, which is constituted of rapid, early, and late cold-responsive genes. To obtain these genes, we performed cold stress treatments of the 11 selected representative angiosperm species under different time points (0, 2, 24, and 168 h). For each species, seedlings with relatively uniform growth and physiological state were selected and divided into four groups (Group 1 to Group 4). To ensure that the four seedling groups of cold treatments (0, 2, 24, and 168 h) could be harvested in the same development stage at the same time on a day, the seedling group of cold treatment for 168 h was first cultured in the artificial climate chamber with 4 °C a week (168 h) before harvest, and then were seedling groups of cold treatment, respectively, for 24, 2, and 0 h at proper times. After cold stress treatments, we collected the fourth expanded leaves of the treated seedlings, which are generally considered as mature healthy leaves at similar developmental stages. For each cold treatment, three biological replicates were performed (Fig. 1b).
RNA extraction, library construction, and sequencing
The total RNA of the collected leaves from each species was isolated and purified using TRIzol reagent (Invitrogen, Carlsbad, CA, USA) following the manufacturer’s procedure. The RNA amount and purity of each sample were assessed by NanoDrop ND-1000 (NanoDrop Technologies, Wilmington, DE, USA) and Agilent 2100 Bioanalyzer (Agilent Technologies, Palo Alto, California, USA). Poly (A) RNA was purified from total RNA using poly-T oligo-attached magnetic beads to generate strand-specific cDNA libraries containing inserts of approximately 150–200 bp in size. In total, 132 cDNA libraries from 11 species under four time points before and after cold stress (0, 2, 24, and 168 h) were constructed for transcriptome analysis. The libraries were sequenced on Illumina NovaSeq 6000 sequencing system (2 × 150 bp paired-end reads) at LC-Bio Technology CO., Ltd. (Hangzhou, China) according to the manufacturer’s instructions (Fig. 1c).
Data Records
The above 132 RNA-seq datasets have been deposited into the NCBI BioProject with the accession number PRJNA76719620. The read-count data matrix of cold-treated samples in each of the 11 species is available at figshare data repository (https://doi.org/10.6084/m9.figshare.22643245.v1)21. DEGs between different time points of cold treatments and normal conditions for each species time points are also available at figshare (https://doi.org/10.6084/m9.figshare.22643074.v1)22.
Technical Validation
Data filtering and quality control
The raw data in fastq format were processed by Trimmomatic v0.3923 to remove the Illumina adapter contamination and low-quality bases. After filtering, quality control of the clean reads in each RNA-seq dataset was assessed using FastQC24 (https://www.bioinformatics.babraham.ac.uk/projects/fastqc/). Q20 and Q30 average values of the 132 libraries were 99.84% and 98.29%, respectively (Dataset 1). The average GC content of the dataset was 47.84% (Dataset 1). Overall results of filtering and quality of clean reads indicated that the sequencing progressed adequately composing a series of high-quality RNA-seq datasets.
Reference genomes assessment and reads mapping
Genome sequences of 11 angiosperms were downloaded from public databases. In detail, genome sequences of A. thaliana were downloaded from TAIR1025, B. pendula and C. sativus were downloaded from NCBI26, P. trichocarpa, G. max, O. sativa and S. italic were downloaded from Phytozome v13.127, H. vulgare and Z. mays were downloaded from Ensembl Plants28, C. illinoinensis and P. edulis were downloaded from GigaDB29. We first used BUSCO v.3.0.230 to detect the quality of the reference genomes. The complete BUSCO values of the 11 reference genomes ranged from 86.3% to 99.3%, among which 10 genomes were over 90% (Fig. 2a), indicating that the reference genomes were appropriate and of high quality. To detect the mapping ratio of the transcriptomes, clean reads from each sample were mapped to their corresponding reference genome by HISAT2 v2.1.031. The average mapping ratio of the RNA-seq samples in each species ranged from 94.23% to 98.84% (Dataset 2 and Fig. 2b).
Principal component analysis
StringTie v2.0.332 was further utilized to estimate the gene expression level (trimmed mean of M value, TMM) based on all mapped reads to the reference genome, and the expressed genes in each sample were identified with the average TMM > 0.5 across three replicates of the sample (Dataset 3). Read-count data can be available at Figshare (https://doi.org/10.6084/m9.figshare.22643245.v1)21. Using the script of PtR in Trinity33, we performed principal component analysis (PCA) based on the mapped count read table of the RNA-seq samples for each species. PCA results showed a clear clustering according to the same time point of cold treatments (Fig. 3), suggesting a high consistency of the biological replicates.
Identification of differentially expressed genes
Differentially expressed genes (DEGs) between two time points before and after cold treatments (0 h versus 2, 24, or 168 h) were obtained using edgeR34, DESeq 235, and Ballgown36. In brief, analyzed from at least two of the three methods, those genes with a mean TMM ≥ 1 across the compared samples that had an adjusted P-value or false discovery rate (FDR) <0.05 and an absolute value of fold change ≥ 2 were considered to be DEGs. For each species, the number of DEGs under different time points of cold treatments (2, 24, and 168 h) compared with the control (0 h) was calculated (Fig. 4a,b, Dataset 4). The meta-data of DEGs are available under Figshare DEG tables of 11 angiosperm species under different time points of cold treatments22. Venn diagrams of the DEGs obtained from different time points of cold treatments were analyzed (Fig. 4c). Additionally, we used R programming to plot heatmaps of the expression of DEGs in each species (Fig. 4d, Dataset 5).
Code availability
Software and their versions used for RNA-seq analysis were described in Methods. No custom code was used to generate or process the data described in the manuscript.
References
Zhu, J. K. Abiotic Stress Signaling and Responses in Plants. Cell 167, 313–324 (2016).
Nie, Y. et al. Innovations and Stepwise Evolution of CBFs/DREB1s and Their Regulatory Networks in Angiosperms. J. Integr. Plant Biol. 64, 2111–2125 (2022).
Chinnusamy, V., Zhu, J. & Zhu, J. K. Cold stress regulation of gene expression in plants. Trends Plant Sci. 12, 444–451 (2007).
Ding, Y., Shi, Y. & Yang, S. Advances and challenges in uncovering cold tolerance regulatory mechanisms in plants. New Phytol. 222, 1690–1704 (2019).
Thomashow, M. F. PLANT COLD ACCLIMATION: Freezing Tolerance Genes and Regulatory Mechanisms. Annu. Rev. Plant Physiol. Plant Mol. Biol. 50, 571–599 (1999).
Cook, D., Fowler, S., Fiehn, O. & Thomashow, M. F. A prominent role for the CBF cold response pathway in configuring the low-temperature metabolome of Arabidopsis. Proc. Natl. Acad Sci. USA 101, 15243–15248 (2004).
Hannah, M. A., Heyer, A. G. & Hincha, D. K. A global survey of gene regulation during cold acclimation in Arabidopsis thaliana. PLoS Genet. 1, e26 (2005).
Maruyama, K. et al. Metabolic pathways involved in cold acclimation identified by integrated analysis of metabolites and transcripts regulated by DREB1A and DREB2A. Plant Physiol. 150, 1972–1980 (2009).
Shi, Y., Ding, Y. & Yang, S. Molecular Regulation of CBF Signaling in Cold Acclimation. Trends Plant Sci. 23, 623–637 (2018).
Yang, S. Cold responses in rice: From physiology to molecular biology. J. Plant Physiol. 269, 153602 (2022).
Guo, Q., Li, X., Niu, L., Jameson, P. E. & Zhou, W. Transcription-associated metabolomic adjustments in maize occur during combined drought and cold stress. Plant Physiol. 186, 677–695 (2021).
Zhan, X., Yang, L., Wang, D., Zhu, J. K. & Lang, Z. De novo assembly and analysis of the transcriptome of Ocimum americanum var. pilosum under cold stress. BMC Genomics 17, 209 (2016).
Zhou, P. et al. Transcriptome profiling of Malus sieversii under freezing stress after being cold-acclimated. BMC Genomics 22, 681 (2021).
Jin, J. et al. Integrated transcriptomics and metabolomics analysis to characterize cold stress responses in Nicotiana tabacum. BMC Genomics 18, 496 (2017).
Liu, Y. et al. Transcriptome profiling reveals the crucial biological pathways involved in cold response in Moso bamboo (Phyllostachys edulis). Tree Physiol. 40, 538–556 (2020).
Chen, S. et al. Genome sequence and evolution of Betula platyphylla. Hortic. Res. 8, 37 (2021).
Wang, S. et al. Innovation and Emerging Roles of Populus trichocarpa TEOSINTE BRANCHED1/CYCLOIDEA/PROLIFERATING CELL FACTOR Transcription Factors in Abiotic Stresses by Whole-Genome Duplication. Front. Plant Sci. 13, 850064 (2022).
Zhao, X. et al. The SR Splicing Factors: Providing Perspectives on Their Evolution, Expression, Alternative Splicing, and Function in Populus trichocarpa. Int. J. Mol. Sci. 22, 11369 (2021).
Guo, L. et al. Convergent evolution of AP2/ERF III and IX subfamilies through recurrent polyploidization and tandem duplication during eudicot adaptation to paleoenvironmental changes. Plant Commun. 3, 100420 (2022).
NCBI Sequence Read Archive https://identifiers.org/ncbi/insdc.sra:SRP339213 (2022).
Wang, S., Zhang, Y., Ye, X. & Wu, W. Read-count data of 11 angiosperm species under different time points of cold treatments. figshare https://doi.org/10.6084/m9.figshare.22643245.v1 (2023).
Wang, S., Zhang, Y., Ye, X. & Wu, W. DEG tables of 11 angiosperm species under different time points of cold treatments. figshare https://doi.org/10.6084/m9.figshare.22643074.v1 (2023).
Bolger, A. M., Lohse, M. & Usadel, B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30, 2114–2120 (2014).
Andrews, S. FastQC: A Quality Control tool for High Throughput Sequence Data. (2010).
Huala, E. et al. The Arabidopsis Information Resource (TAIR): a comprehensive database and web-based information retrieval, analysis, and visualization system for a model plant. Nucleic Acids Res. 29, 102–105 (2001).
Pruitt, K. D., Tatusova, T. & Maglott, D. R. NCBI Reference Sequence (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins. Nucleic Acids Res. 33, D501–504 (2005).
Goodstein, D. M. et al. Phytozome: a comparative platform for green plant genomics. Nucleic Acids Res. 40, D1178–1186 (2012).
Bolser, D. M., Staines, D. M., Perry, E. & Kersey, P. J. Ensembl Plants: Integrating Tools for Visualizing, Mining, and Analyzing Plant Genomic Data. Methods Mol. Biol. 1533, 1–31 (2017).
Sneddon, T. P., Li, P. & Edmunds, S. C. GigaDB: announcing the GigaScience database. GigaScience 1, 11 (2012).
Simão, F. A., Waterhouse, R. M., Ioannidis, P., Kriventseva, E. V. & Zdobnov, E. M. BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics 31, 3210–3212 (2015).
Kim, D., Langmead, B. & Salzberg, S. L. HISAT: a fast spliced aligner with low memory requirements. Nat. Methods 12, 357–360 (2015).
Pertea, M. et al. StringTie enables improved reconstruction of a transcriptome from RNA-seq reads. Nat. Biotechnol. 33, 290–295 (2015).
Haas, B. J. et al. De novo transcript sequence reconstruction from RNA-seq using the Trinity platform for reference generation and analysis. Nat. Protoc. 8, 1494–1512 (2013).
Robinson, M. D., McCarthy, D. J. & Smyth, G. K. edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics 26, 139–140 (2010).
Love, M. I., Huber, W. & Anders, S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq 2. Genome Biol. 15, 550 (2014).
Frazee, A. C. et al. Ballgown bridges the gap between transcriptome assembly and expression analysis. Nat. Biotechnol. 33, 243–246 (2015).
Acknowledgements
We thank Jianqin Huang, Yuxiang Cheng, and Jing Jiang for providing young seedlings of Carya illinoinensis, Populus trichocarpa and Betula pendula, respectively. We thank Guang Yang, Chaofeng Yang, Yanbing Yang, Yinping Xu, and Yiqiang Han for providing seeds for Zea mays, Phyllostachys edulis, Setaria italic, Hordeum vulgare, and Glycine max respectively. This work is supported by the National Natural Science Foundation of China (grant number 31871233).
Author information
Authors and Affiliations
Contributions
W.W. designed the research and secured funding for the project. S.W., Y.Z., Y.S., H.L., X.Z. and Y.D. prepared plant samples and performed cold stress treatments. X.Y. and L.G. analyzed the RNA-seq data. L.C. submitted the RNA-seq data to NCBI. S.W., X.Y. and W.W. wrote and revised the manuscript. All authors have read and approved the final manuscript for submission.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Wang, S., Zhang, Y., Ye, X. et al. A phylotranscriptomic dataset of angiosperm species under cold stress. Sci Data 10, 399 (2023). https://doi.org/10.1038/s41597-023-02307-8
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41597-023-02307-8