Abstract
In plants due to their sessile nature, secondary metabolites are important components against different abiotic and biotic stress, such as extended darkness. For this reason, the variation of secondary metabolite content of the Arabidopsis thaliana HapMap natural population following 0-and 6-d darkness treatment were detected and the raw data of different accessions at two timepoints were deposited in the Zenodo database. Moreover, the annotated secondary metabolites of these samples are presented in this data descriptor, which we believe will be a usefully re-usable resource for future integrative analysis with dark-treated transcripts, proteins or other phenotypic data in order to comprehensively illustrate the multiomic landscape of Arabidopsis in response to the stresses exerted by extended darkness.
Similar content being viewed by others
Background & Summary
Being significantly different to primary metabolites which are the simple structure components that harbor vital roles in development, plant secondary metabolites are derived from the primary metabolites and non-essential for the normal metabolism of the organism but exhibited the survival function different abiotic and biotic stresses1,2,3,4. For example, although sunlight is the energy resource of plant, ultraviolet (UV) light whose wavelength ranges from 280 to 320 nm (UV-B) potentially causes the damage of nucleic acid and proteins and promotes transposon activity which may induce mutations5. Our former study has demonstrated that the content of phenylacylated-flavonols (saiginols) is high-correlated with UV light intensity and the production of saiginols is an important mechanism of Arabidopsis against the UV light damage1,6. Unlike the damage of UV radiation on nucleic acid and proteins, the extended darkness stress causes the cessation of photosynthesis and nutrient starvation of plant. The genomic landscape of primary metabolite and lipids have been illustrated but the genetic regulation of secondary metabolites under extended darkness stress is still unknown7. In general, secondary metabolites have been divided into different classification based on the characteristic chemical structures, such as phenolic acids, flavonoids, terpenoids and steroids, and alkaloids8. Recently, with the explosive development of mass spectrometry methods, liquid chromatography-mass spectrometry (LC-MS) has widely been used as the high-throughput method to analyze the complex plant secondary metabolism7,9,10,11.
Here, the raw data of secondary metabolites of Arabidopsis thaliana HapMap natural population after 0d and 6d darkness treatment are deposited to zenodo (https://zenodo.org/)12,13,14,15,16,17,18,19,20,21,22,23,24. Among the samples, 95 metabolites were well-annotated based on standards, or MS/MS data from former studies1,25,26,27 and the detailed information of the metabolites content of different accession in different timepoints can been found in figshare28,29. A principal component analysis (PCA) of the different time points and accessions based on the secondary metabolites content exhibited the significantly separation between 0d and the 6d samples, indicating the remarkably effect of darkness on the secondary metabolome (Fig. 1)30. Genome-wide association study (GWAS) analysis based on the data in our present study identified the strong associations between the well-known gene (Bglu6, AT1G60270) with Quercetin 3-O-glucoside 7-O-rhamnoside, which indicated the high quality of the data normalization pipeline and the accuracy of the analysis concerning the different metabolite intensities between time points (Fig. 2)31. Moreover, some novel associations such as with formimino-L-aspartic acid which can just be identified in 6d after dark-treated dataset (Fig. 3)32 indicated that extend darkness can significantly change the genetic regulation of metabolites and further supported that these data could be used not only in future to further mine the novel associations with secondary metabolites but also in the mutiomics analysis alongside dark-treated transcriptome, proteome and phenome data in order to figure out the global regulation of Arabidopsis against the extended darkness stress.
Methods
Plant material and sample preparation
The seeds of 259 Arabidopsis accession of Arabidopsis thaliana HapMap natural population were grown on soil under a short-day (SD) photoperiod in a greenhouse for 5 weeks. At 35-days after germination, one plant of each accession was harvested (0 days) and quickly frozen in liquid nitrogen and after 6d,another plant of each accession was harvested within 30 min at 10 am (GMT + 2). All samples were stored at –80 °C prior to further analysis. The positioning of the plants was randomized to avoid block effects during growth and one Col-0 plant was grown in each tray to analyze the spatial variation between different trays. Moreover, the experiment was performed twice in autumn 2018 and spring 2019 as two independent biological replicates. Two pattern quality controls were included in the experiment to remove extraction and batch effects for the final processed data. First, one Ex-QC sample from the same pool material before extended darkness treatment was added per 40 analyzed samples during metabolite extraction and followed the same analytical pathway. Second, an identical quality control was added to every 14 samples during metabolite profiling, and each batch of 60 samples (including four quality controls) was then standardized to the four quality controls7.
LC-MS analysis
The extraction of metabolites was carried out following the method described in a previous study33. One mL pre-cooled extraction solvent (methyl tert-butyl ether/methanol 3:1 vol/vol) was added to 50 mg grinded Arabidopsis leaf. The tube containing sample and extraction buffer was vortexed for 1 min and then shaken on an orbital shaker (100 rpm) for 45 min at 4 °C and sonicated for 15 min. An additional 500 µL of phase separation buffer (water/methanol 3:1 vol/vol) was added to tube and the samples were thoroughly vortexed again for 1 min. After that, the samples are centrifuged at a speed of 20,000 g for 5 min at 4 °C. The 360 µL polar phase underwent drying using a SpeedVac concentrator and was then suspended in 150 µL of 50% methanol (methanol/water 50:50 vol/vol). Then, 3 μL of supernatant was injected and analyzed by the Waters Acquity UPLC system coupled to an Exactive Orbitrap mass detector according to the previously published protocol33. 0.1% formic acid in water (Solvent A) and 0.1% formic acid in acetonitrile (Solvent B) were used as mobile phases. The mobile phase flow rate was 400 µL/min. The spectra were captured in negative ion detection full scan mode, covering a mass range of 100 to 1500 m/z. The maximum scan time was set to 250 ms and the resolution was set to 25,000. From 0 min to min 19 of the UPLC gradient, MS spectra were recorded. RefinerMS software (version 5.3; GeneData), MetAlign34, and Xcalibur software files (Thermo Fisher Scientific) were used to extract molecular masses, retention times, and corresponding peak intensities from the raw. Metabolite identification and annotation were performed using in house standard library, MS2 fragmentation battens, literature base, and metabolomics databases1,9,25,27. Metabolites are reported following the recently updated reporting standards for metabolomics9.
Data processing and metabolite data analysis
The data was analyzed with the Xcalibur 2.1.0 program, and peak identification and annotation were carried out by integrative method: standard chemical confirmation, MS fragmentation and retention time profiling1,25,27,35. Xcalibur Quan Browser was used to choose the best peaks under the parameter as Window (sec), 30; highest peak; minimum peak height (S/N), 3.0; Baseline window, 80–150; area noise factor, 2; peak noise factor, 10; peak height (%), 5.0, tailing factor, 1.51,25,27,35.
GWAS analysis
The normalized metabolite intensity values were used to map the phenotypic observations to loci in the A. thaliana genome based on 199455 SNP markers (minor allele frequency > 1%) of Affymetrix GeneChip Array 6.036,37. The SNP fraction parameter was set to 1, and other parameters were set to default values as recommended by the GAPIT user manual. The genome-wide suggestive and significant threshold of SNP was set to p-value = 3.16 × 10−5 and 5.01 × 10−6, respectively as described previously38.
Data Records
A total of 1244 raw data of the different conditions and Arabidopsis accessions and quality control samples of two different years have been deposited to zenodo (https://zenodo.org/)12,13,14,15,16,17,18,19,20,21,22,23,24. The detailed information of the annotated metabolites and metabolites content of different accessions in different timepoints were deposited in the figshare (https://doi.org/10.6084/m9.figshare.24407896.v3 and https://doi.org/10.6084/m9.figshare.24407812.v3)28,29.
Technical Validation
Two pattern quality controls have been included in the analysis to ensure the accuracy of the analysis about the different metabolite intensity between different conditions. The metabolites result of two different years’ samples have been used to calculate the best linear unbiased predictions (BLUPs) using the R package lme439.
Usage Notes
Figure 1 shows the principle component analysis of metabolite levels of different Arabidopsis accessions for two time points30. Figure 2 shows the Manhattan plot of the association of Bglu6 (AT1G60270) and Quercetin 3-O-glucoside 7-O-rhamnoside under 0d and 6d after darkness treatment31. Figure 3 shows the Manhattan plot of the association of formimino-L-Aspartic acid under 0d and 6d after darkness treatment32.
Code availability
Genome Association and Prediction Integrated Tool (GAPIT) R package: https://www.maizegenetics.net/gapit40. The principle component analysis were performed using MetaboAnalyst 6.041.
References
Tohge, T. et al. Characterization of a recently evolved flavonol-phenylacyltransferase gene provides signatures of natural light selection in Brassicaceae. Nat. Commun. 7, 12399 (2016).
Fàbregas, N. & Fernie, A. R. The metabolic response to drought. J. Exp. Bot. 70, 1077–1085 (2019).
Erb, M. & Kliebenstein, D. J. Plant secondary metabolites as defenses, regulators, and primary metabolites: the blurred functional trichotomy. Plant Physiol. 184, 39–52 (2020).
Akula, R. & Ravishankar, G. A. Influence of abiotic stress signals on secondary metabolites in plants. Plant Signal. Behav. 6, 1720–1731 (2011).
Ries, G. et al. Elevated UV-B radiation reduces genome stability in plants. Nature 406, 98–101 (2000).
Nisa, M.-U., Huang, Y., Benhamed, M. & Raynaud, C. The plant DNA damage response: signaling pathways leading to growth inhibition and putative role in response to stress conditions. Front. Plant Sci. 10, 653 (2019).
Zhu, F. et al. Genome-wide association of the metabolic shifts underpinning dark-induced senescence in Arabidopsis. Plant Cell 34, 557–578 (2021).
Tohge, T., Alseekh, S. & Fernie, A. R. On the regulation and function of secondary metabolism during fruit development and ripening. J. Exp. Bot. 65, 4599–4611 (2014).
Alseekh, S. et al. Mass spectrometry-based metabolomics: a guide for annotation, quantification and best reporting practices. Nat. Methods 18, 747–756 (2021).
Tohge, T. et al. Exploiting natural variation in tomato to define pathway structure and metabolic regulation of fruit polyphenolics in the lycopersicum complex. Mol. Plant 13, 1027–1046 (2020).
Zhu, G. et al. Rewiring of the fruit metabolome in tomato breeding. Cell 172, 249–261 (2018).
Zhu, F. The natural variance of Arabidopsis secondary metabolism on extended darkness-1. Zenodo https://doi.org/10.5281/zenodo.12341588 (2024).
Zhu, F. The natural variance of Arabidopsis secondary metabolism on extended darkness-2. Zenodo https://doi.org/10.5281/zenodo.12200393 (2024).
Zhu, F. The natural variance of Arabidopsis secondary metabolism on extended darkness-3. Zenodo https://doi.org/10.5281/zenodo.12251706 (2024).
Zhu, F. The natural variance of Arabidopsis secondary metabolism on extended darkness-4. Zenodo https://doi.org/10.5281/zenodo.12251770 (2024).
Zhu, F. The natural variance of Arabidopsis secondary metabolism on extended darkness-5. Zenodo https://doi.org/10.5281/zenodo.12251824 (2024).
Zhu, F. The natural variance of Arabidopsis secondary metabolism on extended darkness-6. Zenodo https://doi.org/10.5281/zenodo.12251892 (2024).
Zhu, F. The natural variance of Arabidopsis secondary metabolism on extended darkness-7. Zenodo https://doi.org/10.5281/zenodo.12251936 (2024).
Zhu, F. The natural variance of Arabidopsis secondary metabolism on extended darkness-8. Zenodo https://doi.org/10.5281/zenodo.12290707 (2024).
Zhu, F. The natural variance of Arabidopsis secondary metabolism on extended darkness-9. Zenodo https://doi.org/10.5281/zenodo.12290785 (2024).
Zhu, F. The natural variance of Arabidopsis secondary metabolism on extended darkness-10. Zenodo https://doi.org/10.5281/zenodo.12291751 (2024).
Zhu, F. The natural variance of Arabidopsis secondary metabolism on extended darkness-11. Zenodo https://doi.org/10.5281/zenodo.12291819 (2024).
Zhu, F. The natural variance of Arabidopsis secondary metabolism on extended darkness-12. Zenodo https://doi.org/10.5281/zenodo.12291915 (2024).
Zhu, F. The natural variance of Arabidopsis secondary metabolism on extended darkness-13. Zenodo https://doi.org/10.5281/zenodo.12291977 (2024).
Alseekh, S. et al. Canalization of tomato fruit metabolism. Plant Cell 29, 2753–2765 (2017).
Alseekh, S. et al. Identification and mode of inheritance of quantitative trait loci for secondary metabolite abundance in tomato. Plant Cell 27, 485–512 (2015).
Wu, S. et al. Mapping the Arabidopsis Metabolic Landscape by Untargeted Metabolomics at Different Environmental Conditions. Mol. Plant 11, 118–134 (2018).
Zhu, F. Identified Arabidopsis secondary metabolites in extended darkness treatment. Figshare https://doi.org/10.6084/m9.figshare.24407812.v3 (2023).
Zhu, F. BLUP value of Arabidopsis secondary metabolites in different accession at 0d and 6d after darkness. Figshare https://doi.org/10.6084/m9.figshare.24407896.v3 (2023).
Zhu, F. The principle component analysis of metabolite levels of different Arabidopsis accessions for two time points. Figshare https://doi.org/10.6084/m9.figshare.24407926.v2 (2023).
Zhu, F. The Manhattan plot of the association of Bglu6 (AT1G60270) and Quercetin 3-O-glucoside 7-O-rhamnoside under 0d and 6d after darkness treatment. Figshare https://doi.org/10.6084/m9.figshare.24407944.v2 (2023).
Zhu, F. The Manhattan plot of the association of formimino-L-Aspartic acid under 0d and 6d after darkness treatment. Figshare https://doi.org/10.6084/m9.figshare.24407947.v2 (2023).
Salem, M. A. et al. An improved extraction method enables the comprehensive analysis of lipids, proteins, metabolites and phytohormones from a single sample of leaf tissue under water-deficit stress. Plant J. 103, 1614–1632 (2020).
Lommen, A. MetAlign: Interface-driven, versatile metabolomics tool for hyphenated full-scan mass spectrometry data preprocessing. Anal. Chem. 81, 3079–3086 (2009).
Tohge, T., Scossa, F. & Fernie, A. R. Integrative approaches to enhance understanding of plant metabolic pathway structure and regulation. Plant Physiol. 169, 1499–1511 (2015).
Horton, M. W. et al. Genome-wide patterns of genetic variation in worldwide Arabidopsis thaliana accessions from the RegMap panel. Nat. Genet. 44, 212–216 (2012).
Li, Y., Huang, Y., Bergelson, J., Nordborg, M. & Borevitz, J. O. Association mapping of local climate-sensitive quantitative trait loci in Arabidopsis thaliana. Proc. Natl. Acad. Sci. USA 107, 21199–21204 (2010).
Wu, S. et al. Combined use of genome-wide association data and correlation networks unravels key regulators of primary metabolism in Arabidopsis thaliana. PLoS Genet. 12, e1006363 (2016).
Bates, D., Mächler, M., Bolker, B. & Walker, S. Fitting. linear mixed-effects models using lme4. J. Stat. Softw. 67, 1–48 (2015).
Lipka, A. E. et al. GAPIT: genome association and prediction integrated tool. Bioinformatics 28, 2397–2399 (2012).
Pang, Z. et al. MetaboAnalyst 6.0: towards a unified platform for metabolomics data processing, analysis and interpretation. Nucleic Acids Res. 8, gkae253 (2024).
Acknowledgements
We acknowledge Anne Michaelis and Gudrun Wolter for excellent technical assistance. We thank Dr Youjun Zhang, Dr Leonardo Perez de Souza, and Mustafa Bulut for critical and helpful discussions. We acknowledge Regina Wendenburg, David Barbosa Medeiros, Fayezeh Arabi, Antje Bolze, and Ina Krahnert for help in harvesting plant material.
Funding
Open Access funding enabled and organized by Projekt DEAL.
Author information
Authors and Affiliations
Contributions
F.Z., S.A. and A.R.F. conceived this project and designed all of the experiments., F.Z. and M.W.A. performed the experiments. S.A., W.W., Y.C. and A.R.F. supervised the experiments. F.Z. and A.R.F. wrote the article.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Zhu, F., Ahchige, M.W., Wen, W. et al. The natural variance of Arabidopsis secondary metabolism on extended darkness. Sci Data 11, 841 (2024). https://doi.org/10.1038/s41597-024-03694-2
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41597-024-03694-2