HeliantHOME, a public and centralized database of phenotypic sunflower data

Bercovich, Natalia; Genze, Nikita; Todesco, Marco; Owens, Gregory L.; Légaré, Jean-Sébastien; Huang, Kaichi; Rieseberg, Loren H.; Grimm, Dominik G.

doi:10.1038/s41597-022-01842-0

Download PDF

Data Descriptor
Open access
Published: 30 November 2022

HeliantHOME, a public and centralized database of phenotypic sunflower data

Scientific Data volume 9, Article number: 735 (2022) Cite this article

1310 Accesses
1 Citations
5 Altmetric
Metrics details

Subjects

This article has been updated

Abstract

Genomic studies often attempt to link natural genetic variation with important phenotypic variation. To succeed, robust and reliable phenotypic data, as well as curated genomic assemblies, are required. Wild sunflowers, originally from North America, are adapted to diverse and often extreme environments and have historically been a widely used model plant system for the study of population genomics, adaptation, and speciation. Moreover, cultivated sunflower, domesticated from a wild relative (Helianthus annuus) is a global oil crop, ranking fourth in production of vegetable oils worldwide. Public availability of data resources both for the plant research community and for the associated agricultural sector, are extremely valuable. We have created HeliantHOME (http://www.helianthome.org), a curated, public, and interactive database of phenotypes including developmental, structural and environmental ones, obtained from a large collection of both wild and cultivated sunflower individuals. Additionally, the database is enriched with external genomic data and results of genome-wide association studies. Finally, being a community open-source platform, HeliantHOME is expected to expand as new knowledge and resources become available.

Measurement(s)	plant trait
Technology Type(s)	ImageJ
Factor Type(s)	Country • Sitename • Latitude • Longitude • Elevation
Sample Characteristic - Organism	Helianthus annuus var. macrocarpus • Helianthus annuus • Helianthus argophyllus • Helianthus niveus subsp. canescens (taxid: 74145) • Helianthus petiolaris subsp. fallax (taxi: 74150) • Helianthus petiolaris subsp. petiolaris (taxid: 74151)
Sample Characteristic - Location	United States of America • Canada

CitGVD: a comprehensive database of citrus genomic variations

Article Open access 01 February 2020

Identifying genetic variants underlying phenotypic variation in plants without complete genomes

Article 13 April 2020

Plant pan-genomes are the new reference

Article 20 July 2020

Background & Summary

The sunflower genus comprises more than 50 wild species and 19 subspecies^1,2 all of which are native to North America. Wild sunflowers colonize all kinds of extreme environments, from sand dunes to coastal salt marshes, displaying remarkable inter- and intra-specific plasticity and ability for adaptation. Moreover, the common sunflower, Helianthus annuus var. macrocarpus, is one of the seven major oilseed crops produced around the world³. The study of natural variation in wild relatives of crops like sunflower offers opportunities to increase our understanding of their evolutionary history and adaptation to different environments, as well as to enhance breeding programs. In the past decade, large efforts from numerous research groups around the globe have completed the sequencing and assembly of the common sunflower genome as well as other genomic and germplasm resources^4,5,6,7,8. The current availability of several high-quality reference genomes for sunflower has made it possible to start asking new types of questions about sunflower molecular ecology and its evolutionary history^6,7,8. It also opens the possibility to broadly investigate genetic associations between sequence and gene function in a more accurate way (e.g. Duriez et al.; Todesco et. al)^9,10.

We have recently carried out a comprehensive study of the genetic and phenotypic diversity as well as of their association with environmental variables for four annual wild sunflower species: H. annuus, H. petiolaris, H. niveus and H. argophyllus⁶. These species were selected based on their ability to grow in a broad variety of – often extreme – environments and their potential use for sunflower breeding. Populations for these four species are found all across the US (see Fig. 1), coexisting at times, and showing adaptation to diverse environments often rising to the ecotypic level (subpopulations within the same species).

In a recent work, we carried out a common garden experiment at a single location in Vancouver, Canada and collected phenotypic data for 1,510 individuals belonging to 151 populations for the wild sunflower species mentioned above. We additionally re-sequenced the genomes of most of those individuals, resulting in sets of >4 M high-quality single nucleotide polymorphisms (SNPs) for each of the four species. Given the broad relevance of these datasets, for both the sunflower research and breeding communities, we decided to generate HeliantHOME, an interactive database that contains all these data in a way that can be easily reused, revisited and even enriched by the research community. One of the features we kept into consideration was universality and reproducibility of data. Therefore, whenever possible, we aimed to follow all the ‘minimum information about plant phenotyping experiments’ standards checklist^11,12 (MIAPPE: https://www.miappe.org/), to ensure we could offer the most adequate data, including unique identifiers, detailed descriptions of experimental procedures, measurements, units, and others. Also, we provide data in the most commonly used data formats, such as PLINK and CSV files for GWAS analysis and more importantly, the REST API which conveniently allows to access additional meta-data.

Both raw and extracted phenotypic data have been incorporated in the database; this includes a large collection of high-resolution images of different plant organs, architectural and developmental trait measurements as well as complementary environmental variables (climate and soil data) for the population of origin of the individuals included in the datasets. The corresponding re-sequencing data for most of the individuals is also linked and available for public download.

In addition, we have included in HeliantHOME phenotypic data for the UGA-SAM1 (SAM) population, a collection of 288 cultivated sunflower lines that have been selected because they capture nearly 90% of the allelic diversity present within cultivated sunflower¹³. Most individuals represent modern oil or confectionary cultivars, however, a handful of open-pollinated varieties, landraces and prebred lines are also included. This collection is publicly available and it can be retrieved from both the Agricultural Research Service from the US Department of Agriculture (USDA-ARS)¹⁴, and the French National Institute for Agricultural Research & Environment (INRAE)¹⁵. These individuals are propagated and maintained as isogenic lines for research purposes and became a standard tool for the study of phenotypic variation and adaptation in sunflower^{16,17,18,19,20}. A list of the SAM population lines is available in Supplementary Table 1.

The two datasets provided in HeliantHOME (wild sunflowers and SAM population) offer complementary information on sunflower diversity and its potential use for crop improvement. Many genes that confer adaptation to extreme environments or disease in cultivated sunflower have been introgressed from wild sunflower populations^13,21,22,23 and often, close wild relatives are the source of novel and advantageous alleles sought by the agricultural sector². Thus, the wild sunflowers dataset contained in HeliantHOME represents an unparalleled tool. However, the wild sunflower individuals that we have genotyped could not be maintained (wild sunflowers are obligated outcrossers) and consequently, no further phenotypic information can be added to the dataset.

The SAM population instead, is composed of inbred lines, which can be grown repeatedly in different conditions. The SAM phenotypic dataset is therefore in continuous expansion and constitutes a powerful complementary tool for studies looking at the genetic basis of phenotypic diversity and domestication in sunflowers.

HeliantHOME includes, among others, a rich dataset of high-quality images for individual plants and plant organs, arguably making it one of the finest and most extensive collections of population scale phenotypic data of its kind existing so far. In addition to its obvious utility for the sunflower research community, this collection could as well be suitable as a labeled high-quality dataset, for the development of novel machine learning methods for automatic phenotype extraction or computer vision in general.

Previous experience has shown the usefulness to the scientific community of a curated database like the one we are presenting. AraPheno²⁴, a similar public database for phenotypic data in Arabidopsis thaliana as well as AraGWAS^25,26, a manually curated and standardized GWAS (Genome Wide Association Studies) catalog, have been recently developed and both have been broadly used and expanded since. Once data is centralized and publicly shared, new discoveries can emerge and new analyses become possible.

In summary, HeliantHOME will be a fundamental tool for researchers coming from different fields and not necessarily working with sunflower. It offers a broad dataset for both basic and applied plant sciences, from an evolutionary, ecological and comparative genomics perspective to a computational and even machine learning standpoint.

Methods

Wild sunflower data

In the summer of 2016 ten mother plants were randomly selected from 151 populations of wild sunflowers collected in previous years (2011 and 2015) from across the US and southern Canada, for four sunflower species: H. annuus, H. petiolaris, H. niveus and H. argophyllus. Seeds from each of these plants were germinated, and eventually transplanted into three separate fields at the Totem Plant Science Field Station of the University of British Columbia, Vancouver campus, Vancouver, Canada. Within each field, pairs of plants from the same population were sown using a completely randomized design. Phenotypic measurements were assessed daily throughout plant development, and leaves, stem sections, inflorescences and seeds were collected and digitally recorded for further analyses. We measured a total of up to 87 different traits per individual which can be divided in four main categories: 1) Plant Development & Architecture: including traits related to plant growth, days to flowering, number of primary branches, final height, etc. These traits were measured manually in the field, using precision tools. 2) Inflorescence Traits; including traits, such as flowerhead diameter, number of ligules (petals of the outermost whorl of flowers in the inflorescence circumference), etc. In this case, high resolution images were recorded and eventually analyzed using the Fiji software²⁷ (see Fig. 2A,B).

3) Leaf and branch traits: including traits like leaf area, C/N content, trichomes density, etc. Most leaf traits required the use of scanned high-resolution images and eventual digital analysis (see Fig. 2C,G), using Tomato analyzer software or Fiji^27,28. For C/N content, we collected and ground tissue, and submitted samples to EA-IRMS (Elemental analyzer isotope ratio mass spectrometry) analysis at the Stable Isotope Facility, Faculty of Forestry, UBC, Vancouver Canada. 4) Seed traits: including traits like seed height, perimeter, etc. The seeds were obtained from restricted crosses (only crosses within individuals from the same population) carried out using pollination bags. Seeds were eventually harvested when dry and scanned at high resolution. Images were captured for further digital analysis²⁸ (see Fig. 2H,I).

Detailed information about methods used for the collection of phenotypic developmental data or for the morphometrics obtained from the associated image analyses are provided in Supplementary Table 2. The table includes phenotype name, methodological tools and a brief description of the procedures. More information can also be found in the original publication⁶.

In summary, we provide a) developmental data obtained from this experiment by direct observation of live plants; b) data associated with architecture and development of the plants (by indirect measurement performed on recorded images) as well as c) the high-resolution color images collected for all the studied individuals and their harvested organs.

Cultivated sunflower data

The UGA-SAM1 (SAM) population phenotypic data available in the sunflower community is a collection in continuous expansion that has been growing for the past 10 years, with data obtained both in greenhouse and field settings. We have for now added data for one study carried out in a greenhouse setting to illustrate the potential benefits of including these data in HeliantHOME and are working on the inclusion of a larger dataset^16,17. The evaluated phenotypes for the cultivated individuals currently available are described in Table 1.

Table 1 SAM population phenotypes.

Full size table

HeliantHOME

To facilitate exploration, search, filtering and download of phenotypic as well as meta-data we developed the public web-application HeliantHOME (http://www.helianthome.org). The primary purpose of HeliantHOME is to simplify data access and to provide detailed information about the different sunflower species, populations, phenotypes and associated images. The database can be easily queried via a public web-interface as well as programmatically crawled via the implemented public Representational State Transfer (REST) interface. Detailed FAQs, tutorials and guided tours are implemented to guide novice users and to help them navigate through the different views. Figure 3 shows a screenshot of the landing page of HeliantHOME.

Various views are provided to summarize information about the different species, populations and phenotypes. All integrated data is associated with additional meta-information. For example, the different sunflower populations are linked with location information (latitude and longitude), as well as detailed information about climate and soil variables. The phenotype view summarizes detailed information about the phenotype scoring, as well as additional meta-information and interactive visualizations to analyze the distribution of phenotypic values (see screenshot in Fig. 4). In addition, phenotypes are linked to genome-wide association study (GWAS) results, available at easyGWAS²⁹. The database also hosts high-resolution imaging data for all wild sunflower individuals (http://www.helianthome.org/images/). All displayed information and data can be easily downloaded by the user in various data formats, such as CSV, PLINK, JSON or as imaging files. An integrated REST API allows fast, scalable and customizable programmatic access to the data. Further, additional external resources to genetic data are linked in the Download Center of HeliantHOME (http://www.helianthome.org/download/), to allow the simple download of associated material.

Implementation

HeliantHOME is a public and manually curated database implemented using the open-source Python Django (v.3.2.4) web-application framework (https://www.djangoproject.com). The database backend is based on SQLite (https://www.sqlite.org/index.html). Several third-party Python libraries have been included to allow efficient filtering of the database as well as to provide common data handling and statistical analysis methods like numpy³⁰, scipy³¹ and pandas³².

In addition, the Django REST framework (https://www.django-rest-framework.org) has been used to implement various REST endpoints to simplify the programmatic access and download of all the data stored in HeliantHOME. A detailed documentation of all REST endpoints can be found here: http://www.helianthome.org/rest/api.

The web-application frontend is based on HTML5 and Bootstrap (https://getbootstrap.com), a modern and responsive CSS framework. The interactive parts are implemented using jQuery (https://jquery.com). Dynamic visualizations and plots are integrated and are based on the JavaScript libraries Chart.js (https://www.chartjs.org) and jVectorMap (https://jvectormap.com).

To guide novice users through the web interface and to introduce the different views a fully guided and interactive tour is provided using intro.js (https://github.com/usablica/intro.js).

The code for HeliantHOME is open-source and hosted on GitHub. Further, the repository includes a detailed list of all used packages and allows users to report issues or to submit feedback: https://github.com/grimmlab/HeliantHome.

Data Records

In addition to HeliantHome, all phenotypic data and images are also deposited at the digital library of the Technical University of Munich (https://doi.org/10.14459/2022mp1649709)³³. The data consists of a large collection of high-resolution images and extracted phenotypic measurements for 1,510 wild sunflower individuals and 288 cultivated sunflowers. All the data provided here have been previously published and/or is currently public⁶. We have a set of recorded data for 343 different phenotypes and 88,050 measurements; see Table 2 for additional featured data.

Table 2 HeliantHOME statistics.

Full size table

HeliantHOME is a modern responsive web interface that allows easy access, filtering and download of phenotypic and complementary genotypic sunflower data for a large collection of both wild and cultivated sunflowers.

Among the different features, HeliantHOME holds 9,751 high resolution images corresponding to the wild sunflower individuals and their different organs, as illustrated in the screenshot shown in Fig. 2J.

The data stored at the digital library of Munich allows the download of the full dataset at once, whereas HeliantHome allows the specific download of individual data entries via the included REST API, as well as the download of custom imaging data for certain individuals.

Technical Validation

Wild sunflower data

The common garden experiment involved fully randomized (in pairs) planting of all the individual plants to avoid location effects. In addition, assorted wild sunflower plants were planted all around the edges of the field to minimize border effects. Standard procedures were used to determine developmental traits. Restricted pollination was used to produce seeds while avoiding cross-pollination from different populations. Unlike cultivated sunflower, wild sunflower is self-incompatible.

Usage Notes

Phenotypic and imaging data can be downloaded directly from the digital library of the Technical University of Munich (https://doi.org/10.14459/2022mp1649709)³³, conveniently from HeliantHOME using the web-interface or via custom Python scripts using the publicly available REST interface. Various statistical methods can be used to analyse the phenotypic data. A primary example are genome-wide association studies (GWAS). For this purpose, univariate association tests (associations between single point-mutations with a certain phenotype) that account for population structure, such as FaSTLMM (Factored Spectrally Transformed Linear Mixed Models)³⁴ or permutation based GWAS³⁵ can be used. The data might also be used for the comparison or development of new phenotype prediction methods. The imaging data can be analyzed using custom Python scripts and might also serve the computer science community to develop novel machine learning and computer vision methods for automatic phenotyping^36,37,38.

Code availability

All code for the web server backend and frontend are publicly available for download on GitHub: https://github.com/grimmlab/HeliantHome.The web-application can be accessed via: http://www.helianthome.org.

Change history

23 January 2023
Missing Open Access funding information has been added in the Funding Note

References

Heiser, C. B. & Smith, D. M. The North American sunflowers (Helianthus). (Durham, N.C., Published for the Club by the Seeman Printery, 1969).
Schilling, E. E. Helianthus. Flora of North America north of Mexico 21, 141–169 (2006).
Google Scholar
Foreing Agricultural Service. Oilseeds: World Markets and Trade. (2022).
Badouin, H. et al. The sunflower genome provides insights into oil metabolism, flowering and Asterid evolution. Nature 546, 148–152 (2017).
Article ADS CAS Google Scholar
Kane, N. C. et al. Sunflower genetic, genomic and ecological resources. Mol Ecol Resour 13, 10–20 (2013).
Article Google Scholar
Todesco, M. et al. Massive haplotypes underlie ecotypic differentiation in sunflowers. Nature 584, 602–607 (2020).
Article ADS CAS Google Scholar
INRA Sunflower Bioinformatics Resources. https://www.heliagene.org/.
Sunflower Genome Database. https://www.sunflowergenome.org/.
Duriez, P. et al. A receptor-like kinase enhances sunflower resistance to Orobanche cumana. Nat Plants 5, 1211–1215 (2019).
Article CAS Google Scholar
Todesco, M. et al. Genetic basis and dual adaptive role of floral pigmentation in sunflowers. Elife 11 (2022).
Krajewski, P. et al. Towards recommendations for metadata and data handling in plant phenotyping. J Exp Bot 66, 5417–5427 (2015).
Article CAS Google Scholar
Papoutsoglou, E. A. et al. Enabling reusability of plant phenomic datasets with MIAPPE 1.1. New Phytologist 227, 260–273 (2020).
Article Google Scholar
Mandel, J. R., Dechaine, J. M., Marek, L. F. & Burke, J. M. Genetic diversity and population structure in cultivated sunflower and a comparison to its wild progenitor, Helianthus annuus L. Theoretical and Applied Genetics 123, 693–704 (2011).
Article CAS Google Scholar
USDA Agricultural Research Service. National Plant Germplasm System. https://data.nal.usda.gov/dataset/national-plant-germplasm-system (2017).
A Biological Resource Center for Sunflower. https://www.inrae.fr/actualites/centre-ressources-biologiques-tournesol (2018).
Mandel, J. R. et al. Association Mapping and the Genomic Consequences of Selection in Sunflower. PLoS Genet 9, e1003378 (2013).
Article CAS Google Scholar
Gao, L. et al. Genetic and phenotypic analyses indicate that resistance to flooding stress is uncoupled from performance in cultivated sunflower. New Phytologist 223, 1657–1670 (2019).
Article CAS Google Scholar
Hübner, S. et al. Sunflower pan-genome analysis shows that hybridization altered gene content and disease resistance. Nat Plants 5, 54–62 (2019).
Article Google Scholar
Terzić, S., Zorić, M. & Seiler, G. J. Qualitative traits in sunflower breeding: UGA‐SAM1 phenotyping case study. Crop Sci 60, 303–319 (2020).
Article Google Scholar
Nambeesan, S. U. et al. Association mapping in sunflower (Helianthus annuus L.) reveals independent control of apical vs. basal branching. BMC Plant Biol 15, 84 (2015).
Article Google Scholar
Baute, G. J., Kane, N. C., Grassa, C. J., Lai, Z. & Rieseberg, L. H. Genome scans reveal candidate domestication and improvement genes in cultivated sunflower, as well as post‐domestication introgression with wild relatives. New Phytologist 206, 830–838 (2015).
Article CAS Google Scholar
Kantar, M. B. et al. Ecogeography and utility to plant breeding of the crop wild relatives of sunflower (Helianthus annuus L.). Front Plant Sci 6 (2015).
Seiler, G. & Marek, F. Germplasm resources for increasing the genetic diversity of global cultivated sunflower. Helia 34, 1–20 (2011).
Article Google Scholar
Seren, Ü. et al. AraPheno: a public database for Arabidopsis thaliana phenotypes. Nucleic Acids Res 45, D1054–D1059 (2017).
Article CAS Google Scholar
Togninalli, M. et al. The AraGWAS Catalog: a curated and standardized Arabidopsis thaliana GWAS catalog. Nucleic Acids Res 46, D1150–D1156 (2018).
Article CAS Google Scholar
Togninalli, M. et al. AraPheno and the AraGWAS Catalog 2020: a major database update including RNA-Seq and knockout mutation data for Arabidopsis thaliana. Nucleic Acids Res https://doi.org/10.1093/nar/gkz925 (2019).
Article Google Scholar
Schindelin, J. et al. Fiji: an open-source platform for biological-image analysis. Nat Methods 9, 676–682 (2012).
Article CAS Google Scholar
Rodríguez, G. R. et al. Tomato Analyzer: A Useful Software Application to Collect Accurate and Detailed Morphological and Colorimetric Data from Two-dimensional Objects. Journal of Visualized Experiments https://doi.org/10.3791/1856 (2010).
Article Google Scholar
Grimm, D. G. et al. easyGWAS: A Cloud-Based Platform for Comparing the Results of Genome-Wide Association Studies. Plant Cell 29, 5–19 (2017).
Article CAS Google Scholar
Harris, C. R. et al. Array programming with NumPy. Nature 585, 357–362 (2020).
Article ADS CAS Google Scholar
Virtanen, P. et al. SciPy 1.0: fundamental algorithms for scientific computing in Python. Nat Methods 17, 261–272 (2020).
Article CAS Google Scholar
McKinney, W. Data Structures for Statistical Computing in Python. in Proceedings of the 9th Python in Science Conference 56–61, https://doi.org/10.25080/Majora-92bf1922-00a (2010).
Bercovich, N. et al. HeliantHOME: a public and centralized database of phenotypic sunflower data. Technical University of Munich, mediaTUM https://doi.org/10.14459/2022mp1649709 (2022).
Lippert, C. et al. FaST linear mixed models for genome-wide association studies. Nat Methods 8, 833–835 (2011).
Article CAS Google Scholar
John, M. et al. Efficient permutation-based genome-wide association studies for normal and skewed phenotypic distributions. Bioinformatics 38, ii5–ii12 (2022).
Article Google Scholar
John, M. et al. A comparison of classical and machine learning-based phenotype prediction methods on simulated data and three plant species. Frontiers in Plant Science 2904 (2022).
Hüther, P., Schandry, N., Jandrasits, K., Bezrukov, I., & Becker, C. ARADEEPOPSIS, an automated workflow for top-view plant phenomics using semantic segmentation of leaf States. The Plant Cell 32, 3674–3688 (2020).
Genze, N., Bharti, R., Grieb, M., Schultheiss, S. J., & Grimm, D. G. Accurate machine learning-based germination detection, prediction and quality assessment of three grain crops. Plant methods 16, 1–11 (2020).

Download references

Acknowledgements

We thank D. Skonieczny, A. Kim, A. Parra, N. Garrett and C. Konecny for assistance with fieldwork and data acquisition and J. Mandel, J.M. Burke J. and L. Gao for sharing raw phenotypic data for the SAM population. Further, we thank the Leibniz Supercomputing Centre of the Bavarian Academy of Sciences and Humanities and the Library of the Technical University of Munich for hosting our web-application and data.

Funding

Open Access funding enabled and organized by Projekt DEAL. Funding for thecommon garden experiment of wild sunflower was provided by Genome Canada and Genome BC (LSARP2014-223SUN) and the NSF Plant Genome Program (IOS-1444522).

Author information

Authors and Affiliations

Department of Botany, University of British Columbia, Vancouver, British Columbia, Canada
Natalia Bercovich, Marco Todesco, Gregory L. Owens, Jean-Sébastien Légaré, Kaichi Huang & Loren H. Rieseberg
Biodiversity Research Centre, University of British Columbia, Vancouver, Canada
Natalia Bercovich, Marco Todesco, Gregory L. Owens, Jean-Sébastien Légaré, Kaichi Huang & Loren H. Rieseberg
Technical University of Munich, Campus Straubing for Biotechnology and Sustainability, Bioinformatics, Straubing, Germany
Nikita Genze & Dominik G. Grimm
Weihenstephan-Triesdorf University of Applied Sciences, Straubing, Germany
Nikita Genze & Dominik G. Grimm
Department of Biology, University of Victoria, Victoria, BC, Canada
Gregory L. Owens
Department of Computer Science, University of British Columbia, Vancouver, British Columbia, Canada
Jean-Sébastien Légaré
Data Science Institute, University of British Columbia, Vancouver, British Columbia, Canada
Jean-Sébastien Légaré
Technical University of Munich, Department of Informatics, Garching, Germany
Dominik G. Grimm

Authors

Natalia Bercovich
View author publications
You can also search for this author in PubMed Google Scholar
Nikita Genze
View author publications
You can also search for this author in PubMed Google Scholar
Marco Todesco
View author publications
You can also search for this author in PubMed Google Scholar
Gregory L. Owens
View author publications
You can also search for this author in PubMed Google Scholar
Jean-Sébastien Légaré
View author publications
You can also search for this author in PubMed Google Scholar
Kaichi Huang
View author publications
You can also search for this author in PubMed Google Scholar
Loren H. Rieseberg
View author publications
You can also search for this author in PubMed Google Scholar
Dominik G. Grimm
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

N.B. and D.G.G. conceived the study. N.B., M.T. coordinated, collected and analyzed phenotypic data and produced the genetic data. G.L.O., J.S.L. and K.H. analyzed genotypic data and carried out the S.N.P. calling. D.G.G. implemented the webservice with the help of N.G., N.B. and D.G.G. wrote manuscript. All authors read and approved the final version of the manuscript.

Corresponding authors

Correspondence to Natalia Bercovich or Dominik G. Grimm.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Table 1

Supplementary Table 2

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Bercovich, N., Genze, N., Todesco, M. et al. HeliantHOME, a public and centralized database of phenotypic sunflower data. Sci Data 9, 735 (2022). https://doi.org/10.1038/s41597-022-01842-0

Download citation

Received: 01 April 2022
Accepted: 11 November 2022
Published: 30 November 2022
DOI: https://doi.org/10.1038/s41597-022-01842-0