Abstract
The scarce knowledge on phenotypic characterization restricts the usage of genetic diversity of plant genetic resources in research and breeding. We describe original and ready-to-use processed data for approximately 60% of ~22,000 barley accessions hosted at the Federal ex situ Genebank for Agricultural and Horticultural Plant Species. The dataset gathers records for three traits with agronomic relevance: flowering time, plant height and thousand grain weight. This information was collected for seven decades for winter and spring barley during the seed regeneration routine. The curated data represent a source for research on genetics and genomics of adaptive and yield related traits in cereals due to the importance of barley as model organism. This data could be used to predict the performance of non-phenotyped individuals in other collections through genomic prediction. Moreover, the dataset empowers the utilization of phenotypic diversity of genetic resources for crop improvement.
Design Type(s) | data integration objective • metadata search and retrieval objective |
Measurement Type(s) | Phenotypic_Measurement |
Technology Type(s) | digital curation |
Factor Type(s) | temporal_instant • geographic location • season |
Sample Characteristic(s) | Hordeum vulgare • Hordeum vulgare f. agriocrithon • Hordeum sp. • Hordeum vulgare subsp. spontaneum • cropland biome |
Machine-accessible metadata file describing the reported data (ISA-Tab format)
Similar content being viewed by others
Background & Summary
Cereals are staple food and a valuable source of nutrients around the world1. Among them, barley (Hordeum vulgare sp.) is the fourth most produced crop2. The main end-uses of barley are brewing, feed, and food production3. In terms of crop adaptation barley can be classified into two distinct gene pools: winter and spring type4–6. While winter type barley needs vernalization for flowering stimulation, spring type barley does not require it3. Barley has a diploid genome and its 7 chromosomes represent the base genome of all Triticeae species. For this and many more reasons barley has become a model organism in cereal genetics and genomics7. In addition, the availability of a high quality reference sequence of the barley genome, well established protocols for genome editing and elaborated approaches for genomic selection will greatly benefit barley breeding in the future7–11.
Establishing germplasm collections has involved assemblage and preservation of the existing allelic diversity and their utilization12,13. In the case of barley, more than seven decades of major efforts have resulted in about half a million ex situ accessions worldwide13–15. Germplasm collections are an outstanding resource of genetic diversity for research and plant improvement. For instance, genebank collections represent a rich source of unexplored trait variation which is absent in public and private breeding programs. This variation could potentially boost selection gain in plant breeding to increase both yield potential and sustainability and to facilitate adaptation to global change16,17. However, leveraging genetic resources of public germplasm collections is still a challenge due to the lack of phenotypic information and the high investments required for the systematic characterization of plant material9,18,19. Recently, a method for the exploitation of germplasm based on genomics was proposed19. In this context, genebanks are encouraged to maximize the reuse of both phenotypic and genotypic data by the implementation of the FAIR principles referring to: Findability, Accessibility, Interoperability, and Reusability20. For example, historical phenotypic records for traits with agronomical relevance have been accumulated during the seed regeneration process at genebanks but are not publicly available or the access to them is limited16,19–23.
This study presents original and ready-to-use processed phenotypic data with the aim of leveraging the use of historical information collected during seed regeneration. The data correspond to historical records on traits flowering time (FT), plant height (PH), and thousand grain weight (TGW) accumulated for seven decades plus the outlier status of all data points and the Best Linear Unbiased Estimations (BLUEs) for winter and spring barley accessions pertaining to these traits. This historical information belongs to the barley collection of the Federal ex situ Genebank for Agricultural and Horticultural Plant Species hosted at the Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) in Gatersleben (Germany). Conserving and managing a total of ~22,000 accessions, the IPK Genebank manages the sixth largest collection worldwide, which covers a broad range of phenotypic variation14,15,24,25. This data publication complements a previous research publication25 which focuses on the valorization of genetic resources by developing, validating and employing a curated data set from seed regeneration. Moreover, part of these BLUEs was recently used to show the potetial of genome wide association for FT in genebank materials of spring barley26.
Methods
Plant material
The barley collection at the IPK amounts to ~22,000 accessions. These accessions were assembled by means of worldwide collecting expeditions, seed exchange with other institutes, and donations. Accession-related information is being documented in the genebank information system of the IPK (GBIS)22. This study includes FT, PH, and TGW data recorded during seed regeneration for approximatelly 60% of the barley accessions.
Seed regeneration produced an unbalanced historical data source
Seed regeneration is aimed to supply seed requirements for (i) safeguarding the stored genetic diversity when sample size and seed viability drop beneath a pre-stablished treshold, (ii) conserving new genotypes within the genebank, (iii) research, and (iv) fulfilling external demands of germplasm27. The seed regeneration routine in the genebank generated non-orthogonal phenotypic data23,28,29 across traits and years, e.g., only 12 accessions were evaluated for TGW in 1984 while a record number of 4,789 accessions were characterized in 1970 for PH. Additionally, there were 1% of cases when accessions were multiplied more than once in a year. One of the reasons for this was, for instance, the need to check whether the plant material required vernalization or not. Moreover, the introduction of cold storage in the year 1976 abruptly decreased the periodicity of data generation during seed regeneration, because storage time switched from ~3 to >20 years27 . Furthermore, the use of the collection, or parts of it, in research projects had a positive impact in the amount of data collected per year. For example, the protein screening of cereal genetic resources carried in 1970 brought the largest number of regenerated accessions in a single year (Fig. 1). The data of the present study is based on seed regenerations during the 1946–2015 period. Seed regeneration for barley was conducted in Gatersleben since 1946 in different seasons according to the growth habit of accessions. In more detail, winter accessions were planted between September and December while spring accessions were sown from February until April.
Traits assessed on seed germplasm regeneration
Each accession was multiplied using plots of at least 3 m2 and traits FT, PH, and TGW were assessed during seed regeneration. FT stands for the number of days when 50% of the plants reached flowering. For winter barley, FT is expressed in days after the 1st of January of each year. For spring barley, FT was expressed in days after the sowing date. PH was assessed in cm from the soil surface to the top of spike including awns. TGW was determined after seed harvest and expressed in g on a ~12.5% grain moisture basis. Seeds were harvested at maturity stage and were temporary stored at room temperature. Before the 2005/2006 season the standard protocol for TGW assessment at the genebank was based on the average weight of three samples, each containing 100 grains, which was then extrapolated to 1000 grains. From the 2005/2006 season onwards TGW has been determined by using an automatic Marvin digital seed analyzer and considering a seed sample with up to 100 grains. The data management at the genebank was manual until 2011. In this sense, the information was first recorded in field books, then transferred to card files and lately digitized for data storage and computational analysis. From 2011 onwards Personal Digital Assistants (PDAs) were used.
Methods for data processing
Statistical model
No formal field experimental design was used during seed regeneration while the dataset contains only 1% of cases when accessions were evaluated more than once in a year. For this reason, an unreplicated completely randomized experimental design was assumed for each regeneration cycle during data processing. According to the assumed design, the experimental unit corresponded to a plot. Phenotypic data of each barley type were analyzed separately based on the following mixed model:
where μ is the population mean and “Genotypes” were the genetic effects of accessions, which were assumed as fixed factors, while years and error were treated as random. Variances of errors were modelled as specific for each year. In a first step, Equation (1) was used for outlier detection. Later, the BLUEs of accessions were computed by re-fitting the model in Equation (1) but using and enhanced historical dataset in which data points detected as outliers during the first step were discarded.
Code availability
Mixed model equations were solved using the Restricted Maximum Likelihood (REML) algorithm as implemented in ASReml-R30. All described statistical approaches were performed in R environment (Version 2.15.3)31. Scripts used for outlier detection and estimating BLUEs are included together with the dataset in the public repository described below (Data Citation 1). The use of the code requires the download of the datasets, save them in a working directory and set the working directory in the scripts. The scripts run for a single trait according to one growth habit. For instance, the example scripts run for flowering time (FT) for spring barley. In this case, the resulting files are labeled as “Data.corrected.FT.txt” or “BLUEs.FT.txt” for outlier detection and estimating BLUEs, respectively. In this regard, this study involves 12 outputs that were compiled in four files which are described below.
Data records
The data compiled for this study is publicily available in the Plant Genomics and Phenomics Research Data Repository (PGP) (http://edal-pgp.ipk-gatersleben.de/)32 and can be accessed here as (Data Citation 1). The dataset is formated using the ISA-Tab format33 to guarantee a uniform and easy-readable semantical description. It contains the original data as well as the processed data. While the investigation file describes the general project information, the two study files (“s_Spring_Barley.txt” and “s_Winter_Barley.txt”) provide information about the investigated accessions. They contain information such as: (i) accession identifiers, e.g., the accession ID as an unique and stable database generated code at the genebank and accession number wich is typically used for researchers but is not stable over the time, (ii) sowing_date corresponding to day.month.year, (iii) harvest_year, (iv) country as geographic place of collection reported by donors or collectors, and (v) the comment column which shows two groups of accessions whose countries are mentioned in the manuscript as Germany and Soviet Union. In this regard, the group Germany includes accessions from Germany and [Former] East Germany. The group Soviet Union stands for accessions from [Former] Union of Soviet Socialist Republics, Armenia, Azerbaijan, Belarus, Georgia, Estonia, Kyrgystan, Latvia, Lithuania, Moldova, Russia, Tajikistan, Turkmenistan, Ukraine and Uzbekistan. Furthermore, some modifications were done with respect to the original data, e.g. the harvest year 1946 contained only 2 records for PH in winter type barley, which caused serious convergency problems during the fitting of mixed models. For this reason, these two datapoints were removed from the PH records of winter barley.
The assay files of the present study were separated in the historical phenotypic data (“a_Historical.Data_Spring.txt” and “a_Historical.Data_Winter.txt”), which was provided from the IPK genebank information system and was first screened for outliers. Then, outliers were excluded to produce the enhanced assay files (“a_Enhanced_Historical.Data_Spring.txt” and “a_Enhanced_Historical.Data_Winter.txt”). These files accomodated records for up to 2,967 and 9,898 winter and spring accessions, respectively (Table 1). Each accession was phenotyped from 1 to 22 years (Fig. 2) and in each year a range from 12 to 4,789 accessions, across traits, were evaluated (Fig. 1). The heritability for all traits was high and it increased further by up to 17% when applying an outlier correction25 (Table 2). The Pearson’s correlation coefficient (r) estimated on the enhanced data for pairs of years with at least 50 overlapping accessions ranged from 0.60 to 0.72 (Table 3). The precision in computing the BLUEs amounted to 0.89 for TGW and 0.85 for both FT and PH, respectively25. Moreover, the maximum coefficient of variation of the year on the enhanced data set was 0.22 (Table 4). Ninety percent of these genetic resources were collected or originated from 30 geographic places. Ethiopia with 32.1% of accessions was a predominant origin for spring barley followed by 7.2% from Germany. Interestingly, although 12.4% of winter barley accessions were collected or originated from the Soviet Union, there was not a clear predominant place of collection for this type of barley which was reflected by a more uniform frequency distribution of accessions according to collection places (Table 5). Furthermore, the dataset contains an additional folder with the BLUEs of accessions included in the files “BLUEs_Spring.txt” and “BLUEs_Winter.txt” (Fig. 3), that were estimated based on the enhanced historical data files. The corresponding study files are labeled as “s_Spring_Barley.txt” and “s_Winter_Barley.txt”.
Technical Validation
Validation involves outlier detection, bias assesment for first and second degree statistics and validation of BLUEs of accessions. Methods, results and discussion of this strategy were described in a previous research publication25. However, here we make a brief description of validation methods.
Enhancing the quality of the historical data set by implementing an outlier detection approach
Outliers may jeopardize the quality of the data negatively affecting statistical estimates34,35. The presence of outliers in the historical dataset (Data Citation 1) is plausible because the data was assembled for seven decades under fluctuating conditions of data and seed regeneration management, as well as contrasting weather conditions across years, among others. Both, the assessment and management of outliers in unbalanced historical datasets are challenging. We used an outlier inspection approach by combining re-scaled median absolute deviation of standardized residuals with a Bonferroni-Holm test to flag data points as outliers35. A data-point was declared as outlier by the implemented test according to a predefined significance threshold of p-value < 0.05. We removed the outliers from the historical data set to obtain an enhanced historical dataset (Data Citation 1). Considering genotypes and years as random effects, Equation (1) was re-fitted in order to check the impact of outlier exclusion on variance components and heritability. Heritability was computed as follows: , where denotes the estimator of the genetic variance, corresponds to the average variance estimated for the errors, and stands for the average number of years when genotypes have been tested. Assuming random genotype and fixed year effects on Equation (1), the coefficient of variation of the year was computed as:, where corresponds to the year-specific error variance and YE refers to the year effect.
Studying the potential bias in estimating first- and second-degree statistics for different missing data scenarios
On average, seed regeneration activities before 1976 were carried out every 3 years for each accession. This was mainly because seed storage was formerly performed at room temperature27. However, this condition led to evaluate blocks of accessions corresponding to the year when they entered the genebank, which is often reflecting specific collection hotspots. Therefore, the missing value structure of the phenotypic data collected is potentially deviated from the random scenario. Since estimating first and second degree statistics is potentially biased by the missing data structure, a resampling study was performed considering three missing data scenarios. Firstly, a balanced dataset was derived from the enhanced historical dataset of spring barley. This balanced set included phenotypic records for FT and PH available for the years 1948, 1951, 1954, 1957, 1961, and 1970 for 400 spring accessions. These accessions were collected in 10 geographic places: Turkey (99), Greece (91), Germany (56), United States of America (49), Bulgaria (36), Sweden (18), Japan (14), Albania (13), Austria (12), and countries of the former Soviet Union (12). Later, the balanced dataset was sampled based on three missing data scenarios as follows: in Scenario 1, phenotypic records were randomly sampled from three out of six test years for each accession, which amounted to 1,200 phenotypic data points in total. In Scenario 2, the 400 accessions were randomly grouped into 10 clusters and the phenotypic data for each group was randomly subsampled from 3 years gathering 1,200 phenotypic data points in total. In Scenario 3 the 10 places of collection were considered as groups of accessions and phenotypic data from 3 years was randomly subsampled for each group resulting in 1,200 phenotypic data points. Each scenario was sampled 100 times.
Biases in estimating variances of genotypes and errors were calculated as , where stands for the estimated parameters in each sampling run and d corresponds to the parameter estimated from the balanced dataset. Moreover, we performed a linear regression of the BLUEs computed for each of 100 resampling runs on the BLUEs from the balanced data set. In this respect, the intercept, the slope, and the coefficient of determination of the linear regression model were considered to measure bias.
Resampling procedure for assessing the precision in computing BLUEs of accessions
Precise estimates of trait performance are pivotal for decisions makers on research and breeding. Thus, we performed a resampling procedure36,37 to assess the precision in estimating BLUEs. The enhanced data set of spring barley was randomly split into two equally sized subsets. Only accessions for which phenotypic data was available in both subsets were considered in each of the 100 resampling runs. Therefore, across 100 runs 3,691, 3,474, and 3,066 accessions were included on average for FT, PH and TGW, respectively. We fitted the model specified in Equation (1) to estimate the BLUEs of accessions in both subsets. Subsequently, precision of estimation was computed as the correlation of BLUEs of accessions between subsets.
Usage Notes
Maximizing the use of genetic resources will benefit current and future efforts to breed new cultivars that are required to address needs in food security, climate resilience, and sustainability16,38,39. However, restricted resources limit the systematic phenotyping of germplasm collections9,18,19. The strategy described here is based on data that was routinely collected by curators during seed multiplication cycles and is embedded in the scripts used for outlier detection and BLUEs computation. The scripts run for a single trait according to one growth habit. This strategy could be adapted to other genebanks for the validation of their own data in order to increase the amount of data for well characterized accessions at no extra cost. The value of the data will be further leveraged by genotypic information which will become publicly available soon for the IPK barley collection. In the future, both, phenotypic and genotypic information will facilitate the implementation of genomic prediction which is expected to further boost the utilization of genetic resources for research and breeding19,40–42. By providing the investigated data using the ISA-Tab format and publishing them via DOI, all research data and the presented results are available in a FAIR-way20 and can be easily re-used.
Additional information
How to cite this article: Gonzalez, M. Y. et al. Unbalanced historical phenotypic data from seed regeneration of a barley ex situ collection. Sci. Data. 5:180278 doi: 10.1038/sdata.2018.278 (2018).
Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
References
References
McKevith, B. Nutritional aspects of cereals. Nutr Bull 29, 111–142 (2004).
Food, FAO. Outlook: Biannual Report on Global Food Markets. Report of June http://www.fao.org/3/a-i7343e.pdf (2017).
Verstegen, H., Köneke, O., Korzun, V., von Broock, R. in Biotechnological Approaches to Barley Improvement Kumlehn J. & Stein N. eds. Ch. 1. Springer, (2014).
Thiel, T., Michalek, W., Varshney, R.K. & Graner, A. Exploiting EST databases for the development and characterization of gene-derived SSR-markers in barley (Hordeum vulgare L.). Theor. Appl. Genet. 106, 411–422 (2003).
Malysheva-Otto, L.V., Ganal, M.W. & Röder, M.S. Analysis of molecular diversity, population structure and linkage disequilibrium in a worldwide survey of cultivated barley germplasm (Hordeum vulgare L.). BMC Genet. 7, 6 (2006).
Stracke, S. et al. Effects of introgression and recombination on haplotype structure and linkage disequilibrium surrounding a locus encoding Bymovirus resistance in barley. Genetics 175, 805–817 (2007).
Mascher, M. et al. A chromosome conformation capture ordered sequence of the barley genome. Nature 544, 427–433 (2017).
Beier, S. et al. Construction of a map-based reference genome sequence for barley, Hordeum vulgare L. Sci. Data 4, 170044 (2017).
Kilian, B. & Graner, A. NGS technologies for analyzing germplasm diversity in genebanks. Brief Funct Genomics 11, 38–50 (2012).
Philipp, N. et al. Genomic Prediction of Barley Hybrid Performance. Plant Genome 9 (2016).
Heslot, N., Jannink, J.-L. & Sorrells, M.E. Using genomic prediction to characterize environments and optimize prediction accuracy in applied breeding data. Crop Sci 53, 921–933 (2013).
Gepts, P. Plant genetic resources conservation and utilization. Crop Sci 46, 2278–2292 (2006).
van Hintum, T., Menting, F. in Diversity in Barley (Hordeum vulgare)von Bothmer, R., van Hintum, T., Knüpffer, H & Sato, K. eds. Ch. 12. Elsevier Science B. V., (2003).
FAO. The Second Report on the State of the World’s Plant Genetic Resources for Food and Agriculture http://www.fao.org/docrep/013/i1500e/i1500e.pdf (2010).
Sato, K., Flavell, A., Russell, J., Börner, A., Valkoun, J. in Biotechnological Approaches to Barley Improvement Kumlehn J. & Stein N. ) Ch. 2 (Springer, (2014).
de Carvalho, M. A. A. P. et al. Cereal landraces genetic resources in worldwide GeneBanks. A review. Agron Sustain Dev 33, 177–203 (2013).
Roa, C., Hamilton, R. S., Wenzl, P. & Powell, W. Plant Genetic Resources: Needs, Rights, and Opportunities. Trends Plant Sci 21, 633–636 (2016).
Graebner, R. C., Hayes, P. M., Hagerty, C. H. & Cuesta-Marcos, A. A comparison of polymorphism information content and mean of transformed kinships as criteria for selecting informative subsets of barley (Hordeum vulgare L. sl) from the USDA Barley Core Collection. Genet. Resour. Crop. Evol 63, 477–482 (2016).
Yu, X. et al. Genomic prediction contributing to a promising global strategy to turbocharge gene banks. Nat. Plants 2, 16150 (2016).
Wilkinson, M. D. et al. The FAIR Guiding Principles for scientific data management and stewardship. Sci. Data 3, 160018 (2016).
Krajewski, P. et al. Towards recommendations for metadata and data handling in plant phenotyping. J. Exp. Bot. 66, 5417–5427 (2015).
Oppermann, M., Weise, S., Dittmann, C. & Knupffer, H. GBIS: the information system of the German Genebank. Database 2015, bav021 (2015).
Hartung, K., Piepho, H.-P. & Knüpffer, H. Analysis of genebank evaluation data by using geostatistical methods. Genet. Resour. Crop. Evol 53, 737–751 (2006).
Haseneyer, G. et al. Population structure and phenotypic variation of a spring barley world collection set up for association studies. Plant Breed 129, 271–279 (2010).
González, M. Y. et al. Unlocking historical phenotypic data from an ex situ collection to enhance the informed utilization of genetic resources of barley (Hordeum sp.). Theor. Appl. Genet. 131, 2009–2019 (2018).
Milner, S. et al. Genebank genomics highlights the diversity of a global barley collection. Nat.Genet. 10.1038/s41588-018-0266-x (2019).
Börner, A. Preservation of plant genetic resources in the biotechnology era. Biotechnol J 1, 1393–1404 (2006).
Keilwagen, J. et al. Separating the wheat from the chaff–a strategy to utilize plant genetic resources from ex situ genebanks. Sci Rep 4, 5231 (2014).
Philipp, N. et al. Leveraging the use of historical data gathered during seed regeneration of an ex situ genebank collection of wheat. Front Plant Sci 9, 609 (2018).
Butler, D., Cullis, B. R., Gilmour, A. & Gogel, B. ASReml-R Reference Manual, release 3.0. Brisbane: Queensland Department of Primary Industries https://www.vsni.co.uk/downloads/asreml/release3/asreml-R.pdf (2009).
R Core Team. R: A Language and Environment for Statistical Computing, version 2.15.3. The R foundation for statistical computing (2013) Available at https://www.r-project.org/.
Arend, D. et al. PGP repository: a plant phenomics and genomics data publication infrastructure. Database 2016, 1–11 (2016).
Sansone, S.-A. et al. Toward interoperable bioscience data. Nat. Genet. 44, 121–126 (2012).
Estaghvirou, S. B. O., Ogutu, J.O. & Piepho, H.-P. Influence of outliers on accuracy estimation in genomic prediction in plant breeding. G3(Bethesda) 4, 2317–2328 (2014).
Bernal-Vasquez, A. M., Utz, H. F. & Piepho, H.P. Outlier detection methods for generalized lattices: a case study on the transition from ANOVA to REML. Theor. Appl. Genet. 129, 787–804 (2016).
Bischl, B., Mersmann, O., Trautmann, H. & Weihs, C. Resampling methods for meta-model validation with recommendations for evolutionary computation. Evol Comput 20, 249–275 (2012).
Stone, M. Cross-validatory choice and assessment of statistical predictions. J R Stat Soc B Stat Methodol 36, 111–147 (1974).
Vikram, P. et al. Unlocking the genetic diversity of Creole wheats. Sci Rep 6, 23092 (2016).
Muñoz-Amatriaín, M. et al. The USDA barley core collection: genetic diversity, population structure, and potential for genome-wide association studies. PloS ONE 9, e94688 (2014).
Meuwissen, T.H., Hayes, B.J. & Goddard, M.E. Prediction of total genetic value using genome-wide dense marker maps. Genetics 157, 1819–1829 (2001).
Crossa, J. et al. Genomic Prediction of Gene Bank Wheat Landraces. G3 (Bethesda) 6, 1819–1834 (2016).
Gorjanc, G., Jenko, J., Hearne, S.J. & Hickey, J.M. Initiating maize pre-breeding programs using genomic selection to harness polygenic variation from landrace populations. BMC genomics 17, 30 (2016).
Data Citations
Gonzalez, M. Y. et al. IPK Gatersleben https://doi.org/10.5447/IPK/2018/10 (2018)
Acknowledgements
The Federal Ministry of Education and Research of Germany is acknowledged for funding (grant FKZ031B0184A (AWS) and FKZ031B0190A (MYG)).
Author information
Authors and Affiliations
Contributions
M.Y.G., A.G., Y.Z., N.P., and J.C.R. designed the study. M.Y.G. and A.W.S. wrote the paper. S.W., A.B., M.O. gathered and cleansed the historical phenotypic data. M.Y.G., Y.Z., and N.P. devised and conducted the computational experiments of the validation methods and processed the data. D.A. formatted the ISA-Tab - compliant metadata description for the presented data. All authors helped to enhance the manuscript. All authors agree with the current statement.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
ISA-Tab metadata
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ The Creative Commons Public Domain Dedication waiver http://creativecommons.org/publicdomain/zero/1.0/ applies to the metadata files made available in this article.
About this article
Cite this article
Gonzalez, M., Weise, S., Zhao, Y. et al. Unbalanced historical phenotypic data from seed regeneration of a barley ex situ collection. Sci Data 5, 180278 (2018). https://doi.org/10.1038/sdata.2018.278
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/sdata.2018.278
This article is cited by
-
Genetic Trends Estimation in IRRIs Rice Drought Breeding Program and Identification of High Yielding Drought-Tolerant Lines
Rice (2022)
-
Opportunities and limits of controlled-environment plant phenotyping for climate response traits
Theoretical and Applied Genetics (2022)
-
Genomic prediction models trained with historical records enable populating the German ex situ genebank bio-digital resource center of barley (Hordeum sp.) with information on resistances to soilborne barley mosaic viruses
Theoretical and Applied Genetics (2021)
-
Historical phenotypic data from seven decades of seed regeneration in a wheat ex situ collection
Scientific Data (2019)