Genomic and metabolic prediction of complex heterotic traits in hybrid maize

Journal name:
Nature Genetics
Volume:
44,
Pages:
217–220
Year published:
DOI:
doi:10.1038/ng.1033
Received
Accepted
Published online

Maize is both an exciting model organism in plant genetics and also the most important crop worldwide for food, animal feed and bioenergy production. Recent genome-wide association and metabolic profiling studies aimed to resolve quantitative traits to their causal genetic loci and key metabolic regulators. Here we present a complementary approach that exploits large-scale genomic and metabolic information to predict complex, highly polygenic traits in hybrid testcrosses. We crossed 285 diverse Dent inbred lines from worldwide sources with two testers and predicted their combining abilities for seven biomass- and bioenergy-related traits using 56,110 SNPs and 130 metabolites. Whole-genome and metabolic prediction models were built by fitting effects for all SNPs or metabolites. Prediction accuracies ranged from 0.72 to 0.81 for SNPs and from 0.60 to 0.80 for metabolites, allowing a reliable screening of large collections of diverse inbred lines for their potential to create superior hybrids.

At a glance

Figures

  1. Genealogy of the population.
    Figure 1: Genealogy of the population.

    The tree was reconstructed from SNP data with the balanced minimum evolution (BME) algorithm47 on genetic distances. The breeding subgroups are distinguished by their color. T, tropical lines.

  2. Structure of whole-genome LD.
    Figure 2: Structure of whole-genome LD.

    (a) Decay of LD (r2) with distance between pairs of SNPs. LD averaged over chromosomes is given in distance bins of 50 kb. The value of r2 = 0.1 was reached at approximately 500 kb. (b) Probability distribution of LD between adjacent SNPs (bin size = 0.01).

  3. Observed versus whole-genome-predicted GCA for dry matter yield.
    Figure 3: Observed versus whole-genome–predicted GCA for dry matter yield.

    Results were averaged over all cross-validation runs. Dividing the set of genotypes based solely on their SNPs would result in 203 (74.9%) correct classifications (blue), 36 (13.3%) false negatives (red) and 32 (11.8%) false positives (yellow).

  4. Analysis of the core set of 124 lines.
    Figure 4: Analysis of the core set of 124 lines.

    (a) Results of a principal-component analysis on the SNP data. (b) Decay of LD (r2) with distance between pairs of SNPs. LD averaged over chromosomes is given in distance bins of 50 kb. The value of r2 = 0.1 was reached at approximately 275 kb. (c) Genealogy of the core set, estimated with the BME algorithm on genetic distances. (d) Observed versus whole-genome–predicted GCA for dry matter yield with the breeding subgroups distinguished by color. (e) Observed versus whole-genome–predicted GCA for dry matter yield, with different geographical origins distinguished by color. EU, Europe; NA, North America.

References

  1. Schnable, P.S. et al. The B73 maize genome: complexity, diversity, and dynamics. Science 326, 11121115 (2009).
  2. Montes, J.M., Technow, F., Dhillon, B.S., Mauch, F. & Melchinger, A.E. High-throughput non-destructive biomass determination during early plant development in maize under field conditions. Field Crops Res. 121, 268273 (2011).
  3. Montes, J.M., Melchinger, A.E. & Reif, J.C. Novel throughput phenotyping platforms in plant genetic studies. Trends Plant Sci. 12, 433436 (2007).
  4. Steinfath, M. et al. Discovering plant metabolic biomarkers for phenotype prediction using an untargeted approach. Plant Biotechnol. J. 8, 900911 (2010).
  5. Meyer, R.C. et al. The metabolic signature related to high plant growth in Arabidopsis thaliana. Proc. Natl. Acad. Sci. USA 104, 47594764 (2007).
  6. Schauer, N. et al. Comprehensive metabolic profiling and phenotyping of interspecific introgression lines for tomato improvement. Nat. Biotechnol. 24, 447454 (2006).
  7. Kump, K.L. et al. Genome-wide association study of quantitative resistance to southern leaf blight in the maize association mapping population. Nat. Genet. 43, 163168 (2011).
  8. Tian, F. et al. Genome-wide association study of leaf architecture in the maize nested association mapping population. Nat. Genet. 43, 159162 (2011).
  9. Poland, J.A. et al. Genome-wide nested association mapping of quantitative resistance to northern leaf blight in maize. Proc. Natl. Acad. Sci. USA 108, 68936898 (2011).
  10. Bernardo, R. Molecular markers and selection for complex traits in plants: learning from the last 20 years. Crop Sci. 48, 16491664 (2008).
  11. Xu, Y. & Crouch, J. Marker-assisted selection in plant breeding: from publications to practice. Crop Sci. 48, 391407 (2008).
  12. Heffner, E.L., Sorrells, M.E. & Jannink, J.-L. Genomic selection for crop improvement. Crop Sci. 49, 112 (2009).
  13. Lippman, Z.B. & Zamir, D. Heterosis: revisiting the magic. Trends Genet. 23, 6066 (2007).
  14. Hallauer, A.R., Carena, M.J. & Filho, J.B.M. Quantitative Genetics in Maize Breeding (Iowa State University Press, 2010).
  15. Meuwissen, T.H.E., Hayes, B.J. & Goddard, M.E. Prediction of total genetic value using genome-wide dense marker maps. Genetics 157, 18191829 (2001).
  16. Hayes, B.J. et al. Invited review: genomic selection in dairy cattle: progress and challenges. J. Dairy Sci. 92, 433443 (2009).
  17. Jannink, J.-L., Lorenz, A.J. & Iwata, H. Genomic selection in plant breeding: from theory to practice. Brief Funct. Genomics 9, 166177 (2010).
  18. Lorenzana, R.E. & Bernardo, R. Accuracy of genotypic value predictions for marker-based selection in biparental plant populations. Theor. Appl. Genet. 120, 151161 (2009).
  19. Yang, J. et al. Common SNPs explain a large proportion oft the heritability for human height. Nat. Genet. 42, 565569 (2010).
  20. Yang, J. et al. Genome partitioning of genetic variation for complex traits using common SNPs. Nat. Genet. 43, 519525 (2011).
  21. de los Campos, G., Gianola, D. & Allison, D.B. Predicting genetic predispositions in humans: the promise of whole-genome markers. Nat. Rev. Genet. 11, 880886 (2010).
  22. Smith, J.S.C. et al. Use of doubled haploids in maize breeding: implications for intellectual property protection and genetic diversity in hybrid crops. Mol. Breed. 22, 5159 (2008).
  23. Slatkin, M. Linkage disequilibrium—understanding the evolutionary past and mapping the medical future. Nat. Rev. Genet. 9, 477485 (2008).
  24. Ching, A. et al. SNP frequency, haplotype structure and linkage disequilibrium in elite maize inbred lines. BMC Genet. 3, 19 (2002).
  25. Van Inghelandt, D. et al. Extent and genome-wide distribution of linkage disequilibrium in commercial maize germplasm. Theor. Appl. Genet. 123, 1120 (2011).
  26. Yan, J. et al. Genetic characterization and linkage disequilibrium estimation of a global maize collection using SNP markers. PLoS ONE 4, e8451 (2009).
  27. Elshire, R.J. et al. A robust, simple genotyping-by-sequencing (GBS) approach for high diversity species. PLoS ONE 6, e19379 (2011).
  28. Meuwissen, T. & Goddard, M. Accurate prediction of genetic values for complex traits by whole-genome resequencing. Genetics 185, 623631 (2010).
  29. Fernie, A.R. & Schauer, N. Metabolomics-assisted breeding: a viable option for crop improvement? Trends Genet. 25, 3948 (2009).
  30. Stitt, M., Sulpice, R. & Keurentjes, J. Metabolic networks: how to identify key components in the regulation of metabolism and growth. Plant Physiol. 152, 428444 (2010).
  31. Sulpice, R. et al. Starch as a major integrator in the regulation of plant growth. Proc. Natl. Acad. Sci. USA 106, 1034810353 (2009).
  32. Lisec, J., Schauer, N., Kopka, J., Willmitzer, L. & Fernie, A.R. Gas chromatography mass spectrometry-based metabolite profiling in plants. Nat. Protoc. 1, 387396 (2006).
  33. Keurentjes, J.J.B. et al. The genetics of plant metabolism. Nat. Genet. 38, 842849 (2006).
  34. Yu, J. et al. A unified mixed-model method for association mapping that accounts for multiple levels of relatedness. Nat. Genet. 38, 203208 (2006).
  35. Piepho, H.-P. Ridge regression and extensions for genomewide selection in maize. Crop Sci. 49, 11651176 (2009).
  36. Albrecht, T. et al. Genome-based prediction of testcross values in maize. Theor. Appl. Genet. 123, 339350 (2011).
  37. Habier, D., Fernando, R.L., Kizilkaya, K. & Garrick, D.J. Extension of the bayesian alphabet for genomic selection. BMC Bioinformatics 12, 186 (2011).
  38. Clark, S.A., Hickey, J.M. & van der Werf, J.H.J. Different models of genetic variation and their effect on genomic evaluation. Genet. Sel. Evol. 43, 18 (2011).
  39. Goddard, M. Genomic selection: prediction of accuracy and maximation of long term response. Genetica 136, 245257 (2009).
  40. Duvick, D.N., Smith, J.S.C. & Cooper, M. Long term selection in a commercial hybrid maize breeding program. Plant Breed. Rev. 24, 109151 (2004).
  41. Eyre-Walker, A., Gaut, R.L., Hilton, H., Feldman, D.L. & Gaut, B.S. Investigation of the bottleneck leading to the domestication of maize. Proc. Natl. Acad. Sci. USA 95, 44414446 (1998).
  42. Hoisington, D. et al. Plant genetic resources: what can they contribute toward increased crop productivity. Proc. Natl. Acad. Sci. USA 96, 59375943 (1999).
  43. Nelson, P.T. et al. Molecular characterization of maize inbreds with expired U.S. plant variety protection. Crop Sci. 48, 16731685 (2008).
  44. Melchinger, A.E., Gumber, R.K., Leipert, R.B., Vuylsteke, M. & Kuiper, M. Prediction of testcross means and variances among F3 progenies of F1 crosses from testcross means and genetic distances of their parents in maize. Theor. Appl. Genet. 96, 503512 (1999).
  45. Saito, K. & Matsuda, F. Metabolomics for functional genomics, systems biology, and biotechnology. Annu. Rev. Plant Biol. 61, 463489 (2010).
  46. Fischer, S. et al. Trends in genetic variance components during 30 years of hybrid maize breeding at the University of Hohenheim. Plant Breed. 127, 446451 (2009).
  47. Desper, R. & Gascuel, O. Fast and accurate phylogeny reconstruction algorithms based on the minimum-evolution principle. J. Comput. Biol. 9, 687705 (2002).
  48. Cross, J.M. et al. Variation of enzyme activities and metabolite levels in 24 Arabidopsis accessions growing in carbon-limited conditions. Plant Physiol. 142, 15741588 (2006).
  49. Cuadros-Inostroza, Á. et al. TargetSearch—a bioconductor package for the efficient preprocessing of GC-MS metabolite profiling data. BMC Bioinformatics 10, 428 (2009).
  50. Chen, W.-M. & Abecasis, G.R. Family-based association tests for genome-wide association scans. Am. J. Hum. Genet. 81, 913926 (2007).
  51. Hayes, B.J., Visscher, P.M. & Goddard, M.E. Increased accuracy of artificial selection by using the realized relationship matrix. Genet. Res. (Camb). 91, 4760 (2009).
  52. Yang, J., Lee, H., Goddard, M.E. & Visscher, P.M. GCTA: a tool for fenome-wide complex trait analysis. Am. J. Hum. Genet. 88, 7682 (2011).
  53. Legarra, A., Robert-Granié, C., Manfredi, E. & Elsen, J.-M. Performance of genomic selection in mice. Genetics 180, 611618 (2008).

Download references

Author information

Affiliations

  1. Institute of Plant Breeding, Seed Science and Population Genetics, University of Hohenheim, Stuttgart, Germany.

    • Christian Riedelsheimer,
    • Christoph Grieder,
    • Frank Technow &
    • Albrecht E Melchinger
  2. Max Planck Institute of Molecular Plant Physiology, Potsdam, Germany.

    • Angelika Czedik-Eysenberg,
    • Jan Lisec,
    • Ronan Sulpice,
    • Mark Stitt &
    • Lothar Willmitzer
  3. Department of Molecular Genetics, Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), Gatersleben, Germany.

    • Thomas Altmann
  4. King Abdulaziz University, Jeddah, Saudi Arabia.

    • Lothar Willmitzer

Contributions

T.A., M.S., L.W. and A.E.M. designed the experiments and supervised research. C.R. and C.G. conducted the field experiments. C.R., A.C.-E., R.S. and J.L. performed the metabolic profiling. C.G. analyzed the phenotypic data. C.R. analyzed genomic and metabolic data and developed software. F.T. contributed to the statistical analysis. C.R. and A.E.M. wrote the manuscript.

Competing financial interests

The authors declare no competing financial interests.

Corresponding author

Correspondence to:

Author details

Supplementary information

PDF files

  1. Supplementary Text and Figures (4M)

    Supplementary Figures 1–12, Supplementary Tables 1–4 and Supplementary Note

Additional data