Review Article | Published:

Evolution of crop species: genetics of domestication and diversification

Nature Reviews Genetics volume 14, pages 840852 (2013) | Download Citation


Domestication is a good model for the study of evolutionary processes because of the recent evolution of crop species (<12,000 years ago), the key role of selection in their origins, and good archaeological and historical data on their spread and diversification. Recent studies, such as quantitative trait locus mapping, genome-wide association studies and whole-genome resequencing studies, have identified genes that are associated with the initial domestication and subsequent diversification of crops. Together, these studies reveal the functions of genes that are involved in the evolution of crops that are under domestication, the types of mutations that occur during this process and the parallelism of mutations that occur in the same pathways and proteins, as well as the selective forces that are acting on these mutations and that are associated with geographical adaptation of crop species.

Key points

  • Quantitative trait locus mapping techniques, genome-wide association studies and next-generation sequencing technologies have led to the discovery of genes that drive domestication and that lead to evolutionary diversification of cultivated plant species.

  • Molecular data have provided insights into the nature of selection on these evolutionary genes in crops, as well as the nature of the genes and mutations that are associated with the process.

  • Early steps in domestication seem to be associated with transcription factor loci, whereas in later crop diversification, enzyme-coding genes are targeted by selection.

  • Loss-of-function point mutations are the most common mutational lesion that is found in domestication genes.

  • Although only a handful of species have been studied in-depth, shifts in both domestication- and diversification-related traits can be examined in population demographic analyses using molecular, historic and archaeological data.

Access optionsAccess options

Rent or Buy article

Get time limited or full article access on ReadCube.


All prices are NET prices.


  1. 1.

    , , , & The evolution of agriculture in insects. Annu. Rev. Ecol. Evol. Syst. 36, 563–595 (2005).

  2. 2.

    , & Rapid domestication of marine species. Science 316, 382–383 (2007).

  3. 3.

    The Variation of Animals and Plants Under Domestication (John Murray, 1868).

  4. 4.

    On the Origin of Species by Means of Natural Selection, or the Preservation of Favoured Races in the Struggle for Life (John Murray, 1859).

  5. 5.

    et al. A map of rice genome variation reveals the origin of cultivated rice. Nature 490, 497–501 (2012). This paper uses more than 1,500 accessions in whole-genome resequencing and high-resolution mapping to reveal the origin of rice, and the selective sweeps and fixed domestication mutations.

  6. 6.

    et al. Variation revealed by SNP genotyping and morphology provides insight into the origin of the tomato. PLoS ONE 7, e48198 (2012).

  7. 7.

    Biodiversity in Agriculture: Domestication, Evolution, and Sustainability. 110–135 (Cambridge Univ. Press, 2012).

  8. 8.

    , & Early agricultural pathways: moving outside the 'core area' hypothesis in Southwest Asia. J. Exp. Bot. 63, 617–633 (2012).

  9. 9.

    , & Patterns and processes in crop domestication: an historical review and quantitative analysis of 203 global food crops. New Phytol. 196, 29–48 (2012). This study provides an overview of the features of domesticated plant species.

  10. 10.

    & Man the Hunter (Aldine De Gruyter, 1999).

  11. 11.

    & Global state of biodiversity and loss. Annu. Rev. Environ. Resour. 28, 137–167 (2003).

  12. 12.

    et al. (eds.) Biodiversity in Agriculture: Domestication, Evolution, and Sustainability (Cambridge Univ. Press, 2012).

  13. 13.

    in Biodiversity in Agriculture: Domestication, Evolution, and Sustainability (eds Gepts, P. et al.) 92–109 (Cambridge Univ. Press, 2012).

  14. 14.

    & Measured domestication rates in wild wheats and barley under primitive cultivation, and their archaeological implications. J. World Prehist. 4, 157–222 (1990).

  15. 15.

    & Archaeological data reveal slow rates of evolution during plant domestication. Evolution. 65, 171–183 (2011).

  16. 16.

    Evolution of Crop Plants (Longman, 1976).

  17. 17.

    & Seed dispersal and crop domestication: shattering, germination, and seasonality in evolution under cultivation. Annu. Plant Rev. 38, 238–295 (2009).

  18. 18.

    , & Early domesticated fig in the Jordan Valley. Science. 312, 1372–1374 (2006).

  19. 19.

    Unconscious selection and the evolution of domesticated plants. Econ. Bot. 58, 5–10 (2004).

  20. 20.

    , , , & Ecogeographical diversity — a Vavilovian appraoch. Dev. Plant Genet. Breed. 7, 53–76 (2003).

  21. 21.

    et al. Preceramic maize from Paredones and Huaca Prieta, Peru. Proc. Natl Acad. Sci. USA 109, 1755–1759 (2012).

  22. 22.

    Origin and domestication of Lactuca sativa L. Genet. Res. Crop Evol. 44, 165–174 (1997).

  23. 23.

    Contrasting patterns in crop domestication and domestication rates: recent archaeobotanical insights from the Old World. Ann. Bot. 100, 903–924 (2007).

  24. 24.

    & Genetic approaches to crop improvement: responding to environmental and population changes. Nature Rev. Genet. 9, 444–457 (2008).

  25. 25.

    , , , & Conservation of gene function in the Solanaceae as revealed by comparative mapping of domestication traits in eggplant. Genetics 161, 1713–1726 (2002).

  26. 26.

    , & Suppressor of sessile spikelets1 (Sos1): a dominant mutant affecting inflorescence development in maize. Am. J. Bot. 82, 571–577 (1995).

  27. 27.

    , & The evolution of apical dominance in maize. Nature 386, 485–481 (1997). This paper reports the isolation of one of the first domestication genes.

  28. 28.

    , & Genetic control of the domestication syndrome in common bean. Crop Sci. 36, 1037–1045 (1996).

  29. 29.

    Crop domestication as a long-term selection experiment. Plant Breed. Rev. 24, 1–44 (2004).

  30. 30.

    et al. Cereal domestication and evolution of branching: evidence for soft selection in the Tb1 orthologue of pearl millet (Pennisetum glaucum [L.] R. Br.). PLoS ONE 6, e22404 (2011).

  31. 31.

    et al. Comparative analysis of QTLs affecting domestication traits between two domesticated x wild pearl millet (Pennisetum glaucum L., Poaceae) crosses. Theor. Appl. Genet. 104, 965–975 (2002).

  32. 32.

    & QTL clusters reflect character associations in wild and cultivated rice. Theor. Appl. Genet. 104, 1217–1228 (2002).

  33. 33.

    Das domestikationssyndrom. Die Kulturpflanze 32, 11–34 (in German) (1984).

  34. 34.

    & Inheritance of the morphological differences between maize and teosinte: comparison of results for two F2 populations. Genetics 134, 559–570 (1993). This is one of the classic papers on QTL mapping of domestication traits.

  35. 35.

    et al. Mapping quantitative trait loci for yield, yield components and morphological traits in an advanced backcross population between Oryza rufipogon and the Oryza sativa cultivar Jefferson. Theor. Appl. Genet. 107, 479–493 (2003).

  36. 36.

    et al. The genetic architecture of flowering time and photoperiod sensitivity in maize as revealed by QTL review and meta analysis. J. Integr. Plant. Biol. 54, 358–373 (2012).

  37. 37.

    et al. Genetic control and comparative genomic analysis of flowering time in Setaria (Poaceae). G3 3, 283–295 (2013).

  38. 38.

    Genetic and molecular dissection of naturally occurring variation. Curr. Opin. Plant Biol. 4, 130–135 (2001).

  39. 39.

    et al. Genome-wide association mapping reveals a rich genetic architecture of complex traits in Oryza sativa. Nature Commun. 2, 467 (2011).

  40. 40.

    et al. Genome-wide association studies of 14 agronomic traits in rice landraces. Nature Genet. 42, 961–967 (2010).

  41. 41.

    et al. Population genomic and genome-wide association studies of agroclimatic traits in sorghum. Proc. Natl Acad. Sci. USA 110, 453–458 (2013). This is a large-scale study that identifies regions with selective sweeps that indicate potential diversification genomic regions.

  42. 42.

    et al. A haplotype map of genomic variations and genome-wide association studies of agronomic traits in foxtail millet (Setaria italica). Nature Genet. 45, 957–961 (2013).

  43. 43.

    et al. Comparative population genomics of maize domestication and improvement. Nature Genet. 44, 808–811 (2012).

  44. 44.

    , , & Long-range patterns of diversity and linkage disequilibrium surrounding the maize Y1 gene are indicative of an asymmetric selective sweep. Proc. Natl Acad. Sci. USA 101, 9885–9890 (2004).

  45. 45.

    , & Tzeltal and Tzotzil farmer knowledge and maize diversity in Chiapas, Mexico. Curr. Anthropol. 48, 289–300 (2007).

  46. 46.

    et al. Artificial selection for a Green Revolution gene during japonica rice domestication. Proc. Natl Acad. Sci. USA 108, 11034–11039 (2011).

  47. 47.

    et al. Genetic and molecular analysis of utility of sd1 alleles in rice breeding. Breed. Sci. 57, 53–58 (2007).

  48. 48.

    & A bountiful harvest: genomic insights into crop domestication phenotypes. Annu. Rev. Plant. Biol. 64, 47–70 (2013).

  49. 49.

    & The nature of selection during plant domestication. Nature. 457, 843–848 (2009).

  50. 50.

    et al. Genic and non-genic contributions to natural variation of quantitative traits in maize. Genome Res. 22, 2436–2444 (2012).

  51. 51.

    , , , & Fine mapping, physical mapping and development of diagnostic markers for the Rrs2 scald resistance gene in barley. Theor. Appl. Genet. 119, 1507–1522 (2009).

  52. 52.

    , , , Pattern of diversity in the genomic region near the maize domestication gene tb1. Proc. Natl Acad. Sci. USA 101, 700–707 (2004).

  53. 53.

    , , & Identification of a functional transposon insertion in the maize domestication gene tb 1. Nature Genet. 43, 1160–1163 (2011). This is a functional analysis of a cis -regulatory polymorphism that results in a domesticated trait.

  54. 54.

    et al. The effects of artificial selection on the maize genome. Science 308, 1310–1314 (2005). This is one of the first systematic estimates of the number of genes in a domesticated plant genome that shows evidence of positive selection.

  55. 55.

    et al. Construction of a genetic linkage map and genetic analysis of domestication related traits in mungbean (Vigna radiata). PLoS ONE 7, e41304 (2012).

  56. 56.

    et al. Molecular evidence for a single evolutionary origin of domesticated rice. Proc. Natl Acad. Sci. USA 108, 8351–8356 (2011).

  57. 57.

    , & Natural alleles at a tomato fruit size quantitative trait locus differ by heterochronic regulatory mutations. Proc. Natl Acad. Sci. USA 99, 13606–13611 (2002).

  58. 58.

    & Genetic and molecular regulation of fruit and plant domestication traits in tomato and pepper. J. Exp. Bot. 58, 3841–3852 (2007).

  59. 59.

    , & Rice domestication by reducing shattering. Science 311, 1936–1939 (2006).

  60. 60.

    Population-based resequencing analysis of wild and cultivated barley revealed weak domestication signal of selection and bottleneck in the Rrs2 scald resistance gene region. Genome 55, 93–104 (2012).

  61. 61.

    et al. Distribution of SUN, OVATE, LC, and FAS in the tomato germplasm and the relationship to fruit shape diversity. Plant Physiol. 156, 275–285 (2011).

  62. 62.

    , & How reliable are empirical genomic scans for selective sweeps? Genome Res. 16, 702–712 (2006).

  63. 63.

    , & Plant domestication, a unique opportunity to identify the genetic basis of adaptation. Proc. Natl Acad. Sci. USA 104, 8641–8648 (2007).

  64. 64.

    & Detecting local adaptation using the joint sampling of polymorphism data in the parental and derived populations. Genetics 179, 1713–1720 (2008).

  65. 65.

    et al. OsLG1 regulates a closed panicle trait in domesticated rice. Nature Genet. 45, 462–465 (2013). This is an excellent example of fine mapping, identification of a selective sweep, and functional characterization and identification of a causative mutation that validates LG1 as a domestication gene.

  66. 66.

    et al. Genetic control of inflorescence architecture during rice domestication. Nature Commun. 4, 2200 (2013).

  67. 67.

    et al. The genetic architecture of complex traits in teosinte (Zea mays ssp. parviglumis): new evidence from association mapping. Genetics 180, 1221–1232 (2008).

  68. 68.

    et al.INTERMEDIUM-C, a modifier of lateral spikelet fertility in barley, is an ortholog of the maize domestication gene TEOSINTE BRANCHED 1. Nature Genet. 43, 169–172 (2011).

  69. 69.

    et al. Population-based resequencing reveals that the flowering time adaptation of cultivated barley originated east of the Fertile Crescent. Mol. Biol. Evol. 25, 2211–2219 (2008).

  70. 70.

    , & The molecular genetics of crop domestication. Cell 127, 1309–1321 (2006).

  71. 71.

    Phylogenetic Systematics (Univ. of Illinois Press, 1966).

  72. 72.

    , & Parallel genotypic adaptation: when evolution repeats itself. Genetica 123, 157–170 (2005).

  73. 73.

    & Parallel adaptation: one or many waves of advance of an advantageous allele. Genetics 186, 647–668 (2010).

  74. 74.

    The law of homologous series in variation. J. Genet. 12, 47–89 (1922). This is a classic paper on the recurring traits that are seen among crops and their influence on the development of core evolutionary concepts.

  75. 75.

    et al. Parallel domestication of the Shattering1 genes in cereals. Nature Genet. 44, 720–724 (2012).

  76. 76.

    et al. Genetic control of rice plant architecture under domestication. Nature Genet. 40, 1365–1369 (2008).

  77. 77.

    et al. Control of a key transition from prostrate to erect growth in rice domestication. Nature Genet. 40, 1360–1364 (2008).

  78. 78.

    et al. The Rc and Rd genes are involved in proanthocyanidin synthesis in rice pericarp. Plant J. 49, 91–102 (2007).

  79. 79.

    , , & Caught red-handed: Rc encodes a basic helix-loop-helix protein conditioning red pericarp in rice. Plant Cell. 18, 283–294 (2006).

  80. 80.

    , , & A natural mutation in rc reverts white-rice-pericarp to red and results in a new, dominant, wild-type allele: Rc-g. Theor. Appl. Genet. 117, 575–580 (2008).

  81. 81.

    et al. Seeing red: the origin of grain pigmentation in US weedy rice. Mol. Ecol. 19, 3380–3393 (2010).

  82. 82.

    et al. Global dissemination of a single mutation conferring white pericarp in rice. PLoS Genetics 3, e133 (2007). This paper shows how introgression between populations leads to the spread of a domestication trait.

  83. 83.

    et al. Convergent domestication of cereal crops by independent mutations at corresponding genetic loci. Science 269, 1714–1718 (1995).

  84. 84.

    et al. The amylose content in rice endosperm is related to the post-transcriptional regulation of the waxy gene. Plant J. 7, 613–622 (1995).

  85. 85.

    & Molecular evidence on the origin and evolution of glutinous rice. Genetics 162, 941–950 (2002).

  86. 86.

    , , , & Molecular basis of the waxy endosperm starch phenotype in broomcorn millet (Panicum miliaceum L.). Mol. Biol. Evol. 27, 1478–1494 (2010).

  87. 87.

    , & Structural variation in the Waxy gene and differentiation in foxtail millet [Setaria italica (L.) P. Beauv.]: implications for multiple origins of the waxy phenotype. Mol. Genet. Genom. 268, 214–222 (2002).

  88. 88.

    , , & The molecular basis of mutations at the Waxy locus from Amaranthus caudatus L.: evolution of the waxy phenotype in three species of grain amaranth. Mol. Breed. 30, 511–520 (2012).

  89. 89.

    , & The molecular basis of white pericarps in African domesticated rice: novel mutations at the Rc gene. J. Evol. Biol. 23, 2747–2753 (2010).

  90. 90.

    SHAT1, a new player in seed shattering of rice. Plant Cell. 24, 839 (2012).

  91. 91.

    et al. Genetic control of seed shattering in rice by the APETALA2 transcription factor SHATTERING ABORTION1. Plant Cell 24, 1034–1048 (2012).

  92. 92.

    , , & The origin and evolution of fragrance in rice (Oryza sativa L.). Proc. Natl Acad. Sci. USA 106, 14444–14449 (2009).

  93. 93.

    , , , & Starch grain and phytolith evidence for early ninth millennium B.P. maize from the Central Balsas River Valley, Mexico. Proc. Natl Acad. Sci. USA 106, 5019–5024 (2009).

  94. 94.

    et al. Early allelic selection in maize as revealed by ancient DNA. Science. 302, 1206–1208 (2003).

  95. 95.

    & Developmental analysis of teosinte glume architecture1: a key locus in the evolution of maize (Poaceae). Am. J. Bot. 84, 1313 (1997).

  96. 96.

    et al. The origin of the naked grains of maize. Nature. 436, 714–719 (2005).

  97. 97.

    et al. Angiosperm phylogeny: 17 genes, 640 taxa. Am. J. Bot. 98, 704–730 (2011).

  98. 98.

    Angiosperm Phylogeny Website, Version 12. , (2012).

  99. 99.

    Waxy endosperm in coix and sorghum. J. Hered. 12, 396–400 (1921).

  100. 100.

    Differential regulation of waxy gene expression in rice endosperm. Theor. Appl. Genet. 68, 467–473 (1984).

  101. 101.

    et al. Selection under domestication: evidence for a sweep in the rice waxy genomic region. Genetics 173, 975–983 (2006). This is an early molecular population genetic analysis of a crop diversification gene.

  102. 102.

    et al. Haplotype structure at seven barley genes: relevance to gene pool bottlenecks, phylogeny of ear type and site of barley domestication. Mol. Genet. Genom. 276, 230–241 (2006).

  103. 103.

    et al. Phylogeographic evidence of crop neodiversity in sorghum. Genetics 179, 997–1008 (2008).

  104. 104.

    in Redefining Nature: Ecology, Culture and Domestication (eds Ellen, R. & Fukui, K.) 215–231 (Berg, 1996).

  105. 105.

    et al. Deletion commonly found in Waxy gene of Japanese and Korean cultivars of Job's tears (Coix lacryma-jobi L.). Mol. Breed. 30, 1747–1756 (2012).

  106. 106.

    , , & Origin of waxy common millet, Panicum miliaceum L. in Japan. Genet. Res. Crop Evol. 59, 1303–1308 (2012).

  107. 107.

    , & Diverse origins of waxy foxtail millet crops in East and Southeast Asia mediated by multiple transposable element insertions. Mol. Genet. Genom. 274, 131–140 (2005).

  108. 108.

    On waxy barleys in Japan. Seiken Jiho. 4, 111–113 (in Japanese) (1950).

  109. 109.

    The grain amaranths and their relatives: a revised taxonomic and geographic. Ann. Missouri Bot. Gard. 54, 103–137 (1967).

  110. 110.

    et al. Assessment of genetic diversity in Peruvian amaranth (Amaranthus caudatus and A. hybridus) germplasm using single nucleotide polymorphism markers. Crop Sci. 53, 532–541 (2013).

  111. 111.

    et al. Grinding up wheat: a massive loss of nucleotide diversity since domestication. Mol. Biol. Evol. 24, 1506–1517 (2007).

  112. 112.

    et al. New insight into the history of domesticated apple: secondary contribution of the European wild apple to the genome of cultivated varieties. PLoS Genet. 8, e1002703 (2012).

  113. 113.

    et al. Genetic structure and domestication of carrot (Daucus carota subsp. sativus) (Apiaceae). Am. J. Bot. 100, 930–938 (2013).

  114. 114.

    , , , & Reproductive isolation during domestication. Plant Cell. 24, 2710–2717 (2012).

  115. 115.

    & From forest to field: perennial fruit crop domestication. Am. J. Bot. 98, 1389–1414 (2011). This paper is an overview of the state of understanding about perennial crop domestication traits and demographic histories.

  116. 116.

    et al. Resequencing 50 accessions of cultivated and wild rice yields markers for identifying agronomically important genes. Nature Genet. 30, 105–111 (2011).

  117. 117.

    et al. An SNP caused loss of seed shattering during rice domestication. Science 312, 1392–1396 (2006). This is a classic paper on the isolation of a gene for non-shattering, which is a major domestication trait.

  118. 118.

    , & Inference of the japonica rice domestication process from the distribution of six functional nucleotide polymorphisms of domestication-related genes in various landraces and modern cultivars. Plant Cell Physiol. 49, 1283–1293 (2008).

  119. 119.

    , & The common bean growth habit gene PvTFL1y is a functional homolog of Arabidopsis TFL1. Theor. Appl. Genet. 124, 1539–1547 (2012).

  120. 120.

    et al. Molecular genetic basis of pod corn (Tunicate maize). Proc. Natl Acad. Sci. USA 109, 7115–7120 (2012).

Download references


The authors thank the members of the Purugganan laboratory for their feedback, particularly J. Flowers, A. Plessis and U. Rosas. R.S.M. is supported by a postdoctoral fellowship from the US National Science Foundation Plant Genome Research Program (NSF PGRP). Work on domestication in the Purugganan laboratory is also funded by grants from NSF PGRP and the New York University Abu Dhabi Research Institute, United Arab Emirates.

Author information


  1. Center for Genomics and Systems Biology, Department of Biology, 12 Waverly Place, New York University, New York 10003, USA.

    • Rachel S. Meyer
    •  & Michael D. Purugganan
  2. Center for Genomics and Systems Biology, New York University Abu Dhabi Research Institute, Abu Dhabi, United Arab Emirates.

    • Michael D. Purugganan


  1. Search for Rachel S. Meyer in:

  2. Search for Michael D. Purugganan in:

Competing interests

The authors declare no competing financial interests.

Corresponding author

Correspondence to Michael D. Purugganan.

Supplementary information

PDF files

  1. 1.

    Supplementary information S1 (table)

    List of 353 global domesticated food crops

  2. 2.

    Supplementary information S3 (table)

Excel files

  1. 1.

    Supplementary information S2 (table)

    Expanded table of domestication and diversification genes


Quantitative trait locus

(QTL). A genomic region with a gene (or multiple linked genes) that contains mutations which result in phenotypic variation in populations.

Genome-wide association studies

(GWASs). Studies that use linkage disequilibrium between dense, usually single-nucleotide polymorphism, markers across the genome to identify significant associations between genes (or genomic regions) and trait phenotypes.

Conscious selection

The intentional choice, made by humans, of preferred phenotypes in cultivated plants for use and propagation.

Unconscious selection

Natural selection in crop species as a result of human cultivation practices and of growth in agro-ecological environments.

Green Revolution

A series of research, breeding and technology transfer programmes in the mid-twentieth century that resulted in marked increases in agricultural productivity in developing countries.


Introduction of a wild-type allele into a mutant individual, through either genetic crosses or transgenic methods, to confirm that a particular gene causes a specific phenotype.

Causative mutations

Mutations that lead to altered gene functions, which result in specific phenotypes.


Increase in the frequency of an allelic variant until it is found in all individuals in a population.

Selective sweeps

Rapid increases in population frequencies of positively selected mutations and linked neutral mutations, which result in significant reductions in nucleotide diversity in localized regions of the genome that flank the selected mutations.


Recurrent crossing that leads to the sharing of alleles between gene pools (which can be unidirectional), such as between domesticated and wild populations.

Genetic bottlenecks

Marked decreases in genetic diversity that are caused by reductions in effective population sizes.

Domestication syndrome

The selection of traits that distinguish domesticated species from their wild progenitors; similar traits were often observed to occur in different crops, which led people to view them as a 'syndrome'.

Nonsense mutations

Point mutations that transform amino acid-encoding codons into premature stop codons, which result in the generation of truncated proteins.

cis-regulatory mutations

Mutations in linked, usually non-coding portions, of genes that alter levels and/or patterns of transcription of the linked gene.

Missense mutations

Point mutations that change the identities of encoded amino acids, which result in changes in protein sequences.

Nucleotide diversity

The number of single-nucleotide polymorphism in a genomic region, usually estimated as the mean level of pairwise nucleotide divergence in a sample or a population.

Parallel evolution

Independent evolution of the same trait in different species.

About this article

Publication history



Further reading