Review Article | Published:

Turning a hobby into a job: How duplicated genes find new functions

Nature Reviews Genetics volume 9, pages 938950 (2008) | Download Citation

Subjects

Abstract

Gene duplication provides raw material for functional innovation. Recent advances have shed light on two fundamental questions regarding gene duplication: which genes tend to undergo duplication? And how does natural selection subsequently act on them? Genomic data suggest that different gene classes tend to be retained after single-gene and whole-genome duplications. We also know that functional differences between duplicate genes can originate in several different ways, including mutations that directly impart new functions, subdivision of ancestral functions and selection for changes in gene dosage. Interestingly, in many cases the 'new' function of one copy is a secondary property that was always present, but that has been co-opted to a primary role after the duplication.

Key points

  • Natural selection uses duplicated genes as raw material for functional innovation, co-opting their existing features to new functions.

  • Understanding genetic innovation requires two questions to be addressed: which gene was involved in the duplication; and how has natural selection acted on that duplication to optimize the novel function?

  • Genes with functions such as enzymes, transporters and transcription factors often survive in duplicate. However, the mechanism of duplication is important: genes that are part of complex cellular networks are more easily duplicated by whole-genome duplication (WGD) than by small-scale duplication (SSD).

  • In order to have the potential to acquire a new function, a duplicate gene must come under the protection of natural selection so that it is not eliminated by degenerative mutations. At least three mechanisms can allow natural selection to preserve a duplicate gene pair: neofunctionalization, subfunctionalization and selection for gene dosage.

  • Strikingly, all three of the above mechanisms have been involved in the appearance of novel functions. For instance, dosage selection can maintain a gene duplication in order to provide sufficient expression of a gene product with a weak but beneficial new activity.

  • Such existing minor activities in genes might or might not be related to the gene's evolved function. Examples include enzymes with minor activities for substrates related to their primary substrate, and receptors with affinities for several ligands.

  • Subfunctionalization can also be involved in the process of generating novelty. An example is the GAL1GAL3 gene duplication in Saccharomyces cerevisiae, in which a single gene first gained a novel function that was then optimized by duplication and adaptive subfunctionalization.

Access optionsAccess options

Rent or Buy article

Get time limited or full article access on ReadCube.

from$8.99

All prices are NET prices.

References

  1. 1.

    The Origin of Species by Means of Natural Selection (John Murry, London, 1859).

  2. 2.

    Is evolution a secular religion? Science 299, 1523–1524 (2003).

  3. 3.

    & Novelty in evolution: restructuring the concept. Annu. Rev. Ecol. Syst. 22, 229–256 (1991).

  4. 4.

    in The Panda's Thumb 19–26 (W. W. Norton, New York, 1980).

  5. 5.

    & Exaptation — a missing term in the science of form. Paleobiology 8, 4–15 (1982).

  6. 6.

    , , , & The distribution of integumentary structures in a feathered dinosaur. Nature 410, 1084–1088 (2001).

  7. 7.

    , & Branched integumental structures in Sinornithosaurus and the origin of feathers. Nature 410, 200–204 (2001).

  8. 8.

    , & Fossils, genes and the evolution of animal limbs. Nature 388, 639–648 (1997).

  9. 9.

    & Duplication and divergence: the evolution of new genes and old ideas. Annu. Rev. Genet. 38, 615–643 (2004).

  10. 10.

    Evolution by Gene Duplication (Springer, New York, 1970).

  11. 11.

    The frailty of adaptive hypotheses for the origins of organismal complexity. Proc. Natl Acad. Sci. USA 104 (Suppl. 1), 8597–8604 (2007).

  12. 12.

    & The evolutionary fate and consequences of duplicate genes. Science 290, 1151–1155 (2000). A landmark paper that was the first to estimate genome-wide rates of formation and death of duplicated genes in eukaryotes, and the first to demonstrate the relaxation of selective constraints on duplicated genes.

  13. 13.

    & Preferential duplication of conserved proteins in eukaryotic genomes. PLoS Biol. 2, E55 (2004).

  14. 14.

    , , & Preferential duplication in the sparse part of yeast protein interaction network. Mol. Biol. Evol. 23, 2467–2473 (2006).

  15. 15.

    & Higher duplicability of less important genes in yeast genomes. Mol. Biol. Evol. 23, 144–151 (2006).

  16. 16.

    et al. Exposing the fitness contribution of duplicated genes. Nature Genet. 40, 676–681 (2008).

  17. 17.

    , & A single determinant dominates the rate of yeast protein evolution. Mol. Biol. Evol. 23, 327–337 (2006). The authors demonstrate that the rate of translation of mRNA is the major determinant of nonsynonymous and synonymous evolutionary rates in yeasts.

  18. 18.

    et al. Many gene and domain families have convergent fates following independent whole-genome duplication events in Arabidopsis, Oryza, Saccharomyces and Tetraodon. Trends Genet. 22, 597–602 (2006).

  19. 19.

    , & Dosage sensitivity and the evolution of gene families in yeast. Nature 424, 194–197 (2003).

  20. 20.

    , , & Natural history and evolutionary principles of gene duplication in fungi. Nature 449, 54–61 (2007).

  21. 21.

    et al. Global trends of whole-genome duplications revealed by the ciliate Paramecium tetraurelia. Nature 444, 171–178 (2006).

  22. 22.

    & Yeast genome evolution in the post-genome era. Curr. Opin. Microbiol. 2, 548–554 (1999).

  23. 23.

    & Functional divergence of duplicated genes formed by polyploidy during Arabidopsis evolution. Plant Cell 16, 1679–1691 (2004).

  24. 24.

    et al. Transcriptional regulatory code of a eukaryotic genome. Nature 431, 99–104 (2004).

  25. 25.

    et al. Modeling gene and genome duplications in eukaryotes. Proc. Natl Acad. Sci. USA 102, 5454–5459 (2005). The first application of mathematical modelling techniques to study variation in the propensities of different functional classes of genes to survive in duplicate after WGD or SSD events.

  26. 26.

    , & Functional analysis of gene duplications in Saccharomyces cerevisiae. Genetics 175, 933–943 (2007).

  27. 27.

    , , , & All duplicates are not equal: the difference between small-scale and genome duplication. Genome Biol. 8, R209 (2007).

  28. 28.

    et al. Independent sorting-out of thousands of duplicated gene pairs in two yeast species descended from a whole-genome duplication. Proc. Natl Acad. Sci. USA 104, 8397–8402 (2007).

  29. 29.

    & Increased glycolytic flux as an outcome of whole-genome duplication in yeast. Mol. Syst. Biol. 3, 129 (2007).

  30. 30.

    & Gene-balanced duplications, like tetraploidy, provide predictable drive to increase morphological complexity. Genome Res. 16, 805–814 (2006).

  31. 31.

    , , & Pervasive and persistent redundancy among duplicated genes in yeast. PLoS Genet. 4, e1000113 (2008).

  32. 32.

    , & Retention of protein complex membership by ancient duplicated gene products in budding yeast. Trends Genet. 23, 266–269 (2007).

  33. 33.

    & Consistent patterns of rate asymmetry and gene loss indicate widespread neofunctionalization of yeast genes after whole-genome duplication. Genetics 175, 1341–1350 (2007).

  34. 34.

    , , , & How did Saccharomyces evolve to become a good brewer? Trends Genet. 22, 183–186 (2006).

  35. 35.

    et al. Resurrecting ancestral alcohol dehydrogenases from yeast. Nature Genet. 37, 630–635 (2005).

  36. 36.

    , & Adaptive evolution of a duplicated pancreatic ribonuclease gene in a leaf-eating monkey. Nature Genet. 30, 411–415 (2002).

  37. 37.

    et al. Gene sharing by delta-crystallin and argininosuccinate lyase. Proc. Natl Acad. Sci. USA 85, 3479–3483 (1988).

  38. 38.

    & Recruitment of enzymes as lens structural proteins. Science 236, 1554–1556 (1987).

  39. 39.

    , & Evolution of antifreeze glycoprotein gene from a trypsinogen gene in Antarctic notothenioid fish. Proc. Natl Acad. Sci. USA 94, 3811–3816 (1997).

  40. 40.

    et al. Gene duplication and separation of functions in alphaB-crystallin from zebrafish (Danio rerio). FEBS J. 273, 481–490 (2006).

  41. 41.

    , & Evolution of a novel function: nutritive milk in the viviparous cockroach, Diploptera punctata. Evol. Dev. 6, 67–77 (2004).

  42. 42.

    , , & cDNA cloning of an adult male putative lipocalin specific to tergal gland aphrodisiac secretion in an insect (Leucophaea maderae). FEBS Lett. 449, 125–128 (1999).

  43. 43.

    & The origins of genome complexity. Science 302, 1401–1404 (2003).

  44. 44.

    Energy constraints on the evolution of gene expression. Mol. Biol. Evol. 22, 1365–1374 (2005).

  45. 45.

    , & Ohno's dilemma: evolution of new genes under continuous selection. Proc. Natl Acad. Sci. USA 104, 17004–17009 (2007). The authors propose the IAD model of co-option of minor activities of genes.

  46. 46.

    , , & Genomic convergence toward diploidy in Saccharomyces cerevisiae. PLoS Genet. 2, e145 (2006).

  47. 47.

    Rate of gene silencing at duplicate loci: a theoretical study and interpretation of data from tetraploid fish. Genetics 95, 237–258 (1980).

  48. 48.

    & Probability of fixation of nonfunctional genes at duplicate loci. Am. Nat. 107, 362–372 (1973).

  49. 49.

    et al. Preservation of duplicate genes by complementary, degenerative mutations. Genetics 151, 1531–1545 (1999). The paper that introduced the terms subfunctionalization and neofunctionalization, and that proposed that a specific form of subfunctionalization (DDC) is a frequent cause of the initial preservation of duplicated genes in eukaryotes.

  50. 50.

    The evolution of functionally novel proteins after gene duplication. Proc. Roy. Soc. Lond. B 256, 119–124 (1994).

  51. 51.

    & On some principles governing molecular evolution. Proc. Natl Acad. Sci. USA 71, 2848–2852 (1974).

  52. 52.

    , , & Study of structure–function relationships in human glutamate dehydrogenases reveals novel molecular mechanisms for the regulation of the nerve tissue-specific (GLUD2) isoenzyme. Neurochem. Int. 43, 401–410 (2003).

  53. 53.

    & Birth and adaptive evolution of a hominoid gene that supports high neurotransmitter flux. Nature Genet. 36, 1061–1063 (2004).

  54. 54.

    et al. Novel human glutamate dehydrogenase expressed in neural and testicular tissues and encoded by an X-linked intronless gene. J. Biol. Chem. 269, 16971–16976 (1994).

  55. 55.

    , , & Evolving protein functional diversity in new genes of Drosophila. Proc. Natl Acad. Sci. USA 101, 16246–16250 (2004).

  56. 56.

    et al. Paralogues of porcine aromatase cytochrome P450: a novel hydroxylase activity is associated with the survival of a duplicated gene. Endocrinology 145, 2157–2164 (2004).

  57. 57.

    On the possibility of constructive neutral evolution. J. Mol. Evol. 49, 169–181 (1999).

  58. 58.

    Conserved functions of yeast genes support the duplication, degeneration and complementation model for gene duplication. Genetics 171, 1455–1461 (2005).

  59. 59.

    & Gene duplication and the adaptive evolution of a classic genetic switch. Nature 449, 677–681 (2007).

  60. 60.

    Gene duplication and the origin of novel proteins. Proc. Natl Acad. Sci. USA 102, 8791–8792 (2005).

  61. 61.

    & Escape from adaptive conflict after duplication in an anthocyanin pathway gene. Nature 454, 762–765 (2008).

  62. 62.

    & When gene marriages don't work out: divorce by subfunctionalization. Trends Genet. 23, 270–272 (2007).

  63. 63.

    & Organellar genes: why do they end up in the nucleus? Trends Genet. 16, 315–320 (2000).

  64. 64.

    & Gene regulatory network growth by duplication. Nature Genet. 36, 492–496 (2004).

  65. 65.

    , & Structure and evolution of protein interaction networks: a statistical model for link dynamics and gene duplications. BMC Evol. Biol. 4, 51 (2004).

  66. 66.

    , , & Duplication models for biological networks. J. Comput. Biol. 10, 677–687 (2003).

  67. 67.

    How the global structure of protein interaction networks evolves. Proc. Roy. Soc. Lond. B 270, 457–466 (2003).

  68. 68.

    , & Evolving protein interaction networks through gene duplication. J. Theor. Biol. 222, 199–210 (2003).

  69. 69.

    & Functional partitioning of yeast co-expression networks after genome duplication. PLoS Biol. 4, e109 (2006).

  70. 70.

    & Role of selection in fixation of gene duplications. J. Theor. Biol. 239, 141–151 (2006).

  71. 71.

    , , & Selection in the evolution of gene duplications. Genome Biol. 3, 0008.1–0008.9 (2002).

  72. 72.

    et al. Mefloquine resistance in Plasmodium falciparum and increased pfmdr1 gene copy number. Lancet 364, 438–447 (2004).

  73. 73.

    et al. The influence of CCL3L1 gene-containing segmental duplications on HIV-1/AIDS susceptibility. Science 307, 1434–1440 (2005).

  74. 74.

    et al. Quantitative variation and selection of esterase gene amplification in Culex pipiens. Heredity 83 (Pt 1), 87–99 (1999).

  75. 75.

    , & Genetic architecture of thermal adaptation in Escherichia coli. Proc. Natl Acad. Sci. USA 98, 525–530 (2001).

  76. 76.

    , & Multiple duplications of yeast hexose-transport genes in response to selection in a glucose-limited environment. Mol. Biol. Evol. 15, 931–942 (1998).

  77. 77.

    & Role of gene duplications in the adaptation of Salmonella typhimurium to growth on limiting carbon sources. Genetics 123, 19–28 (1989).

  78. 78.

    Enzyme recruitment in evolution of new function. Annu. Rev. Microbiol. 30, 409–425 (1976).

  79. 79.

    & Biochemistry (John Wiley & Sons, Inc., Hoboken, New Jersey, 2004).

  80. 80.

    & Moonlighting proteins in yeasts. Microbiol. Mol. Biol. Rev. 72, 197–210 (2008).

  81. 81.

    & Conformational diversity and protein evolution — a 60-year-old hypothesis revisited. Trends Biochem. Sci. 28, 361–368 (2003).

  82. 82.

    Enzymes with extra talents: moonlighting functions and catalytic promiscuity. Curr. Opin. Chem. Biol. 7, 265–272 (2003).

  83. 83.

    & Catalytic promiscuity and the evolution of new enzymatic activities. Chem. Biol. 6, R91–R105 (1999).

  84. 84.

    & A new activity for an old enzyme: Escherichia coli bacterial alkaline phosphatase is a phosphite-dependent hydrogenase. Proc. Natl Acad. Sci. USA 101, 7919–7924 (2004).

  85. 85.

    , & Antibody multispecificity mediated by conformational diversity. Science 299, 1362–1367 (2003). The authors describe an antibody that not only binds distinct antigens (a small aromatic compound and a protein) but seems to do so as the result of possessing at least two distinct free-state structural conformations.

  86. 86.

    & Identifying latent enzyme activities: substrate ambiguity within modern bacterial sugar kinases. Biochemistry 43, 6387–6392 (2004).

  87. 87.

    & Reconstitution of a defunct glycolytic pathway via recruitment of ambiguous sugar kinases. Biochemistry 44, 10776–10783 (2005).

  88. 88.

    , , , & Amplification-mutagenesis: evidence that 'directed' adaptive mutation and general hypermutability result from growth with a selected gene amplification. Proc. Natl Acad. Sci. USA 99, 2164–2169 (2002).

  89. 89.

    & Adaptive mutation: how growth under selection stimulates Lac(+) reversion by increasing target copy number. J. Bacteriol. 186, 4855–4860 (2004).

  90. 90.

    An adaptive radiation model for the origin of new gene functions. Nature Genet. 37, 573–577 (2005).

  91. 91.

    & The probability of duplicate gene preservation by subfunctionalization. Genetics 154, 459–473 (2000).

  92. 92.

    & Rapid subfunctionalization accompanied by prolonged and substantial neofunctionalization in duplicate gene evolution. Genetics 169, 1157–1164 (2005).

  93. 93.

    & Subfunctionalization of duplicated genes as a transition state to neofunctionalization. BMC Evol. Biol. 5, 28 (2005).

  94. 94.

    , & Resurrecting the ancestral steroid receptor: ancient origin of estrogen signaling. Science 301, 1714–1717 (2003).

  95. 95.

    Evolution of vertebrate steroid receptors from an ancestral estrogen receptor by ligand exploitation and serial genome expansions. Proc. Natl Acad. Sci. USA 98, 5671–5676 (2001).

  96. 96.

    , & Evolution of hormone-receptor complexity by molecular exploitation. Science 312, 97–101 (2006).

  97. 97.

    & Multiple aspects of mineralocorticoid selectivity. Am. J. Physiol. Renal Physiol. 280, F181–F192 (2001).

  98. 98.

    , & Principles for modulation of the nuclear receptor superfamily. Nature Rev. Drug Discov. 3, 950–964 (2004).

  99. 99.

    & Analyses of the CYP11B gene family in the guinea pig suggest the existence of a primordial CYP11B gene with aldosterone synthase activity. Eur. J. Biochem. 269, 3838–3846 (2002).

  100. 100.

    & Transcriptional control of the GAL/MEL regulon of yeast Saccharomyces cerevisiae: mechanism of galactose-mediated signal transduction. Mol. Microbiol. 40, 1059–1066 (2001).

  101. 101.

    et al. Activation of Gal4p by galactose-dependent interaction of galactokinase and Gal80p. Science 272, 1662–1665 (1996).

  102. 102.

    & Retracted: Mitochondria, the missing link between body and soul: proteomic prospective evidence. Proteomics 8, I–XXIII (2008).

  103. 103.

    , , , & Role of positive selection in the retention of duplicate genes in mammalian genomes. Proc. Natl Acad. Sci. USA 103, 2232–2236 (2006).

  104. 104.

    et al. A scan for positively selected genes in the genomes of humans and chimpanzees. PLoS Biol. 3, e170 (2005).

  105. 105.

    , , , & Emergence of young human genes after a burst of retroposition in primates. PLoS Biol. 3, e357 (2005).

  106. 106.

    & Molecular evidence for natural selection. Annu. Rev. Ecol. Syst. 26, 403–422 (1995).

  107. 107.

    Maximum likelihood estimation on large phylogenies and analysis of adaptive evolution in human influenza virus A. J. Mol. Evol. 51, 423–432 (2000).

  108. 108.

    , & Positive Darwinian selection after gene duplication in primate ribonuclease genes. Proc. Natl Acad. Sci. USA 95, 3708–3713 (1998).

  109. 109.

    In search of molecular darwinism. Nature 385, 111–112 (1997).

  110. 110.

    & Multiple mechanisms promote the retained expression of gene duplicates in the tetraploid frog Xenopus laevis. PLoS Genet. 2, e56 (2006).

  111. 111.

    , & The origin of mutants. Nature 335, 142–145 (1988).

  112. 112.

    & Adaptive reversion of a frameshift mutation in Escherichia coli. Genetics 128, 695–701 (1991).

  113. 113.

    , & Evidence that gene amplification underlies adaptive mutability of the bacterial lac operon. Science 282, 1133–1135 (1998).

  114. 114.

    , , , & Multiple pathways of selected gene amplification during adaptive mutation. Proc. Natl Acad. Sci. USA 103, 17319–17324 (2006).

  115. 115.

    & Gene amplification and genomic plasticity in prokaryotes. Annu. Rev. Genet. 31, 91–111 (1997).

  116. 116.

    & The Yeast Gene Order Browser: combining curated homology and syntenic context reveals gene fate in polyploid species. Genome Res. 15, 1456–1461 (2005).

  117. 117.

    et al. SGD: Saccharomyces Genome Database. Nucleic Acids Res. 26, 73–80 (1998).

Download references

Acknowledgements

We thank K. Byrne, B. Cusack, J. Gordon, N. Khaldi, and J. Mower for discussions regarding the fates of duplicated genes. We would also like to thank three anonymous reviewers for critical comments. This work was supported by Science Foundation Ireland.

Author information

Affiliations

  1. Division of Animal Sciences, University of Missouri–Columbia, 163B Animal Sciences Center, 920 East Campus Drive, Columbia, Missouri 65211-5300, USA.

    • Gavin C. Conant
  2. Smurfit Institute of Genetics, University of Dublin, Trinity College, Dublin 2, Ireland.

    • Kenneth H. Wolfe

Authors

  1. Search for Gavin C. Conant in:

  2. Search for Kenneth H. Wolfe in:

Competing interests

The authors declare no competing financial interests.

Corresponding author

Correspondence to Kenneth H. Wolfe.

Glossary

Subfunctionalization

A pair of duplicate genes are said to be subfunctionalized if each of the two copies of the gene performs only a subset of the functions of the ancestral single copy gene.

Genetic drift

Random fluctuations through time in the allele frequencies of a population, caused by a sampling effect in small populations. Drift can overcome the effects of natural selection if the selective differences between alleles are small.

Neofunctionalization

A pair of duplicate genes in a population are said to be neofunctionalized if one of the two genes possesses a new, selectively beneficial function that was absent in the population before the duplication.

Retrotransposed

Describes a gene that has undergone duplication through a process that involves an mRNA intermediate. It occurs when a reverse transcriptase enzyme synthesizes DNA from an mRNA template and the DNA is then integrated into the genome. Because retrotransposition usually uses mature mRNAs as a substrate, the resulting duplicate genes often lack introns.

Degree distribution

The degree of a node in a network (in this case, a gene) is the number of interactions it has with other nodes in the network. Thus, in a protein–protein interaction network, the degree of a gene is the number of proteins that the product of the gene interacts with. The degree distribution of a network describes the frequency of nodes in that network with a given degree: many networks of biological interest show a power-law degree distribution.

About this article

Publication history

Published

DOI

https://doi.org/10.1038/nrg2482

Further reading