Evolution is typically thought to proceed through divergence of genes, proteins and ultimately phenotypes1, 2, 3. However, similar traits might also evolve convergently in unrelated taxa owing to similar selection pressures4, 5. Adaptive phenotypic convergence is widespread in nature, and recent results from several genes have suggested that this phenomenon is powerful enough to also drive recurrent evolution at the sequence level6, 7, 8, 9. Where homoplasious substitutions do occur these have long been considered the result of neutral processes. However, recent studies have demonstrated that adaptive convergent sequence evolution can be detected in vertebrates using statistical methods that model parallel evolution9, 10, although the extent to which sequence convergence between genera occurs across genomes is unknown. Here we analyse genomic sequence data in mammals that have independently evolved echolocation and show that convergence is not a rare process restricted to several loci but is instead widespread, continuously distributed and commonly driven by natural selection acting on a small number of sites per locus. Systematic analyses of convergent sequence evolution in 805,053 amino acids within 2,326 orthologous coding gene sequences compared across 22 mammals (including four newly sequenced bat genomes) revealed signatures consistent with convergence in nearly 200 loci. Strong and significant support for convergence among bats and the bottlenose dolphin was seen in numerous genes linked to hearing or deafness, consistent with an involvement in echolocation. Unexpectedly, we also found convergence in many genes linked to vision: the convergent signal of many sensory genes was robustly correlated with the strength of natural selection. This first attempt to detect genome-wide convergent sequence evolution across divergent taxa reveals the phenomenon to be much more pervasive than previously recognized.
At a glance
- Mutational effects and the evolution of new protein functions. Nature Rev. Genet. 11, 572–582 (2010) &
- Evolution of genes and genomes on the Drosophila phylogeny. Nature 450, 203–218 (2007) et al.
- Chimpanzee and human Y chromosomes are remarkably divergent in structure and gene content. Nature 463, 536–539 (2010) et al.
- Evolution. Convergent evolution of hearing. Science 338, 894–895 (2012)
- Convergent evolution of Darwin’s finches caused by introgressive hybridization and selection. Evolution 58, 1588–1599 (2004) , , , &
- Detection of convergent and parallel evolution at the amino acid sequence level. Mol. Biol. Evol. 14, 527–536 (1997) &
- Convergent evolution of major histocompatibility complex molecules in humans and New World monkeys. Immunogenetics 51, 169–178 (2000) , , &
- The hearing gene Prestin reunites echolocating bats. Proc. Natl Acad. Sci. USA 105, 13959–13964 (2008) et al.
- Evidence for an ancient adaptive episode of convergent molecular evolution. Proc. Natl Acad. Sci. USA 106, 8986–8991 (2009) et al.
- Cetaceans on a molecular fast track to ultrasonic hearing. Curr. Biol. 20, 1834–1839 (2010) , , , &
- 89–98 (Univ. Chicago Press, 2004) & in Echolocation in Bats and Dolphins (eds , & )
- Echolocation in dolphins and bats. Phys.Today 60, 40–45 (2007) &
- Microbat paraphyly and the convergent evolution of a key innovation in Old World rhinolophoid microbats. Proc. Natl Acad. Sci. USA 99, 1431–1436 (2002) et al.
- Molecular evidence regarding the origin of echolocation and flight in bats. Nature 403, 188–192 (2000) et al.
- Bat echolocation calls: adaptation and convergent evolution. Proc. R. Soc. B 274, 905–912 (2007) &
- Parallel signatures of sequence evolution among hearing genes in echolocating mammals: an emerging model of genetic convergence. Heredity 108, 480–489 (2012) , , , &
- The voltage-gated potassium channel subfamily KQT member 4 (KCNQ4) displays parallel evolution in echolocating bats. Mol. Biol. Evol. 29, 1441–1450 (2012) et al.
- Parallel evolution of auditory genes for echolocation in bats and toothed whales. PLoS Genet. 8, e1002788 (2012) , , , &
- Convergent sequence evolution between echolocating bats and dolphins. Curr.Biol. 20, R53–R54 (2010) et al.
- Comparative analysis of bat genomes provides insight into the evolution of flight and immunity. Science 339, 456–460 (2013) et al.
- Genome-wide scans for candidate genes involved in the aquatic adaptation of dolphins. Genome Biol. Evol. 5, 130–139 (2013) et al.
- The evolution of echolocation in bats. Trends Ecol. Evol. 21, 149–156 (2006) &
- Using genomic data to unravel the root of the placental mammal phylogeny. Genome Res. 17, 413–421 (2007) , , , &
- A high-resolution map of human evolutionary constraint using 29 mammals. Nature 478, 476–482 (2011) et al.
- Phylogenomic analysis resolves the interordinal relationships and rapid diversification of the laurasiatherian mammals. Syst. Biol. 61, 150–164 (2012) et al.
- The evolution of color vision in nocturnal mammals. Proc. Natl Acad. Sci. USA 106, 8980–8985 (2009) et al.
- Rhodopsin molecular evolution in mammals inhabiting low light environments. PLoS ONE 4, e8326 (2009) et al.
- Spectral-tuning mechanisms of marine mammal rhodopsins and correlations with foraging depth. Vis. Neurosci. 17, 781–788 (2000) &
- Role of p63 and the Notch pathway in cochlea development and sensorineural deafness. Proc. Natl. Acad. Sci. USA 110, 7300–7305 (2013) et al.
- The cell cycle and the development and regeneration of hair cells. Curr. Top. Dev. Biol. 57, 449–466 (2003)
- Bayesian phylogenetics with BEAUti and the BEAST 1.7. Mol. Biol. Evol. 29, 1969–1973 (2012) , , &
- SOAP2: an improved ultrafast tool for short read alignment. Bioinformatics 25, 1966–1967 (2009) et al.
- genBlastG: using BLAST searches to build homologous gene models. Bioinformatics 27, 2141–2143 (2011) et al.
- Genome sequencing reveals insights into physiology and longevity of the naked mole rat. Nature 479, 223–227 (2011) et al.
- CEGMA: a pipeline to accurately annotate core genes in eukaryotic genomes. Bioinformatics 23, 1061–1067 (2007) , &
- Assessing the gene space in draft genomes. Nucleic Acids Res. 37, 289–297 (2009) , , , &
- transAlign: using amino acids to facilitate the multiple alignment of protein-coding DNA sequences. BMC Bioinform. 6, 156 (2005)
- MAFFT version 5: improvement in accuracy of multiple sequence alignment. Nucleic Acids Res. 33, 511–518 (2005) , , &
- Selection of conserved blocks from multiple alignments for their use in phylogenetic analysis. Mol. Biol. Evol. 17, 540–552 (2000)
- Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nature Protocols 4, 44–57 (2009) , &
- MUSCLE: a multiple sequence alignment method with reduced time and space complexity. BMC Bioinform. 5, 113 (2004)
- RAxML-VI-HPC: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models. Bioinformatics 22, 2688–2690 (2006)
- 2006) in Proceedings of 20th IEEE/ACM International Parallel and Distributed Processing Symposium (IPDPS2006) (High Performance Computational Biology Workshop,
- Human adaptations to diet, subsistence, and ecoregion are due to subtle shifts in allele frequency. Proc. Natl Acad. Sci. USA 107, 8924–8930 (2010) et al.
- PAML 4: phylogenetic analysis by maximum likelihood. Mol. Biol. Evol. 24, 1586–1591 (2007)
- The Newick utilities: high-throughput phylogenetic tree processing in the UNIX shell. Bioinformatics 26, 1669–1670 (2010) &
- OrthoMaM: a database of orthologous genomic markers for placental mammal phylogenetics. BMC Evol.Biol. 7, 241 (2007) et al.
- PhyloBayes 3: a Bayesian software package for phylogenetic reconstruction and molecular dating. Bioinformatics 25, 2286–2288 (2009) , &
- Codon-substitution models for heterogeneous selection pressure at amino acid sites. Genetics 155, 431–449 (2000) , , &
- Controlling the false discovery rate—a practical and powerful approach to multiple testing. J. R. Stat. Soc. B. 57, 289–300 (1995) &
- A maximum likelihood method for detecting functional divergence at individual codon sites, with application to gene family evolution. J. Mol. Evol. 59, 121–132 (2004) &
- Accuracy and power of statistical methods for detecting adaptive evolution in protein coding sequences and for identifying positively selected sites. Genetics 168, 1041–1051 (2004) , , &
- Construction of phylogenetic trees. Science 155, 279–284 (1967) &
- Estimates of positive Darwinian selection are inflated by errors in sequencing, annotation, and alignment. Genome Biol. Evol. 1, 114–118 (2009) et al.
- The Gene Ontology Consortium. Gene ontology: tool for the unification of biology. Nature Genet. 25, 25–29 (2000)
- KEGG for integration and interpretation of large-scale molecular datasets. Nucleic Acids Res. 40, D109–D114 (2012) , , , &
- Human protein reference database - 2009 update. Nucleic Acids Res. 37, D767–D772 (2008) et al.
- Conserved seed pairing, often flanked by adenosines, indicates that thousands of human genes are microRNA targets. Cell 120, 15–20 (2005) , &
- The Genetic Association Database. Nature Genet. 36, 431–432 (2004) , , &
- Supplementary Information (3.1 MB)
This file contains Supplementary Tables 1-13, Supplementary Figures 1- 9, Supplementary Methods and Supplementary References.