The genomic basis of adaptive evolution in threespine sticklebacks

Jones, Felicity C.; Grabherr, Manfred G.; Chan, Yingguang Frank; Russell, Pamela; Mauceli, Evan; Johnson, Jeremy; Swofford, Ross; Pirun, Mono; Zody, Michael C.; White, Simon; Birney, Ewan; Searle, Stephen; Schmutz, Jeremy; Grimwood, Jane; Dickson, Mark C.; Myers, Richard M.; Miller, Craig T.; Summers, Brian R.; Knecht, Anne K.; Brady, Shannon D.; Zhang, Haili; Pollen, Alex A.; Howes, Timothy; Amemiya, Chris; Lander, Eric S.; Di Palma, Federica; Lindblad-Toh, Kerstin; Kingsley, David M.

doi:10.1038/nature10944

Download PDF

Article
Open access
Published: 04 April 2012

The genomic basis of adaptive evolution in threespine sticklebacks

Felicity C. Jones¹^na1,
Manfred G. Grabherr^2,3^na1,
Yingguang Frank Chan¹^na1^nAff11,
Pamela Russell²^na1,
Evan Mauceli²^nAff11,
Jeremy Johnson²,
Ross Swofford²,
Mono Pirun²^nAff11,
Michael C. Zody²,
Simon White⁴,
Ewan Birney⁵,
Stephen Searle⁴,
Jeremy Schmutz⁶,
Jane Grimwood⁶,
Mark C. Dickson⁶,
Richard M. Myers⁶,
Craig T. Miller¹^nAff11,
Brian R. Summers¹,
Anne K. Knecht¹,
Shannon D. Brady¹,
Haili Zhang¹,
Alex A. Pollen¹,
Timothy Howes¹,
Chris Amemiya⁷,
Broad Institute Genome Sequencing Platform & Whole Genome Assembly Team,
Eric S. Lander²,
Federica Di Palma²,
Kerstin Lindblad-Toh^2,3 &
…
David M. Kingsley^1,8

Nature volume 484, pages 55–61 (2012)Cite this article

53k Accesses
1214 Citations
86 Altmetric
Metrics details

Subjects

Abstract

Marine stickleback fish have colonized and adapted to thousands of streams and lakes formed since the last ice age, providing an exceptional opportunity to characterize genomic mechanisms underlying repeated ecological adaptation in nature. Here we develop a high-quality reference genome assembly for threespine sticklebacks. By sequencing the genomes of twenty additional individuals from a global set of marine and freshwater populations, we identify a genome-wide set of loci that are consistently associated with marine–freshwater divergence. Our results indicate that reuse of globally shared standing genetic variation, including chromosomal inversions, has an important role in repeated evolution of distinct marine and freshwater sticklebacks, and in the maintenance of divergent ecotypes during early stages of reproductive isolation. Both coding and regulatory changes occur in the set of loci underlying marine–freshwater evolution, but regulatory changes appear to predominate in this well known example of repeated adaptive evolution in nature.

Hybrid speciation driven by multilocus introgression of ecological traits

Article Open access 17 April 2024

Evolution of tissue-specific expression of ancestral genes across vertebrates and insects

Article 15 April 2024

Diversity-dependent speciation and extinction in hominins

Article Open access 17 April 2024

Main

The genetic and molecular basis of adaptive evolution is still largely unknown. Some researchers have championed a pre-eminent role for regulatory changes during evolution of adaptive phenotypes, because such changes may avoid pleiotropic consequences of protein-coding alterations^1,2,3. Others have catalogued known phenotypic differences caused by protein-coding changes and have questioned whether sufficient case histories exist to estimate the relative frequency of regulatory and coding changes during adaptive evolution⁴. Despite progress on individual traits⁵, it has been difficult to accumulate enough examples in any particular group to obtain an overall picture of molecular mechanisms underlying evolutionary change, particularly for clearly adaptive phenotypes in wild organisms.

Threespine sticklebacks offer a powerful system for studying the molecular basis of adaptive evolution in vertebrates. After the retreat of Pleistocene glaciers, marine sticklebacks colonized and adapted to many newly formed freshwater habitats, evolving repeated changes in body shape, skeletal armour, trophic specializations, pigmentation, salt handling, life history and mating preferences^6,7. Recurrent evolution of similar phenotypes in similar environments indicates that these traits evolve by natural selection⁸. Distinctive marine and freshwater forms can still hybridize, making it possible to map the genetic basis of individual traits, and identify particular genes underlying armour, pelvic and pigmentation evolution^9,10,11,12. At two of these key loci, distinctive haplotypes were found to be reused when similar phenotypes evolve in different populations^11,12, a pattern that was later found at additional loci^13,14. Ongoing gene flow between marine and freshwater forms occurs along coastal rivers^15,16, making it possible to spread adaptive alleles among populations, and homogenizing neutral genomic regions¹⁷. Here we use signatures of allele sharing to identify a genome-wide set of adaptive loci consistently associated with recurrent marine–freshwater evolution.

Generation of reference genome assembly

To facilitate studies of stickleback evolution, we first generated a reference genome assembly from a homogametic (female) freshwater stickleback (Gasterosteus aculeatus) from Bear Paw Lake, Alaska. The sequenced individual was partially inbred and retained heterozygosity at approximately 1 per 700 base pairs (bp). The assembly, gasAcu1.0, was generated with 9.0× coverage in Sanger sequence data (ABI3730), and has a length-weighted median (N50) contig size of 83.2 kilobases (kb), a length-weighted median (N50) scaffold size of 10.8 megabases (Mb) and a total gapped size of 463 Mb, close to previous estimates of 530 Mb (ref. 18). The 113 largest scaffolds (86.9%, 400.4 Mb) were anchored to stickleback linkage groups in an F₂ marine × freshwater intercross, whereas 60.7 Mb in 1,812 smaller scaffolds (N50 = 0.3 Mb) remain unanchored. Use of a single partially inbred individual, construction and assembly of a range of genomic library sizes, and the relatively low repeat and duplication content of the stickleback genome have produced a highly contiguous anchored genome assembly with contig and scaffold sizes much larger than other published teleosts^19,20,21,22 (Supplementary Table 1).

The stickleback sequence was annotated using the Ensembl pipeline, which predicted 20,787 protein-coding and 1,617 RNA genes (Supplementary Table 2). Of the protein-coding genes, 7,614 showed one-to-one orthology with mammals and an additional 7,192 showed one-to-one orthology among fishes. The other 5,981 genes showed complex orthology relationships, including some lineage-specific gene expansions that contribute to stickleback adaptations (for example, a duplicated mucin family encoding glue proteins used for male nest building²³). A total of 13.4% of the stickleback genome appeared to be under evolutionary constraint when compared with other fishes using PhastCons²⁴. The conserved portion was roughly equally divided between protein-coding and non-coding sequences, with ∼71% of the latter shared with mammals and ∼29% representing fish-specific conserved sequences (Supplementary Table 3).

Sequencing additional population pairs

To search for loci underlying repeated evolution in sticklebacks, we first identified populations showing characteristic marine and freshwater morphology (Fig. 1a, Supplementary Fig. 1 and Supplementary Table 4). Repeated adaptation to divergent marine and freshwater environments resulted in marked correlated changes in body shape, length, depth, fin position, spine length, eye size and armour plate number (Fig. 1b). Because quantitative trait loci (QTL) controlling these traits map to many different chromosomes^{12,25,26,27,28,29,30}, this morphological screen should identify populations differing in a genome-wide range of adaptive loci underlying marine–freshwater differences.

Figure 1: **Genome scans for parallel marine–freshwater divergence.**

From the distinct morphological clusters of marine and freshwater fish, we selected multiple marine–freshwater pairs, from both Pacific and Atlantic populations, including individuals from opposite ends of rivers with marine–freshwater hybrid zones¹⁶ (21 fish in total, including the reference genome individual). The sampling strategy should minimize geographic bias in the data set, while maximizing the chance for local exchange of neutral regions of the genome.

We generated 2.3× average coverage per individual using Illumina sequencing (Supplementary Table 5 and Supplementary Information). To identify single nucleotide polymorphisms (SNPs), we pooled data from all fish and identified positions where at least four reads support a variant allele. This criterion identified 5,897,368 candidate SNPs (Supplementary Table 6), with most being true positives based on experimental validation (n = 48 tested, 82.6% confirmed; Supplementary Information).

Genome-wide survey of parallel evolution

Previous studies have shown that repeated armour evolution in sticklebacks occurs through ancient variants at the EDA locus, which are reused in multiple freshwater populations¹¹ and are subject to strong selection³¹. To identify loci where alleles have similarly been used repeatedly during adaptive divergence of marine and freshwater fish, we used two methods to look for regions where sequences of most freshwater fish were similar to each other, but differed from sequences typically found in marine populations. Note that this pattern will not identify adaptive variants that are unique to individual freshwater populations, but instead focuses on variants with striking evidence of biological replication across populations.

First, we developed a self-organizing map-based iterative Hidden Markov Model (SOM/HMM) to identify the 20 most common patterns of genetic relationships (‘trees’) among the 21 individuals. Genomic regions were assigned to pattern types on the basis of likelihood, with boundaries defined using HMM transitions. This method iteratively models recurring phylogenetic patterns on a local genomic basis with increasing resolution (Fig. 1c and Supplementary Information). Most of the genome was assigned to trees describing geographic relationships between populations (for example, distinct Pacific versus Atlantic clades, each containing marine and freshwater fish; Supplementary Table 7 and Supplementary Figs 2 and 3). A total of 215 regions comprising 2,096,101 bp (0.46% of the genome; median size: 4,684 bp) were assigned to one of four trees separating most marine from most freshwater fish (Supplementary Fig. 3, trees a–d). After filtering, the most prevalent marine–freshwater divergent tree identified 90 genomic regions with a median size of 4,266 bp covering 848,691 bp (0.18% of the genome).

Second, we used a genetic distance-based approach (Fig. 1c) based on building 21 × 21 pairwise nucleotide divergence (π) matrices for each of 877,568 overlapping windows across the genome (2,500 bp, step size: 500 bp). Each distance matrix was used to calculate a marine–freshwater cluster separation score (CSS), quantifying the average distance between marine and freshwater clusters after accounting for variance within ecotypes (Supplementary Information). The score is highly correlated with genetic distance (F_ST), but provides increased resolution under high divergence (Supplementary Fig. 4). After permutation testing, we recovered 174 marine–freshwater divergent regions, covering a total of 1,214,500 bp (0.26% of the genome; median size: 3,000 bp) at a 5% false discovery rate (FDR), and 84 divergent regions covering 479,500 bp (0.10% of the genome; median: 4,000 bp) at 2% FDR. To assign cluster membership in highly divergent genomic regions, we also used an unguided Bayesian model-based data-driven clustering (DDC; Fig. 2c and Supplementary Information). For each window of the genome, we estimated the most likely number of distinct clusters of fish (k = 0 to 5) and the cluster memberships.

Figure 2: **Parallel divergence signals at known armour plate locus.**

The independent SOM/HMM and CSS approaches both successfully recover the previously described chromosome IV EDA locus among the top-scoring marine–freshwater divergent regions (Fig. 2). Notably, the cluster membership assigned by DDC successfully recapitulates the breakpoints of the minimal 16-kb shared freshwater EDA haplotype (Fig. 2c) previously defined by a multi-year positional cloning study of the major locus controlling armour plate differences in sticklebacks¹¹. Additional regions were identified on the same chromosome with similar marine–freshwater divergence patterns, including regions surrounding the developmental signalling gene WNT7B (Supplementary Fig. 5), and a locus involved in hormone and neurotransmitter binding and metabolism (sulphotransferase 4a1, SULT4A (ref. 32)). SOM/HMM and CSS defined many other loci that also show globally shared marine–freshwater divergence, including 242 regions identified by either method (0.5% of the genome), and 147 regions identified by both (0.2% of the genome). The median size of recovered regions (<5 kb) approaches the size of individual genes, and often highlights purely intergenic regions, such as the exclusively non-coding region identified between BANP and RAS on chromosome XIX (Supplementary Fig. 6). The genomic distribution, sizes and overlaps of recovered regions are described in Fig. 3, Supplementary Fig. 7 and Supplementary Table 8, including a list of specific genes identified in top-scoring regions (Supplementary Data 1). Using genotyping assays for SNPs in 11 regions recovered by both SOM/HMM and CSS analyses, we found that 91% of tested regions show significant enrichment of ecotypic alleles in independent marine and freshwater populations (Supplementary Information). These results confirm that our experimental design successfully identifies both known and novel loci consistently associated with parallel evolution of distinct marine and freshwater ecotypes.

Figure 3: **Genome-wide distribution of marine–freshwater divergence regions.**

Compared to the genome overall, the 242 regions implicated in repeated marine–freshwater evolution show higher gene density (Supplementary Fig. 8, P < 4.5 × 10⁻¹³) and higher concentration of conserved non-coding sequences in intergenic regions (Supplementary Fig. 9, P < 1.9 × 10⁻¹¹), probably reflecting a more complex regulatory architecture³³. Gene Ontology analysis shows significant enrichment of genes involved in cellular response to signals, behavioural interaction between organisms, amine and fatty acid metabolism, cell–cell junctions and WNT signalling (Supplementary Table 9). Changes in these biological processes, and in the individual genes defined by parallel divergence analysis, probably underlie recurrent differences in morphology, physiology and behaviour previously described in marine and freshwater sticklebacks⁷. For example, the WNT7B and WNT11 family members identified by the genomic survey have previously been implicated in a paracrine signalling pathway that controls kidney collecting tubule length and diameter³⁴. Fish living in fresh water produce copious hypotonic urine compared to marine fish³⁵, and long-term adaptation to freshwater may select for variants in the same developmental signalling pathways that polarize epithelial cell divisions and regulate kidney tubule formation in other animals.

Extent of parallel reuse in hybrid zones

Although our method identifies regions used repeatedly during stickleback evolution, it does not tell us how prevalent such regions are among all differentiated loci in a particular marine–freshwater species pair. To address this, we analysed patterns of genomic differentiation across a marine–freshwater hybrid zone in River Tyne, Scotland (Fig. 4a). Previous studies show that ecologically mediated postzygotic selection maintains distinct ecotypes in this system, despite hybridization and opportunity for extensive gene flow¹⁶. Whole-genome sequencing of a pair of marine and freshwater fish from either end of the Tyne hybrid zone identified a set of genomic windows with high divergence. Within the top 0.1% divergent windows, 35.3% contain elevated globally shared marine–freshwater divergence (Fig. 4b and Supplementary Information), indicating an ancient shared origin for many, but not all, loci with highly differentiated alleles in this marine–freshwater species pair. Previous studies have shown that some traits in sticklebacks evolve by independent mutations that vary among populations¹⁰. The non-globally shared divergent alleles in the Tyne may also represent recent, or locally arising, adaptive variants, although further studies will be required to link such variants to particular traits, or to distinguish them from neutral but highly variable regions of the stickleback genome.

Figure 4: **How much of local marine–freshwater adaptation occurs by reuse of global variants?**

Marine–freshwater chromosome inversions

When adaptive divergence occurs in hybridizing systems, theory predicts that selection can favour molecular mechanisms that suppress recombination between independent adaptive loci¹⁷. We observed extended stretches of elevated CSS spanning 442 kb, 412 kb and 1,700 kb on chromosomes I, XI and XXI, respectively (Fig. 3). On the basis of sharp transitions in CSS score and DDC cluster assignments at the boundaries, we hypothesized that chromosomal inversions explain these extended regions. By analysing paired-end sequence reads from a marine large-insert (∼220 kb) bacterial artificial chromosome (BAC) library³⁶, we identified individual clones with size and orientation anomalies relative to the freshwater reference genome assembly. The only locations with five or more anomalous clones mapped to chromosomes I, XI and XXI, and these anomalies could be resolved by the presence of inverted chromosome segments between the marine fish and the freshwater reference genome (Fig. 5a, b). Sequences flanking the predicted inversion breakpoints contain inverted repeats, consistent with generation of inversions by intra-chromosomal recombination (Supplementary Fig. 10). Notably, repeats flanking the chromosome XI inversion contained alternative 3′ exons for the voltage-gated potassium channel gene KCNH4. Because KCNH4 transcription is initiated within the inversion, alternative inversion orientations could generate marine- and freshwater-specific KCNH4 isoforms (Fig. 5c). Although the functional consequences of such ecotype-specific isoforms remain unknown, KCNH4 homologues help to maintain resting currents, affect cardiac contractility, and alter performance on cognitive tasks if perturbed in mice^37,38,39. Furthermore, QTL for two distinct marine–freshwater divergent traits have previously been mapped to the broad region of the chromosome XXI inversion (Fig. 5d)^27,30, as expected if inversions help to maintain linkage between different adaptive QTLs⁴⁰.

Figure 5: **Chromosome inversions and marine–freshwater divergence.**

Importantly, cluster assignment of individual fish by DDC shows that most marine and freshwater populations in the Pacific carry contrasting forms of the inversion regions (Supplementary Table 10). Similar ecotype associations are seen in the Atlantic basin for chromosome I (no exceptions), XI (two exceptions), and to a lesser extent for chromosome XXI (three freshwater exceptions). Genetic markers within the chromosome I and XXI regions are polymorphic in hybrid zones, and show large frequency differences when genotyped in adjacent upstream and downstream fish, confirming that these regions are subject to divergent selection in marine and freshwater habitats (Supplementary Table 10). Our results help to explain the broader patterns of genomic divergence seen in Fig. 3, and add to growing evidence that chromosome inversions are a common genomic mechanism that maintains contrasting ecotypes in hybridizing natural populations^41,42,43,44.

Proportion of regulatory and coding change

Identification of a genome-wide set of loci used repeatedly in stickleback adaptation provides a rare opportunity to estimate the relative contribution of coding and regulatory changes underlying adaptive evolution in natural populations. To examine this issue, we analysed 64 marine–freshwater divergent regions with the strongest evidence of parallel evolution: those identified by both SOM/HMM and CSS analyses using the strictest significance thresholds (Supplementary Information and Supplementary Data 1), and containing SNPs showing perfect allele–ecotype association between marine and freshwater fish. Many of these 64 regions (41%) mapped entirely to non-coding regions of the genome, and presumably contain regulatory changes (Fig. 6a). A smaller fraction contains protein-coding sequences with consistent non-synonymous substitutions between marine and freshwater fish (17%). Finally, a fraction of regions (43%) include both coding and non-coding sequences (including non-coding RNAs), but lack ecotype-specific amino acid substitutions (Supplementary Data 1). Because all of these regions contain SNPs with perfect allele–ecotype association that do not cause protein-coding changes, they also probably contribute to adaptive divergence by regulatory alterations. The combined data suggest that both coding and regulatory differences contribute to parallel stickleback evolution, with regulatory changes accounting for a much larger proportion of the overall set of loci repeatedly selected during marine–freshwater divergence.

Figure 6: **Contributions of coding and regulatory changes to parallel marine–freshwater stickleback adaptation.**

To assess further the possible role of gene regulatory evolution in stickleback evolution, we constructed whole-genome expression arrays to compare levels of gene expression in tissues from Little Campbell River (LITC) marine and Fish Trap Creek (FTC) freshwater fish. Of 12,594 informative genes across the genome, 2,817 showed significant expression differences between ecotypes. Genes with marine–freshwater expression differences were significantly more likely to occur in or near the adaptive regions recovered by SOM/HMM or CSS analysis (Fig. 6b, P < 7.1 × 10⁻⁸). Although expression differences can be due to either cis- or trans-acting changes, the expression data are consistent with an important role of regulatory changes during parallel evolution of marine and freshwater sticklebacks.

Discussion

Progress in genetic mapping and positional cloning approaches has recently made it possible to identify a few individual genes and mutations that contribute to phenotypic differences between stickleback populations^10,11,12,25. Despite this progress, identifying many such examples using genetic linkage mapping alone would require years of additional effort. Fortunately, the highly replicated nature of stickleback evolution provides clear molecular signatures that can be used to recover many loci consistently associated with parallel marine–freshwater adaptation. The signal resolution of repeatedly used adaptive loci approaches ∼5 kb, often identifying single genes or intergenic regions, and offering a significant advantage over the several hundred kilobase candidate intervals typically identified in genetic mapping crosses^11,12, or the megabase or larger regions identified in previous selection scans of the stickleback genome¹³. The many marine–freshwater divergent loci and gene expression changes identified in the current study will substantially accelerate ongoing searches for the genetic and molecular basis of fitness-related morphological, physiological and behavioural differences between marine and freshwater fish.

In addition, the genome-wide set of divergent regions already provides new insights into evolutionary processes shaping adaptive evolution and ecological speciation. Our results indicate that parallel evolution of marine and freshwater sticklebacks occurs by dynamic reassembly of many ‘islands’ of divergence distributed across many chromosomes. Reassembly by linkage is probably strengthened by inversions that distinguish marine and freshwater ecotypes. Differences in both globally shared and locally restricted genetic variation actively maintained across a hybrid zone provide a snapshot of the genomic architecture and evolutionary processes contributing to the early stages of reproductive isolation. Finally, our data indicate that repeated evolution of marine–freshwater differences depends on both protein-coding and regulatory changes. Regulatory evolution seems to have a particularly prominent role, as indicated by the increased density of conserved non-coding intergenic sequences found near marine–freshwater divergent loci (Supplementary Fig. 9); the substantial fraction of loci mapping entirely to non-coding regions (Fig. 6a); and the significant enrichment of genes with expression differences near key regions used for parallel evolution (Fig. 6b). Mutations causing structural changes in proteins are the most abundant variants recovered in laboratory Escherichia coli and yeast evolution experiments^45,46. They make up 90% of 40 published examples of adaptive changes between closely related taxa⁴, and 63–77% of the known molecular basis of phenotypic traits in domesticated or wild species⁵. The larger fraction of regulatory changes implicated during repeated stickleback evolution may reflect our use of whole-genome rather than candidate gene approaches, stronger selection against loss-of-function and pleiotropic protein-coding changes in natural populations than in laboratory or domesticated organisms^1,2,3, or an increasing prevalence of regulatory changes at interspecific compared to intraspecific levels^5,47, including emerging species such as marine and freshwater sticklebacks.

Although our study has focused on marine–freshwater divergence, freshwater sticklebacks also repeatedly evolve characteristic lake–stream differences; open-water and bottom-dwelling lake ecotypes; gigantism in particular lakes; and substantial changes in seasonality and life history^6,7,48,49,50. Given the considerable fraction of parallel stickleback evolution probably occurring by shared variants (Fig. 4b), sequencing of additional populations should make it possible to identify similarly shared loci contributing to other ecological traits, again using the power of replicated evolution to illuminate both specific and general mechanisms underlying evolutionary change in natural populations.

Methods Summary

A reference stickleback genome sequence was assembled from a single female freshwater stickleback (Bear Paw Lake, Alaska), using 9× coverage of paired-end Sanger-sequenced reads from multiple insert size libraries. Scaffolds were assigned to linkage groups in a genetic cross, and annotation was carried out using the Ensembl evidence-based pipeline. Twenty-one fish from independent populations were chosen for short-read sequencing (48× combined coverage) based on morphometric analysis. Patterns of genetic variation were analysed for divergence between marine and freshwater fish, using both a self-organizing map/Hidden Markov Model and a pairwise distance matrix approach (see Supplementary Information). Paired-end reads from a marine BAC library were placed against the reference freshwater genome sequence to identify possible chromosome rearrangements. Sequenom iPlex genotyping assays were carried out to verify predicted SNPs and divergent marine–freshwater regions. RNA samples were prepared from tissues of marine and freshwater fish born and raised under identical laboratory conditions. Significant expression differences were detected with Agilent microarrays using eBayes (limma R package). GO category enrichments were analysed using GOstats (BioConductor 2.7). Additional methods and analyses are provided in Supplementary Information.

Accession codes

Primary accessions

Gene Expression Omnibus

GSE34783

Data deposits

UCSC Genome browser tracks showing genome-wide analyses are available at http://sticklebrowser.stanford.edu. Microarray expression data are deposited at the Gene Expression Omnibus (http://www.ncbi.nlm.nih.gov/geo) under accession number GSE34783. BAC-end sequences are deposited at http://www.ncbi.nlm.nih.gov/dbGSS (accession numbers JS583469 to JS583576).

References

Stern, D. L. Perspective: Evolutionary developmental biology and the problem of variation. Evolution 54, 1079–1091 (2000)
Article CAS Google Scholar
Carroll, S. B. Evo-devo and an expanding evolutionary synthesis: a genetic theory of morphological evolution. Cell 134, 25–36 (2008)
Article CAS Google Scholar
Wray, G. The evolutionary significance of cis-regulatory mutations. Nature Rev. Genet. 8, 206–216 (2007)
Article CAS Google Scholar
Hoekstra, H. E. & Coyne, J. A. The locus of evolution: evo devo and the genetics of adaptation. Evolution 61, 995–1016 (2007)
Article Google Scholar
Stern, D. L. & Orgogozo, V. The loci of evolution: how predictable is genetic evolution? Evolution 62, 2155–2177 (2008)
Article Google Scholar
McKinnon, J. S. & Rundle, H. D. Speciation in nature: the threespine stickleback model systems. Trends Ecol. Evol. 17, 480–488 (2002)
Article Google Scholar
Bell, M. A. & Foster, S. A. The Evolutionary Biology of the Threespine Stickleback (Oxford Univ. Press, 1994)
Google Scholar
Endler, J. A. Natural selection in the wild. Monogr. Popul. Biol. 21, 1–336 (1986)
Google Scholar
Kingsley, D. M. & Peichel, C. L. The molecular genetics of evolutionary change in sticklebacks. In Biology of the Threespine Stickleback 41–81 (CRC Press, 2007)
Google Scholar
Chan, Y. F. et al. Adaptive evolution of pelvic reduction in sticklebacks by recurrent deletion of a Pitx1 enhancer. Science 327, 302–305 (2010)
Article ADS CAS Google Scholar
Colosimo, P. F. et al. Widespread parallel evolution in sticklebacks by repeated fixation of Ectodysplasin alleles. Science 307, 1928–1933 (2005)
Article ADS CAS Google Scholar
Miller, C. T. et al. cis-Regulatory changes in Kit ligand expression and parallel evolution of pigmentation in sticklebacks and humans. Cell 131, 1179–1189 (2007)
Article CAS Google Scholar
Hohenlohe, P. A. et al. Population genomics of parallel adaptation in threespine stickleback using sequenced RAD tags. PLoS Genet. 6, e1000862 (2010)
Article Google Scholar
Kitano, J. et al. Adaptive divergence in the thyroid hormone signaling pathway in the stickleback radiation. Curr. Biol. 20, 2124–2130 (2010)
Article CAS Google Scholar
Hagen, D. Isolating mechanisms in threespine sticklebacks (Gasterosteus). J. Fish. Res. Board Can. 24, 1637–1692 (1967)
Article Google Scholar
Jones, F., Brown, C., Pemberton, J. & Braithwaite, V. Reproductive isolation in a threespine stickleback hybrid zone. J. Evol. Biol. 19, 1531–1544 (2006)
Article CAS Google Scholar
Barton, N. H. & Gale, K. S. Genetic analysis of hybrid zones. In Hybrid Zones and the Evolutionary Process 13–45 (Oxford Univ. Press, 1993)
Google Scholar
Vinogradov, A. E. Genome size and GC-percent in vertebrates as determined by flow cytometry: the triangular relationship. Cytometry 31, 100–109 (1998)
Article CAS Google Scholar
Aparicio, S. et al. Whole-genome shotgun assembly and analysis of the genome of Fugu rubripes. Science 297, 1301–1310 (2002)
Article ADS CAS Google Scholar
Jaillon, O. et al. Genome duplication in the teleost fish Tetraodon nigroviridis reveals the early vertebrate proto-karyotype. Nature 431, 946–957 (2004)
Article ADS Google Scholar
Kasahara, M. et al. The medaka draft genome and insights into vertebrate genome evolution. Nature 447, 714–719 (2007)
Article ADS CAS Google Scholar
Star, B. et al. The genome sequence of Atlantic cod reveals a unique immune system. Nature 477, 207–210 (2011)
Article ADS CAS Google Scholar
Kawahara, R. & Nishida, M. Extensive lineage-specific gene duplication and evolution of the spiggin multi-gene family in stickleback. BMC Evol. Biol. 7, 209 (2007)
Article Google Scholar
Siepel, A. et al. Evolutionarily conserved elements in vertebrate, insect, worm, and yeast genomes. Genome Res. 15, 1034–1050 (2005)
Article CAS Google Scholar
Shapiro, M. D. et al. Genetic and developmental basis of evolutionary pelvic reduction in threespine sticklebacks. Nature 428, 717–723 (2004)
Article ADS CAS Google Scholar
Peichel, C. L. et al. The genetic architecture of divergence between threespine stickleback species. Nature 414, 901–905 (2001)
Article ADS CAS Google Scholar
Colosimo, P. F. et al. The genetic architecture of parallel armor plate reduction in threespine sticklebacks. PLoS Biol. 2, 635–641 (2004)
Article CAS Google Scholar
Cresko, W. A. et al. Parallel genetic basis for repeated evolution of armor loss in Alaskan threespine stickleback populations. Proc. Natl Acad. Sci. USA 101, 6050–6055 (2004)
Article ADS CAS Google Scholar
Kimmel, C. B. et al. Evolution and development of facial bone morphology in threespine sticklebacks. Proc. Natl Acad. Sci. USA 102, 5791–5796 (2005)
Article ADS CAS Google Scholar
Albert, A. Y. K. et al. The genetics of adaptive shape shift in stickleback: pleiotropy and effect size. Evolution 62, 76–85 (2008)
PubMed Google Scholar
Barrett, R. D. H., Rogers, S. M. & Schluter, D. Natural selection on a major armor gene in threespine stickleback. Science 322, 255–257 (2008)
Article ADS CAS Google Scholar
Allali-Hassani, A. et al. Structural and chemical profiling of the human cytosolic sulfotransferases. PLoS Biol. 5, e97 (2007)
Article Google Scholar
Knecht, A. K., Hosemann, K. E. & Kingsley, D. M. Constraints on utilization of the EDA-signaling pathway in threespine stickleback evolution. Evol. Dev. 9, 141–154 (2007)
Article CAS Google Scholar
Yu, J. et al. A Wnt7b-dependent pathway regulates the orientation of epithelial cell division and establishes the cortico-medullary axis of the mammalian kidney. Development 136, 161–171 (2009)
Article CAS Google Scholar
Marshall, W. S. & Grosell, M. Ion transport, osmoregulation and acid-base balance. In The Physiology of Fishes 177–230 (CRC Press, 2006)
Google Scholar
Kingsley, D. M. et al. New genomic tools for molecular studies of evolutionary change in threespine sticklebacks. Behaviour 141, 1331–1344 (2004)
Article Google Scholar
Miyake, A., Mochizuki, S., Yokoi, H., Kohda, M. & Furuichi, K. New ether-à-go-go K⁺ channel family members localized in human telencephalon. J. Biol. Chem. 274, 25018–25025 (1999)
Article CAS Google Scholar
Miyake, A. et al. Disruption of the ether-à-go-go K⁺ channel gene BEC1/KCNH3 enhances cognitive function. J. Neurosci. 29, 14637–14645 (2009)
Article CAS Google Scholar
Gutman, G. A. et al. International Union of Pharmacology. LIII. Nomenclature and molecular relationships of voltage-gated potassium channels. Pharmacol. Rev. 57, 473–508 (2005)
Article CAS Google Scholar
Kirkpatrick, M. & Barton, N. Chromosome inversions, local adaptation and speciation. Genetics 173, 419–434 (2006)
Article CAS Google Scholar
Hoffmann, A. A. & Rieseberg, L. H. Revisiting the impact of inversions in evolution: from population genetic markers to drivers of adaptive shifts and speciation? Annu. Rev. Ecol. Evol. Syst. 39, 21–42 (2008)
Article Google Scholar
Lowry, D. B. & Willis, J. H. A widespread chromosomal inversion polymorphism contributes to a major life-history transition, local adaptation, and reproductive isolation. PLoS Biol. 8, e1000500 (2010)
Article Google Scholar
Joron, M. et al. Chromosomal rearrangements maintain a polymorphic supergene controlling butterfly mimicry. Nature 477, 203–206 (2011)
Article ADS CAS Google Scholar
Feder, J. L., Roethele, J. B., Filchak, K., Niedbalski, J. & Romero-Severson, J. Evidence for inversion polymorphism related to sympatric host race formation in the apple maggot fly, Rhagoletis pomonella. Genetics 163, 939–953 (2003)
CAS PubMed PubMed Central Google Scholar
Barrick, J. E. et al. Genome evolution and adaptation in a long-term experiment with Escherichia coli. Nature 461, 1243–1247 (2009)
Article ADS CAS Google Scholar
Kvitek, D. J. & Sherlock, G. Reciprocal sign epistasis between frequently experimentally evolved adaptive mutations causes a rugged fitness landscape. PLoS Genet. 7, e1002056 (2011)
Article CAS Google Scholar
Wittkopp, P. & Haerum, B. K. Regulatory changes underlying expression differences within and between Drosophila species. Nature Genet. 40, 346–350 (2008)
Article CAS Google Scholar
Reimchen, T. E., Stinson, E. M. & Nelson, J. S. Multivariate differentiation of parapatric and allopatric populations of threespine stickleback in the Sangan River watershed, Queen Charlotte Islands. Can. J. Zool. 63, 2944–2951 (1985)
Article Google Scholar
Deagle, B. E. et al. Population genomics of parallel phenotypic evolution in stickleback across stream–lake ecological transitions. Proc. R. Soc.. B 279, 1277–1286 (2011)
Article Google Scholar
McPhail, J. D. Speciation and the evolution of reproductive isolation in the sticklebacks (Gasterosteus) of south-western British Columbia. In The Evolutionary Biology of the Threespine Stickleback 399–437 (Oxford Univ. Pres, 1994)

Download references

Acknowledgements

Stickleback sequencing at Broad Institute was supported by grants from the National Human Genome Research Institute (NHGRI). R.M.M., J.S., J.G. and D.M.K. were supported by NHGRI CEGS Grant P50-HG002568; Y.F.C. by a Stanford Affymetrix Bio-X Graduate Fellowship; C.T.M. by the Jane Coffins Childs Fund; and B.R.S., T.R.H. and A.A.P. by graduate fellowships from NSF and NDSEG. D.M.K. is an investigator of the Howard Hughes Medical Institute. K.L.-T. is a EURYI award recipient funded by ESF. We thank W. Cresko for the BEPA individual used for reference genome sequencing; M. Bell, J. McKinnon, B. Jónsson, S. Mori, C. Peichel, D. Schluter, M. Kalbe, T. Reimchen, D.-P. Højgaard, M. McLaughlin, B. Geyti and B. Blackman for discussions and assistance with specimens used in population surveys; and G. Bejerano for useful discussions and assistance with computational analysis.

Author information

Yingguang Frank Chan, Evan Mauceli, Mono Pirun & Craig T. Miller
Present address: Present addresses: Max Planck Institute for Evolutionary Biology, August-Thienemann-Str. 2, Plön 24306, Germany (Y.F.C.); Children's Hospital Boston, Genetic Diagnostic Lab, 300 Longwood Avenue, Boston, Massachusetts 02115, USA (E.M.); Bioinformatics Core, Zuckerman Research Center, New York, New York 10065, USA (M.P.); Department of Molecular & Cell Biology, 142 LSA 3200, University of California, Berkeley, California 94720, USA (C.T.M.).,
Felicity C. Jones, Manfred G. Grabherr, Yingguang Frank Chan and Pamela Russell: These authors contributed equally to this work.

Authors and Affiliations

Department of Developmental Biology, Beckman Center B300, Stanford University School of Medicine, Stanford, 94305, California, USA
Felicity C. Jones, Yingguang Frank Chan, Craig T. Miller, Brian R. Summers, Anne K. Knecht, Shannon D. Brady, Haili Zhang, Alex A. Pollen, Timothy Howes & David M. Kingsley
Broad Institute of MIT and Harvard, 7 Cambridge Center, Cambridge, 02142, Massachusetts, USA
Manfred G. Grabherr, Pamela Russell, Evan Mauceli, Jeremy Johnson, Ross Swofford, Mono Pirun, Michael C. Zody, Eric S. Lander, Federica Di Palma & Kerstin Lindblad-Toh
Department of Medical Biochemistry and Microbiology, Science for Life Laboratory Uppsala, Uppsala University, Uppsala 751 23, Sweden,
Manfred G. Grabherr & Kerstin Lindblad-Toh
Wellcome Trust Sanger Institute, Hinxton, Cambridge CB10 1SA, UK,
Simon White & Stephen Searle
European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK,
Ewan Birney
HudsonAlpha Institute for Biotechnology, 601 Genome Way, Huntsville, Alabama 35806, USA,
Jeremy Schmutz, Jane Grimwood, Mark C. Dickson & Richard M. Myers
Department of Molecular Genetics, Benaroya Research Institute at Virginia Mason, 1201 Ninth Avenue, Seattle Washington 98101, USA,
Chris Amemiya
Howard Hughes Medical Institute, Stanford University, Stanford, 94305, California, USA
David M. Kingsley
Broad Institute of MIT and Harvard, 7 Cambridge Center, Cambridge, Massachusetts 02142, USA.,
Jen Baldwin, Toby Bloom, David B. Jaffe, Robert Nicol & Jane Wilkinson

Authors

Felicity C. Jones
View author publications
You can also search for this author in PubMed Google Scholar
Manfred G. Grabherr
View author publications
You can also search for this author in PubMed Google Scholar
Yingguang Frank Chan
View author publications
You can also search for this author in PubMed Google Scholar
Pamela Russell
View author publications
You can also search for this author in PubMed Google Scholar
Evan Mauceli
View author publications
You can also search for this author in PubMed Google Scholar
Jeremy Johnson
View author publications
You can also search for this author in PubMed Google Scholar
Ross Swofford
View author publications
You can also search for this author in PubMed Google Scholar
Mono Pirun
View author publications
You can also search for this author in PubMed Google Scholar
Michael C. Zody
View author publications
You can also search for this author in PubMed Google Scholar
Simon White
View author publications
You can also search for this author in PubMed Google Scholar
Ewan Birney
View author publications
You can also search for this author in PubMed Google Scholar
Stephen Searle
View author publications
You can also search for this author in PubMed Google Scholar
Jeremy Schmutz
View author publications
You can also search for this author in PubMed Google Scholar
Jane Grimwood
View author publications
You can also search for this author in PubMed Google Scholar
Mark C. Dickson
View author publications
You can also search for this author in PubMed Google Scholar
Richard M. Myers
View author publications
You can also search for this author in PubMed Google Scholar
Craig T. Miller
View author publications
You can also search for this author in PubMed Google Scholar
Brian R. Summers
View author publications
You can also search for this author in PubMed Google Scholar
Anne K. Knecht
View author publications
You can also search for this author in PubMed Google Scholar
Shannon D. Brady
View author publications
You can also search for this author in PubMed Google Scholar
Haili Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Alex A. Pollen
View author publications
You can also search for this author in PubMed Google Scholar
Timothy Howes
View author publications
You can also search for this author in PubMed Google Scholar
Chris Amemiya
View author publications
You can also search for this author in PubMed Google Scholar
Eric S. Lander
View author publications
You can also search for this author in PubMed Google Scholar
Federica Di Palma
View author publications
You can also search for this author in PubMed Google Scholar
Kerstin Lindblad-Toh
View author publications
You can also search for this author in PubMed Google Scholar
David M. Kingsley
View author publications
You can also search for this author in PubMed Google Scholar

Consortia

Broad Institute Genome Sequencing Platform & Whole Genome Assembly Team

Jen Baldwin
, Toby Bloom
, David B. Jaffe
, Robert Nicol
& Jane Wilkinson

Contributions

K.L.-T., F.D.P., E.S.L. and D.M.K. planned and oversaw the project and K.L.-T. and D.M.K. are co-senior authors. E.M. and M.G.G. assembled, J.S., J.G., M.C.D., A.K.K. and R.M.M. anchored, and S.W., E.B. and S.S. annotated the reference genome. C.A. constructed the BEPA BAC library. F.C.J., Y.F.C., D.M.K., K.L.-T., F.D. and M.G.G. designed the whole-genome re-sequencing experiment. F.C.J. and Y.F.C. performed morphometric analyses. S.D.B. and J.J. prepared and coordinated samples. M.G.G., M.P. and M.C.Z. analysed pilot data and performed simulations to evaluate sequencing strategies. M.G.G., P.R., E.M., F.C.J., Y.F.C., J.J. and R.S. analysed polymorphisms. P.R. and M.G.G. developed and carried out the SOM/HMM analysis. F.C.J. and Y.F.C. developed and carried out the CSS and DDC analysis. P.R. analysed gene and non-coding element density, and performed phylogenetic analysis. T.H. analysed GO term enrichments. F.C.J. and Y.F.C. carried out hybrid zone analysis. C.T.M., B.R.S., J.G., J.S., Y.F.C. and F.C.J. analysed chromosome inversions. F.C.J. and D.M.K. performed analysis of coding and regulatory changes. H.Z., A.A.P. and T.H. performed the whole-genome expression study. D.M.K., F.C.J., Y.F.C., P.R., F.D.P. and K.L.-T. wrote the paper with input from other authors.

Corresponding authors

Correspondence to Kerstin Lindblad-Toh or David M. Kingsley.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Supplementary information

Supplementary Information

This file contains Supplementary Methods and Data, Supplementary Figures 1-10 and Supplementary Tables 1-10. The Supplementary Figures and Tables contain additional details on the genome assembly; gene annotation, sampled populations; morphometric analysis; number and details regarding the SOM/HMM and CSS methods; additional example loci; genome-wide summary of re-sequencing data; specific features of marine–freshwater divergent regions; and chromosome inversions. (PDF 8383 kb)

Supplementary Data

This file contains data supporting the conclusions in Figure 6, specifically genes located in the 81 regions jointly identified by both the SOM/HMM and the CSS methods. Annotations on protein coding changes and gene expression differences are also included. (XLS 59 kb)

PowerPoint slides

PowerPoint slide for Fig. 1

PowerPoint slide for Fig. 2

PowerPoint slide for Fig. 3

PowerPoint slide for Fig. 4

PowerPoint slide for Fig. 5

PowerPoint slide for Fig. 6

Rights and permissions

This article is distributed under the terms of the Creative Commons Attribution-Non-Commercial-Share Alike licence (http://creativecommons.org/licenses/by-nc-sa/3.0/).

Reprints and permissions

About this article

Cite this article

Jones, F., Grabherr, M., Chan, Y. et al. The genomic basis of adaptive evolution in threespine sticklebacks. Nature 484, 55–61 (2012). https://doi.org/10.1038/nature10944

Download citation

Received: 17 September 2011
Accepted: 08 February 2012
Published: 04 April 2012
Issue Date: 05 April 2012
DOI: https://doi.org/10.1038/nature10944

This article is cited by

Alternative splicing and environmental adaptation in wild house mice
- David N. Manahan
- Michael W. Nachman
Heredity (2024)
Lack of genetic differentiation between two sympatric lacustrine morpho-species within the Astyanax (Characidae: Teleostei) genus, Mexico
- Claudia Patricia Ornelas-García
- Elena G. Gonzalez
- Ignacio Doadrio
Ichthyological Research (2024)
A novel tetra-primer ARMS-PCR approach for the molecular karyotyping of chromosomal inversion 2Ru in the main malaria vectors Anopheles gambiae and Anopheles coluzzii
- Verena Pichler
- Antoine Sanou
- Nora J. Besansky
Parasites & Vectors (2023)
Chromosome fusions repatterned recombination rate and facilitated reproductive isolation during Pristionchus nematode speciation
- Kohta Yoshida
- Christian Rödelsperger
- Ralf J. Sommer
Nature Ecology & Evolution (2023)
Mixed strain pathogen populations accelerate the evolution of antibiotic resistance in patients
- Julio Diaz Caballero
- Rachel M. Wheatley
- R. Craig MacLean
Nature Communications (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.