Abstract
Repeated evolution can provide insight into the mechanisms that facilitate adaptation to novel or changing environments. Here we study adaptation to altitude in two tropical butterflies, Heliconius erato and H. melpomene, which have repeatedly and independently adapted to montane habitats on either side of the Andes. We sequenced 518 whole genomes from altitudinal transects and found many regions differentiated between highland (~ 1200 m) and lowland (~ 200 m) populations. We show repeated genetic differentiation across replicate populations within species, including allopatric comparisons. In contrast, there is little molecular parallelism between the two species. By sampling five close relatives, we find that a large proportion of divergent regions identified within species have arisen from standing variation and putative adaptive introgression from high-altitude specialist species. Taken together our study supports a role for both standing genetic variation and gene flow from independently adapted species in promoting parallel local adaptation to the environment.
Similar content being viewed by others
Introduction
Understanding how organisms adapt to the environment is a central goal of evolutionary biology and highly relevant given the pace of global change. One approach is to explore the repeatability of local adaptation in the wild in order to understand whether phenotypic and genetic changes are predictable. On the one hand, repeated adaptation to similar environments can act as a ‘natural experiment’ and provide the means to identify the targets of selection, by distinguishing locally adaptive from neutral or globally beneficial changes1. On the other hand, these scenarios can allow us to test whether the same loci are repeatedly targeted across populations and species2. Despite many studies reporting repeated adaptation involving the same genes or alleles across lineages3,4,5, which we here term ‘molecular parallelism’, we know relatively little about the evolutionary mechanisms that facilitate this process.
Three main mechanisms can give rise to molecular parallelism in repeated adaptation (Fig. 1). Genetic variation upon which selection repeatedly acts may arise via independent mutations at the same gene or locus6. Beneficial variants may be recruited from ancestral standing variation7 or shared across populations of the same species via migration and gene flow8. Lastly, gene flow between species can facilitate the introgression of adaptive alleles9,10,11. A combination of these mechanisms may also be at play, for instance the high altitude adaptation Tibetan-EPAS1 haplotype was introgressed from Denisovan hominins but remained as neutral standing variation before positive selection occurred12.
The likelihood of molecular parallelism and the relative importance of each mechanism may be largely dependent on the divergence between the lineages under study (Fig. 1), but this has seldom been empirically tested3. For instance, populations that diverged recently and retain a large pool of standing genetic variation tend to reuse pre-existing alleles during repeated adaptation, as seen in freshwater adaptation in sticklebacks13, crypsis in beach mice14, or coastal ecotypes of bottlenose dolphins7. Similarly, organisms that readily hybridise in the wild are more likely to share beneficial alleles via adaptive introgression. This allows populations to rapidly adapt to, for instance, novel anthropogenic stressors such as pollutants15 or insecticides16.
Anthropogenic change is forcing organisms to move, adapt, or die, with many predicted to expand their ranges towards the highlands to escape warming and degrading lowland habitats17. We know very little about the genomics or predictability of adaptation to altitude, especially in the tropical insects despite making up about half of all described species (but see)18,19,20. The type and genomic architecture of the trait under study may also determine its predictability21. Phenotypes controlled by few, large-effect loci typically show predictable genetic paths of evolution, such as melanic colouration in mammals, fish, and birds22,23,24,25, perhaps due to selective constraints on genetic pathways21,26,27. Organisms adapting to complex environmental challenges that face multifarious selective regimes may show less predictable patterns, with functional redundancy among genes allowing for different combinations of alleles to achieve similar phenotypes28,29. Thus, understanding the relative importance of these mechanisms in determining the predictability of adaptation to the environment could inform future research into conservation strategies to protect biodiversity30.
Here, we study the genetic basis of repeated adaptation to altitude in two divergent Heliconius tropical butterflies, H. erato and H. melpomene. These aposematic, toxic species have very wide ranges and co-mimic each other across South America, commonly found from sea level to around 1600 m on both sides of the Andes31. In contrast, most other species in this genus have specialised to either highland Andean or lowland Amazonian habitats, with topography and climate shown to correlate with speciation rates across the clade32. The biogeography of these species’ ancestral ranges across the Andes is uncertain, with multiple plausible adaptive histories: ancestral adaptation to the highlands on one side of the Andes followed by expansion, gene flow of adaptive alleles between sides of the Andes, or fully independent in situ adaptation to altitude on either side of the Andes.
A recent study in this system showed that even a modest 800 m change in elevation in the Ecuadorian Andes leads to strong environmental differences, with temperatures in the lowlands more variable and, on average, 4 °C hotter, as well as threefold increase in the ‘drying power of the air’ (Vapour Pressure Deficit) which correlates with desiccation risk in ectotherms33. Phenotypic differences between highland and lowland populations of Heliconius have recently been identified, such as in wing shape34,35 and heat tolerance33, and are of similar magnitude in H. erato and H. melpomene. Thus, the observed environmental and phenotypic differences point towards an important role of local adaptation enabling these two species to inhabit wide altitudinal ranges.
We search for signatures of local adaptation to montane habitats with extensive sampling that harnesses the power of natural spatial replication within and across sides of the Andes, to assess the extent of molecular parallelism in adaptation to montane habitats at the upper end of the species altitudinal range (∼1200 m). We quantify parallelism at multiple levels of divergence: (i) replicate three-population transects within sides of the Andes connected via gene flow, (ii) trans-Andean allopatric three-population transects with no gene flow at these latitudes (but potential for gene flow in other regions), and (iii) two species that diverged 12 million years ago (Fig. 1A). Furthermore, we test whether the same haplotypes are under selection across transects and search for the mutational origin of candidate adaptive alleles (Fig. 1B). Overall, this large empirical study deepens our understanding of how organisms adapt to the environment and identifies both standing genetic variation and adaptive introgression from pre-adapted species as important mechanisms facilitating local adaptation.
Results and discussion
Divergence and diversity across elevations and transects
To study adaptation to altitude in H. erato and H. melpomene, we used whole-genome data from 518 re-sequenced individuals, 444 of which were sequenced for this study. Samples were collected from 111 different locations, which we grouped into 30 populations, corresponding to four three-population transects: Colombia West/East, and Ecuador West/East (Fig. 2, Supplementary Table 1). In each three-population transect, populations were either in the highlands (~1200 m) or lowlands (~200 m), itself divided into nearby or distant lowland sites (Fig. 2A). The Andes act as a barrier to gene flow at these latitudes, with populations on opposite sides of the Andes thought to have split ~0.9 and ~1.7 million years ago for H. erato and H. melpomene, respectively36,37,38 (for natural history see Supplementary Note 1). Individuals of each species clustered strongly into Western and Eastern groups in genome wide PCAs (S.I., Supplementary Fig. 1). In PCAs that only included populations from replicate transects of the same side of the Andes (two per species), structuring by altitude was absent in all but one comparison, H. erato East, where the H. erato highland population in Colombia corresponds to a different colour pattern subspecies and diverged moderately from other populations (S.I., Supplementary Fig. 1). Intraspecific pairwise differentiation between populations on the same side of the Andes increased with geographic distance but was generally low (Fst < 0.1; S.I., Supplementary Fig. 2). The effective replication over space and extensive gene flow within transects provide a powerful setting to study the genomics of parallel adaptation to altitude in the wild.
Genetic distance between populations can be increased by environmental differences that affect dispersal or survival of locally adapted migrants39,40. We tested for such isolation by environment, in our case altitude, with pairwise Fst across all populations of the same side of the Andes and species. At similar geographic distances, genetic differentiation was higher when comparing highland vs. lowland populations than when comparing lowland vs lowland populations. Isolation by altitude could be driven by local adaptation reducing gene flow between elevations or due to many adaptive sites diverging across the genome. This difference was stronger in H. erato than in H. melpomene (S.I., Supplementary Fig. 2). Fst was generally highest when comparing two highland sampling sites at relatively short distances, despite using a topographically informed ‘least-cost path’ as our measure of geographic distance (Supplementary Fig. 3). This could indicate topographical barriers decreasing gene flow, beyond what was captured by the least-cost path, or local adaptation in the highlands leading to increased selection against migrants. A pattern of isolation by environment could also arise due to, for instance, different demographic histories and higher levels of inbreeding in range-edge populations41,42,43. We found no consistent differences in nucleotide diversity (π) between elevations (S.I., Supplementary Fig. 4). Tajima’s D was negative across populations, characteristic of population expansion, but generally less negative in highland populations, suggesting less pronounced expansion or more recent/ongoing contractions in the highlands (S.I., Supplementary Fig. 4). Thus, both heterogenous demographic histories and selection against locally adapted migrants across elevations may lead to genome-wide isolation by environment.
Parallel high-altitude differentiation detected with population branch statistics
To identify genomic regions with high-altitude specific differentiation we calculated Population Branch Statistics (PBS) for three transects in each species, and Fst for the remaining two transects with more limited sampling (H. erato Colombia West, H. melpomene Ecuador West; Fig. 2A). PBS was originally developed to study high-altitude adaptation in humans44 and can distinguish between global and lineage specific differentiation by constructing a trifurcating population tree based on Fst that includes a geographically distant population15,42,45,46,47,48,49. By attributing fractions of differentiation to each branch, PBS identifies genomic regions disproportionally diverged in the focal population, consistent with loci positively selected in the highlands and either selected against or neutral in the lowlands44,50.
When assessing genome-wide average PBS trees the longest PBS branches corresponded to the low distant populations of both species, as expected under a model of neutral isolation by distance (Fig. 2B). H. erato had a consistently longer high-altitude branch compared to the lowland, but geographically nearby population, which could be indicative of increased drift or extensive local adaptation to altitude throughout the genome. Indeed, we detected many regions across the genome strongly differentiated in the highland populations across transects and species (Fig. 3). We defined High Differentiation Regions (HDRs hereafter) by adding a ± 50 kb buffer around outlier windows, i.e. those with zPBShigh (high-altitude branch) or zFst values above 4 (standardised Z-transformed, equivalent to >4 SD51), and merging overlapping intervals into discrete regions. The transects for which only two populations were sampled (Fst), had a higher number of HDRs: 400 and 405 HDRs, covering 11.4% and 17.1% of the genome for H. erato and H. melpomene, respectively (compared to, on average, 229 PBS-based HDRs covering 8%; details on S.I., Supplementary Note 2). This likely reflects the property of PBS to discern between population-specific and globally differentiated alleles.
To test for molecular parallelism in local adaptation, we assessed whether the same individual HDRs were repeatedly found in the highlands across replicate (same side of the Andes) or allopatric (opposite side of the Andes) transects within each species (Fig. 1A). In H. erato, 45% (±3.8 SD) and 38% (±4.4 SD) of HDRs overlapped between replicate transects within the Western and the Eastern Andes, respectively (shared HDRs, SHDRs hereafter; Fig. 3D blue, green). Of those SHDRs, more than a third were also shared across allopatric transects not connected via gene flow, 15% of the total HDRs (allopatric SHDRs hereafter; Fig. 3D red). H. melpomene had a slightly lower percentage of HDRs shared within sides of the Andes (West 33% ± 11 SD; East 27% ± 11 SD), but very few shared across sides of the Andes (allopatric SHDRs: 4% of the total). We then tested if the observed level of sharing was higher than expected under a null distribution of genomic regions, obtained by assessing sharing in randomly placed blocks across the genome of the same size and number as observed HDRs per transect. H. erato HDR sharing was higher than predicted under a null distribution in all three comparisons (replicate Eastern/Western and all transects, Fig. 3B). In H. melpomene, HDR sharing was significant between transects where gene flow is present (replicate Eastern/Western transects), whereas the low levels of allopatric parallelism did not differ from the null distribution (Fig. 3B, red). On average, 15.6% of SHDRs detected overlapped between species, but this fell within the null distribution given the number and size of SHDRs in each species (mean overlap = 14.06% ± 1.19).
In summary, we show that levels of parallelism are highest between populations of the same species connected by gene flow or sharing considerable ancestral variation, but low between species. Molecular parallelism within species could be facilitated by the high levels of nucleotide diversity observed52,53, indicating a large pool of shared variants upon which selection can repeatedly act. Furthermore, recurrent gene flow between replicate populations could facilitate the recruitment of new or standing adaptive alleles, as expected from theory54 and seen in other systems such as maize, Arabidopsis, or sticklebacks3,8,55. The reduced parallelism observed between allopatric populations of H. melpomene could be due to lower standing variation or less gene flow across the Andes, compared to H. erato.
The lack of significant molecular parallelism in altitude candidate loci between H. erato and H. melpomene, whose clades diverged 12 million years ago38, is in stark contrast with colour patterns56, where near-perfect local Müllerian mimics have arisen repeatedly in both species through independent mutations at a handful of conserved loci57,58. This difference in the extent of molecular parallelism might be explained by the nature of the trait under study: adaptation to altitude is multifarious and, as shown here, involves many genes. Genetic redundancy in polygenic adaptation may lead to evolution taking different paths to reach similar solutions, as shown for quantitative iridescent colouration in these two Heliconius species59 or in recent experimental evolution of thermal adaptation in Drosophila28,60. In contrast, the major effect loci that underlie switches in discrete colour patterning may favour genetic predictability even across divergent taxa, especially when selection is strong and reaching adaptive peaks requires large phenotypic shifts.
Highly differentiated regions show additional signatures of selection and, generally, are not associated with low recombination rates
We tested whether highly differentiated genomic regions shared across transects (SHDRs) showed additional evidence of positive selection by computing difference in nucleotide diversity (π) across elevations (Δπ = πhigh− πlow), deviation from neutrality in site frequency distributions (Tajima’s D), and absolute divergence (Dxy) for the same 5 kb windows. Processes other than positive selection, such as background selection, can decrease within-population diversity and thus lead to increased relative differentiation (Fst), especially in the absence of gene flow between populations61,62. Thus, it is important to test for enrichment of different selection statistics to strengthen our inference of locally adaptive loci. A reduced Δπ compared to the background would indicate that a selective sweep in the highlands reduced nucleotide diversity compared to the lowlands. In selective sweep regions, Tajima’s D is expected to be low, as regions with selected haplotypes that rapidly increased in frequency would have an excess of rare alleles. Finally, absolute sequence divergence (Dxy), is expected to be high in old selective sweeps or variants, and less affected by genetic variation within populations than relative measures of differentiation such as Fst63.
SHDRs were considered outliers for Δπ, Tajima’s D, or Dxy, if the observed maximum or minimum values within SHDRs < 10th percentile (or > 90th in Tajima’s D) of the simulated values, obtained from 10000 permutations that randomly placed blocks of equal number and size to observed HDRs across the genome. Of the SHDRs differentiated on both Ecuadorian and Colombian transects but in one side of the Andes only, i.e. across replicate transects, on average 74% and 48% of H. erato and H. melpomene SHDRs, respectively, were outliers for at least one other statistic, in addition to zPBS/zFst (Fig. 4A grey). Of the H. erato and H. melpomene SHDRs shared across all transects (allopatric SHDRs), 94% and 86% had at least one additional outlier statistic, respectively (Fig. 4A, C). In H. erato, SHDRs were often outliers for both, high Dxy and reduced Tajima’s D (36% of SHDRs with additional outlier statistics, on average). In contrast, H. melpomene SHDRs were rarely outliers for Tajima’s D, whereas 22-33% of them were outliers for both Δπ and Dxy. This could point towards different selection histories in each species, with H. erato showing signatures of recent or ongoing selective sweeps leading to an excess of rare alleles (negative Tajima’s D). The Andean split is dated ~0.8 million years older in H. melpomene populations36,37 and their altitudinal range is wider than that of H. erato, which is rarely found > 1500 m at these latitudes. Thus, it is possible that H. melpomene SHDRs represent more ancient sweeps, reflected in the high prevalence of SHDRs outliers for Dxy. Alternatively, high absolute sequence divergence (Dxy) between elevations could be indicative of selected haplotypes arising through adaptive introgression from other species into the highland populations.
As an additional independent line of evidence that SHDRs are under selection, we checked for overlaps with selection statistics and altitude-associated regions obtained from an altitudinal transect in southern Ecuador on the East of the Andes, sequenced with a linked-read technology called ‘haplotagging’64. We found that, on average, 59% ( ± 10 SD) of Eastern SHDRs of each species overlapped with at least one additional outlier selection statistic estimated with the haplotagging dataset, whereas, as expected, fewer Western SHDRs did (32% ±8 SD, on average; S.I., Supplementary Note 3, Supplementary Fig. 5). In contrast, SHDRs shared in all transects showed high levels of overlap with haplotagging-derived selection statistics in all transects (52% ± 15 SD, on average).
We tested if SHDRs were associated with low-recombining regions. Regions of high differentiation and low recombination could be indicative of purifying selection against deleterious mutations (background selection) or maladaptive introgression62,65,66. Background selection has been shown to be a major driver of differentiation landscapes between populations with little gene flow63 but is less plausible when populations readily exchange genetic material, as in this study67,68. However, in low recombining regions, selection may be more efficient due to reduced effective gene flow and segregation of co-adapted alleles, and thus facilitate the maintenance of locally adaptive loci62,69,70. Several strongly selected Heliconius colour pattern loci have been previously associated with regions of low recombination (Fig. 3 chromosomes 15 and 18)58,71. Nevertheless, here we found that recombination rate at SHDRs did not differ from background levels, except in comparisons that included strongly selected colour pattern loci (Fig. 3C, Supplementary Fig. 6). Overall, these additional signatures of selection strongly support the action of repeated divergent selection in the highlands rather than background selection driving the differentiation detected at SHDRs.
Known genes of interest overlap with SHDRs
We retrieved 908 and 747 genes overlapping with SHDRs in H. erato and H. melpomene, respectively. Adaptation to altitude is likely driven by a suite of selective factors such as temperature, air pressure, or habitat changes. Here, we do not distinguish among individual selective pressures, as these are often correlated and we do not have the power to tease them apart. Additionally, populations adapting to new or changing environments are unlikely to univariately respond to these pressures, with adaptive walks in the wild predicted to be as complex as the environments themselves. Nevertheless, future studies should test different selective pressures in laboratory settings and look for overlaps between genes of interest and SHDRs.
With so many potential targets of selection within SHDRs, we do not attempt to infer biological function or adaptive significance from the whole gene set. Instead, we checked for overlaps with regions recently associated with wing shape variation across an altitudinal cline of Southern Ecuador in H. erato and H. melpomene35. Rounder wings are generally associated with the highlands across 13 species of Heliconius34. There is also subtle but highly heritable variation in wing shape associated with altitude along the Ecuadorian Eastern transect here studied35. We found that five out of 12 previously identified candidate wing shape loci35, overlapped with SHDRs in H. erato, three of which corresponded to SHDRs detected in all transects. In contrast, only two wing shape loci (out of 16) overlapped with H. melpomene SHDRs, one of which was a SHDR shared by all transects. The number of overlaps between candidate wing shape loci and SHDRs in H. erato (n = 5) was higher than the 90th quantile of 10,000 permutations, but not in H. melpomene (n = 2; Fig. S7).
One gene on chromosome 13 stood out, rugose, as it was associated with wing shape in both H. erato and H. melpomene35 and overlapped with SHDRs shared in all transects in both species. In Drosophila mutants, rugose has been shown to affect social interactions, locomotion, and hyperactivity72. The highland incipient species of the H. erato clade, H. himera, has been shown to fly for more hours per day than lowland H. erato73, suggesting a potentially important role of locomotion to adapt to highland habitats. Additionally, we found that an H. erato Eastern SHDR (Fig. 4B) overlapped with a locus recently identified to be differentiated across many pairs of subspecies in several Heliconius species and shown to affect wing beat frequency in Drosophila74. Thus, future studies could focus on functionally testing some of these candidates and ascertain the potentially adaptive functions of candidate regions.
Same haplotypes underlie parallel adaptation to altitude
High altitude differentiation at the same locus could be driven by the same or different haplotypes under selection. For instance, different de-novo mutations at one locus were recently found to confer parallel adaptation to toxic soils in Arabidopsis, although most parallel regions were sourced from a common pool of standing alleles75. To test whether our candidate regions shared the same haplotypes, we performed local Principal Component Analyses (PCA) with outlier windows of each SHDR (Fig. 5A). While ‘global’ PCAs tend to show relatedness between individuals due to geographic structure or partial reproductive isolation, local PCAs of smaller genomic regions can highlight divergent haplotypes due to, for instance, structural variation or positive selection leading to similar haplotypes in adapted individuals76,77. Here we assessed whether genetic variation across individuals at SHDRs (local PCA PC1) could be significantly explained by altitude while accounting for genome-wide (‘global’) structuring (Fig. 5A) to test for evidence for shared allelic basis for altitude adaptation.
PCAs in each SHDR were performed with individuals from all altitudes in transects connected via gene flow (replicate Colombia/Ecuador transects). Local PCAs at SHDRs often showed individual clustering that differed from the neutral geographic expectations (whole-genome ‘global’ PCAs that included Western or Eastern transects, Supplementary Fig. 1), and the first axes of variation tended to explain a much larger proportion of the variation observed (PC1 explained 55% ± 20 SD compared to 19% in global PCAs, on average; Supplementary Table 3). Out of the four genome-wide PCAs including individuals from replicate transects within sides of the Andes, altitude only explained clustering along PC1 in the Eastern H. erato transects (Supplementary Table 3, Supplementary Fig. 1). This can be explained by a different highland colour-pattern subspecies in Colombia, H. e. dignus, reducing gene flow across the genome (Fig. 2). In contrast, we found that a large proportion of local SHDR PCAs had PC1s that correlated significantly with individual altitude (P < 0.05, H. erato: East = 48%, H. melpomene: West = 74%, East = 66%, Supplementary Table 3), except in H. erato West where only 25% did (Fig. 5B, Supplementary Figs. 8–11). Altitude explained, on average, 12% of the variation in local PC1 while controlling for the global PC1 (altitude partial R2, H. erato: West = 0.05, East = 0.16, H. melpomene: West = 0.10, East = 0.15, Supplementary Table 3).
Local PCAs can also highlight divergent haplotypes, putatively associated with inversions, by clustering individuals that possess homozygous or heterozygous haplotypes in those regions78,79. We found several H. erato SHDRs within a high differentiation block in chromosome 2 (6.5 mega base pairs, Mbp, Fig. 5C), ~0.75 Mbp downstream from a recently identified inversion exclusively present in lowland individuals of southern Ecuador64. We performed an additional local PCA across the large putative inversion and found a three-cluster pattern, consistent with the presence of the three inversion genotypes (Fig. 5C), and a neighbour-joining tree with outgroups supported its appearance in the Eastern lowlands (Supplementary Fig. 12). Local PCAs of SHDRs within the inversion region correlated more strongly with altitude than putatively inverted-only haplotypes, indicating that adaptive loci remain differentiated in the highlands and may pre-date the inversion event in the lowlands (Supplementary Fig. 8). This is consistent with a model in which the inversion could enhance local adaptation by reducing gene flow between elevations at pre-existing locally-adapted alleles80.
Overall, the majority of SHDRs involve the same alleles across transects connected via gene flow. Those SHDRs that do not correlate with altitude in local PCAs could represent false positives where there is no selection associated with altitude. However, they may also represent cases where the original beneficial mutation recombined onto distinct haplotypes that spread in different regions; cases where different de-novo mutations arose at the same or nearby loci in different genetic backgrounds in different regions; or where beneficial alleles entered the population through independent introgression events. An alternative hypothesis is that the observed parallelism is partly driven by different selective pressures on each cline, and that these peaks represent ‘evolutionary hotspots’ — regions particularly prone to driving phenotypic evolution across lineages5. This seems less likely given that the same haplotypes underlie parallel SHDRs and there is low parallelism between the two species. Nevertheless, putative intra-lineage hotspots could be investigated by assessing mutational biases and chromosomal instability in these regions, as well as disentangling the individual selective factors at play.
The large putative inversion found exclusively in the lowlands may represent a case of structural variation facilitating adaptation in the highlands80. Recent studies on environmental adaptation in seaweed flies and sunflowers, among others, have demonstrated a key role for inversions in maintaining adaptive alleles together and facilitating the evolution of locally adapted ecotypes76,77. By studying individual clustering across differentiated loci we have shown that the same alleles often drive parallelisms involved in local adaptation. We thus next turned to identifying the source of the genetic variation causing molecular parallelism across populations.
The source of parallelism: standing variation and adaptive introgression with high-altitude relatives
The presence of the same putatively adaptive haplotypes on several transects could either reflect: (i) standing variation being repeatedly selected in the highlands or shared via intraspecific gene flow, or (ii) recruitment of adaptations from other high-altitude adapted species through introgression. With five high-depth individuals per population of H. erato and H. melpomene, and 116 additional whole-genomes of high-altitude specialist species and outgroups, we tested for signatures of shared standing variation within species and of adaptive introgression between species in SHDRs.
To test for excess allele sharing at SHDR, we calculated the FdM statistic in 50 kb windows across the genome81,82. For each test, we used a tree with four populations (((P1, P2), P3), O), where P1/P2 reflect the lowland and highland populations, respectively, and P3 is an allopatric high-altitude population of the same species or a sympatric high-altitude specialist species (Fig. 6A). Positive FdM values indicate excess allele sharing between P3 and P2 (i.e., between non-sister high-altitude lineages), and negative values indicate excess allele sharing between P3 and P1 (i.e., between non-sister high- and low-altitude lineages, Fig. 6A). We then tested if SHDRs are enriched for outlier positive FdM (i.e., excess allele sharing with the highlands), using the distribution of absolute negative FdM across SHDRs as a null (see Methods for details). This specifically tests whether genomic regions that are differentiated in high-altitude populations (SHDRs) are systematically enriched for alleles shared with allopatric high-altitude populations of the same species or with sympatric specialist high-altitude species.
We first assessed intraspecific allele sharing between allopatric highland populations on opposite sides of the Andes, which split ~0.9 million years ago (Mya) and ~1.7 Mya in H. erato and H. melpomene, respectively36,37. Signatures of allele sharing likely represent shared ancestral standing variation that pre-dates the Andean split (Fig. 1B), but we cannot rule out gene flow via distant contact zones in the north and south edges of the Andes or periods of secondary contact in the past37. Nearly half of H. erato Eastern SHDRs had outlier excess allele sharing with the Western highlands in Colombia, whereas only 9% of Western SHDRs did (Fig. 6B, Supplementary Fig. 15). Both comparisons resulted in a significant enrichment of excess allele sharing between allopatric highland populations compared to sharing with the lowlands across all SHDRs (Kolmogorov-Smirnov tests P < 0.05 as stars in Fig. 6B, Supplementary Fig. 15). H. melpomene only showed significant enrichment of excess allele sharing between allopatric highland populations of Ecuador in Eastern SHDRs, and the percentage of SHDRs with excess allele sharing was generally lower than in H. erato (Fig. 6B). Overall, shared standing variation has been an important mechanism facilitating molecular parallelism in H. erato, in which trans-Andean populations share a more recent common ancestor.
We then explored allele sharing between highland populations of H. erato and H. melpomene and five sympatric high-altitude specialist relatives (Fig. 6C). These highland species are known to readily or occasionally hybridise with H. erato or H. melpomene37,83,84,85. To quantify genome-wide evidence of allele sharing we first computed f-branch statistics, which test for gene flow between branches of a phylogeny86. As expected, we found evidence of excess allele sharing between all relevant pairs (details in Supplementary Note 4, Supplementary Figs. 13, 14)87. Excess allele sharing at SHDR between highland populations and sympatric high-altitude specialist species likely represent cases of adaptive introgression. Generally, the more closely related putative donor and recipient species were, the higher proportion of SHDRs that showed excess allele sharing (Fig. 6B, Supplementary Fig. 16). For instance, levels of allele sharing in SHDRs were much higher between highland H. erato and the closely related H. himera than with the distantly related H. telesiphe (Fig. 6B). Context-dependant mutational effects (epistasis) in divergent genetic backgrounds could explain the lower levels of excess allele sharing between distantly related species21, as shown for parallel adaptation to altitude in birds88. Interestingly, in Eastern H. melpomene, SHDR sharing with a closely related sympatric species, H. timareta, was more prevalent in SHDR than shared variation with allopatric highland populations of its own species (Supplementary Fig. 16D). Admixture between adjacent H. melpomene and H. timareta populations is well-documented, with strongly selected colour pattern loci having been shared across the species barrier84,85,89.
Levels of putative adaptive introgression at SHDRs were also high between H. himera and highland H. erato, and significantly enriched across SHDRs in Colombia (Fig. 6B). H. himera is a closely related species that split from within the Eastern H. erato clade 215,000−527,000 years ago37, with pre-mating isolation and a range of divergent life-history phenotypes adapted to the highland dry forests it inhabits90,91,92. Admixture is predominantly from H. himera into H. erato37, supporting our hypothesis that the high levels of excess allele sharing at SHDRs between the two may represent cases of adaptive introgression into H. erato. Our study is the first to show that putative adaptive alleles other than colour pattern loci have been shared between high-altitude specialist species and H. erato and H. melpomene, potentially facilitating their adaptation to montane habitats.
The 6.5 Mbp putative inversion detected in chromosome 2 of Eastern H. erato individuals showed high levels of allele sharing between highland H. erato populations and H. himera, whereas there was no excess allele sharing with either the highlands or the lowlands when the putative donor species was a distantly related species (Supplementary Fig. 17). A neighbour-joining tree of this region revealed that lowland distant individuals that clustered in the local PCA formed a monophyletic group divergent from all other H. erato populations, including allopatric populations in the west of the Andes (Supplementary Fig. 12). This suggests that the inversion may have arisen anciently, prior to the western and eastern Andean split of H. erato. Its maintenance in the lowland populations may protect locally adaptive alleles from maladaptive migration load and/or promote the accumulation of novel, locally adaptive mutations77. Furthermore, its absence in the highlands allows for ongoing gene flow between highland H. erato populations and the closely related highland specialist H. himera. Supergene evolution in another species of this genus, H. numata, has been linked to the introgression of a chromosomal inversion93, highlighting the role of structural variation and hybridisation in providing novel genetic architectures that can promote adaptation. Future work could investigate the potential role of this inversion in maintaining locally beneficial allele clusters and their associated adaptive phenotypes.
By studying recently and anciently diverged populations at different altitudes within and across sides of the Andes of two species we have uncovered (i) strong signatures of high-altitude differentiation in narrow regions across the genome, consistent with positive selection, (ii) high levels of molecular parallelism between transects of the same species but no sharing across species, and (iii) an important role of standing variation and adaptive introgression from high-altitude specialist species in adaptation to these montane environments. The overall lack of molecular parallelism across species points towards genetic redundancy of polygenic evolution that allows different combinations of alleles to confer adaptation to the same environments or may reflect variation in the importance of specific selection pressures across species28. The evolutionary success of H. erato and H. melpomene in inhabiting a wide range of altitudes has likely been facilitated by abundant genetic diversity, as well as by intra- and interspecific gene flow allowing for the sharing of pre-existing adaptive alleles. Together, our study highlights the value of extensive replication across space and large whole-genome datasets for understanding the molecular underpinnings of local adaptation in the wild. Both standing genetic variation and recent hybridization can supply the selection targets required for adaptation to the environment, which emphasizes the importance of preserving gene flow and connectivity between populations if organisms are to adapt to everchanging environmental pressures.
Methods
Our study did not require ethical approvals. In Colombia, field collections were conducted under permit no. 530 issued by the Autoridad Nacional de 539 Licencias Ambientales of Colombia (ANLA). In Ecuador, collections during November-December 2011, and September-October 2012, were done under permit 0033-FAU-MAE-DPO-PNY and exported under permits 001-FAU-MAE-DPO-PNY and 006-EXP-CIEN-FAU-DPO-PNY. Permits were obtained from Parque Nacional Yasuní, Ministerio Del Ambiente, La Dirección Provincial de Orellana. Collections in Ecuador during 2017-2019 were conducted under the permit provided by the Ministerio del Ambiente, Ecuador (MAE-DNB-CM-2017-0058).
Study system and wild butterfly collection
H. erato and H. melpomene can be found across most of the Neotropics and have Müllerian aposematic mimicry to advertise their toxicity to predators, thus share colour pattern when inhabiting the same areas94. They can be found continuously coexisting across altitudinal transects ranging from sea level up to 1600 m along the Andean mountains, and H. melpomene can be found across elevations up to 1800 m. Butterflies were collected from 111 different locations, which we grouped into 30 populations, corresponding to four transects Colombia West/East, and Ecuador West/East (Supplementary Table 1). In each transect populations were either in the highlands (altitude mean = 1235 m), lowlands (altitude mean = 364 m), or distant lowlands (altitude mean = 95 m) to control for genetic drift due to isolation by distance (Fig. 2A). The Andes acts as a barrier to gene flow, as elevations in these latitudes are too high for butterflies to fly across and have been for at least 8 million years95, which pre-dates the expansion of both species across these latitudes36,37. Microclimatic variation across the Ecuadorian altitudinal transects here studied were found to be remarkably similar between Eastern and Western transects33, making them ideal for the study of repeated adaptation. All but one of these transects (Colombia East) had the same subspecies, i.e., geographic colour morph, in the three altitudes, to avoid differentiation due to highly divergent colour pattern loci (Fig. 2A). Detached wings were stored in glassine envelopes and bodies in EtOH (96%) vials. We additionally collected high-altitude specialist relatives of H. erato and H. melpomene that have potential for admixture between them. The H. erato relatives were H. himera and H. telesiphe from the Eastern Andes, and H. clysonymus which is found on both sides of the Andes. The H. melpomene relatives were H. timareta and H. cydno, from the Eastern and Western Andes, respectively. More distantly related outgroups were also sampled, H. eleuchia and H. hecale for H. erato and H. melpomene, respectively.
Whole-genome sequencing
Whole genome sequence data from 518 individuals was analysed in this study, 444 were sequenced for this study, while the rest were obtained from published studies (n = 74). Of the individuals sequenced for this study 365 were sequenced at low-medium depth with BGI (~6X), and 79 were sequenced at high depth with Novogene (~18X-30X), at least 5 per population. For the high-altitude specialist species dataset and outgroup species, we obtained high-depth whole genome sequencing data for 116 individuals, 63 of which were sequenced for this study at ~20X depth with BGI. A full list of individuals, localities, and accession numbers can be found in the Supplementary Data 1. Individuals with H. melpomene malleti phenotypes (Fig. 1) were genotyped with PCR amplification with the primer Jerry followed by a restriction digest, following56, to identify cryptic individuals of the species H. timareta ssp. nov., which are indistinguishable phenotypically from H. m. malleti. We extracted DNA with QIAGEN DNeasy Blood and Tissue extraction kits, including RNA removal, and confirmed DNA integrity and concentration (minimum of 10 ng/μL) using Qubit. DNA samples were stored at −20 °C until library preparation. For the individuals that were sequenced with low-medium depth, a secondary purification was performed with magnetic SpeedBeadsTM (Sigma) and we prepared Nextera DNA libraries (Illumina, Inc.) with purified Tn5 transposase96. PCR extension with an i7-index primer (N701–N783) and the N501 i5-index primer was performed to barcode the samples. Library purification and size selection was done using magnetic SpeedBeadsTM (Sigma). We confirmed adaptor lengths through TapeStation High sensitivity T1000 (Agilent Technologies, CA, USA) and gel electrophoresis. Pooled libraries were sequenced by The Beijing Genomics Institute (China) using HiSeq X Ten (Illumina). Library preparation and sequencing of the high-depth H. erato and H. melpomene individuals was carried with HiSeq X platform (150 bp paired-end) by Novogene.
Statistical analyses
All non-genomic analyses were run in R V2.13 (R Development Core Team 2011) and graphics were generated with the package ggplot297.
Read mapping and genotype calling
We aligned the sequence data of all individuals of the two focal species and their relatives to their corresponding reference genomes, either H. melpomene version 2.598,99 or H. erato demophoon58, obtained from Lepbase98, using bwa mem (v 0.7.15100). We used samtools (v 1.9101) to sort and index the alignment files. Duplicates were removed using the MarkDuplicates program in Picard tools (v 1.92 Broad Institute, 2018102). Genome-wide mean sequencing depth was calculated with samtools (v 1.9101). Mean sequencing depth was very similar across areas of H. erato (mean = 8.93, Supplementary Fig. 18 and was generally higher for H. melpomene (mean = 12.3), but more variable, especially in Colombia where many sequences were obtained from published studies (Supplementary Data 1). Most of the analyses described below for H. erato and H. melpomene were performed with genotype likelihoods in ANGSD and low or variable sequencing depths are thus accounted for103.
However, for our phylogenetic datasets combining our samples with other species for phylogenetic tree reconstruction and tests of admixture, we restricted the H. erato and H. melpomene samples to the five individuals per population with high sequencing depth. We mapped the high-altitude specialist relatives and outgroups of H. erato and H. melpomene to the respective reference genomes as explained above (Supplementary Data 1). We used a genotype calling approach with GATK v. 3.7104 to obtain a vcf file each for the H. erato and H. melpomene clade. Genotypes were called with HaplotypeCaller for each individual and variants were then called with GenotypeGVCFs across all individuals combined. The vcf files were filtered with vcftools v. 0.1.15105 to remove genotypes with less than 3 reads, monomorphic sites, multi-allelic sites, insertions and deletions (indels), and sites with more than 50% missing data.
Isolation by distance and Isolation by environment
To study Isolation by Distance (IBD) and Isolation by Environment (IBE), we first calculated all pairwise genetic differentiation (Fst) between all populations on each side of the Andes that had at least 5 individuals each, i.e., H. erato west (npopulation = 7), H. erato east (npopulation = 11), H. melpomene west (npopulation = 7), H. melpomene east (npopulation = 9), hereafter side-species replicate (Supplementary Table 1). We calculated pairwise population genetic distance with the function calculate.all.pairwise.Fst() from the R package BEDASSLE39. This requires a matrix of allele count data, with populations as rows and number of biallelic unlinked loci sampled as columns, which we obtained with ANGSD and custom scripts. First, we obtained a list of highly polymorphic SNPs per side-species replicate, by (i) heavily filtering 10 random individuals and obtaining minor allele frequencies (-doMaf 1) by forcing the major allele to match the reference state, so that it is the same across all populations (-doMajorMinor 4) (ii) extracting the sites and the major/minor allele frequencies, (iii) creating an indexed sites file (angsd sites index), (iv) subsetting so that sites are at least 2 kb apart, to prune for linkage disequilibrium. This list of sites and regions was then used to obtain minor/major allele frequency counts with all individuals per population and forcing the major/minor allele to match the ones given by the sites file (-doMajorMinor 3). From the resulting allele frequencies per population, we calculated allele counts, by multiplying by the number of individuals per population and the number of chromosomes samples (2, diploid). We obtained the required allele count matrix by concatenating all populations per side-species replicate, and only keeping loci with allele counts for all populations.
Geographical distance between populations measured as a straight line through the landscape is not biologically representative of organisms moving through space. To account for topographic complexity, we obtained topographic least cost paths with the R package topodistance. With historical records from the Earthcape database106, we created a binary habitat suitability raster based on the altitudinal range of each species, so that least cost paths between populations never included elevations that these butterflies do not inhabit. Then we used the function topoLCP() to get the least cost path distance between populations (Supplementary Fig. 3107), and use this distance as a proxy of isolation by distance between populations.
Differentiation and selection statistics
To search for signatures of local adaptation to montane habitats we use a measure of lineage-specific differentiation, population branch statistics (PBS)44. PBS is a summary statistic based on pairwise genetic differentiation (Fst) among three populations, two of which are located closely geographically (high, low) and one distant outgroup (low distant). For each population trio (high, low, low distant Fig. 1A), we computed PBS with ANGSD103. We first obtained genotype likelihoods to calculate the site-frequency spectrum (SFS) per population. Then, we computed 2D-SFS for each population pair (high-low, high-low.distant, low-low.distant) with the function realSFS. We then used the 2D-SFS as a prior for the joint allele frequency probabilities at each site, which are used to compute per-site pairwise Hudson’s Fst108 as interpreted by Bhatia (109, realSFS fst index -whichFst 1) among the three populations (Fsthigh-low, Fsthigh-low, Fsthigh-low.distant) and PBS per population (PBShigh, PBSlow, PBSlow.distant). This is achieved by first transforming pairwise Fst values into relative divergence times with Eq. (1):
To then obtain PBS for a given population (here the highlands) with Eq. (2):
This quantifies the magnitude of allele frequency change in the highland lineage since its divergence from the lowland and lowland distant populations. Regions of the genome with large PBS values represent loci that have undergone population-specific sequence differentiation consistent with positive selection. We computed weighted Fst and PBS averages per 5 kb window size and 1 kb steps (realSFS fst stats2), with the same window centres across datasets (-type 0). For transects where only two out of three populations had any individuals (Colombia west for H. erato and Ecuador west for H. melpomene Fig. 1A), we computed pairwise Fst only. Finally, we normalized PBS and Fst window values with z-scores, i.e. number of standard deviations from the mean, so that divergence is comparable across transects.
High Differentiation Region parallelism between replicates
To measure genetic parallelisms in adaptation to the highlands, we quantified overlap of outlier windows and adjacent regions, i.e. high differentiation regions (HDR), across replicate and allopatric transects and species. We considered outlier windows to be those with values above 4 standard deviations from their mean (zPBShigh > 4, zFst > 4; following51). We then expanded outlier window positions 50 kb upstream and 50 kb downstream from the window centre (HDR) and checked for overlaps with other transects, either within or across sides of the Andes for each species, to highlight parallelism when visualizing patterns of genome-wide divergence. High-differentiation regions with any overlap with other transects are termed shared HDRs (SHDRs). To check for HDR overlaps between species, we mapped the H. melpomene windows (starts and end positions) to the H. erato reference genome using a chainfile64 and the liftover utility (Hinrichs, 2006).
To test whether the level of parallelism observed between transects within and across sides of the Andes was higher than expected by chance, we used the R package intervals110. We first created outlier window intervals (HDRs), by obtaining the start and end positions of continuous blocks of outlier windows ( ± 50 kb buffers) with the function Intervals() (options Type=”Z”, closed). We obtained the observed proportion of total intervals that overlapped, at any of their positions, with outlier intervals in the other transect within sides of the Andes, or with outlier intervals in both, the other transect within the same side of the Andes and the two transects on the other side of the Andes (allopatric sharing). We then simulated 10,000 randomized distributions of outlier-window intervals across the genome per transect, per species (n = 8). In each simulation set, we randomly placed the same number of HDR intervals and of the same size as the observed outlier-window intervals for those transects. With these, we estimated the proportion of simulated intervals that overlapped with observed HDR outlier intervals within and across sides of the Andes, obtaining as a result a null distribution of interval overlap proportions. This approach does not account for the possibility that different parts of the genome may be more readily involved in local adaptation. Finally, to determine whether the level of overlap was significantly greater than expected by chance alone, we performed jackknife block resampling across the genome to estimate the 95% CI for the observed proportion of overlapping intervals and then assessed whether this interval included the expected value, which was computed as the mean of 10,000 random permutations described above.
Measures of nucleotide diversity, selective sweeps, and recombination rate
We studied genetic variation within and across populations by deriving three summary statistics from ANGSD thetas estimations103, Tajima’s D, which estimates the deviation of a sequence from neutrality, nucleotide diversity (pi, or population mutation rate), and Dxy or absolute divergence, which calculates pairwise differences between sequences of two populations excluding differences between sequences within populations. ANGSD has been found to be an accurate estimator of nucleotide diversity because it includes invariant sites111. Firstly, we obtained folded global site-frequency spectra for each population. Then we calculated pairwise nucleotide diversity per site (thetaD, realSFS saf2theta). Finally, we performed sliding window analysis of 5 kb window size and 1 kb steps (thetaStat do_stat) to obtain sum of pairwise differences, Tajima’s D, and total effective number of sites per window. Nucleotide diversity (pi) was obtained by dividing the sum of pairwise differences by the total number of sites per window. For all transects, we calculated the difference in nucleotide diversity per window between highland and lowland distant populations (or low populations if low distant individuals unavailable), which is expected to be negative if a selective sweep led to locally reduced diversity in highland populations. Absolute divergence (Dxy,112) between high (population A) and low/low distant (population B) populations was estimated by additionally obtaining pairwise nucleotide diversity per site (thetaD) for all individuals pooled from populations A and B (population AB), and then per-site Dxy obtained with Eq. (3):
With n being the number of individuals per population (A, B, or pooled AB) and thetaD obtained from realSFS saf2theta. Mean Dxy was estimated for the same 5 kb windows with 1 kb steps. Finally, we obtained recombination rate for 50 kb windows along the genome from the mean population recombination rate (ρ = 4Ner; r = probability of recombination per generation per bp) estimated for 13 H. erato populations from across the range in a recent study113 and for 100 kb windows of four H. melpomene populations71. Note that the H. erato genome is larger than H. melpomene (383 Mb and 275 Mb, respectively), hence the difference in window sizes.
Testing for significance of selection statistics in SHDRs
To test for positive selection we assessed Tajima’s D, difference in nucleotide diversity across elevations Δπ (πhigh− πlow), and absolute divergence (Dxy) within SHDRs and compared values to simulated distributions. We used the same permutation approach described for assessing HDR parallelism, by randomly placing the same number of intervals and of the same size as the observed HDRs for each transect 10000 times. We then obtained minimum Tajima’s D and Δπ and maximum Dxy within each simulated SHDR and permutation, and only considered a SHDR to be an outlier for a given selection statistic if the observed maximum (Tajima’s D, Δπ) or minimum (Dxy) value was above the 90th or below the 10th percentile of simulated values. Number of outlier selection statistics per SHDR was tallied and compared across replicate or allopatric sharing in each species.
Global and local PCAs
To assess neutral genetic variation between individuals and populations, we performed principal component analysis (PCA) in the eastern and western transects separately for each species, i.e., two transects per PCA (Fig. 1A). We first obtained a random subsample of 10% of windows that did not have high differentiation across populations, i.e., with zPBS/zFst < 4, and then pruned for linked sites by only retaining 1 site for every 10 kb, yielding 14995 and 8293 sites for H. erato and H. melpomene, respectively. We used the program ANGSD (v 0.933103) to obtain genotype likelihoods in beagle format (-doGlf 2) for all individuals. In H. erato, we excluded chromosome 2 as it contains a large inversion which could distort the neutral differentiation landscape. Genotype likelihoods were used as input for PCAngsd114, which incorporates genotype uncertainty from genotype likelihoods to obtain a covariance matrix across all individuals.
To assess whether the same haplotypes were involved in adaptation to altitude across replicate transects (within sides of the Andes), we performed local PCAs with outlier windows (zPBS/zFst > 4, i.e. >4 standard deviations from the mean) of each SHDR (total = 370 local PCAs). All individuals from replicate transects were included, leading to local PCAs for all Western and Eastern SHDRs of each species that included Colombia and Ecuador samples. We obtained genotype likelihoods in beagle format (-doGlf 2) as input for PCAngsd, similarly to the population structure analysis. We then assessed whether altitude was a significant predictor of individual clustering in each SHDR by building a linear model where local PCA PC1 was the response variable, and altitude and global (genome wide) PCA PC1 the predictors, to account for population structure. We considered that individuals in each replicate transect had the same haplotypes in SHDRs if altitude was a significant predictor of local PCA PC1, while controlling for global PC1 (Fig. 5). We additionally obtained the overall variation explained by the fitted linear models (R2) for each SHDR local PCA and the relative contributions of each explanatory variable (altitude and global PCA PC1, partial R2), estimated with the package relaimpo115.
Measures of excess allele sharing
We used ABBA-BABA-related statistics to examine patterns of allele sharing between closely or distantly related high-altitude species and our study H. erato and H. melpomene populations. These statistics test for an excess of shared derived variation between lineages to distinguish gene flow or ancestral population structure from the incomplete lineage sorting (ILS) that can occur during a simple tree-like branching process. To examine genome-wide patterns of excess allele sharing between populations and species, we obtained F branch statistics implemented with the package Dsuite86. Fbranch summarises and visualises patterns of excess allele sharing across phylogenetic datasets. We performed linkage-pruning to obtain a genome-wide average of excess allele sharing. Using a custom script (https://github.com/joanam/scripts/blob/master/ldPruning.sh) we removed sites above an LD-threshold of R2 > 0.1 with plink v. 1.07116. To reconstruct the backbone phylogeny for excess allele sharing tests, we extracted for each population or species the individual with highest sequencing depth from the vcf file using vcftools v. 0.1.15105. The vcf file was then converted to phylip with a custom script (https://github.com/joanam/scripts/blob/master/vcf2phylip.py). We reconstructed the phylogeny of the melpomene and erato clade separately with RAxML v. 8.2.9 using the GTRGAMMA model117. Using this backbone tree, we used the LD-pruned vcf files of all melpomene/erato clade individuals to computed f statistics (tests of excess allele sharing) across all possible sets of three populations or species with Dsuite Dtrios. Next, we summarized these statistics with Dsuite Fbranch118. The extent of gene flow in the eastern Andes between H. melpomene and H. timareta can skew genome-wide trees. Thus, to assess the levels of gene flow, we constrained the H. melpomene clade to be monophyletic following the species tree. In order to remove spurious signatures of excess allele sharing that are not significant, we set Fbranch values to 0 if the z-score was greater than 3 with a custom script (https://github.com/joanam/scripts/blob/master/removeNonsignDsuite.r). Lastly, we plotted the Fbranch statistics along the phylogeny with dtools.py of the Dsuite package.
To test for adaptive introgression from high-altitude specialist species into H. erato and H. melpomene highland populations, we computed fdM, a statistic of excess allele sharing suitable for small genomic regions. This test is based on a set of four populations, including two sister taxa (P1 & P2), a close relative (P3) that may have admixed with one of these sister taxa and an outgroup (O). Here, the sister taxa represent the H. erato or H. melpomene lowland (P1) and highland populations (P2), whereas P3 represents a highland specialist species that may have contributed beneficial gene variants to the H. erato and H. melpomene highland populations. fdM quantifies gene flow between P3 and P2 or between P3 and P1. In addition, we ran these tests with the allopatric erato/melpomene populations as P3, to test if at SHDRs, the same haplotypes are found in high-altitude populations on both sides of the Andes, potentially due to parallel selection on the same haplotypes. We estimated fdM for non-overlapping 50 kb windows across the genome with the ABBABABAwindows.py script by (Martin et al. 2014) from https://github.com/simonhmartin/genomics_general. We considered individual SHDRs as fdM outliers if their observed maximum fdM value was > 90th percentile of the absolute minimum fdM values across all SHDRs. Additionally, we tested for overall enrichment of excess allele sharing between P3 and P2 (i.e., with the highlands) across all SHDRs, by testing with a two-sample Kolmogorov-Smirnov tests if the distribution of maximum fdM values across all SHDRs (i.e., allele sharing with the highlands, P2) was significantly higher than the absolute minimum fdM values across all SHDRs (i.e. allele sharing with the lowlands, P1). This was repeated for each Colombia/Ecuadorian clines with their respective potential donors (P3).
Reporting summary
Further information on research design is available in the Nature Research Reporting Summary linked to this article.
Data availability
The sequence data generated in this study have been deposited in the European Nucleotide Archive database under accession code PRJEB35570 or elsewhere on ENA if obtained from previous studies, as specified in Supplementary Data 1 for each individual. Reference genomes for H. erato and H. melpomene were taken from Lepbase v. 498 (H. erato demophoon58 and H. melpomene version 2.599, respectively). Source data are provided with this paper in the Source Data file and in the relevant public repository published in Zenodo (https://doi.org/10.5281/zenodo.6836103)119. All records associated to the individuals used for this study are available in the Heliconius Earthcape database (https://heliconius.ecdb.io/106). Source data are provided with this paper.
Code availability
Scripts and pipelines have been made available in the public repository Zenodo (https://doi.org/10.5281/zenodo.6836103)119.
References
Booker, T. R., Yeaman, S. & Whitlock, M. C. Global adaptation complicates the interpretation of genome scans for local adaptation. Evol. Lett. https://doi.org/10.1002/evl3.208 (2020).
Stern, D. L. The genetic causes of convergent evolution. Nat. Rev. Genet. 14, 751–764 (2013).
Bohutínská, M. et al. Genomic basis of parallel adaptation varies with divergence in Arabidopsis and its relatives. Proc. Natl. Acad. Sci. 118, e2022713118 (2021).
Conte, G. L., Arnegard, M. E., Peichel, C. L. & Schluter, D. The probability of genetic parallelism and convergence in natural populations. Proc. R. Soc. B Biol. Sci. 279, 5039–5047 (2012).
Martin, A. & Orgogozo, V. The loci of repeated evolution: A catalog of genetic hotspots of phenotypic variation. Evolution 67, 1235–1250 (2013).
Besnard, G. et al. Phylogenomics of C4 photosynthesis in sedges (Cyperaceae): Multiple appearances and genetic convergence. Mol. Biol. Evol. 26, 1909–1919 (2009).
Louis, M. et al. Selection on ancestral genetic variation fuels repeated ecotype formation in bottlenose dolphins. Sci. Adv. 7, eabg1245 (2021).
Wang, L. et al. Molecular parallelism underlies convergent highland adaptation of maize landraces. Mol. Biol. Evol. 38, 3567–3580 (2021).
Calfee, E., Agra, M. N., Palacio, M. A., Ramírez, S. R. & Coop, G. Selection and hybridization shaped the rapid spread of African honey bee ancestry in the Americas. PLOS Genet. 16, e1009038 (2020).
Jones, M. R. et al. Adaptive introgression underlies polymorphic seasonal camouflage in snowshoe hares. Science 360, 1355–1358 (2018).
Meier, J. I. et al. Ancient hybridization fuels rapid cichlid fish adaptive radiations. Nat. Commun. 8, 14363 (2017).
Zhang, X. et al. The history and evolution of the Denisovan-EPAS1 haplotype in Tibetans. Proc. Natl. Acad. Sci. 118, (2021).
Jones, F. C. et al. The genomic basis of adaptive evolution in threespine sticklebacks. Nature 484, 55–61 (2012).
Wooldridge, T.B. et al. An enhancer of Agouti contributes to parallel evolution of cryptically colored beach mice. Proc. Natl. Acad. Sci. USA 119, e2202862119 (2022).
Oziolor, E. M. et al. Adaptive introgression enables evolutionary rescue from extreme environmental pollution. Science 364, 455–457 (2019).
Valencia-Montoya, W. A. et al. Adaptive introgression across semipermeable species boundaries between local Helicoverpa zea and invasive Helicoverpa armigera moths. Mol. Biol. Evol. 37, 2568–2583 (2020).
Nogués-Bravo, D. et al. Cracking the code of biodiversity responses to past climate change. Trends Ecol. Evol. 33, 765–776 (2018).
Ding, D. et al. Genetic variation in PTPN1 contributes to metabolic adaptation to high-altitude hypoxia in Tibetan migratory locusts. Nat. Commun. 9, 4991 (2018).
Zhang, Q.-L. et al. Comparative transcriptomic analysis of Tibetan Gynaephora to explore the genetic basis of insect adaptation to divergent altitude environments. Sci. Rep. 7, 16972 (2017).
Mayhew, P. J. Why are there so many insect species? Perspectives from fossils and phylogenies. Biol. Rev. 82, 425–454 (2007).
Yeaman, S., Gerstein, A. C., Hodgins, K. A. & Whitlock, M. C. Quantifying how constraints limit the diversity of viable routes to adaptation. PLOS Genet. 14, e1007717 (2018).
Gross, J. B., Borowsky, R. & Tabin, C. J. A Novel Role for Mc1r in the parallel evolution of depigmentation in independent populations of the cavefish astyanax mexicanus. PLoS Genet. 5, e1000326 (2009).
Harris, R. B. et al. The population genetics of crypsis in vertebrates: recent insights from mice, hares, and lizards. Heredity 124, 1–14 (2020).
Mundy, N. I. A window on the genetics of evolution: MC1R and plumage colouration in birds. Proc. R. Soc. Lond. B Biol. Sci. 272, 1633–1640 (2005).
Orteu, A. & Jiggins, C. D. The genomics of coloration provides insights into adaptive evolution. Nat. Rev. Genet. 21, 461–475 (2020).
Castro, J. P. et al. An integrative genomic analysis of the Longshanks selection experiment for longer limbs in mice. eLife 8, e42014 (2019).
Orr, H. A. The probability of parallel evolution. Evolution 59, 216–220 (2005).
Barghi, N., Hermisson, J. & Schlötterer, C. Polygenic adaptation: a unifying framework to understand positive selection. Nat. Rev. Genet. 21, 769–781 (2020).
Hancock, A. M., Alkorta-Aranburu, G., Witonsky, D. B. & Di Rienzo, A. Adaptations to new environments in humans: the role of subtle allele frequency shifts. Philos. Trans. R. Soc. B Biol. Sci. 365, 2459–2468 (2010).
Capblancq, T., Fitzpatrick, M. C., Bay, R. A., Exposito-Alonso, M. & Keller, S. R. Genomic prediction of (mal) adaptation across current and future climatic landscapes. Annu. Rev. Ecol. Evol. Syst. 51, 245–269 (2020).
Rosser, N., Phillimore, A. B., Huertas, B., Willmott, K. R. & Mallet, J. Testing historical explanations for gradients in species richness in heliconiine butterflies of tropical America. Biol. J. Linn. Soc. 105, 479–497 (2012).
Rueda-M, N., Salgado-Roa, F. C., Gantiva-Q, C., Pardo-Diaz, C. & Salazar, C. Environmental drivers of diversification and hybridization in Neotropical butterflies. Front. Ecol. Evol. 746, (2021).
Montejo-Kovacevich, G. et al. Microclimate buffering and thermal tolerance across elevations in a tropical butterfly. J. Exp. Biol. 223, (2020).
Montejo‐Kovacevich, G. et al. Altitude and life-history shape the evolution of Heliconius wings. Evolution 73, 2436–2450 (2019).
Montejo-Kovacevich, G. et al. Genomics of altitude-associated wing shape in two tropical butterflies. Mol. Ecol. 0, (2021).
Moest, M. et al. Selective sweeps on novel and introgressed variation shape mimicry loci in a butterfly adaptive radiation. PLOS Biol. 18, e3000597 (2020).
Van Belleghem, S. M. et al. Selection and isolation define a heterogeneous divergence landscape between hybridizing Heliconius butterflies. Evolution (2021).
Kozak, K. M. et al. Multilocus species trees show the recent adaptive radiation of the mimetic heliconius butterflies. Syst. Biol. 64, 505–524 (2015).
Bradburd, G. S., Ralph, P. L. & Coop, G. M. Disentangling the effects of geographic and ecological isolation on genetic differentiation. Evolution 67, 3258–3273 (2013).
Wang, I. J. & Bradburd, G. S. Isolation by environment. Mol. Ecol. 23, 5649–5662 (2014).
González-Martínez, S. C., Ridout, K. & Pannell, J. R. Range Expansion Compromises Adaptive Evolution in an Outcrossing Plant. Curr. Biol. 27, 2544–2551.e4 (2017).
Hämälä, T. & Savolainen, O. Genomic Patterns of Local Adaptation under Gene Flow in Arabidopsis lyrata. Mol. Biol. Evol. 36, 2557–2571 (2019).
Jasper, R. J. & Yeaman, S. Local adaptation can cause both peaks and troughs in nucleotide diversity within populations. http://biorxiv.org/lookup/doi/10.1101/2020.06.03.132662. https://doi.org/10.1101/2020.06.03.132662 (2020).
Yi, X. et al. Sequencing of 50 Human Exomes Reveals Adaptation to High Altitude. Science 329, 75–78 (2010).
Amorim, C. E. G. et al. Genetic signature of natural selection in first Americans. Proc. Natl. Acad. Sci.https://doi.org/10.1073/pnas.1620541114 (2017).
Delmore, K. et al. The evolutionary history and genomics of European blackcap migration. eLife 9, e54462 (2020).
Lindo, J. et al. A time transect of exomes from a Native American population before and after European contact. Nat. Commun. 7, 13175 (2016).
Vijay, N. et al. Genomewide patterns of variation in genetic diversity are shared among populations, species and higher-order taxa. Mol. Ecol. 26, 4284–4295 (2017).
Wallberg, A., Pirk, C. W., Allsopp, M. H. & Webster, M. T. Identification of multiple loci associated with social parasitism in honeybees. PLOS Genet. 12, e1006097 (2016).
Librado, P. & Orlando, L. Detecting signatures of positive selection along defined branches of a population tree using LSD. Mol. Biol. Evol. 35, 1520–1535 (2018).
Salmón, P. et al. Continent-wide genomic signatures of adaptation to urbanisation in a songbird across Europe. Nat. Commun. 12, 1–14 (2021).
Martin, S. H. et al. Natural selection and genetic diversity in the butterfly Heliconius melpomene. Genetics 203, 525–541 (2016).
Belleghem, S. M. V. et al. Patterns of Z chromosome divergence among Heliconius species highlight the importance of historical demography. Mol. Ecol. 27, 3852–3872 (2018).
Ralph, P. L. & Coop, G. Convergent evolution during local adaptation to patchy landscapes. PLoS Genet. 11, e1005630 (2015).
Fang, B., Kemppainen, P., Momigliano, P., Feng, X. & Merilä, J. On the causes of geographically heterogeneous parallel evolution in sticklebacks. Nat. Ecol. Evol. 4, 1105–1115 (2020).
Nadeau, N. J. et al. Population genomics of parallel hybrid zones in the mimetic butterflies, H. melpomene and H. erato. Genome Res. 24, 1316–1333 (2014).
Nadeau, N. J. Genes controlling mimetic colour pattern variation in butterflies. Curr. Opin. Insect Sci. 17, 24–31 (2016).
Van Belleghem, S. M. et al. Complex modular architecture around a simple toolkit of wing pattern genes. Nat. Ecol. Evol. 1, 52 (2017).
Brien, M. N. et al. The genetic basis of structural colour variation in mimetic Heliconius butterflies. 2021.04.21.440746 https://www.biorxiv.org/content/10.1101/2021.04.21.440746v1, https://doi.org/10.1101/2021.04.21.440746 (2021).
Barghi, N. et al. Genetic redundancy fuels polygenic adaptation in Drosophila. PLOS Biol. 17, e3000128 (2019).
Charlesworth, B. Measures of divergence between populations and the effect of forces that reduce variability. Mol. Biol. Evol. 15, 538–543 (1998).
Nachman, M. W. & Payseur, B. A. Recombination rate variation and speciation: theoretical predictions and empirical results from rabbits and mice. Philos. Trans. R. Soc. B Biol. Sci. 367, 409–421 (2012).
Cruickshank, T. E. & Hahn, M. W. Reanalysis suggests that genomic islands of speciation are due to reduced diversity, not reduced gene flow. Mol. Ecol. 23, 3133–3157 (2014).
Meier, J. I. et al. Haplotype tagging reveals parallel formation of hybrid races in two butterfly species. Proc. Natl. Acad. Sci. 118, (2021).
Charlesworth, B., Nordborg, M. & Charlesworth, D. The effects of local selection, balanced polymorphism and background selection on equilibrium patterns of genetic diversity in subdivided populations. Genet. Res. 70, 155–174 (1997).
Martin, S. H. & Jiggins, C. D. Interpreting the genomic landscape of introgression. Curr. Opin. Genet. Dev. 47, 69–74 (2017).
Burri, R. Interpreting differentiation landscapes in the light of long-term linked selection. Evol. Lett. 1, 118–131 (2017).
Matthey-Doret, R. & Whitlock, M. C. Background selection and FST: Consequences for detecting local adaptation. Mol. Ecol. 28, 3902–3914 (2019).
Meier, J. I., Marques, D. A., Wagner, C. E., Excoffier, L. & Seehausen, O. Genomics of parallel ecological speciation in Lake Victoria cichlids. Mol. Biol. Evol. 35, 1489–1506 (2018).
Yeaman, S. & Whitlock, M. C. The genetic architecture of adaptation under migration–selection balance. Evol. Int. J. Org. Evol. 65, 1897–1911 (2011).
Martin, S. H., Davey, J. W., Salazar, C. & Jiggins, C. D. Recombination rate variation shapes barriers to introgression across butterfly genomes. PLoS Biol. 17, e2006288 (2019).
Wise, A. et al. Drosophila mutants of the autism candidate gene neurobeachin (rugose) exhibit neuro-developmental disorders, aberrant synaptic properties, altered locomotion, and impaired adult social behavior and activity patterns. J. Neurogenet. 29, 135–143 (2015).
Davison, A., McMillan, W. O., Griffin, A. S., Jiggins, C. D. & Mallet, J. L. B. Behavioral and physiological differences between two parapatric Heliconius species. Biotropica https://doi.org/10.1111/j.1744-7429.1999.tb00415.x (1999).
Zhang, Y. et al. A widely diverged locus involved in locomotor adaptation in Heliconius butterflies. Sci. Adv. 7, eabh2340 (2021).
Konečná, V. et al. Parallel adaptation in autopolyploid Arabidopsis arenosa is dominated by repeated recruitment of shared alleles. Nat. Commun. 12, 4979 (2021).
Mérot, C. et al. Locally adaptive inversions modulate genetic variation at different geographic scales in a seaweed fly. Mol. Biol. Evol.
Todesco, M. et al. Massive haplotypes underlie ecotypic differentiation in sunflowers. Nature 584, 602–607 (2020).
Li, H. & Ralph, P. Local PCA shows how the effect of population structure differs along the genome. Genetics 211, 289–304 (2019).
Mérot, C., Oomen, R. A., Tigano, A. & Wellenreuther, M. A roadmap for understanding the evolutionary significance of structural genomic variation. Trends Ecol. Evol. 35, 561–572 (2020).
Kirkpatrick, M. & Barton, N. Chromosome inversions, local adaptation and speciation. Genetics 173, 419–434 (2006).
Malinsky, M. et al. Genomic islands of speciation separate cichlid ecomorphs in an East African crater lake. Science 350, 1493–1498 (2015).
Martin, S. H., Davey, J. W. & Jiggins, C. D. Evaluating the Use of ABBA–BABA Statistics to Locate Introgressed Loci. Mol. Biol. Evol. 32, 244–257 (2015).
Bull, V. et al. Polyphyly and gene flow between non-sibling Heliconius species. BMC Biol. 4, 1–17 (2006).
Martin, S. et al. Genome-wide evidence for speciation with gene flow in Heliconius butterflies. Genome Res. (2013) https://doi.org/10.1101/gr.159426.113.
Pardo-Diaz, C. et al. Adaptive introgression across species boundaries in Heliconius butterflies. PLoS Genet. 8, e1002752 (2012).
Malinsky, M., Matschiner, M. & Svardal, H. Dsuite-Fast D-statistics and related admixture evidence from VCF files. Mol. Ecol. Resour. 21, 584–595 (2021).
Kozak, K. M., Joron, M., McMillan, W. O. & Jiggins, C. D. Rampant genome-wide admixture across the Heliconius radiation. Genome Biol. Evol. (2021).
Natarajan, C. et al. Predictable convergence in hemoglobin function has unpredictable molecular underpinnings. Science 354, 336–339 (2016).
Dasmahapatra, K. K. et al. Butterfly genome reveals promiscuous exchange of mimicry adaptations among species. Nature 487, 94 (2012).
Jiggins, C. D., Mcmillan, O., Neukirchen, W., Mallet, J. & Nw, L. What can hybrid zones tell us about speciation? The case of Heliconius erato and H. himera (Lepidoptera: Nymphalidae). Biol. J. Linn. Soc. 221–242 https://doi.org/10.1111/j.1095-8312.1996.tb01464.x (1996).
Merrill, R. M., Chia, A. & Nadeau, N. J. Divergent warning patterns contribute to assortative mating between incipient Heliconius species. Ecol. Evol. 4, 911–917 (2014).
Montgomery, S. H. & Merrill, R. M. Divergence in brain composition during the early stages of ecological specialization in Heliconius butterflies. J. Evol. Biol. 30, 571–582 (2017).
Jay, P. et al. Supergene evolution triggered by the introgression of a chromosomal inversion. Curr. Biol. 28, 1839–1845 (2018).
Jiggins, C. D. The Ecology and Evolution of Heliconius Butterflies. (Oxford University Press, 2016).
Hoorn, C. et al. Amazonia through time: Andean uplift, climate change, landscape evolution, and biodiversity. science 330, 927–931 (2010).
Picelli, S. et al. Tn5 transposase and tagmentation procedures for massively scaled sequencing projects. Genome Res. 24, 2033–2040 (2014).
Ginestet, C. ggplot2: Elegant Graphics for Data Analysis. J. R. Stat. Soc. Ser. A Stat. Soc. (2011) https://doi.org/10.1111/j.1467-985X.2010.00676_9.x.
Challis, R. J., Kumar, S., Dasmahapatra, K. K., Jiggins, C. D. & Blaxter, M. Lepbase: the Lepidopteran genome database. http://biorxiv.org/lookup/doi/10.1101/056994, https://doi.org/10.1101/056994 (2016).
Davey, J. W. et al. Major Improvements to the Heliconius melpomene Genome Assembly Used to Confirm 10 Chromosome Fusion Events in 6 Million Years of Butterfly Evolution. G3 GenesGenomesGenetics g3.115.023655 https://doi.org/10.1534/g3.115.023655.(2016)
Li, H. Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM. ArXiv Prepr. ArXiv13033997 (2013).
Li, H. et al. The Sequence Alignment/Map format and SAMtools. Bioinforma. Oxf. Engl. 25, 2078–2079 (2009).
Broad Institute. Picard tools. Broad Inst. GitHub Repos. (2018).
Korneliussen, T. S., Albrechtsen, A. & Nielsen, R. ANGSD: Analysis of Next Generation Sequencing Data. BMC Bioinforma. 15, 356 (2014).
McKenna, A. et al. The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 20, 1297–1303 (2010).
Danecek, P. et al. The variant call format and VCFtools. Bioinformatics 27, 2156–2158 (2011).
Jiggins, C. D., Salazar, P. A. & Montejo-Kovacevich, G. Heliconiine Butterfly Collection Records from University of Cambridge. Department of Zoology, Cambridge. (2019).
Wang, I. J. Topographic path analysis for modelling dispersal and functional connectivity: Calculating topographic distances using the topoDistance r package. Methods Ecol. Evol. 11, 265–272 (2020).
Hudson, R. R., Slatkin, M. & Maddison, W. P. Estimation of levels of gene flow from DNA sequence data. Genetics 132, 583–589 (1992).
Bhatia, G., Patterson, N., Sankararaman, S. & Price, A. L. Estimating and interpreting FST: the impact of rare variants. Genome Res. 23, 1514–1521 (2013).
Bourgon, R. intervals: Tools for working with points and intervals. https://CRAN.R-project.org/package=intervals (2020).
Korunes, K. L. & Samuk, K. pixy: Unbiased estimation of nucleotide diversity and divergence in the presence of missing data. bioRxiv 2020.06.27.175091 https://doi.org/10.1101/2020.06.27.175091 (2020).
Nei, M. The theory of genetic distance and evolution of human races. Jpn. J. Hum. Genet. 23, 341–369 (1978).
Belleghem, S. M. V. et al. Selection and gene flow define polygenic barriers between incipient butterfly species. bioRxiv 2020.04.09.034470 https://doi.org/10.1101/2020.04.09.034470 (2020).
Meisner, J. & Albrechtsen, A. Inferring Population Structure and Admixture Proportions in Low-Depth NGS Data. Genetics 210, 719–731 (2018).
Grömping, U. Relative importance for linear regression in R: The package relaimpo. J. Stat. Softw. https://doi.org/10.18637/jss.v017.i01 (2006).
Chang, C. C. et al. Second-generation PLINK: Rising to the challenge of larger and richer datasets. Gigascience 4, s13742–015 (2015).
Stamatakis, A. RAxML version 8: A tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics 30, 1312–1313 (2014).
Malinsky, M. et al. Whole-genome sequences of Malawi cichlids reveal multiple radiations interconnected by gene flow. Nat. Ecol. Evol. 2, 1940–1955 (2018).
Montejo-Kovacevich, G. gmkov/2021-altitude-heliconius: v1. (Zenodo, 2022). https://doi.org/10.5281/zenodo.6836103.
Loecher, M. & Loecher, M. M. Package ‘RgoogleMaps’. (2020).
Acknowledgements
We are grateful to all the field assistants who have collected samples for this study, Narupa Reserve (Jocotoco Foundation, Ecuador), Jatun Satcha Reserve (Ecuador), and Universidad Regional Amazónica Ikiam for their support. We thank Steven van Belleghem for sharing recombination rates data for H. erato, and Emma Curran and Juan Enciso for providing some of the genomic sequences, and the Butterfly Genetics Lab (Cambridge) for helpful feedback. We thank the McGuire Center for Lepidoptera and Biodiversity, Florida Museum of Natural History, University of Florida, for providing access to their butterfly collection for photographing to C.D.J. (Figs. 2, 6). G.M.-K. was supported by a Natural Environment Research Council Doctoral Training Partnership (NE/L002507/1). This work, N.J.N. and C.D.J. were supported by the Natural Environment Research Council (grant number: NE/R010331/1) and by a European Research Council Grant (339873) to C.D.J. Some of the sequence data was generated under a NERC fellowship (NE/K008498/1) to N.J.N. Funding was provided to C.N.B. by the Spanish Agency for International Development Cooperation (AECID, grant number 2018SPE0000400194). C.S and N.R. were funded by Fondos Concursables Big - grant IV-FGD005/ IV-FGI006 Universidad del Rosario. SHM was supported by a NERC IRF (NE/N014936/1). Y.F.C. was supported by the European Research Council Starting Grant 639096 “HybridMiX” and the Max Planck Society. Smithsonian Institution Scholarly Studies Award to K.M.K. and W.O.M. Open access funding provided by the University of Cambridge.
Author information
Authors and Affiliations
Contributions
G.M-K, C.D.J., S.H.M., J.I.M., N.J.N. contributed to the design of the study. G.M.-K., C.N.B., C.S., N.R., S.H.M., W.O.M., K.M.K., N.J.N., S.H.M., C.D.J., collected butterflies and obtained permits. G.M.-K., J.I.M., I.A.W., Y.F.C., M.K., K.M.K., performed DNA extractions and library preparation. G.M-K, J.I.M., S.H.M., performed the analyses with input from C.D.J., N.J.N., Y.F.C. G.M.-K. wrote the first draft of the manuscript, and all authors revised it and approved the final version manuscript.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Peer review
Peer review information
Nature Communications thanks Bohao Fang and the other, anonymous, reviewer(s) for their contribution to the peer review of this work.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Source data
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Montejo-Kovacevich, G., Meier, J.I., Bacquet, C.N. et al. Repeated genetic adaptation to altitude in two tropical butterflies. Nat Commun 13, 4676 (2022). https://doi.org/10.1038/s41467-022-32316-x
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41467-022-32316-x
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.