The Mediterranean basin is a hotspot of biodiversity, fuelled by climatic oscillation and geological change over the past 20 million years. Wall lizards of the genus Podarcis are among the most abundant, diverse, and conspicuous Mediterranean fauna. Here, we unravel the remarkably entangled evolutionary history of wall lizards by sequencing genomes of 34 major lineages covering 26 species. We demonstrate an early (>11 MYA) separation into two clades centred on the Iberian and Balkan Peninsulas, and two clades of Mediterranean island endemics. Diversification within these clades was pronounced between 6.5–4.0 MYA, a period spanning the Messinian Salinity Crisis, during which the Mediterranean Sea nearly dried up before rapidly refilling. However, genetic exchange between lineages has been a pervasive feature throughout the entire history of wall lizards. This has resulted in a highly reticulated pattern of evolution across the group, characterised by mosaic genomes with major contributions from two or more parental taxa. These hybrid lineages gave rise to several of the extant species that are endemic to Mediterranean islands. The mosaic genomes of island endemics may have promoted their extraordinary adaptability and striking diversity in body size, shape and colouration, which have puzzled biologists for centuries.
Hybridization is a powerful source of genetic variation. While hybrids are usually rare in vertebrates, and may be unfit, back-crossing with parental lineages can enable transfer of adaptively relevant alleles between lineages that otherwise remain distinct1,2. The establishment of evolutionarily independent lineages with more evenly shared ancestry is considered exceptional, and the origin and evolutionary potential of these hybrid lineages are highly contentious2,3,4,5,6. Reticulated evolution can be an important feature of adaptive radiation7 (e.g. Heliconius butterflies8 and cichlid fish9,10), but it does not need to be restricted to rapidly evolving clades. Evolution of reproductive isolation is a protracted process11, allowing genetic exchange between lineages to be a persistent feature of adaptation and diversification of a clade. This is particularly the case for organisms evolving in regions like the Mediterranean, where geologic and climatic change have caused repeated range contraction and expansion over millions of years. Still, it remains to be seen to what extent introgressive hybridization has contributed to the exceptional levels of biodiversity and endemism of the Mediterranean fauna.
In this study, we reveal the entangled evolutionary history of wall lizards, among the most abundant, conspicuous and charismatic animals of the Mediterranean. Island endemic species in particular are strikingly variable in colouration and morphology (Fig. 1), a feature that influenced early theorizing about the origins of species and adaptation12,13. Using whole-genome sequences of representatives of all major lineages of the Podarcis genus, we show that genetic exchange has been pervasive throughout the history of this clade, resulting in lineages with highly mosaic genomes that contributed to the diversity and endemism of the Mediterranean fauna.
Genome sequencing and genetic diversity
Genomic DNA of 34 Podarcis wall lizards from different lineages, including 26 recognized species, were sequenced on an Illumina platform (Supplementary Table 1). Sequence reads were aligned to the P. muralis reference genome14 and variants called, generating a total of 28.4 million single-nucleotide variants (SNVs; Supplementary Fig. 1). The mean nucleotide diversity (π) for all individuals was 10.6 × 10−3 (range from 1.7 × 10−3 to 29.0 × 10−3; Supplementary Fig. 2). A principal component analysis (PCA) based on genetic distance separated all individuals into four distinct geographic species clusters (Balkan group, Iberian group, Sicilian-Maltese group and Western Islands group, where the latter includes the Balearic islands, Corsica and Sardinia), and two species that formed separate clusters (lineages of the Italian species P. siculus and the widely distributed species P. muralis; Supplementary Fig. 3).
The evolutionary history of wall lizards
Past attempts to construct a species tree for Podarcis have yielded unstable topologies, and this lack of a fully resolved phylogeny has limited our understanding of their evolutionary history15,16. To bridge this gap, we constructed a phylogenetic framework for wall lizards by adopting two approaches (Supplementary Table 2). We first generated two concatenated datasets by combining the SNVs from whole-genome sequence (WGS) and protein coding sequence (CDS) data, respectively, for all individuals, and inferred the phylogenies for the two datasets using a maximum likelihood (ML) approach. To alleviate biases due to the concatenation of loci with variable evolutionary histories, we complemented this strategy with a multispecies coalescence (MSC) approach. To this end, we divided the WGS data into small non-overlapping windows (200, 100, 50, 25, 10 and 5 kbs), inferred local phylogenies for each window, and reconstructed the consensus tree.
The phylogenies generated from the concatenation and MSC approaches were highly consistent (average normalized Robinson–Foulds pairwise tree distance: 5.21%; Fig. 2a, b), and clearly supported the geographic clusters of the lineages identified by the principal component analysis (Supplementary Fig. 3). The Iberian group and P. muralis formed a sister taxon to P. siculus, while the Balkan group and Sicilian-Maltese group formed a sister taxon to the Western Islands group. The topology among these major groups was supported by all phylogenies, although some species within the Iberian (e.g. P. carbonelli, P. guadarramae and P. vaucheri) and the Balkan (e.g. P. peloponnesiacus and P. tauricus) groups showed low support rates and conflicting topologies (Supplementary Fig. 4). Local trees based on window sizes of 200 kb produced the most stable results with the least polytomies (Supplementary Fig. 5), but also revealed extensive discordances (Fig. 2c). For example, the consensus tree topology only accounted for 8.58%, and the top 8 topologies accounted for only 31.46% of all local trees inferred from 200 kb windows (Supplementary Fig. 5). Most of the low-support nodes were found within the Iberian group. The mitochondrial DNA (mtDNA) tree was inconsistent with trees based on the nuclear genome and did not support the monophyly of the Western Islands group and the Sicilian-Maltese group (Fig. 2b, Supplementary Fig. 6, Supplementary Table 3). These discordances between nuclear and mitochondrial DNA indicate introgression of mtDNA from geographically adjacent, but sometimes distantly related, donor lineages into the P. hispanicus ‘GAL’ lineage, into the P. siculus lineage, into the P. tiliguerta lineage and into the Sicilian subclade (Supplementary Fig. 6).
We estimated the divergence times for all lineages based on the WGS dataset by using a relaxed clock approach. To reduce the influence of extensive introgression, we used only the genomic regions for which local trees were concordant with the consensus phylogeny. Two secondary calibrations were adopted from a recent phylogeny of the Lacertidae17; the root node (37.55 million years ago, MYA) and the crown node of Podarcis (18.60 MYA). We estimated that the split among the major clades of Podarcis wall lizards took place in the Miocene between ~16.7–9.8 MYA (Fig. 2, Supplementary Fig. 7). To characterize the pattern of diversification, we analyzed shifts in diversification rates on a lineage through time (LTT) curve using a birth–death model18. The results suggested two periods of rapid diversification around 17–15 MYA (coinciding with the Burdigalian–Langhian transition) and around 6.5–4.0 MYA (coinciding with the Messinian–Zanclean transition), the latter contributing to the rich diversity of the Balkan and Iberian groups (Fig. 2, Supplementary Fig. 7).
Rampant introgression throughout the evolutionary history of Podarcis
As we found extensive discordances among local trees, we systematically tested for signatures of introgression between all lineages. We first employed the Patterson’s D-statistics (ABBA-BABA test)19 for all 5984 triplets of the 34 lineages and Archaeolacerta bedriagae as outgroup. The majority of the triplets (77.0%) showed significant deviation from neutrality (|Zscore| > 3.3), suggesting that the discordances among local trees and the phylogenetic uncertainties were not only caused by incomplete lineage sorting (ILS), but also provide strong evidence of past admixture between lineages (Supplementary Fig. 11). We also tested the sharing of particularly long genomic blocks between distantly related species by reconstructing co-ancestry matrices in fineSTRUCTURE20, which revealed patterns that were consistent with the D-statistics (Supplementary Fig. 11a).
To quantify the extent of admixture among lineages leading to the major groups of the Podarcis tree, we inferred reticulate phylogenetic networks of the species using phyloNet21. A total of 15 reticulations were identified by phyloNet, all of which were supported by introgression models with minimal errors from D-statistics (Supplementary Table 4, Supplementary Fig. 12). The results revealed extensive introgression and admixture throughout the evolution of Podarcis species (Fig. 3a, Supplementary Fig. 12). An estimation of the proportion of introgressed alleles from parental nodes in qpGraph19 indicated that the reticulations involved 3–49% of alleles from the minority ancestry (Fig. 3b, Supplementary Fig. 12). In 4 out of 12 lineages that experienced introgression, the rate of protein-coding gene evolution (dN/dS ratios; a common estimate of selection), was higher for the introgressed genes than for genes with a history consistent with the consensus phylogeny (Supplementary Fig. 13a and Supplementary Table 5). These results are consistent with adaptive evolution of introgressed genes.
We further fitted admixture models for evolutionary scenarios containing subsets of species to validate the reticulation events using Nelder–Mead optimization22. The results revealed extensive, ancient reticulations between the major clades (Fig. 3). For example, the most recent common ancestor (MRCA) of the Western Islands group received 41% of alleles from the MRCA of the Sicilian-Maltese group, and 44% of alleles from the MRCA of the Iberian group and P. muralis (Fig. 3b). P. siculus, a species that is widely distributed across the Italian Peninsula and adjacent islands, received 24% of its alleles from the MRCA of the Balkan and the Sicilian-Maltese groups.
The phylogenetic network also suggested that introgression between evolutionarily younger lineages accounts for the discordances among local trees. Introgression was particularly prevalent for the Iberian group. First, three Iberian species (P. carbonelli, P. guadarramae and P. vaucheri) showed evidence of multiple hybridization events between different lineages (Supplementary Fig. 12b). Second, the Iberian group hybridized with the P. muralis lineage inhabiting the Iberian Peninsula (Supplementary Fig. 12d). Similar patterns of extensive introgression were evident within the Balkan group (Supplementary Fig. 12c), with the phylogenetic network suggesting that P. tauricus has a mosaic genome with contributions from P. gaigeae (66% of alleles) and P. ionicus (34% of alleles). Similarly, the eastern lineage of P. peloponnesiacus has received 35% of its alleles from an extinct lineage.
Mosaic genomes of Mediterranean island endemic species
Across Mediterranean wall lizards, the hybridization events that resulted in a rather evenly shared ancestry were prevalent in the evolutionary history of extant island endemics. For the Western Islands group now occurring on Corsica, Sardinia and the Balearic islands, their ancestral lineage experienced introgression of 41% of alleles from the MRCA of the Sicilian-Maltese group, and 44% of alleles from the MRCA of P. muralis and the Iberian group. Following its separation from P. tiliguerta, the MRCA of the Balearic islands species provided 21% of the alleles into the Iberian group, before diversifying into the extant species (P. lilfordi and P. pityusensis; Fig. 3b). The admixture model based on D-statistics supported the inferred patterns of introgression with minimal error (M3: 122.51; versus M1: 13067.80, M2: 8281.35) by showing significant excess of allele sharing (Supplementary Fig. 14).
Similarly, two extant species in the Sicilian-Maltese group (P. raffonei and P. waglerianus), inhabiting Sicily and surrounding islands, descended from a lineage that received 28% of its alleles from P. muralis and P. siculus, with the latter descending from another hybrid lineage that shared 24% of its alleles with the MRCA of the Balkan and Sicilian-Maltese group (Figs. 3b and 4a). The D-statistics from the admixture model supported this complex scenario with three reticulation events (minimal error for M4: 175.60; versus M1: 3560.91, M2: 2813.70, M3: 1040.19; Fig. 4b, c).
Following hybridization, the introgressed genomic regions of parental species are bound to break down due to recombination, and regions that carry incompatible alleles will be purged23. Accordingly, introgressed loci are expected to be enriched in genomic regions with a high recombination rate and few genes8,24,25. Unusually long genomic blocks of different ancestry may therefore be putative candidates for adaptive introgression. One particularly important selective pressure during genome stabilization is mito-nuclear compatibility26, suggesting that genomic regions that contain genes involved in cellular respiration or energy metabolism should co-introgress with mitochondrial genomes.
An exceptionally well-suited case for testing this hypothesis is the two island endemic species P. raffonei and P. waglerianus (the Sicilian subclade in the Sicilian-Maltese group) that show evidence of mtDNA introgression from the P. muralis lineage (Supplementary Fig. 6). To look for evidence of co-introgression of the mitochondrial genome and genomic blocks, we scanned genomic windows for long runs of introgressed ancestry in these species (Supplementary Fig. 22). The longest region was a gene-dense region (36.3–39.5 Mb) on Chromosome 15, containing a total of 39 protein coding genes (Fig. 4f, Supplementary Table 6) that was shared with P. muralis (Fig. 4d, e). The genes residing in this introgressed genomic block included several genes (e.g. ATP6V1, RAB11) involved in energy metabolism and mitochondrial function. The finding that P. muralis and the Sicilian subclade formed sister taxa in the mitochondrial phylogeny (Supplementary Fig. 6) is consistent with a role of mito-nuclear compatibility in maintaining the introgressed genomic block. Furthermore, the average dN/dS ratios of the 39 genes residing in the introgressed genomic block was significantly higher than those of genes without a history of introgression (0.420 vs. 0.254; permutation test: P = 0.039; Supplementary Fig. 13b and Supplementary Table 5). To further confirm that this genomic block is best explained by an introgression between P. muralis and the Sicilian subclade, we compared genome-wide patterns of Dxy and fd statistics (Fig. 4f). The Dxy of the candidate block (0.099) was lower than comparable values for 90% of the genome (Fig. 4g), implying that the origin of this genomic block was more recent than the divergence of the majority of their genomes. The fd of the candidate block (0.098) was significantly larger than genome-wide levels (0.038; permutation test: P < 0.001), further supporting the introgression of this genomic block between P. muralis and the Sicilian subclade.
The evolutionary history of wall lizards demonstrates that introgressive hybridization can be a persistent feature of animal clades over millions of years. While hybridization is well known to allow transfer of a limited portion of the genome between otherwise reproductively isolated lineages1,27, the independent evolution and further diversification of animal lineages with major contributions of two or more taxa has been considered exceptional2,3,4,5,27. The rampant introgression throughout the diversification of wall lizards paints a different picture, and suggests that reticulated evolution may have played an important role in generating the exceptional diversity of the Mediterranean lizard fauna.
Climatic oscillations and changes in land masses have caused recurrent range contractions and expansions in the Mediterranean, promoting bouts of evolutionary divergence and secondary contact between lineages28. One particularly dramatic period in the history of the Mediterranean is the Messinian Salinity Crisis (~6.0–5.3 MYA), during which the Mediterranean Sea nearly desiccated before it was rapidly refilled to attain the approximate shape of today. While genomic data alone cannot provide conclusive evidence for the timing of diversification events29, the diversifications within the two species-rich Balkan and Iberian groups of wall lizards appear to coincide with, or follow closely, this event. It could be predicted that such bursts in diversification would spring from hybridization events or that they would promote genetic exchange between incipient species30,31. However, our results demonstrate that introgression has been pervasive throughout the evolutionary history of wall lizards, even between distantly related lineages. Thus, the reticulated evolution of wall lizards is the result of occasional, but often extensive, genetic exchange between steadily evolving lineages, rather than a burst of hybridization associated with a rapid adaptive radiation.
Such a high and persistent level of introgression is possible because evolution of complete reproductive isolation is typically a slow process11. For example, following introductions outside of its native range by humans, the Italian wall lizard, P. siculus, has hybridized with species from the Balkan, Sicilian-Maltese and Western Islands groups, despite the lineages not having shared a common ancestor for over 17 million years32,33. At the same time, however, even closely related species can co-exist in sympatry34, and the narrow hybrid zones between such species (e.g. within the Iberian group34) or subspecies (e.g. within the widely distributed P. muralis35) demonstrate that a few million years are typically sufficient to evolve pre- or post-copulatory mechanisms that prevent lineages from fully merging. It is, therefore, perhaps surprising that frequently more than 20%, and occasionally close to 50%, of the genomes of several extant species derive from hybridization between highly divergent lineages. This was particularly striking for the two species groups that occupy islands in the Tyrrhenian and Balearic seas, suggesting that Mediterranean islands have provided suitable conditions for the formation of hybrid lineages, or that hybrid lineages have proven particularly able to persist in island habitats. Further studies of population genomic data in combination with reconstruction of the geological history could potentially identify the processes responsible for the origin and maintenance of these mosaic genomes9,36.
Regardless of their origin, the evolutionary potential of lineages with extensive genomic admixture is evident from the fact that several of them went on to diversify into new species (e.g. P. raffonei and P. waglerianus in the Sicilian-Maltese group). Another striking feature of these species with mosaic genomes, in particular island endemics (e.g. P. pityusensis in the Western Islands group37), is their extraordinary phenotypic variability. Lizards from nearby islets are frequently more diverse and disparate in body size, shape and colouration than are different mainland species (Fig. 1), an observation that has puzzled naturalists and evolutionary theorists since the 19th century12,13. The mixed ancestry of these species provides a plausible explanation for their striking variability. The merging of genomes that have evolved independently for millions of years creates opportunities for novel phenotypes to emerge, and ample opportunity for drift and selection to cause populations to differentiate. Conversely, the extensive genomic introgression between lineages may have facilitated the long-term persistence of colour morphs that are shared between species14,38. Finally, the presence of long genomic blocks in otherwise highly recombined hybrid genomes suggests scope for genomic conflict and adaptive introgression to have shaped the genomic and phenotypic features of extant species. While further data is necessary to identify exactly how hybridization contributed to their evolvability and diversity, genetic exchange has been a pervasive feature of the evolution of Mediterranean lizards.
Sample information and sequencing
A total of 36 samples were considered in this study, representing 26 species of the Podarcis genus, eight lineages within recognized species, and two species (Atlantolacerta andreanskyi and Archaeolacerta bedriagae) as outgroup (Supplementary Table 1). All specimens were collected in accordance with the policy of the animal care and use ethics of local institutions (for collection permits, see Supplementary Table 7). Genomic DNA was extracted using a DNeasy Blood & Tissue Kit (Qiagen, USA) according to manufacturer’s instructions. Short-insert (300–500 bp) libraries were sequenced on an Illumina HiSeq X platform by NOVOGENE Ltd. (Hong Kong). The sequence reads were quality checked using FASTQC (http://www.bioinformatics.babraham.ac.uk/projects/fastqc/) and trimmed using trimmomatic39 using default settings except for “LEADING:3, TRAILING:3, SLIDINGWINDOW:4:5, MINLEN:70”. Clean reads were aligned to the P. muralis reference genome version PodMur_1.014 using bwa-mem (version 0.7.1; http://bio-bwa.sourceforge.net/bwa.shtml#2/). Single nucleotide and short indel variants were called using the GATK best practice workflow40, with following cutoffs for filtering: SNP quality >100, base quality >30, mapping quality >50, quality by depth >2.0, minimum depth >288, and maximum depth <792, and other default parameters in GATK40.
Two strategies were applied to obtain a robust Podarcis phylogeny. First, we used a concatenation approach by concatenating all SNVs across the whole genome sequences (WGS) and protein-coding sequences (CDS), respectively, and inferred the Maximum likelihood (ML) trees. Second, we adopted a multispecies coalescent approach to infer the Podarcis tree using ASTRAL-III41. The linkage disequilibria of the 34 wall lizard lineages are unknown, and we therefore considered several different window sizes. We generated the local tree dataset by separately splitting the whole genome into fixed windows of 200, 100, 50, 25, 10 and 5 kb, and reconstructed the ML trees for each window. Only windows with the missing rate of SNVs <1% were retained in the following analyses. ML trees were inferred using IQTree42,43 with an GTR + ASC model and 1000 bootstrap replicates. The topologies of local ML trees (based on 50 kb fixed-windows) were quantified using Twisst44 and plotted with different colours in R using the function plot (https://www.r-project.org/). We also used the Bayesian coalescent model in SNAPP from the BEAST2 package45,46 to infer the Podarcis tree based on 3000 randomly selected variants due to computational constraints. Robinson–Foulds (RF) distances were calculated for each pair of phylogenetic topologies using the R package ‘ape’47 to assess the discrepancy among phylogenies.
The mitochondrial genome of each individual was de novo assembled using NOVOPlasty48. The mitochondrial genome of P. muralis (accession FJ460597 from MitoZoa49) was set as a starting reference. A total of 6 Gb sequence reads from each sample were randomly extracted for the baiting and iterative mapping with default parameters. For information on the completeness of the mitochondrial genome assemblies for each lineage, see Supplementary Table 3. Mitochondrial genomes were aligned using MUSCLE50. All ambiguous regions were excluded from the analyses to avoid false hypotheses of primary homology. IQTree was used to infer the phylogenetic tree (ML) with 1000 boostrap replicates after partition model selection42,43.
Divergence time estimation
Given the extensive admixture during the evolutionary history of Podarcis species, we estimated the lineage divergence times based on genomic regions for which local trees were concordant with the consensus phylogeny (resulting in 1.24 million SNVs, 4.37% of the WGS dataset). A relaxed clock45 model in MCMCtree from the package PAML51 was used to estimate divergence times. For MCMCTree, the calibration constraints were specified with soft boundaries by using 0.025 tail probabilities above and below the limit in the built-in function of MCMCtree. The independent rate model (clock = 2) was used to specify the rate priors for internal nodes. The MCMC run was first executed for 10,000,000 generations as burn-in and then sampled every 150 generations until a total of 100,000 samples were collected. Two MCMC runs using random seeds were compared for convergence, and similar results were found. For reference, we also estimated the divergence times based on the entire WGS dataset and on mitochondrial DNA, which yielded qualitatively identical, and quantitatively very similar, results (Supplementary Fig. 23).
Estimation of species diversification and demographic history
We estimated the diversification rate through time using a Lineage Through Time (LTT) curve and a sliding-window estimation18. Shifts in diversification rates were inferred using ML utilizing treePar52 and Bayesian inference using RevBayes53. Pairwise sequentially Markovian coalescence (PSMC) analysis54 was applied to estimate the demographic history of each genome with the command “psmc -N25 -t15 -r5 -p4 + 25*2 + 4 + 6” (Supplementary Figs. 24–27). Mutation rates were estimated by r8s55, and the generation time was set to two years.
Admixture and introgression analysis
We first applied the standard ABBA-BABA test (Patterson’s D-statistics) using the qpDstat command in AdmixTools19, and considered all triplets of the WGS tree, using A. bedriagae as the outgroup. We assessed significance through a block-jackknifing approach as implemented in AdmixTools, and applied a Bonferroni correction to assign significance at the 95% confidence level. In addition, we phased the genomes for each sample using BEAGLE 4.156 with a uniform recombination map and options “-x 1000000 -y 200000 -z 1000” and then used the chromopainter software in the fineSTRUCTURE package20 to calculate the ‘co-ancestry matrix’—a summary of nearest-neighbour haplotype relationships that is an indication of admixture and introgression.
We conducted phylogenetic network analyses using phyloNet21 to infer reticulation events among species. Due to computational limitations, we were unable to analyze the whole dataset at once. We therefore divided the samples into different groups, including (1) the major clades, (2) the Balkan group, (3) the Iberian group, and (4) P. muralis together with Iberian species. We made use of high-quality local trees with mean bootstrap >80, extracted 2000 random trees per run with a chain-length of 10,000,000 and a burn-in of 5,000,000 in the MCMC_gt module. We used 100 independent iterations for each run, and extracted all output networks with more than 50% posterior probability, and summarized the results by generating a correlation matrix of those networks based on Luay Nakhleh’s metric of reduced phylogenetic network similarity8. In addition, we also used the Infer_Network_MPL module to infer reticulation events based on maximum pseudo-likelihood by setting maximum reticulations of 5 for 50 iterations.
To validate the phylogenetic networks, we further fitted the evolutionary scenarios by admixture models for subsets of species or lineages related to introgression events using a combination of linear algebra and numerical optimization (Nelder–Mead) based on observed D-statistics19. The minimal error (F - f)t × S−1 × (F - f) is defined as the cost function of the model parameters, where F and f are the expected and observed D statistics, respectively, and S is the covariance matrix of f. The consensus tree topology was always fitted first, then the other inferred reticulations were gradually added. The best-fitting scenario was identified as the one with the minimum error. The analysis was performed using the R package ‘Admixgraph’22.
Based on the phylogenetic network, we used the program qpGraph from AdmixTools19 to fit the evolutionary history for all 34 Podarcis species or lineages together with introgressions. qpGraph optimizes the fit of a proposed admixture graph in which each node can be descended either from a mixture of two other nodes, or from a single ancestral node. The proportion of introgressed alleles was calculated by ƒ4-ratio tests.
The method quantifying introgression via branch lengths (QuIBL)8, based on the counts of triplet topologies of local ML trees, was used to quantify the frequency of introgression for species or lineages. For each set of trees we determined the likelihood that the branch lengths were best described by a simple exponential distribution as expected under ILS or a mixture of ILS and either introgression or speciation processes. The Bayesian information criterion (BIC) was used to identify the best-fitting model.
To confirm introgression of candidate genomic regions, we calculated the absolute genetic divergence (Dxy) between donor and recipient lineages, and the fd statistics with the same triplets as used for the calculation of D statistics based on 50 Kb fixed-window genomic regions. Dxy and fd were calculated using the genomics_general package (https://github.com/simonhmartin/genomics_general/), where a significantly lower Dxy and higher fd support introgression57 (tested by 1000 permutations).
Evolution of protein-coding genes
Signatures of selection were estimated by the ratios between non-synonymous and synonymous substitution rate (dN/dS) for protein-coding genes in codeml from the package PAML51 using runmode ‘-2’. For each recipient lineage with a history of introgression, we identified the coding sequences of genes from genomic regions derived from introgression events (foreground), and genes with a history consistent with the consensus phylogeny (background). We compared substitution rates between each of the 12 focal lineages that experienced a total of 15 introgression events (Supplementary Table 4) compared to the outgroup Archaeolacerta bedriagae. To test if the dN/dS ratios were significantly different between fore- and background genes, we used 1000 permutations.
Further information on research design is available in the Nature Research Reporting Summary linked to this article.
All sequence data generated in this study have been deposited in NCBI Sequence Reads Archive (SRA) with accession number PRJNA715201.
Hedrick, P. W. Adaptive introgression in animals: examples and comparison to new mutation and standing variation as sources of adaptive variation. Mol. Ecol. 22, 4606–4618 (2013).
Runemark, A., Vallejo-Marin, M. & Meier, J. I. Eukaryote hybrid genomes. PLoS Genet. 15, e1008404 (2019).
Abbott, R. et al. Hybridization and speciation. J. Evol. Biol. 26, 229–246 (2013).
Schumer, M., Rosenthal, G. G. & Andolfatto, P. How common is homoploid hybrid speciation? Evolution 68, 1553–1560 (2014).
Nieto Feliner, G. et al. Is homoploid hybrid speciation that rare? An empiricist’s view. Heredity 118, 513–516 (2017).
Mallet, J., Besansky, N. & Hahn, M. W. How reticulated are species? Bioessays 38, 140–149 (2016).
Marques, D. A., Meier, J. I. & Seehausen, O. A combinatorial view on speciation and adaptive radiation. Trends Ecol. Evol. 34, 531–544 (2019).
Edelman, N. B. et al. Genomic architecture and introgression shape a butterfly radiation. Science 366, 594–599 (2019).
McGee, M. D. et al. The ecological and genomic basis of explosive adaptive radiation. Nature 586, 75–79 (2020).
Malinsky, M. et al. Whole-genome sequences of Malawi cichlids reveal multiple radiations interconnected by gene flow. Nat. Ecol. Evol. 2, 1940–1955 (2018).
Kulmuni, J., Butlin, R. K., Lucek, K., Savolainen, V. & Westram, A. M. Towards the completion of speciation: the evolution of reproductive isolation beyond the first barriers. Philos. Trans. R. Soc. B 375, 20190528 (2020).
Eimer, G. H. T. Untersuchungen über das Variiren der Mauereidechse, ein Beitrag zur Theorie von der Entwicklung aus constitutionellen Ursachen, sowie zum Darwinismus (Nicolaische Verlags-Buchhandlung, 1881).
Kammerer, P. Der Artenwandel auf Inseln und seine Ursachen ermittelt durch Vergleich und Versuch an den Eidechsen der Dalmatinischen Eilande (Verlag von Franz Deuticke, 1926).
Andrade, P. et al. Regulatory changes in pterin and carotenoid genes underlie balanced color polymorphisms in the wall lizard. Proc. Natl Acad. Sci. USA 116, 5633–5642 (2019).
Harris, D. J. & Arnold, E. N. Relationships of wall lizards, Podarcis (Reptilia: Lacertidae) based on mitochondrial DNA sequences. Copeia 1999, 749–754 (1999).
Psonis, N. et al. Hidden diversity in the Podarcis tauricus (Sauria, Lacertidae) species subgroup in the light of multilocus phylogeny and species delimitation. Mol. Phylogenet. Evol. 106, 6–17 (2017).
Garcia-Porta, J. et al. Environmental temperatures shape thermal physiology as well as diversification and genome-wide substitution rates in lizards. Nat. Commun. 10, 4077 (2019).
Meredith, R. W. et al. Impacts of the Cretaceous terrestrial revolution and KPg extinction on mammal diversification. Science 334, 521–524 (2011).
Patterson, N. et al. Ancient admixture in human history. Genetics 192, 1065 (2012).
Lawson, D. J., Hellenthal, G., Myers, S. & Falush, D. Inference of population structure using dense haplotype data. PLoS Genet. 8, e1002453 (2012).
Wen, D., Yu, Y., Zhu, J. & Nakhleh, L. Inferring phylogenetic networks using PhyloNet. Syst. Biol. 67, 735–740 (2018).
Leppala, K., Nielsen, S. V. & Mailund, T. admixturegraph: an R package for admixture graph manipulation and fitting. Bioinformatics 33, 1738–1740 (2017).
Coyne, J. A. & Orr, H. A. Speciation (Sinauer, 2004).
Schumer, M. et al. Natural selection interacts with recombination to shape the evolution of hybrid genomes. Science 360, 656 (2018).
Nelson, T. C. et al. Ancient and recent introgression shape the evolutionary history of pollinator adaptation and speciation in a model monkeyflower radiation (Mimulus section Erythranthe). PLoS Genet. 17, e1009095 (2021).
Wolff, J. N., Ladoukakis, E. D., Enríquez, J. A. & Dowling, D. K. Mitonuclear interactions: evolutionary consequences over multiple biological scales. Philos. Trans. R. Soc. B 369, 20130443 (2014).
Taylor, S. A. & Larson, E. L. Insights from genomes into the evolutionary importance and prevalence of hybridization in nature. Nat. Ecol. Evol. 3, 170–177 (2019).
Thompson, J. D. Plant evolution in the Mediterranean 2nd edn (Oxford Univ. Press, 2020).
Louca, S. & Pennell, M. W. Extant timetrees are consistent with a myriad of diversification histories. Nature 580, 502–505 (2020).
Seehausen, O. Hybridization and adaptive radiation. Trends Ecol. Evol. 19, 198–207 (2004).
Kagawa, K. & Seehausen, O. The propagation of admixture-derived adaptive radiation potential. Proc. Biol. Sci. 287, 20200941 (2020).
Capula, M. Natural hybridization in Podarcis sicula and P. wagleriana (Reptilia: Lacertidae). Biochem. Syst. Ecol. 21, 373–380 (1993).
Podnar, M., Haring, E., Pinsker, W. & Mayer, W. Unusual origin of a nuclear pseudogene in the Italian wall lizard: intergenomic and interspecific transfer of a large section of the mitochondrial genome in the genus Podarcis (Lacertidae). J. Mol. Evol. 64, 308–320 (2007).
Pinho, C., Kaliontzopoulou, A., Carretero, M. A., Harris, D. J. & Ferrand, N. Genetic admixture between the Iberian endemic lizards Podarcis bocagei and Podarcis carbonelli: evidence for limited natural hybridization and a bimodal hybrid zone. J. Zool. Syst. Evol. Res. 47, 368–377 (2009).
Yang, W. et al. Spatial variation in gene flow across a hybrid zone reveals causes of reproductive isolation and asymmetric introgression in wall lizards. Evolution 74, 1289–1300 (2020).
Runemark, A. et al. Variation and constraints in hybrid genome formation. Nat. Ecol. Evol. 2, 549–556 (2018).
Mellado, V. P. Les sargantanes de les Balears (Documenta Balear S.L., 2009).
Jamie, G. A. & Meier, J. I. The persistence of polymorphisms across species radiations. Trends Ecol. Evol. 35, 795–808 (2020).
Bolger, A. M., Lohse, M. & Usadel, B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30, 2114–2120 (2014).
DePristo, M. A. et al. A framework for variation discovery and genotyping using next-generation DNA sequencing data. Nat. Genet. 43, 491–498 (2011).
Zhang, C., Rabiee, M., Sayyari, E. & Mirarab, S. ASTRAL-III: polynomial time species tree reconstruction from partially resolved gene trees. BMC Bioinform. 19, 153 (2018).
Nguyen, L. T., Schmidt, H. A., von Haeseler, A. & Minh, B. Q. IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol. Biol. Evol. 32, 268–274 (2015).
Kalyaanamoorthy, S., Minh, B. Q., Wong, T. K. F., von Haeseler, A. & Jermiin, L. S. ModelFinder: fast model selection for accurate phylogenetic estimates. Nat. Methods 14, 587–589 (2017).
Martin, S. H. & Van Belleghem, S. M. Exploring evolutionary relationships across the genome using topology weighting. Genetics 206, 429–438 (2017).
Bryant, D., Bouckaert, R., Felsenstein, J., Rosenberg, N. A. & RoyChoudhury, A. Inferring species trees directly from biallelic genetic markers: bypassing gene trees in a full coalescent analysis. Mol. Biol. Evol. 29, 1917–1932 (2012).
Bouckaert, R. et al. BEAST 2.5: an advanced software platform for Bayesian evolutionary analysis. PLoS Comput. Biol. 15, e1006650 (2019).
Paradis, E. & Schliep, K. ape 5.0: an environment for modern phylogenetics and evolutionary analyses in R. Bioinformatics 35, 526–528 (2019).
Dierckxsens, N., Mardulyn, P. & Smits, G. NOVOPlasty: de novo assembly of organelle genomes from whole genome data. Nucleic Acids Res. 45, e18 (2017).
D’Onorio de Meo, P. et al. MitoZoa 2.0: a database resource and search tools for comparative and evolutionary analyses of mitochondrial genomes in Metazoa. Nucleic Acids Res. 40, D1168–D1172 (2012).
Edgar, R. C. MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 32, 1792–1797 (2004).
Yang, Z. PAML 4: phylogenetic analysis by maximum likelihood. Mol. Biol. Evol. 24, 1586–1591 (2007).
Stadler, T. Mammalian phylogeny reveals recent diversification rate shifts. Proc. Natl Acad. Sci. USA 108, 6187–6192 (2011).
Hohna, S. et al. RevBayes: bayesian phylogenetic inference using graphical models and an interactive model-specification language. Syst. Biol. 65, 726–736 (2016).
Li, H. & Durbin, R. Inference of human population history from individual whole-genome sequences. Nature 475, 493–496 (2011).
Sanderson, M. J. r8s: inferring absolute rates of molecular evolution and divergence times in the absence of a molecular clock. Bioinformatics 19, 301–302 (2003).
Browning, S. R. & Browning, B. L. Rapid and accurate haplotype phasing and missing-data inference for whole-genome association studies by use of localized haplotype clustering. Am. J. Hum. Genet. 81, 1084–1097 (2007).
Martin, S. H., Davey, J. W. & Jiggins, C. D. Evaluating the use of ABBA-BABA statistics to locate introgressed loci. Mol. Biol. Evol. 32, 244–257 (2015).
A.K. thanks the FEDER Funds through the Operational Competitiveness Factors Program—COMPETE and National Funds through FCT—Foundation for Science and Technology within the scope of the project “PTDC/BIA- EVL/28090/2017-POCI-01-0145-FEDER-028090” for financial support in the collection of samples. D.S. thanks the Italian Ministry of Education, University and Research for financial support (PRIN project 2017KLZ3MA). We thank the following people for support in the collection of specimens: A. Castilla, A. Perera, B. Santos, B. Tomé, C. Rato, D. Rosado, E. Garcia-Muñoz, F. Jorge, G. Caeiro-Dias, G. Pérez i de Lanuza, I. Damas, I. Rocha, I. Tavares, J. Santos, M. Ribeiro, M. A. Carretero and V. Gomes. T.U. thanks the Knut and Alice Wallenberg foundation for support through a Wallenberg Academy fellowship, and project support from the Swedish Research Council (2014_04465 and 2017_03846) and the Crafoord Foundation (20160911 and 20190784).
Open access funding provided by Lund University.
The authors declare no competing interests.
Peer review information Nature Communications thanks Joana Meier and the other, anonymous, reviewers for their contribution to the peer review of this work. Peer reviewer reports are available.
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Yang, W., Feiner, N., Pinho, C. et al. Extensive introgression and mosaic genomes of Mediterranean endemic lizards. Nat Commun 12, 2762 (2021). https://doi.org/10.1038/s41467-021-22949-9