Abstract
Previous studies of DNA sequence and karyotypic data have revealed high genetic diversity in the Physalaemus cuvieri – Physalaemus ephippifer species complex—a group of small leptodactylid frogs in South America. To date, seven major genetic lineages have been recognized in this group, with species delimitation tests supporting four to seven of them as valid species. Among these, only P. ephippifer shows heteromorphic sex chromosomes, but the implications of cytogenetic divergence for the evolution of this group are unknown. We analyzed karyotypic, mitochondrial DNA, and 3RAD genomic data to characterize a putative contact zone between P. ephippifer and P. cuvieri Lineage 1, finding evidence for admixture and karyotypic evolution. We also describe preliminary evidence for admixture between two other members of this species complex—Lineage 1 and Lineage 3 of P. cuvieri. Our study sheds new light on evolutionary relationships in the P. cuvieri – P. ephippifer species complex, suggesting an important role of karyotypic divergence in its evolutionary history and underscoring the importance of hybridization as a mechanism of sex chromosome evolution in amphibians.
Similar content being viewed by others
Introduction
Speciation is a central topic in evolutionary biology, and hybridization between divergent lineages may impact speciation in several ways (for reviews, see Refs.1,2,3,4). Briefly, hybridization can break down species boundaries, counteracting speciation (e.g. Ref.4). Alternatively, if hybrids have reduced fitness, natural selection may favor interspecific divergence and reproductive isolation through reinforcement of species boundaries5,6. In some intermediate cases, backcrossing of viable hybrids with parental species may result in introgression (e.g. Ref.7). Finally, hybridization can generate a hybrid lineage that is reproductively isolated from its parental taxa, thus originating a new species through hybrid speciation (e.g. Ref.8).
As hybridization generates admixed genomes, it can blur species boundaries, thus representing a potential problem for species delimitation and taxonomy (for a review, see Ref.9). Although rival species concepts have been debated for decades10,11,12, a common element of most contemporary concepts is an emphasis on the independent evolutionary trajectory of each species13. In this sense, species can be recognized as separately evolving lineages, following the “general lineage concept of species” (Ref.14; but see Ref.11), and genetic analyses have been commonly incorporated in species delimitation studies of diverse organisms (e.g. Ref.15,16,17). However, because hybridization and introgression are more common than previously assumed (for reviews, see Refs.1,9,18,19), the identification of genetic lineages and the assessment of their reproductive isolation are often challenging, necessitating careful analysis of comprehensive multilocus datasets to discern the evolutionary processes at play. For instance, cytonuclear discordance (conflicting genetic patterns in nuclear vs mitochondrial markers) is common in hybridization and introgression scenarios (for a review, see Ref.20), but can also result from incomplete lineage sorting in the absence of gene flow19,21. Thus, specific analyses—such as the ABBA-BABA test, which identifies excess allele sharing between taxa due to hybridization22—have been widely used to assess the relative roles of gene flow versus incomplete lineage sorting (e.g. Ref.23,24).
Here, we investigated the Physalaemus cuvieri – Physalaemus ephippifer species complex, a group of South American leptodactylid frogs that exhibits high genetic diversity and unresolved taxonomic issues25. Based on DNA sequences (mitochondrial DNA and 3RAD markers) and karyotypes, Nascimento et al.25 recognized seven major genetic lineages in this group, and species delimitation tests based on DNA sequences supported four to seven of these lineages as potential species. One of the uncertainties regarding species delimitation in this group involved a clade composed of P. ephippifer and Lineage 1 of P. cuvieri sensu lato (hereafter L1). The delimitation test based on 3RAD markers (BPP test) recovered P. ephippifer and L1 as distinct species, while the bPTP test based on mitochondrial DNA markers provided only partial support for the recognition of both lineages as different species25. However, this previous study was limited by relatively sparse geographic sampling and the absence of morphological or acoustic data, thus prompting further work to robustly characterize species boundaries.
Interestingly, heteromorphic sex chromosomes Z and W are present in P. ephippifer26 but not in L1 or in any other lineage in the species complex25,27. The W chromosome of P. ephippifer differs from its Z chromosome mainly through the presence of an additional nucleolar organizer region (NOR) and a distal heterochromatic band in the short arm26, but also by the presence of a larger site enriched in the satellite DNA PcP190 in the long arm28. Chromosome 9 of the L1 karyotype has been inferred to be homeologous to the sex chromosomes of P. ephippifer based on the presence of NORs that coincide with heterochromatic C-band enriched in the repetitive DNA PepBS29. This inference was further supported by the detection of a pericentromeric region of the short arm of chromosome 9 using a probe constructed from the microdissection of the pericentromeric region of the long arm of the P. ephippifer Z chromosome (the Zqper probe)29. In addition to the intrachromosomal location of the region detected by the Zqper probe, chromosome 9 of L1 differs from both the Z and W chromosomes of P. ephippifer by having a smaller short arm29. Although chromosome 9 of L1 is polymorphic with respect to NOR size and number, no sex-related heteromorphism has been reported in this lineage27,29.
Sex chromosomes are known to play important roles in speciation, as the presence of distinct sex chromosome systems may lead to reproductive barriers30,31,32,33. However, the extent of hybridization between P. ephippifer and L1 and the potential role of cytogenetic differences in their divergence remain unexplored. To address this, we analyzed specimens from sites located near the distribution boundaries of P. ephippifer and L1 and near their putative contact zone using karyotypic, mitochondrial, and 3RAD genomic data.
Results
Our analyses included 37 specimens from the geographic area of primary interest in this study, which comprises the distribution boundaries of P. ephippifer and L1 and their potential contact zone. This sampled area includes six localities, numbered 1–6 in Fig. 1, respectively: São Pedro da Água Branca (SPAB, admixed P. ephippifer x L1), Vila Nova dos Martírios (VNM, admixed P. ephippifer x L1), São Francisco do Brejão—Trecho Seco (TS, admixed P. ephippifer x L1), Imperatriz (Imp, admixed P. ephippifer x L1), Marabá (Mar, assigned to L4), and Parauapebas (Par, assigned to L4). Additionally, we included individuals from Vila Bela da Santíssima Trindade (VBST, site 8 in Fig. 1), a locality situated in west-central Brazil, Balsas (Bal, site 7 in Fig. 1), and Pirenópolis (Pir, site 9 in Fig. 1). We also expanded the specimen sampling of P. ephippifer, L1, L2, and L3 to 13, 27, 26, and 9 individuals, respectively. For detailed information on samples used in each analysis, refer to Supplementary Table S1.
Mitochondrial DNA analyses
In our phylogenetic analysis of the mtDNA dataset, we recovered all major genetic lineages previously recognized in the P. cuvieri – P. ephippifer species complex, including P. ephippifer, Physalaemus sp. (Western Pará and Viruá clades), L1 (composed of L1A and L1B subclades), L2, and L3 (Fig. 2A; Supplementary Fig. S1). At SPAB, VNM, and TS, we found two distinct haplogroups in syntopy (M1 and M2). The M1 haplogroup included sequences of nine specimens from SPAB, three from VNM, four from TS, and all specimens from Imp and Bal. This M1 haplogroup was nested within the L1B clade. In contrast, the M2 haplogroup included sequences of 10 specimens from SPAB, 1 from VNM, and 1 from TS, and was sister to the P. ephippifer clade (Fig. 2B). We estimated a genetic distance of 1.12% between M1 and M2 haplogroups using the MVZ59–16Sbr fragment, and of 1.39% using only the 16Sar–16Sbr fragment (Table 1).
At locality Bal, we found haplotypes typical of L3 in addition to haplotypes of L1 (Fig. 2, Supplementary Figs. S1, S2). Six specimens from this locality clustered within the L1 clade, while four other specimens were nested in the L3 clade, together with specimens from Porto Nacional (State of Tocantins) and Paranã (State of Tocantins) (Fig. 2). It is worth noting that in this analysis, the L3 clade was sister to the specimen from Vila Bela da Santíssima Trindade (VBST) (State of Mato Grosso). The mitochondrial haplotype found in VBST, however, was notably divergent from the remaining sequences in L3, as shown by the phylogenetic analysis (Fig. 2A), the haplotype network (Supplementary Fig. S2), and the genetic distance analyses (Table 1).
Finally, mtDNA analyses revealed a previously unrecognized lineage composed of specimens from Mar and Par, which we refer to as Lineage 4 (L4) (Fig. 2). The L4 clade was strongly supported (bootstrap value = 99%), as was the clade composed of L4, L1 (including the M1 haplogroup), and P. ephippifer (including the M2 haplogroup) (bootstrap value = 100%), (Fig. 2). L4 was inferred to be sister to the clade composed of L1 and P. ephippifer, although this latter clade was not supported by the bootstrap analysis (Fig. 2). We estimated a genetic distance of 1.48% between L4 and L1 (including L1A and L1B) using the MVZ59–16Sbr fragment and of 1.43% using only the 16Sar–16Sbr fragment. Between L4 and P. ephippifer, we found a divergence of 1.27% in the MVZ59–16Sbr fragment and of 1.43% in the 16Sar–16Sbr fragment (Table 1).
3RAD data assembly and analyses
Our full 3RAD assembly included > 230,000 loci. Of these, > 200,000 loci were variable and contained a total of > 2.7 million SNPs. This final assembly included between 13,468 and 45,449 loci per individual (Supplementary Table S2).
The maximum-likelihood cladograms inferred from the full 3RAD dataset and the dataset that included one random SNP per locus were quite congruent, with the few differences regarding relationships among individuals within specific lineages. In both 3RAD cladograms, we recovered samples from VBST, then L2, and then L3 as sequentially sister to the remaining ingroup samples (Fig. 3). In contrast to the analysis based on mtDNA, analyses of the 3RAD dataset recovered all the specimens from Bal (n = 7) as a clade, which was sister to a clade consisting of L1, L4, P. ephippifer, and all samples from the putative contact zone between L1 and P. ephippifer. Focusing on this contact zone, our analysis inferred a clade composed of samples from TS (n = 5), a clade composed of samples from Imp (n = 2), and a clade composed of samples from SPAB (n = 19) and VNM (n = 4)—with this last clade recovered as sister to P. ephippifer. Similar to the mtDNA results, the phylogenetic analysis of the 3RAD dataset recovered a clade corresponding to L4 (i.e. the samples from Mar and Par), a clade corresponding to L3, and a clade corresponding to Physalaemus sp. (Western Pará and Viruá clades). Another discordance between mtDNA and 3RAD data involved the placement of SMRP 92.307 from locality Pir, which did not group with L2 but was recovered as sister of L3 instead (Fig. 3; Supplementary Fig. S3).
The NeighborNet network inferred from 3RAD data (Supplementary Fig. S4) showed results that were largely concordant with those from the RAxML phylogenetic inference. For example, all samples from Bal clustered with each other but separate from L1. Additionally, we recovered the same major groups within the contact zone between P. ephippifer and L1, with samples from SPAB and VNM being closer to P. ephippifer than the samples from TS and Imp. Furthermore, we recovered SMRP 92.307 from Pirenópolis as separate from L3, which was concordant with our RAxML phylogenetic inference (Fig. 3; Supplementary Fig. S3). However, in the phylogenetic network, we recovered SMRP 92.307 with SMRP 92.308 (from Paranã), which was clustered in L3 in the RAxML analysis (Fig. 3; Supplementary Fig. S3).
In the first PCA (i.e. the analysis that included all samples), PC1 separated samples from Physalaemus sp. (Western Pará and Viruá clades) from all other samples and explained 24.4% of all variation. The second PC axis (explaining 9.4% of all variation) separated the remaining samples into two major groups: one consisting of the outgroup P. centralis, VBST, L2, and L3, and one consisting of L4, P. ephippifer, L1, and the contact zone between P. ephippifer and L1 (Supplementary Fig. S5). In the second PCA, which focused on this contact zone, the first PC axis separated L1 from P. ephippifer and explained 12.2% of all variation (Fig. 4). All samples from the contact zone fell at intermediate positions on this first PC axis, with those from SPAB and VNM being closest to P. ephippifer, those from Imp being closest to L1, and those from TS being intermediate in position. The second PC axis explained 7.6% of variation and mostly separated samples from the contact zone from those in both “parental” lineages (i.e. P. ephippifer and L1). Finally, in the third PCA, which focused on evaluating evidence for gene flow between L3 and samples from Bal (in L1), the first PC axis (24.2% of variation) separated P. ephippifer from L3, with all samples from L1 occupying intermediate positions on this axis (Fig. 5). Notably, samples from Bal showed positions on this PC axis closer to L3 than any other samples in L1. The second PC axis (13.9% of variation) separated L1 from the other two groups.
In the first set of Structure analyses (Fig. 4), we found the strongest support for K = 3 and plotted posterior probabilities of population assignment (hereafter “assignment probabilities”) for K = 2–4. At K = 2, samples from P. ephippifer had uniformly high assignment probabilities to one cluster and samples from L1 had uniformly high assignment probabilities to the second cluster; samples from the contact zone had intermediate assignment probabilities to each cluster, with those from SPAB and VNM having relatively higher assignment probabilities to the cluster corresponding to P. ephippifer, those from Imp having relatively higher assignment probabilities to the cluster corresponding to L1, and those from TS falling in-between. At K = 3, samples from SPAB and VNM had high assignment probabilities to a third cluster, with samples from TS and Imp having assignment probabilities split between this third cluster and the cluster corresponding to L1. The analyses at K = 4 appear similar to those at K = 3 and failed to reveal any additional, meaningful population structure (Fig. 4). In the second set of Structure analyses (Fig. 5), we found some support for K = 2 and K = 4, and we plotted assignment probabilities fork K = 2–4. At K = 2, most samples from L1 had uniformly high assignment probabilities to one cluster and samples from L3 had uniformly high assignment probabilities to the second. Some samples from Bal had low, but non-zero, assignment probabilities to L3. Analyses at K = 3 and K = 4 revealed additional population structure among samples from Bal, with samples SMRP 92.233 and SMRP 92.234 appearing to be the most distinct (Fig. 5).
The ABBA-BABA tests focused on the contact zone between L1 and P. ephippifer revealed statistically significant and negative D-statistics (i.e. indicating more L1 alleles in these samples than in P. ephippifer) for all analyzed sampling sites. These D-statistics varied from − 0.157 (p < 0.01) for Imp to − 0.088 for VNM (p < 0.01) (Fig. 4D). The ABBA-BABA test focused on samples from Bal also revealed a significant negative D-statistic of − 0.214 (p < 0.01) (i.e. indicating more L3 alleles in Bal than in other populations from L1) (Fig. 5).
Cline analyses of the contact zone between P. ephippifer and L1
In the cline analyses of the contact zone between P. ephippifer (hybrid index = 0) and L1 (hybrid index = 1) based on 3RAD data, we estimated mean hybrid indexes of 0.28 ± a standard deviation of 0.02 for SPAB, 0.29 ± 0.01 for VNM, 0.33 ± 0.2 for TS, and 0.42 ± 0.02 for Imp. All estimates of interspecific heterozygosity were low (range = 0.02–0.13) (Fig. 4E). In the geographic cline fit to the hybrid index, we estimated a cline center of 317 km (95% credible interval [CI] 188–596 km) and a cline width of 713 km (CI 219–1,800 km). In the geographic cline fit to the mtDNA haplotype frequencies, we estimated a cline center of 163 km (CI 102–253 km) and a cline width of 319 km (CI 100–1219 km) (Fig. 6).
Cytogenetic analyses of specimens from SPAB and VNM
All of the analyzed individuals from SPAB and VNM had a 2n = 22 karyotype, composed of five pairs of metacentric chromosomes and six pairs of submetacentric chromosomes (Fig. 7A–C). Chromosomes 7, 8, and 9 carried nucleolus organizer regions (NORs), as revealed by silver impregnation using the Ag-NOR method in metaphases of 13 individuals (Fig. 7D). In all of the analyzed specimens, chromosome pair 7 had interstitial NOR in the long arm. In four males and one female, pair 7 was heteromorphic with respect to the NOR size (Fig. 7).
Regarding chromosome pair 8, the presence of one interstitial NOR in the long arm of both homologues was the most frequent condition. However, in three of the 13 analyzed specimens (i.e. SMRP 92.1, SMRP 92.327, and SMRP 92.338), a single NOR-bearing chromosome 8 was present. In this latter specimen (SMRP 92.338), the NOR-bearing chromosome 8 had an additional distal NOR, also in the long arm (Fig. 7D). The variation regarding chromosome 8 was not sex-related.
In contrast, chromosome pair 9 was heteromorphic regarding the number of NORs exclusively in females, suggesting the presence of a ZZ/ZW chromosome system. In the ten males for which a NOR-pattern could be conclusively identified, both chromosomes 9 had only two NORs in the long arm (one interstitial and one terminal), while in the three females, one homologue of pair 9 had an additional distal NOR in the short arm (Fig. 7D).
The C-banding revealed heterochromatic bands in all centromeric regions and an interstitial band in the short arm of chromosome 5. A faint C-band adjacent to the NOR in chromosome 8 could be seen in some metaphases (Fig. 7C). Also, the NORs located in the long arm of chromosomes 9 (tentatively identified as Z and W chromosomes and hereafter referred to as such) coincided with heterochromatic bands (Fig. 7B,C). In addition, the Z chromosome had pericentromeric C-band in the short and long arms, whereas the W chromosome had a pericentromeric C band in the long arm (Fig. 7C). In the C-banded metaphases stained with DAPI, pericentromeric bands were visible in the short arm of chromosome 7 and in the long arm of chromosome 8 (Fig. 7B).
Using the chromosome mapping of the PepBS satellite DNA, we detected sites that coincided with all the NORs revealed by the Ag-NOR method. Accordingly, we documented variation in the size and number of PepBS sites (Fig. 8). Moreover, we noted a difference in probe signal intensity, as the sites in chromosome 9 were much brighter than those in chromosomes 7 and 8 (Fig. 8).
Using in situ hybridization with the 8p probe, which previously detected the sex chromosomes of P. ephippifer and chromosome 9 of L129, we documented a pericentromeric region of the short arm of both Z and W chromosomes of specimens from SPAB (Fig. 9). We observed size heteromorphism of this chromosomal region, but it was not related to sex (Fig. 9B).
Cytogenetic analyses of specimens from Balsas
The three analyzed males had a 2n = 22 karyotype, which was composed of five pairs of metacentric chromosomes and six pairs of submetacentric chromosomes (Fig. 10). The Ag-NOR method revealed NORs in chromosomes 8 and 9 in all the specimens, with chromosome 9 being polymorphic in relation to the presence of an interstitial NOR besides a distal one (Fig. 10D,E). In addition, extra pericentromeric NORs were present in one homologue of pair 10 and one homologue of pair 7 in the specimens SMRP 92.226 and 92.232, respectively (Fig. 10D,E).
The hybridization of the PepBS probe to the karyotype of the SMRP 92.226 detected all the NORs, and the signal probe coincident with the NORs in chromosome 9 was brighter than those observed in chromosomes 8 and 10 (Fig. 10C, Supplementary Fig. S6). Although the NOR of chromosome 10 was similar in size to those of chromosomes 8 and 9, the probe signal coincident with this NOR was much smaller (Fig. 10C, Supplementary Fig. S6). The PepBS probe also revealed a small signal near the centromere of chromosome 10 which had no NOR detected by the Ag-NOR method (Fig. 10C, Supplementary Fig. S6).
All the detected NORs, especially those of chromosomes 7, 8, and 10, coincided with secondary constrictions in Giemsa and DAPI-stained metaphases (Fig. 10A,D,E) and the NORs in chromosome 9 also coincided with C-bands (Fig. 10B). In some metaphases, a faint C-band was noted adjacent to the NOR in chromosome 8. The C-banding also revealed the centromeric region of all chromosomes, an interstitial band in the short arm of chromosome 5, and a pericentromeric band in the long arm of chromosome pair 10 (Fig. 10B).
Discussion
Together, our analyses of mtDNA, 3RAD, and cytogenetic data from the P. cuvieri – P. ephippifer species complex revealed evidence of admixture between P. ephippifer and L1, which suggests an important role for historic contact and introgression in structuring the current distribution of genetic diversity—and potentially the evolution of sex chromosomes—in this group. In addition, our results provide evidence of admixture between L1 and L3, raise questions about the relationship between L2 and L3, and expand our knowledge about the diversity within this species complex, leading to the recognition of two new lineages (i.e. L4 and VBST) that warrant further investigation. We discuss each of these topics in greater detail and highlight opportunities for additional research below.
Admixture between P. ephippifer and L1: a potential driver of sex chromosome evolution
With respect to P. ephippifer and L1, we found some discordant results between the phylogenetic analyses based on mtDNA and 3RAD data. Cytonuclear incongruence is typically expected either in scenarios involving hybridization or incomplete lineage sorting20,21,34. However, our ABBA-BABA tests support hybridization as the most likely explanation for the patterns we observed. Accordingly, our findings indicate that the region encompassing São Pedro da Água Branca, Vila Nova dos Martírios, São Francisco do Brejão—Trecho Seco, and Imperatriz (the SPAB-VNM-TS-Imp region) may represent a contact zone between the sister lineages P. ephippifer and L1.
The frequency of mtDNA haplotypes, the topology of the 3RAD phylogeny and phylogenetic network, the results of Structure analyses and PCAs, and the cline analyses are all consistent with the greatest P. ephippifer ancestry in SPAB (located at the western side of the inferred contact zone) and the greatest L1 ancestry in Imp (located at the southeastern limit of this region). If evidence of gene flow from ABBA-BABA tests reflected limited but ongoing hybridization between individuals of the two parental lineages (i.e. P. ephippifer and L1), we would expect to find a continuum of ancestry proportions in each site—reflecting F1 hybrids and early-generation backcrosses. Instead, we found similar ancestry proportions in all individuals collected at each site, and no evidence of pure parental individuals of P. ephippifer or L1 in any locality within this contact zone. The very low interspecific heterozygosity in the samples from SPAB, VNM, TS, and Imp supports this inference, with no F1 hybrid identified in the 3RAD analyses. Our geographic cline analyses of both 3RAD and mtDNA data suggest a west-to-east change in ancestry in this region, but a denser geographic sampling is still necessary for a proper analysis of the extension of the contact zone between P. ephippifer and L1.
Our cytogenetic data also did not show evidence of ongoing hybridization between parental P. ephippifer and L1. Because the karyotypes of P. ephippifer and L1 can be easily distinguished from each other, particularly by the presence of heteromorphic sex chromosomes in P. ephippifer26,27, we would expect to easily recognize first-generation hybrids if they exist. The karyotypes of such hypothetical F1 hybrids would include, for example, a heteromorphic chromosome pair 8, composed of only one NOR-bearing homologue (inherited from L1), and a pair 9 composed of one chromosome 9 from L1 and one sex chromosome (Z or W) from P. ephippifer. Instead, our cytogenetic analyses did not reveal karyotypes formed by the simple combination of chromosomes currently found in the two parental lineages. On the contrary, our analysis detected unique characteristics of the specimens collected in this hybrid zone, including particular Z and W sex chromosomes, which karyotypically distinguish these specimens from both P. ephippifer and L1. Also, in the specimens from SPAB and VNM, chromosome pair 7 bears a NOR, which is a feature not found in any P. ephippifer or L1 specimens analyzed to date.
Therefore, our data show the cytogenetic distinctiveness of the specimens from SPAB and VNM. This finding is consistent with the PCA of 3RAD markers—which clearly separated the samples from the SPAB-VNM-TS-Imp region—as well as with the Structure analysis—which showed the strongest support for K = 3 when comparing 3RAD markers of P. ephippifer, L1, and specimens from the SPAB-VNM-TS-Imp region. Whether these populations are reproductively isolated (e.g. through genomic incompatibility or behavior) remains unclear and warrants further research.
The origin of new species through hybridization, a process known as hybrid speciation, has long been studied18,35,36. Change of ploidy3 and chromosomal rearrangements are known to play major roles37 in allopolyploid and homoploid speciation, respectively. Among anurans, the emergence of new species involving polyploid lineages has been more commonly reported (e.g. Refs.38,39,40), but there is also evidence of hybrid speciation unrelated to polyploid species. An interesting case was reported for Litoria genimaculata (currently Ranoidea genimaculata41), which exhibited two divergent lineages that hybridized in two distinct contact zones42. In one of them, significant premating isolation of the hybrid lineage was found42, supporting the description of this lineage as a new species43. Another intriguing case refers to the ranid frog Glandirana rugosa, in which new lineages have emerged from the contact and hybridization of groups with diverging sex chromosomes44,45,46,47,48.
A detailed analysis of the sex chromosomes found in specimens from SPAB and VNM shows that their W chromosome is similar to that of P. ephippifer26 in having NOR in the short arm but differs in not having an evident terminal heterochromatic block in the short arm, adjacent to the NOR (Fig. 11). However, the long arm of the W chromosome in specimens from SPAB and VNM has NORs detectable by C-banding (Fig. 11), differentiating it from the long arm of the W chromosome of P. ephippifer, which bears NORs not coincident with C-bands26. Finally, the 8p probe detected a pericentromeric region in the short arm of the W chromosome of specimens from SPAB and VNM, while it mapped pericentromerically to the long arm of the W chromosome of P. ephippifer26 (Fig. 11).
The Z chromosome of specimens from the contact zone is more similar to chromosome 9 of L1 than to the Z chromosome of P. ephippifer (Fig. 11). The Z chromosome of specimens from SPAB and VNM and chromosome 9 of L127,29 have a pericentromeric C-band in the short arm, which coincides with the region detected by the 8p probe, and NORs in the long arm, which coincides with C-bands and PepBS sites. Another similarity between the karyotype of L1 and that found in SPAB and VNM is the presence of NOR-bearing chromosomes 8.
Based on the abovementioned comparative analysis of the sex chromosomes of P. ephippifer and their homeologous chromosome 9 of L1, the inferred scenario of admixture between P. ephippifer and L1 makes it highly plausible that these chromosomes have undergone recombination in the past. Given their similarity, it is reasonable to posit that 9p of L1 is homologous to Wq of P. ephippifer, and 9q of L1 to Wp of P. ephippifer (Fig. 11). Consequently, we hypothesize that the W chromosome from SPAB-VNM may have resulted from recombination between chromosome 9 of L1 and the W chromosome of P. ephippifer (Fig. 11). Hypothetical recombination between these chromosomes would result in a chromosome with (1) a pericentromeric block in the short arm detected by the 8p probe (as observed in 9p of L1 and Wq of P. ephippifer), (2) a distal NOR in the short arm that does not colocalize nor is adjacent to C-band (a condition also observed in Wq of P. ephippifer), and (3) NORs in the long arm that coincide with C-bands (as observed in 9q of L1) (Fig. 11). Under this scenario, admixture between P. ephippifer and L1 would have promoted the origin of a new sex chromosome system, composed of Z and W chromosomes with a lower level of differentiation than that observed between the sex chromosomes of P. ephippifer. Thus, gene flow (or interspecific hybridization) may have been an important driver of karyotypic evolution in this group of frogs (see further discussion below).
However, we cannot disregard alternative hypotheses in which the Z and W chromosomes currently found in SPAB and VNM might have arisen from multiple mutations/rearrangements occurring on ancestral chromosomes. Additional genomic studies would be valuable for identifying linkage groups within the W chromosomes of SPAB-VNM and P. ephippifer and chromosome 9 of L1 to further test these hypotheses. Additionally, the full extent of the contact zone, as well as the timing and duration of gene flow, should be addressed in future studies, which should include a denser geographic sampling—also comprising specimens from beyond the SPAB-VNM-TS-Imp region—and acoustic and morphological data. These data will also allow us to evaluate whether the populations in SPAB-VNM-TS-Imp represent a narrow hybrid zone facilitating ongoing (but limited) gene flow between P. ephippifer and L1 or these admixed populations are reproductively isolated from parental lineages.
Gene flow between L1 and L3 and between L2 and L3
In addition to the contact between P. ephippifer and L1, we found preliminary evidence of gene flow between two non-sister lineages—L1 and L3. In a previous study, delimitation tests based on mtDNA strongly supported L3 as a valid species, distinct from L125. However, in our analyses, from the ten specimens from Bal included in our mtDNA analyses, six had mitochondrial haplotypes that clustered within L1 and four had haplotypes that clustered within L3. Although all six specimens from Bal included in both the 3RAD and mtDNA analyses had mitochondrial haplotypes from L1, the topology of the 3RAD phylogeny and phylogenetic network and the results of the Structure analyses and PCA were all consistent with introgression from L3 into L1. Additionally, the ABBA-BABA test resulted in a D-statistic that was significantly negative, thus supporting the hypothesis of introgression as an explanation for the discordant inferences from mtDNA and 3RAD markers.
The cytogenetic analysis of specimens from Bal showed additional NORs in chromosomes 7 and 10. L3 is highly polymorphic regarding NOR location27 and NOR-bearing chromosomes 7 and 10 similar to those found in Bal were found in L3 (Supplementary File S1). Accordingly, we hypothesize that the NOR-bearing chromosomes 7 and 10 found in Bal have introgressed from L3. Therefore, the NOR polymorphism observed in Bal would have arisen from introgressive hybridization (see further discussion below). However, our small sample size and the limited geographic extent of our samples prevent us from drawing stronger conclusions about the timing or frequency of gene flow.
Finally, we found some preliminary evidence for gene flow between L2 and L3 in the discordance between the phylogenetic placement of a specimen from Pirenópolis (SMRP 92.307) in the mtDNA and 3RAD phylogenies. In the mtDNA phylogeny, we recovered this sample within L2 (Fig. 2), but it was sister to L3 in the 3RAD phylogeny (Fig. 3) and clustered separately with one sample from Paranã (SMRP 92.308) in the 3RAD phylogenetic network (Supplementary Fig. S4).
While secondary contact and gene flow between pairs of anuran species or lineages have been increasingly reported49,50,51,52,53, it is worth noting that admixture involving multiple lineages has also been documented54,55,56. When multiple divergent lineages come into contact, the extent of hybrid zones and the degree of admixture may vary among distinct contact zones. Therefore, comparative analyses of multiple contact zones can provide valuable insights into the evolutionary processes involved in species formation. For instance, Wen & Fu55’s study on the frog Odorrana margaretae revealed that one contact zone exhibited extensive admixture, whereas another zone occupied a narrow geographic area and showed limited admixture of parental lineages. In the case of the P. cuvieri – P. ephippifer species complex, the historical hybridization inferred between P. ephippifer and L1 potentially contributed to the emergence of a new karyotype (as discussed above). Conversely, we lack evidence indicating a similar impact from the inferred contact between L1 and L3 on karyotype divergence. Further characterization of each contact zone, including analyses of the direction, timing, and frequency of hybridization events, remains essential for a comprehensive study of this species complex. Such investigations may offer unique insights into the speciation process.
Additional genetic diversity unveiled in the P. cuvieri – P. ephippifer species complex
Our expanded geographic sampling also clarified some general aspects of the distribution of genetic diversity within the P. cuvieri – P. ephippifer species complex and raises new questions that warrant further study. For example, we demonstrated that L3, which was previously restricted to Porto Nacional25,57, also occurs in the region of Paranã.
In contrast, our analyses showed that the specimens from VBST, also located in Central Brazil, differ from those of Porto Nacional and Paranã. The mtDNA phylogeny suggested that VBST is sister to L3—a relationship that was not recovered in the phylogenetic analysis of the 3RAD dataset. In all the species delimitation tests reported previously25, L3 was recovered as a valid species. Here, we found approximately 3% of nucleotide divergence between VBST and L3 in the 16Sar–16Sbr fragment, a threshold value proposed by Fouquet et al.58 and Lyra et al.59 to flag candidate species of neotropical frogs. Therefore, further analyses are of fundamental importance to evaluate the taxonomic status of the VBST clade.
Finally, we recovered a clade composed of samples from Marabá and Parauapebas in the phylogenetic analyses of mtDNA and 3RAD data, which we refer to as Lineage 4 (L4). All analyses supported the close relationship between L4, L1, and P. ephippifer, but the precise phylogenetic relationships within this clade remain to be elucidated.
These findings and the abovementioned evidence of multiple contact zones reinforce the need for a comprehensive taxonomic revision of the P. cuvieri – P. ephippifer species complex. As anticipated25, it will be a challenging task due to formal issues regarding the original species descriptions—including the imprecise information on the type-locality for P. cuvieri Fitzinger 1826 and the fact that its type specimens are not included in recent type lists41—but also because it will need comprehensive morphological and acoustic analyses complementing genetic studies in an integrative framework.
Hybridization as a driver of chromosome evolution in anurans
Interspecific hybridization plays an important role in shaping karyotype diversity37,60. In scenarios where the hybrids are viable and fertile, backcrosses may lead to the introgression of chromosomal variants from one species to another, which may result in stable chromosomal polymorphisms61. In contrast, when hybrid incompatibility promotes an intrinsic postzygotic reproductive barrier (which might be complete or incomplete, depending on the severity of hybrid incompatibility), introgression is prevented and natural selection favors interspecific divergence, intensifying the reproductive isolation through reinforcement5,6. One interesting category of hybrid incompatibility is related to the presence of distinct sex chromosomes in the parental species, underscoring the importance of sex chromosomes in speciation30,31,32,33. In this scenario, interspecific differences in sex chromosomes may accumulate following reproductive isolation, but new sex chromosome systems might arise as a consequence of hybridization45,47,48.
In the species complex studied here, we found evidence supporting the involvement of natural hybridization between divergent lineages in two distinct phenomena: (1) the polymorphism of NORs (since the analyses of specimens from Bal suggested that NOR-bearing chromosomes from L3 were introgressed into L1); and (2) the origin of a new sex chromosome system (found in SPAB and VNM).
To date, the involvement of interpopulational hybridization in the origin of new sex chromosome systems in anurans had only been reported for Glandirana rugosa44,45,46,47,48. In this extraordinary case, six major geographic groups are recognized: two with male heterogamety and homomorphic sex chromosomes (the West-Japan and East-Japan groups, which are supposed to be original groups); two with female heterogamety and heteromorphic sex chromosomes (the ZW and Neo-ZW groups); one with male heterogamety and heteromorphic sex chromosomes (the XY group); and one with male heterogamety and homomorphic sex chromosomes (the Neo-West-Japan group)45,46,48. All these distinct sex chromosome systems would have emerged after hybridization between groups, with at least four hybrid zones where divergent groups would have secondarily contacted and hybridized45,47,48. During the evolution of these sex chromosome systems, a subtelocentric chromosome originated from a metacentric one (which gave rise to a heteromorphic condition) losses of W and Y chromosomes supposedly occurred in some of the contact zones, and new female-determining gene(s) emerged, promoting parallel transitions from male to female heterogamety45.
Cytogenetic analyses showed no differences between the karyotype found in the West-Japan group and that of the Neo-West-Japan group of Glandirana rugosa, two lineages that supposedly evolved from the hybridization between the West-Japan group and either the Neo-ZW or the XY group (both groups with heteromorphic sex chromosomes)48. Based on such evidence, Ogata et al.48 argued that the loss of W or Y chromosomes may have been involved in the process that secondarily originated a homomorphic sex chromosome system. Because molecular comparisons of the sex chromosomes of the West-Japan and Neo-West-Japan groups (including their sex-determining locus) remain to be done, whether the condition found in the Neo-West-Japan group is exactly a reversion to the original state or not is still an open question48. In either case, hybridization is involved in the origin of homomorphic sex chromosomes from a heteromorphic system, which led Ogata and colleagues to advocate that it is a mechanism that contributes to the prevalence of homomorphic sex chromosomes in anurans.
Here, we suggest that hybridization may have also played an important role in the origin of new sex chromosomes in the P. cuvieri – P. ephippifer species complex. The new ZZ/ZW sex chromosome system we found in specimens from SPAB and VNM shows a lower level of chromosome heteromorphism than found in P. ephippifer. We hypothesize that such a decrease in heteromorphism results from the recombination between chromosome 9 of L1 and chromosome W of P. ephippifer, after these lineages secondarily contacted and hybridized (Fig. 11). In this scenario, hybridization would be important to counteract sex chromosome differentiation, playing a role similar to that attributed to the male-to-female sex reversal in XX/XY systems in the fountain-of-youth model proposed by Perrin62. According to Perrin’s model, the occurrence of sex-reversed XY females in amphibians would increase the recombination between X and Y chromosomes, preventing sex chromosome differentiation and thus contributing to the prevalence of homomorphic sex chromosomes in these vertebrates. In this model, deleterious mutations accumulated in “old” Y chromosomes are lost by recombination, giving rise to “younger” Y chromosomes62. Therefore, we agree with Ogata et al.48 in proposing hybridization as an important force against sex chromosome differentiation and we suggest that hybridization could be considered an additional mechanism promoting recombination and rejuvenating sex chromosomes. In this sense, sex reversal in XX/XY systems and hybridization would be both considered fountains of youth.
Conclusion
We found evidence of admixture in multiple contact zones in the P. cuvieri – P. ephippifer species complex, suggesting that hybridization has promoted rapid karyotypic evolution in this group, contributing to NOR polymorphism and the emergence of a new sex chromosome system. Our results support previous hypotheses that hybridization may contribute to counter sex chromosome differentiation in anurans.
Methods
We studied an extended sample comprising five lineages of the P. cuvieri – P. ephippifer species complex, including specimens from a geographical area situated between previously documented localities of P. ephippifer, and lineages L1 and L3 of P. cuvieri sensu lato (Fig. 1). We generated mitochondrial sequences, 3RAD markers, and cytogenetic data from sampled specimens and combined these data with earlier datasets. Supplementary Table S1 lists the specimens used in each analysis and identifies new samples in this study. Details regarding each procedure and analysis are presented below.
Specimen and tissue collection
We collected new specimens of P. ephippifer (n = 9) and four lineages historically classified as P. cuvieri. Using informal lineage names as described in Lourenço et al.57, these four lineages are: Lineage 1A (L1A; n = 1), Lineage 1B (L1B; n = 17), Lineage 2 (L2; n = 4), and Lineage 3 (L3; n = 4) (Supplementary Table S1). We also collected 37 specimens from sites located between the known geographical distribution boundaries of P. ephippifer, L1, and L3, which included the municipalities of Imperatriz (Imp; n = 2), São Francisco do Brejão—Trecho Seco (TS; n = 5), Vila Nova dos Martírios (VNM; n = 4), São Pedro da Água Branca (SPAB; n = 21), Marabá (Mar; n = 1), and Parauapebas (Par; n = 4) (Fig. 1; Supplementary Table S1). We also sampled the municipality of Vila Bela da Santíssima Trindade (VBST) and included additional samples from Balsas (Bal)—a region located at the western limit of the known distribution area of L1 (Fig. 1; Supplementary Table S1) and formerly analyzed by Lourenço et al.57 and Nascimento et al.25. As an outgroup for 3RAD analyses, we also included specimens of P. centralis (Supplementary Table S1).
We collected all specimens under a permit issued by the Chico Mendes Institute for Biodiversity Conservation/Biodiversity Authorization and Information System (ICMBio/SISBIO) (process #32483). We anaesthetized all frogs using 2% lidocaine and dissected liver samples for DNA analyses and intestine samples for cytogenetic analysis. We obtained genomic DNA from liver samples following the protocol described by Medeiros et al.63. All experiments were performed in accordance with relevant guidelines and regulations and approved by the Committee for Ethics in Animal Use of the University of Campinas (CEUA/UNICAMP) (permit number 4802-1/2018). We deposited all specimens in the amphibian collection of the Zoology Museum “Prof. Adão José Cardoso” at the University of Campinas (ZUEC) or in the herpetological collection of the Federal University of Maranhão (HUFMA). When pertinent, this study was performed in compliance with the ARRIVE guidelines.
Mitochondrial DNA analyses
We generated mitochondrial DNA (mtDNA) sequence data, including the tRNA-Val gene and partial sequences of 12S and 16S rRNA genes, for 66 of the specimens collected. We isolated and amplified fragments of interest by PCR using the primer pairs MVZ5964 – Titus I65 and 12L1366 – 16Sbr67. We purified PCR products using a Wizard SV Gel and PCR Clean-up System (Promega) and sequenced amplicons using a BigDye Terminator kit (Applied Biosystems) in an ABI 3730xL DNA Analyzer automatic sequencer (Applied Biosystems). We conducted sequencing reactions with the aforementioned primers and also with primers MVZ5064, 16SL2a68, 16H1068, and 16Sar67. We edited the resulting nucleotide sequences using CodonCode Aligner software v.5.1.5 (www.codoncode.com) or BioEdit Sequence Alignment Editor software v.7.2.569, yielding concatenated segments delimited by primers MVZ59 and 16Sbr. All the obtained sequences were deposited in GenBank (see Supplementary Table S1 for accession numbers).
We aligned the newly generated sequences with those available on GenBank to create a dataset with mitochondrial sequences from 123 specimens from the P. cuvieri – P. ephippifer species complex (Supplementary Table S1). We also included sequences of representatives from the remaining eight species currently assigned to the P. cuvieri species group (i.e. P. albifrons, P. albonotatus, P. atim, P. centralis, P. cuqui, P. erikae, P. fischeri, and P. kroyeri), as well as sequences from representatives of each of the four other species groups in the P. cuvieri clade, and sequences of P. nattereri—a species of the P. signifer clade (which is the sister clade of the P. cuvieri clade)57—in our data matrix (Supplementary Table S1). We aligned all sequences with MAFFT v.770 (https://mafft.cbrc.jp/alignment/server/), using the G-INS-i option, resulting in a data matrix composed of 153 samples and 2324 characters.
We estimated a maximum-likelihood phylogeny from the mtDNA dataset using RAxML71 as implemented in the CIPRES Science Gateway v.3.372, employing the GTRCAT nucleotide substitution model. We assessed node support with a bootstrap algorithm, using different random seeds and 1000 replicates. We used P. nattereri as the outgroup to root the inferred phylogeny.
We estimated genetic distances between and within major lineages in the P. cuvieri – P. ephippifer species complex using MEGA v.7.073, treating alignment gaps and missing data as pairwise deletions. We calculated uncorrected genetic distances (p-distances) from the 12SrRNA-tRNA-val-16SrRNA fragment and also from the partial segment of 16S rRNA gene flanked by the primers 16Sar and 16Sbr, which is commonly used for evaluating interspecific variation in frogs58,59.
We obtained a list of 68 distinct mtDNA haplotypes related to the P. cuvieri-P. ephippifer species complex using DNAsp v.5.10.0174 and constructed a haplotype network using the median-joining method (MJN) in the software Network v.2.0.1.175. We ignored sites with missing data and alignment gaps in this analysis. Due to sequencing failure at the 3’ end of the 12S rRNA gene, specimens SMRP 92.228, SMRP 92.229, SMRP 92.242, SMRP 92.306, SMRP 92.307, and PS555 were excluded from this analysis, as they had 235 to 334 bp of missing data, leading to the loss of a common 151 bp segment.
3RAD library preparation, sequencing, and assembly
We generated genome-wide SNP data for 81 specimens of the P. cuvieri – P. ephippifer species complex and two specimens of P. centralis (Supplementary Table S1) using the 3RAD (triple-digest RADseq) protocol described by Bayona-Vásquez et al.76. In brief, we digested DNA samples from 57 specimens with the enzymes MspI, ClaI, and BamHI-HF, ligated the resulting DNA fragments to iTru adapters specific to MspI and BamHI-HF cutsites, and cleaned ligation products with SpeedBeads. We then conducted individual PCRs with iTru5 and iTru7 primers77 for these 57 ligation products plus 26 ligation products previously generated by Nascimento et al.25 from samples of the P. cuvieri – P. ephippifer species complex.
We normalized and pooled PCR products and size-selected this pool for 450–550 bp on a Pippin Prep. Finally, we combined these pooled, size-selected libraries for sequencing on an Illumina NovaSeq, targeting approximately 10 million paired-end, 150-basepair reads (PE 150) per sample.
We quality-filtered and assembled reads using ipyrad v.0.9.8478,79. We trimmed forward and reverse reads to 130 bp. We then used a clustering threshold of 0.85, used a minimum depth of six reads per locus, and required that a locus be present in at least four samples to be included in the final assembly. Raw sequence reads are available from the NCBI SRA (PRJNA997238).
3RAD data analyses
First, we inferred phylogenetic relationships from the full 3RAD dataset (n = 83 individuals, n = 2,757,035 SNPs) and the .u.snps.phy output file from ipyrad (i.e. using only one random SNP per locus; n = 83 individuals, n = 205,030 SNPs) using RAxML v.8.280 and RAxML71 in the CIPRES Science Gateway v.3.372, respectively. We used the GTRCAT model in both analyses and assessed node support with a bootstrap algorithm, using different random seeds and 100 replicates of the .u.snps.phy dataset. We used the outgroup P. centralis to root the inferred phylogeny.
Second, we inferred a distance-based phylogenetic network using Hamming distances and the NeighborNet algorithm81 as implemented in package phangorn v.2.11.182,83 in R v4.1.084. We also conducted this analysis using one random SNP per locus (n = 83 individuals, n = 205,030 SNPs).
Third, we conducted a series of principal component analyses (PCAs) using the ipyrad-analysis toolkit in ipyrad v.0.9.8478,79. For each analysis, we: (1) made “population maps” assigning each sample to groups defined by lineage (e.g. L1) or sampling site (e.g. VNM); (2) included only loci with data in at least 20% of individual samples within each group and in at least 50% of overall samples; (3) included all SNPs with a minor allele frequency (MAF) ≥ 0.05; and (4) imputed missing data using kmeans clustering, with k = 10. We conducted three separate PCAs. In the first PCA, we included all samples from our assembled dataset (n = 83 individuals, n = 20,224 SNPs). In the second PCA, we investigated potential evidence of gene flow between P. ephippifer and L1 by including only samples from these lineages and their putative contact zone (i.e. SPAB, VNM, TS, and Imp) (n = 53 individuals, n = 45,800 SNPs). In the third PCA, we evaluated potential evidence of gene flow between L1 and L3 by including only samples from these two lineages, plus samples of P. ephippifer—the closest relative of L1 (n = 27 individuals, n = 14,782 SNPs). For each analysis, we plotted the first two PC axes, jointly representing 20–38% variance in each dataset.
Fourth, we used the Bayesian population clustering program Structure v2.3.485 as implemented in the ipyrad-analysis toolkit in ipyrad v.0.9.84 to explore patterns of genetic structure and admixture78,79. We conducted two separate Structure analyses. In the first analysis, we assessed gene flow between P. ephippifer and L1 by including samples from these two lineages and their putative contact zone (i.e. SPAB, VNM, TS, and Imp). Previous studies have demonstrated that Structure analyses can be sensitive to missing data86 and that biological signal can be masked by the inclusion of singletons87. Thus, we filtered our SNPs to include: (1) only loci with data in at least 20% of individual samples within each group and in at least 80% of overall samples; and (2) only loci with a MAF ≥ 0.02 (to exclude singletons). This dataset consisted of 53 individuals and 3465 SNPs after filtering. We conducted 10 replicate runs of 300,000 MCMC steps (with 30,000 burnin steps) for K = 1–7 and visually evaluated relative support for the optimal number of clusters by plotting ΔK88. In the second analysis, we focused on evaluating gene flow between L1 and L3 by including only samples from those two lineages (n = 17 individuals, n = 1,772 SNPs). We conducted this analysis as described above, except that: (1) we included only SNPs with a MAF ≥ 0.04 (to exclude singletons, accounting for a smaller number of individuals in the dataset); and (2) we evaluated only K = 1–5.
Fifth, we conducted a series of targeted ABBA-BABA tests89,90 to provide independent assessments of introgression, again using the ipyrad-analysis toolkit in ipyrad v.0.9.8478,88. We conducted five analyses, four of which were designed to evaluate the relative contribution of P. ephippifer and L1 to the ancestry of samples from each site in the putative contact zone (i.e. SPAB, VNM, TS, and Imp). The fifth analysis was designed to evaluate evidence for introgression of alleles from L3 into samples from Bal relative to other sites from L1. For each analysis, we used samples of P. centralis as an outgroup, started with our full SNP dataset (n = 2,757,035 SNPs), and evaluated support for mean estimates of d-statistics by plotting the distributions from 1000 bootstrap replicates in R v.4.1.084. In each analysis, we conducted pooled tests in which we used all samples from a site and also separate tests for each constituent sample. For the first four analyses, the number of SNPs used in each test (i.e. the SNPs that fit either an ABBA or BABA pattern of allele sharing) varied from 2364–7575 for SPAB, from 3174–5923 for VNM, from 1907–3454 for TS, and from 2194–3114 for Imp; for the fifth analysis, the number of SNPs varied from 2146–4454. We plotted bootstrap replicates from both individual and pooled tests, and used Z-scores to calculate p-values and evaluate statistical significance.
Cline analysis of the putative contact zone between P. ephippifer and L1
To further characterize the putative contact zone between P. ephippifer and L1 we used a series of cline analyses. Using the packages vcfR v1.14.091 and SNPfiltR v1.0.192 in R v4.3.184, we filtered the full SNP dataset to include only samples from P. ephippifer and L1 (excluding the samples from Bal) and their putative contact zone (i.e. including SPAB, VNM, TS, and Imp) and meeting the following criteria (1) only biallelic SNPs; (2) only the first SNP per locus; (3) only SNPs with a minor allele count of ≥ 2; and (4) with ≤ 50% missing data (n = 46 individuals, n = 6815 SNPs). We then used code from93 to identify SNPs with fixed differences between P. ephippifer and L1 (n = 406 SNPs), and we used the package introgress v1.2.394 to estimate hybrid indexes and calculate interspecific heterozygosity, which we visualized through a triangle plot. Next, we created geographic clines separately from both these hybrid indexes and from mtDNA haplotype frequencies. First, we transformed our two-dimensional geographic coordinate data (i.e. the sampling site locations) into a linear scale. We did this by fitting a line between the westernmost sampling site for P. ephippifer (Belém) and the easternmost sampling site for L1 (Araruna) in this dataset, calculating the point at which a line drawn perpendicular from each other site would intersect this line, and using the package geosphere v1.5–1895 to calculate the linear distance between the westernmost sampling site (Belém) and these points. We then used the package hzar v0.2–996 to fit one geographic cline model using the hybrid indexes calculated in introgress and one using the mtDNA haplotype frequencies. In both cases, we estimated two parameters: the cline center and cline width. We fit each model using 1,000,000 MCMC steps with 1000 burnin steps.
Cytogenetic analyses
To assess karyotypic variation in the contact zone between P. ephippifer and L1, we conducted cytogenetic analyses of seventeen individuals from SPAB and four from VNM (Supplementary Table S1). We also included three specimens from Bal—the site with evidence of admixture between L1 and L3 (see “Results”). For full specimen details, see Supplementary Table S1.
For chromosomal analyses, we injected frogs intraperitoneally with 2% colchicine (0.02 mL/g body weight). After four hours, we euthanized them with a cutaneous administration of 2% lidocaine (an overdose of 50 mg/g body) and removed the intestines (protocol approved by CEUA/UNICAMP, permit number 4802–1/2018). We then made chromosome preparations following the methods described in King and Rofe97, with modifications described in Gatto et al.98. The preparations are deposited in the cytogenetic collection “Shirlei Maria Recco Pimentel” (SMRP, Universidade Estadual de Campinas, Brasil). We also used some chromosome preparations previously described by Quinderé et al.27.
Next, we sequentially subjected these chromosome preparations to 10% Giemsa staining, C-banding71, and silver impregnation using the Ag-NOR method99. We also stained some C-banded metaphases by DAPI (0.5 µg/mL) after removing the Giemsa stain in 50% acetic acid. Additionally, we used fluorescence in situ hybridization (FISH) to map the satellite DNA PepBS29. We labeled a cloned PepBS fragment, previously isolated from P. ephippifer and available from the SMRP cytogenetic collection, with digoxigenin-dUTP (Roche) by PCR with primers PepBS-F and PepBS-R29 and used it as probe in FISH assays.
For cytogenetic analyses of specimens from the secondary contact zone between P. ephippifer and L1, we also used FISH to map the segment Zqper/8p using a probe originally constructed by Gatto et al.29 from chromosome microdissection, which was available from the SMRP cytogenetic collection. This probe (named 8p probe) is specific to the pericentromeric region of the long arm of Z chromosome of P. ephippifer, the pericentromeric region of the short arm of chromosome 9 of individuals from L1, and an interstitial region of the short arm of chromosome 8 of Physalaemus sp. (Western Pará and Viruá clades)29.
In all FISH assays, the hybridization step followed the method of Viegas-Péquignot100. We detected probes by anti-digoxigenin conjugated to rhodamine (Roche) (0.06 µg/mL) and stained chromosomes with DAPI (0.5 µg/mL).
Data availability
Newly obtained sequences were deposited in GenBank (OR005428-OR005495; OR078427-OR078428). Raw sequence reads are available from the NCBI SRA (PRJNA997238).
References
Taylor, S. A. & Larson, E. L. Insights from genomes into the evolutionary importance and prevalence of hybridization in nature. Nat. Ecol. Evol. 3, 170–177 (2019).
Seehausen, O. et al. Genomics and the origin of species. Nat. Rev. Genet. 15, 176–192 (2014).
Abbott, R. et al. Hybridization and speciation. J. Evol. Biol. 26, 229–246 (2013).
Seehausen, O., Alphen, J. J. M. V. & Witte, F. Cichlid fish diversity threatened by eutrophication that curbs sexual selection. Science 277, 1808–1811 (1997).
Servedio, M. R. & Noor, M. A. F. The role of reinforcement in speciation: Theory and data. Annu. Rev. Ecol. Evol. Syst. 34, 339–364 (2003).
Coughlan, J. M. & Matute, D. R. The importance of intrinsic postzygotic barriers throughout the speciation process. Phil. Trans. R. Soc. B 375, 20190533 (2020).
Kumar, V. et al. The evolutionary history of bears is characterized by gene flow across species. Sci. Rep. 7, 46487 (2017).
Lamichhaney, S. et al. Rapid hybrid speciation in Darwin’s finches. Science 359, 224–228 (2018).
Naciri, Y. & Linder, H. P. Species delimitation and relationships: The dance of the seven veils. TAXON 64, 3–16 (2015).
De Queiroz, K. Species concepts and species delimitation. Syst. Biol. 56, 879–886 (2007).
Freudenstein, J. V., Broe, M. B., Folk, R. A. & Sinn, B. T. Biodiversity and the species concept—Lineages are not enough. Syst. Biol. 66, 644–656 (2016).
Padial, J. M. & De La Riva, I. A paradigm shift in our view of species drives current trends in biological classification. Biol. Rev. 96, 731–751 (2021).
Mayden, R. L. A hierarchy of species concepts: The denouement in the saga of the species problem. In Species: The Units of Diversity (ed. Mayden, R. L.) 381–423 (Claridge, M. F., Dawah, H. A. & Wilson, M. R., 1997).
De Queiroz, K. The general lineage concept of species, species criteria, and the process of speciation. In Endless Forms: Species and Speciation (ed. De Queiroz, K.) 57–75 (Howard, D. J. & Berlocher, S. H., 1998).
Doorenweerd, C. et al. A phylogenomic approach to species delimitation in the mango fruit fly (Bactrocera frauenfeldi) complex: A new synonym of an important pest species with variable morphotypes (Diptera: Tephritidae). Syst. Entomol. 48, 10–22 (2023).
Jeena, N. S. et al. Resolved and redeemed: A new fleck to the evolutionary divergence in the genus Scomberomorus Lacepède, 1801 (Scombridae) with cryptic speciation. Front. Mar. Sci. 9, 888463 (2022).
Main, D. C., van Vuuren, B. J., Tilbury, C. R. & Tolley, K. A. Out of southern Africa: Origins and cryptic speciation in Chamaeleo, the most widespread chameleon genus. Mol. Phylogenet. Evol. 175, 107578 (2022).
Dowling, T. E. & Secor, C. L. The role of hybridization and introgression in the diversification of animals. Annu. Rev. Ecol. Syst. 28, 593–619 (1997).
Mallo, D. & Posada, D. Multilocus inference of species trees and DNA barcoding. Phil. Trans. R. Soc. B 371, 20150335 (2016).
Sloan, D. B., Havird, J. C. & Sharbrough, J. The on-again, off-again relationship between mitochondrial genomes and species boundaries. Mol. Ecol. 26, 2212–2236 (2017).
Mirarab, S., Nakhleh, L. & Warnow, T. Multispecies coalescent: Theory and applications in phylogenetics. Annu. Rev. Ecol. Evol. Syst. 52, 247–268 (2021).
Durand, E. Y., Patterson, N., Reich, D. & Slatkin, M. Testing for ancient admixture between closely related populations. Mol. Biol. Evol. 28, 2239–2252 (2011).
Zarza, E. et al. Hidden histories of gene flow in highland birds revealed with genomic markers. Mol. Ecol. 25, 5144–5157 (2016).
Streicher, J. W. et al. Diversification and asymmetrical gene flow across time and space: Lineage sorting and hybridization in polytypic barking frogs. Mol. Ecol. 23, 3273–3291 (2014).
Nascimento, J. et al. Extensive cryptic diversity within the Physalaemus cuvieri–Physalaemus ephippifer species complex (Amphibia, Anura) revealed by cytogenetic, mitochondrial, and genomic markers. Front. Genet. 10, 719 (2019).
Nascimento, J., Quinderé, Y. R. S. D., Recco-Pimentel, S. M., Lima, J. R. F. & Lourenço, L. B. Heteromorphic Z and W sex chromosomes in Physalaemus ephippifer (Steindachner, 1864) (Anura, Leiuperidae). Genetica 138, 1127–1132 (2010).
Quinderé, Y. R. S. D. et al. Polytypic and polymorphic cytogenetic variations in the widespread anuran Physalaemus cuvieri (Anura, Leiuperidae) with emphasis on nucleolar organizing regions. Biol. Res. 42, 79–92 (2009).
Vittorazzi, S. E., Lourenço, L. B. & Recco-Pimentel, S. M. Long-time evolution and highly dynamic satellite DNA in leptodactylid and hylodid frogs. BMC Genet. 15, 111 (2014).
Gatto, K. P., Souza, L. H. B., Nascimento, J., Suárez, P. & Lourenço, L. B. Comparative mapping of a new repetitive DNA sequence and chromosome region-specific probes unveiling rearrangements in an Amazonian frog complex. Genome 64, 857–868 (2021).
Charlesworth, B., Coyne, J. A. & Barton, N. H. The relative rates of evolution of sex chromosomes and autosomes. Am. Nat. 130, 113–146 (1987).
Graves, J. A. M. Did sex chromosome turnover promote divergence of the major mammal groups?: De novo sex chromosomes and drastic rearrangements may have posed reproductive barriers between monotremes, marsupials and placental mammals. BioEssays 38, 734–743 (2016).
Presgraves, D. C. Sex chromosomes and speciation in Drosophila. Trends Genet. 24, 336–343 (2008).
Sæther, S. A. et al. Sex chromosome-linked species recognition and evolution of reproductive isolation in flycatchers. Science 318, 95–97 (2007).
Hibbins, M. S. & Hahn, M. W. Phylogenomic approaches to detecting and characterizing introgression. Genetics 220, iyab173 (2022).
Gross, B. L. & Rieseberg, L. H. The ecological genetics of homoploid hybrid speciation. J. Hered. 96, 241–252 (2005).
Mallet, J. Hybrid speciation. Nature 446, 279–283 (2007).
Rieseberg, L. H. Chromosomal rearrangements and speciation. Trends Ecol. Evol. 16, 351–358 (2001).
Haddad, C. F. B., Pombal, J. P. & Batistic, R. F. Natural hybridization between diploid and tetraploid species of leaf-frogs, genus Phyllomedusa (Amphibia). J. Herpetol. 28, 425 (1994).
Stöck, M. et al. A vertebrate reproductive system involving three ploidy levels: Hybrid origin of triploids in a contact zone of diploid and tetraploid paleartic green toads (Bufo viridis subgroup)*. Evolution 64, 944–959 (2009).
Betto-Colliard, C., Hofmann, S., Sermier, R., Perrin, N. & Stöck, M. Profound genetic divergence and asymmetric parental genome contributions as hallmarks of hybrid speciation in polyploid toads. Proc. R. Soc. B 285, 20172667 (2018).
Frost, D. R. Amphibian species of the world: An online reference. https://amphibiansoftheworld.amnh.org/index.php (2023).
Hoskin, C. J., Higgie, M., McDonald, K. R. & Moritz, C. Reinforcement drives rapid allopatric speciation. Nature 437, 1353–1356 (2005).
Hoskin, C. J. Description, biology and conservation of a new species of Australian tree frog (Amphibia: Anura: Hylidae: Litoria) and an assessment of the remaining populations of Litoria genimaculata Horst, 1883: Systematic and conservation implications of an unusual speciation event. Biol. J. Linn. Soc. 91, 549–563 (2007).
Miura, I. Sex determination and sex chromosomes in Amphibia. Sex. Dev. 11, 298–306 (2017).
Miura, I. & Ogata, M. Change of heterogametic sex from male to female: Why so easy in the frog?. Chromosome Sci. 16, 3–9 (2013).
Miura, I. et al. Identification of ancestral sex chromosomes in the frog Glandirana rugosa bearing XX-XY and ZZ-ZW sex-determining systems. Mol. Ecol. 31, 3859–3870 (2022).
Ogata, M., Shams, F., Yoshimura, Y., Ezaz, T. & Miura, I. W chromosome evolution by repeated recycling in the frog Glandirana rugosa. DNA 2, 172–184 (2022).
Ogata, M., Suzuki, K., Yuasa, Y. & Miura, I. Sex chromosome evolution from a heteromorphic to a homomorphic system by inter-population hybridization in a frog. Phil. Trans. R. Soc. B 376, 20200105 (2021).
Hsiao, Y.-W., Tseng, H.-Y., Nguyen, H. N. & Lin, S.-M. Asymmetric acoustic signal recognition led to asymmetric gene flow between two parapatric frogs. Zool. J. Linn. Soc. 192, 130–143 (2021).
Dufresnes, C. et al. Timeframe of speciation inferred from secondary contact zones in the European tree frog radiation (Hyla arborea group). BMC Evol. Biol. 15, 155 (2015).
Dufresnes, C. et al. Genomic evidence for cryptic speciation in tree frogs from the Apennine Peninsula, with description of Hyla perrini sp. nov. Front. Ecol. Evol. 6, 144 (2018).
Dufresnes, C. et al. Integrating hybrid zone analyses in species delimitation: Lessons from two anuran radiations of the Western Mediterranean. Heredity 124, 423–438 (2020).
Dufresnes, C. et al. Revisiting a speciation classic: Comparative analyses support sharp but leaky transitions between Bombina toads. J. Biogeogr. 48, 548–560 (2021).
Drillon, O., Dufresnes, G., Perrin, N., Crochet, P.-A. & Dufresnes, C. Reaching the edge of the speciation continuum: Hybridization between three sympatric species of Hyla tree frogs. Biol. J. Linn. Soc. 126, 743–750 (2019).
Wen, G. & Fu, J. Isolation and reconnection: Demographic history and multiple contact zones of the green odorous frog (Odorrana margaretae) around the Sichuan Basin. Mol. Ecol. 30, 4103–4117 (2021).
Rivera, D. et al. Phylogenomics, introgression, and demographic history of South American true toads (Rhinella). Mol. Ecol. 31, 978–992 (2022).
Lourenço, L. B. et al. Phylogeny of frogs from the genus Physalaemus (Anura, Leptodactylidae) inferred from mitochondrial and nuclear gene sequences. Mol. Phylogenet. Evol. 92, 204–216 (2015).
Fouquet, A. et al. Underestimation of species richness in neotropical frogs revealed by mtDNA analyses. PLoS One 2, e1109 (2007).
Lyra, M. L., Haddad, C. F. B. & De Azeredo-Espin, A. M. L. Meeting the challenge of DNA barcoding neotropical amphibians: Polymerase chain reaction optimization and new COI primers. Mol. Ecol. Resour. 17, 966–980 (2017).
Zhao, X., Fu, X., Yin, C. & Lu, F. Wheat speciation and adaptation: Perspectives from reticulate evolution. aBIOTECH 2, 386–402 (2021).
Lehnert, S. J. et al. Chromosome polymorphisms track trans-Atlantic divergence and secondary contact in Atlantic salmon. Mol. Ecol. 28, 2074–2087 (2019).
Perrin, N. Sex reversal: A fountain of youth for sex chromosomes?. Evolution 63, 3043–3049 (2009).
Medeiros, L. R. et al. Comparative cytogenetic analysis of some species of the Dendropsophus microcephalus group (Anura, Hylidae) in the light of phylogenetic inferences. BMC Genet. 14, 59 (2013).
Graybeal, A. Phylogenetic relationships of bufonid frogs and tests of alternate macroevolutionary hypotheses characterizing their radiation. Zool. J. Linn. Soc. 119, 297–338 (1997).
Titus, T. A. A Phylogenetic Analysis of the Desmognathinae (Caudata: Plethodontidae): Evolutionary Patterns Inferred from Mitochondrial DNA Sequences (University of Kansas, 1992).
Feller, A. E. & Hedges, S. B. Molecular evidence for the early history of living Amphibians. Mol. Phylogenet. Evol. 9, 509–516 (1998).
Palumbi, S. et al. The simple fool’s guide to PCR. https://stacks.stanford.edu/file/druid:yh393jm6703/Simple_Fool%27s_Master%20PCR.pdf (2002).
Hedges, S. B. Molecular evidence for the origin of birds. Proc. Natl. Acad. Sci. U.S.A. 91, 2621–2624 (1994).
Hall, T. A. BioEdit: A user-friendly biological sequence alignment editor and analysis program for Windows 95/98/NT. Nucleic Acids Symp. Ser. 41, 95–98 (1999).
Katoh, K., Rozewicki, J. & Yamada, K. D. MAFFT online service: Multiple sequence alignment, interactive sequence choice and visualization. Brief. Bioinform. 20, 1160–1166 (2019).
Stamatakis, A. RAxML-VI-HPC: Maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models. Bioinformatics 22, 2688–2690 (2006).
Miller, M. A., Pfeiffer, W. & Schwartz, T. Creating the CIPRES Science Gateway for inference of large phylogenetic trees. In 2010 Gateway Computing Environments Workshop (GCE) (eds Miller, M. A. et al.) 1–8 (IEEE, 2010).
Kumar, S., Stecher, G. & Tamura, K. MEGA7: Molecular evolutionary genetics analysis version 7.0 for bigger datasets. Mol. Biol. Evol. 33, 1870–1874 (2016).
Librado, P. & Rozas, J. DnaSP v5: A software for comprehensive analysis of DNA polymorphism data. Bioinformatics 25, 1451–1452 (2009).
Bandelt, H. J., Forster, P. & Rohl, A. Median-joining networks for inferring intraspecific phylogenies. Mol. Biol. Evol. 16, 37–48 (1999).
Bayona-Vásquez, N. J. et al. Adapterama III: Quadruple-indexed, double/triple-enzyme RADseq libraries (2RAD/3RAD). PeerJ 7, e7724 (2019).
Glenn, T. C. et al. Adapterama II: Universal amplicon sequencing on Illumina platforms (TaggiMatrix). PeerJ 7, e7786 (2019).
Eaton, D. A. R. PyRAD: Assembly of de novo RADseq loci for phylogenetic analyses. Bioinformatics 30, 1844–1849 (2014).
Eaton, D. A. R. & Overcast, I. ipyrad: Interactive assembly and analysis of RADseq datasets. Bioinformatics 36, 2592–2594 (2020).
Stamatakis, A. RAxML version 8: A tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics 30, 1312–1313 (2014).
Bryant, D. Neighbor-Net: An agglomerative method for the construction of phylogenetic networks. Mol. Biol. Evol. 21, 255–265 (2003).
Schliep, K. P. phangorn: Phylogenetic analysis in R. Bioinformatics 27, 592–593 (2011).
Schliep, K., Potts, A. A., Morrison, D. A. & Grimm, G. W. Intertwining phylogenetic trees and networks. https://peerj.com/preprints/2054v1 (2016). https://doi.org/10.7287/peerj.preprints.2054v1.
R Core Team. R: A language and environment for statistical computing. Vienna: R foundation for statistical computing (2021).
Pritchard, J. K., Stephens, M. & Donnelly, P. Inference of population structure using multilocus genotype data. Genetics 155, 945–959 (2000).
Gautier, M. et al. The effect of RAD allele dropout on the estimation of genetic variation within and between populations. Mol. Ecol. 22, 3165–3178 (2013).
Linck, E. & Battey, C. J. Minor allele frequency thresholds strongly affect population structure inference with genomic data sets. Mol. Ecol. Res. 19, 639–647 (2019).
Evanno, G., Regnaut, S. & Goudet, J. Detecting the number of clusters of individuals using the software structure: A simulation study. Mol. Ecol. 14, 2611–2620 (2005).
Green, R. E. et al. A draft sequence of the Neandertal genome. Science 328, 710–722 (2010).
Huson, D. H., Klöpper, T., Lockhart, P. J. & Steel, M. A. Reconstruction of reticulate networks from gene trees. In Research in Computational Molecular Biology: 9th Annual International Conference, RECOMB 2005, Cambridge, MA, USA, May 14–18, 2005. Proc. 9, 233–249 (Springer, 2005).
Knaus, B. J. & Grünwald, N. J. vcfr: A package to manipulate and visualize variant call format data in R. Mol. Ecol. Resour. 17, 44–53 (2017).
DeRaad, D. A. snpfiltr: An R package for interactive and reproducible SNP filtering. Mol. Ecol. Resour. 22, 2443–2453 (2022).
DeRaad, D. A., McCormack, J. E., Chen, N., Peterson, A. T. & Moyle, R. G. Combining species delimitation, species trees, and tests for gene flow clarifies complex speciation in Scrub-Jays. Syst. Biol. 71, 1453–1470 (2022).
Gompert, Z. & Alex Buerkle, C. introgress: A software package for mapping components of isolation in hybrids. Mol. Ecol. Resour. 10, 378–384 (2010).
Hijmans, R. J., Williams, E., Vennes, C. & Hijmans, M. R. J. Package ‘geosphere’. Sph. Trigonometry 1, 1–45 (2017).
Derryberry, E. P., Derryberry, G. E., Maley, J. M. & Brumfield, R. T. hzar: Hybrid zone analysis using an R software package. Mol. Ecol. Res. 14, 652–663 (2014).
King, M. & Rofe, R. Karyotypic variation in the Australian Gekko Phyllodactylus marmoratus (Gray) (Gekkonidae: Reptilia). Chromosoma 54, 75–87 (1976).
Gatto, K. P., Mattos, J. V., Seger, K. R. & Lourenço, L. B. Sex chromosome differentiation in the frog genus Pseudis involves satellite DNA and chromosome rearrangements. Front. Genet. 9, 301 (2018).
Howell, W. M. & Black, D. A. Controlled silver-staining of nucleolus organizer regions with a protective colloidal developer: A 1-step method. Experientia 36, 1014–1015 (1980).
Viegas-Péquignot, E. In situ hybridization to chromosomes with biotinylated probes. In In situ Hyridization, A Practical Approach (ed. Viegas-Péquignot, E.) 137–158 (Wilkinson D, 1992).
Acknowledgements
We thank Alessandra Ferreira da Costa and Stephanne Baumgartner Dourado for technical assistance and William Pinheiro Costa for collecting some specimens. This study was supported by the Brazilian agency São Paulo Research Foundation (FAPESP #2017/15842-8, #2016/06030-7, #2018/03428-5, and #2021/10639-5). LBL and CFBH are grateful to Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq) for research fellowships (#309827/2021-3 and #306623/2018-8). ROT, LHBS, and BCS are grateful to Coordenação de Aperfeiçoamento de Pessoal de Nível Superior–Brasil (CAPES) for graduate scholarships.
Author information
Authors and Affiliations
Contributions
L.H.B.S. generated and analyzed cytogenetic data and contributed to writing the original draft. T.W.P. generated and analyzed 3RAD data and contributed to writing the original draft. R.O.T. generated and analyzed cytogenetic data; generated and analyzed mitochondrial DNA sequences. J.M.F. contributed to analyzing cytogenetic data and writing the original draft. K.P.G. contributed to data curation. B.C.S. generated and analyzed cytogenetic data. G.V.A. contributed to field work and taxonomic identification. P.S. contributed to field work. C.F.B.H. contributed to field work. L.B.L. conceived and designed the project, contributed to analyzing mitochondrial and 3RAD data, supervised cytogenetic analyses, and contributed to writing the original draft. All authors discussed the results and contributed to the final manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Souza, L.H.B., Pierson, T.W., Tenório, R.O. et al. Multiple contact zones and karyotypic evolution in a neotropical frog species complex. Sci Rep 14, 1119 (2024). https://doi.org/10.1038/s41598-024-51421-z
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-024-51421-z
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.