Introduction

The European landscape is the result of the dynamic interaction between people and their natural environments1. This is particularly true for the Mediterranean basin, a biodiversity hotspot, in which the current mosaic of agricultural lands, traditional pastures, evergreen woodlands and maquis habitats, is the result of anthropogenic activities over millennia2,3. Particularly, traditional agricultural systems, derived from well-integrated and sustainable management practices, being spatially heterogeneous, have supported a high level of biodiversity1,4. However, in the second half of the twentieth century, the intensification of modern agricultural practices has increased habitat loss and fragmentation, spatial homogenization, and a reduction in biodiversity5. In general, amphibians are particularly sensitive to habitat destruction and fragmentation that have a negative impact at a demographic and genetic level6 with habitat connectivity playing a key role in their viability at the regional scale7. Indeed, agriculture and urban development activities may create barriers to amphibian dispersal8,9 resulting in isolated subpopulations more subject to increased risk of inbreeding and local extinction10,11, and/or less likely to be recolonized by dispersers from neighbouring subpopulations12. In an isolated population, a decrease in the immigration rate of individuals from other populations (i.e. gene flow) will inevitably produce a genetic diversity loss6. This happens when population size is reduced13 and the isolation among populations is so strong that inbreeding and genetic drift are no longer counterbalanced by gene flow14. These negative effects are further exacerbated by the fact that amphibians, due to their habitat specificity, strict ecophysiological requirements15, low dispersal abilities, and philopatry16, do not behave as single, infinitely large panmictic population17 but as patchy demes representing unique genetic entities despite that geographic proximity. In addition, due to their biphasic life cycle in aquatic and terrestrial habitats, amphibians are vulnerable to the alteration of both environments and to any factors reducing population connectivity18. Despite human activities negatively impact amphibian populations, many studies in the Mediterranean highlighted the importance of artificial water bodies, associated with traditional agriculture and cattle watering, for amphibian feeding and reproduction19,20,21,22. This suggests that such artificial facilities, often connected to natural water bodies, may represent important surrogate breeding habitats for amphibians19,20,23. Indeed, the ability of amphibians to colonise artificial aquatic sites (e.g. tanks, drinking troughs and reservoirs) has been well documented worldwide19,24,25. Moreover, they may act as stopovers during dispersal events, thereby facilitating amphibians’ movements by reducing effective inter-aquatic systems distances26,27. However, there are no studies on their role and importance as for the population genetic conservation and for stepping-stone function in the dispersal process and gene flow maintenance.

With increasing anthropogenic habitat modification, it is particularly important to understand the genetic consequences of changes in landscape composition and configuration28. In this context, microsatellite markers have proved to be sensitive indicators of change in dispersal rates and suitable tools for addressing questions related to population subdivision and interconnection even at fine scale29. Studies on the individual movements and the spatial scale of dispersal among subpopulations are needed in addition to basic demographic research for improving our knowledge of amphibian population dynamics and sustainability, and to assist landscape management and conservation planning12,30. However, species genetic management is often overlooked in conservation programmes due to lack of resources or expertise31. However, the determination of genetic relationships among populations may be particularly useful to detect areas of high conservation priority, dispersal corridors, population declines and individuals and/or populations fundamental for augmentation/translocation or reintroduction programs32. Here, we described the spatial population structure of the Italian smooth newt Lissotriton vulgaris meridionalis (Linnaeus 1758) in a rural landscape of Central Italy (Latium) within a patchy system interested by infrastructural and agricultural development (Fig. 1). We aimed at investigating the role of drinking troughs used for cattle watering and traditional agricultural practices in supporting viable amphibian breeding populations and if and how they are connected to other natural (i.e. brooks) and/or artificial aquatic systems. We selected the sub-endemic Italian smooth newt as a model species since it is widely distributed in the study area (Latium, Central Italy)33 where it is known to successfully colonize both natural and artificial aquatic sites34. Specifically, multilocus genotypes (microsatellite-based protocol) were used to (i) evaluate genetic diversity within and among artificial and natural breeding sites; (ii) infer population structure in the study area; (iii) detect migration and gene flow among demes; (iv) establish whether drinking troughs may provide alternative and/or complementary aquatic habitats for amphibian conservation in rural landscapes.

Figure 1
figure 1

Topographic map of Tolfa Mountains, with localities for 21 Lissotriton vulgaris meridionalis populations sampled for this study. Each coloured ID corresponds to the assigned cluster. White circles identify the five artificial-natural site pairs.

Results

Within-population patterns of genetic diversity

Genotypic data were obtained from 464 individuals, excluding six samples that had more than 20% missing genotypes. One locus (Lm_013) was also excluded from data analyses because of a large amount of missing data, due to an amplification failure.

Estimated null allele frequencies were not significant, indicating the absence of null alleles, large allele dropout and false homozygotes at any locus in all populations.

Microsatellite markers were highly polymorphic across populations sampled, except for LVG-210 and LVG-388, which were monomorphic in 3 and 9 populations, respectively (Table S1, Supplementary material). Overall, allele numbers per locus ranged from 3 (LVG-388) to 46 (Lm_488 and Lm_749). Across all loci, expected heterozygosity (He) ranged from 0.048 to 0.955, and observed heterozygosity (Ho) from 0.048 to 1.00 (Table S1, Supplementary material).

All but three loci (Lm_521, Tv3Ca19, Lm_488) conformed to Hardy–Weinberg equilibrium after Bonferroni correction, with ten populations showing a slight deviation at one locus (Table S1, Supplementary material). We assumed that processes causing disequilibrium were specific to these populations, and thus decided to include the three loci in subsequent analyses.

The number of alleles ranged from 4.5 (FP) to 10.6 (FER) in artificial sites and from 5.4 (RP) to 9.7 (RER) in natural water bodies. Allelic richness varied from 4.4 (FP) to 8.4 (FER) in drinking troughs and from 5.0 (RP) to 8.1 (RIC) in natural sites (Table 1).

Table 1 Details of study populations.

A few populations showed slightly lower levels of diversity (FT, RP, FP). Expected (He) heterozygosity varied from 0.507 (FT) to 0.659 (FER) in artificial sites, and from 0.547 (RP) to 0.681 (RR) in natural sites. Observed heterozygosity (Ho) ranged from 0.515 (FT) to 0.668 (FER) in artificial sites and from 0.550 (RP) to 0.670 (LE) in natural sites (Table 1).

Overall, FIS values ranged from 0.002 to 0.132 with no indication of homozygote excess (or heterozygosity deficit), except for three populations (FE, FUA and RT) showing FIS estimates significantly greater than zero after Bonferroni correction (Table 1).

No linkage disequilibrium was observed between any pair of loci, indicating independence of the eleven genetic markers.

Heterozygosity did not vary between artificial and natural sites (F1,19 = 1.537, p = 0.230). No significant differences were observed in He values within each site pair represented by a natural and an artificial site (N = 5, Z = 1.214, p = 0.225).

Among-population patterns of genetic diversity

Pairwise FST values ranged from 0.001 (FER-RER) to 0.239 (FT-FB) with an average FST = 0.066. FST values were significantly higher than zero in 202 of 210 cases after sequential Bonferroni correction for multiple comparisons. FST was not significant in three out five natural/artificial site pairs (FE-LE, F19-R19, RER-FER). Overall, low to moderate levels of population differentiation were observed except for FT showing the highest FST values (Table S2, Supplementary material).

The number of migrants (Nm) between populations, calculated directly from the FST values (Table S2, Supplementary material; above diagonal), ranged from 0.797 individuals (FT-FP) to 249.75 individuals per generation (FER-RER). The lowest Nm estimates calculated from pairwise FST values ranged from 0.797 to 1.845 for FT comparisons.

Genetic distances (FST) among pairs of all populations were significantly correlated with the geographical distance between localities (Mantel test, P = 0.013). However, geographical distance explained less than 14% of the genetic variation (r2 = 0.137) in the study area (Fig. 2a). The subsequent test for Isolation by Distance (IBD) without outliers (FT, FP, and RP) revealed a higher significant correlation with geographical distance (P = 0.001) explaining about 43% of the genetic variation (r2 = 0.428) (Fig. 2b).

Figure 2
figure 2

Relationship between pairwise geographic and genetic distances (linearized FST) across Lissotriton vulgaris meridionalis populations. Mantel tests reveal a significant correlation in both cases. (a) All sampled populations; (b) populations with exclusion of potential outliers.

The pairwise relatedness analysis showed a slight presence of close relatives within our populations. Based on Lynch & Ritland’s estimator, we identified the presence of second- and third-order of relationships (in FT and FP populations, and in FB, F29 and RP populations, respectively). No close relatives were present within the five site pairs (see Table S3, Supplementary material).

The discriminant analysis of principal components (DAPC) identified 5 clusters. The resulting scatterplot showed evidence of fair separation among clusters (Fig. 3a), with some individuals that did not clearly belong to a distinct group. Considering the membership probabilities of each individual for the different groups, FB-RR and FT populations formed two distinct clusters (clusters 2 and 4, respectively), whereas cluster 1, 3 and 5 consisted mainly of individuals from populations of north-western area (FFA, F32, FE, LE, F35, F29, FUA, F23), south-eastern area (FER, RER, RT) and eastern area (FP, RP), respectively. Individuals from the remaining populations seemed to be admixed and had no distinct division, otherwise belonging to clusters 1, 3 and 5.

Figure 3
figure 3

Population genetic structure by multivariate clustering methods (Discriminant Analysis of Principal Components, DAPC): (a) Discriminant analysis of the genetic variation in Italian smooth newts from 21 breeding sites; (b) analysis of the genetic differentiation among the five artificial-natural site pairs.

DAPC revealed a pronounced genetic differentiation among the five artificial-natural site pairs (Fig. 3b), highlighting that within each pair populations from drinking troughs and natural water bodies are genetically interconnected and represent a unique deme (Fig. 3b).

Bayesian estimates of genetic structure

The model-based clustering method suggested that the model with K = 5 (where K is the number of population genetic clusters) was substantially better than alternative models.

Five clusters were identified with FT was split from the other populations. FB-RR and FP-RP formed two clusters on their own. All the other populations clustered in two distinct groups corresponding to north-western and south-eastern geographical areas (Fig. 4).

Figure 4
figure 4

Population structure inferred by Bayesian assignment of 464 individuals of Lissotriton vulgaris meridionalis performed by STRUCTURE software. Italian smooth newt populations in Tolfa Mountains can be assigned to five clusters of populations (K = 5).

Mean membership coefficients for all individuals into the north-western and south-eastern demes was generally high, with a mean of 0.796 (SD = 0.160) and 0.751 (SD = 0.235), respectively. A few populations (F23, RIC, FBG, F19 and R19), located on the borders of these two clusters, showed membership coefficients less than 70% and thus may be considered as admixed demes (Figs. 4, 5).

Figure 5
figure 5

Topographic map of Tolfa Mountains, with localities for 21 Lissotriton vulgaris meridionalis populations sampled for this study. Pie charts describe the percentage membership to each cluster for K = 5.

The hierarchical analysis of molecular variance (AMOVA) confirmed the genetic structure detected by clustering methods, with 86.73% of genetic variation resided within populations, 7.89% distributed among populations within clusters, and 5.38% of the variance possibly explained by differentiation among the five clusters (Table 2). Although the proportion of genetic variation accountable at higher levels is small (5.38%), all fixation indices were statistically significant. Results were similar even if we excluded admixed populations (F23, RIC, FBG, F19 and R19; Table 2).

Table 2 Results of hierarchical Amova comparing genetic variation within breeding populations, among breeding populations within the five clusters, and among clusters.

Bottleneck detection

The hypothesis of a recent bottleneck was tested with infinite allele model (IAM), stepwise mutation model (SMM) and two-phase model (TPM). We observed the occurrence of a bottleneck event in 7 out of 21 populations under IAM model (RR, FFA, FE, FBG, FP, RT, and RER; Table 3). Significant heterozygote excess was detected in RT population (p = 0.032) also under SMM (Table 3). Moreover, under the more relaxed hypothesis of a certain proportion of mutations encompassing more than a single repeat unit (TPM), the results suggested that two populations had faced a recent bottleneck (FFA and FBG; Table 3). On the contrary, no evidence of heterozygote deficit (i.e. a sign of expansion) was detected.

Table 3 Wilcoxon test for 21 Lissotriton vulgaris meridionalis populations under three mutation models: the infinite allele model (IAM), the stepwise mutation model (SMM) and the two-phase model (TPM, with 90% single-step mutations and 10% multiple-step mutations).

Overall, no significant shift in distribution was detected in our study populations that remained in a normal L-shape, as revealed by the allele frequency distribution test (Fig. S1, Supplementary material).

Recent migration rates

Few pairwise comparisons between localities (5%) detected recent migrations. In 20 instances, the migration rate was greater than 0.04 indicating the occurrence of substantial contemporary gene flow. In 12 cases, migration rates were > 0.1 suggesting that more than 10% of the population is composed by recent immigrants. The highest level of gene flow was mostly detected in populations that were geographically closed and belonging to the same genetic cluster (FB-RR; FP-RP; FER-RER; Table S4, Supplementary material).

These incidences of migration included emigrations from only seven source populations (RR to FB; F32 to FFA, FE, LE, F35, FUA, F23, F11; F29 to FUA; R19 to FUA, F23, F11, RIC, FBG, F19, RT; FP to RP; FER to RER, RT; RER to FER; Fig. 6).

Figure 6
figure 6

Topographic map of Tolfa Mountains, with localities for 21 Lissotriton vulgaris meridionalis populations sampled for this study. Pie charts describe the percentage membership to each cluster for K = 5. The white arrows indicate the direction of gene flow among newt populations at the study area.

Discussion

To our knowledge, this is the first study to examine the distribution of genetic variation and the spatial genetic structure of Italian smooth newt (Lissotriton vulgaris meridionalis) within a rural area of Central Italy (Latium).

During the last century, this rural area was subjected to a reduction of natural water bodies and habitat fragmentation due to human activities. In a previous study, we showed how traditional livestock farming, by building drinking troughs for cattle watering, provided suitable sites for amphibian reproduction, representing a surrogate for the lost natural ones19. Here, we pointed out that their capillary distribution seems to mitigate habitat fragmentation by increasing the number of demes, supporting gene flow among them, and thus improving the overall network of aquatic systems.

Populations showed a similar genetic diversity with no differences between natural and artificial wetlands, comparable to other urodeles and temperate amphibians35. Moreover, newts from drinking troughs had a genetic diversity (i.e. heterozygosity) comparable to those from the closest brook. As the level of genetic diversity reflects population’s adaptive and evolutionary potential36, this finding demonstrates the importance of artificial habitats for the persistence of amphibians especially where original habitats have been degraded or lost20.

Smooth newts are highly philopatric, with short dispersal ability37, resulting in large genetic differentiation among populations that may facilitate local adaptation or cause genetic drift38. To date, for L. vulgaris meridionalis, genetic data are only available from17, who found moderate to high differentiation levels among close populations. In our study, genetic differentiation among populations was moderate (average FST = 0.066), thus indicating the existence of a network of populations connected by dispersing individuals39. This suggests that drinking troughs contributed to sustain gene flow, decreasing local genetic divergence, and mitigating genetic drift and population inbreeding40. Our findings are in agreement with previous fine-scale studies of urodeles employing microsatellite markers12,29,41,42,43.

Some populations showed higher pairwise FST-values not consistent with the geographic distribution (i.e. FT, FP, and RP). This may be due to the relatively longer timespan since isolation and the higher habitat fragmentation characterising FT and FP-RP demes. It is likely that, in the past, the study area consisted of a unique unfragmented woody landscape with many natural aquatic systems used by newts for reproduction. In the 1950s, wide-scale human impact caused habitat loss and fragmentation isolating some demes (i.e. FT and FP-RP) in less favourable areas. The transition forests-arable lands or grazed-cultivated lands, and the construction of roads may have removed suitable terrestrial shelters and artificial breeding sites used for reproduction44, affecting the connectivity of local populations, dispersal, and genetic diversity36,45. Our suppositions are in agreement with previous studies documenting the negative effects of habitat fragmentation on local genetic variability and connectedness in amphibians9,46,47,48.

We also found a correlation between genetic and geographical distance measures. Isolation by distance is expected to shape genetic variation, even on small spatial scales, leading to populations more differentiated from each other with increasing distance49,50. Indeed, we found a significant IBD pattern between populations separated by a maximum distance of 16.7 km (FFA-FER) and 17.8 km (FB-FE). We thus confirm the limited dispersal ability of L. vulgaris meridionalis even at small spatial scale43, whose average dispersal distance has been estimated to be 400–500 m51. However, this moderate correlation coefficient detected suggests that further processes other than distance may have structured populations’ genetic composition48. Indeed, dispersal capability is known to be not only determined by species’ intrinsic biology but also by landscape features and habitat quality52,53. In this context, it is likely that natural and artificial water bodies have increased survival probability and facilitated juvenile dispersal and adult breeding migration43. Drinking troughs, in particular, seem to act as stepping-stone system connecting distant natural breeding sites and building a network able to face potential isolation.

Five statistically significant population clusters with a moderate genetic differentiation have been identified. Gene flow seems to be extensive among populations within each cluster determining quite genetically homogeneous assemblages, with a genetic variation attributable to within-populations differences. Moreover, the five artificial-natural site pairs were genetically interconnected representing each a unique nearly panmictic deme. Such a positive association confirms the potential of drinking troughs to contrast drift and local loss of genetic diversity providing a network of permanent aquatic sources used for breeding and migration9,54.

Three clusters had a very similar cohesiveness concordant with the geographic distribution evidencing a significant positive relationship between genetic differentiation and geographical distance55. This probably reflects the counterbalance of different effects currently ongoing or acted in the past. On one hand, natural limits to dispersal and population-level processes may have affected site fidelity and low vagility increasing genetic distance56; on the other, gene flow among pristine habitats, once widespread in the study area, may have reduced genetic differentiation. Two clusters drifted apart probably because of landscape features alone or in combination with topographical differences. Indeed, both natural barriers and human-induced environmental changes affect occasional migration events among distant populations in L. vulgaris meridionalis hindering gene flow and resulting in a strong genetic structure43.

Five populations showed evidence of mixed genetic ancestry. Such admixed populations are expected in systems where geographical structure exists but there are no barriers to dispersal41. Tolfa Mountains conform to this landscape scenario being characterized by large patches of natural habitats interspersed with areas subject to traditional agricultural and husbandry activities without absolute geographic barriers limiting individual exchange among populations57.

Signals of current migration were found within clusters but not among populations from different sites. Interestingly, migration was unidirectional and influenced by the hydrology of the site (permanent vs. temporary) rather than its typology (natural vs. artificial). Surprisingly, not only natural but also artificial permanent aquatic sites acted as sources of individuals. The observed configuration of water network and the connectivity among drinking troughs, and between drinking troughs and natural aquatic systems, suggests the existence of a metapopulation with a source-sink configuration, in which recolonization following local extinctions, and natural restocking after demographic contractions are likely to occur58.

We found evidence of a slight occurrence of bottleneck events (RT, FFA and FBG). Amphibian populations are known to fluctuate from year to year with a demographic and connectivity instability42 resulting in a genetic bottleneck effect at local scale. This also suggests that populations did not recently experience severe effects of natural and anthropogenic pressures, or such pressures were mitigated by the positive impact of the observed configuration of water network. RT, a brook population subject to periodic drought, seems to have experienced a recent bottleneck probably counterbalanced by individuals migrating from neighbouring natural and artificial sites. FFA and FBG drinking troughs populations probably suffered inadequate management practices, associated with mechanical and chemical cleaning activities and water extraction19. Such operations may cause early desiccation and metamorphosis failure of larvae, with massive mortalities every year and consequent repercussions on the occurrence and abundance of species at local scale20 and a decline in genetic variation13.

Our study revealed that a complex system of artificial permanent aquatic sites and few natural wetlands can sustain a well-structured network of amphibian demes that seem to be quite connected. The current landscape matrix suggests that the observed genetic structure may not reflect present-day gene flow but an intermediate situation between the historic scenario after the 1950s57 and the current landscape. Drinking trough populations probably derived from individuals in dispersal from neighbouring natural sites subject to a relatively fast drift known to affect small and isolated populations59. Since 15–20 generations are needed for signals of genetic effects to be observable60, the genetic structure inferred is more likely to represent a snapshot of past landscape configuration and composition rather than contemporary ones61.

Our study highlights the significant role played by the Maremmana cow in L. vulgaris meridionalis conservation. The Maremmana cow seems to act as an ecosystem engineer (i) directly by structuring landscape features (i.e. increasing habitat heterogeneity) and (ii) indirectly by providing suitable artificial sites crucial for newt genetic diversity maintenance. We emphasize how drinking troughs may represent an interesting tool for promoting individual dispersal and gene flow, reinforcing metapopulation patterns observed in the study area. Intervention measures are timely to prevent the progressive abandonment of drinking troughs observed in the last 50 years in Italy due to expansion of modern and intensive farming62,63. At the local scale, conserving and managing the existing natural sites as well as incentivization of traditional farming accompanied by drinking trough restoration, preservation and proper management will increase the availability of water supplies to both the Maremmana cow and newts, preventing further population declines within the study area. Particularly, the correction of trap effects, invasive species eradication, cleaning practices respectful of newt phenology, intervention to avoid disrepair and permeability loss19,20,23 will help to preserve habitat suitability maximizing the effectiveness of drinking troughs as stepping-stones. At the regional scale, habitat suitability of both natural and artificial water bodies could be improved by increasing site connectivity64. Indeed, the co-occurrence of aquatic sites differing in sizes and other features (e.g. vegetation cover, feeding resources) in the landscape is known to favour species’ dispersal reinforcing their metapopulations64.

Our conclusions suggest that amphibian protection and social rural and cultural values preservation can be simultaneously achieved combining conservation biologists’ efforts and local livelihood needs19.

Methods

Study area

The study area falls within the Maremma district, extended throughout Latium and Tuscany regions. Fieldwork was carried out in the Comprensorio Tolfetano-Cerite-Manziate (Province of Rome, Latium Region, Central Italy; elevations between 150 and 450 m a.s.l.) a Special Protection Area (SPA, IT6030005), spanning 70,000 ha, included in the European Ecological Natura 2000 network. It is characterized by a high biodiversity value representing one of the highest wilderness areas in Italy despite the proximity to several main cities and its rural profile interested by traditional agriculture and husbandry. A mosaic of pastures, cultivated lands, and woodlands characterizes the vegetation landscape, mainly consisting of riverine woods, wide areas covered by bushy pastures and mesophilous forests19,65.

In this area, natural wetlands are scarce and are frequently dried due to agricultural practices and their strong seasonality. Indeed, they are characterized by a short hydroperiod, especially in summer, because of high temperatures and irregular rainfalls. However, artificial drinking troughs built by farmers after World War II to facilitate livestock watering (cattle, horses, and donkeys), are particularly widespread. Drinking troughs are small, shallow (20–60 cm depth) habitats, with rectangular shape and vertical walls, often in connection with natural permanent or semi-permanent aquatic systems19,66. Such artificial aquatic sites are particularly used by the Maremmana cow (Bos primigenius taurus), a robust and frugal native breed well adapted to difficult environments living freely in the marshes and maquis of the south coastal area of Italy for more than 1500 years67,68.

Population sampling

All the potential breeding sites (natural and artificial) were identified through orthophotos in Google Earth Pro software (version 7.3.1.4507). Overall, 7 natural aquatic sites (one pond and six brooks, named LE, RR, RIC, R19, RP, RT, RER) and 51 functioning drinking troughs were detected within an area covering approximately 18,000 ha. We focused on all 7 natural sites and on 14 drinking troughs (labelled with F in Table 1; highest elevation of 398 m a.s.l. at F29 site) characterized by the presence of L. vulgaris meridionalis and a distribution across the study area aimed at minimizing spatial autocorrelation (i.e. space distance of at least 1 km; Fig. 1).

Finally, we selected a study model consisting of five artificial-natural water body pairs, each consisting of a drinking trough built in the immediate surrounding (< 500 m) of a natural brook/pond (FB-RR; RP-FP; FE-LE; F19-R19; RER-FER; see Fig. 1). This approach has been adopted since natural and artificial sites are interconnected, and artificial sites most likely originated from the colonization of individuals from natural sites.

Study sites were sampled during the breeding season of the species (March–May) in 2015–2016 years. Since cohorts can vary in genetic composition, we avoided sampling the same site at different times during the year41. We surveyed drinking troughs by dip netting69, whereas newts were visually located and captured by hand or dip nets in natural sites. We collected 470 tissue samples by tail-clipping70 considering at least 20 adult individuals from each aquatic site useful for the analysis. Samples were stored in ethanol 95% at room temperature until genetic analysis.

Tissue samples were obtained with no anaesthetic and by careful handling of animals. After this procedure, individuals were temporarily housed in tanks filled with water for approximately 30 min to verify their return to normal activity and released at the same point of capture. No individuals died or were otherwise harmed during samplings. This study was conducted according to the ethical scientific guidelines described by the American Society of Ichthyologists and Herpetologists (ASIH), The Herpetological League (HL) and the Society for the Study of Amphibians and Reptiles (SSAR).

Experimental protocols were approved by Italian Ministry of the Environment that gave LV the permit [0001255/PNM] to manipulate amphibians and collect tissue samples.

All methods are reported in accordance with ARRIVE guidelines.

DNA extraction and microsatellite amplification

Total genomic DNA was extracted using DNeasy Blood and Tissue Kit (QIAGEN) and QIAcube robotic workstation, then amplified at 12 microsatellite loci (STRs) optimized for this species71. Three multiplex PCRs (M1 included Lm_521, LVG-210/EU568357, LVG-388/EU568352 and LVG-542/EU568354; M2 included Lm_488, Lm_632, Th09 and Tv3Ca9; M3 included Lm_013, Lm_749, LVG-398/EU568356 and Tv3Ca19) were performed to save time and reagents. PCRs were carried out using Qiagen Multiplex PCR Kit in an 8 μL mix composed of 3.50 μL HotStarTaq Master Mix, 0.70 μL Qsol, 1.50 μL DNA, 0.20 μL each primer (10 μM), brought to volume with H2O. Amplification conditions for all loci included an initial denaturing at 95 °C for 15 min; 40 cycles of denaturing at 94 °C for 30 s, annealing temperature of 55 °C for 1.30 min, and 72 °C extension for 1 min; and a final 10 min extension at 72 °C. Two replicates for each sample at each locus were executed to confirm the genotypes.

Forward primers for each locus were 5’-labelled with a fluorescent dye for detection (Applied Biosystems). PCR products were electrophoresed with internal size standard (GeneScan 1200 LIZ, Applied Biosystems) in an ABI Prism 3130XL sequencer and allele sizes were scored using the software GeneMapper v.5.0 (Applied Biosystems).

Intra and inter-population patterns of genetic diversity

The presence of null alleles was checked by Micro‐Checker v.2.2.372.

Genetic diversity indices (number of alleles, effective number of alleles and number of private alleles) were calculated for each population by GenAlex v.6.573,74. Allelic richness was computed in Fstat 2.9.375. Arlequin v.3.5.276 was used to estimate observed (HO) and expected (HE) heterozygosity and to test genotypic frequencies at each population and locus for statistically significant deviations from Hardy–Weinberg equilibrium (HWE). We used a Monte Carlo approximation of the Fisher’s exact test77 and a standard Bonferroni correction for multiple comparisons (adjusted P-value < 0.001, for a table-wide significance of α = 0.05). The Markov chain algorithm was run for 100,000 steps following 10,000 dememorization steps. The pairwise probability of linkage disequilibrium was estimated using a Fisher’s exact test implemented in Genepop v.4.278 with 10,000 steps following 1000 dememorization steps.

Analysis of variance (ANOVA) was used to reveal differences in expected heterozygosity as a function of site type (F, artificial; R, natural). Moreover, the Wilcoxon matched-pairs test was performed to compare heterozygosity values within each site pair represented by a natural and an artificial site. Statistical tests were performed using STATISTICA software (version 8.0 for Windows) with alpha set at 5%.

Fstat 2.9.3 was used to perform a global test of overall population differentiation (not assuming HWE within populations)75. Pairwise significance tests for FST79 were performed by permutation and resampling of multilocus genotypes among pairs of samples with 10,100 randomizations allowed for a table-wide significance at the 5% nominal level after standard Bonferroni corrections (adjusted P value = 0.001).

The degree of gene flow among demes, in terms of success of immigrants, was determined using Nm estimates, derived from the fixation coefficient FST80.

Pairwise population linearized FST values81 were correlated with geographical distances to test for patterns of spatial subdivision and isolation by distance (IBD) using a Mantel test with 1000 randomizations (implemented in GenAlex v.6.5). We performed two tests for IBD separately, on (i) all populations, and on (ii) populations with the exclusion of potential outliers (i.e. populations showing higher genetic distance despite the similar geographic distance in respect to the others).

Pairwise relatedness analysis was carried out to assess the presence of close relatives in our populations. We performed the analysis (implemented in GenAlex v.6.5) for each population and for the five artificial-natural site pairs using Lynch & Ritland’s82 LR estimator by 2 to give a maximum value of 1 and minimum of − 1.

A discriminant analysis of principal components (DAPC) was performed to assess the genetic divergence among populations and provide a description of the genetic clustering based on the largest between‐group and smallest within‐group variances in allele coefficients (R package “adegenet”)83,84. We used “find.clusters” function to identify the optimal number of clusters. k-means algorithm was run comparing different clustering solutions using Bayesian Information Criterion (BIC) values with increasing values of k. The optimal number of clusters was selected corresponding to the lowest BIC. Then, we determined the number of PCs to retain considering the cumulated variance explained by the eigenvalues of the PCA. After the cross-validation step, DAPC was performed with the retention of 60 PCs (> 80% of variance explained) and four discriminant functions. A further DAPC was conducted to verify genetic differentiation within the five artificial-natural site pairs previously identified, with the retention of 60 PCs and four discriminant functions.

Bayesian estimates of genetic structure

We performed Bayesian clustering analysis to characterize the population genetic structure of aquatic systems across the study area and assess the geographical scale of population differentiation using STRUCTURE v2.3.485. We ran 15 independent simulations of 1,000,000 iterations (following a burn-in period of 100,000) for each value of K85 between 1 and 21. The admixture model was applied assuming gene flow among populations and allowing for correlated allele frequencies across populations. We performed two different models: (i) the default mode for STRUCTURE that uses only genetic information to infer population structure, and (ii) the LOCPRIOR model that uses sampling locations as prior information to assist the clustering.

Appropriate K was selected based on the natural log of Pr(X|K) values and the ∆K criterion86 provided by STRUCTURE HARVESTER v.0.6.9487. Replicate runs were averaged using CLUMPP v.1.188 and plotted in DISTRUCT v.1.189 to generate graphical displays representing the mean membership of each individual over the replicate runs.

Considering the genetic clusters inferred from the Bayesian analyses as panmictic demes, an Analysis of Molecular Variance (AMOVA)90, implemented in Arlequin v.3.5.276, was performed on FST values to examine the distribution of genetic variation at three hierarchical levels: within populations, among populations within demes, and among demes. We assigned each population to a cluster based on its largest proportion of membership and then examined hierarchical genetic distribution considering only populations with a mean membership ≥ 0.7 and excluding admixed populations. The significance of the variance components was tested using 10,000 permutations.

Bottleneck detection

The software BOTTLENECK v1.2.0291 was used to test for evidence of recent bottlenecks in each population. The main assumption of this test is that allele frequency distribution results from an equilibrium between mutation and genetic drift92. In non-bottlenecked populations, the expected heterozygosity at mutation-drift equilibrium (Heq) equals the measured Hardy–Weinberg equilibrium heterozygosity (He). Instead, in case of recent bottlenecks, a correlative reduction of allelic diversity and heterozygosity at polymorphic loci occurs and the mutation-drift equilibrium is temporarily disrupted: He will be significantly greater than Heq, calculated from the number of alleles sampled92,93. The calculation of Heq depends on the mutation model considered analysing the loci92. In the present study, Heq values were calculated under three mutation models: the infinite allele model (IAM)94, the Stepwise Mutation Model (SMM)95 and a combination of those two extreme hypotheses, the two-phase model (TPM, with 90% single-step mutations and 10% multiple-step mutations)96. Significant deviations in observed heterozygosity across loci for each population were detected using a two-tailed Wilcoxon sign-rank test97 that allows the detection of the heterozygosity excess due to the faster loss of low frequency alleles and thus determines whether a population has been recently bottlenecked. Moreover, a qualitative descriptor of allele frequency distribution was used to infer bottlenecks. This method compares allele frequency distribution to the distribution expected at mutation-drift equilibrium (l-shaped distribution), considering that population bottlenecks cause a characteristic mode-shift distortion in the distribution of allele frequencies at selectively neutral loci97.

For fewer than 20 loci (i.e. 11 loci in our study), the Wilcoxon’s test is the most appropriate and powerful91. Since the IAM is mainly recommended for allozyme data and microsatellites are often described as being closer to the single step mutation model91 or to the Two-Phase Mutation model96, we may consider the results obtained under SMM and TPM more reliable.

Recent migration rates

We estimated recent migration rates between localities using the program BAYESASS 3.098, which employs a Bayesian MCMC approach to identify migrants or recent descendants of migrants based on transient linkage disequilibrium among multilocus genotypes from different source populations. This program was run following the recommendation of99 concerning convergence issues. We performed 10 separate runs, with different seed numbers, using 107 iterations, with a burn-in of 106 and a sampling frequency of 1000. This run length was sufficient for the posterior probability to achieve convergence. The MCMC mixing parameters were set to 0.50, 0.80 and 0.25, for allele frequency, level of inbreeding and migration rate, respectively. These delta values resulted in acceptance ratios between 40 and 60% to maximize log-likelihood values98. Migration rates were averaged over the 10 independent runs and compared to average migration rates of randomly permuted data set, generated to assess significance. Estimated migration rates were considered significant when the 95% confidence interval (CI) did not overlap with the 95% CI of the randomly permuted data.