The Korean Peninsula, located at the southern tip of Northeast Asia, has never been covered by ice sheets and was a temperate refugium during the Pleistocene. Karsenia koreana, the sole Asian plethodontid salamander species, occurs only on the southern half of the Korean Peninsula and is thought to have found various climatic refugia. Despite its phylogenetic and biogeographic importance, no population-level genetic analysis has been performed on this species. Here we study the population genetic structure of K. koreana using mitochondrial and microsatellite loci to understand the recent historical dispersion process that shaped its current distribution. Overall, the genetic distance between populations correlated well with the spatial distance, and the genetic structure among populations showed signs of a unilateral northward expansion from a southernmost refugium population. Given the distinct genetic structure formed among the populations, the level of historical gene flow among populations appears to have been very low. As the estimated effective population size of K. koreana was also small, these results suggest that the small, restricted populations of K. koreana are extremely vulnerable to environmental changes that may require high levels of genetic diversity to cope with. Thus, special management strategies are needed to preserve these remnant populations.
Within a geographic area, the distribution of a species reflects its historical dispersion and isolation in the context of geological change and Quaternary climate fluctuation1,2. To avoid the advances of Pleistocene ice sheets, many terrestrial animals extirpated or migrated, with some in the northern hemisphere dispersing to the south and occupying sites that served as refugia3. Even after the retraction of the ice sheets, numerous species remained in place, further adapting and diversifying, resulting in some southern regions exhibiting high species diversity and endemism4,5. Phylogeographic approaches can be used to reconstruct the historical migration routes and diversification processes that species or other higher taxa underwent following glacial period6,7.
The Korean Peninsula has features that make it attractive for phylogeographic studies. It is located at the southern tip of Northeast Asia and has never been covered by ice sheets, although it was indirectly affected by the northern glaciation of the Quaternary8. This area is characterized by a mountainous terrain resulting from complex geological activity, which offered temperate habitats or glacial refugia during the Quaternary glaciations9. Perhaps this is why the Korean Peninsula, despite its relatively small area, is a region with a reasonably high species diversity and endemism. This peninsula is home to ten species of salamander, of which eight are endemic10,11. Genetic studies of Hynobius and Onychodactylus salamander species on the Korean Peninsula detected a high level of intraspecific phylogeographic structure12,13, the existence of cryptic diversity11,12 and evolutionary relatedness to species inhabiting areas north of the peninsula. In total, these results suggest that the Korean Peninsula provided glacial refugia for salamander species that migrated southward.
One other monotypic genus of salamander is found on the Korean Peninsula—the Korean crevice salamander (Karsenia koreana). Karsenia korena is the only plethodontid species in Asia, and until this species was first discovered in Korea in 200514, plethodontid salamanders were thought to be native only to Europe and the Americas. There have been various attempts to elucidate the biogeographic and phylogenetic histories of plethodontid salamanders, the most diverse family in the order Urodela15,16,17,18,19. However, the existence of one plethodontid species in Northeast Asia has introduced a complication in the reconstruction of the historical dispersion process of Plethodontidae20. Since K. koreana was first described, researchers have attempted to determine its phylogenetic placement as a means of inferring how this species became distributed in Asia20,21. The most convincing hypothesis suggests that a small ancestral group of K. koreana migrated from western North America to Eurasia through the Bering Land Bridge around 65 Ma19,22.
Although officially considered an endangered species23, K. koreana is widely distributed in and around the mountainous regions of the Korean Peninsula24 (Fig. 1). Various aspects of Karsenia koreana biology have been studied including its cytogenetics25, morphology26,27 and ecology28,29,30. However, it is not yet known what recent biogeographic pathways this species has taken to form its current, restricted distribution within the Korean Peninsula. Given that no plethodontid species has been found elsewhere in Asia10, the populations from which this species originated (in Russia or northeastern China) have likely been extirpated. The populations on the Korean Peninsula may be considered remnants at the southern end of the biogeographic dispersion. The most effective way to identify the biogeographic pathways of K. koreana is to reconstruct the historical patterns of gene flow by measuring the genetic structure and diversity throughout its known distribution. To date, no population-level genetic analysis has been performed on this species.
This study was designed to uncover the population genetic structure of K. koreana on the Korean Peninsula to understand the historical dispersion process shaping the structure. Two types of genetic markers were used. First, mitochondrial COI (cytochrome c oxidase I) and Cyt b (cytochrome b) were used to estimate the historical migration and isolation processes of K. koreana populations. Second, microsatellites were used to determine the level of gene flow among populations and to estimate more recent demographic information compared to mitochondrial markers. Novel microsatellite markers were developed in this study. Considering that K. koreana is a rare species worldwide that needs careful management, the results of this study provide important baseline data that will inform future conservation strategies.
Karsenia koreana individuals were collected from April 2018 to August 2019 throughout all regions where this species is known to exist (Fig. 1). A tip of tail was clipped for each individual and stored in 70% EtOH until DNA extraction. After sample collection, the living individuals were released back to the original site of capture. A total of 204 individuals from 11 populations, whose localities are geographically separated, were used for this study (Table 1).
The sampling protocol was approved by the Korean Ministry of Environment. The sample collection in the field was carried out under the strict guideline on ethical animal experimentation protocols provided by Seoul National University Institutional Animal Care and Use Committee (SNUIACUC) and the guideline provided in the permits conforming to the Wildlife Protection and Management Act of the Korean Ministry of Environment.
Laboratory protocols for the loci characterization
Genomic DNA was extracted using a DNeasy Blood and Tissue Kit (QIAGEN, Hilden, Germany) following the manufacturer's protocol. The quantity and quality of each DNA sample were assessed using an Epoch Microplate Spectrophotometer (BioTek, Winooski, VT, USA). Successfully extracted DNA samples were diluted to 10–20 ng/μL and stored at − 20 ℃ until use for the genetic analyses.
For the two mitochondrial loci, two novel primers sets were developed (Supplementary Information Table S1) utilizing Primer331 in Geneious Prime 2019.1.3 (https://www.geneious.com) and Primer Premier 6.25 (PREMIER Biosoft, Palo Alto, CA, USA). Primers KkCOI_F6 and KkCOI_R7 were designed to amplify a complete COI gene (1,551 bp; Supplementary Information Table S1), while primers KtRNA_25F and KkCytb_R7 were designed and used to amplify a nearly complete Cyt b gene (1,116 bp; Supplementary Information Table S1). Polymerase chain reaction (PCR) for both mitochondrial loci were performed in a volume of 30 μL containing 1X Ex Taq buffer with 2 mM MgCl2, 0.2 mM dNTP mixture, 1 μL of 10 μM forward and reverse primers each, 1 U Ex Taq polymerase (Takara Bio, Shiga, Japan) and 1 μL of template DNA using a TaKaRa PCR Thermal Cycler Dice Gradient (Takara Bio). The thermal cycle profile for the loci consisted of an initial denaturing at 94 ℃ for 5 min; 35 cycles of a denaturing at 94 ℃ for 45 s, an annealing at 60 ℃ for 1 min and an extension at 72 ℃ for 3 min (for Cyt b: 65 ℃ for 5 min); and a final extension at 72 ℃ for 10 min. This extension step at lower temperature for longer time of Cyt b was necessary, probably due to the putative secondary structure of flanking regions by poly A and T32, which was verified in our preliminary experiments. The PCR products were purified and sequenced by Macrogen Inc. (Seoul, South Korea) and Cosmo Genetech Inc. (Seoul, South Korea) on an AB 3730xl DNA analyzer (Applied Biosystems, Foster City, CA, USA).
The novel microsatellite markers were isolated by Macogen. Whole genomic DNA was subjected to pair-end sequencing using a MiSeq platform (Illumina, San Diego, CA, USA). The extracted reads were processed to remove adaptor sequences, and RepeatModeler33 (http://www.repeatmasker.org/) was used for the identification of repetitive DNA sequences. SSR Finder34 (ftp://ftp.gramene.org/pub/gramene/archives/software/scripts/) was used for annotating reads containing simple repeat sequence motifs. A total of 52 candidate markers (32 tetra-, 10 tri- and 10 dinucleotide repeats) were selected for the testing of amplification and polymorphism with an 8-individual test panel (four populations, two samples each). Fourteen microsatellite markers were chosen to genotype the remaining samples. Microsatellite PCR amplification was performed in 20 μL volume consisting of 1X Taq buffer with 2 mM MgCl2, 0.2 mM dNTP mixture, 0.5 μL of 10 μM forward and reverse primers each, 1 U i-StarTaq polymerase (iNtRON Biotechnology, Seongnam, South Korea) and 1 μL of template DNA using a TaKaRa PCR Thermal Cycler Dice Gradient (Takara Bio). The PCR conditions consisted of an initial denaturing at 94 ℃ for 5 min; 20 cycles of denaturing at 94 ℃ for 20 s, touchdown annealing at 60–50 ℃ for 20 s and an extension at 72 ℃ for 20 s; additional 20 cycles of denaturing at for 20 s, annealing at 50 ℃ for 20 s and an extension at 72 ℃ for 20 s; and a final extension at 72 ℃ for 7 min. Amplified PCR products were genotyped on an ABI 3730xl by NICEM Inc. (Seoul, South Korea). Genotype quality check and peak calling were performed using GeneMapper 3.7 (Themo Fisher Scientific, Waltham, MA, USA).
Mitochondrial diversity and structure
The quality checking, trimming and editing of the mitochondrial sequence data were carried out using Geneious Prime. The sequences were aligned using ClustalW35 under the default setting, implemented in MEGA X36. No gaps or ambiguous bases were found.
We estimated the following parameters for each population based on the concatenated sequences of COI and Cyt b using DnaSP 5.10.0137; number of haplotypes, haplotype diversity (Hd)38, nucleotide diversity (π)38 and sequence diversity (k, average number of nucleotide differences)39. Haplotype sequences of the two loci obtained in this study were deposited in GenBank (Accession Nos. MT106778–MT106825; Supplementary Information Table S2). The number and frequency of private haplotypes (PH) were also calculated based on the haplotype data. The level of population divergence (ΦST40) and the average number of nucleotide differences per site between populations (DXY38) were estimated using Arlequin 184.108.40.206 and DnaSP 5.10.0137, respectively.
The haplotype network was reconstructed using a median-joining approach in NETWORK 10.0.0.042 (http://www.fluxus-engineering.com). For phylogenetic tree reconstruction, we used Hydromantes brunus (GenBank accession No. AY728234.1) as an outgroup species, as it is the most probable sister taxon of K. koreana18,22,43. Prior to phylogenetic tree reconstruction, the selection of partitioning scheme and substitution models were performed based on a greedy algorithm44 with the Bayesian Information Criterion (BIC) using PartitionFinder 2.1.145. Phylogenetic trees were reconstructed based on maximum likelihood (ML) and Bayesian inference (BI) methods were implemented in RaxML46 on the CIPRES platform47 and MrBayes 3.2.648, respectively.
For the ML analysis, we applied 'ML Thorough Boostrap' workflow with 1000 bootstrap replicates under the GTR + I substitution model49. For the BI analysis, two independent Metropolis Coupled Markov Chain Monte Carlo (MCMC) runs of 107 generations were conducted using four chains per run with three heated and one cold (temperature set to 0.1). We sampled trees every 500 generations and discarded 25% of the trees as burn-in. We used TRACER 1.7.150 to assess convergence of parameter estimates and posterior probabilities. The remaining trees were summarized to obtain a 50% majority-rule consensus tree. The tree was visualized using FigTree 1.4.3 (http://tree.bio.ed.ac.uk/software/figtree/).
Divergence time estimation and historical demography
Divergence times among populations of K. koreana were estimated using BEAST 2.6.151 based on the concatenated dataset of COI and Cyt b. Five outgroup taxa were used; Hydromantes brunus (AY728234.1), Aneides hardii (NC_006338.1), Desmognathus wrighti (NC_006337.1), Desmognathus fuscus (NC_006339.1) and Ensatina eschscholtzii (NC_006328.1). For molecular clock calibration, we adopted four molecular dating constraints (Karsenia-Aneides: 40.5 Ma, Aneides-Hydromantes: 39.4 Ma, Aneides-Desmognathus: 37.8 Ma, Ensatina-Hydromantes: 37.9 Ma) provided in Shen et al.19. The means and standard deviations of the normal distribution for these priors were chosen to reflect arithmetical medians of 95% credible intervals.
Given the population-level phylogenetic relationships among K. koreana lineages may be relatively close, we assigned a strict clock model52, coalescent constant population model and GTR + I + G substitution model under the default setting. Nested sampling algorithm53,54 posteriorly confirmed this combination of model parameters among multiple alternatives. The MCMC of BEAST consisted of three independent runs of 108 generations with sampling log and tree files every 1000 generations. After running, the convergence of chains was verified and burn-in periods were determined in TRACER to ensure the effective sample sizes (ESS) for all parameters were over 200. Three independent MCMC runs were combined in LogCombiner 2.6.1. Predetermined 10% burn-in trees were discarded and the final phylogenetic tree was annotated by maximum clade credibility type and median node heights in TreeAnnotator 2.6.0. The tree was drawn by FigTree 1.4.3.
Extended Bayesian skyline plot (EBSP)55 was implemented in BEAST to estimate historical demographic change. An unpartitioned HKY + I substitution model56 was applied, and prior selection, clock model, parameter settings and MCMC setup were the same as those used for the divergence time estimation. To incorporate a time scale into the analysis, we assigned two additional priors; normal distributed clock rate (mean: 6.8629E-3, sigma: 5.9743E-4) and lognormal distributed MRCA prior (mean: 2.318, sigma: 0.3031). The result was plotted with log-scale population size by time (Ma) using R package57 based on 'plotEBSP' function provided in BEAST. The likelihood of historical demographic expansion was tested based on Tajima's D58 and Fu's FS59, which was implemented in DnaSP 5.10.0137 with 10,000 replicates.
Micro-Checker 2.2.360 was used to detect null alleles, large allelic dropout and potential scoring errors. Hardy–Weinberg equilibrium and the likelihood of linkage disequilibrium among selected marker loci were examined using Fisher's exact test under 10,000 dememorization, 100 batches and 5000 iterations per batch, as implemented in Genepop 4.7.261. We calculated the following parameters using GenAlEx 6.50362: number of alleles (N), number of effective alleles (Na), observed heterozygosity (HO), expected heterozygosity (HE), and fixation index (FIS) for each locus and population and pairwise relatedness among samples to avoid duplicates. The signature of historical bottleneck was tested utilizing 1000 iterations of mode-shift at IAM, SMM and TPM (70% SMM, 30% variance) using BOTTLENECK 1.2.0263. Garza-Williamson index (M-ratio)64 was quantified using AGARst 3.365. When the M-ratio dropped significantly below 0.68 according to the traditional method, the population was considered to have experienced a severe historical population decline.
Microsatellite population structure
Pairwise FST66 and -RST67 values were computed using FSTAT 2.9.468 and Arlequin 220.127.116.11 with 1000 permutations. Isolation by distance (IBD), the relationship between geographic distance and Slatkin's linearized FST (FST / (1 − FST))69, was analyzed and visualized using GenAlEx 6.50362. Covariance-standardized Principal Coordinate Analysis (PCoA) was performed to identify the distribution of genetic characteristics among populations at the individual level using GenAlEx 6.50362. Genetic barriers against gene flow among populations were mapped using BARRIER 2.270 according to Monmonier's maximum difference algorithm71 with 1000 bootstrapped Nei's chord distance (DA)72 and FST matrices generated by MSA 4.0573 and R package FinePop74, respectively. The level and pattern of population structure was visualized using Structure 2.3.475 that was implemented with 10 iterations of 105 MCMC generations (including 10,000 burn-in) from K = 1 to K = 11 (the number of populations). The optimal K was determined on STRUCTURE HARVESTER web 0.6.9476 based on Evanno method77.
To infer the evolutionary history of K. koreana populations, over 100 scenarios were predefined to include both northward and southward dispersal patterns of different geographically probable orders among genetic clusters. The scenarios were examined in DIYABC 2.1.078 in a tournament fashion. Genetically close populations were grouped into a single cluster to lower the computational load, resulting in seven populations/clusters. More than 105 simulations per scenario were performed and scenarios were compared based on posterior probabilities. If the comparison results of the direct and logistic approaches were inconsistent, the confidence of high-ranked scenarios were evaluated again to discriminate among them. For the last two selected scenarios, 106 simulations were implemented for each to estimate the parameters of effective population size and branching point.
A total of 38 haplotypes were identified from the 11 K. koreana populations, 37 of which were unique to an individual population. The Hd value of the species in total was identified (0.942), while its π value was relatively low (0.00995). The ratios of private haplotypes (NP / N and PH / H) were relatively low in populations [BE], [JS], [PC] and [SC] (Supplementary Information Table S3). Population [GY] was the most genetically distinct of all the populations, followed by population [HC] (Supplementary Information Table S4). The levels of divergence among populations [JS], [PC] and [SC] were negligible (Supplementary Information Table S4).
Although populations [PC], [JS] and [SC] shared haplotypes, the other populations had their own unique haplotypes. The haplotypes of the populations in close geographical proximity tended to be close to each other in the haplotype network (Fig. 2).
The overall topology of ML and BI trees were nearly identical (Supplementary Information Figure S1 and S2). Consistent with the haplotype network results, the ML and BI trees grouped K. koreana populations into nine clusters, [BE + PC + JS + SC], [GJ], [BE], [JC], [GJ + DJ], [JA], [JE], [HC] and [GY]. Cluster membership tended to be related to geographical locations, with some exceptions. In particular, population [GY] was consistently distinct and separated from the other populations, followed by [HC] and [JE], the second and third most distinct populations.
Recovered divergence dates of prior calibration constraints were within the 95% credible intervals of dates recovered in a previous study (Karsenia-Aneides: 35.39 Ma, Aneides-Hydromantes: 36.18 Ma, Aneides-Desmognathus: 40.71 Ma, Ensatina-Hydromantes: 41.08 Ma)19. In our results, the first split between population [GY] and the others was estimated to have occurred around 2.30 Ma (95% highest posterior density range: 1.74 ~ 2.92 Ma) (Fig. 3). At approximately 1 Ma after this divergence, [HC] and [JE] split off, followed chronologically by [JA], [GJ + DJ], [JC], [BE] and [PC + JS + SC].
The historical demography analysis inferred a continuously stable population size, represented by a typical J-shaped skyline plot that reflects faster molecular evolution on a shorter time scale79 (Fig. 3). Tajima's D and Fu's FS tests were conducted for each mitochondrial locus and the results are summarized in Supplementary Information Table S3. None of these tests rejected the null hypothesis of neutral evolution with constant population size.
Ten tetra-, two tri- and two di-nucleotide microsatellite markers were chosen, as they successfully amplified and contained adequate levels of polymorphism within and among populations. Although no large allelic dropout was detected, the presence of null alleles was suspected in all loci analyzed. Null alleles were found in only one or two populations across the loci and were not found associated with any specific population. This suggests that the potential for substructure or inbreeding within populations was low. The overall null allele frequencies in loci K1039 and K1040 were higher than that in the low-frequency zone (see Dakin and Avise80), and loci K1011 and K1040 had null alleles in two or more populations. We included these three loci (K1011, K1039 and K1040) in subsequent analyses as the analyses results did not differ with or without these loci (see Oromi et al.81). We did not detect any signature of linkage disequilibrium among the loci used (data now shown).
Overall, the 14 microsatellite loci exhibited moderately high genetic diversity (Supplementary Information Table S5). The tetra-nucleotide loci generated higher levels of diversity (HE = 0.841 − 0.921), whereas the di- and tri-nucleotide loci showed slightly lower levels of diversity (HE = 0.677 − 0.874) and were associated with high FIS values, indicating the likely presence of null alleles (Supplementary Information Table S5). The levels of genetic diversity among the populations differed, especially among populations [PC], [JS] and [SC] (Table 2). No signature of genetic bottleneck was detected in any of the populations (Supplementary Information Table S6). Moreover, all populations had an M-ratio greater than 0.68, considered as the threshold level of historical bottleneck (Supplementary Information Table S6).
Based on pairwise-FST and -RST, the overall level of genetic differentiation among populations was fairly high (Table 3). Since the overall values of pairwise-RST were much larger than those of pairwise-FST (Table 3), this genetic differentiation is likely due to spatial isolation between populations rather than genetic drift, resulting from population size fluctuations. In particular, populations [JA], [JE] and [GY] exhibited higher levels of genetic differentiation compared to those of the other populations (Table 3). Genetic differentiation among populations was correlated with geographic distance. For example, in each set of populations comprising [DJ], [GJ], [BE] and populations [PC], [JS], [SC], the populations are in close spatial proximity, and they show exceptionally low levels of genetic differentiations between them (Table 3). This pattern was also evident in the IBD test results (Fig. 4).
PCoA grouped populations into four clusters: [JS + PC + SC], [BE + JC], [DJ + GJ + GY + HC] and [JA + JE], with slight overlaps between groups (Supplementary Information Figure S3). Group [JA + JE] was clearly separated from the other groups without overlap (Supplementary Information Figure S3). The membership of each of these four groups is related to spatial proximity (Supplementary Information Figure S3). In our Bayesian Structure analysis, the delta K method implemented in Structure Harvester was unable to identify an optimal number of genetically distinguishable clusters. The result, K = 2, is the minimal value, and may be a result of underestimation77,82. The overall clustering pattern generated by PCoA is reflected in the Structure analysis (Fig. 5), except that populations [JA] and [JE] are distinct from each other (Fig. 5). Bootstrapped values of DA provide estimates of barriers to gene flow between populations; based on this value, populations [JA] and [JE] are completely isolated from the other populations (Supplementary Information Figure S4). Conversely, no barrier to gene flow was detected among populations [BE], [JC], [PC], [JS] and [SC] (Supplementary Information Figure S4).
DIYABC found two scenarios having the highest statistical probability (Supplementary Information Table S7 and Figure S5). In the first scenario (Supplementary Information Table S7 and Figure S5), [PC + JS + SC], [JA] and [GY] diverged from a common ancestor approximately 5,380 generations ago, while [JE] diverged approximately 2,950 generations ago from [JA]. The remaining populations diverged into [PC + JS + SC], [DJ + GJ], [BE + JC] and [HC] approximately 2,360 generations ago. In the second scenario (Supplementary Information Table S7 and Figure S5), [JA] and [GY] diverged from a common ancestor approximately 13,200 generations ago and [JE] branched off from [JA] approximately 2,090 generations ago. The remaining populations diverged into [JS + PC + SC], [DJ + GJ], [BE + JC] and [HC] approximately 5,710 generations ago. [JS + PC + SC] and [BE + JC] diverged recently, approximately 263 generations ago. Logistic regression and confidence testing (linear discriminant analysis) results indicate that the second scenario is more probable (Supplementary Information Table S8 and Figure S5).
In general, Karsenia koreana showed distinct population genetic structure. The various approaches that we implemented consistently show clear genetic clustering. Both mitochondrial and microsatellite data generally demonstrate a positive correlation between pairwise genetic differentiations and pairwise geographic distances among populations in general. Moreover, each of the southern populations ([GY], [HC], [JE] and [JA]) was differentiated from the other populations. Furthermore, both the BEAST divergence and DIYABC analyses indicate that the most likely evolutionary scenario consists of northward divergence events originating from a southernmost population [GY].
Population genetic diversity and structure
The range of values of mitochondrial diversity of K. koreana (π = 0.00011–0.00237) are relatively lower than those of other plethodontid species, e.g., European Hydromantes species (π = 0.003–0.037)83,84 and Gyrinophilus porphyriticus (π = 0.031)85; and East Asian salamander species, e.g., Pachyhynobius shangchengensis (π = 0.0345)86 and Hynobius quelpaertensis (π = 0.0174–0.0214)87. However, the frequency of heterozygotes in the microsatellite data (HO = 0.449–0.744) suggests that K. koreana is more genetically diverse than other plethodontid species, e.g., Plethodon cinereus (HO = 0.189–0.420)88; and other Northeast Asian salamanders, e.g., Japanese Onychodactylus species (HO = 0.28–0.61)89. Even at the population level, mitochondrial and microsatellite data show different patterns of genetic diversity, although differences in sample sizes should be taken into account. For example, populations located in the Taebaek Mountain range ([PC], [JS] and [SC]) showed the lowest levels of microsatellite diversity, while the mitochondrial diversity of these populations was slightly higher than those of populations [BE] and [HC], which had shallow levels of genetic differentiation.
Overall, the results of genetic structure at the population-level were congruent for mitochondrial DNA and microsatellite data. The spatial location of the populations correlated well with the genetic distance between populations using both genetic markers. Considering that this species does not actively move long distances and tends to stay in a limited habitat, it is unlikely that gene flow over long distances occurs. However, results from both mitochondrial and microsatellite loci, suggest strong gene flow has occurred between populations [PC], [JS] and [SC]. This is probably because populations occupying a complex, continuous mountain terrain have access to a shady, humid environment that allows for the relatively active movement of individuals.
An incongruence between the population structures results based on the two genetic markers (microsatellite, mitochondrial DNA) should be noted. Microsatellite analysis via Structure and PCoA clustered together populations [DJ + GJ] and [HC], while [HC] was relatively well separated from the other populations based on mitochondrial analyses (the haplotype network and phylogenetic trees). In the mitochondrial results, [GJ] was relatively well separated from [DJ]; yet one haplotype of [GJ] grouped with the [DJ] haplotypes. However, these two genetic markers exhibit different modes of inheritance and act on different evolutionary time scales, thus it is not uncommon to observe such incongruences, and many theories and hypotheses provide plausible explanations for such findings90,91,92,93.
Microsatellites and mitochondrial DNA provided different estimates in the effective population size, a finding that should be noted considering that this species is endangered90,94. Fluctuation by genetic drift is likely to occur in mitochondrial data, which provided smaller estimates of effective population size. Thus genetic drift may have caused the differences in the patterns of diversity and structure between the two markers. Historical demographic analyses based on both of the markers used in this study was unable to reject the null hypothesis (a constant population size) and additionally indicated small population sizes.
During the glacial advances of the Pliocene and the glacial cycles of the Pleistocene, animal species in the Northern Hemisphere were restricted to a southern refugium3,95. Later, the retreating ice sheets allowed many animals to recolonize northward. However, if the environments in the southern refugium contain sufficient resources, populations may stay in the refugium and/or adapt to the surrounding areas, rather than recolonize their original habitats. Gómez and Lunt4 proposed that within a big refugium composed of complex mountain ranges, multiple small refugia could form. The idea of small refugia existing within a big refugium has been proposed in many studies5,96,97,98, including empirical studies on plethodontids99,100 and other East Asian animals101,102,103,104,105. This scenario also provides a reasonable interpretation of our data, i.e., genetically distinct K. koreana populations exist in small refugia that are distributed in or around the mountainous terrain of the Korean Peninsula, which in total may be considered one big refugium.
Previous studies inferred that the ancestor of K. koreana crossed from western North America to East Asia approximately 65 Ma19,22. Since K. koreana is confined to the Korean Peninsula, we hypothesize that the current populations are the surviving relics of the species, and the present-day population genetic structure is the result of a relatively recent northward expansion from a southernmost population. With a few exceptions, the results of Structure, PCoA and haplotype network analyses indicate that the population structure of this species was formed in a unidirectional fashion that matches the terrain of the Korean Peninsula's mountain range. The BEAST divergence tree and the most probable DIYABC evolutionary scenario from mitochondrial and microsatellite data, respectively, are both consistent in their support for a historical northward dispersal and divergence from the southernmost population [GY]. Based on mitochondrial data, most K. koreana populations diverged approximately after 1 Ma; and the pattern of isolation by distance from genetically more differentiated southern populations [GY], [HC], [JE] and [JA] further support this idea. Taken overall, these data suggest that K. koreana populations that survived in the southern part of the peninsula throughout the Pleistocene glaciation migrated northward and most recently recolonized favorable habitats along the complex mountain ranges. To evaluate this hypothesis, future work should be done to locate additional populations in the northern part of the Korean Peninsula (i.e., North Korea), and if there are, determine if they exhibit the same evolutionary pattern. Additionally, one outlier that needs further investigation is the divergence between [GY] and all the other populations, estimated to have occurred approximately 2.3 Ma (see also Fig. 3). To clarify this, additional phylogenetic analyses are needed using different types of genetic markers, such as nuclear genes or ddRAD markers.
Implications for conservation
The delimitation of a species′ 'Management Unit' (MU) is important in conserving its current population genetic diversity106. A MU is defined as "a functionally independent population of a species that has formed under restricted levels of gene flow."106. Our results indicate that there are seven K. koreana MUs: [GY], [HC], [JE], [JA], [DJ + GJ], [BE + JC] and [PC + JS + SC]. Such a proposal has an important implication for the conservation of this species.
Presently, the IUCN red list designates Karsenia koreana as a species of "Least Concern,"23 mostly because it is widespread on the Korean Peninsula. However, our data suggest that historical gene flow between the populations was low, isolating populations and increasing the influence of genetic drift. Therefore, although this species is widespread, the average effective population size is very small. Conserving this species requires special conservation strategies that take into account the preservation of all seven MUs.
All data needed to evaluate the conclusions in the paper are present in the paper and the Supplementary Information materials. Sequences in the paper have been deposited at GenBank (Accession Nos. MT106778–MT106825).
Hewitt, G. M. Quaternary phylogeography: the roots of hybrid zones. Genetica 139, 617–638 (2011).
Gillespie, R. G. & Roderick, G. K. Evolution: geology and climate drive diversification. Nature 509, 297–298 (2014).
Hewitt, G. M. The genetic legacy of the quaternary ice ages. Nature 405, 907–913 (2000).
Gómez, A. & Lunt, D. H. Refugia within refugia: patterns of phylogeographic concordance in the Iberian Peninsula. In Phylogeography of Southern European Refugia (eds Weiss, S. & Ferrand, N.) 155–188 (Springer, 2007).
Abellán, P. & Svenning, J. C. Refugia within refugia—patterns in endemism and genetic divergence are linked to Late Quaternary climate stability in the Iberian Peninsula. Biol. J. Linn. Soc. 113, 13–28 (2014).
Avise, J. C. Phylogeography: The History and Formation of Species (Harvard University Press, 2000).
Juan, C., Emerson, B. C., Oromí, P. & Hewitt, G. M. Colonization and diversification: towards a phylogeographic synthesis for the Canary Islands. Trends Ecol. Evol. 15, 104–109 (2000).
Chung, M. Y., López-Pujol, J. & Chung, M. G. The role of the Baekdudaegan (Korean Peninsula) as a major glacial refugium for plant species: a priority for conservation. Biol. Conserv. 206, 236–248 (2017).
Chung, M. Y. et al. The Korean baekdudaegan mountains: a glacial refugium and a biodiversity hotspot that needs to be conserved. Front. Genet. 9, 489 (2018).
AmphibiaWeb. https://amphibiaweb.org/. (Accessed: 15th July 2020).
Borzée, A. & Min, M.-S. Disentangling the impacts of speciation, sympatry and the island effect on the morphology of seven Hynobius sp. salamanders. Animals 11, 187 (2021).
Baek, H.-J., Lee, M.-Y., Lee, H. & Min, M.-S. Mitochondrial DNA data unveil highly divergent populations within the Genus Hynobius (Caudata: Hynobiidae) in South Korea. Mol. Cells 31, 105–112 (2011).
Suk, H. Y. et al. Phylogenetic structure and ancestry of Korean clawed salamander, Onychodactylus koreanus (Caudata: Hynobiidae). Mitochondrial DNA Part A DNA Mapp. Seq. Anal. 29, 650–658 (2018).
Min, M.-S. et al. Discovery of the first Asian plethodontid salamander. Nature 435, 87–90 (2005).
Wiens, J. J., Engstrom, T. N. & Chippindale, P. T. Rapid diversification, incomplete isolation, and the "speciation clock" in North American salamanders (genus Plethodon): testing the hybrid swarm hypothesis of rapid radiation. Evolution 60, 2585–2603 (2006).
Kozak, K. H., Mendyk, R. W. & Wiens, J. J. Can parallel diversification occur in sympatry? Repeated patterns of body-size evolution in coexisting clades of North American salamanders. Evolution 63, 1769–1784 (2009).
Zhang, P. & Wake, D. B. Higher-level salamander relationships and divergence dates inferred from complete mitochondrial genomes. Mol. Phylogenet. Evol. 53, 492–508 (2009).
Vieites, D. R., Román, S. N., Wake, M. H. & Wake, D. B. A multigenic perspective on phylogenetic relationships in the largest family of salamanders, the Plethodontidae. Mol. Phylogenet. Evol. 59, 623–635 (2011).
Shen, X. X. et al. Enlarged multilocus data set provides surprisingly younger time of origin for the Plethodontidae, the largest family of salamanders. Syst. Biol. 65, 66–81 (2016).
Wake, D. B. Persistent plethodontid themes: species, phylogenies, and biogeography. Herpetologica 73, 242–251 (2017).
Wake, D. B. The enigmatic history of the European, Asian and American plethodontid salamanders. Amphib-reptil 34, 323–336 (2013).
Vieites, D. R., Min, M.-S. & Wake, D. B. Rapid diversification and dispersal during periods of global warming by plethodontid salamanders. Proc. Natl. Acad. Sci. U.S.A. 104, 19903–19907 (2007).
IUCN SSC Amphibian Specialist Group. Karsenia koreana. The IUCN Red List of Threatened Species 2019. (2019). https://www.iucnredlist.org/species/61903/110101886. (Accessed: 15th July 2020).
Borzée, A. et al. Climate change-based models predict range shifts in the distribution of the only Asian plethodontid salamander: Karsenia koreana. Sci. Rep. 9, 1–9 (2019).
Sessions, S. K. et al. Cytogenetic analysis of the Asian Plethodontid salamander, Karsenia koreana: evidence for karyotypic conservation, chromosome repatterning, and genome size evolution. Chromosom. Res. 16, 563–574 (2008).
Buckley, D., Wake, M. H. & Wake, D. B. Comparative skull osteology of Karsenia koreana (Amphibia, Caudata, Plethodontidae). J. Morphol. 271, 533–558 (2010).
Sever, D. M., Pinsoneault, A. D., Mackenzie, B. W., Siegel, D. S. & Staub, N. L. A description of the skin glands and cloacal morphology of the plethodontid salamander Karsenia koreana. Copeia 104, 816–823 (2016).
Moon, K. Y. & Park, D. Report of Karsenia koreana eggs oviposited within a semi-natural terrarium constructed at natural habitat. Korean J. Herpetol. 7, 1–5 (2016).
Song, J.-Y. et al. Life history of a unique Asian plethodontid salamander, Karsenia koreana. Zool. Sci. 34, 122–128 (2017).
Jung, J.-H., Lee, E.-J., Lee, W.-S. & Park, C.-D. Habitat suitability models of Korean crevice salamander (Karsenia koreana) at forested area in Daejeon metropolitan city, Republic of Korea. J. For. Res. 24, 349–355 (2019).
Rozen, S. & Skaletsky, H. Primer3 on the WWW for general users and for biologist programmers. In Bioinformatics Methods and Protocols: Methods in Molecular Biology Vol. 132 (eds Misener, S. & Krawetz, S. A.) 365–386 (Humana Press, 2000).
Su, X. Z., Wu, Y., Sifri, C. D. & Wellems, T. E. Reduced extension temperatures required for PCR amplification of extremely A+T-rich DNA. Nucleic Acids Res. 24, 1574–1575 (1996).
Smit, A. F. A., Hubley, R. & Green, P. RepeatModeler Open-1.0. 2008–2015. (2014). http://www.repeatmasker.org.
Stieneke, D. L. & Eujayl, I. L. Imperfect SSR Finder Version 1.0. United States Department of Agriculture. (2019). https://data.nal.usda.gov/dataset/imperfect-ssr-finder.
Thompson, J. D., Higgins, D. G. & Gibson, T. J. CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res. 22, 4673–4680 (1994).
Kumar, S., Stecher, G., Li, M., Knyaz, C. & Tamura, K. MEGA X: molecular evolutionary genetics analysis across computing platforms. Mol. Biol. Evol. 35, 1547–1549 (2018).
Librado, P. & Rozas, J. DnaSP v5: a software for comprehensive analysis of DNA polymorphism data. Bioinformatics 25, 1451–1452 (2009).
Nei, M. Molecular Evolutionary Genetics (Columbia University Press, 1987).
Tajima, F. Evolutionary relationship of DNA sequences in finite populations. Genetics 105, 437–460 (1983).
Excoffier, L., Smouse, P. & Quattro, J. Analysis of molecular variance inferred from metric distances among DNA haplotypes: application to human mitochondrial DNA restriction data. Genetics 131, 479–491 (1992).
Excoffier, L. & Lischer, H. E. L. Arlequin suite ver 3.5: a new series of programs to perform population genetics analyses under Linux and Windows. Mol. Ecol. Resour. 10, 564–567 (2010).
Bandelt, H.-J., Forster, P. & Röhl, A. Median-joining networks for inferring intraspecific phylogenies. Mol. Biol. Evol. 16, 37–48 (1999).
Wake, D. B. Taxonomy of salamanders of the family Plethodontidae (Amphibia: Caudata). Zootaxa 3484, 75–82 (2012).
Lanfear, R., Calcott, B., Ho, S. Y. & Guidon, S. PartitionFinder: combined selection of partitioning schemes and substitution models for phylogenetic analyses. Mol. Biol. Evol. 29, 1695–1701 (2012).
Lanfear, R., Frandsen, P. B., Wright, A. M., Senfeld, T. & Calcott, B. PartitionFinder 2: new methods for selecting partitioned models of evolution for molecular and morphological phylogenetic analyses. Mol. Biol. Evol. 34, 772–773 (2017).
Stamatakis, A. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics 30, 1312–1313 (2014).
Miller, M. A., Pfeiffer, W. & Schwartz, T. Creating the CIPRES Science Gateway for inference of large phylogenetic trees. In Proceedings of the Gateway Computing Environment Workshop (GCE) 1–8 (2010).
Ronquist, F. & Huelsenbeck, J. P. Bayesian phylogenetic inference under mixed models. Bioinformatics 19, 1572–1574 (2003).
Tavaré, S. Some probabilistic and statistical problems in the analysis of DNA sequences. In Some Mathematical Questions in Biology: DNA Sequence Analysis (ed. Miura, R. M.) 57–86 (American Mathematical Society, 1986).
Rambaut, A., Drummond, A. J., Xie, D., Baele, G. & Suchard, M. A. Posterior summarization in Bayesian phylogenetics using Tracer 1.7. Syst. Biol. 67, 901–904 (2018).
Bouckaert, R. et al. BEAST 2.5: an advanced software platform for Bayesian evolutionary analysis. PLoS Comput. Biol. 15, e1006650 (2019).
Brown, R. P. & Yang, Z. Rate variation and estimation of divergence times using strict and relaxed clocks. BMC Evol. Biol. 11, 271 (2011).
Skilling, J. Nested sampling for general Bayesian computation. Bayesian Anal. 1, 833–860 (2006).
Russel, P. M., Brewer, B. J., Klaere, S. & Bouckaert, R. Model selection and parameter inference in phylogenetics using nested sampling. Syst. Biol. 68, 219–233 (2018).
Heled, J. & Drummond, A. J. Bayesian inference of population size history from multiple loci. BMC Evol. Biol. 8, 289 (2008).
Hasegawa, M., Kishino, H. & Yano, T. Dating of the human-ape splitting by a molecular clock of mitochondrial DNA. J. Mol. Evol. 22, 160–174 (1985).
R Core Team. R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. (2013). http://www.r-project.org/.
Tajima, F. The effect of change in population size on DNA polymorphism. Genetics 123, 597–601 (1989).
Fu, Y.-X. Statistical tests of neutrality of mutations against population growth, hitchhiking and background selection. Genetics 147, 915–9125 (1997).
van Oosterhout, C., Hutchinson, B., Wills, D. & Shipley, P. MICRO-CHECKER: software for identifying and correcting genotyping errors in microsatellite data. Mol. Ecol. Notes 4, 535–538 (2004).
Rousset, F. Genepop'007: a complete reimplementation of the Genepop software for Windows and Linux. Mol. Ecol. Resour. 8, 103–106 (2008).
Peakall, R. O. D. & Smouse, P. E. GENALEX 6: genetic analysis in Excel. Population genetic software for teaching and research. Mol. Ecol. Notes 6, 288–295 (2006).
Cornuet, J. M. & Luikart, G. Description and power analysis of two tests for detecting recent population bottlenecks from allele frequency data. Genetics 144, 2001–2014 (1997).
Garza, J. C. & Williamson, E. G. Detection of reduction in population size using data from microsatellite loci. Mol. Ecol. 10, 305–318 (2001).
Harley, E. H. AGARst: a programme for calculating allele frequencies, Gst and Rst from microsatellite data, version 2. University of Cape Town, Cape Town, South Africa. (2001).
Wright, S. The interpretation of population structure by F-statistics with special regard to systems of mating. Evolution 19, 395–420 (1965).
Slatkin, M. A measure of population subdivision based on microsatellite allele frequencies. Genetics 139, 457–462 (1995).
Goudet, J. Fstat (ver. 2.9.4), a program to estimate and test population genetics parameters. (2003). https://www2.unil.ch/popgen/softwares/fstat.htm
Rousset, F. Genetic differentiation and estimation of gene flow from F-statistics under isolation by distance. Genetics 145, 1219–1228 (1997).
Manni, F., Guérard, E. & Heyer, E. Geographic patterns of (genetic, morphologic, linguistic) variation: how barriers can be detected by "Monmonier's algorithm". Hum. Biol. 76, 173–190 (2004).
Monmonier, M. Maximum-difference barriers: an alternative numerical regionalization method. Geogr. Anal. 3, 245–261 (1973).
Nei, M., Tajima, F. & Tateno, Y. Accuracy of estimated phylogenetic trees from molecular data. J. Mol. Evol. 19, 153–170 (1983).
Dieringer, D. & Schlötterer, C. Microsatellite analyser (MSA): a platform independent analysis tool for large microsatellite data sets. Mol. Ecol. Notes 3, 167–169 (2003).
Kitada, S., Nakamichi, R. & Kishino, H. The empirical Bayes estimators of fine-scale population structure in high gene flow species. Mol. Ecol. Resour. 17, 1210–1222 (2017).
Pritchard, J. K., Stephens, M. & Donnelly, P. Inference of population structure using multilocus genotype data. Genetics 155, 945–959 (2000).
Earl, D. A. & Vonholdt, B. M. STRUCTURE HARVESTER: a website and program for visualizing STRUCTURE output and implementing the Evanno method. Conserv. Genet. Resour. 4, 359–361 (2012).
Evanno, G., Regnaut, S. & Goudet, J. Detecting the number of clusters of individuals using the software STRUCTURE: a simulation study. Mol. Ecol. 14, 2611–2620 (2005).
Cornuet, J.-M. et al. DIYABC v2.0: a software to make approximate Bayesian computation inferences about population history using single nucleotide polymorphism, DNA sequence and microsatellite data. Bioinformatics 30, 1187–1189 (2014).
Penny, D. Relativity for molecular clocks. Nature 436, 183–184 (2005).
Dakin, E. E. & Avise, J. C. Microsatellite null alleles in parentage analysis. Heredity 93, 504–509 (2004).
Oromi, N. et al. Genetic structure of lake and stream populations in a Pyrenean amphibian (Calotriton asper) reveals evolutionary significant units associated with paedomorphosis. J. Zool. Syst. Evol. Res. 57, 418–430 (2019).
Janes, J. K. et al. The K = 2 conundrum. Mol. Ecol. 26, 3594–3602 (2017).
Chiari, Y. et al. Phylogeography of Sardinian cave salamanders (genus Hydromantes) is mainly determined by geomorphology. PLoS ONE 7, e32332 (2012).
Cimmaruta, R., Lucente, D. & Nascetti, G. Persistence, isolation and diversification of a naturally fragmented species in local refugia: the case of Hydromantes strinatii. PLoS ONE 10, e0131298 (2015).
Kuchta, S. R., Haughey, M., Wynn, A. H., Jacobs, J. F. & Highton, R. Ancient river systems and phylogeographical structure in the spring salamander, Gyrinophilus porphyriticus. J. Biogeogr. 43, 639–652 (2016).
Pan, T. et al. Long-term sky islands generate highly divergent lineages of a narrowly distributed stream salamander (Pachyhynobius shangchengensis) in mid-latitude mountains of East Asia. BMC Evol. Biol. 19, 1–15 (2019).
Suk, H. Y. et al. Genetic and phylogenetic structure of Hynobius quelpaertensis, an endangered endemic salamander species on the Korean Peninsula. Genes Genom. 42, 165–178 (2020).
Cameron, A. C., Anderson, J. J. & Page, R. B. Assessment of intra and interregional genetic variation in the Eastern Red-backed Salamander, Plethodon cinereus, via analysis of novel microsatellite markers. PLoS ONE 12, e0186866 (2017).
Yoshikawa, N. & Nagata, N. Eighteen SSR Markers for the Japanese clawed salamander, Onychodactylus japonicus, and cross-amplification in its congeners. Curr. Herpetol. 36, 153–158 (2017).
Moritz, C. Applications of mitochondrial DNA analysis in conservation: a critical review. Mol. Ecol. 3, 401–411 (1994).
Estoup, A., Jarne, P. & Cornuet, J. M. Homoplasy and mutation model at microsatellite loci and their consequences for population genetics analysis. Mol. Ecol. 11, 1591–1604 (2002).
Toews, D. P. L. & Brelsford, A. The biogeography of mitochondrial and nuclear discordance in animals. Mol. Ecol. 21, 3907–3930 (2012).
Karl, S. A., Toonen, R. J., Grant, W. S. & Bowen, B. W. Common misconceptions in molecular ecology: echoes of the modern synthesis. Mol. Ecol. 21, 4171–4189 (2012).
Cook, B. D., Bunn, S. E. & Hughes, J. M. Molecular genetic and stable isotope signatures reveal complementary patterns of population connectivity in the regionally vulnerable southern pygmy perch (Nannoperca australis). Biol. Conserv. 138, 60–72 (2007).
Hewitt, G. M. Some genetic consequences of ice ages, and their role in divergence and speciation. Biol. J. Linn. Soc. 58, 247–276 (1996).
Avise, J. C., Walker, D. & Johns, G. C. Speciation durations and Pleistocene effects on vertebrate phylogeography. Proc. R. Soc. Lond. Ser. B Biol. Sci. 265, 1707–1712 (1998).
Alexandrino, J., Froufe, E., Arntzen, J. W. & Ferrand, N. Genetic subdivision, glacial refugia and postglacial recolonization in the golden-striped salamander, Chioglossa lusitanica (Amphibia: Urodela). Mol. Ecol. 9, 771–781 (2000).
Alexandrino, J., Arntzen, J. W. & Ferrand, N. Nested clade analysis and the genetic evidence for population expansion in the phylogeography of the golden-striped salamander, Chioglossa lusitanica (Amphibia: Urodela). Heredity 88, 66–74 (2002).
Rovito, S. M. Lineage divergence and speciation in the Web-toed Salamanders (Plethodontidae: Hydromantes) of the Sierra Nevada, California . Mol. Ecol. 19, 4554–4571 (2010).
Shafer, A. B. A., Cullingham, C. I., Côté, S. D. & Coltman, D. W. Of glaciers and refugia: a decade of study sheds new light on the phylogeography of northwestern North America. Mol. Ecol. 19, 4589–4621 (2010).
Zhang, R.-Z. Geological events and mammalian distribution in China. Acta Zool. Sin. 48, 141–153 (2002).
Matsui, M., Tominaga, A., Liu, W. Z. & Tanaka-Ueno, T. Reduced genetic variation in the Japanese giant salamander, Andrias japonicus (Amphibia: Caudata). Mol. Phylogenet. Evol. 49, 318–326 (2008).
Matsui, M. et al. Phylogenetic relationships of two Salamandrella species as revealed by mitochondrial DNA and allozyme variation (Amphibia: Caudata: Hynobiidae). Mol. Phylogenet. Evol. 48, 84–93 (2008).
Malyarchuk, B., Derenko, M. & Denisova, G. Phylogeny and genetic history of the Siberian salamander (Salamandrella keyserlingii, Dybowski, 1870) inferred from complete mitochondrial genomes. Mol. Phylogenet. Evol. 67, 348–357 (2013).
Honda, A. et al. Late Pleistocene climate change and population dynamics of Japanese Myodes voles inferred from mitochondrial cytochrome b sequences. J. Mammal. 100, 1156–1168 (2019).
Moritz, C. Defining "evolutionarily significant units" for conservation. Trends Ecol. Evol. 9, 373–375 (1994).
We sincerely thank to Kyo-Soung Koo, Jeong-Hyun Lee and Sang-Cheol Lee for sharing sampling site information, and Jaejin Park for providing a sample from Jeongseon. This work was supported by a grant from the National Research Foundation of Korea (NRF) funded by the Ministry of Education (NRF-2016R1D1A1B03934071).
The authors declare no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Jeon, J.Y., Jung, Jh., Suk, H.Y. et al. The Asian plethodontid salamander preserves historical genetic imprints of recent northern expansion. Sci Rep 11, 9193 (2021). https://doi.org/10.1038/s41598-021-88238-z