Introduction

Studies based on Southern Ocean (SO) biodiversity have been focused on eurybathy, circumpolarity and the high prevalence of brooders1,2. For example, the poor dispersers, which include many species, present seemingly high levels of endemism in the SO3. The high prevalence of brooders can be explained partially by glacial cycles4. This explanation is based on the glacial refugium hypothesis5. In this scenario, the hypothetical glacial refuges in the Antarctic would have allowed the Antarctic species to be protected during the interglacial periods2,6, and processes of expansion and contraction could have taken place7. Currently, numerous evidence of an increase in population size during the early stages of the last glaciation seems to support the glacial contractions and interglacial expansions were a recurrent pattern that eventually led to species adapting to cold conditions during the Late Quaternary6,7,8,9,10. Species colonizing different environments have managed to survive through the Pliocene and Pleistocene glaciation cycles by shifting between habitable areas of polynyas or ice fracture zones under former or extant Antarctic ice shelves11,12,13. Some hypotheses that have emerged to explain the survival of species in the last glacial periods are derived from the Expansion‐Contraction Model14. One of these hypotheses is in situ persistence in Antarctic glacial refuges11,15,16. The in situ persistence scenario suggests the presence of one or several isolated refugia on the shelf; these refugia are associated with strong population bottlenecks4,16. A second hypothesis is that of island refuges, which proposes that shallow marine species survived out of the Antarctic continental shelf in adjacent Antarctic islands such as the South Shetland Islands (SSIs)11,17. This area has been defined as an in situ refuge associated to volcanoes or areas of geothermal activity18. These refuges have been particularly informative in reconstructing the periglacial and postglacial history of marine organisms19,20. Finding a pronounced geographical structure may provide evidence of putative refugial areas21,22,23. Refugial populations are usually composed of subsets of the genetic diversity and long-term isolation of populations within geographically separate refugia will lead to genetic differentiation14,24.

Despite the fact that several genera of bivalves have survived the glacial periods, a reduced number of these persist in Antarctic benthic systems and their biogeography patterns have yet to be exhaustively studied. The review of Seymour Island Paleocene molluscs25 lists 57 species and 41 genera, of which only 12 genera are still represented in the SO; most of the remaining genera occur at present in the seas north of the Polar Front26 and a few still live around Antarctica (12.5%)25. Moreover, the analysis of species richness in bivalve families revealed that the distribution has high richness north of the Antarctic peninsula and low richness to the south in, for example in families like Pectinidae, Nuculidae, Mytilidae, Gaimardiidae26,27,28,29,30,31,32,33,34,35. Antarctic bivalves have a varied range of reproductive strategies. In some the sexes are separate (gonochoric organisms) while others are simultaneous hermaphrodites36,37, some display indirect development and external fertilization (planktonic larvae)38 while others exhibit direct development in which the embryos are retained by the female, who provides parental care to the offspring (brooder species)39,40,41,42,43,44,45.

The microbivalve K. subquadrata Pelseneer (1903)46,47 is a brooder species with a 4.5 mm average shell length47. They incubate their offspring until the juvenile stage and have been reported inhabiting the rocky intertidal of islands adjacent to the West Antarctic Peninsula (WAP)48. No genetic information on this species has been reported prior to this work. In this paper, we aim to increase the available information about the Antarctic biodiversity and microbivalve species. To that end, we have developed a spatial analysis of the genetic diversity in populations of K. subquadrata from the WAP to advance the understanding of the evolutionary history of these microbivalves that inhabit Antarctica.

Based on the absence of the larval phase, we hypothesized that due to a low dispersal potential in this species we would find a high spatial genetic structure among populations with evidence of in situ refugia in the SSIs. To test this hypothesis, we used the mitochondrial molecular marker, Cytochrome oxidase I (CoxI) and a suite of phylogeographic analyses to estimate the spatial genetic diversity and the historical demography of this species.

Results

Genetic diversity and spatial genetic structure

The study included seven localities across the WAP (Fig. 1). No insertion/deletions or stop codons were detected in the sequence dataset. In total 37 haplotypes were recovered (Table S1), with a whole dataset haplotype diversity of 0.722 ± 0.032 and 51 polymorphic sites (Table 1). The pairwise FST distance revealed a high diferentiation among sites, with Doumer island being high and significantly differentiated from the other locations (Table S1). The IBD-Mantel test indicated that the relationship between the geographic distance and the linearized genetic distance was not significant among islands (r = 0.196; p-value = 0.25). The general AMOVA showed high genetic structure among populations (FST = 0.87, p-value = 0.0001). The highest values of genetic differentiation measured with SAMOVA analysis were detected when the populations were separated into 4 groups (Group 1: Signy Island; Group 2: King George, Penguin, Greenwich, Deception Islands; Group 3: Livingston Island and Group 4: Doumer Island) with values of FCT = 0.92 (p-value = 0.029) (Table 2). The model based on the Bayesian clustering algorithm and implemented in Geneland detected three clusters for the dataset (K = 3), with a high posterior probability of cluster membership (p-value = 0.9). Thus, demonstrating the strong genetic structure of K. subquadrata populations from the WAP. Cluster 1 (Fig. 2) includes samples exclusively from Signy Island; Cluster 2 includes samples from King George, Penguin, Greenwich, Deception and Livingston Islands; and, finally, Cluster 3 includes samples from Doumer Island.

Figure 1
figure 1

Map of West Antarctic Peninsula depicting sampling locations of K. subquadrata: Signy(1); Penguin(2), King George(3), Greenwich(4), Deception(5), Livingston(6), and (7) Doumer Islands.

Table 1 Geographic information and standard genetic statistics of Kidderia subquadrata sampling localities.
Table 2 Analysis of molecular variance in K. subquadrata.
Figure 2
figure 2

Spatial output of the Geneland analysis of K. subquadrata populations. Black circles indicate the relative positions of the sampled populations. Darker and lighter shading are proportional to posterior probabilities of membership in clusters, with lighter (yellow) areas showing the highest probabilities of clusters. Cluster 1: Signy Island; Cluster 2: Penguin, King George, Greenwich, Livingston and Deception Islands: Cluster 3: Doumer Island.

The genealogical reconstruction of the haplotype network comprised 37 different haplotypes and showed a central haplotype (H1, Fig. 3) with the highest frequency (49.6%) that is distributed among five islands (Signy, King George, Penguin, Greenwich and Deception Islands) in a star-like topology. Additionally, most of the islands presented unique haplotypes with frequencies of 0.6% (Signy), 3.4% (Deception), 3.9% (Penguin), 5.6% (King George) and 7.8% (Greenwich). Ten mutation steps separated H1 from the samples of Livingston Island, where 4 unique haplotypes (3.4% of the total haplotypes) were detected. Another 10 mutational steps separated these groups from the second most frequent haplotype (H2, 19.5%); this group include exclusively samples from Doumer Island, where 2 unique haplotypes were found. A unique haplotype from Signy Island (H3) is separated from those of Doumer Island by one mutational step (Fig. 3).

Figure 3
figure 3

Median-joining haplotype network. Each haplotype is represented by a colored circle indicating the main area where it was collected; the size of the circle is proportional to its frequency in the overall sampling effort.

Historical demography

The Tajima D test and the Fu Fs statistic (Table 1) were negative but no significant for the data set as a whole (Tajima D = -1.027, p-value = 0.2; Fu´s test = -1.906, p-value = 0.2). However, both indexes were both negative and significant for most of the individual islands. Signy, Deception and Doumer Islands were the exception, with Signy island showing positive but not significant values. The Bayesian Skyline plot analysis, where Signy, Penguin, King George, Greenwich, Deception Islands were included (start-like topology in the haplotype network) supports the hypothesis of a recent population expansion of K. subquadrata (Fig. 4). Based on this analysis, the time of the most recent common ancestor (trmca) was 5500 years ago, while the onset of population expansion occurred approximately 65,000 years ago.

Figure 4
figure 4

Historical demographic trends. Historical demographic trends of effective population size (Ne) constructed using a Bayesian skyline plot approach based on CoxI haplotypes. The y-axis is the product of effective population size (Ne) and generation length in a log scale while the x-axis is the time before present. The median estimate (solid black line) and 95% highest probability density (HPD) limits (purple area) are shown. The thick dashed line represents the time of the most recent ancestor, and the thin dashed line represents the time during which species expansion took place.

Figure 5
figure 5

Resulting plots from DIYABC analysis. (a) Panel depicted the two models evaluated after the hierarchical analysis. (b) show the pre-evaluation step of scenarios tested, conducted through a PCA on summary statistics of simulated and observed datasets. The observed dataset (yellow dot) falls within the cloud of simulated points. See Figure S1 for details on the tested scenarios.

The pre‐evaluation step of the ABC procedure performed in DIYABC allowed to reduce the number of scenrios to test and improve DIYABC ability to reveal the true demographic model (Fig. S2–S4). The final analysis (Fig. 5) suggested both scenarios were realistic. The scenario 1 and 2 supports an initial divergence between Doumer and Signy islands and posterior admixture in which the population of Livingston island originates. The scenario 1 reveals an admixture event between Signy and Greenwich Islands giving birth to admixed populations in SSIs (King George, Penguin and Deception Islands). The scenario 2 supports a recent divergence in SSIs (Fig. 5). When scenarios were compared, posterior probabilities for the scenario 1 based on direct estimations reached the higest value of 0.8220 (0.4867,1.0000) while based en logistic approach, the scenario 2 reached the value of 0.9212 (0.9067, 0.9358). Type I error for both scenarios were low (scenario 1: Type I error = 0.038 for direct estimation; 0.009 for logistic approach; scenario 2: Type I error = 0.0039 for direct estimation; 0.010 for logistic approach).

Discussion

Our analysis supports the genetic differentiation among sampled populations of K. subquadrata, in accordance with the prediction that in the WAP, this brooder species have limited dispersal potential and would therefore display a highly structured population2,49,50. Reproductive life history might have a crucial influence in the genetic structure patterns and consequently in the degree of connectivity among populations in Antarctic organisms. For example, a study of two common Antarctic benthic surface-grazing gastropods with contrasting development strategies revealed a high spatial structure among populations in the brooder Margarella antarctica while no spatial population structure was found in the broadcaster Nacella concinna (planktonic larva) throughout the geographical range studied51. Similar results were reported in other works on this Antarctic limpet; these studies, whose sampling sites covered a large geographical range, suggest the existence of a single panmictic unit17,52. More recently, for sea star brooders, researchers have concluded that broadcasters are less spatially structured than brooders53.

The local oceanographic dinamyc may have also played a relevant role in the spatial interaction among populations in the interglacial periods in the SO. The spatial genetic structure pattern of K. subquadrata may have been enhanced by the Bransfield Strait current system. The surface circulation in Bransfield Strait is influenced by the Antarctic Coastal Current (ACC) and the Antarctic Slope Front54. Seaward of the South Shetland Islands (SSIs), the southwestward slope currents persist over the long term, they are originated by the Antarctic Slope Current that flows around the Antarctic Peninsula55. The water masses of the ACC transport nutrients west of the SSIs and enter the area through the Bransfield Strait flowing strongly to the east-northeast along the southern margin of the SSIs56. The Weddell Sea water enters through the Bransfield Strait mainly from the north-east and flows in a south-west direction along the Antarctic Peninsula. A similar process has been proposed to explain a genetic barrier identified in the Antarctic annelids of the family Phyllodocidae family in the strait north of Deception Island57, which coincides with the oceanic front that is generated by the intrusion of seawater masses from the Weddel Sea in the Bransfield Strait55. Therefore, the structure in K. subquadrata populations could be explained by the system of currents in the Bransfield Strait that forms a cyclonic circulation58,59. In addition, Livingston Island showed the presence of unique haplotypes that are separated by more than 10 mutational steps from the ancestral polymorphic haplotype but lie geographically lying within the SSIs. This undoubtedly requires a more detailed explanation and further investigation. According to a modified map from Barlett (2018), Livingston Island has low bathymetry (100–200 m)60. This pattern could result in water retention processes that increase the retention of marine invertebrate in the island, thus acting as a sink that results in a strong spatial genetic differentiation of its K. subquadrata population. However, the number of individuals analyzed in this study is small and an increased sample size would provide a more complete picture of the spatial process in this locality and could reveal the actual phylogeographic pattern.

On the other hand, glacial activity in Antarctica has been suggested as a driver of phylogeographic patterns for species by causing isolation and the presence of in situ glacial refuges2,12,13,14,61. It now appears that the continental shelf was not ice-covered equally across the Antarctic coastline, which allowed some ice-free refuges for fauna during the glacial maxima62. According to the literature, the last glacial maximum occurred 20,000 to 17,000 years ago and West Antarctica and the adjacent islands were apparently covered in ice62,63. The deglaciation of Signy Island and the SSIs began 14,000—11,000 years ago and spread during the Holocene until 8,000—6,000 years ago64,65,66,67. Strong signals of recent demographic expansion as well as founder effect associated with recolonization can be inferred from our results in the haplotype network and DIYABC analysis. Therefore, a potential historical demographic hypothesis could be that strong genetic subdivision is a consequence of a pattern of multiple glacial refugia with a Pleistocene post-glacial expansion19,21.

In particular, the most widely distributed and frequents haplotypes were shared by five islands: Signy, Penguin, King George, Greenwich and Deception. The Signy and Doumer Islands are the most geographically distance localities within the study’s sampling range. The DIYABC reveals that Signy and Doumer are the ancestral populations and probably after the original split, Signy island constituted a source population (refuge) for the population of SSIs. Interestingly, the haplotype network showed that Signy also has a unique haplotype that is different from the ancestral polymorphic haplotype and more closely related to the unique Doumer haplotypes. Some individuals of K. subquadrata that were collected as part of our samples lived as epibionts on the macroalgae. Macroalgal rafting has been suggested to explain the low genetic differentiation of marine communities across the Subantarctic region68, and recent investigations have revealed the arrival of invasive species that reached the Antarctic continent alive on kelp rafts69,70. Thus, a potential explanation for this unusual spatial pattern of the genetic distribution in K. subquadrata populations could be that they have used macroalgae as a transport vector. This hypothesis implies some degree of successful colonization and posterior isolation of this unique haplotypes. In the case of Doumer Island, the southernmost locality, we identified exclusively private haplotypes, they displayed low genetic diversity and the highest FST values, suggesting a process known as leptokurtic dispersion71,72,73. Since that time it has probably served as its own refuge and undergone divergence in isolation19. Similar patterns of genetic structure have been reported in the marine bivalves Arctica islandica74 in the Northern Hemisphere.

Non‐pelagic development has been described in the Antarctic fauna, where brood protection appears dominant75. It is worth noting that cryptic species are commonly found in brooding species in the Antarctic Peninsula in, for example, echinoderms such as ophiuroids, echinoids and brooding pygnogonids44,76,77,78. The study of the bivalve Lissarca notorcadensis was one of the first to show indications of cryptic speciation in Antarcic bivalves27. New evidence of two cryptic lineages in the Antarctic Peninsula was recently reported in the bivalve Aequiyoldia eightsii, with an estimation of genetic distance of 5.79% from CoxI35. Considering K. subquadrata’s intrinsically low dispersal and the evidence of the probable occurrence of in situ refuges revealed in this study, a cryptic speciation process could potentially explain the genetic diversity in the population of Doumer Island. The samples obtained from Doumer Island were collected in a protected bay surrounded by glaciers, semi-isolated from the marine currents since there is only one entrance to the bay; these geographical features indicate that gene flow out of bay could be difficult. Here, based on CoxI we report a genetic distance of 2% among the central haplotype (H1) and the two unique haplotypes from Doumer Island; this value is within the limit established in the existing literature79 for inferring a speciation process. Intraspecific divergence with CoxI is rarely greater than 2% in the phylogeographic analyses; even so it can serve as an effective tool in recognizing putative new lineages61,73,80,81. Moreover, there are reports of distinct glacial refugia in the SSIs and Antarctic Peninsula harboring cryptic species that have diverged recently in micro-allopatry82,83,84.

In conclusion, we report new evidence of a strong spatial genetic structure in a brooder microbivalve species unique to the WAP. We also suggest possible presence of in situ glacial refuges and we infer a cryptic speciation in progress. Mitochondrial DNA has been widely used in numerous studies of phylogeography5,16,17,21,27,35,42; its greatest utility is in non-model species where there are no previous genetic data, since its use enables comparisons with previous studies. However, recent studies have reported significant dissimilarities between genetic diversity and structure encountered in marine species depending on the genetic markers used85. Differences in patterns of genetic structure have been linked to the fact that organelle DNA is more sensitive to introgression and/or rapid sweeps (due to selection or strong genetic drift) than is nuclear DNA86. Therefore, it is imperative to develop new nuclear markers (e.g., microsatellites or SNPs) in Antarctic marine invertebrates. Further studies could increase the number of populations sampled in the Southern Ocean to test for the existence of local adaptation using recently developed genomics and transcriptomics tools.

Methods

Sample sites and collection

A total of 179 individuals of K. subquadrata were collected from the following islands (Fig. 1): (1) Signy (60°43′ S; 45°36′W; n = 16), (2) Penguin (62°05′S- 57°55′W; n = 34), (3) King George Island (62°05′S- 57°56′ W; n = 32), (4) Greenwich (62°48′S- 59°66′W; n = 33), (5) Livingston (62°39′S- 60°36′W; n = 6), (6) Deception (62°58′S- 60°33′W; n = 23) and (7) Doumer (64°52′S-63°35′W; n = 35). In addition, we sampled twice in the Chilean Antarctic O’Higgins military base on Kopaitik Island (63°19′S; 57°53′W), but we were unable to find specimens of K. subquadrata in this area. All samples were collected in rocky intertidal with boulders or stones and shallow subtidal environments. The collections were performed during austral summer expeditions in 2018 and 2019.

DNA extraction, amplification, sequencing and alignment

The total DNA of each individual was isolated with Quick—DNA plus (Zymo Research) commercial kit following the procedures described by the manufacturer. Given the absence of genetic information for the genus Kidderia, the strategy to generate molecular markers was to perform an Myseq Illumina sequencing using the genomic DNA and an in silico enrichment to obtain fragments of the mitochondrial genome. The total length of the partial mitochondrial genome recovered was 1733 bp. Using this genome data, specific primers for the Cytochrome oxidase I (CoxI) gene two pair primers were designed. Two pairs of primers were used: the first corresponds to kg1F (5′-TTG GGC TGG GTT AAT AGG TACA-3′) and kg7R (5′-GAA AAC CAG CAA ACA TAG CA-3′) flanking a fragment with a total length of 861 bp; the second pair corresponds to kg7F: (5′- TGC TAT GTT TGC TGG TTT TC -3′) and kg8R (5′- CCC AAA AAG ACA TTT GAC CC -3′), which were used to recover a fragment of 279 bp. The CoxI gene amplified a total length of 1140 bp, coding 380 amino acids.

All PCRs were performed in a final volume of 25 μL containing 1X PCR buffer, 3.5 mM MgCl2, 0.2 mM each dNTP, 0.25 μM each primer, 0.2X BSA, 1 μL DNA concentrate (10—50 ng), 0.6 units GoTaq DNA polymerase (Promega) and H2O to reach the final volume. The CoxI gene was amplified using the following thermocycling profile: an initial denaturation step (95 °C for 5 min); 35 cycles of amplification (94 °C for 30 s, 60 °C for 1 min and 72 °C for 2 min); and a final extension step (72 °C for 8 min). The PCR products were purified using E.Z.N.A. Cycle Pure PCR Purification kits (Omega Bio-tek) and sequenced in both directions by the sequencing service of Macrogen Inc. Company (www.macrogen.com). Sequences were aligned in Geneious R10.2.4 software87 and manually edited to resolve unclear base calls. CoxI consensus sequences were translated into amino acids using the invertebrate mitochondrial genetic code to check for stop codons. Alignments were performed using the default settings of the ClustalW alignment algorithm implemented in Geneious (cost matrix: IUB; gap open cost: 15; and gap extend cost:6.6).

Spatial pattern of the genetic diversity

To estimate the levels of genetic polymorphism in populations of K. subquadrata we used standard diversity indexes: number of haplotypes (k), number of segregating sites (S), haplotype diversity (H) and the average number of paired differences between sequences (π) for each region using DnaSP v588. Pairwise genetic differentiation (pairwise FST) was estimated to determine the genetic structure among populations using the ARLEQUIN program v.3.5189. To test the hypothesis of isolation by distance (IBD) we used the relationship between genetic (FST⁄1-FST)90 and the geographic distances among Signy, King George, Penguin, Deception, Greenwich, Livingston and Doumer Islands to perform a Mantel test91 executed in the R environment with the Vegan (2.5–2 version) package92.

To test for spatial genetic differentiation, we calculated a global FST by performing a global AMOVA analysis with the ARLEQUIN89. Next, we developed a set of analyses using the geographic coordinates of each sampled island. Here, we analyzed the population structure without an a priori cluster hypothesis using a spatial analysis of molecular variance (SAMOVA)93. In this approach, sample sites are clustered based on a simulated procedure that aims to maximize the proportion of total genetic variance caused by differences among populations groups.

To infer the spatial pattern of genetic diversity in K. subquadrata we estimated the number and composition of panmictic groups and the spatial boundaries among them using a clustering method based on the Bayesian model computed with the GENELAND package, version 4.0.094 in the R environment version 2.4.192. This software implements a Markov Chain Monte Carlo (MCMC) procedure to determine the best clustering of samples with regard to genetic and geographic information. Geographic information is considered as the Bayesian prior level, so clusters corresponding to spatially structured groups are considered to be more probable than clusters that are randomly distributed in space. Five × 106 MCMC iterations were sampled every 1000 steps with a 500-step burn-in period, and a maximum number of clusters K = 8 was run to estimate the model parameters and posterior probabilities of group membership.

Finally, the genealogical relationships among CoxI haplotypes for the whole dataset were characterized using median joining networks in HapView (http://www.cibiv.at/~greg/haploviewer)95.

Historical demography

To examine the historical population demography, the Tajima D test96 and the Fu Fs statistic97 were performed. The Tajima D test is based on the fact that in a neutral model, estimates of the number of sites segregating and the average number of nucleotide differences are correlated. Fu’s test is based on the model of infinite sites without recombination; it gives the probability of observing a random sample with the number of alleles equal to or less than the value observed, given the observed level of diversity and the assumption that all alleles are selectively neutral. Both statistics were calculated in the ARLEQUIN. In addition, we estimated past population dynamics over time using the Bayesian skyline plot method implemented in BEAST 298. To develop this analysis, we selected the five sample sites (Signy, Penguin, King George, Greenwich, Deception Islands) that shared the most frequently observed haplotype; as described earlier, these sample sites showed a star-like topology in the haplotype network. We conducted two independent Bayesian MCMC runs using the generalized time-reversible (GTR) model with a gamma distribution (G), previously estimated with JModelTest299 and mutation rates calibrated for CoxI sequences of bivalves100,101 1.0% Myr-1. Substitution rates were modified to a tenfold evolutionary rate (10% per million), considering the correction for time dependence of molecular rates at the population level102,103. The two independent MCMC calculations were run for 1.5 × 107 generations (sampled every 1000 iterations), and the first 10% of the trees were discarded as burn-in. The convergence of runs was confirmed with Tracer v1.6104, ensuring a minimum of 600 effective samples for each statistic (ESSs). The median and corresponding credibility intervals of the Bayesian skyline plot were depicted with Tracer v1. 6.

In order to better understand the population history of the species we used the program DIYABC v2.1105 to tested for different population demographic scenarios. This software evaluates population histories using Approximate Bayesian Computation (ABC) with genetic data, by testing scenarios that are built through a combination of population divergence, admixture and population size changes. Following the recommendations of Cabrera and Palsbøll106 to improve DIYABC’s ability to reveal the true demographic model, we focused on simple contrasting models and reduced the number of candidate scenarios after a set of hierarchical analysis beginig with 5 preliminary scenarios (Fig. S1). Subsequently, and following the results of the previous genetic analysis, two models were evaluated at the last level (Fig. 5). The mutation rate was set with a mínimum of 1 × 10–9 and máximum 1 × 10–4 and the mutation model used was kimura 2 parameters. For the historical models, priors were set by default and in accordance with the recommendations of the authors of the software, we performed 2,000,000 simulations.