Introduction

Large-scale releases of plant and animal populations has increased worldwide to augment overexploited fisheries, forests, and wildlife (e.g., Laikre et al. 2010). For coastal fisheries, which have seen sharp declines over the last half century (Jackson et al. 2001; Myers and Worm 2003), the release of hatchery-propagated stocks has become an important component of fisheries management strategies, used for restocking, stock enhancement, or ranching in 100 s of species around the world (see Bell et al. 2008 for definitions; Laikre et al. 2010; Lorenzen et al. 2012). Restocking aims to re-establish a locally extinct commercial species and/or to restore depleted spawning biomass to a level where it can provide regular substantial yields, whereas stock enhancement aims at sustaining or improving fisheries in the face of decline (Bell et al. 2008; Lorenzen et al. 2012). Although the genetic risks associated with restocking and stock enhancement have been well described (e.g., Blankenship and Leber 1995; Lorenzen et al. 2012), these programs are rarely monitored (Laikre et al. 2010).

Restocking and stock enhancement are effective at increasing the abundances of depleted or declining populations in the short-term, but these programs risk dramatically altering the genetic composition and diversity of populations, which can negatively impact long-term population resilience (reviewed in Frankham et al. 2010). For example, unintentional domestication selection (Frankham 2008) can result in the release of individuals with adaptive mismatches to local environmental conditions (e.g., Waal et al. 2013), and reduced fitness compared with their wild-born counterparts (Araki et al. 2007, 2008). Programs using a reduced number of broodstock can cause reductions in wild population genetic diversity and effective population size (Ne), leading to increased rates of genetic drift and increased potential for inbreeding, thereby limiting evolutionary potential (e.g., Allendorf and Ryman 1987; Christie et al. 2012). When hatchery Ne is lower than wild (recipient) Ne and reproductive contribution to wild populations is high, reductions in diversity of wild populations can be acute, producing the so-called Ryman-Laikre effect, which is a concern for the enhancement of salmonids and other fish (Ryman and Laikre 1991; Gold et al. 2008; Christie et al. 2012). Genetically aware, “conservation-based” strategies are increasingly used in enhancement and/or restocking programs to maintain high-diversity, self-sustaining populations (e.g., Caughley and Gunn 1996; Heggenes et al. 2006). However, less is known about the long-term genetic impacts of these programs and genetic monitoring is often limited or absent (e.g., Laikre et al. 2010).

Restoration of marine shellfish populations, primarily through juvenile seeding (Gaffney 2006; Laing et al. 2006), is increasing as coastal populations succumb to habitat degradation and overfishing (e.g., Beck et al. 2011). Although the potential for diversity loss is widely appreciated, monitoring is rare and few studies of the genetic impacts of these programs have been conducted (e.g., Gaffney 2006; Camara and Vadopalas 2009). Characteristics of marine shellfish life-history, including high fecundity, high early mortality, and high variance in reproductive success (e.g., Hedgecock and Pudovkin 2011; Plough 2016; Plough et al. 2016), may exacerbate many of the underlying genetic risks associated with restocking or restoration, and can limit Ne to a small fraction of the census population size (He et al. 2012). Few studies have attempted to connect genetic changes in the hatchery (larval) phase to the genetic composition and diversity of wild or restored populations (but see Hanley et al. 2016), and less is known about the Ryman-Laikre effect in iteroparous species like shellfish (Waples et al. 2016). A number of laboratory studies have examined diversity during the larval stages of shellfish species (e.g., Boudry et al. 2002; Lind et al. 2009; Lallias et al. 2010), but less work has been done to connect these changes in the hatchery phase to the genetic composition and diversity in wild or restored populations. Except for work by Morvezen et al. (2016), there is a dearth of data on the genetic changes associated with shellfish hatchery-based enhancement and/or restocking programs from field samples or experiments, and no studies have monitored the genetic impacts of an intensive restoration program.

In this study, we examined the genetic impacts of a large-scale restoration program for the eastern oyster Crassostrea virginica (Gmelin, 1791) in the Chesapeake Bay, with two primary objectives: (1) to characterize how genetic diversity changes during the hatchery production phase under two natural spawning protocols, and (2) to examine how genetic diversity of a restored reef population compares to surrounding wild reefs, to provide an initial assessment of how genetic diversity has been maintained or lost as a result of the restoration program. Oysters once supported major fisheries along the US East Coast but have declined to < 1 % of their historic abundances due to overfishing, habitat destruction, and disease (Newell 1988; Rothschild et al. 1994). In the Chesapeake Bay, historically one of the most productive oyster fisheries on the US East Coast, oyster restoration activities of various sizes have been undertaken over the last half century to restore the fishery and ecosystem services provided by oyster reefs (e.g., Kennedy et al. 2011). Recently, a federal mandate to restore 20 Chesapeake Bay tributaries by 2025 has provided support for more comprehensive restoration in the Choptank River region (Maryland, USA), with the first of three sub-tributaries, Harris Creek (MD), completed in 2016 (Westby et al. 2017). The University of Maryland Center for Environmental Science’s (UMCES) Horn Point Laboratory (HPL) Oyster Hatchery has been producing spat (juvenile oysters) on shell for the Harris Creek Sanctuary (and other locations), through natural (temperature-stimulated) group or “mass” spawning of wild Choptank River broodstock (e.g., Wallace et al. 2008). However, no genetic monitoring of this program has been conducted, and the potential impacts of spawning protocol, hatchery propagation, and planting on genetic diversity are unknown.

To quantify genetic changes during hatchery production (objective 1), we conducted parentage analyses on six mass-spawned cohorts and three controlled-spawned cohorts (isolated natural spawns with single-male fertilizations) using up to nine microsatellite markers. Parental contribution (variance in reproductive success) and genetic diversity metrics, including effective number of breeders, were calculated in the offspring at the spat (planting) stage and compared with their parents. To quantify the broader population genetic impact of this restoration program (i.e., genetic diversity in restored vs wild populations; objective (2), we estimated and compared Ne, genetic diversity metrics, and relatedness among five wild broodstock sources, an aquaculture line, and a restored Harris Creek reef sample planted in 2012, to provide an initial assessment of how diversity is maintained following enhancement with hatchery-produced cohorts. This work provides important baseline data for monitoring genetic diversity of oyster restoration efforts in Chesapeake Bay, and more broadly, for examining genetic impacts of other large-scale shellfish restoration efforts ongoing or planned in the US (e.g., Holley et al. 2018; Brumbaugh and Coen 2009; Dinnel et al. 2009).

Materials and methods

Spawning methods, collection of broodstock or hatchery sources, and sampling of spat

Oyster cohorts (offspring from mass- and controlled-spawns) were produced at the UMCES HPL Oyster Hatchery in Cambridge, MD, USA over the summers of 2014–2016. Wild Choptank River broodstock was used for the production of all cohorts (Table 1), except for cohort M6, which was produced from the Louisiana-derived “LoLA” aquaculture line, obtained from the Aquaculture Genetics and Breeding Technology Center (ABC) at the Virginia Institute of Marine Sciences. All broodstock were conditioned in the HPL Oyster Hatchery at ambient salinity and 20 °C for 6–8 weeks prior to spawning. The HPL Oyster Hatchery produces all spat on shell for restoration planting using a mass-spawn protocol, and thus, we followed their protocols for the mass-spawned cohorts.

Table 1 Detailed cohort information of C. virginica populations used in this study

For the mass-spawned cohorts (M1–M6; Table 1), groups of ~ 50 ripe broodstock were placed in an open aquarium (spawning table) with heated (~ 28–30 °C) flow-through filtered Choptank River water (~ 10–12 ppt; hereafter seawater) to initiate spawning. As oysters spawned, females and males were removed from the spawning table and placed into separate containers (one for all males and one for all females) and allowed to finish spawning. When spawning ceased, pooled eggs were counted volumetrically, and an appropriate amount of pooled sperm was used to fertilize the pool of eggs (Galtsoff 1964). For all mass-spawned cohorts, spawning order was recorded for males and females and fertilization occurred no more than 45 min after the first oyster was observed to spawn. For the controlled-spawned cohorts (C1–C3; Table 1), broodstock oysters were placed in individual 1.8 L aquaria (Aquaneering, San Diego, CA) arranged on a multi-tiered rack with heated, flowing seawater (independent in- and out-flow for each vessel). When evidence of spawning was observed, water to the particular aquaria was turned off to allow gamete accumulation. The time of spawning for each oyster was noted. Once a sufficient number of spawning males and females were identified, eggs were pooled, counted volumetrically, and divided evenly among eight containers (the number of males that spawned) for individual male fertilizations (no sperm competition). Cohorts M1–M3 were spawned in June 2014, M4–M6 in June 2015, and C1–C3 in July 2016. Tissue samples (adductor muscle or mantle) were collected from all adult broodstock and preserved in 70–95% ethanol. All broodstock adults (LoLA and wild) were also utilized in the population genetic analyses of diversity, population structure, and Ne (see below). In addition, 48 adult oysters (mixed-age classes) were sampled from a restored reef in the Harris Creek Sanctuary (Seed 2; 38.71298 N, −76.31985 W) in 2015, which had been planted with hatchery-produced oysters in 2012 (Fig. 1). All details concerning the larval culturing and juvenile nursing are presented in the Supplementary Materials and Methods.

Fig. 1
figure 1

Map of the Chesapeake Bay showing the approximate locations of wild C. virginica broodstock sources from the Choptank River (Chesapeake Bay inset) and the Harris Creek Sanctuary. ‘CLP’ is Chlora Point., ‘SH’ is Sandy Hill, ‘BBY’ is Black Buoy, ‘TB’ is States Bank and ‘HC’ is the Harris Creek Sanctuary site

Whole individual spat were randomly sampled from each cohort and stored in tissue lysis buffer at − 80 °C prior to extraction. One hundred twenty spat were sampled for cohorts M1–M3 and C1–C3, and 115 spat were sampled for cohorts M4–M6 (Table 1).

DNA extraction and microsatellite genotyping

DNA was extracted from adult tissue and spat using the E.Z.N.A.® Tissue DNA Kit (Omega-Biotek, Norcross, GA) or the Qiagen DNeasy Tissue kit (Qiagen Inc, Valencia, CA), following the protocol for animal tissues. DNA concentrations were estimated using a Qubit Fluorometer (Thermo Fisher Scientific Inc., Waltham, MA, USA) and all samples were diluted to a concentration of 10 ng µL−1. Genetic analysis was conducted on nine microsatellite loci previously developed by Brown et al. (2000), Reece et al. (2004), Carlsson and Reece (2007), Wang and Guo (2007), and Wang et al. (2009) and were named as in the source publication (see Supplementary Table S1 for loci ID). Samples from the restored Harris Creek reef and cohorts M1–M6 (broodstock parents and offspring) were genotyped at all nine microsatellite loci while cohorts C1–C3 were genotyped at six of these loci (Supplementary Table S1). Polymerase chain reactions were carried out in volumes of 12.5 µl with 1× GoTaq® flexi PCR buffer (Promega, Madison, WI, USA), 200 µm dNTPs, 0.04 μm of the M13-tailed forward primer, 0.16 μm of both the reverse primer and the M13 fluorescent dye-labeled primer, and 1.25 units of GoTaq® Flexi DNA Polymerase (Promega, Madison, WI, USA). The PCR cycling program was run in two phases; the first to amplify the target microsatellite marker, and the second to incorporate a fluorescently labeled dye attached to the M13 tag sequence (see Schuelke 2000 for details).

Microsatellite PCR products were pooled, precipitated, and shipped dry to the Arizona State University DNA Laboratory, where fragment analysis was performed on an ABI 3730 capillary sequencer. Genotypes were scored by eye using the LIZ500 (Applied Biosystems) as an internal size standard on the Peak Scanner v1.0 software (Applied Biosystems) and verified using Genemapper (Applied Biosystems). For each cohort, 25% of the offspring were genotyped in duplicate to calculate genotype error rate at each locus.

Parentage assignment

Parentage assignment was performed in CERVUS v.3.0.7 (Marshall et al. 1998; Kalinowski et al. 2007) using the parent–pair analysis option and the Delta method. For each cohort, simulations in CERVUS were run to estimate the critical values of Delta for each cohort with strict (95%) and relaxed (80%) levels of statistical confidence in assignment (Kalinowski et al. 2007). Each set of simulations was specific to the number of loci typed, the number of candidate parents, and the locus error rates for that particular cohort. Putative candidate parent pairs were considered accurate if the offspring-dam-sire trio statistical confidence was ≥ 95% (strict confidence). We excluded individuals assigned at relaxed confidence (80%), or those that were not assigned at all, from downstream analysis.

Effective number of breeders (N b) and related statistical analyses

After assigning offspring to broodstock parent pairs, the mean number of offspring (\(\overline k\)) per parent and variance in reproductive success (Vk) was calculated for male and female broodstock from each cohort. Vk was standardized by the mean number of offspring in a family (σ2/μ2) and is reported as such from here on. From these parameters, the inbreeding number of breeders (NbI) was estimated for each sex following equations 1 and 2 from Christie et al. (2012). Equation 6 from Waples and Waples (2011) was used to estimate Nb (hereafter NbS) using information from sibship reconstruction analysis in COLONY v2.0.6.1 (Jones and Wang, 2010). We then compared the estimated NbI to the number of successful broodstock spawned (N) for all cohorts. Differences in Vk between males and females were tested within each cohort using a two-tailed parametric F-test in R (R Core Team 2017). Skew in reproductive contribution of males and females was assessed via Chi-square goodness-of-fit tests in R, with the null hypothesis of equal reproductive contribution.

Population genetic analyses and statistical comparisons

Allelic richness (Ar) for each locus and cohort was calculated in FSTAT v2.9.3.2 (Goudet 2002), which uses a sample-size independent rarefaction analysis of allelic richness. Observed heterozygosity (Ho) was calculated in the R package “hierfstat” (Goudet 2005). Tests for Hardy–Weinberg equilibrium (HWE) were performed in GenoDive v2.0b27 (Meirmans and Van Tienderen 2004) using 9 999 one-sided permutation tests, with P values corrected for multiple tests via sequential Bonferroni correction (Rice 1989). For markers that differed significantly from HWE, null allele frequencies were estimated using Microchecker v.2.2.3 (Van Oosterhout et al. 2004) and FreeNA (Chapuis and Estoup 2007). To test whether parents and offspring from a given cohort differed in genetic diversity metrics (Ar, Ho), paired Wilcoxon signed-rank tests between parents and offspring were performed (see Nei 1987, p. 183), with P values adjusted for multiple tests using the false discovery rate correction (Benjamini and Hochberg 1995). Relatedness was estimated for parents and offspring (grouped by broodstock source) from hatchery cohorts and for the Harris Creek sample using the Lynch and Ritland method (Lynch and Ritland 1999) in COANCESTRY v1.0 (Wang 2011), because it performs well in panmictic populations (e.g., Oliehoek et al. 2006).

Contemporary effective population sizes (Ne) were calculated for adult broodstock source populations, hatchery-produced oysters, and a restored population from Harris Creek reef using the linkage disequilibrium (LD) method (Waples and Do 2008) implemented in NeEstimator v2.01 (Do et al. 2014; minor allele frequencies < 0.05 excluded). For this analysis, if there were multiple adult broodstock groups from the same reef/population, they were pooled within population (i.e., Sandy Hill 1, 2, and 3 were combined into a single Sandy Hill population; see Table 3). Finally, all wild broodstock source populations were also combined into a single population for an overall wild Choptank River Ne estimate. Although the mixed-age adult samples can be used to estimate Ne (Waples et al. 2014), the offspring cohorts, which represent a single age-class, were used to estimate Nb (Waples 2005). To obtain offspring Ne estimates, raw offspring Nb estimates (LD method) were adjusted according to Waples et al. (2014) using three life-history traits: adult life span = 15 (10–20 years in undisturbed populations, Powell and Cummins 1985), age at maturity (α) = 2 (averaged values from Galtsoff 1964; Rothschild et al. 1994; Powell et al. 2013), and variation in age-specific fecundity CVf = 0.65 (from Mroch et al. 2012; Mann et al. 2014).

To examine the extent of genetic differentiation among the broodstock sources, hatchery-produced offspring, and restored Harris Creek sample, FST (Nei’s GST; Nei 1973) was calculated in GenoDive. Heterogeneity in allelic frequencies between pairs of samples was tested in GenoDive (G-test; 9999 permutations). Significance criteria were adjusted for the number of simultaneous tests using sequential Bonferroni corrections (Rice 1989). FST values (Weir 1996) were recalculated in FreeNA using the ‘excluding null alleles’ method (Chapuis and Estoup 2007) to correct the positive bias induced by the presence of null alleles on FST estimation. Finally, population structure was assessed among broodstock population sources and their offspring, with discriminant analysis of principle components (DAPC) in the R package ‘Adegenet’ (Jombart 2008).

Results

Genotyping results and null alleles

Across the nine cohorts examined, a total of 1299 individuals were genotyped, 846 (141 parents, 705 offspring) of which comprised the mass-spawns and 405 (45 parents, 360 offspring) of which comprised the controlled-spawns (Table 1). A total of 12 547 genotypes were scored with an average error rate of 1.9% across all samples. One locus (Cvi6) was excluded from cohort M6 because of inconsistencies with allele scoring. Null alleles were detected in five (RUCV3, RUCV114, Cvi2i23, Cvi2i4, and Cvi6) out of the nine loci (Table S1), which is consistent with previous studies using the same markers (Rose et al. 2006; Arnaldi et al. 2018). Five out of the nine microsatellite loci showed Hardy–Weinberg disequilibrium with an excess of homozygotes in most of the studied populations (Supplementary Table S1). All population genetic analyses were performed with and without these loci and the two data sets provided similar results.

Parentage results, variance in reproductive success, and changes in diversity

Parentage assignment of offspring to parental pairs in CERVUS ranged from 74 to 89% (mean 83.2%) at the 95% confidence cutoff level (Supplementary Table S2). We observed no difference in assignment success between mass- and controlled-spawns (Wilcoxon signed-rank test; P > 0.05). Parentage assignment results were then used to assess variance in reproductive success and skew among parents within mass-spawned and controlled-spawn cohorts.

For the mass-spawned cohorts, reproductive success varied widely among individuals––individual males produced 0–58.1% of assigned offspring (mean = 8.7%, mode = 5%) and individual females produced 0–32.6% of assigned offspring (mean = 8.3%, mode = 0%). The standardized variance in reproductive success was higher in males than females in all mass-spawned cohorts (64% higher mean Vk compared with females; F-test, F0.05,(5,5) = 0.143, P= 0.03; Fig. 2). Among individuals that were observed to have spawned, 5.8% of males and 5.6% of females effectively produced no offspring (averaged over all six cohorts). Male reproductive contribution differed significantly from the expectation of equal contribution, indicating evidence of significant reproductive skew in all six mass-spawns (Supplementary Figure S1). Female reproductive contribution was also highly variable and differed significantly from the expectation of equality in five of the six mass-spawned cohorts; however, the skew was less than that of males (Supplementary Figure S1, i.e., less significant Chi-square deviations). Male reproductive contribution appeared to be affected by male spawning order, and males that spawned later contributed more offspring (P= 0.03, Supplementary Figure S2). Overall, high variance in reproductive success among parents resulted in a substantial reduction in NbI (average 45%) and relatively low NbI/N ratios (Table 2). Nb estimates varied depending on the method used, but all NbS estimates were lower than NbI (NbI ranged from 7.62 to 17.58, whereas NbS ranged from 4.13 to 9.85; Table 2). The magnitude of reduction in NbI appeared to be affected by the sex ratios of each cohort, with a marginally significant trend (P= 0.09, compared with alpha = 0.05) of decreasing variance for sex ratios closer to one (Table 2). For example, the highest (0.84) NbI/N ratio was observed in cohort M3, where the sex ratios were closest to one, whereas the lowest (0.36) NbI/N ratio was observed in cohort M2, where the sex ratios were farthest from one (0.4). Finally, statistically significant reductions in allelic richness (Ar) from parents to offspring were found in all the mass-spawned cohorts (Wilcoxon signed-rank test P< 0.03, Table 2), while reductions in Ho between parents and offspring were not significant (P> 0.05, Table 2). Results for the controlled-spawns were generally similar to the mass-spawns with high variance in reproductive success among individuals. Individual males produced 3.3–42.5% of offspring (average = 12.5%, mode = 3.26%), whereas individual females produced 0–57.5% of offspring (average = 14.3%, mode = 1.1%). Patterns of Vk between males and females followed the opposite trend of the mass-spawns, with higher variance for females compared with males (65% higher than males; F-test, F 0.05, (2,2) = 0.375, P> 0.05; Fig. 2). Among individuals that were observed to have spawned, 100% of males had offspring assigned to them vs. 95.2% of females, both of which are higher than in the mass-spawned cohorts. Again, the high Vk among parents led to a reduction in NbI (average = 47%) and relatively low NbI/N ratios (Table 2). NbS was lower than NbI in all controlled-spawned cohorts (Table 2). We focus on NbI for the remainder of the study because it can directly be compared with the initial number of spawning adults. Statistically significant reductions in Ar between parents and offspring were found in all the controlled-spawn cohorts (Wilcoxon signed-rank test P< 0.03, Table 2), whereas Ho was maintained in the offspring of cohorts C1 and C3 and increased in the offspring of cohort C3 (P> 0.05, Table 2).

Fig. 2
figure 2

Standardized variance in reproductive success for males and females across C. virginica cohorts. M1–M6 correspond to the mass-spawned cohorts; M6 corresponds to the LoLA aquaculture line, and C1–C3 correspond to the controlled-spawn cohorts

Table 2 Summary of genetic diversity results for parents and offspring of each C. virginica cohort

Global relatedness was higher in all hatchery-produced offspring (Supplementary Figure S3; average increase = 0.065). Relatedness in wild broodstock sources ranged from 0.002 to 0.011, whereas the mean relatedness of wild hatchery-produced offspring ranged from 0.03 to 0.129. As expected, relatedness in LoLA broodstock parents (0.117) was higher than wild broodstock sources and also increased in the hatchery-produced offspring (0.208). Offspring from the controlled-spawned cohort broodstock source (TB) had the smallest relative increase in relatedness (0.028), whereas offspring within Chlora Point (CLP; mass-spawned cohort) exhibited the greatest relative increase in relatedness (0.118). Relatedness in the Harris Creek restored sample was comparable to that in wild broodstock sources and was relatively low overall (0.012).

Genetic diversity and differentiation among wild broodstock and restored oysters

Genetic diversity (Ar and Ho) of wild adult broodstock ranged from 9.33 to 14.38 and 0.51 to 0.68, respectively (Table 2). The LoLA (aquaculture) adults displayed the lowest allelic richness overall (8.56; Table 2), but had comparable levels of heterozygosity (0.68). Overall, the restored Harris Creek sample had the second highest allelic richness (14.19) and comparable levels of heterozygosity (0.59) to wild broodstock populations. Pairwise FST analyses (Nei’s GST) revealed low genetic differentiation among most broodstock sources. All pairwise FST estimates between wild broodstock source populations were small, ranging from − 0.001 to 0.022 (Table 3)—none were significant after Bonferonni correction. Recalculating FST excluding null alleles in FreeNA produced very similar estimates to un-corrected values (Supplementary Table S4). Pairwise FST estimates between the Harris Creek and wild broodstock source populations were small and non-significant, ranging from − 0.001 to 0.019 (Table 3). Pairwise FST estimates between the LoLA broodstock adults and Chesapeake Bay wild populations were higher, ranging from 0.054 to 0.079, and all comparisons (adult broodstock: BBY, CLP, HC, TB, SH1, SH2, and SH3) were statistically significant (G-test P< 0.001). Interestingly, pairwise FST estimates between hatchery-produced offspring (i.e. the cohorts produced from wild broodstock) and wild broodstock parent sources were typically higher than comparisons among only adult broodstock (0.011 < FST < 0.148; Supplementary Table S5). Similar to the FST results, analyses of population structure via DAPC revealed two or three major clusters, with LoLA broodstock adults (and offspring) grouping distinctly from Choptank River wild broodstock and the restored Harris Creek sample (Figs. 3a, b). More structure was evident among the offspring cohorts (e.g., SH2 and SH1 vs. SH3; Supplementary Figure S4), and broodstock sources and offspring tended to group together, as expected (Fig. 3a). More subtle genetic differences were observed between Sandy Hill (SH) broodstock samples (e.g., SH2 adults vs. others) despite being sampled from a common population (Fig. 3b). The Harris Creek sample clustered in the center of the five other wild Choptank River broodstock sources (Fig. 3b).

Table 3 FST estimates between all C. virginica wild and broodstock source populations (below diagonal) and G-test P value (above diagonal)
Fig. 3
figure 3

Discriminant analysis of principle components (DAPC) among wild and restored C. virginica populations. Results are shown for adult samples (broodstock and restored populations) only a and for all individuals (adults and offspring b. Number of PCs to retain for each set of analyses was determined via the a-spline optimization approach in Adegenet (29, and 23 PCs retained for all, and parents only, respectively). Letters at the end of population names/codes represent parent (p) or offspring (o) groups

Effective population sizes of wild broodstock sources and the Harris Creek population

Contemporary effective population size (Ne) estimates varied among the different wild (broodstock source) populations but were fairly consistent overall, ranging from 75.3 to 129.6 with a mean of 102. Most of the estimates were bounded at the 95% confidence limits (95% confidence interval range 52.1–119.2; Table 4). Two Ne estimates were unbounded––Black Buoy and Chlora Point, but these had relatively small sample sizes (N = 34 and N = 22, respectively). The Ne estimate for the pooled wild broodstock sources was highest at 366.8 and had the largest 95% confidence interval range (200.1–1424.3). The Ne estimate for the Harris Creek sample was similar to the range of values estimated for wild populations at 68.3—confidence limits for the wild populations and the restored sample overlapped substantially (Table 4). As expected, Ne estimates were systematically higher in wild populations compared to the representative aquaculture line (LoLA), and effective sizes were systematically higher in wild broodstock parents compared to their hatchery-reared offspring (Table 4).

Table 4 Effective population sizes (and confidence intervals (CIs)) of wild, hatchery-produced and restored C. virginica populations based on the linkage disequilibrium method (Waples and Do 2008)

Discussion

Changes in genetic diversity resulting from hatchery propagation and spawning protocol

Across all hatchery-produced cohorts of oysters (mass- and controlled-spawns), offspring displayed lower genetic diversity, higher global relatedness, and reduced Nb and Ne compared with their wild adult progenitors. The reduction in Nb was the most substantial among the diversity metrics examined, reflecting the high variance in reproductive success observed among parents. As expected, patterns of diversity loss differed between the two spawning protocols, but the controlled-spawns did not reduce overall reproductive variance (see below). Indeed, controlled fertilizations of oysters may not necessarily reduce diversity loss during hatchery cultivation. For example, Boudry et al. (2002) found high Vk of cohorts of Pacific oysters despite controlled fertilizations. Post-spawning genotype-dependent larval mortality may also contribute to high Vk among parents (e.g., Plough 2016; Plough 2018).

In the mass-spawned cohorts, where we expected Vk to be greatest, we observed higher skew in males compared with females, which is consistent with results from previous studies of fish and shellfish (e.g., Bekkevold et al. 2002; Boudry et al. 2002). The high male variance can possibly be explained by differences in sperm quality or quantity among males in the competitive, pooled fertilization environment of a mass-spawn. Sperm competition can influence the proportion of eggs fertilized and differences in sperm traits (quality), such as sperm motility and velocity, can lead to variable contributions among males (Gaffney et al. 1993; Withler and Beacham 1994; Wedekind et al. 2007). However, male reproductive variance may not be attributed to sperm traits alone (e.g., Linhart et al. 2005; Kaspar et al. 2007). Fertilization rates may also depend on gametic compatibility (e.g., Gaffney et al. 1993), which was not tested in this study. Other characteristics of the mass-spawn protocol, such as the timing of gamete release, may also lead to imbalanced reproductive contributions. For example, we found that males that released gametes later in the mass-spawns contributed more offspring (Supplementary Figure S2). Fertilizing with pooled sperm from natural spawns is a common practice in shellfish restoration hatcheries (e.g., in Maryland, HPL Oyster Hatchery; in Martha’s Vineyard, Massachusetts, Emma Green-Beach, Martha’s Vineyard Shellfish Group Inc., personal communication; and in New York Harbor, Rebecca Resner, Billion Oyster Project, personal communication), but the inability to control the timing of gamete release or gamete output may increase variance in reproductive success among parents.

In contrast to the mass-spawn results, the controlled-spawns effectively reduced male reproductive variance, but increased female reproductive variance, resulting in a similar reduction in Nb (low ratios of Nb/N), and effectively negating any benefits from reducing the male variance component. The inadvertent increase in female variance likely resulted from several factors associated with the controlled-spawning design, which relied on natural spawns within self-contained vessels. First, the number of eggs per female was not normalized across individuals before fertilization (they were pooled and then divided evenly); thus, differences in fecundity or effective fecundity among females could have contributed to female reproductive variance. Second, egg quality or viability may have varied among females, with reduced egg viability at the time of fertilization for females that spawned earlier compared with females that spawned closer to the time of time of fertilization. Eggs from some females were not fertilized for 2.5 h after the first observation of spawning (see Supplementary Table S7 for detailed information on timing of gamete release), and thus, they may have been less viable at the time of fertilization. Though we lack detailed information on temporal changes to gamete viability in C. virginica, an experimental study of gamete viability in the broadcast spawning bivalve Cerastoderma edule showed that percent fertilization decreased 50% after 2 h (André and Lindegarth 1995), indicating that a 2.5-hour window could reduce fertilization success during spawning. Natural spawning protocols are clearly less ideal than directed fertilizations via strip-spawning of adults. However, the goal of these experiments was to control fertilizations (and sperm competition) within the framework of typical HPL Oyster Hatchery protocols that use natural spawning for oyster restoration. Employing strip-spawning would make large numbers of pairwise fertilizations logistically easier, but this is not always preferred or possible, especially if programs are broodstock limited, and differential survival of larvae can still result in skewed contributions and high Vk among parents (Boudry et al. 2002; Lallias et al. 2010). Overall, natural, individual spawns are challenging to execute on a small-scale, and thus, this practice is not likely to be viable for large-scale restoration hatcheries. Overcoming large variance in family sizes remains a major obstacle towards maximizing genetic diversity and Ne in hatchery-propagated individuals for restoration. The findings from this study, that there is an average loss of ~ 45% of the Nb between parents and offspring in the mass-spawns, could help to inform decisions about spawning practices and the number of broodstock used for eastern oyster or other shellfish restoration programs.

Genetic differentiation between wild, hatchery-produced, and restored oysters

Wild broodstock source populations within the Choptank River showed no genetic differentiation, which is consistent with previous studies reporting high gene flow among oyster populations in Chesapeake Bay (e.g., Rose et al. 2006) and Delaware Bay (He et al. 2012). Despite low FST values overall, Rose et al. (2006) observed a weak pattern of isolation by distance among oyster populations in Chesapeake Bay, indicating some barriers to gene flow over 100 s of km. High gene flow among mid-Bay populations may also be driven by a long history of human-mediated adult and juvenile oyster movement, planting, and transplantation throughout the Chesapeake Bay (Kennedy and Breisch 1983; Mann and Powell 2007). Wild Choptank River and restored Harris Creek reef oysters differed genetically (statistically significant pairwise FST) from the LoLA aquaculture line, which is expected for a selectively bred, “closed” hatchery line that was developed with genetically divergent wild progenitors from Louisiana (Appleyard and Ward 2006; Aquaculture Genetics and Breeding Technology Center 2009; Champagnon et al. 2012). Finally, the restored Harris Creek oyster population showed no genetic differentiation from the local broodstock populations (e.g., Table 3 and Fig. 3), which is consistent with the fact that these local populations were used to produce oysters that are planted in Harris Creek. However, the power provided by eight microsatellite markers is unlikely to be sufficient to resolve fine-scale population structure among reefs or restored populations over such a small geographic scale.

Comparisons of contemporary effective population sizes between wild and restored reefs

Contemporary Ne estimates in this study agree generally with the magnitude of values reported for eastern oyster populations in the Delaware Bay (37–437) by He et al. (2012), but are lower than those reported for the James River (535–1 516) by Rose et al. (2006), and higher than that reported for the Delaware Bay (33.8) by Hedgecock et al. (1992). Although there is a rather wide range of Ne estimates reported across locations, most estimates are fairly low ( < 1000). Ne estimates for two of the broodstock sources in this study were unbounded (BBY and CLP; sample sizes < 34, which is low for Ne estimation via the LD method; Waples and Do 2010), but estimates for the remaining populations were bounded, and thus provide insight into the range of possible Ne for the wild broodstock populations in the Choptank River (75.3–129.6). Compared with estimates for wild oysters in the Choptank River, the restored oyster sample from Harris Creek had similar levels of genetic diversity and effective population size (Ne = 68.3), which suggests that the Harris Creek restoration program has not caused significant declines in genetic diversity, at least based on the single reef sampled and metrics examined.

Though we did not detect major declines in diversity for this restoration program, genetic analyses of similar restocking or enhancement efforts in marine fish have yielded mixed results in terms of the severity of associated genetic changes. Although some studies show that stock enhancement produced effectively no change to population genetic diversity (e.g., Heggenes et al. 2006; Gow et al. 2011; Katalinas et al. 2017), other studies have shown rapid declines in diversity and severe Ryman-Laikre effects after only a few generations (Gold et al. 2008; Karlsson et al. 2008; Christie et al. 2012). Disparate results among studies may be driven by specific characteristics of the program (i.e., husbandry practices), the initial status of wild populations, or possibly the design of the monitoring study itself (e.g., number of samples, type and number of markers used). In the current study, a lack of salient genetic decline may reflect specific HPL hatchery broodstock management and spawning practices that are implemented to minimize declines in genetic diversity. For example, the HPL hatchery uses 100–1000 s of wild broodstock each year while past broodstock are rotated out and retired after 3–4 years of use (Stephanie Alexander, HPL oyster hatchery, personal communication). Moreover, deliberate out-planting of many distinct cohorts of spat (i.e., produced from spawns with different sets of parents), ensures that a given restored reef will comprise multiple spawns, from multiple years, from hundreds of parents. Of course, caution must be exercised when inferring the genetic impact of oyster restoration in Harris Creek based on the sampling of a single restored reef. The analysis of additional restored populations with variable planting histories, and the use of larger marker data sets (e.g., 1000 s of SNPs) is ongoing and should provide a more conclusive picture of the genetic impacts of hatchery propagation and planting on in these populations.

Effective population size (Ne) estimates in marine animals tend to be much smaller than the census population size (N) (e.g., Hedrick 2005; Palstra and Ruzzante 2008), and extremely low Ne/N ratios (10−3–10−6) have been reported in many marine invertebrates and fishes (Hedgecock 1994; Hauser et al. 2002; Hedrick 2005; Hauser and Carvalho 2008), perhaps suggesting high variation in reproductive contribution among adults. So, how should the Ne estimates of oysters in this and previous studies be considered in a conservation or management context? Applying relatively cautious conservation thresholds (e.g., the 50/500 rule; Franklin 1980; Franklin and Frankham 1998), Ne estimates of oysters are on the order of magnitude of what is required for inbreeding avoidance, but are less than the prescribed target for preserving long-term evolutionary viability or quantitative genetic variation (Ne > 500). However, concerns about the genetic risks for populations with small Ne may be slightly overblown if there is frequent gene flow (i.e. replenishing genetic variation despite low Ne), which is typical for populations of marine animals like oysters that have a highly dispersive larval stage (e.g., Gaffney 2006; Hauser and Carvahlo 2008; Palstra and Ruzzante 2008).

Although low Ne estimates may overstate the level of genetic risk to some marine populations, it has also been argued that estimates of Ne/N ratios in marine species may be artificially low (downwardly biased) when true Ne is actually quite high (Palstra and Ruzzante 2008; Waples 2016). Using simulations, Waples (2016) examined the range of life-history characteristics (e.g., longevity, age at maturity, fecundity, and variance in reproductive success that increase with age) that would be required to generate tiny Ne/N ratios (e.g., < 0.001) observed in empirical studies of fish and shellfish, and determined that unless very large sample sizes were used (e.g., 10 s of thousands), Ne estimates could be seriously biased. This does not mean that all low Ne estimates are wrong, but that we should be very cautious about the interpretation of the Ne estimates in this and previous studies using relatively few markers and small sample sizes. If estimates across samples or markers are bimodal in distribution (very low and high) and/or are typically unbounded, a very large Ne cannot be rejected out of hand. Given that the Ne estimates in this study are generally consistent with previous Ne estimates of oyster populations in the Chesapeake Bay (and most estimates were bounded), it seems likely that they are reflective of truly low Ne for populations in the Choptank River.

Conclusions

This study provides an initial examination of the genetic impacts of a large-scale eastern oyster restoration program, and the overarching findings are relevant for future eastern oyster restoration programs as well as for the restoration of other marine shellfish with similar life-history features. Overall, hatchery-based propagation of oysters led to substantial reductions in diversity and the Nb from parents to offspring (a decline of ~ 45% in Nb on average), which can be explained by the high variance in reproductive success (Vk) among adults in mass-spawns and hatchery-based larval culture. Experiments employing controlled, natural spawn protocols with independent fertilizations of pooled eggs failed to reduce Vk compared with mass-spawns. Despite high Vk and diversity loss in the offspring of individuals cohorts, estimates of genetic diversity metrics and Ne from field-based samples did not indicate major losses of genetic diversity in hatchery-planted oysters in Harris Creek, MD, at least based on the single restored population examined. Additional restored populations will need to be analyzed before any definitive conclusions can be made about the genetic impact of this restoration program. The use of a large number of rotated, wild broodstock, from which mixed larval batches are planted and re-planted over time may have helped to minimize diversity loss within restored reef sites. Based on the results of this study, a number of standard recommendations could be made for oyster (or other shellfish) restoration programs, including using large numbers of rotated, wild broodstock, and implementing pair-cross matings (Camara and Vadopalas 2009). However, pair-cross matings and/or strip-spawning may not be feasible for some restoration programs, especially those that employ mass-spawns or are broodstock limited.

Data archiving

Genotype data available from the Dryad Digital Repository: https://doi.org/10.5061/dryad.h3kv180.