Introduction

Approximately 13,000 years ago, humans began to domesticate plants and animals1. Worldwide cultivation of seaweeds, however, has a much shorter history, increasing substantially over the last 50 years2. Seaweed aquaculture has been developed primarily in Asia2,3, and to date, fewer than 20 species have been domesticated4. More than 90% of worldwide farming activities concentrate on five taxa2,5,6: the brown kelps Saccharina and Undaria and the red algae Porphyra, Gracilaria and Eucheuma.

Plant domestication not only modifies economic and agronomic phenotypes but also leaves a genetic signature that affects both the population structure and the genetic diversity of the domesticated species7,8. For marine crops, previous studied indicated that an extended period of inbreeding and intensive selection during the domestication process may reduce the genetic diversity, narrow the germplasm base and promote adaptive divergence between the domesticated seaweeds and their wild counterparts8,9,10,11. An understanding of how domestication affects the domesticated species provides insights into general mechanisms of adaptions and diversification and can guide the genetic improvement of crops in breeding programs1.

Saccharina japonica (Aresch) Lane, Mayes, Druehl & Saunders is native to cold-temperature coasts along northern Japan, northwestern Korea and Far Eastern Russia and was introduced into China in the 1930s12. S. japonica is a commercially and ecologically important seaweed that is mainly cultivated in China, Japan and Korea. China, in particular, ranks first in the world for the cultivation of this species with a wet weight production of 5,941,658 t in 2013 (FAO: http://www.fao.org/fishery/statistics/global-aquaculture-production). S. japonica was initially cultivated in the north of China using conventional methods (e.g., tying sporelings to rocks) as early as the 1930ā€‰s, but modern cultivation started in the 1950s12,13 and is successfully conducted in China on a large scale today (Supplementary Fig. S1). The expansion of the kelp cultivation industry in China can mainly be attributed to the development and implementation of the summer sporelings method13,14,15, the floating raft cultivation technique and fertilizer application13,15,16,17. To expand the cultivation area, a considerable effort was made to transplant kelp from the northern areas (Shandong and Liaoning, 35ā€“39Ā°N in latitude) to the south of China (Zhejiang and Fujian, 24ā€“31Ā°N in latitude)13,14,18.

Early in the domestication process, wild S. japonica populations from Japan or wild Chinese introduced populations were cultivated rather than selected strains14. Since the 1960ā€‰s, selective breeding has been applied to S. japonica, the first variety to be produced being ā€œHaiqing No. 1ā€19. Subsequently, various kelp breeding strategies including gametophyte cloning, intra- and inter-specific hybridization methods and heterosis have been used to produce elite northern cultivars with high yield and improved resistant to light irradiance such as 901, Rongfu, Dongfang No. 2 and Dongfang No. 314,20,21,22,23. Most southern cultivars have been bred using continuous selection strategies based on local cultivar germplasm, such as Huangguan No. 124.

There is evidence that domestication has led to a reduction in genetic diversity for several cultivated seaweeds, including Undaria pinnatifida, Ulva prolifera and Porphyra yezoensis9,10,11. Some previous reports have assessed the genetic diversity and population structure of S. japonica populations in China and indicated that extensive selection and inbreeding for multiple generations are likely to also have reduced the genetic diversity of cultivated S. japonica populations25,26,27,28. Both kelp germplasm introduction and inter-specific hybridization were expected to increase the genetic diversity by introducing the new genetic variations into the cultivated strains. However, little is known about the effects of these two processes on the genetic structure of the populations of S. japonica that are cultivated at present. In addition, some studies showed that the escaped cultivated kelp from the farms might cross-hybridize with wild populations, resulting in genetic introgression of cultivar genetic material into wild populations3,5,11,29. It is not known whether a similar gene flow from cultivated to wild populations has occurred for S. japonica. Estimating the existence and extent of such a gene flow might be helpful for designing breeding practices and conservation strategies.

Using selected microsatellite markers, we conducted a population genetic analysis of 28ā€‰S. japonica populations (17 cultivated populations, 3 wild introduced populations and 8 wild indigenous populations). The objectives of our study were 1) to examine if Chinese populations after domestication are less genetically diverse than wild indigenous populations, a pattern expected given the known history of the introduction of S. japonica in China, 2) to test the prediction that kelp germplasm introduction and inter-specific hybridization will increase the genetic diversity and enlarge the genetic divergence of cultivated S. japonica populations by introducing the new genetic variations, and 3) to evaluate the genetic diversity of wild introduced populations and investigate whether escapes from cultivated populations influence the wild introduced populations. The long-term objective is to improve selection and breeding and sustainable utilization of S. japonica seed stocks.

Results

Selection and characteristics of the SSR loci

A total of 766 individuals from 28 populations (Fig. 1; Supplementary Table S1) were genotyped at 23 nuclear SSRs derived from genomic data30. The percentage of missing genotypes from the whole dataset was 0.085% (30 missing genotypes for 35,236 attempted). The percentage of successful amplifications per locus ranged from 96% to 100%. The number of alleles (A) ranged between 7 for SJ93 and 50 for SJ136 (Supplementary Table S2). The SSR markers exhibited polymorphism with high polymorphic information content (PIC) (PICā€‰>ā€‰0.5) for 22 SSR markers and low PIC (0.22) for SJ31 (Supplementary Table S2). The mean expected heterozygosity across populations ranged from 0.18ā€‰Ā±ā€‰0.05 to 0.72ā€‰Ā±ā€‰0.04, and the mean observed heterozygosity ranged from 0.16ā€‰Ā±ā€‰0.04 to 0.68ā€‰Ā±ā€‰0.04. microchecker detected no genotyping error due to stuttering and large allele dropout, but null alleles were detected at several loci: SJ13, SJ21, SJ125 and SJ136. We used the software program freena to estimate the average frequency of null alleles per locus, and this varied from 0.00ā€‰Ā±ā€‰0.00 for SJ3 to 0.16ā€‰Ā±ā€‰0.02 for SJ136. Only four loci (SJ13, SJ21, SJ125 and SJ136) had high frequencies of null alleles (>0.06) (Supplementary Table S2). The global FST across all loci without correction for null alleles (0.342, 95% CI: 0.307ā€“0.378) was slightly higher than the corrected FST values (0.333, 95% CI: 0.299ā€“0.368), and the pairwise FST per locus without correction was also higher than the pairwise FST with correction (data not shown). High frequencies of null alleles have the potential to influence the estimation of genetic differentiation. Consequently, we excluded these four loci (SJ13, SJ21, SJ125 and SJ136) from this study and reported only the results based on 19 SSRs.

Figure 1: Geographic locations of the domesticated and wild Saccharina japonica populations used in this study.
figure 1

The geographic figure was created using the MATLAB software package (R2012b) (http://cn.mathworks.com/products/matlab/). Red indicates northern cultivated populations; Purple shows southern cultivated populations; Orange indicates wild introduced populations; Dark green shows Russian wild populations; Light green represents Japanese wild populations.

After false discovery rate (FDR) correction for multiple tests, linkage disequilibrium (LD) tests for each pair of loci indicated that 171 pairs (3.5%) were significantly in disequilibrium. Given that these loci did not share corresponding disequilibria in all samples, we assumed that none of the loci were physically linked. No consistent pattern of linkage disequilibrium was observed, so the 19 loci were used for all subsequent analyses.

Summary statistics for 28ā€‰S. japonica populations

Genetic diversity was evaluated for 28ā€‰S. japonica populations at the population and group levels (Table 1). At the population level, the mean number of alleles across loci (NA) varied from 1.79ā€‰Ā±ā€‰0.10 for XP (one southern cultivated population from China) to 9.11ā€‰Ā±ā€‰1.24 for SA (one wild indigenous population from Shiriya, Aomori pref, Japan). Allelic richness (AR) based on 20 samples per population was highest (8.19ā€‰Ā±ā€‰1.07) in the SA population and lowest (1.78ā€‰Ā±ā€‰0.10) in the XP population. There were no private alleles in the cultivated populations, but private alleles existed in all wild introduced/indigenous populations (WIā€‰+ā€‰WRā€‰+ā€‰WJ) except one wild introduced population (YM). The mean observed heterozygosity across loci (HO) ranged from 0.25ā€‰Ā±ā€‰0.05 for XP to 0.68ā€‰Ā±ā€‰0.04 for HA (one wild indigenous population from Hakodate, Japan), and expected heterozygosity across loci (HE) ranged from 0.25ā€‰Ā±ā€‰0.04 for XP to 0.71ā€‰Ā±ā€‰0.04 for HA.

Table 1 Genetic diversity analysis of 28 Saccharina japonica populations.

We compared the mean values of all genetic diversity indices (NA, AR, NP, HO and HE) at the group level (Table 1) and found that all parameters were highest in the wild indigenous populations in Japan (WJ) and lowest in southern cultivated populations (SC). In addition, the genetic diversity of wild indigenous populations (WJ and WR, HSā€‰=ā€‰0.539) was significantly higher (Pā€‰=ā€‰0.003; Supplementary Table S3) than the genetic diversity of cultivated populations (NC and SC, HSā€‰=ā€‰0.390). The genetic diversity indices in northern cultivated populations (NC, HSā€‰=ā€‰0.415) were higher than in wild introduced populations (WI, HSā€‰=ā€‰0.386) and southern cultivated populations (SC, HSā€‰=ā€‰0.328) (Supplementary Table S3).

FIS values showed significant deviation from zero (Pā€‰<ā€‰0.01, Table 1) in the ten populations, indicating departures from Hardyā€“Weinberg equilibrium. Five cultivated populations (901, DF2, PL, HG and XP) had significantly negative FIS values, indicating heterozygosity excess, whereas five wild populations (ZD, AW, KA, HA and SA) had significantly positive FIS values, indicating heterozygote deficiency. After FDR correction, the DF2 population showed significant departures from HWE, involving 16 of the 19 loci, all due to heterozygote excess.

Recent changes in effective population size were detected based on the Wilcoxon signed rank-test under the infinite allele model (IAM), the stepwise-mutation model (SMM), and the two-phase mutation model (TPM). Eight cultivated populations and two wild introduced populations showed significant heterozygote excess compared to the expected equilibrium in the IAM after FDR correction (Supplementary Table S4). The TPM model has been reported to be the most conservative and powerful model31, and the results obtained with the TPM model indicated that the cultivated population DF2 showed significant heterozygosity excess compared to the expected equilibrium after FDR correction (Supplementary Table S4). However, hybridization can severely influence the outcome of the bottleneck tests, so the significant heterozygosity excess of the DF2 population is possibly due to hybridization during the breeding of this variety. The mode-shift test detected characteristic mode-shift distortion in the typical L-shape distribution of allele frequencies caused by bottlenecks in DF2 and XP.

Two major genetic groups: a wild indigenous group and a cultivated/introduced group

Discriminant analysis of principal components (DAPC) partitioned the S. japonica populations into two genetic clusters (Fig. 2). This partitioning was supported by the lowest Bayesian information criteria (BIC) in the DAPC analyses when Kā€‰=ā€‰2. DAPC analyses revealed two genetic clusters: a wild indigenous cluster, including Russian and Japanese wild populations (WRā€‰+ā€‰WJ), and a cultivated/introduced cluster, which included all populations from China (NCā€‰+ā€‰SCā€‰+ā€‰WI) except for DF3 (Fig. 2). A neighbour-joining (NJ) analysis calculated with Neiā€™s pairwise genetic distance (Da) indicated that the wild indigenous cluster of populations was clearly distinct from the cultivated/introduced cluster (except DF3) (Fig. 3). Bayesian clustering of the 28 populations was carried out with STRUCTURE, and consistent results were obtained from the 20 runs to test for each K value. The natural logarithm of the likelihood of the data increased sharply from Kā€‰=ā€‰1 to Kā€‰=ā€‰2 (Supplementary Fig. S2A). The metric delta K peaked at Kā€‰=ā€‰2, indicating that this was the highest hierarchical level of the genetic structure (Supplementary Fig. S2B; delta Kā€‰=ā€‰9903.50). Consistent with the NJ tree and DAPC results, the cultivated/introduced group and wild indigenous group (Kā€‰=ā€‰2) could be distinguished in the STRUCTURE analysis and showed very little admixture between them (Fig. 4). A second hierarchical level of genetic division was identified with Kā€‰=ā€‰3 (Supplementary Fig. S2B; delta Kā€‰=ā€‰87.90), indicating that the cultivated/introduced group (except DF3) could be divided into two genetic subgroups: one containing the northern cultivated populations (NC) and the other the southern cultivated populations (SC) and the wild introduced populations (WI) (Fig. 4). Wild introduced populations (WI) and southern cultivated populations (SC) clustered together, and this was also supported by the NJ tree. The southern and northern cultivated kelp populations could be clearly separated, but three cultivated populations (ZK1, RF, HG) had high genetic admixture (Fig. 4). Most of the populations clustered according to geographical distribution in DAPC plots, NJ tree and the STRUCTURE analysis, except for the DF3 population. From the NJ tree, seven cultivated populations (TJ, ZK1, ZK2, AL, LJ, YZ and PL) bred from intra-specific crosses clustered together and were genetically separated from other cultivated populations (901, RF, DF2 and DF3) derived from inter-specific crosses (Fig. 3).

Figure 2: Scatterplot output from a discriminant analysis of principal components (DAPC) for the genetic structure of Saccharina japonica individuals based on 19 microsatellites.
figure 2

Dots represent individuals from the 28 populations, and different populations are depicted with different colours and symbols. Abbreviations correspond to populations presented in Table 1. The bar graph inset exhibits the variance explained by the two discriminant eigenvalues used for plotting. The 67% inertia ellipses are drawn for each population, representing the variance of the two principal components.

Figure 3
figure 3

Neighbour-joining tree constructed based on Neiā€™s standardized genetic distance (Da) for 28 Saccharina japonica populations.

Figure 4: Bayesian estimates of population structure based on SSR data for the whole dataset.
figure 4

The results (Kā€‰=ā€‰2 and Kā€‰=ā€‰3) obtained in STRUCTURE are shown for comparison. The vertical bar indicate individuals, and colours correspond to specific clusters.

Patterns of genetic differentiation

Pairwise FST values with and without correction for null alleles showed some slight variation with differences ranging from 0.000 to 0.026 (data not shown). Of the 378 pairwise tests conducted among 28 kelp populations, 370 were highly significant (Pā€‰<ā€‰0.0001), and 6 were significant (Pā€‰<ā€‰0.05) after FDR correction (Supplementary Table S5). Therefore, most of the 28 populations have significant genetic differentiation from each other. FST values ranged from 0.009 (ZK1 and ZK2) to 0.695 (XP and EP) (Supplementary Table S5). The Chinese populations (except DF2 and DF3) were genetically divergent from wild indigenous populations (FST: 0.277ā€“0.695; Pā€‰<ā€‰0.0001). However, pairwise FST values between DF3 and three Japanese populations (KA, HA and SA) varied from 0.152 to 0.240, which indicated moderate genetic differentiation among those populations (Supplementary Table S5). There was not a high level of genetic differentiation among most of the cultivated/introduced populations (NCā€‰+ā€‰SCā€‰+ā€‰WI) (0.009ā€“0.330), but DF2 and DF3 had higher pairwise FST values (0.224ā€“0.486) compared with other cultivated populations. In addition, two northern cultivated populations (901 and NJ) were highly differentiated from three southern cultivated populations (GW, XP and PT) with FST values ranging from 0.252 to 0.330. When FST values among populations within each group were calculated independently, the highest value was found within the WR group (FSTā€‰=ā€‰0.305) and the lowest value (FSTā€‰=ā€‰0.063) within the WI group (Supplementary Table S3). In addition, the genetic differentiation between wild indigenous populations (WRā€‰+ā€‰WJ, FSTā€‰=ā€‰0.247) was significantly larger than for cultivated populations (NCā€‰+ā€‰SCā€‰+ā€‰WI, FSTā€‰=ā€‰0.199) (Supplementary Table S3).

We detected significant genetic differentiation between wild indigenous and cultivated/introduced groups, and this separation was confirmed by an estimation of 30.49% genetic variance between these two groups based on AMOVA (Š¤CTā€‰=ā€‰0.30; Pā€‰<ā€‰0.001) (Table 2). The genetic variance partitioned between populations within groups was 14.67% (Š¤SCā€‰=ā€‰0.21; Pā€‰<ā€‰0.001), while 54.85% of the variation existed within populations. AMOVA results based on northern and southern cultivated populations showed significant Š¤-statistics among populations (Š¤STā€‰=ā€‰0.23; Pā€‰<ā€‰0.001), among populations within groups (Š¤SCā€‰=ā€‰0.17; Pā€‰<ā€‰0.001) and among groups (Š¤CTā€‰=ā€‰0.06; Pā€‰<ā€‰0.001). All variance components were statistically significant (Pā€‰<ā€‰0.05, Table 2).

Table 2 Analysis of molecular variance (AMOVA) for different groups in Saccharina japonica.

Discussion

Generally, domestication tends to reduce allelic variation and genetic diversity inducing genetic erosion in cultivated seaweeds8. This phenomenon has been observed for U. pinnatifida, U. prolifera and P. yezoensis9,10,11. In the present study, analysis of mean values of NA, HE, HO, NP, AR and HS in each group indicated that genetic diversity was higher in the Japanese wild populations than in the domesticated kelp populations. This difference may be due to continuous selection having reduced the effective population size and increased genetic drift and hitchhiking during the domestication process8,32,33. Continuous inbreeding and directional selection were usually adopted for the breeding of S. japonica in China19,26. Although there were no obvious signs of inbreeding in cultivated populations, we cannot exclude that inbreeding and selfing during the breeding process caused genetic erosion in the cultivated populations, as was reported in previous studies26,27,34.

Departures from Hardyā€“Weinberg expectations due to heterozygosity excess were detected in five S. japonica cultivated populations (901, DF2, PL, HG and XP) (Table 1), possibly due to a human-mediated non-random mating strategy (small reproductive population size, existence of heterosis and effect of gametophytic self-incompatible system). One possible explanation of heterozygote excess in these cultivated populations may be the small reproductive population size during the breeding process. When only a few parental kelp contribute to the next generation, allelic frequencies can differ between male and female parents, and lead to a significant deviation from random mating35,36. Another potential explanation was that selection for linked heterozygotes or associated heterosis effect could cause the heterozygote excess in these cultivated populations. To fully utilize heterosis to breed the seedlings for enhancing the quality and quantity of the next yearā€™s cultivation, intraspecific hybridization of divergent lineages or interspecific hybridization could be used. Such an effect of heterosis is exemplified by the DF2 population: the cultivar Dongfang NO. 2 (DF2) was the first filial generation obtained by hybridizing a male gametophyte clone of S. longissima with a female one of S. japonica20, and in our study, it showed particularly significant heterozygosity excess (FISā€‰=ā€‰0.96; Pā€‰<ā€‰0.01). Therefore, we presumed that overdominant selection in the breeding cultivars favoured heterozygote survival and caused the heterozygote excess with hitch-hiking selection. One common explanation for observing heterozygote excess is the self-incompatibility system effect37; however, we could not conclude that the excess of heterozygotes in cultivated S. japonica was caused by active avoidance of selfing and full-sibling mating.

In contrast to cultivated populations, five wild populations showed significant departure from HWE with heterozygote deficiencies. An excess of homozygosity within wild populations can be due to at least three causes: the presence of null alleles, inbreeding and population subdivision. Inbreeding could be common in wild populations, possibly due to the limited dispersal of the kelp gametes, favouring mating within the kelp siblings or close relatives38,39. Our present data are not sufficient to resolve whether there was a substructure within our populations that could result in a Wahlund effect40. However, in a previous study of local substructure and gene flow in wild S. japonica indigenous populations41, we found evidence to support a within-population structure as a cause of heterozygote deficiency.

In this study, we observed that wild introduced populations (WI) had lower genetic diversity than wild indigenous populations (WRā€‰+ā€‰WJ). This was consistent with our previous results that wild introduced populations have lower haplotype and nucleotide diversity than wild indigenous populations41. The reduction of genetic diversity in wild introduced populations was possibly due to founder effects and genetic drift following introduction in the 1930ā€‰s. This is supported by the fact that two wild introduced populations (YM and ZD) showed signs of a genetic bottleneck (Supplementary Table S4). Moreover, some wild introduced populations were transplanted from Shandong (35ā€“36Ā°N) to Zhejiang (27ā€“31Ā°N) and to Fujian (24ā€“27Ā°N) in China in the 1960s14,16,18. The southward transplantation had a profound influence on the genetic structure of the cultivated S. japonica populations. DAPC plots, an NJ tree and STRUCTURE analyses indicated that the clustering of the southern cultivated populations (SC) and the wild introduced populations (WI) was principally driven by these transplantations (Supplementary Fig. S1).

The northern cultivated populations (NC) and the wild introduced populations (WI) all grow in the north of China, but they did not cluster together in our analyses. We believe that the northern cultivated populations (NC) are genetically different and diverse as a result of interspecific hybridization and/or multiple introductions of wild indigenous germplasm12,20,21,23,42, while the wild introduced populations (WI) are comparatively more isolated genetically, with less germplasm introduction. There was evidence for a low level of gene admixture between cultivated populations and one wild introduced population (XS) (8.4% wild introduced individuals belong to cultivated populations), while no evidence was found for gene admixture among the other wild introduced populations (YM and ZD) and the cultivated populations (Fig. 4). These results imply that the wild populations of S. japonica have not been markedly impacted by gene flow from cultivated populations.

Several earlier studies reported that northern and southern cultivated kelp populations lacked any significant genetic differentiation26,27. Although the degree of differentiation between northern and southern cultivated populations is globally mild (only 6% variance explained by northern vs. southern differences), our analyses indicated significant genetic differentiation among these cultivated populations (Š¤CTā€‰=ā€‰0.06; Pā€‰<ā€‰0.001) (Table 2). The difference between the populations is principally due to the northern cultivated populations (NC) representing a genetically mixed gene pool with repeated introductions of wild indigenous germplasm (from the same species or from congeneric species). Southern cultivated populations (SC) seem to have retained the original genetic composition of the wild introduced populations (WI), with less germplasm introduction.

Most northern cultivated populations from inter-specific and intra-specific crosses have distinct genetic structure patterns. Indeed, most cultivated populations bred from descendants of S. japonica clustered together and were separated from descendants of inter-specific crosses in the NJ tree (Fig. 3). Liu et al. reported two cultivars (901 and DF3) that were derived from hybridization between S. japonica and S. longissima clustered together26. In our study, it was surprising that 901 clustered with RF, not DF3 (Fig. 3), although cultivar ā€œRongfuā€ (RF) contained wild germplasm from S. latissima22. We presumed that these two cultivars might have been genetically mixed in the farms, due to deliberate hybridization by farmers or uncontrolled genetic mixing of cultivars during the summer sporeling-rearing27.

Based on the STRUCTURE analyses and NJ tree, the northern cultivated population DF3 clustered with wild indigenous populations. We suspected that this cultivar might have been unconsciously or deliberately mixed with wild kelp germplasm by farmers during the breeding or cultivation processes. To verify whether there is contamination and degradation in these DF3 varieties, a detailed study of the DF3 strains cultivated in different farms should be further conducted.

In conclusion, the overall genetic structure of the S. japonica strains analysed in this study suggest that this cultivated kelp represents a single complex gene pool within which historical movement of germplasm, recent introductions, interspecific hybridizations and human selection are shaping the genetic structure. In our present study, the expected reduced diversity of cultivated populations was confirmed, and genetically distant populations or geographically isolated populations should be used for enhancing diversity and improving productivity. Diverse wild S. japonica resources are essential for the restoration and selection processes necessary for Chinese kelp production. Further genome-wide analysis of the domesticated kelps will provide more information to understand in detail the micro-evolution processes that have occurred during the domestication and thus to improve kelp breeding strategies.

Methods

Sample collection and identification

We sampled 28ā€‰S. japonica populations (766 individuals), including twelve northern cultivated populations (NC), five southern cultivated populations (SC), three wild introduced populations (WI) and eight wild indigenous populations from Russia and Japan (WR and WJ) (Supplementary Table S1; Fig. 1). Seventeen of these populations (458 individuals) had already been used in previous studies including seven cultivated populations26 (174 individuals) and ten wild introduced/indigenous populations41 (284 individuals) (Supplementary Table S1). Six of the cultivated cultivars have been examined and approved by the Chinese Approving Committee of Aquacultural Stock Seeds and Elite Varieties: ā€œHuangguan No.1ā€ (HG)22,24, ā€œRongfuā€ (RF)17, ā€œAilunwanā€ (AL), ā€œ90-1ā€ (901)23,34, ā€œDongfang No.2ā€ (DF2)20 and ā€œDongfang No.3ā€ (DF3)21, and the remaining 11 cultivated populations corresponded to production cultivars that are commonly employed in kelp production. Wild introduced populations corresponded to populations from China that are able to complete their life histories in the wild, whereas the wild indigenous populations corresponded to Russian (3 populations) and Japanese populations (5 populations) that might have been the original source of strains cultivated in China.

SSR analysis

Twenty-three simple sequence repeat (SSR) markers were screened from a list of 76 SSRs previously developed by Li et al.30 and analysed using three sets of multiplex PCR reactions in this study (Supplementary Table S6). Each multiplex set was carefully assembled based on the compatibility of SSRs during the PCR and on the molecular size of their amplicons. Forward primers of SSR markers were labelled with one of the following fluorescent dyes: carboxy fluorescein (FAM), carboxytetrame thylrhodamine (TAMRA) or hexachloro-6-carboxy fluorescein (HEX). The reaction mixtures (10ā€‰Ī¼L) contained 0.2ā€‰Ī¼L template DNA, 0.1ā€‰Ī¼L of each primer, and 5ā€‰Ī¼L Multiplex PCR Master Mix (QIAGEN Multiplex PCR Kit, Qiagen, Germany). The PCR reactions were performed in a Life Pro thermocycler with an initial denaturation step of 5ā€‰min at 95ā€‰Ā°C, followed by 20 cycles of 95ā€‰Ā°C for 30ā€‰s, 55ā€“58ā€‰Ā°C for 30ā€‰s and 72ā€‰Ā°C for 30ā€‰s, and a final extension at 72ā€‰Ā°C for 10ā€‰min. For PCR fragment size determinations, 0.5ā€‰Ī¼L of an internal size standard (Liz-500, LIZ) was mixed with 0.3ā€‰Ī¼L of PCR product and 9.5ā€‰Ī¼L formamide. The mixture was heated to 95ā€‰Ā°C for 5ā€‰min, then cooled down on ice, and finally subjected to fragment analysis on an ABI PRISM 3730 sequencer (Applied Biosystems, Unites States of America). The alleles were sized using genemarker v2.2.0 (SoftGenetics).

Statistical analysis

Micro-checker 2.2.3 was used to check for potential genotyping errors caused by stuttering or large allele dropout and the presence of null alleles43. Null allele frequencies across the populations were estimated using the expectation maximization (EM) algorithm by the program freena44. Linkage disequilibrium (LD) and deviations from Hardyā€“Weinberg equilibrium (HWE) were estimated for each SSR locus and individual kelp populations using genepop 4.2.245 with 10,000 dememorization and in 20 batches with 5,000 iterations per batch. Multiple tests in the detection of LD and HWE were corrected using the false discovery approach46 in the R-package qvalue (R Development Core Team 2013). For polymorphism evaluation of each SSR locus, allele numbers (A), expected and observed heterozygosity (HE and HO), and polymorphic information content (PIC) were calculated using popgene 1.3.147. For measures of genetic diversity in each population, the mean number of alleles across all loci (NA) and allelic richness (AR) across all loci based on the minimal sample size with the rarefaction method were calculated in fstat 2.9.3.248. The mean observed and expected heterozygosity (HE and HO) and numbers of private alleles (NP) were calculated using genalex 6.4149 . The significance of departures from Hardyā€“Weinberg equilibrium, as evidenced by deviation of inbreeding coefficient (FIS) from zero, was tested with 5,000 randomizations using fstat 2.9.3.250. Microsatellite data files were converted into the formats for the various analysis software using pgdspider 2.0.1.051 and convert 1.3.152.

Genetic bottleneck analysis

Screens for signatures of genetic bottlenecks were carried out for cultivated and wild populations using bottleneck 1.2.0231. A one-tailed Wilcoxon signed-rank test was applied to determine the significance of heterozygosity excess, and 10,000 replicates were run under three microsatellite models: infinite allele model (IAM), stepwise-mutation model (SMM) and two-phase mutation model (TPM) incorporated with 90% IAM and 10% SMM53,54,55. A test for mode shift was also used to detect genetic bottlenecks. The expected distributions of allele frequencies exhibit a normal L-shaped distribution, but population bottlenecks could cause a characteristic mode-shift distortion in the distribution of allele frequencies54.

Genetic differentiation analysis

Global and pairwise FST across all loci was investigated with correction for null alleles, using freena with 10,000 bootstrap resampling, to avoid the impact of null alleles on the estimation of genetic differentiation44. Pairwise genetic differentiation among the 28 populations was estimated over 100,000 random permutations with arlequin 3.5.1.356.

Analysis of molecular variance (AMOVA) was conducted in arlequin 3.5.1.3 to partition genetic variation across nested levels: within populations, between populations within groups, and between groups56. The different groups were defined as follows (Table 1): the cultivated/introduced populations grouped 20 populations (NCā€‰+ā€‰SCā€‰+ā€‰WI); the wild indigenous populations grouped 8 populations (WRā€‰+ā€‰WJ1ā€‰+ā€‰WJ2); the northern cultivated populations (NC) included 12 populations; the southern cultivated populations (SC) possessed 5 populations.

Analysis of population structure

To infer genetic clusters in S. japonica populations, multivariate analyses were carried out using discriminate analyses of principal components (DAPC) using the adegenet 1.3.1 package57,58 in the R environment (R Development Core Team 2013). The optimal number of clusters was selected based on the lowest Bayesian Information Criterion (BIC). Pairwise genetic distances between populations were calculated using Neiā€™s59 standardized genetic distance Da, and the non-rooted tree was generated using the neighbour-joining (NJ) algorithm with 1,000 bootstrap replicates in poptree60. Tree topologies were viewed and adjusted with figtree 1.4.2 (available at http://tree.bio.ed.ac.uk/software/figtree/). Bayesian analysis of the population genetic structure with structure 2.3.1 was applied with the admixture model and correlated allele frequencies61,62. To determine the optimal number of clusters, K, the population structure was tested at K values ranging from 1 to 15 with 20 replicates based on 1,000,000 Markov chain Monte Carlo iterations following a burn-in period of 500 000 steps, and we evaluated the log-likelihood [lnP(k)] for each K and estimated Delta (K) using structure harvester63. The graphical results were displayed using distruct64 .

Additional Information

How to cite this article: Zhang, J. et al. Effect of domestication on the genetic diversity and structure of Saccharina japonica populations in China. Sci. Rep. 7, 42158; doi: 10.1038/srep42158 (2017).

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.