Identifying the genetic diversity, genetic structure and a core collection of Ziziphus jujuba Mill. var. jujuba accessions using microsatellite markers

Xu, Chaoqun; Gao, Jiao; Du, Zengfeng; Li, Dengke; Wang, Zhe; Li, Yingyue; Pang, Xiaoming

doi:10.1038/srep31503

Download PDF

Article
Open access
Published: 17 August 2016

Identifying the genetic diversity, genetic structure and a core collection of Ziziphus jujuba Mill. var. jujuba accessions using microsatellite markers

Chaoqun Xu¹,
Jiao Gao¹,
Zengfeng Du²,
Dengke Li³,
Zhe Wang¹,
Yingyue Li¹ &
…
Xiaoming Pang¹

Scientific Reports volume 6, Article number: 31503 (2016) Cite this article

4832 Accesses
37 Citations
Metrics details

Subjects

Abstract

Ziziphus is a genus of spiny shrubs and small trees in the Rhamnaceae family. This group has a controversial taxonomy, with more than 200 species described, including Chinese jujube (Ziziphus jujuba Mill. var. jujuba) and Indian jujube (Z. mauritiana), as well as several other important cultivated fruit crops. Using 24 SSR markers distributed across the Chinese jujube genome, 962 jujube accessions from the two largest germplasm repositories were genotyped with the aim of analyzing the genetic diversity and structure and constructing a core collection that retain high genetic diversity. A molecular profile comparison revealed 622 unique genotypes, among which 123 genotypes were genetically identical to at least one other accessions. STRUCTURE analysis and multivariate analyses (Cluster and PCoA) roughly divided the accessions into three major groups, with some admixture among groups. A simulated annealing algorithm and a heuristic algorithm were chosen to construct the core collection. A final core of 150 accessions was selected, comprising 15.6% of the analyzed accessions and retaining more than 99.5% of the total alleles detected. We found no significant differences in allele frequency distributions or in genetic diversity parameters between the chosen core accessions and the 622 genetically unique accessions. This work contributes to the understanding of Chinese jujube diversification and the protection of important germplasm resources.

Construction of a core collection of native Perilla germplasm collected from South Korea based on SSR markers and morphological characteristics

Article Open access 13 December 2021

Kyu Jin Sa, Dong Min Kim, … Ju Kyong Lee

Genetic diversity and population structure of ridge gourd (Luffa acutangula) accessions in a Thailand collection using SNP markers

Article Open access 28 July 2021

Grimar Abdiel Perez, Pumipat Tongyoo, … Paweena Chuenwarin

DNA fingerprinting, fixation-index (Fst), and admixture mapping of selected Bambara groundnut (Vigna subterranea [L.] Verdc.) accessions using ISSR markers system

Article Open access 15 July 2021

Md Mahmudul Hasan Khan, Mohd Y. Rafii, … Jamilu Halidu

Introduction

Chinese jujube (Ziziphus jujuba Mill. var. jujuba), which belongs to the buckthorn family (Rhamnaceae), is an important deciduous fruit tree that is typically grown in temperate and subtropical areas. It is indigenous to the middle and lower reaches of the Yellow River of China and was first domesticated 7,700 years ago¹. Chinese jujube is both consumed as a fruit and used in herbal medicine because it has high vitamin C, cyclic AMP and mineral content (particularly potassium and iron) as well as biologically active compounds². In addition, it is considered an ideal cash crop for arid and semi-arid areas due to its high tolerance to drought and salinity¹. Historically, Chinese jujube has a large-scale commercial production in China and South Korea, recently, it has been gradually gaining prominence in Australia, the USA and other countries³. Chinese jujube has been cultivated throughout China except the northernmost province, Heilongjiang and the far southwestern province of Tibet, with a cultivation area of two million ha¹. The leading provinces in Chinese jujube production are Xinjiang followed by Shannxi, Shanxi, Hebei, Shandong and Henan, accounting for approximately 90% of the entire yield in China.

Chinese jujube originates from its wildtype, sour jujube (Z. jujuba Mill. var. spinosa (Bunge) Hu ex H. F. Chow)¹, mainly propagated by grafting and suckering, but it can also be propagated using seeds. Crossbreeding of Chinese jujube has proven difficult because of the small flower and low fruit and kernel production in the seed⁴. New cultivars are mainly being developed based on selection from spontaneous somatic mutants (‘sports’) and occasional seedlings. The number of cultivars has increased over time and more than 900 documented cultivars currently exist in China, with these cultivars having been selected by farmers and breeders. Although most of the Chinese jujube cultivars are diploid (2n = 2× = 24), a few triploids exist. ‘Zanghuangdazao’ was the first triploid cultivar found in nature and it exhibits considerable genetic variation⁵. ‘Pingguozao’ (recently certified as ‘Jingling No. 1’) was also shown to be a triploid using flow cytometry and chromosome counting⁶.

The two largest Chinese jujube collections are housed at the National Chinese Jujube Germplasm Repository, located in Taigu County, Shanxi Province and the National Foundation for Improved Cultivar of Chinese Jujube, located in Cang County, Hebei Province. Several other smaller local collections also exist. Due to the ease of asexual propagation and the frequent transport of cultivars between regions, there are a large number of synonyms for some cultivar names. Moreover, mislabeling may also occur in the germplasm collections, which hinders cultivar identification, exploitation, evaluation and use. Therefore, a noteworthy goal is to characterize current Chinese jujube collections to improve management and utilization.

Traditionally, plant cultivar differentiation was based on morphological characteristics and pedigree information. However, morphological descriptions can possess limitations, as morphology can be influenced by environmental factors and requires skilled assessment⁷. With the advent of molecular marker techniques, DNA fingerprinting has become an important tool used to identify and delineate cultivars and quantify variation within the germplasm. Different types of molecular markers, including random amplified polymorphic DNA (RAPD), amplified fragment length polymorphic DNA (AFLP), sequence-related amplified polymorphism (SRAP) and simple sequence repeats (SSR), have been applied to identify Chinese jujube cultivars, evaluate genetic diversity and conduct QTL mapping⁴. Among these markers, SSRs have become the genetic marker of choice due to a high reproducibility and ability to identify high levels of genetic polymorphism, co-dominance, broad genome distributions and genetic diversity. A great amount of SSR markers have been developed for Z. jujuba^8,9,10 and sour jujube¹¹ (Ziziphus jujuba var. spinosa), which have been used for linkage map construction (unpublished data) and genetic diversity estimations^4,5. More recently, our research group reported that 76 major cultivars employed in Chinese jujube production exhibited comparatively high genetic diversity based on 31 SSR primer pairs compared with fruit crops like grape and apple⁴. We found that the recorded location distributions of many Chinese jujube cultivars may not represent their actual origin. The genetic diversity of 174 Chinese jujube genotypes was also evaluated using AFLP and SRAP markers¹². Therefore, considering the large number of germplasm resources, it warrants a more comprehensive understanding of the diversity within the germplasm collections.

Management of large germplasm collections is often costly, time-consuming and labor-intensive, limiting the breeding capacity and in-depth explorations of the germplasms. Because these collections often contain redundant accessions, so it is urgent to build a core collection, which, as the representative germplasm resources of the entire germplasm collection, preserves the maximum genetic diversity and minimum repetition of a crop species¹³. Therefore, a core collection can improve germplasm selection and evaluation for curators and breeders, while maintaining a core set that representative of the genetic diversity of the entire germplasm collection. This strategy would allow for allelic gene varieties and genotype-phenotype associations to be efficiently mined and assessed.

Here we aimed to (1) develop molecular fingerprints and determine the genetic redundancy in the germplasm accession collections, (2) identify the genetic relationships among these accessions, (3) evaluate the level of genetic diversity in the collections and (4) construct a suitable core collection to be used as a germplasm resource.

Results

Chinese jujube germplasm identity analysis

Twenty-four SSR loci were employed to identify unique genotypes among the 947 diploid accessions. In total, 622 distinct SSR genotypes were detected in the collections (Table S1). In total, 499 accessions had unique multi-locus genotypes (Table S1) and each was represented by only one accession in the collections. The remaining 448 accessions, which accounted for 47.3% of the collections, possessed non-unique SSR profiles and were represented by 123 different genotypes (Table 1). Hereafter, accessions that were genetically identical to at least one other accession are defined as a duplicate set. The 123 duplicate sets were found in 198 and 250 accessions for Cangzhou and Taigu, respectively (Table S2). Within the 123 duplicate sets were three distinct types. The first type represented different strains of the same cultivar in Cangzhou (e.g., ‘Dongzao-103’ and ‘Dongzao-100’). The second type corresponded to an identical name with different ‘-C’ or ‘-T’ suffixes, representing the accessions from Cangzhou or Taigu, respectively (e.g., ‘Mayizao-C’ and ‘Mayizao-T’). The third type was the synonym that contained the largest group listed in group 18, with as many as 37 accessions, including several ‘Xiaozao’ accessions (Table S2). Separate identity analyses for Cangzhou and Taigu are also included in Table S2, which revealed 43 and 64 duplicate sets, respectively.

Table 1 List of diploid accessions information and genetic parameters for Cangzhou and Taigu.

Full size table

Genetic diversity of the subset with unique genotypes

All 24 SSR loci successfully amplified polymorphic and reproducible alleles in 622 genotypes. The 24 SSRs yielded high discriminating capacity, as deduced from the low cumulative identity probability (PI) of 8.6E-19 (Table 2). Private alleles were investigated among the sour jujube, diploid accessions and triploid species. Sour jujube accessions revealed 59 alleles and1 private allele in 24 SSRs, whereas no private alleles were identified for the triploid species.

Table 2 Observed probability of identity calculated from 622 unique genotypes using GenAlEx 6.5 software on 24 SSR loci.

Full size table

A total of 215 alleles were detected in the 622 unique diploid genotypes, ranging from 3 alleles at BFU0584 and BFU0733 to 21 alleles at BFU0308, with a mean of 8.96 alleles per locus. The Ne value ranged from 1.35 alleles at locus BFU0521 to 8.97 alleles at locus BFU0308, with an average of 3.15 alleles per locus (Table 3). The allele size varied from 104 bp at locus BFU0574 to 316 bp at locus BFU0377. Of the 215 alleles, 128 were considered rare alleles, occurring at a low frequency (<0.05) in the entire germplasm collections and representing 59.5% of the total alleles. A minimum allele frequency of 0.1% was found for all 24 loci, except BFU0263, BFU0733, BFU0478, BFU1279, BFU0277, BFU0501 and BFU1178. The maximum allele frequency (0.86) was observed for allele 240 at BFU0521. In addition, the mean Ho and He values per locus ranged from 0.25 at BFU0479 to 0.91 at BFU0308 and from 0.26 at BFU0521 to 0.89 at BFU0308. Moreover, the important genetic diversity estimator, polymorphic information content (PIC), revealed high diversity levels for all genotypes, averaging 0.56. Sixteen microsatellite loci were highly polymorphic (PIC > 0.5), ranging from 0.51 at BFU0479 to 0.88 at BFU0308. Eight loci exhibited moderate polymorphic trends (0.25 < PIC < 0.5), ranging from 0.25 at BFU0521 to 0.42 at BFU0478 (Table 3).

Table 3 Genetic diversity statistics for 24 SSR loci in 622 unique genotypes.

Full size table

Comparison of genetic diversity between the Cangzhou and Taigu repositories

Of the 947 diploid accessions, 479 were from the Cangzhou repository and 468 were from the Taigu repository. As for the 622 distinct genotypes, Cangzhou and Taigu repositories possessed 362 and 315 genotypes, respectively (Table 1). Thus, Cangzhou had more unique genotypes than did Taigu.

The Cangzhou and Taigu repositories exhibited 196 and 176 alleles, with averages of 8.17 and 7.33 alleles, respectively (Table 1). The number of common alleles was 157. The Cangzhou and Taigu repositories exhibited 39 and 19 private alleles, respectively (Table S3). ‘Cuizaohong’ exhibited four loci (BFU0586, BFU0377, BFU0521 and BFU0564) with five private alleles, followed by ‘Henan-12’ and ‘Cuzao’, which each had three private alleles for the Cangzhou repository. ‘Kashixiaozao’ exhibited the largest number of private alleles, followed by ‘Hanguohongyan’ and ‘Chaoyangmopanzao,’ each of which possessed two loci (BFU0539, BFU0614 and BFU1205, BFU0574, respectively) with private alleles in the Taigu repository. However, the frequencies of the private alleles from both the Cangzhou and Taigu repositories were very low, with average values of 0.003 and 0.002, respectively (Table S3). Accessions with one or more private alleles are listed in Table S4, with Cangzhou and Taigu exhibiting 57 and 23 private alleles, respectively.

The mean expected heterozygosity (He) and observed heterozygosity (Ho) were 0.60 and 0.64 for Taigu, whereas the average values were 0.58 and 0.62 for Cangzhou. The mean PIC in Taigu (0.56) was similar to that of Cangzhou (0.53) (Table 1).

Population Structure and Principal Coordinate Analysis

In the absence of clear-cut origins of the accessions, a non-stratified strategy was adopted for the genetic structure analysis. Our results showed a clear peak for ΔK at K = 3 (Fig. 1), where all the accessions were roughly divided into three major groups, with some admixture among groups (Fig. 2). About 80% of accessions belonged to each group, which showed strong ancestry values averaging >0.80 (data not shown). Group 3 contained the highest number of accessions (351), followed by group 2 (140) and group 1 (131). Group 1 was comprised almost all of the ‘Dongzao’ accessions, such as ‘Dongzao-40,’ ‘Dongzao-103,’and ‘Chengwudongzao-T.’ Only ‘Gansudongzao’ was included in group 3. All five of the sour jujube accessions were assigned to group 3, albeit in two different nodes. Notably, four accessions from Korea (‘Hanguohongyan,’. ‘Hanguowudeng,’. ‘Hanguojinxiu,’. and ‘Hanguoyuechu’), which were highly adaptable to cold climates, were included in group 2, ‘Hanguofuzao’ from Korea, however, was assigned to group 3, suggesting that it may have a unique ancestry type.

Statistical analysis indicated that the percentage of genotypes with a membership coefficient ≥90% was 63.83%. A total of 83.28% of genotypes exhibited a membership coefficient ≥80% and only 3.38% of the accessions exhibited a membership coefficient of 5% or less. Based on standard permutation tests of the full data set, the groups defined by Structure suggest moderate genetic differentiation, as indicated by the global Fst value of 0.11 (P < 0.01).

A principal coordinate analysis (PCoA) roughly divided the 622 unique accessions into three clusters (Fig. 3). Principal coordinates (PCo) 1 and 2 explained 12.9% and 6.4% of the variance in the genotype data, respectively (Fig. 3). More than 50% of the accessions were assigned to cluster 3, whose accessions were much more scattered than those in clusters 1 and 2.

The dendrogram divided the 622 diploid accessions into three major clades (Fig. 4). Overall, the dendrogram corroborated the Structure results, with the exception ofclade 2, in which a few accessions were assigned to group 3.

Selection of core collections

To determine the optimal core size, 21 sampling percentages from the whole collection were designed, combined with two sampling strategies. As illustrated in Fig. 5, Curve 2 exhibited inferior efficiency compared to Curve 1, especially for core sets with smaller sample collections, which demonstrated a larger allele retention gap. For instance, the value represented in Curve 1 was approximately 50% higher than the cases in Curve 2 when the core selection size was 20–150 and when the core set reached 400, Curve 2 only captured 85.8% of the total alleles. In contrast, Curve 1 had plateaued as the core set reached 150 and the allelic retention nearly equaled the total alleles in the 947 diploid accessions. Ultimately, the simulated annealing algorithm (represented by Curve 1) is considered the preferred strategy for constructing the core selection and the core size of 150, which accounted for 15.6% of the total accessions and captured 99.5% of the total alleles, was ultimately defined.

In the present study, we constructed an integrated applied core collection for Chinese jujube that includes 20 retained accessions with applications to genetic research and breeding programs, all of which were chosen based on fruit cracking, fruit size, fruit shape and commercial importance in the jujube industry (Table S5). Then, a total of 77 and 79 accessions were identified using PowerMarker and PowerCore software at the genotypic level, respectively. All of the 156 accessions were subjected to the relationship test based on the cluster analysis. Finally, a core set of 150 was constructed after deleting 6 duplicates.

The mean values of Na, Ne, Ho, He and PIC from the core collection were greater than or equal to the 622 diploid accessions (Table 4). Heterozygosity and alleles of all loci in the 150 core collections were 0.64 and 214, respectively, while the 622 diploid accessions yielded values of 0.61 and 215. No significant differences were observed for Na, Ne, Ho, He and PIC between the core and the 622 diploid accessions, as indicated by Levene’s test for equality of variance and t-tests for equality of means (Table 4). The frequency of alleles in the core collection and the 622 unique genotypes was highly correlated (R = 0.9453) (Fig. 6).

Table 4 Comparison of the genetic diversity and significance test of the differences between the 622 unique diploid genotypes collection (215 alleles) and core collection (214 alleles).

Full size table

Discussion

The aim of this work was to identify the genetic diversity, genetic structure and a core collection of Ziziphus jujuba Mill. var jujuba accessions. Now, we interpret our results with regard to genetic diversity and the causes of the genetic redundancy. The present status of genetic structure is briefly discussed. In addition, we further explain the efficiency of the strategy used to construct the core collection. Genetic redundancy is an important issue in plant genetic resource management. The identification of duplicates is important in germplasm repositories, particularly when considering the construction of core collections.

Different rates of duplication have been extensively reported in soybean¹⁴, lychee¹⁵, grape¹⁶ and melon¹⁷. In the present study, a genetic characterization of Chinese jujube found 123 genotypes that were genetically identical to at least one other accession, which accounts for about half of the collection (47.3%, 947 accessions) (Table S2). This suggests that duplicates may frequently occur in Chinese jujube. Some duplicates (79 groups, e.g., Changmuzao-C’/‘Changmuzao-T’; Table S2) correspond to an accession with an identical name with different suffixes, ‘-C’ and ‘-T’, which represent the accessions from Cangzhou and Taigu, respectively. This may indicate common accessions in both repositories. Other duplicates appear in either of the two germplasm repositories (For example, ‘Yuanlizao-C1’/‘Yuanlizao-C2,’ ‘Jinsi No. 3-T1’/‘Jinsi No. 3-T2,’ etc., Table S2). Some other duplicates may have occurred as a result of incorrect origin identification, as the recorded location of many Chinese jujube cultivars may not represent their actual origin⁴. Olive and grape crops, which have long cultivation histories, have faced similar origin identification issues^18,19. In general, the redundant genotypes are consistent with expectations. This is probably because the sports (spontaneous somatic mutants) or clonal selections are hard to differentiate from their original cultivar based on a limited number of molecular markers¹⁶. Moreover, considering the high genetic similarity level, as indicated by the propagation characteristics of Chinese jujube, the redundant accessions with distinct phenotypes are suitable for functional genomic studies. For example, ‘Hupingzao,’ which is identical to ‘Junzao’ based on the SSR genotype (Table S2), is a selected cultivar from ‘Junzao’ with a different fruit shape. Other identical pairs identified in the study can be phenotypically differentiated based on various traits such as fruit size (e.g., ‘Zhanpudazao’ vs. ‘Xiaozao-C2’) and shape (e.g., ‘Lelingmopanzao’ vs. ‘Yuanling’). As Emanuelli et al.¹⁶ highlight, duplicate accessions with differing phenotypic traits could be especially valuable material for further studies of the regulation of important traits. Thus, it is necessary for the accessions with identical SSR genotypes to be further evaluated morphologically or to be investigated by more molecular markers before being considered for elimination from the collection.

The study evaluated the genetic diversity of a large Chinese jujube collection (622 unique diploid genotypes), representing the largest and most extensive study of this species to date. The number of alleles per locus (mean = 8.96) was much higher than that detected in 76 major Chinese jujube cultivars (mean = 5.70)⁴. The high level of allele variation may be due to the large number of accessions analyzed. The present study showed that the alleles are not evenly distributed in both repositories, partially due to the existence of many low frequency alleles (<0.05). This is verified by levels of 58.2% and 50.6% for Cangzhou and Taigu, respectively. Somatic mutation is important for the breeding of Chinese jujube, which has been kept by the vegetative method, producing an excess of low-frequency variants²⁰. As a consequence, it is necessary to strengthen the protection of rare alleles, especially in the Cangzhou collection.

In accordance with our previous studies, which revealed the mean values of Ho (0.678) and He (0.621) using 31 SSRs⁴, high heterozygosity levels (Ho of 0.64 and He of 0.60) were detected in the present study. The results also agree with a recent report from an assessment of the entire genome sequence⁹. Several studies have shown that cash trees, such as Citrus²¹, Diospyros kaki Thumb²² and Castanea crenata²³, also exhibited high genetic heterozygosity. The results may be explained by cross-pollination arising in fruit trees, including Chinese jujube, which are propagated vegetatively. It is necessary to plant different varieties together to ensure cross-pollination in order to overcome the prevalence of self- and cross-incompatibility. Other causes may be related to long-term natural selection, the mixed nature of the accessions or the historic mixing of strains from different populations²⁴.

Structure and cluster analyses are effective means for studying genetic relationships related to germplasm resources^25,26. Structure analysis showed that the grouping was largely consistent with the UPGMA clustering (Fig. 4). Considering the higher genetic diversity levels in groups 1 and 3, a higher percentage of mixed ancestry rate derived from the genotypes in group 2 may have occurred. The low proportion of the variance explained by the first two axes of the PCoA indicated that the planar graph may not efficiently represent a large number of variables. Similar results have been previously reported by Belaj et al.²⁷ and Leigh et al.²⁸. Despite the loss of geographical origin information, the cluster differentiation is evident. The first axis separated the majority of accessions in cluster 2 from those in clusters 1 and 3, whereas the second axis separated the majority of the accessions in cluster 1 from those in clusters 2 and 3. However, a small degree of admixture existed in the first two axes, suggesting that no strict distinction exists among the three clusters. The results can also be explained by the low molecular variance among the clusters (0.85%), indicating limited differentiation.

The taxonomic controversy between sour jujube and Chinese jujube is worth noting²⁹. It was long considered that jujube was domesticated from wild jujube^30,31,32. Some have classified Chinese jujube and sour jujube as two independent species based on the morphology, habitat, anatomy and other differences³³. Others have treated sour jujube as a subspecies based on a SRAP analysis and ITS sequence data³⁴. However, these studies failed to make convincing arguments to defend their positions. In the present study, sour jujube accessions were clustered into two different clades (Fig. 4). Structure analysis also divided the sour jujube accessions into two different groups, indicating that different groups may have been independently domesticated, which agrees with the results of a cpDNA analysis³⁵. Thus, the present study supports the view that sour jujube should not be recognized as a unique species.

The major consideration in constructing a core collection from a very large germplasm collection is to develop reliable classification criteria. However, the problem does not exist within Chinese jujube accessions. Groups of accessions defined by the growing region, cultivar origin, species or subspecies proposed in previous studies cannot be effectively applied to Chinese jujube^15,36. Consequently, a non-group strategy is adopted to construct a core collection. Previous studies have proposed differing criteria regarding the relative sizes of core collections^37,38,39,40. Most researchers believe that 5–20% of the sampled size should encompass the genetic diversity of the entire collection. In the present study, we designed a large scale core set (2–40% of the 947 diploid accessions). A comparison of the sampling efficiency (i.e., the ability to capture allele numbers) supports the simulated annealing algorithm as the favorable approach (Fig. 5).

Large plant collections are expensive to maintain. Thus, a minimum number of samples that represent maximum genetic diversity is recommended⁴¹. The results suggest that the subset with a 15.6% sampling ratio yielded the largest allelic retention (99.5%). Similar studies have reported allelic retention values of 95.74% and 100% in pear⁴² and melon¹⁷, with sampling ratios reaching 24.2% and 19.4%, respectively. Only 4% of the sampling proportion sufficiently captured the entire genetic diversity of analyzed grapevine collections, which may be due to the high heterozygosity level and redundancy in the collection⁴³.

Twenty accessions listed in Table S5 were retained to assure representation of the characteristics in the core collection. The retained accessions, representing each of the unique phenotypic characteristics not included in the core, could easily be added to the core collection. Moreover, the complicated relationships shown in Fig. 4 reflect the admixture within Chinese jujube and pre-selection can avoid rejection of high-quality accessions. Larger subsets analyzed in the study guarantee full allelic coverage and maximum genetic diversity, especially considering the abundance of low-frequency alleles (59.5%). Different core collection strategies have been reported^{15,17,36,37,42}. The simulated annealing algorithm implemented in PowerMarker software ensured a high allelic coverage, while PowerCore software yielded maximum allele diversity with the lowest sampling intensity. The combination of the two strategies is useful for conservation purposes. No significant difference was observed in the variability parameters and allele frequency distribution between the core and entire unique collections, indicating that the core collection developed in the present study effectively represents the 622 unique genotypes. Due to the lack of a proper characterization method and/or a large number of germplasms, reliable data related to important plant traits, such as the disease tolerance (Witches Broom, fruit shrink disease, etc.), freezing, fruit cracking, fruit quality and other factors, may not be available. The genetic marker data are limited and morphological diversity may be lost if they are used solely to determine the core collection. Therefore, it is crucial to characterize the accessions morphologically. The development of a core collection can facilitate the enhanced characterization of important jujube traits.

A valuable core collection should be dynamic and periodically revised to incorporate additional accessions⁴⁴. Furthermore, we can determine new core collections that are suitable for other users. Core collections can provide a rational framework for intensive natural variation surveys linked to complex traits, such as fruit cracking resistance in Chinese jujube, which can improve utilization and breeding.

Methods

Plant material and genomic DNA extraction

In total, 962 Ziziphus accessions were collected from the National Chinese Jujube Germplasm Repository located in Taigu County, Shanxi Province (Taigu) and the National Foundation for Improved Cultivar of Chinese Jujube, Cang County, Hebei Province (Cangzhou) (Table S6). The samples consist of 942 diploid accessions and 15 triploid accessions of Z. jujuba var. jujuba, as well as five accessions of sour jujube (Table S7). All collected fresh young leaves for each accessions were immediately flash frozen in liquid nitrogen and stored at −70 °C until use. Total genomic DNA was extracted from the leaves following described methods⁴. DNA quality was tested using 1.0% agarose gel electrophoresis and the DNA was diluted to 10 ng/μL.

SSR markers and PCR reaction

A set of 24 SSR loci scattered throughout the genome were selected on the basis of their polymorphism and reproducibility⁴ (Table 2). PCR amplifications were performed on 10 μL volumes containing 1 μL of template DNA (10 ng/μL), 5 μL of 2x Taq mix, 0.4 μL of the forward primer (1 μM), 1.6 μL of the reverse primer (1 μM), 1.6 μL of M13 primer (1 μM) with a fluorescent label (FAM, HEX, ROX, or TAMRA) and 0.4 μL of ddH2O. The thermal cycling program consisted of pre-denaturation at 94 °C for 5 min, 30 cycles at 94 °C for 30 s, 55 °C for 30 s and 72 °C for 40 s, followed by 8 cycles at 94 °C for 30 s, 53 °C for 30 s, 72 °C for 40 s and a final extension at 72 °C for 10 min⁴⁵.

PCR products were analyzed via capillary electrophoresis using an ABI 3730XL DNA Sequencer (Applied Biosystems, Foster City, CA, USA). Alleles were identified using the GeneMarker v 1.75 software package (SoftGenetics LLC, State College, PA, USA).

Genetic diversity analysis

Microsatellite alleles were corrected using FlexiBin v 2⁴⁶ and GeneMarker v 1.75 (SoftGenetics LLC, USA). The Microsatellite toolkit v 3.1.1⁴⁷ was used to identify the duplicate sets. The remaining accessions with unique SSR genotypes were used to estimate the following parameters using GenAlEx 6.5⁴⁸: the number of alleles (Na), the effective number of alleles (Ne), observed heterozygosity (Ho), expected heterozygosity (He), the probability of identity (PI) and polymorphic information content (PIC).

Population structure analysis

A Bayesian clustering analysis was implemented in Structure 2.3.3^49,50 to evaluate population genetic structure. An admixture model and correlated allele frequencies were applied to estimate the ancestry fractions of each cluster attributed to each accession. For each value of K (range 1–20), twenty independent runs were performed with a burn-in period of 200,000 followed by 500,000 MCMC (Markov chain Monte Carlo) repetitions. Parameters were set to the default values and all accessions were treated as having unknown origins. The delta K method⁵¹ was implemented in Structure Harvester⁵² to determine the most probable value of K. The accessions with membership probabilities greater than or equal to 0.50 were considered to belong to the same group. Diversity statistics were calculated in PowerMarker v 3.25⁵³ based on the genetic clusters identified by Structure, including the genotype numbers of each cluster, major allele frequency, number of alleles, genetic diversity and polymorphic information content. An unweighted pair group method with an arithmetic mean (UPGMA) dendrogram was constructed using PowerMarker v 3.25⁵³. A principal coordinate analysis (PCoA), based on the standardized covariance of genetic distances was performed using GenAlEx v 6.5⁴⁸.

Core collection development

Given the absence of detailed genetic information about the accessions, a non-group-based strategy was adopted. Four steps were executed: (1) Twenty-one core collections corresponding to different scales were constructed to identify the optimal core collection size (the sampling scale increased from 20 to 400 in increments of 5,10, 50 and 100 when the scale ranging from 20 to 50, 50 to 150, 150 to 300 and 300 to 400, respectively). Five independent runs were then repeatedly performed for each core selection by using two algorithms, including simulated annealing algorithm and random algorithm, implemented in PowerMarker to determine the optimal core collection size. (2) Several accessions that are agronomically (cracking resistance, fruit size, etc.) and commercially important (e.g., ‘Huanghuadongzao,’ ‘Xinzhenghuizao’ and ‘Junzao’) were selected as the retained accessions (listed in Table S5). (3) PowerMarker software⁵³ and PowerCore software⁵⁴ selected accessions based on the allele number and genetic diversity. The PowerCore software used a heuristic algorithm. A total of 1,000 independent runs were conducted based on the PowerMarker software⁵⁴. Accessions with occurrence numbers above 500 were then retained in the core collection. The results from the analysis by the two softwares were combined for further screening. (4) Based on the dendrogram, one of the accessions with a close relationship was removed until the optimal core size was reached.

SPSS v18.0 (SPSS, Chicago, IL, USA) was used to assess the final core collection by performing Levene’s test and T-test for Na, Ne, Ho, He and PIC values between the core and the entire unique collection. The comparison of allele frequency were carried out with Microsoft Excel (Microsoft, Washington, USA).

Additional Information

How to cite this article: Xu, C. et al. Identifying the genetic diversity, genetic structure and a core collection of Ziziphus jujuba Mill. var. jujuba accessions using microsatellite markers. Sci. Rep. 6, 31503; doi: 10.1038/srep31503 (2016).

References

Liu, M. J. Chinese Jujube: Botany and Horticulture. Hortic Rev (Am Soc Hortic Sci) 32, 229 (2010).
Google Scholar
Gao, Q. H., Wu, C. S. & Wang, M. The jujube (Ziziphus jujuba Mill.) fruit: a review of current knowledge of fruit composition and health benefits. J Agr Food Chem 61(14), 3351–3363 (2013).
Article CAS Google Scholar
Crawford, R., Shan, F. C. & McCarthy, A. Chinese jujube: A developing industry in Australia. Acta Hortic 993, 29–36 (2013).
Article Google Scholar
Wang, S. et al. Isolation and Characterization of Microsatellite Markers and Analysis of Genetic Diversity in Chinese Jujube (Ziziphus jujuba Mill.). Plos ONE. 9(6), e99842 (2014).
Article ADS PubMed PubMed Central Google Scholar
Zhang, C. M. et al. Genetic diversity and population structure of sour jujube, Ziziphus acidojujuba. Tree Genet Genomes 11, 809 (2015).
Article Google Scholar
Liu, X. S. et al. Discovery and Identification of Natural Triploid Ploidy of Chinese Jujube Cultivar ‘Pingguozao’. Acta Horti Sin 40(3), 426–432 (in Chinese with English abstract) (2013).
Google Scholar
Alcaraz, M. L. & Hormaza, J. I. Molecular characterization and genetic diversity in an avocado collection of cultivars and local Spanish genotypes using SSRs. Hereditas 144(6), 244–253 (2007).
Article CAS PubMed Google Scholar
Xiao, J. et al. Genome-Wide Characterization of Simple Sequence Repeat (SSR) Loci in Chinese Jujube and Jujube SSR Primer Transferability. Plos ONE. 10(5), e0127812 (2015).
Article CAS PubMed PubMed Central Google Scholar
Liu, J. et al. A Chinese jujube (Ziziphus jujuba Mill.) fruit-expressed sequence tag (EST) library: Annotation and EST-SSR characterization. Sci Hort-AMSTERDAM 165, 99–105 (2014).
Article CAS Google Scholar
Li, Y. et al. De Novo Assembly and Characterization of the Fruit Transcriptome of Chinese Jujube (Ziziphus jujuba Mill.) Using 454 Pyrosequencing and the Development of Novel Tri-Nucleotide SSR Markers. Plos ONE. 9(9), e106438 (2014).
Article ADS CAS PubMed PubMed Central Google Scholar
Zhang, C. et al. Development and characterization of microsatellite markers for sour jujube (Ziziphus jujuba var. spinosa). Indian J Genet Pl Br 73, 338–341 (2013).
Article CAS Google Scholar
Bai R. X. Studies on genetic diversity and core collection construction of Ziziphus jujuba germsplasm resources using AFLP and SRAP marker. Ph.D. thesis, Agricultural University of Hebei (2008).
Frankel, O. H. & Brown, A. H. D. Plant genetic resources today: a critical appraisal (eds Holden, J. H. W., Williams, J. T. ) 249–257 (George Allen & Unwin, 1984).
Kuroda, Y. et al. Genetic diversity of wild soybean (Glycine soja Sieb. et Zucc.) and Japanese cultivated soybeans [G. max (L.) Merr.] based on microsatellite (SSR) analysis and the selection of a core collection. Genet Resour Crop Evol 56(8), 1045–1055 (2009).
Article CAS Google Scholar
Sun, Q. et al. Developing a core collection of litchi (Litchi chinensis Sonn.) based on EST-SSR genotype data and agronomic traits. Sci Hort-AMSTERDAM 146, 29–38 (2012).
Article ADS Google Scholar
Emanuelli, F. et al. Genetic diversity and population structure assessed by SSR and SNP markers in a large germplasm collection of grape. BMC Plant Biol 13(1), 39 (2013).
Article CAS PubMed PubMed Central Google Scholar
Hu, J. et al. Microsatellite Diversity, Population Structure and Core Collection Formation in Melon Germplasm. Plant Mol Biol Rep 1–9 (2014).
Cipriani, G. et al. The SSR-based molecular profile of 1005 grapevine (Vitis vinifera L.) accessions uncovers new synonymy and parentages and reveals a large admixture amongst varieties of different geographic origin. Theor Appl Genet 121(8), 1569–1585 (2010).
Article PubMed Google Scholar
Marra, F. P. et al. Genetic relationships, structure and parentage simulation among the olive tree (Olea europaea L. subsp. europaea) cultivated in Southern Italy revealed by SSR markers. Tree Genet Genomes 9(4), 961–973 (2013).
Article Google Scholar
Arnaud-Haond, S. et al. Standardizing methods to address clonality in population studies. Molecular Ecology, 16(24), 5115–5139 (2007).
Article CAS PubMed Google Scholar
Barkley, N. A. et al. Assessing genetic diversity and population structure in a citrus germplasm collection utilizing simple sequence repeat markers (SSRs). Theor Appl Genet 112(8), 1519–1531 (2006).
Article CAS PubMed Google Scholar
Del Mar Naval, M. et al. Analysis of genetic diversity among persimmon cultivars using microsatellite markers. Tree Genet Genomes 6(5), 677–687 (2010).
Article Google Scholar
Tanaka, T. et al. Genetic diversity of Castanea crenata in northern Japan assessed by SSR markers. Breed Sci 55(3), 271–277 (2005).
Article CAS Google Scholar
Kotze, A. & Muller, G. H. In Proceedings, 5th World Congress on Genetics Applied to Livestock Production 21, 413–416 (University of Guelph, 1994).
Goossens, B. et al. Measuring genetic diversity in translocation programmes: principles and application to a chimpanzee release project. Anim Conserv 5, 225–236 (2002).
Article Google Scholar
Beaumont, M. et al. Genetic diversity and introgression in the Scottish wildcat. Mol Ecol 10, 319–336 (2001).
Article CAS PubMed Google Scholar
Leigh, F. J. et al. A comparison of molecular markers and statistical tools for diversity and EDV studies (eds Tuberosa, R. et al.) 349–363 (Bologna: Avenue Media, 2005).
Belaj, A. et al. Developing a core collection of olive (Olea europaea L.) based on molecular markers (DArTs, SSRs, SNPs) and agronomic traits. Tree Genet Genomes 8, 365–378 (2012).
Article Google Scholar
Akhter, C. et al. Ziziphus jujuba Mill. subsp. spinosa (Bunge) Peng, Li & Li: a New Plant Record for the Indian Subcontinent. Taiwania. 58(2), 132–135 (2013).
Google Scholar
Yan, G. J. Cytological study of Chinese Jujube, Master’s Thesis, Agricultural University of Hebei, 1984 (in Chinese).
Qu, Z. Z. et al. Application of isozyme in classification of Chinese jujube cultivars. J Agric Univ Hebei 13(4), 1–7 (in Chinese) (1990).
Google Scholar
Peng, J. Y. et al. Study on the relationship and evolution of Chinese jujube and sour jujube through the pollen morphology. Fruit tree Hebei 3, 25 (in Chinese) (1992).
Google Scholar
Liu, M. J. & Cheng, J. R. A taxonomic study on Chinese jujube and wild jujube. J Agric Univ Hebei 17(4), 1–10 (1994).
Google Scholar
Li, L. et al. Analysis of the genetic relationships in Chinese Ziziphus with SRAP markers. Agric Sci China 9(9), 1278–1284 (2010).
Article Google Scholar
Huang, J. et al. Development of Chloroplast Microsatellite Markers and Analysis of Chloroplast Diversity in Chinese Jujube (Ziziphus jujuba Mill.) and Wild Jujube (Ziziphus acidojujuba Mill.). PloS ONE. 10(9), e0134519 (2015).
Article CAS PubMed PubMed Central Google Scholar
Wang, Y. et al. Construction and evaluation of a primary core collection of apricot germplasm in China. Sci Hort-AMSTERDAM 128(3), 311–319 (2011).
Article Google Scholar
Brown, A. H. D. The case for core collections. in the use of plant genetic resources (eds Brown, A. H. D. et al.) 136–156 (Cambridge University Press, 1989).
van Hintum, T. H. J. L. et al. Core collections of plant genetic resources. IPGRI Technical Bullletin No. 3 (International Plant genetic Resources Institute, 2000).
Yonezawa, K. et al. In Core Collections of Plant Genetic Resources (eds Hodgkin, T. et al.) 35–54 (John Wiley and sons, 1995).
Charmet, G. & Balfourier, F. The use of geostatistics for sampling a core collection of perennial ryegrass populations. Genet Resour Crop Evol 42, 303–309 (1995).
Article Google Scholar
Frankel, O. H. In Genetic manipulation: Impact on man and society (eds Arber, W. K. et al.) 161–170 (Cambridge University Press, 1984).
Song, Y. et al. Identifying genetic diversity and a preliminary core collection of Pyrus pyrifolia cultivars by a genome-wide set of SSR markers. Sci Hort-AMSTERDAM 167, 5–16 (2014).
Article CAS Google Scholar
Le Cunff, L. et al. Construction of nested genetic core collections to optimize the exploitation of natural diversity in Vitis vinifera L. subsp. sativa. BMC Plant Biol 8(1), 31(2008).
Article CAS PubMed PubMed Central Google Scholar
Jaradat, A. A. et al. The dynamics of a core collection (eds Hodgkin, T. et al.) 179–186 (John Wiley and sons, 1995).
Schuelke, M. An economic method for the fluorescent labeling of PCR fragments. Nat Biotechnol 18(2), 233–234 (2000).
Article CAS PubMed Google Scholar
Amos, W. et al. Automated binning of microsatellite alleles: problems and solutions. Mol Ecol Notes. 7, 10–14 (2007).
Article CAS MathSciNet Google Scholar
Park, S. D. E. Trypanotolerance in west African cattle and the population genetic effects of selection, PhD thesis, University of Dublin (2001).
Peakall, R. & Smouse, P. E. GenAlEx 6.5: genetic analysis in Excel. Population genetic software for teaching and research – an update. Bioinformatics 28, 2537–2539 (2012).
Article CAS PubMed PubMed Central Google Scholar
Falush, D. et al. Inference of population structure using multi-locus genotype data: linked loci and correlated allele frequencies. Genetics 164, 1567–1587 (2003).
CAS PubMed PubMed Central Google Scholar
Pritchard, J. K. et al. Inference of population structure using multi-locus genotype data. Genetics 155, 945–959 (2000).
CAS PubMed PubMed Central Google Scholar
Evanno, G. et al. Detecting the number of clusters of individuals using the software STRUCTURE: a simulation study. Mol Ecol 14(8), 2611–2620 (2005).
Article CAS PubMed Google Scholar
Earl, D. A. STRUCTURE HARVESTER: a website and program for visualizing STRUCTURE output and implementing the Evanno method. Conserv Genet Resour 4(2), 359–361 (2012).
Article Google Scholar
Liu, K. & Muse, S. PowerMarker: new genetic data analysis software. Version 3.23 Bioinformatics 21(9), 2128–2129 (2005).
Article CAS PubMed Google Scholar
Kim, K. W. et al. PowerCore: a program applying the advanced M strategy with a heuristic search for establishing core sets. Bioinformatics 23, 2155–2162 (2007).
Article CAS PubMed Google Scholar

Download references

Acknowledgements

This research was supported by Projects in the National Science & Technology Pillar Program (2013BAD14B0302), the National Natural Science Foundation of China (31372019, 31400578), the National Science and Technology Basic Works Project of China (2013FY111700, 2011FY110200) and the National S&T Infrastructure Program of China for Crop (Chinese jujube) Germplasm Resources.

Author information

Authors and Affiliations

National Engineering Laboratory for Tree Breeding, Key Laboratory of Genetics and Breeding in Forest Trees and Ornamental Plants, Ministry of Education, College of Biological Sciences and Biotechnology, Beijing Forestry University, 100083, Beijing, China
Chaoqun Xu, Jiao Gao, Zhe Wang, Yingyue Li & Xiaoming Pang
National Foundation for Improved Cultivar of Chinese Jujube, Cangzhou, 061000, Heibei, China
Zengfeng Du
Pomology Institute, Shanxi Academy of Agricultural Science, 030815, Taigu, Shanxi, China
Dengke Li

Authors

Chaoqun Xu
View author publications
You can also search for this author in PubMed Google Scholar
Jiao Gao
View author publications
You can also search for this author in PubMed Google Scholar
Zengfeng Du
View author publications
You can also search for this author in PubMed Google Scholar
Dengke Li
View author publications
You can also search for this author in PubMed Google Scholar
Zhe Wang
View author publications
You can also search for this author in PubMed Google Scholar
Yingyue Li
View author publications
You can also search for this author in PubMed Google Scholar
Xiaoming Pang
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

C.X., Y.L. and X.P. contributed to the design of experiments in the study. C.X., J.G., Z.D., D.L. and Z.W. conducted the experiments. C.X., J.G., Y.L. and X.P. analyzed the experimental data. C.X., J.G., Y.L., X.P., Z.D., D.L. and Z.W. contributed to the preparation and writing of the manuscript. All authors read and approved the final manuscript.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Electronic supplementary material

Supplementary Information

Rights and permissions

This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

Reprints and permissions

About this article

Cite this article

Xu, C., Gao, J., Du, Z. et al. Identifying the genetic diversity, genetic structure and a core collection of Ziziphus jujuba Mill. var. jujuba accessions using microsatellite markers. Sci Rep 6, 31503 (2016). https://doi.org/10.1038/srep31503

Download citation

Received: 14 January 2016
Accepted: 21 July 2016
Published: 17 August 2016
DOI: https://doi.org/10.1038/srep31503

This article is cited by

Genetic diversity and population structure of Robinia pseudoacacia from six improved variety bases in China as revealed by simple sequence repeat markers
- Qi Guo
- Sen Cao
- Yun Li
Journal of Forestry Research (2022)
SSR-based population structure, molecular diversity and identity cards of Ziziphus species from Pakistan and China
- Nisar Uddin
- Niaz Ali
- Inayat Ur Rahman
Genetic Resources and Crop Evolution (2021)
Determination of Chromosome Number and Genetic Diversity using SSR and RAPD Markers in Ziziphus jujuba Mill.
- Saeid Daghighi
- Zohreh Alizadeh
- Homa Habibi
Iranian Journal of Science and Technology, Transactions A: Science (2021)
Characterization of EST-SSR markers in Curcuma kwangsiensis S. K. Lee & C. F. Liang based on RNA sequencing and its application for phylogenetic relationship analysis and core collection construction
- Yuanjun Ye
- Yechun Xu
- Jinmei Liu
Genetic Resources and Crop Evolution (2021)
Construction of an anchoring SSR marker genetic linkage map and detection of a sex-linked region in two dioecious populations of red bayberry
- Yan Wang
- Hui-Min Jia
- Zhong-Shan Gao
Horticulture Research (2020)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

Chinese jujube germplasm identity analysis

Genetic diversity of the subset with unique genotypes

Comparison of genetic diversity between the Cangzhou and Taigu repositories

Population Structure and Principal Coordinate Analysis

Selection of core collections

Discussion

Methods

Plant material and genomic DNA extraction

SSR markers and PCR reaction

Genetic diversity analysis

Population structure analysis

Core collection development

Additional Information

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Ethics declarations

Competing interests

Electronic supplementary material

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links