Abstract
The cultivated peanut, A. hypogaea L., is an important oil and food crop globally.High-density genetic linkage mapping is a valuable and effective method for exploring complex quantitative traits. In this context, a recombinant inbred line (RIL) of 146 lines was developed by crossing Huayu28 and P76. We developed 433,679 high-quality SLAFs, of which 29,075 were polymorphic. 4,817 SLAFs were encoded and grouped into different segregation patterns. A high-resolution genetic map containing 2,334 markers (68 SSRs and 2,266 SNPs) on 20 linkage groups (LGs) spanning 2586.37 cM was constructed for peanut. The average distance between adjacent markers was 2.25 cM. Based on phenotyping in seven environments, QTLs for oleic acid (C18:1), linoleic acid (C18:2) and the ratio of oleic acid to linoleic acid (O/L) were identified and positioned on linkage groups A03, A04, A09, B09 and B10. Marker2575339 and Marker2379598 in B09 were associated with C18:1, C18:2 and O/L in seven environments, Marker4391589 and Marker4463600 in A09 were associated with C18:1, C18:2 and O/L in six environments. This map exhibits high resolution and accuracy, which will facilitate QTL discovery for essential agronomic traits in peanut.
Similar content being viewed by others
Introduction
The cultivated peanut (A. hypogaea L.) is an allotetraploid (2n = 4× = 40) legume that is mostly grown in semi-arid tropic regions in the world as an important oil and food crop. During 2010–2014, the global average production was 42.27 million tons from an area of 25.27 million ha1. A genetic map constructed from a population segregation for a trait of interest is required for QTL (quantitative trait loci) identification. Currently, genetic linkage maps commonly include restriction fragment length polymorphism (RFLP), amplified fragment length polymorphism (AFLP), simple sequence repeat (SSR), and single-nucleotide polymorphism (SNP) markers. A genetic map, especially a high-density genetic map, provides an important foundation for QTL mapping and anchoring sequence scaffolds, and the utility of genetic linkage maps depends on the types and numbers of markers used. Since 2001, 27 previously tetraploid peanut linkage maps were published, 23 of which were cultivated peanut species2,3,4,5,6,7,8,9,10,11,12,13,14,15, four of which were from crosses of cultivated and synthetic tetraploid peanut species16,17,18,19. There were four linkage maps which had more than 1,000 markers including SSR, transposon and SNP. Using the F2 population from A. hypogaea ‘Satonoka’ and A. hypogaea ‘Kintoki’, Shirasawa et al. (2012) constructed a high-density genetic linkage map which had 1,114 markers and spanned 2,166.4 cM9. Huang et al. (2016) constructed a high-density genetic linkage map of cultivated peanut which had 1,219 markers including 1,175 SSR and 42 transposon polymorphic markers, and spanned 2,038.75 cM15. Zhou et al. (2014) constructed one linkage map which was comprised of 1,685 marker loci, including 1,621 SNP and 64 SSR markers. The map displayed a distribution of the markers into 20 linkage groups (LGs), spanning a distance of 1, 446.7 cM10. Shirasawa et al. (2013) constructed a genetic linkage map of cultivated peanut with 1,469 markers19.
SNPs are the most abundant and stable form of genetic variation in most genomes and have become the marker type of choice in many genetic studies. Recently, SLAF-seq (specific length amplified fragment sequencing), a high-resolution strategy, has been developed for large-scale de novo SNP discovery20. Combining NGS, SLAF-seq is a very time- and cost-effective method. The efficiency of SLAF-seq was tested on data from rice21 and soybean22. This approach has been successfully applied to high-density genetic map construction, QTL analysis and mapping genes for many plants and animals. Since 2013, more than 30 high-density genetic maps have been constructed using SLAF markers, including SLAF-seq map for sesame (Sesamum indicum)23, kiwifruit (Actinidia chinensis)24, soybean (Glycine max)25, mei (Prunus mume)26, cucumber (Cucumis sativus)27,28,29, orchard grass (Dactylis spp.)30, and red sage (Salvia miltiorrhiza)31. This approach was also used to detect QTL for the isoflavone content of soybean32, growth-related QTL of the Chinese mitten crab Eriocheir sinensis33, fruit-related QTL of cucumber31, flowering time QTLs in orchard grass29 and adzuki bean (Vigna angularis)34. Guo et al. used SLAF markers and the BSA (bulked segregant analysis) method to discover QTLs controlling CMV (cucumber mosaic virus) resistance in pepper (Capsicum frutescens), and then identified the gene CA02g19570 as a possible candidate gene for resistance to CMV in pepper35. These results show that SLAF sequencing is a powerful high-throughput technique for the efficient development of a large number of polymorphic markers in a short time and is effective for linkage map construction and QTL analysis.
In this study, SLAF-seq was used for the rapid discovery of SNPs in the RIL population. Subsequently, we constructed a high-density genetic map of A. hypogaea L., which contained 2,334 markers (68 SSRs and 2,266 SNPs) on the 20 linkage groups and spanned 2,586.37 cM, with an average distance of 2.25 cM between adjacent markers. QTLs for oleic acid (C18:1), linoleic acid (C18:2) and the ratio of oleic acid to linoleic acid (O/L) were analysed based on phenotyping in seven environments. This map exhibited high resolution and accuracy and is the first map based on SLAF-seq in peanut. It provides a new method of constructing a peanut genetic map whereby SLAF-seq is applied to peanut. It will facilitate the identification of genes and QTLs underlying essential agronomic traits in peanut.
Results
High-throughput SLAF sequencing and genotyping
After SLAF library construction and Illumina sequencing, a total of 64.2 Gb of data containing 322.29 M reads were obtained. The average GC (guanine-cytosine) content was 42.91%, and Q30 ratio (bases with a quality score of 30, indicating a 1% chance of an error and thus 99% confidence) was 87.00%. In the maternal inbred line (Huayu28), the number of reads produced for 398,870 SLAFs was 12,310,352, and the average coverage for each SLAF marker was 21.81-fold. In the paternal line (P76), 11,979,609 reads and 404,089 SLAFs were generated, with an average coverage of 20.68-fold for each SLAF. For the analysis of the RIL mapping population, an average of 2,041,121 reads were generated for the development of 262,928 SLAF markers for each line, and the average coverage was 5.53-fold (Table 1).
After correcting or discarding low-depth SLAF tags, 433,679 SLAFs were identified, among which 29,075 were polymorphic with a polymorphism rate of 6.70% (Supplementary Dataset 1). The parental lines were given with different alphabets than genotypes to determine segregation patterns, and 23,673 from the 29,075 polymorphic SLAFs were successfully encoded and grouped into eight segregation patterns (ab × cd, ef × eg, hk × hk, lm × ll, nn × np, aa × bb, ab × cc and cc × ab) following a genotype encoding rule (Fig. 1). Since the two parents (Huayu28 and P76) are homozygous inbred lines with genotypes of aa and bb, only the 7,949 markers that fell into the aa × bb segregation pattern were used in linkage analysis.
Genetic linkage map
Comparing with the reference genome (http://peanutbase.org/home), HighMap assigned 2,334 of 2,384 markers to 20 LGs of peanut including 2,266 SNPs and 68 SSRs. The total genetic length of the molecular linkage map was 2,586.37 cM in twenty linkage groups with a mean marker distance of 2.25 cM between adjacent markers (Figure S1; Table 2 and Supplementary Dataset 2). The number of markers in each LG ranged from 13 to 369, with an average of 116.7 markers per LG. The LGs ranged from 68.79 to 191.24 cM in length, with average inter-marker distances of 0.3–6.09 cM, and nine linkage groups contained over 100 marker loci. A09 was the shortest, with 156 loci spanning 68.79 cM, while B04 was the longest group, with 240 loci spanning 191.24 cM. B01 contained the fewest markers with 13 loci, whereas B05 had the highest density with 369 loci.
In the map, 700 (29.99%) markers showed a skewed segregation pattern (p < 0.05; Table 2). The segregation distortion markers were distributed in 14 LGs. There were no segregation distorted markers in A01, A05, A09, A10, B01, B08. The ratio of segregation distorted markers in A02, B04, B06, B07, B08, B09 and B10 were lower than 8.06%. The ratios of segregation distorted markers were extremely high in A04, B03 and B05, at 95.56%, 90.42% and 73.44%, respectively.
The SSR markers were distributed across 20 LGs (Figure S1; Table 2, Supplementary Dataset 2 and Supplementary Dataset 3). 15 SSR markers were mapped to the same linkage group by comparing our map and other previous maps (Huang et al., 2016 and Peanutbase; Supplementary Dataset 3). For example, GM1992 and IPAHM288, located in A01 and A03 of those previous maps, were grouped in A01 and A03. 9 SSR markers showed different results to those previous maps. IPAHM_356, GM2259, and IPAHM_82, located in A05 and A09 of Peanutbase, respectively, were grouped in A01, A06 and B10 in this study. Twelve SSR markers showed confused positions; for example, TC13E05 was mapped to A02 in this study, while it was located in B02 of Peanutbase. Ah1TC6H03, IPAHM_659 and Ah1TC5D06 showed similar results.
Validation of the SNP-based genetic map
The quality of this peanut genetic map was evaluated by the haplotype and heat maps. The haplotype maps, which reflect the double exchange of the population, were developed for parental controls and 146 RILs using 2,334 markers (Additional file S1). Most of the recombination blocks were distinctly defined. The heat maps directly reflected recombination relationships among markers in each LG (Additional file S2). Each cell represented a recombination rate between two adjacent markers, the level of which was visualized by different colors ranging from yellow to purple (yellow indicated a lower recombination rate; purple indicated a higher rate). Heat maps indicated SNP markers in most LGs were well ordered. The collinearity of each LG with the peanut reference genome was also analyzed. As shown in Figure S2, a relatively high collinearity was observed between 20 LGs and the reference genome.
Phenotypic variation
Phenotypic data for four seasons were generated for oleic acid, linoleic acid and O/L (the ratio of oleic acid (C18:1) to linoleic acid (C18:2)) on two parents and the complete set of RIL. Phenotypic analysis revealed that the fitted curves were found to be multimodal (oleic acid, linoleic acid and O/L). The oleic acid content (C18:1), linoleic acid content (C18:2) and the ratio of oleic acid (C18:1) to linoleic acid (C18:2) (O/L) of maternal parent, Huayu28, varied between 0.404~0.468, 0.321~0.362 and 1.146~1.426. The RIL showed high phenotypic variation for C18:1, C18:2 and O/L in seven environments. The C18:1 of RIL ranged between 0.401~0.819, 0.426~0.821, 0.404~0.818, 0.387~0.836, 0.405~0.843, 0.377~0.843, and 0.383~0.824 during 2010, 2011, 2012 and 2014. The C18:2 of RIL ranged between 0.024~0.390, 0.021~0.364, 0.021~0.362, 0.020~0.400, 0.018~0.368, 0.025~0.413, and 0.019~0.384 during 2010, 2011, 2012 and 2014. The O/L of RIL ranged between 1.028~33.871, 1.170~38.371, 1.123~38.451, 0.967~40.360, 1.100~45.966, 0.927~34.147, and 1.061~40.803 during 2010, 2011, 2012 and 2014 (Fig. 2).
QTL analysis
Multi-season phenotypic data together with genotypic data generated on RIL were used for conducting QTL analysis. An interval mapping model with a logarithm of odds (LOD) score of 2.5 for potential QTLs was used for QTL detection. In total, significant QTLs for C18:1, C18:2 and O/L were found to be distributed on A03, A04, A09, B03, B09 and B10 of the peanut map (Table 3). For oleic acid (C18:1), two main QTLs were identified, which were located in A09 and B09 and showed consistent high contributions in seven environments. Phenotypic variation explained (PVE) of QTL in A09 ranged from 7.91% to 13.12% in seven environments, while PVE of QTL in B09 ranged from 46.15% to 57.60% in seven environments. QTL in A03 was identified in two environments (2012 Laixi and 2014 Laixi), and QTL in B10 was identified in two environments (2010 Jining and 2011 Laixi). For linoleic acid (C18:2) and O/L, similar results were found in seven environments. For all QTL for oleic acid, linoleic acid and O/L, the ‘P76’ parent contributed to high oleic acid (C18:1) and high O/L, while the ‘Huayu28’ parent contributed to high linoleic acid (C18:2).
Discussion
The SLAF-seq strategy, a combination of locus-specific amplification and high-throughput sequencing, has been subjected to a series of critical trials to guarantee its high efficiency, accuracy and density20. This approach has been successfully applied to high-density genetic map construction, QTL analysis and mapping genes for many plants and animals23,24,25,26,27,28,29,30,31,32,33,34,35. Based on a careful analysis of the genomic GC content, repeat conditions and genome length of Arachis duranensis and Arachis ipaensis, HaeIII was selected to digest the genomic DNA with a digestion rate of 94.02%. Subsequently, SLAFs (314–394 bp) were selected in a pilot experiment for further paired-end sequencing. A pre-designed scheme and a pilot experiment were conducted to ensure the density, uniformity and efficiency of the marker development. The average genotype quality score of all SLAF markers reached the cut-off value of 30, which was sufficient to filter the reads with low sequencing depth. Thus, the combination of sequence depth and genotype quality scores sufficiently enhanced the genotyping accuracy. Using high-throughput SLAF sequencing, we developed 433,679 high-quality SLAFs, of which 29,075 were polymorphic. A total of 2,266 polymorphic SLAFs were identified for linkage map construction. The sequencing average depth of these SLAFs in the parents and progenies were 54.32-fold, 57.72-fold and 9.83-fold, respectively. Our results clearly demonstrate that SLAF-seq is efficient for large-scale genotyping and rapid development of a large number of genetic markers. DNA marker distribution is not random with some genomic regions highly populated with markers whereas others are under-represented. In the present map, inter-marker distances varied in sizes. The inter-marker distances on A01, A02, A04, A05, A07, A09, B03, B04, B05 and B08 were more than 90%, which were less than and/or equal to 5.0 cM. The inter-marker distances less than and/or equal to 5.0 cM on other chromosomes were less than 90%, and these chromosomes comprised of less than 74 markers. Inter-marker distances larger than 10.0 cM were found on all chromosomes except A05, A09 and B05, suggesting that such inter-marker distances are not restricted to a particular chromosome. The longest one was 26.24 cM on the distal end of B02. The presence of these inter-marker distances may have negative effects on the application of mapped DNA markers; for example, genomic regions that lack DNA markers will make detection of quantitative trait loci (QTL) difficult36. Therefore, more comparable markers between different peanut maps are needed to fill in the inter-marker distances to obtain a more complete coverage of the peanut genome. A comparison between the high-density genetic linkage map and the maps from Huang et al. and Peanutbase showed that 15 SSR markers were mapped to the same linkage group, 9 SSR markers showed different results to those of previous maps and 12 SSR markers showed confused positions. For example, TC13E05 was mapped in A02 in this study, while it was located in B02 of Peanutbase. IPAHM_93 was mapped in A03 of both the maps in this study and Huang et al., but was mapped in B03 of Peanutbase. Ah1TC6H03, IPAHM_659 and Ah1TC5D06 showed similar results. The AA and BB subgenomes are highly similar19 which may contribute to the difficulty of marker assignment to the subgenomes.
Segregation distortion is a common biological phenomenon and is one of the engines driving evolutionary processes. It can be observed in almost all types of hybrid segregating populations. In general, the skewed segregation ratio of RIL populations is higher than that of backcross populations (BC) and doubled haploid populations (DH). F2 populations show the lowest skewed segregation ratio37. The genetic basis of segregation distortion is still under debate, and gametophyte and/or zygotic selection and chromosomal rearrangements may be the main causes38. Studies have demonstrated a large number of segregation distortions in many species, such as maize (Zea mays)39, barley (Hordeum vulgare)40, potato (Solanum tuberosum)41, sesame23, and sorghum (Sorghum bicolor)38,42,43. Segregation distorted markers are common in peanut linkage maps. Hong et al. (2010) used three RIL populations to construct a composite linkage maps with 175 SSR markers. The total composite map length is 885.4 cM, with an average marker density of 5.8 cM. Segregation distortion in the three populations was 23.0%, 13.5% and 7.8% of the markers, respectively4. Zhou et al. (2014) constructed one linkage map which was comprised of 1,685 marker loci, including 1,621 SNPs and 64 simple sequence repeat (SSR) markers, of which 659 markers showed significant segregation distortion and were distributed among every LG10. Huang et al. (2016) used a RIL population to construct a high-density genetic linkage map with 1,219 mapped loci covering a total map length of 2,038.75 cM. Chi-square (χ2) analysis revealed significant segregation distortion (p < 0.05) for 152 loci (12.5%)15. In the present study, a RIL mapping population was used to construct a linkage map, and 700 markers (29.99%) of the 2,334 assigned markers showed significant segregation distortion. Most skewed loci were located on A04, B03, and B05. Although the molecular mechanism of segregation distortion remain undiscovered, numerous studies have shown that segregation distortion markers for linkage map construction could increase the quantity of markers on the map, the genome coverage of the map, and help to improve the detection of linked QTLs23,44,45,46. Xu showed that distorted markers can be used for QTL mapping with no detrimental effect on the result and can be beneficial if used properly44. Zhang et al. showed that segregation distortion could result in higher genetic variance than non-distortion and help to improve the detection of linked QTLs because distortion markers do not have a large effect on the position or effect estimations of QTL analysis46. Zhang et al. (2013) mapped 205 markers of segregation distortion onto the final map with a distribution on every LG of sesame, similar to the distribution of all markers23.
Oleic acid (C18:1) and linoleic acid (C18:2) were the major fatty acids and accounted for about 80% of peanut oil. Oleic acid (C18:1) is about 47% in normal and up to 80% in high oleic peanut lines and is associated with several human health benefits47,48. Linoleic acid (C18:2) is about 40% in normal and down to 2% in high oleic peanut lines. Manish et al. (2014) used RIL populations derived from normal and high oleic peanut varieties to locate FAD2 genes for C18:1, C18:2 and oleic/linoleic acid ratio (O/L)11. The mapping positions for ahFAD2A (A sub-genome) and ahFAD2B (B sub-genome) genes were assigned on A09 and B09 linkage groups. The PVE of ahFAD2B for C18:1, C18:2 and O/L were 26.54%, 25.59% and 41.02%, and the PVE of ahFAD2A for C18:1, C18:2 and O/L were 8.08%, 6.86% and 3.78%. In this study, we detected the two main QTLs for C18:1, C18:2 and O/L being located in A09 and B09. PVE of QTLs for C18:1, C18:2 and O/L in B09 ranged from 46.15% to 57.60%, 46.33% to 57.56% and 23.87% to 43.41%, respectively, while in A09 ranged from 7.19% to 13.12%, 7.36% to 12.95% and 6% to 11.13%, respectively. Genomic approaches such as high-throughput sequencing and large-scale genotyping technologies have been used in genetic linkage mapping. The SLAF-seq method provided significant advantages to generate enough polymorphic markers for high-density genetic map construction in peanut. The high density map is sufficient to ensure adequate polymorphic marker coverage in regions of interest and can be used as a reference map for peanut genetic studies.
Conclusions
In this study, we used the SLAF-seq method to develop 433,679 high-quality SLAFs, of which 29,075 were polymorphic. To our knowledge, we have constructed the first high-density genetic map of SLAF for the cultivated peanut (A. hypogaea L.), which consisted of 2,334 markers (68 SSRs and 2,266 SNPs) on the 20 linkage groups spanning 2,586.37 cM. The average distance between adjacent markers was 2.25 cM. The analysis of the SLAFs and their sequence information identified that SLAF-seq is an effective strategy for large-scale genotyping applied to the construction of a high-density map of peanut. Based on phenotyping in seven environments, QTLs for oleic acid (C18:1), linoleic acid (C18:2) and the ratio of oleic acid to linoleic acid (O/L) were identified and positioned on linkage groups A03, A04, A09, B09 and B10. Marker2575339 and Marker2379598 in B09 were associated with C18:1, C18:2 and O/L in seven environments, and Marker4391589 and Marker4463600 in A09 were associated with C18:1, C18:2 and O/L in six environments. This map exhibits high resolution and accuracy. It will facilitate QTL discovery for essential agronomic traits in peanut.
Material and Methods
Plant materials and DNA extraction
An F2:11 population of 146 RILs was derived from a cross between ‘Huayu28’ and ‘P76’. ‘Huayu28’ was early-mature, normal oleic content and small-seed peanut variety. ‘P76’ was lately-mature, high oleic content and medium-seed peanut variety. Seedlings of progeny and parents were planted in the experiment field of Shandong Peanut Research Institute in Laixi, Shandong Province, China, in 2014. Young healthy leaves from two parents and 146 RILs (F2:11) were collected and frozen in liquid nitrogen, then transferred to a −70 °C freezer. Total genomic DNA was extracted from each leaf sample by Plant Genomic DNA Kit (Tiangen Biotech (Beijing) Co., Ltd.). The concentration and quality of DNA were examined by electrophoresis in 0.8% agarose gels with a standard lambda DNA, and an ND-1000 spectrophotometer (NanoDrop, Wilmington, DE, USA).
SLAF library construction and high-throughput sequencing
SLAF-seq was used to genotype the 146 RILs and the two parents, as previously described13, with a few modifications. In brief, genomic DNA from each sample was treated with HaeIII (NEB, Ipswich, MA, USA), T4 DNA ligase (NEB), ATP (NEB), and HindIII adapter at 37 °C. These restriction-ligation reaction solutions were diluted and mixed with dNTP, Taq DNA polymerase (NEB) and MseI primer containing barcode 1 for PCR reactions. The E.Z.N.A Cycle Pure Kit (Omega, London, UK) was used to purify the PCR products. The purified PCR products were pooled and incubated at 37 °C with MseI, T4 DNA ligase, ATP, and Solexa adapter. After incubation, the reaction products were then purified using a Quick Spin column (Qiagen, Venlo, Netherlands), and electrophoresed on a 2% agarose gel. SLAFs of 314–394 bp (including adapter sequence indexes and adaptors) in size were selected for paired-end sequencing on a Gel Extraction Kit (Qiagen). The gel-purified product was sequenced on the Illumina HiSeq. 2500 system (Illumina, Inc; San Diego, CA, U.S.) according to the manufacturer’s recommendations. Real-time monitoring was performed for each cycle during sequencing, and the ratio of raw high-quality reads with quality scores greater than Q30 (a quality score of 30 indicates a 0.1% chance of obtaining an error, and thus 99.9% confidence) and the guanine-cytosine (GC) content were calculated for quality control. All sequences clustered together were defined as a SLAF loci. In each of the SLAF, we found polymorphism loci between the parents, and most of these were SNPs. All polymorphism SLAFs loci were genotyped with consistency in the offspring and parental SNP loci.
SLAF and SSR data analysis and genotyping
SLAF-seq data was operated using the software developed by Sun et al.20, and the genotyping methods with reference to Sun et al. (2013) and Wei et al.27. According to sequence similarity, the generated pair-end reads from SLAF-seq were clustered, and the reads could be inferred from one-to-one alignment by BLAT (-tileSize = 10-stepSize = 5). Identical reads were merged, and the reads with over 90% similarity sequences were grouped into one SLAF locus20. In each SLAF locus, minor allele frequency (MAF) evaluation was used to define alleles.
In order to ensure the quality of the genetic map, the following rules were applied to filter SLAFs: (1) removal of SLAFs with parents sequence depth of less than 10×; (2) removal of SLAFs with complete degree below 70%; (3) removal of SLAFs with serious distorted segregation (p-value < 0.01); (4) removal of SLAFs with more than eight SNPs. SLAFs that passed the four-step filtering process were considered as potential markers. Those polymorphic SLAF markers were then assorted into eight segregation patterns as follows: ab×cd, ef×eg, hk×hk, lm×ll, nn×np, aa×bb, ab×cc, and cc×ab (Table 4). Since the RIL mapping populations were derived from two homozygous peanut varieties with a genotype of aa or bb, only the SLAF markers which had segregation patterns of aa×bb were used in map construction.
SSR primers (Supplementary Supplementary Dataset 3) were selected from several previous articles. The PCR reactions conditions used were as follows: 3 min denaturation at 94 °C; 35 cycles of 1 min at 94 °C, 30 s at 55 °C, and 90 s at 72 °C; and then a final extension of 10 min at 72 °C and storage at 4 °C. The PCR products were separated on 8% PAGE gel. The segregation data for SSR markers in the same population were detected, and 68 of them were used for construction of the genetic map.
High-density genetic map construction
Since next-generation sequencing data may include many genotyping errors and deletions, which could reduce the quality of the high-density linkage maps, High Map Strategy was used to order the SLAF and SSR markers, and to correct the genotyping errors in the linkage groups49. After genotyping of the 146 RILs, 2-point linkage analysis was performed for efficient markers. All high-quality SLAF and SSR markers were allocated to 20 LGs on the basis of their locations on chromosomes. A detailed MST map algorithm was used to order the SLAF and SSR markers50, and the SMOOTH algorithm was used to correct the genotyping errors as per the marker ordering51. All LGs were processed as follows: primary markers was used to order the LGs by their location on chromosomes; according to the relationship between the ordered markers, genotyping errors or deletions were corrected using the SMOOTH algorithm; the minimum spanning tree map was used to order the map; and the SMOOTH algorithm was used to correct the newly ordered genotypes. After four or more cycles of this processing, 20 high-quality maps were obtained. The Kosambi mapping function was used to estimate the map distances52.
QTL analysis
Based on the integrated map, significant loci associated with oleic acid, linoleic acid and the ratio of oleic acid to linoleic acid were identified based on LOD scores larger than the 5% cutoff value determined through 1,000 permutation tests using the CIM method from the ‘qtl’ package of R.MapQTL6.053, which was used to conduct logarithm of odds and percentage of phenotypic variance explained analysis, and interval mapping (IM)54 was used to detect QTLs for target traits. According to this method55, the fatty acids content was detected by gas chromatography (GC). Oleic and linoleic acids were investigated in 2010 (Laixi and Jining), 2011(Laixi and Baoding), 2012 (Laixi and Dongying) and 2014 (Laixi).
References
Food and Agriculture Organization of the United Nations. http://faostat3.fao.org/download/Q/QC/E (2016).
Herselman, L. et al. Identification and mapping of AFLP markers linked to peanut (Arachis hypogaea L.) resistance to the aphid vector of groundnut rosette disease. Theor Appl Genet 109, 1426–1433 (2004).
Varshney, R. K. et al. The first SSR-based genetic linkage map for cultivated groundnut (Arachis hypogaea L.). Theor Appl Genet 118, 729–739 (2009).
Hong, Y. et al. A SSR-based composite genetic linkage map for the cultivated peanut (Arachis hypogaea L.) genome. BMC Plant Biol 10, 17 (2010).
Sujay, V. et al. Quantitative trait locus analysis and construction of consensus genetic map for foliar disease resistance based on two recombinant inbred line populations in cultivated groundnut (Arachis hypogaea L.). Mol Breed 30, 773–788 (2012).
Gautami, B. et al. An international reference consensus genetic map with 897 marker loci based on 11 mapping populations for tetraploid groundnut (Arachis hypogaea L.). PLoS One 7, e41213 (2012).
Qin, H. et al. An integrated genetic linkage map of cultivated peanut (Arachis hypogaea L.) constructed from two RIL populations. Theor Appl Genet 124, 653–664 (2012).
Wang, H. et al. Development and characterization of BAC-end sequence derived SSRs, and their incorporation into a new higher density genetic map for cultivated peanut (Arachis hypogaea L.). BMC Plant Biol 12, 10 (2012).
Shirasawa, K. et al. In silico polymorphism analysis for the development of simple sequence repeat and transposon markers and construction of linkage map in cultivated peanut. BMC Plant Biol 12, 80 (2012).
Zhou, X. et al. Construction of a SNP-based genetic linkage map in cultivated peanut based on large scale marker development using next-generation double-digest restriction-site-associated DNA sequencing (ddRADseq). BMC Genomics 15, 351 (2014).
Pandey, M. K. et al. Identification of QTLs associated with oil content and mapping FAD2 genes and their relative contribution to oil quality in peanut (Arachis hypogaea L.). BMC Genet 15, 133 (2014).
Chen, W. et al. Quantitative trait locus analysis for pod- and kernel-related traits in the cultivated peanut (Arachis hypogaea L.). BMC Genet 17, 25 (2016).
Zhao, Y. et al. QTL mapping for bacterial wilt resistance in peanut (Arachis hypogaea L.). Mol Breed 36, 13 (2016).
Huang, L. et al. Quantitative trait locus analysis of agronomic and quality-related traits in cultivated peanut (Arachis hypogaea L.). Theor Appl Genet 128, 1103–1115 (2015).
Huang, L. et al. Development and deployment of a high-density linkage map identified quantitative trait loci for plant height in peanut (Arachis hypogaea L.). Sci Rep 6, 39478 (2016).
Burow, M. D., Simpson, C. E., Starr, J. L. & Paterson, A. H. Transmission Genetics of Chromatin From a Synthetic Amphidiploid to Cultivated Peanut (Arachis hypogaea L.): Broadening the Gene Pool of a Monophyletic Polyploid Species. Genetics 159, 823 (2001).
Fonceka, D. et al. Genetic mapping of wild introgressions into cultivated peanut: a way toward enlarging the genetic basis of a recent allotetraploid. BMC Plant Biol 9, 103 (2009).
Shirasawa, K. et al. Integrated consensus map of cultivated peanut and wild relatives reveals structures of the A and B genomes of Arachis and divergence of the legume genomes. DNA Res 20, 173–184 (2013).
Bertioli, D. J. et al. The use of SNP markers for linkage mapping in Diploid and Tetraploid peanuts. Genes Genomes and Genetics 4, 89–96 (2014).
Sun, X. et al. SLAF-seq: an efficient method of large-scale de novo SNP discovery and genotyping using high-throughput sequencing. PLoS One 8, e58700 (2013).
Mao, D. et al. Multiple cold resistance loci confer the high cold tolerance adaptation of Dongxiang wild rice(Oryza rufipogon) to its high-latitude habitat. Theor Appl Genet 128, 1359–1371 (2015).
Zhao, X. et al. Loci and candidate gene identification for resistance to Sclerotinla sclerotiorum in soybean (Glycine max L. Merr.) via association and linkage maps. The Plant Journal 82, 245–255 (2015).
Zhang, Y. et al. Construction of a high-density genetic map for sesame based on large scale marker development by specific length amplified fragment (SLAF) sequencing. BMC Plant Biol 13, 141 (2013).
Huang, S. et al. Draft genome of the kiwifruit Actinidia chinensis. Nat Commun 4, 2640 (2013).
Qi, Z. et al. A high-density genetic map for soybean based on specific length amplified fragment sequencing. PLoS One 9, e104871 (2014).
Zhang, J. et al. High-density genetic map construction and identification of a locus controlling weeping trait in an ornamental woody plant (Prunus mume Sieb. et Zucc). DNA Res 22, 183–191 (2015).
Wei, Q. et al. An SNP-based saturated genetic map and QTL analysis of fruit-related traits in cucumber using specific-length amplified fragment (SLAF) sequencing. BMC Genomics 15, 1158 (2014).
Xu, X. et al. A high-density genetic map of cucumber derived from Specific Length Amplified Fragment sequencing (SLAF-seq). Front Plant Sci 5, 768 (2014).
Zhu, W. Y. et al. A High-Density Genetic Linkage Map for Cucumber (Cucumis sativus L.): Based on Specific Length Amplified Fragment (SLAF) Sequencing and QTL Analysis of Fruit Traits in Cucumber. Front Plant Sci 7, 437 (2016).
Zhao, X. et al. Construction of high-density genetic linkage map and identification of flowering-time QTLs in orchardgrass using SSRs and SLAF-seq. Sci Rep 6, 29345 (2016).
Liu, T. et al. Construction of the first high-density genetic linkage map of Salvia miltiorrhiza using specific length amplified fragment (SLAF) sequencing. Sci Rep 6, 24070 (2016).
Li, B. et al. Construction of a high-density genetic map based on large-scale markers developed by specific length amplified fragment sequencing (SLAF-seq) and its application to QTL analysis for isoflavone content in Glycine max. BMC Genomics 15, 1086 (2014).
Qiu, G. F. et al. A second generation SNP and SSR integrated linkage map and QTL mapping for the Chinese mitten crab Eriocheir sinensis. Sci Rep 7, 39826 (2017).
Liu, C. et al. Development of a high-density genetic linkage map and identification of flowering time QTLs in adzuki bean (Vigna angularis). Sci Rep 6, 39523 (2016).
Guo, G. et al. Rapid identification of QTLs underlying resistance to Cucumber mosaic virus in pepper (Capsicum frutescens). Theor Appl Genet 130, 41–52 (2017).
Cregan P, J. T. et al. An integrated genetic linkage map of the soybean. Crop Sci 39, 1464–1490 (1999).
Wang, W. et al. Construction and analysis of a high-density genetic linkage map in cabbage (Brassica oleracea L. var. capitata). BMC Genomics 13, 523 (2012).
Menz, M. A. et al. A high-density genetic map of Sorghum bicolor (L.) Moench based on 2926 AFLP, RFLP and SSR markers. Plant Mol Biol 48, 483–499 (2002).
Lu, H., Romeroseverson, J. & Bernardo, R. Chromosomal regions associated with segregation distortion in maize. Theor Appl Genet 105, 622–628 (2002).
Li, H. et al. Construction of a high-density composite map and comparative mapping of segregation distortion regions in barley. Mol Genet Genomics 284, 319–331 (2010).
Tai, G. C. C., Seabrook, J. E. A. & Aziz, A. N. Linkage analysis of anther-derived monoploids showing distorted segregation of molecular markers. Theor Appl Genet 101, 126–130 (2000).
Mace, E. S. et al. A consensus genetic map of sorghum that integrates multiple component maps and high-throughput Diversity Array Technology (DArT) markers. BMC Plant Biol 9, 13 (2009).
Ji, G. et al. Construction of a high-density genetic map using specific-locus amplified fragments in sorghum. BMC Genomics 18, 51 (2017).
Xu, S. Z. Quantitative trait locus mapping can benefit from segregation distortion. BMC Genetics 180, 2201–2208 (2008).
Xu, S. & Hu, Z. Mapping Quantitative Trait Loci Using DistortedMarkers. J Plant Genomics 2009, 410825 (2009).
Zhang, L. et al. Effects of missing marker and segregation distortion on QTL mapping in F2 populations. Theor Appl Genet 121(6), 1071–1082 (2010).
Wang, M. L. et al. Assessment of Oil content and fatty acid composition variability in different peanut subspecies and botanical varieties. Plant Genetic Resources 8, 71–73 (2009).
Pan, L. J. et al. Effect of high-oleate peanut on serum lipids level in rats. Journal of Peanut Science 38(3), 6–9 (2009).
Liu, D. et al. Construction and analysis of high-density linkage map using high-throughput sequencing data. PLoS One 9, e98855 (2014).
Wu, Y., Bhat, P. R., Close, T. J. & Lonardi, S. Efficient and accurate construction of genetic linkage maps from the minimum spanning tree of a graph. Plos Genetics 4, e1000212 (2008).
Os, H. V., Stam, P., Visser, R. G. F. & Eck, H. J. V. SMOOTH: a statistical method for successful removal of genotyping errors from high-density genetic linkage data. Theor Appl Genet 112, 187–194 (2005).
Kosambi, D. D. The estimation of map distances from recombination values. Annals of Human Genetics 12, 172–175 (1943).
Ooijen, J. W. V. MapQTL, Software for the mapping of quantitative trait loci in experimental populations. Kyazma BV, Wageningen 63 (2004).
Jansen, R. C. Interval mapping of multiple quantitative trait loc. Genetics 135, 2005–211 (1993).
Chen, J. et al. Identification of SSR markers linked to high oleic content in peanut (Arachis hypogaea L.) by RIL Population. Journal of Peanut Science 45, 1–7 (2016).
Acknowledgements
We would like to thank Xuelin Luo for carrying out phenotype data analysis and Ye Chu for providing some suggestions for modifications. This research was partially supported by the Breeding Project from the Department of Science & Technology of Shandong Province (2017LZGC003); Achievements Programs of Shandong Academy of Agricultural Sciences; Visiting Scholar Project of Shandong Academy of Agricultural Sciences; Agricultural Scientific and Technological Innovation Project of Shandong Academy of Agricultural Sciences (CXGC2016-02); the Shandong Key Research and Development Program (2016ZDJS10A02); and the National Ten Thousand Youth Talents Plan of 2014 (W02070268).
Author information
Authors and Affiliations
Contributions
J.C. and X.X.H. conceived of the study, designed the experiments and finished the final version of the manuscript. X.X.H., S.Z.Z. and H.R.M. established the RIL mapping population, carried out data analysis and drafted the manuscript. F.G.C. and W.Q.Y. participated in data collection and prepared the genomic DNA for sequencing. Y.S., T.T.X., N.C., X.Y.C. and Z.M.Z. helped collect statistics and draft the manuscript. All authors read and approved the final manuscript.
Corresponding author
Ethics declarations
Competing Interests
The authors declare no competing interests.
Additional information
Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Electronic supplementary material
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Hu, X.H., Zhang, S.Z., Miao, H.R. et al. High-Density Genetic Map Construction and Identification of QTLs Controlling Oleic and Linoleic Acid in Peanut using SLAF-seq and SSRs. Sci Rep 8, 5479 (2018). https://doi.org/10.1038/s41598-018-23873-7
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-018-23873-7
This article is cited by
-
Detection of two homologous major QTLs and development of diagnostic molecular markers for sucrose content in peanut
Theoretical and Applied Genetics (2024)
-
Designing future peanut: the power of genomics-assisted breeding
Theoretical and Applied Genetics (2024)
-
Genetic analysis and exploration of major effect QTLs underlying oil content in peanut
Theoretical and Applied Genetics (2023)
-
Discovery of common loci and candidate genes for controlling salt-alkali tolerance and yield-related traits in Brassica napus L.
Plant Cell Reports (2023)
-
Development of molecular resources for the genetic improvement of noug (Guizotia abyssinica (L.f) Cass): a mini review
CABI Agriculture and Bioscience (2022)
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.