Construction of a high density linkage map in Oil Palm using SPET markers

Herrero, Javier; Santika, Baitha; Herrán, Ana; Erika, Pratiwi; Sarimana, Upit; Wendra, Fahmi; Sembiring, Zulhermana; Asmono, Dwi; Ritter, Enrique

doi:10.1038/s41598-020-67118-y

Download PDF

Article
Open access
Published: 19 June 2020

Construction of a high density linkage map in Oil Palm using SPET markers

Javier Herrero¹,
Baitha Santika²,
Ana Herrán¹,
Pratiwi Erika²,
Upit Sarimana²,
Fahmi Wendra²,
Zulhermana Sembiring²,
Dwi Asmono² &
…
Enrique Ritter¹

Scientific Reports volume 10, Article number: 9998 (2020) Cite this article

2091 Accesses
13 Citations
1 Altmetric
Metrics details

Subjects

Abstract

A high-density genetic linkage map from a controlled cross of two oil palm (Elaeis guineensis) genotypes was constructed based on Single Primer Enrichment Technology (SPET) markers. A 5K panel of hybridization probes were used for this purpose which was derived from previously developed SNP primers in oil palm. Initially, 13,384 SNPs were detected which were reduced to 13,073 SNPs after filtering for only bi-allelic SNP. Around 75% of the markers were found to be monomorphic in the progeny, reducing the markers left for linkage mapping to 3,501. Using Lep-MAP3 software, a linkage map was constructed which contained initially 2,388 markers and had a total length of 1,370 cM. In many cases several adjacent SNP were located on the same locus, due to missing recombination events between them, leading to a total of 1,054 loci on the 16 LG. Nevertheless, the marker density of 1.74 markers per cM (0.57 cM/marker) should allow the detection of QTLs in the future. This study shows that cost efficient SPET markers are suitable for linkage map construction in oil palm and probably, also in other species.

Linkage-based genome assembly improvement of oil palm (Elaeis guineensis)

Article Open access 29 April 2019

Effect of marker segregation distortion on high density linkage map construction and QTL mapping in Soybean (Glycine max L.)

Article Open access 31 May 2019

Construction of a dense genetic map of the Malus fusca fire blight resistant accession MAL0045 using tunable genotyping-by-sequencing SNPs and microsatellites

Article Open access 01 October 2020

Introduction

One of the most productive oil crops in the world is the oil palm. According the USDA (United States Department of Agriculture), the total world vegetable oil production of 2019/2020 (until December 2019) was 207.06 million MT, with palm oil in the first place (75.69 million MT or 36.6%), followed by soybean oil (56.73 Million MT) and rapeseed oil (27.04 Million MT). Oil palm (Elaeis guineensis Jacq.) is a perennial monocotyledonous tropical crop species that belongs to the family of Arecaceae, which originate from the tropical rain forest of Central and West Africa.

Conventional breeding based on phenotypic observations in the progenies is generally applied in oil palm breeding. However, it needs more space and time for selecting promising crosses, particularly when increasing parental biodiversity. Since the land issues are spreading¹, breeders are starting slowly to implement molecular breeding techniques for improving the oil palm, both production and quality, without enlarging more the land use.

Several possibilities for genetic material selection and improvement in E. guineensis using molecular breeding have been proposed by several authors^2,3,4. One way to overcome these issues consists of optimizing the molecular breeding through marker assisted selection^5,6. Selection of promising parental palms can be done already at seedling stage, without waiting for the palm to give information about production. Marker assisted selection (MAS) can also reduce the land use based on progeny performance predictions and earlier selection during the nursery step, reducing in this way the number of progenies for evaluations. MAS usually requires prior knowledge on the distribution of quantitative trait loci (QTL) for a targeted trait in the genome and the underlying candidate genes⁷.

Large number of genomic resources has been generated in the last decades for this purpose in oil palm, including the whole genome sequence. These resources can be accessed at the National Center of Biotechnology (NCBI, USA) in the Taxonomy database (http://www.ncbi.nlm.nih.gov/Taxonomy/) or at PalmXplore from Malaysian Palm Oil Board (MPOB)⁸ (http://palmxplore.mpob.gov.my/palmXplore). Also different molecular and statistical techniques have been applied in oil palm in molecular breeding, such as Genome wide Association studies (GWAS) based on GBS “genotyping by sequencing” for candidate gene detection⁹ or Genomic Preselection based on GBS for breeding¹⁰. These techniques were applied on whole germplasm collections.

In the past, a more classical way of molecular breeding represented the construction of genetic linkage maps in controlled crosses and posterior QTL analyses. Markers close to the QTL locations could be applied for MAS, and if the map was anchored to a physical map, QTL locations could be searched for potential co-located candidate genes with a relevant biological meaning influencing a particular trait.

Various examples are available in oil palm. For example, Billotte et al.¹¹ performed QTL detection for different productive and oil quality traits by multi-parent linkage mapping in oil palm. Montoya et al.¹² identified 19 QTL related to fatty acid composition in an interspecific pseudo-backcross between E.oleifera and E. guineensis and Seng et al.¹³ mapped QTLs for oil yield components in an elite oil palm cross.

Despite more sophisticated analysis methods which are nowadays available, classical linkage mapping and QTL analyses in a controlled cross represents still a valid and useful tool in the case of rare genotypes with exceptional trait expression, as for example a rare resistance to a particular disease, for which no collection of genotypes with varying resistance levels are available.

Different marker types have been used widely in oil palm for constructing linkage map which include RFLPs, AFLPs, SSRs, and SNPs. Restriction associated DNA tagging (RAD) or double digestion RAD (ddRAD) have identified large amount of SNPs and produced remarkable maps^14,15. These are low cost approaches for linkage mapping, relying on target sequences between restriction sites all over the genome, including coding and non-coding regions. Recently, specific sequence capture (probe based) methods have become also a cost effective alternative. Since only a small amount of gDNA from each sample has to be provided beside target genome and SNP information, affordable sequencing is also possible for modest laboratories. A 5K probe panel should ensure a good coverage of target loci and the detection of polymorphisms in the mapping population¹⁶. The single primer enrichment technology (SPET), recently developed by NUGEN, is a targeted sequencing technology which has been used up to now for biomedical applications^17,18 and in plants for genotyping in monocot (Z. mays) lines or in a natural black poplar (P. nigra) population¹⁹, as well as for characterizing large germplasm sets from tomato and egg plant²⁰. According to Barchi et al.²⁰, SPET represents a valid alternative to GBS and micro arrays, and it allows users to customize the panel of target markers and provides reliable fingerprinting of accessions maintained in gene banks. Also for linkage mapping, specific capture ensures the detection of existing population polymorphisms at all targeted positions. The scalable probe design can maximize the number of target locations and the sequencing of all SNPs in the genomic regions for which probes have been designed. Thus, this technique appears to be especially well suited for the construction of dense genetic maps¹⁶.

Our main objective was to provide a cost-effective alternative for obtaining a sufficiently saturated linkage map using own genetic resources, a small sample number and considering a limited laboratory infrastructure. This study tests the suitability of the novel SPET technology for constructing a high-density linkage map which has been never used before in oil palm and can be further exploited for QTL analyses and breeding in the future.

Results

The service provider delivered a VCF file with a total of 13,384 SNP markers located on all 16 chromosomes of oil palm. The SNP distribution over chromosomes and their correspondence to individual loci was analysed. Table 1 summarizes the results. The SNP numbers per chromosome varied from 443 to 1634 with an average of 836.5 SNP per chromosome. They corresponded to a total of 4308 (out of 5000) loci, where each locus represented a designed SPET probe. The corresponding loci varied from 150 to 537 with an average of 269.3 loci per chromosome. In accordance, the average numbers of detected SNP per probe ranged from 2.89 to 3.44, and on average 3.12 SNP per probe were observed. However, in many cases 5 or more SNP per locus were detected. For the probe design of each SNP the OPGP reference genome sequence of dura DELI genotype PO4906D was used. The frequently observed additional SNP resulted from quite different sequences in the more exotic parental genotypes from Cameroon and Nigeria at the SNP locations.

Table 1 Distribution of SNP over chromosomes and number of probe loci.

Full size table

SNPs were filtered for a maximum of 20% missing values. A total of 13,256 SNPs remained. SNPs were also filtered for bi-allelic states. A total of 183 SNPs showed more than 2 allelic states, leading to 13,073 remaining SNPs.

After imputing missing values with Beagle 5 Software, we analysed the expected segregations of SNP markers. Table 2 shows the expected segregation ratios depending on the parental configuration (PC). In theory, nine unphased PC are possible. In four cases, no segregation was expected (both parents were homozygous for one SNP level), while in four cases 1:1 segregations occurred (only one parent was homozygous for one SNP allele, the other parent was heterozygous) and in one case a 1:2.1 segregation was expected (both parents were heterozygous). Table 2 shows also the observed numbers of SNP cases for each parental configuration.

Table 2 Expected Segregations depending on the parental SNP configurations and observed SNP numbers for each parental configuration.

Full size table

With respect to the classification of SNP depending on the parental configuration, the highest numbers of SNP were homozygous for the reference allele in both parents (P1:0/0-P2:0/0). This was the case for 8,652 SNP (66.2%) for which no segregation was expected. In contrary, for the PC with homozygous states for the alternative allele (1/1-1/1) only 852 cases were found. Considering other PC without expected segregations, remarkable low numbers of cases were observed for the PC: 1/1-0/0 and 0/0-1/1 which occurred in only 31 and 37 cases, respectively. Thus, a total of 9,572 SNP (73.2%) revealed a non-segregating PC. In general, there was a good agreement of the observed and the expected segregations for the segregating PC. The other PC frequencies with expected segregations varied between 234 and 1,063 SNP.

Non-segregating SNPs were discarded from the imputed VCF file and a total of 3,501 SNPs (26.8%) remained for further processing by Lep-MAP. Although several SNP showed high distortions in their segregation ratios, they were kept since Lep-MAP would filter them automatically during the process of linkage map construction. Table 3 summarizes the results obtained by Lep-MAP. Initially 20 linkage groups were obtained. The first 16 linkage groups corresponded to the 16 chromosomes of the oil palm genome. They contained a total of 2,388 markers and had a total length of 1,370 cM. Individual linkage groups contained between 71 and 280 SNP markers each and varied between 57.1 and 154.7 cM in length. This corresponded to an average marker density of 1.74 marker per cM (0.57 cM/marker). In addition, four smaller linkage groups were obtained with a total of 27 markers. These markers belonged to other chromosomes, but could not be placed on the corresponding linkage groups.

Table 3 Characteristics of the Linkage Map obtained by Lep-MAP Software.

Full size table

The number of SNPs which were supposed to map to other chromosomes on the reference map were determined. A total of 90 “misplaced” markers (3.8%) were identified in this way, varying between 0 and 14 SNPs for individual linkage groups. These markers did not affect the total lengths of the linkage groups, since they were not located at the distal ends. However, the remaining marker numbers were slightly reduced to 2298 as shown in Table 3 (column “NoM”). The remaining markers had all distinct locations on the physical map, but revealed in many cases for adjacent markers identical loci on the linkage map, since no recombination was detected between them. Table 3 shows also the number of loci which were obtained in this way for each LG. The total number of loci on the linkage map was 1,054 and they varied between 35 and 122 for the individual linkage groups (Table 3).

We also analysed the expected order of markers within linkage groups. Frequently we observed smaller rearrangements between adjacent markers, but occasionally also some larger ranking differences for certain markers. Figure 1 visualizes the differences between observed and expected marker orders for each LG considering all 2,298 SNP markers. In general, there was a good agreement since markers followed a putative regression line on each chromosome. However, also some outliers could be detected, which were located clearly apart from these regression lines. This was for example the case for one isolated individual SNP on LG1, three SNP on LG8, LG9 and LG10 and several other ones on different LG. Also larger gaps can be observed in Fig. 1: more vertical gaps such those in LG1 or LG15 represented an increase in recombination events for markers which were closely located on the physical map, while more horizontal gaps, like those which could be seen in LG4, LG5 and other LG indicate a low SNP coverage in the particular genomic regions.

Figure 2 presents the obtained linkage map considering only one marker per locus. Since the SNP were numbered consecutively according to their genomic location on the physical map, their SNP numbers should increase along the chromosomes. Smaller rearrangements can be identified, if consecutive SNP numbers decrease (for example SNP185/176 on LG1 at 12.7 and 13.2 cM, respectively).

Adjacent markers with over 1 million bp differences in their physical orders are indicated in italics and not in bold. The number of markers deviated in this way from their expected order are also indicated in Table 3 in the column “devM” for each LG. The first number indicates the number of markers deviated by over 1 million bp. A total number of 115 markers (11%) showed this deviation. They range from 0 markers for LG14 to 19 markers for LG8. The numbers in brackets in the column “devM” indicate the marker numbers which were deviated by over 2 million bp. The total number was reduced to almost 50% (62 markers, 5.9%) and nine LG had less than four deviated markers.

Discussion

The SPET approach has been used up to now successfully for biomedical applications, for genotyping in maize and black poplar and for characterizing large germplasm collections of tomato and eggplant. This is the first study which applies this technology for constructing a linkage map in oil. Genetic linkage mapping in controlled crosses is based on recombination frequencies (RF) between adjacent markers. Linked markers are also expected to be in linkage disequilibrium (LD) used generally in population analyses, according to the similarity of the formulas to calculate RF and LD.

A missing data threshold of 20% was used in our study, but the values were imputed using Beagle 5.0 Software which has a very low error rate <1%²¹. Obviously the quality of a map will be better with less missing values, but also Astorkia et al.^22,23 published association mapping studies in oil palm using a missing value rate of 20%.

According to Scheben et al.²⁴, the single primer enrichment technology combines in a single approach both, targeted analysis of SNPs, thus being comparable with genotyping arrays, and complexity reduction typical of GBS approaches. Furthermore, SPET provides the ability of multiplexing thousands of samples in a single sequencing run, which can be genotyped with tens thousands of probes and with a good coverage at target sites. Finally, thanks to the sequencing of the genomic regions around the target SNPs, SPET allows the discovery of thousands of novel SNPs not originally included in the panel²⁰. Also, in our study, on average 2.9 SNPs were detected per designed probe.

Other Next Generation Sequencing (NGS) technologies have been applied for linkage map construction previously. Carrasco et al.²⁵ reported the construction of high-density linkage maps of Japanese plum (Prunus salicina Lindl.) using SNP markers, obtained with a GBS strategy. The consensus map was built using 732 SNPs which spanned 617 cM with an average of 0.96 cM between adjacent markers. Using SNP-GBS markers, Su et al.²⁶ published a similar approach for maize and Gutierrez-Gonzalez²⁷ for wheat. Tang et al.²⁸ presented a high-quality genetic linkage map of 2572 SNP markers for Stylosanthes guianensis using the amplified-fragment single nucleotide polymorphism and methylation (AFSM) approach. Yi et al.²⁹ used a similar technology, specific length amplified fragment sequencing (SLAF-seq) to construct an SNP-based high-density linkage map for flax (Linum usitatissimum L.), and Zhang et al.³⁰ used RAD markers to construct a high-density map in channel catfish. Compared to these approaches, SPET markers have the great advantage that the specific SNP probes can be designed exclusively in coding regions as in our case. Moreover, specific genomic regions of interests can be considered in detail for targeting, for example promoters or other regulatory regions of a gene.

In the case of the oil palm, Bai et al.¹⁴ reported the largest saturated linkage map with 10,023 genome-wide SNP markers obtained by RAD-seq. A total of 5,727 SNPs were located in genes. More recently, Xia et al.¹⁵ mapped even 249,457 SLAF tags, representing an optimized version of ddRADseq, to the reference oil palm genome from MPOB using 200 individuals, but only 5064 SNPs were located within genic regions.

According to Machida-Hirano et al.³¹ a major limitation of restriction enzymes based methods is that they scan the genome at random loci. However, for many applications fine mapping, genotyping at specific loci, genes, or genomic regions is more critical. In this sense the valuable resources generated in the studies above can be exploited for fine mapping in specific QTL regions for traits of interest in the future.

Probe-based targeted sequencing represents a promising alternative to RADseq.³², and according to Mamanova et al.³³ target enrichment is a feasible way to bring the field of genomics into smaller laboratories. Previous technologies for genotyping at specific target regions were not as cost effective as RAD-seq techniques, since library preparation was more laborious and time consuming, and only provided limited multiplexing options^31,32. However, SPET is mainly PCR based, reducing considerably the costs in this way²⁰.

Capture techniques are useful for the construction of dense genetic maps since sequencing will reveal all SNPs in the contig area for which the probes have been designed¹⁶. The SPET approach resembles a custom-made SNP array which has been used for example by Cui et al.³⁴ to produce a high-density genetic map based on a Wheat 660K SNP array or by Joshi et al.³⁵ for constructing a high-density linkage Map in Nile Tilapia using a 58K SNP array. Compared to these array technologies, PCR based SPET does not require the development of a new, custom-made hardware device such as a chip which is used for a reduced sample size of only 96 genotypes as in our study.

A large portion of non-segregating probes were observed in our mapping population (75%). This finding is not surprising since the SNPs were obtained from shot-gun sequencing of a panel of different Elaeis guineensis and E. oleifera genotypes. Thus, many detected SNPs could be specific for E. oleifera and would not segregate in E. guineensis. This reduced polymorphism is in part compensated by many additional SNP which were found around the target SNPs. Nevertheless, 2388 out of 3267 segregating SNP (73%) could be placed on the linkage map in this study. We have used Lep-MAP3 Software to construct our linkage map. The algorithms of Lep-MAP3 can analyse large marker numbers, but also low-coverage datasets and reduce data filtering and curation on any data. This yields more markers in the final maps with less manual work even on problematic datasets³⁶. Bai et al.¹⁴ obtained a linkage map of 1,480 RAD markers used to identify interesting QTL for oil content traits. Comparing techniques and the number of markers (3,501 SPET markers), we believe that the SPET approach should be sufficient for our purposes.

In this study we have used only one panel of 5K loci for the analyses. However, it is worth to mention that these analyses could be extended to additional 5K pools, depending on the number of available SNPs. Also, other oil palm genome assemblies and SNPs such as available at Malaysian Palm Oil board^2,37 could be considered. In this sense, our study represents more a proof of concept approach about the suitability of SPET markers for linkage map construction. It is independent of the particular reference genome used and the concrete target genes identified by the approach and could probably also be extended to other species. One straightforward application of the obtained linkage map is the exploitation for QTL analyses of new traits of interest. However, the progeny has been planted in 2018 in the field and flowering is expected for 2021, when trait recording could be initiated.

In our study we observed a total of 3.8% of markers which were misplaced on our linkage map, according to their genomic location on the physical map. Also, for some other SNPs the expected marker order deviated considerably from the expected locations on the linkage groups. It is difficult to say if this was due to scoring errors of the molecular data for linkage map construction, or due to errors in the assembly of the genome sequences. In any case, these discrepancies should be exploited for revising carefully potential alternatives in genome assemblies leading to a comprehensive validation and refinement of de novo genome assemblies. Unlike simply genotyping individual accessions, the expected segregations in a controlled cross are known for a given parental configuration as shown in Table 2. Thus, potential scoring errors are directly visible, and in fact we have observed this in several cases in our study. No technology is perfect and data analyses have to scope with such errors, as done by the algorithms of Lep-MAP3.

Nevertheless, even for the results obtained with only one single 5K panel, there should be sufficient markers to detect QTLs for traits of interest, in order to identify in their neighbourhood potential underlying genes with a relevant biological meaning which could explain a specific QTL. This is particularly the case when applying interval mapping techniques for QTL detection³⁸, since problem of miss orders of markers can be circumvented with this more local approach. Programs such as FastQTL are able to handle large amounts of markers for this purpose³⁹. Considering that often several SNP markers with different genomic locations map to the same locus in the obtained genetic linkage map, this information should be used to revise the whole genomic region for potential candidate gene detection. Besides, the density of the bait design nearby known QTL can be very easily adapted¹⁶. This “capture-assisted QTL mapping” of important phenotypic traits is a parallel utility⁴⁰ and represents an advantage for breeding purposes.

Our results demonstrate that SPET represents a valid alternative to random complexity reduction methods such as GBS and micro arrays; it allows users to customize the panel of target markers and provides also a reliable technique for producing genetic linkage maps from controlled crosses. This study differs from other linkage map construction studies in the oil palm, due to the novel idea that implies the use of targeted high-throughput SNP markers for map construction which could be also affordable for modest laboratories with limited infrastructure. The map can be the first step for future analysis of quantitative trait locus of interesting traits for crop breeding. Once QTL regions have been identified, the map can be saturated with new specific probes, localized in these QTL regions for fine mapping of loci controlling specific traits.

Materials and Methods

Plant material

The genetic linkage map was derived from a controlled cross between a wild Dura accession from East Cameroon (Cam08) and an advanced Pisifera breeding clone from Nigeria (P320/23). A total of 94 Tenera palms from this cross as well as the two parents were genotyped using SPET Technology.

Generation of SPET markers

Genomic DNA was obtained from around 50 mg of young leaves following the protocol of innuPREP Plant DNA extraction kit (Analytic Jena, Germany). Quality and quantity of DNA from parents and progeny samples were verified via agarose gel and Qubit fluorometer (Life Technologies). Library preparation and sequencing was performed by IGA Technology Services (IGATech Udine, Italy), using a NextSeq. 500 sequencing platform (Illumina, San Diego, CA, USA) in single-end mode (150 bp).

SNP data were retrieved from the Oil Palm Research Project (OPGP)⁴¹ and the whole genome sequence of the dura DELI genotype PO4906D from the OPGP project was used to design the probes for the single primer enrichment technology (NuGen. San Carlos CA, United States). A 5K panel of intragenic SNPs was selected, thoroughly covering the sixteen chromosomes of oil palm, since the main purpose was to construct a linkage map. This panel was employed to mine alleles of the genotyped parents and progeny genotypes using the Allegro Targeted Genotyping V2 procedure (IGATech Udine, Italy).

Sequence data processing and linkage map construction

VCFtools⁴² was used for filtering the VCF file delivered by the service provider for missing value ratios and biallelic single nucleotide polymorphism (SNP) states. Beagle 5 Software⁴³ was applied for imputing missing values in the filtered VCF file. For linkage map construction, Lep-MAP3 Software³⁶ was applied which can handle large number of markers. The imputed VCF file was used for this purpose, together with a “pedigree file” indicating the parents of the controlled cross. The analyses were performed on a UNIX computer. The process involved several steps starting with ParentCall2 which was used to call (possible missing or erroneous) parental genotypes, followed by the Filtering2 step which filtered markers based on high segregation distortion. A p-value of 0.001 was used as threshold for the Chi-square tests. The next module was SeparateChromosomes2 which assigned markers into linkage groups (LGs) by computing all pair-wise LOD scores between markers and joined markers with LOD score higher than a user given parameter. The commonly used LOD score of 5 was used for this purpose. The JoinSingles2All module assigned singular markers to existing LGs by computing LOD scores between each single marker and markers from the existing LGs. This module generated a new map file with additional markers assigned to linkage groups. OrderMarkers2 ordered the markers within each LG by maximizing the likelihood of the data for alternative orders. A total of 100 iterations per LG were used to obtain the final map. Marker distances were expressed in cM using the Kosambi Mapping function⁴⁴. Office software was used to extract and combine the information generated by the different output files of Lep-Map3.

Marker locations on the genetic map in cM and on the physical genome in base pairs (bp) were compared based on the original locations of the SNP markers on the 16 pseudomolecules of the mentioned OPGP genome.

References

Wich, S. A. et al. Will oil palm’s homecoming spell doom for Africa’s great apes? Current Biology. 24(14), 1659–1663, https://doi.org/10.1016/j.cub.2014.05.077 (2014).
Article CAS PubMed Google Scholar
Singh et al. Oil palm genome sequence reveals divergence of interfertile species in Old and New worlds. Nature. 500(7462), 335–339, https://doi.org/10.1038/nature12309 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Babu, B. K. & Mathur, R. K. Molecular breeding in oil palm (Elaeis guineensis): Status and Future perspectives. Progressive Horticulture 48(2), 123–131, https://doi.org/10.5958/2249-5258.2016.00051.8 (2016).
Article Google Scholar
Low et al. The Oil Palm Genome Revolution. Journal of Oil Palm Research 29(4), 456–468, https://doi.org/10.21894/jopr.2017.00018 (2017).
Article CAS Google Scholar
Xu, Y. et al. Whole-genome strategies for marker-assisted plant breeding. Molecular breeding. 29(4), 833–854, https://doi.org/10.1007/s11032-012-9699-6 (2012).
Article CAS Google Scholar
Seng, T. Y. et al. Marker-assisted selection and its application in breeding for high yielding short palms: the FGV approach. Planter 93, 255–268 (2017).
Google Scholar
Ong, A. L. et al. Linkage-based genome assembly improvement of oil palm (Elaeis guineensis). Sci. Rep 9(1), 1–9, https://doi.org/10.1038/s41598-019-42989-y (2019).
Article CAS Google Scholar
Sanusi, N. S. N. M. et al. PalmXplore: oil palm gene database. Database (Oxford). 2018, https://doi.org/10.1093/database/bay095 (2018).
Babu, B. K. et al. Genome wide association study (GWAS) and identification of candidate genes for yield and oil yield related traits in oil palm (Eleaeis guineensis) using SNPs by genotyping-based sequencing. Genomics 112(1), 1011–1020, https://doi.org/10.1016/j.ygeno.2019.06.018 (2020).
Article CAS PubMed Google Scholar
Cros, D. et al. Genomic preselection with genotyping-by-sequencing increases performance of commercial oil palm hybrid crosses. BMC Genomics 18, 839, https://doi.org/10.1186/s12864-017-4179-3 (2017).
Article PubMed PubMed Central Google Scholar
Billotte, N. et al. QTL detection by multi-parent linkage mapping in oil palm (Elaeis guineensis Jacq.). Theor Appl Genet 120(8), 1673–1687, https://doi.org/10.1007/s00122-010-1284-y (2010).
Article CAS PubMed PubMed Central Google Scholar
Montoya, C. et al. Quantitative trait loci (QTLs) analysis of palm oil fatty acid composition in an interspecific pseudo-backcross from Elaeis oleifera (H.B.K.) Cortés and oil palm (Elaeis guineensis Jacq.). Tree Genet. Genomes. 9, 1207–1225, https://doi.org/10.1007/s11295-013-0629-5 (2013).
Article Google Scholar
Seng, T. Y. et al. QTLs for oil yield components in an elite oil palm (Elaeis guineensis) cross. Euphytica 212(3), 399–425, https://doi.org/10.1007/s10681-016-1771-6 (2016).
Article Google Scholar
Bai, B. et al. Developing genome-wide SNPs and constructing an ultrahigh-density linkage map in oil palm. Sci Rep 8, 691, https://doi.org/10.1038/s41598-017-18613-2 (2018).
Article ADS CAS PubMed PubMed Central Google Scholar
Xia, W. et al. Development of high-density SNP markers and their application in evaluating genetic diversity and population structure in Elaeis guineensis. Front. Plant Sci 10, 130, https://doi.org/10.3389/fpls.2019.00130 (2019).
Article PubMed PubMed Central Google Scholar
Holtz, Y. et al. Genotyping by Sequencing using specific allelic capture to build a high-density genetic map of durum wheat. Plos One 11, 5, https://doi.org/10.1371/journal.pone.0154609 (2016).
Article CAS Google Scholar
Scolnick, J. A. et al. An efficient method for identifying gene fusions by targeted RNA sequencing from fresh frozen and FFPE samples. Plos One, 10(7), https://doi.org/10.1371/journal.pone.0128916 (2015).
Nairismägi, M. L. et al. JAK-STAT and G-protein-coupled receptor signaling pathways are frequently altered in epitheliotropic intestinal T-cell lymphoma. Leukemia 30(6), 1311–1319, https://doi.org/10.1038/leu.2016.13 (2016).
Article CAS PubMed PubMed Central Google Scholar
Scaglione, D. et al. Single primer enrichment technology as a tool for massive genotyping: a benchmark on black poplar and maize. Annals of botany 124(4), 543–552, https://doi.org/10.1093/aob/mcz054 (2019).
Article PubMed PubMed Central Google Scholar
Barchi, L. et al. Single Primer Enrichment Technology (SPET) for high-throughput genotyping in tomato and eggplant germplasm. Front. Plant Sci. 10, 1005, https://doi.org/10.3389/fpls.2019.01005 (2019).
Article PubMed PubMed Central Google Scholar
Pook, T. et al. Improving Imputation Quality in BEAGLE for Crop and Livestock Data. G3: Genes, Genomes, Genetics 10(1), 177–188, https://doi.org/10.1534/g3.119.400798 (2020).
Article Google Scholar
Astorkia, M. et al. Association mapping between candidate gene SNP and production and oil quality traits in interspecific oil palm hybrids. Plants 8(10), 377, https://doi.org/10.3390/plants8100377 (2019).
Article CAS PubMed Central Google Scholar
Astorkia, M. et al. Detection of significant SNP associated with production and oil quality traits in interspecific oil palm hybrids using RARSeq. Plant Science 291, 110366, https://doi.org/10.1016/j.plantsci.2019.110366 (2020).
Article CAS PubMed Google Scholar
Scheben, A., Batley, J. & Edwards, D. Genotyping‐by‐sequencing approaches to characterize crop genomes: choosing the right tool for the right application. Plant biotechnology journal 15(2), 149–161, https://doi.org/10.1111/pbi.12645 (2017).
Article CAS PubMed PubMed Central Google Scholar
Carrasco, B. et al. Construction of a highly saturated linkage map in Japanese plum (Prunus salicina L.) using GBS for SNP marker calling. Plos One, 13(12), https://doi.org/10.1371/journal.pone.0208032 (2018).
Su, C. et al. High density linkage map construction and mapping of yield trait QTLs in maize (Zea mays) using the genotyping-by-sequencing (GBS) technology. Front. Plant Sci 8, 706, https://doi.org/10.3389/fpls.2017.00706 (2017).
Article PubMed PubMed Central Google Scholar
Gutierrez-Gonzalez, J. J., Mascher, M., Poland, J. & Muehlbauer, G. J. Dense genotyping-by-sequencing linkage maps of two Synthetic W7984× Opata reference populations provide insights into wheat structural diversity. Sci. Rep. 9(1), 1–15, https://doi.org/10.1038/s41598-018-38111-3 (2019).
Article CAS Google Scholar
Tang, Y. Q. et al. Construction of a high-density linkage map and QTL mapping for important agronomic traits in Stylosanthes guianensis (Aubl.) Sw. Sci. Rep 9(1), 1–7, https://doi.org/10.1038/s41598-019-40489-7 (2019).
Article ADS CAS Google Scholar
Yi, L. et al. Construction of an SNP-based high-density linkage map for flax (Linum usitatissimum L.) using specific length amplified fragment sequencing (SLAF-seq) technology. Plos One, 12(12), https://doi.org/10.1371/journal.pone.0189785 (2017).
Zhang, S. et al. Construction of a high-density linkage map and QTL fine mapping for growth and sex related traits in channel catfish (Ictalurus punctatus). Frontiers in genetics 10, 251, https://doi.org/10.3389/fgene.2019.00251 (2019).
Article CAS PubMed PubMed Central Google Scholar
Machida-Hirano, R. & Niino, T. Potato genetic resources. In The Potato Genome, 11–30. Springer, Cham., https://doi.org/10.1007/978-3-319-66135-3_2 (2017).
Andrews, K. R. et al. Harnessing the power of RADseq for ecological and evolutionary genomics. Nature Reviews Genetics 17(2), 81, https://doi.org/10.1038/nrg.2015.28 (2016).
Article CAS PubMed PubMed Central Google Scholar
Mamanova, L. et al. Target-enrichment strategies for next-generation sequencing. Nature methods 7(2), 111, https://doi.org/10.1038/nmeth.1419 (2010).
Article CAS PubMed Google Scholar
Cui, F. et al. Utilization of a Wheat660K SNP array-derived high-density genetic map for high-resolution mapping of a major QTL for kernel number. Sci. Rep 7(1), 1–12, https://doi.org/10.1038/s41598-017-04028-6 (2017).
Article CAS Google Scholar
Joshi, R. et al. Development and validation of 58K SNP-array and high-density linkage map in Nile tilapia (O. niloticus). Frontiers in genetics 9, 472, https://doi.org/10.3389/fgene.2018.00472 (2018).
Article CAS PubMed PubMed Central Google Scholar
Rastas, P. Lep-MAP3: robust linkage mapping even for low-coverage whole genome sequencing data. Bioinformatics 33(23), 3726–3732, https://doi.org/10.1093/bioinformatics/btx494 (2017).
Article CAS PubMed Google Scholar
Ong, P. W. et al. Development of SNP markers and their application for genetic diversity analysis in the oil palm (Elaeis guineensis). Genetics and Molecular Research 14(4), 12205–12216, https://doi.org/10.4238/2015.October.9.9 (2015).
Article CAS PubMed Google Scholar
Zobaer, A. M. et al. A Comparison on Some Interval Mapping Approaches for QTL Detection. Bioinformation 15(2), 90, https://doi.org/10.6026/97320630015090 (2019).
Article Google Scholar
Vanderzande, S. et al. High-quality, genome-wide SNP genotypic data for pedigreed germplasm of the diploid outbreeding species apple, peach, and sweet cherry through a common workflow. Plos One, 14(6), https://doi.org/10.1371/journal.pone.0210928 (2019).
Jones, M. R. & Good, J. M. Targeted capture in evolutionary and ecological genomics. Molecular ecology 25(1), 185–202, https://doi.org/10.1111/mec.13304 (2016).
Article PubMed Google Scholar
UMR - Joint Research Unit – AGAP (France). Report of Oil Palm Genome Project (OPGP) International Consortium, https://umr-agap.cirad.fr/recherche/equipes-scientifiques2/oil-palm-genome-projects-opgp (2016).
Danecek, P. et al. The variant call format and VCFtools. Bioinformatics 27(15), 2156–2158, https://doi.org/10.1093/bioinformatics/btr330 (2011).
Article CAS PubMed PubMed Central Google Scholar
Browning, B. L. et al. A one-penny imputed genome from next-generation reference panels. The American Journal of Human Genetics 103(3), 338–348, https://doi.org/10.1016/j.ajhg.2018.07.015 (2018).
Article CAS PubMed Google Scholar
Kosambi, D. D. The estimation of map distances from recombination values. Springer, New Delhi, 125–130, https://doi.org/10.1007/978-81-322-3676-4_16 (2016).

Download references

Acknowledgements

Authors would like to appreciate the work of Field and Molecular Breeding departments in the samples management and collection. The present study was fully supported by Sampoerna Agro TBK (Indonesia) as part of DAMASO collaboration project with NEIKER research centre (Spain).

Author information

Authors and Affiliations

NEIKER-Basque Institute for Agricultural Research and Development - Basque Research and Technology Alliance (BRTA). Campus Agroalimentario de Arkaute s/n, 01192, Arkaute, Spain
Javier Herrero, Ana Herrán & Enrique Ritter
Department of Research & Development, PT Sampoerna Agro Tbk., Jl. Basuki Rahmat No. 788, Palembang, 30127, Indonesia
Baitha Santika, Pratiwi Erika, Upit Sarimana, Fahmi Wendra, Zulhermana Sembiring & Dwi Asmono

Authors

Javier Herrero
View author publications
You can also search for this author in PubMed Google Scholar
Baitha Santika
View author publications
You can also search for this author in PubMed Google Scholar
Ana Herrán
View author publications
You can also search for this author in PubMed Google Scholar
Pratiwi Erika
View author publications
You can also search for this author in PubMed Google Scholar
Upit Sarimana
View author publications
You can also search for this author in PubMed Google Scholar
Fahmi Wendra
View author publications
You can also search for this author in PubMed Google Scholar
Zulhermana Sembiring
View author publications
You can also search for this author in PubMed Google Scholar
Dwi Asmono
View author publications
You can also search for this author in PubMed Google Scholar
Enrique Ritter
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

J.H., B.S. and E.R. were involved in the study design, laboratory analysis, results evaluation, data analysis and the main manuscript writing. B.S., A.H., P.E., U.S. and F.W. were in charge of field work, samples management and contributed in parts of the manuscript. E.R., Z.S. and D.A. are responsible for the project direction, they were involved in the manuscript revision and the contribution of applicability objectives.

Corresponding author

Correspondence to Javier Herrero.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Herrero, J., Santika, B., Herrán, A. et al. Construction of a high density linkage map in Oil Palm using SPET markers. Sci Rep 10, 9998 (2020). https://doi.org/10.1038/s41598-020-67118-y

Download citation

Received: 28 March 2020
Accepted: 01 June 2020
Published: 19 June 2020
DOI: https://doi.org/10.1038/s41598-020-67118-y

This article is cited by

Genetic dissection of fruit maturity date in apricot (P. armeniaca L.) through a Single Primer Enrichment Technology (SPET) approach
- Irina Baccichet
- Remo Chiozzotto
- Marco Cirilli
BMC Genomics (2022)
Genome properties of key oil palm (Elaeis guineensis Jacq.) breeding populations
- Essubalew Getachew Seyum
- Ngalle Hermine Bille
- David Cros
Journal of Applied Genetics (2022)
Targeted genome-wide SNP genotyping in feral horses using non-invasive fecal swabs
- Stefan Gavriliuc
- Salman Reza
- Jocelyn Poissant
Conservation Genetics Resources (2022)
Oil palm in the 2020s and beyond: challenges and solutions
- Denis J. Murphy
- Kirstie Goggin
- R. Russell M. Paterson
CABI Agriculture and Bioscience (2021)
Molecular approaches for improving oil palm for oil
- Gen Hua Yue
- Bao Qing Ye
- May Lee
Molecular Breeding (2021)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.