Insights into opium poppy (Papaver spp.) genetic diversity from genotyping-by-sequencing analysis

Hong, Uyen Vu Thuy; Tamiru-Oli, Muluneh; Hurgobin, Bhavna; Okey, Christopher R.; Abreu, Artur R.; Lewsey, Mathew G.

doi:10.1038/s41598-021-04056-3

Download PDF

Article
Open access
Published: 07 January 2022

Insights into opium poppy (Papaver spp.) genetic diversity from genotyping-by-sequencing analysis

Uyen Vu Thuy Hong^1,2,
Muluneh Tamiru-Oli^1,2,
Bhavna Hurgobin^1,2,
Christopher R. Okey³,
Artur R. Abreu³ &
…
Mathew G. Lewsey^1,2

Scientific Reports volume 12, Article number: 111 (2022) Cite this article

17k Accesses
18 Citations
36 Altmetric
Metrics details

Subjects

Abstract

Opium poppy (Papaver somniferum) is one of the world’s oldest medicinal plants and a versatile model system to study secondary metabolism. However, our knowledge of its genetic diversity is limited, restricting utilization of the available germplasm for research and crop improvement. We used genotyping-by-sequencing to investigate the extent of genetic diversity and population structure in a collection of poppy germplasm consisting of 91 accessions originating in 30 countries of Europe, North Africa, America, and Asia. We identified five genetically distinct subpopulations using discriminate analysis of principal components and STRUCTURE analysis. Most accessions obtained from the same country were grouped together within subpopulations, likely a consequence of the restriction on movement of poppy germplasm. Alkaloid profiles of accessions were highly diverse, with morphine being dominant. Phylogenetic analysis identified genetic groups that were largely consistent with the subpopulations detected and that could be differentiated broadly based on traits such as number of branches and seed weight. These accessions and the associated genotypic data are valuable resources for further genetic diversity analysis, which could include definition of poppy core sets to facilitate genebank management and use of the diversity for genetic improvement of this valuable crop.

Genetic diversity, population structure, and relationships of apricot (Prunus) based on restriction site-associated DNA sequencing

Article Open access 01 May 2020

Assessment of genetic diversity and SNP marker development within peanut germplasm in Taiwan by RAD-seq

Article Open access 25 August 2022

Population structure and genetic diversity in red clover (Trifolium pratense L.) germplasm

Article Open access 20 May 2020

Introduction

Opium poppy (Papaver somniferum L.) is one of the oldest cultivated plant species. Archaeological evidence shows that poppy has been cultivated and used for thousands of years, dating back to the earliest Neolithic ages^1,2,3,4. However, its origin and domestication history has remained unclear until recently. Domestication traits are poorly defined, making it difficult to distinguish domesticated and wild forms especially in archaeological records⁵. Several lines of evidence, based mainly on archaeological data and geographical distribution of cultivated and wild species, suggest the Mediterranean as the centre of poppy origin and domestication^5,6. Changes in capsule and seed sizes and capsule indehiscence, which is the retention of seed in the capsules, are believed to be amongst poppy domestication-related traits⁷.

Currently, poppy is widely cultivated as both a licit and illicit crop in Asia, Europe, Oceania and South America^8,9,10. It is a source of several benzylisoquinoline alkaloids (BIAs) including morphine, codeine, thebaine, papaverine and noscapine for the pharmaceutical industry and for the clandestine production of heroin. Poppy seeds are also used in the food industry for baking and extraction of edible oil, whilst the plant is grown for ornamental purposes in some countries due to its attractive flowers¹¹. Poppy cultivars used in food applications are required to contain no or negligible amounts of alkaloids¹². The availability of commercial cultivars with specific alkaloid profiles is also vital to meet the needs of the pharmaceutical industry and subsequent consumers¹². Obtaining codeine and thebaine from morphine-free plants can also contribute to preventing illicit production of the morphine-derived heroin.

Considerable poppy genetic diversity has been reported in several countries including India, Turkey, Czech Republic, and Australia^{13,14,15,16,17}. Germplasm collections of varying sizes exist in some of these countries^14,15,18,19. Additionally, a substantial number of poppy genetic resources are currently maintained as seeds in global genebanks. The Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) genebank in Germany has over 1100 accessions of poppy that were collected worldwide²⁰. A collection of similar size is maintained at the Institute of Protection of Biodiversity and Biological Safety in the Slovak University of Agriculture²¹. Germplasm collections provide the genetic and phenotypic diversity used in crop breeding and development. They are also vital resources for research aimed at dissecting the genetic and molecular basis of essential plant processes including secondary/specialized metabolism. However, these resources are underexploited in poppy partly because the genetic diversity of the available germplasm has not been studied in detail. Additionally, the legal opium poppy industry is strictly regulated under international law, restricting the movement of germplasm between countries. Most available reports of opium poppy genetic diversity are based on studies that either assessed small collections of germplasm from a single country or used a limited number of classical DNA markers such as amplified fragment length polymorphisms (AFLPs), random amplified polymorphic DNAs (RAPDs) and simple sequence repeat markers (SSRs)^13,14,22,23.

Molecular markers have been instrumental for the study of genetic diversity and population structure of germplasm collections. Such studies generate information key to both germplasm conservation and use of the resources for genetic improvement of crops. As the most abundant types of sequence variation in plant genomes, single nucleotide polymorphisms (SNPs) are suitable for several applications that require high-density and genome-wide markers including genetic diversity and population structure analyses, QTL mapping and map-based cloning^24,25. Recent advances in next generation sequencing have greatly reduced the cost of genome sequencing, allowing the generation of very large numbers of molecular markers. However, reduced representation sequencing (RRS) is still widely used when analyzing large number of samples in species with large genomes, such as opium poppy. Genotyping-by-sequencing (GBS) is a low-cost and fast RRS method for SNP discovery and mapping, which reduces the complexity of genomes by generating smaller fragments via restriction digestion²⁶. A draft of the P. somniferum genome that captured 94.8% of the estimated genome size, with 81.6% of the sequences assigned to individual chromosomes, has been sequenced recently^27,28. Given the genome is an estimated 2.8 Gb and comprised of 70% repetitive elements, RRS methods are an appropriate, cost-effective method for poppy diversity analyses. Such studies could facilitate the development of poppy core sets, enabling the mapping and isolation of genes or genomic regions associated with traits of interest.

In this work, we applied GBS analysis to characterize 91 poppy accessions obtained from the IPK genebank in Germany. The accessions consisted of two Papaver species, P. somniferum L. subsp. somniferum (P. somniferum hereafter) and P. somniferum L. subsp. setigerum (DC.) Arcang. (P. setigerum hereafter). P. setigerum is commonly thought to be the direct ancestor of P. somniferum and historically has also been used for alkaloid production²⁹. The accessions originated from diverse geographic regions that encompassed 30 countries of Europe, North Africa, America, and Asia. We provide a genome-wide assessment of the genetic diversity and population structure of the accessions using GBS. The data generated here provides a resource that will allow detailed analysis of poppy germplasm and the definition of poppy core set for effective management of poppy genetic resources. It may also facilitate the genetic improvement of opium poppy through the generation of mapping populations, the identification of useful traits and development molecular markers for marker-assisted selection.

Results

Optimization of a poppy GBS protocol for SNP discovery

To generate a SNP dataset sufficient for genetic diversity analysis, we first optimized a GBS protocol for opium poppy. GBS remains a method of choice for genome-wide SNP detection in non-model species and species with large genomes. It is however yet to be utilized for opium poppy. A critical step in GBS protocols is the reduction of genome complexity using restriction enzymes (REs), and the two-enzyme GBS protocol uses a combination of a rare- and common-cutting REs³⁰. Although some RE combinations are often used in plants, the optimal combinations need to be determined for the genome of each species³¹. To select the optimal enzyme combination for opium poppy, we prepared eight double digested libraries from a pool of 3 representative samples using the enzyme combinations PstI/MspI, PstI/MseI, PstI/NlaIII, PstI/HpyCH4IV, EcoRI/MspI, EcoRI/MseI, EcoRI/NlaIII and EcoRI/HpyCH4IV. We then compared the absence of visible repeat regions within the size selection area (280–375 bp) and level of amplifications to select EcoRI/NlaIII as the optimal combination for opium poppy GBS library preparation (Fig. S1).

The multiplexed pool of EcoRI/NlaIII-based GBS libraries for 91 accessions was sequenced to generate a total of 103,802,122 raw 150-bp single end reads (15.57 Gb data, average 1.14 million reads per sample, Tables S1, 2). We aligned the 103,601,279 reads to the poppy reference genome after filtering, with a read alignment rate of 97–99% (Table S3). A total of 165,363 SNPs was identified at 76,407 loci and present in ≥ 90% of accessions, which is significantly higher than the number of SNPs we were able to call using a reference-free pipeline (Table S4). These SNPs were evenly distributed across the 11 chromosomes and unplaced scaffolds of the draft opium poppy reference genome (Fig. 1a,b). The 165,363 SNPs were predicted to have 165,786 effects, of which 149,536 (90.2%) were found in intergenic regions, while the remaining 16,250 (9.8%) corresponded to genic regions (Table S5). This optimized protocol will allow researchers to rapidly apply GBS to unlimited number of poppy accessions at reduced cost, allowing detailed characterization of the available germplasm.

Assessing genetic relatedness of 91 Papaver accessions

Next, we set out to determine the relationship amongst the 91 Papaver accessions from a broad geographic range (Table S1). The accessions, which originated from 30 countries in four continents, were primarily P. somniferum (88 accessions) but included 3 P. setigerum accessions (Table S1). We first used pairwise comparisons of the 165,363 filtered SNPs for hierarchical clustering based on the identity-by-state algorithm³². Four clusters plus a single accession (PAP 400) far from all others were identified (Fig. 2). Cluster 1 contained the three P. setigerum accessions, while the P. somniferum accessions, except PAP 400, were grouped into three distinct clusters. Although PAP 400 was labelled as a P. somniferum accession, our result suggests that it is neither P. somniferum nor P. setigerum. We assessed the morphological characteristics of the accessions when grown under controlled glasshouse conditions. We found that a range of morphological traits were quite diverse between accessions, including seed and capsule characteristics (Fig. 3; Table S6; Fig. S2–S4). PAP 400 was morphologically distinct from both Papaver species (Fig. 4a).

To study the relationships between P. somniferum, P. setigerum and PAP 400, we determined genome sizes and ploidy levels of PAP 400 and representative P. somniferum and P. setigerum accessions by flow cytometry. We estimated the genome size of P. somniferum to be ~ 3.04 Gb, which is slightly bigger than a previous ~ 2.87 Gb estimate²⁷ (Fig. 4b). The genome of P. setigerum was estimated to be ~ 4.9 Gb, indicating the P. setigerum genome is close to twice the genome size of P. somniferum (Fig. 4c,f). This was similar to previous genome size estimates and supports reports that P. setigerum is tetraploid (2n = 44) with chromosomes smaller in size compared with the diploid (2n = 22) P. somniferum^33,34,35. PAP 400 had a slightly smaller genome size than the diploid P. somniferum (Fig. 4d,e). Taken together, our results suggest PAP 400 may be a different species and a case of mislabelling, which can occur in seedbanks during plant cultivation and storage³⁶. This is possible given that IPK holds seeds of other Papaver species in its collection. However, the dataset we present is relatively small, including only three P. setigerum accessions. Consequently, accurate classification of PAP 400 and the accessions in general would require a larger dataset, in particular covering more P. setigerum accessions and the other known Papaver species.

Population structure and genetic diversity amongst Papaver accessions

Understanding the genetic structure of populations is useful for germplasm conservation and plant breeding. To infer population structure, we analysed the 90 Papaver accessions, removing the outlier PAP 400 and associated data (131,039 SNPs remaining). Five clusters (hereafter termed subpopulations) were inferred at the lowest Bayesian information criterion (BIC) score (Fig. 5a). To understand the genetic relationships between the five subpopulations, we carried out DAPC. Ten principal components (PCs) were retained (with 47.75% of the variance conserved) by the cross-validation function, which gave four discriminant eigenvalues (Fig. 5b). Subpopulation 1 (SP1) was comprised of all three P. setigerum accessions, while SPs 2–5 consisted of 12, 4, 21, and 50 P. somniferum accessions, respectively. The wide separation between SP1 and the other subpopulations (SPs 2–5) on the DAPC plot illustrates the extensive genetic difference between P. somniferum and P. setigerum.

We investigated population structure in greater detail. We detected admixture amongst the 90 poppy accessions by applying the admixture model in STRUCTURE using 49,166 unlinked SNPs³⁷. Based on ΔK values, the most optimum K value detected was four to eight (Fig. S5). Bar plots for each optimal K value, with the accessions sorted following the DAPC result, illustrated that the pattern of subpopulation assignment did not change significantly across the different K values and was consistent with the five subpopulations determined by DAPC (Fig. 5c). At K = 5, the three P. setigerum accessions making up the SP1 from DAPC are genetically distinct with no admixture from the other subpopulations (Fig. 5c). Interestingly, the 12 accessions of SP2 were a less genetically diverse group representing broad geographic origins extending from North Africa to East Asia, indicative of germplasm exchanges in the past. The four accessions of SP3, which were all from North Korea, had moderate level of admixtures from SPs 2 and 5. The 21 accessions making up SP4 were highly diverse with a high level of admixtures from SPs 2, 3 and 5. Most of the accessions in SP4 originated from western and Mediterranean regions of Europe. Considering that opium poppy was domesticated in the western Mediterranean, from where it spread to north and central Europe, this admixture might be due to ongoing gene flow between wild and domesticated forms. SP5 contained 50 accessions with low level of admixtures from all the other subpopulations^5,6. Subpopulations were significantly differentiated, as shown by pairwise calculation of genetic differentiation or fixation index (F_ST) that ranged from 0.235 to 0.627, suggesting either low levels of allele sharing or differences in allele frequencies between subpopulations (Fig. 5d; Table 1). SP1, comprised of P. setigerum accessions, was strongly differentiated from sub-populations of P. somniferum accessions, supporting the results of DAPC and STRUCTURE analysis (Fig. 5b,c).

Table 1 Pairwise genetic differentiation (F_ST) between five subpopulations of opium poppy accessions calculated from 49,166 single nucleotide polymorphism loci.

Full size table

There were noticeable differences in genetic diversity between the five DAPC subpopulations (Table 2). The number of private/unique alleles (AP) ranged from 790 (SP3) to 37,157 (SP1; P. setigerum), calculated from the 131,039 SNP dataset. This further confirmed the genetic distinctiveness of P. setigerum. The percentage of polymorphic loci varied from 3.93 (SP3) to 25.05% (SP5). The level of observed heterozygosity (H_O) was highest for SP1 (0.166) and lowest for SP2 (0.009). H_O was lower than the expected heterozygosity (H_E) for all subpopulations except SP1 (P. setigerum), indicating high levels of inbreeding in P. somniferum. This finding was supported by the higher inbreeding coefficients (F_IS) for the P. somniferum subpopulations (0.061 to 0.377). F_IS was negative for SP1, possibly a consequence of an excess of the observed heterozygotes. The highest nucleotide diversity (π) was observed in SP1 (0.219) and the lowest in SP3 (0.040). The genetic variations were both due to differences between (41.6%) and within (44.8%) subpopulations, determined using analysis of molecular variance (AMOVA; Table 3).

Table 2 Measures of diversity for 90 Papaver accessions from five subpopulations calculated from 49,166 single nucleotide polymorphism loci.

Full size table

Table 3 Analysis of molecular variance for 90 opium poppy accessions based on 49,166 single nucleotide polymorphism markers.

Full size table

We further investigated patterns of accession groupings with regard to their country of origin by combining the hierarchical clustering and population structure analyses with information on the country of origin of each accession (Fig. 6). The North Korean accessions formed a distinct subpopulation within this analysis. Additionally, some accessions obtained from the same countries were grouped together as genetically similar. Examples included accessions from Japan, Morocco, Belgium, Switzerland, Germany, Mongolia, Russia, Australia, Bulgaria, and Czechoslovakia (Fig. 6). We interpret these results as reflecting the highly restricted movement of poppy germplasm between different countries and the controlled circumstances under which poppies are cultivated.

Variability in alkaloid content across genetically distinct poppy accessions

Papaver cultivars and accessions exhibit notable variability in the quantity and composition of their alkaloid contents^38,39. Understanding the genetic basis of this variability would improve knowledge of alkaloid biosynthesis and may provide tools for breeding and synthetic biology. We examined this by quantifying both the major (morphine, codeine, thebaine) and minor (papaverine and opripavine) alkaloids in dry capsules of the 90 accessions that reached full maturity (Fig. 7). We observed considerable variation among the accessions in alkaloid content and composition (Fig. 7). Total alkaloid content ranged from 0.125 (PAP 795) to 1.610 (PAP 784) g/100 g DW, whereas morphine content varied between 0.072 (PAP 229A, P. setigerum) and 1.416 (PAP 784) g/100 g DW. Codeine and thebaine contents ranged from 0.002 (PAP 795) to 0.342 (PAP 151) and 0.000 to 0.336 (PAP 719) g/100 g DW, respectively (Fig. 7a). Eight of the accessions analysed (9%) did not have a detectable level of thebaine. Papaverine and oripavine content ranged from 0.000 to 0.077 and 0.102 g/100 g DW, respectively. We identified 18 accessions with undetectable levels of papaverine and six with undetectable oripavine. These results demonstrate that considerable diversity in total alkaloid contents exists in the accessions studied.

The relative abundance of individual alkaloids also varied across accessions. Morphine was the most abundant alkaloid in 85 of the 90 accessions (94.4%), ranging from 32.3 to 96% (of total alkaloids) (Fig. 7b). Codeine was the most abundant alkaloid (over 50% of total alkaloids) in only three accessions (PAP 150, PAP 151 and PAP 152), though 14 accessions had codeine abundances of 20% (of total alkaloids) or more. Notably, all three high codeine accessions were collected in Morocco. Thebaine, papaverine or oripavine were the most abundant alkaloid in none of the accessions. Thebaine composition ranged from 0.0 to 24.4% (of total alkaloids), with the highest in PAP 719. The proportions of papaverine ranged from 0.0 to 9.4% and oripavine from 0.0 to 29.9% (of total alkaloids).

We tested for relationships between genetic and chemical diversity by applying principal component analysis based upon the alkaloid profiles of the accessions, then compared subpopulation clustering patterns (Fig. 7c and Fig. S6). The accessions of subpopulations 4 and 5 had the most similar alkaloid composition, demonstrated by PCA (PC1 and PC2 accounting for 92.4% of variability, Fig. 7c). The P. setigerum accessions were clearly separated from all others, while accessions of subpopulation 2 had the most diverse and distinct alkaloid composition among the P. somniferum accessions. Accessions with the highest codeine proportions, including PAP 150 (53.9%), PAP 151 (52.0%), PAP 152 (50.4%), PAP 200 (42.2%), PAP 739 (35.6%), PAP 149 (33.8%) and PAP 354 (28.7%), were clearly separate from the others (Fig. 7c). These results suggested a congruent pattern of genetic and chemical diversity of the accessions. However, considering that alkaloid content and composition can be affected by environment, it is important that such data is validated in replicated field experiments³⁸. For a crop like poppy that is highly valued for its secondary metabolites, understanding patterns of metabolic variation is important both for conservation and utilization of the available germplasm.

Phylogenetic relationships of P. somniferum accessions

We explored the phylogenetic relationships between P. somniferum accessions within the collection by constructing a Neighbor-Net using 49,160 unlinked SNPs⁴⁰. Distinct genetic groups were identified, mostly reflecting the subpopulations identified by the population structure analyses (Fig. 8). Accessions of P. somniferum and P. setigerum were clearly separated. Groups 1 and 2 contained all the three and 12 accessions from SP1 and SP2 of the DAPC, respectively. The accessions of SP4 differentiated into two groups, Group 3 and 4, consistent with the STRUCTURE analysis indicating that the highest amount of admixture was in SP4 (Figs. 5b, 8). The North Korean accessions formed Group 5, while Group 6 included all 50 accessions of DAPC SP5.

We examined morphological and agronomical traits that could potentially distinguish the various subpopulations and genetic groups. We found that variations in the agronomic-related traits of number of branches per plant and 1000-seed weight were broadly consistent with the subpopulations and phylogenetic groups identified (Fig. 9a). Branch number ranged from 1.75 (PAP 184) to 11.25 (PAP 229A), and 1000-seed weight from 0.165 (PAP 255) to 0.815 g (PAP 733). The P. setigerum accessions (Group 1) and accessions in Groups 2 to 5, representing accessions of the DAPC SP1 to SP4, were generally highly branching and produced lighter seeds (Fig. 9a). Contrastingly, accessions in Group 6 (SP5) were less branching and produced heavier seeds. We tested if the association between branch number and 1000-seed weight was significant and found a significant negative correlation between the two traits (r = − 0.70, p < 0.001) (Fig. 9c). Seed size and branching habit are important characteristics to distinguish Papaver species^5,41. We also observed differences between genetic groups with respect to capsule and seed morphology (Figs. S4, S5). The loss of a wind-based seed dispersal mechanism through a transition from poricidal to indehiscent capsules is believed to be among the changes in morphological traits that occurred during poppy domestication⁷. Our study provides a preliminary analysis of this interesting topic, which could be further investigated using larger experiments that incorporate field data.

Discussion

The poppy germplasm currently available in several genebanks is a rich potential source of useful alleles. Accurate genotyping of this germplasm is crucial to dissect the available genetic diversity. This is an important step for germplasm conservation and can also facilitate the identification and deployment of promising genotypes/alleles for genetic improvement of the crop. With this in mind, we optimized a GBS protocol for poppy and applied GBS-based analysis of SNP markers for the assessment of genetic relationships in diverse poppy accessions from P. somniferum and P. setigerum. This optimized protocol provides a rapid and cost-effective method for genotyping of unlimited number of accessions.

The current P. somniferum genome is a draft with only 81.6% of the sequences assigned to individual chromosomes²⁷. However, we were able to identify significantly more markers when using it as a reference for both P. somniferum and P. setigerum SNP calling than when using a reference-free GBS analysis method (Table S4). This result was supported by a high alignment rate (> 97%) and number of private alleles (37,157) identified for the P. setigerum accessions (Table 2; Table S3). The P. somniferum and P. setigerum genomes share considerable homology and have a close phylogenetic relationship, which likely explains these results^{34,42,43,44,45,46}. The differing ploidy of P. setigerum (tetraploid) and P. somniferum (diploid) was a potential limitation of our analyses, but GBS has previously been implemented on species with different ploidy levels^{47,48,49,50,51}. However, accurate identification of SNPs from polyploids is still a challenge that requires improvement of existing software packages or development of new ones⁵².

Whole genome sequencing (WGS) is an ideal tool to capture complete information about genome variability including SNP variants. However, the RRS approaches such as GBS are practical and cost-effective methods for simultaneous SNP discovery and genotyping of species with large and complex genomes or when analysing large number of samples. Furthermore, for species like opium poppy where the available germplasm is yet to be fully characterized, GBS can be applied to characterize a large collection from which a core set could be selected to represent the genetic diversity of the entire collection. WGS can then be applied to the accessions in the core collection for further, deep analysis. This is a critical step to efficiently manage the genebank collection and promote its use. The accessions characterized in our study and the associated GBS data can significantly contribute towards this goal.

The phylogenetic and taxonomic relationships of Papaver species, particularly that of the cultivated P. somniferum and the wild species P. setigerum, is still debated^6,29,33,46. Some treat the two as separate species, while others consider P. setigerum as a subspecies of P. somniferum based on morphology and alkaloid profiles^29,46. Our findings confirmed the genetic separation of the accessions from the two named groups (Figs. 3, 4, 5). This result was expected, because it has been shown previously that P. setigerum is distinct from, but phylogenetically closer to, P. somniferum than all other Papaver species^45,46. P. setigerum is also considered the putative progenitor of cultivated P. somniferum^5,29. Our result based on genome size analysis supports previous reports that P. somniferum is diploid (2n = 22) and P. setigerum tetraploid (2n = 44) (Fig. 4)^34,35. These data do not seem to support the hypothesis of a direct origin of P. somniferum from P. setigerum. However, they raise interesting questions about the possible relationships between the two species. Considerable homology exists between the two genomes, suggesting they may have a common origin³⁴. Interspecific crosses are possible between the two species despite some meiotic abnormalities observed at the F1 generation^34,53. Notably, a diploid form of P. setigerum has been described^29,54. However, our data is based on only three P. setigerum accessions that are all tetraploids. For a detailed analysis of the genetic relationship of these two species, more samples from P. setigerum, including both the diploid and tetraploid forms, need to be studied.

Our analysis of population structure and phylogeny generated similar accession groupings. Interestingly, these groups can be broadly identified based on traits such as branching and seed weight (Fig. 9). Although this finding needs further verification with larger datasets and in replicated experiments including in the field, the data can be a useful input for studies aiming to unravel the domestication history of opium poppy. Domestication traits in poppy are poorly defined, making it difficult to investigate this process particularly in archaeological records. Changes in capsule and seed sizes are believed to be among the domestication-related traits in poppy⁷. Our preliminary observations also suggest capsule indehiscence is a useful trait for consideration in future studies.

The accessions we studied were highly diverse in their alkaloid profiles. Although morphine was the dominant alkaloid in most of the accessions, we also identified accessions with codeine levels of up to 54% (of total alkaloids). Both natural and induced mutants with altered alkaloid profiles have been instrumental to elucidate the molecular mechanisms underlying differences in alkaloid contents in poppy^{27,55,56,57,58}. The transcriptional regulation of alkaloid biosynthesis in poppy has not been studied in detail. Diverse chemotypes are potential sources of gene expression or enzyme variants with differing activities that affect the alkaloid biosynthesis pathway. These are vital resources to understand the biochemical and genomic regulation of alkaloid biosynthesis. This is also key for breeding commercial lines with specific alkaloid abundances or desired alkaloid profiles. The development of the top1 mutant, a high-thebaine and high-oripavine variety, was a significant commercial breakthrough that allowed the production of thebaine from morphine-free plants⁵⁶. Thebaine is used for the semi-synthesis of painkillers oxycodone and hydrocodone and the anti-opioid addiction drugs buprenorphine and naltrexone⁵⁹. Similarly, the development of high codeine and morphine-free poppy varieties would allow the direct plant-based production of codeine while preventing the illicit synthesis of the morphine-derived heroin⁶⁰.

Our study demonstrates the utility of GBS for genetic analyses in opium poppy. Many hundreds of poppy germplasm accessions are available from genebanks worldwide. These are an immense potential resource if fully exploited. Application of GBS to the entire poppy germplasm collection, complemented by the currently available draft reference genome sequence of poppy, could drive further studies to unravel the extent of genetic diversity in the species. Although poppy is largely self-pollinating, a considerable degree of outcrossing has been reported^61,62,63. Consequently, future genetic diversity studies need to take into consideration intra-accession variability. Based on our preliminary observations, we also suggest that traits such as seed weight, capsule size and dehiscence, and branching be considered in studies of the domestication and phylogenetic relationships of opium poppy and related species. Data generated from such studies would also enable the development a poppy core set, which is an important step towards the selection of suitable parents and development of mapping populations or genetic panels for elucidating the genetic architecture of traits of agronomic and pharmaceutical importance.

Methods

Plant materials and morphological characterization

Seeds of 95 Papaver accessions from diverse geographical origins were obtained from the global poppy germplasm collection maintained at Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Genbank in Germany (Table S1). Of these, 91 successfully germinated accessions and were used for GBS analysis. We excluded four P. somniferum accessions that failed to germinate. The accessions were morphologically diverse materials belonging to the two main Papaver species: P. somniferum (88 accessions) and P. setigerum (3 accessions) (Fig. 1). Seeds were sown in 200 mm pots using a standard potting mix. Two pots (each a replication) were used per line and plants were thinned to two per pot after germination. Plants were grown in a glasshouse under constant temperature (22 °C day and night) and photoperiod (17 h/7 h light/dark). Capsules were harvested for alkaloid profiling at the dry capsule stage.

At full plant maturity data was recorded on number of branches (main stem plus side branches), 1000-seed weight, seed colour and capsule and seed characteristics. Images of capsules were taken under normal room light using a SM-G950U1 digital camera. The seed images were captured using a Leica M205 FA stereomicroscope with a digital camera Leica DFC 420 and Leica Application Suite Software (LAS 4.0, Leica) with 10% light and 50% zoom in condition. To measure seed weight, we first removed debris then counted 1000 clean seeds using a Contador seed counter fitted with feed container no. 3 for fine seeds (Pfeuffer GmbH, Germany). To determine seed colour, images of the seeds were taken using a Cannon EOS-600D camera together with a ColorChecker passport (X-rite) for camera calibration and colour correction. The original Red, Green, Blue (RGB) colour channels of seed digital images were calibrated in Lightroom Classic CC (Adobe Creative Cloud). The RGB values were then converted to Hex colour codes using RGB Color code chart (https://www.rapidtables.com/web/color/RGB_Color.html). The corresponding approximate colours were determined from Hex colour codes using HTML CSS Color Picker (https://www.htmlcsscolor.com/hex/).

DNA isolation, GBS library preparation and sequencing

For DNA extraction, ~ 50–100 mg fresh young leaves were harvested in liquid nitrogen and ground to a fine powder using a TissueLyser II system (Qiagen). Genomic DNA was extracted using the CTAB (cetyl trimethylammonium bromide) method⁶⁴ with the following minor modification. Following grinding of the samples but prior to addition of the extraction buffer, samples were washed using a 0.1 M HEPES (N-2-hydroxyetylpiperazine-N′-ethanesulphonic acid) buffer (pH 8.0) containing 1% polyvinylpyrrolidone, 0.9% L-ascorbic acid, and 2% 2-mercaptoethanol to remove polysaccharides and phenolic compounds. DNA quality was checked by agarose gel electrophoresis and quantified with a NanoDrop spectrophotometer ND-1000 version (Thermo Fisher Scientific, Wilmington, DE, USA). For GBS library construction, the double digest RAD-seq (ddRAD) based library preparation protocol was used⁶⁵. The protocol included the following steps: DNA digestion with two restriction enzymes, ligation of barcoded adapters compatible with restriction sites overhang, size selection of pooled digested-ligated fragments using Blue Pippin, and amplification of library via PCR using indexed primers. For protocol optimization, eight double digested libraries were prepared from a pool of 3 representative samples (PAP 630, PAP 696 and a commercial cultivar) using the restriction enzyme combinations PstI/MspI, PstI/MseI, PstI/NlaIII, PstI/HpyCH4IV, EcoRI/MspI, EcoRI/MseI, EcoRI/NlaIII and EcoRI/HpyCH4IV. The library generated using EcoRI and NlaIII was sequenced on the Illumina NextSeq500 platform (Illumina, San Diego, CA, USA) with the standard protocol for single-end reads in 150-cycle mid-output mode at the Australian Genome Research Facility (Melbourne, Australia).

Mapping and SNP calling

Raw sequence data were demultiplexed and sorted using the “process_radtags” function in STACKS v2.41 with the default parameters: “--inline-index” for barcode option and “--renz_1 EcoRI --renz_2 NlaIII” for enzymes option⁶⁶. After trimming barcode sequences, all trimmed reads (150 bp) were checked for quality, and low-quality reads (with quality score of less than 10) and “no RadTag” reads were removed. The filtered reads were aligned to the draft poppy genome sequence retrieved from NCBI (GCA_003573695.1_ASM357369v1) using BWA v0.7.17 and SAMtools v1.9^27,67,68. SNP calling was performed using the refmap.pl pipeline of STACKS v2.41. All the 91 accessions were treated as a single population. SNP loci that were mapped with reads from less than 90% of the accessions sequenced (--min-samples-per -pop 0.9) were excluded from further analysis. The same parameter was applied for the non-reference-based SNP calling using denovo.pl pipeline of STACKS. The density and distribution of the filtered SNPs across poppy chromosomes were determined using BEDTools v2.26.0 and SnpEff v4.1l^69,70. To visualize SNP density, the “MVP.report” function with 1 Mb non-overlapping window from rMVP package was performed in R studio^71,72,73.

Population structure analysis

The population structure of the accessions was assessed using four methods. First, the accessions were clustered using identity-by-state (IBS) to determine the relationship between accessions based on the proportion of shared alleles between pairs of individuals in PLINK v1.07³². Second, for elucidation of the genetic structure and identification of the optimum number of subgroups, we removed the outlier PAP 400 and applied the Bayesian information criterion (BIC) analysis to the remaining 90 accessions. BIC analysis as a nonparametric method was performed with the adegenet package v2.1.3 in R studio based on 131,039 SNP dataset^71,74,75. The best number of subgroups/subpopulations was determined as the K-means corresponding to the lowest BIC score using the “find.clusters” function. To determine the relationship between the subpopulations, we carried out discriminant analysis of principal components (DAPC). A cross validation function “Xval.dapc” was used to determine the optimal number of PCs to be retained.

Third, to investigate the population structure in detail, the admixture within the accessions was determined using Bayesian clustering based on a Bayesian Markov Chain Monte Carlo model (MCMC) implemented in STRUCTURE v2.3.4³⁷. With the assumption that SNPs at the same RAD locus are linked, --write_single_snp flag (in STACKS) was applied to ensure that only one SNP per RAD locus was used for STRUCTURE analysis. To determine the most likely number of subpopulations (clusters), four independent runs with 500,000 iterations and a 150,000-step burn-in period were performed for each K from 1 to 10. The output was obtained by structureHarvester v0.6.93 using the maximum estimated log-likelihood [log(P(X|K)] model and the highest ΔK in Evanno method^37,76,77. After determining the most probable K values, ten runs of 500,000 iterations followed by a 150,000 step burn-in were performed using STRUCTURE for each K. Additionally, for each optimal K, CLUMPP was used to generate individual and population Q matrices from the membership coefficient matrices of the ten replicates obtained from STRUCTURE⁷⁸. Bar plots were generated using DISTRUCT software⁷⁹. Forth, principal component analysis (PCA) was conducted using PLINK v1.07 and plotted using ggplot2 package in R studio^32,71,80.

Genetic diversity and differentiation

Common measures of genetic diversity including private allele number (AP), percentage of polymorphic loci (%Poly), observed and expected heterozygosity (H_O and H_E), nucleotide diversity (π) and inbreeding coefficient (F_IS) were calculated for the five subpopulations using the “populations” function in STACKS v2.41⁶⁶. The genetic differentiation between the subpopulations was calculated based on pairwise population differentiation (F_ST) values from GENODIVE v3.05⁸¹. Significance levels (α = 0.05) of the F_ST values were determined by running 999 permutations and assessing this against a Bonferroni-adjusted P-value to account for multiple testing. The correlation matrix was visualized using the corrplot package in R studio^73,82. To determine the distribution of genetic variation, analysis of molecular variance (AMOVA) was performed using GENODIVE v3.05⁸¹. Significance level was tested using 999 permutations.

Phylogenetic analysis

To explore the phylogenetic relationships between P. somniferum accessions, we constructed a phylogenetic network using 49,160 unlinked SNPs. The SNPs were exported in PHYLIP format (--phylip-var) from the “populations” function in STACKS v2.41 to the SplitTree4 software⁴⁰. The split network was created with the “uncorrectedP” method, which ignores ambiguous sites, and visualized using “Neightbornet” network. One thousand bootstrap values were used.

Alkaloid profiling

Alkaloid content of 90 accessions was measured according the protocol used by Dittbrenner and colleagues³⁸. Only accessions that produced capsules samples sufficient for analysis were included. Alkaloids were also analysed in four accessions that had failed to germinate during the first trial and subsequently lacked GBS data (Table S1). For testing the relationship between alkaloid and genetic diversity, we conducted principal component analysis (PCA) using TASSELv5.2.73 and plotted the graph using ggplot2 package in R studio^73,80,83.

Genome size and ploidy analysis

The genome size and ploidy level of selected accessions was estimated using flow cytometry analysis of propidium iodide (PI)-stained nuclei isolated from poppy leaves. Nuclei of tomato (Solanum lycopersicum), which has genome size of ~ 900 Mb⁸⁴, were simultaneously isolated from leaves, stained and analysed with poppy nuclei as an internal reference standard. Nuclei were isolated with the Galbraith lysis buffer using a modified protocol from Gutzat and Scheid as follows: (1) Chopping fresh young leaf tissue (0.5 g) using double-sided razor blades in 2 mL ice-cold lysis buffer, (2) filtering the homogenate through a 40 µm nylon mesh, (3) adding 2.5 µL of RNase (10 mg/mL) to 500 µL filtered homogenate, then incubating on ice for 10 min, (4) centrifugation at 400 g for 3 min, removing the supernatant and resuspending the pellet gently in 1 mL lysis buffer, then incubating on ice for 15 min, and (5) filtering the homogenate again through a 40 µm nylon mesh^85,86. For nuclei staining, 25 µL PI (1 mg/ml) was added into 500 µL nuclei solution to get a final concentration of 50 µg/mL. Samples were screened on a CytoFLEX S flow cytometer (Beckman Coulter, Brea, CA, USA). The PerCP-A fluorescence intensity of G1 and G2 phase cells of internal standard and samples was used to estimate genome size and ploidy level of the samples.

Ethical standards

Seeds of all poppy accessions were obtained from a public seedbank and transferred in accordance with international legislation. All experimental research in this study was conducted in compliance with the relevant institutional, national, and international guidelines and legislation.

References

Kritikos, P. G. & Papadaki, S. The History of the Poppy and of Opium and Their Expansion in Antiquity in the Eastern Mediterranean Area (UN, 1967).
Google Scholar
Askitopoulou, H., Ramoutsaki, I. A. & Konsolaki, E. Archaeological evidence on the use of opium in the Minoan world in International Congress Series. 23–29 (Elsevier, 2002).
Bernáth, J. & Németh, É. Poppy in Oil crops (eds. Vollmann, J. & Rajcan, I.) 449–468 (Springer, 2009).
Salavert, A., Martin, L., Antolín, F. & Zazzo, A. The opium poppy in Europe: Exploring its origin and dispersal during the Neolithic. Antiquity 92 (2018).
Jesus, A. et al. A morphometric approach to track opium poppy domestication. Sci. Rep. 11, 1–11 (2021).
Google Scholar
Salavert, A. et al. Direct dating reveals the early history of opium poppy in western Europe. Sci. Rep. 10, 1–10 (2020).
Google Scholar
Zohary, D., Hopf, M. & Weiss, E. Domestication of Plants in the Old World: The Origin and Spread of Domesticated Plants in Southwest Asia, Europe, and the Mediterranean Basin (Oxford University Press on Demand, 2012).
Google Scholar
Beaudoin, G. A. & Facchini, P. J. Benzylisoquinoline alkaloid biosynthesis in opium poppy. Planta 240, 19–32 (2014).
CAS PubMed Google Scholar
INCB/UN. Report of the International Narcotics Control Board 2020. (United Nations Publications, 2021).
Tamiru‐Oli, M., Premaratna, S. D., Gendall, A. R. & Lewsey, M. G. Biochemistry, Genetics, and Genomics of Opium Poppy (Papaver somniferum) for Crop Improvement. Annu. Plant Rev.Online, 1177–1219 (2018).
Labanca, F., Ovesna, J. & Milella, L. Papaver somniferum L. taxonomy, uses and new insight in poppy alkaloid pathways. Phytochem. Rev. 17, 853–871 (2018).
CAS Google Scholar
EFSA Panel on Contaminants in the Food Chain. Update of the Scientific Opinion on opium alkaloids in poppy seeds. EFSA J. 16, e05243 (2018).
Google Scholar
Saunders, J. A., Pedroni, M. J., Penrose, L. D. & Fist, A. J. AFLP analysis of opium poppy. Crop Sci. 41, 1596–1601 (2001).
CAS Google Scholar
Celik, I. et al. Molecular genetic diversity and association mapping of morphine content and agronomic traits in Turkish opium poppy (Papaver somniferum) germplasm. Mol. Breed. 36, 46 (2016).
Google Scholar
Lahiri, R., Lal, R., Srivastava, N. & Shanker, K. Genetic variability and diversity in Indian germplasm of opium poppy (Papaver somniferum L.). J. Appl. Res. Med. Aromat. Plants 8, 41–46 (2018).
Google Scholar
Srivastava, A. et al. Genetic diversity in Indian poppy (P. somniferum L.) germplasm using multivariate and SCoT marker analyses. Ind. Crops Prod. 144, 112050 (2020).
CAS Google Scholar
Svoboda, P., Vašek, J., Vejl, P. & Ovesná, J. Genetic features of Czech blue poppy (Papaver somniferum L.) revealed by DNA polymorphism. Czech J. Food Sci. 38, 198–202 (2020).
CAS Google Scholar
Bajpai, S., Gupta, A., Gupta, M. & Kumar, S. Inter-relationships between morphine and codeine in the Indian genetic resources of opium poppy. J. Herbs Spices Med. Plants 8, 75–81 (2001).
Google Scholar
Prajapati, S. et al. Alkaloid profiles of the Indian land races of the opiumpoppy Papaver somniferum L. Genet. Resour. Crop Evol. 49, 183–188 (2002).
Google Scholar
Nutr, H. Börner, A. Preservation of plant genetic resources in the biotechnology era. Biotechnol. J. Technol. 1, 1393–1404 (2006).
Google Scholar
Brezinova, B., Macak, M. & Eftimova, J. The morphological diversity of selected traits of world collection of poppy genotypes (genus Papaver). J. Cent. Eur. Agric. 10, 183–192 (2009).
Google Scholar
Acharya, H. S. & Sharma, V. Molecular characterization of opium poppy (Papaver somniferum) germplasm. Am. J. Infect. Dis. 5, 148–153 (2009).
CAS Google Scholar
Celik, I., Gultekin, V., Allmer, J., Doganlar, S. & Frary, A. Development of genomic simple sequence repeat markers in opium poppy by next-generation sequencing. Mol. Breed. 34, 323–334 (2014).
CAS Google Scholar
Batley, J. & Edwards, D. SNP applications in plants in Association mapping in plants (eds. Oraguzie et al) 95–102 (Springer, 2007).
Kumar, S., Banks, T. W. & Cloutier, S. SNP discovery through next-generation sequencing and its applications. Int. J. Plant Genomics 2012 (2012).
Elshire, R. J. et al. A robust, simple genotyping-by-sequencing (GBS) approach for high diversity species. PLoS ONE 6, e19379 (2011).
CAS PubMed PubMed Central ADS Google Scholar
Guo, L. et al. The opium poppy genome and morphinan production. Science 362, 343–347 (2018).
CAS PubMed ADS Google Scholar
Li, Q. et al. Gene clustering and copy number variation in alkaloid metabolic pathways of opium poppy. Nat. Commun. 11, 1–13 (2020).
ADS Google Scholar
Hammer, K. Problems of Papaver somniferum-classification and some remarks on recently collected European poppy land-races. Die Kulturpflanze 29, 287–296 (1981).
Google Scholar
Poland, J. A., Brown, P. J., Sorrells, M. E. & Jannink, J.-L. Development of high-density genetic maps for barley and wheat using a novel two-enzyme genotyping-by-sequencing approach. PLoS ONE 7, e32253 (2012).
CAS PubMed PubMed Central ADS Google Scholar
Glaubitz, J. C. et al. TASSEL-GBS: A high capacity genotyping by sequencing analysis pipeline. PLoS ONE 9, e90346 (2014).
PubMed PubMed Central ADS Google Scholar
Purcell, S. et al. PLINK: A tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet. 81, 559–575 (2007).
CAS PubMed PubMed Central Google Scholar
Srivastava, S. & Lavania, U. Evolutionary DNA variation in Papaver. Genome 34, 763–768 (1991).
CAS Google Scholar
Malik, C., Mary, T. & Grover, I. Cytogenetic studies in Papaver. V. Cytogenetic studies on P. somniferum x P. setigerum hybrids and amphiploids. Cytologia 44, 59–69 (1979).
Google Scholar
Wakhlu, A. & Bajwa, P. Cytological analysis in embryogenic callus cultures and regenerated plants of Papaver somniferum L.(opium poppy). Cytologia 52, 631–638 (1987).
Google Scholar
Akpertey, A., Padi, F. K., Meinhardt, L. & Zhang, D. Effectiveness of Single Nucleotide Polymorphism markers in genotyping germplasm collections of Coffea canephora using KASP assay. Front. Plant Sci. 11, 2300 (2020).
Google Scholar
Pritchard, J. K., Stephens, M. & Donnelly, P. Inference of population structure using multilocus genotype data. Genetics 155, 945–959 (2000).
CAS PubMed PubMed Central Google Scholar
Dittbrenner, A., Mock, H., Börner, A. & Lohwasser, U. Variability of alkaloid content in Papaver somniferum L. J. Appl. Bot. Food Qual. 82, 103–107 (2009).
CAS Google Scholar
Shukla, S., Yadav, H. K., Rastogi, A., Mishra, B. K. & Singh, S. P. Alkaloid diversity in relation to breeding for specifi c alkaloids in opium poppy (Papaver somniferum L.). Czech J. Genet. Plant Breed. 46, 164–169 (2010).
CAS Google Scholar
Huson, D. H. & Bryant, D. Application of phylogenetic networks in evolutionary studies. Mol. Biol. Evol. 23, 254–267 (2006).
CAS PubMed Google Scholar
Hrishi, N. J. Cytogenetical studies on Papaver somniferum L. and Papaver setigerum DC. and their hybrid. Genetica 31, 1–130 (1960).
CAS PubMed Google Scholar
Lavania, U. C. & Srivastava, S. Quantitative delineation of karyotype variation in Papaver as a measure of phylogenetic differentiation and origin. Curr. Sci. 77, 429–435 (1999).
Google Scholar
Hammer, K. & Fritsch, R. The question of ancestral species of cultivated poppy (Papaver somniferum L.). Kulturpflanze XXV, 113–124 (1977).
Google Scholar
Dittbrenner, A., Lohwasser, U., Mock, H.-P. & Börner, A. Molecular and phytochemical studies of Papaver somniferum in the context of infraspecific classification in V International Symposium on the Taxonomy of Cultivated Plants 799. 81–88 (2007).
Lane, A. K. et al. Phylogenomic analysis of Ranunculales resolves branching events across the order. Bot. J. Linn. Soc. 187, 157–166 (2018).
Google Scholar
Liu, L. et al. The complete chloroplast genome of Papaver setigerum and comparative analyses in Papaveraceae. Genet. Mol. Biol. 43 (2020).
VanWallendael, A., Alvarez, M. & Franks, S. J. Patterns of population genomic diversity in the invasive Japanese knotweed species complex. Am. J. Bot. 108, 857–868 (2021).
PubMed Google Scholar
Ogden, R. et al. Sturgeon conservation genomics: SNP discovery and validation using RAD sequencing. Mol. Ecol. 22, 3112–3123 (2013).
CAS PubMed Google Scholar
Qi, Z.-C. et al. Phylogenomics of polyploid Fothergilla (Hamamelidaceae) by RAD-tag based GBS-insights into species origin and effects of software pipelines. J. Syst. Evol. 53, 432–447 (2015).
Google Scholar
Yang, X. et al. Constructing high-density genetic maps for polyploid sugarcane (Saccharum spp.) and identifying quantitative trait loci controlling brown rust resistance. Mol. Breed. 37, 1–12 (2017).
Google Scholar
Wang, N. et al. Genome sequence of dwarf birch (Betula nana) and cross-species RAD markers. Mol. Ecol. 22, 3098–3111 (2013).
CAS PubMed Google Scholar
Clark, L. V., Lipka, A. E. & Sacks, E. J. polyRAD: Genotype calling with uncertainty from sequencing data in polyploids and diploids. GG3: GenesGenom. Genet. 9, 663–673 (2019).
CAS Google Scholar
Singh, S., Shukla, S., Khanna, K., Dixit, B. & Banerji, R. Variation of major fatty acids in F8 generation of opium poppy (Papaver somniferum× Papaver setigerum) genotypes. J. Sci. Food Agric. 76, 168–172 (1998).
CAS Google Scholar
Hammer, K. & Fritsch, R. Zur Frage nach der Ursprungsart des Kulturmohns Papaver somniferum L. Die Kulturpflanze 25, 113–124 (1977).
Google Scholar
Chaturvedi, N. et al. Comparative analysis of Papaver somniferum genotypes having contrasting latex and alkaloid profiles. Protoplasma 251, 857–867 (2014).
CAS PubMed Google Scholar
Millgate, A. G. et al. Morphine-pathway block in top1 poppies. Nature 431, 413–414 (2004).
CAS PubMed ADS Google Scholar
Winzer, T. et al. A Papaver somniferum 10-gene cluster for synthesis of the anticancer alkaloid noscapine. Science 336, 1704–1708 (2012).
CAS PubMed ADS Google Scholar
Pathak, S. et al. Comparative transcriptome analysis using high papaverine mutant of Papaver somniferum reveals pathway and uncharacterized steps of papaverine biosynthesis. PLoS ONE 8, e65622 (2013).
CAS PubMed PubMed Central ADS Google Scholar
Chen, X. et al. A pathogenesis-related 10 protein catalyzes the final step in thebaine biosynthesis. Nat. Chem. Biol. 14, 738–743 (2018).
CAS PubMed Google Scholar
Hagel, J. M. & Facchini, P. J. Dioxygenases catalyze the O-demethylation steps of morphine biosynthesis in opium poppy. Nat. Chem. Biol. 6, 273–275 (2010).
CAS PubMed Google Scholar
Bhandari, M. Out-crossing in opium poppy Papaver somniferum L. Euphytica 48, 167–169 (1990).
Google Scholar
Miller, J. et al. Pollination biology of oilseed poppy, Papaver somniferum L. Aust. J. Agric. Res. 56, 483–490 (2005).
Google Scholar
Nyman, U. & Hall, O. Some varieties of Papaver somniferum L. with changed morphinane alkaloid content. Hereditas 84, 69–76 (1976).
CAS PubMed Google Scholar
Murray, M. & Thompson, W. F. Rapid isolation of high molecular weight plant DNA. Nucleic Acids Res. 8, 4321–4326 (1980).
CAS PubMed PubMed Central Google Scholar
Peterson, B. K., Weber, J. N., Kay, E. H., Fisher, H. S. & Hoekstra, H. E. Double digest RADseq: An inexpensive method for de novo SNP discovery and genotyping in model and non-model species. PLoS ONE 7, e37135 (2012).
CAS PubMed PubMed Central ADS Google Scholar
Catchen, J., Hohenlohe, P. A., Bassham, S., Amores, A. & Cresko, W. A. Stacks: An analysis tool set for population genomics. Mol. Ecol. 22, 3124–3140 (2013).
PubMed PubMed Central Google Scholar
Li, H. & Durbin, R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 25, 1754–1760 (2009).
CAS PubMed PubMed Central Google Scholar
Li, H. et al. The sequence alignment/map format and SAMtools. Bioinformatics 25, 2078–2079 (2009).
PubMed PubMed Central Google Scholar
Quinlan, A. R. & Hall, I. M. BEDTools: A flexible suite of utilities for comparing genomic features. Bioinformatics 26, 841–842 (2010).
CAS PubMed PubMed Central Google Scholar
Cingolani, P. et al. A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3. Fly 6, 80–92 (2012).
CAS PubMed PubMed Central Google Scholar
Team, R. C. R: A language and environment for statistical computing. R. foundation for stastical computing, Vienna (2013). http://www.R-project.org/
Yin, L. et al. rMVP: A Memory-efficient, Visualization-enhanced, and Parallel-accelerated tool for Genome-Wide Association Study. Genom. Proteom. Bioinform. (2021) In press. https://doi.org/10.1016/j.gpb.2020.10.007.
Team, R. RStudio: Intergrated development for R. RStudio, PBC, Boston, MA (2020). http://www.rstudio.com/
Jombart, T. adegenet: A R package for the multivariate analysis of genetic markers. Bioinformatics 24, 1403–1405 (2008).
CAS PubMed Google Scholar
Jombart, T. & Ahmed, I. adegenet 1.3–1: New tools for the analysis of genome-wide SNP data. Bioinformatics 27, 3070–3071 (2011).
CAS PubMed PubMed Central Google Scholar
Evanno, G., Regnaut, S. & Goudet, J. Detecting the number of clusters of individuals using the software STRUCTURE: A simulation study. Mol. Ecol. 14, 2611–2620 (2005).
CAS PubMed Google Scholar
Earl, D. A. & vonHoldt, B. M. STRUCTURE HARVESTER: A website and program for visualizing STRUCTURE output and implementing the Evanno method. Conser. Genet. Resour. 4, 359–361 (2012).
Google Scholar
Jakobsson, M. & Rosenberg, N. A. CLUMPP: A cluster matching and permutation program for dealing with label switching and multimodality in analysis of population structure. Bioinformatics 23, 1801–1806 (2007).
CAS PubMed Google Scholar
Rosenberg, N. A. Distruct: A program for the graphical display of population structure. Mol. Ecol. Notes 4, 137–138 (2004).
Google Scholar
Wickham, H. ggplot2: Elegant Graphics for Data Analysis (Springer, 2016). https://doi.org/10.1007/978-3-319-24277-4.
Book MATH Google Scholar
Meirmans, P. G. Genodive version 3.0: Easy-to-use software for the analysis of genetic data of diploids and polyploids. Mol. Ecol. Resour. 20, 1126–1131 (2020).
CAS PubMed PubMed Central Google Scholar
Wei, T. & Simko, V. R package 'corrplot': Visualization of a Correlation Matrix (Version 0.90) (2021). https://github.com/taiyun/corrplot.
Bradbury, P. J. et al. TASSEL: Software for association mapping of complex traits in diverse samples. Bioinformatics 23, 2633–2635 (2007).
CAS PubMed Google Scholar
Consortium, T. G. The tomato genome sequence provides insights into fleshy fruit evolution. Nature 485, 635 (2012).
ADS Google Scholar
Galbraith, D. W. et al. Rapid flow cytometric analysis of the cell cycle in intact plant tissues. Science 220, 1049–1051 (1983).
CAS PubMed ADS Google Scholar
Gutzat, R. & Scheid, O. M. Preparing chromatin and RNA from rare cell types with fluorescence-activated nuclear sorting (FANS). In Plant Epigenetics and Epigenomics: Methods and Protocols (eds Spillane, C. & McKeown, P.) 95–105 (Springer, 2020).
Google Scholar

Download references

Acknowledgements

UTVH received a PhD scholarship from La Trobe University Graduate Research School. Work in the Lewsey lab is funded by the Australian Research Council Industrial Transformation Hub in Medicinal Agriculture (IH180100006) and a Commonwealth Scientific and Industrial Research Organisation SIEF STEM+ Fellowship with Palla Pharma Ltd. We thank La Trobe’s Bioimaging Platform for support with genome size analysis.

Author information

Authors and Affiliations

La Trobe Institute for Agriculture and Food, La Trobe University, AgriBio Building, Bundoora, VIC, 3086, Australia
Uyen Vu Thuy Hong, Muluneh Tamiru-Oli, Bhavna Hurgobin & Mathew G. Lewsey
Australian Research Council Research Hub for Medicinal Agriculture, La Trobe University, AgriBio Building, Bundoora, VIC, 3086, Australia
Uyen Vu Thuy Hong, Muluneh Tamiru-Oli, Bhavna Hurgobin & Mathew G. Lewsey
Palla Pharma Ltd, Docklands, VIC, 3008, Australia
Christopher R. Okey & Artur R. Abreu

Authors

Uyen Vu Thuy Hong
View author publications
You can also search for this author in PubMed Google Scholar
Muluneh Tamiru-Oli
View author publications
You can also search for this author in PubMed Google Scholar
Bhavna Hurgobin
View author publications
You can also search for this author in PubMed Google Scholar
Christopher R. Okey
View author publications
You can also search for this author in PubMed Google Scholar
Artur R. Abreu
View author publications
You can also search for this author in PubMed Google Scholar
Mathew G. Lewsey
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

M.T.O., M.G.L., U.V.T.H. designed the study. U.V.T.H., M.T.O., C.R.O., A.R.A. conducted the experiments. U.V.T.H., M.T.O., B.H. analysed the data. C.R.O., A.R.A. provided research materials. U.V.T.H., M.T.O., M.G.L. interpreted the results and wrote the paper. All co-authors read and approved the final manuscript.

Corresponding authors

Correspondence to Muluneh Tamiru-Oli or Mathew G. Lewsey.

Ethics declarations

Competing interests

UVTH, MTO, BH and MGL declare no competing interests. CRO and ARA are employees of Palla Pharma Ltd.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Hong, U.V.T., Tamiru-Oli, M., Hurgobin, B. et al. Insights into opium poppy (Papaver spp.) genetic diversity from genotyping-by-sequencing analysis. Sci Rep 12, 111 (2022). https://doi.org/10.1038/s41598-021-04056-3

Download citation

Received: 27 September 2021
Accepted: 14 December 2021
Published: 07 January 2022
DOI: https://doi.org/10.1038/s41598-021-04056-3

This article is cited by

Evaluation of chloroplast DNA barcoding markers to individualize Papaver somniferum for forensic intelligence purposes
- Kari Graham
- Rachel Houston
International Journal of Legal Medicine (2024)
Partially defatted rather than native poppy seeds beneficially alter lipid metabolism in rats fed a high-fat diet
- Jarosław Koza
- Adam Jurgoński
Scientific Reports (2023)
Genetic structure and geneflow of Malus across the Korean Peninsula using genotyping-by-sequencing
- Young-Ho Ha
- Hee-Young Gil
- Joo-Hwan Kim
Scientific Reports (2022)
Alkaloid binding to opium poppy major latex proteins triggers structural modification and functional aggregation
- Natali Ozber
- Samuel C. Carr
- Peter J. Facchini
Nature Communications (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

Optimization of a poppy GBS protocol for SNP discovery

Assessing genetic relatedness of 91 Papaver accessions

Population structure and genetic diversity amongst Papaver accessions

Variability in alkaloid content across genetically distinct poppy accessions

Phylogenetic relationships of P. somniferum accessions

Discussion

Methods

Plant materials and morphological characterization

DNA isolation, GBS library preparation and sequencing

Mapping and SNP calling

Population structure analysis

Genetic diversity and differentiation

Phylogenetic analysis

Alkaloid profiling

Genome size and ploidy analysis

Ethical standards

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links