Abstract
Allium macrostemon Bunge, commonly referred to as "no-biru" in Japan, is a widespread wild onion species found across the country. Despite being deeply entwined in ancient Japanese culture, it remains an underutilized crop in Japan. Determining the origins of its domestic populations and understanding their genetic composition is crucial to highlighting the plant's historical significance in Japan. This study aims to bridge this knowledge gap by examining the genetic diversity of 47 A. macrostemon samples from various regions in Japan using RAD-Seq. Our analyses distinguished unique population structures, dividing the samples into three distinct groups: A, B, and C. Notably, groups A and B showed clear evidence of bulb propagation, while group C did not. Group C formed four subgroups: C1, C2, C3, and C4. Hybridization between subgroup C1 and either group A, B, or both, resulted in the emergence of subgroups C2, C3, and C4. Thus, groups A, B, and C1 are posited as the ancestral populations. Additionally, our morphological observations indicated distinct differences among these three groups. Our findings also suggest that human migration may have influenced the plant's distribution, hinting at active usage in the past that later waned, causing its current underutilized status.
Similar content being viewed by others
Introduction
Wild Japanese garlic (Allium macrostemon Bunge), also known as "Chinese garlic" in English and "no-biru" in Japanese, belongs to the Amaryllidaceae family1. The plant is naturally distributed across eastern Russia, eastern Mongolia, mainland China, Taiwan, Korea, and Japan2,3,4,5. It has been utilized in traditional East Asian medicine for centuries1, 6, 7, in addition to serving as a food source. While A. macrostemon is notably popular as an edible Allium species in Korea and northeastern China4, Russia primarily uses its bulbs for pickling8. However, Japan's use of this plant deviates from these patterns. It is mostly harvested from the wild and occasionally sold in local markets, with cultivation limited to small scales. Hence, despite its extensive distribution, A. macrostemon is currently considered a neglected crop in Japan, possibly due to its low yield leading to underutilization9.
Historical references suggest that this plant was used medicinally in Japan10. Its mentions in classical Japanese literature point to active utilization in the past. Waka poems referencing this plant appear in the Kojiki ("Records of Ancient Matters"), compiled in 712, and the Manyoshu ("Collection of Ten Thousand Leaves"), an anthology of various waka poems written until 759. One waka poem in the Manyoshu mentions the plant's use in marinated food, while another describes the use of the bulbs for medicinal treatment. Despite these clues indicating the prior widespread use of this neglected crop, supportive genetic evidence is lacking. Therefore, studying the genetic diversity of A. macrostemon in Japan could shed light on this enigma.
The plant possesses significant medicinal properties, renowned for its analgesic, immune-boosting, and anti-asthmatic capabilities in traditional East Asian medicine1, 6, 7. Recent investigations into the chemical compounds found in A. macrostemon have revealed impressive potential for medical applications. Its bulb is rich in steroidal saponins, which show promise in treating conditions like acute myocardial ischemia11, hyperglycemia, hyperlipidemia, and visceral obesity12. These steroidal saponins also exhibit cytotoxic activities against human cancer cell lines13. Moreover, an aqueous extract from the dried bulbs reportedly shows antidepressant-like and analgesic effects14, 15. The leaves of A. macrostemon contain valuable constituents, such as flavonoids16, caffeic acid17, and phenolic compounds18, 19, suggesting potential utility in herbal medicine. Additionally, the plant's volatile organic compounds possess medicinal properties, including lipid-lowering and anti-atherosclerosis effects20. These discoveries underscore the remarkable potential of A. macrostemon as a herbal medicine.
Despite numerous studies exploring the medicinal components of A. macrostemon, its genetic diversity remains understudied. However, understanding genetic diversity is crucial as it provides a vital tool for monitoring and evaluating populations, which aids in the conservation planning of A. macrostemon. Moreover, knowledge of this plant's genetic diversity allows plant breeders to develop and improve new cultivars with desirable traits, such as pest and disease resistance, high yield potential, and superior characteristics21.
Compared to traditional DNA markers, single nucleotide polymorphisms (SNPs) are more abundant in plant genomes, making them a superior tool for identifying plant diversity22. Advances in high-throughput sequencing technologies have significantly reduced the cost and improved the efficiency of obtaining SNP markers, particularly through restriction site-associated DNA sequencing (RAD-Seq)23. This method facilitates the reduced representation of individual genomes. A RAD-Seq variation, termed double-digest RAD-Seq (ddRAD-Seq), utilizes two restriction enzymes to cut the genome24. Compared to traditional RAD-Seq methods, ddRAD-Seq library preparation is less costly, quicker, and requires less genomic material25. Consequently, ddRAD-Seq has become the preferred choice for species lacking genome information, such as A. macrostemon. No reports exist on the genetic relationships and intraspecific diversity of A. macrostemon using either traditional or high-throughput methods. Encouraged by our research group's previous successes with RAD-Seq or ddRAD-Seq in analyzing various species, including loquat26, 27, Japanese pepper28, citrus29,30,31, and firefly32, we are inspired to apply this knowledge to the study of A. macrostemon.
In this study, we explore the genetic diversity and population structure of wild Japanese garlic, A. macrostemon. We collected 47 samples from various regions in Japan, including Honshu, Kyushu, Shikoku, and Okinawa Island, with Hokkaido excluded. Subsequently, we conducted an in-depth analysis using genome-wide SNP markers obtained via RAD-Seq. Our objective is to reveal the historical context of A. macrostemon in Japan, particularly its association with past human activities.
Results
Sample collection of Allium macrostemon
We collected bulbs from 47 A. macrostemon plants from different locations in Japan, including fields and riverbanks. Figure 1 and Table 1 show details regarding the plants, their collection dates, and places of origin. We further created three technical replicate samples (samples 30 and 31, samples 14 and 15, as well as samples 17 and 32), resulting in a total of 50 samples subjected to ddRAD-Seq.
Variants detection using RAD-Seq data
The ddRAD-Seq of the 50 samples, as shown in Table 1, generated more than 10 Gbp of data, comprising a total of 196.2 million raw, single-end 51-bp reads. Quality-based filtering yielded an average of 0.9 million reads (ranging from a minimum of 0.2 million to a maximum of 3 million) across the 50 samples (Supplementary Table S1). The Stacks program-built loci de novo with an average coverage depth of 16.78-fold (Supplementary Table S2). We identified a total of 5848 variant sites for subsequent analysis.
The plant consists of three groups
Principal component analysis (PCA) of these variant sites effectively partitioned the 50 samples into three groups, labeled A, B, and C (Fig. 2). The division into these three groups was primarily driven by the principal components 1 and 2, which contribute 16.5% and 9.86% respectively. Analysis of principal components 3, 4, and 5 did not reveal additional meaningful features (Supplementary Fig. S1). Multidimensional scaling (MDS) analysis classified the samples into three groups, with the same members as identified by the PCA (Fig. 3).
Members of group A originated from three districts in Japan: the plants in Chugoku (35 and 42), Shikoku (26, 34, and 36), and Kyushu (39). Thus, they were found throughout Western Japan, excluding the Okinawa district. Members of group B, on the other hand, came from five districts in Japan: plants from Tohoku (21), Kanto (2, 12, 25, and 30, with 30's technical replicate 31), Chubu (8), Kinki (27), and Shikoku (14, and its technical replicate 15). These were absent from the Chugoku, Kyushu, and Okinawa districts, indicating that they were not found in Western Japan, except for the presence in the Shikoku district. Members of group C were from all districts in Japan: plants in Tohoku (5, 10, 22, 24, and 33), Kanto (29 and 48), Chubu (7, 11, 18, 23, and 49), Kinki (1, 6, 9, 13, and 47), Chugoku (19 and 20), Shikoku (3 and 36), Kyushu (16, 28, 37, 38, 40, 41, 43, 44, 45, 46, and 50), and Okinawa (17 and its technical replicate 32).
Cluster analysis was performed using a pairwise distance matrix (Fig. 4), which identified the same three groups as those identified in the PCA and MDS analyses: groups A and B each formed a tight cluster, while group C was an outgroup of groups A and B. The cluster analysis detected several subclusters, but, with two exceptions, it did not detect any region-specific subclusters. The two exceptions are the subclusters to which plants 5, 10, 22, and 33 from the Tohoku district and plants 16 and 46 from the Kyushu district belong.
Gene flow from group A and/or group B to group C was detected, but not from group C to group A or group B
We performed an admixture analysis to estimate the degree of genetic mixing. We used values from 1 to 10 as possible K values (symbolizing the number of ancestral populations), computed the cross-validation (CV) errors, and estimated the most likely K value, i.e., the K value with the lowest CV (Supplementary Fig. S2). The CV error was minimized at K = 3, indicating that the most likely number of ancestral populations is three, aligning with the number of groups identified above. The results of the admixture analysis are displayed (Fig. 5, Supplementary Fig. S3). The admixture analysis detected no gene flow into groups A and B from other groups. Conversely, this analysis identified gene flow into group C from either group A, group B, or both.
Group C consists of four subgroups
Analysis of the results at K = 3, the highest possible value, divided the individuals into four subgroups: subgroup C1, with no detected gene flow from either group A or group B (plants 4, 5, 9, 10, 22, 33, 43, 44, 48, and 49); subgroup C2, with detected gene flow only from group A (plants 7, 11, 23, and 24); subgroup C3, with detected gene flow only from group B (plants 1, 3, 16, 18, 20, 37, 41, 45, and 46); and subgroup C4, with detected gene flow from both groups A and B (plants 6, 13, 17 (and its technical replicate 32), 19, 28, 29, 38, 40, 47, and 50). Subgroups C1, C2, and C3 were also found in the PCA and MDS analyses. Cluster analysis demonstrated that subgroups C1, C2, and C3 formed one or two clusters. Additionally, subgroup C1 was the most distant from groups A and B in the PCA and MDS analyses. Within each subgroup, we did not identify characteristics specific to particular regions. The admixture analysis also displayed more intense gene flow into group C from group B than from group A.
Statistical interpretations of the classifications
We performed a statistical analysis on the six populations. The Fst value analysis (Table 2) exposed a high level of genetic differentiation between groups A and B, yet a comparatively low average degree of genetic differentiation between group A and subgroups C1, C2, C3, or C4, as well as between group B and any of these subgroups. Notably, the genetic differentiation between group B and either C1 or C2 was higher than that between group A and either C1 or C2. Conversely, the Fst values between group B and either C3 or C4 were lower compared to those between group A and either C3 or C4.
For each population, we conducted a statistical analysis. The average values of nucleotide diversity (π) and mean expected heterozygosity (He) were higher in the subgroups of group C; these values in groups A and B were similar and lower than in group C (Table 3, Supplementary Table S3). A negative Fis value indicates avoidance of inbreeding in groups A, B, and C2, while the relatively high Fis values in groups C1, C3, and C4 imply a moderate degree of inbreeding (Table 4).
We calculated the ratio of heterozygous loci for each individual (Supplementary Table S4). Ten individuals exhibited a relatively low ratio of heterozygous loci, with a value below 0.2. All ten plants (5, 7, 17, 28, 29, 33, 38, 43, 47, and 50) belonged to group C.
Both groups A and B propagated asexually, while group C propagated sexually
A. macrostemon reproduces asexually via bulbs, particularly with human intervention. We detected asexual reproduction in the plant by examining the conservation of heterozygous loci positions via pairwise alignments. The degrees of conservation for the technical replicates, plants 14 and 15, 30 and 31, and 17 and 32, were 49.0%, 47.6%, and 43.9%, respectively. De novo analysis of RAD-Seq data, which contains many missing values, is likely to result in 40–50% conservation between somatic clones. Consequently, we determined the presence or absence of asexual reproduction by setting the value of conservation at nearly 40% as the criterion. It should be noted that a similar value of conservation was also used as a criterion for clonal propagation in a study of Japanese pepper that used de novo analysis of RAD-Seq data28.
We investigated the conservation levels among plants in each group, both within and across districts (Table 4, Supplementary Table S5). The conservation level in group A was high, with over half of the pairs showing above 40% (40.8–52.1%). Some pairs had conservation levels slightly below 40% (36.6–39.7%). However, given the somatic mutations (including loss of heterozygosity) that occur during asexual reproduction, it is likely that these plants with slightly lower conservation values also propagated from the same ancestor via asexual reproduction. The conservation level in group B was high, with all pairs of plants showing an average of more than 40% (40.7–49.7%) except for one pair which showed 39.6%. Conversely, nearly all the pairs observed in group C showed a low level of conservation, with only six exceptions. Thus, groups A and B propagated from a single plant by asexual reproduction, whereas the members of group C propagated almost exclusively without asexual reproduction.
Morphological differences exist among ancestral populations A, B, and C1
Out of the 47 individuals subjected to RAD-Seq, 22 were cultured and analyzed for morphological data using nine distinct measurements (Table 5). Subsequently, we conducted a principal component analysis (PCA) on the 22 cultured individuals, using the acquired morphological data (Fig. 6a). Given that groups A, B, and C1 represent ancestral populations, a separate PCA was performed exclusively on the ten individuals from these groups (Fig. 6b). The first PCA that included all 22 individuals accounted for 41.28% of the total variance, with the second principal component contributing 25.8%. Conversely, in the PCA exclusive to the ten selected individuals, the first and second principal components accounted for 58.4% and 23.3% of the variance, respectively. Notably, while the graphical representation in Fig. 6a did not clearly delineate genetic or regional grouping, Fig. 6b distinctly showcased genetic groupings. These observations underscore the presence of morphological disparities among the ancestral populations A, B, and C1.
Based on the results of the PCA, we further investigated which morphological measurements primarily influenced the groupings observed in Fig. 6b. Among the variables, leaf width, maximum weight of bulbs, lateral diameter of the maximum bulb, leaf sheath diameter, and the number of plant tillers were characterized by elevated values in group B. Conversely, the number of daughter bulbs demonstrated higher values within group A. Leaf length, the number of leaves, and the total weight of bulbs displayed enhanced values in both group A and group B, indicating shared morphological features among these two groups. Remarkably, none of the measurements presented high values for group C1, suggesting distinct morphological characteristics within this group.
We examined the preceding study33 that analyzed the concentration of phenolics in A. macrostemon collected from various regions across Japan. Using data extracted from this study33, we conducted a PCA, employing the quantitative values of functional components as variables. However, similar to the results in Fig. 6a, this PCA did not facilitate the delineation of samples based on genetic or regional clusters.
Discussion
The key revelation of this study is the historical relevance of A. macrostemon, a plant currently overlooked in Japan but widely used in earlier times. With minimal exceptions, no distinct regional clusters were discernible across all groups. Previous reports have noted the lack of a clear correlation between the plant's collection sites and the content of flavonoids and ferulic acid glycosides18, 19, 33. This absence of phenotypic differentiation between various collection sites aligns with the genetic findings of our study. These results suggest that although the plant's inter-regional migration is currently inactive, it was once actively transported, with its bulbous nature potentially facilitating this movement. Its historical significance is underscored by its wide usage as both a vegetable and medicinal resource.
The previously active utilization of this plant influence our interpretation of classical Japanese literature. Mentions of the plant are found in Japanese literary works such as the Kojiki and Manyoshu. Modern interpretations often suggest that individuals in the past had a fondness for wild plants. However, given the active use of A. macrostemon as a food source, it is plausible that emperors and poets from these historical periods may have appreciated it not merely as a wild plant but as a cultivated vegetable. This potential reevaluation offers a more nuanced understanding of cultural and lifestyle patterns in historic Japan.
Furthermore, this study revealed that A. macrostemon exhibits two distinct reproductive strategies. Groups A and B, which were clearly established as discrete groups, demonstrated evidence of clonal propagation based on the analysis of the conservation of heterozygous loci positions. This suggests that each group represents a population formed primarily through asexual reproduction. The negative Fis values observed in groups A and B are also likely related to this clonal propagation (Table 3). Therefore, the principal mode of reproduction within groups A and B appears to be through bulb propagation, highlighting a notable asexual reproduction strategy.
Conversely, group C exhibited a more complex population structure and demonstrated a largely non-clonal mode of propagation, with a few exceptions (Table 4). This suggests that group C primarily employs an alternative, bulb-independent mode of reproduction, most likely through hybridization leading to seed formation. Within group C, several members demonstrated notably low heterozygosity. This shift towards homozygosity is likely related to the reproductive dynamics brought about by hybridization.
Our study subdivided group C into four subgroups. Among them, subgroup C1 showed no signs of gene flow from either group A or group B. Furthermore, there was no evidence of clonal propagation within subgroup C1. Thus, group A, group B, and subgroup C1 represent distinct ancestral populations. Gene flow from group A, group B, or both, was observed in other subgroups within group C, contributing to increased genetic diversity within this group (Fig. 5). The possibility of gene flow due to human intervention cannot be discounted, but natural hybridization seems more probable. Human activities, including the movement of plants and bulbs, might have heightened the chances of natural hybridization between groups. This implies that human-induced translocation of this plant might have facilitated the augmentation of its genetic diversity. Consequently, this observation suggests that incidental human intervention may escalate the genetic diversity of a plant species, even one that can also propagate in the wild.
We conducted a cultivation trial to ascertain the morphological traits of this plant's genetic resources in Japan. Although we observed morphological variations, a discernible pattern remained elusive, prompting us to withhold the publication of these results. Nevertheless, the RAD-Seq analysis uncovered that groups A, B, and C1 formed the ancestral populations. Consequently, we re-evaluated only these three groups based on morphological characteristics, resulting in their partition into three distinct clusters. Essentially, excluding the population resulting from hybridization, Japanese A. macrostemon was genetically and morphologically divided into three groups. It was not possible to classify the three ancestral groups based on differences in their functional components. Still, we remain open to the possibility that future research may uncover componential differences among three groups. If ongoing research finds no compositional characteristics distinguishing the three ancestral groups, and if historical records indicate distinct recognition of each group, it might suggest that past populations valued this plant more for its vegetable properties than its medicinal benefits.
Given its widespread presence in Japan, A. macrostemon might have originated locally or been introduced from overseas, either in its entirety or in parts. Notably, groups A and B, which primarily reproduce through bulb propagation—a method well-suited for human-assisted migration, might have been introduced from foreign regions. In this scenario, the most desirable varieties might have been selected overseas and then imported into Japan. Conversely, group C, which displays a bulb-independent mode of reproduction, could be endemic to Japan. This group's unique reproductive strategy may have evolved to adapt to the specific environmental conditions prevalent in Japan, suggesting a native origin. However, more comprehensive research is required to conclusively determine the plant's origin and the pathways of its spread.
Interestingly, the Japanese word 'no-biru' also has another name, 'chosen no-biru'. 'Chosen' means Korea, suggesting that part of this plant may have been introduced to Japan from Korea34. However, 'chosen' also has other meanings. 'Chosen' is used as a prefix for closely related organisms, like different species within the same genus. It is therefore possible that the plant was previously differentiated into two species. Although it remains unclear whether 'chosen' refers to Korea or to a closely related organism, the possibility that people in the past distinguished at least two species of this plant is intriguing and suggests a link with the grouping in this study. The existence of morphological differences among ancestral populations A, B, and C1 also supports this possibility. Further studies analyzing individuals from China and Korea, which would allow us to speculate on the evolution of this plant and the route of its arrival in Japan, would provide further clarification.
A critical reference for discussing the potential foreign introduction of A. macrostemon is our prior study on the scallion mosaic virus (ScaMV) in the family Potyviridae that infects this plant species35. This earlier research revealed that the genome types of ScaMV found in Japan are dispersed among various genomic patterns, suggesting multiple introductions into Japan via different routes. Furthermore, it was shown that the parent ScaMV genomes recombined relatively recently in Japan and were subsequently spread across various districts by human activities. This supports the speculation that the plant might have been established in Japan prior to the introduction of the virus. Nevertheless, it does not entirely exclude the possibility that the plant and the virus were introduced simultaneously as part of the same event. In essence, this underlines the complex historical and ecological interplays between A. macrostemon, its associated viruses, and human activities.
In conclusion, our findings provide valuable insights into the genetic resources of A. macrostemon collected across Japan. Through genetic-level investigations, we have provided evidence suggesting past human-induced migration of this species. In this study, the use of RAD-Seq technology was instrumental in analyzing the genetic diversity of a species without prior genomic information, similar to what has been done previously with Japanese pepper28. Looking forward, conservation efforts for this plant's genetic resources will require attention to not only genetic diversity but also key traits such as the chemical compounds present in the plant's bulbs and leaves, which possess potential value for culinary uses.
Methods
Plant materials
A total of 47 wild Japanese garlic (A. macrostemon) samples were collected from various sites along riverbanks and fields across Japan. We searched for wild Japanese garlic plants on foot and by car. All methods involving plants were carried out in accordance with relevant guidelines and regulations. Plants were collected with permission from their owners. All wild Japanese garlics were maintained at the Center for Education and Research in Agricultural Innovation, Saga University (Saga, Japan). Detailed information about each plant sample, its origin, and the respective collection date is provided in Table 1. Certain samples served as technical replicates of the same plant, samples 30 and 31, samples 14 and 15, as well as samples 17 and 32.
Cultivation trial and measurement
The cultivation trial of A. macrostemon was conducted in a field (grey lowland soil, pH 6.1, EC 0.19 mS cm−1) located at Saga University (latitude 33° 30′ 91.308″ N; longitude 130° 33′ 28.119″ W). The compound of fertilizer (N:P2O5:K2O = 14:14:14%) was applied at 8 kg of N, phosphate, and potassium each at 10 a−1, and the field was covered with black mulch. The bulbs were planted on 9 October 2015 and 12 October 2016. The planting density was set with a 150 cm row width and 25 cm spacing between plants both lengthwise and widthwise. Five medium-scale plants of the isolate were arranged per row, with one plant per hole. We recorded various morphological parameters over a 2-year period (2016 and 2017), including the number of leaves, plant number of tillers, leaf length and width, leaf sheath diameter, the number of daughter bulbs, the total and maximum weight of bulbs, and the lateral diameter of the maximum bulb.
DNA extraction and double-digest restriction site amplified DNA sequencing (ddRAD-Seq)
The extraction and purification of DNA were performed using the CTAB method36, followed by RNase treatment. The quality of the isolated genomic DNA was validated through 0.8% agarose gel electrophoresis. The DNA concentration was quantified using a Qubit dsDNA BR Assay Kit (Invitrogen, MA, USA). The library for ddRAD-Seq was prepared based on the method of Sakaguchi et al.37, with some adaptations from the original protocol25. The libraries were sequenced using 51-bp single-end reads at Macrogen (Seoul, Korea) on one lane of a HiSeq 2000 (Illumina, San Diego, CA, USA).
De novo mapping and variant calling
The denovo_map.pl script from the Stacks package (version 2.60)38, a wrapper for ustacks, sstacks, tsv2bam, and gstacks, was used to map reads de novo without using a reference genome sequence. The options for denovo_map.pl were -M 4 (number of mismatches allowed between stacks within individuals [for ustacks]), -n 4 (number of mismatches allowed between stacks between individual [for cstacks]), and -m 3 (number of identical reads required to initiate a new putative allele [for ustacks]). Genotyping data were created by assigning one individual per population to specify all the samples. After performing denovo_map.pl, the populations program of the Stacks package (version 2.60) created the vcf (variant call format)39, plink40, and phylip files using the -R 0.5 (minimum percentage of individuals across populations required to process a locus), -write-single-snp (restrict data analysis to only the first SNP per locus), -min-maf 0.05 (minimum minor allele frequency required to process a nucleotide site at a locus), -vcf, -plink, and -phylip-var-all options.
Principal component analysis, multidimensional scaling analysis, and cluster analysis
PCA and MDS analyses were performed using the vcf file generated by the populations program. The SNPRelate program41 in the R software environment (version 4.1.2) was employed to convert the vcf file into a gds (genomic data structure) file and generate PCA and MDS plots. The contribution ratio for each principal component was also computed. Only bi-allelic loci were included in this analysis. The SNPRelate package was further used to construct a dendrogram, calculating identity by state (IBS) pairwise distances. Image outputs were produced using the basic functions within the R software environment.
Principal component analysis of morphological test and functional component analysis
PCA of morphological tests and functional component analysis were performed using R software (version 4.1.3). Input data for these analyses were sourced from CSV files containing tables of morphological tests or functional component results. The R software (version 4.1.3) was used to calculate important component and generate PCA plots. The contribution ratio for each principal component was also computed. The diagram outputs were produced using the basic functions within the R software environment.
Admixture analysis
The plink program (plink 2, version 1.90p)40 was used to generate the input files required for the admixture program. The admixture program (version 1.3.0)42 was utilized to determine the admixture history and cross-validation (CV) error for the hypothetical runs from K (number of ancestral populations) = 1–10. The CV error plot was drawn using the received log data to ascertain the optimal K value. The R software (version 4.1.2) was utilized to create admixture plots using the Q estimate files generated by the admixture program.
Statistical analysis
Groups differentiated by the afore mentioned analyses were designated as separate populations in the population map data required for the Stacks package (version 2.60)38. The denovo_map.pl script was re-executed following data modifications in the population map. The parameters used to run the populations program included -R 0.7, -min-maf 0.05, -write-single-snp, -fstats, and -fst-correction.
Analysis of asexual reproduction
The occurrence of asexual reproduction was verified by examining the conservation of heterozygous loci positions using pairwise alignments. The populations program was also used to create pairwise alignment between the two individuals. In this study, the -R option was set to one. The percentage of conserved heterozygous loci was calculated by dividing the number of conserved sites by the total number of variant sites (provided by the populations program).
English writing
The chatGPT was used solely for the purpose of improving English writing.
Data availability
Sequences are available at the DNA Data Bank of Japan Sequence Read Archive (https://ddbj.nig.ac.jp/resource/sra-submission/DRA013735; Accession no. DRA013735).
References
Fritsch, R. M. & Friesen, N. Evolution, domestication and taxonomy. In Allium Crop Science: Recent Advances (eds Rabinowitch, H. D. & Currah, L.) 5–30 (CABI Publishing, 2002). https://doi.org/10.1079/9780851995106.0005.
Chukhina, I.G. Allium macrostemon. http://www.agroatlas.ru/en/content/related/Allium_macrostemon/ (2008). Accessed 10 Feb 2023.
Oyuntsetseg, B., Beket, U. & Munkhjargal, B. Allium macrostemon. In Mongolian Red List and Conservation Action Plans of Plants, Ulaanbataar. Mongolia (Jamsran, Ts., Sanchir, Ch., Bachman, S., Soninkhishig, N., Gombobaatar, S., Baillie, J. E. M. & Tsendeekhuu, Ts., eds) (2011).
Choi, J. H. & Oh, B. U. A partial revision of Allium (Amaryllidaceae) in Korea and north-eastern China. Bot. J. Linnean Soc. 167(2), 153–211. https://doi.org/10.1111/j.1095-8339.2011.01166.x (2011).
Wu, Z. Y., Raven, P. H., & Hong, D. Y. Flora in China. Beijing/St.Louise: Science Press/Missousi Botanical Garden Press. http://www.efloras.org/florataxon.aspx?flora_id=3&taxon_id=200027501 (1994).
Duke, J. A. & Ayensu, E. S. Medicinal plants of China. Reference publications. Algonac. J. Bot. Taxon. Geobot. https://doi.org/10.1002/fedr.19870980707 (1985).
Zhang, Z. et al. A comparative study of the neutral and acidic polysaccharides from Allium macrostemon Bunge. Carbohydr. Polym. 117, 980–987. https://doi.org/10.1016/j.carbpol.2014.10.019 (2015).
Hanelt, P. Alliaceae. In Mansfeld’s Encyclopedia of Agricultural and Horticultural Crops 3rd edn, Vol. 4 (ed. Hanelt, P.) 2250–2269 (Springer, 2001).
Rhodes, L. & Maxted, N. Allium macrostemon. The IUCN Red List of Threatened Species 2016: e.T201505A47600689. https://doi.org/10.2305/IUCN.UK.2016-3.RLTS.T201505A47600689.en (2016). Accessed 29 Dec 2022.
Fujii, Y. Historical background and dissemination of Baker’s garlic in Japan. Bull. Faculty Agric. Tottori Univ. 37, 159–168 (1985) (In Japanese).
Li, F., Xu, Q., Zheng, T., Huang, F. & Han, L. Metabonomic analysis of Allium macrostemon Bunge as a treatment for acute myocardial ischemia in rats. J. Pharm. Biomedic. Anal. 88, 225–234. https://doi.org/10.1016/j.jpba.2013.09.002 (2014).
Xie, W. et al. Novel effects of macrostemonoside A, a compound from Allium macrostemon Bung, on hyperglycemia, hyperlipidemia, and visceral obesity in high-fat diet-fed C57BL/6 mice. Eur. J. Pharm. 599, 159–165. https://doi.org/10.1016/j.ejphar.2008.09.042 (2008).
Chen, H. F., Wang, G. H., Luo, Q., Wang, N. L. & Yao, X. S. Two new steroidal saponins from Allium macrostemon Bunge and their cytotoxity on different cancer cell lines. Molecules 14, 2246–2253. https://doi.org/10.3390/molecules14062246 (2009).
Lee, S. et al. Antidepressant-like activity of the aqueous extract of Allium macrostemon in mice. J. Ethnoparmacol. 131, 386–395. https://doi.org/10.1016/j.jep.2010.07.015 (2010).
Yang, X. et al. Allium macrostemon Bunge. exert analgesic activity by inhibit NaV17 channel. J. Ethnopharmacol. 281, 114495. https://doi.org/10.1016/j.jep.2021.114495 (2021).
Nakane, R. & Iwashina, T. Flavonol glycosides from the leaves of Allium macrostemon. Nat. Product Com. 10, 1381–1382. https://doi.org/10.1177/1934578X1501000817 (2015).
Nakayama, H. et al. Glucose esters of caffeic acid from Allium macrostemon Bunge. Jpn. J. Food Chem. Saf. 27(19), 28–32. https://doi.org/10.18891/jjfcs.27.1_28 (2020).
Usui, A. et al. Ferulic acid esters of oligo-glucose from Allium macrostemon. Nat. Product Com. 12, 89–91. https://doi.org/10.1177/1934578X1701200125 (2017).
Usui, A. et al. Phenolic constituents from Allium macrostemon Bunge. J-Stage. 71, 51–52. https://doi.org/10.24684/jspharm.71.1_51 (2017).
Qin, Z. et al. Characterization of volatile organic compounds with anti-atherosclerosis effects in Allium macrostemon Bge. and Allium Chinese G. Don by head space solid phase microextraction coupled with gas chromatography tandem mass spectrometry. Front. Nutr. 10, 996675. https://doi.org/10.3389/fnut.2023.996675 (2023).
Govindaraj, M., Vetriventhan, M. & Srinivasan, M. Importance of genetic diversity assessment in crop plants and its recent advances: An overview of its analytical perspectives. Hindawi Publs. Corp. 2015, 1–14. https://doi.org/10.1155/2015/431487 (2015).
Yu, Q. et al. RAD-seq as an effective strategy for heterogenous variety identification in plants-a case study in Italian Ryegrass (Lolium multiflorum). BMC Plant Biol. 22, 331. https://doi.org/10.1186/s12870-022-03617-6 (2022).
Andrews, K. R., Good, J. M., Miller, M. R., Luikart, G. & Hohenlohe, P. A. Harnessing the power of RADseq for ecological and evolutionary genomics. Genetics 17, 81–92. https://doi.org/10.1038/nrg.2015.28 (2016).
Yang, G. Q. et al. Development of a universal and simplified ddRAD library preparation approach for SNP discovery and genotyping in angiosperm plants. Plant Methods 12, 39. https://doi.org/10.1186/s13007-016-0139-1 (2016).
Peterson, B. K., Weber, J. N., Kay, E. H., Fisher, H. S. & Hoekstra, H. E. Double digest RADseq: An inexpensive method for de novo SNP discovery and genotyping in model and non-model species. PLoS One 7, e37135. https://doi.org/10.1371/journal.pone.0037135 (2012).
Fukuda, S. et al. Construction of a high-density linkage map for bronze loquat using RAD-Seq. Sci. Hortic. 251, 59–64. https://doi.org/10.1016/j.scienta.2019.02.065 (2019).
Nagano et al. Genetic diversity of Loquat (Eriobotrya japonica) revealed using RAD-Seq SNP markers. Sci. Rep. 12, 10200. https://doi.org/10.1093/bioinformatics/bts606 (2022).
Premarathne, M. D. G. P. et al. Elucidation of Japanese pepper (Zanthoxylum piperitum De Candolle) domestication using RAD-Seq. Sci. Rep. 11, 6464. https://doi.org/10.1038/s41598-021-85909-9 (2021).
Penjor, T., Mimura, T., Matsumoto, R., Yamamoto, M. & Nagano, Y. Characterization of limes (Citrus aurantifolia) grown in Bhutan and Indonesia using high-throughput sequencing. Sci. Rep. 4, 4853. https://doi.org/10.1038/srep04853 (2014).
Penjor, T. et al. RAD-Seq analysis of typical and minor Citrus accessions, including Bhutanese varieties. Breed. Sci. 66, 797–807. https://doi.org/10.1270/jsbbs.16059 (2016).
Nagano, Y. et al. Phylogenetic relationships of Aurantioideae (Rutaceae) based on RAD-Seq. Tree Genet. Genomes 14, 6. https://doi.org/10.1007/s11295-017-1223-z (2018).
Kato, D. et al. Evaluation of the population structure and phylogeography of the Japanese Genji firefly, Luciola cruciata, at the nuclear DNA level using RAD-Seq analysis. Sci. Rep. 10, 1533. https://doi.org/10.1038/s41598-020-58324-9 (2020).
Ishimaru, K., Usui, A., Ohshima, K. & Fukuda, S. Phenolic concentration of Allium macrostemon Bunge collected in various area of Japan. Jpn. J. Food Chem. Saf. 25(1), 25–32 (2018) (In Japanese).
Kajita, T & Yonekura, K. Botanical Japanese name-scientific index YList. http://ylist.info/ (2021). Accessed 22 May 2023.
Ohshima, K. et al. Genomic epidemiology and evolution of Scallion mosaic potyvirus from asymptomatic wild Japanese garlic. Front. Microbiol. 12, 789596. https://doi.org/10.3389/fmicb.2021.789596 (2021).
Doyle, J. J. & Doyle, J. L. A rapid DNA isolation procedure for small quantities of fresh leaf tissue. Phytochem. Bull. 19, 11–15 (1987).
Sakaguchi, S. et al. High-throughput linkage mapping of Australian white cypress pine (Callitris glaucophylla) and map transferability to related species. Tree Genet. Genomes 11, 121. https://doi.org/10.1007/s11295-015-0944-0 (2015).
Rochette, N. C., Rivera-Colón, A. G. & Catchen, J. M. Stacks 2: Analytical methods for paired-end sequencing improve RADseq-based population genomics. Mol. Ecol. 28, 4737–4754. https://doi.org/10.1111/mec.15253 (2019).
Danecek, P. et al. The variant call format and VCFtools. Bioinformatics 27, 2156–2158. https://doi.org/10.1093/bioinformatics/btr330 (2011).
Purcell, S. et al. PLINK: A tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet. 81, 559–575. https://doi.org/10.1086/519795 (2007).
Zheng, X. et al. A high-performance computing toolset for relatedness and principal component analysis of SNP data. Bioinformatics 28, 3326–3328. https://doi.org/10.1093/bioinformatics/bts606 (2012).
Alexander, D. H., Novembre, J. & Lange, K. Fast model-based estimation of ancestry in unrelated individuals. Genome Res. 19, 1655–1664. https://doi.org/10.1101/gr.094052.109 (2009).
Acknowledgements
This work was partially supported by a Japanese Government (MEXT) scholarship, a KAKENHI Grant-in-Aid for Scientific Research(C) (18KT0092) from the Japan Society for the Promotion of Science (JSPS) and grant from the United Graduate School of Agricultural Sciences, Kagoshima University, Japan. This research was part of the dissertation submitted by the first author in partial fulfillment of a Ph.D. degree. All authors have provided consent.
Author information
Authors and Affiliations
Contributions
K.O., K.I., Y.N., and S.F. designed the study. K.O. and S.F. collected samples. K.H. performed DNA extraction. W.P., S.K., Y.N. and S.F. performed bioinformatics analysis. A.J.N. performed RAD-Seq analysis. K.H. and S.F. performed cultivation test. W.P., Y.N., and S.F. analyzed the data; W.P., Y.N., and S.F. wrote the manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Probowati, W., Koga, S., Harada, K. et al. RAD-Seq analysis of wild Japanese garlic (Allium macrostemon Bunge) growing in Japan revealed that this neglected crop was previously actively utilized. Sci Rep 13, 16354 (2023). https://doi.org/10.1038/s41598-023-43537-5
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-023-43537-5
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.