Complete chloroplast genomes shed light on phylogenetic relationships, divergence time, and biogeography of Allioideae (Amaryllidaceae)

Namgung, Ju; Do, Hoang Dang Khoa; Kim, Changkyun; Choi, Hyeok Jae; Kim, Joo‑Hwan

doi:10.1038/s41598-021-82692-5

Download PDF

Article
Open access
Published: 05 February 2021

Complete chloroplast genomes shed light on phylogenetic relationships, divergence time, and biogeography of Allioideae (Amaryllidaceae)

Ju Namgung¹^na1,
Hoang Dang Khoa Do^1,2^na1,
Changkyun Kim¹,
Hyeok Jae Choi³ &
…
Joo‑Hwan Kim¹

Scientific Reports volume 11, Article number: 3262 (2021) Cite this article

3596 Accesses
19 Citations
4 Altmetric
Metrics details

Subjects

Abstract

Allioideae includes economically important bulb crops such as garlic, onion, leeks, and some ornamental plants in Amaryllidaceae. Here, we reported the complete chloroplast genome (cpDNA) sequences of 17 species of Allioideae, five of Amaryllidoideae, and one of Agapanthoideae. These cpDNA sequences represent 80 protein-coding, 30 tRNA, and four rRNA genes, and range from 151,808 to 159,998 bp in length. Loss and pseudogenization of multiple genes (i.e., rps2, infA, and rpl22) appear to have occurred multiple times during the evolution of Alloideae. Additionally, eight mutation hotspots, including rps15-ycf1, rps16-trnQ-UUG, petG-trnW-CCA, psbA upstream, rpl32-trnL-UAG, ycf1, rpl22, matK, and ndhF, were identified in the studied Allium species. Additionally, we present the first phylogenomic analysis among the four tribes of Allioideae based on 74 cpDNA coding regions of 21 species of Allioideae, five species of Amaryllidoideae, one species of Agapanthoideae, and five species representing selected members of Asparagales. Our molecular phylogenomic results strongly support the monophyly of Allioideae, which is sister to Amaryllioideae. Within Allioideae, Tulbaghieae was sister to Gilliesieae-Leucocoryneae whereas Allieae was sister to the clade of Tulbaghieae- Gilliesieae-Leucocoryneae. Molecular dating analyses revealed the crown age of Allioideae in the Eocene (40.1 mya) followed by differentiation of Allieae in the early Miocene (21.3 mya). The split of Gilliesieae from Leucocoryneae was estimated at 16.5 mya. Biogeographic reconstruction suggests an African origin for Allioideae and subsequent spread to Eurasia during the middle Eocene. Cool and arid conditions during the late Eocene led to isolation between African and Eurasian species. African Allioideae may have diverged to South American taxa in the late Oligocene. Rather than vicariance, long-distance dispersal is the most likely explanation for intercontinental distribution of African and South American Allioideae species.

Comparison of the complete plastomes and the phylogenetic analysis of Paulownia species

Article Open access 10 February 2020

A comparative analysis of complete chloroplast genomes of seven Ocotea species (Lauraceae) confirms low sequence divergence within the Ocotea complex

Article Open access 21 January 2022

Comparative and phylogenetic analyses of the chloroplast genomes of Filipendula species (Rosoideae, Rosaceae)

Article Open access 18 October 2023

Introduction

Allioideae Herbert, a subfamily of Amaryllidaceae (Asparagales), comprises four tribes, 13 genera and over 900 species¹. The subfamily is widely distributed in temperate and subtropical regions of the Northern Hemisphere and South America, and occurs locally in South Africa². Most Allioideae are economically important plants used in traditional medicine, horticulture, and also as ornamentals. Within Amaryllidaceae, Allioideae can easily be distinguished from the other subfamilies based on superior ovary and solid styles. These subfamilies are further characterized by possession of unique chemical compounds³. Molecular phylogenetic studies have demonstrated the monophyly of each subfamily of Amaryllidaceae using chloroplast (cp) DNA sequence data^4,5,6. Despite the morphological, anatomical, chemical, and molecular distinctiveness of Allioideae, its sister group is controversial. Meerow et al.⁴ suggested that Allioideae is sister to the Agapanthoideae –Amaryllidoideae clade based on two cpDNA rbcL and trnL-F regions, which was also reported by Costa et al.⁷ inferred from four loci dataset. A more recent analysis of four cpDNA genes by Chen et al.⁶ found support for a sister relationship between Allioideae and Amaryllidoideae, which is in agreement with the results of Steele et al.⁵ and Xie et al.⁸. Although these studies shed light on the molecular systematics of Amaryllidaceae, comprehensive phylogenetic analysis using complete cp genome sequences has not yet been conducted to resolve the systematic position of Allioideae within the family.

Within Allioideae, four tribes (Allieae, Gilliesieae, Leucocoryneae, and Tulbaghieae) are recognized based on the presence or absence of corona, flower symmetry, style position, and the presence or absence of septal nectaries^1,9. For example, Allieae, comprising a single genus (Allium) classified in 15 subgenera⁹, is defined by a combination of traits including a gynobasic style, actinomorphic flowers, and absence of corona and sepal nectaries. Previous molecular phylogenetic analyses of Allioideae have shown that each tribe forms a well-supported clade^10,11. However, disagreement about the relationships among tribes Gilliesieae, Tulbaghieae, and Leucocoryneae within Allioideae continues. Souza et al.¹¹ examined phylogenetic relationships within Allioideae using combined nuclear ribosomal internal transcribed spacer (nrITS) and single cpDNA marker (trnG intron) sequences, and revealed that Gilliesieae is closely related to the Tulbaghieae–Leucocoryneae clade. Later, Sassone and Giussani² revealed that Tulbaghieae is sister to the Gilliesieae-Leucocoryneae clade based on nrITS and two cpDNA (ndhF and matK) sequences, which is in consonance with the results of Costa et al.⁷. These studies focused on the tribe and genus levels and used limited DNA regions. Thus, the phylogenetic relationships among tribes within Allioideae could be clarified by including more DNA regions.

Understanding the disjunct distribution pattern of plant groups has long been a major focus of biogeography^7,12. Within the molecular phylogenetic framework, biogeographic origin and migration routes leading to the present disjunct distributions of a variety of plant taxa have been inferred¹³. In particular, study of the biogeographic history of plants with disjunct distributions between the Northern and Southern Hemispheres is informative, as it can provide information about global biodiversity. Two main migration routes between the Northern and Southern Hemispheres have been recognized: one between North and South America and the others between Europe and Africa several times in the Tertiary^14,15. Migration between Asia and Australia in the Miocene and later has been reported, but less commonly¹⁶. In Allioideae, Allieae is widely distributed in the Northern Hemisphere including Eurasia and North America, while all other tribes are endemic to South Africa or South America^2,10. Tulbaghieae is endemic to South Africa; Gilliesieae and Leucocoryneae are restricted to South America, with one exception, Nothoscordum bivalve (L.) Britton, which has expanded its range as far as the southern half of the USA. Therefore, the distribution pattern of Allioideae offers an ideal opportunity for understanding the biogeographic origins and migration routes of plant groups showing disjunct distribution between the two hemispheres. To determine the migration patterns of these Northern–Southern Hemisphere disjunct species, the most efficient way is to estimate their divergence times using DNA sequences and resolved phylogenies¹⁵. Several studies have estimated the divergence time of the major clades of Allioideae using DNA sequences^2,6,7,8,17. Chen et al.⁶ used four cpDNA coding regions to estimate the divergence times of families and major subfamilies of Asparagales. They suggested that the crown node of Allioideae occurred 37 million years ago (mya) in the late Eocene. Li et al.¹⁷ and Sassone and Giussani² estimated the crown nodes of Allioideae tribes between 18 (Gilliesieae) and 34 mya (Allieae). However, these studies did not determine the divergence times of taxa that are distributed disjunctly between the Northern and Southern Hemispheres. The latest study on evolutionary history of Allioideae revealed an ‘out of India’ origin of Allieae before a widespread distribution in the northern hemisphere based on the nrITS and three cpDNA (matK, ndhF, and rbcL) sequence data⁷.

The chloroplast genome (cpDNA), being inherited maternally (> 85%), paternally, or biparentally and containing coding genes necessary for photosynthesis, provides useful data for phylogenetic studies, biogeographical analyses, and reconstruction of the evolutionary history of angiosperms^18,19,20. Numerous studies have been conducted on the complete cpDNA of Allioideae. However, these investigations focused on few species or small groups within Allioideae^8,21,22. Therefore, the uncertainties of Allioideae phylogeny have not been fully resolved with regard to subgeneric, tribal, and subfamilial relationships. Here, we sequenced 23 chloroplast genomes representing four tribes of Allioideae using next-generation sequencing (NGS) technology. Using these results together with published cpDNA data for Asparagaceae, Xanthorrhoeaceae, and Iridaceae, we aim to (1) explore cpDNA evolution in Allioideae; (2) clarify the tribal and subfamilial relationships of Allioideae and related taxa; (3) estimate the divergence times of Allioideae; and (4) reconstruct the biogeographic history of the subfamily.

Results

Comparative analysis of cpDNA features in Allioideae and related taxa

The cpDNA genomes of Allioideae have a quadripartite structure that includes a large single copy (LSC), a small single copy (SSC), and two inverted repeat (IR) regions (Fig. 1). However, cpDNA genome size varies from 145,819 to 157,735 bp (Table 1). Among the four tribes of Allioideae, Allieae species have the smallest cpDNA (Allium paradoxum; 145,819 bp) and the largest cpDNA (Allium tuberosum; 157,735 bp). Most cpDNA of Allioideae is smaller than those of Agapanthoideae (157,055 bp) and Amaryllidoideae (ranging from 158,355 to 159,998 bp). Additionally, the GC content of cpDNA sequence in Allium species (generally ≤ 37.1%) is lower than those of other examined taxa (Table 1).

Table 1 Some features of chloroplast genomes in Allioideae and related species.

Full size table

Although cpDNA size is variable, its gene content and order are quite stable among Allioideae and related taxa; it includes 80 protein-coding genes, 30 tRNAs, and four rRNAs (Fig. 1, Table 1, and Table S1). The loss of infA was observed in Allium monanthum, A. karataviense, A. ampeloprasum, A. macleanii, and A. spicatum, whereas complete deletion of rpl22 and rps16 was recorded in A. monanthum and A. platyspathum, respectively (Table 1). In addition to the loss of protein-coding regions, pseudogenization was annotated in some regions of the examined species, including rps2 in most examined species of Allieae (excluding A. fistulosum, A. macleanii, A. caeruleum, A. chinense, A. prattii, and A. pskemense), matK (A. karataviense, A. spicatum, and A. siculum), infA (A. monanthum, A. ochotense, A. tricoccum, A. siculum, A. victorialis, A. prattii, A. nanodes and Tulbaghia violacea), rps16 (A. neriniflorum, A. schoenoprasum, A. ampeloprasum, A. chinense, and A. obliquum), rpl23 (A. spicatum and T. violacea), accD (A. nigrum, A. cepa, and T. violacea), cemA (T. violacea), ycf2 (A. neriniflorum), rpl36 (A. caeruleum), atpB and rbcL (A. prattii and A. nanodes) and ycf1 (Gilliesia graminea). Notably, A. paradoxum possessed complete deletion of rpl22, ndhF, ndhG, and rps2, and pseudozenization of infA, ndhJ, ndhK, ndhC, ndhD, ndhE, ndhI, ndhH, and ndhA (Table 1). Also, pseudogenization of ycf15 was observed in all examined chloroplast genomes of Amaryllidaceae and outgroups.

The boundaries between the LSC and IR regions are quite similar among Allioideae and other examined species, located in the coding region of rpl22 (Table 1). However, the expansion lengths are variable, ranging from 3 to 39 bp. By contrast, the LSC-IR junction is within rps19 (10 bp) in A. karataviense and A. spicatum. In A. monanthum, the LSC-IR border is in the intergenic spacer (IGS) between rps19 and rpl22, which was also observed in Agapanthus coddii, Lycoris radiata, Asparagus officinalis, Yucca filamentosa, Xanthorrhoea preissii, and Iris koreana. Notably, Nothoscordum bonariense (Leucocoryneae) has a unique LSC-IR junction within the IGS between trnH-GUG and rps19. Similar to the variation of the LSC-IR border, three types of junction between SSC and IR regions were observed, including overlap, adjunction, and gap between ycf1 and ndhF. Notably, adjunction was only found in Lycoris radiata. By contrast, the overlap and gap boundaries are common in Allioideae and related taxa (Table 1).

The Pi values of nucleotide diversity range from 0 to 0.08718 in Allium species and reach 0.09956 in Allioideae (Table S2) with the variation of noncoding regions being greater than that of coding sequences. In Allioideae, hotspot regions include rps15-ycf1, rps16-trnQ-UUG, petG-trnW-CCA, psbA upstream, rpl32-trnL-UAG, ycf1, rpl22, matK, and ndhF. Similarly, high variation in the DNA sequences of examined Allium species was found in rps15-ycf1, petD-rpoA, petG-trnW-CCA, psbA upstream, rpl32-trnL-UAG, ycf1, infA, rps2, and ndhF.

Analysis of repeats revealed 21 repeated regions in the cpDNA of Allioideae (Table S3). Most repeats are forward, aside from a palindromic repeat found only in Allium koreanum and A. obliquum. Additionally, repeats were abundant in noncoding regions. Some repeats were found in the ycf2 and tRNA coding sequences (i.e., trnF-GAA, trnA-UGC, trnfM-CAU-trnP-UGG, trnS-GCU, and trnS-UGA). In addition to the shared repeats among Allioideae, unique repeats were found in A. koreanum, A. cepa, A. obliquum, A. cyathophorum, A. nigrum, A. senescens, and A. ursinum (Table S3).

A total of 72 regions containing simple sequence repeats (SSRs) were detected in the cpDNA of Allioideae, with lengths ranging from 10 to 20 bp (Table S4). Most SSRs are mononucleotide repeats made up of A and T nucleotides. These SSRs are located mostly in noncoding regions, except for repeats found in ycf1, ycf2, rpoC1, rpoC2, ndhF, rps16, and cemA. The number and lengths of SSRs varied among Allioideae species (Table S4).

Phylogenetic relationships of Allioideae

Maximum Parsimony (MP) and Bayesian Inference (BI) analyses using 74 protein-coding regions of cpDNA produced trees with identical topology. The strict consensus tree gained from the MP analysis is shown in Fig. 2 (Tree length = 20,479; Consistency index (CI) = 0,7; Retention index (RI) = 0.8; Homoplasy index (HI) = 0.3). The monophyly of Allioideae was strongly supported. Amaryllidoideae was found to be sister to Allioideae with the highest support. Within Allioideae, Allieae (clade I) was sister to the clade II consisting of the remaining tribes. Within clade II, Tulbaghieae was sister to Gilliesieae-Leucocoryneae.

In Allium, three subclades (a-c) were recognized: “a” included A. nigrum through A. cyathophorum; “b” included A. spicatum through A. tricoccum; and “c” comprised A. cernuum through A. siculum. Although monophyly of three subgenera: Cepa, Anguinum, and Anguinum was supported, our results clearly demonstrate that two subgenera, Cyathophora and Melanocrommyum, represented by multiple species, were not monophyletic (Fig. 2).

Molecular dating analyses

Divergence time estimates for the tribes of Allioideae based on the combination of 74 coding gene sequences in the chloroplast genome are shown in Fig. 3 and Table 2. Our BEAST dating analysis resulted in estimates for the crown node of Allioideae (clade I) of 40.1 mya (95% highest posterior density [HPD] = 28.5–55.3 mya; node 1) in the Eocene. Within Allioideae, the age estimate for the crown node of Allieae in the Northern Hemisphere was 21.3 mya (95% HPD = 14.4–28.8 mya; node 2) in the early Miocene. The age estimate for the crown node of clade II, including the other tribes of Allioideae, was dated to 25.3 mya (95% HPD = 11.5–39.1 mya; node 3) in the late Oligocene. The divergence time between the Gilliesieae and Leucocoryneae was estimated at 16.5 mya (95% HPD = 5.0–28.5 mya; node 4) at the interface of the early and middle Miocene.

Table 2 Posterior age distributions of major nodes of Allioideae using BEAST, with results of ancestral area reconstruction using BBM and S-DIVA analyses.

Full size table

Ancestral area reconstruction

The ancestral ranges of the nodes of clades in Allioideae inferred using the Bayesian binary method (BBM) and statistical dispersal variance analysis (S-DIVA) are summarised in Fig. 4 and Table 2. The BBM reconstruction suggests that Africa (C) is the most probable ancestral area of Allioideae (node 1, 82%), whereas the S-DIVA range reconstruction for this node is Eurasia + Africa (AC, 50%) or Eurasia + Africa + South America (ACD, 50%). Both methods suggest Eurasia (A) as the ancestral area for Allieae (Clade I; node 2). S-DIVA suggests Africa + South America (CD) as the most probable ancestral area for clade II (node 3), which includes the remaining tribes of Allioideae, whereas BBM indicated Africa (C) with 87% marginal probability. BBM and S-DIVA reconstructions both suggest that South America (D) is the most probable ancestral area for the node of Gilliesieae–Leucocoryneae (node 4).

Discussion

Chloroplast genome evolution in Allioideae

The newly sequenced chloroplast genome revealed a highly conserved genome structure in terms of GC content and gene composition and order among Allioideae and related taxa (Table 1 and Fig. 1). In comparison to GC content of Amborella trichopoda (38.34%), Nicotiana tabacum (37.85%), and Oryza sativa (39%), those of Allioideae exhibited a lower percentage, especially members of Allieae (generally ≤ 37.1%). However, only some representatives of over 1000 species in Alliodeae were used in this study. Therefore, a larger number of Allioideae samples is needed to clarify the fluctuation of GC content which contributed to RNA editing and stability of genome structure^23,24,25. Gene content varied among species due to pseudogenization and loss of genes in some Allioideae (i.e., Tulbaghieae, Gillesieae, and Allieae; Table 1). The size of chloroplast genome was affected by the reduction and expansion of IR regions and gene loss and duplication²³. Among Allioideae species, chloroplast genome size fluctuation was caused by pseudogenization and loss of genes (Table 1). For example, the smallest cpDNA in Allioideae was found in Allium paradoxum of which three and nine genes were lost and pseudogenized, respectively. Further observation on the gene loss and pseudogenization revealed that the gene loss and pseudogenization are not corresponded to the recognized clades indicating parallel evolution of these events in Allium (Table S5). For example, the loss of infA was recorded in representative species of three evolutionary lines in Allium. A similar trend was found in the pseudogenization of rps2 of which the intact sequences were also recorded (Table S5). In monocots, the parallel loss or pseudogenization of genes has been reported. For instance, in Liliales representatives of both photosynthetic and mycoheterotrophic groups show the gene loss and pseudogenization of rps16, infA, and cemA²⁶. Previously, the loss of infA was surveyed in angiosperms, revealing that the loss of infA from cpDNA can be mitigated by infA in the nuclear genome²⁷. Various gene deletions have been reported in Allium (section Daghestanica)²¹. However, the mechanism leading to and outcomes of these events have not been studied in Allioideae species. In the present study, the sequence of the lost gene was not found in the current raw NGS data, suggesting that these genes were not transferred to nuclear or mitochondrial genomes. However, to confirm the final destination of the lost genes, the NGS data of nuclear and mitochondrial genomes among Allium species should be generated. Additionally, only 13 out of over 800 species of Allium were examined in the present study; therefore, further studies that cover all members of Allium should be conducted to provide a comprehensive understanding of the evolution of gene loss and pseudogenization in Allieae and related taxa.

Aside from the loss and pseudogenization of genes, which affect genome size, the expansion and contraction of IR regions resulted in differing junctions among LSC-IR-SSC regions and thus caused length variations in the cpDNA of Allioideae (Table 1). Previously, Wang et al.²⁸ described different junction types in monocot species, ranging from trnH-GUG to rpl22. The LSC-IR junctions of basal angiosperms and monocots were also reported and divided into five types²⁶. In the present study, the LSC-IR junction varied from trnH-GUG (type II, Nothoscordum bonariense) to rpl22 (type IV, most of Allium; Table 1). Notably, type III of LSC-IR junction (located in the IGS between rps19-rpl22) was found in Allium monanthum (Table 1), suggesting high variability of this boundary in Allioideae. Similar to the LSC-IR junction, the SSC-IR border feature is variable among Allioideae species, which may show overlap, adjunction, or a gap between ycf1 and ndhF as described in a previous study²⁶ (Table 1). This junction is located within ycf1 in cpDNA due to its long length. These characteristics of the LSC-IR-SSC junction have also been reported in other monocot groups^28,29, suggesting similar patterns of structural variation among the cpDNA of monocots.

Analysis of nucleotide diversity and repeats in cpDNA sequences provides useful information for identifying molecular markers, reconstructing phylogenetic relationships, and exploring population genetics in angiosperms^30,31. In this study, different SSRs were identified among Allioideae that may be useful for studies of molecular markers and population genetics of Allium in particular and Allioideae in general (Tables S3 and S4). Furthermore, eight hotspot regions of cpDNA were identified, which can be used in future studies of interspecies relationships among Allium species (Table S2). Another study on the complete plastomes of Allium revealed different genes with high nucleotide diversity (including ndhK, ndhE, ndhA, rps16, psaI, rpl22, rpl32, and trnK-UUU) in comparison with the present study¹⁵. These various findings might be caused by different taxon sampling and an insufficient number of samples among the studies. However, these results provided preliminary data on nucleotide diversity of plastomes for further studies that include all Allium taxa to identify the common hotspot regions across Allium.

Phylogenetic relationships of Allioideae

Our MP and BI analyses consistently recovered Allioideae as sister to Amaryllidoideae (Fig. 2). This result is in line with previous molecular phylogenetic studies of Amaryllidaceae^5,6. By contrast, Allioideae was found to be sister to a clade of Amaryllidoideae and Agapanthoideae inferred from data of nuclear ITS and plastid matK, ndhF, and rbcL⁷. Although Allioideae has superior ovary and solid style (vs. inferior ovary and hollow style in Amaryllidoideae), these characteristics are homoplasious in Asparagales³². Our phylogenomic study recovered Allieae as sister to the rest tribes of Allioideae (Fig. 2). The unique position of Allieae is also corroborated by having the synapomorphic, gynobasic style (vs. terminal in other tribes). Tulbaghieae, sister to Leucocoryneae-Gilliesieae, could be distinguished by the presence of corona in the flower. Moreover, the pseudogenization of cemA gene was only detected in Tulbaghieae. Gilliesieae and Leucocoryneae were strongly supported as sister in agreement with Sassone and Giussani². This relationship is supported by several morphological characteristics such as terminal style position and absence of corona in the flower. In addition, both tribes were distributed in South America. In particular, Gilliesieae is restricted to Chile and Patagonia in Argentina, while Leucocoryneae is located in Argentina, Chile, Bolivia, Peru, Paraguay, Uruguay, and Brazil. Therefore, molecular phylogenetic relationships among tribes of Allioideae were supported by morphological and geographical evidence.

In the present study, Allium subg. Melanocrommyum and A. subg. Cyathophora were found to be non-monophyletic although 74 protein-coding genes were used (Fig. 2). Previous molecular phylogenetic studies of Allium revealed the non-monophyly of some subgenera^8,9,33. For example, Li et al.³³ reported paraphyly of the subgenera Anguinum, Cepa, Allium, Reticulatobulbosa, and Polyprason inferred from ITS and rps16 sequences. Similarly, the monophyly of subgenera Rhizirideum, Polyprason, and Cyathophora was not corroborated by ITS and external transcribed spacer sequences⁹. Additionally, the phylogeny of Allium based on whole plastome sequences revealed the polyphyly of subgenera Cepa and Polyprason⁸. Albeit different molecular datasets have been used and resulted in non-monophyletic relationships, Allium species are always placed into three distinct clades, and accordingly, the hypothesis of three evolutionary lineages was proposed^10,33. Among members of the genus Allium, the basic chromosome numbers are x = 7, 8, 9, 10, 14^7,34,35. Additionally, natural interspecific hybridization has been reported in Allium³⁵. The high chromosome diversity and hybridization in this genus might blur to propose a clear classification of Allium. Although 74 protein-coding genes were used in the present study, subgeneric relationships within Allium were not fully resolved. Therefore, further studies using more Allium samples and more molecular data (i.e., coding sequences in nuclear and mitochondrial genomes, and hotspot regions) should be conducted to provide better subgeneric classification of this complex genus of Allioideae and an explanation for the three distinct groups of Allium.

Divergence time and biogeographic origins of Allioideae

Accurate estimation of divergence time in a certain plant group is important to understanding its biogeographic history. However, like most plant groups, the fossil record in Allioideae is sparse. When paleontological data are lacking, molecular estimates provide the only means for inferring the age of lineages, and multiple DNA regions are used to ensure the accuracy of divergence time estimates. Here, we used 74 cpDNA coding regions to estimate the divergence times of major clades in Allioideae. Previous studies also analyzed divergence times of Allioideae and resulted in different outcomes (Table S6). Our molecular dating analysis suggests that Allioideae diverged from its sister clade in the early Eocene (mean = 47.7 mya; 95% HPD = 40.8–56.5 mya). Similar divergence time of Allioideae (41.9 mya, 95% HDP = 34.5–47.6 mya) was estimated based on 48 shared chloroplast genes among 19 monocots families⁸. The diversification of Allioideae, which resulted in the formation of two major lineages, is estimated to have occurred in the middle Eocene (40.1 mya, 95% HPD = 28.5–55.3 mya; node 1 in Fig. 3). This estimate of the crown age of Allioideae is similar to that obtained in a previous research (37.0 mya, 95% HPD = 27.8–44.5 mya)⁶. This result is also supported by the fossil genus Paleoallium, which is similar to extant Allium, recently reported during the Eocene³⁶. Thus, we believe that this is the most reliable estimate of the divergence time for Allioideae to date. However, Costa et al.⁷ presented an older divergence time of Allioideae (Table S6). In particular, Allioideae diverged in the Paleocene (63.2 mya, 95% HDP = 67.5–53.7 mya) followed by splits of Allieae (52.2 mya, 95% HDP = 58.1–44.4 mya) and Tulbaghieae and Gilliesieae (54.1 mya, 95% HDP = 65.1–37.11 mya)⁷. In comparison to the results of the current study, the older times might be caused by different sequence data matrix (four loci of which missing data were accounted for 20.5% of the matrix), and different calibration points (fossil leaf of Amaryllidaceae)⁷.

The species in four tribes of Allioideae distributed discontinuously, with complete separation between the Northern and Southern Hemispheres (Allieae, Eurasia, and North America; Tulbaghieae, Africa; Gilliesieae and Leucocoryneae, South America). In contrast to Dubouzet and Shinoda³⁷, who suggested that the major lineages of Allioideae originated in the Northern Hemisphere, our biogeographic reconstructions based on BBM analysis suggest that this subfamily originated in Africa with high marginal probability, while S-DIVA suggests Eurasia + Africa or Eurasia + Africa + South America as the origin of Allioideae (Fig. 4, Table 2). The deepest branches of the topology originate in Africa, including the sister groups of subfamilies Amaryllidoideae and Agapanthoideae. Moreover, the age of the crown node of clade II (mean = 25.3 mya), which includes Tulbaghieae, Gilliesieae, and Leucocoryneae from the Southern Hemisphere, is older than that of Allieae (mean = 21.3 mya) from the Northern Hemisphere (Fig. 3). The initial diversification of Allioideae likely occurred due to climatic conditions. During the late Paleocene and early Eocene, a warming period occurred, producing a pronounced climate optimum that favored the diversification of major Allioideae lineages in Africa. The ancestor of Allioideae is believed to have originated in Africa, with the Allieae lineage then migrating towards warmer areas of the Northern Hemisphere when the global climate shifted to cooler conditions around 50–34 mya³⁸. Dispersal from Africa to Europe is common among land plants with disjunct distributions in both regions³⁹.

The ancestral range of the crown node of clade II, which includes Tulbaghieae, Gilliesieae, and Leucocoryneae, is in Africa according to our BBM analysis (Fig. 4 and Table 2). Two mechanisms have been proposed to explain the intercontinental distribution of this clade in the Southern Hemisphere, attributing it to either dispersal or vicariance (continental drift). We observed disjunct populations in Africa and South America. Our age estimate for the divergence of these two regions is 25.3 mya, followed by diversification approximately 16.5 mya and the subsequent emergence of the monophyletic Gilliesieae–Leucocoryneae lineage in South America. Thus, continental drift does not appear to have played a role in the disjunct distribution of the Allioideae species in Africa and South America, as the great southern continent of Gondwanaland is thought to have broken up in the early Cretaceous. The possibility of biological exchange between Africa and South America since the late Oligocene occurred too recent to support a vicariance explanation based on continental drift. Instead, long-distance dispersal may explain the intercontinental distribution of African and South American Allioideae species. Similar origins have been postulated for Caricaceae⁴⁰ and Canellaceae⁴¹. The latest study on the biogeography of Allioideae suggested an “Out-of-India” hypothesis for the colonization of Allieae in the northern hemisphere from India tectonic plate⁷. However, the absence of Allieae species in India questioned the reliability of “Out-of-India” hypothesis although the authors demonstrated that aridification during the collision of India and Eurasia caused the extinction of Allium in India.

The present study presents the most detailed molecular phylogenetic and biogeographic information available to date for Allioideae and illustrates the need to investigate relationships at the tribe level more thoroughly, especially Gilliesieae–Leucocoryneae. Givnish et al.⁴² recently suggested an “out of Gondwana” origin for Liliales and emphasized the importance of vicariance in the ancient past for determining its current distribution. However, the biogeographic origin and their distribution pattern of Asparagales in the Southern Hemisphere have not yet been addressed. Thus, future works should include additional sampling to establish the biogeographic history of Asparagales in Southeast Asia, India, South America, Australia, and Africa.

Conclusions

This study provided new data on the evolution of chloroplast genomes in Allioideae. Specifically, there were parallel events of gene loss (infA, rps16, ndhF, ndhG, and rpl22) and pseudogenization (i.e., rps2, ycf15, rps16 and matK) across Allieae despite the division of Allium into evolutionary lines. The phylogeny inferred from 74 protein-coding genes revealed the monophyly of tribes in Allioideae; however, the subgenera classification of Allium was polyphyletic, suggesting further studies on phylogeny of Allium with more samples and molecular data (i.e., single copy genes in nuclear and mitochondrial genomes and non-coding regions). Divergence time estimation and biogeographic analysis resulted in the origin from Africa in the Eocene of Allioideae species of which the expansion to the northern hemisphere may infer from long-distance dispersal.

Materials and methods

Taxon sampling, DNA extraction, genome assembly, and annotation

Allioideae samples were collected from various sources (Table S7). Samples were dried with silica gel and used for extraction of total genomic DNA with a modified 2 × cetyltrimethylammonium bromide (CTAB) method⁴³. High-quality DNA samples (> 200 ng/ul) were applied to NGS using the MiSeq sequencing platform with Miseq Reagent Kit v3 following manufacturer’s instruction (Illumina, Korea). The raw reads (2 × 300 bp paired-end reads) obtained were trimmed to remove regions with error probabilities greater than 0.01% per base using Geneious v.7.1.9⁴⁴. Also, the adapter sequences were removed using the function “Trims Ends” of Geneious v.7.1.9. The paired-end reads (300 bp) were assembled using the reference chloroplast genomes of Allium cepa (GenBank no. KM088013), Allium obliquum (GenBank no. NC037199), Allium sativum (GenBank no. NC031829), Allium ursinum (GenBank no. MH157875), and Allium victorialis (GenBank no. MF687749) based on minimum similarity of 95% to the reference. Then, the isolated reads were subjected to de novo assembly in Geneious to complete the chloroplast genome sequences. The number of total reads, number of assembled reads, and coverage are summarised in Table S7 (over 15x). To confirm the newly completed sequences of Allium chloroplast genome, NOVOPlasty was used following the manual instructions⁴⁵. In the case of having gaps during the assembly process, specific primer pairs were designed using Primer3 and the PCR products were sequenced using Sanger method to cover the gaps⁴⁶. The newly completed chloroplast genome sequences were annotated using previously published Allium cpDNA as listed above with Geneious. Then, the protein-coding regions were checked and manually adjusted to include a start codon at the beginning and a stop codon at the end of the region. The tRNA sequences were confirmed using tRNAScan-SE⁴⁷. A circular chloroplast genome map was obtained using the OGDraw program⁴⁸.

Comparative genomic analyses in Allioideae

The new complete cpDNA sequences of Allioideae species were used along with published cpDNA from NCBI (including Allium cepa [GenBank no. KM088013], A. obliquum [GenBank no. NC037199], A. sativum [GenBank no. NC031829], A. ursinum [GenBank no. MH157875], and A. victorialis [GenBank no. MF687749]) for comparative analysis (Table S7). The DNASP 5.0 program was used to calculate the nucleotide diversity (Pi values) of noncoding and coding cpDNA regions among Allioideae species⁴⁹. The REPuter program was used to identify repeats in the cpDNA of Allioideae with a minimum length of 19 bp⁵⁰. The Phobos program embedded in Geneious was used to identify simple single repeats, including mono-, di-, tri-, tetra-, penta-, and hexa-nucleotides with repeated numbers of 10, 5, 4, 3, 3, and 3, respectively [http://www.rub.de/ecoevo/cm/cm_phobos.htm].

Phylogenetic analysis

Twenty-eight species were subjected to phylogenetic analysis, including Allioideae (21 species), Amaryllidoideae (5), and Agapanthoideae (1) within Amaryllidaceae. Within Allioideae, all four tribes (Allieae [18 species], Gilliesieae [1], Leucocoryneae [1], and Tulbaghieae [1]) recognized in the most recent accounts of the subfamily were sampled. For rooting, five species of Asparagaceae, Xanthorrhoeaceae, and Iridaceae were included based on previous phylogenetic studies⁶. Taxa sampled, voucher information, and GenBank accession numbers for the cp genome data are listed in Table S4. Among 80 coding genes in the chloroplast genome, six genes (rpl22, infA, ycf15, rps2, rps16, and accD) were excluded from the data matrix due to pseudogenization and loss events. Thus, the phylogenetic analyses were done on a dataset of 74 coding genes of the cp genome. Multiple-sequence alignment was performed using MAFFT v.6⁵¹ with the default alignment parameters. Gaps were treated as missing data.

Phylogenetic reconstructions based on the combined sequences of 74 coding genes were performed using the maximum parsimony (MP) method in the program PAUP^* 4.0b10⁵². All characters and character states were weighted equally and unordered. The most parsimonious trees were identified with a heuristic algorithm comprising tree bisection-reconnection, branch swapping, the MULPARS function, and the alternative character state. Bootstrap analyses (1000 pseudoreplicates) were conducted to examine the relative level of support (BP) for individual clades on each of the resulting cladograms.

Phylogenetic analysis of the combined cpDNA dataset was also conducted using Bayesian inference (BI) in MrBayes v.3.12⁵³. Applying the Akaike information criterion, jModelTest v.2.1.7⁵⁴ assigned the GTR + I + Г model of molecular evolution to the combined dataset. Four MCMC chains were run simultaneously and sampled every 1000 generations for a total of 20 million generations. We plotted the log-likelihood scores of sample points against generation time using Tracer v.1.5; this ensured that stationarity was achieved after the first 2 million generations by determining whether the log-likelihood values of the sample points reached a stable equilibrium. In addition, we used the AWTY graphical system⁵⁵ to compare split frequencies among runs and plot the cumulative split frequencies to ensure that stationarity was reached. The first 1000 (10%) sample trees from each run were discarded (representing burn-in), as determined using Tracer v.1.5. A maximum a posteriori tree was constructed by summarising the remaining trees from parallel runs into a majority-rule consensus tree, yielding posterior probability (PP) values for each clade.

Molecular dating analysis

To estimate the divergence times of tribes in Allioideae, we used BEAST v.1.8⁵⁶ based on 74 cpDNA coding regions. The BEAUti interface was used to generate input files for BEAST, in which the GTR + I + Г model, Yule speciation tree prior, and uncorrelated lognormal molecular clock model were applied. Two runs of 200 million generations were set for the MCMC chains, sampling every 1000 generations. Convergence of the stationary distribution was checked through visual inspection of the plotted posterior estimates using Tracer v.1.6. After discarding the first 20,000 (10%) trees as burn-in, the samples were summarised in a maximum clade credibility tree in TreeAnnotator v.1.6.1 using a PP limit of 0.50 and summarising the mean node heights. The mean and 95% HPD of each age estimate were obtained from the combined outputs using Tracer. The results were visualized using Figtree v.1.4.2 [http://tree.bio.ed.ac.uk/software/figtree/].

Age calibration was constrained to the phylogeny of Allioideae and its close relatives. The crown node (C1 in Fig. 3) of Yucca-Hosta was constrained with a uniform distribution from 20.7 to 37.5 mya following McKain et al.⁵⁷, who estimated the divergence time of Agavoideae using 69 cpDNA coding genes. Three further calibration processes were implemented, as uniform distribution from 50.0 to 67.4 mya for the stem group of Amaryllidaceae (C2); from 42.0 to 61.7 mya for the crown group of Amaryllidaceae (C3); and from 38.1 to 56.5 mya for the stem node of Allioideae (C4).

Ancestral area reconstruction

Biogeographic data for species within Allioideae were compiled from their distributions described in the literature and herbarium specimens. The distribution range of Allioideae species and outgroups was divided into five areas: (A) Eurasia, (B) North America, (C) Africa, (D) South America, and (E) Australia. We coded each species based on the entire range of the species regardless of the sample’s biogeographic source. Ancestral area reconstruction and estimation of spatial patterns of geographic diversification within Allioideae were inferred using the BBM and S-DIVA as implemented in RASP v.2.1b (Reconstruct Ancestral State in Phylogenies, formerly S-DIVA)⁵⁸. The BBM was run using the fixed state frequencies model (Jukes-Cantor) with equal among-site rate variations over two million generations, 10 chains each, and two parallel runs. In S-DIVA, the frequencies of ancestral ranges at a given node in ancestral reconstructions are averaged over all trees. For these analyses, we used all post burn-in trees obtained from BEAST analysis. The consensus tree used to map the ancestral distribution of each node was obtained using the Compute Condense option in RASP from stored trees. The maximum number of ancestral areas was set to five.

References

The Angiosperm Phylogeny Group et al. An update of the Angiosperm Phylogeny Group classification for the orders and families of flowering plants: APG IV. Bot. J. Linn. Soc. 181, 1–20 (2016).
Article Google Scholar
Sassone, A. B. & Giussani, L. M. Reconstructing the phylogenetic history of the tribe Leucocoryneae (Allioideae): reticulate evolution and diversification in South America. Mol. Phylogenet. Evol. 127, 437–448 (2018).
Article PubMed Google Scholar
Kubitzki, K., Rohwer, J. G. & Bittrich, V. The Families and Genera of Vascular Plants (Springer, Berlin, 1990).
Google Scholar
Meerow, A. W. et al. Systematics of Amaryllidaceae based on cladistic analysis of plastid sequence data. Am. J. Bot. 86, 1325–1345 (1999).
Article CAS PubMed Google Scholar
Steele, P. R. et al. Quality and quantity of data recovered from massively parallel sequencing: Examples in Asparagales and Poaceae. Am. J. Bot. 99, 330–348 (2012).
Article CAS PubMed Google Scholar
Chen, S., Kim, D.-K., Chase, M. W. & Kim, J.-H. Networks in a large-scale phylogenetic analysis: reconstructing evolutionary history of Asparagales (Lilianae) based on four plastid genes. PLoS ONE 8, e59472 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Costa, L. et al. Divide to conquer: evolutionary history of Allioideae Tribes (Amaryllidaceae) is linked to distinct trends of karyotype evolution. Front. Plant Sci. 11, 1–15 (2020).
Article Google Scholar
Xie, D. F. et al. Insights into phylogeny, age and evolution of Allium (Amaryllidaceae) based on the whole plastome sequences. Ann. Bot. 125, 1039–1055 (2020).
Article CAS PubMed PubMed Central Google Scholar
Nguyen, N. H., Driscoll, H. E. & Specht, C. D. A molecular phylogeny of the wild onions (Allium; Alliaceae) with a focus on the western North American center of diversity. Mol. Phylogenet. Evol. 47, 1157–1172 (2008).
Article CAS PubMed Google Scholar
Friesen, N., Fritsch, R. & Blattner, F. Phylogeny and new intrageneric classification of Allium (Alliaceae) based on nuclear ribosomal DNA ITS sequences. Aliso 22, 372–395 (2006).
Article Google Scholar
Souza, G., Crosa, O., Speranza, P. & Guerra, M. Phylogenetic relations in tribe Leucocoryneae (Amaryllidaceae, Allioideae) and the validation of Zoellnerallium based on DNA sequences and cytomolecular data. Bot. J. Linn. Soc. 182, 811–824 (2016).
Article Google Scholar
Cox, C. B., Moore, P. D. & Ladle, R. Biogeography: An Ecological and Evolutionary Approach (Wiley-Blackwell, New York, 2016).
Google Scholar
Kim, C., Kim, S.-C. & Kim, J.-H. Historical biogeography of Melanthiaceae: a case of out-of-North America through the bering land bridge. Front. Plant Sci. 10, 396 (2019).
Article PubMed PubMed Central Google Scholar
Morley, R. J. Interplate dispersal paths for megathermal angiosperms. Perspect. Plant Ecol. Evol. Syst. 6, 5–20 (2003).
Article Google Scholar
Nie, Z.-L. et al. Evolution of the intercontinental disjunctions in six continents in the Ampelopsis clade of the grape family (Vitaceae). BMC Evol. Biol. 12, 17 (2012).
Article PubMed PubMed Central Google Scholar
McLoughlin, S. The breakup history of Gondwana and its impact on pre-Cenozoic floristic provincialism. Aust. J. Bot. 49, 271 (2001).
Article Google Scholar
Li, Q.-Q., Zhou, S.-D., Huang, D.-Q., He, X.-J. & Wei, X.-Q. Molecular phylogeny, divergence time estimates and historical biogeography within one of the world’s largest monocot genera. AoB Plants 8, plw41 (2016).
Article Google Scholar
Choi, J. W. et al. Organelle inheritance and genome architecture variation in isogamous brown algae. Sci. Rep. 10, 2048 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Crosby, K. & Smith, D. R. Does the mode of plastid inheritance influence plastid genome architecture?. PLoS ONE 7, e46260 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Givnish, T. J. et al. Orchid historical biogeography, diversification, Antarctica and the paradox of orchid dispersal. J. Biogeogr. 43, 1905–1916 (2016).
Article Google Scholar
Xie, D.-F. et al. Phylogeny of Chinese Allium species in section Daghestanica and adaptive evolution of Allium (Amaryllidaceae, Allioideae) species revealed by the chloroplast complete genome. Front. Plant Sci. https://doi.org/10.3389/fpls.2019.00460 (2019).
Article PubMed PubMed Central Google Scholar
Huo, Y. et al. Complete chloroplast genome sequences of four Allium species: comparative and phylogenetic analyses. Sci. Rep. 9, 12250 (2019).
Article ADS PubMed PubMed Central CAS Google Scholar
Xiao-Ming, Z. et al. Inferring the evolutionary mechanism of the chloroplast genome size by comparing whole-chloroplast genome sequences in seed plants. Sci. Rep. 7, 1555 (2017).
Article ADS PubMed PubMed Central CAS Google Scholar
Smith, D. R. Unparalleled GC content in the plastid DNA of Selaginella. Plant Mol. Biol. 71, 627–639 (2009).
Article CAS PubMed Google Scholar
Ravi, V., Khurana, J. P., Tyagi, A. K. & Khurana, P. An update on chloroplast genomes. Plant Syst. Evol. 271, 101–122 (2008).
Article CAS Google Scholar
Do, H. D. K., Kim, C., Chase, M. W. & Kim, J. Implications of plastome evolution in the true lilies (monocot order Liliales). Mol. Phylogenet. Evol. 148, 106818 (2020).
Article PubMed Google Scholar
Millen, R. S. et al. Many parallel losses of infA from chloroplast DNA during angiosperm evolution with multiple independent transfers to the nucleus. Plant Cell 13, 645–658 (2001).
Article ADS CAS PubMed PubMed Central Google Scholar
Wang, R.-J. et al. Dynamics and evolution of the inverted repeat-large single copy junctions in the chloroplast genomes of monocots. BMC Evol. Biol. 8, 36 (2008).
Article CAS PubMed PubMed Central Google Scholar
Dong, W.-L. et al. Molecular evolution of chloroplast genomes of orchid species: insights into phylogenetic relationship and adaptive evolution. Int. J. Mol. Sci. 19, 716 (2018).
Article PubMed Central CAS Google Scholar
Wang, X. et al. The USDA cucumber (Cucumis sativus L.) collection: genetic diversity, population structure, genome-wide association studies, and core collection development. Hortic. Res. 5, 64 (2018).
Article PubMed PubMed Central Google Scholar
Zhou, L. et al. Developing single nucleotide polymorphism markers for the identification of pineapple (Ananas comosus) germplasm. Hortic. Res. 2, 15056 (2015).
Article PubMed PubMed Central CAS Google Scholar
Pires, C. et al. Phylogeny, genome size, and chromosome evolution of Asparagales. Aliso 22, 287–304 (2006).
Article Google Scholar
Li, Q.-Q. et al. Phylogeny and biogeography of Allium (Amaryllidaceae: Allieae) based on nuclear ribosomal internal transcribed spacer and chloroplast rps16 sequences, focusing on the inclusion of species endemic to China. Ann. Bot. 106, 709–733 (2010).
Article CAS PubMed PubMed Central Google Scholar
Peruzzi, L., Carta, A. & Altinordu, F. Chromosome diversity and evolution in Allium (Allioideae, Amaryllidaceae). Plant Biosyst. Int. J. Deal Asp. Plant Biol. 151, 212–220 (2017).
Google Scholar
Smirnov, S., Skaptsov, M., Shmakov, A., Fritsch, R. M. & Friesen, N. Spontaneous hybridization among Allium tulipifolium and A. robustum (Allium subg. Melanocrommyum, Amaryllidaceae) under cultivation. Phytotaxa 303, 155 (2017).
Article Google Scholar
Pigg, K. B., Bryan, F. A. & DeVore, M. L. Paleoallium billgenseli gen. et sp. nov.: Fossil Monocot Remains from the Latest Early Eocene Republic Flora, Northeastern Washington State, USA. Int. J. Plant Sci. 179, 477–486 (2018).
Article Google Scholar
Dubouzet, J. G. & Shinoda, K. Relationships among Old and New World Alliums according to ITS DNA sequence analysis. Theor. Appl. Genet. 98, 422–433 (1999).
Article CAS Google Scholar
Zachos, J. Trends, rhythms, and aberrations in global climate 65 Ma to Present. Science (80-) 292, 686–693 (2001).
Article ADS CAS Google Scholar
Désamoré, A. et al. Out of Africa: north-westwards Pleistocene expansions of the heather Erica arborea. J. Biogeogr. 38, 164–176 (2011).
Article Google Scholar
Antunes Carvalho, F. & Renner, S. S. A dated phylogeny of the papaya family (Caricaceae) reveals the crop’s closest relatives and the family’s biogeographic history. Mol. Phylogenet. Evol. 65, 46–53 (2012).
Article Google Scholar
Müller, S. et al. Intercontinental long-distance dispersal of Canellaceae from the New to the Old World revealed by a nuclear single copy gene and chloroplast loci. Mol. Phylogenet. Evol. 84, 205–219 (2015).
Article PubMed CAS Google Scholar
Givnish, T. J. et al. Phylogenomics and historical biogeography of the monocot order Liliales: out of Australia and through Antarctica. Cladistics 32, 581–605 (2016).
Article PubMed Google Scholar
Doyle, J. J. & Doyle, J. L. A rapid DNA isolation procedure for small quantities of fresh leaf tissue. Phytochem. Bull. 19, 11–15 (1987).
Google Scholar
Kearse, M. et al. Geneious Basic: An integrated and extendable desktop software platform for the organization and analysis of sequence data. Bioinformatics 28, 1647–1649 (2012).
Article PubMed PubMed Central Google Scholar
Dierckxsens, N., Mardulyn, P. & Smits, G. NOVOPlasty: de novo assembly of organelle genomes from whole genome data. Nucleic Acids Res. https://doi.org/10.1093/nar/gkw955 (2016).
Article PubMed PubMed Central Google Scholar
Untergasser, A. et al. Primer3—new capabilities and interfaces. Nucleic Acids Res. 40, e115–e115 (2012).
Article CAS PubMed PubMed Central Google Scholar
Schattner, P., Brooks, A. N. & Lowe, T. M. The tRNAscan-SE, snoscan and snoGPS web servers for the detection of tRNAs and snoRNAs. Nucleic Acids Res. 33, W686–W689 (2005).
Article CAS PubMed PubMed Central Google Scholar
Greiner, S., Lehwark, P. & Bock, R. OrganellarGenomeDRAW (OGDRAW) version 1.3.1: expanded toolkit for the graphical visualization of organellar genomes. Nucleic Acids Res. 47, W59–W64 (2019).
Article CAS PubMed PubMed Central Google Scholar
Librado, P. & Rozas, J. DnaSP v5: a software for comprehensive analysis of DNA polymorphism data. Bioinformatics 25, 1451–1452 (2009).
Article CAS PubMed Google Scholar
Kurtz, S. REPuter: the manifold applications of repeat analysis on a genomic scale. Nucleic Acids Res. 29, 4633–4642 (2001).
Article MathSciNet CAS PubMed PubMed Central Google Scholar
Katoh, K. MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform. Nucleic Acids Res. 30, 3059–3066 (2002).
Article ADS CAS PubMed PubMed Central Google Scholar
Cummings, M. P. PAUP* [Phylogenetic Analysis Using Parsimony (and Other Methods)]. in Dictionary of Bioinformatics and Computational Biology (Wiley, 2004). doi:https://doi.org/10.1002/0471650129.dob0522.
Ronquist, F. et al. MrBayes 3.2: efficient Bayesian phylogenetic inference and model choice across a large model space. Syst. Biol. 61, 539–542 (2012).
Article PubMed PubMed Central Google Scholar
Darriba, D., Taboada, G. L., Doallo, R. & Posada, D. jModelTest 2: more models, new heuristics and parallel computing. Nat. Methods 9, 772–772 (2012).
Article CAS PubMed PubMed Central Google Scholar
Nylander, J. A. A., Olsson, U., Alström, P. & Sanmartín, I. Accounting for phylogenetic uncertainty in biogeography: a Bayesian approach to dispersal-vicariance analysis of the thrushes (Aves: Turdus). Syst. Biol. 57, 257–268 (2008).
Article PubMed Google Scholar
Drummond, A. J. & Rambaut, A. BEAST: Bayesian evolutionary analysis by sampling trees. BMC Evol. Biol. 7, 214 (2007).
Article PubMed PubMed Central CAS Google Scholar
McKain, M. R. et al. A phylogenomic assessment of ancient polyploidy and genome evolution across the Poales. Genome Biol. Evol. https://doi.org/10.1093/gbe/evw060 (2016).
Article PubMed PubMed Central Google Scholar
Yu, Y., Harris, A. J., Blair, C. & He, X. RASP (Reconstruct Ancestral State in Phylogenies): a tool for historical biogeography. Mol. Phylogenet. Evol. 87, 46–49 (2015).
Article PubMed Google Scholar

Download references

Acknowledgments

This work was supported by the Scientific Research (KNA 1-1-13, 14-1) of Korea National Arboretum and the National Research Foundation (NRF-2017R1D1A1B06029326).

Author information

These authors contributed equally: Ju Namgung and Hoang Dang Khoa Do.

Authors and Affiliations

Department of Life Science, Gachon University, Seongnam, 13120, Republic of Korea
Ju Namgung, Hoang Dang Khoa Do, Changkyun Kim & Joo‑Hwan Kim
Nguyen Tat Thanh Hi-Tech Institute, Nguyen Tat Thanh University, Ho Chi Minh City, Vietnam
Hoang Dang Khoa Do
Department of Biology and Chemistry, Changwon National University, Gyeongsangnamdo, 51140, Republic of Korea
Hyeok Jae Choi

Authors

Ju Namgung
View author publications
You can also search for this author in PubMed Google Scholar
Hoang Dang Khoa Do
View author publications
You can also search for this author in PubMed Google Scholar
Changkyun Kim
View author publications
You can also search for this author in PubMed Google Scholar
Hyeok Jae Choi
View author publications
You can also search for this author in PubMed Google Scholar
Joo‑Hwan Kim
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

J.-H.K. and H.J.C. conceived the experiments; J.N., C.K., and H.D.K.D. conducted the experiments, analyzed the data, and wrote the draft manuscript; J.-H.K. and H.J.C. revised the draft manuscript. All authors agreed to the final form of this manuscript.

Corresponding author

Correspondence to Joo‑Hwan Kim.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Tables.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Namgung, J., Do, H.D.K., Kim, C. et al. Complete chloroplast genomes shed light on phylogenetic relationships, divergence time, and biogeography of Allioideae (Amaryllidaceae). Sci Rep 11, 3262 (2021). https://doi.org/10.1038/s41598-021-82692-5

Download citation

Received: 26 May 2020
Accepted: 18 January 2021
Published: 05 February 2021
DOI: https://doi.org/10.1038/s41598-021-82692-5

This article is cited by

Assembly, annotation and analysis of the chloroplast genome of the Algarrobo tree Neltuma pallida (subfamily: Caesalpinioideae)
- Esteban Caycho
- Renato La Torre
- Gisella Orjeda
BMC Plant Biology (2023)
Complete plastid genome structure of 13 Asian Justicia (Acanthaceae) species: comparative genomics and phylogenetic analyses
- Zhengyang Niu
- Zheli Lin
- Yunfei Deng
BMC Plant Biology (2023)
On Pattern-Cladistic Analyses Based on Complete Plastid Genome Sequences
- Evgeny V. Mavrodiev
- Alexander Madorsky
Acta Biotheoretica (2023)
Plastome phylogenomics and historical biogeography of aquatic plant genus Hydrocharis (Hydrocharitaceae)
- Zhi-Zhong Li
- Samuli Lehtonen
- Jin-Ming Chen
BMC Plant Biology (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

Comparative analysis of cpDNA features in Allioideae and related taxa

Phylogenetic relationships of Allioideae

Molecular dating analyses

Ancestral area reconstruction

Discussion

Chloroplast genome evolution in Allioideae

Phylogenetic relationships of Allioideae

Divergence time and biogeographic origins of Allioideae

Conclusions

Materials and methods

Taxon sampling, DNA extraction, genome assembly, and annotation

Comparative genomic analyses in Allioideae

Phylogenetic analysis

Molecular dating analysis

Ancestral area reconstruction

References

Acknowledgments

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links