Buxus and Tetracentron genomes help resolve eudicot genome history

Chanderbali, Andre S.; Jin, Lingling; Xu, Qiaoji; Zhang, Yue; Zhang, Jingbo; Jian, Shuguang; Carroll, Emily; Sankoff, David; Albert, Victor A.; Howarth, Dianella G.; Soltis, Douglas E.; Soltis, Pamela S.

doi:10.1038/s41467-022-28312-w

Download PDF

Article
Open access
Published: 02 February 2022

Buxus and Tetracentron genomes help resolve eudicot genome history

Nature Communications volume 13, Article number: 643 (2022) Cite this article

6945 Accesses
20 Citations
28 Altmetric
Metrics details

Subjects

Abstract

Ancient whole-genome duplications (WGDs) characterize many large angiosperm lineages, including angiosperms themselves. Prominently, the core eudicot lineage accommodates 70% of all angiosperms and shares ancestral hexaploidy, termed gamma. Gamma arose via two WGDs that occurred early in eudicot history; however, the relative timing of these is unclear, largely due to the lack of high-quality genomes among early-diverging eudicots. Here, we provide complete genomes for Buxus sinica (Buxales) and Tetracentron sinense (Trochodendrales), representing the lineages most closely related to core eudicots. We show that Buxus and Tetracentron are both characterized by independent WGDs, resolve relationships among early-diverging eudicots and their respective genomes, and use the RACCROCHE pipeline to reconstruct ancestral genome structure at three key phylogenetic nodes of eudicot diversification. Our reconstructions indicate genome structure remained relatively stable during early eudicot diversification, and reject hypotheses of gamma arising via inter-lineage hybridization between ancestral eudicot lineages, involving, instead, only stem lineage core eudicot ancestors.

Nested whole-genome duplications coincide with diversification and high morphological disparity in Brassicaceae

Article Open access 30 July 2020

Genome structure-based Juglandaceae phylogenies contradict alignment-based phylogenies and substitution rates vary with DNA repair genes

Article Open access 04 February 2023

Prickly waterlily and rigid hornwort genomes shed light on early angiosperm evolution

Article Open access 24 February 2020

Introduction

Flowering plants (angiosperms), with nearly 400,000 species and a fossil record that dates to the Early Cretaceous, have a complex evolutionary history marked by early and rapid lineage divergences^1,2,3. Whole-genome duplication (WGD) events have also been frequent in angiosperms, and indeed all extant species are ancient polyploids descended from a common ancestor that experienced at least one WGD^4,5. Subsequent polyploidy events have been identified throughout angiosperm phylogeny, often coinciding closely with the origin and/or radiation of major clades^6,7,8,9,10. Notably, the core eudicots (Gunneridae¹¹), nested in the eudicot clade, descend from an ancient hexaploid formation, termed gamma^12,13,14,15, and account for ~70% of extant angiosperm species. Moreover, a novel suite of floral features, ‘whorled pentamery’ with flower parts arranged in concentric whorls of five^16,17,18, evolved shortly after the origin of the core eudicots^11,19 and could be genetically linked to this ancient hexaploidy event, e.g., through multiplications or rearrangements of floral transcriptional regulators¹⁵. Such a causal relationship between gamma and whorled pentamery, although still speculative, is consistent with the widely acknowledged role of gene and genome duplications providing the genetic raw material for evolutionary innovation^9,20.

The phylogenetic timing and mechanism of gamma hexaploidy are currently unresolved. Hypotheses on the topic mostly envision a two-step process, in which the product of an initial WGD fused with a third genome in a second polyploidization, possibly via a wide cross after an extended period of random fractionation (loss of either copy of duplicated genomic regions following WGD) in the tetraploid intermediate²¹. The breadth of this putative wide cross is also unclear and possibly includes extant early-diverging eudicot lineages^13,15,22. Alternatively, one of the gamma subgenomes may have been more resistant to fractionation, and all three subgenomes may have been joined rapidly in evolutionary time²¹, perhaps in an autohexaploidy event²³. It has also been argued that gamma hexaploidy derives from an initial tetraploidy shared by all eudicots^24,25. Further still, the lack of clear evidence of gamma outside of the core eudicots may be due to stochastic gene loss over more than 100 million years of independent evolution²³. Efforts to evaluate evolutionary scenarios of gamma origins have been hampered by limited data and unsettled sister-group relationships to the core eudicots. Plastome sequence data support either Buxales^19,26,27 or Trochodendrales^28,29 as immediate sisters to the core eudicots, while single-copy nuclear (SCN) genes from transcriptome data sets have recovered a Buxales+Trochodendrales clade placed sister to the core eudicots^15,30. Thus, despite considerable research interest, the timing and mechanism of gamma formation have remained unresolved.

We here provide genome assemblies for Buxus sinica (Buxales) and Tetracentron sinense (Trochodendrales), which represent, either individually or collectively, the sister lineage of core eudicots¹⁵. These two genome assemblies complement those available for other early-diverging eudicot lineages^22,31,32,33 and permit evaluations of eudicot phylogeny and gamma origins based on phylogenomics, molecular evolution, and synteny. In addition, we employ the RACCROCHE³⁴ pipeline of algorithms to infer the ancestral genomes at three sequential nodes of the eudicot radiation.

Results and discussion

Genome assembly, annotation, and structure

Chromosome-scale nuclear genome assemblies for Buxus and Tetracentron were produced from PacBio long-read contigs assembled with the FALCON/FALCON-unzip pipeline³⁵ and scaffolded by Hi-C technology³⁶ (Fig. 1; Supplementary Data 1). The Buxus assembly totals 764 Mb (90% of the estimated genome size of 850 Mb), with 7180 contigs (N50 = 164 kb) in 63 scaffolds (N50 = 56 Mb), of which 14 contain 763 Mb (99.8%) of the assembly. The Tetracentron assembly totals 908 Mb (93% of the estimated genome size of 975 Mb), with 6178 contigs (N50 = 238 kb) in 662 scaffolds (N50 = 54 Mb), of which 19 contain 856 Mb (94.5%) of the assembly. The largest 14 and 19 scaffolds of the Buxus and Tetracentron assemblies, respectively, correspond with the known chromosome numbers of these taxa^37,38. Benchmarking Universal Single-Copy Orthologs (BUSCO) analyses^39,40 estimate 96.3% and 93.5% completeness for the Buxus and Tetracentron genomes, respectively (Supplementary Data 2). Transposable elements and other repeat sequences account for 76.4% and 78.5% of the Buxus and Tetracentron assemblies, respectively (Supplementary Data 3). In Buxus, LTR retrotransposons (26.8%), followed by LINEs (4.9%) and DNA transposable elements (2.8%), are most abundant, with Ty3/Gypsy and Ty1/Copia retrotransposons accounting for 87.2% and 13.0% of the LTRs, respectively. LTRs (27.4%), LINES (4.6%), and DNA transposable elements (2.9%) account for most of the Tetracentron repeats, with Ty3-Gypsy (62.6%) and Ty1/Copia (36.6%) retrotransposons best represented among the LTRs. Annotation of the repeat-masked assemblies yielded 27,027 and 30,704 protein-coding gene models, including 86.9% and 80.5% of the BUSCO genes, in Buxus and Tetracentron, respectively (Supplementary Data 2). Our Tetracentron assembly is similar to one produced for another individual of this species³³ in terms of BUSCO statistics and annotation metrics, but differs in size (908 vs 1170 Mb) and the number of chromosome-size scaffolds (19 vs 24). We are unable to account for these differences, but our assembly closely matches the genome size measured by flow cytometry, and the only reported chromosome count of n = 24⁴¹ for Tetracentron has been discredited³⁷.

**Fig. 1: Habit and genome assembly features of *Buxus* and *Tetracentron*.**

Analyses of synonymous changes per synonymous site (Ks) and intragenomic synteny indicate that Buxus and Tetracentron are both paleopolyploids, with one and two rounds of WGDs in their respective evolutionary histories. Buxus syntenic paralogs (paleologs) constitute extensive blocks of colinear genome sequence across pairs of chromosomes and are characterized by Ks values close to 1.0 (Fig. 1c). Ks values for Tetracentron paleologs are concentrated near Ks = 0.5, but colinear genome sequences are distributed among four chromosomes (Fig. 1d), together suggesting two WGDs in close succession. The two Buxus subgenomes are highly conserved, with synteny blocks that often extend across much of the whole chromosomes, while the four subgenomes of Tetracentron appear to be highly rearranged at the chromosomal level (Fig. 1). The extent to which this structure reflects genome reshuffling, which is a prominent mechanism of post-polyploid diploidization (PPD) after WGDs⁴², or artifacts of genome assembly, is unclear. In favor of PPD processes, the Tetracentron genome is appreciably downsized compared to its sister species, and the only other living member of Trochodendraceae, Trochodendron aralioides (0.9 versus 1.6 GB), which shares two WGDs with Tetracentron³³ but exhibits more extensive blocks of inter-chromosomal synteny (Supplementary Fig. 1).

Phylogenetic positions of Buxus and Tetracentron

To reconstruct the branching sequence of the early eudicot radiation, we analyzed phylogenetic data sets for representative angiosperms composed of hundreds of BUSCO genes⁴³, the Angiosperms353 loci⁴⁴, and orthogroups identified de novo by the Orthofinder pipeline⁴⁵. Coalescence-based analyses of all three data sets place Ranunculales as sister to all other living eudicot lineages, with Proteales (including Sabiaceae) diverging next, and a Buxales+Trochodendrales clade as sister to the core eudicot clade (Fig. 2a; left panel, Supplementary Figs. 2a and 3). Concatenated data sets of the SCN loci, whether analyzed in Maximum Likelihood (Fig. 2a, right panel, Supplementary Fig. 2b) or Bayesian Inference (Supplementary Fig. 4) frameworks, recover Buxales alone as the core eudicot sister group, with Trochodendrales as sister to this Buxales+core eudicot clade. Although this branching sequence receives maximal statistical support in both Maximum Likelihood (bootstrap) and Bayesian Inference (posterior probability) analyses, incomplete lineage sorting (ILS) is a potential confounding factor in phylogenetic analyses of concatenated data sets in the face of rapid radiations⁴⁶, as is the case for the eudicots. Indeed, the quartet-support values associated with the Buxales+Trochodendrales clade in the coalescence tree indicate considerable gene tree discordance with respect to the positions of these taxa. Further exploration of conflicts affecting the eudicot clade, visualized as a cloudogram of gene trees (Fig. 2b), however, reveals that ~30% of the gene trees support the Buxales+Trochodendrales clade, while only ~18% support either Buxales or Trochodendrales as the core eudicot sister group (Supplementary Data 4). We also estimated the branching sequence of early-diverging eudicots using the ‘Trees in the Peaks’ method, which reconstructs speciation and polyploidization events from Ks and similarity score distributions of syntenic homologs^47,48 (Fig. 2c). This method, which requires that ancestral Ks and similarity scores and/or their ranges must precede (greater Ks or lower similarity) or overlap those in the descendants, was applied to evaluate each of all possible binary rooted phylogenies. The only branching sequence that satisfies these conditions is one in which Buxales and Trochodendrales are collectively sister to the core eudicots. Specifically, the peak Ks value of syntenic orthologs that diverged via the Buxus/Tetracentron speciation is younger than those derived from the phylogenetic divergence of Vitis (a core eudicot) from Buxus or from Tetracentron.

**Fig. 2: Phylogenetic relations of *Buxus* and *Tetracentron*.**

Phylogenomics of eudicot subgenomes

Synteny-guided phylogenomic analyses of eudicot subgenomes were conducted to assess the several hypothesized scenarios for the origin of gamma hexaploidy (Fig. 3). Pairwise analyses of inter-genomic collinearity (macrosynteny) and fractionation patterns identify extensive regions of early-diverging eudicot genomes shared with the gamma-derived hexaploid genome of Vitis, and each other (Supplementary Figs. 5–9). The ratios of syntenic depths (the number of times a genomic region is syntenic to regions in another genome) in these comparisons reflect the number of subgenomes, or level of ploidy, for the respective species. Thus, we see 2:3 syntenic depth between Buxus and Vitis, and 4:3 syntenic depth between Tetracentron and Vitis, while Tetracentron to Buxus is 4:2 in syntenic depth. Likewise, as previously reported, Aquilegia and Nelumbo each exhibit 2:3 syntenic depth with Vitis, and 2:2 with each other. Collectively, these macrosyntenic alignments approximate the modern distribution of the seven ancestral eudicot chromosomes (Fig. 3a, Supplementary Data 5, and see below), the evolutionary histories of which we have estimated through phylogenetic analyses of 1932 gene trees populated with 15872 genes (Fig. 3b). For example, syntenic blocks descended from ancestral chromosome 4 (purple tracks in Fig. 3a) occupy regions of Vitis chromosomes 5, 7, and 14, as well as portions of chromosomes 2 and 5 of Aquilegia, 1 and 2 of Nelumbo, 5 and 6 of Buxus, and 1, 2, 3, and 18 of Tetracentron. Microsynteny (gene level) alignments within these major synteny blocks comprise 235 homologous loci and a total of 1837 syntelogs (genes derived from the same ancestral genomic region) useful for inferring the evolutionary history of ancestral chromosome 4 (see Supplementary Data 5 for the modern distribution and representation of each ancestral chromosome). The coalescent solution of phylogenetic trees for these 235 loci shows that duplicated blocks of ancestral chromosome 4 now present in Aquilegia, Nelumbo, Buxus, and Tetracentron constitute lineage-specific clades (Fig. 3b), indicating that ancestral chromosome 4 was duplicated independently in each of the respective stem lineages of these four modern genomes. Indeed, the duplicated blocks of all seven ancestral chromosomes in Aquilegia, Nelumbo, Buxus, and Tetracentron constitute lineage-specific groupings (Fig. 3b), providing consensus that their respective WGDs are independent events and, importantly, exclusively involved genome donors that belonged to their respective clades, i.e., their stem lineage ancestors.

Phylogenetic alliances of the seven ancestral chromosomes occupying the modern, gamma-derived, Vitis genome are less clear. Of the three copies of ancestral chromosome 4, the syntenic blocks preserved on Vitis chromosome 5 and 7 form a well-supported sister group, but the block on Vitis chromosome 14 is placed as an earlier branch, albeit with low support. Vitis-specific clades were also not recovered for ancestral chromosomes 1 and 3, although again without high statistical support for non-monophyly. However, triplicated copies of ancestral chromosomes 2, 5, 6, and 7 in the Vitis genome group together as each other’s closest relatives. Although clade support is strong only for the copies of ancestral chromosome 7 currently preserved on Vitis chromosomes 6, 8, and 13, the phylogenies of these four sets of genomic regions suggest they uniquely share a common ancestor, one that evolved separately from the other, earlier-diverged, eudicot lineages. Altogether, we recover Vitis-specific groupings for duplicates of four of the ancestral eudicot chromosomes, albeit as a well-supported clade only once. The relationships of the other three ancestral chromosomes may best be described as phylogenetically unresolved. Importantly, these findings are inconsistent with evolutionary scenarios of gamma formation through an extremely wide cross between a core eudicot and an early-diverging eudicot lineage, as has been previously proposed²². An initial tetraploidy event in the common ancestor of the eudicots²⁴ is also inconsistent with our finding that paralogous genomic blocks in Aquilegia, and all other basal eudicots, constitute lineage-specific clades. The only evolutionary scenario consistent with our analyses is one in which gamma hexaploidy exclusively involved stem lineage ancestors of extant core eudicot species as genome donors. As such, if hexaploidy was attained via a two-step process of sequential WGDs, the third of the gamma genomes must have been donated from a now extinct lineage that branched off the core eudicot ancestral line before the initial tetraploidy event (Fig. 3c).

Ancestral genomes

The independence of each of the WGD events associated with each of the early-diverging eudicot lineages implies unduplicated ancestral genomes leading all the way from the ancestral angiosperm up to gamma and the core eudicots. We explore this key inference through ancestral genome reconstruction. We reconstructed ancestral genomes at three nodes of the eudicot phylogeny (Fig. 4a): the common ancestor of the core eudicot clade (ancestor 3), two sequentially older nodes ancestral also to Buxus and Tetracentron (ancestor 2), and Nelumbo (ancestor 1).

All three of these ancestral genomes are reconstructed as seven putative protochromosomes, each with between 700 and 1600 protogenes, totaling more than 8000 protogenes, arranged in their ancestral order (Fig. 4b). Our ancestral genome reconstructions include ~2000 more (ca. 25%) ordered protogenes than previous reconstructions of an ancestral eudicot genome⁴⁹. To understand the early evolution of eudicot genome structure, we partitioned the modern eudicot chromosomes into sets of syntenic regions and painted each of these according to its corresponding protochromosomes (Fig. 4c; Supplementary Fig. 10). These projections relate modern eudicot genomes to successive ancestral precursors and provide insights into the relative timing of any structural changes during eudicot genome evolution. Projections of the three ancestral genome reconstructions onto Vitis chromosomes (Fig. 4c) are globally similar, indicating genome structure remained relatively stable during early eudicot diversification. Inconsistent with the hypothesis of one ancestral eudicot tetraploidy²⁴, these projections indicate that fusion of the two ancestral chromosomes now combined in Vitis chromosome 7 and Aquilegia chromosome 5 (juxtaposed purple and green blocks in Fig. 4c and Supplementary Fig. 10, respectively) did not occur prior to the origin of the eudicot ancestor. Were this the case, both sections of these Vitis and Aquilegia chromosomes would be painted with a common color representing one ancestral chromosome whose ‘chimeric’ origin would be invisible to our methods. Instead, these, and other, chromosomal fusions appear to be independent, lineage-specific events that post-date ancestral genome arrangements. Several other genomic rearrangements, as measured by the ‘choppiness’ of chromosomal paintings (Supplementary Data 6), emerge from our reconstructions. In the case of Vitis, the modern genome has accumulated 41 inter-chromosomal exchanges relative to ancestors 1 and 2, and 31 after ancestor 3. The reduced number of inter-chromosomal exchanges indicates greater similarity of Vitis to the core eudicot ancestor (ancestor 3) relative to the more ancient ancestors 1 and 2. A similar reduction of inter-chromosomal exchanges, from 67 (relative to ancestor 2) to 56 (relative to ancestor 3), was also observed for Amaranthus tuberculatus, the other core eudicot genome in our analyses. As such, we can reject the occurrence of any single WGD in the eudicot stem lineage and instead firmly resolve independent WGDs in each modern eudicot lineage, including the core eudicots with their unique gamma hexaploid structure.

Our Buxus and Tetracentron genome assemblies have facilitated rigorous assessments of alternative hypothesized scenarios for the origin of gamma, a key hexaploidy associated with a major event in the history of terrestrial life, the origin of core eudicots, which comprise the vast majority of flowering plants. We have presented and analyzed several lines of evidence, including Ks distributions, genomic synteny, fractionation bias, phylogenomics, and ancestral genome reconstruction, that bear relevance to the phylogenetic and WGD history of the early-diverging eudicot angiosperms. These analyses reconstruct the sequential branching order of the initial eudicot radiation and show that each of the early-diverging eudicot lineages is characterized by its own independent duplication event(s). We find no evidence to support hypotheses that a single polyploidy event might have been formative for eudicot diversification as a whole. Instead, our analyses place gamma hexaploidy on the stem lineage of core eudicots and rule out a role for other living early-diverging eudicots as genome donors, a possibility that was consistent with the results of previous analyses^13,14,15,22. Without a single, linking WGD common to all eudicots, an argument that one polyploidy event may have helped spur the massive eudicot diversification (via adaptive, alternative deployments of duplicate genes), even following a time lag, is not supported by our data. Instead, each independent WGD among the early-diverging eudicot lineages, other than gamma, underlies relatively species-poor lineages that show limited fossil or living evidence for extensive radiation. Thus, with the genomes of all living early-diverging eudicot lineages now examined for a possible genomic contribution to gamma, the origin of gamma remains another abominable angiosperm mystery despite intensive study.

Methods

DNA extraction, sequencing, and assembly

Buxus sinica and Tetracentron sinense tissues were obtained from individuals cultivated at the University of Wisconsin-Madison (accession no. UW 136) and the University of Washington Arboretum, Seattle (accession no. 385-62), respectively. Genome sizes for these accessions were estimated using flow cytometry with BD CellQuest Pro software (Supplementary Data 7) by the Benaroya Research Institute (Seattle, WA). High-molecular-weight genomic DNA was isolated from young leaf tissue using modified nuclei-preparation and cetyltrimethylammonium bromide (CTAB) DNA extraction methods. Briefly, leaf tissue was ground to a fine powder under liquid nitrogen and mixed with nuclear isolation buffer (15 mM Tris, 10 mM EDTA, 130 mM KCl, 20 mM NaCl, 1 mM Spermine, 1 mM Spermidine, 8% PVP-10, 0.1% Triton X-100, and 7.5% 2-mercaptoethanol), passed sequentially through 100 and 40 µm mesh filters, treated with 1% Triton X-100, and centrifuged at 2000 × g for 10 min at 4 °C to pellet the nuclei. The pellet resuspended for 1 h at 65 °C in lysis buffer (100 mM Tris-HCl, 100 mM NaCl, 50 mM EDTA, 2% CTAB, 1% PEG 6000), and high-molecular-weight DNA was isolated from the lysate via 24:1 chloroform/isoamyl alcohol and purified with the QIAGEN Genomic kit. SMRTbell 20-kb libraries were generated and sequenced on the PacBio RSII platform to ~160x genomic coverage. In addition, Hi-C libraries were prepared and sequenced to coverage depths of ~40x by Phase Genomics (Seattle, WA). PacBio reads were assembled using the pb-assembly suite of programs which includes the FALCON/FALCON-unzip assembly pipeline and performs contig phasing and polishing³⁵. The polished assemblies were deduplicated with Purge Haplotigs⁵⁰ and scaffolded using Proximity Guided Assembly (PGA) and Hi-C reads by Phase Genomics (Seattle, WA).

RNA-seq data

Transcriptome assemblies were produced for Buxus sinica and Tetracentron sinense to aid annotation of their genome assemblies. We also produced transcriptome assemblies for six additional early-diverging eudicots (Buxus sempervirens, Meliosma dillenifolia, Nelumbo lutea, Sabia emarginata, Sabia swinhonei, Trochodendron aralioides), as well as the core eudicot (Gunnera manicata), to improve taxon sampling in phylogenetic analyses. Paired-end RNA-seq libraries were constructed from polyA selected total RNA extracted from floral and/or leaf tissues (Supplementary Data 8), and sequenced using the Illumina HiSeq 3000 system. Reads were trimmed with Trimmomatic⁵¹ and assembled using Trinity⁵². Coding DNA (CDS) and protein sequences were predicted with TransDecoder (http://transdecoder.github.io).

Annotation

Genomes were annotated using the MAKER pipeline⁵³. De novo transcriptome assemblies for Buxus and Tetracentron, along with proteomes for four publicly available eudicot genomes—Arabidopsis thaliana, Aquilegia coerulea, Nelumbo nucifera, and Vitis vinifera (Supplementary Data 8)—were provided as evidence. Custom repeat libraries for genome masking were produced according to the MAKER-P advanced protocol⁵⁴ using LTRharvest⁵⁵, LTRdigest⁵⁶, MITE-Hunter⁵⁷, RepeatModeler⁵⁸, and RepeatMasker⁵⁹. Gene models were predicted from the masked assemblies using the SNAP⁶⁰ and Augustus⁶¹ ab initio predictors after three rounds of training on interim high-quality (AED <= 0.25; length >= 50 amino acids) and BUSCO gene models, respectively.

Phylogenetic analyses

Three phylogenetic data sets were compiled from translated transcriptomes or genome-annotated proteomes for 40 angiosperms (Supplementary Data 8). Conserved single-copy land plant genes were identified by BUSCO⁴³ analyses with the embryophyta_odb10 data set, orthologs of the Angiosperms353 loci⁴⁴ were collected by BLAST searches seeded with Amborella trichopoda proteins, and orthogroups were circumscribed by Orthofinder⁴⁵. For all data sets, protein sequences were aligned using MAFFT⁶² and converted to codon alignments using PAL2NAL⁶³, which were refined in three successive rounds of sequence filtering and trimming using trimAl⁶⁴. Initially, sequences with less than 50% residue overlap over >70% of their length were removed to discard any potentially spurious homologs. The passing sequences were next trimmed with trimAl’s heuristic automatic method (-automated1) and filtered again as above to remove sequences that might contribute extensive missing data to the phylogenetic matrix. Alignments with fewer than 4 sequences, and missing representatives of either Buxales or Trochodendrales, were discarded. After all filtering steps, 1248 BUSCOs, 346 Angiosperms353 loci, and 2573 orthogroups were retained for phylogenetic analyses. Maximum likelihood (ML) trees for the single-copy data sets were inferred from alignments of individual loci as well as concatenations of these, produced with FASconCAT⁶⁵, using RAxML⁶⁶ with the GTR + gamma model of nucleotide evolution and 1000 bootstrap replicates. Concatenated alignments were analyzed using a partition scheme that defines individual genes as units for parameter optimization. Partitioned Bayesian Inference analyses were run with MrBayes with the GTR + I + G model for all partitions. Two independent parallel runs of four Metropolis-coupled Monte Carlo Markov Chains were run for 10 million generations with sampling every 1000 generations. Majority rule consensus trees and posterior probabilities of bipartitions were computed after discarding the first 25% of the sampled trees as burn-in. Orthogroup trees were inferred with IQ-Tree⁶⁷ with the best substitution model selected from among those implemented in RAxML and 1000 ultrafast bootstrap replicates. ASTRAL-III⁶⁸ and ASTRAL-Pro⁶⁹ were used to infer the species trees from single- and multi-copy gene trees, respectively, under the multi-species coalescent. DensiTree⁷⁰ was used for visualizations of discordance among a subset of single-copy gene trees without missing taxa.

Comparative genomics of polyploidy

CoGe’s SynMap and FractBias programs were used to perform genome alignments and fractionation bias calculations. FractBias analyses were conducted using all genes in the target genomes and syntenic depth settings in accordance with ploidy levels of respective genomes, as revealed by SynMap plots. All analyses can be regenerated on the CoGe platform (see Code availability below). For synteny-guided phylogenomic analyses, inter-genomic alignments were produced and screened to identify all syntenic homologs (syntelogs) present in ratios of up to 3:2:2:2:4 in Vitis, Aquilegia, Buxus, Nelumbo, and Tetracentron, respectively, using MCscan⁷¹. This collection of syntenic homologs was divided into seven pools in accordance with the major synteny blocks conserved across these eudicot genomes (as identified by SynMap and FractBias mappings, and which correspond with ancestral eudicot chromosomes). Unique identifiers for individual loci were replaced by ‘Species_chromosome’ codes to create comparable phylogenetic matrices and trees for coalescence-based phylogenetic analyses as outlined above.

Ancestral genomes

To build the three ancestral genomes indicated in Fig. 4, we use the RACCROCHE pipeline³⁴. Briefly, RACCROCHE uses all the syntenically validated homolog pairs generated by SynMap and builds disjoint gene families based on the principle that a gene homologous (orthologous or paralogous) with any gene in a family must also be a member of that family. For each genome, RACCROCHE extracts a set of ‘generalized’ adjacencies, namely all oriented pairs of genes within the same window containing seven consecutive genes. The pairs are represented by the non-adjacent ends of the two genes. The genes in these pairs are then labeled according to the gene families to which they belong. Each ancestor node has three incident branches, partitioning the tree into three subtrees defined by the one incoming edge (its ancestor) and two outgoing edges (its descendants). If an adjacency is found anywhere in any of the genomes in two or three of these subtrees, it is considered a candidate adjacency. With candidate adjacencies weighted as 2 or 3 according to the number of occurrences in subtrees, a maximum weight matching (MWM) of gene ends constructs the highest weight sets of compatible contiguous adjacencies (ancestral contigs). A gene end can only be matched to one end of another gene, so that these ancestral contigs are guaranteed to be linearly, or very occasionally circularly, ordered. Inversions with breakpoints within windows of seven consecutive genes will preserve common adjacencies between two genomes, but not reading directions within the window. Common adjacencies are our primary concern, so we do not use reading direction information in MWM. Circular contigs were linearized by breaking an adjacency of lowest weight. The ancestral contigs from MWM solutions were then aligned to chromosomes of modern genomes, and co-occurring contigs were clustered to assemble ancestral chromosomes. A complete-linkage clustering was applied to the correlations of contigs’ co-occurrence to assemble ancestral chromosomes⁷². To aid in future studies of the genomic organization of gene function, a GO-term enrichment analysis of the members of each gene family was implemented to produce a functional annotation for the inferred ancestral genes. The functional annotations of ancestral genomes can be downloaded from https://git.cs.usask.ca/buxus/buxus-tetra.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

All raw sequence reads used in this study have been deposited in NCBI under the BioProject accession numbers PRJNA549075, PRJNA547721, and PRJNA548936. In addition, the Buxus and Tetracentron genome assemblies, associated annotation files, and predicted CDS and protein sequences, along with all phylogenetic data sets analyzed here, and ancestral genome reconstructions have been deposited in the Dryad Digital Repository [https://doi.org/10.5061/dryad.cjsxksn6d]⁷³. Source data are provided with this paper.

Code availability

Custom scripts and command-line arguments have been deposited in GitHub [https://github.com/andrechanderbali/Buxus-Tetracentron-Genomes].

References

Govaerts, R. How many species of seed plants are there? TAXON 50, 1085–1090 (2001).
Article Google Scholar
Friis, E. M., Pedersen, K. R. & Crane, P. R. Cretaceous angiosperm flowers: Innovation and evolution in plant reproduction. Palaeogeogr., Palaeoclimatol., Palaeoecol. 232, 251–293 (2006).
Article Google Scholar
Magallón, S., Gómez-Acevedo, S., Sánchez-Reyes, L. L. & Hernández-Hernández, T. A metacalibrated time-tree documents the early rise of flowering plant phylogenetic diversity. N. Phytol. 207, 437–453 (2015).
Article Google Scholar
Jiao, Y. et al. Ancestral polyploidy in seed plants and angiosperms. Nature 473, 97–100 (2011).
Article ADS CAS PubMed Google Scholar
Amborella Genome Project. The Amborella genome and the evolution of flowering plants. Science 342, 1241089 (2013).
Article Google Scholar
Schranz, M. E., Mohammadin, S. & Edger, P. P. Ancient whole genome duplications, novelty and diversification: the WGD Radiation Lag-Time Model. Curr. Opin. Plant Biol. 15, 147–153 (2012).
Article PubMed Google Scholar
Vanneste, K., Maere, S. & Peer, deY. V. Tangled up in two: a burst of genome duplications at the end of the Cretaceous and the consequences for plant evolution. Philos. Trans. R. Soc. B 369, 20130353 (2014).
Article Google Scholar
Tank, D. C. et al. Nested radiations and the pulse of angiosperm diversification: increased diversification rates often follow whole genome duplications. N. Phytologist 207, 454–467 (2015).
Article Google Scholar
Soltis, P. S. & Soltis, D. E. Ancient WGD events as drivers of key innovations in angiosperms. Curr. Opin. Plant Biol. 30, 159–165 (2016).
Article PubMed Google Scholar
Landis, J. B. et al. Impact of whole-genome duplication events on diversification rates in angiosperms. Am. J. Bot. 105, 348–363 (2018).
Article PubMed Google Scholar
Cantino, P. D. et al. Towards a phylogenetic nomenclature of Tracheophyta. Taxon 56, 1E–44E (2007).
Article Google Scholar
Jaillon, O. et al. The grapevine genome sequence suggests ancestral hexaploidization in major angiosperm phyla. Nature 449, 463–467 (2007).
Article ADS CAS PubMed Google Scholar
Jiao, Y. et al. A genome triplication associated with early diversification of the core eudicots. Genome Biol. 13, R3 (2012).
Article PubMed PubMed Central Google Scholar
Vekemans, D. et al. Gamma paleohexaploidy in the stem-lineage of core eudicots: significance for MADS-box gene and species diversification. Mol. Biol. Evol. https://doi.org/10.1093/molbev/mss183 (2012).
Chanderbali, A. S., Berger, B. A., Howarth, D. G., Soltis, D. E. & Soltis, P. S. Evolution of floral diversity: genomics, genes and gamma. Philos. Trans. R. Soc. Lond., B, Biol. Sci. 372, 20150509 (2017).
Soltis, D. E. et al. Gunnerales are sister to other core eudicots: implications for the evolution of pentamery. Am. J. Bot. 90, 461–470 (2003).
Article PubMed Google Scholar
Endress, P. K. In Advances in Botanical Research 44: Developmental Genetics of the Flower (eds. Soltis, D. E., Leebens-Mack, J. H. & Soltis, P. S.) 1–61 (Elsevier, 2006).
Endress, P. K. Flower structure and trends of evolution in eudicots and their major subclades. Ann. Mo. Botanical Gard. 97, 541–583 (2010).
Article Google Scholar
Soltis, D. E. et al. Angiosperm phylogeny: 17 genes, 640 taxa. Am. J. Bot. 98, 704–730 (2011).
Article PubMed Google Scholar
Ohno, S. Evolution by Gene Duplication (Springer-Verlag, 1970).
Lyons, E., Pedersen, B., Kane, J. & Freeling, M. The value of nonmodel genomes and an example using SynMap within CoGe to dissect the hexaploidy that predates the rosids. Tropical Plant Biol. 1, 181–190 (2008).
Article CAS Google Scholar
Ming, R. et al. Genome of the long-living sacred lotus (Nelumbo nucifera Gaertn.). Genome Biol. 14, R41 (2013).
Article PubMed PubMed Central Google Scholar
Tang, H. et al. Unraveling ancient hexaploidy through multiply-aligned angiosperm gene maps. Genome Res. 18, 1944–1954 (2008).
Article CAS PubMed PubMed Central Google Scholar
Aköz, G. & Nordborg, M. The Aquilegia genome reveals a hybrid origin of core eudicots. Genome Biol. 20, 256 (2019).
Article PubMed PubMed Central Google Scholar
Velasco, R. et al. A high quality draft consensus sequence of the genome of a heterozygous grapevine variety. PLoS ONE 2, e1326 (2007).
Article ADS PubMed PubMed Central Google Scholar
Worberg, A. et al. Phylogeny of basal eudicots: insights from non-coding and rapidly evolving DNA. Org. Diversity Evolution 7, 55–77 (2007).
Article Google Scholar
Ruhfel, B. R., Gitzendanner, M. A., Soltis, P. S., Soltis, D. E. & Burleigh, J. G. From algae to angiosperms–inferring the phylogeny of green plants (Viridiplantae) from 360 plastid genomes. BMC Evolut. Biol. 14, 23 (2014).
Article Google Scholar
Moore, M. J., Soltis, P. S., Bell, C. D., Burleigh, J. G. & Soltis, D. E. Phylogenetic analysis of 83 plastid genes further resolves the early diversification of eudicots. Proc. Natl Acad. Sci. USA 107, 4623–4628 (2010).
Article ADS CAS PubMed PubMed Central Google Scholar
Sun, Y. et al. Phylogenomic and structural analyses of 18 complete plastomes across nearly all families of early-diverging eudicots, including an angiosperm-wide analysis of IR gene content evolution. Mol. Phylogenet. Evol. 96, 93–101 (2016).
Article PubMed Google Scholar
Leebens-Mack, J. H. et al. One thousand plant transcriptomes and the phylogenomics of green plants. Nature 574, 679–685 (2019).
Article Google Scholar
Filiault, D. L. et al. The Aquilegia genome provides insight into adaptive radiation and reveals an extraordinarily polymorphic chromosome with a unique history. Elife 7, e36426 (2018).
Strijk, J. S., Hinsinger, D. D., Zhang, F. & Cao, K. Trochodendron aralioides, the first chromosome-level draft genome in Trochodendrales and a valuable resource for basal eudicot research. Gigascience 8, giz136 (2019).
Liu, P.-L. et al. The Tetracentron genome provides insight into the early evolution of eudicots and the formation of vessel elements. Genome Biol. 21, 291 (2020).
Article CAS PubMed PubMed Central Google Scholar
Xu, Q., Jin, L., Zheng, C., Leebens-Mack, J. & Sankoff, D. in Lecture Notes in Bioinformatics Vol. 12686 (2021).
Chin, C.-S. et al. Phased diploid genome assembly with single-molecule real-time sequencing. Nat. Methods 13, 1050–1054 (2016).
Article CAS PubMed PubMed Central Google Scholar
Ghurye, J. et al. Integrating Hi-C links with assembly graphs for chromosome-scale assembly. PLoS Comput. Biol. 15, e1007273 (2019).
Yang, X., Lu, S. & Peng, H. Cytological studies on the eastern Asian family Trochodendraceae. Bot. J. Linn. Soc. 158, 332–335 (2008).
Article Google Scholar
Van Laere, K., Hermans, D., Leus, L. & Van Huylenbroeck, J. Genetic relationships in European and Asiatic Buxus species based on AFLP markers, genome sizes and chromosome numbers. Plant Syst. Evolution 293, 1–11 (2011).
Article Google Scholar
Simão, F. A., Waterhouse, R. M., Ioannidis, P., Kriventseva, E. V. & Zdobnov, E. M. BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics 31, 3210–3212 (2015).
Article PubMed Google Scholar
Seppey, M., Manni, M. & Zdobnov, E. M. in Gene Prediction: Methods and Protocols (ed. Kollmar, M.) 227–245 (Springer, 2019).
Ratter, J. A. & Milne, C. in Notes from the Royal Botanic Garden Edinburgh (UK) (1976).
Dodsworth, S., Chase, M. W. & Leitch, A. R. Is post-polyploidization diploidization the key to the evolutionary success of angiosperms?. Botanical J. Linn. Soc. 180, 1–5 (2016).
Article Google Scholar
Waterhouse, R. M. et al. BUSCO applications from quality assessments to gene prediction and phylogenomics. Mol. Biol. Evol. 35, 543–548 (2018).
Article CAS PubMed Google Scholar
Johnson, M. G. et al. A universal probe set for targeted sequencing of 353 nuclear genes from any flowering plant designed using k-medoids clustering. Syst. Biol. 68, 594–606 (2019).
Article CAS PubMed Google Scholar
Emms, D. M. & Kelly, S. OrthoFinder: phylogenetic orthology inference for comparative genomics. Genome Biol. 20, 238 (2019).
Article PubMed PubMed Central Google Scholar
Mirarab, S. & Warnow, T. ASTRAL-II: coalescent-based species tree estimation with many hundreds of taxa and thousands of genes. Bioinformatics 31, i44–i52 (2015).
Article CAS PubMed PubMed Central Google Scholar
Sankoff, D., Zheng, C., Lyons, E. & Tang, H. in Algorithms for Computational Biology (eds. Botón-Fernández, M., Martín-Vide, C., Santander-Jiménez, S. & Vega-Rodríguez, M. A.) 3–14 (Springer International Publishing, 2016).
Sankoff, D. & Zheng, C. in Comparative Genomics: Methods and Protocols (eds. Setubal, J. C., Stoye, J. & Stadler, P. F.) 291–315 (Springer, 2018).
Murat, F., Armero, A., Pont, C., Klopp, C. & Salse, J. Reconstructing the genome of the most recent common ancestor of flowering plants. Nat. Genet 49, 490–496 (2017).
Article CAS PubMed Google Scholar
Roach, M. J., Schmidt, S. A. & Borneman, A. R. Purge Haplotigs: allelic contig reassignment for third-gen diploid genome assemblies. BMC Bioinforma. 19, 460 (2018).
Article CAS Google Scholar
Bolger, A. M., Lohse, M. & Usadel, B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30, 2114–2120 (2014).
Article CAS PubMed PubMed Central Google Scholar
Grabherr, M. G. et al. Trinity: reconstructing a full-length transcriptome without a genome from RNA-Seq data. Nat. Biotechnol. 29, 644–652 (2011).
Article CAS PubMed PubMed Central Google Scholar
Holt, C. & Yandell, M. MAKER2: an annotation pipeline and genome-database management tool for second-generation genome projects. BMC Bioinforma. 12, 491 (2011).
Article Google Scholar
Campbell, M. S. et al. MAKER-P: a tool kit for the rapid creation, management, and quality control of plant genome annotations. Plant Physiol. 164, 513–524 (2014).
Article CAS PubMed Google Scholar
Ellinghaus, D., Kurtz, S. & Willhoeft, U. LTRharvest, an efficient and flexible software for de novo detection of LTR retrotransposons. BMC Bioinforma. 9, 18 (2008).
Article Google Scholar
Steinbiss, S., Willhoeft, U., Gremme, G. & Kurtz, S. Fine-grained annotation and classification of de novo predicted LTR retrotransposons. Nucleic Acids Res. 37, 7002–7013 (2009).
Article CAS PubMed PubMed Central Google Scholar
Han, Y. & Wessler, S. R. MITE-Hunter: a program for discovering miniature inverted-repeat transposable elements from genomic sequences. Nucleic Acids Res. 38, e199 (2010).
Article PubMed PubMed Central Google Scholar
Hubley, R. & Smit, A. RepeatModeler Open-1.0. http://www.repeatmasker.org/RepeatModeler/ (2008).
Smit, A. F., Hubley, R. & Green, P. RepeatMasker (2013).
Korf, I. Gene finding in novel genomes. BMC Bioinforma. 5, 59 (2004).
Article Google Scholar
Stanke, M. & Waack, S. Gene prediction with a hidden Markov model and a new intron submodel. Bioinformatics 19, ii215–ii225 (2003).
Article PubMed Google Scholar
Katoh, K. & Standley, D. M. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol. Biol. Evolution 30, 772–780 (2013).
Article CAS Google Scholar
Suyama, M., Torrents, D. & Bork, P. PAL2NAL: Robust conversion of protein sequence alignments into the corresponding codon alignments. Nucleic Acids Res. 34, W609–W612 (2006).
Article CAS PubMed PubMed Central Google Scholar
Capella-Gutiérrez, S., Silla-Martínez, J. M. & Gabaldón, T. trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses. Bioinformatics 25, 1972–1973 (2009).
Article PubMed PubMed Central Google Scholar
Kück, P. & Meusemann, K. FASconCAT: convenient handling of data matrices. Mol. Phylogenet. Evol. 56, 1115–1118 (2010).
Article PubMed Google Scholar
Stamatakis, A. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics 30, 1312–1313 (2014).
Article CAS PubMed PubMed Central Google Scholar
Nguyen, L.-T., Schmidt, H. A., von Haeseler, A. & Minh, B. Q. IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol. Biol. Evol. 32, 268–274 (2015).
Article CAS PubMed Google Scholar
Zhang, C., Rabiee, M., Sayyari, E. & Mirarab, S. ASTRAL-III: polynomial time species tree reconstruction from partially resolved gene trees. BMC Bioinforma. 19, 153 (2018).
Article Google Scholar
Zhang, C., Scornavacca, C., Molloy, E. K. & Mirarab, S. ASTRAL-Pro: quartet-based species-tree inference despite paralogy. Mol. Biol. Evolution 37, 3292–3307 (2020).
Article CAS Google Scholar
Bouckaert, R. R. DensiTree: making sense of sets of phylogenetic trees. Bioinformatics 26, 1372–1373 (2010).
Article CAS PubMed Google Scholar
Tang, H. et al. Synteny and collinearity in plant genomes. Science 320, 486–488 (2008).
Article ADS CAS PubMed Google Scholar
Xu, Q., Jin, L., Leebens-Mack, J. & Sankoff, D. Validation of automated chromosome recovery in the reconstruction of ancestral gene order. Algorithms 14, 160 (2021).
Chanderbali, A. S. et al. Data from: Buxus and Tetracentron genomes. Dryad, Dataset. https://doi.org/10.5061/dryad.cjsxksn6d (2021).

Download references

Acknowledgements

This work was supported by National Science Foundation grants DEB-1455601 to A.S.C., DEB-1457440 to D.G.H., DEB-2030871 to V.A.A., and Discovery grants to L.J. and to D.S. from the Natural Sciences and Engineering Research Council of Canada. D.S. holds the Canada Research Chair in Mathematical Genomics. We thank Brent Berger, Ray Larson, and Veronica Di Stilio for aid in plant collection and DNA extraction and Hanqi Ye for bioinformatics discussion.

Author information

Authors and Affiliations

Florida Museum of Natural History, University of Florida, Gainesville, FL, USA
Andre S. Chanderbali, Douglas E. Soltis & Pamela S. Soltis
Department of Computer Science, University of Saskatchewan, Saskatoon, SK, Canada
Lingling Jin
Department of Mathematics and Statistics, University of Ottawa, Ottawa, ON, Canada
Qiaoji Xu, Yue Zhang & David Sankoff
Department of Biological Sciences, St. John’s University, Queens, NY, USA
Jingbo Zhang & Dianella G. Howarth
South China Botanical Garden, Chinese Academy of Sciences, Guangzhou, China
Shuguang Jian
Department of Biological Sciences, University at Buffalo, Buffalo, NY, USA
Emily Carroll & Victor A. Albert
Department of Biology, University of Florida, Gainesville, FL, USA
Douglas E. Soltis
Biodiversity Institute, University of Florida, Gainesville, FL, USA
Douglas E. Soltis & Pamela S. Soltis
Genetics Institute, University of Florida, Gainesville, FL, USA
Douglas E. Soltis & Pamela S. Soltis

Authors

Andre S. Chanderbali
View author publications
You can also search for this author in PubMed Google Scholar
Lingling Jin
View author publications
You can also search for this author in PubMed Google Scholar
Qiaoji Xu
View author publications
You can also search for this author in PubMed Google Scholar
Yue Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Jingbo Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Shuguang Jian
View author publications
You can also search for this author in PubMed Google Scholar
Emily Carroll
View author publications
You can also search for this author in PubMed Google Scholar
David Sankoff
View author publications
You can also search for this author in PubMed Google Scholar
Victor A. Albert
View author publications
You can also search for this author in PubMed Google Scholar
Dianella G. Howarth
View author publications
You can also search for this author in PubMed Google Scholar
Douglas E. Soltis
View author publications
You can also search for this author in PubMed Google Scholar
Pamela S. Soltis
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

A.S.C., D.E.S., D.G.H., D.S., P.S.S., and V.A.A. conceived and designed the study. A.S.C. generated the whole-genome and transcriptome assemblies, performed phylogenetic and comparative genomics analyses, and drafted the primary manuscript. D.S., L.J., and Q.X. generated and analyzed the ancestral genome reconstructions. Y.Z., E.C., and V.A.A. analyzed data. S.J. provided data. Additional text and discussion were provided by D.E.S., D.G.H., D.S., L.J., J.Z., P.S.S., and V.A.A. All authors approved the final manuscript.

Corresponding author

Correspondence to Andre S. Chanderbali.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks the anonymous reviewers for their contribution to the peer review of this work. Peer reviewer reports are available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Description of Additional Supplementary Files

Supplementary Data 1

Supplementary Data 2

Supplementary Data 3

Supplementary Data 4

Supplementary Data 5

Supplementary Data 6

Supplementary Data 7

Supplementary Data 8

Reporting Summary

Source data

Source Data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Chanderbali, A.S., Jin, L., Xu, Q. et al. Buxus and Tetracentron genomes help resolve eudicot genome history. Nat Commun 13, 643 (2022). https://doi.org/10.1038/s41467-022-28312-w

Download citation

Received: 29 June 2021
Accepted: 14 January 2022
Published: 02 February 2022
DOI: https://doi.org/10.1038/s41467-022-28312-w

This article is cited by

Karyotype and LTR-RTs analysis provide insights into oak genomic evolution
- Rui-Bin Cao
- Ran Chen
- Xiao-Long Jiang
BMC Genomics (2024)
High quality genomes produced from single MinION flow cells clarify polyploid and demographic histories of critically endangered Fraxinus (ash) species
- Steven J. Fleck
- Crystal Tomlin
- Victor A. Albert
Communications Biology (2024)
Stomata variation in the process of polyploidization in Chinese chive (Allium tuberosum)
- Peng-Qiang Yao
- Jian-Hua Chen
- Shi-Ping Cheng
BMC Plant Biology (2023)
From comparative gene content and gene order to ancestral contigs, chromosomes and karyotypes
- Qiaoji Xu
- Lingling Jin
- David Sankoff
Scientific Reports (2023)
Subgenome dominance shapes novel gene evolution in the decaploid pitcher plant Nepenthes gracilis
- Franziska Saul
- Mathias Scharmann
- Kenji Fukushima
Nature Plants (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Results and discussion

Genome assembly, annotation, and structure

Phylogenetic positions of Buxus and Tetracentron

Phylogenomics of eudicot subgenomes

Ancestral genomes

Methods

DNA extraction, sequencing, and assembly

RNA-seq data

Annotation

Phylogenetic analyses

Comparative genomics of polyploidy

Ancestral genomes

Reporting summary

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Peer review

Peer review information

Additional information

Supplementary information

Source data

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links