Chromosome-scale assembly of the Kandelia obovata genome

Hu, Min-Jie; Sun, Wei-Hong; Tsai, Wen-Chieh; Xiang, Shuang; Lai, Xing-Kai; Chen, De-Qiang; Liu, Xue-Die; Wang, Yi-Fan; Le, Yi-Xun; Chen, Si-Ming; Zhang, Di-Yang; Yu, Xia; Hu, Wen-Qi; Zhou, Zhuang; Chen, Yan-Qiong; Zou, Shuang-Quan; Liu, Zhong-Jian

doi:10.1038/s41438-020-0300-x

Download PDF

Article
Open access
Published: 02 May 2020

Chromosome-scale assembly of the Kandelia obovata genome

Min-Jie Hu¹^na1,
Wei-Hong Sun^2,3^na1,
Wen-Chieh Tsai⁴,
Shuang Xiang^2,3,
Xing-Kai Lai⁵,
De-Qiang Chen^2,3,
Xue-Die Liu²,
Yi-Fan Wang²,
Yi-Xun Le²,
Si-Ming Chen^2,6,
Di-Yang Zhang ORCID: orcid.org/0000-0001-7548-4378³,
Xia Yu³,
Wen-Qi Hu³,
Zhuang Zhou³,
Yan-Qiong Chen³,
Shuang-Quan Zou^2,3 &
…
Zhong-Jian Liu^3,7

Horticulture Research volume 7, Article number: 75 (2020) Cite this article

4627 Accesses
39 Citations
1 Altmetric
Metrics details

Subjects

Abstract

The mangrove Kandelia obovata (Rhizophoraceae) is an important coastal shelterbelt and landscape tree distributed in tropical and subtropical areas across East Asia and Southeast Asia. Herein, a chromosome-level reference genome of K. obovata based on PacBio, Illumina, and Hi-C data is reported. The high-quality assembled genome size is 177.99 Mb, with a contig N50 value of 5.74 Mb. A large number of contracted gene families and a small number of expanded gene families, as well as a small number of repeated sequences, may account for the small K. obovata genome. We found that K. obovata experienced two whole-genome polyploidization events: one whole-genome duplication shared with other Rhizophoreae and one shared with most eudicots (γ event). We confidently annotated 19,138 protein-coding genes in K. obovata and identified the MADS-box gene class and the RPW8 gene class, which might be related to flowering and resistance to powdery mildew in K. obovata and Rhizophora apiculata, respectively. The reference K. obovata genome described here will be very useful for further molecular elucidation of various traits, the breeding of this coastal shelterbelt species, and evolutionary studies with related taxa.

A high-quality chromosome-scale assembly of the centipedegrass [Eremochloa ophiuroides (Munro) Hack.] genome provides insights into chromosomal structural evolution and prostrate growth habit

Article Open access 01 September 2021

Chromosome-level genome assembly and annotation of the prickly nightshade Solanum rostratum Dunal

Article Open access 01 June 2023

Chromosome-level genome assembly of Zizania latifolia provides insights into its seed shattering and phytocassane biosynthesis

Article Open access 11 January 2022

Introduction

Mangrove forests are coastal ecosystems with unique biodiversity that provides many ecosystem services and functions¹. Mangrove loss will increase the threat of coastal hazards (i.e., erosion, storm surges, and tsunamis) to human safety and shoreline development². Specifically, this will reduce coastal water quality and biodiversity and threaten adjacent coastal habitats, thereby weakening the main resources on which the human community relies, including a large number of products and services provided by mangroves^3,4. Therefore, detailed studies and analyses of the genome and evolution of mangroves are urgently required, especially in the context of frequent human disturbance and inevitable sea-level rise.

The mangrove species Kandelia obovata belongs to Rhizophoraceae, which is called “Qiuqie” in Chinese, with the Latin name of K. candel in “Flora Reipublicae Popularis Sinicae”⁵. Later, in 2008, its Latin name was changed to K. obovata in the “Flora of China”⁶. K. obovata is a woody plant predominantly found in tropical and subtropical tidal salt wetlands distributed from East Asia to Southeast Asia⁷. K. obovata adapts to transitional ecosystems where the land and ocean meet by overcoming periodic and aperiodic tidal effects, which induce high salinity, severe erosion, and anaerobic conditions⁸. K. obovata plays a crucial role in protecting biodiversity and combating erosion^9,10. Specifically, the mangrove K. obovata can protect the embankment, accelerate the natural deposition of the beach, filter organic matter and pollutants from inland areas, and provide an ideal habitat for the marine flora and fauna¹¹. At the same time, due to its beautiful shape, unique floral pattern and fragrance, K. obovata is an excellent coastal wetland landscape plant and horticultural ornamental plant (Fig. 1).

**Fig. 1: Morphological features of the flower and fruit of K. obovata.**

Here, the genome of the mangrove K. obovata was sequenced using PacBio sequencing as well as the Illumina next-generation sequencing platform. These data can help clarify the history of mangrove colonization and mangrove adaptation mechanisms in intertidal zones. Furthermore, this study will provide a basis for the conservation of mangrove diversity and in-depth development of genetic resources for mangroves, as well as the development and utilization of coastal horticultural plants.

Results and discussion

Genome sequence and assembly

K. obovata contains 36 chromosomes (2n = 2x = 36)⁶. To assess genome size, survey sequencing was performed, and 65.27 Gb of clean data was obtained (Supplementary Table 1). The survey analysis indicated that the K. obovata genome size is 211.86 Mb and has a low level of heterozygosity of approximately 0.38% (Supplementary Fig. 1). The assembled genome is 178.44 Mb in size, with a scaffold N50 value of 279.55 kb obtained by using Illumina sequencing (Table 1). To improve K. obovata assembly quality, we conducted Pacific Biosciences RSII sequencing and obtained 25 Gb of single-molecule real-time long reads (average read length of 11.9 kb; Supplementary Fig. 2, Supplementary Table 1). The final assembled genome is 177.99 Mb in size, with a contig N50 value of 5.74 Mb (Table 1). The quality of the assembly was evaluated using Benchmarking Universal Single-Copy Orthologs (BUSCO)¹². The results showed that the gene set completeness of the assembled genome is 97.3%, indicating that the K. obovata genome assembly is very complete and of high quality (Table 1). Finally, high-throughput/resolution chromosome conformation capture (Hi-C) technology was adopted to assess the chromosome-level diploid genome. The results showed that the lengths of the chromosomes ranged from 5.03 to 13.8 Mb (Supplementary Table 2), with a total length of 178.01 Mb and a scaffold N50 of 10.03 Mb (Fig. 2, Table 1).

Table 1 The statistical results of Hi-C assembly

Full size table

Gene prediction and annotation

We confidently annotated 19,138 protein-coding genes in K. obovata (Supplementary Fig. 3, Supplementary Table 3), of which 19,136 (99.17%) were supported by de novo prediction, transcriptome data, and homolog prediction (Supplementary Table 4). The genome of Rhizophora apiculata, also belonging to Rhizophoreae, has 26,640 protein-coding genes, which is 7502 more than observed in K. obovata¹³. The BUSCO¹² assessment indicated that the completeness of the gene set of the annotated genome was 90% for K. obovata (Supplementary Table 5). In addition, 105 microRNAs, 307 transfer RNAs, 167 ribosomal RNAs, and 199 small nuclear RNAs were identified in the K. obovata genome (Supplementary Table 6).

Using homology-based and de novo approaches to identify transposable elements (TEs), we estimated that 24.07% of the K. obovata genome consists of repetitive sequences (Supplementary Figs. 4 and 5 and Supplementary Tables 7 and 8) and 29% of the R. apiculata genome consists of repetitive sequences¹³. Compared with those of closely related nonmangrove plant genomes, the repetitive portions of the R. apiculata genome, comprising predominantly TE families, are significantly reduced, and the decrease in TE number largely resulted in a general decrease in genome size among true mangroves¹³. The small repetitive sequences may be one reason for the small genome of K. obovata. In addition, 18,266 genes were functionally annotated, among which 11,124 and 14,401 were annotated to Gene Ontology terms and Kyoto Encyclopedia of Genes and Genomes terms, respectively, and 12,491 genes were functionally annotated in all five databases (Supplementary Fig. 6, Supplementary Table 9).

Evolution of gene families

We constructed a phylogenetic tree and estimated the divergence times of K. obovata and nine other plant species based on genes extracted from a total of 1095 single-copy families (Supplementary Figs. 7 and 8, Supplementary Table 10). As expected, K. obovata was sister to R. apiculata (Supplementary Fig. 9). The estimated Rhizophoreae divergence time was 83.15 Mya, and the divergence time between K. obovata and R. apiculata was 24.63 Mya (Supplementary Fig. 9). Next, using CAFÉ 3 (ref. ¹⁴), we found that 1110 gene families were expanded in the lineage leading to the Rhizophoreae, whereas 1368 families were contracted (Fig. 3). Four hundred and ninety-five gene families were expanded in K. obovata, compared with the 1098 in R. apiculata (Fig. 3). At the same time, 1604 gene families were contracted in K. obovata, compared with the 659 in R. apiculata. K. obovata has more contracted gene families than R. apiculata and fewer expanded gene families than R. apiculata, which may be the reason that the genome of K. obovata is smaller than that of R. apiculata. For the expanded gene families, we conducted GO enrichment analysis and found enrichment for the GO terms “structural constituent of cytoskeleton” and “structural constituent of ribosome” (Supplementary Table 11). For the contracted gene families, enrichment was detected for the GO terms “protein kinase activity”, “terpene synthase activity”, “oxidoreductase activity”, “nutrient reservoir activity”, “defense response”, and “sulfotransferase activity” (Supplementary Table 12). Gene families with K. obovata-specific expansion and contraction might relate to adaptation to K. obovata-specific coastal niches. Further research is required to validate the function of these genes.

**Fig. 3: The expansion and contraction of gene families.**

Synteny analysis and an ancient polyploidization event

Whole-genome polyploidization events are a feature of many taxa and an efficient mechanisms of genome expansion¹⁵. To detect the occurrence of polyploidization events in Rhizophoreae, we used the default parameters of JCVI v0.9.14 (ref. ¹⁶) to analyze the protein sequences of K. obovata, R. apiculata, and Vitis vinifera and obtained the gene pairs in the collinear regions. The results showed that there were 11,010 collinear gene pairs between K. obovata and R. apiculata, 10,893 collinear gene pairs between K. obovata and V. vinifera, 3,840 collinear gene pairs within K. obovata and 4,646 collinear gene pairs within R. apiculata (Supplementary Table 13).

We estimated the distributions of synonymous substitutions per synonymous site (Ks) values to more precisely infer the timing of polyploidization events in the K. obovata genome. The distributions of Ks for paralogous K. obovata genes showed two peaks, one at Ks = 0.38 and the other at Ks = 1.5–1.9 (Fig. 4, Supplementary Fig. 10a). The Ks distribution of R. apiculata also had two peaks, one at Ks = 0.32 and the other at Ks = 1.5–1.9 (Fig. 4, Supplementary Fig. 10b). The results suggested that K. obovata and R. apiculata experienced two polyploidization events. To confirm these two polyploidization events, we further analyzed the Ks distribution of K. obovata and R. apiculata and that of K. obovata and V. vinifera. We observed that the Ks distribution of K. obovata and R. apiculata had one peak, at Ks = 0.1–0.16, which was smaller than the first peak in the Ks distributions within K. obovata (Ks = 0.38) and R. apiculata (Ks = 0.32) (Fig. 4). The first peak in the K. obovata Ks distribution (Ks = 0.38) indicates that K. obovata shares a whole-genome duplication (WGD) event with other Rhizophoreae. In addition, we found that the Ks distribution of K. obovata and V. vinifera had one peak, at Ks = 0.9–1.4, which was also smaller than the second peak in the Ks distributions within K. obovata (Ks = 1.5–1.9) and R. apiculata (Ks = 1.5–1.9) (Fig. 4). The second peak in the K. obovata Ks distribution (Ks = 1.5–1.9) indicates that the common ancestor of K. obovata and V. vinifera experienced an ancient polyploidization event. This event was shared by most eudicots, called the γ event, which is an ancient whole-genome triplication event¹⁷. Finally, we provide direct evidence of gene collinearity, as shown in Fig. 5; the purple peak corresponds to the first peak of the K. obovata Ks distribution (Ks = 0.38) and R. apiculata Ks distribution (Ks = 0.32) (Fig. 5b, d), and the green peak corresponds to the second peak of the K. obovata Ks distribution (Ks = 1.5–1.9) and R. apiculata Ks distribution (Ks = 1.5–1.9) (Fig. 5a, c). The purple collinear region is an extra copy of the genomes of K. obovata and R. apiculata, and the green collinear region is also an extra copy of the genes in the genomes of K. obovata and R. apiculata (Fig. 5). These copies correspond to two polyploidization events of K. obovata and R. apiculata. Therefore, our study verified that K. obovata experienced two polyploidization events: one WGD event shared with Rhizophoreae and one shared with most eudicots (γ event).

**Fig. 4: Ks distributions between *K. obovata* and *R. apiculata* and *K. obovata* and *V. vinifera* and within *K. obovata* and *R. apiculata*.**

**Fig. 5: Collinear point diagram and Ks values corresponding to the collinear blocks.**

MADS-box gene family analysis

MADS-box genes play a key role in many important processes during plant development, especially during flower development¹⁸. We evaluated the MADS-box genes in K. obovata and R. apiculata. The K. obovata and R. apiculata genomes encode 43 and 65 MADS-box genes, respectively. There are 12 type I and 31 type II MADS-box genes in the K. obovata genome and 31 type I and 34 type II genes in the R. apiculata genome (Table 2, Supplementary Table 14). Interactions among type I MADS-box genes promote the initiation of endosperm development¹⁹. The type I genes of R. apiculata were approximately three times more numerous than those of K. obovata (Fig. 6a, Table 2). In addition, only 1 pseudogene type I genes were found in the K. obovata genome (Supplementary Table 14), suggesting that the type I MADS-box genes of K. obovata experienced a lower gain rate and higher loss rate than type II MADS-box genes.

Table 2 MADS-box genes in Arabidopsis thaliana, Oryza sativa, Phalaenopsis equestris, K. obovata, and R. apiculata

Full size table

**Fig. 6: Phylogenetic analysis of MADS-box genes from *A. thaliana*, *O. sativa*, *P. equestris*, *K. obovata*, and *R. apiculata*.**

Type II MADS-box genes include two types: MIKC^C and MIKC*²⁰. MIKC*-type gene regulation has a major impact on pollen gene expression^21,22. Plant MIKC^C-type genes are the most widely studied MADS-box genes because they are essential for plant growth and development^23,24. The K. obovata genome has four MIKC*-type genes and 27 MIKC^C-type genes, while the R. apiculata genome has three MIKC*-type genes and 31 MIKC^C-type genes (Fig. 6b, Table 2). Fewer C/D-class and AGL6 genes were found in K. obovata and R. apiculata than in rice, whereas more B-AP3-class and E-class genes were found in K. obovata than in rice (Fig. 6b). A-class, B-class, C/D-class, and E-class gene clades are well known for their roles in the specification of floral organ identity²⁵, notably, the ABCDE flowering model^26,27,28. K. obovata and R. apiculata have the same number of A-class and B-class genes (five members). K. obovata (six members) has more E-class genes than R. apiculata (four members), and R. apiculata (one member) has fewer C-class genes than K. obovata (three members) (Fig. 6b). The AGL12 gene is involved in root cell differentiation²⁹, and the ANR1 gene is involved in the regulation of lateral root development³⁰. Furthermore, the loss of the AGL12 gene may result in the loss of the ability to develop true roots for terrestrial growth²⁹. K. obovata and R. apiculata each contain one AGL12-clade gene and one ANR1-clade gene (Fig. 6b), which may be because mangrove roots have adapted to environments at the interface of land and sea. SOC1, SVP, FLC, and AGL15 regulate flowering time^31,32,33,34. SOC1 integrates multiple flowering signals related to photoperiod, temperature, hormones, and age³⁴. Notably, we found that SOC1-like genes were expanded in both K. obovata (five members of SOC1) and R. apiculata (seven members of SOC1) (Fig. 6b). Sequence variation among these SOC1-like genes could be associated with the functional diversification of the SOC1 clade in K. obovata and R. apiculata.

Disease resistance-related genes

Plant resistance genes (R genes) exist in large families and usually contain a nucleotide-binding site (NBS) domain and a leucine-rich repeat (LRR) domain, denoted NLR³⁵. According to the presence or absence of different domains in the N-terminal region, resistance genes encoding NBS domains can be divided into the TNL (TIR-NBS-LRR), CNL (CC-NBS-LRR), and RNL (RPW8-NBS-LRR) groups³⁶. A total of 165 and 292 nucleotide-binding site (NBS)-containing R genes were identified in K. obovata and R. apiculata, respectively; this might be because the distribution of R. apiculata is wider than that of K. obovata (Fig. 7, Supplementary Table 15).

**Fig. 7: Phylogenetic reconstruction of the NLR proteins in *K. obovata* and *R. apiculata*.**

We selected NLR candidate genes from K. obovata and R. apiculata with complete domains to construct a phylogenetic tree. The results showed that these candidate genes were divided into the TNL, RNL, and CNL families (Fig. 7). RPW8 is a family of genes with highly specifically expressed characteristics, including resistance to powdery mildew³⁷. The phylogenetic tree showed that RPW8 genes were significantly separated from all other CNL genes (Fig. 7). The RPW8 clade contained two K. obovata and three R. apiculata genes and clustered with two ADR1 genes from Arabidopsis, indicating that RPW8 genes might be associated with resistance to powdery mildew (Fig. 7).

Conclusion

Although K. obovata is well known as a coastal shelterbelt and landscape tree in tropical and subtropical areas, research on this species has been hampered by a lack of genetic data. We obtained a chromosome-level reference genome of K. obovata, assembled a 177.99 Mb genome, and annotated 19,136 protein-coding genes. A large number of contracted gene families and a small number of expanded gene families, as well as a small number of repeated sequences, resulted in a smaller genome in K. obovata than in R. apiculata. Ks analysis revealed that K. obovata experienced two polyploidization events, namely, the recent WGD shared with other Rhizophoreae and the ancient polyploidization event shared with most eudicots (γ event). The Rhizophoreae divergence time was 83.15 Mya, and the divergence time between K. obovata and R. apiculata was 24.63 Mya. We identified MADS-box and RPW8 genes in K. obovata, which might be related to flowering and resistance to powdery mildew, respectively. The genomic sequence analysis of the mangrove K. obovata helped reveal its mechanisms of adaptation to the intertidal zone; this knowledge is critical for understanding its genetic evolution and reproduction.

Materials and methods

DNA preparation and sequencing

Fresh K. obovata tissues were collected from the Quanzhou Estuary Wetland Provincial Nature Reserve, Fujian Province, China. Genomic DNA was isolated from the fresh leaves of K. obovata for de novo sequencing and assembly. Paired-end libraries (500 bp) were constructed according to the Illumina protocol. Genome size and heterozygosity were measured using KmerFreq and GCE based on a 17-K-mer distribution. In addition, a 20 kb insert library was constructed according to the PacBio RSII protocol and subsequently sequenced on the PacBio platform (Supplementary Table 1). The transcriptomes of different tissues of K. obovata were sequenced on the Illumina platform.

Genome assembly

De novo assembly of the PacBio reads was performed. FALCON (https://github.com/PacificBiosciences/FALCON)³⁸ was used to correct errors in the original data. Then, SMARTdenovo v1.0 was used to assemble the corrected data³⁹, and Arrow software (https://github.com/PacificBiosciences/GenomicConsensus) was used to polish the assembly results. To further eliminate Indel and SNP errors in the assembly sequence, we compared the second-generation small-fragment data to the assembly results and corrected the assembly results again with Pilon v1.22 (ref. ⁴⁰). To confirm the quality of the genome assembly, we performed a BUSCO v3 (ref. ¹²) (http://busco.ezlab.org/) assessment using single-copy orthologous genes.

Hi-C library construction and assembly of the chromosome

Fresh leaves of K. obovata were used to construct a Hi-C sequencing library, which was sequenced on the NovaSeq platform. SOAPnuke v1.5.3 (ref. ⁴¹) was used to filter the original data (filtration parameter: filter -n 0.01 -l 20 -q 0.4 -d -M 3 -A 0.3 -Q 2 -i -G --seqType 1) to obtain clean reads. Then, the clean data were compared with the genome using Juicer software⁴². The results were filtered, and misaligned reads were removed. The genome sequence was preliminarily clustered, sequenced, and directed using 3D-DNA⁴³. Juicer-box⁴² was again used to adjust, reset, and cluster the genome sequence. Finally, we evaluated genome integrity using BUSCO v3 software¹².

Identification of repetitive sequences

TEs contribute to genome dynamism in terms of both size and structure through insertions and eventual loss⁴⁴. Tandem Repeats Finder (http://tandem.bu.edu/trf/trf.html, v4.07) was used to predict tandem repeats across the genome⁴⁵. TEs were first identified using RepeatMasker v3.3.0 (http://www.repeatmasker.org) and RepeatProteinMask based on Repbase v21.12 (http://www.girinst.org/repbase)⁴⁶. Then, two de novo prediction software programs, RepeatModeler (http://www.repeatmasker.org/RepeatModeler/)⁴⁷ and LTR_FINDER v1.06 (http://tlife.fudan.edu.cn/ltr_finder/)⁴⁸, were used to identify TEs in the genomes. Finally, repeat sequences with identities ≥50% were grouped into the same classes.

Gene prediction and annotation

Homology-based, de novo, and transcriptome-based predictions were integrated to predict high-quality protein-coding genes. For homology-based prediction, homologous proteins from five available whole-genome sequences, namely, those of Arabidopsis thaliana, Linum usitatissimum, Populus trichocarpa, Ricinus communis, and Salix purpurea, were aligned to the K. obovata genome sequence using Exonerate v2.0 (https://www.ebi.ac.uk/Tools/psa/genewise/)⁴⁹. Gene structures were generated using GeneWise v2.4.1 (ref. ⁵⁰). Three ab initio prediction software programs, namely, Augustus v3.0.2 (http://bioinf.uni-greifswald.de/augustus/)⁵¹, Fgenesh (https://omictools.com/fgenesh-tool)⁵², and GlimmerHMM⁵³, were employed for de novo gene prediction. Then, the homology-based and ab initio gene structures were merged into a nonredundant gene model using Maker v2.31.8 (ref. ⁵⁴). TopHat v2.0.11 was used to map RNA-seq reads to the assembly⁵⁵, and Cufflinks v2.2.1 (ref. ⁵⁶) was applied to combine the mapping results for transcript structural predictions.

The protein sequences of the consensus gene set were aligned to seven protein databases, including GO (The Gene Ontology Consortium)⁵⁷, KEGG (http://www.genome.jp/kegg/)⁵⁸, InterPro (https://www.ebi.ac.uk/interpro/)⁵⁹, Swiss-Prot (http://www.uniprot.org)⁶⁰, and TrEMBL (http://www.uniprot.org/)⁶⁰, for predicted gene annotation. The rRNAs were identified by aligning the rRNA template sequences from the Rfam⁶¹ database against the genome using the BLASTN algorithm with an E-value cutoff of 1E–5. The tRNAs were predicted using tRNAscan-SE v1.3.1 (http://lowelab.ucsc.edu/tRNAscan-SE/)⁶², and other ncRNAs were predicted by Infernal software (http://infernal.janelia.org/) against the Rfam database.

Phylogenetic analysis

Genes from whole-genome sequences of ten species (K. obovata, Amborella trichopoda, Arabidopsis thaliana, Dimocarpus longan, Morus notabilis, Populus trichocarpa, Rhizophora apiculata, Ricinus communis, Vitis vinifera, and Oryza sativa) were used for gene-family clustering analysis. OrthoMCL v2.0.9 (ref. ⁶³) was used to identify orthologous groups among the ten species. Pairwise similarities between all protein sequences were calculated using BLASTP with an E-value cutoff of 1E–5. To obtain reliable single-copy orthologous groups, we filtered out single-copy orthologous groups containing proteins of length <200 bp. MUSCLE v3.8.31 (ref. ⁶⁴) was used to perform multisequence alignment of the protein sequences of the filtered single-copy orthologous group, and nucleotide alignment results were obtained by the corresponding relationship between protein sequences and nucleotide sequences. Finally, the nucleotide sequences of the single-copy orthologous group were connected to form a supergene, and then the data set was employed to construct a phylogenetic tree by using the GTR + gamma model in MrBayes⁶⁵.

Estimation of divergence time

The Markov chain Monte Carlo algorithm for Bayesian estimation was employed to infer the divergence time of each tree node using the MCMCTree module of PAML v4.7 (ref. ⁶⁶). The nucleic acid replacement model used was the GTR model, and the molecular clock model used was the independent rate model. The MCMC process included 100,000 burn-in iterations and 1,000,000 sampling iterations (with a sample taken every 100 iterations). To obtain a more stable result, the same parameter was executed twice. Calibration times were obtained from TimeTree (http://www.timetree.org).

Gene family expansion and contraction

We measured the expansion and contraction of orthologous gene families using CAFÉ 3 (https://github.com/hahnlab/CAFE)¹⁴. Based on maximum likelihood modeling of gene gain and loss, we analyzed gene families for signs of expansion or contraction using genomic data from the ten species.

Collinearity analysis

Within collinear segments, genes are conserved in function and sequence and remain highly conserved during the evolution of species. We used the default parameters of JCVI v0.9.14 (https://pypi.org/project/jcvi/)¹¹ to analyze the protein sequences of K. obovata, R. apiculata, and V. vinifera and obtained the gene pairs in collinear regions. Then, we used COGE (https://genomevolution.org/coge/) for online analysis, examined the relationship between Ks peaks and collinear regions, and verified the WGD event experienced by the common ancestor of K. obovata and R. apiculata.

Whole-genome duplication

We used Ks distribution analysis to infer WGD events of K. obovata and R. apiculata. Diamond v0.9.24 (ref. ⁶⁷) was used to conduct self-alignment of the protein sequences of the two species and then extract the mutual optimal alignment in the alignment results. Finally, Codeml in the PAML package was used to calculate the Ks values^39,68.

MADS-box analysis

The hidden Markov model (HMM) profile of the MADS-box gene family (PF00319) was obtained from Pfam (http://pfam.xfam.org). MADS-box gene family proteins were separately searched with HMMER 3.1 (with the default parameters)⁶⁹. InterProScan v 5.19 (ref. ⁷⁰) was used to identify MADS-box gene family candidates in the genomes of K. obovata and R. apiculata. The genomic data of R. apiculata were downloaded from http://evolution.sysu.edu.cn/Sequences.html. MADS-box gene candidates were further confirmed with the 60 amino acid domains available from SMART⁷¹ and online BLAST analysis (https://www.ncbi.nlm.nih.gov). Specifically, the protein sequence set for the MADS-box gene candidates was subjected to BLAST analysis against the assembled transcriptomes of the roots, stems, leaves, flowers, and fruits of K. obovata with the TBLASTN program. A phylogenetic tree was then constructed using MEGA5 (ref. ⁷²) with the default parameters.

Disease resistance genes

Predicted proteins from the K. obovata and R. apiculate genomes were scanned using HMMER v3.1 (E-value cut-off of 1 × 10⁻⁵)⁶⁹ using the HMM corresponding to the Pfam NLR protein family (NB-ARC: PF00931; TIR: PF01582; RPW8: PF05659; LRR: PF00560, PF07723, PF07725 and PF12799). To remove false-positive NB-ARC domain hits, InterProScan v5.19 was used to check the protein domains of the extracted sequences⁷⁰. The NBS domains of the genes confirmed by both HMMER and InterProScan were extracted according to InterProScan annotation and aligned using MAFFT v7.310 (ref. ⁶³); the alignment was then input into FastTree⁷³ with the JTT model and visualized using EvolView⁷⁴.

Data availability

Genome sequences have been submitted to the National Genomics Data Center (NGDC). PacBio whole-genome sequencing data and Illumina data have been deposited in BioProject/GSA (https://bigd.big.ac.cn/gsa.)⁷⁵ under accession codes PRJCA002330/CRA002395 and the whole-genome assembly and annotation data have been deposited in BioProject/GWH (https://bigd.big.ac.cn/gwh)⁷⁶ under accession codes PRJCA002330/GWHACBH00000000.

References

Kauffman, J. B. et al. Shrimp ponds lead to massive loss of soil carbon and greenhouse gas emissions in northeastern Brazilian mangroves. Ecol. Evol. 8, 5530–5540 (2018).
Article PubMed PubMed Central Google Scholar
Gilman, E. L., Ellison, J. C., Duke, N. C. & Field, C. D. Threats to mangroves from climate change and adaptation options: a review. Aquat. Bot. 89, 237–250 (2008).
Article Google Scholar
Nagelkerken, I. et al. The habitat function of mangroves for terrestrial and marine fauna: a review. Aquat. Bot. 89, 155–185 (2008).
Article Google Scholar
Walters, B. B. et al. Ethnobiology, socio-economics and management of mangrove forests: a review. Aquat. Bot. 89, 220–236 (2008).
Article Google Scholar
Wight et al. in Flora Reipublicae Popularis Sinicae (ed Delectis florae Reipublicae Popularis Sinicae agenda academiae sinicae) Vol. 52, 133–135 (Sciences Press, Beijing, 1983).
Qin, H. N. & David, E. B. in Flora of China (eds Wu, Z. Y., Peter, R.H. & Hong, D.) Vol. 13, 295–299 (Sciences Press, Beijing, 2009).
Sheue, C. R., Liu, H. Y. & Yong, J. W. H. Kandelia obovata (Rhizophoraceae), a new mangrove species from Asia. Taxon 52, 287–294 (2003).
Article Google Scholar
Giri, C. et al. Status and distribution of mangrove forests of the world using earth observation satellite data. Glob. Ecol. Biogeogr. 20, 154–159 (2011).
Article Google Scholar
Wardiatno, Y., Mardiansyah, Prartono, T. & Tsuchiya, M. Possible food sources of macrozoobenthos in the manko mangrove ecosystem, Okinawa (Japan): a stable isotope analysis approach. Trop. Life Sci. Res. 26, 53–65 (2015).
PubMed PubMed Central Google Scholar
Zhou, Q. et al. Characteristics and distribution of microplastics in the coastal mangrove sediments of China. Sci. Total Environ. 31, 134807 (2019).
Google Scholar
Rogers, A. & Mumby, P. J. Mangroves reduce the vulnerability of coral reef fisheries to habitat degradation. PLoS Biol. 17, e3000510 (2019).
Article CAS PubMed PubMed Central Google Scholar
Simao, F. A. et al. BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics 31, 3210–3212 (2015).
Article CAS PubMed Google Scholar
Xu, S. H. et al. The origin, diversification and adaptation of a major mangrove clade (Rhizophoreae) revealed by whole-genome sequencing. Natl. Sci. Rev. 4, 721–734 (2017).
Article CAS PubMed Google Scholar
Han, M. V., Thomas, G. W. C., Lugo-Martinez, J. & Hahn, M. W. Estimating gene gain and loss rates in the presence of error in genome assembly and annotation using CAFE 3. Mol. Biol. Evol. 30, 1987–1997 (2013).
Article CAS PubMed Google Scholar
McGrath, C. L. & Lynch, M. Evolutionary significance of whole-genome duplication. in Poly-ploidy and Genome Evolution (eds Soltis, P. S. & D. E., Soltis) 1–20 (Springer Berlin Heidelberg, Berlin, Heidelberg, 2012).
Tang, H. B. et al. JCVI v0.9.14, https://pypi.org/project/jcvi/ (2014).
Wu, S. D., Han, B. C. & Jiao, Y. N. Genetic contribution of Paleopolyploidy to adaptive evolution in angiosperms. Mol. Plant 13, 59–71 (2019).
Article PubMed CAS Google Scholar
Zhang, L. et al. Genome-wide identification, characterization of the MADS-box gene family in Chinese jujube and their involvement in flower development. Sci. Rep. 7, 1025 (2017).
Article PubMed PubMed Central CAS Google Scholar
Masiero, S., Colombo, L., Grini, P. E., Schnittger, A. & Kater, M. M. The emerging importance of type I MADS box transcription factors for plant reproduction. Plant Cell 23, 865–872 (2011).
Article CAS PubMed PubMed Central Google Scholar
Henschel, K. et al. Two ancient classes of MIKC-type MADS-box genes are present in the moss physcomitrella patens. Mol. Biol. Evol. 19, 801–804 (2002).
Article CAS PubMed Google Scholar
Adamczyk, B. J. & Fernandez, D. E. MIKC* MADS domain heterodimers are required for pollen maturation and tube growth in Arabidopsis. Plant Physiol. 149, 1713–1723 (2009).
Article CAS PubMed PubMed Central Google Scholar
Liu, Y. et al. Functional conservation of MIKC*-Type MADS box genes in Arabidopsis and rice pollen maturation. Plant Cell 25, 1288–1303 (2013).
Article CAS PubMed PubMed Central Google Scholar
Theissen, G. & Melzer, R. Molecular mechanisms underlying origin and diversification of the angiosperm flower. Ann. Bot. 100, 603–609 (2007).
Article PubMed PubMed Central Google Scholar
Li, C. et al. Genome-wide characterization of the MADS-box gene family in radish (Rahpanus sativus L.) and assessment of its roles in flowering and floral organogenesis. Front. Plant Sci. 7, 1390 (2016).
PubMed PubMed Central Google Scholar
Sheng, X. G. et al. Genome wide analysis of MADS-box gene family in Brassica oleracea reveals conservation and variation in flower development. BMC Plant Biol. 19, 106 (2019).
Article PubMed PubMed Central Google Scholar
Coen, E. S. & Meyerowita, E. M. The war of the whorls: genetic interactions controlling flower development. Nature 353, 31–37 (1991).
Article CAS PubMed Google Scholar
Zahn, L. M., Feng, B. & Ma, H. Beyond the ABC-model: regulation of floral homeotic genes. Adv. Bot. Res. 44, 163–207 (2006).
Article CAS Google Scholar
Silva, C. S. et al. Evolution of the plant reproduction master regulators LFY and the MADS transcription factors: the role of protein structure in the evolutionary development of the flower. Front. Plant Sci. 6, 1193 (2015).
PubMed Google Scholar
Ibarra-Laclette, E. et al. Architecture and evolution of a minute plant genome. Nature 498, 94–98 (2013).
Article CAS PubMed PubMed Central Google Scholar
Zhang, H. & Forde, B. G. An Arabidopsis MADS box gene that controls nutrient-induced changes in root architecture. Science 279, 407–409 (1998).
Article CAS PubMed Google Scholar
Searle, I. et al. The transition factor FLC confers a flowering response to vernalization by repressing meristem competence and systemic signaling in Arabidopsis. Genes Dev. 20, 898–912 (2006).
Article CAS PubMed PubMed Central Google Scholar
Reeves, P. A. et al. Evolution conservation of the FLOWERING LOCUS C mediated vernalization response: evidence from the sugar beet (Bsta vulgaris). Genetics 176, 295–307 (2007).
Article CAS PubMed PubMed Central Google Scholar
Lee, J. H. et al. Role of SVP in the control of flowering time by ambient temperature in Arabidopsis. Genes Dev. 21, 397–402 (2007).
Article CAS PubMed PubMed Central Google Scholar
Lee, J. & Lee, I. Regulation and function of SOC1, a flowering pathway integrator. J. Exp. Bot. 61, 2247–2254 (2010).
Article CAS PubMed Google Scholar
Lozano, R., Hamblin, M. T., Prochnik, S. & Jannink, J. L. Identification and distribution of the NBS-LRR gene family in the Cassava genome. BMC Genomics 16, 360 (2015).
Article PubMed PubMed Central CAS Google Scholar
Xiang, L. X. et al. Genome-wide comparative analysis of NBS-encoding genes in four Gossypium species. BMC Genomics 18, 292 (2017).
Article PubMed PubMed Central CAS Google Scholar
Xiao, S. et al. The atypical resistance gene, RPW8, recruits components of basal defence for powdery mildew resistance in Arabidopsis. Plant J. 42, 95–110 (2005).
Article CAS PubMed Google Scholar
Chin, C. S. et al. Phased diploid genome assembly with single-molecule real-time sequencing. Nat. Methods 13, 1050–1054 (2016).
Article CAS PubMed PubMed Central Google Scholar
Blanc, G. & Wolfe, K. H. Widespread paleopolyploidy in model plant species inferred from age distributions of duplicate genes. Plant Cell. 16, 1667–1678 (2004).
Article CAS PubMed PubMed Central Google Scholar
Walker, B. J. et al. Pilon: an integrated tool for comprehensive microbial variant detection and genome assembly improvement. PLoS ONE 9, e112963 (2014).
Article PubMed PubMed Central CAS Google Scholar
Chen, Y. et al. SOAPnuke: a MapReduce acceleration-supported software for integrated quality control and preprocessing of high-throughput sequencing data. GigaScience 7, 120 (2017).
Google Scholar
Durand, N. C. et al. Juicer provides a one-click system for analyzing loop-resolution Hi-C experiments. Cell Sys 3, 95–98 (2016).
Article CAS Google Scholar
Dudchenko, O. et al. De novo assembly of the Aedes aegypti genome using Hi-C yields chromosome-length scaffolds. Science 356, 92–95 (2017).
Article CAS PubMed PubMed Central Google Scholar
Hawkins, J. S., Proulx, S. R., Rapp, R. A. & Wendel, J. F. Rapid DNA loss as a counterbalance to genome expansion through retrotransposon proliferation in plants. Proc. Natl Acad. Sci. USA 106, 17811–17816 (2009).
Article CAS PubMed PubMed Central Google Scholar
Benson, G. Tandem Repeats Finder: a program to analyze DNA sequences. Nucleic Acids Res. 27, 573–580 (1999).
Article CAS PubMed PubMed Central Google Scholar
Jurka, J. et al. Repbase Update, a database of eukaryotic repetitive elements. Cytogenet. Genome Res. 110, 462–467 (2005).
Article CAS PubMed Google Scholar
Price, A. L., Jones, N. C. & Pevzner, P. A. De novo identification of repeat families in large genomes. Bioinformatics 21, 351–358 (2005).
Article Google Scholar
Zhao, X. & Hao, W. LTR_FINDER: an efficient tool for the prediction of full-length LTR retrotransposons. Nucleic Acids Res. 35, W265–W268 (2007).
Article CAS Google Scholar
Slater, G. S. C. & Birney, E. Automated generation of heuristics for biological sequence comparison. BMC Bioinformatics 6, 31 (2005).
Article PubMed PubMed Central CAS Google Scholar
Birney, E., Clamp, M. & Durbin, R. GeneWise and Genomewise. Genome Res. 14, 988–995 (2004).
Article CAS PubMed PubMed Central Google Scholar
Stanke, M., Schoffmann, O., Morgenstern, B. & Waack, S. Gene prediction in eukaryotes with a generalized hidden Markov model that uses hints from external sources. BMC Bioinformatics 7, 62 (2006).
Article PubMed PubMed Central CAS Google Scholar
Grabherr, M. G. et al. Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nat. Biotechnol. 29, 644–652 (2011).
Article CAS PubMed PubMed Central Google Scholar
Majoros, W. H., Pertea, M. & Salzberg, S. L. TigrScan and GlimmerHMM: two open source ab initio eukaryotic gene-finders. Bioinformatics 20, 2878–2879 (2004).
Article CAS PubMed Google Scholar
Holt, C. & Yandell, M. MAKER2: an annotation pipeline and genome-database management tool for second-generation genome projects. BMC Bioinformatics 12, 491 (2011).
Article PubMed PubMed Central Google Scholar
Trapnell, C., Pachter, L. & Salzberg, S. L. TopHat: discovering splice junctions with RNA-Seq. Bioinformatics 25, 1105–1111 (2009).
Article CAS PubMed PubMed Central Google Scholar
Trapnell, C. et al. Differential gene and transcript expression analysis of RNA-seq. experiments with TopHat and Cufflinks. Nat. Protoc. 7, 562–578 (2012).
Article CAS PubMed PubMed Central Google Scholar
Ashburner, M. et al. Gene Ontology: tool for the unification of biology. Nat. Genet. 25, 25–29 (2000).
Article CAS PubMed PubMed Central Google Scholar
Ogata, H. et al. KEGG: Kyoto Encyclopedia of Genes and Genomes. Nucleic Acids Res. 27, 29–34 (1999).
Article CAS PubMed PubMed Central Google Scholar
Jones, P. et al. InterProScan 5: genome-scale protein function classification. Bioinformatics 30, 1236–1240 (2014).
Article CAS PubMed PubMed Central Google Scholar
Boeckmann, B. et al. The SWISS-PROT protein knowledgebase and its supplement TrEMBL. Nucleic Acids Res. 31, 365–370 (2003).
Article CAS PubMed PubMed Central Google Scholar
Griffiths-Jones, S. et al. Rfam: annotating non-coding RNAs in complete genomes. Nucleic Acids Res. 33, D121–D124 (2005).
Article CAS PubMed Google Scholar
Lowe, T. M. & Eddy, S. R. tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res. 25, 955–964 (1997).
Article CAS PubMed PubMed Central Google Scholar
Fischer, S. et al. in Current Protocols in Bioinformatics (eds Andreas, D. et al.) Vol. 6, Ch. 6 (Zhang, 2011).
Edgar, R. C. MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 32, 1792–1797 (2004).
Article CAS PubMed PubMed Central Google Scholar
Castresana, J. Selection of conserved blocks from multiple alignments for their use in phylogenetic analysis. Mol. Biol. Evol. 17, 540–552 (2000).
Article CAS PubMed Google Scholar
Yang, Z. PAML 4: phylogenetic analysis by maximum likelihood. Mol. Biol. Evol. 24, 1586–1591 (2007).
Article CAS PubMed Google Scholar
Benjamin, B., Chao, C. & Daniel, H. H. Fast and sensitive protein alignment using diamond. Nat. Methods 12, 59–60 (2015).
Article CAS Google Scholar
Wang, K. et al. The draft genome of a diploid cotton Gossypium raimondii. Nat. Genet. 44, 1098–1103 (2012).
Article CAS PubMed Google Scholar
Eddy, S. R. Accelerated profile HMM searches. PLoS Comput. Biol. 7, e1002195 (2011).
Article CAS PubMed PubMed Central Google Scholar
Finn, R. D. et al. InterPro in 2017—beyond protein family and domain annotations. Nucleic Acids Res. 45, D190–D199 (2017).
Article CAS PubMed Google Scholar
Letunic, I., Doerks, T. & Bork, P. SMART: recent updates, new developments and status in 2015. Nucleic Acids Res. 43, D257–D260 (2015).
Article CAS PubMed Google Scholar
Tamura, K. et al. MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods. Mol. Biol. Evol. 28, 2731–2739 (2011).
Article CAS PubMed PubMed Central Google Scholar
Price, M. N., Dehal, P. S. & Arkin, A. P. FastTree 2-approximately maximum-likelihood trees for large alignments. PLoS ONE 10, e9490 (2010).
Article CAS Google Scholar
He, Z. et al. Evolviewv2: an online visualization and management tool for customized and annotated phylogenetic trees. Nucleic Acids Res. 44, W236–W241 (2016).
Article CAS PubMed PubMed Central Google Scholar
Wang, Y. et al. GSA: Genome Sequence Archive. Genomics Proteomics Bioinformatics 15, 14–18 (2017).
Article PubMed PubMed Central Google Scholar
Zhang, Z. et al. Database resources of the BIG Data Center in 2019. Nucleic Acids Res. 47, D8–D14 (2019).
Article CAS Google Scholar

Download references

Acknowledgements

This research was jointly funded by the National Science Foundation of China (41801062), the Special Project for the Cultivation of Major Achievements in the Peak Discipline of Forestry Science (118/71201800709), the Special Subsidy for Leading Talents of Scientific and Technological Innovation in Fujian Province (118/KRC16006A), and the Fujian Forestry Science and Technology Research Project (Min[2019]6). The authors would like to thank Prof. Wenqing Wang (College of the Environment and Ecology, Xiamen University) and Dr. Jiafang Huang (School of Geographical Sciences, Fujian Normal University) for kindly providing the picture of K. obovata.

Author information

These authors contributed equally: Min-Jie Hu, Wei-Hong Sun

Authors and Affiliations

Key Laboratory of Humid Sub-tropical Eco-Geographical Processes of the Ministry of Education, Fujian Normal University, Fuzhou, 350007, China
Min-Jie Hu
Fujian Colleges and Universities Engineering Research Institute of Conservation and Utilization of Natural Bioresources, College of Forestry, Fujian Agriculture and Forestry University, Fuzhou, 350002, China
Wei-Hong Sun, Shuang Xiang, De-Qiang Chen, Xue-Die Liu, Yi-Fan Wang, Yi-Xun Le, Si-Ming Chen & Shuang-Quan Zou
Key Laboratory of National Forestry and Grassland Administration for Orchid Conservation and Utilization at the College of Landscape Architecture, Fujian Agriculture and Forestry University, Fuzhou, 350002, China
Wei-Hong Sun, Shuang Xiang, De-Qiang Chen, Di-Yang Zhang, Xia Yu, Wen-Qi Hu, Zhuang Zhou, Yan-Qiong Chen, Shuang-Quan Zou & Zhong-Jian Liu
Institute of Tropical Plant Sciences and Microbiology, National Cheng Kung University, Tainan, 701, China
Wen-Chieh Tsai
Administration of the Quanzhou Bay Estuary Wetland Nature Reserve, Quanzhou, 362000, China
Xing-Kai Lai
Ocean College, Minjiang University, Fuzhou, 350002, China
Si-Ming Chen
Henry Fok College of Biology and Agriculture, Shaoguan University, Shaoguan, 512005, China
Zhong-Jian Liu

Authors

Min-Jie Hu
View author publications
You can also search for this author in PubMed Google Scholar
Wei-Hong Sun
View author publications
You can also search for this author in PubMed Google Scholar
Wen-Chieh Tsai
View author publications
You can also search for this author in PubMed Google Scholar
Shuang Xiang
View author publications
You can also search for this author in PubMed Google Scholar
Xing-Kai Lai
View author publications
You can also search for this author in PubMed Google Scholar
De-Qiang Chen
View author publications
You can also search for this author in PubMed Google Scholar
Xue-Die Liu
View author publications
You can also search for this author in PubMed Google Scholar
Yi-Fan Wang
View author publications
You can also search for this author in PubMed Google Scholar
Yi-Xun Le
View author publications
You can also search for this author in PubMed Google Scholar
Si-Ming Chen
View author publications
You can also search for this author in PubMed Google Scholar
Di-Yang Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Xia Yu
View author publications
You can also search for this author in PubMed Google Scholar
Wen-Qi Hu
View author publications
You can also search for this author in PubMed Google Scholar
Zhuang Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Yan-Qiong Chen
View author publications
You can also search for this author in PubMed Google Scholar
Shuang-Quan Zou
View author publications
You can also search for this author in PubMed Google Scholar
Zhong-Jian Liu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Shuang-Quan Zou or Zhong-Jian Liu.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Supplementary information

Chromosome-scale assembly of the Kandelia obovata genome

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Hu, MJ., Sun, WH., Tsai, WC. et al. Chromosome-scale assembly of the Kandelia obovata genome. Hortic Res 7, 75 (2020). https://doi.org/10.1038/s41438-020-0300-x

Download citation

Received: 27 November 2019
Revised: 11 March 2020
Accepted: 16 March 2020
Published: 02 May 2020
DOI: https://doi.org/10.1038/s41438-020-0300-x

This article is cited by

SOS1 gene family in mangrove (Kandelia obovata): Genome-wide identification, characterization, and expression analyses under salt and copper stress
- Chenjing Shang
- Li Sihui
- Jackson Nkoh Nkoh
BMC Plant Biology (2024)
Expansion and adaptive evolution of the WRKY transcription factor family in Avicennia mangrove trees
- Xiao Feng
- Guohong Li
- Ziwen He
Marine Life Science & Technology (2023)
In silico analysis of NAC gene family in the mangrove plant Avicennia marina provides clues for adaptation to intertidal habitats
- Shiwei Song
- Dongna Ma
- Hai-Lei Zheng
Plant Molecular Biology (2023)
Description and genomic characterization of Gallaecimonas kandeliae sp. nov., isolated from the sediments of mangrove plant Kandelia obovate
- Meng Long
- Shaoshuai Tang
- Yishan Lu
Antonie van Leeuwenhoek (2023)
Catalytic innovation underlies independent recruitment of polyketide synthases in cocaine and hyoscyamine biosynthesis
- Tian Tian
- Yong-Jiang Wang
- Sheng-Xiong Huang
Nature Communications (2022)

Subjects

Abstract

Similar content being viewed by others

Introduction

Results and discussion

Genome sequence and assembly

Gene prediction and annotation

Evolution of gene families

Synteny analysis and an ancient polyploidization event

MADS-box gene family analysis

Disease resistance-related genes

Conclusion

Materials and methods

DNA preparation and sequencing

Genome assembly

Hi-C library construction and assembly of the chromosome

Identification of repetitive sequences

Gene prediction and annotation

Phylogenetic analysis

Estimation of divergence time

Gene family expansion and contraction

Collinearity analysis

Whole-genome duplication

MADS-box analysis

Disease resistance genes

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding authors

Ethics declarations

Conflict of interest

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Quick links