Coriander Genomics Database: a genomic, transcriptomic, and metabolic database for coriander

Song, Xiaoming; Nie, Fulei; Chen, Wei; Ma, Xiao; Gong, Ke; Yang, Qihang; Wang, Jinpeng; Li, Nan; Sun, Pengchuan; Pei, Qiaoying; Yu, Tong; Hu, Jingjing; Li, Xinyu; Wu, Tong; Feng, Shuyan; Li, Xiu-Qing; Wang, Xiyin

doi:10.1038/s41438-020-0261-0

Download PDF

Article
Open access
Published: 01 April 2020

Coriander Genomics Database: a genomic, transcriptomic, and metabolic database for coriander

Xiaoming Song¹^na1,
Fulei Nie¹^na1,
Wei Chen^1,2^na1,
Xiao Ma³,
Ke Gong¹,
Qihang Yang¹,
Jinpeng Wang ORCID: orcid.org/0000-0002-2253-7752¹,
Nan Li¹,
Pengchuan Sun ORCID: orcid.org/0000-0002-8999-4894¹,
Qiaoying Pei¹,
Tong Yu¹,
Jingjing Hu¹,
Xinyu Li¹,
Tong Wu¹,
Shuyan Feng¹,
Xiu-Qing Li⁴ &
…
Xiyin Wang ORCID: orcid.org/0000-0003-3454-0374^1,2

Horticulture Research volume 7, Article number: 55 (2020) Cite this article

4599 Accesses
33 Citations
Metrics details

Subjects

Abstract

Coriander (Coriandrum sativum L.), also known as cilantro, is a globally important vegetable and spice crop. Its genome and that of carrot are models for studying the evolution of the Apiaceae family. Here, we developed the Coriander Genomics Database (CGDB, http://cgdb.bio2db.com/) to collect, store, and integrate the genomic, transcriptomic, metabolic, functional annotation, and repeat sequence data of coriander and carrot to serve as a central online platform for Apiaceae and other related plants. Using these data sets in the CGDB, we intriguingly found that seven transcription factor (TF) families showed significantly greater numbers of members in the coriander genome than in the carrot genome. The highest ratio of the numbers of MADS TFs between coriander and carrot reached 3.15, followed by those for tubby protein (TUB) and heat shock factors. As a demonstration of CGDB applications, we identified 17 TUB family genes and conducted systematic comparative and evolutionary analyses. RNA-seq data deposited in the CGDB also suggest dose compensation effects of gene expression in coriander. CGDB allows bulk downloading, significance searches, genome browser analyses, and BLAST searches for comparisons between coriander and other plants regarding genomics, gene families, gene collinearity, gene expression, and the metabolome. A detailed user manual and contact information are also available to provide support to the scientific research community and address scientific questions. CGDB will be continuously updated, and new data will be integrated for comparative and functional genomic analysis in Apiaceae and other related plants.

Chromosome-level genome assembly and annotation of Zicaitai (Brassica rapa var. purpuraria)

Article Open access 03 November 2023

Comparative transcriptomics provides a strategy for phylogenetic analysis and SSR marker development in Chaenomeles

Article Open access 12 August 2021

The clove (Syzygium aromaticum) genome provides insights into the eugenol biosynthesis pathway

Article Open access 09 July 2022

Introduction

Coriandrum sativum L. (2n = 2× = 22) is an Apiaceae species whose young plants before flowering are usually referred to as cilantro or Chinese parsley, whereas its flowering plants and seeds are generally referred to as coriander. The Apiaceae family exhibits more than 3700 species in 434 genera (https://en.wikipedia.org/wiki/Apiaceae), including several well-known crops such as carrot (Daucus carota) and celery (Apium graveolens). Coriander is a globally important vegetable crop. Its production tripled from 1994 to 2016, and Asia accounts for 71.4% of its total production worldwide (http://faostat3.fao.org/).

Coriander exhibits high nutrient levels, and is rich in vitamin C and carotene¹. Coriander leaves and stems are edible vegetables, and its seeds can be used as a spice. In addition, coriander contains several medicinal ingredients and presents important medicinal value. Interestingly, Coriander contains many volatile oils that are responsible for its special aroma, which arouses an unpleasant reaction in 4–14% of people (https://en.wikipedia.org/wiki/Coriander). Scientists have found that most people who dislike coriander share a common olfactory receptor gene, OR6A2, whose product absorbs the odor of aldehyde chemicals².

With the rapid development of sequencing technologies, an increasing number of genomes are being sequenced and released^3,4. Therefore, many plant genome databases have been constructed to allow researchers to search and download omics data sets. Although these statistics are incomplete, there are more than 40 available genome databases for single plants or families such as Arabidopsis (http://www.arabidopsis.org/index.jsp), rice (http://rice.plantbiology.msu.edu), maize (https://www.maizegdb.org)⁵, cotton⁶, carrot⁷, Chinese cabbage⁸, pineapple⁹, and strawberry¹⁰ (Table 1). In comparison with other families, few genome sequences have been released for the Apiaceae family, and only the carrot genome has been sequenced thus far¹¹. However, there is no such database available for coriander, which limits the ability of researchers to obtain genomic and other types of omics data for coriander. Although a carrot genome database has been built, the genome size, gene number, and number of repeat sequences are significantly greater in coriander than in carrot. Therefore, it is necessary to build a genome data analysis platform for coriander together with carrot genome data.

Table 1 Summary of main genome databases for genome-sequenced plants

Full size table

Here, we report the construction of the Coriander Genomics Database (CGDB), which contains a high-quality genome assembly of coriander obtained with the most recent sequencing technologies and novel bioinformatics methods. Moreover, this database contains RNA-seq and metabolic data sets from four tissues and three periods for coriander and carrot. The CGDB also provides a large amount of comparative genomic analysis data, such as information on collinear blocks, transcription factors, repeat sequences, and gene annotations. All users can readily access the data using the browser and query a variety of data types from the CGDB, including genomic, annotation, TF, RNA-seq, and metabolic data sets. We hope that CGDB will become an important platform for the plant research community to conduct comparative and functional genomic analyses of coriander and other related plants.

Materials and methods

Collection of genome, expression, and metabolic data sets

The genomic, expression, and metabolic data sets included in our CGDB database were obtained from standard experiments and bioinformatics analyses. The genomic data sets mainly contained genome sequence, gene sequence, protein sequence, gff, gene annotation, and repeat sequence data. The expression data sets mainly contained gene expression data for coriander and carrot from three different growth stages and four tissues. The metabolic data sets included information on metabolism genes from coriander and carrot from three different growth stages. All the analysis results for these data sets are easily navigable and available in the CGDB.

Transcription factor family identification

Transcription factors (TFs) play very important roles in regulating the expression of genes related to plant growth and development and various abiotic stresses. Here, we used the Pfam_Scan.pl program implemented at the Pfam (http://pfam.sanger.ac.uk) database (e-value < 1e-4) to predict TFs in coriander and carrot whole genomes. Then, we conducted the TF classification according to the Pfam results by using Perl scripts.

TUB gene family identification

The grape (Genoscope 12×) and carrot (v2.0) genome data used in this study were downloaded from the Phytozome website (https://phytozome.jgi.doe.gov/pz/portal.html)^11,12. Coriander (v1.0) genomics data were derived from our CGDB database. The Pfam database (http://pfam.sanger.ac.uk) was used to perform a domain search on the amino acid sequences, and the genes containing a “TUB” domain (PF01167) were extracted by using a Perl program. ClustalW was used to perform multiple alignments of the sequences of the tubby (TUB) gene family (https://www.genome.jp/tools-bin/clustalw). PhyML 3.0 was employed to construct ML trees with the Jones, Taylor, and Thorton (JTT) model and 1000 nonparametric bootstrap replicates¹³. The pheatmap package of R was used to draw expression clusters (https://cran.r-project.org/web/packages/pheatmap/index.html).

Genome collinearity detection

To clarify the evolutionary relationships and whole-genome duplication (WGD) or triplication (WGT) events between coriander and five other representative plant species (carrot, lettuce, tomato, grape, and Arabidopsis), we performed a whole-genome comparative analyses. First, the whole-genome protein sequences from all the species were searched against their own sequences using BLASTP with an E-value cutoff of 1 × 10⁻⁵. Then, MCScanX software (-k 50, -s 5, -m 25) was used to detect the duplicate types and collinearity blocks based on a previous report¹⁴. Finally, the collinear regions between any two of these plant species were visualized using the Circos and TBtools software packages.

Database architecture and implementation

The CGBD was implemented by applying a variety of common software packages in a Windows system, including internet information services (IIS), MySQL database management, ASP.NET, and HTML. The data were processed and analyzed by using C#, HTML, and JavaScript, and bioinformatics tools were employed for interpreting biological significance. The CGBD consists of relational databases storing processed data in MySQL. An interactive Web interface was constructed to enable users to conveniently access the CGBD and obtain information needed for either basic research applications or biological analysis through any modern browser on their devices. C#, HTML, and JavaScript were implemented to transmit user query information and rapidly extracted data from the MySQL database management system to generate report pages (Fig. 1). The genome browser (JBrowse) was constructed to visualize the genomic data and gene structure¹⁵. For the interactive alignment of genome sequences, BLAST searches were performed by using BLAST-2.7.1 + , an independent web server for flexible queries of similar nucleotide and amino acid sequences.

Database construction

Main interface

The CGDB structure consists of seven main modules: Home, Browse, Search, Tools, Download, Contact, and Help (Figs. 2 and 3a). In the browse module, we provide the transcription factor data for coriander and carrot, information on the main Apiaceae germplasm resources, and links to the major public genome data resources (Fig. 3c). In the search module, the CGDB allows the users to search the gene sequences, protein sequences, and annotations of the coriander genome by gene names (Fig. 3b). More importantly, users can search the novel transcriptome and metabolome data of coriander and carrot in our database (Fig. 3d, e). Users can obtain gene FPKM values from different tissues and development periods by inputting a gene name from coriander or carrot. Furthermore, the users can search the molecular weights and contents of metabolites determined in coriander plants after 30, 60, and 90 days of growth and in carrot by using the metabolite formula.

**Fig. 3: The main functions of the browse and search module.**

Furthermore, we provide the syntenic genes and figures illustrating synteny among coriander and five other representative plants (Fig. 4). The online access tool provides the BLAST tool to users, which can search similar sequences on the basis of nucleotide or amino acid sequences. Furthermore, we built a genome browser to interactively display coriander genomic data. In addition, the help module provides data statistics and FAQs to help users to quickly and conveniently use this database. In summary, the CGDB will benefit both comparative genomic and functional genomic studies in plants, especially for coriander and other Apiaceae species.

**Fig. 4: Gene synteny search function of CGDB.**

Genome sequences and annotation resources

Recently, we sequenced and assembled the coriander genome using PacBio, Illumina, 10X Genomics, and HiC sequencing technology. The assembled genome was 2118.31 Mb, accounting for 99.44% of the estimated genome. The contig N50 length was 604.13 Kb, and the scaffold N50 reached 160.99 Mb. Therefore, we obtained a high-quality assembled coriander genome. To facilitate the convenient use of these data by other researchers, we constructed the CGDB database, and all of the genome and related analytical data sets can be easily downloaded and retrieved from the CGDB by all users.

The CGDB contains not only genome sequences but also 40,747 gene and protein sequences for searching and downloading. Among these genes, 37,772 (92.7%) coriander genes annotated at four databases (Swiss-Prot, TrEMBL, KEGG, and InterPro) were also provided on our CGDB website. In addition, this database includes the annotation of specific repeat sequences, which accounted for 70.59% of the coriander genome. A total of 7233 noncoding RNAs, including 339 miRNAs, 780 tRNAs, 5440 rRNAs, and 674 snRNAs, can also retrieved by a researcher studying noncoding RNAs.

Expression and metabolic data resources

In addition to the genome data, the CGDB provided the expression and metabolic data for functional genomic studies. The expression data sets were obtained from three different stages and four tissues of coriander by using RNA-Seq. The different growth stages were 30, 60, and 90 days after sowing, and the four tissues were the roots, stems, leaves, and flowers of coriander. Each sample was included three biological replications. In total, 39,225 (90.74%) genes were shown to be actively expressed in at least one tissue, whereas 4005 genes exhibited no expression in the four examined tissues of coriander. A total of 35,759 (83.93%) genes were detected in the transcriptome in at least one stage, while 6848 genes presented no expression among the three developmental stages in coriander. In addition, the expression data for three different growth stages of carrots were included in our database for comparative analyses. In total, 28,667 (82.10%) genes were detected in the transcriptome of at least one growth stage, and 6251 genes exhibited no expression among the three developmental stages of carrots.

Metabolomics is a relatively new discipline and an emerging omics technology that emerged after genomics, metabolomics, and transcriptomics, and has become an important component of systems biology research. Our database integrated the metabolic data sets of coriander and carrots from three different growth stages, including 30, 60, and 90 days after sowing. All of these expression and metabolic data sets are easily navigable and available in the CGDB for all users.

Transcription factor family data

Transcription factors (TFs) regulate the transcription of downstream genes by binding to their specific DNA sequences. For user convenience, 2908 and 2330 genes were classified into 63 and 61 TF families from the whole-genome sequences of coriander and carrot, respectively, and all of them can be searched in the browse module of the CGDB (Fig. 5a). Two families, EIL and NZZ/SPL, presented six and one genes, respectively, in coriander, while no such genes were detected in the carrot genome. The MYB gene family was the largest among all these predicted TF families, followed by the AP2/ERF and NBS gene families. There were 335/269, 228/202, and 189/148 genes in the coriander/carrot genome for these three large gene families, respectively.

**Fig. 5: Transcription factor (TF) analyses of coriander and carrot genomes.**

Furthermore, we compared the number of TFs for each gene family between the coriander and carrot genomes. The total gene number in coriander (40,747) is ~1.27 times that (32,118) in the carrot genome. Given the difference in the total number of genes in the two species, we first divided the number of TFs in coriander by 1.27 and then calculated the difference in TFs between the two species. The results showed that seven gene families were larger (ratio > 1.5) in coriander than in carrot (Fig. 5b, Table 2). The largest family was the MADS TFs, showing a ratio greater than 3.15, followed by TUB and HSF TFs. These results indicated that these gene families have expanded and might play important roles in coriander development. In addition, five gene families showed lower expression (ratio <0.7) in the coriander genome than in the carrot genome (Fig. 5b, Table 2). All the TFs of coriander and carrot were added to our databases, and TFs from other Apiaceae species will also be provided in the future. All of these DNA/protein sequences and their domain information can be easily searched in the browse module of the CGDB. Therefore, the CGDB provides rich resources for users to conduct gene family analyses. Here, we take the TUB gene family as an example to conduct evolutionary and expression analyses (Fig. 6).

Table 2 The number of significantly different transcription factors (TFs) and the ratio of their numbers between coriander and carrot

Full size table

Example of CGDB application: the TUB gene family

A total of 17 TUB transcription factor family members were identified in the whole genome of coriander via the browse module of the CGDB. To investigate the evolutionary relationships of this gene family, we identified 6 and 13 TUB family genes from carrot and grape, respectively (Fig. 6b).

Most genome-wide duplication events, including WGD and WGT events, a accompanied by gene loss or retention^16,17. To elucidate the evolution of the TUB gene family in coriander, we performed gene loss and replication retention analyses. Comparison with grape showed that two WGD events occurred in coriander and carrot after their divergence from grape¹¹. Here, 13 TUB family genes were identified in grape. In theory, there should be 52 TUB family genes in coriander and carrot (13 × 2 × 2), but only 17 and 6 TUB family genes were identified in coriander and carrot, respectively. Thus, these two WGD events did not result in the expansion of the TUB gene family, and significant gene loss occurred in the coriander and carrot genomes.

We detected the quantitative changes in the number of TUB family genes based on the phylogenetic reconstruction obtained for coriander, carrot, and grape. We divided the tree into nine groups (A to F) according to the grape sequence (Fig. 6b). In theory, one grape gene should correspond to four genes of coriander and carrot. However, we found that most groups exhibited gene loss, which ranged from 1 to 4 genes, except in group A of the coriander genome. In carrots, there was more gene loss among all groups, with the number of lost genes ranging from 3 to 4. In particular, there were no genes in the coriander and carrot genomes corresponding to the five grape genes in group F. These results indicated that the loss of TUB family genes occurred in both the coriander and, especially, the carrot genomes.

We also conducted expression analyses of these TUB families using the RNA-seq data deposited at our CGDB website (Fig. 6c, Table 3). Interestingly, we found that the number of TUB family genes was lower in carrot than in coriander in most groups, but their expression was higher in carrots than in coriander, as observed for groups A, E, H, and I. These findings suggest that there may be a dose compensation effect of gene expression in coriander. However, the carrot TUB family gene from group D, DCAR 001134, did not show expression in any of the three developmental stages, while two coriander genes from the same group presented a higher expression level than that in coriander.

Table 3 TUB gene expression (FPKM) in coriander and carrot at 30, 60, and 90 days after sowing

Full size table

In conclusion, all of these gene family and expression data sets can be searched and downloaded from the CGDB database. Therefore, similar comparative and functional genomic analyses can be conducted by numerous scientific and agricultural researchers or breeders.

Integration of comparative genomics data

Coriander, carrot, and celery are Apiaceae species. The carrot genome and a related database have been reported, and its chromosome number (2n = 18) is different from that of coriander (2n = 22). Therefore, our database provides rich resources for the comparative genomic analysis of Apiaceae plants. From carrot and coriander genome analyses, we found that two WGD events are shared by Apiaceae species (Fig. 6a). The findings from these genomes and the observed duplication events can led us to conduct genome analyses of evolution, polyploidy, and comparative genomics, examining characteristics such as gene collinearity and gene family expansion and contraction.

Functionality for comparative genomic analyses is available in the CGDB database for users who want to conduct homology comparisons and evaluate systematic evolution and duplication events between coriander and other species. The CGDB provides collinear gene pairs and figures illustrating collinearity between any pair of the species C. sativum, A. thaliana, D. carota, L. sativa, S. lycopersicum, and V. vinifera. The collinear regions between the coriander genome and these species are provided by the collinear region search function, and users can query detailed information about collinear genes in the search module of the CGDB website.

BLAST server and genome browser

The BLAST tool was embedded in the CGDB database using the BLAST-2.7.1+ program to help users perform sequence alignment¹⁸. We provide a user-friendly graphic interface based on Web forms. The BLAST database contained the whole-genome sequences, gene sequences, and protein sequences of coriander and nine other related species, including carrot, lettuce, Arabidopsis, rice, tomato, potato, grape, kiwifruit, and Amborella. All users can perform similarity searches of genes against each type of sequence by using various BLAST search programs, such as BLASTn, BLASTp, BLASTx, tBLASTn, and tBLASTx. Amino acid or nucleotide acid sequences in FASTA format can be directly submitted by copying the data to the frame or uploading a FASTA file. Some parameters, such as the e-value, score, and output format, can be modified or simply set to the default parameter values by the users according to the research aims before performing a BLAST search. Finally, clicking the search icon provides the link to the results interface for users. The query sequences along with their position in the whole-genome sequences of coriander are produced and ordered according to the expected values, and the user can browse or download the BLAST results.

A genome browser was developed to display the coriander genomic data set and the features and structure of coriander genes. In addition, users can query the genomic sequences of each chromosome, which enables users to enlarge and view relevant information for specific genes, such as gene names, types, positions, lengths, and sequences.

Data download

The download page was provided to allow users to download entire data sets, including the whole coriander and carrot genomic sequences, gene sequences, protein sequences in FASTA format, and gene structure according to gff results. The gene annotation data set of coriander contains gene functional descriptions from several protein databases, including InterPro, Swiss-Prot, NR, and KEGG, which is provided for users on the download page and search page. Coriander transcriptome data from four tissues (roots, stems, leaves, and flowers) and three different developmental stages of leaves are available to provide expression information for the related genes. In addition, carrot transcriptome data from the three different developmental stages of leaves are available to provide expression information on the related genes. We also provide coriander and carrot metabolome data from the three different developmental stages of leaves.

Help and contact

In the help module, statistical information and the answers to frequently asked questions (FAQs) are provided for users. The statistical data are provided for three main types of data: genomic data, RNA-seq data, and metabolic data. The genomics data mainly included genome survey data, the data output, Hi-C statistics, chromosome lengths, sequence classifications, and gene annotations. The RNA-seq data included a summary of the data for four tissues in coriander and three different periods in both coriander and carrot. The metabolic data included the formula, molecular weight, and contents of metabolites at the developmental stages of 30, 60, and 90 days after the sowing of coriander and carrot.

In addition, we provide a detailed FAQ manual, which includes the ten main questions that might arise among the users, such as how to download and cite the CGDB. All the main frequent questions and answers can be found in this document. We also provide information such as our contact e-mail, mobile phone numbers, and addresses in the contact module to help users to contact us conveniently and quickly.

Limitations and future development

Due to the lack of phenotypic trait information for coriander samples, few studies and reports on trait-related markers have been reported. Based on the existing genomics data, a large number of phenotype-related molecular markers, such as SSRs and single-nucleotide polymorphisms (SNPs), will be added to this database in the near future.

Coriander is a dicot and belongs to the Apiaceae plants phylogenetically; the coriander genome is the best genome reported to date for studying the gene family evolution in Apiaceae combined with the carrot genome. The CGDB database will be updated with new data and information in a timely manner. With increasing research on coriander, large quantities of data on transcriptome, metabolome, and proteome sequences and phenomics data will emerge in in the future. We will continuously collect these omics data sets and store them in the CGDB to allow users to conduct further comparative genomic and functional analyses.

Due to the rapid development of software and bioinformatics methods, plus the variety of omics data sets for genes, the structure, and function of the current coriander annotation will be improved in the future. In addition, we will identify molecular markers such as SSRs and SNPs in various genotypes of coriander, which will allow users to conveniently conduct genome-wide association studies (GWAS) and molecular marker-assisted selection (MAS) for coriander. Moreover, we welcome and encourage all users to send us feedback for the further improvement of this database. We believe that the CGDB will be a useful and user-friendly database for coriander researchers and breeders.

Conclusion

We developed the Coriander Genomics Database (CGDB), which includes a large amount of genomic data and other omics data sets, for research on the coriander genome, transcriptome, and metabolome. The popular, powerful BLAST search tool was implemented, which allows users to search for their target genes in coriander and several other related species in this database. The genome browser can be easily used to search the detailed information of each gene. This site is intended to be the major database for coriander research. The intuitive browser of the CGDB enables searches, visualization, downloading, and the observation of cross-species collinearity, providing the most recent, unrestricted access to various omics data sets among users. We integrated all of these resources into a portal and provided useful coriander and comparative genomic resources. This database will help to establish an active global community of coriander researchers and facilitate comparative genomic evolution studies in Apiaceae.

Data availability

The genome assembly and annotation of coriander are available in the CGDB database. The CGDB can be freely accessed at http://cgdb.bio2db.com/ via the internet. All data in this database are freely available for academic research purposes. Newly released data for coriander will be updated at this website. All materials and related data from this study are available upon reasonable request. Enquiries concerning the CGDB database can be directed by email to songxiaoming116@163.com.

References

Prachayasittikul, V., Prachayasittikul, S., Ruchirawat, S. & Prachayasittikul, V. Coriander (Coriandrum sativum): a promising functional food toward the well-being. Food Res. Int. 105, 305–323 (2018).
Article CAS Google Scholar
Eriksson, N. et al. A genetic variant near olfactory receptor genes influences cilantro preference. Flavour 1, 22 (2012).
Article Google Scholar
Chen, F. et al. Genome sequences of horticultural plants: past, present, and future. Hortic. Res. 6, 112 (2019).
Article Google Scholar
Chen, F. et al. The sequenced angiosperm genomes and genome databases. Front. Plant Sci. 9, 418 (2018).
Zhu, G. et al. PPIM: a protein-protein interaction database for maize. Plant Physiol. 170, 618–626 (2016).
Article CAS Google Scholar
Yu, J. et al. CottonGen: a genomics, genetics and breeding database for cotton research. Nucleic Acids Res. 42, D1229–D1236 (2014).
Article CAS Google Scholar
Xu, Z. S., Tan, H. W., Wang, F., Hou, X. L. & Xiong, A. S. CarrotDB: a genomic and transcriptomic database for carrot. Database 2014, 1–8 (2014).
Article Google Scholar
Cheng, F. et al. BRAD, the genetics and genomics database for Brassica plants. BMC Plant Biol. 11, 136 (2011).
Article CAS Google Scholar
Xu, H. et al. PGD: pineapple genomics database. Hortic. Res. 5, 66 (2018).
Article Google Scholar
Hirakawa, H. et al. Dissection of the octoploid strawberry genome by deep sequencing of the genomes of Fragaria species. DNA Res. 21, 169–181 (2014).
Article CAS Google Scholar
Iorizzo, M. et al. A high-quality carrot genome assembly provides new insights into carotenoid accumulation and asterid genome evolution. Nat. Genet. 48, 657–666 (2016).
Article CAS Google Scholar
Jaillon, O. et al. The grapevine genome sequence suggests ancestral hexaploidization in major angiosperm phyla. Nature 449, 463–467 (2007).
Article CAS Google Scholar
Guindon, S. et al. New algorithms and methods to estimate maximum-likelihood phylogenies: assessing the performance of PhyML 3.0. Syst. Biol. 59, 307–321 (2010).
Article CAS Google Scholar
Wang, Y. et al. MCScanX: a toolkit for detection and evolutionary analysis of gene synteny and collinearity. Nucleic Acids Res. 40, e49 (2012).
Article CAS Google Scholar
Buels, R. et al. JBrowse: a dynamic web platform for genome visualization and analysis. Genome Biol. 17, 66 (2016).
Article Google Scholar
Mun, J. H. et al. Genome-wide comparative analysis of the Brassica rapa gene space reveals genome shrinkage and differential loss of duplicated genes after whole genome triplication. Genome Biol. 10, R111 (2009).
Article Google Scholar
De Bodt, S., Maere, S. & Van de Peer, Y. Genome duplication and the origin of angiosperms. Trends Ecol. Evol. 20, 591–597 (2005).
Article Google Scholar
Camacho, C. et al. BLAST+: architecture and applications. BMC Bioinforma. 10, 421 (2009).
Article Google Scholar

Download references

Acknowledgements

This work was supported by the National Natural Science Foundation of China (31801856 to X.S.), the Hebei Province Higher Education Youth Talents Program (BJ2018016 to X.S.), China-Hebei 100 Scholars Supporting Project (E2013100003 to X.W.), and the Natural Science Foundation of Hebei (C2017209103 to X.S.).

Author information

The authors contributed equally: Xiaoming Song, Fulei Nie, Wei Chen

Authors and Affiliations

Center for Genomics and Biocomputing/College of Life Sciences, North China University of Science and Technology, Tangshan, Hebei, 063210, China
Xiaoming Song, Fulei Nie, Wei Chen, Ke Gong, Qihang Yang, Jinpeng Wang, Nan Li, Pengchuan Sun, Qiaoying Pei, Tong Yu, Jingjing Hu, Xinyu Li, Tong Wu, Shuyan Feng & Xiyin Wang
School of Genomics and Bio-Big-Data, Chengdu University of Traditional Chinese Medicine, Chengdu, 610075, China
Wei Chen & Xiyin Wang
Library, North China University of Science and Technology, Tangshan, Hebei, 063210, China
Xiao Ma
Fredericton Research and Development Centre, Agriculture and Agri-Food Canada, Fredericton, New Brunswick, E3B 4Z7, Canada
Xiu-Qing Li

Authors

Xiaoming Song
View author publications
You can also search for this author in PubMed Google Scholar
Fulei Nie
View author publications
You can also search for this author in PubMed Google Scholar
Wei Chen
View author publications
You can also search for this author in PubMed Google Scholar
Xiao Ma
View author publications
You can also search for this author in PubMed Google Scholar
Ke Gong
View author publications
You can also search for this author in PubMed Google Scholar
Qihang Yang
View author publications
You can also search for this author in PubMed Google Scholar
Jinpeng Wang
View author publications
You can also search for this author in PubMed Google Scholar
Nan Li
View author publications
You can also search for this author in PubMed Google Scholar
Pengchuan Sun
View author publications
You can also search for this author in PubMed Google Scholar
Qiaoying Pei
View author publications
You can also search for this author in PubMed Google Scholar
Tong Yu
View author publications
You can also search for this author in PubMed Google Scholar
Jingjing Hu
View author publications
You can also search for this author in PubMed Google Scholar
Xinyu Li
View author publications
You can also search for this author in PubMed Google Scholar
Tong Wu
View author publications
You can also search for this author in PubMed Google Scholar
Shuyan Feng
View author publications
You can also search for this author in PubMed Google Scholar
Xiu-Qing Li
View author publications
You can also search for this author in PubMed Google Scholar
Xiyin Wang
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

X.S. and X.W. conceived the project and were responsible for the project initiation. X.S., F.N., and W.C. supervised and managed the project and research. Bioinformatics analyses were led by X. S., K.G., J.W., N.L., T.W., and P.S. Database construction was led by F.N., X.S., Q.Y., Q.P., X.M., and T.Y. The paper was organized, written, and revised by X.S., X.W., X.L., J.H., X.L., T.W., and S.F. All authors read and revised the paper.

Corresponding author

Correspondence to Xiyin Wang.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Song, X., Nie, F., Chen, W. et al. Coriander Genomics Database: a genomic, transcriptomic, and metabolic database for coriander. Hortic Res 7, 55 (2020). https://doi.org/10.1038/s41438-020-0261-0

Download citation

Received: 14 November 2019
Revised: 17 January 2020
Accepted: 26 January 2020
Published: 01 April 2020
DOI: https://doi.org/10.1038/s41438-020-0261-0

This article is cited by

Improving coriander yield and quality with a beneficial bacterium
- Xiaoxuan Wu
- Yu Yang
- Huiming Zhang
Molecular Horticulture (2024)
Transcriptomics, proteomics, and metabolomics interventions prompt crop improvement against metal(loid) toxicity
- Ali Raza
- Hajar Salehi
- Rajeev K. Varshney
Plant Cell Reports (2024)
Flower-like meristem conditions and spatial constraints shape architecture of floral pseudanthia in Apioideae
- Jakub Baczyński
- Ferhat Celep
- Regine Claßen-Bockhoff
EvoDevo (2022)
Agri-biotechnology of coriander (Coriandrum sativum L.): an inclusive appraisal
- Saikat Gantait
- Amit Baran Sharangi
- Narottam Kumar Meena
Applied Microbiology and Biotechnology (2022)
Development and characterization of genic SSR-FDM for stem gall disease resistance in coriander (Coriandrum sativum L.) and its cross species transferability
- Sharda Choudhary
- Mahantesha B. N. Naika
- R. D. Meena
Molecular Biology Reports (2021)

Subjects

Abstract

Similar content being viewed by others

Introduction

Materials and methods

Collection of genome, expression, and metabolic data sets

Transcription factor family identification

TUB gene family identification

Genome collinearity detection

Database architecture and implementation

Database construction

Main interface

Genome sequences and annotation resources

Expression and metabolic data resources

Transcription factor family data

Example of CGDB application: the TUB gene family

Integration of comparative genomics data

BLAST server and genome browser

Data download

Help and contact

Limitations and future development

Conclusion

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Quick links