PGD: Pineapple Genomics Database

Xu, Huimin; Yu, Qingyi; Shi, Yan; Hua, Xiuting; Tang, Haibao; Yang, Long; Ming, Ray; Zhang, Jisen

doi:10.1038/s41438-018-0078-2

Download PDF

Article
Open access
Published: 17 September 2018

PGD: Pineapple Genomics Database

Huimin Xu^1,2,
Qingyi Yu ORCID: orcid.org/0000-0001-5393-5764³,
Yan Shi¹,
Xiuting Hua¹,
Haibao Tang ORCID: orcid.org/0000-0002-3460-8570¹,
Long Yang⁴,
Ray Ming^1,5 &
…
Jisen Zhang¹

Horticulture Research volume 5, Article number: 66 (2018) Cite this article

3862 Accesses
27 Citations
1 Altmetric
Metrics details

Subjects

Abstract

Pineapple occupies an important phylogenetic position as its reference genome is a model for studying the evolution the Bromeliaceae family and the crassulacean acid metabolism (CAM) photosynthesis. Here, we developed a pineapple genomics database (PGD, http://pineapple.angiosperms.org/pineapple/html/index.html) as a central online platform for storing and integrating genomic, transcriptomic, function annotation and genetic marker data for pineapple (Ananas comosus (L.) Merr.). The PGD currently hosts significant search tools and available datasets for researchers to study comparative genomics, gene expression, gene co-expression molecular marker, and gene annotation of A. comosus (L). PGD also performed a series of additional pages for a genomic browser that visualizes genomic data interactively, bulk data download, a detailed user manual, and data integration information. PGD was developed with the capacity to integrate future data resources, and will be used as a long-term and open access database to facilitate the study of the biology, distribution, and the evolution of pineapple and the relative plant species. An email-based helpdesk is also available to offer support with the website and requests of specific datasets from the research community.

Kiwifruit Genome Database (KGD): a comprehensive resource for kiwifruit genomics

Article Open access 01 August 2020

Coriander Genomics Database: a genomic, transcriptomic, and metabolic database for coriander

Article Open access 01 April 2020

Reference transcriptomes and comparative analyses of six species in the threatened rosewood genus Dalbergia

Article Open access 20 October 2020

Introduction

Pineapple (Ananas comosus (L.)) is an important tropical fruit displaying crassulacean acid metabolism (CAM) with high water-use efficiency. Due to its pleasant taste and desirable nutritional characteristics, pineapple has a great economic value and industrial usage, as well as medicinal properties. Genetically, pineapple originated when the Bromeliaceae family separated from Poaceae early in the history of Poales offering an evolutionary reference for comparative genomics analysis of cereal genomes. In comparison with Poaceae such as rice (Oryza sativa), maize (Zea mays), sorghum (Sorghum bicolor)^1,2, the genome sequence of pineapple were released much later, when the challenges of its highly heterozygous genome were recently solved and a reference genome assembled by Ming et al.³ The genome assembly of pineapple was published, the samples used to assemble genome were from variety F153. Because of the pivotal phylogenetic position of pineapple at the base of the Poales³, making its much smaller genome convenient for evolutionary studies. At present, many sequenced plant genomes have a genomics database for researchers to manage the biological information, such as TAIR (http://www.arabidopsis.org) for arabidopsis, MaizeGDB (http://www.maizeGDB.org) for maize, and RGAP (http://rice.plantbiology.msu.edu) for rice, but the current absence of such a database for pineapple, limits the ability of researchers in obtaining genomic data for pineapple.

Previously, a pineapple EST database was only developed with the bioinformatics resource hosting the fruit, root, and nematode-infected gall-expressed sequences⁴. In addition to the general plant databases, there are no genomic databases specifically designed for pineapple. However, with the advances of genomic sequencing technologies and the newly available assembled pineapple genome by Ming et al.³, an integrated genomics database storing gene information resources of pineapple is essential for the research community to explore the molecular biology of pineapple and the evolution of pineapple and also for Bromeliaceae studies. In this study, these gene sets were included in the pineapple genome database that we constructed. Here, we constructed an integrated functional genomics database for pineapple named PGD (http://pineapple.angiosperms.org/pineapple/html/index.html), in which users can readily access data using the browser and query a variety of data types from PGD including genetics, genomics, functional annotations, RNA-seq expression dataset, and molecular marker information. In addition, PGD also includes several online visualization tools such as JBrowse and ViroBlast that make it accessible from any device. The integrated gene search, dataset download function, genetic marker database, and help manual, will be improved constantly, and we hope that PGD will become a fundamental comprehensive genomics database for pineapple functional genomics research.

Construction and content

Data sources and processing

Genome assemblies and gene annotations

The pineapple reference genome was sequenced and assembled by Ming et al.³ using three main approaches, whole-genome shotgun sequenced with Illimina, 454 (Roche), PacBio-sequencing technologies, and BAC pools sequenced with Illumina sequencing³. The assembled genome included 8986 contigs with N50 of 126.5 kb and 3,133 scaffold with N50 of 11.8 Mb, respectively, accounting for 71.3% and 72.6% of assembled genome³ (Table 1). The transcriptome was assembled by de novo Trinity⁵ and reference-guided Trinity was constructed by PASA⁶ with the nearly full-length pineapple transcript, which was identified using BLASTP. Subsequently each transcript were trained by SNAP⁷, GENEMARK⁸, and AUGUSTUS⁹. This result was combined with gene annotation produced by MAKER, which could be considered as messenger RNA evidence³. A total of 27,024 gene models were obtained by MAKER annotation without redundancy, which include 24,063 complete gene models and 2,961 classified as partial.

Table 1 Summary of genome assembly of pineapple variety “F153” in PGD

Full size table

Gene function annotation

Based on three main protein databases, SWISS-PROT, TrEMBL, and TAIR10, the protein descriptions were defined using AHRD with optimum parameter³. For functional annotation, conserved domain regions and interrelated gene information, Gene Ontology (GO)¹⁰, and InterPro¹¹ domain of the predicted pineapple proteins were annotated using InterProScan¹² with default parameters. In addition, the relevant Kyoto Encyclopedia of Genes and Genomes (KEGG)¹³ pathway-associated pineapple protein was annotated using KOBAS¹⁴. The above data were publicly available and are now accessible on PGD for users (Table 2).

Table 2 Summary of gene annotation of pineapple variety “F153” in PGD

Full size table

Genetic marker annotation: SSR, SNP, and IP markers

To develop the resources of simple sequence repeats (SSRs) of the pineapple genome, three main procedures in this execution make it possible to identify more SSRs markers. The first step is the fact that sequences of SSR loci were extracted from the pineapple genome using the customer/user Perl script. Secondly, SSR primers were designed based on 60 bp each side of target loci on the coding sequence (CDS) and genomic sequence by Primer3¹⁵. These primers were ultimately tested by e-PCR¹⁶ and the optimal results of CDS-SSR and genomic-SSR markers were selected as the reference resource to deposit in the PGD. As a result, a total of 4,629 CDS-SSR and 46,860 genomic-SSR markers were identified and made available in pineapple genome database with detailed information for the both types for users.

Regarding single-nucleotide polymorphism (SNP), a total of 89 genome resequencing Ananas accessions were collected, and paired-end resequencing reads were mapped to the pineapple F153 reference genome with BWA (version: 0.7.12-r1039)¹⁷ using the default parameters. To convert mapping results into the BAM format and to filter duplicated reads, SAMtools (vesion:1.3)¹⁸ and Picard package were used, respectively. The Genome Analysis Toolkit (GATK, version 3.7-0-gcfedb67)¹⁹ was performed to detect SNPs. The neighbor-joining tree was constructed using SNPhylo²⁰ software with bootstrap value 100. We identified 7,252,423 SNPs and 923,469 indels.

To develop intron polymorphism (IP) markers, a customized Perl script was used to search IP loci in the pineapple genome, and the primer design and tests were similar to the pressure of SSR maker developments. The PGD collected 17,540 IP loci, which are also used to establish whether introns exist in the querying sequences using the IP development page.

Expression data

We collected and downloaded 45 RNA-seq samples from the public platform (https://de.iplantcollaborative.org/de/?type=data&folder=/iplant/home/cmwai/coge_data/Pineapple_tissue_RNAseq), these RNA-seq samples include the photosynthetic (green tip) and non-photosynthetic (white base) leaf tissue at 2-h intervals over a 24-h period during the growth period of the field from A. comosus cultivar MD2 (26 samples), the different leaf segments at 12:00 and 10:00 from each individual MD2 plants (12 samples), the fruits from cultivar MD2 (5 sample), the flowers (1 samples), and root (1 sample) tissues from A. comosus var. F153³. The sample information is shown in Table 3. The clean reads were obtained using Trimmomatic²¹, and were subsequently aligned to reference genomes using HISAT2 (v2.0.5)²². The fragments per kilobase of exon per million fragments mapped (FPKM) of the annotated genes were normalized using Cufflinks (v2.2.1) (http://cole-trapnell-lab.github.io/cufflinks/releases/v2.2.1/) with default parameters.

Table 3 Summary of RNA-seq samples in PGD

Full size table

Gene-to-gene co-expression

In order to identify co-regulated genes in A. comosus, both Spearman and Pearson methods²³ were used to calculate pairwise expression correlation co-efficiency based on RNA-seq data among 15 different tissues, which included 13 different leaves, one root, and one flower. A total of 7,228 informative genes (with FPKM >5 in at least one tissue and a variance >1) were obtained and gene pairs with absolute similarity of expression correlation >0.65 were used as the final dataset. All datasets are easily navigable and available in PGD.

Comparative genomics analysis

To clarify the evolutionary relationship and whole-genome duplication (WGD) events between pineapple and eight representative plant species, including Oryza sativa, Vitis vinifera, Spirodela polyrhiza, Asparagus aofficinalis, Elaeis guineensis, Phoenix dactylifera, S. bicolor, and Musa acuminata, we performed whole-genome comparative analyses. The collinear regions between pineapple and these eight plant species were visualized based on dot-plot using MCscan²⁴.

Database architecture and implementation

The PGD was implemented by performing a variety of several common software packages in the LINUX system, including PHP, Apache web server, MySQL database management, and Perl FastCGI. The data was processed and analyzed by the pipelines of Perl script, and bioinformation tools for interpreting biological significance. The PGD consists of some relational databases storing the processed data in MySQL. An interactive Web interface was constructed to enable users to conveniently access the PGD and obtain the information needed either for basic research applications or biological analysis through any modern browser on their devices. PHP script was implemented to transmit user query information and rapidly extracted data from MySQL databases management to generate report pages (Fig. 1). In addition, the genome visualization tool was implemented by the genetic genome browser (JBrowser)²⁵. For interactive alignment of genome sequences, BLAST²⁶ was performed by ViroBLAST²⁶, an independent web server for flexible queries of similar nucleotide and amino acid sequences.

Content

The overview of the PGD’s structure is shown in Fig. 2, it consists of three main modules: search, molecular markers, and online tools. In the search module, we provided four methods to search for pineapple genes: searching specific gene information by gene properties such as gene names, term ID, and expression value based on FPKM. The relationship of gene co-expression was searched by value and gene ID. In the molecular markers module, the search page of PGD molecular marker was developed based on available public data resources containing a large number of IP, SSR, and SNP markers. Almost all the genetic markers available could be used to construct the genetic map based on microsatellites²⁷ for A. comosus species, which can benefit both biological traits and genetic divergence studies in pineapple. In addition, online access tools provide two major functions for users, browsing gene structure by genomic regions, and searching by nucleotide and amino acid sequence similarity.

Utility and discussion

Search function: search genes, gene expression, and gene co-expression

The search module allows researchers to search gene information from the pineapple genome, and to do so several modes to search data are provided, including by gene ID, GO ID, KEGG ID, and InterPro ID. This module provides an interactive and user-friendly interface that also includes examples. The related term information of focused genes was provided by searching, where several hyperlinks are rendered in search results page, users can obtain interrelated core information of focused genes by clicking those hyperlinks.

In the gene-expression page, users can query the expression level based on FPKM of specified genes by inputting a gene or gene loci. In addition, the input is the identifier of target genes and the output is co-expression genes and relative correlation coefficient with the cutoff and type of coefficient users inputted above in the gene co-expression page (Fig. 3).

**Fig. 3: Schematic structure of the PGD.**

Comparative genomics

Pineapples are monocotyledonous and phylogenetically related to Poaceae (including maize, wheat, rice, and sorghum), its genome is considered as a reference for comparative analysis of monocotyledons based on its well-conserved karyotype. The critical phylogenetic position on the Poales level result in the revision of the date of the cereal genome duplication event ρ, which was initially thought to have occurred between 9.5 and 11.5 millions years ago (MYA), This duplication event does not exist in pineapples, and the previous σ WGD was revised to 100–120 MYA³. For example, a phylogenetic analysis for the SWEETs (Sugar Will Eventually be Exported Transporters) gene family based on representative monocotyledon (A. comosus, S. bicolor, O. sativa) plant species suggested that the SWEET gene family is ancient and its evolutionary history can be traced in duplicated order (Fig. 4). Here, SWEETs expansion in Poales were also assumed to be mainly caused by ρ WGD, one of the pineapple SWEET genes (Aco003627.1) where the recent gene duplications contributed to SWEET expansion in rice. But in another group, the Poales plant likely retained the SWEETs inherited from σ WGD (for example: Aco004463.1), and one of the ancestors subsequently acquired ρ WGD in the lineages leading to rice and sorghum, which generated two SWEETs. The Poales lineage separated from the lineages leading to banana and the palms 100–120 million years ago²⁸, the earlier σ WGD event occurred in Poales lineage before ρ WGD. Pineapple lacking the ρ WGD represents the closest sequenced lineage to the grasses, which makes it an excellent outgroup for grass comparative genomic studies³.

**Fig. 4: Comparison of gene structure and phylogenetic analysis of the eight members of *SWEET* gene family.**

This function is available for users who want to explore homology comparison, and evolution, and WGD events between pineapple and other species. Here, homologous regions from the pineapple genome to other species are provided by the collinear region search function, and users can also query detailed information about genes of orthologous pairs (Fig. 5). This function can help researchers to understand the collinear and evolutionary relationships between the same genome and the corresponding species.

**Fig. 5: Colinear regions of *A. comosus* compared with corresponding genomes.**

The molecular marker in PGD

SNP marker, SSR marker, and IP marker

Regardless of self-incompatibility, pineapple has high levels of heterozygosity resulting from clonal propagation. F153 had a combined heterozygosity rate of 1.89%, with 1.54% SNPs and 0.35% indels, whereas MD2 had a heterozygosity rate of 1.98%, with 1.71% SNPs and 0.27% indels. The wild A. bracteatus CB5 accession had a higher heterozygosity rate of 2.93%, with 2.53% SNPs and 0.40% indels³. About SNP from pineapple, users can query all variation type of very sequenced variety of pineapple by browsing JBrowse. PGD also provided an interactive interface for users to search molecular markers data by filling out search boxes and selecting special items. Meanwhile, there are two types of query page for CDS-SSR and genome-SSR for every species where users can obtain detailed information (e.g., start and end loci, forward sequence, reverse sequence, forward length, reverse length, forward GC, and reverse GC and so on) by clicking submit bottom on the query pages.

In addition, for IP molecular markers, the data here can be searched including: chromosome number, intron start, intron end, intron length, forward sequence, reverse sequence, forward Tm, reverse Tm, forward GC, and reverse GC.

BLAST server and genome visualization

BLAST was implemented by using ViroBlast for sequence homology searches. User can search sequences of pineapple including genomic scaffolds, coding sequences, or proteins. BLASTN, TBLASTX, and BLASTX can be conducted to search the sequences of scaffolds, unigenes, and gene CDS. In addition, users can also search against databases of protein sequences by inputting protein and nucleotide sequences within BLASTP and BLASTX, respectively. Besides, users can query protein sequences using TBLASTX, a tool of translating nucleotide sequences entered and nucleotide databases into protein sequences (Fig. 6a).

**Fig. 6: Visualization Tools Interface.**

JBrowse was developed for displaying the annotation and structure of our genes, functionally annotated unigenes, SNP, and RNA-seq data. Users can query genomic scaffolds, which enables users to view distinctly the relevant information for gene annotation and structure when assessing gene models. (Fig. 6b).

Data download, statistical information, and user manual

The download page was provided for users to download entire datasets as needed, including genomic scaffolds and predicted gene sequences in the FASTA format and gene structure in the gff3 format. The dataset of gene annotation contains gene functional descriptions, KEGG, GO, and InterPro domain, which was provided for users in a download page. Pineapple transcriptome data from leaf, the different development stages of fruits, flower, and root are available for expression information of the corresponding genes. In addition, the general statistics data of genome assemblies, gene expression, homology, gene models, gene co-expression, and genetic molecular marker in each species are displayed in interactive page readily. Additionally, we also provide a detailed user manual, including data resources, sequence processing methodology and parameters, and user documents for users in PGD, as well as manually correction documents for users generating useful and practical recommendations.

Limitations and future development

Some functional genomics information that cannot be fully accessed in all species in PGD remain due to the limitations of current assemblies and annotations. This information includes for example, alternative spliced events and non-coding RNA genes that are not annotated in most of the varieties of pineapple.

Pineapples are monocotyledonous and phylogenetically related to Poaceae plants (including maize, wheat, rice, and sorghum) and are the best genomes for studying the gene family evolution in Poaceae and monocot genomes. In addition, PGD will explore these two directions in future developments. Due to the rapid development of genome analysis, the variety of datasets for gene expression and sequence and the structure and function of the current annotation will be improved in the future. Besides, the novel functional genomics data resources will be displayed in PGD in the future, based on the recently released public data and data from our research group with respect to pineapple, including information about non-coding RNA, and comparative genomics.

Conclusion

We developed the Pineapple Genomic Database (PGD), which includes the large amount of set of genomic data and several online visualization tools for future research on genomics, molecular marker, and transcriptome of pineapple. Several powerful search tools were implemented, which allow users to analyze their target genes. This site aims to be the database-to-go for pineapple, thanks to intuitive search options, visualization, downloading, mining literature, and cross-species searches, providing the latest, unrestricted access to genomic data to end-users. Integrating all of these resources in a portal, and providing useful Ananas and comparative genomic specific resources, will help fostering a global and active Ananas research community and genome evolution study of Poales.

Availability and requirements

The PGD can be freely accessed at http://pineapple.angiosperms.org/pineapple/html/index.html via the World Wide Web. A reliable data management system has been developed and all newly released information will be updated on this website. Enquiries concerning the database should be directed by email to rayming@illinois.edu or zjisen@126.com.

References

Jiao, Y., Li, J., Tang, H. & Paterson, A. H. Integrated syntenic and phylogenomic analyses reveal an ancient genome duplication in monocots. Plant Cell 26, 2792 (2014).
Article CAS PubMed PubMed Central Google Scholar
Amborella Genome Project. The Amborella genome and the evolution of flowering plants. Science 342, 1241089 (2013).
Ming, R. et al. The pineapple genome and the evolution of CAM photosynthesis. Nat. Genet. 47, 1435 (2015).
Article CAS PubMed PubMed Central Google Scholar
Moyle, R. L., Crowe, M. L., Ripi-Koia, J., Fairbairn, D. J. & Botella, J. R. PineappleDB: an online pineapple bioinformatics resource. BMC Plant Biol. 5, 21 (2005).
Article PubMed PubMed Central Google Scholar
Grabherr, M. G. et al. Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nat. Biotechnol. 29, 644 (2011).
Article CAS PubMed PubMed Central Google Scholar
Haas, B. J. et al. Automated eukaryotic gene structure annotation using EVidenceModeler and the program to assemble spliced alignments. Genome Biol. 9, R7 (2008).
Article PubMed PubMed Central Google Scholar
Korf, I. Gene finding in novel genomes. BMC Bioinformatics 5, 59 (2004).
Article PubMed PubMed Central Google Scholar
Lomsadze, A., Ter-Hovhannisyan, V., Chernoff, Y. O. & Borodvsky, M. Gene identification in novel eukaryotic genomes by self-training algorithm. Nucleic Acids Res. 33, 6494–6506 (2005).
Article CAS PubMed PubMed Central Google Scholar
Stanke, M., Schöffmann, O., Morgenstern, B. & Waack, S. Gene prediction in eukaryotes with a generalized hidden Markov model that uses hints from external sources. BMC Bioinformatics 7, 62 (2006).
Article PubMed PubMed Central Google Scholar
Gene Ontology Consortium. Gene Ontology Consortium: going forward. Nucleic Acids Res. 43, D1049 (2015).
Finn, R. D. et al. InterPro in 2017—beyond protein family and domain annotations. Nucleic Acids Res. 45, D190 (2017).
Article CAS PubMed Google Scholar
Jones, P. et al. InterProScan 5: genome-scale protein function classification. Bioinformatics 30, 1236–1240 (2014).
Article CAS PubMed PubMed Central Google Scholar
Kanehisa, M. & Goto, S. KEGG: Kyoto Encyclopedia of Genes and Genomes. Nucleic Acids Res. 27, 29–34 (2000).
Google Scholar
Xie, C. et al. KOBAS 2.0: a web server for annotation and identification of enriched pathways and diseases. Nucleic Acids Res. 39, W316 (2011).
Article CAS PubMed PubMed Central Google Scholar
Rozen, S. & Skaletsky, H. Primer3 on the WWW for general users and for biologist programmers. Methods Mol. Biol. 132, 365 (2000).
CAS PubMed Google Scholar
Schuler, G. D. Sequence mapping by electronic PCR. Genome Res. 7, 541 (1997).
Article CAS PubMed PubMed Central Google Scholar
Li, H. & Durbin, R. Fast and accurate short read alignment with Burrows–Wheeler transform. Bioinformatics. 25, 1754–1760 (2009).
Article CAS PubMed PubMed Central Google Scholar
Li, H., Handsaker, B., Wysoker, A., Fennell, T. & Ruan, J. The Sequence Alignment-Map format and SAMtools. Bioinformatics 25, 2078–2079 (2009).
Article PubMed PubMed Central Google Scholar
Mckenna, A. et al. The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 20, 1297–1303 (2010).
Article CAS PubMed PubMed Central Google Scholar
Lee, T. H., Guo, H., Wang, X., Kim, C. & Paterson, A. H. SNPhylo: a pipeline to construct a phylogenetic tree from huge SNP data. BMC Genomics 15, 162 (2014).
Article PubMed PubMed Central Google Scholar
Bolger, A. M., Lohse, M. & Usadel, B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30, 2114 (2014).
Article CAS PubMed PubMed Central Google Scholar
Pertea, M., Kim, D., Pertea, G. M., Leek, J. T. & Salzberg, S. L. Transcript-level expression analysis of RNA-seq experiments with HISAT, StringTie and Ballgown. Nat. Protoc. 11, 1650 (2016).
Article CAS PubMed PubMed Central Google Scholar
Ma, C. & Wang, X. Application of the Gini correlation coefficient to infer regulatory relationships in transcriptome analysis. Chin. Nurs. Manag. 160, 192 (2008).
Google Scholar
Tang, H. et al. Synteny and collinearity in plant genomes. Science 320, 486–488 (2008).
Article CAS PubMed Google Scholar
Skinner, M. E., Uzilov, A. V., Stein, L. D., Mungall, C. J. & Holmes, I. H. JBrowse: a next-generation genome browser. Genome Res. 19, 1630 (2009).
Article CAS PubMed PubMed Central Google Scholar
Deng, W., Nickle, D. C., Learn, G. H., Maust, B. & Mullins, J. I. ViroBLAST: a stand-alone BLAST web server for flexible queries of multiple databases and user’s datasets. Bioinformatics 23, 2334–2336 (2007).
Article CAS PubMed Google Scholar
Bindler, G. et al. A high density genetic map of tobacco (Nicotiana tabacum L.) obtained from large scale microsatellite marker development. Theor. Appl. Genet. 123, 219 (2011).
Article PubMed PubMed Central Google Scholar
D’Hont, A. et al. The banana (Musa acuminata) genome and the evolution of monocotyledonous plants. Nature 488, 213–217 (2012).
Article PubMed Google Scholar

Download references

Acknowledgements

The authors would like to thank Irene Lavagi for editing the language. The study was supported by grants from the 863 program (2013AA102604), NSFC (31201260), Program for New Century Excellent Talents in Fujian Province and Science and Technology Major Project of Fujian Province (2016NZ0001).

Author information

Authors and Affiliations

Center for Genomics and Biotechnology, Haixia Institute of Science and Technology, Fujian Provincial Key Laboratory of Haixia Applied Plant Systems Biology, Fujian Agriculture and Forestry University, 350002, Fuzhou, China
Huimin Xu, Yan Shi, Xiuting Hua, Haibao Tang, Ray Ming & Jisen Zhang
College of Resource and Environment, Fujian Agriculture and Forestry University, 350002, Fuzhou, China
Huimin Xu
Department of Plant Pathology and Microbiology, Texas A&M AgriLife Research, Texas A&M University System, Dallas, TX, 75252, USA
Qingyi Yu
Agricultural Big-Data Research Center and College of Plant Protection, Shandong Agricultural University, 271018, Taian, China
Long Yang
Department of Plant Biology, University of Illinois at Urbana-Champaign, Urbana, IL, 61801, USA
Ray Ming

Authors

Huimin Xu
View author publications
You can also search for this author in PubMed Google Scholar
Qingyi Yu
View author publications
You can also search for this author in PubMed Google Scholar
Yan Shi
View author publications
You can also search for this author in PubMed Google Scholar
Xiuting Hua
View author publications
You can also search for this author in PubMed Google Scholar
Haibao Tang
View author publications
You can also search for this author in PubMed Google Scholar
Long Yang
View author publications
You can also search for this author in PubMed Google Scholar
Ray Ming
View author publications
You can also search for this author in PubMed Google Scholar
Jisen Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Long Yang, Ray Ming or Jisen Zhang.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Xu, H., Yu, Q., Shi, Y. et al. PGD: Pineapple Genomics Database. Hortic Res 5, 66 (2018). https://doi.org/10.1038/s41438-018-0078-2

Download citation

Received: 02 May 2018
Revised: 17 June 2018
Accepted: 06 August 2018
Published: 17 September 2018
DOI: https://doi.org/10.1038/s41438-018-0078-2

This article is cited by

The pear genomics database (PGDB): a comprehensive multi-omics research platform for Pyrus spp.
- Shulin Chen
- Manyi Sun
- Jun Wu
BMC Plant Biology (2023)
Genome-wide characterization and expression profiling of B3 superfamily during ethylene-induced flowering in pineapple (Ananas comosus L.)
- Cheng Cheng Ruan
- Zhe Chen
- Zhi Li Zhang
BMC Genomics (2021)
The pineapple MADS-box gene family and the evolution of early monocot flower
- Juan Hu
- Xiaojun Chang
- Liangsheng Zhang
Scientific Reports (2021)
Genome-wide Identification and Expression Pattern Analysis of the HD-Zip Transcription Factor Family in Pineapple (Ananas Comosus)
- Qiao Zhou
- Liping Liu
- Yuan Qin
Tropical Plant Biology (2021)
The role of cis-elements in the evolution of crassulacean acid metabolism photosynthesis
- Li-Yu Chen
- Yinghui Xin
- Ray Ming
Horticulture Research (2020)

Subjects

Abstract

Similar content being viewed by others

Introduction

Construction and content

Data sources and processing

Genome assemblies and gene annotations

Gene function annotation

Genetic marker annotation: SSR, SNP, and IP markers

Expression data

Gene-to-gene co-expression

Comparative genomics analysis

Database architecture and implementation

Content

Utility and discussion

Search function: search genes, gene expression, and gene co-expression

Comparative genomics

The molecular marker in PGD

SNP marker, SSR marker, and IP marker

BLAST server and genome visualization

Data download, statistical information, and user manual

Limitations and future development

Conclusion

Availability and requirements

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding authors

Ethics declarations

Conflict of interest

Additional information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Quick links