The persimmon (Diospyros oleifera Cheng) genome provides new insights into the inheritance of astringency and ancestral evolution

Zhu, Qing-gang; Xu, Yang; Yang, Yong; Guan, Chang-fei; Zhang, Qiu-yun; Huang, Jing-wen; Grierson, Don; Chen, Kun-song; Gong, Bang-chu; Yin, Xue-ren

doi:10.1038/s41438-019-0227-2

Download PDF

Article
Open access
Published: 18 December 2019

The persimmon (Diospyros oleifera Cheng) genome provides new insights into the inheritance of astringency and ancestral evolution

Qing-gang Zhu ORCID: orcid.org/0000-0002-8786-2169^1,2^na1,
Yang Xu³^na1,
Yong Yang⁴,
Chang-fei Guan⁴,
Qiu-yun Zhang^1,2,
Jing-wen Huang^1,2,
Don Grierson ORCID: orcid.org/0000-0002-2238-8072^1,5,
Kun-song Chen^1,2,
Bang-chu Gong³ &
…
Xue-ren Yin^1,2

Horticulture Research volume 6, Article number: 138 (2019) Cite this article

7067 Accesses
36 Citations
28 Altmetric
Metrics details

Subjects

Abstract

Persimmon (Diospyros kaki) is an oriental perennial woody fruit tree whose popular fruit is produced and consumed worldwide. The persimmon fruit is unique because of the hyperaccumulation of proanthocyanidins during fruit development, causing the mature fruit of most cultivars to have an astringent taste. In this study, we obtained a chromosome-scale genome assembly for ‘Youshi’ (Diospyros oleifera, 2n = 2x = 30), the diploid species of persimmon, by integrating Illumina sequencing, single-molecule real-time sequencing, and high-throughput chromosome conformation capture techniques. The assembled D. oleifera genome consisted of 849.53 Mb, 94.14% (799.71 Mb) of which was assigned to 15 pseudochromosomes, and is the first assembled genome for any member of the Ebenaceae. Comparative genomic analysis revealed that the D. oleifera genome underwent an ancient γ whole-genome duplication event. We studied the potential genetic basis for astringency development (proanthocyanidin biosynthesis) and removal (proanthocyanidin insolublization). Proanthocyanidin biosynthesis genes were mainly distributed on chromosome 1, and the clustering of these genes is responsible for the genetic stability of astringency heredity. Genome-based RNA-seq identified deastringency genes, and promoter analysis showed that most of their promoters contained large numbers of low oxygen-responsive motifs, which is consistent with the efficient industrial application of high CO₂ treatment to remove astringency. Using the D. oleifera genome as the reference, SLAF-seq indicated that ‘Youshi’ is one of the ancestors of the cultivated persimmon (2n = 6x = 90). Our study provides significant insights into the genetic basis of persimmon evolution and the development and removal astringency, and it will facilitate the improvement of the breeding of persimmon fruit.

The apricot (Prunus armeniaca L.) genome elucidates Rosaceae evolution and beta-carotenoid synthesis

Article Open access 18 November 2019

A chromosome-scale genome sequence of pitaya (Hylocereus undatus) provides novel insights into the genome evolution and regulation of betalain biosynthesis

Article Open access 06 July 2021

The chromosome-level genome of dragon fruit reveals whole-genome duplication and chromosomal co-localization of betacyanin biosynthetic genes

Article Open access 10 March 2021

Introduction

Diospyros L. belongs to the Ebenaceae, a plant genus that includes over 500 species that are distributed worldwide¹ and is one of the largest angiosperm genera². Some of its species (e.g., Diospyros kaki Thunb) produce edible fruit, and others are sources of ebony wood, stock, medicinal materials or ornamental plants³. The most well-known species is Diospyros kaki, which originated from China⁴. One fossil leaf of wild persimmon (Diospyros miokaki) that is ~25 million years old was found in the Shandong Province of China and stored in the Shanwang Paleontological Museum⁵. In ancient China, persimmon was first described in ‘Li Ji·Nei Ze’ (circa 450 BC, written by Ji Kong, grandson of Confucius), and it began to be considered an important food during the Warring States Period (475–221 B.C.)⁶. The persimmon was widely cultivated in China during the Tang Dynasty (618–907 A.D.) due to the development of grafting and technological approaches for removing astringency, and it was introduced to Japan and spread to the Korean peninsula in the 15th century and to Europe and then America in the 18th–20th centuries³. Persimmon has been cultivated mainly in oriental countries (e.g., China, Japan, and Korea) but is widely distributed in other areas (e.g., China, Japan, and Korea). Australia, Brazil, Israel, Spain, and the United States)⁷. The most recent statistical data indicate that persimmon fruit production in China has reached 400 million tons annually (FAO, 2016).

Persimmon fruit are unique in their accumulation of proanthocyanidins (PAs, also known as condensed tannins, CTs), and soluble CTs (SCTs) cause fruit astringency in most cultivars, even at commercial maturity^8,9, reducing consumer appeal. Thus, persimmon is a characteristic material for studying PA synthesis, accumulation and metabolism in fruit. PAs also prevent benefits such as strong oxidation resistance, and they are widely used in cosmetics as antioxidants and anti-aging ingredients¹⁰. PAs are also widely used in wastewater treatment because of their scavenging ability towards radioactive and heavy metals^{10,11,12,13,14}. The biosynthesis of PAs is controlled by a series of structural genes in the shikimate pathway¹⁵; however, the molecular mechanisms and evolution of PA accumulation in persimmon remain elusive¹⁶. In addition to the study of PA metabolism, in recent years, persimmon has been used as a model to understand fruit anaerobic environment interactions^17,18 and the evolution of sex chromosome systems in higher plants¹⁹. However, most of these studies were conducted based on the de novo assembly of transcriptomic sequences due to the absence of complete persimmon genome sequences for either Diospyros species or other Ebenaceae. This deficiency has inhibited research on evolution and heredity in persimmon and other plants in the Ebenales.

Within Diospyros, there are three main species cultivated for fruit production: Diospyros lotus, Diospyros oleifera, and Diospyros kaki, and all three of which produce astringent fruit. D. kaki is hexaploid (2n = 6x = 90), while D. lotus and D. oleifera are diploid (2n = 2x = 30)²⁰. D. kaki accounts for the greatest number of commercial fruit tree cultivars, and D. lotus is widely used as a rootstock. Compared to D. kaki and D. lotus, D. oleifera exhibits two additional characteristics: the generation of trichomes and the excretion of oils on the fruit surface (Supplementary Fig. 1). Here, the material ‘Youshi’ (the traditional common name in China for D. oleifera Cheng) was used; Youshi means ‘oil persimmon’ in Chinese. Another Ebenales plant, avocado (Persea americana), is also well known for containing high levels of lipids²¹. Fu et al.²² sequenced the complete chloroplast genomes from D. kaki, D. lotus, D. oleifera, D. glaucifolia and D. ‘Jinzaoshi’ and found that D. kaki was more closely related to D. oleifera, whereas D. lotus exhibited a closer relationship with D. glaucifolia. Kanzaki⁴ proposed that the commercialized persimmon (D. Kaki) originated from southern China through polyploidization from diploid ancestors. The complete genome sequence for Diospyros would be tremendously useful for clarifying these issues.

In this study, D. oleifera (‘Youshi’) was chosen as the material for genome sequencing. The PacBio RSII and HiSeq sequencing techniques were used to generate a high-quality genome assembly and annotation for this diploid persimmon. Due to the importance of astringency for the persimmon industry, the genetic basis for astringency development and removal was also investigated. Based on the obtained genome sequence, the first genetic linkage map was constructed for persimmon using SLAF-seq, which provided information about the evolution of persimmon.

Results

Genome assembly and annotation

The ‘Youshi’ genome was sequenced using the Illumina HiSeq (Illumina, USA) and PacBio Sequel platforms, and the assembled scaffolds were ordered using the Hi-C technique. The detailed assembly process is illustrated in Supplementary Fig. 2. The project generated ~86 gigabases (Gb) of high-quality sequences (Supplementary Tables 1 and 2), representing ~100× coverage of this diploid ‘Youshi’ persimmon genome. The final genome assembly was 849.53 Mb in size, very close to the predicted size of 853.3 Mb based on nuclear weight measurements performed via flow cytometry (Supplementary Fig. 3), and it consisted of 4728 scaffolds (≥1 kb) with an N50 of 42.43 Mb and 5919 contigs (≥1 kb) with an N50 of 890.84 kb (Supplementary Table 3). The GC content of the assembled persimmon genome was 37.40% (Table 1). Notably, 94.14% (799.71 Mb) of the genome was anchored to 15 pseudochromosomes (Supplementary Table 4), and a total of 556.36 Mb (64.96%, Supplementary Table 5) of repetitive sequences were identified.

Table. 1 Statistics of persimmon genome assembly and annotation.

Full size table

The quality of the assembly was further assessed by three independent methods. First, it was verified that it contained a majority of the core eukaryotic genes (98.03% and 77.02%, respectively, Supplementary Table 6) and genes in the BUSCO (Benchmarking Universal Single-Copy Orthologs) datasets²³ (89.86%, Supplementary Table 7). Second, the high-throughput sequencing (Illumina HiSeq) data were aligned to the genome sequence using BWA software²⁴^, and the results indicated that more than 98.58% of the sequences could be mapped to the assembled genome (Supplementary Table 8). Third, a Hi-C intrachromosomal contact map suggested that all bins could be allocated to 15 pseudochromosomes (Supplementary Fig. 4). Thus, all of these results support the conclusion that this assembled ‘Youshi’ persimmon genome is of high quality at the chromosome scale.

A total of 32,516 putative genes were predicted, with an average gene length of 6773 bp; 80.53% of these genes shared homology with known genes, and 95.95% of these genes were functionally annotated (Supplementary Tables 9 and 10; Supplementary Fig. 5). Among these predicted genes, only 25,379 (78.05%) could be anchored to the 15 pseudochromosomes (Supplementary Fig. 6). A total of 1540 noncoding RNAs, including miRNAs, rRNAs, and tRNAs, were identified by in silico prediction (Supplementary Table 11). In addition, 4381 pseudogenes were predicted in the persimmon genome (Supplementary Table 12). Motif and domain annotation analysis based on the amino acid sequences of 32,516 predicted genes indicated a total of 2802 motifs and 36,198 domains (Supplementary Table 13). There were 137 gene syntenic blocks and 3612 paralogous gene groups identified based on the self-alignment of the 25,379 chromosome-anchored genes, indicating that the persimmon genome has undergone frequent interchromosome fusions and segmental duplications during its evolutionary history (Fig. 1).

**Fig. 1: Homologous genome blocks in persimmon.**

Comparative genomic and genome evolutionary analysis

A gene family cluster analysis of the complete gene sets of persimmon (D. oleifera), apple (M. domestica), Arabidopsis (A. thaliana) and grape (V. vinifera) was performed. A total of 25,199 gene families in the persimmon genome were grouped into 13,406 gene clusters, with 7567 gene clusters being shared by all four species (Fig. 2a). Furthermore, 3644 persimmon-specific genes in 1251 clusters were identified (Supplementary Table 14), which are annotated in Supplementary Table 15. An expanded analysis was performed with 13 other sequenced plant genomes, and single-copy genes were used for phylogenetic tree construction. The results showed that D. oleifera is relatively closely related to Actinidia chinensis (kiwifruit) and Solanum lycopersicum (tomato) (Fig. 2b). The phylogenetic tree also indicated that D. oleifera diverged phylogenetically from A. chinensis approximately 77.80 million years ago (Mya), after the divergence of S. lycopersicum at 104.47 Mya (Fig. 2b).

**Fig. 2: Comparative genomic analysis of persimmon and other species.**

The 4DTv (fourfold synonymous third-codon transversion) value peaked at ~1.51 for D. oleifera (Fig. 2c), indicating that only one ancient γ whole-genome duplication (WGD) occurred in the D. oleifera genome lineage, which was further confirmed by ks analysis (Supplementary Fig. 7). There were 328 and 493 gene families (annotations are listed in Supplementary Table 16) showing expansion or contraction, respectively, after divergence from kiwifruit (Fig. 2d), suggesting that more D. oleifera gene families have experienced contraction than expansion during adaptive evolution.

A total of 414 and 3637 genes in the expanded families were annotated (Supplementary Tables 17 and 18) to kyoto encyclopedia of genes and genomes (KEGG) pathways and gene ontology (GO) terms, respectively. KEGG analysis showed that most of the expanded genes were involved in plant-pathogen interactions, starch and sucrose metabolism and phenylpropanoid biosynthesis. GO analysis revealed that in addition to primary metabolic processes, the expanded orthogroups were involved in the defense response, sexual reproduction and flavonoid biosynthetic processes. Forty-seven of the genes from contracted gene families were clustered in 28 KEGG pathways, including plant hormone signal transduction, sesquiterpenoid, and triterpenoid biosynthesis and beta-alanine metabolism (Supplementary Table 19). The GO terms of the genes from contracted gene families were mainly related to oxidation-reduction processes, protein phosphorylation and pentacyclic triterpenoid biosynthetic processes (Supplementary Table 20). The functional annotations of the genes in expanded and contracted gene families highlighted various traits of persimmon, including its high contents of sugar and flavonoids, dioecy, low levels of terpenes, and strong adaptability.

Characterization of putative genes in proanthocyanidin biosynthesis pathways

A unique feature of persimmon is that it accumulates a high content of PAs during fruit development. In the genome, we found 57 genes involved in PA biosynthesis (Supplementary Table 20), all of which were mapped to the D. oleifera genome. The results indicated that 33.96% of the genes involved in PA biosynthesis were located on chromosome 1 (Fig. 3) and formed a gene cluster enriched in these genes (Supplementary Fig. 8).

Genome-wide RNA-seq analysis of astringency removal in persimmon fruit by artificial high CO₂ treatment

Most commercialized persimmon cultivars are of the astringent type; thus, understanding PA biosynthesis (astringency formation) and removal (deastringency) is important. A few candidate genes related to alcoholic fermentation and transcription factors that regulate their expression have been previously identified and shown to contribute to astringency removal^9,17,18. However, because of the absence of a complete genome sequence, many of the genes potentially involved in astringency removal were not identified.

High CO₂ treatment (95% CO₂ + 4% N₂ + 1% O₂) is the most effective method for tannin removal. This treatment results in elevated ADH and PDC activities and triggers acetaldehyde metabolism^9,18,25,26. The acetaldehyde that is produced converts the soluble condensed tannins into insoluble condensed tannins^27,28, so that they no longer contribute to astringency. Here, 53 astringent-type cultivars of persimmon fruit (listed in Supplementary Table 22) were collected and subjected to high CO₂ treatment for 24 h. Four representative cultivars, two showing rapid deastringency after treatment (D. kaki cv. Sigoushi, SGS and cv. Luoyangfangtianshengshi, LYFTSS) and two showing slow deastringency (D. kaki cv. Laopige, LPG and cv. Shijiazhuanglianhuashi, SJZLHS), were selected according to quantitative soluble tannin analysis. After high CO₂ treatment, SGS and LYFTSS exhibited lower soluble tannin contents (<0.1 FW%), while LPG and SJZLHS exhibited higher soluble tannin contents (0.92 and 0.66 FW%, respectively) (Fig. 4b).

**Fig. 4: Persimmon fruit deastringency analysis and WGCNA analysis.**

When these four cultivars were used for RNA-seq, the differentially expressed genes (DEGs) were clustered into 21 coexpression modules (Fig. 4c) through WGCNA (weighted gene coexpression network analysis), and tannin contents were correlated with the module eigengenes (Fig. 4d). Three significantly negative correlation modules (<−0.6) were found (tan, −0.75; brown, −0.77; yellow, −0.64), and the genes within these modules are listed in Supplementary Table 23. As expected, genome-wide transcriptomic analysis provided a more comprehensive overview of astringency removal. The previously characterized genes related to alcoholic fermentation, DkPDC1 (EVM0027273), DkPDC2 (EVM0022732), and DkADH1 (EVM0007501), were simultaneously clustered in the brown or yellow module. Moreover, a key enzyme that catalyzes the phosphoenolpyruvate (PEP)-to-pyruvate transition under low-oxygen conditions (Supplementary Fig. 9 ²⁹), the pyruvate kinase (PK) gene (EVM0008535, DkPK1), was found to be one of the hub genes (Supplementary Fig. 10). The expression in fruit of members of the DkPK family was verified in ‘Jingmianshi’ fruit (D. kaki, astringent type), and the transcript levels of DkPK1 were found to be higher than those of any other members of this gene family (Fig. 4e). Furthermore, dual-luciferase assays indicated that two previously characterized hypoxia/high-CO₂-responsive transcription factors (DkERF18 and DkERF24) could trans-activate the promoter of DkPK1 (Fig. 4f).

To understand the genetic basis of the failure of astringency removal during fruit development in astringent-type persimmons, the putative promoter regions (−2000 bp) of all of the previously characterized hypoxia/high-CO₂-responsive transcription factors (Supplementary Table 24) and candidate genes included in Supplementary Fig. 9 were obtained (Supplementary Fig. 11 and 12). Motif analysis indicated that anaerobic-responsive elements (GC-rich motif: GCC[C/G]C, GT-rich motif: GGTTT)³⁰ were present in all of these promoters (Supplementary Figs. 11 and 12). Each of these genes exhibited more than one anaerobic-responsive element within a single promoter (Supplementary Figs. 11 and 12), and these results confirm the genetic and mechanistic basis of the effectiveness of low oxygen (resulting from high CO₂ treatment) in causing astringency removal.

D. oleifera genome-based genetic map analysis

Two commercial persimmon cultivars, ‘Taishuu’ (Diospyros kaki cv. Taishuu, male) and ‘Luotiantianshi’ (Diospyros kaki cv. Luotiantianshi, female), were used as parents for hybridization. Due to the high rate of abortion in hybrid seeds, even after using the embryo rescue culture technique³¹, only 77 F1 progeny were obtained. Then, an SLAF-seq (specific-locus amplified fragment sequencing) library was constructed for the two persimmon parents and their F1 offspring. The evaluation of the D. kaki SLAF DNA library indicated that the efficiency of RsaI and HaeIII digestion was 86.80%. The percentage of the mapping of the paired-end reads in the control (Oryza sativa L. japonica) was 87.92%. The average sequencing depth of the SLAF markers was 20.40-fold (Supplementary Table 25). After filtering, for the F1 population, 20,405 SLAFs with three segregation patterns were used for genetic map construction, and 11,204 markers were used (Supplementary Table 26). After linkage analysis, 11,204 markers were anchored to 45 high-density linkage groups of D. kaki, with a minimum LOD score (MLOD) of 7.0. The final map was 7168.74 cM long, with an average intermarker distance of 0.64 cM (Supplementary Table 27). The largest LG (linkage group) was LG45, which contained 222 markers with a length of 224.67 cM; the smallest was LG35, which contained 118 markers that spanned 56.23 cM (Supplementary Table 27). The percentage of gaps ≤5 cM between adjacent markers of the 45 LGs ranged from 92.74 to 99.35% (average of 97.13%), and the largest gap was 19.77 cM in LG17 (Fig. 5a). After correcting genotyping errors using SMOOTH algorithms, 1140 segregation distortion markers with P < 0.01 were retained to increase genomic coverage in the final genetic map, which was further evaluated by the haplotype (Supplementary Fig. 13) and heat maps (Supplementary Fig. 14).

**Fig. 5: Genetic map analysis of persimmon.**

Based on the D. oleifera genome, the colinear relationship between the D. kaki genetic map and the D. oleifera genome was analyzed. The SLAF sequences of the 11,204 mapped markers were aligned to the genomes using BLASTN analysis with an e-value cutoff of 1e−5, revealing that 26.86% (n = 3009) of the markers were assigned to the D. oleifera genome (Supplementary Table 20). Moreover, 45 LGs were clearly assigned to the 15 chromosomes (Fig. 5b and Supplementary Table 28). Different numbers of LGs were assigned to different chromosomes, such as chromosomes 1/3/5/7/9/12/15, each with two LGs, chromosomes 4/8/10/13/14 were each assigned three LGs, and four, five and seven LGs were assigned to chromosomes 11, 2 and 6, respectively (Fig. 5b). Using the Spearman correlation coefficient approach, the mapped markers (n = 3009) of the D. kaki genetic map were found to show significant collinearity with the D. oleifera genome, and the correlation between the genetic and physical positions of the SLAF markers is shown in Fig. 5b.

Discussion

Persimmon is a widely grown woody tree, and its fruit are valued by consumers because of their high concentrations of antioxidant compounds (especially PAs) and edible fiber, and they serve as a primary source of medicines^3,32,33. Although persimmon is economically and culturally important in oriental countries, its genome sequence has not been published. Here, using PacBio and Hi-C technology, the genome sequence of persimmon was assembled from diploid ‘Youshi’ (D. oleifera), which provides a basic resource for the future persimmon research and industrial applications. A genome sequence of 849.53 Mb was assembled with a contig N50 of 0.89 M, and 94.14% of the genome was anchored to 15 pseudochromosomes. The quality of the assembly was further assessed by using core genetic integrity assessment and RNA-seq data. There were 32,516 predicted gene locations in the persimmon genome, which is similar to results from other plant species, such as tomato (N = 34,879)³⁴ and pomegranate (N = 30,903)³⁵. A total of 556.36 Mb (64.96%, Table S7) of repeat sequences were identified, which is similar to the amount in sorghum (62%)³⁶ and higher than that in apple (42%)³⁷, grapevine (41.4%)³⁸ and rice and Arabidopsis (14 and 17%, respectively)^39,40. Additionally, a total of 4012 simple sequence repeats were evenly distributed across the pseudochromosomes, which are beneficial for molecular breeding (Supplementary Table 5). The obtained D. oleifera genome not only provides basic information for persimmon research and utilization but also represents the first reported complete genome for Ebenales. Thus, the D. oleifera genome could be further used for better understanding genome evolution and the relationships of plants in Ebenales.

The phylogenetic tree compiled from the whole-genome analyses showed that D. oleifera diverged from kiwifruit approximately 77.80 Mya, ago and the estimated times of the separation from the lineages with A. thaliana and V. vinifera were ~124.20 and 124.63 years ago, respectively. Following the ancient hexaploidization event (γ) shared by core eudicots, including kiwifruit, there were two additional independent WGD events⁴¹. For D. oleifera, the 4DTv analysis indicated that persimmon underwent an ancient γ WGD event. Whether this WGD event is shared by other members of Ebenales will have to be determined through further analysis when more Ebenales genome sequences become available.

The KEGG and GO analyses indicated that the expanded gene families are mainly involved in starch and sucrose metabolism, which may have conferred greater attractiveness on the persimmon fruit during evolution. Some other expanded genes were found to participate in phenylpropanoid biosynthesis, flavonoid biosynthesis, and anthocyanin biosynthesis, and all of these pathways, as well as PA biosynthesis, are derived from the shikimate pathway¹⁵. The phenylpropanoids are considered to be key mediators involved in plant resistance to biotic and abiotic stress responses⁴². These expanded gene families are responsible for the accumulation of PAs in persimmon but also cause fruit astringency, which is not desirable to humans and unexpected animal and insect invasions. In recent years, a non-astringent persimmon that originated from spontaneous mutants of astringent persimmon has increased in importance in fruit industry⁷. Experiments indicate that astringent persimmons are more resistant to cold than non-astringent persimmons³. Thus, the expansion of the relevant gene families may have provided an important way to generate abiotic and biotic resistance in persimmon, which would have been critical for the evolution of persimmon and the expansion of its cultivation.

The characteristics of D. kaki persimmon and ‘Youshi’ (D. oleifera) related to astringency development and removal were selected for further genome-based investigation. The fruit of hexaploid persimmon cultivars such as ‘Youshi’ also exhibit active PA biosynthesis. The basis for this characteristic was investigated via the genome-wide analysis of PA biosynthesis genes. Most of the PA biosynthesis genes were located on chromosome 1, where they formed a gene cluster. In the plant genome, it is common to find highly homologous genes involved in secondary metabolite synthesis located in a cluster on a chromosome⁴³. These gene clusters may originate from duplication and divergence, and clustering may also improve gene function. Furthermore, related traits are likely to be stably inherited, and the genes in clusters could be regulated simultaneously. Most metabolites synthesized by gene clusters are defense or tolerance related^44,45,46,47. Arabidopsis, apple, and grape are predicted to exhibit 18, 73, and 23 genes involved in PA biosynthesis, respectively, but these genes are widely distributed on different chromosomes^37,38,39. Thus, the clustering of PA biosynthesis genes provides the genetic basis for high PA content in persimmon. On the other hand, genome-based RNA-seq provided new insights regarding astringency removal by identifying the previously characterized PDC and ADH genes⁹ and new hypoxia-responsive genes (e.g., DkPK1 and its transcriptional regulation). In Arabidopsis, a genome-wide investigation in seedlings predicted 49 core hypoxia-responsive genes⁴⁸. Here, the assembled genome allowed the rapid searching of low-oxygen-responsive motifs in the putative promoter regions of these candidate genes. Our findings suggested that all of the candidate genes exhibited multiple low-oxygen-responsive motifs in their promoters (Supplementary Figs. 10 and 11), which provided genetic evidence of their role and effectiveness in postharvest astringency loss in response to a low oxygen environment.

High-density linkage maps are the basis for QTL fine mapping of traits, map-based gene cloning and marker-assisted breeding^49,50,51. In persimmon, germplasm genetic diversity and marker-trait associations have been explored using different types of molecular markers^{52,53,54,55,56}. However, none of this research has led to the construction of a high-quality genetic map for persimmon or for any member of the Diospyros genus, due to the limited number of hybrid offspring produced because of the abortion of hybrid seeds. SLAF-seq provides an economical and efficient method for the linkage mapping of nonmodel species with complex genomes⁵⁷. Here, we obtained 77 F1 offspring from ‘Taishuu’ × ‘Luotiantianshi’ via the embryo rescue culture technique³¹ and constructed the first D. kaki genetic linkage map, with 11,204 markers spanning 7,168.74 cM. The average intermarker distance was 0.64 cM, which is comparable to the resolution reported for high-density linkage maps of Vigna unguiculata (0.35 cM)⁵⁸, Arachis hypogaea (0.58 cM;⁵¹ 0.7 cM⁵⁹), Pleurotus tuoliensis (1.0 cM)⁵⁰ and Gossypium barbadense (1.09 cM)⁴⁹. This genetic map should be useful for persimmon breeding as a genome reference and for establishing marker-trait associations, providing a basis for understanding the persimmon genome and its inheritance. These markers will be useful for screening important biological properties of persimmon, especially related to the accumulation of tannins and other factors affecting taste.

As the genome of hexaploid cultivated persimmon remains unpublished, the availability of the D. oleifera genome provides an alternative comparable reference for D. kaki. Based on the D. oleifera genome, 26.86% of the 11,204 markers (n = 3009) in the D. kaki genetic linkage map were assigned to and showed high colinearity with the newly completed D. oleifera genome sequences. Thus, it can be proposed that D. oleifera was one of the diploid ancestors of hexaploid D. kaki, and this evidence supports the previous hypothesis of Kanzaki⁴ and the preliminary predictions resulting from genomic in situ hybridization (GISH) studies⁶⁰ and the comparison of chloroplast genome sequences^22,61. The relationship between chromosomes and LGs suggested that D. oleifera may also be closely related to other ancestors of D. kaki, and chromosome breakage and rearrangement may have occurred during the evolution of D. kaki from D. oleifera.

In summary, the first genome of Ebenaceae, from D. oleifera, was sequenced and found to be approximately 849.53 Mb in size (N50 > 0.8 M), with 94.14% (799.71 Mb) of the sequences anchored onto 15 pseudochromosomes. Comparative genomics analysis revealed the genetic basis of astringency development (PA synthesis) and removal in persimmon via precipitation, caused by the production of acetaldehyde during the low-oxygen response. Furthermore, the first genetic map of persimmon was constructed and indicated that D. oleifera was one of the ancestors of cultivated persimmon. The persimmon genome represents an invaluable resource for the genetic improvement of the fruit and for better understanding of its genome evolution. Genetic markers can be developed based on this genome sequence, which will help to accelerate persimmon breeding.

Materials and methods

Genome size estimation

Young leaves of D. oleifera from three branches were used as the sample for genome sequencing (with each tree as one repetition). Nuclear DNA was isolated from fresh leaf tissue using a protocol developed by Doležel et al.⁶², and flow cytometry (BD FACScalibur, BD Biosciences, USA) was applied to estimate the nuclear DNA content. The reference standard was Zea mays ‘B73’ (2.35 pg/1C)⁶³. The genome size of D. oleifera was calculated as the ratio between the sample and standard peaks multiplied by the genome size of the standard.

De novo genome assembly and quality assessment

Mature leaves of D. oleifera ‘Youshi’ were collected from the National Persimmon Germplasm Resources Nursery (Northwest A&F University, Shaanxi, China), and the CTAB method⁶⁴ was applied to extract the genomic DNA from the leaves. Library construction, sequencing, assembly and quality assessment for the ‘Youshi’ genome were conducted by Biomarker Technologies Corporation (Beijing, China). According to standard Illumina protocols, two paired-end (PE) libraries with insert sizes of 220 and 500 bp and 14 mate-pair (MP) libraries with insert sizes of 3, 4, 5, 8, 10, 15, and 17 kb were constructed. The Illumina HiSeq 2500 platform (Illumina, San Diego, CA, USA) was used to sequence these libraries and produced 88.35 Gb of raw sequencing data, representing 100X genome coverage. Reads were trimmed for adaptor removal and quality enhancement. NxTrim was used to collapse the duplicated reads of the MP libraries. A deduplication step was performed on the MP libraries.

The genome assembly and contigs were constructed and analyzed by ALLPATHS-LG⁶⁵ with default parameters. SSPACE⁶⁶ was used to link the contigs with support from the paired MP reads. Gaps were filled with GapCloser⁶⁷. In preparation for the assembly, scaffolds <1000 bp were removed. PacBio library construction included DNA purification, damage repair, hairpin adaptor ligation, and digestion with exonucleases to remove damaged DNA and fragments without adaptors. The sizes of the templates (≥15 Kb) were selected via Blue Pippin electrophoresis, and the RS II platform (Pacific Biosciences, California, USA) using P6-C4 sequencing chemistry was then applied to sequence the library. The correction of the draft genome assembly was performed with the Canu v1.5⁶⁸, WTDBG v1.2.8 (available at https://github.com/ruanjue/wtdbg) and Falcon⁶⁹ (Supplementary Tables 31 and 32) assemblers. Finally, the assembly-merging method was utilized⁷⁰ to optimize the assembly results.

Another assembly method for scaffold anchoring using Hi-C sequencing technology was applied. The gDNA was digested with HindIII. The 5′ overhangs were filled with biotinylated nucleotides; free blunt ends were ligated; and the DNA was purified and sheared to 300–700 bp fragments after biotin was removed. PCR was applied to enrich the biotin-tagged fragments, and the DNA was then used to construct an Illumina library, which was on the Illumina HiSeq 2500 platform. A total of 55.41 Gb of clean data (65 × the genome size) were obtained, and 62.20 M valid interaction pairs were detected to assist in genome assembly. The evaluation of the alignment efficiency and insert length distribution for valid pair fragments and valid interaction pairs was performed with HiC-Pro⁷¹. The genome sequence contigs and scaffolds were divided into subgroups and sorted and oriented into super scaffolds using LACHESIS⁷².

Final assembly and validation

To improve the persimmon genome assembly and chromosome anchoring accuracy, the Illumina data, PacBio long reads and Hi-C data were combined and mutually corrected. Finally, the persimmon genome was assembled into 856.41 Mb of scaffold sequences. We found that 99.99% (856.35 Mb) of the genome consisted of the contig sequences. The assembly contained 4728 scaffolds, with a scaffold N50 of 42.42 Mb, composed of 5919 contigs, with a contig N50 of 890.84 kb. The maximum scaffold and contig lengths were 57.59 Mb and 9.38 Mb, respectively.

The Hi-C contact heatmap was inspected to confirm the accuracy of the Hi-C assembly. To facilitate the formation of Hi-C contacts, the sequences were proximate in three dimensions if they were adjacent to one another in one dimension. The number of Hi-C links between 100-kb windows on the pseudochromosomes of the final assembly is represented by the intensity of red coloration. More pronounced contact frequencies indicate more Hi-C links between the two bins. An elevated link frequency was observed with a diagonal pattern within individual pseudochromosomes, indicating increased interaction contacts between adjacent regions. The second-generation short sequencing reads from the RNAseq data were also used to validate the final assembly. Among the 250,587,312 reads, 98.60% were mapped successfully back to the final persimmon genome, with a 93.78% properly paired mapping rate.

CEGMA (version 2.5), with 248 conserved core eukaryotic genes, and BUSCO (version 2.0)²³, with 1440 genes in embryophyta odb9, were used to assess the completeness and accuracy of the assembled persimmon genome.

Genome annotation

A pipeline that integrated de novo gene prediction, homology-based gene models and RNAseq-based gene models was used to predict protein-coding genes. For de novo prediction, the genes in the genome were predicted with four predictors: Genscan⁷³, Augustus (version 2.4)⁷⁴, GlimmerHMM (version 3.0.4)⁷⁵, GeneID (version 1.4)⁷⁶ and SNAP⁷⁷. For homology prediction, GeMoMa (version 1.3.1)⁷⁸ was used, with protein sequences from A. thaliana, O. sativa, Z. jujuba and C. sinensis and gene models with a minimum 50% coverage and 50% query/hit coverage.

Two complementary methods, one de novo based and the other homology based, were used to identify and classify transposable elements (TEs). LTR FINDER (version 1.05)⁷⁹, MITE-Hunter⁸⁰, RepeatScout (version 1.0.5)⁸¹ and PILER-DF (version 2.4)⁸² were run to construct the de novo repeat libraries. RepeatMasker (version open-4.0.6)⁸³ with WUBlast searches against the RepBase database (RepBase 19.06)⁸⁴ was applied to build the homology-based repeat library. To obtain a consensus library, PASTEClassifer⁸⁵ was used to process and classify the two libraries. To identify the TEs in the persimmon genome, RepeatMasker was employed, using the consensus library to mask the genome; the masked genome was used for gene prediction.

For RNA-seq prediction, processed reads were aligned to the reference genome using HISAT2⁸⁶. HISAT (version 2.0.4)⁸⁷ and Stringtie (version 1.2.3)⁸⁷ were used to assemble the transcripts. Trinity (version 2.1.1)⁸⁸ was applied for the de novo assembly of the RNA-seq reads, and the RNA-seq reads were analyzed with PASA (version 2.0.2)⁸⁹ for the prediction of unigenes. TransDecoder (version 2.0) (Haas, http://transdecoder.github.io) and GeneMarkS-T (version 5.1)⁹⁰ were used to predict open reading frames (ORFs). Finally, an integrated gene set was produced by EVM (version 1.1.1)⁹¹. The RNA-seq assemblies were employed to determine untranslated regions (UTRs). The longest transcripts for each locus were retained, and regions outside of the ORFs were designated as UTRs.

The Rfam database (version 12.0)⁹² and MirBase database⁹³ were searched with Infernal (version 1.1)⁹⁴ to detect microRNAs and rRNAs. tRNAscan-SE (version 1.3.1)⁹⁵ with eukaryote parameters was applied to predict transfer RNAs (tRNAs).

GenBlastA (version 1.0.4)⁹⁶ was used to identify homologous sequences in the genome by using the integrated gene set as the query, and GeneWise (version 2.4.1)⁹⁷ was used to define pseudogenes containing premature stop codons or frameshift mutations with 60% identity and coverage.

Protein sequences were compared against several databases, including NR⁹⁸, KOG⁹⁹, KEGG¹⁰⁰, and TrEMBL¹⁰¹, using BLAST (version 2.2.31)¹⁰² with an E-value cutoff of 1E-5 to annotate the functions of the predicted genes. GO annotations were assigned by BLAST2GO (version 2.5);¹⁰³ InterProScan (version 5.8)¹⁰⁴ and Hmmscan (HMMER, version 3.0)¹⁰⁵ were used for searching; and annotatemotifs and domains were annotated by searching the Pfam database¹⁰⁶, PROSITE database¹⁰⁷ and HAMAP database¹⁰⁸.

Comparative genomic analysis

The available protein sequence sets were collected from 14 sequenced plant species: Vitis vinifera (Vvinifera_145_Genoscope), Actinidia chinensis (v1.0), Musa acuminata, Solanum lycopersicum (SL2.40), Diospyros oleifera, Citrus sinensis, Glycine max (Soybean_Williams82), Carica papaya, Arabidopsis thaliana (TAIR10), Juglans regia, Fragaria vesca, Prunus persica (v2.0), Pyrus bretschneideri, and Malus domestica (v1.0). OrthoMCL (version 2.0; mcl inflation factor of 1.5)¹⁰⁹ was used to identify orthologous genes. Pairwise sequence similarities were calculated by all-against-all BLASTP (Blast + version 2.3.0)¹¹⁰ with a P-value cutoff of 1e−5 and a minimum match length of 50%.

MUSCLE¹¹¹ was used to perform the phylogenetic analysis and alignment using the coding sequences of common single-copy genes. The jModelTest output was used to select the protein model, and the evolutionary tree was constructed via the TIM2+I+G model. The divergence time was estimated with MCMCTREE of PAML (version 4.7a)¹¹². Gene family expansion and contraction were analyzed by using CAFÉ (version 4.2)¹¹³ with a maximum-likelihood model. The gene family clustering results and estimated divergence times between species were used. The value of the birth and death parameter (λ) was 0.02, and the P-value was 0.01.

MCscan was applied to detect the 4DTV gene pairs calculated using the HKY substitution model. MCscan with the YN00 program of the PAML package was used to detect the synonymous mutation rate (ks) of gene pairs.

Persimmon fruit deastringency treatment and transcriptome sequencing

Commercially mature persimmon fruit from 53 astringent types were harvested from the National Persimmon Germplasm Resources Nursery (Northwest A&F University, Shaanxi, China) in 2017. To remove the astringency from mature fruit, high CO₂ (95% CO₂ + 4% N₂ + 1% O₂, 1 d) treatment was applied as described by Zhu et al.^17,18, and fruit treated in air were used as the control (CK). The treatments were carried out with three biological replicates. At each sampling time, flesh samples were bulked and frozen in liquid nitrogen and stored at −80 °C for further use.

Fruit astringency was evaluated by two different methods. First, soluble tannins were visualized via the tannin printing method according to Min et al.¹¹⁴. The fruit were cut lengthwise immediately after treatment and then pressed onto 5% FeCl₂-soaked filter paper for 5 s. The intensity of the resulting black color after the removal of fruit indicated the soluble tannin content, and the filter paper was photographed. The soluble tannin contents of frozen samples were also quantified with the Folin-Ciocalteau reagent according to the method described by Yin et al.¹¹⁵. The results were calculated using a standard curve of tannin acid equivalents g⁻¹ fresh weight. All measurements were conducted with three biological replicates.

Four cultivars showing obvious differences in the rates of deastringency (e.g., rapid deastringency, Sigoushi and Luoyangfangtianshengshi; slow deastringency, Laopige, and Shijiazhuanglianhuashi) were selected for RNA-seq analysis. A total of 36 samples (0 d, 1 d for CK and CO₂-treated fruit) with three biological replicates were used for total RNA extraction via the CTAB protocol described by Chang et al.⁶⁴. An Agilent 2100 Bioanalyzer (Agilent Technologies, Santa Cara, California, USA), a NanoDrop 2000 spectrophotometer (Thermo Scientific, Waltham, MA, USA), and a Qubit 2.0 fluorometer (ThermoFisher Scientific, Waltham, MA, US) were used to assess RNA integrity, purity, and concentrations, respectively. A TruSeq RNA Sample Preparation Kit was used to prepare RNA-seq libraries following the manufacturer’s instructions, and the libraries were sequenced on the Illumina HiSeq 2500 platform (Illumina, San Diego, CA, USA). Raw reads were trimmed to remove adaptors and increase quality. Reads <100 bp were discarded after trimming. Overall, each sample produced an average of 7.28 Gb clean data.

The clean reads were mapped to the genome by using HISAT2⁸⁶ with the default parameters. StringTie¹¹⁶ was used to assemble the transcripts. The fragments per kilobase of transcript per million fragments mapped (FPKM) values were used to measure gene expression. DEseq¹¹⁷ was applied to identify differentially expressed genes (DEGs). The false discovery rate was used to adjust the P-values. Genes with significant differences in expression (i.e., log₂ foldchange >1 and adjusted P-value < 0.01) were considered DEGs and were annotated with GO terms and KEGG pathways.

Coexpression networks were constructed using the WGCNA (v1.66) package in R (v3.5.2)¹¹⁸. Genes with expression values (FPKM) in any variable equal to or higher than 1 were selected for WGCNA. The automatic network construction function blockwiseModules () with the following settings: soft power 16, minModulesSize 30, mergeCutHeight 0.25 and default parameters for other settings, was used to construct the models. The soft power was chosen by the function pickSoftThreshold () in WGCNA. The correlations between module eigengenes, the first principle of each module, and the content of tannins were calculated with function cor () in R with default settings. Candidate hub genes in “tan”, “brown” and “yellow” were picked by thresholding at a value of 0.6. The coexpression network of 295 selected genes was visualized with Cytoscape (v3.7.1).

Real-time PCR

For real-time PCR, gene-specific oligonucleotide primers were designed, which are described in Supplementary Table 32. Melting curves and product resequencing were used to assess the quality and specificity of each pair of primers. Real-time PCR was carried out on a Bio-Rad CFX96 Real-Time PCR System using SsoFast^TM EvaGreen Supermix (Bio-Rad, USA) following the manufacturer’s instructions. The housekeeping gene DkACT (GenBank No. AB473616)¹⁵ was chosen as the internal control, and the 2^−ΔCt method was used to calculate the relative expression¹¹⁹.

Dual-luciferase assay

The in vivo regulatory effects of the previously characterized high-CO₂/hypoxia-responsive transcription factors on the newly identified deastringency-related gene (DkPK1) were investigated via dual-luciferase assays. The promoter of DkPK1 was cloned into the pGreen II 0800-LUC vector (LUC) (primers are listed in Supplementary Table 28), and the transcription factors were previously cloned into the pGreen II 002962-SK vector^17,18. All constructs were electroporated into Agrobacterium tumefaciens GV1301. The constructed SK and LUC plasmids were transiently expressed in tobacco (Nicotiana benthamiana) leaves as described by Min et al.⁹. The Dual-luciferase® Reporter Assay System 10-Pack kit (Promega, USA) was employed to analyze firefly luciferase and Renilla luciferase in tobacco leaves at 3 d after infiltration by using a GLOMAX^TM 96 Microplate Luminometer (Promega).

SLAF library construction and high-throughput sequencing

As mentioned above, it is difficult to obtain hybrid persimmon offspring due to the abortion of hybrid embryos in seeds. In this study, a population of 77 F1 plants derived from ‘Luotiantianshi’ × ‘Taishuu’ via the embryo rescue culture technique was used to construct the Diospyros kaki (2n = 6x = 90) genetic map. Young leaves were collected for DNA extraction. The cetyltrimethylammonium bromide method¹²⁰ was used to extract the total genomic DNA. DNA concentrations and qualities were estimated using a NanoDrop 2000 spectrophotometer (Thermo Scientific, Milan, Italy) and electrophoresis in agarose gels.

SLAF-seq was used for the rapid and effective discovery of SNP markers, employing an improved SLAF-seq strategy as described by Sun et al.⁵⁷. Two restriction enzymes (i.e., RsaI and HaeIII) were tested to identify the best enzyme for GBS (genotyping by sequencing) library construction by investigating the length and distribution of the resulting digested fragments of the genomic DNA of the two parents and the F1 population. Fragments of 314–414 bp were selected to generate paired-end reads (PE125 bp) on the Illumina HiSeq-Xten sequencing platform (Illumina, San Diego, CA, USA) by Biomarker Technologies, Beijing, China.

Genotyping and linkage map construction

SLAF marker identification and genotyping were performed using procedures described by Sun et al.⁵⁷. Briefly, low-quality reads (quality score <20) were filtered out, and then filtered reads were clustered according to sequence similarity. Identical reads were merged to avoid repeat-computing requirements, and sequences with over 90% identity were grouped into a single SLAF locus, as described by Sun et al.⁵⁷. Differences in high-depth fragments were defined as SNPs or indels. To construct a high-quality genetic map, we filtered the SLAFs using five criteria: (1) removal of SLAFs from parents where the sequencing depth was less than 10X ; (2) removal of SLAFs with more than three SNPs; (3) removal of SLAFs with an aa × bb segregation pattern; (4) removal of SLAFs missing more than 30% of offspring; and (5) removal of segregation-distorted markers (p < 0.01). A high-density genetic map was constructed by using the SLAFs that passed the five-step filtering process. SLAFs that followed the 1:1 Mendelian segregation pattern were used for genetic map construction.

The map was constructed with HighMap software¹²¹, which performs four procedures: module-linkage grouping, marker ordering, error genotyping correction, and map evaluation. The single-linkage clustering algorithm was used to cluster the markers into linkage groups. The error correction strategy by using SMOOTH was then conducted according to the parental contribution of genotypes¹²², and a k-nearest neighbor algorithm was applied to impute missing genotypes¹²³. Skewed markers were then added to this map by applying a multipoint method of maximum likelihood. Map distances were estimated using the Kosambi mapping function¹²⁴. Based on the D. oleifera genome, the SLAF sequences of the 11,204 mapped markers were aligned to the D. oleifera genome using BLASTN analysis with an e-value cutoff of 1e−5.

Accession numbers

The raw genomic sequence, transcriptome, and genetic map data have been deposited in the NCBI Sequence Read Archive under accession numbers PRJNA562043, PRJNA562975, and PRJNA563228.

References

Duangjai, S., Wallnöfer, B., Samuel, R., Munzinger, J. & Chase, M. W. Generic delimitation and relationships in Ebenaceae sensu lato: evidence from six plastid DNA regions. Am. J. Bot. 93, 1808–1827 (2006).
Article CAS PubMed Google Scholar
Frodin, D. G. History and concepts of big plant genera. Taxon 53, 753–776 (2004).
Article Google Scholar
Wang, R. Z., Yang, Y. & Li, G. C. Chinese persimmon germplasm resources. Acta Hortic. 436, 43–50 (1997).
CAS Google Scholar
Kanzaki, S. The origin and cultivar development of Japanese Persimmon (Diospyros kaki Thunb.). J. Jpn. Soc. Food Sci. Technol. 63, 328–330 (2016).
Article Google Scholar
Wu, Z. X. The Shanwang fossils with the passage of time. Openings 7, 74–79 (2005).
Google Scholar
Xin, S. Chinese Fruit History Research. pp 142–143 (Agricultural Publishing House: Beijing, 1983).
Yonemori, K., Sugiura, A. & Yamada, M. Persimmon genetics and breeding. Plant Breed. Rev. 19, 191–225 (2000).
Google Scholar
Yamada, M., Yamane, H., Sato, A., Hirakawa, N. & Wang, R. Z. Variations in fruit ripening time, fruit weight and soluble solids content of oriental persimmon cultivars native to Japan. J. Jpn. Soc. Hortic. Sci. 63, 485–491 (1994).
Article Google Scholar
Min, T. et al. Ethylene-responsive transcription factors interact with promoters of ADH and PDC involved in persimmon (Diospyros kaki) fruit de- astringency. J. Exp. Bot. 63, 6393–6405 (2012).
Article CAS PubMed PubMed Central Google Scholar
Nakajima, A. & Sakaguchi, T. Uptake and recovery of gold by immobilized persimmon tannin. J. Chem. Technol. Biotechnol. 57, 321–326 (1993).
Article CAS PubMed Google Scholar
Sakaguchi, T. & Nakajima, A. Accumulation of uranium by immobilized persimmon tannin. Sep. Sci. Technol. 29, 205–221 (1994).
Article CAS Google Scholar
Nakajima, A. & Baba, Y. Mechanism of hexavalent chromium adsorption by persimmon tannin gel. Water Res. 38, 2859–2864 (2004).
Article CAS PubMed Google Scholar
Xiong, Y. et al. Selective recovery of precious metals by persimmon waste chemically modified with dimethylamine. Bioresour. Technol. 100, 4083–4089 (2009).
Article CAS PubMed Google Scholar
Inoue, K. et al. Hydrometallurgical recovery of precious metals and removal of hazardous metals using persimmon tannin and persimmon wastes. Metals 5, 1921–1956 (2015).
Article CAS Google Scholar
Akagi, T. et al. DkMyb4 is a Myb transcription factor Involved in proanthocyanidin biosynthesis in persimmon fruit. Plant Physiol. 151, 2028–2045 (2009).
Article CAS PubMed PubMed Central Google Scholar
Akagi, T., Ikegami, A. & Yonemori, K. Proanthocyanidin biosynthesis of persimmon (Diospyros kaki Thunb.) fruit. Sci. Hortic. 130, 373–380 (2011).
Article CAS Google Scholar
Zhu, Q. G. et al. A transcription factor network responsive to high CO₂/hypoxia is involved in deastringency in persimmon fruit. J. Exp. Bot. 69, 2061–2070 (2018).
Article CAS PubMed PubMed Central Google Scholar
Zhu, Q. G. et al. High-CO₂/hypoxia-responsive transcription factors DkERF24 and DkWRKY1 interact and activate DkPDC2 promoter. Plant Physiol. 180, 6211–6633 (2019).
Article CAS Google Scholar
Akagi, T., Henry, I. M., Tao, R. & Comai, L. A Y-chromosome-encoded small RNA acts as a sex determinant in persimmons. Science 346, 646–650 (2014).
Article CAS PubMed Google Scholar
Guo, D. L. & Luo, Z. R. Genetic relationships of the Japanese persimmon Diospyros kaki (Ebenaceae) and related species revealed by SSR analysis. Genet. Mol. Res. 10, 1060–1068 (2011).
Article CAS PubMed Google Scholar
Tan, C. X. Virgin avocado oil: an emerging source of functional fruit oil. J. Funct. Foods 54, 381–392 (2019).
Article CAS Google Scholar
Fu, J. M. et al. Five complete chloroplast genome sequences from Diospyros: genome organization and comparative analysis. PLoS ONE 11, e0159566 (2016).
Article PubMed PubMed Central CAS Google Scholar
Simão, F. A., Waterhouse, R. M., Ioannidis, P., Kriventseva, E. V. & Zdobnov, E. M. BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics 31, 3210–3212 (2015).
Article PubMed CAS Google Scholar
Li, H. & Durbin, R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 25, 1754–1760 (2009).
Article CAS PubMed PubMed Central Google Scholar
Ikegami, A., Eguchi, S., Kitajima, A., Inoue, K. & Yonemori, K. Identification of genes involved in proanthocyanidin biosynthesis of persimmon (Diospyros kaki) fruit. Plant Sci. 172, 1037–1047 (2007).
Article CAS Google Scholar
Salvador, A. et al. Physiological and structural changes during ripening and deastringency treatment of persimmon fruit cv. ‘Rojo Brillante’. Postharvest Biol. Technol. 46, 181–188 (2007).
Article CAS Google Scholar
Tanaka, T., Takahashi, R., Kouno, I. & Nonaka, G. Chemical evidence for the de-astringency (insolubilization of tannins) of persimmon fruit. J. Chem. Soc. 20, 3013–3022 (1994).
Google Scholar
Taira, S., Ikeda, K. & Ohkawa, K. Comparison of insolubility of tannins induced by acetaldehyde vapor in fruit of three types of astringent persimmon. J. Jpn. Soc. Hortic. Sci. 48, 684–687 (2001).
CAS Google Scholar
Lasanthi-Kudahettige, R. et al. Transcript profiling of the anoxic rice coleoptile. Plant Physiol. 144, 218–231 (2007).
Article CAS PubMed PubMed Central Google Scholar
Kyozuka, J., Olive, M., Peacock, W. J., Dennis, E. S. & Shimamoto, K. Promoter elements required for developmental expression of the maize Adh1 gene in transgenic rice. Plant Cell 6, 799–810 (1994).
CAS PubMed PubMed Central Google Scholar
Xu, Y. et al. Optimization of immature embryo rescue technique system for persimmon. J. Biol. 34, 110–115 (2017).
Google Scholar
Peyrat, L. et al. Chemical diversity and antiviral potential in the pantropical Diospyros genus. Fitoterapia 112, 9–15 (2016).
Article CAS PubMed Google Scholar
Fan, R. Y. et al. Plant tannin immobilized Fe₃O₄@SiO₂ microspheres: a novel and green magnetic bio-sorbent with superior adsorption capacities for gold and palladium. J. Hazard Mater. 364, 780–789 (2019).
Article CAS PubMed Google Scholar
The Tomato Genome Consortium. The tomato genome sequence provides insights into fleshy fruit evolution. Nature 485, 635–641 (2012).
Article CAS Google Scholar
Yuan, Z. H. et al. (2018). The pomegtanate (Punica granatum L.) genome provides insights into fruit quality and ovule developmental biology. Plant Biotechnol. J. 16, 1363–1374 (2018).
Article CAS PubMed PubMed Central Google Scholar
Paterson, A. H. et al. The Sorghum bicolor genome and the diversification of grasses. Nature 457, 551–556 (2009).
Article CAS PubMed Google Scholar
Velasco, R. et al. The genome of the domesticated apple (Malus X domestica Borkh.). Nat. Genet. 42, 833–841 (2010).
Article CAS PubMed Google Scholar
The French-Italian Public Consortium for grapevine Genome Characterization. The grapevine genome sequence suggests ancestral hexaploidization in major angiosperm phyla. Nature 449, 463–468 (2007).
Article CAS Google Scholar
The Arabidopsis Genome Initiative. Analysis of the genome sequence of the flowering plant Arabidopsis thaliana. Nature 408, 796–815 (2000).
Article Google Scholar
International Rice Genome Sequencing Project. The map-based sequence of the rice genome. Nature 436, 793–800 (2005).
Article CAS Google Scholar
Huang, S. X. et al. Draft genome of the kiwifruit Actinidia chinensis. Nat. Commun. 4, 2640 (2013).
Article PubMed CAS Google Scholar
Vogt, T. Phenylpropanoid biosynthesis. Mol. Plant 3, 2–20 (2010).
Article CAS PubMed Google Scholar
Richard, G. F., Kerrest, A. & Dujon, B. Comparative genomics and molecular dynamics of DNA repeats in Eukaryotes. Microbiol. Mol. Biol. Rev. 72, 686–727 (2008).
Article CAS PubMed PubMed Central Google Scholar
Field, B. & Osbourn, A. E. Metabolic diversification-independent assembly of operon-like gene clusters in different plants. Science 320, 543–547 (2008).
Article CAS PubMed Google Scholar
Rocha, E. P. C. The organization of the bacterial genome. Annu. Rev. Genet. 42, 211–233 (2008).
Article CAS PubMed Google Scholar
Osbourn, A. Secondary metabolic gene clusters: evolutionary toolkits for chemical innovation. Trends Genet. 26, 449–457 (2010).
Article CAS PubMed Google Scholar
Takos, A. M. & Rook, F. Why biosynthetic genes for chemical defense compounds cluster. Trends Plant Sci. 17, 383–388 (2012).
Article CAS PubMed Google Scholar
Mustroph, A. et al. Profiling translatomes of discrete cell population resolves altered cellular priorities during hypoxia in Arabidopsis. Proc. Natl Acad. Sci. USA 106, 18843–18848 (2009).
Article CAS PubMed PubMed Central Google Scholar
Fan, L. P. et al. A high-density genetic map of extra-long staple cotton (Gossypium barbadense) constructed using genotyping-by-sequencing based single nucleotide polymorphic markers and identification of fiber traits-related QTL in a recombinant inbred line population. BMC Genomics 19, 489 (2018).
Article PubMed PubMed Central Google Scholar
Gao, W. et al. A genetic linkage map of Pleurotus tuoliensis integrated with physical mapping of the de novo sequenced genome and the mating type loci. BMC Genomics 19, 18 (2018).
Article PubMed PubMed Central CAS Google Scholar
Wang, Z. H. et al. Development of a high-density genetic map based on specific length amplified fragment sequencing and its application in quantitative trait loci analysis for yield-related traits in cultivated peanut. Front. Plant Sci. 9, 827 (2018).
Article PubMed PubMed Central Google Scholar
Badenes, M. et al. Genetic diversity of introduced and local Spanish persimmon cultivars revealed by RAPD markers. Genet. Resour. Crop Evol. 50, 579–585 (2003).
Article CAS Google Scholar
Yonemori, K. et al. Relationship of European persimmon (Diospyros kaki Thunb.) cultivars to Asian cultivars, characterized using AFLPs. Genet. Resour. Crop Evol. 55, 81–89 (2008).
Article CAS Google Scholar
Du, X. Y., Zhang, Q. L. & Luo, Z. R. Development of retrotransposon primers and their utilization for germplasm identification in Diospyros spp. (Ebenaceae). Tree Genet. Genomes 5, 235–245 (2009).
Article Google Scholar
Kanzaki, S. et al. SCAR markers for practical application of marker-assisted selection in persimmon (Diospyros kaki Thunb.) Breeding. J. Jpn. Soc. Hortic. Sci. 79, 150–155 (2010).
Article CAS Google Scholar
Onoue, N., Kobayashi, S., Kono, A. & Sato, A. SSR-based molecular profiling of 237 persimmon (Diospyros kaki Thunb.) germplasms using an ASTRINGENCY-linked marker. Tree Genet. Genomes 14, 28 (2018).
Article Google Scholar
Sun, X. W. et al. SLAF-seq: an efficient method of large-scale de novo SNP discovery and genotyping using high-throughput sequencing. PLoS ONE 8, e58700 (2013).
Article CAS PubMed PubMed Central Google Scholar
Huang, H. H. et al. High-density genetic map construction and comparative genome analysis in asparagus bean. Sci. Rep. 8, 4836 (2018).
Article PubMed PubMed Central CAS Google Scholar
Gabay, G. et al. High-resolution genetic linkage map of European pear (Pyrus communis) and QTL fine-mapping of vegetative budbreak time. BMC Plant Biol. 18, 175 (2018).
Article PubMed PubMed Central CAS Google Scholar
Choi, Y. A., Tao, R., Yonemori, K. & Sugiura, A. Genomic in situ hybridization between persimmon (Diospyros kaki) and several wild species of Diospyros. J. Jpn. Soc. Hortic. Sci. 72, 385–399 (2003).
Article CAS Google Scholar
Yonemori, K. et al. Sequence analyses of the ITS regions and the matK gene for determining phylogenetic relationships of Diospyros kaki (persimmon) with other wild Diospyros (Ebenaceae) species. Tree Genet. Genomes 4, 149–158 (2008).
Article Google Scholar
Doležel, J., Greilhuber, J. & Suda, J. Estimation of nuclear DNA content in plants using flow cytometry. Nat. Protoc. 2, 2233–2244 (2007).
Article PubMed CAS Google Scholar
Schnable, P. S. et al. The B73 maize genome: complexity, diversity, and dynamics. Science 326, 1112–1115 (2009).
Article CAS PubMed Google Scholar
Chang, S., Puryear, J. & Cairney, J. A simple and efficient method for isolating RNA from pine trees. Plant Mol. Biol. Rep. 11, 113–116 (1993).
Article CAS Google Scholar
Gnerre, S. & Jaffe, D. B. High-quality draft assemblies of mammalian genomes from massively parallel sequence data. Proc. Nat. Acad. Sci. USA 108, 1513–1518 (2011).
Article CAS PubMed Google Scholar
Boetzer, M., Henkel, C. V., Jansen, H. J., Butler, D. & Pirovano, W. Scaffolding pre-assembled contigs using SSPACE. Bioinformatics 27, 578–579 (2011).
Article CAS PubMed Google Scholar
Luo, R. et al. SOAPdenovo2: an empirically improved memory-efficient short-read de novo assembler. Gigascience 1, 18 (2012).
Article PubMed PubMed Central Google Scholar
Koren, S. et al. Canu: scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation. Genome Res. 27, 722–736 (2017).
Article CAS PubMed PubMed Central Google Scholar
Chin, C. S. et al. Phased diploid genome assembly with single-molecule real-time sequencing. Nat. Meth. 13, 1050–1056 (2016).
Article CAS Google Scholar
Chakraborty, M., Baldwin-Brown, J. G., Long, A. D. & Emerson, J. J. Contiguous and accurate de novo assembly of metazoan genomes with modest long read coverage. Nucleic Acids Res. 44, e147 (2016).
Article PubMed PubMed Central CAS Google Scholar
Servant, N. et al. HiC-Pro: an optimized and flexible pipeline for Hi-C data processing. Genome Biol. 16, 259 (2015).
Article PubMed PubMed Central CAS Google Scholar
Burton, J. N. et al. Chromosome-scale of de novo genome assemblies based on chromatin interactions. Nat. Biotechnol. 31, 1119–1125 (2013).
Article CAS PubMed PubMed Central Google Scholar
Burge, C. & Karlin, S. Prediction of complete gene structures in human genomic DNA. J. Mol. Biol. 268, 78–94 (1997).
Article CAS PubMed Google Scholar
Stanke, M. & Waack, S. Gene prediction with a hidden Markov model and a new intron submodel. Bioinformatics 19(Suppl 2), ii215–ii225 (2003).
Article PubMed Google Scholar
Majoros, W. H., Pertea, M. & Salzberg, S. L. TigrScan and GlimmerHMM: two open source ab initio eukaryotic gene-finders. Bioinformatics 20, 2878–2879 (2004).
Article CAS PubMed Google Scholar
Blanco, E., Parra, G. & Guigó, R. Using geneid to identify genes. Curr. Protoc. Bioinforma. Chapter 4, Unit 4.3 (2007).
Korf, I. Gene finding in novel genomes. BMC Bioinforma. 5, 59 (2004).
Article Google Scholar
Keilwagen, J. et al. Using intron position conservation for homology-based gene prediction. Nucleic Acids Res. 44, e89 (2016).
Article PubMed PubMed Central CAS Google Scholar
Xu, Z. & Wang, H. LTR_FINDER: an efficient tool for the prediction of full-length LTR retrotransposons. Nucleic Acids Res. 35, W265–W268 (2007).
Article PubMed PubMed Central Google Scholar
Han, Y. & Wessler, S. R. MITE-Hunter: a program for discovering miniature inverted-repeat transposable elements from genomic sequences. Nucleic Acids Res. 38, e199 (2010).
Article PubMed PubMed Central CAS Google Scholar
Price, A. L., Jones, N. C. & Pevzner, P. A. De novo identification of repeat families in large genomes. Bioinformatics 21(Suppl 1), i351–i358 (2005).
Article CAS PubMed Google Scholar
Edgar, R. C. & Myers, E. W. PILER: identification and classification of genomic repeats. Bioinformatics 21(Suppl 1), i152–i158 (2005).
Article CAS PubMed Google Scholar
Tarailo-Graovac, M. & Chen, N. S. Using repeatmasker to identify repetitive elements in genomic sequences. Curr. Protoc. Bioinforma. 25, 4.10.1–4.10.14 (2009).
Article Google Scholar
Jurka, J. et al. Repbase Update, a database of eukaryotic repetitive elements. Cytogenet Genome Res. 110, 462–467 (2005).
Article CAS PubMed Google Scholar
Hoede, C. et al. PASTEC: an automatic transposable element classification tool. PLoS ONE 9, e91929 (2014).
Article PubMed PubMed Central CAS Google Scholar
Kim, D., Langmead, B. & Salzberg, S. L. HISAT: a fast spliced aligner with low memory requirements. Nat. Methods 12, 357–360 (2015).
Article CAS PubMed PubMed Central Google Scholar
Pertea, M., Kim, D., Pertea, G. M., Leek, J. T. & Salzberg, S. L. Transcript-level expression analysis of RNA-seq experiments with HISAT, StringTie and Ballgown. Nat. Protoc. 11, 1650–1667 (2016).
Article CAS PubMed PubMed Central Google Scholar
Grabherr, M. G. et al. Trinity: reconstructing a full-lentgh transcriptome without a genome from RNA-Seq data. Nat. Biotechnol. 29, 644–652 (2011).
Article CAS PubMed PubMed Central Google Scholar
Campbell, M. A., Haas, B. J., Hamilton, J. P., Mount, S. M. & Buell, C. R. Comprehensive analysis of alternative splicing in rice and comparative analyses with Arabidopsis. BMC Genomics 7, 327 (2006).
Article PubMed PubMed Central CAS Google Scholar
Tang, H. et al. ALLMAPS: robust scaffold ordering based onmultiple maps. Genome Biol. 16, 3 (2015).
Article CAS PubMed PubMed Central Google Scholar
Haas, B. J. et al. Automated eukaryotic gene structure annotation using EVidenceModeler and the Program to Assemble Spliced Alignments. Genome Biol. 9, R7 (2008).
Article PubMed PubMed Central CAS Google Scholar
Kozomara, A. & Griffithsjones, S. MiRBase: annotating high confidence microRNAs using deep sequencing data. Nucleic Acids Res. 42, D68–D73 (2014).
Article CAS PubMed Google Scholar
Griffiths-Jones, S., Grocock, R. J., van Dongen, S., Bateman, A. & Enright, A. J. MiRBase: microRNA sequences, targets and gene nomenclature. Nucleic Acids Res. 34, D140–D144 (2006).
Article CAS PubMed Google Scholar
Nawrocki, E. P. & Eddy, S. R. Infernal 1.1: 100-fold faster RNA homology searches. Bioinformatics 29, 2933–2935 (2013).
Article CAS PubMed PubMed Central Google Scholar
Lowe, T. M. & Eddy, S. R. TRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res. 25, 955–964 (1997).
Article CAS PubMed PubMed Central Google Scholar
Rong, S., Chu, J. S. C., Ke, W., Jian, P. & Chen, N. GenBlastA: enabling BLAST to identify homologous gene sequences. Genome Res. 19, 143–149 (2009).
Google Scholar
Birney, E., Clamp, M. & Durbin, R. Genewise and genomewise. Genome Res. 14, 988 (2004).
Article CAS PubMed PubMed Central Google Scholar
Marchler-Bauer, A. et al. CDD: a conserved domain database for the functional annotation of proteins. Nucleic Acids Res. 39, D225–D229 (2011).
Article CAS PubMed Google Scholar
Tatusov, R. L. et al. The COG database: new development in phylogenetic classification of proteins from complete genomes. Nucleic Acids Res. 29, 22–28 (2001).
Article CAS PubMed PubMed Central Google Scholar
Kanehisa, M. & Goto, S. KEGG: Kyoto encyclopedia of genes and genomes. Nucleic Acids Res. 28, 27–30 (2000).
Article CAS PubMed PubMed Central Google Scholar
Boeckmann, B. et al. The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003. Nucleic Acids Res. 31, 365–370 (2003).
Article CAS PubMed PubMed Central Google Scholar
Altschul, S. F., Gish, W., Miller, W., Myers, E. W. & Lipman, D. J. Basic local alignment search tool. J. Mol. Biol. 215, 403–410 (1990).
Article CAS PubMed Google Scholar
Conesa, A. et al. Blast2GO: a universal tool for annotation, visualization and analysis in functional genomics research. Bioinformatics 21, 3674–3676 (2005).
Article CAS PubMed Google Scholar
Zdobnov, E. & Apweiler, R. InterProScan- an integration platform for the signature-recognition methods in InterPro. Bioinformatics 17, 847–848 (2001).
Article CAS PubMed Google Scholar
Mistry, J., Finn, R. D., Eddy, S. R., Bateman, A. & Punta, M. Challenges in homology search: HMMER3 and convergent evolution of coiled-coil regions. Nucleic Acids Res. 41, e121 (2013).
Article CAS PubMed PubMed Central Google Scholar
Finn, R. D. et al. Pfam: clans, web tools and services. Nucleic Acids Res. 34, 247–251 (2006).
Article CAS Google Scholar
Bairoch, A. PROSITE: a dictionary of sites and patterns in proteins. Nucleic Acids Res. 19, 2241–2245 (1991).
Article CAS PubMed PubMed Central Google Scholar
Lima, T. et al. HAMAP: a database of completely sequenced microbial proteome sets and manually curated microbial protein families in UniProtKB/Swiss-Prot. Nucleic Acids Res. 37, D471–D478 (2009).
Article CAS PubMed Google Scholar
Li, L., Stoeckert, C. J. & Roos, D. S. OrthoMCL: identification of ortholog groups for eukaryotic genomes. Genome Res. 13, 2178–2189 (2003).
Article CAS PubMed PubMed Central Google Scholar
Camacho, C. et al. BLAST+: architecture and applications. BMC Bioinforma. 10, 421 (2009).
Article CAS Google Scholar
Edgar, R. C. MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 32, 1792–1797 (2004).
Article CAS PubMed PubMed Central Google Scholar
Yang, Z. PAML 4: phylogenetic analysis by maximum likelihood. Mol. Biol. Evol. 24, 1586–1591 (2007).
Article CAS PubMed Google Scholar
De Bie, T., Cristianini, N., Demuth, J. & Hahn, M. CAFÉ: a computational tool for the study of gene family evolution. Bioinformatics 22, 1269–1271 (2006).
Article PubMed CAS Google Scholar
Min, T. et al. Isolation and expression of NAC genes during persimmon fruit postharvest astringency removal. Int. J. Mol. Sci. 16, 1894–1906 (2015).
Article CAS PubMed PubMed Central Google Scholar
Yin, X. R. et al. Expression of ethylene response genes during persimmon fruit astringency removal. Planta 235, 895–906 (2012).
Article CAS PubMed Google Scholar
Pertea, M. et al. StringTie enables improved reconstruction of a transcriptome from RNA-seq reads. Nat. Biotechnol. 33, 290–295 (2015).
Article CAS PubMed PubMed Central Google Scholar
Anders, S. & Huber, W. Differential expression analysis for sequence count data. Genome Biol. 11, R106 (2010).
Article CAS PubMed PubMed Central Google Scholar
Langfelder, P. & Horvath, S. WGCNA: an R package for weighted correlation network analysis. BMC Bioinfomatics 9, 559 (2008).
Article CAS Google Scholar
Livak, K. J. & Schmittgen, T. D. Analysis of relative gene expression data using real-time quantitative PCR and the 2^−ΔΔCT method. Methods 25, 402–408 (2001).
Article CAS PubMed Google Scholar
Doyle, J. & Doyle, J. L. Isolation of plant DNA from fresh tissue. Focus 12, 13–15 (1990).
Google Scholar
Liu, D. Y. et al. Construction and analysis of high-density linkage map using high-throughput sequencing data. PLoS ONE 9, e98855 (2014).
Article PubMed PubMed Central CAS Google Scholar
van Os, H., Stam, P., Visser, R. G. F. & van Eck, H. J. SMOOTH: a statistical method for successful removal of genotyping errors from high-density genetic linkage data. Theor. Appl. Genet. 112, 187–194 (2005).
Article CAS PubMed Google Scholar
Huang, X. H. et al. Genome-wide association study of flowering time and grain yield traits in a worldwide collection of rice germplasm. Nat. Genet. 44, 32–39 (2012).
Article CAS Google Scholar
Kosambi, D. D. The estimation of map distances from recombination values. Ann. Eugen. 12, 172–175 (1943).
Article Google Scholar

Download references

Acknowledgements

This research was supported by the National Key Research and Development Program (2016YFD0400102), the National Natural Science Foundation of China (31672204, 31722042), the Fok Ying Tung Education Foundation, China (161028), Key Agricultural New Varieties Breeding Projects funded by the Zhejiang Province Science and Technology Department (2016C02052-10), and Fundamental Research Funds of CAF (CAFYBB2017SY015).

Author information

These authors contributed equally: Qing-gang Zhu, Yang Xu

Authors and Affiliations

Zhejiang Provincial Key Laboratory of Horticultural Plant Integrative Biology, Zhejiang University, Zijingang Campus, Hangzhou, 310058, PR China
Qing-gang Zhu, Qiu-yun Zhang, Jing-wen Huang, Don Grierson, Kun-song Chen & Xue-ren Yin
State Agriculture Ministry Laboratory of Horticultural Plant Growth, Development and Quality Improvement, Zhejiang University, Zijingang Campus, Hangzhou, 310058, PR China
Qing-gang Zhu, Qiu-yun Zhang, Jing-wen Huang, Kun-song Chen & Xue-ren Yin
Research Institute of Subtropical Forestry, Chinese Academy of Forestry, Hangzhou, 311400, PR China
Yang Xu & Bang-chu Gong
College of Horticulture, Northwest A&F University, Yangling, PR China
Yong Yang & Chang-fei Guan
Plant & Crop Sciences Division, School of Biosciences, University of Nottingham, Sutton Bonington Campus, Loughborough, UK
Don Grierson

Authors

Qing-gang Zhu
View author publications
You can also search for this author in PubMed Google Scholar
Yang Xu
View author publications
You can also search for this author in PubMed Google Scholar
Yong Yang
View author publications
You can also search for this author in PubMed Google Scholar
Chang-fei Guan
View author publications
You can also search for this author in PubMed Google Scholar
Qiu-yun Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Jing-wen Huang
View author publications
You can also search for this author in PubMed Google Scholar
Don Grierson
View author publications
You can also search for this author in PubMed Google Scholar
Kun-song Chen
View author publications
You can also search for this author in PubMed Google Scholar
Bang-chu Gong
View author publications
You can also search for this author in PubMed Google Scholar
Xue-ren Yin
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

X.Y. conceived the project and designed the experiments. Q.g.Z., B.G., and Y.X. performed most of the experiments and analyzed the data. Y.Y. and C.G. collected the samples of different persimmon cultivars. Q.y.Z. and J.H. analyzed the RNA-seq data. Q.g.Z. and Y.X. drafted the paper with contributions from all the authors. D.G., K.C., and X.Y. revised the paper.

Corresponding authors

Correspondence to Bang-chu Gong or Xue-ren Yin.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Supplementary information

Supplemental figures

Supplemental Tables

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Zhu, Qg., Xu, Y., Yang, Y. et al. The persimmon (Diospyros oleifera Cheng) genome provides new insights into the inheritance of astringency and ancestral evolution. Hortic Res 6, 138 (2019). https://doi.org/10.1038/s41438-019-0227-2

Download citation

Received: 02 November 2019
Accepted: 27 November 2019
Published: 18 December 2019
DOI: https://doi.org/10.1038/s41438-019-0227-2

This article is cited by

Full-length transcriptome profiling for fruit development in Diospyros oleifera using nanopore sequencing
- Yang Xu
- Cui-yu Liu
- Bang-chu Gong
BMC Genomic Data (2023)
Allele-aware chromosome-level genome assembly of the autohexaploid Diospyros kaki Thunb
- Huawei Li
- Peng Sun
- Jianmin Fu
Scientific Data (2023)