Chromosome-scale genome assembly of sweet cherry (Prunus avium L.) cv. Tieton obtained using long-read and Hi-C sequencing

Wang, Jiawei; Liu, Weizhen; Zhu, Dongzi; Hong, Po; Zhang, Shizhong; Xiao, Shijun; Tan, Yue; Chen, Xin; Xu, Li; Zong, Xiaojuan; Zhang, Lisi; Wei, Hairong; Yuan, Xiaohui; Liu, Qingzhong

doi:10.1038/s41438-020-00343-8

Download PDF

Article
Open access
Published: 01 August 2020

Chromosome-scale genome assembly of sweet cherry (Prunus avium L.) cv. Tieton obtained using long-read and Hi-C sequencing

Jiawei Wang ORCID: orcid.org/0000-0002-5598-2115¹^na1,
Weizhen Liu ORCID: orcid.org/0000-0002-0780-0284²^na1,
Dongzi Zhu¹^na1,
Po Hong ORCID: orcid.org/0000-0001-6238-4554¹,
Shizhong Zhang³,
Shijun Xiao^2,4,
Yue Tan¹,
Xin Chen¹,
Li Xu¹,
Xiaojuan Zong¹,
Lisi Zhang¹,
Hairong Wei¹,
Xiaohui Yuan² &
…
Qingzhong Liu¹

Horticulture Research volume 7, Article number: 122 (2020) Cite this article

5488 Accesses
43 Citations
1 Altmetric
Metrics details

Subjects

Abstract

Sweet cherry (Prunus avium) is an economically significant fruit species in the genus Prunus. However, in contrast to other important fruit trees in this genus, only one draft genome assembly is available for sweet cherry, which was assembled using only Illumina short-read sequences. The incompleteness and low quality of the current sweet cherry draft genome limit its use in genetic and genomic studies. A high-quality chromosome-scale sweet cherry reference genome assembly is therefore needed. A total of 65.05 Gb of Oxford Nanopore long reads and 46.24 Gb of Illumina short reads were generated, representing ~190x and 136x coverage, respectively, of the sweet cherry genome. The final de novo assembly resulted in a phased haplotype assembly of 344.29 Mb with a contig N50 of 3.25 Mb. Hi-C scaffolding of the genome resulted in eight pseudochromosomes containing 99.59% of the bases in the assembled genome. Genome annotation revealed that more than half of the genome (59.40%) was composed of repetitive sequences, and 40,338 protein-coding genes were predicted, 75.40% of which were functionally annotated. With the chromosome-scale assembly, we revealed that gene duplication events contributed to the expansion of gene families for salicylic acid/jasmonic acid carboxyl methyltransferase and ankyrin repeat-containing proteins in the genome of sweet cherry. Four auxin-responsive genes (two GH3s and two SAURs) were induced in the late stage of fruit development, indicating that auxin is crucial for the sweet cherry ripening process. In addition, 772 resistance genes were identified and functionally predicted in the sweet cherry genome. The high-quality genome assembly of sweet cherry obtained in this study will provide valuable genomic resources for sweet cherry improvement and molecular breeding.

The complex polyploid genome architecture of sugarcane

Article Open access 27 March 2024

A. L. Healey, O. Garsmeur, … A. D’Hont

A pan-genome of 69 Arabidopsis thaliana accessions reveals a conserved genome structure throughout the global species range

Article Open access 11 April 2024

Qichao Lian, Bruno Huettel, … Raphael Mercier

Technology-enabled great leap in deciphering plant genomes

Article 20 March 2024

Lingjuan Xie, Xiaojiao Gong, … Longjiang Fan

Introduction

Sweet cherry (Prunus avium), originating from the Caspian and Black Sea regions, is an economically significant fruit species worldwide. Annual global sweet cherry production is ~2.2 million tons¹. Sweet cherry was first introduced to China in the 1870s, and its widespread cultivation began in the 1990s. In the last year, the total planting area of sweet cherry in China reached 233,300 ha, with production of ~1.2 million tons². In addition, there have been dedicated breeding efforts for this crop. The traditional breeding cycle of a sweet cherry cultivar is more than 15 years long, and marker-assisted selection and genomic selection are current candidate strategies for speeding up the breeding cycle³. Although sweet cherry has a simple, compact genome (2n = 2x = 16), only one draft genome assembly has been reported to date⁴. Owing to the use of short-read sequencing technology, the draft genome assembly exhibited a small scaffold N50 of 219.6 Kb and a low genome coverage of 77.8%.

Within the Prunus genus, the Chinese plum (Mei, Prunus mume) genome was first sequenced in 2012⁵. Since then, several other important fruit species have been sequenced, such as peach (Prunus persica)^6,7, sweet cherry (Prunus avium)⁴, almond (Prunus dulcis)⁸, flowering cherry (Prunus yedoensis)⁹, a flowering cherry interspecific hybrid (‘Somei-Yoshino’, Cerasus × yedoensis)¹⁰, apricot (Prunus armeniaca)¹¹ and European plum (Prunus domestica)¹². All the sequenced genomes exhibit high coverage, ranging from 84.6% in Chinese plum⁵ to 126% in flowering cherry (P. yedoensis)⁹, with the exception of the genome assembly of sweet cherry cv. Satonishiki (77.8%)⁴. In other assemblies, including those of peach, almond, apricot, and Chinese plum, chromosome-level scaffolds with fewer gaps than in the assembly of sweet cherry cv. Satonishiki have also been constructed.

In this study, we aimed to assemble a high-quality chromosome-scale reference genome for sweet cherry using Oxford Nanopore technology (ONT) combined with short Illumina sequencing reads and Hi-C scaffolding. The high-quality genome assembly of sweet cherry will facilitate molecular breeding of sweet cherry and advance our understanding of the genetics and evolution of the Prunus genus.

Results

Sequencing and assembly of the sweet cherry cv. Tieton genome

The sweet cherry cultivar ‘Tieton’ (cv. Tieton) was used for whole-genome sequencing and chromosome-scale assembly. After filtering low-quality reads, a total of 65.05 Gb of Oxford Nanopore long reads and 46.24 Gb of Illumina short reads were obtained. The sequencing details are provided in Table S1.

Several approaches were applied to assemble the genome of sweet cherry cv. Tieton. The assembly statistics and Benchmarking Universal Single-Copy Orthologs (BUSCOs, v3.0.2)¹³ analysis results are shown in Table 1. The NECAT-medaka-NextPolish-Purge_Haplotigs assembly was chosen for further analysis because it showed the best quality. The genome assembly of sweet cherry cv. Tieton exhibited a total size of 344.29 Mb, consisting of 610 contigs with a contig N50 size of 3.25 Mb and a largest contig size of 13.6 Mb. The statistics for our sweet cherry cv. Tieton genome assembly are shown in Table 2.

Table 1 Statistics of different assemblies and BUSCO analysis results for the sweet cherry cv. Tieton genome

Full size table

Table 2 Statistics of the genome assembly for sweet cherry (Prunus avium) cv. Tieton

Full size table

Hi-C scaffolding generated a total of 134,298,049 read pairs (40.19 Gb) with a Q30 of 92.38%. After mapping the Hi-C reads against the assembly of sweet cherry cv. Tieton, 38.91 million valid interaction pairs, accounting for 82.12% of the unique mapped read pairs, were used for the Hi-C analysis. The statistics of the mapping details for the Hi-C data are shown in Table S2. ALLHiC v0.9.12¹⁴ was applied to construct the chromosomal-level scaffolds, and eight chromosomal-level scaffolds were generated, with lengths ranging from 30.63 to 62.32 Mb, accounting for 99.59% of the assembly. The statistics for the final pseudochromosomes of the sweet cherry cv. Tieton genome are shown in Table 2, and the Hi-C contact map is shown in Fig. 1. To the best of our knowledge, this is the first chromosomal-level genome assembly for sweet cherry.

**Fig. 1: Hi-C interaction heatmap for the sweet cherry (*Prunus avium*) cv. Tieton genome.**

Genome assembly quality evaluation

The LTR assembly index (LAI), which was first introduced by Ou et al.¹⁵, is a newly developed reference-free genome metric that evaluates genome assembly continuity using long-terminal-repeat retrotransposons (LTR-RTs). The LAI is used to compare assembly quality between different species independent of genome size and total scaffold size, and its reliability has been proven in more than 40 plant genomes¹⁵. The LAI and contig N50 results for different genome assemblies within Prunus are shown in Table S3. Among the genome assemblies of sweet cherry cv. Tieton (this study), sweet cherry cv. Satonishiki⁴, peach v2.0⁶, flowering cherry (P. yedoensis)⁹, a flowering cherry interspecific hybrid (‘Somei-Yoshino’, Cerasus × yedoensis)¹⁰, Chinese plum⁵, almond⁸, apricot¹¹, and European plum¹², our sweet cherry cv. Tieton genome assembly shows the highest raw LAI (24.45) and LAI (19.68), nearly reaching the “gold standard” of genome assembly proposed by Ou et al.¹⁵. Among other Prunus species, only peach v2.0 (LAI = 18.79)⁶ and the newly assembled apricot genome (LAI = 16.29)¹¹ exhibit similar scores, in agreement with their high assembly qualities. However, the previous genome assembly of sweet cherry cv. Satonishiki presented a raw LAI score of only 2.98 and failed to receive an LAI score due to its fragmented assembly.

The BUSCO analysis result, as shown in Table 1, implies the completeness of our sweet cherry cv. Tieton genome assembly. The NECAT-medaka-NextPolish-Purge_Haplotigs assembly identified 97.4% of the complete BUSCOs, corresponding to the greatest number of BUSCOs among the post-Purge_Haplotigs assemblies. Although the two MaSuRCA assemblies identified more complete BUSCOs, they also identified more duplicated BUSCOs.

As shown in Fig. S1, at kmers = 37, the sweet cherry cv. Tieton genome was estimated to be 340.05 Mb in size, with a heterozygosity of 0.409%, which is very close to the genome size of 338 Mb estimated by flow cytometry¹⁶. The slightly larger genome size of our assembly might be due to the heterozygosity of the sweet cherry genome. In addition, this result indicates the completeness of our cv. Tieton genome assembly.

Annotation of the sweet cherry cv. Tieton genome

We used homology-based, de novo and RNA-seq methods for protein-coding gene prediction and functional annotation. A total of 38,275 gene models encoding 40,338 proteins were predicted in the sweet cherry cv. Tieton genome (Table 3 and Fig. 2). A total of 30,416 of 40,338 proteins (75.40%) were annotated by using the EggNOG v4.5.1¹⁷, Pfam v32.0¹⁸, UniProt v2019_10¹⁹, KEGG²⁰, Gene Ontology²¹, COG²², BUSCO v2.0¹³, MEROPS v12.0²³, Phobius v1.01²⁴, SignalP v4.1²⁵, and CAZyme v8.0²⁶ databases or pipelines.

Table 3 Statistics of gene prediction and functional annotation for the sweet cherry (Prunus avium) cv. Tieton genome

Full size table

**Fig. 2: Characterization of sweet cherry cv. Tieton and its genome synteny with cv. Satonishiki.**

Compared with the former genome annotation of sweet cherry cv. Satonishiki⁴, our genome annotation of cv. Tieton predicted fewer gene models but with a longer total coding sequence (CDS) length. The CDS size distributions in the two genome annotations are shown in Fig. S2, and the statistics of the annotation profiles are shown in Table S4. According to Fig. S2, the genome annotation of sweet cherry cv. Satonishiki predicted more gene models with a length shorter than 650 bp, but our genome annotation of sweet cherry cv. Tieton predicted more gene models with a length longer than 650 bp. There were also more genes annotated by GO and KEGG pathways in our genome annotation of sweet cherry cv. Tieton (Table 3). These results suggest a better genome assembly and annotation of sweet cherry cv. Tieton than of cv. Satonishiki.

Repetitive sequence annotation showed a total content of 204.55 Mb, indicating that 59.40% of the sweet cherry cv. Tieton genome was repetitive (Table S5 and Fig. 2). Among these repetitive elements, LTR retrotransposons (19.71%) were the predominant component (8.50% Copia, followed by 7.59% Gypsy), whereas EnSpm (1.83%) was the most abundant class of DNA transposons. Our sweet cherry cv. Tieton genome exhibits a higher repetitive rate than the genome assembly of cv. Satonishiki (43.8%)⁴. The high repetitive rate in the sweet cherry genome may contribute to the poor genome assembly of sweet cherry cv. Satonishiki because short-read technology is limited in resolving highly repetitive sequences²⁷.

The noncoding RNA prediction annotated 3825 noncoding RNAs, including 621 ribosomal RNAs, 1905 transfer RNAs, 1114 small nuclear RNAs, and 131 microRNAs, in the sweet cherry cv. Tieton genome (Table S6).

Synteny analysis between the genome of sweet cherry cv. Tieton and the former draft genome of cv. Satonishiki

We performed a synteny analysis between the genomes of sweet cherry cv. Tieton and cv. Satonishiki, and the results are shown in Fig. 2. The genome of sweet cherry cv. Tieton shows good synteny with the genome of cv. Satonishiki. However, because of the fragmented genome assembly of cv. Satonishiki, many genes are not anchored to pseudomolecules, especially in high-repeat-content regions. This result also suggests the good quality of our genome assembly of sweet cherry cv. Tieton.

Evolution of the sweet cherry genome

A series of evolutionary analyses were conducted by comparing the sweet cherry cv. Tieton genome with the genomes of other representative species in Prunus, including flowering cherry (P. yedoensis)⁹, peach v2.0⁶, almond⁸, apricot¹¹, and Chinese plum⁵. Gene family clustering analysis assigned 172,683 proteins to 25,768 orthogroups, accounting for 92.6% (186,437) of the total proteins. The statistics for the orthology analysis results are shown in Table S7. There were 461 species-specific orthogroups in the sweet cherry genome, consisting of 2797 proteins. In addition, 816 orthogroups, accounting for 2850 proteins, were found to be shared between sweet cherry and flowering cherry. GO enrichment analysis of these genes revealed that catalytic and binding processes in the cellular component category and cellular and metabolic processes in the biological process category were the most enriched GO terms. The GO enrichment results are shown in Fig. S3.

The gene duplication event analysis revealed that sweet cherry and flowering cherry exhibited the largest numbers of gene duplication events, which were 10,618 and 9659, respectively (Fig. S4). However, 1329 gene duplication events were identified in peach, 4611 in apricot, and 3139 in Chinese plum. Although only 18,169 genes were identified in the almond genome⁸, 2989 gene duplication events were detected.

We also explored the genome synteny between the sweet cherry cv. Tieton and the other representative species. As shown in Fig. 3, our genome assembly of sweet cherry cv. Tieton exhibits a high level of genome synteny with all the other Prunus genomes, especially peach v2.0⁶. Only a few blocks were assigned to different chromosomes between our sweet cherry assembly and the peach assembly. Since peach is a genetically well-characterized model for research in Prunus species²⁸ and high-level genome synteny is expected in this genus²⁹, the highest genome synteny with peach v2.0⁶ suggests the good quality of our sweet cherry genome assembly.

**Fig. 3: Chromosome-level collinearity patterns between sweet cherry cv. Tieton, peach, almond, apricot and Chinese plum.**

The expansion and contraction of the gene families in sweet cherry cv. Tieton were analyzed by comparing the gene families with those in the other representative species, and the results are shown in Fig. S4. Compared with those in the other Prunus species, 1489 gene families in the sweet cherry cv. Tieton genome were expanded, and 1909 gene families were contracted. The sweet cherry and flowering cherry clade showed more gene family expansions and fewer gene family contractions than the other species in Prunus, with 1157 expanded gene families and 599 contracted gene families.

Of the 1489 expanded gene families in the sweet cherry cv. Tieton genome, 108 were significantly expanded, consisting of 1125 genes (Table S8). These expanded genes exhibit diverse functions. The 121 genes (10.76%) were annotated in the KEGG (Kyoto Encyclopedia of Genes and Genomes) pathway database and were assigned to 37 pathways (Table S9). The five largest pathway categories were alpha-Linolenic acid metabolism (26 genes, ko00592), Ribosome (24 genes, ko03010), Spliceosome (21 genes, ko03040), Plant hormone signal transduction (11 genes, ko04075), and ABC transporters (7 genes, ko02010). In the alpha-linolenic acid metabolism pathways, 26 genes were identified as salicylate carboxymethyltransferase genes in the genome of sweet cherry cv. Tieton. These 26 genes were clustered with another 3 genes in sweet cherry into one gene family, which were identified as salicylic acid carboxyl methyltransferase (SAMT)/jasmonic acid carboxyl methyltransferase (JMT) genes. Only four genes in flowering cherry⁹, 11 genes in peach v2.0⁶, three genes in almond⁸, six genes in apricot¹¹, and seven genes in Chinese plum⁵ were clustered in this gene family. Its expansion in the sweet cherry genome was mainly caused by two gene duplicate events occurring on chromosome 8. As shown in Fig. 4a, five of eight SAMTs in the peach genome were duplicated two times, corresponding to 20 of 24 SAMTs in the sweet cherry genome. SAMT converts salicylic acid to methyl salicylate, which acts as a long-distance mobile signal for systemic acquired resistance (SAR) in the uninfected part of the plant³⁰. The expansion of SAMTs in sweet cherry suggests that salicylic acid may play important roles in the defense response against biotic and abiotic stresses. It is interesting that downstream nonexpressor of pathogenesis-related (NPR) genes in the salicylic acid signaling pathway were also identified in these significantly expanded gene families. In the plant hormone signal transduction pathways, 11 genes were annotated as ankyrin repeat-containing (ANK)-NPR-like proteins, which were clustered with 42 other ANKs in the genome of sweet cherry. ANKs constitute a large multigene family in higher plants and play important roles in protein–protein interactions in many developmental processes and abiotic and biotic stress resistance³¹. These 53 ANKs in the sweet cherry genome were clustered with 30 ANKs in flowering cherry⁹, 42 ANKs in peach v2.0⁶, 19 ANKs in almond⁸, 20 ANKs in apricot¹¹, and 50 ANKs in Chinese plum⁵. The expansion of ANKs was probably caused by gene duplication on chromosome 2 because 35 ANKs in the sweet cherry genome and 19 ANKs in the peach v2.0 genome were detected on chromosome 2 (Fig. 4b).

**Fig. 4: Collinearity patterns between the peach v2.0 (*Prunus persica*) and sweet cherry (*Prunus avium*) genomes.**

Fruit development-related genes in the sweet cherry genome

Using our previously published transcript sequencing data for cv. Tieton fruit development³², we reanalyzed the sequencing data based on the assembled genome. A total of 1044 genes were identified as differentially expressed genes (DEGs, P-values < 0.01) during fruit development. The KEGG pathway enrichment analysis results of these DEGs are shown in Table S10, and the 43 largest categories are shown in Fig. 5. Among these pathways, metabolic pathways (pavi01100), photosynthesis (pavi00195), biosynthesis of secondary metabolites (pavi01110), photosynthesis-antenna proteins (pavi00196), carbon metabolism (pavi01200), carbon fixation in photosynthetic organisms (pavi00710), cysteine and methionine metabolism (pavi00270), plant hormone signal transduction (pavi04075), biosynthesis of amino acids (pavi01230), and Isoquinoline alkaloid biosynthesis (pavi00950) were the 10 largest categories (with the lowest P-values), indicating that these pathways play important roles in the fruit development of sweet cherry cv. Tieton.

**Fig. 5: KEGG enrichment analysis of the differentially expressed genes during Tieton fruit development.**

Among all the DEGs, 26 genes were involved in plant hormone signal transduction pathways. Twelve of them were involved in the auxin signaling pathway, and 9 of them were identified as abscisic acid (ABA)-responsive genes. In comparison with the well-known function of ABA-responsive genes in the fruit development of sweet cherry³³, how auxin-responsive genes are involved in this process is still unclear. Of these auxin-responsive genes, six were identified as AUX/IAA (auxin-responsive protein IAA), two were identified as GH3 (indole-3-acetic acid-amido synthetase GH3), two were identified as SAUR (small auxin-up RNA), and two were identified as ARF (auxin response factor). The expression patterns of these genes are shown in Fig. 6. It is known that the application of exogenous auxin can delay fruit development, and the endogenous auxin level decreases during the fruit ripening process in sweet cherry^34,35,36. The gene expression patterns of the AUX/IAAs and ARFs were consistent with the endogenous auxin level during fruit development. However, the increased expression of GH3s and SAURs in the late stage of fruit development reveals a complicated interaction between auxin and the sweet cherry ripening process. Further studies are needed to explore the function of auxin in fruit development in sweet cherry. Three members of TCH4 (xyloglucan endotransglucosylase/hydrolase protein) involved in the brassinosteroid signal transduction pathway were identified as DEGs, which were mainly expressed in the middle stage of fruit development in sweet cherry. One jasmonate ZIM domain-containing protein (identified as TIFY 9) was increased during the fruit ripening process, which confirms that jasmonic acid is involved in this process in sweet cherry^37,38,39. One AHP (histidine-containing phosphotransfer protein 1-like) was decreased during this process, consistent with the endogenous cytokinin level changes in sweet cherry fruit development³⁴.

**Fig. 6: Expression patterns of plant hormone signal transduction pathway genes involved in sweet cherry fruit development.**

Resistance-related genes in the sweet cherry genome

The sweet cherry cv. Tieton genome consisted of 772 resistance-related (R) genes with leucine-rich repeat (LRR) domains and/or nucleotide-binding sites (NBSs). These genes were classified into five groups: coiled-coil-NBS-LRR (CNL), TIR-NBS-LRR (TNL), NBS-LRR (NL), LRR, and NBS. LRR was the largest group, with 506 genes, whereas CNL included 148 genes, TNL included 31 genes, NL included 229 genes, and NBS included 44 genes. Compared to the peach genome, with 437 R genes, the sweet cherry genome had more R genes (772). The sweet cherry genome had more LRR domain-containing proteins, whereas the peach genome had more TNL-encoding genes (146 genes vs. 31 genes).

We also conducted chromosome-level genome synteny analysis of the R genes in the sweet cherry and peach genomes. As shown in Fig. S5, the majority of the R genes in the sweet cherry genome showed good synteny with those in the peach genome. In addition, these R genes were unevenly distributed along the eight chromosomes of sweet cherry. Their hotspots resided on chromosomes 1, 2, and 8, which are clearly shown in Fig. S6.

Discussion

Using the combination of Oxford Nanopore technology, Illumina short reads and Hi-C scaffolding, we successfully assembled a high-quality reference genome for sweet cherry cv. Tieton and anchored 99.59% of the sequences to eight pseudochromosomes. According to the LAI-based method for assessing genome assembly quality, our sweet cherry cv. Tieton genome assembly shows better assembly quality than all the documented genomes within Prunus. This genome assembly can serve as a high-quality reference genome for sweet cherry to support studies of molecular breeding, genetics and evolution in sweet cherry and the Prunus genus.

Compared with the previous draft assembly of sweet cherry cv. Satonishiki, our genome assembly of cv. Tieton provides higher coverage and better continuity. First, the assembly size for sweet cherry cv. Tieton has been increased to 344.29 Mb, representing 101.2% of the estimated genome size. Second, our assembly of cv. Tieton exhibits a 15.47x longer contig N50 than cv. Satonishiki. We also annotated more genes than were included in the former annotation of cv. Satonishiki, suggesting the better continuity of our assembly.

The repetitive rate in the genome of sweet cherry cv. Tieton (59.40%) is higher than that in all the other sequenced species of Prunus, such as almond (34.6%)⁸, peach (37.14%)⁷, apricot (38.28%)¹¹, Chinese plum (45.0%)⁵, and flowering cherry (P. yedoensis, 47.2%)⁹. There are also more gene duplication events in sweet cherry and flowering cherry (P. yedoensis) than in peach, almond, apricot, and Chinese plum. These results may explain why sweet cherry has a larger genome than other diploid Prunus species.

Two gene duplication events were identified in the genome of sweet cherry compared with peach v2.0, which were involved in the salicylic acid synthesis and salicylic acid signal transduction pathways. We also identified some auxin-responsive genes that were differentially expressed during the fruit development of sweet cherry. This finding suggests that auxin plays a complicated role in the fruit ripening process of sweet cherry. Our well-assembled reference genome of sweet cherry cv. Tieton will support studies on gene function characterization and benefit the research community in the future.

Materials and methods

DNA extraction and sequencing

Leaf samples from sweet cherry cv. Tieton grown in the experimental orchard of Shandong Institute of Pomology, Taian, China, were collected and frozen in liquid nitrogen. Genomic DNA (gDNA) was extracted, size selected and sequenced on an Oxford Nanopore PromethION system by BioMarker Technology Co., Ltd. (Beijing, China). Low-quality reads, reads with adapters and reads shorter than 2000 nt were filtered out. Before assembly, 2000 reads were randomly selected and subjected to BLAST searches against the NT database using BLAST v2.9.0+, and no obvious external contamination was found. For Illumina sequencing, a paired-end library with an insert size of 350 bp was constructed and sequenced by CapitalBio Technology Inc. (Beijing, China) in accordance with the manufacturer’s protocol using the Illumina HiSeq X Ten platform with a 150-nt layout. A total of 48.27 Gb of Illumina raw sequencing data was generated for the survey, correction and evaluation of the genome.

Genome assembly

Several approaches were applied to assemble the sweet cherry cv. Tieton genome using MaSuRCA v3.3.4⁴⁰, Canu v1.8⁴¹, wtdbg2 v2.5, and NECAT v0.01⁴². To facilitate reading, a complete set of parameters for each tool is listed in Supplementary Table S11. BUSCO v3.0.2⁴³ was used to assess all the assemblies. The preliminary assemblies of Canu, wtdbg2, and NECAT were first polished with medaka (https://github.com/nanoporetech/medaka) using Nanopore long reads and then polished three times with NextPolish v1.0.5 using Illumina short reads. The three draft assemblies were subsequently processed with Purge Haplotigs v1.1.0⁴⁴ to investigate the proper assignment of contigs.

Hi-C analysis and pseudochromosome construction

Fresh young leaves collected from a sweet cherry cv. Tieton tree were fixed using formaldehyde at a concentration of 1%. The chromatin was cross-linked and digested using the restriction enzyme HindIII. The 5′ overhangs were labeled with a biotinylated tag and end repaired. After ligation, the DNA was extracted, sheared, and subjected to selection for fragments with lengths between 300 and 700 bp. The biotin-containing fragments were captured for library construction. Finally, the quality of the purified library was evaluated with a Qubit 3.0 fluorometer (Thermo Fisher Scientific, Inc.), quantitative PCR (qPCR), and an Agilent 2100 instrument. The qualified library was sequenced using the Illumina HiSeq X Ten platform with a 150-bp PE layout. To obtain the unique mapped read pairs, we first truncated the paired reads at the putative Hi-C junctions and then aligned them to the sweet cherry cv. Tieton genome assembly using the Arima-HiC Mapping Pipeline (https://github.com/ArimaGenomics/mapping_pipeline). We then used the partition, rescue, optimize, and build functions of the ALLHiC v0.9.12¹⁴ pipeline to construct the chromosomal-level scaffolds of the sweet cherry cv. Tieton genome. After ALLHiC scaffolding, pseudomolecules and Hi-C contact maps were built by using Juicer v1.6.2⁴⁵ and the 3D de novo assembly pipeline v180114⁴⁶. Juicebox Assembly Tools v1.9.8 were used to visualize and manually correct the large-scale inversions and translocations to obtain the final pseudochromosomes.

Genome assembly quality evaluation

Following the approach of Ou et al.¹⁵, LTRharvest v1.6.1 and LTR_retriever v2.8 were first used to detect LTR-RTs in the genome assemblies, and then the LAIs of each genome were calculated using LTR_retriever. The genome size of sweet cherry cv. Tieton was estimated by using Jellyfish v2.2.10⁴⁷ and GenomeScope v1.0⁴⁸ based on a k-mer counting method.

Repetitive sequence annotation

RepeatModeler v.2.0.1⁴⁹ was applied as the de novo method to identify repetitive elements in the sweet cherry cv. Tieton genome. Then, the homology-based tool RepeatMasker v.4.0.9⁵⁰ was applied against Dfam v3.1⁵¹ and Repbase library v20170127 using the Embryophyta settings.

Noncoding RNA prediction

Infernal v1.1.2⁵² was used to identify noncoding RNAs (ncRNAs) in the sweet cherry cv. Tieton genome by comparison with the RFAM database v14.1⁵³. tRNAscan-SE v2.0.5 was used to identify tRNAs. RNAmmer v1.1.2⁵⁴ was used to search for rRNAs. The results of the three pipelines were combined, and the overlapping annotations were removed.

Protein-coding gene prediction and functional annotation

Funannotate v1.7.2⁵⁵, which employs GeneMark-ES v4.48_3.60_lic⁵⁶, Augustus v3.3.2⁵⁷, CodingQuarry v2.0⁵⁸, SNAP v2006-07-28⁵⁹, and GlimmerHMM v3.0.4⁶⁰, was used to perform protein-coding gene prediction in the repeat-masked sweet cherry cv. Tieton genome. Briefly, we first trained the gene prediction model using our previously published transcript sequencing data for cv. Tieton fruit³². Then, protein-coding genes were predicted for the whole genome using the Funannotate prediction pipeline. All the predicted gene models, transcript alignments generated by PASA v2.4.1⁶¹, and protein sequences of the peach v2.0 annotation⁶ were input into EVidenceModeler v1.1.1 to obtain the final prediction of gene models for the sweet cherry cv. Tieton genome. The statistics of the prediction results and the weights used for EVidenceModeler are shown in Table S12. InterProScan5 v5.0⁶² was used to annotate the predicted proteins against the InterPro database v77.0⁶³. EggNOG-mapper v1.0.3¹⁷ was used locally against emapperdb v4.5.1 for fast functional annotation with Gene Ontology²¹ and COG functional categories²². KAAS (KEGG Automatic Annotation Server)⁶⁴ was used to annotate the predicted protein sequences according to KEGG Orthology. The results of the InterProScan, EggNOG-mapper, and KEGG Orthology analyses were combined and fitted to the Funannotate annotation pipeline. UTRs were added by the Funannotate update pipeline using the transcript data.

Gene orthologs and gene family analysis

Gene orthologs and gene duplication events in the sweet cherry genome compared with the genomes of other Prunus plants were identified by OrthoFinder v2.3.11⁶⁵. The GO enrichment analysis of these proteins was performed by using WEGO v2.0⁶⁶. CAFÉ v4.21⁶⁷ was employed to explore gene family size expansion and contraction following the provided protocol.

RNAseq analysis and KEGG pathway enrichment of DEGs

RNAseq analysis was performed following the protocol of Thomas W. Battaglia (https://github.com/twbattaglia/RNAseq-workflow). TCC-GUI was used to normalize the expression data and detect the DEGs⁶⁸. KOBAS 3.0 was used to perform the KEGG enrichment analysis⁶⁹.

Genome synteny analysis

A Python version of MCscan⁷⁰ was employed to analyze the synteny between the sweet cherry cv. Tieton genome and the cv. Satonishiki genome or other genomes within Prunus following the approach of Haibao Tang. Circos v0.69-8⁷¹ was used to generate the synteny plot between sweet cherry cv. Tieton and cv. Satonishiki.

Data availability

Raw sequencing reads have been deposited in the NCBI SRA (Sequence Read Archive) database under BioSample accession number SAMN13640536. This whole-genome shotgun project has been deposited in GenBank under the accession JAAOZG000000000. The version described in this paper is version JAAOZG010000000.

References

Quero-García, J., Iezzoni, A., Puławska, J. & Lang, G. Cherries: Botany, Production and Uses 1–533 (CABI PUBLISHING, Boston, MA, 2017).
Xuwei, D. et al. Fruit scientific research in New China in the past 70 years: cherry. J. Fruit. Sci.36, 1339–1351 (2019).
Google Scholar
Garcia, J. Q. Cherry breeding in the world: current analysis and future perspectives. Italus Hortus26, 9–20 (2019).
Google Scholar
Shirasawa, K. et al. The genome sequence of sweet cherry (Prunus avium) for use in genomics-assisted breeding. DNA Res.24, 499–508 (2017).
CAS PubMed PubMed Central Google Scholar
Zhang, Q. et al. The genome of Prunus mume. Nat. Commun.3, 1318 (2012).
PubMed Google Scholar
Verde, I. et al. The Peach v2.0 release: high-resolution linkage mapping and deep resequencing improve chromosome-scale assembly and contiguity. BMC Genomics.18, 225 (2017).
PubMed PubMed Central Google Scholar
International Peach Genome, I. et al. The high-quality draft genome of peach (Prunus persica) identifies unique patterns of genetic diversity, domestication and genome evolution. Nat. Genet.45, 487–494 (2013).
Google Scholar
Sanchez-Perez, R. et al. Mutation of a bHLH transcription factor allowed almond domestication. Science364, 1095–1098 (2019).
CAS PubMed Google Scholar
Baek, S. et al. Draft genome sequence of wild Prunus yedoensis reveals massive inter-specific hybridization between sympatric flowering cherries. Genome Biol.19, 127 (2018).
PubMed PubMed Central Google Scholar
Shirasawa, K. et al. Phased genome sequence of an interspecific hybrid flowering cherry, ‘Somei-Yoshino’ (Cerasus × yedoensis). DNA Res.26, 379–389 (2019).
CAS PubMed PubMed Central Google Scholar
Jiang, F. et al. The apricot (Prunus armeniaca L.) genome elucidates Rosaceae evolution and beta-carotenoid synthesis. Horticulture Res.6, 128 (2019).
Google Scholar
Zhebentyayeva, T. et al. Genetic characterization of worldwide Prunus domestica (plum) germplasm using sequence-based genotyping. Hortic. Res.6, 12 (2019).
CAS PubMed PubMed Central Google Scholar
Seppey, M., Manni, M. & Zdobnov, E. M. BUSCO: assessing genome assembly and annotation completeness. Methods Mol. Biol.1962, 227–245 (2019).
CAS PubMed Google Scholar
Zhang, X., Zhang, S., Zhao, Q., Ming, R. & Tang, H. Assembly of allele-aware, chromosomal-scale autopolyploid genomes based on Hi-C data. Nat. Plants5, 833–845 (2019).
CAS PubMed Google Scholar
Ou, S., Chen, J. & Jiang, N. Assessing genome assembly quality using the LTR Assembly Index (LAI). Nucleic Acids Res.46, e126 (2018).
PubMed PubMed Central Google Scholar
Arumuganathan, K. & Earle, E. D. Nuclear DNA content of some important plant species. Plant Mol. Biol. Report.9, 208–218 (1991).
CAS Google Scholar
Huerta-Cepas, J. et al. Fast genome-wide functional annotation through orthology assignment by eggNOG-Mapper. Mol. Biol. Evol.34, 2115–2122 (2017).
CAS PubMed PubMed Central Google Scholar
El-Gebali, S. et al. The Pfam protein families database in 2019. Nucleic Acids Res.47, D427–D432 (2019).
CAS PubMed Google Scholar
UniProt, C. UniProt: a worldwide hub of protein knowledge. Nucleic Acids Res.47, D506–D515 (2019).
Google Scholar
Kanehisa, M. et al. Data, information, knowledge and principle: back to metabolism in KEGG. Nucleic Acids Res.42, D199–D205 (2014).
CAS PubMed Google Scholar
Harris, M. A. et al. The Gene Ontology (GO) database and informatics resource. Nucleic Acids Res.32, D258–D261 (2004).
CAS PubMed Google Scholar
Koonin, E. V. et al. A comprehensive evolutionary classification of proteins encoded in complete eukaryotic genomes. Genome Biol.5, R7 (2004).
PubMed PubMed Central Google Scholar
Rawlings, N. D. et al. The MEROPS database of proteolytic enzymes, their substrates and inhibitors in 2017 and a comparison with peptidases in the PANTHER database. Nucleic Acids Res.46, D624–D632 (2018).
CAS PubMed Google Scholar
Kall, L., Krogh, A. & Sonnhammer, E. L. A combined transmembrane topology and signal peptide prediction method. J. Mol. Biol.338, 1027–1036 (2004).
CAS PubMed Google Scholar
Almagro Armenteros, J. J. et al. SignalP 5.0 improves signal peptide predictions using deep neural networks. Nat. Biotechnol.37, 420–423 (2019).
CAS PubMed Google Scholar
Lombard, V., Golaconda Ramulu, H., Drula, E., Coutinho, P. M. & Henrissat, B. The carbohydrate-active enzymes database (CAZy) in 2013. Nucleic Acids Res.42, D490–D495 (2014).
CAS PubMed Google Scholar
Paajanen, P. et al. A critical comparison of technologies for a plant genome sequencing project. Gigascience8, giy163 (2019).
PubMed PubMed Central Google Scholar
Yu, Y. et al. Genome re-sequencing reveals the evolutionary history of peach fruit edibility. Nat. Commun.9, 5404 (2018).
CAS PubMed PubMed Central Google Scholar
Aranzana, M. J. et al. Prunus genetics and applications after de novo genome sequencing: achievements and prospects. Hortic. Res.6, 58 (2019).
PubMed PubMed Central Google Scholar
Li, Y. X. et al. Salicylic acid in Populus tomentosa is a remote signalling molecule induced by Botryosphaeria dothidea infection. Sci. Rep.8, 14059 (2018).
PubMed PubMed Central Google Scholar
Sharma, M. & Pandey, G. K. Expansion and function of repeat domain proteins during stress and development in plants. Front Plant Sci.6, 1218 (2015).
PubMed Google Scholar
Wei, H. et al. Comparative transcriptome analysis of genes involved in anthocyanin biosynthesis in the red and yellow fruits of sweet cherry (Prunus avium L.). PLoS ONE10, e0121164 (2015).
PubMed PubMed Central Google Scholar
Shen, X. J. et al. A role for PacMYBA in ABA-regulated anthocyanin biosynthesis in red-colored sweet cherry cv. Hong Deng (Prunus avium L.). Plant Cell Physiol.55, 862–880 (2014).
CAS PubMed Google Scholar
Teribia, N., Tijero, V. & Munne-Bosch, S. Linking hormonal profiles with variations in sugar and anthocyanin contents during the natural development and ripening of sweet cherries. N. Biotechnol.33, 824–833 (2016).
CAS PubMed Google Scholar
Wang, Y. P. et al. Transcriptional regulation of PaPYLs, PaPP2Cs and PaSnRK2s during sweet cherry fruit development and in response to abscisic acid and auxin at onset of fruit ripening. Plant Growth Regul.75, 455–464 (2015).
CAS Google Scholar
Stern, R. A., Flaishman, M., Applebaum, S. & Ben-Arie, R. Effect of synthetic auxins on fruit development of ‘Bing’ cherry (Prunus avium L.). Sci. Horticulturae.114, 275–280 (2007).
CAS Google Scholar
Berni, R., Cai, G., Xu, X., Hausman, J. F. & Guerriero, G. Identification of jasmonic acid biosynthetic genes in sweet cherry and expression analysis in four ancient varieties from tuscany. Int. J. Mol. Sci.20, 10 (2019).
Google Scholar
Kondo, S., Motoyama, M., Michiyama, H. & Kim, M. Roles of jasmonic acid in the development of sweet cherries as measured from fruit or disc samples. Plant Growth Regul.37, 37–44 (2002).
CAS Google Scholar
Kondo, S., Tomiyama, A. & Seto, H. Changes of endogenous jasmonic acid and methyl jasmonate in apples and sweet cherries during fruit development. J. Am. Soc. Horticultural Sci.125, 282–287 (2000).
CAS Google Scholar
Zimin, A. V. et al. The MaSuRCA genome assembler. Bioinformatics29, 2669–2677 (2013).
CAS PubMed PubMed Central Google Scholar
Koren, S. et al. Canu: scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation. Genome Res.27, 722–736 (2017).
CAS PubMed PubMed Central Google Scholar
Chen, Y. et al. Fast and accurate assembly of Nanopore reads via progressive error correction and adaptive read selection. bioRxiv. https://doi.org/10.1101/2020.02.01.930107 (2020).
Waterhouse, R. M. et al. BUSCO applications from quality assessments to gene prediction and phylogenomics. Mol. Biol. Evol35, 543–548 (2017).
PubMed Central Google Scholar
Roach, M. J., Schmidt, S. A. & Borneman, A. R. Purge Haplotigs: allelic contig reassignment for third-gen diploid genome assemblies. BMC Bioinforma.19, 460 (2018).
CAS Google Scholar
Durand, N. C. et al. Juicer provides a one-click system for analyzing loop-resolution Hi-C experiments. Cell Syst.3, 95–98 (2016).
CAS PubMed PubMed Central Google Scholar
Dudchenko, O. et al. De novo assembly of the Aedes aegypti genome using Hi-C yields chromosome-length scaffolds. Science356, 92–95 (2017).
CAS PubMed PubMed Central Google Scholar
Marcais, G. & Kingsford, C. A fast, lock-free approach for efficient parallel counting of occurrences of k-mers. Bioinformatics27, 764–770 (2011).
CAS PubMed PubMed Central Google Scholar
Vurture, G. W. et al. GenomeScope: fast reference-free genome profiling from short reads. Bioinformatics33, 2202–2204 (2017).
CAS PubMed PubMed Central Google Scholar
Flynn, J. M. et al. RepeatModeler2: automated genomic discovery of transposable element families. bioRxiv. https://doi.org/10.1101/856591 (2019).
Tarailo-Graovac, M. & Chen, N. Using RepeatMasker to identify repetitive elements in genomic sequences. Curr. Protoc. Bioinformatics25, Unit 4–10 (2009).
Hubley, R. et al. The Dfam database of repetitive DNA families. Nucleic Acids Res.44, D81–D89 (2016).
CAS PubMed Google Scholar
Nawrocki, E. P. & Eddy, S. R. Infernal 1.1: 100-fold faster RNA homology searches. Bioinformatics29, 2933–2935 (2013).
CAS PubMed PubMed Central Google Scholar
Kalvari, I. et al. Non-coding RNA analysis using the Rfam database. Curr. Protoc. Bioinforma.62, e51 (2018).
Google Scholar
Lagesen, K. et al. RNAmmer: consistent and rapid annotation of ribosomal RNA genes. Nucleic Acids Res.35, 3100–3108 (2007).
CAS PubMed PubMed Central Google Scholar
Love, J. et al. nextgenusfs/funannotate: funannotate v1.7.2. Zenodo. https://doi.org/10.5281/zenodo.3594559 (2019).
Lomsadze, A., Ter-Hovhannisyan, V., Chernoff, Y. O. & Borodovsky, M. Gene identification in novel eukaryotic genomes by self-training algorithm. Nucleic Acids Res.33, 6494–506 (2005).
CAS PubMed PubMed Central Google Scholar
Hoff, K. J. & Stanke, M. Predicting genes in single genomes with AUGUSTUS. Curr. Protoc. Bioinforma.65, e57 (2019).
Google Scholar
Testa, A. C., Hane, J. K., Ellwood, S. R. & Oliver, R. P. CodingQuarry: highly accurate hidden Markov model gene prediction in fungal genomes using RNA-seq transcripts. BMC Genomics.16, 170 (2015).
PubMed PubMed Central Google Scholar
Korf, I. Gene finding in novel genomes. BMC Bioinforma.5, 59 (2004).
Google Scholar
Majoros, W. H., Pertea, M. & Salzberg, S. L. TigrScan and GlimmerHMM: two open source ab initio eukaryotic gene-finders. Bioinformatics20, 2878–2879 (2004).
CAS PubMed Google Scholar
Haas, B. J. et al. Improving the Arabidopsis genome annotation using maximal transcript alignment assemblies. Nucleic Acids Res.31, 5654–5666 (2003).
CAS PubMed PubMed Central Google Scholar
Jones, P. et al. InterProScan 5: genome-scale protein function classification. Bioinformatics30, 1236–1240 (2014).
CAS PubMed PubMed Central Google Scholar
Mitchell, A. L. et al. InterPro in 2019: improving coverage, classification and access to protein sequence annotations. Nucleic Acids Res.47, D351–D360 (2019).
CAS PubMed Google Scholar
Moriya, Y., Itoh, M., Okuda, S., Yoshizawa, A. C. & Kanehisa, M. KAAS: an automatic genome annotation and pathway reconstruction server. Nucleic Acids Res.35, W182–W185 (2007).
PubMed PubMed Central Google Scholar
Emms, D. M. & Kelly, S. OrthoFinder: phylogenetic orthology inference for comparative genomics. Genome Biol.20, 238 (2019).
PubMed PubMed Central Google Scholar
Ye, J. et al. WEGO 2.0: a web tool for analyzing and plotting GO annotations, 2018 update. Nucleic Acids Res.46, W71–W75 (2018).
CAS PubMed PubMed Central Google Scholar
Han, M. V., Thomas, G. W., Lugo-Martinez, J. & Hahn, M. W. Estimating gene gain and loss rates in the presence of error in genome assembly and annotation using CAFE 3. Mol. Biol. Evol.30, 1987–1997 (2013).
CAS PubMed Google Scholar
Su, W., Sun, J., Shimizu, K. & Kadota, K. TCC-GUI: a Shiny-based application for differential expression analysis of RNA-Seq count data. BMC Res Notes12, 133 (2019).
PubMed PubMed Central Google Scholar
Xie, C. et al. KOBAS 2.0: a web server for annotation and identification of enriched pathways and diseases. Nucleic Acids Res.39, W316–W322 (2011).
CAS PubMed PubMed Central Google Scholar
Tang, H. et al. Synteny and collinearity in plant genomes. Science320, 486–488 (2008).
CAS PubMed Google Scholar
Krzywinski, M. et al. Circos: an information aesthetic for comparative genomics. Genome Res.19, 1639–1645 (2009).
CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

This study was supported by the Shandong Provincial Key Laboratory for Fruit Biotechnology Breeding, the Special Fund for Innovation Teams of Fruit Trees in Agricultural Technology System of Shandong Province (SDAIT-06-04), the Agricultural scientific and technological innovation project of Shandong Academy of Agricultural Science (CXGC2018F03), the Fundamental Research Funds for the Central Universities (WUT: 2020IVA026), and the start-up grant from Wuhan University of Technology (grant no. 104-40120526).

Author information

These authors contributed equally: Jiawei Wang, Weizhen Liu, Dongzi Zhu

Authors and Affiliations

Shandong Key Laboratory of Fruit Biotechnology Breeding, Shandong Institute of Pomology, Taian, Shandong, 271000, China
Jiawei Wang, Dongzi Zhu, Po Hong, Yue Tan, Xin Chen, Li Xu, Xiaojuan Zong, Lisi Zhang, Hairong Wei & Qingzhong Liu
School of Computer Science and Technology, Wuhan University of Technology, Wuhan, Hubei, 430070, China
Weizhen Liu, Shijun Xiao & Xiaohui Yuan
State Key Laboratory of Crop Biology, Shandong Agricultural University, Taian, Shandong, 271018, China
Shizhong Zhang
Gooal Gene, Wuhan, Hubei, 430070, China
Shijun Xiao

Authors

Jiawei Wang
View author publications
You can also search for this author in PubMed Google Scholar
Weizhen Liu
View author publications
You can also search for this author in PubMed Google Scholar
Dongzi Zhu
View author publications
You can also search for this author in PubMed Google Scholar
Po Hong
View author publications
You can also search for this author in PubMed Google Scholar
Shizhong Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Shijun Xiao
View author publications
You can also search for this author in PubMed Google Scholar
Yue Tan
View author publications
You can also search for this author in PubMed Google Scholar
Xin Chen
View author publications
You can also search for this author in PubMed Google Scholar
Li Xu
View author publications
You can also search for this author in PubMed Google Scholar
Xiaojuan Zong
View author publications
You can also search for this author in PubMed Google Scholar
Lisi Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Hairong Wei
View author publications
You can also search for this author in PubMed Google Scholar
Xiaohui Yuan
View author publications
You can also search for this author in PubMed Google Scholar
Qingzhong Liu
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

J.W., W.L., and Q.L. conceived the project. J.W. collected the samples. J.W., W.L., S.Z., S.X., D.Z., P.H., Y.T., X.Z., X.Y., and H.W. performed the genome assembly and data analysis. J.W., W.L., L.X., L.Z., X.C., S.X., X.Y., and Q.L. wrote the paper. All authors read and approved the final version of the paper.

Corresponding authors

Correspondence to Jiawei Wang, Weizhen Liu or Qingzhong Liu.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Supplementary information

Supporting Information

Supporting Information 2

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Wang, J., Liu, W., Zhu, D. et al. Chromosome-scale genome assembly of sweet cherry (Prunus avium L.) cv. Tieton obtained using long-read and Hi-C sequencing. Hortic Res 7, 122 (2020). https://doi.org/10.1038/s41438-020-00343-8

Download citation

Received: 26 March 2020
Revised: 29 April 2020
Accepted: 12 May 2020
Published: 01 August 2020
DOI: https://doi.org/10.1038/s41438-020-00343-8

This article is cited by

A chromosome-scale assembly of the early-flowering Prunus campanulata and comparative genomics of cherries
- Yuxi Hu
- Chao Feng
- Ming Kang
Scientific Data (2023)
Construction of high density linkage maps in almond and synteny analysis with peach, Japanese apricot and sweet cherry genomes
- Habibullah Tevfik
- Harun Karcı
- Salih Kafkas
Euphytica (2023)
MYB transcription factor family in sweet cherry (Prunus avium L.): genome-wide investigation, evolution, structure, characterization and expression patterns
- Irfan Ali Sabir
- Muhammad Aamir Manzoor
- Caixi Zhang
BMC Plant Biology (2022)
Genome-wide identification of cysteine-rich receptor-like kinases in sweet cherry reveals that PaCRK1 enhances sweet cherry resistance to salt stress
- Xiaohui Zhao
- Dehui Qu
- Hongyan Su
Plant Cell Reports (2022)
Upregulation of defense-related gene expressions associated with lethal growth failure in the hybrid seedlings of Japanese flowering cherry
- Momi Tsuruta
- Chunlan Lian
- Yuzuru Mukai
Tree Genetics & Genomes (2022)

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

Sequencing and assembly of the sweet cherry cv. Tieton genome

Genome assembly quality evaluation

Annotation of the sweet cherry cv. Tieton genome

Synteny analysis between the genome of sweet cherry cv. Tieton and the former draft genome of cv. Satonishiki

Evolution of the sweet cherry genome

Fruit development-related genes in the sweet cherry genome

Resistance-related genes in the sweet cherry genome

Discussion

Materials and methods

DNA extraction and sequencing

Genome assembly

Hi-C analysis and pseudochromosome construction

Genome assembly quality evaluation

Repetitive sequence annotation

Noncoding RNA prediction

Protein-coding gene prediction and functional annotation

Gene orthologs and gene family analysis

RNAseq analysis and KEGG pathway enrichment of DEGs

Genome synteny analysis

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Conflict of interest

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Quick links