Transcriptional dynamics of the developing sweet cherry (Prunus avium L.) fruit: sequencing, annotation and expression profiling of exocarp-associated genes

Alkio, Merianne; Jonas, Uwe; Declercq, Myriam; Van Nocker, Steven; Knoche, Moritz

doi:10.1038/hortres.2014.11

Download PDF

Article
Open access
Published: 12 March 2014

Transcriptional dynamics of the developing sweet cherry (Prunus avium L.) fruit: sequencing, annotation and expression profiling of exocarp-associated genes

Merianne Alkio¹,
Uwe Jonas¹,
Myriam Declercq¹,
Steven Van Nocker² &
…
Moritz Knoche¹

Horticulture Research volume 1, Article number: 11 (2014) Cite this article

6396 Accesses
60 Citations
Metrics details

Subjects

Abstract

The exocarp, or skin, of fleshy fruit is a specialized tissue that protects the fruit, attracts seed dispersing fruit eaters, and has large economical relevance for fruit quality. Development of the exocarp involves regulated activities of many genes. This research analyzed global gene expression in the exocarp of developing sweet cherry (Prunus avium L., ‘Regina’), a fruit crop species with little public genomic resources. A catalog of transcript models (contigs) representing expressed genes was constructed from de novo assembled short complementary DNA (cDNA) sequences generated from developing fruit between flowering and maturity at 14 time points. Expression levels in each sample were estimated for 34 695 contigs from numbers of reads mapping to each contig. Contigs were annotated functionally based on BLAST, gene ontology and InterProScan analyses. Coregulated genes were detected using partitional clustering of expression patterns. The results are discussed with emphasis on genes putatively involved in cuticle deposition, cell wall metabolism and sugar transport. The high temporal resolution of the expression patterns presented here reveals finely tuned developmental specialization of individual members of gene families. Moreover, the de novo assembled sweet cherry fruit transcriptome with 7760 full-length protein coding sequences and over 20 000 other, annotated cDNA sequences together with their developmental expression patterns is expected to accelerate molecular research on this important tree fruit crop.

RNA-seq-based transcriptome profiling of early fruit development in Chieh-qua and analysis of related transcription factors

Article Open access 12 June 2024

Transcriptome analysis of five different tissues of bitter gourd (Momordica charantia L.) fruit identifies full-length genes involved in seed oil biosynthesis

Article Open access 13 September 2022

Genome-wide identification and characterization of long noncoding RNAs during peach (Prunus persica) fruit development and ripening

Article Open access 30 June 2022

Introduction

The exocarp, or skin, of fleshy fruit is a highly specialized tissue that forms the interface between the fruit flesh and the environment, and supports the mechanical integrity of the fruit. It is the outermost part of the fruit wall (pericarp), and typically comprises the epidermis and sub-epidermal cell layers.¹ The exocarp protects the developing fruit against uncontrolled water loss or uptake, ultraviolet radiation, mechanical damage, pathogens and herbivores.^2,3 Reflecting the diverse functions, cell walls in the exocarp are characterized through thickenings and other modifications, the most prominent one being the cuticle, or the cuticular membrane⁴ (CM). The cuticle is a complex layer of lipophilic materials such as cutin and waxes. It is localized in the outer periclinal and anticlinal cell walls of the epidermis, and in some fruit such as apple and tomato, may extend into the hypodermal cell walls. Furthermore, exocarp cells usually contain high levels of phenolic compounds, pigments, volatile esters and defense-related proteins.⁵ Several horticulturally relevant traits are also associated with the exocarp, such as color, resistance to pathogen infections and to fruit cracking. During ripening, the fruit becomes attractive to frugivores through changes in color, smell, flavor and texture. These changes also affect the exocarp. Further developmental changes affect the composition and load of the cuticle^6–8 and the mechanical properties of the exocarp. For instance, the peel of tomato and grape berry stiffens towards maturity, which has been proposed to control fruit growth.^9,10

The dynamic functions of the exocarp require a developmentally changing repertoire of biosynthetic activities and expressed genes. So far, global transcriptome studies of the whole fruit have uncovered a wealth of developmentally regulated genes in important fleshy fruit crops such as tomato,¹¹ apple,^12–14 grape berry,^15,16 peach^17,18 and date palm.¹⁹ Only a few studies on temporal changes of gene expression profiles in the developing exocarp have been published, and these have focused on tomato^20–22 or grape berry.²³ Other transcriptome analyses of the exocarp have concentrated on a single developmental stage (peach,²⁴ grape berry,²⁵ tomato,²⁶ citrus²⁷) or pooled developmental stages (pomegranate²⁸).

Sweet cherry is an important fleshy, non-climacteric tree fruit crop grown in temperate regions worldwide. Like in other stone fruit such as the peach, the edible, fleshy part of the sweet cherry fruit consists of exocarp and mesocarp, while the endocarp develops to a hard shell enclosing the seed, together constituting the ‘stone’ or pit. So far, relatively little data on gene expression are available for sweet cherry fruit. This limits the ability to identify genes with important roles in developmental processes, which in turn limits breeding and production of superior fruit.

Recently, developmental and tissue-specific expression patterns were studied for 22 sweet cherry genes, 13 of which are potentially involved in cuticle formation and/or function in fruit.²⁹ The present study is a comprehensive report of transcriptional changes that take place in the sweet cherry fruit exocarp over the developmental course from flowering to maturity. The objectives of this research were to (i) identify genes expressed in the sweet cherry exocarp; (ii) resolve temporal expression patterns of the exocarp-associated genes; and (iii) identify developmentally regulated processes characteristic to the exocarp on the basis of coregulated genes. Much of our past research has been on the development and function of fruit cuticles^6,30,31 and fruit skin.^32,33 In the present work, our particular interest was on genes involved in cuticle deposition. Other key processes we focused on included cell wall metabolism during fruit growth, maturation and softening; sugar transport; and transcriptional regulation. Together, the results reveal a complex chronology of simultaneous, consecutive and overlapping processes taking place in the developing sweet cherry exocarp.

Materials and methods

Sampling of fruit and determination of fresh mass, fruit surface area and fruit cuticle mass

Fruit from sweet cherry (Prunus avium L. ‘Regina’) trees grown in a commercial orchard in Gleidingen, Germany (09°84′E, 52°27′N), were used for all analyses. Flawless fruits, free of visible defects, were sampled, weighed and the fruit surface area calculated weekly as described previously.²⁹ Cuticular membranes were isolated enzymatically; the mass of CM per unit surface area and per fruit and the CM thickness were determined as described previously.²⁹

Total RNA extraction

Tissue samples for extraction of total RNA were collected weekly from full bloom to 94 days after full bloom (DAFB) (i.e., maturity) and flash frozen in liquid nitrogen within 10 min from removal of the fruit from the tree as described previously.²⁹ Total RNA was isolated using the InviTrap Spin Plant RNA Mini Kit (Stratec Molecular GmbH, Berlin, Germany) with the lysis buffer RP.

Library preparation for RNA-Seq and sequencing

A total of 24 RNA samples isolated from following sweet cherry fruit tissues were used for transcriptome sequencing in one or two biological replicates as indicated in Figure 1c: whole ovaries (other floral organs removed) 3 DAFB; exocarp-enriched tissue (0.5–2 mm thin slices from the outermost pericarp) 10, 17, 24, 31, 38, 45, 52, 59, 66, 73, 80, 87 and 94 DAFB; mesocarp tissue 24 and 80 DAFB. The mesocarp samples served for detecting differential expression between exo- and mesocarp. Each biological replicate contained tissue from at least ten fruits collected from at least three trees.

Aliquots of total RNA were treated with DNase I (Fermentas, Schwerte, Germany), purified using MinElute RNA purification Kit (Qiagen, Hilden, Germany) and the integrity of the RNA was assessed using capillary electrophoresis (Bioanalyzer; Agilent, Santa Clara, CA, USA); the RNA integrity numbers of the samples were 8.2–10. Directional complementary DNA (cDNA) libraries were prepared at the GenXPro GmbH (Frankfurt am Main, Germany). Briefly, total RNA samples were retreated with DNase (BaselineZero; Epicentre Biotechnologies, Madison, WI, USA); mRNA was captured using Dynabeads mRNA Purification Kit (Invitrogen, Darmstadt, Germany) and decapped using tobacco acid pyrophosphatase (Epicentre Biotechnologies). RNA was fragmented using RNA fragmentation reagents (Ambion, Darmstadt, Germany). Fragment ends were repaired using T4 polynucleotide kinase (NEB, Frankfurt am Main, Germany), and adapters RA3 and RA5 (Illumina Inc., San Diego, CA, USA; Supplementary Table S1) were ligated to 3′ and 5′ ends using truncated T4 RNA Ligase 2 (NEB) respective T4 RNA Ligase 1 (NEB). First-strand cDNA was synthesized using SuperScript III reverse transcription kit (Invitrogen) and RTP primers (Illumina; Supplementary Table S1). The cDNA was amplified in 13–15 cycles of PCR using Phusion DNA polymerase (Finnzymes, Schwerte, Germany) with RP1 and RPI1–24 primers with Index 1-24 (Illumina; Supplementary Table S1). Amplified cDNA libraries were separated via polyacrylamide gel electrophoresis and fragments of 200–250 bp were excised and eluted from the gel. Concentration of the cDNA was quantified using the RiboGreen assay kit (Invitrogen). Aliquots of four indexed cDNA samples each were pooled. The pools were spiked with 1% of PhiX174 DNA (Illumina) for sequencing quality control and the samples were sequenced using the Solexa/Illumina platform (HiSeq2000; Illumina Sequencing Kit TruSeq SBS v5) yielding 100 bp reads in the 5′ to 3′ direction of the mRNA. Base calling was done using the HiSeq Control Software packages v1.4.5, RTA v1.12.4, CASAVA v1.7.0 and OLB v1.9.0. Indexed reads were deconvoluted using a custom Perl script (GenXPro GmbH).

Pre-processing and de novo assembly of Illumina reads

The pre-processing and assembly pipelines are outlined below; details on hardware, operating systems, parameter settings and other specifics are given in Supplementary Method S1.

Deconvoluted reads were pre-processed locally. The quality of Illumina reads was assessed using the program FastQC 0.9.4 (http://www.bioinformatics.babraham.ac.uk/projects/fastqc/). Adapter sequences were removed from 5′ and 3′ ends using the program ea-utils.³⁴ Poor quality bases at the 3′ end were removed using the tool FASTQ Quality Trimmer (FASTX-Toolkit; fastx-tools_0.0.13_binaries_Linux_2.6_amd64; http://hannonlab.cshl.edu/fastx_toolkit/). Error correction of the remaining reads was done using Coral 1.3³⁵ and the standard Linux programs split and cat. To reduce data input for the de novo assembly, identical reads were merged using the FASTQ collapser of the FASTX-Toolkit and reads identical in sequence but different in length were merged using the program USEARCH 5.0.144.³⁶

The pre-processed reads were assembled at the RRZN cluster system at the Leibniz Universität Hannover, Germany in a multistep process. Filtered reads were first assembled to contiguous sequences (contigs) using Velvet 1.1.05³⁷ with k-mer sizes of 95–23 in increments of four and two different coverage cutoff setting (auto or 2); 38 assemblies in total. The contigs from Velvet assembly were cleaned of homopolymers, sequences of low complexity and adapter remnants using the program seqclean_x86_64 (http://compbio.dfci.harvard.edu/tgi/software/). Twenty-two published (GenBank JU090712-JU090733), three unpublished sweet cherry contigs from the same assembly²⁹ and the complete genome sequence of the phage phiX174 (GenBank J02482.1) were added to the Velvet contigs, which were then assembled via decremental clustering and assembly using the programs PAVE 3.0³⁸ and CAP3³⁹ in five rounds with decreasing stringency. Throughout the pre-processing, data were managed via the MySQL server.

Read mapping and BLAST searches

Transcript abundance was estimated by mapping the Illumina reads to the assembled contigs using the program Bowtie.⁴⁰ To enable correct alignment of as many reads as possible the raw reads were trimmed to 65 bp by removing 5 bp from the 5′-end and 30 bp from the 3′-end of each read and one mismatch per mapped read was allowed. Reads that mapped to two or more contigs were randomly assigned to one contig. Contigs were first sorted in two groups based on contig length and number of mapped reads. Group 1, high abundance contigs of ≥200 bp length and total read count ≥30 per contig in each sample and ≥75 per contig in all 24 samples together; Group 2, low abundance contigs of <200 bp length or total read count per contig <30 reads/sample and <75 reads/all 24 samples. Homology analyses were performed using the BLAST program.⁴¹ To identify contigs likely originating from other sources than the sweet cherry mRNA, the high abundance contigs (Group 1) were aligned against Prunus persica chloroplast (NCBI RefSeq NC_014697.1), mitochondrial (NCBI RefSeq release 51, directory: mitochondrion), ribosomal RNA (http://bioinformatics.psb.ugent.be/webtools/rRNA/rRNA.tar), plant pathogen (ftp://ftp.plantbiology.msu.edu/pub/data/CPGR/all.plant_pathogen.wgs_and_complete_genome.fasta.gz), yeast (http://downloads.yeastgenome.org/sequence/S288C_reference/genome_releases/S288C_reference_genome_R64-1-1_20110203.tgz) and viral (NCBI RefSeq release 51, directory: viral) sequences. Contigs with highly significant BLASTN matches (e-value <10⁻¹⁰⁰) in these searches were removed from Group 1 as a new Group 3. BLAST searches for Group 1 contigs were further performed in the PAVE platform. To identify highly similar sequences within the assembled contigs, each contig was used as query in a BLASTN search against all contigs with an e-value cutoff 10⁻¹⁰⁰. BLASTX searches for significant hits with an e-value <10⁻¹⁵ were conducted against predicted peptides of peach (Peach v1.0⁴²), apple (Malus × domestica Genome v1.0⁴³), grape vine (Vitis vinifera 12X March 2010 release of the draft genome⁴⁴), Arabidopsis (TAIR 10.0⁴⁵) and 25 previously assembled sweet cherry sequences²⁹ (GenBank JU090712 to JU090733). Results of these BLAST analyses were also referred to as ‘PAVE hits’. The sequence data were downloaded from Phytozome v.8.0⁴⁶ except for the sweet cherry sequences. Contigs were estimated to contain a complete open reading frame based on the comparison with the predicted peptide sequence of its best PAVE hit. Group 1 contigs were additionally annotated based on the best hits (e-value <10⁻¹⁰) retrieved in a BLASTX search performed via the Blast2GO platform⁴⁷ against the SwissProt database⁴⁸ (as of August 2012). Contigs with no hits were further queried against the NCBI RefSeq and nr databases as of August 2012. Results of these BLAST analyses were also referred to as ‘B2G hits’.

InterProScan and gene ontology analyses

Contigs in group 1 were functionally annotated assigning InterPro matches⁴⁹ and gene ontology (GO) terms via Blast2GO platform.⁴⁷ InterPro scan was performed on the translated amino-acid sequences of the longest open reading frame (ORF) of each contig if ≥150 nucleotides in length. GO terms were extracted from the BLASTX matches (e-value <10⁻¹⁰, GO annotation threshold 55) against the Swiss-Prot database⁴⁸ (as of August 2012). Contigs with no hits were further queried against the NCBI RefSeq and nr protein databases. Visualization and statistical analyses of the GO terms were performed using Blast2GO. Alternative GO annotation was conducted by extracting the GO terms assigned to the best P. persica and Arabidopsis hits (e-value <10⁻¹⁵) of each contig via the AgriGO platform.⁵⁰

Determination of expression levels and clustering of co-expressed genes

Expression levels were estimated by calculating the numbers of expected fragments (reads) per kilobase of contig per million fragments mapped (FPKM).⁵¹ The data were visualized and analyzed using the Mayday workbench.⁵² Reproducibility of the FPKM values between two replicate samples was assessed from MA plots and by calculating the correlation between the FPKM values of all contigs in the two biological replicates. We made use of the replicate samples and removed from Group 1 all contigs that differed in their normalized FPKM values between any two replicates by 10% points or more. This filtered subset of 29 955 contigs was termed ‘Group 1F’ and used for most of the bio-informatic analyses. Clusters of co-expressed genes were identified using the neural gas and quality threshold clustering algorithms in Mayday workbench. Quality threshold cluster diameters were adapted to the data using the ‘search diameter for QT clustering’ option.

Estimating transcript levels using reverse transcription-quantitative real-time PCR

Reverse transcription-quantitative real-time PCR (RT-qPCR) was carried out for selected genes as described earlier.²⁹ Primer sequences are given in Supplementary Table S1.

GenBank accessions

The Transcriptome Shotgun Assembly project has been deposited at DDBJ/EMBL/GenBank under the accession GAJZ00000000 (contigs with predicted full-length ORFs were deposited) and the assembly name P.avium_v2.0. The version described in this paper is the first version, GAJZ01000000. The data are part of BioProject PRJNA73727.

Results and discussion

Characterization of the fruit, cDNA sequencing and de novo assembly of the sweet cherry transcriptome

Fruit growth exhibited three stages typical for stone fruit⁵³ (Figure 1a). During Stage I—here, approximately from 0 to 30 DAFB—growth occurs by cell division and expansion in the entire pericarp. During Stage II (here, 31–45 DAFB) the endocarp (the innermost pericarp layer) and the seed develop, while the fruit mass and surface area remain essentially constant. Stage II was identified based on the second lag phase in the growth curve (Figure 1a) and on the beginning and completion of hardening of the endocarp (not shown). In Stage III (here, 52–94 DAFB) fruit mass and surface area increase rapidly, primarily due to the expansion of mesocarp cells. Here, maxima of surface area expansion rate occurred in Stage I at approximately 24 DAFB (39.0 mm² day⁻¹) and in Stage III at approximately 66 DAFB (60.0 mm² day⁻¹), as calculated from the data of surface area development in Figure 1a. As previously observed,⁶ cuticle mass per fruit increased rapidly during Stage I, the rate of CM deposition reaching a maximum of approximately 117 µg per fruit per day about 24 DAFB in the present study. Thereafter, the CM deposition rate steeply decreased in Stage II and further Stage III, averaging only 5.4 µg per fruit per day at 66 DAFB (Figure 1b), the time of the highest surface expansion rate.

Tissue samples for RNA-seq analyses were collected in one or two biological replicates as indicated in Figure 1c. In total, cDNA generated from 24 individual samples representing 14 developmental time points was sequenced. The procedure and outcome of the sequencing, pre-processing, and assembly are outlined in Figure 2. Details on the pre-processing and assembly protocols, and the raw read and pre-processing results per sample are available as Supplementary Method S1, Supplementary Table S2 and Supplementary Fig. S1. Directional sequencing of 24 indexed samples in pools of four yielded a total of 665 million raw reads of 100 bp length each. After pre-processing and de novo assembly we obtained a final set of 68 101 assembled contigs of 107–14 149 bp length.

Mapping of trimmed reads to the de novo assembled contigs

To estimate the abundance of the mRNA represented by each contig in each sample, raw reads were trimmed to 65 nucleotides and remapped to the 68 101 contigs. Contigs were then separated into three groups based on the mapped read counts, results of BLASTn analyses and contig lengths. Group 1 (34 695 contigs or 51% of all contigs) consists of ‘high abundance’ contigs, or sequences represented by at least 30 reads per sample or 75 reads in all 24 samples; contigs shorter than 200 bp were excluded. Group 2 (32712 contigs or 48%) consists of ‘low-abundance’ contigs, or sequences with read counts below the set threshold of 30 reads per sample or 75 reads in all 24 samples. Group 3 (694 contigs or 1%) consists of probable contamination from viral, bacterial or other sources (data not shown). The assembled sequences in Groups 1 and 2 are available as Supplementary Data S1 and S2. The mapping results for all groups and samples are presented in Supplementary Table S3. For Group 1, the complete data on individual contigs including contig lengths, total read counts and BLAST results are available as Supplementary Table S4.

Table 1 summarizes statistics on contig length, GC content and other characteristics for Groups 1–3. Compared to other published plant transcriptomes that were assembled de novo from Illumina reads, the N50 value (1413 bp) and the average length (740 bp) of the Group 1 contigs are among the highest values reported.⁵⁴ Considering that we used single reads instead of paired-end reads, this indicates that the chosen assembly strategy (pre-processing of the raw reads followed by multi-k-mer assembly using Velvet and super-assembly using CAP3) was successful. However, as with any de novo assembly, our assembly has its shortcomings. For instance, sometimes two or more contigs without a sequence overlap were most likely derived from a single transcript, as suggested by the alignment of the contigs to their shared best hit.

Table 1 Summary of all contigs. Contigs were assembled de novo from Illumina sequenced cDNA fragments generated from sweet cherry ‘Regina’ fruit sampled at different developmental stages. Group 1 and 3 were termed ‘high abundance’ and Group 2 ‘low abundance’ contigs, based on the number of mapped reads per contig; threshold 30 mapped reads per sample or 75 reads total in all 24 samples. Group 3 consists of contigs with BLASTn hits (e-value <1×⁻¹⁰⁰) to bacterial, viral, rRNA or other sources as described in the section on ‘Materials and methods’

Full size table

The length distribution of contigs in Group 1 was somewhat biased towards the contigs shorter than 600 bp, although a considerable number of the contigs were 2000 to 5000 bp long or longer (Figure 3 and Table 1). This length distribution is similar to that of the P. persica predicted transcripts (Figure 3). A total of 330 million reads (49.6% of all reads) mapped to Group 1 contigs, 99.8% of them being unique matches; 16.0% mapped to Group 3 and only 0.4% to Group 2 (Table 1 and Supplementary Table S3). Averaging over the 24 samples, 75% (s.d. 8%) of the mappable reads mapped to Group 1 contigs, 24% (s.d. 8%) to Group 3 and 0.5% (s.d. 0.1%) to Group 2 (Figure 4).

Gene expression levels were estimated by calculating FPKM values (fragments per kilobase of transcript per million mapped reads⁵¹), which normalize the raw read counts against the length of the contig and the total number of mappable reads in each sample. Pearson’s correlation coefficients between the FPKM values for all Group 1 contigs in two replicate samples, available for eight developmental stages, ranged from 0.900 (38 DAFB) to 0.996 (66 DAFB), meeting the proposed standard for RNA-seq experiments.⁵⁵ MA plots, created in the Mayday workbench using the FPKM values analogous to the method proposed by Wang et al.,⁵⁶ indicated only minor differences between replicates (Supplementary Fig. S2).

Annotation of the assembled contigs: BLAST searches, gene ontology analyses and InterProScan

To detect similarities between the sweet cherry contigs and proteins in other plants, two independent BLAST searches were performed. First, the contigs were aligned against sweet cherry, peach, apple, grape vine and Arabidopsis predicted peptides via the platform PAVE.³⁸ The second round of BLAST searches were performed against the SwissProt, RefSeq and nr databases via the platform Blast2GO (B2G).⁴⁷ The results of these BLAST searches against the two sets of databases are referred to as ‘PAVE hits’ and ‘B2G hits’, respectively. Results are shown for contigs that were detected at similar abundance levels in both biological replicates, where applicable. This subset of contigs covered 86% of all Group 1 contigs and was termed Group 1F (see also the section on ‘Materials and methods’). In Group 1F, 23 603 contigs (79%) had PAVE hits, while 74% of the contigs had B2G hits (Table 2 and Supplementary Table S4). Notably, several contigs without significant hits to P. persica predicted peptides and complete ORFs sometimes had highly significant hits against the P. persica genomic sequence. Although potentially functional,⁵⁷ such non-coding transcripts were not analyzed further.

Table 2 Annotation summary of all Group 1F contigs and of the Group 1F contigs containing a full-length ORF. Group 1F consists of Group 1 contigs differing less than 10% points in the relative expression levels between two biological replicates. Similarity searches were performed via the platform PAVE against P. persica, M. domestica, V. vinifera, A. thaliana and P. avium peptides and via Blast2GO (B2G) against Swiss-Prot, NCBI RefSeq and nr databases as described in the section on ‘Materials and methods’

Full size table

To estimate how well the assembled contigs represent expression of real genes and not just random sequences, a brief summary is given on the Group 1F contigs containing full-length ORFs. A full-length ORF was identified in 7628 contigs (25% of all), when the presence of a full-length ORF was predicted based on the alignment of the contig to its most similar PAVE hit: complete coverage from start to stop codon indicated a full-length ORF. More contigs are likely to be predicted to include full-length ORFs, if a different ORF finding procedure is applied. Here, the contigs with predicted full-length ORFs in Group 1F were 247–12 130 bp long (median 1563 bp; Supplementary Table S4). Ninety-six percent of them had best hits against P. persica, 77% had SwissProt hits, 97% had InterProScan matches and 82% had GO terms assigned (Table 2 and Supplementary Table S4). The InterProScan matches and GO terms assigned were largely consistent with each other and with the functions suggested by the best hits (Supplementary Table S4).The assigned GO terms covered a wide range of processes, functions and cellular compartments from ‘cell division’ (171 contigs) and ‘chloroplast’ (1042 contigs) to ‘transcription factor activity’ (300 contigs) and ‘response to stimulus’ (2424 contigs) (Figure 5 and Supplementary Table S5).

Based on sequence comparison with 1521 predicted transcription factors of P. persica genome,⁵⁸ 1009 contigs representing transcription factors in Group 1F were identified, 410 of which contained a full-length ORF (Supplementary Table S4). Of the 58 transcription factor families in P. persica, 54 were represented in Group 1F, and full-length ORFs were identified from 50 families. Among the contigs with full-length ORF, the largest transcription factor family was bZIP (33 contigs, or 8.0% of the 410 predicted transcription factors). Assuming that the distribution of TFs in sweet cherry genome is similar to the one in peach genome, the bZIP family is over-represented in fruit (51 genes or 3.4% of all TFs in peach). In peach genome, the largest transcription factor family is bHLH (130 members or 8.5% of all transcription factors); in the sweet cherry fruit transcriptome, bHLH family ranks fourth (26 contigs or 6.3%). Other TF families with at least 20 contigs with predicted full-length ORFs in Group 1F were MYB (30), C2H2 (26), WRKY (24), ERF (22) and C3H (20).

Exocarp-specific transcripts are related to epidermis development and stress responses

To systematically identify exocarp-specific transcripts, contigs with normalized FPKM values in the mesocarp samples below 1% were considered as ‘exocarp-specific’. Normalized FPKM values per sample were calculated after setting the sum of FPKM values in all 24 samples to 100%. According to this criterion, 1162 contigs in Group 1F were exocarp-specific, 219 of which contained a full-length ORF (Supplementary Table S4). Gene ontology term enrichment analyses assigned several known epidermis-related processes and functions to the exocarp-specific contigs. For instance, in the category ‘biological process’, the most over-represented GO terms (false discovery rate <0.05) were ‘wax biosynthetic process, ‘cutin biosynthetic process’ and ‘response to salicylic acid stimulus’ (Table 3). Wax and cutin biosyntheses are essential for cuticle formation, whereas salicylic acid signaling plays a major role in pathogen defense. The specific, over-represented biological processes assigned to the exocarp-specific contigs may be summarized as follows: (i) epidermis development; (ii) responses to biotic and abiotic stresses; (iii) hormone responses; and (iv) secondary metabolism (Table 3 and Supplementary Table S6). In the category ‘molecular function’, the most specific GO terms assigned to the contigs with full-length ORF were ‘carboxylesterase’, ‘transcription factor’, ‘monooxygenase’, ‘fatty acid elongase’ and ‘polyol transmembrane transporter’ activities (Table 3 and Supplementary Table S6). In the category ‘cellular component’, the only over-represented GO term was ‘apoplast’. Collectively, the GO term enrichment analyses confirmed that our ‘exocarp-specific’ contigs with full-length ORFs realistically reflect genes with preferential or specific expression in the fruit skin. The GO term enrichment analysis was repeated on all 1162 exocarp-specific contigs identified in Group 1F (Supplementary Table S6). The results were in line with the results for the contigs with predicted full-length ORFs. Expectedly though, the number of over-represented GO terms were larger for the 1162 contigs (253) than for the 219 contigs (91) (Supplementary Table S6). The most specific over-represented GO terms not identified among the 219 full-length ORF contigs included ‘protein autophosphorylation’, ‘glucosinolate’,’ cyanide’ and ‘terpenoid biosynthetic’ processes.

Table 3 Over-represented GO terms in 219 Group 1F contigs with predicted full-length open reading frame and preferential expression in the exocarp (FPKM values in the mesocarp <1% of total FPKM values in all 24 samples). GO term enrichment was determined in a Fisher’s exact test and the result reduced to most specific terms; 20 most over-represented terms are shown. Complete results including contig identifiers available as Supplementary Table S6

Full size table

Forty most abundant sweet cherry contigs and their expression patterns

Next, we examined the forty most highly represented contigs in the sweet cherry skin transcriptome (Table 4; details on the sequences in Supplementary Table S4). The most abundant contig by far was Pa_24402 with a FPKM sum of 523 781 over the 24 samples, or 4.2 million mapped reads total. Its best B2G hit was GenBank accession P50694, a glucan endo-1,3-β-glucosidase or thaumatin-like protein, the most abundant soluble protein in the ripe sweet cherry fruit.⁵⁹ Five other ‘Top 40’ contigs also had B2G hits to published P. avium sequences: Pa_11846 (P. avium allergen 1), Pa_29501 (non-specific lipid transfer protein), Pa_00414 (phenylalanine ammonia-lyase 1), Pa_21625 (thaumatin-like protein) and Pa_03818 (anthocyanidin 3-O-glucosyltransferase). Altogether, based on the best hit descriptions, 11 of the 40 most abundant contigs represent proteins likely related to abiotic and biotic stresses, such as oxidative stress and responses to pathogens. Also, three further contigs described as putative transport proteins may also be associated with defense responses such as metallothionein-like proteins⁶⁰ and lipid transport protein.⁶¹ The latter are predominantly expressed in surface tissues,⁶¹ including citrus and tomato fruit.^26,27 While relevant in constitutive and acute stress responses, many stress defense proteins were proposed to have other, developmental functions as well. For instance, Pa_10497 is a tentative S-adenosylmethionine decarboxylase proenzyme, the key enzyme in the biosynthesis of polyamines, essential for stress responses and growth.⁶² The other Top 40 transcripts encode cell wall structural proteins and expansins (eight contigs), proteins involved in protein biosynthesis or turnover (three contigs), enzymes of the flavonoid biosynthesis pathway (two contigs) and proteins related to cytoskeleton, gluconeogenesis, photosynthesis or unknown functions (one contig each) (Table 4). Nine transcripts produced no hits in the BLASTx searches against SwissProt, RefSeq or nr databases with the applied settings. In addition to the above mentioned lipid transfer proteins, many of these Top 40 contigs from sweet cherry fruit were homologous to abundant transcripts in other fruits. For instance, abundant metallothionein expression was also reported in other fruits including banana⁶³ and pineapple.⁶⁴ Furthermore, several best hit descriptions were similar to the annotations of the most abundant transcripts in the peel transcriptome of pomegranate.²⁸ Common best hit descriptions included metallothionein, tubulin, lipid transfer protein, catalase, cysteine protease and S-adenosylmethionine decarboxylase.

Table 4 Forty most abundant mRNA sequences (contigs) in the transcriptome of developing sweet cherry ‘Regina’ fruit. Additional information about these sequences is given in Supplementary Table S4

Full size table

Functions of the Top 40 contigs in sweet cherry transcriptome were further predicted by means of GO analyses. Of the 136 GO terms assigned to the Top 40 contigs, three cell wall-related terms were over-represented as compared to all Group 1F contigs (Supplementary Table S7). Other relevant GO terms in the category ‘cellular component’ were associated with vacuole, plasma membrane or plastid. In the category ‘biological process’, the most relevant GO terms included ‘response to stimulus’ and ‘developmental processes’, while in the category ‘molecular function’, the most relevant GO terms were ‘binding’ and ‘hydrolase activity’ (Supplementary Table S7).

Global expression patterns in developing sweet cherry fruit: detection of five main patterns of gene expression

We identified expression patterns or clusters on the whole transcriptome scale and focused on the exocarp-specific genes within selected clusters. Two partitional clustering algorithms implemented in the Mayday workbench⁵² were used: The neural gas (NG) algorithm was applied to separate Group 1F contigs in five large clusters, followed by quality threshold (QT) clustering for more detailed expression patterns (see the section on ‘Materials and methods’). Neural gas clusters NG1 to NG5 contained 3053 to 8881 contigs each (Supplementary Fig. S3a). Each NG cluster had a characteristic average or centroid expression pattern: elevated expression during Stages I and II (NG1), in the ovary shortly after flowering (0–10 DAFB; NG2), during Stage II and early Stage III (NG3), from mid-Stage II to maturity (NG4) and mainly in the mesocarp (NG5) (Supplementary Fig. S3a and Fig. 6a). Several over-represented GO terms were detected in each NG cluster (Supplementary Table S8). For instance, in cluster NG2, the most specific, over-represented GO terms were related to translation, transcription and cell division. These functions are consistent with the high RNA levels in the tissue collected 3 and 10 DAFB (data not shown), reflecting the high cell division activity characteristic for this early stage of fruit development. Based on the GO analysis and the expression patterns, we expected to find the most exocarp-relevant contigs in clusters NG1, NG3 and NG4. Therefore, the following analyses concentrated on these three NG clusters. Only the GO terms in the category biological process will be considered below; the complete results are available in Supplementary Table S8.

Cluster NG1 contained 285 exocarp-specific contigs. Here, the most specific, over-represented terms were wax biosynthetic process, cutin biosynthetic process and fatty acid elongation. Other significantly over-represented, specific GO terms were (also) related to cuticle development, lipid metabolism, suberin biosynthetic process, responses to abiotic and biotic stimuli, cell-to-cell signaling and trichome differentiation. Suberin is typically present in endodermis and periderm, but not in the young fruit. Possibly, contigs annotated as related to suberin biosynthesis in fact participate in cutin biosynthesis, as suberin and cutin contain similar aliphatic monomers such as ω-hydroxy acids.⁶⁵ Cluster NG3 contained 493 exocarp-specific contigs. The most specific, over-represented GO terms here were response to salicylic acid stimulus, anthocyanin accumulation in tissues in response to ultraviolet light and response to abscisic acid stimulus (Supplementary Table S8). Other over-represented GO terms included responses to biotic and abiotic factors, cell wall thickening and transmembrane transport, as well as terpenoid, polysaccharide and phenylpropane metabolism. In cluster NG4, the most specific, over-represented GO terms assigned to the 307 exocarp-specific contigs were response to salicylic acid stimulus, response to ethylene stimulus and response to bacterium. Other GO terms were related to protein autophosphorylation, response to biotic and abiotic factors, secondary metabolism and polyol transport.

To summarize these GO term enrichment results for the three NG clusters, the most typical, exocarp-specific processes during Stages I, II, and III of the sweet cherry fruit development were cuticle formation and deposition, response to abscisic acid (ABA) and response to ethylene stimuli, respectively. Throughout the development, the exocarp is specifically engaged in defense responses to various biotic and abiotic factors. The expression data support the view that constitutive expression of defense genes in the sweet cherry exocarp protects the developing fruit and contributes to developmentally regulated resistance to pathogens.^66,67 The results are also consistent with the view that the defense-related proteins may have other developmental roles as well.⁶⁸

Global expression patterns in developing sweet cherry fruit: identification of distinct patterns of coregulated genes

For fine resolution of expression patterns, the clusters NG1, NG3 and NG4 were subjected to QT clustering (minimum cluster size 20 contigs), resulting in 44, 51 and 71 QT clusters, respectively (Supplementary Fig. S3a). Selected QT clusters are shown in Figure 6b. Selected contigs are listed in Table 5 and their expression patterns are shown in Supplementary Fig. S3b. To estimate the reliability of the expression patterns obtained using RNA-seq, we additionally determined expression levels for selected contigs in Table 5 using RT-qPCR (Supplementary Fig. S4). For the seven contigs, representing transcripts putatively relevant for cuticle formation, the correlation coefficients between the expression levels determined using the two methods ranged from 0.94 to 0.99 (Supplementary Fig. S4).

Table 5 Selected contigs representing sweet cherry genes with predicted functions in cell wall modification, lipid metabolism, regulation of transcription, transport or other processes

Full size table

GO analysis detected over-represented GO terms assigned to the contigs in most of the large NG-QT clusters when compared to all Group 1F contigs. In cluster NG1-QT44 (Figure 6b), with expression maxima from 3 to 24 DAFB, over-represented GO terms were related to the cellular components chloroplast, ribosome and cell wall, and to the biological processes translation, protein-chromophore linkage and GDP-L-fucose biosynthetic process (Supplementary Table S8). Contigs homologous to cell wall modifying enzymes in this cluster included Pa_03454 (endo-1,4-β-glucanase), Pa_06233 (polygalacturonase), Pa_07062 (pectate lyase) and Pa_07115 (pectin methylesterase). Twenty-three of the 1363 contigs in cluster NG1-QT44 were exocarp-specific. Most of the over-represented GO terms associated with the exocarp-specific contigs were related to fatty acid metabolism. An example is the contig Pa_11433, whose best B2G hit was the long-chain acyl-CoA synthetase gene LACS1, involved in cutin and wax biosynthesis in Arabidopsis⁶⁹ (Supplementary Table S4). Among the exocarp-specific contigs was also Pa_22147, a tentative MYB transcription factor gene with the highest similarity to the subgroup 9 of R2R3 MYB genes, known as regulators of epidermal cell differentiation and development.⁷⁰ Its expression peaked at 10–17 DAFB. A similar temporal expression pattern has Pa_11565, a contig present in mesocarp at a very low level (FPKM value 24 DAFB in exocarp 10.3%±0.3%, in mesocarp 1.4% of total FPKM). Its best B2G hit was SNAKIN-1, a constitutive defense protein⁷¹ affecting cell division and cell wall composition in potato.⁶⁸ A SNAKIN-1 homolog was also identified in developing tomato fruit, where it was 9.87-fold more abundant in the exocarp than in the mesocarp at the end of cell division stage.²⁰

In cluster NG1-QT43, with average expression pattern similar to NG-QT44 but the expression maximum shifted to 24 DAFB (Figure 6b), the most specific, over-represented GO terms in the category biological process were folic acid metabolic process, photosynthesis and cutin biosynthetic process (Supplementary Table S8). An example of the photosynthesis-related genes in this cluster was the Top 40 contig Pa_16626, whose best B2G hit was ribulose bisphosphate carboxylase small chain. Cluster NG1-QT43 contained 56 exocarp-specific contigs. The most specific, over-represented GO terms assigned to the exocarp-specific contigs were related to lipid metabolism, in particular, cutin and wax biosyntheses (Supplementary Table S8). An example is the contig Pa_06204, which is a homolog of LACS2 in Arabidopsis and involved in the cutin biosynthesis in sweet cherry (Declercq, 2013, unpublished data). Other exocarp-specific contigs were also homologous to genes related to cuticle formation such as ATT1 (Pa_01043), HTH (Pa_10336), GPAT8 (Pa_00817), GDSL lipase (Pa_01940), KCS10 (Pa_01742) and the AP2/ERF family transcription factor WIN1 (Pa_19618) (Table 5). Most of these genes (except for the homologs of HTH and KCS10) were also among the ones identified previously as putatively involved in CM biogenesis in sweet cherry fruits, and their expression patterns here (Figure 6 and Supplementary Fig. S4B) were very similar to the ones reported.²⁹ A highly abundant contig in this cluster was Pa_02691, an ERF transcription factor homolog. With FPKM values of 1451 (±39) in exocarp and 150 in mesocarp 24 DAFB, the sequence was much more abundant in the exocarp than in mesocarp, but it was not strictly exocarp-specific. In the functional category cell wall modifying enzymes, the contig Pa_06731, homologous to pectate lyase, was somewhat more expressed in the exocarp than in the mesocarp, with a maximum at 24 DAFB (exocarp, 1466±14 FPKM; mesocarp 852 FPKM). The corresponding pectate lyase may be involved in cell wall loosening and rearrangement during the rapid cell fruit growth occurring at this time. The finding is in line with the expression peak of the apple pectate lyase MdPL1 in growing apple.⁷² In our transcriptome, a total of eight contigs contained the pectate lyase/Amb allergen motif (IPR002022), four of which were predicted to represent full-length ORFs. Three of these contigs were in cluster NG1: The above mentioned Pa_07062 (NG1-QT44), Pa_06731, and, in cluster NG1-QT18, Pa_03006. The latter one had two prominent expression maxima, 24 and 66 DAFB, while the former ones were not expressed during Stage III. Interestingly, contig Pa_03006 was the best hit of a pectate lyase from sour cherry (PcPEL1, GenBank BG319625; 98% identity), which was highly expressed during the Stage III, and only slightly upregulated during Stage I in developing sour cherry.⁷³

In cluster NG1-QT42, with expression maxima between 3 and 17 DAFB, the most specific, over-represented terms were related to ribosomes and translation. This is consistent with the high cell division activity characteristic for Stage I. Over-represented GO terms among the 29 exocarp-specific contigs of this cluster were carboxylesterase and tyrosine decarboxylase activities (Supplementary Table S8, Supplementary Table S4). All four sequences annotated with the GO term carboxylesterase activity were homologous to GDSL esterase/lipase and peaked 17 DAFB.

In cluster NG1-QT39 with expression maxima 17–24 DAFB the most specific, over-represented terms were hexose biosynthetic process, fatty acid biosynthetic process and Golgi apparatus. Among the 50 exocarp-specific contigs in this cluster was one specific, over-represented GO term: transferase activity, transferring acyl groups other than amino acyl group. The seven contigs annotated with this term included Pa_14662 (best B2G hit long-chain-alcohol O-fatty-acyltransferase WSD1, wax ester synthase/acyl-CoA:diacylglycerol acyltransferase) and Pa_00867 (best B2G hit BAHD acyltransferase DCR, DEFECTIVE IN CUTICULAR RIDGES), the former one possibly involved in wax and the latter one in cutin biosynthesis. Notable contigs in cluster NG1-QT39 further included two putative transcription factors (an AP2 family member, Pa_00973 and an MYB family member, Pa_23194, best B2G hit MYB66), a putative (hemi)cellulase (Pa_10720) and a lipid transport protein (Pa_22556, best BG2 hit uncharacterized GPI-anchored protein). The best PAVE hit of the latter contig was another assembled sweet cherry cDNA encoding a putative lipid transport protein, PaLTPG1 (GenBank JU090722), tentatively involved in cuticle deposition in sweet cherry fruit.²⁹ Other exocarp-specific, putatively cuticle-related contigs in cluster NG1-QT39 included Pa_14044 and Pa_04572. The best BG2 hit of Pa_14044 was the putative aminoacrylate hydrolase RutD from Vitis vinifera, while the contig is also the best sweet cherry hit of the Arabidopsis protein BODYGUARD (BDG) (e-value 10×10⁻¹⁶⁹, 61% amino-acid identity), an extracellular α/β superfamily hydrolase essential for cuticle formation.⁷⁴ Pa_14044 was the only contig in the assembled sweet cherry fruit transcriptome to contain the multi domain PLN03087, ‘BODYGUARD 1 domain containing hydrolase’ (NCBI conserved domains search⁷⁵). Contig Pa_04572, with the best B2G hit GDSL esterase/lipase LTL1, was the second best sweet cherry hit (e-value 10⁻¹⁵⁷, 86% similarity) of the tomato CD1 protein, a GDSL lipase and putative cutin polymerase in tomato fruit epidermis.^76,77 However, the best hit of tomato CD1 was Pa_16669 (e-value 10⁻¹⁵⁹, 86% similarity), a member of cluster NG1-QT36 (Supplementary Fig. S3a). Cluster NG-QT36 also contained Pa_22209, another exocarp-specific GDSL lipase/esterase homolog with a full-length ORF.

In cluster NG1-QT37, over-represented GO terms were related to chloroplast. No over-represented GO terms were identified in the six exocarp-specific contigs of this cluster. A putatively cuticle-relevant, exocarp-specific contig in this cluster is Pa_08841 with the best B2G hit the ERF transcription factor SHINE2 from Arabidopsis, and the best PAVE hit the previously identified SHINE family transcription factor PaWINB, potentially involved in CM formation in the sweet cherry fruit.²⁹ In total, we identified three contigs in the sweet cherry fruit transcriptome containing a full-length ORF, which most likely represent ERF family transcription factors from the SHINE clade, previously associated with regulation of cutin and wax deposition:^78,79 Pa_08841, Pa_02691 and Pa_19618 (the latter two were in cluster NG-QT43). The recently characterized SlSHINE3,⁸⁰ a regulator of cuticle development in tomato fruit and ortholog of Arabidopsis SHINE3 was most similar to Pa_08841 at the peptide level (69% positives, e-value 3×10⁻⁵⁷) and to Pa_19618 at the nucleotide level (85% identity, e-value 3×10⁻³²).

Smaller QT clusters in NG1 also contained several interesting contigs. For instance, potentially relevant for the cell wall architecture, Pa_13423 is a putative pectin methyl esterase inhibitor, highly abundant in the fruit with a maximum expression during 24–31 DAFB (NG1-QT32). A number of other contigs also shared high sequence similarity with pectin methyl esterase inhibitors, Pa_13423 being the most abundant one (total FPKM sum 25363). Indeed, its expression was much higher than that of any pectin methyl esterase homologs in the transcriptome. The unclustered contig Pa_00883 represents a putative (hemi)cellulase, present at a high level especially, while the rate of expansion growth was high (24 and 66 DAFB). Exocarp-specific contigs with probable relevance for wax biosynthesis included Pa_00478 (best hit WAX2, or CER1), Pa_06166 (3-ketoacyl-CoA synthase 6) and the Top 40 contig Pa_29501 (non-specific lipid-transfer protein, allergen Pru av 3) in cluster NG1-QT28, and the unclustered contig Pa_08907 (3-ketoacyl-CoA synthase 19). Contigs Pa_08907 and Pa_06166 were, together with Pa_01742 (NG1-QT43), the only three exocarp-specific contigs in Group 1F, which contained a full-length ORF and the InterPro motif very-long-chain 3-ketoacyl-CoA synthase (IPR012392), essential in wax biosynthesis. Pa_08907 was present at much lower levels than the other two but, interestingly, its expression peaked at equal levels during Stage I and during Stage III, while the two major KCS transcripts were abundant mainly during Stage I.

Based on the expression patterns, cluster NG1 was likely to contain most of the contigs with potential relevance to the process of our key interest, cuticle formation. Therefore, clusters NG3 and NG4 will be described in less detail than cluster NG1.

In the largest QT cluster of NG3 (NG3-QT51) the most specific, over-represented GO terms included photorespiration (or oxidative photosynthetic carbon pathway), terpene biosynthetic process and ABA 8′-hydroxylase activity. The most abundant contig with the latter GO term was Pa_06858, with an expression maximum from 38 to 45 DAFB. The contig is best hit of PacCYP707A2, the most abundant ABA 8′-hydroxylase transcript in the mesocarp of developing sweet cherry fruit.⁸¹ The expression patterns of the gene from Stage II to maturity in the exocarp of the cultivar Regina (this work) and in the mesocarp of cultivar Hongdeng⁸¹ were very similar. The consistent expression patterns underline the importance of this ABA degrading enzyme for the development of the sweet cherry fruit. Tightly regulated expression of genes in ABA turnover may account for the previously reported low and high ABA levels before and after onset of sweet cherry fruit ripening, respectively.^81,82

The second largest QT cluster of NG3 (NG3-QT50) had a striking centroid expression pattern with maximum expression levels at the beginning of growth Stage II (31 DAFB). An exocarp-specific contig in this cluster, Pa_01687 (best B2G hit polyol transporter 5), with maximal expression from 31 to 45 DAFB (Supplementary Fig. S3b), might be useful as a molecular marker for Stage II. This contig is interesting for its putative function in sugar transport. Pa_01687 is the best hit of PcSOT2 (97% similarity, e-value 0.0), a sorbitol transporter from sour cherry fruit, also highly expressed during Stage II.⁸³ Another putative sorbitol transporter sequence was nearly absent until 31 DAFB, but abundant thereafter to maturity (Pa_05395 in NG4-QT66). This developmental expression pattern is similar to the one of PcSOT1, the major sorbitol transporter accounting for sugar accumulation in maturing sour cherry.⁸³ Like Pa_01687, the other major sorbitol transporter contig, Pa_05395 was exocarp specific (Supplementary Fig. S3b). The predicted peptide sequences of Pa_05395 and PcSOT1 were to 81% similar (e-value 0.0).

In the functional category cell wall metabolism, cluster NG3 includes contig Pa_03056 (NG3-QT46), homologous to a xyloglucan endotransglucosylase/hydrolase protein, XTH33. The abundance of the contig increased sharply 45 DAFB, and stayed at a high level until 59 DAFB, after which it was detected at a very low level only (Supplementary Fig. S3b). The expression pattern suggests a specific role for the XTH gene in the cell wall rearrangements that preceed and accompany the increase in fruit growth rate (Figure 1a) and softening⁸⁴ in Stage III. Exocarp-specific contigs with putative roles for cuticle formation in cluster NG3 included Pa_04929 (best hit cytochrome P450 716B2; with a total FPKM sum of 6030 a very highly abundant sequence), Pa_05584 (transhomeobox-leucine zipper protein ANL2), Pa_05276 (WAX2), Pa_06635 (ABC-transporter, WBC11) and Pa_07137 (ABC transporter). Pa_05584 was the best hit of the Arabidopsis HD-ZIP IV transcription factor gene ANL2 (ANHOCYANINLESS2), and of the tomato ANL2 homolog, CD2 (CUTIN DEFICIENT2). In Arabidopsis and tomato, these HD-ZIP IV factor are essential regulators of cutin, wax and anthocyanin levels.^85–87 The expression pattern of Pa_05584 in sweet cherry fruit—exocarp-specific, high expression levels during stages II and III, lower but constant expression levels during Stage I—is consistent with roles in cuticular lipid and anthocyanin biosyntheses also for the sweet cherry fruit ANL2/CD2 homolog.

In cluster NG4, with expression maxima from mid-Stage II through Stage III, the largest cluster, QT71, contained 1956 contigs with mostly increasing expression levels from 31 DAFB to maturity and highest expression levels after 66 DAFB. Interesting sequences in this cluster included the putative cell wall related contigs Pa_01792 (best hit pectin methylesterase) and Pa_07321 (xyloglucan endotransglucosylase/hydrolase), tentatively active in the ripening associated softening of the fruit during Stage III.⁸⁴ In this cluster, 79 contigs represent transcription factors, such as Pa_02092 (best B2G hit NAC domain containing protein) and Pa_06479 (best B2G hit MYB75). The former contig was highly abundant throughout Stage III, while the expression of the latter one increases sharply 66 DAFB and is at maximum from 80 to 87 DAFB. Pa_06479 is the most highly abundant MYB family transcription factor sequence in the sweet cherry fruit transcriptome (total FPKM sum 6415). The predicted peptide sequence is identical with GenBank ADY15304.1, defined as R2R3-MYB transcription factor from P. avium, and highly similar (85% similarity) with PavMYB10, another previously identified R2R3 MYB factor sequence from sweet cherry.⁸⁸ PavMYB10 was identified as a main positive regulator of anthocyanin biosynthesis in the sweet cherry fruit.^88,89 Consistent with such a function, the abundance of Pa_06479 increased with reddening of the fruit (Supplementary Fig. S3b and Figure 1a and 1c). The differences between the predicted amino acid sequences of Pa_06479 and PavMYB10 are similar to the differences in varying amino-acid residues among the MYB10 protein sequences from several rosaceous species.⁸⁸

Cluster NG4-QT67, with centroid expression level elevated from 38 DAFB to maturity and an expression peak 66 DAFB, contains a putative pectate lyase, Pa_14097. With Pa_06731 (NG1-QT43), it is the most abundant pectate lyase transcript in the sweet cherry fruit transcriptome. It was the best hit of a ripening associated pectate lyase from apple (GenBank AY376878¹⁴) and the third best hit of the ripening associated pectate lyase PcPEL1 from sour cherry⁷³ (best hit of PcPEL1 was Pa_03006, NG1-QT18; see above). Contig Pa_14097 was more expressed in the exocarp than in the mesocarp (80 DAFB exocarp, 1161±43 FPKM; mesocarp, 245 FPKM), suggesting that in sweet cherry fruit, this ripening associated pectate lyase is mostly relevant for the expansion growth of the exocarp cells. Altogether, the pectate lyase/Amb allergen motif (IPR002022) was identified in eight contigs, four of which were also predicted to contain a full-length ORF. Other contigs relevant for the cell wall architecture in cluster NG4-QT67 include Pa_09347 (best hit expansin) and Pa_44853 (probable rhamnose biosynthetic enzyme). Both contigs share with Pa_14097 the sharp peak in expression level 66 DAFB, consistent with functions in cell wall re-arrangements that enable the rapid expansion growth at this developmental stage.⁹⁰ In addition to these cell wall-related contigs in cluster NG4-QT67, contig Pa_02283 (best hit cytidine triphosphate (CTP) synthase 1) also had a sharp expression peak 66 DAFB, with lower but still elevated expression from 73 to 87 DAFB. In eukaryotic cells, CTP is required for phospholipid biosynthesis and CTP synthase is the key enzyme in the process.^91,92 A function of the putative CTP synthase encoded by Pa_02283 in phospholipid biosynthesis in sweet cherry fruit is consistent with a high demand for membrane lipids during the maximum rate of fruit surface expansion 66 DAFB.

Further contigs from cluster NG4 selected for Table 5 encode enzymes involved in cuticular lipid biosynthesis, transcription factors and a sugar transporter. The unclustered, exocarp specific contig Pa_14729 encodes a putative glycerol-3-phosphate acyltransferase (GPAT); its best B2G hit was GPAT6. The contig has two expression maxima, 17–24 DAFB and 66–87 DAFB. In Arabidopsis, GPAT6 functions in cutin biosynthesis in petals and sepals.⁹³ The exocarp specific expression pattern with the peak during the highest rate of cuticle deposition (24 DAFB) in sweet cherry fruit is consistent with a function in the cutin biosynthetic pathway. Along these lines, the second maximum during Stage III is consistent with the low but detectable cuticle deposition rate at this late stage (Figure 1b) and with the increased amounts of the main cutin constituents, 9(10)-hydroxy-hexadecane-1,16-dioic acid and 9(10),16-dihydroxy-hexadecanoic acid, at maturity as compared to 22 DAFB.⁸

Conclusions

The sweet cherry fruit transcriptome database P.avium_v2.0 consists of a comprehensive set of assembled, annotated sequences representing genes that contribute to the fruit skin development and their detailed expression patterns over time (the sequences, annotation and expression data for all contigs are available as Supplementary Data S1 and Supplementary Table S4). The genome-wide analyses indicate tight developmental regulation of genes functioning in diverse processes such as transcriptional regulation, sugar transport, lipid metabolism and cell wall rearrangements. The high temporal resolution of the expression patterns presented here reveals finely tuned developmental specialization of different members of gene families. Moreover, the de novo assembled sweet cherry fruit transcriptome with 7760 full-length protein coding sequences and over 20 000 other, annotated cDNA sequences together with their developmental expression patterns are expected to accelerate molecular research on this important tree fruit crop with up to now only little genomic resources.

Conflict of interest

The authors declare no conflicts of interests.

References

Esau K . Anatomy of Seed Plants. 2nd ed. New York: John Wiley & Sons, 1977.
Google Scholar
Baur P, Stulle K, Schönherr J, Uhlig B . [Absorption of radiation in the UV-B to blue light range by leave cuticles of selected crop plants].Gartenbauwissenschaft 1998; 63: 145–152. German
Google Scholar
Martin JT, Juniper BE . The Cuticles of Plants. Edinburgh: Edward Arnold Ltd, 1970.
Google Scholar
Kerstiens G . Plant Cuticles—An Integrated Functional Approach. Oxford: BIOS Scientific Publishers, 1996.
Book Google Scholar
Lalel HJD, Singh Z, Tan SC . Distribution of aroma volatile compounds in different parts of mango fruit. J Hort Sci Biotechnol 2003; 78: 131–138.
Article CAS Google Scholar
Knoche M, Beyer M, Peschel S, Oparlakov B, Bukovac MJ . Changes in strain and deposition of cuticle in developing sweet cherry fruit. Physiol Plant 2004; 120: 667–677.
Article CAS PubMed Google Scholar
Leide J, Hildebrandt U, Reussing K, Riederer M, Vogg G . The developmental pattern of tomato fruit wax accumulation and its impact on cuticular transpiration barrier properties: effects of a deficiency in a beta-ketoacyl-coenzyme A synthase (LeCER6). Plant Physiol 2007; 144: 1667–1679.
Google Scholar
Peschel S, Franke R, Schreiber L, Knoche M . Composition of the cuticle of developing sweet cherry fruit. Phytochemistry 2007; 68: 1017–1025.
Article CAS PubMed Google Scholar
Thompson DS, Davies WJ, Ho LC . Regulation of tomato fruit growth by epidermal cell wall enzymes. Plant Cell Environ 1998; 21: 589–599.
Article CAS Google Scholar
Bargel H, Neinhuis C . Tomato (Lycopersicon esculentum Mill.) fruit growth and ripening as related to the biomechanical properties of fruit skin and isolated cuticle. J Exp Bot 2005; 56: 1049–1060.
Article CAS PubMed Google Scholar
The Tomato Genome Consortium . The tomato genome sequence provides insights into fleshy fruit evolution. Nature 2012; 485: 635–641.
Article CAS Google Scholar
Park S, Sugimoto N, Larson MD, Beaudry R, van Nocker S . Identification of genes with potential roles in apple fruit development and biochemistry through large-scale statistical analysis of expressed sequence tags. Plant Physiol 2006; 141: 811–824.
Article CAS PubMed PubMed Central Google Scholar
Janssen BJ, Thodey K, Schaffer RJ et al. Global gene expression analysis of apple fruit development from the floral bud to ripe fruit. BMC Plant Biol 2008; 8: 16.
Article CAS PubMed PubMed Central Google Scholar
Costa F, Alba R, Schouten H et al. Use of homologous and heterologous gene expression profiling tools to characterize transcription dynamics during apple fruit maturation and ripening. BMC Plant Biol 2010; 10: 1–17.
Article CAS Google Scholar
Da Silva FG, Iandolino A, Al-Kayal F et al. Characterizing the grape transcriptome. Analysis of expressed sequence tags from multiple Vitis species and development of a compendium of gene expression during berry development. Plant Physiol 2005; 139: 574–597.
Article CAS PubMed Google Scholar
Pilati S, Perazzolli M, Malossini A et al. Genome-wide transcriptional analysis of grapevine berry ripening reveals a set of genes similarly modulated during three seasons and the occurrence of an oxidative burst at vèraison. BMC Genomics 2007; 8: 428.
Article PubMed PubMed Central Google Scholar
Trainotti L, Bonghi C, Ziliotto F et al. The use of microarray µPEACH1.0 to investigate transcriptome changes during transition from pre-climacteric to climacteric phase in peach fruit. Plant Sci 2006; 170: 606–613.
Article CAS Google Scholar
Manganaris GA, Rasori A, Bassi D et al. Comparative transcript profiling of apricot (Prunus armeniaca L.) fruit development and on-tree ripening. Tree Genet Genomes 2011; 7: 609–616.
Article Google Scholar
Yin YX, Zhang XW, Fang YJ et al. High-throughput sequencing-based gene profiling on multi-staged fruit development of date palm (Phoenix dactylifera, L.). Plant Mol Biol 2012; 78: 617–626.
Article CAS PubMed PubMed Central Google Scholar
Lemaire-Chamley M, Petit J, Garcia V . Changes in transcriptional profiles are associated with early fruit tissue specialization in tomato. Plant Physiol 2005; 139: 750–769.
Article CAS PubMed PubMed Central Google Scholar
Mintz-Oron S, Mandel T, Rogachev I et al. Gene expression and metabolism in tomato fruit surface tissues. Plant Physiol 2008; 147: 823–851.
Article CAS PubMed PubMed Central Google Scholar
Yeats T, Howe K, Matas A . Mining the surface proteome of tomato (Solanum lycopersicum) fruit for proteins associated with cuticle biogenesis. J Exp Bot 2010; 61: 3759–3771.
Article CAS PubMed PubMed Central Google Scholar
Ali MB, Howard S, Chen SW et al. Berry skin development in Norton grape: distinct patterns of transcriptional regulation and flavonoid biosynthesis. BMC Plant Biol 2011; 11: 1–23.
Article CAS Google Scholar
Vecchietti A, Lazzari B, Ortugno C et al. Comparative analysis of expressed sequence tags from tissues in ripening stages of peach (Prunus persica L. Batsch). Tree Genet Genomes 2009; 5: 377–391.
Article Google Scholar
Grimplet J, Deluc LG, Tillett RL et al. Tissue-specific mRNA expression profiling in grape berry tissues. BMC Genomics 2007; 8: 184.
Article CAS Google Scholar
Matas AJ, Yeats TH, Buda GJ et al. Tissue- and cell-type specific transcriptome profiling of expanding tomato fruit provides insights into metabolic and regulatory specialization and cuticle formation. Plant Cell 2011; 23: 3893–3910.
Article CAS PubMed PubMed Central Google Scholar
Matas AJ, Agusti J, Tadeo FR, Talon M, Rose JK . Tissue-specific transcriptome profiling of the citrus fruit epidermis and subepidermis using laser capture microdissection. J Exp Bot 2010; 61: 3321–3330.
Article CAS PubMed PubMed Central Google Scholar
Ono NN, Britton MT, Fass JN, Nicolet CM, Lin DW, Tian L . Exploring the transcriptome landscape of pomegranate fruit peel for natural product biosynthetic gene and SSR marker discovery. J Integr Plant Biol 2011; 53: 800–813.
Article CAS PubMed Google Scholar
Alkio M, Jonas U, Sprink T, van Nocker S, Knoche M . Identification of putative candidate genes involved in cuticle formation in Prunus avium (sweet cherry) fruit. Ann Bot 2012; 110: 101–112.
Article CAS PubMed PubMed Central Google Scholar
Knoche M, Peschel S, Hinz M, Bukovac MJ . Studies on water transport through the sweet cherry fruit surface: II. Conductance of the cuticle in relation to fruit development. Planta 2001; 213: 927–936.
Article CAS PubMed Google Scholar
Khanal BP, Grimm E, Finger S, Blume A, Knoche M . Intracuticular wax fixes and restricts strain in leaf and fruit cuticles. New Phytol 2013; 200: 1–10.
Article CAS Google Scholar
Grimm E, Peschel S, Knoche M . Mottling on sweet cherry fruit is caused by exocarp strain. J Am Soc Hort Sci 2013; 138: 18–23.
Article Google Scholar
Grimm E, Peschel S, Becker T, Knoche M . Stress and strain in the sweet cherry skin. J Am Soc Hort Sci 2012; 137: 383–390.
Article Google Scholar
Aronesty E . ea-utils: Command-line tools for processing biological sequencing data. 2011. Available at http://code.google.com/p/ea-utils. (accessed July 2011)
Salmela L, Schroeder J . Correcting errors in short reads by multiple alignments. Bioinformatics 2011; 27: 1455–1461.
Article CAS PubMed Google Scholar
Edgar RC . Search and clustering orders of magnitude faster than BLAST. Bioinformatics 2010; 26: 2460–2461.
Article CAS PubMed Google Scholar
Zerbino DR, Birney E . Velvet: algorithms for de novo short read assembly using de Bruijn graphs. Genome Res 2008; 18: 821–829.
Article CAS PubMed PubMed Central Google Scholar
Soderlund C, Johnson E, Bomhoff M, Descour A . PAVE: program for assembling and viewing ESTs. BMC Genomics 2009; 10: 400.
Article CAS PubMed PubMed Central Google Scholar
Huang XQ, Madan A . CAP3: A DNA sequence assembly program. Genome Res 1999; 9: 868–877.
Article CAS PubMed PubMed Central Google Scholar
Langmead B, Trapnell C, Pop M, Salzberg SL . Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol 2009; 10: R25.
Article CAS PubMed PubMed Central Google Scholar
Altschul SF, Madden TL, Schäffer AA et al. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res 1997; 25: 3389–3402.
Article CAS PubMed PubMed Central Google Scholar
Verde I, Abbott AG, Scalabrin S et al. The high-quality draft genome of peach (Prunus persica) identifies unique patterns of genetic diversity, domestication and genome evolution. Nat Genet 2013; 45: 487–494.
Article CAS PubMed Google Scholar
Velasco R, Zharkikh A, Affourtit J et al. The genome of the domesticated apple (Malus × domestica Borkh.). Nat Genet 2010; 42: 833–839.
Article CAS PubMed Google Scholar
Jaillon O, Aury JM, Noel B et al. The grapevine genome sequence suggests ancestral hexaploidization in major angiosperm phyla. Nature 2007; 449: 463–467.
CAS PubMed Google Scholar
Swarbreck D, Wilks C, Lamesch P et al. The Arabidopsis Information Resource (TAIR): gene structure and function annotation. Nucleic Acids Res 2008; 36: D1009–D1014.
Article CAS PubMed Google Scholar
Goodstein DM, Shu S, Howson R et al. Phytozome: a comparative platform for green plant genomics. Nucleic Acids Res 2012; 40: D1178–D1186.
Article CAS PubMed Google Scholar
Conesa A, Götz S, García-Gómez JM, Terol J, Talón M, Robles M . Blast2GO: a universal tool for annotation, visualization and analysis in functional genomics research. Bioinformatics 2005; 21: 3674–3676.
Article CAS PubMed Google Scholar
The Uniprot Consortium . Reorganizing the protein space at the Universal Protein Resource (UniProt). Nucleic Acids Res 2012; 40: D71–D75.
Article CAS Google Scholar
Zdobnov E, Apweiler R . InterProScan—an integration platform for the signature-recognition methods in InterPro. Bioinformatics 2001; 17: 847–848.
Article CAS PubMed Google Scholar
Du Z, Zhou X, Ling Y, Zhang ZH, Su Z . agriGO: a GO analysis toolkit for the agricultural community. Nucleic Acids Res 2010; 38: W64–W70.
Article CAS PubMed PubMed Central Google Scholar
Trapnell C, Williams BA, Pertea G et al. Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation. Nat Biotechnol 2010; 28: 511–5.
Article CAS PubMed PubMed Central Google Scholar
Battke F, Symons S, Nieselt K . Mayday—integrative analytics for expression data. BMC Bioinformatics 2010; 11: 1–10.
Article CAS Google Scholar
Lilleland O, Newsome L . A growth study of the cherry fruit. Proc Am Soc Hortic Sci 1934; 32: 291–299.
Google Scholar
Schliesky S, Gowik U, Weber APM, Bräutigam A . RNA-Seq assembly—are we there yet? Front Plant Sci 2012; 3: 1–12.
Article Google Scholar
The ENCODE Consortium. Standards, Guidelines and Best Practices for RNA-Seq V1.0. 2011. Available at http://encodeproject.org/ENCODE/protocols/dataStandards/ENCODE_RNAseq_Standards_V1.0.pdf (accessed July 2011).
Wang L, Feng Z, Wang X, Wang X, Zhang X . DEGseq: an R package for identifying differentially expressed genes from RNA-seq data. Bioinformatics 2010; 26: 136–138.
Article CAS PubMed Google Scholar
St Laurent G, Shtokalo D, Tackett MR et al. Intronic RNAs constitute the major fraction of the non-coding RNA in mammalian cells. BMC Genomics 2012; 13: 504.
Article CAS PubMed PubMed Central Google Scholar
Zhang H, Jin J, Tang L et al. PlantTFDB 2.0: update and improvement of the comprehensive plant transcription factor database. Nucleic Acids Res 2011; 39: D1114–D1117.
Article CAS PubMed Google Scholar
Fils-Lycaon B, Wiersma P, Kenneth C, Sautiere P . A cherry protein and its gene, abundantly expressed in ripening fruit, have been identified as thaumatin-like. Plant Physiol 1996; 111: 269–273.
Article CAS PubMed PubMed Central Google Scholar
Leszczyszyn OI, Imam HT, Blindauer CA . Diversity and distribution of plant metallothioneins: a review of structure, properties and functions. Metallomics 2013; 5: 1146–1169.
Article CAS PubMed Google Scholar
Kader JC . Lipid-transfer proteins in plants. Annu Rev Plant Phys 1996; 47: 627–654.
Article CAS Google Scholar
Kusano T, Berberich T, Tateda C, Takahashi Y . Polyamines: essential factors for growth and survival. Planta 2008; 228: 367–381.
Article CAS PubMed Google Scholar
Liu P, Goh CJ, Loh CS, Pua EC . Differential expression and characterization of three metallothionein-like genes in Cavendish banana (Musa acuminata). Physiol Plant 2002; 114: 241–250.
Article CAS PubMed Google Scholar
Moyle R, Fairbairn DJ, Ripi J, Crowe M, Botella JR . Developing pineapple fruit has a small transcriptome dominated by metallothionein. J Exp Bot 2005; 56: 101–112.
Article CAS PubMed Google Scholar
Pollard M, Beisson F, Li Y, Ohlrogge JB . Building lipid barriers: biosynthesis of cutin and suberin. Trends Plant Sci 2008; 13: 236–246.
Article CAS PubMed Google Scholar
Broekaert WF, Cammue BP, DeBolle MF, Thevissen K, DeSamblanx GW, Osborn RW . Antimicrobial peptides from plants. CRC Crit Rev Plant Sci 1997; 16: 297–323.
Article CAS Google Scholar
Develey-Riviere MP, Galiana E . Resistance to pathogens and host developmental stage: a multifaceted relationship within the plant kingdom. New Phytol 2007; 175: 405–416.
Article PubMed Google Scholar
Nahirñak V, Almasia NI, Fernandez PV et al. Potato snakin-1 gene silencing affects cell division, primary metabolism, and cell wall composition. Plant Physiol 2012; 158: 252–263.
Article CAS PubMed Google Scholar
Lü S, Song T, Kosma DK, Parsons EP, Rowland O, Jenks MA . Arabidopsis CER8 encodes LONG-CHAIN ACYL-COA SYNTHETASE 1 (LACS1) that has overlapping functions with LACS2 in plant wax and cutin synthesis. Plant J 2009; 59: 553–564.
Article CAS PubMed Google Scholar
Baumann K, Perez-Rodriguez M, Bradley D et al. Control of cell and petal morphogenesis by R2R3 MYB transcription factors. Development 2007; 134: 1691–1701.
Google Scholar
Segura A, Moreno M, Madueno F, Molina A, Garcia-Olmedo F . Snakin-1, a peptide from potato that is active against plant pathogens. Mol Plant Microbe Interact 1999; 12: 16–23.
Article CAS PubMed Google Scholar
Goulao LF, Santos J, de Sousa I, Oliveira CM . Patterns of enzymatic activity of cell wall-modifying enzymes during growth and ripening of apples. Postharvest Biol Technol 2007; 43: 307–318.
Article CAS Google Scholar
Yoo SD, Gao ZF, Cantini C, Loescher WH, van Nocker S . Fruit ripening in sour cherry: Changes in expression of genes encoding expansins and other cell-wall-modifying enzymes. J Am Soc Hort Sci 2003; 128: 16–22.
Article CAS Google Scholar
Kurdyukov S, Faust A, Nawrath C . The epidermis-specific extracellular BODYGUARD controls cuticle development and morphogenesis in Arabidopsis. Plant Cell 2006; 18: 321–339.
Article CAS PubMed PubMed Central Google Scholar
Marchler-Bauer A, Zheng CJ, Chitsaz F et al. CDD: conserved domains and protein three-dimensional structure. Nucleic Acids Res 2013; 41: D348–D352.
Article CAS PubMed Google Scholar
Yeats T, Martin L, Viart H . The identification of cutin synthase: formation of the plant polyester cutin. Nat Chem Biol 2012; 8: 8–10.
Article CAS Google Scholar
Girard AL, Mounet F, Lemaire-Chamley M et al. Tomato GDSL1 is required for cutin deposition in the fruit cuticle. Plant Cell 2012; 24: 3119–3134.
Article CAS PubMed PubMed Central Google Scholar
Aharoni A, Dixit S, Jetter R . The SHINE clade of AP2 domain transcription factors activates wax biosynthesis, alters cuticle properties, and confers drought tolerance when overexpressed in Arabidopsis. Plant Cell 2004; 16: 2463–2480.
Article CAS PubMed PubMed Central Google Scholar
Shi JX, Malitsky S, de Oliveira S et al. SHINE transcription factors act redundantly to pattern the archetypal surface of Arabidopsis flower organs. PLoS Genet 2011; 7: e1001388.
Article CAS PubMed PubMed Central Google Scholar
Shi JX, Adato A, Alkan N et al. The tomato SlSHINE3 transcription factor regulates fruit cuticle formation and epidermal patterning. New Phytol 2013; 197: 468–480.
Article CAS PubMed Google Scholar
Ren J, Sun L, Wu J et al. Cloning and expression analysis of cDNAs for ABA 8′-hydroxylase during sweet cherry fruit maturation and under stress conditions. J Plant Physiol 2010; 167: 1486–1493.
Article CAS PubMed Google Scholar
Kondo S, Inoue K . Abscisic acid (ABA) and 1-aminocyclopropane-1-carboxylic acid (ACC) content during growth of “Satohnishiki” cherry fruit, and the effect of ABA and ethephon application on fruit quality. J Hort Sci Biotechnol 1997; 72: 221–228.
Article CAS Google Scholar
Gao ZF, Maurousset L, Lemoine R, Yoo SD, van Nocker S, Loescher W . Cloning, expression, and characterization of sorbitol transporters from developing sour cherry fruit and leaf sink tissues. Plant Physiol 2003; 131: 1566–1575.
Article CAS PubMed PubMed Central Google Scholar
Kondo S, Danjo C . Cell wall polysaccharide metabolism during fruit development in sweet cherry “Satohnishiki” as affected by gibberellic acid. J Jpn Soc Hort Sci 2001; 70: 178–184.
Article CAS Google Scholar
Kubo H, Peeters AJ, Aarts MG, Pereira A, Koornneef M . ANTHOCYANINLESS2, a homeobox gene affecting anthocyanin distribution and root development in Arabidopsis. Plant Cell 1999; 11: 1217–1226.
Article CAS PubMed PubMed Central Google Scholar
Isaacson T, Kosma DK, Matas AJ et al. Cutin deficiency in the tomato fruit cuticle consistently affects resistance to microbial infection and biomechanical properties, but not transpirational water loss. Plant J 2009; 60: 363–377.
Article CAS PubMed Google Scholar
Nadakuduti SS, Pollard M, Kosma DK, Allen C, Ohlrogge JB, Barry CS . Pleiotropic phenotypes of the sticky peel mutant provide new insight into the role of CUTIN DEFICIENT2 in epidermal cell function in tomato. Plant Physiol 2012; 159: 945–960.
Article CAS PubMed PubMed Central Google Scholar
Lin-Wang K, Bolitho K, Grafton K et al. An R2R3 MYB transcription factor associated with regulation of the anthocyanin biosynthetic pathway in Rosaceae. BMC Plant Biol 2010; 10: 50.
Article CAS PubMed PubMed Central Google Scholar
Sooriyapathirana SS, Khan A, Sebolt AM et al. QTL analysis and candidate gene mapping for skin and flesh color in sweet cherry fruit (Prunus avium L.). Tree Genet Genomes 2010; 6: 821–832.
Article Google Scholar
Cosgrove DJ . Growth of the plant cell wall. Nat Rev Mol Cell Biol 2005; 6: 850–861.
Article CAS PubMed Google Scholar
Kent C . Eukaryotic phospholipid biosynthesis. Annu Rev Biochem 1995; 64: 315–343.
Article CAS PubMed Google Scholar
Chang YF, Carman GM . CTP synthetase and its role in phospholipid synthesis in the yeast Saccharomyces cerevisiae. Prog Lipid Res 2008; 47: 333–339.
Article CAS PubMed PubMed Central Google Scholar
Li-Beisson Y, Pollard M, Sauveplane V, Pinot F, Ohlrogge J, Beisson F . Nanoridges that characterize the surface morphology of flowers require the synthesis of cutin polyester. Proc Natl Acad Sci USA 2009; 106: 22008–220013.
Article PubMed PubMed Central Google Scholar

Download references

Acknowledgements

The authors thank the Deutsche Forschungsgemeinschaft for financial support (Grant AL1373_1-1) and Thorben Sprink for sampling of the fruit and isolation of RNA for the RNA-seq analyses.

Author information

Authors and Affiliations

Institute of Horticultural Production Systems, Leibniz Universität Hannover, D-30419 Hannover, Germany
Merianne Alkio, Uwe Jonas, Myriam Declercq & Moritz Knoche
Department of Horticulture, Michigan State University, East Lansing, MI 48824-1325, USA
Steven Van Nocker

Authors

Merianne Alkio
View author publications
You can also search for this author in PubMed Google Scholar
Uwe Jonas
View author publications
You can also search for this author in PubMed Google Scholar
Myriam Declercq
View author publications
You can also search for this author in PubMed Google Scholar
Steven Van Nocker
View author publications
You can also search for this author in PubMed Google Scholar
Moritz Knoche
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Merianne Alkio.

Ethics declarations

Competing interests

The authors declare no conflict of interest.

Supplementary information

Supplementary Table S1 (XLSX 11 kb)

Supplementary Table S2 (XLSX 50 kb)

Supplementary Table S3 (XLSX 25 kb)

Supplementary Table S4 (XLSX 22360 kb)

Supplementary Table S5 (XLSX 631 kb)

Supplementary Table S6 (XLSX 590 kb)

Supplementary Table S7 (XLSX 34 kb)

Supplementary Table S8 (XLSX 5104 kb)

Supplementary Fig. S1 (PDF 709 kb)

Supplementary Fig. S2. (PDF 110 kb)

Supplementary Fig. S3. (PDF 3412 kb)

Supplementary Fig. S4. (PDF 97 kb)

Supplementary Method S1 (PDF 47 kb)

Supplementary Data S1 (TXT 34364 kb)

Supplementary Data S2 (TXT 9729 kb)

Rights and permissions

This work is licensed under a Creative Commons Attribution 3.0 Unported License. To view a copy of this license, visit http://creativecommons.org/licenses/by-nc-nd/3.0/

Reprints and permissions

About this article

Cite this article

Alkio, M., Jonas, U., Declercq, M. et al. Transcriptional dynamics of the developing sweet cherry (Prunus avium L.) fruit: sequencing, annotation and expression profiling of exocarp-associated genes. Hortic Res 1, 11 (2014). https://doi.org/10.1038/hortres.2014.11

Download citation

Received: 20 November 2013
Accepted: 17 January 2014
Published: 12 March 2014
DOI: https://doi.org/10.1038/hortres.2014.11

Subjects

Abstract

Similar content being viewed by others

Introduction

Materials and methods

Sampling of fruit and determination of fresh mass, fruit surface area and fruit cuticle mass

Total RNA extraction

Library preparation for RNA-Seq and sequencing

Pre-processing and de novo assembly of Illumina reads

Read mapping and BLAST searches

InterProScan and gene ontology analyses

Determination of expression levels and clustering of co-expressed genes

Estimating transcript levels using reverse transcription-quantitative real-time PCR

GenBank accessions

Results and discussion

Characterization of the fruit, cDNA sequencing and de novo assembly of the sweet cherry transcriptome

Mapping of trimmed reads to the de novo assembled contigs

Annotation of the assembled contigs: BLAST searches, gene ontology analyses and InterProScan

Exocarp-specific transcripts are related to epidermis development and stress responses

Forty most abundant sweet cherry contigs and their expression patterns

Global expression patterns in developing sweet cherry fruit: detection of five main patterns of gene expression

Global expression patterns in developing sweet cherry fruit: identification of distinct patterns of coregulated genes

Conclusions

Conflict of interest

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Competing interests

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

Search

Quick links