The grapevine (Vitis vinifera L.) floral transcriptome in Pinot noir variety: identification of tissue-related gene networks and whorl-specific markers in pre- and post-anthesis phases

Vannozzi, Alessandro; Palumbo, Fabio; Magon, Gabriele; Lucchin, Margherita; Barcaccia, Gianni

doi:10.1038/s41438-021-00635-7

Download PDF

Article
Open access
Published: 01 September 2021

The grapevine (Vitis vinifera L.) floral transcriptome in Pinot noir variety: identification of tissue-related gene networks and whorl-specific markers in pre- and post-anthesis phases

Horticulture Research volume 8, Article number: 200 (2021) Cite this article

3622 Accesses
5 Citations
5 Altmetric
Metrics details

Subjects

Abstract

The comprehension of molecular processes underlying the development and progression of flowering in plants is a hot topic, not only because that often the products of interest for human and animal nutrition are linked to the development of fruits or seeds, but also because the processes of gametes formation occurring in sexual organs are at the basis of recombination and genetic variability which constitutes the matter on which evolution acts, whether understood as natural or human driven. In the present study, we used an NGS approach to produce a grapevine flower transcriptome snapshot in different whorls and tissues including calyx, calyptra, filament, anther, stigma, ovary, and embryo in both pre- and post-anthesis phases. Our investigation aimed at identifying hub genes that unequivocally distinguish the different tissues providing insights into the molecular mechanisms that are at the basis of floral whorls and tissue development. To this end we have used different analytical approaches, some now consolidated in transcriptomic studies on plants, such as pairwise comparison and weighted-gene coexpression network analysis, others used mainly in studies on animals or human’s genomics, such as the tau (τ) analysis aimed at isolating highly and absolutely tissue-specific genes. The intersection of data obtained by these analyses allowed us to gradually narrow the field, providing evidence about the molecular mechanisms occurring in those whorls directly involved in reproductive processes, such as anther and stigma, and giving insights into the role of other whorls not directly related to reproduction, such as calyptra and calyx. We believe this work could represent an important genomic resource for functional analyses of grapevine floral organ growth and fruit development shading light on molecular networks underlying grapevine reproductive organ determination.

Floral transcriptomes reveal gene networks in pineapple floral growth and fruit development

Article Open access 10 September 2020

Transcriptome profiling and weighted gene co-expression network analysis of early floral development in Aquilegia coerulea

Article Open access 12 November 2020

Uncovering the molecular mechanisms of russet skin formation in Niagara grapevine (Vitis vinifera × Vitis labrusca)

Article Open access 19 March 2024

Introduction

Grapevine (Vitis vinifera L.) is indisputably one of the most popular crops in the world. According to the most updated data of the International Organization of Vine and Wine¹, vines are spread over an area of more than 7 million hectares and produce annually about 75 million tons of grapes, destined to production of wine, table grapes, juices, and raisins (OIV). Therefore, in a sector that alone is worth billions of dollars, a deep comprehension of each of the thorny phases characterizing berry development is pivotal and several approaches have been applied to elucidate the changes that take place from flowering to ripe berry.

From a molecular point of view, analyses of large gene expression datasets represent a key tool to decipher the biological processes underlying the development of a specific tissue or organ. This would explain the exponential increase in the number of expression atlases developed in recent years. An expression atlas should be meant as a snapshot of the genes expressed in one or more tissues, organs, or even cells at a given phenological stage or throughout a developmental kinetics². According to the main databases dedicated to this purpose—the Bio-Analytic Resource (BAR) for plant biology³ and the Expression Atlas platform²—expression atlases are available for at least 29 plant species. Among the monocots species used to produce the most recent and complete atlases—in terms of tissues and time point analyzed—stand out Zea mays⁴, Sorghum bicolor⁵, and Hordeum vulgare⁶. Instead, for dicots species, in addition to Arabidopsis thaliana⁷ a remarkable effort was done in Glycine max^8,9, Solanum lycopersicum¹⁰, Solanum tuberosum¹¹, and Vitis vinifera L. As regards this latter, several RNA-seq-based experiments have been conducted in the last 10 years. Considering the “time” variable, transcriptional profiles in temporal kinetics are available for berry as a whole (Corvina cv.)¹², grape skin (Cabernet Sauvignon cv.)¹³, and leaf (Summer Black cv.)¹⁴. Considering the “cultivar” variable, the grape berry transcriptomes of ten different cultivars were compared to identify cultivar specific-splicing events¹⁵, while Ghan et al.¹⁶ applied the same approach in 7 wine grape cultivars to identify the common transcriptional subnetworks underlying the berry skin in the late stages of ripening. Also, the comparison among berry transcriptomic profiles of 5 Italian cultivars, each sampled at 4 progressive phenological phases¹⁷, led to the identification of common switch genes, that seem to regulate the phase transition during berry ripening. In an exhaustive study, Dal Santo et al.¹⁸ performed an RNA-seq analysis in two genotypes (Cabernet Sauvignon and Sangiovese) at two developmental stages and cultivated in three different environments over two vintages, in order to elucidate the contribution of genotype, the influence of environment and the effect of their interaction (G×E) on the berry transcriptome. Finally, the most comprehensive atlas so far produced in grapevine is based on 54 samples representing green and woody organs at different developmental phases and it was developed to infer the specific metabolic pathways characterizing each of the samples¹⁹.

The abundance of transcriptomic data relating to grape berry and its sub-tissues offers an in-depth insight into the molecular processes underlying berry growth and ripening, but it leaves unclarified most of the upstream aspects related to flower development. In fact, except for Fasoli et al.¹⁹, who considered some of the floral tissues, transcriptional data straddling the anthesis process and related to single grape whorls are lacking. Since the flower gene expression regulation at both temporal and spatial level is the cornerstone for achieving the specification of morphology and physiology of the berry, we attempted to fill this gap by dissecting the transcriptome profiles of six (calyptra, calyx, anther, filament, stigma, and ovary) and four floral tissues (calyx, stigma, ovary, stigma, and embryo) before and after anthesis, respectively. Making use of analytical tools such as the weighted-gene network coexpression analysis (WGCNA), and the analysis tau (τ) and crossing obtained results with already available data, we tried to identify those hub genes and molecular networks that specifically characterize different floral organs or tissues. This analysis made it possible to identify enriched ontological categories in the different tissues under examination and to isolate specific transcription factors expressed exclusively or predominantly in a given whorl. Furthermore, limited to genes expressed exclusively in anther or in stigma, we carried out a de novo cis-regulatory elements (CRE) analysis at the promoter level, in order to identify those motifs linked to the tissue-specific expression of selected genes. This original grapevine atlas could represent an important genomic resource for functional analyses of grapevine floral organ growth and fruit development.

Results and discussion

Global RNA-seq analysis of grapevine flower tissues

To obtain gene expression profiles for different whorls and tissues of the grapevine flower, RNA sequencing (RNA-seq) data were generated from 10 different tissues collected from flowers of V. vinifera cv Pinot noir plants grown in open fields. Six tissue samples were collected from pre-anthesis (“Before Anthesis”—BA) flowers (EL-18), namely calyptra, calyx, filament, anther, ovary, and stigma, and four tissue samples were collected after anthesis (AA; EL-26): calyx, ovary, stigma, and embryo (Fig. 1A). The HiSeq 2500 sequencing run produced a total output of 336 M of 2 × 250 bp reads (on average 12 M reads per sample), while, after filtering steps, ~312 M of reads were retained. Given the low number of reads obtained for Stigma BA and Filament BA, the third replicate from both tissues was excluded from further analyses. The cluster dendrogram analysis based on raw counts (Supplementary Fig. 1) showed a good correlation among the biological replicates of each sample, except for one replicate of Calyx post-anthesis (Calyx AA) grouped with the Embryo samples, which may be due to the high proximity of the tissues which reside very close to each other. The filtered reads deriving from the 28 samples were combined and assembled into a reference catalog, composed by 210,674 transcripts and annotated based on the PN40024 12X v1²⁰. After Transcript Per Million (TPM) calculation, 7802 genes were filtered out while 22,094 genes, corresponding to the 73.8% of the total number of genes predicted in the PN40024 12X v1 grapevine reference genome, were retained for further analyses, being expressed in at least one tissue with TPM > 1. Pre-anthesis anther (Anther BA) had the lowest number of expressed genes (15,843), whereas Calyx AA had the highest number (19,320). Finally, 13,087 genes scored a TPM value >1 in all tissues considered (Fig. 1B, Supplementary Table 1).

**Fig. 1: Overview of the *V. vinifera* cv Pinot noir floral samples used for RNA-seq analysis.**

To isolate tissue-specific genes providing evidence about the molecular mechanisms occurring in different whorls analyzed, we took advantage of different analytical approaches. Some of them are commonly used in plant genomics, such as pairwise comparison and weighted-gene coexpression network analysis (WGCNA), some others have been mainly exploited in animals or human genomics, such as the tau (τ) analysis. For the sake of clarity in Fig. 2, we reported the logical workflow of all analyses performed in this study.

**Fig. 2: Logical workflow of analyses performed in this study.**

Weighted-gene coexpression network analysis identified gene modules highly associated with specific grapevine flower whorls/tissues

WGCNA is a systems biology approach aimed at understanding networks of highly correlated genes instead of individual genes, which has been successfully applied in various genomics studies in many plant species including pineapple²¹, strawberry²², pear²³, and grapevine²⁴. In this study, coexpression networks were constructed based on pairwise correlations of gene expression trends across all sampled tissues. The 22,094 genes resulting from TPM normalization and filtering, were analyzed in order to identify gene coexpression modules using the R-package WGCNA. Genes showing variance lower than 1 were removed leaving a total number of 19,658 genes. The matrix was raised to a soft-thresholding power β = 12 to ensure a scale-free network (Fig. 3A, B). Modules are defined as clusters of highly interconnected genes, such that genes belonging to the same module, highly correlated with each other. For the present analysis, the minimum module size was set to 30, and modules with highly correlated eigengenes (based on a threshold of 0.25) were merged. The eigengene represents the first principal component of a given module and can be considered as a representative of the expression profiles of genes within that module.

**Fig. 3: Weighted-gene coexpression network analysis for grapevine RNA-seq data.**

Twenty distinct modules were identified (Fig. 3C). The modules are labeled by color and are shown in a hierarchical clustering dendrogram, in which each tree branch constitutes a module and each leaf in the branch is one gene (Fig. 3D). Next, we performed a correlation analysis between the 20 distinct modules and the 10 tissues/whorls under study (Fig. 4A).

**Fig. 4: Module-tissue association analysis.**

For each organ, at least one highly specific module was identified (r > 0.8; correlation p-value < 0.01; Fig. 4B), although, in some cases, multiple modules showed significant correlations with the same tissue and/or more tissues showed correlations with the same module. For example, the sienna3, cyan, green, and midnightblue modules were specifically correlated with Stigma BA, Calyx BA, Calyptra BA, and Embryo, respectively. The best correlations between module eigengene (ME) and tissue were between the blue module and Anther BA (r = 1, p = 2e−10), the black module and Filament BA (r = 0.97, p = 5e−06) and finally the magenta module and Calyx AA (r = 0.97, p = 4e−06) (Fig. 4B, Supplementary Table 2). To further investigate the gene constitution of the 10 modules showing the best correlation with tissues under study, two network unique properties such as gene significance (GS) and module membership (MM) were carried out. The module membership (MM) is a measure of the correlation between the expression profile of a given gene with the considered module eigengene. The gene significance (GS) is an additional network parameter, that can be also defined by the minus log of a p-value and give an estimation of the biological significance of a gene. The higher the absolute value of GSi, the more biologically significant is the i-th gene.

Abstractly speaking, if a gene has higher GS and MM, it is more meaningful with the phenotypical trait^25,26. Thus, a specific module whose MM or GS were significantly connected and associated with the anther tissue may play a more important biological role on anther determination or functionality²⁶. Although all 10 modules considered showed extremely significant correlations between GS and MM, the blue (Anther BA), black (Filament BA), and sienna3 (Stigma BA) ones showed the best correlations between MM and GS (Fig. 5). Overall, module blue was observed as the best meaningful module by its strongly positive correlations (r = 1, p < E−200 in GS vs. MM) indicating its strict involvement in anther specific molecular mechanisms. In order to understand if modules associated with different tissues were enriched in genes belonging to determined ontological categories, we conducted a gene set enrichment analysis (GSEA) on those genes showing a MM >0.9 (Fig. 6), which were considered as WGCNA hub genes.

**Fig. 5: Scatterplots of gene significance (GS) vs. module membership (MM).**

**Fig. 6: Gene set enrichment analysis.**

The analysis allowed to identify the most relevant biological networks for all the tissues except for Filament BA and Calyx AA. This could be due to both the small size of the modules and the absence of specific processes occurring in these whorls. In Calyptra BA we identified a consistent number of biological networks associated with “photosynthesis” (GO:0015979) and “photosynthesis light reaction” (GO:0019684). Within these GO categories, two genes encoding light-harvesting chlorophyll A-binding proteins (LHCA1 and LHCA4) stand out. Their expression resulted sensibly higher in calyptra (1843 and 1153 TPM, respectively; Supplementary Table 3) compared to any other tissue, in accordance with the active role of this green tissue in the photosynthesis process²⁷. The most enriched GO categories in Anther BA were “localization” (GO:0051179), “transport” (GO:0006810), and establishment of localization (GO:0051234). Something similar was observed in the Corvina expression atlas, where pollen and stamen were characterized by the strong expression of genes related to transport and cell wall structure¹⁹. In addition, “anther dehiscence” (GO:0009901), “fruit dehiscence” (GO:0010047), and “dehiscence” (GO:0009900) were the most enriched categories in Anther BA. Five polygalacturonase (PG) encoding genes (Supplementary Table 3) were included in all the categories abovementioned and exhibited the highest expression levels in Anther BA. VIT_13s0064g00760, a member of the polygalacturonase GH28 sub-family, showed an impressive transcripts accumulation (3310 TPM) in the male reproductive organ whereas its levels were always very low (from 0 to 17 TPM) in all the other tissues. The expression of PG genes in tapetum, pollen grains, stigmas, and pollinated pistils has been described in various species and implies their role in tapetum degradation, pollen maturation, pollen tube growth, and pollination²⁸. In Arabidopsis, suppressing the expression of QRT3 interferes with microspore separation after the tetrad stage²⁹, whereas in Chinese cabbage BcMF6 silencing determines smaller floral organs and a lower pollen germination rate caused by the disruption of microspore maturation³⁰. In the same species, BcMF2 and BcMF9 RNA antisense lines showed disturbed development of the pollen wall intine layer and of the pollen tube wall^31,32 and the downregulation of BcMF2 caused pollen deformity and balloon-like swelling in the pollen tube tip, together with premature tapetum formation³¹. When a soybean PG is heterologously overexpressed in Arabidopsis, inflorescence mortality is over 50%, and siliques and seeds significantly decrease in number³³. With regards to post-anthesis tissues, “recognition of pollen” (GO:0048544), “pollen-pistil interaction” (GO:0009875) and “cell recognition” (GO:0008037) were unarguably the most interesting terms enriched in Stigma AA. Most of the genes highly expressed in this tissue (and scarcely or no detected within the other tissues), as expected, were linked to the self-incompatibility locus (S-locus) and mainly represented by kinases. In Stigma AA, of interest was also the detection of 99 genes included in the “protein phosphorylation” (GO:0006468) and “phosphorylation” (GO:0016310) categories, both involved in pollen/stigma interaction processes^34,35. Calyx AA showed a significant number of genes covering categories such as “plant-type secondary cell wall biogenesis” (GO:0009834) and “cell wall biogenesis” (GO:0042546). Among them, most noteworthy is the almost exclusive expression of three distinct genes (VIT_06s0004g03050, VIT_08s0040g01970, and VIT_08s0040g02030) all belonging to the fasciclin arabinogalactan-proteins (FLA11) family. In this regard, the Arabidopsis orthologous (AT5G03170) seems to be pivotal for tensile strength and tensile modulus of elasticity, two features that match with the mechanical sustain role of calyx after fertilization and fruit set. Finally, the midnightblue module, the one that best correlates with Embryo AA, included genes mainly involved in cell ontogenesis such as “cellular component organization or biogenesis” (GO:0071840), “organelle organization” (GO:0006996), and “chromosome organization” (GO:0051276), coherently with the intense embryonal development activity (Supplementary Table 3).

WGCNA and tau analyses identified whorl-specific transcriptional regulators

In order to identify transcriptional regulators specifically expressed in different tissues and whorls analyzed in this study, hub genes belonging to the different tissue-specific modules showing MM > 0.9, were screened based on functional annotation reported in the Plant Transcription Factor database (Plant TFDB)³⁶. Considering specific genes identified by WGCNA we selected 251 TF genes representing 36 TF families. In absolute terms, the Stigma AA, Embryo AA, and Anther BA were the tissues with the largest number of specific TF-coding genes (83, 40, and 31, respectively). In contrast, Ovary AA and Calyptra BA were the tissues with the lowest number of specific TFs (7). Overall, the MYB-R2R3 (41 genes), WRKY (24), bHLH (20), NAC (16), MICK-MADS (12), and ERF (12) families were the most abundant, although with differences depending on the tissue considered (Fig. 7A; Supplementary Table 4). For example, whether in Stigma AA there was a consistent number of MYB-R2R3 (18) and WRKY (16), in Ovary BA they were lacking, leaving room to genes belonging to minor TF families, such as B3 or G2-like. While in Supplementary Table 4 we listed all tissue-specific TFs identified, in Table 1 we reported the most relevant ones based on module membership (MM > 0.9) and expression. Overall, a rather large number of transcription factors identified appeared to be related to roles in flower development, determination, or identity. Amongst them it is worth mentioning VviMYB108A (VIT_05s0077g00500)³⁷, highly expressed in Anther BA, whose Arabidopsis orthologous is a JA-inducible TF gene with an important role in stamen development and male fertility, being involved in three main aspects: filament elongation, anther dehiscence, and pollen viability³⁸. One of the most expressed TFs in stigma BA, the bHLH gene BEE1, is orthologous of the BR-responsive gene AtBEE1 that regulates stigmatic cell development in Arabidopsis³⁹. The occurrence of VIT_17s0000g03580, another BR-responsive bHLH, in stigma best ranked TFs, raises questions about the involvement of these hormones in the processes of flower development and fertilization.

**Fig. 7: Tissue-specific transcription factors based on WGCNA.**

Table 1 Top-5 tissue-specific TFs identified by WGCNA analysis (MM > 0.9) ordered by descending TPM values

Full size table

Vogler et al. hypothesized that the growth-promoting properties of the reproductive tract of Arabidopsis depend, at least partly, on BR compounds, which are provided by the cells of the reproductive tract to promote pollen germination on the stigmatic papillae, and to boost pollen tube growth for rapid double fertilization⁴¹. Another TF, VviAG1 (VIT_12s0142g00360), was highly expressed in stigma BA and its orthologous in Arabidopsis, the MADS-box gene SHATTERPROOF 2, was demonstrated to be involved in promoting stigma, style, and medial tissue development⁴². Certainly, a gene family of particular interest when it comes to floral identity is represented by floral homeotic genes, which are the basis of the ABCDE model and are well-studied genes involved in flower development⁴³. These genes, belonging to the MICK_MADS family, already characterized at the genomic level by Grimplet et al.⁴⁴, have recently been characterized from their transcriptional sub-functionalization by Palumbo et al.⁴⁰, who selected 18 MADS boxes belonging to different classes (A, B, C, D, E) and analyzed their expression in different whorls during Pinot noir flower development. Within the tissue-specific TF obtained in this study, we identified 12 TF belonging to the MICK_MADS box family. Amongst them, only 5 genes fall within those homeotic genes considered by Palumbo et al.⁴⁰, namely VviAG1 (VIT_12s0142g00360), a class C gene listed in the Stigma BA related module (sienna3), VviSEP1 (VIT_14s0083g01050), a class E gene belonging to the Ovary BA module (plum1), VviAP1 (VIT_01s0011g00100), a class A gene belonging to the Calyx BA module (cyan), VviABS1 and VviABS2 (VIT_10s0042g00820 and VIT_01s0011g01560), both belonging to the B-sister class and detected in the Embryo module (midnightblue). The absence of the other homeotic genes studied by Palumbo et al.⁴⁰ from those identified in this study is likely due to the approach used to analyze the data. By filtering by module membership, most of those genes expressed simultaneously in more than one tissue, an intrinsic characteristic of some homeotic genes on which the ABCDE model is based, were in fact set aside. Nonetheless, we identified some other MADS-box genes which appear to be expressed in different tissues and which deserve a thought. Amongst these are VviAGL17c (VIT_00s0211g00180), VviSVP1 (VIT_00s0313g00070), VviMADSD1a (VIT_07s0031g01140), VviTM8a (VIT_17s0000g01230), VviSVP2 (VIT_18s0001g07460), VviFLC2 (VIT_14s0068g01800), and VIT_15s0048g01240, an AGL20-like gene. Although some of these genes showed limited expression, some others, such as VviSVP1, VviTM8a, and VviFLC2 were significantly induced in specific whorls. VvSP1 belongs to the Anther BA-specific module (blue) and showed a transcript accumulation ~16 times higher with respect to the mean transcript accumulation of all other tissues. VviTM8a was the first ranked TF gene for expression in the calyx AA module (magenta), being ~20 times more expressed than in all other tissues. Finally, VviFLC2 was expressed preferentially in stigma AA (fold change = 9 compared to other tissues). To provide a global view of the behavior of all homeotic genes of interest, the heatmap in Fig. 7B reports all the genes analyzed by Palumbo et al.⁴⁰, together with those that have emerged from the WGCNA in this study.

VviAP1, a class A homeotic gene, was highly expressed in Calyx (both BA and AA) and in Calyptra BA, in agreement with what was observed by Palumbo et al.⁴⁰ and with previous observations in other plant species such as Arabidopsis⁴⁵, Camellia japonica⁴⁶, and Medicago trucantula⁴⁷. VviFUL1 (VIT_17s0000g04990) and VviFUL2 (VIT_14s0083g01030), the two grapevine orthologues of Arabidopsis FRUITFULL (FUL)⁴⁴, showed distinctive expression patterns. VviFUL2 was expressed in ovary and calyx before anthesis, whereas VviFUL1, was expression was generally much lower, was expressed in Calyx BA, Calyptra BA and, most intriguing, in Filament BA. Class B genes, namely VviPI, VviAP3a, and VviAP3b confirmed their role in petal and stamen identity with the highest transcript accumulation detected in Calyptra BA and stamen tissues (anther and/or filament). Amongst the B-sister genes, VviABS1 and VviABS2 perfectly matched what was expected based on Palumbo et al.⁴⁰, being exclusively expressed in Ovary BA and in embryo. For what concerns the class C genes, VviAG2 and VviAGL6a were preferentially expressed in stamen, at the level of filament, whereas VviAG1 and VviAGL6b were expressed in Stigma BA and in Calyx BA and AA, respectively. VvAG3, a class D gene, was switched off in all tissue except for embryo, as previously described by Palumbo et al.⁴⁰ and Boss et al.⁴⁸. Finally, amongst the class E genes, VvSEP1-4 transcripts were accumulated preferentially in Calyx BA and AA, with the exclusion of VviSEP1 which was only detected in pre-anthesis phase. Moreover, a relevant expression in Ovary BA was detected for VviSEP1 and VviSEP4.

Isolation of whorls/tissue-specific gene markers using the tau (τ) analysis

The WGCNA analysis, based on the identification of clusters of highly correlated genes sharing similar expression patterns across all samples, allowed the determination of groups of genes closely associated with a specific phenotypic character, represented in this study by a determined floral tissue or whorl.

This analysis has proved particularly effective in numerous studies^21,22,23, nevertheless, the fact that a particular gene is highly expressed in one tissue compared to others is a relative parameter and in some cases, it may be more useful to identify genes that are exclusively expressed in one organ and not in others: in other words, specific tissue/organ gene markers. For this purpose, we applied an algorithm generally used in transcriptomic studies on animals or humans. Such an algorithm, defined as tau (τ) algorithm⁴⁹, can determine the tissue-specificity level of each predicted gene of a given genome.

After the quantile normalization of 22,094 genes (selected because showing TPM values equal or higher than 1 in at least one of the 10 samples) and the creation of BIN profiles, the implementation of the τ algorithm led to the assignment of a value ranging from 0 (constitutively expressed in all or most of the tissues) to 1 (absolutely specific for a given tissue) to each gene. The uneven occurrence of the τ values throughout the gene set is illustrated in Fig. 8A and is coherent with what is expected based on Kryuchkova-Mostacci and Robinson-Rechavi⁴⁹. Overall, 3575 genes proved to be highly specific (HSG, tau >0.85) and, among them, 1514 resulted absolutely specific (ASG, τ = 1). The tau value only defined the “specificity” of a gene, whereas to determine which tissue the gene is specific for, the tau expression fractions (τ_ef) were calculated.

**Fig. 8: Tissue-specific gene distribution.**

Anther BA was the tissue displaying the highest number of HSG (805) and ASG (307) while, on the contrary, the lowest HSG and ASG values (155 and 57, respectively) were identified in Ovary AA (Fig. 8B). This observation is intriguing since anther BA represents the tissue that showed the lowest number of expressed genes (TPM> 1; Fig. 1), whereas Ovary AA was one of those tissues with the highest number of expressed genes. Nevertheless, this observation is partially confirmed in the Corvina expression atlas, where, out of 516 genes identified as specific flower, 229, equal to 44%, were specific for stamens and pollen¹⁹. For each tissue, the list of HSG and ASG is available as Supplementary Table 5. Similar to what was done for the genes obtained through WGCNA, also in this case the highly specific genes (HSG) resulting from the tau analysis were subjected to a GSEA to verify the presence of enriched ontological categories and the possible overlap with those obtained in the WGCNA analysis. As a matter of fact, many enriched terms identified in HSG were common with those highlighted in the WCGNA analysis (Fig. 6). This observation conferred greater robustness to the results obtained and laid the foundations for a subsequent step aimed at further narrowing down the list of key genes of interest.

Comparison between tau and WGCN analyses and determination of best optimum tissue-specific genes

The comparison between the highly specific genes (HSG) identified by the tau analysis and the tissue-specific modules identified by WGCNA (MM > 0.9) led to the identification of 1513 genes shared by the two approaches. Looking at the specific tissues under study, the percentage of common genes found its maximum in Anther BA (47.4%; Supplementary Table 2; Supplementary Fig. 2). This result is not surprising, considering that the modules isolated by WGCNA contain genes that can show high levels of expression even in those tissues that are not associated with the module itself, whereas highly specific genes (HSG) identified by (τ) algorithm represent genes that are almost exclusively expressed in that specific tissue and not in others. Nevertheless, although the results provided by the two analyses have different biological meanings, we considered those genes shared between the two approaches to be of particular interest, defining them as key hub genes. Ranking of genes by both expression and specificity is useful for anyone working on a single tissue wanting to identify a set of genes that are highly specific to the tissue, that are expressed in high enough quantities (facilitating bench work in the laboratory), and with minimal expression in other tissues (limiting off-target effects). With this aim, we retrieved the quantile normalized expression and tissue-specificity of every key hub gene detected by the WGCNA and tau analyses comparison and we used this information to create a score column. Each gene’s score was between 0 and 2 and was the sum of its tau expression fraction value (τ_ef) and its 0–1 ranged normalized expression value. For each tissue, we then considered the top 10 ranking genes based on score values, in other words, those genes with both the highest expression and specificity. While the complete list of these genes is available as Supplementary Table 6 and it is graphically represented in Supplementary Fig. 3, in Table 2 we reported the first ranking best optimum specific gene for each different tissue.

Table 2 Best optimum gene for each tissue under study based on tau analysis

Full size table

Among the most interesting optimum genes in pre-anthesis, the best optimum gene in filament was a Phospholipase A1 (VIT_15s0021g01510; τ_ef = 1 and no expression in all the other tissues). Although no information on this gene is available in grapevine, its Arabidopsis orthologous, namely AT2G44810, encodes DEFECTIVE ANTHER DEHISCENCE 1 (DAD1), a protein located in the chloroplast⁵⁰ whose phospholipase A1 activity catalyzes the late phase of the jasmonate biosynthetic pathway⁵¹. DAD1 expression appears to be restricted to stamen filaments immediately before flower opening and dad1 mutants show defects in flower opening as well as anther dehiscence and pollen maturation⁵². The massive occurrence of transcript encoding for fatty acid elongase (VIT_07s0141g00030) in anther BA (TPM = 1726, compared to an average TPM = 3 in all the others tissues) is in agreement with the grapevine expression atlas published by Fasoli et al.¹⁹, where this gene was highly expressed in stamen, considering both anther and filament tissues, and in pollen grain. Transcripts of A. thaliana orthologous AT2G26640 encoding a 3-ketoacyl-CoA-synthase 11 (KCS11) are accumulated, above all the other tissues, in stamens and pollen⁵³. It is also worth noting that AT1G68530, another Arabidopsis orthologous encoding KCS6 another protein belonging to the ketoacyl-CoA-synthase family, is the major condensing enzyme involved in stem wax and pollen coat lipid biosynthesis⁵⁴ and is highly expressed in the tapetum of anthers near maturity⁵⁵. Kcs6 mutants are male sterile⁵⁶. VIT_18s0001g14760 transcript was found to be the best optimum gene marker in stigma BA. This observation is consistent with what was observed by Fasoli et al.¹⁹ in the Corvina expression atlas. The related Arabidopsis orthologous AT1G75900 encodes a GDSL esterase/lipase EXL3, a protein belonging to the extracellular lipases (EXLs). This class of proteins is abundant in pollen coat, and its combination with lipids can interact with stigma cells, bringing the recognition signal and triggering a mechanical conduit that leads to pollen hydration⁵⁷. It is interesting to note that EXL4, another GDSL esterase/lipase protein, is localized in small granules in the tapetal cells of pollen coat⁵⁸, required for its formation and is involved in male fertility. Mutants show reduced pollen fertility, underdeveloped pollen grain coat as well as impaired water absorption and germination capacities. Amongst the best optimum genes detected in post-anthesis phase, it is noteworthy the presence of a laccase encoded by VIT_18s0164g00100 in the ovary, as confirmed in the Corvina atlas by Fasoli et al.¹⁹, where a high expression of this gene was detected in the pericarp and in the pulp of all berry developmental stages, above all in fruit set and post fruit set. The upregulation of VIT_18s0164g00100 was associated with processes directly involved in berry ripening²⁴. Finally, VIT_13s0019g01520 transcript was highly and specifically accumulated in embryo, again in agreement with the observation of the massive occurrence of this gene mRNA in fruit set and post fruit set seed made by Fasoli et al.¹⁹. The specific accumulation of a transcript coding for a stilbene synthase, VvSTS36 (VIT_16s0100g01100)⁵⁹, in the stigma AA is curious both for the fact that the induction of this gene and its paralogues had never previously been observed in this tissue, and because generally STSs are expressed in response to biotic or abiotic stresses. It is conceivable that in the post-anthesis phase, once fertilization has taken place, the stigma undergoes a senescence process. Several studies reported the accumulation of STS transcript and, consequently, of basic and complex stilbenes, in the senescence phase, probably as a response to the oxidative processes and the production of ROS which characterize senescing tissues but also in response to the accumulation of one of the main hormones linked to this process: ethylene, closely linked to the transcriptional activation of these genes. Ultimately, although the discussion of the results obtained only touched on the best optimum genes identified through the tau approach, it seems evident that most of the genes identified found confirmation in the literature. On this basis, through this tool, we believe we have made available a pertinent list of specific tissue/organ genes that may be of interest to the scientific community.

Integration of the flower and the Corvina expression atlases and Identification of enriched cis-regulating elements in flower-specific genes

Although WGCNA and tau analyses provided lists of genes of interest related to specific floral tissues or expressed exclusively in one whorl rather than another, based on samples considered in this study it was not possible to rule out the fact that these genes are also expressed in other tissues of the plant. In order to further circumscribe the number of genes of interest at the floral level, we did an additional step, excluding those transcripts whose expression was reported also in other tissues based on the V. vinifera cv Corvina expression atlas¹⁹. Based on this analysis, carried out using microarray technology, the number of flower-specific genes (FSG) was estimated 516¹⁹. We then crossed these data with those resulting from the tau and WGCN analyses, considering all HSG genes (tau >0.85) identified in this study and all those genes belonging to the 10 tissue-specific modules identified in the WGCNA (MM >0.9). Overall, 145 transcripts were found to be common between the three datasets (~28.1% of the specific flower genes identified in the Corvina atlas; Fig. 9A, Supplementary Table 7), representing genes that are expressed only in flower based on the Corvina atlas and specifically expressed in one whorl rather than another based on our data. Two-hundred-eighty-eight genes were expressed only in flower based on the Corvina atlas but did not show specificity for any whorl based on the tau/WGCNA analyses, representing genes that are homogeneously expressed in all flower tissues considered. Of the 145 genes shared by the 3 datasets, the majority was highly specific for Anther BA (113 genes; 78%) and post-anthesis stigma (22 genes; 15%). The remaining ones were HSG for Ovary BA (4 genes), Filament BA (2 genes), Stigma BA (2 genes), Calyptra BA (1 gene), and Calyx AA (1 gene) (Fig. 9B). The heatmaps in Fig. 9 report the behavior of these genes across the 54 tissues considered in the Corvina atlas (Fig. 9C) and in the different floral tissues/whorls considered in this study (Fig. 9D) whereas Supplementary Table 7 provides a list of these key hub genes for each tissue considered.

**Fig. 9: Comparison between flower-specific genes in the Corvina atlas and highly specific genes detected in this study.**

In order to study the structure of co-expressed genes promoters and to identify genetic determinants of the tissue-specific expression of tissue-specific genes, we conducted a de novo motifs discovery analysis considering the 2 kb sequences upstream the 113 and 22 found to be expressed only in anther and stigma, respectively, based on the WCGNA, tau analysis, and Corvina expression atlas. The analysis was carried out exclusively in these two tissues because they were the only ones having enough genes shared between the WGCNA analysis, the Corvina atlas and the present flower atlas, to allow a reliable de novo motifs discovery. For this purpose, we retrieved the promoter sequences of selected genes previously isolated from the 12x V1 prediction of the PN40024 genome. Using the DECOD software, we screened these sequences for novel enriched motifs with k-mer equal to 8 with respect to 10,000 2 kb promoter sequences randomly retrieved from the grapevine genome. We then used STAMP⁶⁰ to match the motifs discovered against PLACE, a database of motifs found in plant cis-acting regulatory DNA elements collected from previously published studies⁶¹. For both tissues analyzed, 10 motifs enriched in the promoters of tissue-specific genes were identified. The comparison with cis-element already deposited in the database via STAMP highlighted the 5 best matches. Among these, we only considered those CREs with a biological meaning.

In Anther BA (Fig. 9E) the first ranked motif based on the score, motif 1 (score = 4.41E−04), was similar to motifs TL1ATSAR (E-value 7.38E−07) and NODCON2GM (E-value 5.54E−05), both enriched in promoter regions of genes involved in reproductive processes. TL1ATSAR was identified in Arabidopsis and is associated with male meiocyte-expressed genes⁶² whereas NODCON2GM is a cis-regulatory element of the gene TM6, directly involved in stamens petaloidy and flower shape formation In Peonia spp.⁶³. Motif 2 (3.26E−04) was associated with POLASIG2 and GMHDLGMVSPB. POLASIG2 is highly related to a large number of flowering genes of Arabidopsis and GMHDLGMVSPB was found to be highly and specifically related to pollen tapetum-expressed genes in rice (Oryza sativa)⁶⁴. Something similar was observed for O2F2BE2S1, the best match of Motif 3. The ABREBZMRAB28 motif was found to be enriched in the promoter region of bZIP genes involved in floral development in six strawberry species and it is presumed that they also play important role in the male gametophyte⁶⁵. Moreover, in Arabidopsis, several AtbZIP genes were selected for their putative involvement in pollen development⁶⁵. Moving from anther to stigma AA, the other tissue that allowed, given the number of specific tissue genes, to carry out the analysis of the promoters, among the identified motifs whose biological significance has been confirmed by the similarity with already characterized CREs stand out SEF3MOTIFGM, NODCON2GM, and RYREPEATBNNAPA. SEF3MOTIFGM was found in the promoter region of SEP3. SEP3 sequence of Platanus acerifolia was heterologously expressed in tobacco and through a GUS assay observed to be expressed in reproductive tissues, among them also stigma⁶⁶. NODCON2GM motif is enriched in the promoter region of the poplar gene PtaRHE1, coding for a RING-H2 protein, ectopically expressed in tobacco and observed highly expressed in the stigma through a GUS assay⁶⁷. Finally, RYREPEATBNNAPA in Arabidopsis, was found to be enriched in ABA-related differentially expressed genes which were observed to be highly affected by the overexpression of MINI ZINC FINGER 1 (MIF1), a putative zinc finger protein⁶⁸. For mutants, a lot of defects were observed in floral whorls, including stigma, showing also reduction in fertility⁶⁸.

Ultimately, many of the regions identified through this approach found confirmation in the literature. An interesting investigation would be to evaluate the possible relationships existing between the transcription factors identified through WGCNA analysis (for example MYB108A, VviSVP1, and VvNAC34) and the cis-elements identified here. To date, there are several NGS approaches that could shed light on the physical interaction of these trans- and cis-factors, first the DNA Affinity Purification Protocol (DAPseq)⁶⁹. This could validate the possible interaction between these specific TFs and specific tissue/organ genes and, in association with further RNA-seq analyses, could provide numerous information about the tissue-specific transcriptional networks of the flower, as well as about the cistrome landscape of candidate TFs.

Isolation of novel housekeeping genes based on the floral and Corvina expression atlases

Looking at constitutive (housekeeping) genes, we identified 662 genes that exhibited a τ value = 0 and TPM values >100 in all tissues. The complete list (in ascending order based on standard deviation of TPM values), is available as Supplementary Table 8, while Supplementary Fig. 4 depicts the main biological networks resulting from the GO enrichment analysis of the 662 housekeeping genes here identified. The GO categories more represented were “positive regulation of translational termination” (GO:0045905), “proteasomal ubiquitin-independent protein catabolic” (GO:0010499), and “assembly of large subunit precursor of pre-ribosome” (GO:1902626) that scored, respectively, fold enrichment values of 34.15, 27.32, and 22.77. It is not surprising that the most represented categories (and the related genes) resulting from the GO enrichment analysis are involved in molecular processes common to all tissues and strictly required for cellular survival (protein synthesis and degradation). A further step was taken to verify whether the 662 constitutive floral genes found in all the tissues here analyzed, were also constitutively expressed in the 54 tissues considered in the Corvina atlas¹⁹: in this way, we managed to ascertain the qualitative expression of the entire gene list also in the Corvina atlas and to reclassify therefore these loci as “whole plant housekeeping genes”. Due to their crucial role in cellular survival and to their constitutive expression, housekeeping genes or control genes play a decisive role for mRNA levels normalization in qPCR studies. At this aim, several studies attempted the identification of optimal housekeeping genes to be used for specific grapevine tissues⁷⁰, developmental stages⁷¹, and variable physiological conditions^72,73. Some of these genes, obtained by cross-checking our RNA-seq data with the Corvina atlas, matched with the findings already available in the scientific literature regarding the ubiquitous expression of genic loci such as ACTIN 7 (VIT_04s0044g00580)⁷², RIBOSOMAL PROTEIN 60S (VIT_18s0001g06410)⁷⁴, ELONGATION FACTOR 1-alpha 1 (VIT_06s0004g03220)⁷¹, GAPDH (VIT_17s0000g10430)⁷¹, and AQUAPORIN PIP2B (VIT_13s0019g04280)⁷¹. On the other side, an exhaustive and novel list of candidate control genes never considered before is presented here. Among the best 10 optimum housekeeping genes (in terms of transcripts abundance and comparable expression levels in all tissues of this study and in the Corvina atlas too; Supplementary Fig. 4 and Supplementary Table 8) stand out four proteasome-related genes, genes involved in exocytosis (RAB GTPase ARA3), RNA transcription (CTV.22), and protein synthesis (EIF-3E and EIF-4A3) stands out a member of the Pollen Ole e 1 allergen and extensin (AtPOE1) family, initially identified as a group of allergens and recently recognized as developmental regulators in many plant tissues³⁸. A further qPCR validation step is needed to evaluate whether these genes could represent a valid alternative to those commonly used in expression analyses.

Conclusions

Although grapevine is a plant species mainly propagated by agamic way, nowadays, the understanding of the molecular mechanisms that lead to the ontogenetic determination of the different organs within the flower is a topic of great interest. The reasons behind this statement are many: (i) the importance of conventional genetic improvement of varieties as well as rootstocks is increasingly affirming, in contrast to the historical and cultural legacies that see viticulture as an extremely conservative discipline; (ii) flowering represents the first step of the reproductive phase that will lead to the development of the main vine product, the berry; (iii) the structure of the flower and the architecture of the inflorescence influence the organization of the bunch, with many consequences at the production level; (iv) the flower represents the main source of tissues (anthers and filaments) used for in vitro regeneration and, indirectly, for genetic improvement through new breeding techniques. In this study, we generated 30 grapevine RNA-seq datasets for different whorls and tissue of V. vinifera cv Pinot noir flowers in pre-anthesis (E–L 18, 8 days before anthesis) and post-anthesis (EL-26, 6 days after anthesis) stages and integrated them with a previously published grapevine cv Corvina expression atlas, which included several flower samples at different development stages¹⁹. Both WGCNA and tau analysis were used to analyze the RNA-seq data and identify tissue-specific gene modules or marker genes and many of these were identified by both methods. Both analyses have advantages and disadvantages, depending on the objectives of the work and the biological questions to be answered. In this study, which has as its main objective the development of a transcriptomic reference for functional studies on flower-specific genes in grapevine, we tried to combine both approaches, identifying key hub genes which specifically characterize different flower organs before and after anthesis. Although this work is configured as a descriptive study at a wide genome level, the provision of numerous data relating to specific genes for every single tissue or whorls considered represents an important resource for the scientific community of the vine.

Materials and methods

Plant material and sample collection

Flower materials (V. vinifera L. cv Pinot noir, clone 115, grafted onto Kober 5BB rootstock) were retrieved on May 2018 from a germplasm collection established in 2009 in the experimental farm “Lucio Toniolo” in Legnaro (University of Padova, Padova, Italy; 45°21′5,68″N 11°57′2,71″E). The soil texture was as follows: 46% sand, 24% clay, and 30% loam; pH = 7.9; electric conductivity, 112 μS; and organic carbon, 1.1%. Specifically, in May 14th (E–L 18, 8 days before anthesis) and May 28th (E–L 26, 6 days after anthesis), three inflorescences, each of which collected from an individual plant (1 inflorescence × 3 plants) were sampled and snap-frozen in liquid nitrogen. Each flower was then rapidly dissected in cold conditions into the relative whorls with the aid of a stereoscope and a scalpel. From pre-anthesis flowers, calyptra (or cap), calyx, anther, filament, ovary, and stigma were collected whereas after anthesis—being the cap and the male tissues released—the inflorescence was dissected into calyx, ovary, stigma, and embryo. Considering both stages, ten tissues were isolated, each in three biological replicates (n = 30).

RNA purification, library preparation, and sequencing

For each sample, ~50 mg of tissue were ground in liquid nitrogen, and total RNA was purified using the “Spectrum Plant Total RNA Kit” (Sigma-Aldrich, St. Louis, MO, USA) following the instruction provided by the manufacturer. The integrity of total RNA was checked on 1% (w/v) agarose gel (Life Technologies, Carlsbad, CA, USA) stained with 1 x SYBR Safe DNA Gel Stain (Life Technologies) while the quality (in terms of 260/280 and 260/280 ratios) and the quantity were spectrophotometrically evaluated using NanoDrop-1000 (Thermo Scientific, Wilmington, MA, USA). RNA was stored at −80 °C until use.

cDNA libraries construction and sequencing were performed as described by Chitarrini et al.⁷⁵. Briefly, 1 µg of total RNA was used to construct stranded mRNA-seq libraries (KAPA Stranded mRNA-Seq Kit, Kapa Biosystems, Woburn, MA, USA), that were later barcoded using the KAPA Dual-Indexed Adapter Kit (Kapa Biosystems). Libraries were then quantified (KAPA Library Quantification Kit, Kapa Biosystems) using a LightCycler 480 (Roche, Mannheim, Germany), checked in terms of correct size (250–280 bp) with a Tapestation 2200 (Agilent Technologies, Santa Clara, CA, USA) and High Sensitivity D1000 ScreenTape assay (Agilent Technologies) and, finally, multiplexed. Sequencing was performed on an Illumina HiSeq 2500 platform (Rapid Run Mode, Illumina, Inc., San Diego, CA, USA) to generate 2 × 250 bp reads. All raw reads were deposited in the NCBI SRA database with accession numbers SRR14777742–SRR14777769.

RNA-seq analysis

FastQC software v.0.11.9⁷⁶ was used to summarize analysis results and to verify the overall quality of the sequencing output while fastp v.0.36⁷⁷ was used to trim the Illumina adapters, merge the reads and filter the sequences based on phred quality score (removed if Q<30). Trinity software v2.8.5⁷⁸ was used in de novo mode to assemble raw reads deriving from the 28 samples (two samples, one from stigma and one from filament were excluded) into a single reference catalog as demanded by Salmon⁷⁹ for quantifying transcript abundance from RNA-seq reads. Trinity was run with default parameter by setting the minimum contig length to 200 and k-mer value at 25. The resulting catalog was then annotated based on the PN40024 12X v1 grapevine reference genome assembly (29,971 genes⁸⁰) and using the BLASTn algorithm⁸¹. Since the average number of raw reads produced per sample (~12 million) did not reach the minimum threshold required to estimate possible alternative transcripts (i.e., 30–60 million per samples⁸²), all the putative isoforms (e.g., i1, i2, i3) produced by Trinity and therefore deriving from the same gene locus (e.g., VIT_18s0166g00210), were annotated under the same transcript name (e.g., VIT_18s0166g00210.01). To quasi-map and quantify RNA-seq reads with Salmon software v.0.14.1⁷⁹, we built an index based on the newly assembled catalog of flower transcripts. The ‘decoys’ option was used to build a decoy-aware index by employing the entire genome as the decoy sequence. The RNA-seq reads of each sample were then quantified and their abundance in terms of transcripts per million (TPM) was calculated. As recommended when using the decoy-aware index, we used the ‘validateMappings’ option to mitigate potential spurious mapping of reads arising from unannotated genomic loci sequence-similar to annotated transcriptomic loci.

Weighted-gene correlation-network analysis (WGCNA)

In order to identify clusters (modules) of highly correlated genes attributable to a specific tissue, coexpression networks were constructed using the WGCNA 1.70–3 (https://horvath.genetics.ucla.edu/html/CoexpressionNetwork/Rpackages/WGCNA/) package⁸³ in R-studio Version 1.3.1093, R version 4.0.3⁸⁴. The analysis was performed on 19,658 genes showing a mean TPM equal to or greater than 1 in at least one tissue and variance higher than 1, while the remaining 10,230 genes were filtered out. Parameters used in the analysis were set as follows: weighted network, signed; hierarchical clustering tree, Dynamic Hybrid Tree Cut algorithm; power = 12; minModuleSize = 30. As a first step in the analysis, a matrix of pairwise correlations between all genes across the 10 tissues was built. Then, the matrix was raised to a given soft-thresholding power based on the criterion of approximate scale-free topology and pickSoftThreshold function (R² > 0.9) to obtain an adjacency matrix. In order to identify modules of co-expressed genes, the topological overlap-based dissimilarity was constructed^85,86 and used as input to perform the average linkage hierarchical clustering. Modules with highly correlated eigengenes were then merged (mergeCutHeight = 0.25). The association between merged modules and tissues/organs was tested calculating each module eigengene, defined as the first principal component of a PCA on the gene expression of all genes within the module. For each gene, total and intramodular connectivity (function softConnectivity), kME (for modular membership, also known as eigengene-based connectivity), and kME-P value were calculated, resulting in 20 tissue-specific modules. Genes belonging to each module were subjected to a GO enrichment analysis using the online tool ShinyGO⁸⁷.

Identification of tissue-specific genes and genes constitutively expressed in all floral tissues

The tissue-specificity level of each gene was calculated according to the tau (τ) algorithm⁸⁸, which was demonstrated to be the best performing method to measure expression specificity in a benchmark study by Kryuchkova-Mostacci and Robinson-Rechavi⁴⁹. Tau, whose values vary from 0 (broadly expressed) to 1 (tissue-specific), was calculated using the tispec R-package (https://rdrr.io/github/roonysgalbi/tispec). Data were first normalized removing all genes whose expression was <1 TPM in any tissue and then, in order to make cross-tissue comparisons possible, a quantile normalization on the entire dataset was accomplished. Thereafter, for each tissue, a BIN value ranging from 0 (lowest expression) to 10 (highest expression) was attributed to each gene. The specificity of each gene (considering all tissues) was calculated implementing the τ algorithm:

$$\uptau = \frac{{\mathop {\sum }\nolimits_{i = 1}^N (1 - x_i)}}{{N - 1}}$$

where N is the number of tissues and x_i is the expression value normalized by the highest expression.

Absolutely specific genes (ASGs) were defined as genes expressed in a single tissue only and indicated by a τ value of 1; highly specific genes (HSGs) were genes with relatively highly enriched expression in a few tissues and defined by a τ value of at least 0.85. Finally, genes were considered constitutively expressed in all floral tissues if τ value was <0.2. The plotDensity function was then used to plot the tau value of every gene and visualize which tau values occur most often. Finally, for each tissue, the specificity of each gene was calculated as τ expression fraction (τ_ef):

$${\uptau}_{{{{\mathrm{ef}}}}} = {\uptau}\frac{{qn}}{{{\mathrm{max}}}}$$

where qn is the quantile normalized expression and max is the highest quantile normalized expression. The function getTissue was used to retrieve the quantile normalized expression and tissue-specificity of every gene in each tissue and to create a score value between 0 and 2. This value represents the sum of its τ_ef value and its 0–1 ranged normalized expression value. Ranking of genes by both expression and specificity was used to identify a set of 10 optimal genes that were highly specific for a given tissue, highly expressed, and with minimal expression in other tissues. The online tool VENNY 2.1 (https://bioinfogp.cnb.csic.es/tools/venny/) was used to highlight any possible overlapping among HSGs (resulting from the τ_ef analysis) of a given tissue and genes belonging to the cluster (resulting from the WGCNA analysis) significantly most associated with the tissue. Finally, a list of constitutive genes was drawn up retaining those loci that—in all tissues—scored a τ_ef value =0 and a TPM value >100.

Identification of cis- and trans-regulating factors in genes of interest

To identify genes coding for transcription factors, gene IDs identified by WGCNA and tau analysis were screened against the Plant TFDB³⁶. To identify cis-regulatory element (CRE), the promoter sequences (2 kb) of the specific genes selected for anther BA and stigma AA were retrieved from the 12x V1 annotation of PN40024²⁰. The de novo identification of motifs enriched in the promoters of these genes was carried out using the DECOD software⁸⁹, using as a background a collection of 10,000 promoter sequences obtained randomly from the grape genome. The analysis was performed using k-mers of 8 nucleotides (default parameter), a maximum of 10 motifs identified and 50 iterations. Once the enriched motifs were identified, they were submitted to the online tool STAMP⁶⁰, in order to identify any CRE already characterized in previous studies.

Data availability

Raw Illumina sequence data were deposited in the National Center for Biotechnology Information (NCBI) and can be accessed in the sequence read archive (SRA) database (https://www.ncbi.nlm.nih.gov/sra). The accession number is PRJNA736298 and includes 28 accession items (SRR14777742–SRR14777769). All data generated or analyzed during this study are included in this published article and its supplementary information files.

References

OIV Statistical. Report on World Vitiviniculture, produced by the OIV (International Organization of Vine and Wine), https://www.oiv.int/public/medias/6371/oiv-statistical-report-on-world-vitiviniculture-2018.pdf (2018). Last accessed 30-06-2021.
Papatheodorou, I. et al. Expression Atlas: gene and protein expression across multiple studies and organisms. Nucleic Acids Res. 46, D246–D251 (2018).
Article CAS PubMed Google Scholar
Provart, N. The Bio-Analytic Resource: Gene expression and protein tools, Centre for the Analysis of Genome Evolution and Function (Canada Foundation for Innovation to NJP) & Arabidopsis Research Group (Department of Cell and Systems Biology, University of Toronto), http://www.bar.utoronto.ca/ (2020). Last accessed 30-06-2021.
Stelpflug, S. C. et al. An expanded maize gene expression atlas based on RNA sequencing and its use to explore root development. Plant Genome 9, 1–16 (2016).
Article CAS Google Scholar
Wang, B. et al. A comparative transcriptional landscape of maize and sorghum obtained by single-molecule sequencing. Genome Res. 28, 921–932 (2018).
Article CAS PubMed PubMed Central Google Scholar
Mayer, K. F. X. et al. A physical, genetic and functional sequence assembly of the barley genome. Nature 491, 711–716 (2012).
Article CAS PubMed Google Scholar
Cheng, C. Y. et al. Araport11: a complete reannotation of the Arabidopsis thaliana reference genome. Plant J. 89, 789–804 (2017).
Article CAS PubMed Google Scholar
Severin, A. J. et al. RNA-Seq Atlas of Glycine max: a guide to the soybean transcriptome. BMC Plant Biol. 10, 160 (2010).
Article PubMed PubMed Central CAS Google Scholar
Libault, M. et al. An integrated transcriptome atlas of the crop model Glycine max, and its use in comparative analyses in plants. Plant J. 63, 86–99 (2010).
CAS PubMed Google Scholar
Sato, S. et al. The tomato genome sequence provides insights into fleshy fruit evolution. Nature 485, 635–641 (2012).
Article CAS Google Scholar
Xu, X. et al. Genome sequence and analysis of the tuber crop potato. Nature 475, 189–195 (2011).
Article CAS PubMed Google Scholar
Venturini, L. et al. De novo transcriptome characterization of Vitis vinifera cv. Corvina unveils varietal diversity. BMC Genomics 14, 41 (2013).
Article CAS PubMed PubMed Central Google Scholar
Wang, Y. & Li, R. Grape skin transcriptome to reveal genes related with resveratrol accumulation. (Northwest A&F University: Xianyang, Cina, 2015). https://www.ncbi.nlm.nih.gov/bioproject/?term=PRJNA306731 Last accessed: 30-06-2021.
Pervaiz, T. et al. Transcriptomic analysis of grapevine (cv. Summer Black) Leaf, using the illumina platform. PLoS ONE 11, e0147369 (2016).
Article PubMed PubMed Central CAS Google Scholar
Potenza, E. et al. Exploration of alternative splicing events in ten different grapevine cultivars. BMC Genomics 16, 706 (2015).
Article PubMed PubMed Central CAS Google Scholar
Ghan, R. et al. The common transcriptional subnetworks of the grape berry skin in the late stages of ripening. BMC Plant Biol. 17, 94 (2017).
Article PubMed PubMed Central CAS Google Scholar
Palumbo, M. C. et al. Integrated network analysis identifies fight-club nodes as a class of hubs encompassing key putative switch genes that induce major transcriptome reprogramming during grapevine development. Plant Cell 26, 4617–4635 (2014).
Article CAS PubMed PubMed Central Google Scholar
Dal Santo, S. et al. Grapevine field experiments reveal the contribution of genotype, the influence of environment and the effect of their interaction (G×E) on the berry transcriptome. Plant J. 93, 1143–1159 (2018).
Article CAS PubMed Google Scholar
Fasoli, M. et al. The grapevine expression atlas reveals a deep transcriptome shift driving the entire plant into a maturation program. Plant Cell 24, 3489–3505 (2012).
Jaillon, O. et al. The grapevine genome sequence suggests ancestral hexaploidization in major angiosperm phyla. Nature 449, 463–467 (2007).
Article CAS PubMed Google Scholar
Wang, L. et al. Floral transcriptomes reveal gene networks in pineapple floral growth and fruit development. Commun. Biol. 3, 500 (2020).
Shahan, R. et al. Consensus coexpression network analysis identifies key regulators of flower and fruit development in wild strawberry. Plant Physiol. 178, 202–216 (2018).
Article CAS PubMed PubMed Central Google Scholar
Li, X. et al. Comparative transcriptomic analysis provides insight into the domestication and improvement of pear (P. pyrifolia) fruit. Plant Physiol. 180, 435–452 (2019).
Article CAS PubMed PubMed Central Google Scholar
Guo, D. L., Wang, Z. G., Pei, M. S., Guo, L. L. & Yu, Y. H. Transcriptome analysis reveals mechanism of early ripening in Kyoho grape with hydrogen peroxide treatment. BMC Genomics 21, 1–18 (2020).
Article Google Scholar
Langfelder, P., Mischel, P. S. & Horvath, S. When is hub gene selection better than standard meta-analysis? PLoS ONE 8, e61505 (2013).
Lou, Y. et al. Characterization of transcriptional modules related to fibrosing-NAFLD progression. Sci. Rep. 7, 1–12 (2017).
Article CAS Google Scholar
Lebon, G., Brun, O., Magné, C. & Clément, C. Photosynthesis of the grapevine (Vitis vinifera) inflorescence. Tree Physiol. 25, 633–639 (2005).
Article CAS PubMed Google Scholar
Yang, Y., Yu, Y., Liang, Y., Anderson, C. T. & Cao, J. A profusion of molecular scissors for pectins: classification, expression, and functions of plant polygalacturonases. Front. Plant Sci. 9, 1–16 (2018).
Article Google Scholar
Rhee, S. Y., Osborne, E., Poindexter, P. D. & Somerville, C. R. Microspore separation in the quartet 3 mutants of arabidopsis is impaired by a defect in a developmentally regulated polygalacturonase required for pollen mother cell wall degradation. Plant Physiol. 133, 1170–1180 (2003).
Article CAS PubMed PubMed Central Google Scholar
Zhang, Q., Huang, L., Liu, T., Yu, X. & Cao, J. Functional analysis of a pollen-expressed polygalacturonase gene BcMF6 in Chinese cabbage (Brassica campestris L. ssp. chinensis Makino). Plant Cell Rep. 27, 1207–1215 (2008).
Article CAS PubMed Google Scholar
Huang, L. et al. The polygalacturonase gene BcMF2 from Brassica campestris is associated with intine development. J. Exp. Bot. 60, 301–313 (2009).
Article CAS PubMed Google Scholar
Huang, L. et al. BcMF9, a novel polygalacturonase gene, is required for both Brassica campestris intine and exine formation. Ann. Bot. 104, 1339–1351 (2009).
Article CAS PubMed PubMed Central Google Scholar
Wang, F. et al. A global analysis of the polygalacturonase gene family in soybean (glycine max). PLoS ONE 11, 1–23 (2016).
Google Scholar
Rundle, S. J., Nasrallah, M. E. & Nasrallah, J. B. Effects of inhibitors of protein serine/threonine phosphatases on pollination in Brassica. Plant Physiol. 103, 1165–1171 (1993).
Article CAS PubMed PubMed Central Google Scholar
Hiscock, S. J., Doughty, J. & Dickinson, H. G. Synthesis and phosphorylation of pollen proteins during the pollen-stigma interaction in self-compatible Brassica napus L. and self-incompatible Brassica oleracea L. Sex. Plant Reprod. 8, 345–353 (1995).
Article Google Scholar
Jin, J. et al. PlantTFDB 4.0: Toward a central hub for transcription factors and regulatory interactions in plants. Nucleic Acids Res. 45, D1040–D1045 (2017).
Article CAS PubMed Google Scholar
Wong, D. C. J. et al. A systems-oriented analysis of the grapevine R2R3-MYB transcription factor family uncovers new insights into the regulation of stilbene accumulation. DNA Res. 00, dsw028 (2016).
Google Scholar
Mandaokar, A. & Browse, J. MYB108 acts together with MYB24 to regulate jasmonate-mediated stamen maturation in Arabidopsis. Plant Physiol. 149, 851–862 (2009).
Article CAS PubMed PubMed Central Google Scholar
Crawford, B. C. W. & Yanofsky, M. F. Half filled promotes reproductive tract development and fertilization efficiency in Arabidopsis thaliana. Development 138, 2999–3009 (2011).
Article CAS PubMed Google Scholar
Palumbo, F., Vannozzi, A., Magon, G., Lucchin, M. & Barcaccia, G. Genomics of flower identity in grapevine (Vitis vinifera L.). Front. Plant Sci. 10, 1–15 (2019).
Article Google Scholar
Vogler, F., Schmalzl, C., Englhart, M., Bircheneder, M. & Sprunck, S. Brassinosteroids promote Arabidopsis pollen germination and growth. Plant Reprod. 27, 153–167 (2014).
Article CAS PubMed Google Scholar
Colombo, M. et al. A new role for the SHATTERPROOF genes during Arabidopsis gynoecium development. Dev. Biol. 337, 294–302 (2010).
Article CAS PubMed Google Scholar
Krizek, B. A. & Fletcher, J. C. Molecular mechanisms of flower development: an armchair guide. Nat. Rev. Genet. 6, 688–698 (2005).
Article CAS PubMed Google Scholar
Grimplet, J., Martínez-zapater, J. M. & Carmona, M. J. Structural and functional annotation of the MADS-box transcription factor family in grapevine. BMC Genomics 1–23. https://doi.org/10.1186/s12864-016-2398-7 (2016).
Bowman, J., Alvarez, J., Weigel, D., Meyerowitz, E. & Smyth, D. R. Control of flower development in Arabidopsis thaliana by APETALA1 and interacting genes. Development 119, 721–743 (1993).
Article CAS Google Scholar
Sun, Y., Fan, Z., Li, X., Li, J. & Yin, H. The APETALA1 and FRUITFUL homologs in Camellia japonica and their roles in double flower domestication. Mol. Breed. 33, 821–834 (2014).
Article CAS Google Scholar
Roque, E., Gómez-Mena, C., Ferrándiz, C., Beltrán, J. & Cañas, L. in Functional Genomics in Medicago truncatula - Methods and Protocols (eds. Cañas, L. A. & Beltrán, P. J.) 273–290 (Humana Press, 2018).
Boss, P. K., Buckeridge, E. J., Poole, A. & Thomas, M. R. New insights into grapevine flowering. Funct. Plant Biol. 30, 593–606 (2003).
Article CAS PubMed Google Scholar
Kryuchkova-Mostacci, N. & Robinson-Rechavi, M. A benchmark of gene expression tissue-specificity metrics. Brief. Bioinform. 18, 205–214 (2017).
CAS PubMed Google Scholar
Seo, Y. S., Kim, E. Y., Kim, J. H. & Kim, W. T. Enzymatic characterization of class I DAD1-like acylhydrolase members targeted to chloroplast in Arabidopsis. FEBS Lett. 583, 2301–2307 (2009).
Article CAS PubMed Google Scholar
Hyun, Y. et al. Cooperation and functional diversification of two closely related galactolipase genes for jasmonate biosynthesis. Dev. Cell 14, 183–192 (2008).
Article CAS PubMed Google Scholar
Ishiguro, S. et al. SHEPHERD is the Arabidopsis GRP94 responsible for the formation of functional CLAVATA proteins. EMBO J. 21, 898–908 (2002).
Article CAS PubMed PubMed Central Google Scholar
Joubès, J. et al. The VLCFA elongase gene family in Arabidopsis thaliana: phylogenetic analysis, 3D modelling and expression profiling. Plant Mol. Biol. 67, 547–566 (2008).
Article PubMed CAS Google Scholar
Kunst, L. & Samuels, A. L. Biosynthesis and secretion of plant cuticular wax. Prog. Lipid Res. 42, 51–80 (2003).
Article CAS PubMed Google Scholar
Hooker, T. S., Millar, A. A. & Kunst, L. Significance of the expression of the CER6 condensing enzyme for cuticular wax production in Arabidopsis. Plant Physiol. 129, 1568–1580 (2002).
Article CAS PubMed PubMed Central Google Scholar
Millar, A. A. et al. CUT1, an Arabidopsis gene required for cuticular wax biosynthesis and pollen fertility, encodes a very-long-chain fatty acid condensing enzyme. Plant Cell 11, 825–838 (1999).
Article CAS PubMed PubMed Central Google Scholar
Preuss, D. Sexual signaling on a cellular level: lessons from plant reproduction. Mol. Biol. Cell 13, 1803–1805 (2002).
Article CAS PubMed PubMed Central Google Scholar
Mayfield, J. A., Fiebig, A., Johnstone, S. E. & Preuss, D. Gene families from the Arabidopsis thaliana pollen coat proteome. Science 292, 2482–2485 (2001).
Article CAS PubMed Google Scholar
Vannozzi, A., Dry, I. B., Fasoli, M., Zenoni, S. & Lucchin, M. Genome-wide analysis of the grapevine stilbene synthase multigenic family: genomic organization and expression profiles upon biotic and abiotic stresses. BMC Plant Biol. 12, 130 (2012).
Mahony, S. & Benos, P. V. STAMP: a web tool for exploring DNA-binding motif similarities. Nucleic Acids Res. 35, 253–258 (2007).
Article Google Scholar
Higo, K., Ugawa, Y., Iwamoto, M. & Korenaga, T. Plant cis-acting regulatory DNA elements (PLACE) database: 1999. Nucleic Acids Res. 27, 297–300 (1999).
Article CAS PubMed PubMed Central Google Scholar
Li, J., Yuan, J. & Li, M. Characterization of putative cis-regulatory elements in genes preferentially expressed in arabidopsis male meiocytes. Biomed Res. Int. 2014, 708364 (2014).
Shu, Q. et al. Analysis of the formation of flower shapes in wild species and cultivars of tree peony using the MADS-box subfamily gene. Gene 493, 113–123 (2012).
Article CAS PubMed Google Scholar
Hobo, T. et al. Various spatiotemporal expression profiles of anther-expressed genes in rice. Plant Cell Physiol. 49, 1417–1428 (2008).
Article CAS PubMed PubMed Central Google Scholar
Liu, H. et al. Genome-wide analysis and evolution of the bZIP transcription factor gene family in six Fragaria species. Plant Syst. Evol. 303, 1225–1237 (2017).
Lu, S. et al. Isolation and functional characterization of the promoter of SEPALLATA3 gene in London plane and its application in genetic engineering of sterility. Plant Cell. Tissue Organ Cult. 136, 109–121 (2019).
Article CAS Google Scholar
Mukoko Bopopi, J. et al. Ectopic expression of PtaRHE1, encoding a poplar RING-H2 protein with E3 ligase activity, alters plant development and induces defence-related responses. J. Exp. Bot. 61, 297–310 (2010).
Article PubMed CAS Google Scholar
Hu, W. & Ma, H. Characterization of a novel putative zinc finger gene MIF1: Involvement in multiple hormonal regulation of Arabidopsis development. Plant J. 45, 399–422 (2006).
Article CAS PubMed Google Scholar
Bartlett, A. et al. Mapping genome-wide transcription-factor binding sites using DAP-seq. Nat. Protoc. 12, 1659–1672 (2017).
Article CAS PubMed PubMed Central Google Scholar
González-Agüero, M. et al. Identification of two putative reference genes from grapevine suitable for gene expression analysis in berry and related tissues derived from RNA-Seq data. BMC Genomics 14, 878 (2013).
Reid, K. E., Olsson, N., Schlosser, J., Peng, F. & Lund, S. T. An optimized grapevine RNA isolation procedure and statistical determination of reference genes for real-time RT-PCR during berry development. BMC Plant Biol. 6, 1–11 (2006).
Article CAS Google Scholar
Selim, M. et al. Identification of suitable reference genes for real-time RT-PCR normalization in the grapevine-downy mildew pathosystem. Plant Cell Rep. 31, 205–216 (2012).
Article CAS PubMed Google Scholar
Luo, M. et al. Selection of reference genes for miRNA qRT-PCR under abiotic stress in grapevine. Sci. Rep. 8, 1–11 (2018).
Google Scholar
Gamm, M. et al. Identification of reference genes suitable for qRT-PCR in grapevine and application for the study of the expression of genes involved in pterostilbene synthesis. Mol. Genet. Genomics 285, 273–285 (2011).
Article CAS PubMed Google Scholar
Chitarrini, G. et al. Two-omics data revealed commonalities and differences between Rpv12- and Rpv3-mediated resistance in grapevine. Sci. Rep. 10, 1–15 (2020).
Article CAS Google Scholar
Ewels, P., Magnusson, M., Lundin, S. & Käller, M. MultiQC: Summarize analysis results for multiple tools and samples in a single report. Bioinformatics 32, 3047–3048 (2016).
Article CAS PubMed PubMed Central Google Scholar
Chen, S., Zhou, Y., Chen, Y. & Gu, J. Fastp: an ultra-fast all-in-one FASTQ preprocessor. Bioinformatics 34, i884–i890 (2018).
Article PubMed PubMed Central CAS Google Scholar
Haas, B. J. et al. De novo transcript sequence reconstruction from RNA-seq using the Trinity platform for reference generation and analysis. Nat. Protoc. 8, 1494–1512 (2013).
Article CAS PubMed Google Scholar
Patro, R., Duggal, G., Love, M. I., Irizarry, R. A. & Kingsford, C. Salmon provides fast and bias-aware quantification of transcript expression. Nat. Methods 14, 417–419 (2017).
Article CAS PubMed PubMed Central Google Scholar
Canaguier, A. et al. A new version of the grapevine reference genome assembly (12X.v2) and of its annotation (VCost.v3). Genomics Data 14, 56–62 (2017).
Article CAS PubMed PubMed Central Google Scholar
Altschul, S. F., Gish, W., Miller, W., Myers, E. W. & Lipman, D. J. Basic local alignment search tool. J. Mol. Biol. 215, 403–410 (1990).
Article CAS PubMed Google Scholar
Illumina Inc. Considerations for RNA-Seq read length and coverage. Different RNA-Seq experiment types require different sequencing read lengths and depth. https://emea.support.illumina.com/bulletins/2017/04/considerations-for-rna-seq-read-length-and-coverage-.html (2021).
Langfelder, P. & Horvath, S. WGCNA: an R package for weighted correlation network analysis. BMC Bioinformatics 9, 559 (2008).
R. Core Team. R: a language and environment for Statistical computing (2020).
Zhang, B. & Horvath, S. A general framework for weighted gene co-expression network analysis. Stat. Appl. Genet. Mol. Biol. 4, 17 (2005).
Ravasz, E., Somera, A. L., Mongru, D. A., Oltvai, Z. N. & Barabási, A. L. Hierarchical organization of modularity in metabolic networks. Science 297, 1551–1555 (2002).
Article CAS PubMed Google Scholar
Ge, S. X., Jung, D., Jung, D. & Yao, R. ShinyGO: a graphical gene-set enrichment tool for animals and plants. Bioinformatics 36, 2628–2629 (2020).
Article CAS PubMed Google Scholar
Yanai, I. et al. Genome-wide midrange transcription profiles reveal expression level relationships in human tissue specification. Bioinformatics 21, 650–659 (2005).
Article CAS PubMed Google Scholar
Huggins, P. et al. DECOD: fast and accurate discriminative DNA motif finding. Bioinformatics 27, 2361–2367 (2011).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

The authors would like to thank Dr. Sara Sgubin who was of great help in the collection of the flower samples and in the dissection of the whorls and flower tissues on which the analyses were conducted and Dr. Luca Meloni who has kindly contributed to the drafting of Fig. 1 and Fig. 6. The authors are also grateful to Prof. Sara Zenoni, which provided transcriptional data from the V. vinifera cv Corvina atlas.

Funding

This study was partially supported by the research project VANN_BIRD2020_01 financed by the University of Padova.

Author information

These authors contributed equally: Alessandro Vannozzi, Fabio Palumbo

Authors and Affiliations

Department of Agronomy, Food, Natural Resources, Animals and Environment (DAFNAE), University of Padova, Campus of Agripolis, V. le dell’Università 16, 35020, Legnaro, Padova, Italy
Alessandro Vannozzi, Fabio Palumbo, Gabriele Magon, Margherita Lucchin & Gianni Barcaccia

Authors

Alessandro Vannozzi
View author publications
You can also search for this author in PubMed Google Scholar
Fabio Palumbo
View author publications
You can also search for this author in PubMed Google Scholar
Gabriele Magon
View author publications
You can also search for this author in PubMed Google Scholar
Margherita Lucchin
View author publications
You can also search for this author in PubMed Google Scholar
Gianni Barcaccia
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

G.B. and M.L. designed the research. A.V., F.P., and G.M. conducted and controlled the experiments and analyzed the data. A.V. and F.P. carried out the bioinformatics analyses. G.M. performed G.O. and CRE analyses. F.P. and A.V. wrote the manuscript. All authors contributed to editing the manuscript.

Corresponding author

Correspondence to Gianni Barcaccia.

Ethics declarations

Conflict of interest

The authors declare no competing interests.

Supplementary information

Supplementary Figure 1

Supplementary Figure 2

Supplementary Figure 4

Supplementary Figure 3

Supplementary Table 1

Supplementary Table 2

Supplementary Table 3

Supplementary Table 4

Supplementary Table 5

Supplementary Table 6

Supplementary Table 7

Supplementary Table 8

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Vannozzi, A., Palumbo, F., Magon, G. et al. The grapevine (Vitis vinifera L.) floral transcriptome in Pinot noir variety: identification of tissue-related gene networks and whorl-specific markers in pre- and post-anthesis phases. Hortic Res 8, 200 (2021). https://doi.org/10.1038/s41438-021-00635-7

Download citation

Received: 12 March 2021
Revised: 13 July 2021
Accepted: 17 July 2021
Published: 01 September 2021
DOI: https://doi.org/10.1038/s41438-021-00635-7