Small RNA profiling in Pinus pinaster reveals the transcriptome of developing seeds and highlights differences between zygotic and somatic embryos

Rodrigues, Andreia S.; Chaves, Inês; Costa, Bruno Vasques; Lin, Yao-Cheng; Lopes, Susana; Milhinhos, Ana; Van de Peer, Yves; Miguel, Célia M.

doi:10.1038/s41598-019-47789-y

Download PDF

Article
Open access
Published: 05 August 2019

Small RNA profiling in Pinus pinaster reveals the transcriptome of developing seeds and highlights differences between zygotic and somatic embryos

Andreia S. Rodrigues^1,2,
Inês Chaves^1,2,
Bruno Vasques Costa^1,2,3,
Yao-Cheng Lin ORCID: orcid.org/0000-0002-9390-795X^4,5,
Susana Lopes^1,2,
Ana Milhinhos^1,2,
Yves Van de Peer^5,6,7 &
…
Célia M. Miguel ORCID: orcid.org/0000-0002-1427-952X^1,2,8

Scientific Reports volume 9, Article number: 11327 (2019) Cite this article

2877 Accesses
24 Citations
6 Altmetric
Metrics details

Subjects

Abstract

Regulation of seed development by small non-coding RNAs (sRNAs) is an important mechanism controlling a crucial phase of the life cycle of seed plants. In this work, sRNAs from seed tissues (zygotic embryos and megagametophytes) and from somatic embryos of Pinus pinaster were analysed to identify putative regulators of seed/embryo development in conifers. In total, sixteen sRNA libraries covering several developmental stages were sequenced. We show that embryos and megagametophytes express a large population of 21-nt sRNAs and that substantial amounts of 24-nt sRNAs were also detected, especially in somatic embryos. A total of 215 conserved miRNAs, one third of which are conifer-specific, and 212 high-confidence novel miRNAs were annotated. MIR159, MIR171 and MIR394 families were found in embryos, but were greatly reduced in megagametophytes. Other families, like MIR397 and MIR408, predominated in somatic embryos and megagametophytes, suggesting their expression in somatic embryos is associated with in vitro conditions. Analysis of the predicted miRNA targets suggests that miRNA functions are relevant in several processes including transporter activity at the cotyledon-forming stage, and sulfur metabolism across several developmental stages. An important resource for studying conifer embryogenesis is made available here, which may also provide insightful clues for improving clonal propagation via somatic embryogenesis.

Spatial co-transcriptomics reveals discrete stages of the arbuscular mycorrhizal symbiosis

Article Open access 08 April 2024

Karen Serrano, Margaret Bezrutczyk, … Benjamin Cole

The complex polyploid genome architecture of sugarcane

Article Open access 27 March 2024

A. L. Healey, O. Garsmeur, … A. D’Hont

Single-cell and spatial RNA sequencing reveal the spatiotemporal trajectories of fruit senescence

Article Open access 10 April 2024

Xin Li, Bairu Li, … Robert Henry

Introduction

Zygotic embryogenesis is a crucial phase of the life cycle of seed plants, growth axes are established and accumulation of reserves takes place for later use during germination¹. The study of zygotic embryogenesis has been challenging due to the small size of the embryo, particularly during early embryogenesis, and due its location deep within the seed tissues².

Somatic embryogenesis, whereby a whole plant is derived from somatic cells that become competent and enter into the embryogenic pathway in vitro, is often the model system of choice to study the molecular regulation of plant embryogenesis². Simultaneously, it has been used as a clonal propagation technology in several conifer species where somatic embryogenesis is induced mainly from immature and mature zygotic embryo (ZE) explants³. The study of the molecular regulation of zygotic embryogenesis is therefore very important in providing fundamental knowledge to improve the success of somatic embryogenesis protocols which are still largely empirical⁴.

Small non-coding RNAs (sRNAs) are important regulators of the expression of genes involved in plant development and in responses to both abiotic and biotic stresses⁵. The two main classes of sRNAs described in plants are microRNAs (miRNAs), derived from single stranded hairpin RNA precursors, and short interfering RNAs (siRNAs), derived from double stranded RNA precursors. Both classes of sRNAs are processed by DICER-LIKE (DCL) proteins. Plant miRNAs are typically 20–22-nt and act on targets through post-transcriptional gene silencing (PTGS), either by transcript cleavage or by translational repression. siRNAs may play their role by PTGS or transcriptional gene silencing (TGS) through alterations in DNA methylation. siRNAs are divided in different subclasses, the most common of which are the 21–22-nt secondary siRNAs, involved in PTGS and TGS, and the 24-nt heterochromatic siRNAs (het-siRNAs), involved in TGS⁶.

A survey of sRNA sequence and size profiles in 3 green algae and 31 vascular plant species revealed variability among plant species⁷. Overall, the 21-nt and 24-nt sequences prevail in the sRNA transcriptomes of angiosperms⁷, and during the seed development phase, diverse sRNAs size profiles seem to be present⁸. In the gymnosperm clade, including the conifers, contradictory results relating to the expression of 24-nt sRNAs have been reported^{7,9,10,11,12,13,14,15,16}. The differences observed, as well as the distinct features of zygotic embryogenesis in angiosperms versus gymnosperms, show that Arabidopsis thaliana is not the best model for studies on somatic embryogenesis in non-angiosperm species.

The miRNAs are by far the better characterized class of sRNAs. In v.22 of miRBase¹⁷, a total of 8485 plant miRNAs have been deposited, however only 669 have been identified in conifer species. Compared with angiosperms, there is little information on sRNAs available for conifers. Nonetheless, many miRNAs appear to be specific for conifers.

Reports on high-throughput small RNA sequencing (RNA-Seq) in conifers have covered pools of different tissues, including seeds from Cunninghamia lanceolata¹⁴, seeds from Picea glauca^18,19, needles from Pinus contorta¹¹, over 20 different tissues from Picea abies, including male and female cones^7,13,20, male and female cones from Pinus tabuliformis¹², and different pooled tissues from Larix leptolepis, including somatic embryos (SEs), or seedling tissues^15,16.

The current knowledge on sRNAs expression and function in seed development has been recently reviewed⁸, pointing to a very limited number of miRNAs whose function during this phase of the plant life cycle has been experimentally characterized (namely miR156, miR159, miR160, miR164, miR165/166, miR172, miR394, and miR397).

Indirect evidence of the relevance of sRNAs throughout Pinus pinaster (maritime pine) embryogenesis has been highlighted in a genome-wide transcriptomic analysis of consecutive stages of embryo development²¹. This study revealed that sRNA-associated functions such as siRNA and miRNA binding, and gene silencing by miRNA, are differentially regulated across P. pinaster zygotic embryogenesis. Also, miRNA functions appeared more represented in mid to late embryogenesis²¹. In addition to conserved miRNAs that are expected to be involved in conifer and angiosperm embryogenesis, we hypothesize that the function of conifer-specific sRNAs may underlie some of the characteristic differences between the developing embryos of conifers versus those of angiosperm species. In this work, we provide an overview of the sRNA transcriptome of P. pinaster, an important conifer species widely spread throughout the Mediterranean region where it has a significant economic relevance. A set of miRNAs, including conserved and newly discovered miRNAs, have been identified, some of which show evident differences in their expression throughout embryo development. This dataset represents an important resource for the future characterization of the functional roles of sRNAs in conifer embryogenesis and advances the current knowledge of the molecular regulation of this process. Finally these results provide clues for how to improve clonal propagation via somatic embryogenesis.

Results

Small RNA-Seq data

In order to obtain a comprehensive, dynamic sRNA transcriptome of the P. pinaster seed throughout embryo development, sRNA libraries of ZEs at five stages of development from early to late embryogenesis (ZE0, ZE3, ZE4B, ZE5, ZE7), and corresponding megagametophytes (MGs) at three stages of development (MG0, MG4B, MG7), were sequenced using Illumina technology. In addition, sRNA libraries from SEs were also sequenced to characterize putative differences between zygotic and somatic embryogenesis during mid and late embryogenesis (see Fig. 1). Small RNA-Seq data was obtained from 16 libraries (Supplementary Table S1A–C), yielding between 13,295,829 and 33,047,491 raw reads per library and almost 330 M raw reads in total. After filtering (to remove low complexity and t/rRNA sequences) about 110 M reads of 18-nt to 26-nt and an absolute abundance of > = 5 were retained. These sequences represent 19–41% of the initial raw reads in each sRNA library. From these, approximately 76 M had a perfect match to the genome of Pinus taeda, which corresponds to about 808 K unique (distinct) reads. The percentages of filtered reads and of reads aligned to the genome were consistent between biological replicates.

The size profiles of reads expressed per biological sample were analysed (Fig. 2) focusing on 21-nt and 24-nt reads. After filtering and mapping, the 21-nt reads are clearly more highly expressed in all samples, but the 24-nt reads are also clearly present and are more represented in the later stages of embryogenesis (from embryo stages ZE4B and SE4B onwards).

Annotation of the miRNA transcriptome of P. pinaster embryogenesis

Genome-aligned reads were initially mapped against mature miRNAs deposited in miRBase v.21 to annotate conserved miRNAs. Then, both the annotated conserved miRNAs and the remaining unidentified sRNAs were further analysed to: (1) predict putative novel miRNAs and (2) identify miRNA precursor and miRNA-star (sequence complementary to the mature miRNA in the single stranded hairpin RNA precursor) sequences for annotated miRNAs. Furthermore, miRNA-star sequences were also annotated. Identified conserved, novel and star miRNAs can be found in Supplementary Table S2A–C.

A total of 215 conserved miRNAs were annotated, corresponding to 40 conserved miRNA families (MIRs), each containing between 1 and 28 isoforms (Fig. 3). The conserved miRNA families with the highest number of isoforms were MIR166 (28), MIR159 (22), MIR396 (20) and MIR319 (17). Many of the conserved miRNA families were present across all of the libraries, namely MIR1311, MIR1312, MIR156, MIR162, MIR166, MIR167, MIR168, MIR172, MIR319, MIR3711, MIR390, MIR396, MIR482, MIR946, MIR947 and MIR951. A total of 8266 novel miRNAs were predicted of which 212 have a miRNA-star sequence and are thus considered as high-confidence P. pinaster novel miRNAs (see Supplementary Table S2A–C).

The first 5′-end nucleotide of the sRNA may indicate preferential association to a specific ARGONAUTE (AGO) protein²². In our study, a total of 4042 (46%) mature miRNAs begin with nucleotide A, 3713 (42%) with U, 530 (6%) with C and 472 (5%) with G. Conserved miRNAs were predominantly found with a 5′-end U, whereas the majority of novel miRNAs displayed a 5′-end A or a 5′-end U. This suggests that most P. pinaster miRNAs play their regulatory roles upon being selectively loaded by AGO1 (5′-end U miRNAs) or AGO2 (5′-end A miRNAs)²³.

A principal component analysis (PCA) of all libraries, using conserved and novel miRNAs, demonstrates that the three types of tissues can be distinguished based on their miRNA profiles (Fig. 4). Approximately 48% of the variance can be explained by the two principal components. Distinct developmental stage samples from the same tissue are highly correlated and therefore cluster together. It should be noted that hierarchical cluster analysis (HCL) showed consistency between biological replicates (Supplementary Fig. S1).

A comparison of the miRNA transcriptomes of ZEs, SEs and MGs at stages T4B and T7 showed a very similar number of total annotated miRNAs in stages T4B and T7 (Fig. 5) with 1071 miRNAs and 1098 miRNAs, respectively. Only miRNAs which have been annotated in both biological replicates were considered in this comparison, thereby excluding those unique to SEs. The number of conserved miRNAs annotated in each type of tissue, either at stage T4B or stage T7, is very similar and most conserved miRNAs are detected in all of the three tissues. On the contrary, many annotated novel miRNAs seem to be present in only one of the tissues, with 653–680 novel miRNAs annotated in ZEs, 437–452 in MGs and 199–249 in SEs. The total number of miRNAs simultaneously present in ZEs and MGs decreased from stage T4B to stage T7, while miRNAs detected in both SEs and MGs increased.

Overall expression patterns of conserved and novel miRNAs

An overall analysis of the expression patterns of conserved miRNA families across all samples demonstrates that, with the exception of MIR166 and MIR947, these families are expressed at low to moderate levels in all libraries (Fig. 6). Note however that large variations in the expression levels of different isoforms within the same miRNA family across development might be found (Supplementary Fig. S2).

Given that SE development closely resembles zygotic embryogenesis but it occurs in vitro without a surrounding MG, it was interesting to note that some miRNA families, such as MIR397 and MIR408, were moderately expressed in SEs and MGs but were greatly reduced in ZEs. Other families, such as MIR394, MIR159 and MIR171, exhibited moderate expression in embryos (ZEs and SEs) but were greatly reduced in MGs (Fig. 6). The majority of the novel miRNAs also exhibited low to moderate expression levels, and similarly to conserved miRNAs, there are several examples of tissue specific expression (Supplementary Table S2A–C).

In order to highlight those miRNAs that are highly expressed during embryo development, a cut-off of 100 counts per million (CPM) was applied to miRNAs expressed either by ZEs or MGs, at stages T4B and T7. A shortlist containing 20 conserved miRNAs and 21 novel miRNAs (Supplementary Table S2A–C) that passed this criterium, was generated. Within this list, four conserved miRNAs were simultaneously present in ZEs and SEs at stages T4B and T7 (Fig. 5), namely miR166-TCTCGGACCAGGCTTCATTCC, miR159-TTTGGTTTGAAGGGAGCTCT, miR159-TTTGGTTTGAAGGGAGCTCTA and miR159-TTGGATTGAAGGGAGCTCCA. The shortlist also includes miR166-TCGGACCAGGCTTCATTCC and miR319-CTTGGACTGAAGGGAGCTCCC, simultaneously present in MGs and SEs at stages T4B and T7 (Fig. 5). miR162-TCGATAAACCTCTGCATCCGG which is described as targeting DCL transcripts²⁴, is less expressed in SEs. Eleven of the novel miRNAs in this list are high-confidence, and their expression is generally lower in SEs than in ZEs or MGs.

Differentially expressed miRNAs in zygotic embryos and megagametophytes

A total of 565 differentially expressed (DE) miRNAs were detected among the 1710 miRNAs present across P. pinaster ZE development, including 36 conserved miRNA isoforms from 17 miRNA families, 504 novel miRNAs and 25 miRNA-stars. The DE miRNAs in ZE samples were grouped into six distinct clusters according to their expression profiles (Fig. 7; Supplementary Table S3A–F).

The expression profiles can be associated with a specific ZE stage(s): cluster-1 (15 miRNAs) presents two peaks of expression, at ZE0 and ZE5; cluster-2 (87 miRNAs) shows a single expression peak at ZE3; cluster-3 (54 miRNAs) has a peak at ZE4B; cluster-4 (181 miRNAs) a peak at ZE5; cluster-5 (24 miRNAs) has two peaks of expression, at ZE4B and ZE7; and in cluster-6 (204 miRNAs) the expression steadily increases up to ZE7. These results show that the majority of DE miRNAs throughout ZE development reach their peak of expression late in embryogenesis, at the cotyledonary (ZE5) and mature (ZE7) embryo stages.

Another 41 miRNAs were DE between MG4B and MG7 samples, where 25 are up-regulated at MG7 (cluster-1) and 16 miRNAs are up-regulated at MG4B (cluster-2), thereby grouping into two distinct expression profiles (Fig. 7; Supplementary Table S4A,B). Among the 41 DE miRNAs are four miRNA isoforms from four conserved miRNA families (MIR319, MIR396, MIR482 and MIR950) and 37 novel miRNAs. Similarly to the ZE samples, the largest number of DE miRNAs in MGs appear to reach their peak of expression late in embryogenesis.

Gene enrichment analysis of predicted miRNA targets

To link miRNA differential expression patterns with putative functions in ZE and MG development, their target transcripts were predicted against the P. pinaster transcriptome (Supplementary Table S5A–F and Supplementary Table S6A,B), and associated GO terms were retrieved for each expression profile cluster (Supplementary Table S7).

In the ZE samples, the enrichment analysis of the predicted targets retrieved a total of 167 overrepresented Gene Ontology (GO) terms, namely 92 Biological Process (BP), 55 Molecular Function (MF), and 20 Cellular Component (CC) terms. The targets of miRNAs in clusters 1 and 5 are mostly associated with sulfur compound metabolism and sulfur compound biosynthesis. In cluster 2, most miRNA target genes are related to aspartate family amino acid metabolism, including organic acid metabolism and carboxylic acid metabolism, both also found overrepresented in cluster 4 (Supplementary Table S7 and Supplementary Fig. S3). Interestingly, the majority of the targets of cluster 3 miRNAs are associated with transporter activity. An overrepresentation of targets related with branched-chain amino acid biosynthesis, cell communication and signaling is exclusive to miRNAs in cluster 4, while regulation of phosphate metabolism, biological regulation and phosphorus metabolism are overrepresented in the targets of miRNAs in clusters 4 and 6 (Supplementary Table S7, Supplementary Fig. S4 and Supplementary Fig. S5). It should be noted that the highest number of enriched BP terms were associated with the miRNA targets of cluster 6, including cellular metabolism, cellular process, protein phosphorylation, cellular localization, cellular carbohydrate metabolism or cell cycle (Supplementary Fig. S5).

The enrichment analysis of the predicted targets in the MG samples retrieved a total of 21 overrepresented GO terms (ten BP, nine MF, and two CC). The results show an overrepresentation of targets associated with the MF term protein binding in both clusters 1 and 2. Cluster 2 contains several overrepresented BP terms such as response to stimulus, response to organic substance, cell communication, signaling, nucleocytoplasmic transport and glucose metabolism (Supplementary Table S7 and Supplementary Fig. S6).

MiRNAs expression validation by RT-qPCR

Six conserved miRNAs that were highly and/or differentially expressed in ZE development, were selected for RT-qPCR validation (Fig. 8). The sequences of the TaqMan miRNA probes selected for RT-qPCR validation fully matched the miRNA sequences found in the DE analysis. The RT-qPCR results showed a good agreement with expression profiles from small RNA-Seq data. All validated miRNAs, with the exception of miR319, had higher expression in ZEs than in MGs.

Discussion

We have surveyed the sRNA transcriptome of the developing ZE, SE and MG of P. pinaster. Only four conifer species have their miRNAs deposited in miRBase, the reference miRNA database, namely Cunninghamia lanceolata, Picea abies, Pinus densata and Pinus taeda (http://www.mirbase.org/cgi-bin/browse.pl, accessed on 08-June-2018). For Picea abies, the first conifer with a publicly available reference genome¹³, 594 miRNA precursors and 600 mature miRNAs are deposited, including many distinct isoforms from the same miRNAs family. Here, we have identified and annotated a total of 215 conserved miRNAs and 212 high-confidence putative novel miRNAs in embryos and in the MG of P. pinaster, which represent an important resource for the study of embryogenesis in conifer species and significantly enriches the current repertoire of published and annotated plant miRNAs.

To our knowledge, this is the most complete report disclosing the sRNA transcriptome of seed tissues of a conifer species across a developmental time course (obtained through next generation sequencing). Both 21-nt and 24-nt sRNAs increase towards the later stages of development, peaking at embryo maturation (ZE7). The considerable and diverse fraction of 24-nt sequences expressed by the analysed P. pinaster tissues consists of many unique sequences expressed at low levels. Therefore, the 21-nt sRNAs become the predominant fraction in the sRNA population as a result of the strong reduction of the 24-nt sRNAs after filtering out sequences with an absolute abundance lower than five. It should also be noted that the 24-nt reads are more abundant in ZEs and SEs than they are in MGs. Since the work published by Dolgosheina et al.⁹, the ability of conifers to express significant numbers of 24-nt sRNAs has been a topic of debate. The major plant sRNA classes which include 24-nt sequences, are the hairpin-derived siRNAs (hp-siRNAs), the natural antisense siRNAs (natsiRNAs), and the het-siRNAs⁶. Het-siRNAs are considered the most abundant siRNAs expressed by plants and, unlike hp-siRNAs and natsiRNAs, are solely composed of 24-nt sequences and associated with TGS via RNA-directed DNA methylation (RdDM). Het-siRNAs seem to be particularly important during plant meiosis, gametogenesis and embryogenesis, playing a role in the silencing of transposons and repetitive sequences in order to maintain genome integrity⁶.

In Larix leptolepis, a significant fraction of non-redundant 24-nt reads was also detected in somatic embryogenesis tissues, whereas non-redundant 21-nt sRNAs predominated in seedlings^15,16,25. Similarly to our results, the 24-nt sRNAs exhibited lower redundancy, i.e. many diverse 24-nt sRNAs with low individual expression levels¹⁶. In Cunninghamia lanceolata, a single sRNA library prepared from seeds, calli, seedlings, adult leaves and stems, exhibited a higher abundance of the 24-nt sRNAs in both unique and redundant sRNAs size profiles¹⁴. In Picea glauca, 24-nt sRNAs were amongst the most abundant found at seed set¹⁹, although a decrease in expression of 24-nt sRNAs was observed as seed set progressed into maturation¹⁸. Moreover, there were more unique 24-nt sRNAs than 21-nt sRNAs, which was particularly evident at early stages of seed set¹⁸. Significant levels of 24-nt sRNAs were also found in reproductive tissues of Picea abies, in particular in male cones, which were found to be largely associated with repeats¹³. In Pinus tabuliformis immature male and female cones, both total and unique sRNA length distributions peaked at 21-nt, which together with the expression of sRNA biogenesis associated genes, pointed to the predominance of the miRNA pathway in these tissues¹².

While in P. pinaster embryos the abundance of total 24-nt sRNAs increases towards maturation (ZE7 and SE7), being even higher in SEs than in ZEs, the opposite was observed during seed development of angiosperms such as canola, barley, rice and wheat⁸. Interestingly, a previous study reported overrepresentation of putative AGO9 transcripts in the coding transcriptome of late P. pinaster embryos²¹. AGO9 belongs to a phylogenetic clade (AGO4/6/8/9) known to preferentially associate with 24-nt siRNAs and acting to silence transposons and repetitive sequences at the transcriptional level²². In Picea abies, a correlation was established between a much higher level of 24-nt siRNAs and increased CHH DNA methylation in somatic embryogenesis culture cells when compared with needles, which showed mostly 21-nt siRNAs²⁶. Although no functional evidence is available, a putative role in epigenetic silencing of transposons and repetitive elements seems likely, which would be in agreement with the more frequent epigenetic/genetic instability often associated with in vitro culture²⁷. However, it should be noted that although genetic and epigenetic instability has been detected in conifer embryogenic cultures, especially when these are maintained in vitro during long periods of time^26,28,29, this risk has been considered limited when compared to angiosperm species^30,31. This further suggests that 24-nt sRNAs might have a particularly relevant role in maintaining such stability in conifers.

It is now clear that conifer species are able to express considerable levels of 24-nt sRNAs but it remains to be clarified whether their expression dynamics are directly associated with the species, the stage of the life cycle and/or the type of tissue. The case for conifers might be similar to that of the Selaginella lineage, in which the het-siRNAs pathway was found to be active only in specific tissues³².

We have annotated a high number of novel and conserved miRNAs from P. pinaster embryos and MGs. Many miRNA families detected here are conserved across land plants, such as MIR156, MIR159, MIR160, MIR162, MIR166, MIR167, MIR168, MIR169, MIR171, MIR172, MIR319, MIR390, MIR394, MIR396, MIR397, MIR529 and MIR535. Other families such as MIR482, which was found to be poorly enriched in monocots, or MIR395, MIR399 and MIR408, which are enriched in angiosperms⁷, are also present in P. pinaster embryos and MGs. Thirteen miRNA families are conifer-specific (MIR946, MIR947, MIR950, MIR951, MIR1311, MIR1312, MIR1313, MIR1314, MIR1315, MIR1316, MIR3699, MIR3701 and MIR3711).

It has been documented that the more conserved the miRNA sequence is, the more abundant it is⁷. In this work, miR166-TCGGACCAGGCTTCATTCCCC with 21-nt is the most abundant sequence across all libraries but there are also several highly abundant putative novel miRNAs such as novel-TCCAACGAAGATCAGAAGGCTT with 22-nt. However, in general, the average expression of conserved miRNAs is higher than that of novel miRNAs.

The analysis of DE conserved and novel miRNAs suggests that the biological functions of miRNAs are particularly important in late P. pinaster embryogenesis. This supports the importance of posttranscriptional regulation by miRNAs during seed maturation. Furthermore, it is in close agreement with previous work highlighting that sRNA associated processes are differentially expressed during P. pinaster zygotic embryogenesis and provides further evidence of miRNA functions in mid to late embryogenesis²¹.

The function of a few conserved miRNAs annotated here had already been experimentally validated in seed development⁸. MIR159, MIR171 and MIR394 were found to be present in both ZEs and SEs, but greatly reduced in MGs. MIR159 is one of the largest miRNA families present in our data. Its increasing expression across embryogenesis, with several DE miR159 isoforms, points to an important function during late embryogenesis (ZE5 and ZE7). MiR159 has been identified as having a possible involvement in the plastid fatty acid biosynthesis pathway during seed maturation in Brassica napus and, a negative correlation between the expression of miR159 and its predicted target KASII (also known as FAB1) was confirmed³³. Also in P. pinaster embryos, we found a homolog of KASII (sp_v3.0_unigene24638), encoding “3-ketoacyl-ACP synthase II”, as a predicted target of miR159. A P. pinaster homolog of MYB101, sp_v3.0_unigene11630, was also predicted as target of the DE isoforms of miR159, expression of which increase towards embryo maturation. These results are consistent with previous work reporting that Arabidopsis miR159 cleaves MYB33 and MYB101 transcripts during seed germination, contributing to block ABA signaling associated with seed dormancy³⁴. In the conifer Larix leptolepsis LaMYB33 was confirmed as target of miR159 in SEs. A negative correlation was observed between expression of LaMYB33 and the expression of miR159 during the late stages of SE maturation³⁵.

Both MIR159 and MIR171 have been pointed out as potential markers of embryogenecity after somatic embryogenesis induction in conifers⁴. The GRAS-family transcription factors SCARECROW-LIKE (SCL) are well known miR171 targets, although functional diversification was already described for some members of the MIR171 that are predicted to target non-SCL6 genes^36,37. MIR171 is a promising candidate to understand initial embryo-specific molecular processes and their relevance in zygotic versus somatic embryogenesis since, like MIR159, it is also exclusively expressed by ZEs and SEs of P. pinaster. In Citrus sinensis, miR171 could be detected in embryogenic calli but not in non-embryogenic calli. Based on its SCL targets, it was suggested that miR171 inactivates postembryonic growth to maintain normal somatic embryogenesis³⁸. Amongst the predicted targets of the DE miR171 isoform in P. pinaster are the C-terminus sequence of a putative SCL and several genes encoding Mitogen activated protein kinase (MAP kinase, MPK).

MIR394, detected only during the late stages of P. pinaster embryo development (ZE5, ZE7 and SE7), has been detected as early as the 16-cell stage and up to the torpedo stage in wild-type Arabidopsis embryogenesis³⁹. In Arabidopsis, miR394 repression of the LEAF CURLING RESPONSIVENESS (LCR) transcript was shown to be necessary to maintain the stem cell competence of the shoot apical meristem³⁹. Finally, in Brassica napus the repression of LCR by miR394 was shown to play a role in seed development, where it impacts the seed content of storage oil, protein and glucosinolates⁴⁰.

Due in part to the difficulties of isolating ZEs, especially at very early stages of development, somatic embryogenesis has been used as a model system for studying ZE development. However, somatic embryogenesis in conifers is usually artificially induced in the presence of high levels of auxin, which can lead to altered expression of many genes. From this work it is clear that SEs have different sRNA profiles when compared to their zygotic counterparts, although they express roughly the same repertoire of conserved and novel miRNAs. The miR482/miR2118 superfamily, found to be up-regulated in P. pinaster SEs, has been recently characterized in Picea abies. It was suggested that the miR482/miR2118 superfamily has dual functions in gymnosperms, regulating not only the production of phased secondary siRNAs from NB-LRR genes, but also triggering siRNA production in reproductive tissues⁴¹. Interestingly, these functions have been divergently retained in eudicots and monocots^42,43,44,45. We do not know what roles this miRNA family is playing in SEs, but it may be involved in triggering siRNA production, which would be consistent with the higher amounts of 24-nt sRNAs observed for SEs when comparing to ZEs. On the other hand, the reported involvement of NB-LRR genes in hormonal responses to environmental stress⁴⁶ would be in agreement with the stress inducing conditions used in vitro for SEs.

Other miRNA families present in SEs and MGs but greatly reduced in ZEs, are MIR397 and MIR408. Several studies on miR397 have reported roles in the abiotic stress response⁴⁷ and MIR408 plays important roles during vegetative development in Arabidopsis^48,49,50. The function of MIR408 in reproductive development remains unclear. MIR408 targets transcripts encoding copper-containing proteins and sucrose was shown to be an important regulator of such proteins through miR408 and miR398, which are induced by SPL7 in response to high sucrose^51,52. Overexpression of MIR408 in Arabidopsis also altered various morphological traits including flower size and silique length, resulting in enhanced biomass and seed yield⁵³. In maize endosperm the expression of miR408 was reported to be significantly altered following treatment with sucrose⁵⁴.

Development of SEs can occur under high concentrations of sucrose in several conifer species, including P. pinaster. Moreover, the artificial conditions that are imposed can be considered stressful for the plant cells. In this context, and according to previous reports, it seems likely that the higher levels found for miR397 and miR408 in SEs are related, at least partially, to such conditions. The complete absence of a MG tissue in somatic embryogenesis is also a distinctive characteristic relative to zygotic embryogenesis, and we may speculate that some of the overrepresented functions in SEs are balancing the lack of a MG tissue. MIR3711 is one of the few miRNA families up-regulated only in MGs. This family of miRNAs has been also detected in Picea abies SEs but the target genes and putative functions are unknown⁵⁵.

Regarding the putative novel miRNAs identified here, there are several interesting candidates for further characterization, for instance novel-TGAGATTGTTGGAGAGGTTCA and novel-AATGGGTTGACTGGAAAGACC (Supplementary Table S2A–C) expressed by both ZEs and SEs, but expressed lowly or entirely absent in MGs. Although these two examples refer to high-confidence novel miRNAs, the high number of predicted targets for most of them makes it difficult to hypothesize about their roles during embryogenesis, and functional studies are required.

Overall, from the cluster analysis of DE miRNAs, each stage of development seems to be associated with regulation by a characteristic population of miRNAs, which peak at defined time points. Gene enrichment analysis of the predicted targets of DE miRNAs, both conserved and novel, revealed enriched GO terms associated to different expression profiles. For instance, in stage T4B, transporter activity is highlighted, possibly in relation to the appearance of the cotyledons which requires a very active transport of auxin and nutrients to the developing tissues. Sulfur metabolism also featured in relation to DE miRNAs with varying expression profiles (clusters 1 and 5). Developing seeds are an important sink for oxidized or reduced sulfur, and the relevance of glutathione (transport metabolite for reduced sulfur) metabolism during embryogenesis has been highlighted in previous transcriptomic analysis of P. pinaster embryogenesis²¹ and functional studies in other species⁵⁶. In addition to its importance in situations of high protein turnover, a link between sulfur metabolism and the synthesis and steady levels of ABA, a major regulator of embryo maturation, has been described in Arabidopsis⁵⁷. Thus, it is not surprising to find enrichment of sulfur metabolism processes in our data, given the requirement of ABA across embryo development. In fact, ABA is an essential component of protocols for promoting SE development in conifers⁵⁸.

Material and Methods

Plant material

All biological samples have been derived from open-pollinated P. pinaster Ait. trees of clone 49, which is part of the Portuguese breeding program population⁵⁹, located in a clonal orchard at Escaroupim National Forest, Portugal (longitude 8°44’W, latitude 39°4’N). The biological samples included zygotic embryos (ZEs), megagametophytes (MGs) and somatic embryos (SEs) (Fig. 1 and Supplementary Table S8). Collection of cones for ZE isolation occurred between mid June and end of July 2012 to 2014. ZEs were isolated as previously reported²¹, and categorized according to the staging system described in Gonçalves et al.⁶⁰. Five different groups of developing embryos were considered, as follows: ZE0 included the early embryo stages T0, T1 and T2; ZE3 included the pre-cotyledonary embryo stages T3 and T4; ZE4B included the early cotyledonary embryo stage T4B; ZE5 included the cotyledonary embryo stage T5; and ZE7 included the mature embryo stage T7. Several biological replicates containing between 20–60 embryos were prepared for each of the five groups. The MGs surrounding ZE0 (MG0), ZE4B (MG4B) and ZE7 (MG7) embryos were also isolated and collected in pools of 10 MGs per biological replicate. The SE biological samples were isolated in 2014 from the embryogenic line 49/34/11 (derived from immature embryo tissues of clone 49) submitted to a previously described SE maturation protocol⁶¹, upon morphological evaluation based on the staging system for ZE⁶⁰. Sixteen early cotyledonary SEs, equivalent to zygotic counterparts ZE4B, were pooled together in SE4B biological sample. Seven late SEs equivalent to ZE7, with a minimum of four cotyledons, were pooled together in SE7 biological sample. All samples were immediately frozen in liquid nitrogen upon collection and stored at −70 °C.

Equipment and settings

Stereomicroscope observations were performed with a Nikon SMZ800 and images were captured using an Olympus SC30 camera and software.

RNA isolation and small RNA-Seq

Total RNA samples were extracted using the “Plant/Fungi Total RNA Purification Kit” (NORGEN BIOTEK CORP.), according to manufacturer instructions and with minor modifications: (1) between 600–1000 uL of Lysis Buffer C were added to the biological sample, depending on its complexity; (2) after incubation at 55 °C, the lysate was vortexed twice for 3 min at maximum speed and the clear lysate transferred into a new Eppendorf; (3) total RNA samples were eluted using water. DNA contamination was eliminated using DNase TURBO (Ambion). Cleaned RNA samples were quantified using the QuBit® 3.0 fluorometer and sent to the sequencing service provider who prepared and sequenced the sRNA libraries using Illumina technology.

Two biological replicates were sequenced per tissue/stage, with exception of ZE0, MG0, and SE, for which one biological sample was sequenced.

MiRNAs identification and annotation

The raw sequencing data files were automatically processed by the sRNA analysis pipeline miRPursuit⁶², with the user-defined criteria described as follows: after a standard pre-processing step, reads were firstly filtered excluding those outside the 18–26 nt range and with an absolute abundance lower than 5; secondly, only reads perfectly mapping (0 mismatches) in the genome of Pinus taeda v1.01-masktrim⁶³, defined as the Genome⁺DB, were considered to be candidate reads in following steps. Conserved miRNAs were identified/annotated by comparison with mature miRNAs deposited in miRBase v.21, allowing up to two mismatches¹⁷. Non-conserved reads and conserved miRNAs were processed in order to identify (1) putative novel miRNAs and (2) precursor sequences for all annotated miRNAs, including miRNA-star sequences whenever present in the sRNA library.

MiRNAs expression analysis

The expression values of annotated miRNAs were normalized against the total number of reads in each library and multiplied by a factor of 10⁶ (CPM or Counts Per Million).

A Principal Component Analysis (PCA) was performed using the “prcomp” R function with the annotated conserved and novel miRNAs and their expression levels after ln(CPM) transformation. The principal components were calculated through correlation matrix, which uses normalized data.

Heatmaps of the conserved miRNA families were built using pheatmap in R after ln(CPM) transformation.

Differentially expressed (DE) miRNAs were determined in ZE samples with an A-NOVA statistical analysis (p-value = 0.05 without alpha correction; assuming ZE0 as an out-group) and in MGs samples with a t-test statistical analysis (p-value = 0.05 without alpha correction, assuming MG0 as an out-group).

MiRNA target prediction

The targets of the DE miRNAs were predicted using the freely available online tool psRNAtarget^64,65, against the reference transcriptome P. pinaster⁶⁶, and the predicted targets with expectation 3-to-5 were selected for further analysis.

The putative target transcripts of DE miRNAs were subject to a gene enrichment analysis performed with BiNGO plugin from Cytoscape^67,68, and with the following settings: the Hypergeometric Test as statistical test; the Benjamini & Hochberg’s FDR correction as multiple testing correction; a significance level of 0.05; and whole annotation as the reference set. The Gene Ontology (GO) terms were further summarized by REViGO to remove the redundant ones, using the default options⁶⁹.

Validation by RT-qPCR

Six conserved miRNAs were selected for expression profile validation based on their high expression levels in ZE samples and on the availability of specific ‘ready to order’ TaqMan miRNA Assays (Applied Biosystems®), namely miR159 (TTGGATTGAAGGGAGCTCCA; Assay ID: 008363_mat), miR162 (TCGATAAACCTCTGCATCCAG; Assay ID: 000342), miR167 (TGAAGCTGCCAGCATGATCTG; Assay ID: 003037_mat), miR171 (TTGAGCCGCGCCAATATCACT; Assay ID: 005375_mat), miR319 (TTGGACTGAAGGGAGCTCCC; Assay ID: 000361), and miR390 (AAGCTCAGGAGGGATAGCGCC; Assay ID: 001409). An absolute quantification of miRNA expression was performed. To this end, oligonucleotides identical to the six selected miRNAs were ordered (Biomers.net), and used to prepare standard curves for each of the selected miRNAs. The cDNA synthesis was performed from 10 ng of DNase-treated total RNA using the TaqMan® MicroRNA Reverse Transcription Kit (Applied Biosystems®) and the miRNA-specific RT primer provided with the TaqMan® MicroRNA Assay (Applied Biosystems®), according to the manufacturer’s instructions. All qPCR experiments were performed in a LightCycler 480 (Roche Diagnostics) with 96-well white plates (Roche Diagnostics). Each 20 uL qPCR reaction mixture included 1X TaqMan® Universal PCR Master Mix II, No UNG (Applied Biosystems®), 1X TaqMan® MicroRNA Assay (Applied Biosystems®) and the cDNA, prepared according to manufacturer’s instructions. Three biological replicates, each with three technical replicates, were used for miRNA expression quantification in ZE and MG samples. For comparative analysis, the small RNA-Seq mean counts of each miRNA in the different biological replicates, and the mean RT-qPCR quantification value were considered.

Data Availability

The data were deposited in the European Nucleotide Archive (ENA) under the study PRJEB27796, with the run accessions ERS2608100 to ERS2608104 (MG), ERS2608105 to ERS2608113 (ZE), ERS2616519 to ERS2616520 (SE) [http://www.ebi.ac.uk/ena/data/view/PRJEB27796].

References

Goldberg, R. B., de Paiva, G. & Yadegari, R. Plant embryogenesis: zygote to seed. Science 266, 605–614 (1994).
Article ADS CAS Google Scholar
Zimmerman, J. L. Somatic embryogenesis: a model for early development in higher plants. Plant Cell 5, 1411–1423 (1993).
Article Google Scholar
Guan, Y., Li, S.-G., Fan, X.-F. & Su, Z.-H. Application of somatic embryogenesis in woody plants. Front. Plant Sci. 7, 938 (2016).
PubMed PubMed Central Google Scholar
Miguel, C. M., Rupps, A., Raschke, J., Rodrigues, A. S. & Trontin, J. F. Impact of molecular studies on somatic embryogenesis development for implementation in conifer multi-varietal forestry In Vegetative Propagation of Forest trees (eds Park, Y. S., Bonga, J. M. & Moon, H. K.) 373–421 (National Institute of Forest Science, 2016).
Li, S., Castillo-González, C., Yu, B. & Zhang, X. The functions of plant small RNAs in development and in stress responses. Plant J. 90, 654–670 (2017).
Article CAS Google Scholar
Borges, F. & Martienssen, R. A. The expanding world of small RNAs in plants. Nat. Rev. Mol. Cell Biol. 16, 727–741 (2015).
Article CAS Google Scholar
Chávez Montes, R. A. et al. Sample sequencing of vascular plants demonstrates widespread conservation and divergence of microRNAs. Nat. Commun. 5, 3722 (2014).
Article ADS Google Scholar
Rodrigues, A. S. & Miguel, C. M. The pivotal role of small non-coding RNAs in the regulation of seed development. Plant Cell Rep. 36, 653–667 (2017).
Article CAS Google Scholar
Dolgosheina, E. V. et al. Conifers have a unique small RNA silencing signature. RNA 14, 1508–1515 (2008).
Article CAS Google Scholar
Lee, E. K. et al. A functional phylogenomic view of the seed plants. PLoS Genet. 7, e1002411 (2011).
Article CAS Google Scholar
Morin, R. D. et al. Comparative analysis of the small RNA transcriptomes of Pinus contorta and Oryza sativa. Genome Res. 18, 571–584 (2008).
Article CAS Google Scholar
Niu, S.-H. et al. Identification and expression profiles of sRNAs and their biogenesis and action-related genes in male and female cones of Pinus tabuliformis. BMC Genomics 16, 693 (2015).
Article Google Scholar
Nystedt, B. et al. The Norway spruce genome sequence and conifer genome evolution. Nature 497, 579–584 (2013).
Article ADS CAS Google Scholar
Wan, L.-C. et al. Identification and characterization of small non-coding RNAs from Chinese fir by high throughput sequencing. BMC Plant Biol. 12, 146 (2012).
Article CAS Google Scholar
Zhang, J. et al. Deciphering small noncoding RNAs during the transition from dormant embryo to germinated embryo in larches (Larix leptolepis). PLoS ONE 8, e81452 (2013).
Article ADS Google Scholar
Zhang, J. et al. Dynamic expression of small RNA populations in larch (Larix leptolepis). Planta 237, 89–101 (2013).
Article CAS Google Scholar
Kozomara, A. & Griffiths-Jones, S. miRBase: annotating high confidence microRNAs using deep sequencing data. Nucleic Acids Res. 42, D68–73 (2014).
Article CAS Google Scholar
Liu, Y. & El-Kassaby, Y. A. Landscape of fluid sets of hairpin-derived 21-/24-nt-long small RNAs at seed set uncovers special epigenetic features in Picea glauca. Genome Biol. Evol. 9, 82–92 (2017).
CAS PubMed PubMed Central Google Scholar
Liu, Y. & El-Kassaby, Y. A. Global analysis of small RNA dynamics during seed development of Picea glauca and Arabidopsis thaliana populations reveals insights on their evolutionary trajectories. Front. Plant Sci. 8, 1719 (2017).
Article Google Scholar
Källman, T., Chen, J., Gyllenstrand, N. & Lagercrantz, U. A significant fraction of 21-nucleotide small RNA originates from phased degradation of resistance genes in several perennial species. Plant Physiol. 162, 741–754 (2013).
Article Google Scholar
de Vega-Bartol, J. J. et al. Transcriptomic analysis highlights epigenetic and transcriptional regulation during zygotic embryo development of Pinus pinaster. BMC Plant Biol. 13, 123 (2013).
Article Google Scholar
Fang, X. & Qi, Y. RNAi in plants: an argonaute-centered view. Plant Cell 28, 272–285 (2016).
Article CAS Google Scholar
Mi, S. et al. Sorting of small RNAs into Arabidopsis argonaute complexes is directed by the 5′ terminal nucleotide. Cell 133, 116–127 (2008).
Article CAS Google Scholar
Xie, Z., Kasschau, K. D. & Carrington, J. C. Negative feedback regulation of Dicer-like1 in Arabidopsis by microRNA-guided mRNA degradation. Curr. Biol. 13, 784–789 (2003).
Article CAS Google Scholar
Zhang, J. et al. Genome-wide identification of microRNAs in larch and stage-specific modulation of 11 conserved microRNAs and their targets during somatic embryogenesis. Planta 236, 647–657 (2012).
Article CAS Google Scholar
Ausin, I. et al. DNA methylome of the 20-gigabase Norway spruce genome. Proc. Natl. Acad. Sci. USA 113, E8106–E8113 (2016).
Article CAS Google Scholar
Miguel, C. & Marum, L. An epigenetic view of plant cells cultured in vitro: somaclonal variation and beyond. J. Exp. Bot. 62, 3713–3725 (2011).
Article CAS Google Scholar
Tremblay, L., Levasseur, C. & Tremblay, F. M. Frequency of somaclonal variation in plants of black spruce (Picea mariana, Pinaceae) and white spruce (P. glauca, Pinaceae) derived from somatic embryogenesis and identification of some factors involved in genetic instability. Am. J. Bot. 86, 1373–1381 (1999).
Article CAS Google Scholar
Marum, L., Rocheta, M., Maroco, J., Oliveira, M. M. & Miguel, C. Analysis of genetic stability at SSR loci during somatic embryogenesis in maritime pine (Pinus pinaster). Plant Cell Rep. 28, 673–682 (2009).
Article CAS Google Scholar
Egertsdotter, U. Plant physiological and genetical aspects of the somatic embryogenesis process in conifers. Scandinavian Journal of Forest Research 34, 360–369 (2018).
Article Google Scholar
Isabel, N. et al. Occurrence of somaclonal variation among somatic embryo-derived white spruces (Picea glauca, Pinaceae). Am. J. Bot. 83, 1121–1130 (1996).
Article Google Scholar
Axtell, M. J. Classification and comparison of small RNAs from plants. Annu. Rev. Plant Biol. 64, 137–159 (2013).
Article CAS Google Scholar
Wang, J. et al. Identification of microRNAs actively involved in fatty acid biosynthesis in developing Brassica napus seeds using high-throughput sequencing. Front. Plant Sci. 7, 1570 (2016).
PubMed PubMed Central Google Scholar
Reyes, J. L. & Chua, N.-H. ABA induction of miR159 controls transcript levels of two MYB factors during Arabidopsis seed germination. Plant J. 49, 592–606 (2007).
Article CAS Google Scholar
Li, W.-F. et al. Regulation of LaMYB33 by miR159 during maintenance of embryogenic potential and somatic embryo maturation in Larix kaempferi (Lamb.) Carr. Plant Cell Tiss. Organ Cult. 113, 131–136 (2013).
Article ADS CAS Google Scholar
Ariel, F. et al. Two direct targets of cytokinin signaling regulate symbiotic nodulation in Medicago truncatula. Plant Cell 24, 3838–3852 (2012).
Article CAS Google Scholar
Zhu, X. et al. Discovery of conservation and diversification of genes by phylogenetic analysis based on global genomes. Plant Genome 8 (2015).
Article Google Scholar
Wu, X.-M. et al. Genomewide analysis of small RNAs in nonembryogenic and embryogenic tissues of citrus: microRNA- and siRNA-mediated transcript cleavage involved in somatic embryogenesis. Plant Biotechnol. J. 13, 383–394 (2015).
Article Google Scholar
Knauer, S. et al. A protodermal miR394 signal defines a region of stem cell competence in the Arabidopsis shoot meristem. Dev. Cell 24, 125–132 (2013).
Article CAS Google Scholar
Song, J. B. et al. Altered fruit and seed development of transgenic rapeseed (Brassica napus) over-expressing microRNA394. PLoS ONE 10, e0125427 (2015).
Article Google Scholar
Xia, R., Xu, J., Arikit, S. & Meyers, B. C. Extensive families of miRNAs and PHAS loci in Norway spruce demonstrate the origins of complex phasiRNA networks in seed plants. Mol. Biol. Evol. 32, 2905–2918 (2015).
Article CAS Google Scholar
Fei, Q., Xia, R. & Meyers, B. C. Phased, secondary, small interfering RNAs in posttranscriptional regulatory networks. Plant Cell 25, 2400–2415 (2013).
Article CAS Google Scholar
Johnson, C. et al. Clusters and superclusters of phased small RNAs in the developing inflorescence of rice. Genome Res. 19, 1429–1440 (2009).
Article CAS Google Scholar
Song, X. et al. Roles of DCL4 and DCL3b in rice phased small RNA biogenesis. Plant J. 69, 462–474 (2012).
Article CAS Google Scholar
Jeong, D.-H. et al. Parallel analysis of RNA ends enhances global investigation of microRNAs and target RNAs of Brachypodium distachyon. Genome Biol. 14, R145 (2013).
Article Google Scholar
Sarazin, V. et al. Arabidopsis BNT1, an atypical TIR-NBS-LRR gene, acting as a regulator of the hormonal response to stress. Plant Sci. 239, 216–229 (2015).
Article CAS Google Scholar
Sunkar, R. & Zhu, J.-K. Novel and stress-regulated microRNAs and other small RNAs from Arabidopsis. Plant Cell 16, 2001–2019 (2004).
Article CAS Google Scholar
Zhang, H. et al. Genome-wide mapping of the HY5-mediated gene networks in Arabidopsis that involve both transcriptional and post-transcriptional regulation. Plant J. 65, 346–358 (2011).
Article CAS Google Scholar
Zhang, H. et al. MicroRNA408 is critical for the HY5-SPL7 gene network that mediates the coordinated response to light and copper. Plant Cell 26, 4933–4953 (2014).
Article CAS Google Scholar
Zhang, H. & Li, L. SQUAMOSA promoter binding protein-like7 regulated microRNA408 is required for vegetative development in Arabidopsis. Plant J. 74, 98–109 (2013).
Article CAS Google Scholar
Ren, L. & Tang, G. Identification of sucrose-responsive microRNAs reveals sucrose-regulated copper accumulations in an SPL7-dependent and independent manner in Arabidopsis thaliana. Plant Sci. 187, 59–68 (2012).
Article CAS Google Scholar
Yamasaki, H., Hayashi, M., Fukazawa, M., Kobayashi, Y. & Shikanai, T. SQUAMOSA Promoter Binding Protein-Like7 Is a Central Regulator for Copper Homeostasis in Arabidopsis. Plant Cell 21, 347–361 (2009).
Article CAS Google Scholar
Song, Z. et al. Constitutive expression of miR408 improves biomass and seed yield in Arabidopsis. Front. Plant Sci. 8, 2114 (2017).
Article Google Scholar
Huang, H. et al. Identification and characterization of microRNAs in maize endosperm response to exogenous sucrose using small RNA sequencing. Genomics 108, 216–223 (2016).
Article CAS Google Scholar
Yakovlev, I. A. & Fossdal, C. G. In silico analysis of small RNAs suggest roles for novel and conserved miRNAs in the formation of epigenetic memory in somatic embryos of Norway spruce. Front. Physiol. 8, 674 (2017).
Article Google Scholar
Stasolla, C. Glutathione redox regulation of in vitro embryogenesis. Plant Physiol. Biochem. 48, 319–327 (2010).
Article CAS Google Scholar
Cao, M.-J. et al. Sulfate availability affects ABA levels and germination response to ABA and salt stress in Arabidopsis thaliana. Plant J. 77, 604–615 (2014).
Article CAS Google Scholar
Montalbán, I. A., García-Mendiguren, O. & Moncaleán, P. Somatic embryogenesis in Pinus spp. Methods Mol. Biol. 1359, 405–415 (2016).
Article Google Scholar
Aguiar, A., Almeida, M. H. & Borralho, N. Genetic control of growth, wood density and stem characteristics of Pinus pinaster in Portugal. Silva Lusitana 11, 131–139 (2003).
Google Scholar
Gonçalves, S., Cairney, J., Maroco, J., Oliveira, M. M. & Miguel, C. Evaluation of control transcripts in real-time RT-PCR expression analysis during maritime pine embryogenesis. Planta 222, 556–563 (2005).
Article Google Scholar
Morel, A. et al. Cotyledonary somatic embryos of Pinus pinaster Ait. most closely resemble fresh, maturing cotyledonary zygotic embryos: biological, carbohydrate and proteomic analyses. Planta 240, 1075–1095 (2014).
Article CAS Google Scholar
Chaves, I., Costa, B. V., Rodrigues, A. S., Bohn, A. & Miguel, C. M. miRPursuit-a pipeline for automated analyses of small RNAs in model and nonmodel plants. FEBS Lett. 591, 2261–2268 (2017).
Article CAS Google Scholar
Wegrzyn, J. L. et al. Unique features of the loblolly pine (Pinus taeda L.) megagenome revealed through sequence annotation. Genetics 196, 891–909 (2014).
Article CAS Google Scholar
Dai, X., Zhuang, Z. & Zhao, P. X. psRNATarget: a plant small RNA target analysis server (2017 release). Nucleic Acids Res. 46, W49–W54 (2018).
Article CAS Google Scholar
Dai, X. & Zhao, P. X. psRNATarget: a plant small RNA target analysis server. Nucleic Acids Res. 39, W155–9 (2011).
Article CAS Google Scholar
Canales, J. et al. De novo assembly of maritime pine transcriptome: implications for forest breeding and biotechnology. Plant Biotechnol. J. 12, 286–299 (2014).
Article ADS CAS Google Scholar
Maere, S., Heymans, K. & Kuiper, M. BiNGO: a Cytoscape plugin to assess overrepresentation of gene ontology categories in biological networks. Bioinformatics 21, 3448–3449 (2005).
Article CAS Google Scholar
Shannon, P. et al. Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. 13, 2498–2504 (2003).
Article CAS Google Scholar
Supek, F., Bošnjak, M., Škunca, N. & Šmuc, T. REVIGO summarizes and visualizes long lists of gene ontology terms. PLoS ONE 6, e21800 (2011).
Article ADS CAS Google Scholar

Download references

Acknowledgements

Isabel Carrasquinho and Alexandre Aguiar from INIAV are acknowledged for provision of plant material. James Yates from ITQB NOVA is acknowledged for English language revision of the manuscript. This work was supported through projects funded by (1) the European Commission Seventh Framework Programme (FP7, Grant Agreement N°289841-PROCOGEN), and (2) Fundação para a Ciência e a Tecnologia (FCT), through grants Biodata.pt (ID 22231) co-financed by FEDER and FCT/MEC (PIDDAC), GREEN-it (UID/Multi/04551/2013), IF/01168/2013, DL 57/2016/CP1351/CT0003 and the doctoral fellowships SFRH/BD/79779/2011 (to ASR) and PD/BD/114359/2016 (to SL).

Author information

Authors and Affiliations

iBET, Instituto de Biologia Experimental e Tecnológica, Apartado 12, 2781-901, Oeiras, Portugal
Andreia S. Rodrigues, Inês Chaves, Bruno Vasques Costa, Susana Lopes, Ana Milhinhos & Célia M. Miguel
Instituto de Tecnologia Química e Biológica António Xavier, Universidade Nova de Lisboa (ITQB NOVA), Av. República, 2780-157, Oeiras, Portugal
Andreia S. Rodrigues, Inês Chaves, Bruno Vasques Costa, Susana Lopes, Ana Milhinhos & Célia M. Miguel
INESC-ID, Instituto Superior Técnico, Universidade de Lisboa, Rua Alves Redol 9, Lisboa, 1000-029, Portugal
Bruno Vasques Costa
Biotechnology Center in Southern Taiwan and Agricultural Biotechnology Research Center, Academia Sinica, Tainan, Taiwan
Yao-Cheng Lin
VIB-UGent Center for Plant Systems Biology, Ghent, Belgium
Yao-Cheng Lin & Yves Van de Peer
Department of Plant Biotechnology and Bioinformatics, Ghent University, Ghent, Belgium
Yves Van de Peer
Department of Biochemistry, Genetics and Microbiology, University of Pretoria, Private bag X20, Pretoria, 0028, South Africa
Yves Van de Peer
BioISI – Biosystems & Integrative Sciences Institute, Faculdade de Ciências, Universidade de Lisboa, Lisboa, Portugal
Célia M. Miguel

Authors

Andreia S. Rodrigues
View author publications
You can also search for this author in PubMed Google Scholar
Inês Chaves
View author publications
You can also search for this author in PubMed Google Scholar
Bruno Vasques Costa
View author publications
You can also search for this author in PubMed Google Scholar
Yao-Cheng Lin
View author publications
You can also search for this author in PubMed Google Scholar
Susana Lopes
View author publications
You can also search for this author in PubMed Google Scholar
Ana Milhinhos
View author publications
You can also search for this author in PubMed Google Scholar
Yves Van de Peer
View author publications
You can also search for this author in PubMed Google Scholar
Célia M. Miguel
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

This study was conceived and directed by C.M.M. A.S.R., S.L. and A.M. did the experimental work, including preparation of total RNA for sequencing and gene expression validation by RT-qPCR. I.C., B.V.C. and Y.-C.L. performed the bioinformatics analysis. A.S.R., I.C., B.V.C., Y.-C.L., Y.V.P. and C.M.M. participated in the analysis of results and their biological interpretation. A.S.R., I.C. and C.M.M. wrote the paper. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Célia M. Miguel.

Ethics declarations

Competing Interests

The authors declare no competing interests.

Additional information

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Figures

Supplementary Tables (datasets)

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Rodrigues, A.S., Chaves, I., Costa, B.V. et al. Small RNA profiling in Pinus pinaster reveals the transcriptome of developing seeds and highlights differences between zygotic and somatic embryos. Sci Rep 9, 11327 (2019). https://doi.org/10.1038/s41598-019-47789-y

Download citation

Received: 03 October 2018
Accepted: 24 July 2019
Published: 05 August 2019
DOI: https://doi.org/10.1038/s41598-019-47789-y

This article is cited by

miRNome profiling reveals differential miRNAs associated with embryogenic potential in the somatic embryogenesis of Araucaria angustifolia
- Leandro Francisco de Oliveira
- Amanda Rusiska Piovezani
- Eny Iochevet Segal Floh
Plant Cell, Tissue and Organ Culture (PCTOC) (2023)
Spatiotemporal expression profile of novel and known small RNAs throughout rice plant development focussing on seed tissues
- Anikó Meijer
- Tim De Meyer
- Tina Kyndt
BMC Genomics (2022)
Integrated mRNA and miRNA Expression Analyses of Pinus massoniana Roots and Shoots in Long-Term Response to Phosphate Deficiency
- Fuhua Fan
- Xianwen Shang
- Jianhui Tan
Journal of Plant Growth Regulation (2022)
Functional characterization of the MiR171a promoter and endogenous target mimics identification in Lilium pumilum DC. Fisch. during somatic embryogenesis
- Hongyu Li
- Jing Wang
- Hongmei Sun
Plant Cell, Tissue and Organ Culture (PCTOC) (2021)
Identification of microRNAs and their target genes related to needle discoloration of evergreen tree Chinese cedar (Cryptomeria fortunei) in cold winters
- Yingting Zhang
- Junjie Yang
- Jin Xu
Planta (2021)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.