An improved de novo assembling and polishing of Solea senegalensis transcriptome shed light on retinoic acid signalling in larvae

Córdoba-Caballero, José; Seoane, Pedro; Jabato, Fernando M.; Perkins, James R.; Manchado, Manuel; Claros, M. Gonzalo

doi:10.1038/s41598-020-77201-z

Download PDF

Article
Open access
Published: 26 November 2020

An improved de novo assembling and polishing of Solea senegalensis transcriptome shed light on retinoic acid signalling in larvae

José Córdoba-Caballero¹^na1,
Pedro Seoane^1,2^na1,
Fernando M. Jabato¹,
James R. Perkins^1,2,3,
Manuel Manchado⁴ &
…
M. Gonzalo Claros^1,2,3,5

Scientific Reports volume 10, Article number: 20654 (2020) Cite this article

1616 Accesses
5 Citations
1 Altmetric
Metrics details

Subjects

Abstract

Senegalese sole is an economically important flatfish species in aquaculture and an attractive model to decipher the molecular mechanisms governing the severe transformations occurring during metamorphosis, where retinoic acid seems to play a key role in tissue remodeling. In this study, a robust sole transcriptome was envisaged by reducing the number of assembled libraries (27 out of 111 available), fine-tuning a new automated and reproducible set of workflows for de novo assembling based on several assemblers, and removing low confidence transcripts after mapping onto a sole female genome draft. From a total of 96 resulting assemblies, two “raw” transcriptomes, one containing only Illumina reads and another with Illumina and GS-FLX reads, were selected to provide SOLSEv5.0, the most informative transcriptome with low redundancy and devoid of most single-exon transcripts. It included both Illumina and GS-FLX reads and consisted of 51,348 transcripts of which 22,684 code for 17,429 different proteins described in databases, where 9527 were predicted as complete proteins. SOLSEv5.0 was used as reference for the study of retinoic acid (RA) signalling in sole larvae using drug treatments (DEAB, a RA synthesis blocker, and TTNPB, a RA-receptor agonist) for 24 and 48 h. Differential expression and functional interpretation were facilitated by an updated version of DEGenes Hunter. Acute exposure of both drugs triggered an intense, specific and transient response at 24 h but with hardly observable differences after 48 h at least in the DEAB treatments. Activation of RA signalling by TTNPB specifically increased the expression of genes in pathways related to RA degradation, retinol storage, carotenoid metabolism, homeostatic response and visual cycle, and also modified the expression of transcripts related to morphogenesis and collagen fibril organisation. In contrast, DEAB mainly decreased genes related to retinal production, impairing phototransduction signalling in the retina. A total of 755 transcripts mainly related to lipid metabolism, lipid transport and lipid homeostasis were altered in response to both treatments, indicating non-specific drug responses associated with intestinal absorption. These results indicate that a new assembling and transcript sieving were both necessary to provide a reliable transcriptome to identify the many aspects of RA action during sole development that are of relevance for sole aquaculture.

Emx2 underlies the development and evolution of marsupial gliding membranes

Article Open access 24 April 2024

A high-resolution transcriptomic and spatial atlas of cell types in the whole mouse brain

Article Open access 13 December 2023

Genomes of multicellular algal sisters to land plants illuminate signaling network evolution

Article Open access 01 May 2024

Introduction

Flatfish refers to a diverse group of high valuable fish species worldwide belonging to the order Pleuronectiformes. Some of these species have been intensively studied in recent years due to their potential to be produced in aquaculture or as attractive biological models to understand the molecular mechanisms that govern the severe morphological and physiological transformations that occur during metamorphosis. In spite of the significant advances in the aquaculture of some species, most of them have to face new challenges related to growth performance, larval production, fish quality and pigmentation and disease resistance. As a result, abundant genomic resources including genomes and transcriptomes were reported and have been successfully used to understand flatfish asymmetry in Japanese flounder¹, metamorphosis regulatory pathways in Atlantic halibut², sex chromosome evolution and sex determination in half-smooth tongue sole³ and gene mapping in turbot⁴. It fact, fish genomics has been called to support genetic improvement of production traits and breeding programmes in aquaculture⁵.

Solea senegalensis (Senegalese sole) is the most important flatfish cultivated in Southern Europe by market price. Although its culture has faced specific challenges and some bottlenecks still persist, such as the lack of reproduction success for males cultured in captivity, the high dispersion of sizes or the high incidence of malformations. Nevertheless, more than 1600 tonnes are currently produced in Europe under advanced recirculation systems⁶. Key advances in hatchery and nursery protocols occurred in parallel with the development of important genomic resources including transcriptomes⁷ and genome drafts^8,9. All this information enabled the design of useful mid- and high-density arrays and successful high-throughput transcript sequencing (RNA-seq) studies providing relevant information about lipid physiology in larvae¹⁰, olfactory communication between breeders¹¹, immune response associated to experimental diets¹², osmoregulation⁷ or pigmentation disorders in post-larvae¹³. However, the complex regulatory pathways that regulate metamorphosis in sole are not yet fully understood. The impressive plasticity of the individuals during this period is evidenced by their ability to imprint new properties as a function of the environmental factors. In fact, behavioral and metabolic changes lead to the acquisition of a functional digestive tract, competent immunity responses, mature neuro-muscular skeletal system, skin and pigmentation development, visual performance, and gonad development during this period^10,14,15,16. Metamorphosis is triggered by a surge of thyroid hormones (TH)¹⁷ that govern a TH-responsive asymmetric centre in the anterior head region for eye migration and head remodeling¹⁸. Even so, these signalling cascades are highly modulated by environmental factors such as vitamin A that can interact with TH levels to modify skeletal morphogenesis leading to skeletal deformities¹⁹. In the Japanese flounder, retinoic acid (RA), the active metabolite of vitamin A, also appears to be critical in establishing asymmetric pigmentation and in modulating eye migration¹. In sole, a few key genes involved in RA metabolism have been described²⁰, and further research is required to understand the action of retinoids during metamorphosis in this species.

High-quality assembled and annotated transcriptomes are essential to achieve a comprehensive picture of cell or tissue physiology²¹, which is especially critical when genome sequence is not established. Transcriptomes are highly dynamic and show important differences depending on the cell genetic background, regulatory programming during organism development and cell differentiation, and post-transcriptional modifications²². As a result, assembling a de novo transcriptome, an essential approach for high-throughput gene expression studies, is prone to overestimate the number of transcripts due to immature mRNAs, intermediary spliced forms, sequencing errors, fragmented transcripts, insufficient sequencing depth and biological variability²³. Previous assembled transcriptomes in sole reported a high number of transcripts^7,8,24,25 that clearly exceeded the expected number of predicted genes as reported in closely related flatfish (about 21,000 protein-coding genes^1,3,26). Accurate transcript quantification for gene expression studies is hindered by over-represented transcriptomes²⁷, resulting in biased transcript discovery, over-estimation of family-collapsed contigs, and under-estimation of redundant contigs^11,27. Bioinformatic strategies to reduce artefacts and redundancy have been implemented in sole⁷, but the result was still far from being optimal, and further polishing to increase tissue representativity, transcript completeness and annotations is required.

The release of better de novo transcriptomes is challenging that evolves quite fast^22,28,29. The current study describes a new and improved version of the S. senegalensis transcriptome named SOLSEv5.0 using only a representative selection of sequencing libraries as input to a fine-tuned de novo assembling workflow based on TransFlow²⁸ that combines multiple assemblers. The best assembly was then polished to remove low confidence transcripts. Finally, its suitability for RNA-seq studies was demonstrated using an experimental study that investigates the role of RA on metamorphic larvae by using a specific RA-receptor agonist or a RA synthesis blocker. The bioinformatic analysis was performed using an improved version of DEGenes Hunter³⁰.

Results

De novo assembling for new “raw” transcriptomes

It is well established that more input reads do not produce better assemblies^31,32. According to this observation, 30 Illumina and 1 GS-FLX libraries were adopted as the most informative ones from the original dataset of 111 libraries⁷ following criteria detailed in “Methods” section. These libraries accounted for 454 million Illumina pair-end reads and 5.66 millions GS-FLX single-end reads that after pre-processing rendered 361 and 3.1 millions of useful reads, respectively (Supplementary File 1). To validate species-specific reads, useful reads were then mapped onto a draft genome of S. senegalensis^8,9,33. Supplementary File 1 shows that GS-FLX reads presented the lowest mapping rate (10.59 %, maybe due to their long length or the presence of too many polymorphisms), while Illumina libraries H634, H638 and H639 have distinctly higher mapping rates ($ >70 $ % maybe due to an overrepresentation of immature transcripts containing introns). Since libraries H634, H638 and H639, together with H632, belonged to the same experimental batch, they were discarded, resulting in a subset of 26 Illumina (275,501,704 paired-end reads) and 1 GS-FLX libraries for assembling the new transcriptome.

“Raw” transcriptomes were assembled using a fine-tuned TransFlow workflow detailed in “Methods” section. Since transcriptome evaluation was independent of the TT sequence but based on 15 quality parameters calculated by TransFlow, zebrafish transcriptome was selected as reference because it is the best known fish transcriptome. The 15 assemblies with the lowest PCA distance to the zebrafish reference transcriptome are presented in Table 1. scOases_k45 (Table 1), where only Illumina reads were assembled using Oases with k-mer = 45, was the first choice, and its name was simplified for convenience as “Oases_k45” from now on. Since GS-FLX libraries were prepared using a different set of tissues⁷, aaMin2/scOases_cat_cd/454Cap3 in Table 1 (named as “Min2_Oases_Cap3” from now on for convenience) was also considered. It was built using the following steps: (i) assembling of GS-FLX reads using Mira3 and Euler-SR, and the resulting contigs were combined with CAP3; (ii) assembling Illumina reads with Oases using k-mers of 45 and 55, and redundancy decreasing using CDHIT-EST; and (iii) reconciliation of non-redundant contigs from step (ii) with CAP3 contigs from step (i) using Minimus2. Interestingly, the Oases_k45 was produced in step (ii), which makes it a part of Min2_Oases_Cap3.

Table 1 The 15 assembling approaches with the lowest PCA distance to the reference transcriptome of D. rerio (zebrafish) generated after fine-tunning TransFlow are presented.

Full size table

Both Oases_k45 and Min2_Oases_Cap3 were functionally and structurally annotated with full-length proteins from Actinopterigii taxon and then completed with full-length proteins from the vertebrate division, and were then compared to the published transcriptome (referred as v4.0)⁷. Table 2 shows that new “raw” transcriptomes have less tentative transcripts (TTs), smaller overall transcriptome length (but similar between Oases_k45 and Min2_Oases_Cap3), a two- to four-fold increase of N50 and N90, and less artefacts than v4.0 obtained with the same reads. Therefore, the proposed fine-tuned assembling workflow generated more accurate transcriptomes.

Table 2 Main features of “raw” transcriptomes (the original v4.0 as well the new Min2_Oases_Cap3 and Oases_k45) after functional and structural annotation.

Full size table

The second half of Table 2 displays the functional annotation and coding statuses of the three transcriptomes. It was striking that the new “raw” transcriptomes had less TTs ‘Without orthologue’ and without predictable coding region (‘Unknown nature’ in Table 2) than v4.0. Furthermore, Min2_Oases_Cap3 had 7 353 additional orthologues (‘With orthologue’ in Table 2), 8574 additional unique othologues and 3546 additional unique complete transcripts compared to Oases_k45. Therefore, Min2_Oases_Cap3 appeared as a more comprehensive “raw” transcriptome than Oases_k45 despite having a high rate of unknown TTs, which indicated that further polishing was advisable.

Transcriptome polishing

The first cleansing step was conducted by mapping the TTs from the three “raw” transcriptomes (see Table 2) on the Senegalese sole genome, where v4.0 was included as a baseline control. As a result, Min2_Oases_Cap3 and Oases_k45 transcriptomes contained less unmapping, atypical and low quality TTs than v4.0 (see Table 3 for values and definitions). Hence, the three kinds of low confidence TTs were removed from their respective “raw” transcriptome to produce “definitive” transcriptomes that were then evaluated for intron-exon patterns. A higher proportion of introns per TT were found in Min2_Oases_Cap3 and Oases_k45, particularly for TTs with $> 3$ exons (Table 3). Most removed TTs were very short, or the consequence of chimaeric or misassembled contigs (Supplementary File 2). Therefore, polishing steps improved “definitive” transcriptomes, even though it hardly removes low confidence TTs in v4.0 due to a suboptimal assembling approach.

Table 3 Low confidence TTs identified in “raw” transcriptomes that were removed to constitute “definitive” transcriptomes whose TTs were counted by the number of exons identified in their sequence.

Full size table

To confirm that polishing did not remove informative TTs, structural statuses of “raw” and “definitive” transcriptomes, as well as the set of “raw” quality” TTs, were compared (Fig. 1). ‘Unknown’ status was always the most abundant and accounted for huge TT numbers in “raw” transcriptomes and was markedly decreased after polishing (Table 3). Artefacts—abundant in “raw” v4.0 (125,606)—were nearly absent in Min2_Oases_Cap3 and Oases_k45 (841 and 90, respectively), and most of them (99.17%: 25,939 not mapping onto the genome draft and 98,625 considered atypical) were removed after polishing. Therefore, any TT tagged with ‘Artifact’ status was removed. All of this produced a consistent number of genes between Min2_Oases_Cap3 and Oases_k45 (51,348 and 40,950, respectively) compared to the inflated number of v4.0 (118,436 TTs). Hence, the results supported that polishing removed TTs of limited interest.

Intriguingly, the decrease in the other structural statuses after polishing was moderated (Fig. 1). This was explained by the TT redundancy (Fig. 2), where v4.0 was the most redundant compared to Oases_k45 and Min2_Oases_Cap3. TT decreases from “low quality” to “definitive” transcriptomes may be explained by redundancy too. Overall, these data demonstrate that “definitive” Min2_Oases_Cap3 was less redundant and contained more different orthologue IDs than the other transcriptomes and hence it should be selected as SOLSEv5.0.

The SOLSEv5.0 transcriptome

Usefulness of SOLSEv5.0 transcriptome for gene expression analyses was demonstrated by mapping 18 Illumina libraries corresponding to a study of RA signalling in Senegalese sole larvae (Supplementary File 3) onto the three “definitive” transcriptomes and the “raw” v4.0 as a baseline control (Fig. 3). Columns “Mapping reads” and “Mapping TTs/[v4.0]” indicated that the amount of reads mapping on a transcriptome decreased with respect to the number of TTs in the transcriptome. The ratio of mapping reads in “definitive” Oases_k45 was lower (50 %) than in Min2_Oases_Cap3 (75 %), that could be due to the tissue-specific TTs provided by the GS-FLX reads, suggesting that they should be retained in SOLSEv5.0. The fact that most of the TTs ($>80$ %) in “definitive” Min2_Oases_Cap3 and Oases_k45 were supported by at least one pair of reads (“Mapped TTs” columns in Fig. 3) reinforced again that polishing was not associated with information loss but a redundancy reduction, as suggested above (Fig. 2). Finally, “definitive” Min2_Oases_Cap3 showed the highest amount of “Reads per TT” (Fig. 3), strongly suggesting that it should be the Senegalese sole transcriptome SOLSEv5.0³⁴.

When SOLSEv5.0³⁴ and “definitive” Oases_k45 (Table 4) were compared with their “raw” transcriptome versions (Table 2, Fig. 1), it is noticeable the strong reduction of TTs without orthologue and, more interestingly, that most of them were now predicted as coding sequences by Transdecoder (which is included in our Full-LengtherNext annotation tool) instead of unknown nature. It is also observed an increase in N50 and N90 of each assembly, and a decrease in artefacts, in spite of the decrease in the number of orthologues and unique complete transcripts due to polishing (even if this reduction is mainly due to redundancy, as observed in Fig. 2). The main features of SOLSEv5.0 when compared to “definitive” Oases_k45 (Table 4) revealed that (i) the aggregate length of SOLSEv5.0 was shorter in spite of having more TTs; (ii) artefactual TTs (Supplementary File 2, Fig. 1) were absent; (iii) SOLSEv5.0 had more orthologues, more unique IDs and more complete and unique complete TTs in spite of having shorter N50 and N90; and (iv) although SOLSEv5.0 had 6088 more TTs without orthologue, a higher proportion (5098, 64.2 %) was predicted to code for a protein that can be the result of new genes, genes with highly divergent sequences, or new splicing isoforms that should be taken into account. Note that the longest TT observed in Min2_Oases_Cap3 in Table 2 disappeared in Table 4, and both “definitive” transcriptomes have the same longest transcript. Since it is reported that the overall quality of assemblies must rely on more than one statistics^35,36, results about mapping of Fig. 3 as well as the increase of N50/N90, the contrast in number of orthologues, unique orthologues, complete transcripts, and even the ratio of coding sequences among the TTs without orthologue (Table 4) are also reinforcing the choice of the SOLSEv5.0 transcriptome.

Table 4 Main features of “definitive” transcriptomes based on Min2_Oases_Cap3 (SOLSEv5.0) and Oases_k45.

Full size table

Differential expression of RA signalling in sole larvae

To obtain insights about RA-mediated regulatory cascades in sole metamorphosing larvae, counts of mapped reads of Supplementary File 3 were analysed using a new version of DEGenes Hunter³⁰. It was designed for dealing with de novo assembled transcriptomes with improved functionalities for differential expression and new capabilities for functional interpretation. A highly conservative approach applying default expression thresholds based on “prevalent” DETs (differentially expressed TTs) was followed. Untreated control (CTRL) larvae were compared to those conditioned with DEAB (a blocker of RA synthesis that inhibits the enzyme retinaldehyde dehydrogenase) and TTNPB (a specific RA-receptor agonist that triggers a RA-signalling response) at 24 and 48 h, resulting in four sets of DETs (folders starting by CTRL_vs in Supplementary File 4).

A total of 1963 and 70 DETs were identified in DEAB-treated larvae and 3203 and 421 in the TTNPB-treated larvae with respect to CTRL at 24 and 48 h after treatment, respectively (Fig. 4A). When DETs were compared between sampling points within each treatment, 79 % matched between 24 h and 48 h in DEAB while only 25 % coincided in TTNPB-treated larvae at 24 and 48 h (Fig. 4A). Interestingly, both drug treatments modified the expression of a common set of 755 transcripts (mainly at 24 h, 650 DETs). The PCA (principal component analysis) using the 4 730 “prevalent” DETs across the four comparisons displayed a transient effect more intense at 24 h than at 48 h. The three experimental groups (CTRL, DEAB and TTNPB) showed separation at 24 h (Fig. 4B, circles). However, at 48 h (Fig. 4B, squares) the DEAB-treated larvae appeared closer and overlapping with the CTRL group although TTNPB still remained slightly separated, suggesting a more sustained response with TTNPB than with DEAB.

To assess the robustness of SOLSEv5.0 regarding differential expression, the same analysis was performed using “raw” v4.0 as transcriptome reference and the results are given Supplementary File 5. In agreement with Fig. 3, the average number of normalized counts per TT for SOLSEv5.0, ranging from 40 % (48 h) to 90 % (24 h), is higher than for “raw” v4.0 (Supplementary File 6A). As expected, more DETs were identified in “raw” v4.0 (Supplementary File 6B) due to redundancy (Fig. 2) and the total number of TTs (697 125 for “raw” v4.0 (Table 2) and 51 348 for SOLSEv5.0 [Table 4]). The mapping of 48.4 % of TTs in “raw” v4.0 produce more common DETs than using SOLSEv5.0 (compare Supplementary File 6B with Fig. 4A). However, sample clustering by PCA in Supplementary File 6C was similar to Fig. 4B. Taken together, it can be stated that the highly polished SOLSEv5.0 was less redundant, less confounding and at least as informative than the original “raw” v4.0 transcriptome.

Functional analysis of DEAB and TTNPB effects on RA signalling

The new version of DEGenes Hunter extended and facilitated the functional data interpretation based on zebrafish orthologues (given as functional_report.html in every CTRL_vs folder in Supplementary File 4). Overall, main GOs of biological pathways modified by drug treatments are depicted in Fig. 5. DEAB and TTNPB treatments shared eight over-represented pathways related to “lipid metabolism”, “transport and homeostasis” and “carboxylic acid metabolism”. Pathways more represented in DEAB-treated larvae were “regulation of cholesterol esterification” and those associated with immune system (“antigen processing and presentation”, “antigen processing and presentation of peptide antigen via MHC class I”, “antigen processing” and “presentation of peptide antigen”, the latter was observed both at 24 and 48 h). In TTNPB-treated larvae, specific over-represented pathways were mainly related to morphogenesis at 24 h and those related to retinoids and other metabolic pathways at 48 h.

To gain more insights about RA-signalling, the set of shared DETs and those specific for DEAB and TTNPB treatments were separately analyzed for enrichment of GO categories. In shared DETs, the “terpenoid metabolic process” pathway that is directly involved in RA metabolism was significantly over-represented and mainly up-regulated. Moreover, genes in categories previously identified and associated with “lipids”, “immune system”, “carbohydrate homeostasis” and “proteolysis” appeared mainly down-regulated (Fig. 6, ‘Shared DETs’). For DEAB-specific DETs (1220), main over-represented pathways contain down-regulated genes at 24 h (1023 DETs). Some of them were previously observed in the set of shared DETs (those related to “peptidase activity”, “immune system” and “lipids”) and only “mitotic cell cycle” and “organelle fission” pathways were specific (Fig. 6, ‘DEAB-specific 24h’). When the set of TTNPB-specific transcripts was analyzed at 24 h (2496 specific DETs, 83.5% down-regulated) main over-represented GO categories were related to “morphogenesis”, “phosphorus and organonitrogen metabolism”, “vesicle-mediated transport”, “dephosphorylation”, “enzyme activator activity” and “synaptic vesicle cycle” (Fig. 6, ‘TTNPB-specific 24h’). At 48 h (327 DETs, 84.7% up-regulated) there was a shift in significantly enriched categories with the “regulation of carbohydrate catabolism”, “extracellular matrix organization”, “isoprenoid catabolic process”, “intracellular receptor signalling pathway” and “retinol metabolic process”, among others, pathways (Fig. 6, ‘TTNPB-specific 48 h’).

The same functional interpretation was carried out using the results Supplementary File 5 obtained with “raw” v4.0 transcriptome as reference. In spite of the different number and DETs and zebrafish orthologues (Supplementary File 7, Venn diagrams), with about 60.4 % matching in zebrafish IDs in both sets of DETs), the GO analysis clearly demonstrated that categories specially related with RA metabolism of TTNPB at 48 h were missed (Supplementary File 8). In fact, “raw” v4.0 transcriptome was biased toward metabolic pathways with a high redundancy through the different comparisons. These categories were also observable using SOLSEv5.0 (Fig. 6), supporting that SOLSEv5.0 can provide a clearer and more comprehensive functional interpretation, avoiding the negative consequences of high redundancy.

Table 5 Representative DETs between DEAB and TTNPB with respect to CTRL at 24 and 48 h after RNA-seq analysis.

Full size table

A detailed analysis of DETs from Supplementary File 4 identified several genes related to RA and TH metabolism that are indicated in Table 5. RA receptors, such as rargb, raraa and rxrba, that mediated RA actions were differentially expressed only in the TTNPB-treated larvae. The rbp4 (retinol binding protein) was down-regulated and crpb1a (cellular RA binding protein) was up-regulated in both DEAB and TTNPB treatments, whereas crabp2b was up-regulated only in TTNPB treatments. When enzymes implicated in RA biosynthesis and degradation were monitored, cytochromes P450 26A1 (cyp26a1) and 26B1 (cyp26b1) as well as lecithin retinol acyltransferases (lrata and lratb.1) were strongly up-regulated in TTNPB treatments (cytochromes were down-regulated in DEAB treatments but not significantly). Also, dehydrogenase/reductase 3 (dhrs3b) was significantly up-regulated in TTNPB and down-regulated in DEAB treatments. Enzymes aldehyde dehydrogenase 1 family member A2 (aldh1a2) and trans-retinol 13,14-reductase (retsat) appeared only significantly down-regulated in TTNPB group. Enzymes all-retinol dehydrogenase 7 (rdh7) and $\beta ,\beta$-carotene 9’,10’-oxygenase-like (bco2l) were down-regulated in both treatments and the $\beta ,\beta$-carotene 15,15’-dioxygenase-like (bco1) and $\beta ,\beta$-carotene 9’,10’-oxygenase (bco2a) were up-regulated in DEAB treatments. In addition to these genes, Table 5 also presents a wide set of DETs related to thyroid axis, phototransduction signalling in retina, pigmentation-related genes, and matrix-related genes. Moreover, several transcription factors were also differentially expressed with DEAB and TTNPB treatments (Supplementary File 4).

Discussion

The wide use of RNA-seq and the increasing number of studies focused on transcript-discovery or expression profiles are paving the way to obtain better transcriptomes and new assembling pipelines combining different bioinformatics tools, such as CAFE²², TransFlow²⁸ or RefShannon²⁹. These new pipelines have to deal with errors due to the high number of reads, weakly expressed or ‘uncommon’ transcripts, circular RNAs, gene fusions, and isoform discrimination. It was well established that assembly quality improves with longer paired-end reads²⁸ but not increasing the input sequencing dataset³¹ due to, for example, the increase in low abundance transcripts and a plethora of unannotated transcripts, most of them containing intronic regions³². Hence, small sequencing datasets are desirable to decrease computational requirements and produce robust transcriptomes, since manual curation is excluded, even though it has been proved to render the best results³⁷. Unfortunately, there is no consensus on the most efficient RNA-seq analysis protocols for characterizing and quantifying transcripts, which can result in a high number of TTs, such as the case of the Senegalese sole transcriptome⁷, with nearly 700,000 TTs and mapping rates higher than 90%^7,11. These figures indicate an over-estimation of the real number of transcripts provoked by the huge amounts of reads in datasets used as input in the pipeline^31,32 and the suboptimal assembling approach (Table 2). A partial solution to the challenging de novo assembling arrived from using only 27 high-quality sole libraries (Supplementary File 1) instead of all (111) replicates⁷, that represent about 23 % of the total number of reads, reduction that will not have any significant impact on the representativity of transcript sequences, even though a small overall decrease in biological variability might occur. Then, de novo assembling reduced RAM demands and time requirements, and reduced to 96 the former 180 assemblies produced by TransFlow workflow²⁸ after considering only scaffolded sequences with a minimal coverage of 10 paired-end reads and increasing the k-mer sizes. The resulting transcriptomes had lower amounts of TTs with a lower proportion of “Unknown nature” (Table 2) in agreement with a recently published study where depth effects on de novo RNA-seq assembling was demonstrated³². The resulting high-quality, annotated transcriptome named SOLSEv5.0³⁴ contained Illumina and GS-FLX libraries from different tissues (Table 1) and was consistent with other species where the presence of GS-FLX reads improved the final transcriptome²⁸, as can be seen in Fig. 3 and Table 4. Therefore, the new assembling pipelines with less input datasets contributed to obtain more accurate “raw” transcriptomes than the original v4.0 approach (Table 2)^7,22,28.

The numbers shown in Table 2 indicated that additional polishing was required besides published criteria⁷ of sequence similarity, gene orthology or coding prediction that improved the original v4.0^10,11. In the present study, removal of low confidence TTs based on structural annotation and their mapping patterns (Table 3) diminished the huge amount of artefactual single-exon and uninformative TTs (Fig. 1), as well as TT redundancy (Fig. 2), which resulted in more robust mapping when using “definitive” Min2_Oases_Cap3 (Fig. 3). The new transcriptome SOLSEv5.0 (Table 4) contained 22,684 TTs coding for 17,429 different proteins, a reasonable number compared with phylogenetically near species such as the 21,516 different proteins described on Cynoglossus semilaevis genome³ or the 21,787 in Japanese flounder¹. Based on SOLSEv5.0 features, it is proposed that any de novo transcriptome should rely on selecting a reasonable number of paired-end reads to be assembled with an optimised workflow, and a final removal of low confidence TTs (Fig. 3). In fact, a suboptimal assembly presents many low confidence TTs that are difficult to remove (Fig. 1), and a good assembling approach without polishing still retains many redundant TTs (Fig. 2).

But, even if de novo assembled transcriptomes are acceptable, bioinformatic tools for RNA-seq are not specifically designed for them due to the low annotation rates and biases in FDR corrections than are exacerbated by redundant transcripts²⁷. This is why improved R scripts of DEGenes Hunter were developed in this study to analyse differential expression patterns using SOLSEv5.0³⁴ as reference. The analysis relied on “prevalent” DETs (those TTs predicted as differentially expressed in all algorithms applied) that were then used for subsequent functional analyses (Supplementary File 4). The new features for functional interpretation in DEGenes Hunter provide dependable and accurate information about samples and functional changes (Figs. 4, 5, Supplementary File 4).

RA synthesis, degradation and cellular transport are highly conserved in vertebrates³⁸. A coordinated action of these three pathways have revealed as essential to fine-tune RA levels gradients that in turn control morphogenesis and many other developmental processes. A previous study in sole using DEAB an TTNPB drugs demonstrated that RA levels modify the expression of some enzymes involved in retinol-retinal-RA conversion and RA degradation by establishing a negative feedback mechanism³⁹. The present study demonstrates that acute exposure of larvae to these two drugs triggered an intense, specific and transient response at 24 h but with hardly any differences after 48 h. Although these treatments modified the expression of a common set of transcripts, the overall number of DETs were representative of drug-specific regulatory activities as demonstrated by PCA analysis (Fig. 4). Remarkably, both DEAB an TTNPB treatments activated a homeostatic response related to disturbance of retinoid metabolism (Figs. 5, 6). Key enzymes such as dhrs3b (Table 5), involved in the biosynthetic pathway and controlling the reduction of retinaldehyde back to retinol, appeared regulated through a negative feedback, promoting the synthesis of retinal in DEAB and retinol storage in TTNPB treatments. Moreover, the RA-receptor activation by TTNPB modified several pathways related to RA biosynthesis and degradation, retinol storage, carotenoid metabolism and visual cycle (Table 5, Fig. 7). Cytochromes cyp26a1 and cyp26b1 mRNAs were highly up-regulated at 24 and 48 h, concomitant with the induction of cellular RA binding proteins 1 and 2 (genes crabp1 and crabp2a in Table 5, Fig. 7) to prevent excessive amounts of cellular all-trans retinoic acids (atRAs)^39,40,41. Moreover, down-regulation of aldh1a2 expression (Table 5, Fig. 7) indicated a switch-off of the second step of RA biosynthesis. Simultaneously with controlling the balance between RA production and degradation, other closely related pathways were modified to promote the retinol storage by increasing lrat, reducing adipogenesis by retsat (Table 5, Fig. 7)⁴², shifting $\beta$-carotene transformation toward apocarotenals instead of retinal production, and promoting 11-cis retinoids synthesis^43,44. Unlike TTNPB, DEAB treatments modified specifically only those pathways related to retinal supply (carotenoids) and transformation (visual cycle). On the whole, TTNPB and DEAB treatments triggered a wide homeostatic response beyond biosynthesis and degradation, as a feedback mechanism to control atRA-mediated actions.

Concerning morphological aberrations, skeletogenesis disorders and malpigmentation, they were systemically associated with an excess of dietary vitamin A levels and external treatments with atRA. The present study indicates that increased RA signalling activated some pathways related to morphogenesis and collagen fibril organisation (Table 5). This is consistent with the fact that several transcription factors increased (hoxb1, hoxb5, Hox-B6b, Hox-C5a, hoxc6, intestine-specific homeobox and six2) or reduced (hoxa13, lhx2, hnf1b, hmbox1, cux1, zfhx3 and meis2) their expression levels (Supplementary File 4). Going into detail, RA-mediated dysregulation of hox genes in zebrafish was teratogenic due to a modification of the anteroposterior developmental patterning^45,46. High levels of RA that induced overexpression of hox5b provoked severe cardiac abnormalities^46,47. Moreover, a zebrafish defective in aldh1a2 that phenotypically lacks forelimbs (pectoral fins) and posterior branchial arches reduced the expression of hoxb5a, hoxb6a and hoxb6b along the entire length of its spinal cord expression domain, decreased hoxb4a expression in the hindbrain and failed to express dlx2, an early marker of apical ectodermal ridge activity in the fin bud⁴⁸. In addition to transcription factors, TTNPB also modified the expression of 11 collagen-encoding genes (Table 5) and osteocalcin-2 (bglap). Although our experimental design was intended to evaluate the effects under a short-time drug exposure, this wide response of regulatory transcriptional factors and collagens does not preclude major structural alterations and agrees with the morphological aberrations found in larvae fed high levels of vitamin A^19,49.

RA has also been associated with malpigmentation in flatfish during metamorphosis, such as in Japanese flounder where high doses of 9-cis-RA promoted development of adult chromatophores when supplied at the beginning of metamorphosis⁵⁰. More recent studies demonstrated that an RA gradient is required for the generation of asymmetric pigmentation in flatfish with higher concentrations of atRA and 9-cis-RA in the ocular side than blind side¹. In sole, previous studies failed to demonstrate an association between dietary levels of vitamin A and pseudoalbinism^49,51, more related to dietary arachidonic acid levels⁵². In the current study, TTNPB results in up-regulation of GTP cyclohydrolase I (gch1; Table 5), the first and rate-limiting enzyme of pteridine synthesis and specifically expressed in xanthoblasts and to some extent in melanoblasts^53,54. Moreover, TTNPB also modified enzymes modulating hypoxanthine biosynthesis (the main pigment in iridophores) such as nt5c2 and pnp5a, suggesting disrupted pigmentation patterns in sole by modifying xanthophores and iridophores physiology^13,55. High expression of gch1 and disruption of iridophores, contrary to the pattern observed in pseudoalbino¹³, could be behind of the high rates of ambicolouration disorders currently found in the sole industry. Further experiments with longer evaluation periods will be necessary to demonstrate this hypothesis.

RA and TH establish a cross talk to regulate morphogenesis and cell development and differentiation in vertebrates mediated by the competition of the thyroid receptor (TR) and RA receptor (RAR) to form a heterodimer with the retinoid X receptor (RXR) for binding to gene promoters^1,56. This interaction is crucial in flatfish metamorphosis governed mostly by THs^17,18,25. Recent molecular findings in Japanese flounder have demonstrated that disorders in eye migration during metamorphosis are mediated by the RA-TH interaction: atRA reduced the proliferation of cells in the suborbital area of the blind side eye by up-regulating rara and down-regulating thraa and thrb1¹. In sole, dietary vitamin A levels also modified their thyroid follicle number and size, T3 and T4 immunoreactive staining, skeletogenesis and mineralisation^19,49. Moreover, thrb and thrab expression was reduced in metamorphic larvae fed with high levels of vitamin A¹⁹, and hormonal treatments demonstrated that atRA and TTNPB increased thrb mRNA levels and reduced thrab transcript levels in post-metamorphic larvae but not in pre-metamorphosis³⁹. In the present study, metamorphic larvae exposed to TTNPB down-regulated the expression of tg and thrab (Table 5), but did not modify the expression of dio2 and thrb, two genes that play a pivotal role in the asymmetric remodelling of sole head¹⁸. These data indicate a time-sensitivity window to hormonal treatments that might explain the minor effects on settlement and eye migration in sole, and the differences in gene expression patterns^19,49.

DEAB and TTNPB modified the expression of a common set of 755 TTs (Fig. 4A), most of them at 24 h after drug treatments. These TTs were mainly related to “lipid metabolic process”, “lipid transport” and “lipid homeostasis”, processes that play a key role in the absorption and distribution of vitamin A. It is already known that high levels of retinal or RA inhibit adipogenesis when administered at early stages of adipocyte development⁵⁷. External supply of retinal or Raldh1$^{-/-}$ mice mutants that accumulate retinal down-regulate the expression of adipogenic target genes and the transport protein rbp4 (also observed in the present study) through inhibiting the PPAR-$\gamma$:RXR dimer as a defensive mechanism to limit a RA excess in tissues⁵⁷. Similarly, liganded RA-receptor or agonists inhibit adipogenesis by blocking C/EBP$\beta$-mediated induction of downstream genes⁵⁸. Involvement of PPAR-$\gamma$ and C/EBP in adipogenesis regulation seems to be a conserved setting in fish^59,60. The regulation of lipid transport, including several apolipoproteins involved in chylomicrons and very-low-density lipoprotein formation and lipases, is a key regulatory mechanism in the intestine and liver of pelagic and metamorphosing sole larvae to fit nutrient levels^{10,61,62,63,64}. Therefore, our data suggest that DEAB and TTNPB treatments provided sensing signals to activate mechanisms that limit the availability of this fat-soluble vitamin, high levels of retinal in DEAB and enhanced RA-signalling in TTNPB treatments, which is consistent with the capacity of sole larvae to mobilize and store dietary vitamin A surplus²⁰. They also support a negative feedback for vitamin A intestinal absorption and storage as previously suggested⁴⁴ and highlight non-specific RA effects associated with lipid metabolism in DEAB and TTNPB treatments that should be taken into consideration when these drugs are used in fish trials.

In conclusion, even if new sequencing data using long read technologies (Oxford Nanopore or PacBio Sequel) would benefit any transcriptome assembling, re-analysis of already existing data⁶⁵ is essential to provide optimised genomic tools for gene expression estudies and to exploit previous research investments. Hence this study demonstrates the usefulness of in silico strategy to generate the new SOLSEv5.0 as reference transcriptome for RNA-seq studies in sole. Also, new DEGenes Hunter capabilities have provided full relevant information for gene expression analyses. Both of them facilitated investigation into RA signalling and metabolism in larvae, revealing that DEAB and TTNPB drugs trigger a wide, coordinated and specific homeostatic response to maintain physiological RA levels. Moreover, expression changes of transcripts involved in morphogenesis and pigmentation also support a RA role in tissue remodeling that could be behind the morphological abnomalities associated with the supply of vitamin A in sole larvae. The identification of DETs shared by both drug treatments demonstrate non-specific drug effects related to lipid metabolism and gut absorption that it is relevant for the design of future studies using these drugs.

Methods

Computational infrastructure

Picasso supercomputer of University of Malaga was used for all bioinformatic tests, implementations and executions. It consists of an OpenSUSE LEAP 42.3 with Slurm queue system and Infiniband network (54/40 Gbps) containing 216 nodes with Intel E5-2670 2.6 GHz cores for a total of 3 456 cores and 22 TB of RAM. The required software was already installed, and all workflows and pipelines developed here were based on the workflow manager AutoFlow ⁶⁶. Assembling workflow required the installation of TransFlow and its dependencies²⁸, such as Oases, SOAPdenovotrans, Minimus2, MIRA4, EULER-SR, CAP3, BUSCO, CD-HIT-EST, and FactoMineR.

Sequencing data and mapping

A total of 111 Illumina RNA-seq libraries of $2\times 75$ nt per read ($>1800$ million reads) and 5,663,225 GS-FLX RNA-seq single-end reads of Senegalese sole were available from BioProjects PRJNA255461, PRJNA241068 and PRJNA261151⁷. These datasets comprised nine experiments that included negative control and drug treatment in triplicate libraries, or even time-course responses. To optimize the number of reads to assemble, only one out of each biological replicate was randomly selected by experiment, and when a time series was available, only the first and the last sampling points were chosen (Supplementary File 1). Libraries containing GS-FLX reads were pre-processed using SeqTrimNext (based on SeqTrim⁶⁷), while SeqTrimBB (based on BBmap suite⁶⁸) was used for Illumina reads. Default parameters were applied in both cases. Illumina reads shorter than 60 bp were discarded, whereas the threshold for GS-FLX reads was set in 90.

Pre-processed reads (Supplementary File 1) were mapped onto the S. senegalensis genome draft³³ using Bowtie2⁶⁹ with the –no-mixed parameter to discard reads from unpaired alignments. SAMtools⁷⁰ was used for mapping manipulation and read counts.

De novo assembling and annotations

Pre-processed (useful) reads from libraries in Supplementary File 1 were assembled using the automated and modular framework TransFlow²⁸ customised as follows: (i) increased k-mer length to 45 and 55 for Illumina reads (TransFlow variable $kmers = [45;55]), and to 29 for GS-FLX reads; (ii) removal of Illumina scaffolds with low coverage ($<10\times$, new TransFlow variable $NT_COVERAGE_IN_CONTIG=10); (iii) primary, intermediary assemblies are not considered, retaining only scaffolded assemblies; and (iv) additional CD-HIT-EST⁷¹ execution at the end of module 3 was included using options -c 0.95 -s 0.7 to reduce contig/scaffold redundancy at 95 % identity and 70 % overlap. The customised workflow is available at GitHub (https://github.com/JoseCorCab/TransFlow) and resulted in up to 96 assemblies. The Danio rerio GRCz11 build was used as reference transcriptome to select the best Senegalese sole “raw” transcriptome based on PCA distances. Sequencing reads ERR216329 from the SRA repository were used for quality assessment of the D. rerio transcriptome within TransFlow.

Structural status and functional assignment were annotated using Full-LengtherNext (P. Seoane, N. Fernández-Pozo and M.G. Claros, in preparation; http://www.scbi.uma.es/fulllengthernext for web execution, http://www.scbi.uma.es/site/scbi/downloads/313-full-lengthernext for instructions and off-line installation) configured to use UniProtKB full-length protein sequences for Actinopterygii taxon as in 24-09-2018 for first annotation database, and the Vertebrata division as in 24-09-2018 for the main database.

Removal of low confidence transcripts

The rationale of the following processes is to sieve abnormal TTs. Firstly, a “raw” transcriptome was mapped against the preliminary S. senegalensis genome draft³³ developed in our laboratory^8,9 using Minimap2⁷² in splice mode, including option -uf for finding canonical splicing GT-AG sites, and option -C5 for a penalty of 5 for non-canonicals. The resulting SAM (Sequence Alignment Map) file was processed using SAMtools view -h -F 2052 to retain the best mapped position of multimapping transcripts without unmapped ones. According to Patterson et al.³², unspliced, fragmented, or missasembled, transcripts, as well as contaminant DNA, were then removed³², as well as TTs with ungapped mapping with $>90$ % identity.

The last polishing step was intended to retain those TTs having $>70$ % identity covering $>70$ % of its length to avoid fragmented or chimeric TTs containing too many intronic sequences. Since coverage and identity are not given in SAM files, SAM format was converted to PAF (Pairwise mApping Format; https://github.com/lh3/miniasm/blob/master/PAF.md) using sam2paf option of paftools.js (a script provided by Minimap2 authors in https://github.com/lh3/minimap2/tree/master/misc). CIGARs (Concise Idiosyncratic Gapped Alignment Report) were also transfered using an in-house script written in Ruby (v2.4.1) merge_paf_cigar.rb (this and other postprocessing scripts are available at GitHub https://github.com/JoseCorCab/TransFlow_postprocessing). Minimap2 artefacts produced when aligning transcripts to genomes were trimmed using paf_report.rb, which also calculates coverage, identity and exon number for each transcript.

Fish trial and drug treatments

To investigate the effects of retinoic acid (RA) on sole larvae, fertilized eggs were obtained from naturally spawning Senegalese sole broodstock (IFAPA Centro El Toruño). The eggs were collected in the morning (8.00 am) and separated by buoyancy to get the fecundated fraction. Eggs were incubated in 15 L cylindro-conical tanks using a flow-through sea water circuit with gentle aeration at an initial density of 3000 embryos $\hbox {L}^{-1}$ and full exchange of sea water every hour. Newly hatched larvae (1 day post-hatch (dph)) were transferred to a 400 L tanks at an initial density of 45 to 50 larvae $\hbox {L}^{-1}$ and maintained in the dark until the onset of external feeding (3 dph). Detailed protocol for larval rearing was as previous described⁷³. Shortly, larvae were supplied rotifers (Brachionus plicatilis) enriched for 3 h with Tisochrysis lutea (T-iso strain) from 3 till 9 dph. Moreover, Artemia metanauplii enriched for 24 h with T. lutea were supplied from 7 dph until the end of the experiment. Live microalgae Nannochloropsis gaditana and T. lutea were also added directly to water to enriched the live preys in the tank. During the trial, a 16 h:8 h light:dark photoperiod was used (light intensity 500 lux) and the mean water temperature and salinity was $21.1 \pm 0.3~^{\circ }$C and $34.2 \pm 0.2$ ppt, respectively. When larvae had just started the metamorphosis (12 dph), they were distributed into nine 15 L cylindro-conical tanks at the same initial larval density. Four days later (16 dph), when larvae were at the metamorphosis climax (metamorphic stages S2–S3)¹⁷, the following treatments were applied: (i) control group (CTRL) with dimethylsulfoxide (DMSO) (Sigma-Aldrich, Ref. 472301); (ii) TTNPB group with 4-[(E)-2-(5,6,7,8-tetrahydro-5,5,8,8-tetramethyl-2-naphthalenyl)-1-propenyl]benzoic acid (25 nM in DMSO, Sigma-Aldrich, Ref. T3757), a RA analog which acts as a selective agonist of RA-receptor; and (iii) DEAB group with 4-diethylaminobenzaldehyde (50 µM in DMSO, Sigma-Aldrich, Ref. 31830), an inhibitor of aldehyde dehydrogenase (ALDH) enzymes that prevents RA synthesis. Doses were selected according to previous data in sole^39,74. Larvae were sampled at 24 and 48 h after the drug treatments, euthanized using MS-222, washed using diethylpyrocarbonated water, frozen in liquid nitrogen, and stored at $-\,80~^{\circ }$C until analysis.

The study has been carried out in accordance with EC Directive 86/ 609/EEC for animal experiments and Spanish regulations on animal welfare. All procedures were approved by the Animal Ethics Committee of IFAPA.

RNA preparation

Pools of larvae (n = 5) from each tank (n = 3) were homogenised using the Fast-prep FG120 instrument (Bio101) and Lysing Matrix D (Q- Bio-Gene) for 40 s at speed setting 6. The numbers of embryos/larvae in the pools were always similar between conditions and replicates. Total RNA was isolated using the RNeasy Mini Kit (Qiagen) and treated twice with DNase I using an RNase-Free DNase kit (Qiagen) for 30 min to avoid genomic DNA contamination. Illumina libraries were constructed using mRNA-Seq sample preparation kit and sequenced using TruSeq SBS Kit v3-HS, in paired end mode, $2 \times 76$ bp in a fraction of a lane (1/6) of a HiSeq2000 sequencing system (Illumina, Inc) following the manufacturer’s protocol as previously reported⁷.

Gene expresion analyses

Useful reads from Supplementary File 3 were mapped onto a reference transcriptome using Bowtie2 with default parameters and –no-unal –no-mixed options to only retain appropriately mapped paired-end reads in the SAM file. The SAM file was sorted and converted to BAM by SAMtools sort and reads mapped per transcript were counted by sam2counts (https://github.com/vsbuffalo/sam2counts). Then, differential expression analysis was performed using an improved version of DEGenes Hunter³⁰ available at https://github.com/seoanezonjic/DEgenesHunter. The first script degenes_Hunter.R is for differential expression using edgeR, DEseq2, limma and NOISeq, and generates files with quality control, expression data and a final report (DEG_report.html) where analysis details and plots are given. A DET is qualified as “prevalent” when it is differentially expressed in all algorithms used, and “possible” when not all algorithms qualify it as DET. By default, thresholds for a DET are absolute fold-change (FC) $>2$ and false discovery rate (FDR) $<0.05$ in every algorithm. The output includes a mean logFC and a combined FDR. More detail is given at its GitHub page.

Functional interpretation is launched with script functional_Hunter.R based mainly on topGO and clusterProfiler⁷⁵ and then a final report (functional_report.html) is generated. Gene Ontology terms, and Reactome and KEGG databases for pathways are inspected to graphically display KEGG over-representation and clustering, GO over-representation analysis for the three hierarchies, and Reactome over-representation. SOLSEv5.0 DETs were interpreted using their zebrafish orthologues. Supplementary File 4 contains the most relevant results produced by DEGenes Hunter for the RA signalling study.

Additional biological interpretation of functional information based on Gene Ontology terms and KEGG/BioCarta pathways of D. rerio orthologues were carried out using the ClueGO plug-in⁷⁶ for Cytoscape⁷⁶ to visualise functionally grouped terms. PCA analyses with normalised, prevalent DETs in at least one of the four comparisons were performed using ClustVis⁷⁷. Venn diagrams were performed with InteractiveVenn at http://www.interactivenn.net.

Code availability

RNA-seq datasets are available at accession numbers SRR4897845, SRR1030352, SRR1282039, SRR2072478, DRR003148, SRR954861, SRR1282039, SRR100067, SRR2005826 as well as BioProjects 392999, PRJNA287107, 392587 and PRJNA392589. The genome draft of a female Senegalese sole is available at FigShare³³; it includes five files: Sosen1_genome_scaffolds.fasta containing every contig and scaffold identifier and sequence in fasta format; Sosen1_genome_annotation.gff3 corresponding to a tentative annotation of genome contigs and scalffolds using MAKER2 and transcript sequences in SOLSEv5.0³⁴; Sosen1_maker.transcripts.fasta containing the deduced transcripts from the gff3 annotation file; Sosen1_maker.proteins.fasta containing the deduced amino acid sequence for all deduced transcripts; and Sosen1_maker.proteins_annotation.tsv containing more annotations for deduced transcripts and proteins, such as transcript and protein lengths, best UniProtKB orthologue with identity % and E-value, structural status, open reading frame location in the transcript, description, GOs, KEGG codes, InterPro IDs, Pfam, EC and Unipathway, as tab-separated values (tsv format). Full-LengtherNext can be executed at http://www.scbi.uma.es/fulllengthernext; instructions and off-line installation can be obtained in http://www.scbi.uma.es/site/scbi/downloads/313-full-lengthernext. TransFlow can be downloaded from https://github.com/seoanezonjic/TransFlow; the customisation for this work can be downloaded from https://github.com/JoseCorCab/TransFlow. The new DEGenes Hunter version is available from https://github.com/seoanezonjic/DEgenesHunter. Other in-house scripts can be downloaded from https://github.com/JoseCorCab/TransFlow_postprocessing. SOLSEv5.0 transcriptome is available at FigShare³⁴, including three files: SOLSEv5.0.fasta containing every TT identifier and its sequence in fasta format; SOLSEv5.0_ORTH_Drerio.tsv including the zebrafish orthologue (Subject_id) for every TT (Query_id) and its description as tab-separated values (tsv); and SOLSEv5.0_annot.tsv containing more annotations for each TT, such as its length, UniProtKB orthologue with identity % and E-value, structural status, open reading frame location in the TT, description, GOs, KEGG codes, InterPro IDs, Pfam, EC and Unipathway, in tsv format.

References

Shao, C. et al. The genome and transcriptome of Japanese flounder provide insights into flatfish asymmetry. Nat. Genet. 49, 119–124. https://doi.org/10.1038/ng.3732 (2017).
Article CAS PubMed Google Scholar
Alves, R. N. et al. The transcriptome of metamorphosing flatfish. BMC Genomics 17, 413. https://doi.org/10.1186/s12864-016-2699-x (2016).
Article CAS PubMed PubMed Central Google Scholar
Chen, S. et al. Whole-genome sequence of a flatfish provides insights into ZW sex chromosome evolution and adaptation to a benthic lifestyle. Nat. Genet. 46, 253–260. https://doi.org/10.1038/ng.2890 (2014).
Article CAS PubMed Google Scholar
Figueras, A. et al. Whole genome sequencing of turbot (Scophthalmus maximus; Pleuronectiformes): a fish adapted to demersal life. DNA Res. 23, 181–92. https://doi.org/10.1093/dnares/dsw007 (2016).
Article CAS PubMed PubMed Central Google Scholar
Houston, R. D. et al. Harnessing genomics to fast-track genetic improvement in aquaculture. Nat. Rev. Genet.https://doi.org/10.1038/s41576-020-0227-y (2020).
Article PubMed Google Scholar
APROMAR. La Acuicultura en España 2019 v 1.3. Tech. Rep., Asociación Empresarial de Acuicultura de España (2019).
Benzekri, H. et al. De novo assembly, characterization and functional annotation of Senegalese sole (Solea senegalensis) and common sole (Solea solea) transcriptomes: integration in a database and design of a microarray. BMC Genomics 15, 952. https://doi.org/10.1186/1471-2164-15-952 (2014).
Article CAS PubMed PubMed Central Google Scholar
Manchado, M., Planas, J. V., Cousin, X., Rebordinos, L. & Claros, M. G. Genomics in Aquaculture, chap. Current Status in Other Finfish Species: Description of Current Genomic Resources for the Gilthead Seabream (Sparus aurata) and soles (Solea senegalensis and Solea solea) 195–221 (Academic Press, San Diego, 2016).
Google Scholar
Manchado, M., Planas, J. V., Cousin, X., Rebordinos, L. & Claros, M. G. The Biology of Sole, Chap. Genetic and Genomic Characterization of Soles 375–394 (CRC Press, Boca Raton, 2019).
Google Scholar
Hachero-Cruzado, I. et al. Characterization of the genomic responses in early senegalese sole larvae fed diets with different dietary triacylglycerol and total lipids levels. Comp. Biochem. Physiol. D Genomics Proteomics 12, 61–73. https://doi.org/10.1016/j.cbd.2014.09.005 (2014).
Article CAS PubMed Google Scholar
Fatsini, E., Bautista, R., Manchado, M. & Duncan, N. J. Transcriptomic profiles of the upper olfactory rosette in cultured and wild Senegalese sole (Solea senegalensis) males. Comp. Biochem. Physiol. D Genomics Proteomics 20, 125–135. https://doi.org/10.1016/j.cbd.2016.09.001 (2016).
Article CAS PubMed Google Scholar
Montero, D. et al. Dietary vegetable oils: effects on the expression of immune-related genes in Senegalese sole (Solea senegalensis) intestine. Fish Shellfish Immunol. 44, 100–8. https://doi.org/10.1016/j.fsi.2015.01.020 (2015).
Article CAS PubMed Google Scholar
Pinto, P. I. S. et al. Understanding pseudo-albinism in sole (Solea senegalensis): a transcriptomics and metagenomics approach. Sci. Rep. 9, 13604. https://doi.org/10.1038/s41598-019-49501-6 (2019).
Article ADS CAS PubMed PubMed Central Google Scholar
Campos, C., Valente, L. M. P., Conceição, L. E. C., Engrola, S. & Fernandes, J. M. O. Temperature affects methylation of the myogenin putative promoter, its expression and muscle cellularity in senegalese sole larvae. Epigenetics 8, 389–97. https://doi.org/10.4161/epi.24178 (2013).
Article PubMed PubMed Central Google Scholar
Firmino, J. et al. Phylogeny, expression patterns and regulation of dna methyltransferases in early development of the flatfish, Solea senegalensis. BMC Dev. Biol. 17, 11. https://doi.org/10.1186/s12861-017-0154-0 (2017).
Article CAS PubMed PubMed Central Google Scholar
Carballo, C. et al. Microalgal extracts induce larval programming and modify growth and the immune response to bioactive treatments and lcdv in senegalese sole post-larvae. Fish Shellfish Immunol. 106, 263–272. https://doi.org/10.1016/j.fsi.2020.07.020 (2020).
Article CAS PubMed Google Scholar
Manchado, M., Infante, C., Asensio, E., Planas, J. V. & Cañavate, J. P. Thyroid hormones down-regulate thyrotropin beta subunit and thyroglobulin during metamorphosis in the flatfish Senegalese sole (Solea senegalensis Kaup). Gen. Comp. Endocrinol. 155, 447–55. https://doi.org/10.1016/j.ygcen.2007.07.011 (2008).
Article CAS PubMed Google Scholar
Campinho, M. A. et al. A thyroid hormone regulated asymmetric responsive centre is correlated with eye migration during flatfish metamorphosis. Sci. Rep. 8, 12267. https://doi.org/10.1038/s41598-018-29957-8 (2018).
Article ADS CAS PubMed PubMed Central Google Scholar
Fernández, I. et al. Vitamin a affects flatfish development in a thyroid hormone signaling and metamorphic stage dependent manner. Front. Physiol. 8, 458. https://doi.org/10.3389/fphys.2017.00458 (2017).
Article PubMed PubMed Central Google Scholar
Boglino, A. et al. Commercial products for Artemia enrichment affect growth performance, digestive system maturation, ossification and incidence of skeletal deformities in Senegalese sole (Solea senegalensis) larvae. Aquaculture 324–325, 290–302. https://doi.org/10.1016/j.aquaculture.2011.11.018 (2012).
Article Google Scholar
Morillon, A. & Gautheret, D. Bridging the gap between reference and real transcriptomes. Genome Biol. 20, 112. https://doi.org/10.1186/s13059-019-1710-7 (2019).
Article PubMed PubMed Central Google Scholar
You, B.-H., Yoon, S.-H. & Nam, J.-W. High-confidence coding and noncoding transcriptome maps. Genome Res. 27, 1050–1062. https://doi.org/10.1101/gr.214288.116 (2017).
Article CAS PubMed PubMed Central Google Scholar
Smith-Unna, R., Boursnell, C., Patro, R., Hibberd, J. M. & Kelly, S. TransRate: reference-free quality assessment of de novo transcriptome assemblies. Genome Res. 26, 1134–1144. https://doi.org/10.1101/gr.196469.115 (2016).
Article CAS PubMed PubMed Central Google Scholar
Ferraresso, S. et al. Exploring the larval transcriptome of the common sole (Solea solea L.). BMC Genomics 14, 315. https://doi.org/10.1186/1471-2164-14-315 (2013).
Article CAS PubMed PubMed Central Google Scholar
Louro, B., Marques, J. P., Manchado, M., Power, D. M. & Campinho, M. A. Sole head transcriptomics reveals a coordinated developmental program during metamorphosis. Genomics 112, 592–602. https://doi.org/10.1016/j.ygeno.2019.04.011 (2020).
Article CAS PubMed Google Scholar
Kettleborough, R. N. W. et al. A systematic genome-wide analysis of zebrafish protein-coding gene function. Nature 496, 494–497. https://doi.org/10.1038/nature11992 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Hsieh, P.-H., Oyang, Y.-J. & Chen, C.-Y. Effect of de novo transcriptome assembly on transcript quantification. Sci. Rep. 9, 8304. https://doi.org/10.1038/s41598-019-44499-3 (2019).
Article ADS CAS PubMed PubMed Central Google Scholar
Seoane, P. et al. TransFlow: a modular framework for assembling and assessing accurate de novo transcriptomes in non-model organisms. BMC Bioinform.https://doi.org/10.1186/s12859-018-2384-y (2018).
Article Google Scholar
Mao, S., Pachter, L., Tse, D. & Kannan, S. Refshannon: a genome-guided transcriptome assembler using sparse flow decomposition. PLoS ONE 15, e0232946. https://doi.org/10.1371/journal.pone.0232946 (2020).
Article CAS PubMed PubMed Central Google Scholar
González Gayte, I., Bautista Moreno, R., Seoane Zonjic, P. & Claros, M. G. DEgenes hunter—a flexible R pipeline for automated RNA-seq studies in organisms without reference genome. Genomics Comput. Biol. 3, e31. https://doi.org/10.18547/gcb.2017.vol3.iss3.e31 (2017).
Article Google Scholar
Hayer, K. E., Pizarro, A., Lahens, N. F., Hogenesch, J. B. & Grant, G. R. Benchmark analysis of algorithms for determining and quantifying full-length mRNA splice forms from RNA-seq data. Bioinformatics 31, 3938–45. https://doi.org/10.1093/bioinformatics/btv488 (2015).
Article CAS PubMed PubMed Central Google Scholar
Patterson, J. et al. Impact of sequencing depth and technology on de novo RNA-Seq assembly. BMC Genomics 20, 604. https://doi.org/10.1186/s12864-019-5965-x (2019).
Article CAS PubMed PubMed Central Google Scholar
Claros, M. G., Seoane, P. & Manchado, M. Sequences and annotations of a provisional genome draft of a Senegalese sole female. Figsharehttps://doi.org/10.6084/m9.figshare.12472100 (2020).
Article Google Scholar
Claros, M. G., Córdoba-Caballero, J., Seoane, P. & Manchado, M. Sequences and annotations of SOLSEv5.0 transcriptome. Figsharehttps://doi.org/10.6084/m9.figshare.12296171 (2020).
Article Google Scholar
Bradnam, K. R. et al. Assemblathon 2: evaluating de novo methods of genome assembly in three vertebrate species. Gigascience 2, 10. https://doi.org/10.1186/2047-217X-2-10 (2013).
Article CAS PubMed PubMed Central Google Scholar
Wajid, B. & Serpedin, E. Do it yourself guide to genome assembly. Brief Funct. Genomics 15, 1–9. https://doi.org/10.1093/bfgp/elu042 (2016).
Article CAS PubMed Google Scholar
Kanitz, A. et al. Comparative assessment of methods for the computational inference of transcript isoform abundance from RNA-seq data. Genome Biol. 16, 150. https://doi.org/10.1186/s13059-015-0702-5 (2015).
Article CAS PubMed PubMed Central Google Scholar
Rhinn, M. & Dollé, P. Retinoic acid signalling during development. Development 139, 843–58. https://doi.org/10.1242/dev.065938 (2012).
Article CAS PubMed Google Scholar
Boglino, A., Ponce, M., Cousin, X., Gisbert, E. & Manchado, M. Transcriptional regulation of genes involved in retinoic acid metabolism in Senegalese sole larvae. Comp. Biochem. Physiol. B Biochem. Mol. Biol. 203, 35–46. https://doi.org/10.1016/j.cbpb.2016.08.007 (2017).
Article CAS PubMed Google Scholar
Hernandez, R. E., Putzke, A. P., Myers, J. P., Margaretha, L. & Moens, C. B. Cyp26 enzymes generate the retinoic acid response pattern necessary for hindbrain development. Development 134, 177–87. https://doi.org/10.1242/dev.02706 (2007).
Article CAS PubMed Google Scholar
Adams, M. K., Belyaeva, O. V., Wu, L. & Kedishvili, N. Y. The retinaldehyde reductase activity of dhrs3 is reciprocally activated by retinol dehydrogenase 10 to control retinoid homeostasis. J. Biol. Chem. 289, 14868–80. https://doi.org/10.1074/jbc.M114.552257 (2014).
Article CAS PubMed PubMed Central Google Scholar
Schupp, M. et al. Retinol saturase promotes adipogenesis and is downregulated in obesity. Proc. Natl. Acad. Sci. U.S.A. 106, 1105–10. https://doi.org/10.1073/pnas.0812065106 (2009).
Article ADS PubMed PubMed Central Google Scholar
Harrison, E. H. Mechanisms involved in the intestinal absorption of dietary vitamin A and provitamin A carotenoids. Biochim. Biophys. Acta 70–7, 2012. https://doi.org/10.1016/j.bbalip.2011.06.002 (1821).
Article CAS Google Scholar
Widjaja-Adhi, M. A. K., Lobo, G. P., Golczak, M. & Von Lintig, J. A genetic dissection of intestinal fat-soluble vitamin and carotenoid absorption. Hum. Mol. Genet. 24, 3206–19. https://doi.org/10.1093/hmg/ddv072 (2015).
Article CAS PubMed PubMed Central Google Scholar
Marlétaz, F., Holland, L. Z., Laudet, V. & Schubert, M. Retinoic acid signaling and the evolution of chordates. Int. J. Biol. Sci. 2, 38–47. https://doi.org/10.7150/ijbs.2.38 (2006).
Article PubMed PubMed Central Google Scholar
Waxman, J. S. & Yelon, D. Increased hox activity mimics the teratogenic effects of excess retinoic acid signaling. Dev. Dyn. 238, 1207–1213. https://doi.org/10.1002/dvdy.21951 (2009).
Article CAS PubMed PubMed Central Google Scholar
Waxman, J. S., Keegan, B. R., Roberts, R. W., Poss, K. D. & Yelon, D. Hoxb5b acts downstream of retinoic acid signaling in the forelimb field to restrict heart field potential in zebrafish. Dev. Cell 15, 923–934. https://doi.org/10.1016/j.devcel.2008.09.009 (2008).
Article CAS PubMed PubMed Central Google Scholar
Grandel, H. et al. Retinoic acid signalling in the zebrafish embryo is necessary during pre-segmentation stages to pattern the anterior-posterior axis of the CNS and to induce a pectoral fin bud. Development 129, 2851–2865 (2002).
CAS PubMed Google Scholar
Fernández, I. et al. Effect of dietary vitamin A on Senegalese sole (Solea senegalensis) skeletogenesis and larval quality. Aquaculture 295, 250–265. https://doi.org/10.1016/j.aquaculture.2009.06.046 (2009).
Article CAS Google Scholar
Miwa, S. & Yamano, K. Retinoic acid stimulates development of adult-type chromatophores in the flounder. J. Exp. Zool. 284, 317–324 (1999).
Article CAS PubMed Google Scholar
Fernández, I. & Gisbert, E. Senegalese sole bone tissue originated from chondral ossification is more sensitive than dermal bone to high vitamin A content in enriched Artemia. J. Appl. Ichthyol. 26, 344–349. https://doi.org/10.1111/j.1439-0426.2010.01432.x (2010).
Article Google Scholar
Villalta, M., Estévez, A. & Bransden, M. P. Arachidonic acid enriched live prey induces albinism in Senegal sole (Solea senegalensis) larvae. Aquaculture 245, 193–209. https://doi.org/10.1016/j.aquaculture.2004.11.035 (2005).
Article CAS Google Scholar
Braasch, I., Schartl, M. & Volff, J.-N. Evolution of pigment synthesis pathways by gene and genome duplication in fish. BMC Evol. Biol. 7, 74. https://doi.org/10.1186/1471-2148-7-74 (2007).
Article CAS PubMed PubMed Central Google Scholar
Nord, H., Dennhag, N., Muck, J. & von Hofsten, J. Pax7 is required for establishment of the xanthophore lineage in zebrafish embryos. Mol. Biol. Cell 27, 1853–62. https://doi.org/10.1091/mbc.E15-12-0821 (2016).
Article CAS PubMed PubMed Central Google Scholar
Zhou, L. et al. Genetic characteristic and RNA-Seq analysis in transparent mutant of carp-goldfish nucleocytoplasmic hybrid. Genes (Basel) 10, 704. https://doi.org/10.3390/genes10090704 (2019).
Article CAS Google Scholar
Li, H. et al. Ectopic cross-talk between thyroid and retinoic acid signaling: a possible etiology for spinal neural tube defects. Gene 573, 254–260. https://doi.org/10.1016/j.gene.2015.07.048 (2015).
Article CAS PubMed Google Scholar
Ziouzenkova, O. et al. Retinaldehyde represses adipogenesis and diet-induced obesity. Nat. Med. 13, 695–702. https://doi.org/10.1038/nm1587 (2007).
Article CAS PubMed PubMed Central Google Scholar
Schwarz, E. J., Reginato, M. J., Shao, D., Krakow, S. L. & Lazar, M. A. Retinoic acid blocks adipogenesis by inhibiting c/ebpbeta-mediated transcription. Mol. Cell Biol. 17, 1552–61. https://doi.org/10.1128/mcb.17.3.1552 (1997).
Article CAS PubMed PubMed Central Google Scholar
Wafer, R., Tandon, P. & Minchin, J. E. N. The role of peroxisome proliferator-activated receptor gamma (pparg) in adipogenesis: applying knowledge from the fish aquaculture industry to biomedical research. Front. Endocrinol. (Lausanne) 8, 102. https://doi.org/10.3389/fendo.2017.00102 (2017).
Article Google Scholar
Salmerón, C. Adipogenesis in fish. J. Exp. Biol.https://doi.org/10.1242/jeb.161588 (2018).
Article PubMed Google Scholar
Roman-Padilla, J., Rodríguez-Rua, A., Claros, M. G., Hachero-Cruzado, I. & Manchado, M. Genomic characterization and expression analysis of four apolipoprotein A-IV paralogs in Senegalese sole (Solea senegalensis Kaup). Comp. Biochem. Physiol. B Biochem. Mol. Biol. 191, 84–98. https://doi.org/10.1016/j.cbpb.2015.09.010 (2016).
Article CAS PubMed Google Scholar
Román-Padilla, J., Rodríguez-Rúa, A., Manchado, M. & Hachero-Cruzado, I. Molecular characterization and developmental expression patterns of apolipoprotein A-I in Senegalese sole (Solea senegalensis Kaup). Gene Expr. Patterns 21, 7–18. https://doi.org/10.1016/j.gep.2016.05.003 (2016).
Article CAS PubMed Google Scholar
Román-Padilla, J., Rodríguez-Rúa, A., Ponce, M., Manchado, M. & Hachero-Cruzado, I. Effects of dietary lipid profile on larval performance and lipid management in Senegalese sole. Aquaculture 468, 80–93. https://doi.org/10.1016/j.aquaculture.2016.10.005 (2017).
Article CAS Google Scholar
Roman-Padilla, J., Rodríguez-Rúa, A., Carballo, C., Manchado, M. & Hachero-Cruzado, I. Phylogeny and expression patterns of two apolipoprotein E genes in the flatfish Senegalese sole. Gene 643, 7–16. https://doi.org/10.1016/j.gene.2017.11.078 (2018).
Article CAS PubMed Google Scholar
Kovalevskaya, N. V. et al. Dnadigest and repositive: connecting the world of genomic data. PLoS Biol. 14, e1002418. https://doi.org/10.1371/journal.pbio.1002418 (2016).
Article CAS PubMed PubMed Central Google Scholar
Seoane, P. et al. AutoFlow, a versatile workflow engine illustrated by assembling an optimised de novo transcriptome for a non-model species, such as faba bean (Vicia faba). Curr. Bioinform. 11, 1–11. https://doi.org/10.2174/1574893611666160212235117 (2016).
Article CAS Google Scholar
Falgueras, J. et al. SeqTrim: a high-throughput pipeline for pre-processing any type of sequence read. BMC Bioinform. 11, 38. https://doi.org/10.1186/1471-2105-11-38 (2010).
Article CAS Google Scholar
Bushnell, B. BBmap Suite (2014).
Langmead, B. & Salzberg, S. L. Fast gapped-read alignment with Bowtie 2. Nat. Methods 9, 357–359. https://doi.org/10.1038/nmeth.1923 (2012).
Article CAS PubMed PubMed Central Google Scholar
Li, H. et al. The sequence alignment/Map format and SAM tools. Bioinformatics 25, 2078–2079. https://doi.org/10.1093/bioinformatics/btp352 (2009).
Article CAS PubMed PubMed Central Google Scholar
Li, W. & Godzik, A. CD-HIT: a fast program for clustering and comparing large sets of protein or nucleotide sequences. Bioinformatics 22, 1658–1659. https://doi.org/10.1093/bioinformatics/btl158 (2006).
Article CAS PubMed Google Scholar
Li, H. Minimap2: fast pairwise alignment for long DNA sequences. arXiv (2017). http://arXiv.org/1708.01492.
Fernández-Díaz, C. et al. Growth and physiological changes during metamorphosis of Senegal sole reared in the laboratory. J. Fish Biol. 58, 1086–1097. https://doi.org/10.1111/j.1095-8649.2001.tb00557.x (2001).
Article Google Scholar
Ponce, M. et al. Genomic characterization, phylogeny and gene regulation of g-type lysozyme in sole (Solea senegalensis). Fish Shellfish Immunol. 31, 925–37. https://doi.org/10.1016/j.fsi.2011.08.010 (2011).
Article CAS PubMed Google Scholar
Yu, G., Wang, L.-G., Han, Y. & He, Q.-Y. clusterProfiler: an R package for comparing biological themes among gene clusters. OMICS 16, 284–7. https://doi.org/10.1089/omi.2011.0118 (2012).
Article CAS PubMed PubMed Central Google Scholar
Bindea, G. et al. ClueGO: a Cytoscape plug-in to decipher functionally grouped gene ontology and pathway annotation networks. Bioinformatics 25, 1091–3. https://doi.org/10.1093/bioinformatics/btp101 (2009).
Article CAS PubMed PubMed Central Google Scholar
Metsalu, T. & Vilo, J. ClustVis: a web tool for visualizing clustering of multivariate data using principal component analysis and heatmap. Nucleic Acids Res. 43, W566–W570. https://doi.org/10.1093/nar/gkv468 (2015).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

This work would not have been possible without the computer resources and the technical support provided by the Plataforma Andaluza de Bioinformática of the University of Málaga. The authors are grateful to people that prepared larval samples within the Interreg Sudoe project AQUAGENET and deposited the dataset in the NCBI. This study was amenable by grants RTA2017-00054-C03-01, RTA2017-00054-C03-03 funded from MCIU/AEI(INIA)/ERDF-UE and by grant AGL2017-83370-C3-3-R funded from MCIU/AEI/ERDF-UE. Publication costs were funded by the mentioned RTA2017-00054-C03-01 grant.

Author information

These authors contributed equally: José Córdoba-Caballero and Pedro Seoane.

Authors and Affiliations

Department of Molecular Biology and Biochemistry, Universidad de Málaga, Málaga, 29071, Spain
José Córdoba-Caballero, Pedro Seoane, Fernando M. Jabato, James R. Perkins & M. Gonzalo Claros
CIBER de Enfermedades Raras (CIBERER), Málaga, 29071, Spain
Pedro Seoane, James R. Perkins & M. Gonzalo Claros
Institute of Biomedical Research in Malaga (IBIMA), IBIMA-RARE, Málaga, 29010, Spain
James R. Perkins & M. Gonzalo Claros
Consejería de Agricultura, Ganadería, Pesca y Desarrollo Sostenible, IFAPA Centro El Toruño, El Puerto de Santa María, Cádiz, 11500, Spain
Manuel Manchado
Instituto de Hortofruticultura Subtropical y Mediterránea (IHSM-UMA-CSIC), Málaga, 29010, Spain
M. Gonzalo Claros

Authors

José Córdoba-Caballero
View author publications
You can also search for this author in PubMed Google Scholar
Pedro Seoane
View author publications
You can also search for this author in PubMed Google Scholar
Fernando M. Jabato
View author publications
You can also search for this author in PubMed Google Scholar
James R. Perkins
View author publications
You can also search for this author in PubMed Google Scholar
Manuel Manchado
View author publications
You can also search for this author in PubMed Google Scholar
M. Gonzalo Claros
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

J.C.C. and P.S. collected and prepared the reads, defined the assemblies and customised the workflows. F.M.J. and J.R.P. updated the DEGenes Hunter scripts. J.C.C., P.S. and M.G.C. designed the polishing processes. M.M. carried out the RA-signalling analyses and interpretations. M.M. and M.G.C. wrote the main manuscript and figures. All authors have contributed, reviewed and approved the submitted version of the manuscript.

Corresponding author

Correspondence to M. Gonzalo Claros.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary files 1-8.

Supplementary file 4.

Supplementary file 5.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Córdoba-Caballero, J., Seoane, P., Jabato, F.M. et al. An improved de novo assembling and polishing of Solea senegalensis transcriptome shed light on retinoic acid signalling in larvae. Sci Rep 10, 20654 (2020). https://doi.org/10.1038/s41598-020-77201-z

Download citation

Received: 19 June 2020
Accepted: 06 November 2020
Published: 26 November 2020
DOI: https://doi.org/10.1038/s41598-020-77201-z

This article is cited by

Chromosome anchoring in Senegalese sole (Solea senegalensis) reveals sex-associated markers and genome rearrangements in flatfish
- Israel Guerrero-Cózar
- Jessica Gomez-Garrido
- Manuel Manchado
Scientific Reports (2021)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Emx2 underlies the development and evolution of marsupial gliding membranes

A high-resolution transcriptomic and spatial atlas of cell types in the whole mouse brain

Genomes of multicellular algal sisters to land plants illuminate signaling network evolution

Introduction

Results

De novo assembling for new “raw” transcriptomes

Transcriptome polishing

The SOLSEv5.0 transcriptome

Differential expression of RA signalling in sole larvae

Functional analysis of DEAB and TTNPB effects on RA signalling

Discussion

Methods

Computational infrastructure

Sequencing data and mapping

De novo assembling and annotations

Removal of low confidence transcripts

Fish trial and drug treatments

RNA preparation

Gene expresion analyses

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary information

Supplementary files 1-8.

Supplementary file 4.

Supplementary file 5.

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Chromosome anchoring in Senegalese sole (Solea senegalensis) reveals sex-associated markers and genome rearrangements in flatfish

Comments

Search

Quick links