Abstract
Teleost fishes display the largest array of sex-determining systems among animals, resulting in various reproductive strategies. Research on sex-related genes in teleosts will broaden our understanding of the process, and provide important insight into the plasticity of the sex determination process in vertebrates in general. Crimson seabream (Parargyrops edita Tanaka, 1916) is one of the most valuable and abundant fish resources throughout Asia. However, little genomic information on P. edita is available. In the present study, the transcriptomes of male and female P. edita were sequenced with RNA-seq technology. A total of 388,683,472 reads were generated from the libraries. After filtering and assembling, a total of 79,775 non redundant unigenes were obtained with an N50 of 2,921 bp. The unigenes were annotated with multiple public databases, including NT (53,556, 67.13%), NR (54,092, 67.81%), Swiss-Prot (45,265, 56.74%), KOG (41,274, 51.74%), KEGG (46,302, 58.04%), and GO (11,056, 13.86%) databases. Comparison of the unigenes of different sexes of P. edita revealed that 11,676 unigenes (9,335 in females, 2,341 in males) were differentially expressed between males and females. Of these, 5,463 were specifically expressed in females, and 1,134 were specifically expressed in males. In addition, the expression levels of ten unigenes were confirmed to validate the transcriptomic data by qRT-PCR. Moreover, 34,473 simple sequence repeats (SSRs) were identified in SSR-containing sequences, and 50 loci were randomly selected for primer development. Of these, 36 loci were successfully amplified, and 19 loci were polymorphic. Finally, our comparative analysis identified many sex-related genes (zps, amh, gsdf, sox4, cyp19a, etc.) and pathways (MAPK signaling pathway, p53 signaling pathway, etc.) of P. edita. This informative transcriptomic analysis provides valuable data to increase genomic resources of P. edita. The results will be useful for clarifying the molecular mechanism of sex determination and for future functional analyses of sex-associated genes.
Introduction
Sex determination is the process that establishes the sex of an individual and regulates the differentiation of sex characteristics, and its mechanisms are highly conserved in many animals (e.g. mammals and birds)1,2. Many of the genes involved in sex differentiation in distantly related species are the same, suggesting that the gene regulation mechanisms underlying sex differentiation and sex determination might be similar3,4. However, the mechanisms of sex determination in teleost fishes are remarkably diverse5,6. Among animals, teleost fishes showed the most diverse systems of sex determination, resulting in numerous reproductive strategies7,8. In contrast to sex in birds, humans and other higher vertebrates, which is relatively stable, sex in teleost fishes is unstable or alterable9,10,11. Research on sex determination mechanisms in teleosts has aroused widespread interest in an increasing number of investigators. It will provide deep insight into the plasticity of the sex determination process and broaden understanding of the process. In addition, as teleosts are an important protein resource for humans, it is important to understand the reproductive biology of teleosts to enable prediction of potential impacts and effective management.
Crimson seabream (Parargyrops edita Tanaka, 1916), a demersal fish belonging to the family Sparidae, is naturally distributed from the southern coast of Japan to the coast of Indonesia12. P. edita is one of the most valuable and popular fish resources throughout Asia, but catches dropped from ca. 49,000 t in 1970 to 21,000 t in 2008 due to overfishing in the Beibu Gulf13. With high overfishing, P. edita populations have shown miniaturization, prematurity and structural change. Given the important economic status and population decline of the species, many studies have been performed to access the stock and to examine the population genetic variation and biology of P. edita12,14. However, the genomic resources of P. edita are insufficient. Only mitochondrial gene sequences in the National Center for Biotechnology Information (NCBI) database can be used. Therefore, screening genes related to sex and identifying the functions of these genes may not only enable characterization of the mechanisms of the gene network underlying sex determination and differentiation in P. edita, but also enlarge the genomic database for P. edita and support future molecular biology studies.
Benefiting from the rapid development of next generation sequencing (NGS) technologies, high-throughput RNA sequencing (RNA-seq) is becoming readily available for non-model organisms15,16. The transcriptome contains almost all protein-coding genes, representing an essential part of the genome17. For species whose genomes are still not available or that have complex genomes, sequencing the transcriptome is an attractive option for detecting genes, especially in economically valuable fish. In addition, transcriptomic analysis enables simultaneous analyses of multiple processes, including protein homeostasis, metabolism and other regulatory cellular processes8,18,19. Recently, transcriptomics has been applied to identify sex-determining genes in teleost fishes20,21,22. In the present study, RNA-seq analysis was performed on pooled samples of multiple tissues from P. edita of different sexes. In addition, expression profiles were generated to detect differentially expressed genes (DEGs) between male and female P. edita. Subsequently, the candidate sex-related genes were determined. Furthermore, abundant simple sequence repeats (SSRs) were detected among deep-coverage sequence region reads. The results will provide a foundation for discovering regulatory mechanisms associated with sex-related genes and a valuable transcriptomic resource, which will facilitate research on the molecular regulatory mechanisms of teleost fishes.
Materials and methods
Ethics statement
All the experimental procedures were approved by the ethics committee of Laboratory Animal Welfare and Ethics of South China Sea Fisheries Research Institute (project identification code: nhdf 2019-02, date of approval: 13th Oct. 2019). The methods involving animals in this study were conducted in accordance with the Laboratory Animal Management Principles of China.
Fish materials
Six mature P. edita (three females and three males) were collected from the Beibu Gulf of China (N 19°56.552, E 108°02.841). To eliminate confounding effects, the P. edita were temporarily acclimated for three days in an aerated seawater tank. Then, all fish were anaesthetized and killed by severing the spinal cord. Then, tissues (muscle, gonad, liver and heart tissues) from each P. edita specimen were collected and frozen in liquid nitrogen for total RNA extraction.
RNA extraction, cDNA library building, and sequencing
Total RNA was extracted from each tissue of the six fish using TRIzol reagent. RNA integrity was assessed by using an Agilent 2100 Bioanalyzer (Agilent Technologies, Palo Alto, CA, USA), and the samples with RNA integrity numbers (RINs) ≥ 7 were subjected to subsequent analysis23. RNA from tissues from every individual was pooled in equal amounts.
Then, mRNA (RNA with a poly A tail) was extracted from the total RNA using magnetic beads with Oligo (dT) probes and purified. Fragmentation buffer was applied to break the mRNA into fragments of suitable size, and the fragmented mRNA was reverse-transcribed into double-stranded cDNA with the N6 random primer. Then, the cDNA fragments were repaired with phosphate at the 5′end, and an “A” base was added to the 3′ end, after which adapters were ligated to the cDNA fragments. PCR was performed to amplify the ligation products. After heat denaturation, the single-stranded DNA was cyclized with a splint oligo and DNA ligase. Finally, the prepared cDNA libraries were sequenced on the BGISEQ-500 platform using a 150-bp paired-end strategy (BGI, Shenzhen, China; http://bgitechsolutions.com/technologies/64).
De novo assembly and functional annotation
Trimmomatic was used to remove adaptors as well as low-quality sequences from the raw reads generated from the BGISEQ-500 platform24. First, the parameter in ILLUMINACLIP was set to 2:30:10 to remove the adaptor, leading and trailing bases with a Phred quality score below 3 were trimmed, reads were scanned with a 4 bp sliding window and trimmed when average Phred quality dropped below 20, and reads with a length lower than 50 bp were also discarded. Subsequently, the Trinity software package (trinityrnaseq, version: 2.0.6; https://github.com/trinityrnaseq/trinityrnaseq/releases/tag/v2.0.6) was applied to de novo assembly of the high-quality clean reads (min_kmer_cov: 3)25. The k-mer value was set to 25. TGICL clustering software (version: 2.1) was then used to remove the redundancies and acquire maximum-length nonredundant unigenes with a minimum of 70% similarity26. Benchmarking Universal Single-Copy Orthologs (BUSCO) was then used to evaluate transcriptome completeness, using a < 15% missing genes in the BUSCO Eukaryota dataset as a benchmark for analysis27. Finally, all unigenes were annotated by comparison against different public databases, including the NCBI non-redundant protein (NR) database, the NCBI nucleotide (NT) database, the Swiss-Prot protein database, the Gene Ontology (GO) databases, the Kyoto Encyclopedia of Genes and Genomes (KEGG) database and the euKaryotic Orthologous Group (KOG) database (BLASTx 2.5.0; the E-value threshold was 1 × 10−5).
Analysis of potential sex-related genes
In order to characterize gene expression variation in the samples from different sexes, we used the short-read alignment tool Bowtie2 to map all clean reads to the assembled transcriptome, with end-to-end alignment, a 22-base seed length, two reseed attempts with no mismatches allowed in the seed, a seed interval of 17 bases, and all other settings at the default values 28. To determine the fragments per kilobase of exon model per million mapped fragments (FPKM) values, RSEM was applied to normalize the expression levels of all unigenes29. To identify DEGs, the DEGseq package was applied30. The threshold for DEGs was defined as an absolute log2(fold change) value ≥ 4 and a Q-value ≤ 0.001. Among the DEGs, unigenes specifically expressed in female or male crimson seabream were regarded as specifically expressed genes (SEGs). Furthermore, enrichment analyses (KEGG and GO) of the DEGs were performed based on the hypergeometric distribution test, and a significance test was applied. A P value ≤ 0.05 indicated a significantly enriched term.
Potential SSR marker detection and validation
The MIcroSAtellite identification tool (MISA, version 1.0) was applied to identify the simple sequence repeat (SSR) sequences in the assembled transcriptome. The minimum repeat number for the various unit types were as follows: twelve for mono-nucleotide, six for dinucleotide, five for tri-nucleotide and tetra-nucleotide, and four for penta-nucleotide and hexa-nucleotide microsatellites. Then, primers for every SSR locus were designed on flanking sequences by using Primer 331.
In order to assess the polymorphism of the SSR markers, we used a wild P. edita population sampled in the Beibu Gulf, genomic DNA was isolated from 24 randomly selected P. edita specimens using a marine animal DNA kit (Tiangen, Beijing, China). Then, 1% agarose gel electrophoresis was used to access the integrity of the DNA samples. Fifty pairs of primers were initially selected and tested. The annealing temperatures of the primers were initially tested for amplification using pooled DNA samples. PCR amplifications were carried out using a Mastercycler gradient thermal cycler (Eppendorf) in a final volume of 25 μl, and the mixture and conditions of PCR followed the methods of Shan et al32. Subsequently, genomic DNA from 24 samples were amplificated under appropriate annealing temperatures, and the products were resolved in 6% denaturing polyacrylamide gels and visualized by silver staining.
Validation of the transcriptome data by quantitative real-time PCR
To validate the accuracy of the crimson seabream transcriptomic data, ten genes were selected for quantitative real-time PCR (qRT-PCR) validation. All the selected genes exhibited significantly different expression between female samples and male samples; there were five female-biased genes and five male-biased genes. Primer design was carried out using Primer Premier 6.0 (PREMIER Biosoft International, Palo Alto, California). In addition, 18 s and β-actin were chosen as reference genes for internal standardization, and the PCR procedure and amplification reaction system were the same as those described by Lou et al16. ABI7300 SDS software (Applied Biosystems) was applied to analyze the data after the PCR program was run. We calculated the relative expression levels of the ten target unigenes based on the 2−ΔΔCt method (ΔCT = CTtarget unigene − CTreference gene, − ΔΔCT = ΔCTfemale − ΔCTmale) and the log2(fold change) was then used for a comparison with the log2(fold change) of RNA-seq.
Results
Transcriptome sequencing and de novo assembly
Six cDNA libraries were constructed and subsequently sequenced on the BGISEQ-500 platform. In total, 388,683,472 reads were generated, and 373,965,788 (96.21%) clean reads were screened after removal of adaptor sequences, N-containing sequences and low-quality sequences. The statistics of the six cDNA libraries are shown in Table 1. All clean reads from the females and males were assembled to generate unigenes (Table 1). After clustering and removal of redundancies with Trinity and TGICL, 79,775 unigenes were generated with an N50 length of 2,921 bp and a mean length of 1,523 bp (Table 2).
The raw reads of the present study were uploaded to the SRA databases of NCBI under BioProject PRJNA591405, with accession numbers SRR10568165-SRR10568170. This Transcriptome Shotgun Assembly project has been deposited at DDBJ/EMBL/GenBank under the accession number GICI00000000. The version described in this paper is the first version, GICI01000000.
Functional annotation of the P. edita transcriptome
All the unigenes were functionally characterized based on the descriptions of their similar sequences from databases. The numbers and percentages of unigenes that matched to various databases are shown in Table 3. In summary, 53,556 unigenes (67.13%) were annotated to the NCBI nucleotide NT database A total of 54,255 unigenes (68.01%) were annotated to at least one of the protein databases, while 8,876 unigenes (11.13%) were annotated to all the protein databases that were searched in this study, including the NR, Swiss-Prot, KEGG, KOG and GO databases (Fig. 1). Subsequently, the completeness of the assembly and annotations was assessed with BUSCO. The results showed that 99% of the unigenes (51% as single genes and 48% as duplicated genes) were found to be complete, and 1% were found to be fragmented (Figure S1).
A total of 41,274 unigenes were annotated to the KOG database and classified into 25 subcategories. Among the 25 subcategories, ‘general function prediction only’ (6,963 unigenes) and ‘signal transduction mechanisms’ (5,992 unigenes) were the top two subcategories (Fig. 2). In addition, 11,056 unigenes were classified into three major GO categories (cellular component, biological process and molecular function) (Fig. 3). The terms ‘binding’ (5,395 unigenes), ‘catalytic activity’ (4,054 unigenes) and ‘cell’ (3,148 unigenes) were dominant in each functional category, respectively. In total, 46,302 unigenes were assigned to pathways based on the KEGG pathways analysis (Fig. 4); the top four pathways were ‘human diseases’ (24,232), ‘organism system’ (19,108), ‘metabolism’ (13,717) and ‘cellular processes’ (10,149). Furthermore, the predominant subcategories of pathways were ‘signal transduction’ (7406), ‘global and overview maps’ (5,246) and ‘immune system’ (4,972). These annotations offer a valuable resource for investigating specific functions, pathways and processes in teleost research.
Discovery of molecular markers
SSRs have been widely applied in genetic linkage mapping, quantitative trait locus (QTL) mapping and population genetic diversity analyses for aquaculture species due to their high variability, abundance and co-dominant mode of inheritance33. Our search revealed 34,473 SSRs in the transcriptomic dataset, of which 41.00% were di-nucleotide repeats, 27.49% were tri-nucleotide repeats, and 24.66% were mono-nucleotide repeats (Fig. 5). Our findings are consistent with the conclusion that the di-nucleotide repeat type of SSR is the most abundant repeat type in animals17,34.
In the present study, fifty pairs of primers were randomly selected for synthesis and validation, and 36 primer pairs gave clean amplification product at the appropriate annealing temperatures. Among these, 19 primer pairs (38.00%) amplified at least four allele from all samples assessed on 6% denaturing polyacrylamide gels, indicating that these identified SSRs were at least moderately polymorphic (Table S1, Figure S2).
Potential sex-related genes and functional enrichment analysis results
To analyse the potential sex-related genes, unigenes with |log2(fold change)| values ≥ 4 and Q-values < 0.001 were determined to be DEGs between sexes. After filtration, 11,676 DEGs were obtained between different sexes of P. edita; these genes might be associated with sex determination (Fig. 6). Among these genes, 2,341 unigenes were male-biased DEGs and 9,335 unigenes were female-biased DEGs. In addition, sex-specific genes were detected, including 1,134 male SEGs and 5,463 female SEGs. All the DEGs between the two sexes were then used as input to perform GO and KEGG enrichment. Consequently, 2,543 unigenes (2,237 female-biased genes and 306 male-biased genes) were assigned to 1,911 GO terms (Figure S3). The results regarding the GO terms showed that sex-biased genes were predominantly associated with ‘integral component of membrane’ (GO:0016021), ‘ATP binding’ (GO:0005524), ‘nucleus’ (GO:0005634) and other terms. The top 20 statistically significant KEGG classifications of sex-biased genes are shown in Table S2. The results suggested that the three largest KEGG classifications of sex-biased genes were ‘Metabolic pathways’ (ko01100), ‘Epstein-Barr virus infection’ (ko05169) and ‘Small cell lung cancer’ (ko05200).
Sex determination is a complex trait that is controlled by multiple genes. Among the DEGs, genes involved in regulating sex control and gonadal development were predicted based on the results of sequence annotation, including genes encoding zona pellucida sperm-binding proteins (zps), anti-Müllerian hormone (amh), P450 aromatase (cyp19a), SRY-box containing protein 4 (sox4), and gonadal soma derived factor 1 (gsdf) and other candidate genes (Table 4).
Transcriptomic data validation
To validate the transcriptomic data, ten selected genes (five female-biased genes and five male-biased genes) were subjected to qRT-PCR to investigate the expression profiles in all female and male samples. The specific primers are shown in Table S3. Similar up- or down-regulation patterns of these genes were observed in qRT-PCR and RNA-seq results, although few genes showed differentiation between the values (Fig. 7). These differentiations may be due to the different calculated methods used between RNA-seq and qRT-PCR. Thus, the results of qRT-PCR validated the transcriptomic data.
Discussion
It is obvious that genes involved in gonad development and related to sex differentiation play critical roles in controlling the sex ratios of teleosts17. It is crucial to clarify the mechanisms of gonad differentiation and sex determination. However, due to insufficient detection of sex-related genes and explicit determination of biological pathways, research in this area has advanced very slowly. Parargyrops edita, an economically important marine fish, has rarely been studied with regard to its sex-related and sex-determining genes. To explore the mechanisms of sex determination, sex-related genes, and their biological pathways should be screened. In the present study, the transcriptomes of different sexes of P. edita were sequenced by the RNA-seq technique to screen sex-related genes between females and males. Our results detected numerous sequences that could be annotated to well-known sex-related genes and that were enriched in sex-related biological pathways. After transcriptome sequencing and assembly, 79,775 unigenes were obtained. After annotation, 54,255 (68.01%) unigenes were significantly matched in the protein databases. The annotation information of these unigenes might provide basic resources for further research on the molecular mechanisms of biological processes of P. edita. In this study, 11,676 DEGs were obtained between different sexes of P. edita. Some of these genes may play crucial roles in the sex determination of P. edita. Our results could provide fundamental resources for further studies on teleost.
According to the annotations, many sex-related genes were identified from the P. edita transcriptome. Zona pellucida proteins are a major class of female-specific glycoproteins related to reproduction in female vertebrates35. Zona pellucida proteins are detected in the mammalian egg envelope, and they are also in the inner layer of the fish chorion. Zona pellucida proteins are encoded by multiple genes, including zp3, zp4 and potentially other zona pellucida proteins genes36,37. Recently, multiple copies of zp genes have been detected in some teleost fishes (e.g., medaka and carp)38,39. The zona pellucida sperm-binding protein 3 and zona pellucida sperm-binding protein 4 function as the primary sperm receptor and induces the acrosome reaction35. In this study, compared to male P. edita, female P. edita showed higher zp3 and zp4 expression levels, showing that these sex-specific genes may play critical roles in reproduction.
The sex-determining region Y (SRY) box, or SOX, is characterized as a conserved signature sequence in the high-mobility group (HMG) DNA-binding domain40. SOX genes have been identified in most vertebrates (mammals, birds and fish), and the orthologous genes in different species are highly similar41. It has been suggested that the SOX4 protein plays critical roles in many developmental processes, nervous system development, thymocyte development and embryonic cardiac development42,43,44,45. In this study, compared to female P. edita, male P. edita showed higher sox4 expression levels.
Gonadotropins (GTHs) are important modulators of gonadal steroidogenesis and gametogenesis in teleosts, and GTH-II is important for final maturation and spermiation or ovulation46,47. In red seabream (Pagrus major), GTH-II may be involved in regulation of gametogenesis in both males and females and may be important for final maturation and ovulation or spermiation48. Tian et al. found that GTH-II subunit genes were expressed at higher levels in ovaries than in testes in Sillago sihama22. Similarly, GTH-II showed higher expression levels in female P. edita than in male P. edita, indicating that GTH-II may be important for final maturation and ovulation. Research has suggested that transforming growth factor beta (TGF-β) superfamily members play important roles in cellular homeostasis, cell growth and cell differentiation. An important member, anti-Müllerian hormone (amh) has been identified in some teleost fishes (e.g., Oreochromis niloticus, Larimichthys crocea, and Cynoglossus semilaevis); this gene plays a critical role in gonad development in several fish species49,50. Furthermore, another important member of the TGF-β family, gonadal soma-derived factor (gsdf), was first identified in Oncorhynchus mykiss and plays a critical role in the proliferation of spermatogonia in male rainbow trout51. However, it has been reported that gsdf is also expressed in the follicle cells surrounding oocytes in trout and salmon 51,52. In the present study, gsdf and amh showed higher expression levels in males and females, respectively, than in the opposite sexes indicating that these genes may mainly participate in the proliferation of spermatogonia and oogonia, respectively. Furthermore, the kinesin superfamily, a group of microtubule-dependent motors, has been suggested to play important roles in nuclear reshaping, acrosome biogenesis and flagellum formation during spermiogenesis53,54. KIFC3 is a C-terminal kinesin motor protein in the superfamily that is expressed mainly in cell lines and proliferative tissues55. It has been reported that KIFC3 genes play complementary roles in nuclear morphogenesis and acrosome formation during spermiogenesis of Eumeces chinensis 56. However, there is no evidence to support a correlation between KIFC3 genes and fish sex differentiation. Our study seems to indicate that KIFC3 genes are expressed mainly in male P. edita. The expression of KIFC3 genes should be studied in different organs of fish.
Doublesex and mab-3-related transcription factor 1 (Dmrt1), a sex-determining gene, has been identified in many vertebrate species57,58,59,60. Dmrt1 is highly conserved and has been reported as the first conserved gene involved in sex differentiation found from invertebrates to humans57. In addition, Dmrt1 has been indentified in many teleost fish, including Sillago sihama21, Scophthalmus maximus6, and Cynoglossus semilaevis 61. However, our data for Dmrt1 was not consistent with data for this gene from other species. In addition, there were some other known sex-related genes whose expression was not detected in P. edita. This lack of detection could have occurred because the threshold for DEG determination was too high or too low or because these genes are not actually sex-regulators in P. edita.
In the present study, some enriched KEGG pathways related to sex determination were identified, including the ‘mitogen-activated protein kinase’ (MAPK) signaling pathway, the ‘neuroactive ligand-receptor interaction’ and the p53 signaling pathway.
The MAPK signaling pathway exists in all eukaryotic organisms and plays a wide variety of cellular roles in growth, differentiation, and stress responses62. Previous studies have shown that the MAPK signaling pathway mainly participates in the movement of primary spermatocytes across the blood-testis barrier and the regulation of gonadotropin subunit gene expression63. In the present study, the pathway was significantly enriched in male P. edita. It was also enriched in female P. edita but without statistical significance, which might indicate that the pathway plays different roles in the gonadal development of P. edita.
Previous studies have shown that the neuroactive ligand-receptor interaction signaling pathway is associated with lactation performance in mice. Furthermore, associated genes of the pathway are also upregulated64. In teleost fish, it has been suggested that this pathway plays important roles in teleost reproduction and gonadal development, such as in Oplegnathus punctatus21, Paralichthys olivaceus20, Nile tilapia65 and Perca flavescens66. Hence, we speculated that the pathway might serve as a critical modulator in both the reproductive system and the nervous system to control sex hormone production. Our results showed that the pathway might be correlated with gonadal development and sexual maturity in P. edita by regulating the production and neuro-distribution of sex steroids.
Conclusions
In conclusion, this was the first transcriptomic study of P. edita and it provided significant information to enrich the genetic resources. Through comparison of male and female P. edita transcriptomes, numerous DEGs known to be involved in sex-related processes were identified. Through GO and KEGG enrichment analyses, several potential gonadal development- and gametogenesis-related genes and pathways were identified. These findings will be useful for clarifying the molecular mechanism of sex-determination and for future functional analyses of sex-associated genes. Moreover, many SSRs were identified in the transcriptome, providing tools for future molecular biology studies on this important species.
References
Yang, L., Wang, Y., Zhang, Z. & He, S. Comprehensive transcriptome analysis reveals accelerated genic evolution in a Tibet Fish Gymnodiptychus pachycheilus. Genome Biol. Evol. 7, 251–261 (2015).
Chen, H. et al. Transcriptome analysis of sex-related genes in the blood clam Tegillarca granosa. PLoS ONE 12, e0184584 (2017).
Ottolenghi, C. et al. Loss of Wnt4 and Foxl2 leads to female-to-male sex reversal extending to germ cells. Hum. Mol. Genet. 16, 2795–2804 (2007).
Smith, C. A. et al. The avian Z-linked gene DMRT1 is required for male sex determination in the chicken. Nature 461, 267–271 (2009).
Devlin, R. H. & Nagahama, Y. Sex determination and sex differentiation in fish: an overview of genetic physiological and environmental influences. Aquaculture 208, 191–364 (2002).
Ribas, L. et al. Comprehensive transcriptomic analysis of the process of gonadal sex differentiation in the turbot (Scophthalmus maximus). Mol. Cell Endocrinol. 422, 132–149 (2016).
Kobayashi, Y., Nagahama, Y. & Nakamura, M. Diversity and plasticity of sex determination and differentiation in fishes. Sex Dev. 7, 115–125 (2013).
Casas, L. et al. Sex change in clownfish: molecular insights from transcriptome analysis. Sci. Rep. 6, 35461 (2016).
Barske, L. A. & Capel, B. Blurring the edges in vertebrate sex determination. Curr. Opin. Genet. Dev. 18, 499–505 (2008).
Pan, Q. et al. Vertebrate sex-determining genes play musical chairs. C. R. Biol. 339, 258–262 (2016).
Chen, Y. et al. Discovery and identification of candidate sex-related genes based on transcriptome sequencing of Russian sturgeon (Acipenser gueldenstaedtii) gonads. Physiol. Genomics 48, 464–476 (2016).
Hou, G., Feng, B., Lu, H. & Zhu, F. Age and growth characteristics of crimson sea bream Paragyrops edita Tanaka in Beibu Gulf. J. Ocean. Univ. China 7, 457–465 (2008).
Feng, B., Hou, G., Lu, H. & Chen, X. Management recommendation for Paragyrops edita in Beibu Gulf based per-recruitment analysis. Syst. Sci. Compr. Stud. Agric. 25, 114–119 (2009).
Xia, J., Xia, K., Gong, J. & Jiang, S. Complete mitochondrial DNA sequence gene organization and genetic variation of control regions in Parargyrops edita. Fish. Sci. 73, 1042–1049 (2007).
Chatchaiphan, S., Srisapoome, P., Kim, J., Devlin, R. H. & Na-Nakorn, U. D. novo transcriptome characterization and growth-related gene expression profiling of diploid and triploid bighead catfish (Clarias macrocephalus Günther 1864). Mar. Biotechnol. 19, 36–48 (2017).
Lou, F., Yang, T., Han, Z. & Gao, T. Transcriptome analysis for identification of candidate genes related to sex determination and growth in Charybdis japonica. Gene 677, 10–16 (2018).
Ma, D. et al. Transcriptome analysis for identification of genes related to gonad differentiation growth immune response and marker discovery in the turbot (Scophthalmus maximus). PLoS ONE 11, e0149414 (2016).
Goncalves, P. et al. Transcriptomic profiling of adaptive responses to ocean acidification. Mol. Ecol. 26, 5974–5988 (2017).
Yang, X. et al. Comparative transcriptome analysis provides insights into differentially expressed genes and long non-coding RNAs between ovary and testis of the mud crab (Scylla paramamosain). Mar. Biotechnol. 20, 20–34 (2018).
Zhang, W. et al. Transcriptome analysis of the gonads of olive flounder (Paralichthys olivaceus). Fish. Physiol. Biochem. 42, 1581–1594 (2016).
Du, X. et al. Comparative transcriptome analysis of ovary and testis reveals potential sex-related genes and pathways in spotted knifejaw Oplegnathus punctatus. Gene 637, 203–210 (2017).
Tian, C. et al. Transcriptome analysis of male and female mature gonads of silver sillago (Sillago sihama). Genes 10, 129 (2019).
Schroeder, A. et al. The RIN: an RNA integrity number for assigning integrity values to RNA measurements. BMC Mol. Biol. 7, 3 (2006).
Bolger, A. M., Lohse, M. & Usadel, B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30, 2114–2120 (2014).
Grabherr, M. G. et al. Trinity: reconstructing a full-length transcriptome without a genome from RNA-Seq data. Nat. Biotechnol. 29, 644–652 (2011).
Pertea, G. et al. TIGR Gene Indices clustering tools (TGICL): a software system for fast clustering of large EST datasets. Bioinformatics 19, 651–652 (2003).
Simão, F. A., Waterhouse, R. M., Ioannidis, P., Kriventseva, E. V. & Zdobnov, E. M. BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics 31, 3210–3212 (2015).
Langmead, B. & Salzberg, S. L. Fast gapped-read alignment with Bowtie 2. Nat. Methods 9, 357 (2012).
Li, B. & Dewey, C. RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome. BMC Bioinform. 12, 323 (2011).
Wang, L., Feng, Z., Wang, X., Wang, X. & Zhang, X. DEGseq: an R package for identifying differentially expressed genes from RNA-seq data. Bioinformatics 26, 136–138 (2010).
Untergasser, A. et al. Primer3-new capabilities and interfaces. Nucl. Acids Res. 40, e115 (2012).
Shan, B. et al. Genetic diversity and population structure of black sea bream (Acanthopagrus schlegelii) based on mitochondrial control region sequences: the genetic effect of stock enhancement. Reg. Stud. Mar. Sci. 5, 101188 (2020).
Liu, Z. & Cordes, J. DNA marker technologies and their applications in aquaculture genetics. Aquaculture 238, 1–37 (2004).
Wang, J. et al. EST clustering error evaluation and correction. Bioinformatics 20, 2973–2984 (2004).
Litscher, E. S. & Wassarman, P. M. Egg extracellular coat proteins: from fish to mammals. Histol. Histopathol. 22, 337–347 (2007).
Burdette, S. C., Frederickson, C. J., Bu, W. & Lippard, S. J. ZP4 an improved neuronal Zn2+ sensor of the Zinpyr family. J. Am. Chem. Soc. 125, 1778–1787 (2003).
Liu, X., Wang, H. & Gong, Z. Tandem-repeated zebrafish zp3 genes possess oocyte-specific promoters and are insensitive to estrogen induction. Biol. Reprod. 74, 1016–1025 (2006).
Chang, Y., Hsu, C., Wang, S., Tsao, C. & Huang, F. Molecular cloning structural analysis and expression of Carp ZP2 gene. Mol. Reprod. Dev. 46, 258–267 (1997).
Kanamori, A., Naruse, K., Mitani, H., Shima, A. & Hori, H. Genomic organization of ZP domain containing egg envelope genes in medaka (Oryzias latipes). Gene 305, 35–45 (2003).
Wilson, M. & Koopman, P. Matching SOX: partner proteins and co-factors of the SOX family of transcriptional regulators. Curr. Opin. Genet. Dev. 12, 441–446 (2002).
Bowles, J., Schepers, G. & Koopman, P. Phylogeny of the SOX family of developmental transcription factors based on sequence and structural indicators. Dev. Biol. 227, 239–255 (2000).
Van de Wetering, M., Oosterwegel, M., van Norren, K. & Clevers, H. Sox-4 an Sry-like HMG box protein is a transcriptional activator in lymphocytes. EMBO J. 12, 3847–3854 (1993).
Schilham, M. W. et al. Defects in cardiac outflow tract formation and pro-B-lymphocyte expansion in mice lacking Sox-4. Nature 380, 711–714 (1996).
Schilham, M. W., Moerer, P., Cumano, A. & Clevers, H. C. Sox-4 facilitates thymocyte differentiation. Eur. J. Immunol. 27, 1292–1295 (1997).
Cheung, M., Abu-Elmagd, M., Clevers, H. & Scotting, P. J. Roles of Sox4 in central nervous system development. Mol. Brain. Res. 79, 180–191 (2000).
Van der Kraak, G., Suzuki, K., Peter, R. E., Itoh, H. & Kawauchi, H. Properties of common carp gonadotropin I and gonadotropin II. Gen. Comp. Endocrinol. 85, 217–229 (1992).
Okada, T. et al. Purification and characterization of gonadotropin I and II from pituitary glands of tuna (Thunnus obesus). Int. J. Pept. Protein Res. 43, 69–80 (1994).
Gen, K. et al. Unique expression of gonadotropin-I and-II subunit genes in male and female red seabream (Pagrus major) during sexual maturation. Biol. Reprod. 63, 308–319 (2000).
Pfennig, F., Standke, A. & Gutzeit, H. O. The role of Amh signaling in teleost fish-Multiple functions not restricted to the gonads. Gen. Comp. Endocrinol. 223, 87–107 (2015).
Zhai, G. et al. Androgen signaling regulates the transcription of anti-Müllerian hormone via synergy with SRY-related protein SOX9A. Sci. Bull. 62, 197–203 (2017).
Sawatari, E., Shikina, S., Takeuchi, T. & Yoshizaki, G. A novel transforming growth factor-b superfamily member expressed in gonadal somatic cells enhances primordial germ cell and spermatogonial proliferation in rainbow trout (Oncorhynchus mykiss). Dev. Biol. 301, 266–275 (2007).
Luckenbach, J. A., Iliev, D. B., Goetz, F. W. & Swanson, P. Identification of differentially expressed ovarian genes during primary and early secondary oocyte growth in coho salmon Oncorhynchus kisutch. Reprod. Biol. Endocrinol. 6, 2 (2008).
Hoang, E. H., Whitehead, J. L., Dose, A. C. & Burnside, B. Cloning of a novel C-terminal kinesin (KIFC3) that maps to human chromosome 16q 13-q 21 and thus is a candidate gene for Bardet–Biedl syndrome. Genomics 52, 19–22 (1998).
Miki, H., Setou, M., Kaneshiro, K. & Hirokawa, N. All kinesin superfamily protein KIF genes in mouse and human. Proc. Natl. Acad. Sci. USA 98, 7004–7011 (2001).
Yang, Z. et al. Molecular cloning and functional analysis of mouse C-terminal kinesin motor KifC3. Mol. Cell. Biol. 21, 3765–3770 (2001).
Hoang, E. H., Whitehead, J. L., Dosé, A. C. & Burnside, B. Gene expression pattern of KIFC3 during spermatogenesis of the skink Eumeces chinensis. Gene 556, 206–212 (2015).
Marchand, O. et al. DMRT1 expression during gonadal differentiation and spermatogenesis in the rainbow trout Oncorhynchus mykiss. BBA-Gene Struct. Expr. 1493, 180–187 (2000).
Smith, C. A., Katz, M. & Sinclair, A. H. DMRT1 is upregulated in the gonads during female-to-male sex reversal in ZW chicken embryos. Biol. Reprod. 68, 560–570 (2003).
Matsuda, M. et al. DMY is a Y-specific DM-domain gene required for male development in the medaka fish. Nature 417, 559–563 (2002).
Yoshimoto, S. et al. A W-linked DM-domain gene DM-W participates in primary ovary development in Xenopus laevis. Proc. Natl. Acad. Sci. USA 105, 2469–2474 (2008).
Cui, Z. et al. Genome editing reveals dmrt1 as an essential male sex-determining gene in Chinese tongue sole (Cynoglossus semilaevis). Sci. Rep. 7, 42213 (2017).
Jung, K. W., Kim, S. Y., Okagaki, L. H., Nielsen, K. & Bahn, Y. S. Ste50 adaptor protein governs sexual differentiation of Cryptococcus neoformans via the pheromone-response MAPK signaling pathway. Fungal. Genet. Biol. 48, 154–165 (2011).
Di, A. S., Rossi, P., Geremia, R. & Sette, C. The MAPK pathway triggers activation of Nek2 during chromosome condensation in mouse spermatocytes. Development 129, 1715–1727 (2002).
Wei, J. et al. Identification of gene sets and pathways associated with lactation performance in mice. Physiol. Genomics 45, 171–181 (2013).
Sun, L. et al. Global DNA methylation changes in Nile tilapia gonads during high temperature-induced masculinization. PLoS ONE 11, e0158483 (2016).
Li, Y. et al. De novo transcriptome sequencing and analysis of male pseudo-male and female yellow perch Perca flavescens. PLoS ONE 12, e0171187 (2017).
Funding
This research was funded by the China-Vietnam Joint Survey on Fish Stocks in the Common Fishing Zone of the Beibu Gulf; the Central Public-Interest Scientific Institution Basal Research Fund, South China Sea Fisheries Research Institute, CAFS (Nos. 2020TD01, 2018YB05) and the China-ASEAN Maritime Cooperation Fund (China-Vietnam Fishery Stock Enhancement and Conservation in Beibu Gulf).
Author information
Authors and Affiliations
Contributions
Conceptualization, B.S. and D.S.; writing-original draft preparation, B.S. and Y.L.; investigation and methodology, B.S. and C.Y.; project administration and resources, D.S.; investigation and methodology, Y.Z. and Y.L.; writing-review and editing, B.S., C.Y. and D.S.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Supplementary Materials
: Figure S1: Completeness of the assembly and annotations, Figure S2: Polyacrylamide gel electrophoresis for 19 polymorphic SSR markers in 24 individuals, Figure S3: GO terms for the DEGs in the biological process, cellular component, and molecular function categories, Table S1: Characterization of 19 polymorphic SSR loci, Table S2: Top 20 KEGG pathways, Table S3: Specific primers for the selected unigenes and reference genes.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Shan, B., Liu, Y., Yang, C. et al. Comparative transcriptomic analysis for identification of candidate sex-related genes and pathways in Crimson seabream (Parargyrops edita). Sci Rep 11, 1077 (2021). https://doi.org/10.1038/s41598-020-80282-5
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-020-80282-5
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.