Integrated transcriptome catalog of Tenualosa ilisha as a resource for gene discovery and expression profiling

Chowdhury, Md. Arko Ayon; Islam, Md. Rakibul; Amin, Al; Mou, Sadia Noor; Ullah, Kazi Newaz; Baten, Abdul; Shoyaib, Mohammad; Ali, Amin Ahsan; Chowdhury, Farhana Tasnim; Rahi, Md. Lifat; Khan, Haseena; Amin, M Ashraful; Islam, Mohammad Riazul

doi:10.1038/s41597-023-02132-z

Download PDF

Data Descriptor
Open access
Published: 17 April 2023

Integrated transcriptome catalog of Tenualosa ilisha as a resource for gene discovery and expression profiling

Md. Arko Ayon Chowdhury ORCID: orcid.org/0000-0002-7437-7287^1,2,
Md. Rakibul Islam^1,2,
Al Amin^1,2,
Sadia Noor Mou^1,2,
Kazi Newaz Ullah^2,3,
Abdul Baten⁴,
Mohammad Shoyaib⁵,
Amin Ahsan Ali²,
Farhana Tasnim Chowdhury¹,
Md. Lifat Rahi⁶,
Haseena Khan¹,
M Ashraful Amin² &
…
Mohammad Riazul Islam¹

Scientific Data volume 10, Article number: 214 (2023) Cite this article

2343 Accesses
3 Citations
1 Altmetric
Metrics details

Subjects

Abstract

The silver pride of Bangladesh, migratory shad, Tenualosa ilisha (Hilsa), makes the highest contribution to the total fish production of Bangladesh. Despite its noteworthy contribution, a well-annotated transcriptome data is not available. Here we report a transcriptomic catalog of Hilsa, constructed by assembling RNA-Seq reads from different tissues of the fish including brain, gill, kidney, liver, and muscle. Hilsa fish were collected from different aquatic habitats (fresh, brackish, and sea water) and the sequencing was performed in the next generation sequencing (NGS) platform. De novo assembly of the sequences obtained from 46 cDNA libraries revealed 462,085 transcript isoforms that were subsequently annotated using the Universal Protein Resource Knowledgebase (UniPortKB) as a reference. Starting from the sampling to final annotation, all the steps along with the workflow are reported here. This study will provide a significant resource for ongoing and future research on Hilsa for transcriptome based expression profiling and identification of candidate genes.

Evolution of tissue-specific expression of ancestral genes across vertebrates and insects

Article 15 April 2024

A high-resolution transcriptomic and spatial atlas of cell types in the whole mouse brain

Article Open access 13 December 2023

Emx2 underlies the development and evolution of marsupial gliding membranes

Article Open access 24 April 2024

Background

The migratory shad Hilsa (Tenualosa ilisha) is the national fish of Bangladesh and very famous for its unique taste and texture. It is an anadromous fish and spends most of its life cycle in the marine environment but mature individuals migrate to the upstream freshwater river systems for spawning. Post spawning, larvae grow up to the juvenile stage (locally known as Jatka) that perform downstream migration to the sea for further growth and maturation. This indicates that Hilsa regularly experiences broad spectrum salinity fluctuation throughout its life cycle. Therefore, this species possesses the capability to rapidly adapt to large scale salinity change including full strength seawater (salinity level 35.0%) to absolute freshwater (salinity level 0%) and vice versa¹. Due to this sharp salinity fluctuation, Hilsa faces severe challenges including osmotic stress, differences in nutrient availability, changing immunity and disease susceptibility, hormonal imbalance, etc. Thus, Hilsa must display diverse adaptive strategies to rapidly cope with these biological changes during their migration pathways. Differential expression of a number of candidate genes is thought to be the key underlying mechanism to rapidly cope with these changing conditions.

Hilsa is nutritionally enriched with high quality proteins, vitamins, minerals, polyunsaturated fatty acids (PUFA) and Omega-3 fatty acids². It is considered to be the most important commercial and cultural species across the entire Indian sub-continent. In the financial year 2017–18, total 0.517 million Metric Ton of Hilsa was harvested in Bangladesh alone, which constitutes 12% of total national fish production³. Since, Hilsa is of central importance as a primary fisheries species in Bangladesh, more in-depth molecular work is needed to understand their physiology and adaptation mechanism in both marine and fresh aquatic environments. At present, 3 draft genome assemblies of Hilsa are publicly available^4,5,6. The publicly available de novo transcriptome assemblies for Hilsa are limited to either muscle or liver tissues^7,8. In a recent study, tissue-specific diversity of the alpha-2-macroglobulin splice isoforms in liver, gill, testes, and ovary have been reported but no annotated transcripts were made publicly available⁹. Here we report an annotated transcriptome catalog of Hilsa using five different tissues (gill, kidney, liver, brain and muscle) with sequential workflow (Figs. 1,2). These five tissues were selected based on diverse functional roles of the respective tissues. Brain and liver constitute most of the major regulatory functions of body and thus most of the genes are expressed in these two tissues. Different genes responsible for inhabiting heterogeneous environmental conditions are expressed in the gill and kidney tissues. Genes associated with growth, developmental processes, metabolic and physiological activities are normally expressed in muscle tissue. Therefore, these five tissues provide ideal samples for capturing almost all of the expressed genes in a fish. This annotated de novo transcriptome assembly from in silico normalised RNA-Seq reads is expected to serve as a complementary resource and reference gene sets, which will accelerate Hilsa research on gene discovery and genome annotation using experimental evidence, gene expression profiling, etc¹⁰.

In-depth sequencing (at least 80 million paired end reads per sample) has enhanced the chance of including the low abundant transcripts in the count¹¹.

Methods

Sample collection

At least, four live Hilsa samples were collected from each sampling sites using seine net with the help of local fishermen (Fig. 3). Live fish samples were euthanized using dry ice (1:1 wt/wt) and immediately dissected in the field to obtain fresh tissue samples. Gill, kidney, liver, brain and muscle tissues were dissected and immediately preserved in RNAlater. The preserved Hilsa tissue samples were brought to the laboratory and maintained at −80 °C for subsequent use. All the required paper works for animal ethics clearance and field work were approved by authority prior to the starting of this study (Ref. No.: KUAEC-2021/09/20).

RNA extraction

RNA was isolated from the RNAlater preserved tissue samples using TRIzol® reagent (ThermoFisher Scientific, USA)¹². The manufacturer’s protocol was optimized with slight modifications. Tissue samples taken from RNAlater were washed with ice-cold DEPC-treated water and transferred immediately in mortars containing liquid nitrogen. Tissue samples were crushed into fine powder using pestles. Finely crushed tissue samples were subsequently transferred to 1 mL of ice-cold TRIzol solution and maintained on ice for 10 minutes. After a short vortex and centrifugation (16,000 × g for 15 minutes at 4 °C temperature), the supernatant (pink aqueous solution) was transferred to another microcentrifuge tube. 300 μL of chloroform was added to each tube and mixed well by inverting. The mixture was incubated on ice for 10 minutes prior to centrifugation at 16,000 × g. After centrifugation, 450 μL of the aqueous layer was pooled carefully and transferred to fresh microcentrifuge tubes without disturbing the other layer. 45 μL of 3M sodium-acetate (pH~5.5) and 495 μL of isopropanol were added in the tubes and incubated overnight at −20 °C.

The supernatant was discarded after centrifugation at 16,000 × g for 15 minutes at 4 °C. A slightly white clear pellet was formed at the bottom of each tube. 1 mL of 75% ethanol was added to each tube for washing and then centrifuged (16,000 × g for 15 minutes at 4 °C). The supernatant was discarded and pellets were air dried for 20 minutes and resuspended in 30 μL of DEPC treated water. RNA concentration was measured using a Nanodrop One UV-vis spectrophotometer. Integrity of the extracted RNA was checked using agarose gel electrophoresis and Agilent Technologies 2200 TapeStation RNA ScreenTape¹³. Only the high quality RNA samples were sent to Macrogen Inc, SouthKorea for subsequent steps including library preparation and sequencing.

cDNA Library preparation and sequencing

Removal of contaminating genomic DNA by DNase-l treatment, mRNA isolation from total RNA samples, cDNA library preparation and quality assessment were performed at Macrogen Inc, South Korea. TruSeq stranded mRNA library (Illumina, San Diego, USA) preparation kit was used to prepare Hilsa cDNA libraries following the manufacturer’s protocol. Constructed cDNA libraries were then assessed for quality using Bioanalyzer. Equimolar quantities of each good quality cDNA libraries (Table 1) were used for sequencing in the Illumina based platform, NovaSeq 6000.

Table 1 Number of source specific cDNA libraries used in sequencing.

Full size table

In silico normalisation and de novo assembly and dataset annotation

The software package FastP (version 0.12.4) was used to remove extraneous (first 12 bases) sequences and a default sliding window size 4 with a mean quality score below Q20 was trimmed prior to de novo assembly¹⁴. After filtering the initial 4,560,499,900 reads, total 4,319,117,376 high quality sequencing reads were remained. The quality of all the trimmed reads was evaluated individually using FastQC (https://www.bioinformatics.babraham.ac.uk/projects/fastqc/) and aggregated using MultiQC¹⁵. Clean paired-end reads from 46 samples were first normalised in silico and further assembled into 527,646 transcript isoforms using Trinity (version-2.13.2)^16,17. GC content of the transcriptome was 44.56%, median and average transcript lengths were 358 and 681.61 bases respectively. Redundant transcripts were removed by clustering the sequences using the CD-HIT-EST program with a default (0.95) similarity match threshold that resulted in 462,456 transcript isoforms¹⁸. Subsequently, the transcript sequences were submitted to NCBI Transcriptome Shotgun Assembly (TSA) Database. NCBI screened the transcriptome assembly for foreign contaminating transcripts. The contaminating transcripts were removed and final number of transcripts were 462,085. Coding regions within the transcripts were predicted and extracted using the TransDecoder (version-5.5.0) (https://github.com/TransDecoder/TransDecoder) tool with the default parameters. BLASTx and BLASTp programs (version-2.12.0+) were used for homology-based similarity search of the transcripts and predicted proteins against the latest UniProtKB protein database with a maximum e-value of 1e⁻⁵ ^19,20,21. The BLAST results were integrated by Trinotate (http://trinotate.github.io).

Data Records

Raw RNA-Sequence reads of different tissues of Tenualosa ilisha have been deposited in the NCBI Sequence Read Archive (SRA) database under the NCBI Bioproject (https://www.ncbi.nlm.nih.gov/bioproject/) with accession PRJNA850620²². Final set of transcripts have been submitted to NCBI Transcriptome Shotgun Assembly (TSA) database (https://www.ncbi.nlm.nih.gov/genbank/tsa/) under the accession no GKAU00000000²³. Raw assembly data, annotation file and Transdecoder predicted peptide files have been deposited in Figshare²⁴.

Technical Validation

Samples with low quality RNA and cDNA library were removed from the transcriptome catalog. From raw reads to final annotation, the quality of data was scrutinised at several checkpoints. The quality of the raw and trimmed sequence reads was assessed in terms of sequence quality, presence of adaptors, GC content, overrepresented k-mers using FastQC (http://www.bioinformatics.babraham.ac.uk/projects/fastqc). Subsequently QC of all the samples were aggregated and analysed altogether utilizing MultiQC (Fig. 4). An appropriate reference genome of Hilsa with chromosomal information is not available^4,5,6. Thus, transcriptome was reconstructed from the RNA-Seq data using the Trinity de novo RNA-Seq assembler which outperforms others in terms of overall reads representation of the assembly, completeness, mismatch and mis-assembly, etc²⁵. The transcriptome data were subjected to BUSCO (Benchmarking Universal Single-Copy Orthologs) analysis to examine their completeness^26,27. The analysis performed on transcriptome module against eukaryota_odb10, metazoa_odb10, vertebrata_odb10 and actinopterygii_odb10 ortholog gene sets showed 98.40%, 99.60%, 92.30% and 90.70% of completeness respectively. Also, percentages of missing BUSCOs for eukaryota_odb10, metazoa_odb10, vertebrata_odb10 and actinopterygii_odb10 datasets were found to be 0.4%, 0.2%, 2.5% and 5.0% respectively (Fig. 5)²⁸.

Both the BLASTp and BLASTx programs for annotation were used with a stringent e-value cut-off of 1e⁻⁵ against the UniprotKB Protein Database. The workflow and the links of the deposited data are provided. These datasets expand the indispensable transcriptomic resources for further research on Hilsa functional genomics, gene characterisation and expression profiling etc.

Code availability

Fastp (version 0.12.4)

fastp -i Read_1.fastq.gz -f 12 -o Read_1_fastp_trim.fastq -I Read_2.fastq.gz -F 12 -O Read_2_fastp_trim.fastq

Trinity (version 2.13.2)

Trinity–seqType fq–max_memory 96 G–samples_file Hilsa_samples–SS_lib_type RF–CPU 20–no_bowtie

Hilsa_samples file is provided in Figshare repository²³.

CD-HIT (version 4.8.1)

cd-hit-est –I Hilsa_RNA_Trinity.fasta –o Hilsa_TSA.fasta –T 20 –M 1400 –c 0.95

TransDecoder (version 5.5.0)

TransDecoder.LongOrfs -t Hilsa_TSA_v2.fasta

Then,

TransDecoder.Predict -t HILSA_TSA_v2.fasta–retain_blastp_hits Hilsa_peptides_BLASTp.txt

BLAST (version 2.12.0+)

Database Preparation:

makeblastdb –in uniprot_sprot.fasta -dbtype prot -parse_seqids -out uniprot_sprot_fasta.db

BLASTp RUN-1:

blastp -query Hilsa_peptide_transdecoder.fasta -db uniprot_sprot_fasta.db -outfmt 6

-max_target_seqs 1 -num_threads 16 -evalue 1e-5 -out Hilsa_peptides_BLASTp.txt

The output file ‘Hilsa_peptides_BLASTp.txt’ was used to run ‘TransDecoder.Predict’ program.

BLASTp RUN-2:

blastp -query HILSA_TSA_v2.fasta.transdecoder. fasta -db uniprot_sprot_fasta.db -outfmt 6 -max_target_seqs 1 -num_threads 16 -evalue 1e-5 -out Hilsa_final_BLASTp.txt

BLASTx RUN:

blastx -db uniprot_sprot_fasta.db -query Hilsa_TSA.fasta -max_target_seqs 1 -outfmt 6

-num_threads 16 -evalue 1e-5 > Hilsa_Transcripts_Blastx.txt

Trinotate

Build_Trinotate_Boilerplate_SQLite_db.pl Trinotate

Then,

Trinotate Trinotate.sqlite init–gene_trans_map Hilsa_TSA_v2.fasta.gene_trans_map –

–transcript_fasta Hilsa_TSA_v2.fasta–transdecoder_pep Hilsa_peptide_transdecoder.fasta

Loading BLASTx and BLASTp results

Trinotate Trinotate.sqlite LOAD_swissprot_blastx Hilsa_Transcripts_Blastx.txt

And then,

Trinotate Trinotate.sqlite LOAD_swissprot_blastp Hilsa_ final _BLASTp.txt

Generating Annotation Report

Trinotate Trinotate.sqlite report > Annotation_report.xls

References

Ahsan, D. A., Naser, M. N., Bhaumik, U., Hazra, S. & Bhattacharya, S. B. Migration, Spawning Patterns and Conservation of Hilsa Shad (Tenualosa ilisha) in Bangladesh and India. Publ. by Acad. Found. India, New Delhi Int. Union Conserv. Nat. Nat. Resour. 95 (2014).
De, D. et al. Nutritional profiling of hilsa (Tenualosa ilisha) of different size groups and sensory evaluation of their adults from different riverine systems. Sci. Rep. 9, (2019).
DoF. Fisheries statistics of Bangladesh 2017–2018. Fish. Resour. Surv. Syst. (FRSS), Dep. Fish. Bangladesh Minist. Fish. 35, 129 (2018).
Google Scholar
Das, A. et al. Genome of Tenualosa ilisha from the river Padma, Bangladesh. BMC Res. Notes 11 (2018).
Mohindra, V. et al. Draft genome assembly of Tenualosa ilisha, Hilsa shad, provides resource for osmoregulation studies. Sci. Rep. 9 (2019).
Mollah, M. B. R., Khan, M. G. Q., Islam, M. S. & Alam, M. S. First draft genome assembly and identification of SNPs from hilsa shad (Tenualosa ilisha) of the Bay of Bengal. F1000Research 8 (2019).
Divya, B. K. et al. Muscle transcriptome resource for growth, lipid metabolism and immune system in Hilsa shad, Tenualosa ilisha. Genes and Genomics 41, 1–15 (2019).
Article CAS PubMed Google Scholar
Ganguly, S., Mitra, T., Mahanty, A., Mohanty, S. & Mohanty, B. P. A comparative metabolomics study on anadromous clupeid Tenualosa ilisha for better understanding the influence of habitat on nutritional composition. Metabolomics 16 (2020).
Mohindra, V., Dangi, T., Chowdhury, L. M. & Jena, J. K. Tissue specific alpha-2-Macroglobulin (A2M) splice isoform diversity in Hilsa shad, Tenualosa ilisha (Hamilton, 1822). PLoS One 14, (2019).
Ding, L. et al. EAnnot: A genome annotation tool using experimental evidence. Genome Res. 14, 2503–2509 (2004).
Article CAS PubMed PubMed Central Google Scholar
Conesa, A. et al. A survey of best practices for RNA-seq data analysis. Genome Biology 17 (2016).
Rio, D. C. et al. Purification of RNA Using TRIzol (TRI Reagent) Purification of RNA Using TRIzol (TRI Reagent) https://doi.org/10.1101/pdb.prot5439 (2012).
Article Google Scholar
Liu, M. H. et al. Automated Assessment of Next Generation Sequencing Library Preparation Workflow for Quality and Quantity Using the Agilent 2200 TapeStation System Automated RNA Sample Quality Control Rapid DNA-Seq to Achieve High Coverage Libraries from 1ng-1 g in 2 Hours Sequencing Single Human and Bacterial Cells at Low Coverage for Aneuploidy, CNV, and Genotyping Applications. ABRF 2014 POSTER ABSTRACTS S18 JOURNAL OF BIOMOLECULAR TECHNIQUES 25 (2014).
Chen, S., Zhou, Y., Chen, Y. & Gu, J. Fastp: An ultra-fast all-in-one FASTQ preprocessor. Bioinformatics 34, i884–i890 (2018).
Article PubMed PubMed Central Google Scholar
Ewels, P., Magnusson, M., Lundin, S. & Käller, M. MultiQC: Summarize analysis results for multiple tools and samples in a single report. Bioinformatics 32, 3047–3048 (2016).
Article CAS PubMed PubMed Central Google Scholar
Grabherr, M. G. et al. Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nat. Biotechnol. 29, 644–652 (2011).
Article CAS PubMed PubMed Central Google Scholar
Haas, B. J. et al. De novo transcript sequence reconstruction from RNA-seq using the Trinity platform for reference generation and analysis. Nat. Protoc. 8, 1494–1512 (2013).
Article CAS PubMed Google Scholar
Fu, L., Niu, B., Zhu, Z., Wu, S. & Li, W. CD-HIT: Accelerated for clustering the next-generation sequencing data. Bioinformatics 28, 3150–3152 (2012).
Article CAS PubMed PubMed Central Google Scholar
Bateman, A. UniProt: A worldwide hub of protein knowledge. Nucleic Acids Res. 47, D506–D515 (2019).
Article Google Scholar
Altschup, S. F., Gish, W., Miller, W., Myers, E. W. & Lipman, D. J. Basic Local Alignment Search Tool. J. Mol. Biol 215 (1990).
Camacho, C. et al. BLAST+: Architecture and applications. BMC Bioinformatics 10, (2009).
NCBI Sequence Read Archive. https://identifiers.org/insdc.sra:SRP385023 (2022).
Chowdhury, M. A. A. et al. TSA: Tenualosa ilisha, transcriptome shotgun assembly. GenBank https://identifiers.org/nucleotide:GKAU00000000 (2023).
Chowdhury, M. A. A. et al. Hilsa Transcriptome Datasets. Figshare https://doi.org/10.6084/m9.figshare.20391168 (2022).
Hölzer, M. & Marz, M. De novo transcriptome assembly: A comprehensive cross-species comparison of short-read RNA-Seq assemblers. Gigascience 8, (2019).
Waterhouse, R. M. et al. BUSCO applications from quality assessments to gene prediction and phylogenomics. Mol. Biol. Evol. 35, 543–548 (2018).
Article CAS PubMed Google Scholar
Simão, F. A., Waterhouse, R. M., Ioannidis, P., Kriventseva, E. V. & Zdobnov, E. M. BUSCO: Assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics 31, 3210–3212 (2015).
Article PubMed Google Scholar
Kriventseva, E. V. et al. OrthoDB v10: Sampling the diversity of animal, plant, fungal, protist, bacterial and viral genomes for evolutionary and functional annotations of orthologs. Nucleic Acids Res. 47, D807–D811 (2019).
Article CAS PubMed Google Scholar

Download references

Acknowledgements

This project is funded by a grant from the Independent University, Bangladesh (IUB) and the ICT division, Government of Bangladesh.

Author information

Authors and Affiliations

Molecular Biology Laboratory, Department of Biochemistry and Molecular Biology, University of Dhaka, Dhaka, 1000, Bangladesh
Md. Arko Ayon Chowdhury, Md. Rakibul Islam, Al Amin, Sadia Noor Mou, Farhana Tasnim Chowdhury, Haseena Khan & Mohammad Riazul Islam
Center for Computational and Data Sciences (CCDS), Independent University, Bangladesh (IUB), Dhaka, Bangladesh
Md. Arko Ayon Chowdhury, Md. Rakibul Islam, Al Amin, Sadia Noor Mou, Kazi Newaz Ullah, Amin Ahsan Ali & M Ashraful Amin
Department of Zoology, Jagannath University, Dhaka, 1100, Bangladesh
Kazi Newaz Ullah
Institute of Precision Medicine and Bioinformatics, Sydney Local Health District, Royal Prince Alfred Hospital, Camperdown, Australia
Abdul Baten
Institute of Information Technology (IIT), University of Dhaka, Dhaka, 1000, Bangladesh
Mohammad Shoyaib
Fisheries and Marine Resource Technology (FMRT) Discipline, Khulna University, Khulna, 9208, Bangladesh
Md. Lifat Rahi

Authors

Md. Arko Ayon Chowdhury
View author publications
You can also search for this author in PubMed Google Scholar
Md. Rakibul Islam
View author publications
You can also search for this author in PubMed Google Scholar
Al Amin
View author publications
You can also search for this author in PubMed Google Scholar
Sadia Noor Mou
View author publications
You can also search for this author in PubMed Google Scholar
Kazi Newaz Ullah
View author publications
You can also search for this author in PubMed Google Scholar
Abdul Baten
View author publications
You can also search for this author in PubMed Google Scholar
Mohammad Shoyaib
View author publications
You can also search for this author in PubMed Google Scholar
Amin Ahsan Ali
View author publications
You can also search for this author in PubMed Google Scholar
Farhana Tasnim Chowdhury
View author publications
You can also search for this author in PubMed Google Scholar
Md. Lifat Rahi
View author publications
You can also search for this author in PubMed Google Scholar
Haseena Khan
View author publications
You can also search for this author in PubMed Google Scholar
M Ashraful Amin
View author publications
You can also search for this author in PubMed Google Scholar
Mohammad Riazul Islam
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

H.K., M.R.I. and M.A.A. started and led the project. H.K., M.R.I., M.A.A., A.A., M.A.A.C., S.N.M., A.B., F.T.C., M.L.R. designed the overall project. M.L.R. collected the samples. M.A.A.C., A.A., S.N.M., K.N.U. processed the samples including RNA extraction, Q.C. assessment. M.A.A.C. processed, analysed and wrote the draft manuscript. S.N.M. and A.B. assisted in analysis pipeline development. M.R.I. (Rakibul) and A.A. assisted in data management and draft writing.

Corresponding authors

Correspondence to M Ashraful Amin or Mohammad Riazul Islam.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Chowdhury, M.A.A., Islam, M.R., Amin, A. et al. Integrated transcriptome catalog of Tenualosa ilisha as a resource for gene discovery and expression profiling. Sci Data 10, 214 (2023). https://doi.org/10.1038/s41597-023-02132-z

Download citation

Received: 04 August 2022
Accepted: 03 April 2023
Published: 17 April 2023
DOI: https://doi.org/10.1038/s41597-023-02132-z