Chromosome-level genome assembly of humpback grouper using PacBio HiFi reads and Hi-C technologies

Liu, Jinxiang; Sun, Huibang; Tang, Lei; Wang, Yujue; Wang, Zhigang; Mao, Yunxiang; Huang, Hai; Zhang, Quanqi

doi:10.1038/s41597-023-02907-4

Download PDF

Data Descriptor
Open access
Published: 09 January 2024

Chromosome-level genome assembly of humpback grouper using PacBio HiFi reads and Hi-C technologies

Jinxiang Liu ORCID: orcid.org/0000-0002-6091-5558^1,2,3^na1,
Huibang Sun¹^na1,
Lei Tang¹,
Yujue Wang¹,
Zhigang Wang¹,
Yunxiang Mao⁴,
Hai Huang⁴ &
…
Quanqi Zhang^1,2,3

Scientific Data volume 11, Article number: 51 (2024) Cite this article

1142 Accesses
1 Citations
1 Altmetric
Metrics details

Subjects

Abstract

The humpback grouper (Cromileptes altivelis), a medium-sized coral reef teleost, is a naturally rare species distributed in the tropical waters of the Indian and Pacific Oceans. It has high market value, but artificial reproduction and breeding remain limited and need to be improved. Here, we assembled the genome with 1.08 Gb, with a contig N50 of 43.78 Mb. A total of 96.59% of the assembly anchored to 24 pseudochromosomes using Hi-C technology. It contained 24,442 protein-coding sequences, of which 99.3% were functionally annotated. The completeness of the assembly was estimated to be 97.3% using BUSCO. The phylogenomic analysis suggested that humpback grouper should be classified into the genus Epinephelus rather than Cromileptes. The comparative genomic analysis revealed that the gene families related to circadian entrainment were significantly expanded. The high-quality reference genome provides useful genomic tools for exploiting the genomic resource of humpback grouper and supports the functional genomic study of this species in the future.

Chromosome-level haplotype-resolved genome assembly for Takifugu ocellatus using PacBio and Hi-C technologies

Article Open access 11 January 2023

Chromosome-level genome assembly of Nibea coibor using PacBio HiFi reads and Hi-C technologies

Article Open access 03 November 2022

The first high-quality chromosome-level genome of Eretmochelys imbricata using HiFi and Hi-C data

Article Open access 09 September 2023

Background & Summary

Groupers, as a series of important commercial and ecological reef fish, are distributed in tropical and subtropical waters worldwide. On present understanding, groupers consist of 165 species in 16 genera and vary considerably in terms of lifestyle, growth rate, and body appearance¹. The humpback grouper is a naturally rare species that is widely distributed in the tropical waters of the Indian and Pacific Oceans². The term “humpback grouper” is because its body is relatively higher than its head, which gives a humpback aspect. The humpback grouper is a medium-sized fish, which grows up to 70 cm. As a protogynous hermaphroditic species, all humpback grouper individuals are born female and can transform into male when they grow up and experience 2–5 spawning seasons. This fish has high market value and is exceedingly favored by consumers due to their high nutritional value, tasty flesh, and beautiful appearance. In recent years, overfishing has led to a sharp decrease in the wild humpback grouper population, whereas the market demand has increased rapidly. Its relatively slow growth rate, unique sex-change strategy, and susceptibility to various pathogenic diseases during cultivation severely restrict the development of artificial culture. Previous studies of humpback grouper focused on immunology, the establishment of cell lines, classification, and feed supplement^3,4,5,6. The decoding of a high-quality reference genome could support more information on molecular biology, genetics, breeding, and conservation biology.

Recently, several types of grouper genomes have been assembled, such as giant grouper (Epinephelus lanceolatus), leopard coral grouper (Plectropomus leopardus), and red-spotted grouper (Epinephelus akaara)^7,8,9. Traditionally, grouper identification was primarily dependent on the surface profile and phenotype. Actually, it could cause errors and challenges in taxonomy. The groupers had a close relationship in evolution. To better understand the evolutionary relationship and taxonomy, it was necessary to acquire a specific solution by molecular biology. Besides, a high-quality reference genome resource could also provide an effective tool for genetic improvement and germplasm conservation. At present, the long-read and short-read sequencing technologies have been applied to the assembled genome. It was able to obtain highly integrated genome assemblies, especially circular consensus sequencing (CCS) improved the accuracy of PacBio SMRT sequencing. The HiFi sequence updated the genome assembly between read length and base quality significantly.

In 2021, a humpback grouper genome was constructed with the assembly of 1.013 Gb (contig N50 of 18.09 Mb)¹⁰. In this study, we represent a chromosome-scale genome assembly and annotation of humpback grouper with the PacBio HiFi and Hi-C sequencing technologies. Approximately 1.08 Gb genome was assembled with the contig N50 43.78 Mb. BUSCO analysis showed that 97.3% of the final assembly was complete BUSCOs. Overall, this high-quality reference genome provides a valuable basis for further genetic improvement and understanding the functional genes and molecular mechanisms in humpback grouper

Methods

DNA sample collection, library construction, and sequencing

A female humpback grouper was collected from Hainan Chenhai Aquatic Co., Ltd. The muscle tissue was collected for DNA extraction and library construction. Genomic DNA was extracted by the QIAamp DNA purification kit (Qiagen, USA). The short fragment library was generated using the Truseq Nano DNA HT Sample Preparation Kit (Illumina, USA) with an insert size of 350 bp and the Illumina NovaSeq 6000 platform. For the HiFi read generation, DNA fragment > 30 kb was selected using BluePippin Systerm (Sage Science, USA). The library was generated using the SMRTbell Template PrepKit 2.0 (PacBio, USA), and the library was sequenced in CCS on the PacBio Sequel II platform. The Hi-C library was constructed following the standard protocol described previously with certain modifications¹¹, and it was sequenced using the Illumina NovaSeq 6000 platform. A total of 53.1 Gb of Illumina data, 21.5 Gb PacBio of PacBio data, and 96 Gb of Hi-C data after trimming the low-quality reads and adaptor sequences from the raw data.

RNA sample collection, library construction, and sequencing

The samples of eight embryonic development stages (one cell, morula, high blastula, low blastula, gastrula, somite, neurula, and before the hatching stage) were collected for RNA extraction using TRIzol reagent (Invitrogen, USA). RNA-seq libraries were constructed using Illumina TruSeq Stranded mRNA Library Prep Kit (Illumina, USA) and sequenced by the Illumina NovaSeq 6000 platform. Further, RNA extracted from embryonic samples was mixed for Iso-seq. The Iso-seq library was constructed and sequenced on the PacBio Sequel II platform. The clean data was obtained by removing reads containing adapters, reads containing poly-N and low-quality reads from the raw data. Around 55.6 Gb of RNA-seq data and 69.1 Gb of Iso-seq data were generated for genome annotation.

Genome assembly and quality assessment

The characterization of the genome was estimated using the Illumina short-read data, and the 17 bp k-mer analysis was applied for estimation. The estimated genome size was 1,091.59 Mb, the heterozygosity rate was approximately 0.19%, and the repeated content was 45.81%. The genome was assembled using SOAPdenovo2 with k-mer set at 41 bp¹². The gaps were filled with GapCloser. Then, the draft genome was corrected and re-assembled using HiFi long reads by Hifiasm 0.12-r304 with the parameters “-t 30 -D 10”¹³. The genome assembly was 1.08 Gb, with a contig N50 size of 43.78 Mb (Fig. 1A). To obtain the chromosome-level genome, we applied ALLHiC pipeline to link the mapped contigs to 24 pseudochromosomes¹⁴. Finally, 96.59% of scaffolds were mapped to 24 chromosomes (Fig. 1B).

To evaluate the assembled genome, BUSCO was applied to evaluate the completeness of genome assembly. A total of 3,345 BUSCO genes were identified, with 3,263 complete genes, 3,230 single-copy genes, 33 multi-copy genes, 47 fragmented genes, and 44 missing genes accounting for 97.3%, 96.3%, 1.0%, 1.4%, and 1.3% of the whole genome, respectively (Table 1).

Table 1 BUSCO evaluation result of humpback grouper genome.

Full size table

Repeat and noncoding RNA annotation

Repeat sequences of the humpback grouper genome were identified using a combination of homology-based and de novo approaches. For the ab initio method, the RepeatModeler (v2.0.1)¹⁵, RepeatScout (v1.0.5)¹⁶, and LTR_finder (v1.0.6)¹⁷ were used to build the humpback grouper custom repeat database. In the homology-based method, the Repbase database¹⁸ was used to identify repeats with the RepeatMasker and RepeatProteinMask. The total length of the repetitive elements accounted for 44.38% of the humpback grouper genome (Fig. 2C). DNA transposons represented the most abundant class of repeats (17.85% of the genome) followed by long interspersed elements (LINEs, 15.20%), long terminal repeats (LTRs, 5.38%), and short interspersed elements (SINEs, 1.11%) (Table 2).

Table 2 Statistic results of different types of annotated repeat content.

Full size table

Noncoding RNAs, including rRNAs, snRNAs, miRNAs, and tRNAs, were identified by adopting INFERNAL (v1.1.2) through the Rfam database (release 13.0) for the humpback grouper genome using BLASTN (E-value ≤ 1e−5)^19,20,21. Transfer RNA was predicted using tRNAscan (v1.3.1)²² with default parameters for eukaryotes. Ribosome RNAs and their subunits were predicted using the RNAmmer (v1.2)²³. For non-coding RNA annotation, a total of 1,905 miRNA, 2,107 tRNA, 3,360 rRNA, and 1,637 snRNA were identified (Table 3).

Table 3 Summary statistics of noncoding RNA.

Full size table

Gene prediction and annotation

Firstly, three strategies were used for gene structure prediction, including de novo prediction, homology-based, and RNA-seq data-based prediction. Augustus (v2.5.5)²⁴, Glimm erHMM (v3.01)²⁵, SNAP²⁶, Geneid²⁷, and Genescan²⁸, were used for de novo gene prediction with default settings. Protein sequences of giant grouper, black rockfish (Sebastes schlegelii), stickleback (Gasterosteus aculeatus), large yellow croaker (Larimichthys crocea), grass carp (Ctenopharyngodon idella), Japanese flounder (Paralichthys olivaceus), and red-spotted grouper were downloaded from Ensembl and NCBI databases. These sequences were aligned to the humpback grouper genome with TBLASTN (E-value ≤ 10−5), and homologous genome sequences were then aligned against matching proteins by GeneWise (v2.4.0)²⁹ to generate a gene structure based on the alignment. Furthermore, the RNA-seq data from different embryonic development stages were assembled using Trinity (v2.1.1)³⁰ and mapped to the humpback grouper genome by using the Cufflinks (v2.1.1)³¹. Gene prediction from the above methods was merged to a consensus gene set using the EVM (v1.1.1)³². The functional annotation of the predicted genes of humpback grouper was performed by alignment to the SwissProt³³, NR³⁴, KEGG³⁵, Interpro³⁶, GO³⁷, and Pfam databases³⁸. A total of 24,442 protein-coding genes were predicted (Table 4), of which 24, 268 (99.3%) genes were annotated (Fig. 2B). The lengths of average transcript and CDS were 19,080.10 and 1,607.91 bp, respectively (Fig. 2A).

Table 4 Summary statistics of predicted protein-coding genes in the assembled genome.

Full size table

Data Records

The genome assembly and raw reads of the genome and transcriptome sequencing for humpback grouper were deposited under the Sequence Read Archive SRP322594³⁹. The genome assembly was deposited at GenBank with the accession number GCA_019925165.1⁴⁰. Besides, the assembled genome, predicted peptide, CDS, and GO term files were available in the figshare database with the DOI number: https://doi.org/10.6084/m9.figshare.24145230.v2⁴¹.

Technical Validation

Evaluation of the genome assembly and annotation

To evaluate the integrity and accuracy of the genome assembly, the completeness of the final genome assembly was assessed using BUSCO (v4.0)⁴² with the lineage database vertebrata_odb10 and CEGMA (v2.5)⁴³. It was shown that the assembly contained 97.3% complete and 1.4% fragmented conserved single copy orthologue genes, and 94.35% of the 248 core eukaryotic genes. By aligning Illumina sequencing reads to the genome using BWA (v0.7.8)⁴⁴, the reads mapping rate and the coverage rates were 99.68% and 99.91%, respectively. It was indicating high mapping efficiency and comprehensive coverage. Thus, all of the above results indicated that we obtained the high-quality genome of humpback grouper.

Code availability

No specific code was used in this study. The data analyses used standard bioinformatic tools specified in the methods.

References

Ma, K. Y., Craig, M. T., Choat, J. H. & van Herwerden, L. The historical biogeography of groupers: Clade diversification patterns and processes. Molecular Phylogenetics and Evolution 100, 21–30 (2016).
Article PubMed Google Scholar
Ortega-Recalde, O., Goikoetxea, A., Hore, T. A., Todd, E. V. & Gemmell, N. J. The Genetics and Epigenetics of Sex Change in Fish. Annual Review of Animal Biosciences 8, 47–69 (2020).
Article PubMed Google Scholar
Ketut, M., Zafran, Asami, Y. & Teruo, M. Susceptibility of juvenile humpback grouper Cromileptes altivelis to grouper sleepy disease iridovirus (GSDIV). Diseases of Aquatic Organisms 59, 1–9 (2004).
Article Google Scholar
Wang, L. et al. Establishment and characterization of a new cell line from the muscle of humpback grouper (Cromileptes altivelis). Fish Physiology and Biochemistry 46, 1897–19075 (2020).
Article CAS PubMed Google Scholar
Qin, J., Hu, D., Yang, W. & Xiao, J. Complete mitochondrial genome of the humpback grouper Cromileptes altivelis. Mitochondrial DNA 25, 200–201 (2014).
Article CAS PubMed Google Scholar
Sun, Y. et al. Evaluation of Lactococcus lactis HNL12 combined with Schizochytrium limacinum algal meal in diets for humpback grouper (Cromileptes altivelis). Fish & Shellfish Immunology 94, 880–888 (2019).
Article CAS Google Scholar
Zhou, Q. et al. A chromosome-level genome assembly of the giant grouper (Epinephelus lanceolatus) provides insights into its innate immunity and rapid growth. Molecular Ecology Resources 19, 1322–1332 (2019).
Article CAS PubMed Google Scholar
Wang, Y. et al. Chromosome Genome Assembly of the Leopard Coral Grouper (Plectropomus leopardus) With Nanopore and Hi-C Sequencing Data. Frontiers in Genetics 11, (2020).
Ge, H. et al. De novo assembly of a chromosome-level reference genome of red-spotted grouper (Epinephelus akaara) using nanopore sequencing and Hi-C. Molecular Ecology Resources 19, 1461–1469 (2019).
Article CAS PubMed PubMed Central Google Scholar
Yang, Y. et al. Chromosome Genome Assembly of Cromileptes altivelis Reveals Loss of Genome Fragment in Cromileptes Compared with Epinephelus Species. Genes 12 (2021).
Belton, J.-M. et al. Hi–C: A comprehensive technique to capture the conformation of genomes. Methods 58, 268–276 (2012).
Article CAS PubMed Google Scholar
Luo, R. et al. SOAPdenovo2: an empirically improved memory-efficient short-read de novo assembler. GigaScience 1 (2012).
Cheng, H., Concepcion, G. T., Feng, X., Zhang, H. & Li, H. Haplotype-resolved de novo assembly using phased assembly graphs with hifiasm. Nature Methods 18, 170–175 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Zhang, X., Zhang, S., Zhao, Q., Ming, R. & Tang, H. Assembly of allele-aware, chromosomal-scale autopolyploid genomes based on Hi-C data. Nature Plants 5, 833–845 (2019).
Article CAS PubMed Google Scholar
Tarailo-Graovac, M. & Chen, N. Using RepeatMasker to Identify Repetitive Elements in Genomic Sequences. Current Protocols in Bioinformatics 25, 4.10.11–14.10.14 (2009).
Article Google Scholar
Price, A. L., Jones, N. C. & Pevzner, P. A. De novo identification of repeat families in large genomes. Bioinformatics 21, i351–i358 (2005).
Article CAS PubMed Google Scholar
Xu, Z. & Wang, H. LTR_FINDER: an efficient tool for the prediction of full-length LTR retrotransposons. Nucleic Acids Research 35, W265–W268 (2007).
Article PubMed PubMed Central Google Scholar
Bao, W., Kojima, K. K. & Kohany, O. Repbase Update, a database of repetitive elements in eukaryotic genomes. Mobile. DNA 6, 11 (2015).
Google Scholar
Nawrocki, E. P. & Eddy, S. R. Infernal 1.1: 100-fold faster RNA homology searches. Bioinformatics (Oxford, England) 29, 2933–2935 (2013).
CAS PubMed Google Scholar
Kalvari, I. et al. Rfam 13.0: shifting to a genome-centric resource for non-coding RNA families. Nucleic Acids Research 46, D335–D342 (2018).
Article CAS PubMed Google Scholar
Camacho, C. et al. BLAST+: architecture and applications. BMC Bioinformatics 10, 421 (2009).
Article PubMed PubMed Central Google Scholar
Lowe, T. M. & Eddy, S. R. tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Research 25, 955–964 (1997).
Article CAS PubMed PubMed Central Google Scholar
Lagesen, K. et al. RNAmmer: consistent and rapid annotation of ribosomal RNA genes. Nucleic Acids Research 35, 3100–3108 (2007).
Article CAS PubMed PubMed Central Google Scholar
Stanke, M. et al. AUGUSTUS: ab initio prediction of alternative transcripts. Nucleic Acids Research 34, W435–W439 (2006).
Article CAS PubMed PubMed Central Google Scholar
Majoros, W. H., Pertea, M. & Salzberg, S. L. TigrScan and GlimmerHMM: two open source ab initio eukaryotic gene-finders. Bioinformatics 20, 2878–2879 (2004).
Article CAS PubMed Google Scholar
Leskovec, J. & Sosič, R. SNAP: A General Purpose Network Analysis and Graph Mining Library. ACM Transactions on Intelligent Systems and Technology 8, 1 (2016).
Article PubMed PubMed Central Google Scholar
Blanco, E., Parra, G. & Guigó, R. Using geneid to Identify Genes. Current Protocols in Bioinformatics 18, 4.3.1–4.3.28 (2007).
Article Google Scholar
Burge, C. & Karlin, S. Prediction of complete gene structures in human genomic DNA11Edited by F. E. Cohen. Journal of Molecular Biology 268, 78–94 (1997).
Article CAS PubMed Google Scholar
Doerks, T., Copley, R. R., Schultz, J., Ponting, C. P. & Bork, P. Systematic identification of novel protein domain families associated with nuclear functions. Genome Research 12, 47–56 (2002).
Article CAS PubMed PubMed Central Google Scholar
Grabherr, M. G. et al. Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nature Biotechnology 29, 644–652 (2011).
Article CAS PubMed PubMed Central Google Scholar
Ghosh, S. & Chan, C.-K. K. Plant Bioinformatics: Methods and Protocols (pp. 339–361. Springer, New York, 2016).
Book Google Scholar
Haas, B. J. et al. Automated eukaryotic gene structure annotation using EVidenceModeler and the Program to Assemble Spliced Alignments. Genome Biololgy 9, R7 (2008).
Article Google Scholar
Bairoch, A. et al. The Universal Protein Resource (UniProt). Nucleic Acids Research 33, D154–D159 (2005).
Article CAS PubMed Google Scholar
Marchler-Bauer, A. et al. CDD: a Conserved Domain Database for the functional annotation of proteins. Nucleic Acids Research 39, D225–D229 (2011).
Article CAS PubMed Google Scholar
Ogata, H. et al. KEGG: Kyoto Encyclopedia of Genes and Genomes. Nucleic Acids Research 27, 29–34 (1999).
Article CAS PubMed PubMed Central Google Scholar
Finn, R. D. et al. InterPro in 2017—beyond protein family and domain annotations. Nucleic Acids Research 45, D190–D199 (2017).
Article CAS PubMed Google Scholar
The Gene Ontology Consortium The Gene Ontology Resource: 20 years and still GOing strong. Nucleic Acids Research 47, D330–D338 (2019).
Article Google Scholar
Finn, R. D. et al. Pfam: the protein families database. Nucleic Acids Research 42, D222–D230 (2014).
Article CAS PubMed Google Scholar
NCBI Sequence Read Archive https://identifiers.org/ncbi/insdc.sra:SRP322594 (2020).
NCBI GenBank https://identifiers.org/ncbi/insdc.gca:GCA_019925165.1 (2021).
Liu, J. The humpback grouper genome. Figshare https://doi.org/10.6084/m9.figshare.24145230.v2 (2023).
Simão, F. A., Waterhouse, R. M., Ioannidis, P., Kriventseva, E. V. & Zdobnov, E. M. BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics 31, 3210–3212 (2015).
Article PubMed Google Scholar
Parra, G., Bradnam, K. & Korf, I. CEGMA: a pipeline to accurately annotate core genes in eukaryotic genomes. Bioinformatics 23, 1061–1067 (2007).
Article CAS PubMed Google Scholar
Li, H. & Durbin, R. Fast and accurate short read alignment with Burrows–Wheeler transform. Bioinformatics 25, 1754–1760 (2009).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

This work was supported by the Project of Sanya Yazhou Bay Science and Technology City (SCKJ-JYRC-2023-63, SCKJ-2023-01-004), and the Hainan Special PhD Scientific Research Foundation of Sanya Yazhou Bay Science and Technology City (HSPHDSRF-2022-02-009). This work was supported by the High-performance Computing Platform of YZBSTCACC. We appreciate the help from Novogene company for the sequencing.

Author information

These authors contributed equally: Jinxiang Liu, Huibang Sun.

Authors and Affiliations

MOE Key Laboratory of Marine Genetics and Breeding, College of Marine Life Sciences/Key Laboratory of Tropical Aquatic Germplasm of Hainan Province, Sanya Oceanographic Institution, Ocean University of China, Qingdao/Sanya, China
Jinxiang Liu, Huibang Sun, Lei Tang, Yujue Wang, Zhigang Wang & Quanqi Zhang
Laboratory for Marine Fisheries Science and Food Production Processes, Qingdao National Laboratory for Marine Science and Technology, Qingdao, China
Jinxiang Liu & Quanqi Zhang
Hainan Yazhou Bay Seed Laboratory, Sanya, China
Jinxiang Liu & Quanqi Zhang
MOE Key Laboratory of Utilization and Conservation for Tropical Marine Bioresources, Hainan Tropical Ocean University, Sanya, China
Yunxiang Mao & Hai Huang

Authors

Jinxiang Liu
View author publications
You can also search for this author in PubMed Google Scholar
Huibang Sun
View author publications
You can also search for this author in PubMed Google Scholar
Lei Tang
View author publications
You can also search for this author in PubMed Google Scholar
Yujue Wang
View author publications
You can also search for this author in PubMed Google Scholar
Zhigang Wang
View author publications
You can also search for this author in PubMed Google Scholar
Yunxiang Mao
View author publications
You can also search for this author in PubMed Google Scholar
Hai Huang
View author publications
You can also search for this author in PubMed Google Scholar
Quanqi Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

J.X.L., H.B.S., Y.X.M., and Q.Q.Z. conceived the study. J.X.L. and Y.J.W. interpreted the data. L.T. and H.H. prepared the material. J.X.L., H.B.S., and Z.G.W. drafted the manuscript. J.X.L., H.H., and Q.Q.Z. contributed to the final manuscript editing.

Corresponding authors

Correspondence to Hai Huang or Quanqi Zhang.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Liu, J., Sun, H., Tang, L. et al. Chromosome-level genome assembly of humpback grouper using PacBio HiFi reads and Hi-C technologies. Sci Data 11, 51 (2024). https://doi.org/10.1038/s41597-023-02907-4

Download citation

Received: 15 September 2023
Accepted: 29 December 2023
Published: 09 January 2024
DOI: https://doi.org/10.1038/s41597-023-02907-4