A chromosome-level genome assembly of the yellow-throated marten (Martes flavigula)

Mei, Xuesong; Liu, Guangshuai; Yan, Jiakuo; Zhao, Chao; Wang, Xibao; Zhou, Shengyang; Wei, Qinguo; Zhao, Shihu; Liu, Zhao; Sha, Weilai; Zhang, Honghai

doi:10.1038/s41597-023-02120-3

Download PDF

Data Descriptor
Open access
Published: 17 April 2023

A chromosome-level genome assembly of the yellow-throated marten (Martes flavigula)

Xuesong Mei ORCID: orcid.org/0000-0001-9087-2053¹^na1,
Guangshuai Liu¹^na1,
Jiakuo Yan¹,
Chao Zhao¹,
Xibao Wang¹,
Shengyang Zhou¹,
Qinguo Wei¹,
Shihu Zhao ORCID: orcid.org/0000-0003-0139-7743¹,
Zhao Liu¹,
Weilai Sha¹ &
…
Honghai Zhang¹

Scientific Data volume 10, Article number: 216 (2023) Cite this article

1288 Accesses
1 Altmetric
Metrics details

Subjects

Abstract

The yellow-throated marten (Martes flavigula) is a medium-sized carnivore that is widely distributed across much of Asia and occupies an extensive variety of habitats. We reported a high-quality genome assembly of this organism that was generated using Oxford Nanopore and Hi-C technologies. The final genome sequences contained 215 contigs with a total size of 2,449.15 Mb and a contig N50 length of 68.60 Mb. Using Hi-C analysis, 2,419.20 Mb (98.78%) of the assembled sequences were anchored onto 21 linkage groups. Merqury evaluation suggested that the genome was 94.95% complete with a QV value of 43.75. Additionally, the genome was found to comprise approximately 39.74% repeat sequences, of which long interspersed elements (LINE) that accounted for 26.13% of the entire genome, were the most abundant. Of the 20,464 protein-coding genes, prediction and functional annotation was successfully performed for 20,322 (99.31%) genes. The high-quality, chromosome-level genome of the marten reported in this study will serve as a reference for future studies on genetic diversity, evolution, and conservation biology.

The chromosome-level genome assembly of the giant dobsonfly Acanthacorydalis orientalis (McLachlan, 1899)

Article Open access 08 April 2024

Chromosome-level genome assembly and annotation of eel goby (Odontamblyopus rebecca)

Article Open access 02 February 2024

A high-quality chromosomal-level genome assembly of Greater Scaup (Aythya marila)

Article Open access 04 May 2023

Background & Summary

The yellow-throated marten belongs to the genus Martes of the family Mustelidae and is named after its conspicuous yellow pelage on its chest and throat¹. It is a voracious predator that feeds on several types of vertebrates, invertebrates, fruit, nectar, and food residue². Unlike many mustelids, the marten generally moves in groups of two to three individuals², which enables increased access to resources and reduces the risk of predation³. Given its preference for forested areas, it rarely appears in non-wooded environments⁴, as a consequence of which it may serve as a good indicator of forest ecosystem health. The marten performs several key roles in maintaining ecological balance, including spreading seeds⁵, and controlling the herbivore population size⁶, as a top-level predator in certain ecosystems⁴. The risk of extinction faced by the marten is low and the International Union for the Conservation of Nature (IUCN) classifies it as “Least Concern”⁷. However, rampant hunting, habitat loss, and other human activities pose substantial danger to the gradually decreasing marten populations⁷. Fortunately, certain protective measures, including legislation to counter these trends have been implemented in several countries, such as Myanmar⁸, Thailand⁹, South Korea⁶, and China¹⁰.

At present, research on the marten primarily focuses on its physical characteristics, behaviour, geographic range, and habitat. However, progress in molecular characterization, albeit slowly, has resulted in complete elucidation of its mitochondrial genome^11,12, established phylogenetic relationships between species on the basis of mitochondrial and/or partial nuclear gene sequences^13,14,15, and enabled population genetics analyses based on microsatellite markers^16,17. Genetic and evolutionary studies on the marten have been limited by the sparse nature of available genomic resources. For instance, the marten is the only extant species of the genus Martes that is adapted for survival in areas spanning from boreal to equatorial regions and from sea-level to an altitude of 4,510 m⁷. The likelihood is that there is some genetic variation among populations of the marten occupying different habitats. Therefore, a meaningful analysis of population structure, and the molecular mechanisms of adaptive evolution among different marten populations at the genomic level will be highly valuable. We applied Oxford Nanopore and Hi-C technologies to generate a chromosome-level genome assembly of the marten, which will serve as a useful resource in evolutionary and population genetics studies on this animal, as well as in chromosome evolution studies on Carnivora.

Methods

Sampling and sequencing

The yellow-throated marten sample used for DNA and RNA sequencing was obtained from Chengdu, China. Muscle tissue was stored at −80 °C and used to construct Illumina, Nanopore, and Hi-C libraries. High molecular weight genomic DNA was extracted from muscle tissues using a Blood & Cell Culture DNA Midi Kit.

Short-insert-size (~400 bp) paired-end sequencing libraries were constructed using the Truseq Nano DNA HT Sample Preparation Kit and sequenced on the Illumina HiSeq X Ten platform to generate 150 bp paired-end reads. These yielded 1.58 billion reads, 236.83 Gb of raw sequence data, which covered 96.70-fold of the genome assembly (Table 1, Table S1). Nanopore libraries were constructed and sequenced on the PromethION sequencer. In total, 27.76 million reads, 264.40 Gb of raw sequence data were obtained, which was 107.96-fold coverage of the genome assembly (Table 1, Table S2). The mean read length and the N50 length were 9.53 kb and 17.43 kb, and the longest read covered 204.65 kb (Table S2). Hi-C libraries were constructed using MboI restriction enzyme and sequenced on the Illumina NovaSeq6000 platform in 150 bp PE mode. As a result, 257.31 Gb of Hi-C reads were obtained, which covered 105.07-fold of the genome assembly (Table 1, Table S3).

Table 1 Statistics of sequencing data generated in this study.

Full size table

Additionally, RNA was extracted from seven tissues of the marten, including testis, stomach, kidney, pancreas, heart, spleen, and intestine. Transcriptome sequencing was performed on the Illumina Novaseq6000 platform, which yielded a total of 60.43 Gb of raw reads (Table 1, Table S4).

Genome size and heterozygosity estimation

Raw genomic Illumina sequencing reads were filtered using Fastp v0.12.6¹⁸ to remove adaptors, duplications, and low-quality reads. The clean reads were subsequently used to estimate genome size, heterozygosity, and repeat content based on 21-mer frequency distribution analysis using Jellyfish v2.3.0¹⁹ and GenomeScope v2.0²⁰. This resulted in the identification of 205,236,235,649 21-mers with a depth of 77 (Table S5). We therefore estimated that the genome of the marten is approximately 2,224.23 Mb in size, with a heterozygosity of 0.40% and a repeat content of 13.16% (Fig. 1, Table S5).

De novo assembly of the marten genome

Sequencing data generated from the Nanopore platform were corrected (parameters: “reads_cutoff: 1k,seed_cutoff: 19k”) and assembled (parameters: default) using NextDenovo v2.0-beta.1 (https://github.com/Nextomics/NextDenovo). Further improvement in the accuracy of the assembly was ensured by performing four rounds of self-correction and three rounds of consensus correction using ONT reads and Illumina short reads with Nextpolish v1.0.5²¹. The finally assembled genome was 2449.15 Mb in size with 215 contigs and a contig N50 of 68.60 Mb (Table 2). These findings closely mirror the genome size of Martes zibellina (2,420.68 Mb), a closely related species of the marten²². Further genome assembly summary statistics were computed using Gfastats v1.3.3²³ (Table 2).

Table 2 Summary statistics of the genome assembly.

Full size table

Chromosomal-level scaffolding

Chromosome-level scaffolding was performed by Hi-C analysis at the Genome Center of Grandomics (Wuhan, China). The raw Hi-C data were primarily filtered using Hi-C-Pro v2.8.0²⁴. Subsequently, post quality control with Fastp, the clean Hi-C data were mapped to the genome assembly of the marten using Bowtie2 v2.3.2²⁵ to get the unique mapped paired-end reads. As a result, 608.63 million uniquely mapped pair-end reads were obtained (Table S6), of which 83.19% were valid interaction pairs (Table S7). Combined with the valid Hi-C data, LACHESIS²⁴ was applied to produce a chromosomal-level genome. We further adjusted the misassembled contigs manually based on the interaction strength among the contigs and a linkage map using Juicebox²⁶. The final outcome entailed 2,419.20 Mb (98.78%) of assembled sequences that were anchored and orientated onto 21 chromosomes, ranging from 3.97 Mb to 219.65 Mb in length (Fig. 2, Table 3). Subsequently, the software ggplot2 in the R package was used to generate a genome-wide Hi-C heatmap to evaluate the quality of the chromosomal-level genome. The heatmap of chromosome crosstalk illustrated that the chromosomal-level genome was complete and robust (Fig. 3).

Table 3 Statistics of the chromosomal-level genome.

Full size table

Genome quality assessment

Complementary methods were employed to evaluate the quality of genome assembly. First, the Illumina reads and Nanopore reads were aligned to the marten genome using BWA v0.7.12-r1039²⁷ and Minimap2 v2.17²⁸, respectively. The results showed that 99.85% of the Illumina reads and 99.74% of the Nanopore reads could be mapped to the genome, with a coverage rate of 99.87% and almost 100%, respectively (Table S8, Table S9). Second, the completeness of the genome was evaluated by BUSCO v4.0.5 with a core gene set, referring to mammalia_odb10²⁹. As a result, 93.37% (8,614 of 9,226) of the complete BUSCO genes were identified, of which, 93.03% (8,583 of 9,226) were single copy and 0.34% (31 of 9,226) were duplicated (Fig. S1). Third, Merqury v1.3³⁰ was used to assess the consensus quality value (QV) and k-mer completeness of the genome assembly, which were found to be 43.75 and 94.95%, respectively (Table S10, Fig. S2).

Repeat annotation

Homology-based and ab initio prediction methods were used to identify repetitive sequences in the marten genome. The homology-based analysis was performed using RepeatMasker v4.1.0³¹ with the Repbase database³². For ab initio prediction, RepeatModeler v2.0.1³³ was utilized to construct a de novo repeat library, which was subsequently employed to predict repeats with RepeatMasker. We identified 973.18 Mb of repetitive sequences, accounting for 39.74% of the marten genome (Table 4). Among these, long interspersed elements (LINE) that accounted for 26.13% of the whole genome were the most abundant (Table 4). These results are supported by similar findings in published mustelids genomes^22,34,35.

Table 4 Statistics of repetitive elements in the marten genome.

Full size table

Prediction and functional annotation of protein-coding genes

We predicted protein-coding genes in the marten genome through integrating three different strategies: ab initio prediction, homology-based prediction, and transcriptome-based prediction. First, Augustus v2.5.5³⁶, GlimmerHMM v3.0.4³⁷, Geneid v1.4.4³⁸, and Genscan v1.0³⁹ were adopted to ab initio gene prediction with internal gene models. Second, protein sequences of seven species including, Bos taurus, Canis lupus familiaris, Enhydra lutris, Homo sapiens, Mustela erminea, Mustela putorius furo, and Mus musculus, as the templates of protein homology-based prediction were downloaded and aligned against the marten genome using TblastN v2.2.26⁴⁰ with an E-value ≤ 1e−5. The potential gene structure of each alignment was then predicted by GeneWise v2.4.1⁴¹. Third, transcriptome data were aligned to the marten genome with TopHat v2.1.1⁴² and the gene structures were predicted by Cufflinks v2.2.1⁴³. Finally, a non-redundant gene set was generated via integration of the three respective annotation files that were assigned different weights (ab initio prediction was “1”, homology-based prediction was “5”, and transcriptome-based prediction was “10”) in EVidenceModeler v1.1.1. PASA v2.3.3 was used to update the gene models by identifying untranslated regions to generate a final annotation⁴⁴.

Functional annotation of the protein-coding genes was accomplished using eggNOG-Mapper v2⁴⁵, a tool that enables rapid functional annotations of novel sequences on the basis of pre-computed orthology assignments, against the EggNOG v5.0 database⁴⁶.

Overall, we obtained 20,464 protein-coding genes in the marten genome, of which, 20,322 (99.31%) were successfully annotated. Additionally, we compared the distribution of mRNA length, coding DNA sequence (CDS) length, exon length, intron length and exon number in the marten genome with that of seven other mustelids, including, Enhydra lutris, Lontra canadensis, Lutra lutra, Mustela erminea, Meles meles, Mustela putorius furo, and Neovison vison (Table 5, Fig. 4). The results revealed a higher percentage of shorter mRNA in the marten genome than that in the genomes of the seven other mustelids (Fig. 4a). Further, short intronic lengths (about 0~75 bp) in the marten genome had a distribution pattern that was distinct from the seven other mustelids (Fig. 4d). One of the possible reasons is that there is slight deviation in the results of genome assembly and/or annotation between different species.

Table 5 The comparisons of gene elements in the marten genome with seven other mustelids.

Full size table

Data Records

The genomic Illumina sequencing data were deposited in the Sequence Read Archive at NCBI SRR21452075⁴⁷. The genomic Nanopore sequencing data were deposited in the Sequence Read Archive at NCBI SRR21426791⁴⁸. The transcriptome Illumina sequencing data were deposited in the Sequence Read Archive at NCBI SRR21460068-SRR21460074^{49,50,51,52,53,54,55}. The Hi-C sequencing data were deposited in the Sequence Read Archive at NCBI SRR21430408⁵⁶. The final chromosome assembly were deposited in the GenBank at NCBI JAODOS000000000⁵⁷. The final chromosome assembly, gene structure annotation, repeat annotation, and gene functional prediction were deposited in the Figshare database⁵⁸.

Technical Validation

DNA quantification and qualification

DNA degradation and contamination was monitored on 1% agarose gels. DNA purity was detected using NanoDrop One UV-Vis spectrophotometer. DNA concentration was measured by Qubit Fluorometer.

RNA quantification and qualification

RNA degradation and contamination was monitored on 1% agarose gels. RNA concentration was measured by Qubit Flurometer. RNA integrity was assessed using Agilent 2100 Bioanalyzer.

Quality filtering of Illumina data

To make sure the reads reliable in the following analyses, we used Fastp to elevate the quality of raw reads generated from the Illumina platform. The data were filtered out as follows:

(1)
removing the reads with more than 10% of Ns;
(2)
removing the reads with a quality score less than 20 for 20% of bases;
(3)
removing the reads with adapter sequences;
(4)
removing the reads with duplications.

Code availability

No specific code or script was used in this work. The commands used in the processing were all executed according to the manuals and protocols of the corresponding bioinformatics software.

References

Grassman, L. I., Tewes, M. E. & Silvy, N. J. Ranging, habitat use and activity patterns of binturong Arctictis binturong and yellow-throated marten Martes flavigula in north-central Thailand. Wildlife Biology 11, 49–57 (2005).
Article Google Scholar
Parr, J. W. K. & Duckworth, J. W. Notes on diet, habituation and sociality of Yellow-throated Martens Martes flavigula. Small Carnivore Conservation 36, 27–29 (2007).
Google Scholar
Twining, J. P. & Mills, C. Cooperative hunting in the yellow-throated marten (Martes flavigula): Evidence for the not-so-solitary marten? Ecosphere 12, e03398 (2021).
Article Google Scholar
Lee, H. et al. Current distribution and habitat models of the yellow-throated marten, Martes flavigula, in South Korea. Mammal Research 66, 429–441 (2021).
Article Google Scholar
Zhou, Y., Slade, E., Newman, C., Wang, X. & Zhang, S. Frugivory and seed dispersal by the yellow-throated marten, Martes flavigula, in a subtropical forest of China. Journal of Tropical Ecology 24, 219–223 (2008).
Article Google Scholar
Lee, S., Lee, S., Song, W. & Lee, M. Habitat Potential Mapping of Marten (Martes flavigula) and Leopard Cat (Prionailurus bengalensis) in South Korea Using Artificial Neural Network Machine Learning. Applied Sciences 7, 912 (2017).
Article Google Scholar
Chutipong, W. et al. Martes flavigula. The IUCN Red List of Threatened Species 2016, e.T41649A45212973 (2016).
Zaw, T. et al. Status and distribution of small carnivores in Myanmar. Small Carnivore Conservation 38, 2–28 (2008).
Google Scholar
Chutipong, W. et al. Current distribution and conservation status of small carnivores in Thailand: a baseline review. Small Carnivore Conservation 51, 96–136 (2014).
Google Scholar
Yiming, L., Zenxiang, G., Li, X., Sung, W. & Niemelä, J. Illegal Wildlife Trade in the Himalayan Region of China. Biodiversity and Conservation 9, 901–918 (2000).
Article Google Scholar
Xu, C., Zhang, H. & Ma, J. The complete mitochondrial genome of Martes flavigula. Mitochondrial DNA 24, 240–242 (2013).
Article CAS PubMed Google Scholar
Jang, K. H. & Hwang, U. W. Complete mitochondrial genome of Korean yellow-throated marten, Martes flavigula (Carnivora, Mustelidae). Mitochondrial DNA 27, 1–2 (2014).
Google Scholar
Stone, K. D. & Cook, J. A. Molecular evolution of Holarctic martens (genus Martes. Mammalia: Carnivora: Mustelidae). Molecular Phylogenetics & Evolution 24, 169–179 (2002).
Article CAS Google Scholar
Yu, L. et al. On the phylogeny of Mustelidae subfamilies: analysis of seventeen nuclear non-coding loci and mitochondrial complete genomes. BMC Evolutionary Biology 11, 92 (2011).
Article PubMed PubMed Central Google Scholar
Law, C. J., Slater, G. J. & Mehta, R. S. Lineage diversity and size disparity in Musteloidea: testing patterns of adaptive radiation using molecular and fossil-based methods. Systematic Biology 67, 127–144 (2018).
Article CAS PubMed Google Scholar
Lee, M. et al. Characterization of 33 microsatellite markers and development of multiplex PCR for yellow-throated marten (Martes flavigula). Genetics and Molecular Research 16, 1–8 (2017).
Article Google Scholar
Lee, S. M. et al. A core set of microsatellite loci for yellow-throated marten, Martes flavigula: a case of inferences of family relationships. Genes & Genomics 41, 1457–1465 (2019).
Article CAS Google Scholar
Chen, S., Zhou, Y., Chen, Y. & Gu, J. fastp: an ultra-fast all-in-one FASTQ preprocessor. Bioinformatics 34, i884–i890 (2018).
Article PubMed PubMed Central Google Scholar
Marcais, G. & Kingsford, C. A fast, lock-free approach for efficient parallel counting of occurrences of k-mers. Bioinformatics 27, 764–70 (2011).
Article CAS PubMed PubMed Central Google Scholar
Ranallo-Benavidez, T. R., Jaron, K. S. & Schatz, M. C. GenomeScope 2.0 and Smudgeplot for reference-free profiling of polyploid genomes. Nature Communications 11, 1432 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Hu, J., Fan, J., Sun, Z. & Liu, S. NextPolish: A fast and efficient genome polishing tool for long-read assembly. Bioinformatics 36, 2253–2255 (2019).
Article Google Scholar
Liu, G. et al. First draft genome of the sable, Martes zibellina. Genome Biology and Evolution 12, 59–65 (2020).
Article PubMed PubMed Central Google Scholar
Formenti, G. et al. Gfastats: conversion, evaluation and manipulation of genome sequences using assembly graphs. Bioinformatics 38, 4214–4216 (2022).
Article CAS PubMed PubMed Central Google Scholar
Burton, J. N. et al. Chromosome-scale scaffolding of de novo genome assemblies based on chromatin interactions. Nature Biotechnology 31, 1119–1125 (2013).
Article CAS PubMed PubMed Central Google Scholar
Langmead, B. & Salzberg, S. L. Fast gapped-read alignment with Bowtie 2. Nature Methods 9, 357–359 (2012).
Article CAS PubMed PubMed Central Google Scholar
Durand, N. C. et al. Juicebox Provides a Visualization System for Hi-C Contact Maps with Unlimited Zoom. Cell Systems 3, 99–101 (2016).
Article CAS PubMed PubMed Central Google Scholar
Li, H. & Durbin, R. Fast and Accurate Short Read Alignment with Burrows-Wheeler Transform. Bioinformatics 25, 1754–1760 (2009).
Article CAS PubMed PubMed Central Google Scholar
Li, H. Minimap2: Pairwise alignment for nucleotide sequences. Bioinformatics 34, 3094–3100 (2018).
Article CAS PubMed PubMed Central Google Scholar
Simão, F. A., Waterhouse, R. M., Ioannidis, P., Kriventseva, E. V. & Zdobnov, E. M. BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics 31, 3210–3212 (2015).
Article PubMed Google Scholar
Rhie, A., Walenz, B. P., Koren, S. & Phillippy, A. M. Merqury: Reference-free quality, completeness, and phasing assessment for genome assemblies. Genome biology 21, 245 (2020).
Article CAS PubMed PubMed Central Google Scholar
Tarailo-Graovac, M. & Chen, N. Using RepeatMasker to identify repetitive elements in genomic sequences. Current Protocols in Bioinformatics 4, 1–14 (2009).
Google Scholar
Bao, W., Kojima, K. K. & Kohany, O. Repbase Update, a database of repetitive elements in eukaryotic genomes. Mobile DNA 6, 11 (2015).
Article PubMed PubMed Central Google Scholar
Flynn, J. et al. RepeatModeler2 for automated genomic discovery of transposable element families. Proceedings of the National Academy of Sciences 117, 9451–9457 (2020).
Article ADS CAS Google Scholar
Derežanin, L. et al. Multiple types of genomic variation contribute to adaptive traits in the mustelid subfamily Guloninae. Molecular Ecology 31, 2898–2919 (2022).
Article PubMed Google Scholar
Ekblom, R. et al. Genome sequencing and conservation genomics in the Scandinavian wolverine population. Conservation Biology 32, 1301–1312 (2018).
Article PubMed Google Scholar
Stanke, M. & Morgenstern, B. AUGUSTUS: A web server for gene prediction in eukaryotes that allows user-defined constraints. Nucleic acids research 33, W465–W467 (2005).
Article CAS PubMed PubMed Central Google Scholar
Majoros, W. H., Pertea, M. & Salzberg, S. L. TigrScan and GlimmerHMM: Two open source ab initio eukaryotic gene-finders. Bioinformatics 20, 2878–2879 (2004).
Article CAS PubMed Google Scholar
Parra, G., Blanco, E. & Guigo, R. GeneID in Drosophila. Genome Research 10, 511–515 (2000).
Article CAS PubMed PubMed Central Google Scholar
Burge, C. & Karlin, S. Prediction of Complete Gene Structures in Human Genomic DNA. Journal of molecular biology 268, 78–94 (1997).
Article CAS PubMed Google Scholar
Gertz, E. M., Yu, Y., Agarwala, R., Schaffer, A. A. & Altschul, S. F. Composition-based statistics and translated nucleotide searches: Improving the TBLASTN module of BLAST. BMC biology 4, 1–14 (2006).
Article Google Scholar
Birney, E., Clamp, M. & Durbin, R. GeneWise and GenomeWise. Genome research 14, 988–995 (2004).
Article CAS PubMed PubMed Central Google Scholar
Kim, D. et al. Tophat2: Accurate Alignment of Transcriptomes in the Presence of Insertions, Deletions and Gene Fusions. Genome biology 14, R36 (2013).
Article PubMed PubMed Central Google Scholar
Ghosh, S. & Chan, C. K. K. Analysis of RNA-Seq Data Using TopHat and Cufflinks. Methods in Molecular Biology 1374, 339–361 (2016).
Article CAS PubMed Google Scholar
Haas, B. J. et al. Automated eukaryotic gene structure annotation using EVidenceModeler and the Program to Assemble Spliced Alignments. Genome Biology 9, R7 (2008).
Article PubMed PubMed Central Google Scholar
Cantalapiedra, C. P., Hernández-Plaza, A., Letunic, I., Bork, P. & Huerta-Cepas, J. eggNOG-mapper v2: Functional Annotation, Orthology Assignments, and Domain Prediction at the Metagenomic Scale. Molecular biology and evolution 38, 5825–5829 (2021).
Article CAS PubMed PubMed Central Google Scholar
Huerta-Cepas, J. et al. eggNOG 5.0: a hierarchical, functionally and phylogenetically annotated orthology resource based on 5090 organisms and 2502 viruses. Nucleic acids research 47, D309–D314 (2018).
Article PubMed Central Google Scholar
NCBI Sequence Read Archive https://identifiers.org/ncbi/insdc.sra:SRR21452075 (2022).
NCBI Sequence Read Archive https://identifiers.org/ncbi/insdc.sra:SRR21426791 (2022).
NCBI Sequence Read Archive https://identifiers.org/ncbi/insdc.sra:SRR21460068 (2022).
NCBI Sequence Read Archive https://identifiers.org/ncbi/insdc.sra:SRR21460069 (2022).
NCBI Sequence Read Archive https://identifiers.org/ncbi/insdc.sra:SRR21460070 (2022).
NCBI Sequence Read Archive https://identifiers.org/ncbi/insdc.sra:SRR21460071 (2022).
NCBI Sequence Read Archive https://identifiers.org/ncbi/insdc.sra:SRR21460072 (2022).
NCBI Sequence Read Archive https://identifiers.org/ncbi/insdc.sra:SRR21460073 (2022).
NCBI Sequence Read Archive https://identifiers.org/ncbi/insdc.sra:SRR21460074 (2022).
NCBI Sequence Read Archive https://identifiers.org/ncbi/insdc.sra:SRR21430408 (2022).
Mei, X.S. Martes flavigula isolate Sichuan, whole genome shotgun sequencing project. GenBank https://identifiers.org/ncbi/insdc:JAODOS000000000 (2022).
Mei, XS. et al. A chromosome-level genome assembly of the yellow-throated marten (Martes flavigula), figshare, https://doi.org/10.6084/m9.figshare.c.6203869.v1 (2023).
Chen, C. et al. TBtools: An Integrative Toolkit Developed for Interactive Analyses of Big Biological Data. Molecular Plant 13, 1194–1202 (2020).
Article CAS PubMed Google Scholar

Download references

Acknowledgements

This work was supported by the National Natural Science Foundation of China (No. 32070405; No. 31872242; No. 32170530; No. 31900311; No. 32000291).

Author information

These authors contributed equally: Xuesong Mei, Guangshuai Liu.

Authors and Affiliations

School of Life Sciences, Qufu Normal University, Qufu, 273165, Shandong, China
Xuesong Mei, Guangshuai Liu, Jiakuo Yan, Chao Zhao, Xibao Wang, Shengyang Zhou, Qinguo Wei, Shihu Zhao, Zhao Liu, Weilai Sha & Honghai Zhang

Authors

Xuesong Mei
View author publications
You can also search for this author in PubMed Google Scholar
Guangshuai Liu
View author publications
You can also search for this author in PubMed Google Scholar
Jiakuo Yan
View author publications
You can also search for this author in PubMed Google Scholar
Chao Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Xibao Wang
View author publications
You can also search for this author in PubMed Google Scholar
Shengyang Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Qinguo Wei
View author publications
You can also search for this author in PubMed Google Scholar
Shihu Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Zhao Liu
View author publications
You can also search for this author in PubMed Google Scholar
Weilai Sha
View author publications
You can also search for this author in PubMed Google Scholar
Honghai Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Mei X.S., Liu G.S. and Zhang H.H. designed the study; Yan J.K. and Zhao C. collected the samples; Mei X.S., Wang X.B., Zhou S.Y., Wei Q.G., Zhao S.H., Liu Z. and Sha W.L. performed the data analysis; Mei X.S. wrote the paper. All authors read, edited, and approved the final manuscript.

Corresponding author

Correspondence to Honghai Zhang.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary materials

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Mei, X., Liu, G., Yan, J. et al. A chromosome-level genome assembly of the yellow-throated marten (Martes flavigula). Sci Data 10, 216 (2023). https://doi.org/10.1038/s41597-023-02120-3

Download citation

Received: 03 October 2022
Accepted: 29 March 2023
Published: 17 April 2023
DOI: https://doi.org/10.1038/s41597-023-02120-3