A chromosome-scale reference genome for Spironucleus salmonicida

Xu, Feifei; Jiménez-González, Alejandro; Kurt, Zeynep; Ástvaldsson, Ásgeir; Andersson, Jan O.; Svärd, Staffan G.

doi:10.1038/s41597-022-01703-w

Download PDF

Data Descriptor
Open access
Published: 24 September 2022

A chromosome-scale reference genome for Spironucleus salmonicida

Scientific Data volume 9, Article number: 585 (2022) Cite this article

1362 Accesses
1 Citations
3 Altmetric
Metrics details

Subjects

Abstract

Spironucleus salmonicida is a diplomonad causing systemic infection in salmon. The first S. salmonicida genome assembly was published 2014 and has been a valuable reference genome in protist research. However, the genome assembly is fragmented without assignment of the sequences to chromosomes. In our previous Giardia genome study, we have shown how a fragmented genome assembly can be improved with long-read sequencing technology complemented with optical maps. Combining Pacbio long-read sequencing technology and optical maps, we are presenting here this new S. salmonicida genome assembly in nine near-complete chromosomes with only three internal gaps at long repeats. This new genome assembly is not only more complete sequence-wise but also more complete at annotation level, providing more details into gene families, gene organizations and chromosomal structure. This near-complete reference genome will aid comparative genomics at chromosomal level, and serve as a valuable resource for the diplomonad community and protist research.

Measurement(s)	genomic_DNA • sequence_assembly • sequence feature annotation
Technology Type(s)	SMRT Sequencing • sequence assembly process • sequence annotation
Sample Characteristic - Organism	Spironucleus salmonicida

Nanopore sequencing technology, bioinformatics and applications

Article 08 November 2021

The variation and evolution of complete human centromeres

Article Open access 03 April 2024

Single-cell RNA-seq of the rare virosphere reveals the native hosts of giant viruses in the marine environment

Article 11 April 2024

Background & Summary

Spironucleus salmonicida (“the salmonid killer”) causes systemic infections in farmed Atlantic salmon, Chinook salmon and Arctic char^1,2, thus poses a threat to sustainable aquaculture. Outbreaks of spironucleosis in farmed Atlantic salmon, Salmo salar, is a recurring problem and causes mass mortality and economical loss in for example Northern Norway. Salmon infected with S. salmonicida develops internal haemorrhaging, splenomegaly and granulomatous lesions in the liver and spleen, and drug treatment is not possible. This makes studies of the parasite important in order to develop alternative strategies to manage the parasite³.

S. salmonicida belongs to diplomonads, a group of unicellular protists with two diploid nuclei bearing different life styles. There are parasitic diplomonads like S. salmonicida, for example Giardia species which cause diarrhoea in various animals including humans⁴. All members of the Giardia genus are strictly intestinal parasites, while S. salmonicida is a well-adapting pathogen that can colonize different sites in the host⁵. It was shown in our previous study that S. salmonicida possesses an extended metabolic repertoire and more extensive gene regulation, probably making it more adapted to cope with environmental fluctuations⁶. There are also free-living diplomonads like Trepomonas sp. PC1, and comparative genomics have shown that the free-living life style most likely is a secondary adaptation and evolved from its parasitic ancestor^7,8.

We recently published two Giardia genome assemblies, Giardia intestinalis WB⁹ and Giardia muris¹⁰, in near-complete chromosomes using Pacbio reads alone or in combination with optical maps. With similar sequencing and assembly strategy, we obtain a high-quality reference genome of S. salmonicida in near-complete chromosomes. Diplomonad genomes in near-complete chromosomes make it possible to study gene organization at the chromosomal level, and provide ground for studying chromosomal evolution.

Methods

DNA preparation and sequencing

S. salmonicida (ATCC 50377), previously known as Spironucleus barkhanus², was isolated from a muscle abscess in Atlantic salmon grown in Vesterålen Sea in northern Norway. Cells were obtained from American Type Culture Collection (ATCC) in 2008 and grown axenically in slanted polypropylene tubes using a modified liver digest yeast extract (LYI) medium¹¹ at 16 °C. Stocks of the S. salmonicida ATCC isolate were cryopreserved in liquid nitrogen. DNA extraction was performed in 2015, new batch of cells were taken up from the cryopreserved stock, cultured until confluent and DNA extracted directly to ensure minimal accumulation of mutations. Around 10⁸ cells were harvested for DNA extraction using a phenol-chloroform extraction. The DNA was then purified using the Qiagen Genomic Tip 100/G according to the manufacturer protocol. The purified DNA was quantified using a Qubit fluorometer and quality checked using a NanoDrop and agarose gel electrophoresis. DNA was then stored at −20 °C and 40 μg of gDNA was sent directly for sequencing at the Uppsala Genome Center hosted by the Science for Life Laboratory (Uppsala University). A 10 kbp PacBio library was generated following the standard SMRT bell construction protocol according to the manufacturer recommended protocol. The library was sequenced on 6 SMRT cells of the Pacbio RS II instrument using the P6-C4 chemistry, which generated 267,495 reads in 2.6 billion bases (Table 1) with an N50 length of 14.6 kbp.

Table 1 Comparison of the old and the new S. salmonicida genome assemblies.

Full size table

Genome assembly

The genome was assembled using the same method described in the G. intestinalis WB genome publication⁹. HGAP¹² was used for de novo genome assembly, followed by consensus sequence calling with Quiver¹². Both programs are part of SMRT Analysis (v2.3.0) pipeline¹² from Pacbio. This yielded 61 contigs.

The contigs were then mapped to the optical maps of the nine chromosomes (Fig. 1, obtained for the old genome assembly in 2011⁶) using MapSolver (v3.2.0) from OpGen, and the mapping information was used to stitch together neighboring contigs into scaffolds, as described in the G. intestinalis WB genome publication⁹. To further close the gaps in the nine scaffolds, PBJelly (v15.8.24)¹³ was run using the Pacbio reads, and the resulting scaffolds were polished with Quiver¹². Canu (v1.4)¹⁴ was used to assemble reads that failed to map to the scaffolds, which generated 33 contigs with sizes < = 36 kbp. These small contigs do not map to the optical maps because they are below the size limitation for a sequence to uniquely map to an optical map. Another round of Quiver polishing was applied on Canu contigs combined with the nine scaffolds. To further improve per base quality, DNA Illumina paired-end reads (SRR948594¹⁵) were mapped to the draft assembly using BWA-MEM (v0.7.15)¹⁶ and the resulting BAM file was fed to Pilon (v1.21)¹⁷ to update indels and SNPs. The final assembly has a size of 14.7 Mbp, and is distributed in 42 scaffolds with nine major ones representing 96.5% of the total size (Table 1).

Heterozygosity estimation

DNA Illumina reads (SRR948594¹⁵,) were re-mapped to the base-corrected sequences using BWA (v0.7.15)¹⁶ and samtools (v1.8)¹⁸ mpileup result was generated and parsed. Sites with at least 20X base coverage and an alternative base in at least 10% of the reads were called as SNP (or allelic sequence heterozygosity (ASH)) sites.

Genome annotation

Annotation from the old S. salmonicida genome assembly⁶ was transferred to the new one using RATT (v0.95)¹⁹. 541 RATT transferred genes were selected to train GlimmerHMM (v3.0.1)²⁰, which together with Prodigal (v2.6.3)²¹ were used for gene structural prediction. Functional annotation was performed using a combination of similarity information from BLASTP²² search against NR database and domain information from Conserved Domain (CD) search²³. Transferred and predicted genes were then merged, and the inconsistent ones were manually inspected. RNA-seq reads (SRR948595²⁴) mapped to the genome assembly were used as a guideline for structural annotation during manual examination, and the mapping was carried out with BWA (v0.7.15) since the old genome assembly contained only four introns. Searching for the conserved AC-repeat intron motif⁶ revealed no new intron in the new genome assembly.

The functional annotation was further improved by mining metabolic genes in Pathway Tools v21.5²⁵ as described in G. muris genome publication¹⁰. An updated and curated G. intestinalis WB genome assembly⁹ has been published since the previous S. salmonicida genome was annotated. Genes shared with this assembly (BLASTP e-value <1e-10) but annotated differently were double checked to incorporate the updated annotations when applicable.

There are in total 8,661 protein-coding genes (plus an additional six partial and 194 pseudo genes) annotated in the new genome assembly (Table 1). Although the number of genes increased compared to the old genome assembly, with a bigger genome size and a bigger mean intergenic region size, coding density of the new genome turns out to be slightly smaller (Table 1).

5 S ribosomal RNAs (rRNAs) were predicted using rnammer (v1.2)²⁶, and there are 40 copies of them in tandem array located on chromosome 9 (Fig. 2). 18 S, 28 S and 5.8 S rRNA annotations were transferred from the old genome assembly, and there is one copy of each, all found in a single small contig just like in the old genome assembly⁶. In agreement with the old assembly, ribosomal RNA contig has a 15 times higher Pacbio read coverage compared to the genomic average, indicating the contig is most likely a collapse of repetitive reads. tRNAs were identified using tRNAscan-SE (v1.23)²⁷, and there are more duplicated copies of tRNAs annotated in the new genome assembly (Table 1).

No sign of contamination has been observed at assembly nor annotation level.

Data Records

This Whole Genome Shotgun project has been deposited at DDBJ/ENA/GenBank under the accession AUWU00000000. The version described in this paper is version AUWU02000000²⁸, and the GenBank assembly accession is GCA_000497125.2²⁹. Raw DNA sequence reads from Pacbio are deposited at NCBI Sequence Read Archive (SRA) under accession number SRP028565³⁰. Pacbio reads mapped CRAM file (BAM file converted to CRAM file using samtools view command to reduce file size) is provided in the Figshare database under Digital Object Identifier (DOI) code³¹, and Illumina reads mapped BAM file is available in Figshare database under DOI code³².

Technical Validation

Genome completeness

Restriction enzyme (NheI) maps of the nine chromosomes align well with the genomic sequences digested with NheI in silico (Fig. 1). In fact, 95.2% of the optical maps are covered by the assembled genomic sequences. Interestingly, the right ends of chromosome 2 and 3 were reported as incomplete maps because of low coverage. However, long-read assembly extends well beyond the incomplete right ends of the optical maps, and the right end of chromosome 3 ends in telomeric repeats (TAGG)n. In fact, ten out of 18 chromosome ends are assembled into telomeric repeats, specially, chromosome 1, 5, 7 and 8 have both telomeric ends complete (Fig. 2). There are terminal gaps (Ns) at four out of the eight chromosome ends missing telomeric repeats, and the sizes were determined by the alignment to the optical maps. Three internal gaps also present in the new genome assembly, and the largest of all seven gaps is the internal gap located on chromosome 2 at a size of 87.9 kbp (Fig. 2). The new genome assembly is highly contiguous compared to the old fragmented genome assembly with 232 gaps distributed in 233 scaffolds, and preserves the synteny when compared to the old genome sequences (Fig. 3).

The new genome assembly is also better at resolving duplicated regions, which is defined as BLASTN matches against itself with > = 2000 bp and 95% identity. In fact, 2.7 Mbp (19.2%) of the new genome sequences are characterized as duplicated regions compared to 1.5 Mbp of the old genome sequences. Regions involved in duplications are most commonly observed in short stretches of sequences with just one gene. However, we see also multi-gene regions involved in duplications up to 64 kbp in size (Fig. 2, ribbons highlighted in red in the innermost circle), which might be the largest repetitive regions the assembly could resolve due to the limitation of the read length. The largest BLASTN match is on chromosome 9, which is 33 kbp in size overlapping itself for 2 kbp resulting in a tandem repeat of 64 kbp in size (Fig. 2, red ribbon at chromosome 9 in the innermost circle). Manual examination of the Pacbio long read pileup and coverage at the two largest regions involved in duplications revealed the read coverage is in line with the neighbouring regions as well as the average coverage, indicating repetitive regions were correctly resolved. Genes found in duplicated regions are often members of multi-gene families. Duplicated regions also maintain higher GC contents with a mean GC level at 44.7%, while the rest of the genome sequence has a mean GC level at 30.7% and the whole genome sequence has a mean GC level at 33.5% (Table 1). Better resolved duplicated regions probably contribute to the observation of lower ASH level in the new genome assembly (Table 1).

Annotation improvement

With this near-complete reference genome, we are able to present here a more complete annotation. Among the 8,661 full length protein-coding genes, 5,016 genes have exact copies as in the old genome assembly, 1,273 genes have the same sequences but updated descriptions, and 287 genes are the same but with SNPs. Start codons of 976 genes were adjusted, based on the alignments of orthologous genes and A-rich motifs at the start codons⁶. 3′ end sequences of 156 genes were updated due to differences of short insertions and deletions between the two genome sequences. 303 genes share sequence similarity with genes already annotated, but no distinct orthologous gene could be assigned. There are 650 genes which are unique to the new genome assembly, most of them are hypothetical proteins, but there are also genes with putative functional annotations. The most interesting unique gene is Histone H4, which was completely missing in the old genome assembly, but is presented in ten copies (plus five pseudo copies) in the new genome assembly with nine (plus the five pseudo copies) of them arranged in tandem array interspersed by four copies of reverse transcriptase (RT) on chromosome 3 (Fig. 2).

Better genome assembly leads also to better annotation of protein families. S. salmonicida harbors a large group of cysteine-rich membrane proteins, which were divided into cysteine-rich membrane protein 1 (CRMP1) and cysteine-rich membrane protein 2 (CRMP2)⁶. CRMP1s resemble variant-specific surface proteins (VSPs) in G. intestinalis structurally with a transmembrane domain at the 3′ end followed by a pentapeptide motif⁶, and VSPs are known to be expressed on the cell surface of the parasite to assist the parasite to bypass host immune system³³. We find more CRMP1s and CRMP2s in the new genome assembly, now 138 CRMP1s (plus 22 pseudo and 1 partial) compared to 125 in the old genome assembly and 248 CRMP2s (plus 30 pseudo and 1 partial) compared to 195.

With nine near-complete chromosomes, we could visualize how these cysteine-rich membrane proteins are organized along the chromosomes (Fig. 2). The gene families are enriched in arrays on certain chromosomes, especially chromosome 2 (42 CRMP1s and 99 CRMP2s) and chromosome 9 (59 CRMP1s and 41 CRMP2s). This organization of gene families is different from what has been observed in the Giardia genomes. Cysteine-rich protein family specifically VSPs are enriched at terminal ends in G. muris genome¹⁰, while they are all over the chromosomes in G. intestinalis WB genome⁹. We also noticed that the regions around the cysteine-rich membrane proteins as well as the chromosomal ends tend to be gene-poor (Fig. 2, green dots in coding% track), and these regions tend to have higher allelic sequence variation (Fig. 2, red dots in SNPs track) and GC level. Similar patterns have been observed in the G. intestinalis WB genome⁹, suggesting that gene conversion mechanisms might be involved in the expansion of the antigenic surface protein repertoire.

Code availability

Software including their version information were listed in the method section. Custom scripts were provided at personal GitHub (https://github.com/feifei/scripts_to_share), including scripts to scaffold the contigs with optical maps (scaffolding_with_maps.py), to identify indel and SNPs using mapping information from Illumina DNA reads (base_change_from_pileup.py) and update reference sequence accordingly (base_change_incorporation.py), to call SNPs (snps.py), to analyze duplicated regions (duplicated_regions.py) and to generate the figures (optical_maps.R, circos.R, dotplot.R).

References

Kent, M. L. et al. Systemic hexamitid (Protozoa, Diplomonadida) infection in seawater pen-reared Chinook salmon Oncorhynchus tshawytscha. Dis. Aquat. Organ. 14, 81–89 (1992).
Article Google Scholar
Jørgensen, A. & Sterud, E. The marine pathogenic genotype of Spironucleus barkhanus from farmed salmonids redescribed as Spironucleus salmonicida n. sp. J. Eukaryot. Microbiol. 53, 531–541 (2006).
Article Google Scholar
Williams, C. F. et al. Spironucleus species: economically-important fish pathogens and enigmatic single-celled eukaryotes. J. Aquac. Res. Dev. S2, 002 (2011).
Google Scholar
Monis, P. T., Caccio, S. M. & Thompson, R. C. A. Variation in Giardia: towards a taxonomic revision of the genus. Trends Parasitol. 25, 93–100 (2009).
Article Google Scholar
Sterud, E., Poppe, T. & Bornø, G. Intracellular infection with Spironucleus barkhanus (Diplomonadida: Hexamitidae) in farmed Arctic char Salvelinus alpinus. Dis. Aquat. Organ. 56, 155–61 (2003).
Article Google Scholar
Xu, F. et al. The genome of Spironucleus salmonicida highlights a fish pathogen adapted to fluctuating environments. PLoS Genet. 10, e1004053 (2014).
Article Google Scholar
Xu, F. et al. On the reversibility of parasitism: adaptation to a free-living lifestyle via gene acquisitions in the diplomonad Trepomonas sp. PC1. BMC Biol. 14, 62 (2016).
Article Google Scholar
Jiménez-González, A. & Andersson, J. O. Metabolic reconstruction elucidates the lifestyle of the last Diplomonadida common ancestor. mSystems 5, e00774–20 (2020).
Article Google Scholar
Xu, F., Jex, A. & Svärd, S. G. A chromosome-scale reference genome for Giardia intestinalis WB. Sci. Data 7, 38 (2020).
Article CAS Google Scholar
Xu, F. et al. The compact genome of Giardia muris reveals important steps in the evolution of intestinal protozoan parasites. Microb. Genom. 6, e000402 (2020).
Google Scholar
Jerlström-Hultqvist, J., Einarsson, E. & SvÃ¤rd, S. G. Stable transfection of the diplomonad parasite Spironucleus salmonicida. Eukaryotic cell 11, 1353–1361 (2012).
Article Google Scholar
Chin, C.-S. et al. Nonhybrid, finished microbial genome assemblies from long-read SMRT sequencing data. Nat. Methods 10, 563–569 (2013).
Article CAS Google Scholar
English, A. C. et al. Mind the gap: upgrading genomes with Pacific Biosciences RS long-read sequencing technology. PLoS ONE 7, e47768 (2012).
Article ADS CAS Google Scholar
Koren, S. et al. Canu: scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation. Genome Res. 27, 722–736 (2017).
Article CAS Google Scholar
NCBI Sequence Read Archive https://identifiers.org/ncbi/insdc.sra:SRR948594 (2015).
Li, H. & Durbin, R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 25, 1754–1760 (2009).
Article CAS Google Scholar
Walker, B. J. et al. Pilon: an integrated tool for comprehensive microbial variant detection and genome assembly improvement. PLoS ONE 9, e112963 (2014).
Article ADS Google Scholar
Li, H. et al. The Sequence alignment/map (SAM) format and SAMtools. Bioinformatics 25, 2078–9 (2009).
Article Google Scholar
Otto, T. D., Dillon, G. P., Degrave, W. S. & Berriman, M. RATT: Rapid Annotation Transfer Tool. Nucleic Acids Res. 39, e57 (2011).
Article CAS Google Scholar
Majoros, W. H., Pertea, M. & Salzberg, S. L. TigrScan and GlimmerHMM: two open source ab initio eukaryotic gene-finders. Bioinformatics 20, 2878–2879 (2004).
Article CAS Google Scholar
Hyatt, D. et al. Prodigal: prokaryotic gene recognition and translation initiation site identification. BMC Bioinformatics 11, 119 (2010).
Article Google Scholar
Camacho, C. et al. BLAST+: architecture and applications. BMC Bioinformatics 10, 421 (2008).
Article MathSciNet Google Scholar
Marchler-Bauer, A. & Bryant, S. H. CD-Search: protein domain annotations on the fly. Nucleic Acids Res. 32, W327–31 (2004).
Article CAS Google Scholar
NCBI Sequence Read Archive https://identifiers.org/ncbi/insdc.sra:SRR948595 (2015).
Karp, P. D. et al. Pathway Tools version 19.0 update: software for pathway/genome informatics and systems biology. Briefings in Bioinformatics 17, 877–890 (2015).
Article Google Scholar
Lagesen, K. et al. RNAmmer: consistent and rapid annotation of ribosomal RNA genes. Nucleic Acids Res. 35, 3100–3108 (2007).
Article ADS CAS Google Scholar
Lowe, T. M. tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res. 25, 955–964 (1997).
Article CAS Google Scholar
Xu, F. Spironucleus salmonicida whole genome shotgun sequencing project. Genbank https://identifiers.org/ncbi/insdc:AUWU02000000 (2021).
NCBI Assembly https://identifiers.org/ncbi/insdc.gca:GCA_000497125.2 (2021).
NCBI Sequence Read Archive https://identifiers.org/ncbi/insdc.sra:SRP028565 (2020).
Xu, F. spiro.pacbio.sorted.cram, Figshare, https://doi.org/10.6084/m9.figshare.20410980 (2022).
Xu, F. spiro.illumina.sorted.bam, Figshare, https://doi.org/10.6084/m9.figshare.20288733 (2022).
Krakovka, S., Ribacke, U., Miyamoto, Y., Eckmann, L. & Svärd, G. S. Characterization of metronidazole-resistant Giardia intestinalis lines by comparative transcriptomics and proteomics. Front. Microbiol. 13, 834008 (2022).
Article Google Scholar
Gu, Z., Gu, L., Eils, R., Schlesner, M. & Brors, B. Circlize implements and enhances circular visualization in R. Bioinformatics 30, 2811–2812 (2014).
Article CAS Google Scholar
Kurtz, S. et al. Versatile and open software for comparing large genomes. Genome Biol. 5, R12 (2004).
Article Google Scholar
Chakraborty, M. et al. Hidden genetic variation shapes the structure of functional elements in. Drosophila. Nat. Genet. 50, 20–25 (2018).
Article CAS Google Scholar
Wickham, H. ggplot2: Elegant Graphics for Data Analysis. Springer (2016).

Download references

Acknowledgements

We thank Uppsala Genome Center for sequencing the genome.

Funding

Open access funding provided by Uppsala University.

Author information

Feifei Xu
Present address: Department of Medical Cell Biology, BMC, Box 571, Uppsala University, SE-751 23, Uppsala, Sweden
Ásgeir Ástvaldsson
Present address: Department of Microbiology, National Veterinary Institute, SE-751 89, Uppsala, Sweden

Authors and Affiliations

Department of Cell and Molecular Biology, BMC, Box 596, Uppsala University, SE-751 24, Uppsala, Sweden
Feifei Xu, Alejandro Jiménez-González, Zeynep Kurt, Ásgeir Ástvaldsson, Jan O. Andersson & Staffan G. Svärd

Authors

Feifei Xu
View author publications
You can also search for this author in PubMed Google Scholar
Alejandro Jiménez-González
View author publications
You can also search for this author in PubMed Google Scholar
Zeynep Kurt
View author publications
You can also search for this author in PubMed Google Scholar
Ásgeir Ástvaldsson
View author publications
You can also search for this author in PubMed Google Scholar
Jan O. Andersson
View author publications
You can also search for this author in PubMed Google Scholar
Staffan G. Svärd
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

S.S., J.A. and F.X. conceived the study, Á.Á. prepared the DNA for sequencing. F.X. assembled and validated the assembly, F.X. and A.J. annotated the genome. F.X. and Z.K. analyzed the genome. F.X. drafted the paper and all authors wrote the paper.

Corresponding author

Correspondence to Feifei Xu.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Xu, F., Jiménez-González, A., Kurt, Z. et al. A chromosome-scale reference genome for Spironucleus salmonicida. Sci Data 9, 585 (2022). https://doi.org/10.1038/s41597-022-01703-w

Download citation

Received: 11 April 2022
Accepted: 15 September 2022
Published: 24 September 2022
DOI: https://doi.org/10.1038/s41597-022-01703-w

This article is cited by

Single molecule real-time sequencing data sets of Hypericum perforatum L. plantlets and cell suspension cultures
- Rajendran K. Selvakesavan
- Maria Nuc
- Gregory Franklin
Scientific Data (2024)