Genome survey, high-resolution genetic linkage map construction, growth-related quantitative trait locus (QTL) identification and gene location in Scylla paramamosain

Zhao, Ming; Wang, Wei; Chen, Wei; Ma, Chunyan; Zhang, Fengying; Jiang, Keji; Liu, Junguo; Diao, Le; Qian, Heng; Zhao, Junxia; Wang, Tian; Ma, Lingbo

doi:10.1038/s41598-019-39070-z

Download PDF

Article
Open access
Published: 27 February 2019

Genome survey, high-resolution genetic linkage map construction, growth-related quantitative trait locus (QTL) identification and gene location in Scylla paramamosain

Ming Zhao ORCID: orcid.org/0000-0002-1221-9240¹^na1,
Wei Wang¹^na1,
Wei Chen¹,
Chunyan Ma¹,
Fengying Zhang¹,
Keji Jiang¹,
Junguo Liu¹,
Le Diao¹,
Heng Qian¹,
Junxia Zhao¹,
Tian Wang¹ &
…
Lingbo Ma¹

Scientific Reports volume 9, Article number: 2910 (2019) Cite this article

1889 Accesses
14 Citations
1 Altmetric
Metrics details

Subjects

Abstract

Scylla paramamosain is one of the most economically important crabs in China. In this study, the first genome survey sequencing of this crab was performed, and the results revealed that the estimated genome size was 1.21 Gb with high heterozygosity (1.3%). Then, RAD technology was used to construct a high-resolution linkage map for this species. A total of 24,444 single nucleotide polymorphism (SNP) makers were grouped into 47 linkage groups. The total length of the linkage groups was 3087.53 cM with a markers interval of 0.92 cM. With the aid of transcriptome and genome scaffold data, 4,271 markers were linked to genes, including several important growth-related genes such as transforming growth factor-beta regulator I, immune related-gene C-type lectin and ecdysone pathway gene broad-complex-like protein. Further, 442 markers, representing 279 QTLs, associated with 24 traits were identified, and of these markers, 78 were linked to genes. Some interesting genes, such as dedicator of cytokinesis protein 3, tenascin-X and DNA helicase MCM8, were believed to have important relationship with specific traits and merit further exploration. The results of this study will accelerate the genetic improvement and genome sequencing analysis of the mud crab.

Genome survey and high-resolution genetic map provide valuable genetic resources for Fenneropenaeus chinensis

Article Open access 06 April 2021

Xianhong Meng, Qiang Fu, … Jie Kong

Linkage mapping and QTL analysis of growth traits in Rhopilema esculentum

Article Open access 10 January 2022

Bailing Chen, Yulong Li, … Yunfeng Li

An ultra-high density SNP-based linkage map for enhancing the pikeperch (Sander lucioperca) genome assembly to chromosome-scale

Article Open access 18 December 2020

Lidia de los Ríos-Pérez, Julien A. Nguinkal, … Dörte Wittenburg

Introduction

From phenotype breeding and molecular marker-assisted selection (MAS) to genome-wide selection (GS), artificial breeding strategies for economic animals have developed rapidly. Along with technological progress, artificial breeding has become increasingly effective, mainly through a more accurate enrichment of economically important quantitative traits and shorter breeding intervals. In 2001, Meuwissen et al.¹ proposed the GS strategy and suggested that genome-wide dense markers should be used to predict genetic value. Further, the GS strategy has been demonstrated to be a popular tool for genetic improvement in livestock, and considerable achievements have been obtained, for example, dairy cattle², chicken³ and pig⁴. However, in aquatic economic animals, especially crustacean species, mainstream breeding strategies in many species still consist of phenotype breeding and MAS. Other than unstable breeding and farming processes (compared with those of livestock), the lack of a high-quality genome and linkage map are two of the main problems that constrain the development of this industry.

With the revolution of sequencing technology, it is more effective to obtain a vast number of markers in a specific species, and thus, a high-resolution genetic linkage map becomes possible. Recently, several high-resolution genetic linkage maps were constructed in economic crustacean species, including Penaeus monodon⁵, E. sinensis⁶, Litopenaeus vannamei⁷, Marsupenaeus japonicus⁸, Portunus trituberculatus⁹. The average marker distances of these maps were all lower than 1 cM, which greatly assists de novo genome assembly and gene location, and further promotes the animal breeding process. Currently, the only crustacean genomes available are those of the water flea (Daphnia pulex)¹⁰, sand flea (Parhyale hawaiensis)¹¹, marbled crayfish (Procambarus virginalis)¹² and Chinese mitten crab (Eriocheir sinensis)¹³. Although most of these genomes were poorly assembled, they provided important insights into the genomic architecture and basic biology of crustacean species.

The mud crab Scylla paramamosain, belongs to the genus Scylla (Family: Portunidae), and is the most common mud crab in China¹⁴. As it is one of the most commercially important crustacean species, many research teams have been drawn to investigate the genetic breeding, aquaculture and basic biology of S. paramamosain. We have been devoted to studying the genetic breeding and aquaculture industry for many years, keeping ahead of the artificial breeding of this crab. Currently, there are no genome assembly datasets for this crab, and only several transcriptome libraries have been reported^{15,16,17,18,19}. The only genetic linkage map for the mud crab was constructed by our team using 212 microsatellite and amplified fragment length polymorphism (AFLP) markers, and the mean spacing was 18.68 cM²⁰. These data cannot meet the needs of the modern breeding strategies for this important species. Therefore, a high-resolution genetic linkage map is needed both the artificial breeding industry and genomics research.

The purpose of this study was to construct a high-resolution genetic linkage map for both QTL mapping, locating genes and assisting with further genome assembly and artificial breeding processes in S. paramamosain. Twenty-six traits of 99 F1 generation individuals were measured to obtain the trait-related loci. Genome survey data and transcriptome data were used in the gene location and growth gene identification analysis.

Materials and Methods

Ethics statement

All animal experiments in this study were conducted in accordance with the relevant national and international guidelines. Our project was approved by the East China Sea Fisheries Research Institute. In China, catching wild mud crabs from seawater does not require specific permits. Our study did not involve endangered or protected species.

Genome survey sequencing and analysis

Genomic DNA of S. paramamosain was extracted from muscle for sequencing. Three pair-end DNA libraries, two with insert sizes of 400–500 bp and one with 250 bp, were constructed following standard Illumina operating procedures.

All raw data were trimmed to filter out low-quality data and adapter contaminant using NGS QC Toolkit²¹. De novo assembly was performed on the clean reads using SOAPdenovo software (http://soap.genomics.org.cn/soapdenovo.html) with the following parameters: the k value in k-mer was set at 45, unsolve repeats by reads and fill gaps in scaffolds.

Mapping family and data collection

The F1 full-sib family for linkage map construction was created by two parents from the wild population of Hainan Province, China. The mapping population was reared at Wei-Er-Si Aquafarming Company of Rudong in 2017. In total, 99 progeny were randomly selected after being reared for 100 days in the same pond. Twenty-six traits, representing sex, carapace, cheliped, pereiopod and swimming stroke, were measured for each individual according to the measurement method described by Keenan et al.²². The measurement data are provided in the Supplementary Table 1.

Muscle tissues were sampled and preserved immediately in liquid nitrogen. Total DNA was extracted using a TIANamp Marine Animal DNA Extraction kit (TIANGEN, Beijing, China). DNA concentration was determined using GeneQuant (Amersham Biosciences Ltd., Piscataway, NJ, USA), and integrity was evaluated via electrophoresis in a 1% agarose gel.

RAD library construction and Illumina HiSeq3000 sequencing

Restriction site-associated DNA marker (RAD) library construction, sample indexing and pooling followed for the natural populations²³. The restriction enzyme EcoRI was used to cut the DNA. A total of 26 multiplexed sequencing libraries were constructed, in which each DNA sample was assigned a unique nucleotide multiplex identifier (MID) for barcoding. Pair-end (125-bp) sequencing was performed using Illumina HiSeq3000 in a total throughput of six lanes.

SNP discovery, filtering, genotyping and validation

Raw sequence reads were trimmed to 110 nucleotides from the 3′ end to ensure that more than 98% of the nucleotides had a quality value above Q30 (equal to 0.1% sequencing error). The trimmed reads were clustered into read tags (hereafter, RAD-tags) by sequence similarity using ustacks²⁴ to produce unique candidate alleles for each RAD locus. A maximum base-pair mismatch of one was allowed in this step for the genetic mapping population. RAD-tags were then collapsed into clusters using ustacks under the default parameters for SNP calling.

Linkage map construction and QTL mapping

Genotype calling refers to the process of determining the genotypes of the SNP loci of each individual after SNP calling has been performed in accordance with Xu et al.²⁵. Then, customized Perl scripts were applied to generate a “.loc” format file, which was the input file for joinmap 4.0²⁶. The genetic map was grouped by joinmap 4.0 with logarithm of the odds (LOD) = 6.0 and the marker order was determined by Lep-MAP 2²⁷. Singular markers were added to the established LGs using the joinSingles module with an LOD score limit of 10 and a minimum difference of 3 between the best LG and the second best LG of each joined marker. Then, markers intervals which are larger than 30 cM were removed, except for the LG32, in which the large interval might be caused by the location of centriole.

QTL mapping of the 26 traits was performed with MapQTL 5.0²⁸. QTL region detection, the percentage of the phenotypic variance explained, and the genotypic information coefficient (GIC) were calculated with the interval QTL mapping model (IM)²⁹. In the QTL mapping step, the LOD threshold for testing the significance of the QTL peaks was calculated using 1,000 permutations for each of the trait data sets and a genome-wide significant level of 5%. For interval distances >1.0 cM, significant thresholds were estimated every 1.0 cM. In this study, a LOD value of ≥3.0 was set as the minimum threshold to indicate a QTL in the present study.

Gene location and growth-related gene identification

RAD-Tag sequences from the linkage map were BLASTed against genomic scaffolds and transcriptome unigenes, and a reference transcriptome was de novo assembled using the raw reads from our previous work^16,17 and some unpublished data using the BLASTn program via an identity value cut-off of 99% and alignment length of 50. For alignment lengths between 30 and 50, manual inspection was performed. The transcriptome data, the genome scaffold data, and tag sequences from linkage map data referred to in this article are provided in Supplementary File 1. RAD-Tag sequences were also annotated via the GenBank nr database, which is hosted by NCBI (http://www.ncbi.nlm.nih.gov/), by blastx with an E-value cut-off of 1.0 × 10⁻⁵.

Results

Genome survey of S. paramamosain

Two 400- to 500-bp and one 250-bp paired-end libraries were constructed for the genome survey analysis, and 147.18 Gb of sequencing data were generated. Raw sequencing data were submitted to the Sequence Read Archive (SRA) database of NCBI with the accession number SRP150472. After QC and filtering, a total of 78.41 Gb of high-quality reads were obtained with a 45.32% GC content. The frequency of 17-mers (nucleotide strings with a length of 17 bp) among the raw sequencing data was calculated, and a k-mer curve was constructed. k-mer analysis revealed that there was a peak at the k-mer length of 60. The genome size was estimated at 1.21 Gb with remarkably high heterozygosity (1.3%), and 55.20% of the genome was estimated to be repetitive³⁰. A total of 16,925,345 contigs with an N50 size of 145 bp were obtained, and of these sequences, 7,027,514 contained at least one simple sequence repeat (SSR), of which 3,900,204 were one-base repeats and 4,446,896 were two-base repeats.

RAD sequencing and genotyping

There were 31 million high-quality reads for the female parent, and on average, 11 million for the offspring. The male parent could not be found because the female parent was captured from the open field sea. The female parent was sequenced at a higher depth (26.90×) than the mean depth of the offspring (8.37×). The statistics for the RAD-tag are summarized in Table 1.

Table 1 The statistical results of the RAD-tag for the female parents and the offspring.

Full size table

A total of 158,138 heterozygous polymorphic SNP markers were detected initially. A missed genotype number above nine on any SNP was filtered. After filtering, 29,069 high-quality SNP markers, that conformed to the expected Mendelian ratios (P ≥ 0.001) remained and were included in the linkage analysis.

Linkage map construction

At the LOD threshold of 6.0, 24,444 SNP loci were grouped into 47 linkage groups in the merged maps (Fig. 1), and 4,625 SNP markers could not be grouped. The 24,444 SNP markers contained three segregation types, including 13,261 lm x ll (54.3%), 9,125 nn x np (37.3%) and 2,058 hk x hk (8.4%). The genotype information for each SNP locus is provided in Supplementary Table 2, and the paternal information was derived from the maternal and F1 data. Maternal and paternal maps were also constructed, with 46 and 47 linkage groups, respectively. The lengths of the maternal, paternal and merged maps were 3,230.70 cM, 3,334.42 cM and 3,087.53 cM, respectively, and the average marker intervals were 1.87 cM, 1.82 cM and 0.92 cM, respectively. The length of the linkage groups in the maternal map varied from 6.12 cM (LG47) to 279.34 cM (LG2), and the length of those in the paternal map ranged from 6.19 cM (LG47) to 312.65 cM (LG8) (Table 2). The estimated genome size of S. paramamosain is 1.21 Gb; thus, the average recombination rate across all linkage groups was 2.55 cM/Mb.

Table 2 Summary of the consensus linkage map in Scylla paramamosain.

Full size table

QTL mapping

The 26 traits of the 99 mapped filial individuals, which included 78 females and 21 males, are listed in Supplementary Table 3. In total, 442 SNP loci, with a LOD > 3, which representing 279 QTLs, corresponded to 24 traits. We divided these 24 traits into five categories: appearance size, head area, cheliped, pereiopod and swimming stroke, and reproductive ability (Fig. 2). The LOD values of these 442 loci ranged from 3.01 to 12.66. Five markers with a LOD value greater than five were associated with seven traits, including three cheliped traits (markers 11.hk_hk_10468 and 11.hk_hk_11304), three pereiopod and swimming stroke traits (markers 11.hk_hk_7558, 105.nn_np_34985 and 11.hk_hk_11304) and one size trait (11.nn_np_24679). Among the 442 markers, 36 markers were linked to more than one trait, suggesting that these traits might be under the same genetic control. Two to 58 markers distributed in one, two, three or four linkage groups were linked to 24 traits. SW had the most linked markers (58), followed by 2PML (55), 3PML (48), and FW (44), whereas CW and ML had only two markers. The markers associated with the 24 traits are summarized in Table 3.

Table 3 Summary of the 24 traits associated QTLs.

Full size table

Twelve markers were linked to sex and distributed in LG10 (7), LG12 (2) and LG9 (3) (Fig. 3a). CL (Fig. 3b), CW, 8CW (Fig. 3c) and BH, four important traits representing the appearance size of the crab, were linked to 19 markers distributed in LG11 (8), LG2 (4), LG12 (3), LG29 (2) and LG41 (2). Five traits (DL, ML, PL, PW and PD), indicating the developmental properties of the cheliped, were linked to 37 markers distributed in LG5 (15), LG26 (8), LG13 (4), LG14 (3), LG3 (3), LG11 (2), LG15 (1) and LG22 (1). Five traits (1PML, 2PML, 3PML, 5PW and 5PL) associated with pereiopod and swimming stroke had 168 markers distributed in 11 linkage groups, represented by LG25 (52), LG2 (30), LG12 (26) and LG5 (21). Six traits that might correlate with the development of the head area (FW, DFMS, FMSH, DFLS, 1DFLS and 2DFLS), had 168 markers distributed in 10 linkage groups, represented by LG25 (43), LG13 (33), LG6 (26) and LG15 (18). AW might have some correlation with reproductive ability, with four markers located in LG11 (3) and LG10 (1).

Gene location and growth gene identification

By means of BLASTing against the nr database, transcriptome and scaffold of the genome survey, 4,271 of the linkage map markers were linked to genes, and the gene numbers distributed in each linkage group ranged from 15 to 210. LG2 has the largest number of located genes, followed by LG3 (204), LG1 (198), LG11(180) and LG4 (171). LG47 has the smallest number of genes, followed by LG44 (16), LG45 (20), LG46 (32), LG41 (37) and LG43 (38) (Supplementary Table 4). Gene numbers were generally positively correlated with the length and markers of linkage groups, with correlation coefficient of 0.818 and 0.823, respectively. Some growth-related genes, including transforming growth factor-beta regulator I, insulin-like androgenic gland hormone and insulin-like growth factor 2 mRNA-binding protein, were located. C-type lectin, which participates in the immune defense process, HR4 nuclear receptor and broad-complex-like protein, which participates in the ecdysone regulation pathway, were also located.

In total, 78 markers have been linked to genes. Among them, 22 markers had annotation information from at least one public database, and others were linked to transcriptome data but lacked annotation information. Of these 22 markers, two were located in the 3′-UTR, seven were in the intron, 11 were in the open reading frame (ORF), and four led to a nonsynonymous mutation; the other two were located in a pseudogene because an abnormal termination codon existed in the translated region (Table 4). 11.hk_hk_4825, a synonymous SNP located in the ORF of a RING finger protein, was a QTL of FW and 5PW. Two RING finger proteins, nhl-1, nesprin-2-like, RNA-directed DNA polymerase from mobile element jockey-like, dedicator of cytokinesis protein 3 isoform X2, carbonic anhydrase 2-like and protein disulfide-isomerase A5-like each had one SNP marker associated with FW (four), DFMS (two) and 2DFLS (one), and these genes or their linked genes might be related to the development of head area. Furthermore, integrator complex subunit 8-like, tenascin-X-like, sialidase-like, BTB/POZ domain-containing protein 3, polypeptide N-acetylgalactosaminyl transferase 1 and cholinephosphotransferase 1 isoform or their linked genes might be related to pereiopod and swimming stroke development. Mitochondrial ribonuclease P protein 3, sedoheptulokinase-like and putative nuclease HARBI1 are three genes with SNP associated with cheliped related traits, and them or their linked genes might be related to cheliped development. In addition, ankyrin repeat and general transcription factor II-I repeat domain-containing protein 2B-like or their linked genes might be related to the appearance size of the crab. GPI mannosyltransferase 4, DNA helicase MCM8-like and kinesin II or their linked genes might be related to reproductive ability, and protein numb isoform X5 or its linked genes might be related to OCS.

Table 4 Statistics of growth-related genes.

Full size table

Discussion

Mud crabs are euryhaline and widely distributed in tropical, subtropical and temperate waters. In China, S. paramamosain is the dominant species and is mainly cultured in southeastern coastal provinces¹⁴. Mud crabs are very popular in China because of their high content of protein, unsaturated fatty acids and trace elements such as vitamins³¹.

Genome sequencing is an important step for deciphering evolutionary status and molecular mechanisms and accelerating genetic improvements in traits of interest in economically important species. The genome size of S. paramamosain was estimated to be 1.21 Gb with remarkably high heterozygosity (1.3%). This finding is inconsistent with previous studies (1.64 pg), in which flow cytometry was used to assess genome size³². Different results for one species from two methods were also found in P. trituberculatus⁹ and L. vannamei⁷. The flow cytometry method is fairly straightforward, but the accuracy is highly dependent on the internal standard and quality of the material used for DNA content measurement³³. Additionally, a comparison between flow cytometry and k-mer analysis methods in the estimation of nine insect species suggested that k-mer analysis is more accurate³⁴. Genome survey analysis will be highly useful for the formulation of sequencing strategies for the mud crab.

Currently, the only reported linkage map of this species was constructed by our team using microsatellite and AFLP markers. In Total, 212 markers were mapped, and the mean spacing of the markers was 18.68 cM²⁰, which cannot fulfill the requirements of the genetic breeding industry. This study reported a high-resolution genetic map of the mud crab with 24,444 SNP markers, a length of 3087.53 cM and a marker interval of 0.92 cM. This map has the highest number of markers compared with that of maps of other crustacean species, including P. trituberculatus (10,963 markers)⁹, M. japonicus (9289 markers)⁸ and L. vannamei (6146 markers)⁷. A large number of markers in the map were at the same location, which means they are completly linked. Therefore, the marker interval should be 0.92 cM instead of 0.13 cM (3087.53/24,444≈0.13). However, the numerous markers will be very useful in QTL mapping, gene location and genome assemble.

Forty-seven linkage groups were obtained in the merged map, and only 46 linkage groups were in the maternal map. The missing linkage group in the maternal map was LG38; this linkage group may not be the sex chromosome because alleles in this linkage were not separated significantly by sex in the F1 offspring. Forty genes were located on LG38, and no sex-determining genes were found in this linkage group. A high-density linkage map suggested a ZW sex determination system in Eriocheir sinensis; but only an ankyrin-2 gene was found on the putative sex chromosome, and the sex determination gene double-sex was located on a putative autosome⁶. This finding suggested that sex determination genes may not exist on sex-linked chromosomes or that there is a different sex determination system in crustaceans. Further, 12-sex linked QTL were identified, and verification in large numbers of wild populations is needed.

Under the conditions of high marker density (24,444 SNPs), the linkage groups should be consistent with the haploid chromosome number. However, the number of linkage groups in the present map seems to be less than the 49 haploid chromosome numbers reported by Wang et al.³⁵ and Chen et al.³⁶ using karyotype method. In fact, fewer number of linkage groups in the linkage map were also found in M. japonicus⁸. In this study, the lack of one linkage group in the maternal map and the lower numbers in the merged map might be mostly due to the relatively lower number of mapping populations or the low enzyme-digestion efficiency in the missing chromosome.

Overall, 4,271 markers were linked to genes, and the gene numbers were generally positively correlated with the length and marker numbers of linkage groups, suggesting that genes were almost evenly distributed in the crab genome. Transforming growth factor-beta regulator I, which is also called nuclear interactor of ARF and Mdm2 (NIAM), activates p53/TP53, causes G1 arrest and collaborates with ARF to suppress proliferation, acted as a growth inhibitor³⁷. Insulin-like androgenic gland hormone (IAG), which is produced by androgenic glands in male crustaceans, is regarded as a key regulator of sex differentiation. Interestingly, studies on IAG in Scylla paramamosain suggested that it is involved in regulating ovarian development and somatic growth³⁸. Insulin-like growth factor 2 mRNA-binding protein binds to the 5′-UTR of the insulin-like growth factor 2 (IGF2) mRNA to regulate IGF2 translation, and thus has an important function in animal development³⁹. C-type lectins are a superfamily of proteins that recognize a broad repertoire of ligands and regulate a diverse range of physiological functions⁴⁰. Indeed, C-type lectins of S. paramamosain were upregulated significantly by Vibrio parahaemolyticus and had an important role in the immune response^41,42. Broad-complex-like protein and HR4 nuclear receptor are two ecdysteroidogenic transcription factors that deliver ecdysone signal, which is also important for development⁴³. Knowing the location of these genes will assist the study of genetic breeding.

The head area of the crab contains many organs, such as the cerebral ganglion and antennary glands. Seven linked annotated genes were associated with this area. Among them, nesprin-2 belongs to a novel family of nuclear and cytoskeletal proteins with rapidly expanding roles as intracellular scaffolds and linkers⁴⁴. Dedicator of cytokinesis protein 3 (DOCK3) is a large protein involved in intracellular signalling networks, which function as activators of small G proteins. DOCK3 is expressed exclusively in the central nervous system of mice and plays an important role in axonal outgrowth and cytoskeleton re-organization⁴⁵.

Six genes associated with pereiopod and swimming stroke traits were identified. Among them, tenascin-X is a glycoprotein that contributes to matrix stability and is possibly involved in collagen fibril formation⁴⁶. A SNP in this gene causes a nonsynonymous mutation, which replaces a hydrophilic amino acid Ser with a hydrophobic amino acid Leu. Sialidase-like, which is a glycoside hydrolase enzyme that cleaves the glycosidic linkages of neuraminic acids⁴⁷, also possesses a nonsynonymous mutation of Ser to Asn. Future studies in large populations are needed to prove the correlation between phenotype and genotype. Polypeptide N-acetylgalactosaminyltransferase 1, which catalyses the initial reaction in O-linked oligosaccharide biosynthesis⁴⁸, and cholinephosphotransferase 1, which catalyses phosphatidylcholine biosynthesis from CDP-choline and plays a central role in the formation and maintenance of vesicular membranes⁴⁹, were also suggested to have a relationship with pereiopod and swimming stroke development.

Three linked annotated genes were suggested to associate with cheliped traits. Among them, mitochondrial ribonuclease P protein 3, which is a part of mitochondrial ribonuclease P (mt-RNase P) that cleaves tRNA molecules at their 5′ -ends⁵⁰, and a nonsynonymous mutation of Trp to Arg was found in this gene. In addition, the putative nuclease HARBI1, which might promote anti-V. alginolyticus infection by participating in regulating phagocytosis, apoptosis, superoxide dismutase activity, PO activity, and THC in Marsupenaeus japonicus, was another gene associated with cheliped traits⁵¹.

The ankyrin repeat was linked to one QTL of 8CW and appeared to be associated with appearance size. Proteins that contain the ankyrin repeat motif can mediate the protein-protein interactions⁵².

Beneath the abdominal plate is the location in which the eggs are held in female crabs and where the ejaculatory duct is found in male crabs, both of which were possibly associated with reproductive ability. Three genes were linked to QTLs of AW, among which GPI mannosyltransferase 4 is involved in glycosylphosphatidylinositol-anchor biosynthesis⁵³. DNA helicase MCM8, forms a complex with MCM9 and is involved in homologous recombination repair following DNA interstrand cross-links, thus playing a key role during gametogenesis⁵⁴.

It is likely that some of the association genes seem to have some relationship with specific traits, which indicated by their function studies in other species. However, it should be noted that the association genes do not mean that they are the causative genes, and in-depth studies are still needed.

In summary, this study performed a genome survey analysis, which revealed that S. paramamosain has an estimated genome size of 1.21 Gb with high heterozygosity (1.3%). Then, a high-resolution linkage map was constructed using 24,444 SNP makers. Forty-seven linkage groups were obtained with a total length of 3087.53 cM and a marker interval of 0.92 cM. With the aid of transcriptome and genome scaffold data, 4,271 markers were linked to genes, including some important growth-related, immune-related and hormone pathway genes. Further, 442 QTLs were identified and found to correspond to 24 traits, and of these, 78 QTLs were linked to genes. Some interesting genes, such as dedicator of cytokinesis protein 3, tenascin-X and DNA helicase MCM8, were believed to have important relationships with specific traits and are worth further exploration.

References

Meuwissen, T. H., Hayes, B. J. & Goddard, M. E. Prediction of total genetic value using genome-wide dense marker maps. Genetics 157, 1819–1829 (2001).
CAS PubMed PubMed Central Google Scholar
Schaeffer, L. R. Strategy for applying genome-wide selection in dairy cattle. Anim. Breed. Genet. = Zeitschrift fur Tierzuchtung und Zuchtungsbiologie 123, 218–223 (2006).
Article CAS PubMed Google Scholar
Muir, W. M. et al. Genome-wide assessment of worldwide chicken SNP genetic diversity indicates significant absence of rare alleles in commercial breeds. P Natl Acad Sci USA 105, 17312–17317 (2008).
Article ADS CAS Google Scholar
Ibáñez-Escriche, N., Forni, S., Noguera, J. L. & Varona, L. Genomic information in pig breeding: Science meets industry needs. Livest Sci 166, 94–100 (2014).
Article Google Scholar
Robinson, N. A. et al. QTL for white spot syndrome virus resistance and the sex-determining locus in the Indian black tiger shrimp (Penaeus monodon). BMC Genomics 15, 731 (2014).
Article PubMed PubMed Central Google Scholar
Cui, Z. et al. High-density linkage mapping aided by transcriptomics documents ZW sex determination system in the Chinese mitten crab Eriocheir sinensis. Heredity 115, 206 (2015).
Article CAS PubMed PubMed Central Google Scholar
Yu, Y. et al. Genome survey and high-density genetic map construction provide genomic and genetic resources for the Pacific White Shrimp Litopenaeus vannamei. Sci Rep 5, 15612 (2015).
Article ADS CAS PubMed PubMed Central Google Scholar
Lu, X. et al. High-resolution genetic linkage mapping, high-temperature tolerance and growth-related quantitative trait locus (QTL) identification in Marsupenaeus japonicus. Mol Genet Genomic: MGG 291, 1391–1405 (2016).
Article CAS PubMed Google Scholar
Lv, J., Gao, B., Liu, P., Li, J. & Meng, X. Linkage mapping aided by de novo genome and transcriptome assembly in Portunus trituberculatus: applications in growth-related QTL and gene identification. Sci Rep 7, 7874 (2017).
Article ADS PubMed PubMed Central Google Scholar
Colbourne, J. K. et al. The Ecoresponsive Genome of Daphnia pulex. Science 331, 555–561 (2011).
Article ADS CAS PubMed PubMed Central Google Scholar
Kao, D. et al. The genome of the crustacean Parhyale hawaiensis, a model for animal development, regeneration, immunity and lignocellulose digestion. eLife 5, e20062 (2016).
Article PubMed PubMed Central Google Scholar
Gutekunst, J. et al. Clonal genome evolution and rapid invasive spread of the marbled crayfish. Nature Ecology & Evolution 2, 567–573 (2018).
Article Google Scholar
Song, L. et al. Draft genome of the Chinese mitten crab. Eriocheir sinensis. GigaScience 5, 5 (2016).
Article PubMed Google Scholar
Ma, L.-B., Zhang, F.-Y., Ma, C.-Y. & Qiao, Z.-G. Scylla paramamosain (Estampador) the most common mud crab (Genus Scylla) in China: evidence from mtDNA. Aquac Res 37, 1694–1698 (2006).
Article CAS Google Scholar
Xie, C. P. et al. Transcriptome and Expression Profiling Analysis of the Hemocytes Reveals a Large Number of Immune-Related Genes in Mud Crab Scylla paramamosain during Vibrio parahaemolyticus Infection. PloS one 9, e114500, https://doi.org/10.1371/journal.pone.0114500 (2014).
Article ADS CAS PubMed PubMed Central Google Scholar
Ma, H. et al. Transcriptome analysis of the mud crab (Scylla paramamosain) by 454 deep sequencing: assembly, annotation, and marker discovery. PloS one 9, e102668, https://doi.org/10.1371/journal.pone.0102668 (2014).
Article ADS PubMed PubMed Central Google Scholar
Ma, H. et al. Identification of transcriptome-derived microsatellite markers and their association with the growth performance of the mud crab (Scylla paramamosain). PloS one 9, e89134, https://doi.org/10.1371/journal.pone.0089134 (2014).
Article ADS PubMed PubMed Central Google Scholar
Gao, J. et al. Transcriptome analysis of the differences in gene expression between testis and ovary in green mud crab (Scylla paramamosain). Bmc Genomics 15, 585 (2014).
Article PubMed PubMed Central Google Scholar
Zhu, F., Qian, X. & Ma, X. Comparative transcriptomic analysis of crab hemocytes in response to white spot syndrome virus or Vibrio alginolyticus infection. Fish Shellfish Immun 80, 165–179 (2018).
Article CAS Google Scholar
Ma, H. Y. et al. First genetic linkage map for the mud crab (Scylla paramamosain) constructed using microsatellite and AFLP markers. Genet Mol Res: GMR 15, gmr. 15026929 (2016).
Patel, R. K. & Jain, M. NGS QC Toolkit: A Toolkit for Quality Control of Next Generation Sequencing Data. PloS one 7, e30619, https://doi.org/10.1371/journal.pone.0030619 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Keenan, C. P., Davie, P. J. & Mann, D. A revision of the genus Scylla de Haan, 1833 (Crustacea: Decapoda: Brachyura: Portunidae). Raffles B Zool 46, 217–245 (1998).
Google Scholar
Baird, N. A. et al. Rapid SNP Discovery and Genetic Mapping Using Sequenced RAD Markers. PloS one 3, e3376, https://doi.org/10.1371/journal.pone.0003376 (2008).
Article ADS PubMed PubMed Central Google Scholar
Catchen, J. M., Amores, A., Hohenlohe, P., Cresko, W. & Postlethwait, J. H. Stacks: Building and Genotyping Loci De novo From Short-Read Sequences. G3: Genes|Genomes|Genetics 1, 171–182 (2011).
Article CAS PubMed PubMed Central Google Scholar
Pei, X. et al. Population genomic analyses from low‐coverage RAD‐Seq data: a case study on the non‐model cucurbit bottle gourd. Plant J 77, 430–442 (2014).
Article Google Scholar
Van Ooijen, J. W. Multipoint maximum likelihood mapping in a full-sib family of an outbreeding species. Genetics Res 93, 343–349 (2011).
Article Google Scholar
Rastas, P., Calboli, F. C. F., Guo, B., Shikano, T. & Merilä, J. Construction of Ultradense Linkage Maps with Lep-MAP2: Stickleback F 2 Recombinant Crosses as an Example. Genome Biol Evol 8, 78–93 (2016).
Article CAS Google Scholar
Ooijen, V. MapQTL, Software for the mapping of quantitative trait loci in experimental populations, in Kyazma B. V. Wageningen, (Netherlands, 2004).
Jansen, R. C. & Stam, P. High resolution of quantitative traits into multiple loci via interval mapping. Genetics 136, 1447–1455 (1994).
CAS PubMed PubMed Central Google Scholar
Zeng, X. et al. The draft genome of Tibetan hulless barley reveals adaptive patterns to the high stressful Tibetan Plateau. P Natl Acad Sci USA 112, 1095–1100 (2015).
Article ADS CAS Google Scholar
Li, Y., Ai, C. & Liu, L. Mud Crab, Scylla paramamosain China’s Leading Maricultured Crab. Aquaculture in China: Success Stories and Modern Trends, Chapter 3.4 (2018).
Liu, L. et al. Flow cytometric analysis of DNA content for four commercially important crabs in China. Acta Oceanologica Sinica 35, 7–11 (2016).
Article Google Scholar
Dolezel, J. & Greilhuber, J. Nuclear genome size: are we getting closer? Cytometry. Part A: the journal of the International Society for Analytical Cytology 77, 635–642 (2010).
Article Google Scholar
He, K., Lin, K., Wang, G. & Li, F. Genome Sizes of Nine Insect Species Determined by Flow Cytometry and k-mer Analysis. Front Physiol 7, 569 (2016).
PubMed PubMed Central Google Scholar
Wang, G., Chen H., Li, S., Guan W. & Li, Q. Studies on the Chromosomal Karyotype of Mud Crab, Scylla serrata (Forskal). Marine Sci, 1, 9–13 (2002) (In Chinese with English Abstract).
Chen, X., Wang G., Chen, L. & Li, S. Methodologoical improvement and its application effect in chromosome study of mud crab, Scylla serrata. J Oceanography in Taiwan Strait, 3, 347–353 (2004). (In Chinese with English Abstract).
Tompkins, V. S. et al. A novel nuclear interactor of ARF and MDM2 (NIAM) that maintains chromosomal stability. J Biol Chem 282, 1322–1333 (2007).
Article CAS PubMed Google Scholar
Huang, X., Ye, H., Huang, H., Yang, Y. & Gong, J. An insulin-like androgenic gland hormone gene in the mud crab, Scylla paramamosain, extensively expressed and involved in the processes of growth and female reproduction. Gen Comp Endocrinol 204, 229–238 (2014).
Article CAS PubMed Google Scholar
Nielsen, J. et al. A Family of Insulin-Like Growth Factor II mRNA-Binding Proteins Represses Translation in Late Development. Mol Cell Biol 19, 1262–1270 (1999).
Article CAS PubMed PubMed Central Google Scholar
Brown, G. D., Willment, J. A. & Whitehead, L. C-type lectins in immunity and homeostasis. Nat Rev Immunol 18, 374–389 (2018).
Article CAS PubMed Google Scholar
Wei, X. et al. C-type lectin B (SpCTL-B) regulates the expression of antimicrobial peptides and promotes phagocytosis in mud crab Scylla paramamosain. Dev Comp Immunol 84, 213–229 (2018).
Article CAS PubMed Google Scholar
Jiang, K. et al. Isolation of the C-type lectin like-domain cDNAs from the mud crab Estampador, 1949, and its expression profiles in various tissues, during larval development, and under challenge. Crustaceana 85, 817–834 (2012).
Article Google Scholar
Niwa, Y. S. & Niwa, R. Transcriptional regulation of insect steroid hormone biosynthesis and its role in controlling timing of molting and metamorphosis. Dev Growth Differ 58, 94–105 (2016).
Article CAS PubMed Google Scholar
Rajgor, D. & Shanahan, C. M. Nesprins: from the nuclear envelope and beyond. Expert Reviews in Molecular Medicine 15, e5 (2013).
Article PubMed PubMed Central Google Scholar
Helbig, K. L., Mroske, C., Moorthy, D., Sajan, S. A. & Velinov, M. Biallelic loss-of-function variants in DOCK3 cause muscle hypotonia, ataxia, and intellectual disability. Clin genet 92, 430–433 (2017).
Article CAS PubMed Google Scholar
Egging, D., van den Berkmortel, F., Taylor, G., Bristow, J. & Schalkwijk, J. Interactions of human tenascin-X domains with dermal extracellular matrix molecules. Arch Dermatol Res 298, 389–396 (2007).
Article CAS PubMed Google Scholar
von Itzstein, M. The war against influenza: discovery and development of sialidase inhibitors. Nature reviews. Drug Discov 6, 967–974 (2007).
Article Google Scholar
Wandall, H. H. et al. Substrate specificities of three members of the human UDP-N-acetyl-alpha-D-galactosamine:Polypeptide N-acetylgalactosaminyltransferase family, GalNAc-T1, -T2, and -T3. J Biol Chem 272, 23503–23514 (1997).
Article CAS PubMed Google Scholar
Henneberry, A. L., Wistow, G. & McMaster, C. R. Cloning, genomic organization, and characterization of a human cholinephosphotransferase. J Biol Chem 275, 29808–29815 (2000).
Article CAS PubMed Google Scholar
Holzmann, J. et al. RNase P without RNA: identification and functional reconstitution of the human mitochondrial tRNA processing enzyme. Cell 135, 462–474 (2008).
Article CAS PubMed Google Scholar
Sun, B., Qian, X. & Zhu, F. Molecular characterization of shrimp harbinger transposase derived 1 (HARBI1)-like and its role in white spot syndrome virus and Vibrio alginolyticu sinfection. Fish Shellfish Immun 78, 222–232 (2018).
Article CAS Google Scholar
Li, J., Mahajan, A. & Tsai, M. D. Ankyrin repeat: a unique motif mediating protein-protein interactions. Biochem 45, 15168–15178 (2006).
Article CAS Google Scholar
Taron, B. W., Colussi, P. A., Wiedman, J. M., Orlean, P. & Taron, C. H. Human Smp3p adds a fourth mannose to yeast and human glycosylphosphatidylinositol precursors in vivo. J Biol Chem 279, 36083–36092 (2004).
Article CAS PubMed Google Scholar
Nishimura, K. et al. Mcm8 and Mcm9 form a complex that functions in homologous recombination repair induced by DNA interstrand crosslinks. Mol Cell 47, 511–522 (2012).
Article CAS PubMed Google Scholar

Download references

Acknowledgements

This work was supported by grants from the Program of Science and Technology Commission of Shanghai (No. 15391900900), China Agriculture Research System-48 (No. CARS-48), the Basic Research Fund for State-level Nonprofit Research Institutes of ESCFRI (No. 2014Z01) and Special Program on Agricultural Aspect of Science and Technology Commission of Ningbo (2016C11003).

Author information

Ming Zhao and Wei Wang contributed equally.

Authors and Affiliations

East China Sea Fisheries Research Institute, Chinese Academy of Fishery Sciences, 300 Jungong Road, Shanghai, 200090, China
Ming Zhao, Wei Wang, Wei Chen, Chunyan Ma, Fengying Zhang, Keji Jiang, Junguo Liu, Le Diao, Heng Qian, Junxia Zhao, Tian Wang & Lingbo Ma

Authors

Ming Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Wei Wang
View author publications
You can also search for this author in PubMed Google Scholar
Wei Chen
View author publications
You can also search for this author in PubMed Google Scholar
Chunyan Ma
View author publications
You can also search for this author in PubMed Google Scholar
Fengying Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Keji Jiang
View author publications
You can also search for this author in PubMed Google Scholar
Junguo Liu
View author publications
You can also search for this author in PubMed Google Scholar
Le Diao
View author publications
You can also search for this author in PubMed Google Scholar
Heng Qian
View author publications
You can also search for this author in PubMed Google Scholar
Junxia Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Tian Wang
View author publications
You can also search for this author in PubMed Google Scholar
Lingbo Ma
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

L.M. conceived and supervised the project. L.M., C.M., F.Z. and K.J. supplied the experimental animals. W.W., W.C., M.Z., J.L., L.D., H.Q., J.Z. and T.W. measured the traits and sampled the tissues. M.Z., L.M., W.W. and W.C. contributed to the genome survey analysis, linkage map construction and QTL mapping. M.Z. contributed to the gene location analysis. M.Z., L.M., W.W. wrote the manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Lingbo Ma.

Ethics declarations

Competing Interests

The authors declare no competing interests.

Additional information

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Dataset 1

Dataset 2

Dataset 3

Dataset 4

Dataset 5

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Zhao, M., Wang, W., Chen, W. et al. Genome survey, high-resolution genetic linkage map construction, growth-related quantitative trait locus (QTL) identification and gene location in Scylla paramamosain. Sci Rep 9, 2910 (2019). https://doi.org/10.1038/s41598-019-39070-z

Download citation

Received: 19 June 2018
Accepted: 11 January 2019
Published: 27 February 2019
DOI: https://doi.org/10.1038/s41598-019-39070-z

This article is cited by

Effects of berberine hydrochloride on immune response in the crab Charybdis japonica
- Tian-heng Gao
- Ming-ming Han
- Qi-chen Jiang
BMC Genomics (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Materials and Methods

Ethics statement

Genome survey sequencing and analysis

Mapping family and data collection

RAD library construction and Illumina HiSeq3000 sequencing

SNP discovery, filtering, genotyping and validation

Linkage map construction and QTL mapping

Gene location and growth-related gene identification

Results

Genome survey of S. paramamosain

RAD sequencing and genotyping

Linkage map construction

QTL mapping

Gene location and growth gene identification

Discussion

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing Interests

Additional information

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links