Genome-wide mapping and characterization of microsatellites in the swamp eel genome

Li, Zhigang; Chen, Feng; Huang, Chunhua; Zheng, Weixin; Yu, Chunlai; Cheng, Hanhua; Zhou, Rongjia

doi:10.1038/s41598-017-03330-7

Download PDF

Article
Open access
Published: 09 June 2017

Genome-wide mapping and characterization of microsatellites in the swamp eel genome

Zhigang Li¹,
Feng Chen¹,
Chunhua Huang¹,
Weixin Zheng¹,
Chunlai Yu¹,
Hanhua Cheng¹ &
…
Rongjia Zhou ORCID: orcid.org/0000-0002-9701-8009¹

Scientific Reports volume 7, Article number: 3157 (2017) Cite this article

4129 Accesses
21 Citations
1 Altmetric
Metrics details

Subjects

Abstract

We described genome-wide screening and characterization of microsatellites in the swamp eel genome. A total of 99,293 microsatellite loci were identified in the genome with an overall density of 179 microsatellites per megabase of genomic sequences. The dinucleotide microsatellites were the most abundant type representing 71% of the total microsatellite loci and the AC-rich motifs were the most recurrent in all repeat types. Microsatellite frequency decreased as numbers of repeat units increased, which was more obvious in long than short microsatellite motifs. Most of microsatellites were located in non-coding regions, whereas only approximately 1% of the microsatellites were detected in coding regions. Trinucleotide repeats were most abundant microsatellites in the coding regions, which represented amino acid repeats in proteins. There was a chromosome-biased distribution of microsatellites in non-coding regions, with the highest density of 203.95/Mb on chromosome 8 and the least on chromosome 7 (164.06/Mb). The most abundant dinucleotides (AC)n was mainly located on chromosome 8. Notably, genomic mapping showed that there was a chromosome-biased association of genomic distributions between microsatellites and transposon elements. Thus, the novel dataset of microsatellites in swamp eel provides a valuable resource for further studies on QTL-based selection breeding, genetic resource conservation and evolutionary genetics.

Improved high-quality reference genome of red drum facilitates the processes of resistance-related gene exploration

Article Open access 07 November 2023

Yongshuang Xiao, Jing Liu, … Yuting Ma

Genome- and transcriptome-derived microsatellite loci in lumpfish Cyclopterus lumpus: molecular tools for aquaculture, conservation and fisheries management

Article Open access 17 January 2020

Simo N. Maduna, Adam Vivian-Smith, … Snorre B. Hagen

Automating microsatellite screening and primer design from multi-individual libraries using Micro-Primers

Article Open access 07 January 2022

Filipe Alves, Filipa M. S. Martins, … Antonio Muñoz-Mérida

Introduction

Swamp eel (Monopterus albus) taxonomically belongs to teleosts, the family Synbranchidae of the order Synbranchiformes (Neoteleostei, Teleostei, and Vertebrata). The fish is distributed mainly in China, Korea, Japan, Thailand, Lao, Indonesia, Malaysia, Philippines and India. They are also found in southeastern United States and northern Australia. Swamp eel is an economically important species in southeast Asia for food production. In addition, because of its natural sex reversal characteristic from female via intersex into male during its life cycle and relative small genome size, swamp eel is an ideal model for studies of comparative genomics and sexual differentiation^{1, 2}. Recently, our group has sequenced the whole genome of swamp eel. With the availability of genome sequence resources, it poses a challenge for mining of useful genetic markers and genes in a genome-wide level and utilization of them in genetic improvement of swamp eel.

Microsatellites, also known as, simple sequence repeats (SSRs), are short tandem repeats of 1–6 nucleotides. Microsatellites are observed in almost all known eukaryotic and prokaryotic genomes, and present in both coding and non-coding regions^3,4,5. This makes microsatellite a powerful genetic marker for a variety of applications, such as genetic linkage mapping, population genetics, QTL (quantitative trait loci)-based selection breeding, molecular breeding, and evolutionary studies^6,7,8,9,10. In comparison with other genetic marker systems, such as restriction fragment length polymorphism, random amplified polymorphic DNA, amplified fragment length polymorphism, sequence-related amplified polymorphism, and target region amplification polymorphism, microsatellites are characterized by their high frequency of distribution, co-dominance, reproducibility, and high polymorphism^{11, 12}. Efforts have been made worldwide to compile and develop microsatellite databases in eukaryotes^{13,14,15,16,17}. In teleost fishes, valuable microsatellites and related genetic linkage maps have been characterized^18,19,20,21. Although a few of SSR markers in swamp eel have been reported^22,23,24, a genome-wide characterization of microsatellites remains to be identified in this species.

Recent development in high-throughput DNA sequencing technologies provides new opportunities to promote mining of molecular markers. In this study, taking advantage of the whole genome sequences of swamp eel we obtained recently, we conducted a genome-wide detection of microsatellite sequences. We analyzed distribution of microsatellite motifs (dinucleotides, trinucleotides, tetranucleotides, pentanucleotides and hexanucleotides) in the genome, characterized microsatellites in both coding and noncoding regions. We found that trinucleotide repeats were most abundant microsatellites in coding regions though their low enrichment, and microsatellites were abundant and chromosome-biased in non-coding regions. In particular, a chromosome-biased association of genomic distributions between microsatellites and transposon elements (TEs) was described. The novel set of microsatellites in swamp eel provides a valuable dataset for further studies on QTL-based selection breeding, genetic resource conservation and evolutionary genetics.

Results

Identification of microsatellites in the swamp eel genome

To screen microsatellites in the genome of swamp eel, we searched the genome for all potential microsatellite motifs from dinucleotides, trinucleotides, tetranucleotides, pentanucleotides and hexanucleotides. A total of 99,293 microsatellites were identified with average frequency of 179 microsatellites per megabase of genomic sequences (Supplementary Table 1). Of the 99,293 microsatellites, the dinucleotides were the most abundant (70,456) with a proportion of 70.95%, followed by trinucleotides (13,365; 13.46%), tetranucleotides (4,755; 4.79%), pentanucleotides (1,257; 1.27%), hexanucleotides (85; 0.09%) and compound microsatellites (9,375; 9.44%) (Fig. 1a). In the perfect matched repeats, two classes of microsatellites were divided, based on length of the repeat motifs. A total of 37,777 (42.01%) microsatellites were classified into long and hypervariable class I type (≥20 bp) and the remaining 52,141 (57.99%) microsatellites as variable class II type (12–19 bp) (Fig. 1b). The proportion of different microsatellite motifs is not uniform, particularly in the cases of dinucleotides and trinucleotides. Among dinucleotides, AC/GT motifs (55.24%) were most recurrent, followed by AG/CT (11.30%), AT/AT (4.36%) and CG/CG (0.05%) motifs (Fig. 1c). Among the trinucleotides, AAT/ATT (3.16%) motifs were most abundant followed by AAC/GTT (1.22%) and AGG/CCT (1.17%), whereas CCG/CGG (0.01%) motifs were least (Fig. 1d). Moreover, AAAT, AAAAT and AAAAAT were the most abundant repeats in each class. Analysis of physical location and density of microsatellites on the chromosomes showed that distribution of microsatellites across the chromosomes was uniform with regard to a certain motif type, whereas there were variable densities in different microsatellite types across the chromosomes, for example, 112.56–149.02/Mb in dinucleotides, 22.38–27.17/Mb in trinucleotides, 7.51–9.49/Mb in tetranucleotides, 1.79–2.60/Mb in pentanucleotides, and 0.07–0.26/Mb in hexanucleotides (Fig.1e). Finally, PCR analysis indicated that alleles of microsatellites ranged from 2 to 5 in these repeat types (Fig. 1f).

We also investigated the microsatellite motif distribution with regard to repeat numbers. For all five microsatellite types, microsatellite frequency decreased as the number of repeat units increased, which was more obvious in long than short microsatellite motifs (Fig. 2). Moreover, the mean repeat number in dinucleotides (10.33) was approximately 1.5 times of those in trinucleotides, tetranucleotides, pentanucleotides and hexanucleotides (6.16, 6.44, 6.37, and 6.99 respectively). The trends are similar to those in the human genome²⁵.

Trinucleotide repeats are most abundant microsatellites in coding regions

We investigated distribution of microsatellites in both coding and non-coding regions of the genome. Microsatellites were mainly located in non-coding regions (98,602, 99%), whereas there were approximately 1% (691) of the microsatellites located in coding regions (Fig. 3a). In coding regions, trinucleotide repeats were most abundant microsatellites (~75.5%), followed by dinucleotide repeats (20.3%), which represented amino acid repeats in proteins. For these microsatellites in coding regions, we analyzed their GO annotation by Blast2GO. A total of 375 genes were assigned to the molecular function category (Fig. 3b). Catalytic activity (35.9%) was the most dominant group followed by binding (30.3%). Metabolic process (17.6%) was the most enriched group that were annotated to the biological process category (Fig. 3c). With regard to the cellular component, 37.5% sequences were assigned to the cell part followed by organelle (27.5%), membrane (11.5%) and macromolecular complex (9.6%) (Fig. 3d). To investigate whether particular GO terms were overrepresented in microsatellite-containing genes, we performed an overrepresentation analysis (Fisher’s exact test, available through PANTHER version 11.1²⁶). No GO term was significantly enriched in microsatellite-containing genes compared to all the other genes in the genome (false discovery rate (FDR) = 0.05). In addition, no chromosome-biased distribution of GO enriched genes was detected (FDR = 0.05).

Abundant and chromosome-biased microsatellites in non-coding regions

As most of the microsatellites were located in non-coding regions, we analyzed their distribution patterns in the genome. We found that there was a chromosome-biased distribution of these microsatellites in non-coding regions, with the highest density of 203.95/Mb on chromosome 8 and followed by chromosome 12. The least density of the microsatellites was detected on chromosome 7 (164.06/Mb) (Fig. 3a). For the repeat types, dinucleotide repeats were the most abundant class of microsatellites, particularly on the chromosome 8, whereas the chromosome 7 had the least level of distribution of dinucleotide repeats (Fig. 4a). The most abundant repeat type of dinucleotides (AC)n was mainly located on chromosome 8 (Fig. 4b), whereas the most enriched type of trinucleotides was mainly located on chromosome 1, 4 and 7 (Fig. 4c). These results indicated that there were repeat type- and chromosome-biased distributions of the microsatellites in the genome.

Genomic mapping and their chromosome-biased association of microsatellites with TEs

As the association between microsatellites and TEs in genomes remains elusive^{8, 27,28,29}, we tested their distributions in the swamp eel genome. Sliding window analysis in a genome-wide, using a window of 3 Mb with a step of 100 kb, showed a distinct distribution pattern among chromosomes (Fig. 5a). Thus, we analyzed correlation between microsatellite and TE densities in individual chromosome using the same sliding window parameters. An obvious negative correlation was observed on chromosome 12 (r = −0.83, p = 1.3813E-51) (Fig. 5b) and also on chromosomes 2, 4 and 7, whereas positive correlation was only detected on chromosome 5 (r = 0.256, p = 1.6817E-8) (Fig. 5c). Notably, on the chromosome 11, there were two types of distribution patterns according to TEs numbers in 3 Mb windows. A quadratic function was observed when TEs ≥4000 (Fig. 5d), whereas a linear correlation detected when <4000 (Fig. 5e), which indicated a threshold value of TE numbers associated with microsatellite density on the chromosome 11. A similar quadratic function was also detected on chromosomes 9 and 10. No obvious association was detected on chromosomes 1, 3, 6 and 8. These results suggested a chromosome-biased association between microsatellites and TEs in the genome.

Discussion

Swamp eel is an increasingly emerging model species in biology, in addition to its economic importance in fish production^{1, 2}. Microsatellites that are widely distributed in a genome are important genetic markers for assessing genetic diversity, genetic map construction, comparative genomics, and marker-assisted selection breeding. Characterization of the genome-wide microsatellites in this study, together with SSR markers from previous reports in swamp eel^{22,23,24, 30, 31}, provides a resourceful dataset for genetic improvement of this species, and genomic and evolutionary biology studies.

The genomic data are excellent sources for SSR mining and has been utilized in various species^{9, 25, 32, 33}. In the present study, we identified a total of 99,293 microsatellites based on the whole-genome sequences of swamp eel. The distribution frequency of microsatellites (179/Mb) estimated in the genome is comparable to that documented in buffalo genome (170/Mb)³³, but lower than those in human and mouse³⁴. These differences could be due to the variation in search criteria, sizes of the databases and bioinformatics software tools used in different studies for identification of microsatellites. The most abundant dinucleotide and trinucleotide motifs are AC/GT and AAT/ATT, which are in agreement with those in human²⁵ and buffalos^{7, 33}, but different from those of cattle and goat^{35, 36}. Predominant repeats in various classes are AAT, AGG and AAC in trimers, AAAT, AAAC, and AAAG in tetramers, AAAAT and AAAAC in pentamers and AAAAAT, AAAAAC, AAAAAG and AAAAAG in hexamers. It reflects a prevalence of the A-rich repeats during genome evolution in teleost fishes. The abundance of the repeats is probably influenced by their secondary structures and the effect on DNA replication²⁵ or reflects a genetic adaptation to water environment during fish speciation. Thus, the characterization of the microsatellites in the swamp eel provides a useful resource for further studies in genome evolution in the teleost fish species.

The frequency and density of microsatellites are probably correlated with genome sizes. For example, the microsatellite density is higher in large genomes than in small genomes among mammals³⁷. However, the microsatellite frequency in plants is lower in large genomes than in small genomes³⁸. The distributions of microsatellites also vary in different regions in a genome. It is well known that noncoding regions generally contain more abundant microsatellites than coding regions^{39, 40}. There is no apparent difference of microsatellite contents between intergenic regions and introns⁴¹. In addition, the microsatellite density is higher at the end of chromosome arms than at other regions in human and mouse genomes^{42, 43}. Although the trends for different repeat types are similar between chromosomes within a genome, the density of repeats could vary among different chromosomes of the same species. The density is higher on autosomes than on X chromosomes in mammals (such as humans, mice, and rats) but with exception in Drosophila ⁴⁴. This can be expected, since different chromosomes in a genome have different organizations of genes, euchromatin, and heterochromatin. This variation is due in part to AT/GC content of genomes, with biased toward either high AT or CG. The bias is favorite for enhancement of expansion through slippage during DNA replication⁴⁵.

Transposition often generates genetic variations, and microsatellites are probably associated with relevant elements^46,47,48. Alu elements are widely distributed in the human genome, representing more than 10% of its total size. Since Alu repeats contain a poly(A) tail and a central linker region rich in adenines, there is a certain extent of association with A-rich microsatellites. A significant association was observed between the 3′ end of Alu sequences, not only with (A)n mononucleotide repeats but also with (AAC)n, (AAT)n, and A-rich tetra- to hexanucleotide repeats, moreover, this association was weaker with (AT)n dinucleotide repeats⁴⁹. The (AC)n dinucleotide repeats were preferentially associated with Alu elements, 75% of them were at the 3′ end of the elements, while the remainder were in the central linker region⁴⁶. However, a high density of transposable elements does not always coincide with a high density of microsatellites. For example, analysis in five complete plant genomes showed that microsatellites were preferentially located in unique regions of the genomes and exhibited a lack of association with transposon-rich regions³⁸. It was hypothesized that microsatellite can be derived from TEs and the opposite evolutionary direction may occur^{50, 51}. The direction of transition from TE to microsatellite might depend on transposition rate with an optimal value and the opposite transition is linked to recombination rate^{50, 51}. A chromosome-biased association between microsatellites and TEs in our study is presumably at least partially related to their distant contact and recombination behavior of chromosomes. A chromosome-biased association between microsatellites and TEs in the fish genome observed in this study provides a new layer in understanding of complexity of these repeats in genome structure and evolution.

Microsatellites are closely related to genome stability and regulations of gene expression, expansions of which are risk factors of many genetic disorders in human, such as fragile X syndrome⁵², Huntington’s disease⁵³ and myotonic dystrophy⁵⁴. In fishes, a microsatellite marker, (GT)ntt(GT)n, in the 3′ untranslated regions of rtp3 is significantly associated with nervous necrosis virus disease resistance⁵⁵. Whether there is a particular GO term enrichment in microsatellite-associated genes is an interesting issue. In our study, gene ontology annotation of microsatellite-containing genes revealed that these genes were involved in various aspects of biological activities in swamp eel. In line with this, no GO term was overrepresented in the microsatellite-containing genes compared to total genomic genes. Similar results were reported in functional annotation of microsatellite-containing genes in Acipenser fulvescens ⁵⁶ and Carcharodon carcharias ⁵⁷. In addition, GO term was not enriched in particular chromosome either. Nevertheless, both microsatellites and TEs are associated with three-dimensional chromosome architecture^{58, 59}. Some G-rich TEs and microsatellites can form structures made of four DNA strands known as G-quadruplexes contributing to change in chromatin status, transcription enhancement/inhibition and the evolution of regulatory networks^{58, 59}.

Materials and Methods

Animals and ethics statement

Swamp eels (Monopterus albus) were obtained from markets in Wuhan, China. All animal experiments and methods were performed in accordance with the relevant approved guidelines and regulations, as well as under the approval of the Ethics Committee of Wuhan University.

Screening and identification of microsatellites

Genomic sequences of swamp eels were sequenced by our lab (DDBJ/EMBL/GenBank under the accession AONE00000000). The Perl script MIcroSAtelitte (MISA, http://pgrc.ipk-gatersleben.de/misa/) was used to identify microsatellites in the genomes. The genomic sequence data were loaded into a local pool. The configuration file was written in an independent text document named as “misa.in” and was placed in the same folder with the Perl script named as “misa.pl”. The sequence of each chromosome was screened for potential motif repeats by calling the genomic sequence data file and the configuration file. To identify the presence of microsatellites, only 2 to 6 nucleotides motifs were considered, and the minimum repeat unit was defined as 6 for dinucleotide repeats, 5 for trinucleotide repeats, 4 for tetranucleotides, and 3 for pentanucleotides and hexanucleotides. Compound microsatellites were defined as ≥2 microsatellites interrupted by ≤100 bases³³.

PCR amplification

Total genomic DNA was isolated from gonad samples by previous method⁶⁰. Primers were designed by Primer3 software⁶¹. PCR amplification was conducted in 25 μl reactions containing 50 ng of template DNA, 2.5 mM MgCl₂, 2.5 μl 10 × PCR buffer, 0.5 mM each primer, 0.5 U Taq DNA polymerase, and 2.5 mM dNTPs. Primer sequences were listed in Supplementary Table 2. The PCR cycling profile was 95 °C for 5 min, 35 cycles at 94 °C for 30 s, 60 °C for 30 s, 72 °C for 30 s, and a final extension at 72 °C for 5 min.

Functional assignments of the transcripts containing microsatellites

To assign putative functions to the microsatellite-containing transcripts, Blast2go program was run locally to BLAST against a reference database that stores UniProt entries and their associated Gene Ontology (GO)⁶². The GO categorization results were expressed as three independent hierarchies for biological process, cellular component and molecular function (http://www.geneontology.org/). GO term overrepresentation was analyzed by PANTHER version 11.1²⁶ (http://www.pantherdb.org/).

TE analysis

TE elements were analyzed using previous methods⁶³. To analyze density of TEs in swamp eel genome, we combined homology-based and de novo approaches. The homology-based approach utilized database Repbase (release 22.01) with RepeatMasker (version 4.0.6)⁶⁴. The de novo approach utilized two prediction programs (RepeatModeler version 1.0.8⁶⁵ and LTR-FINDER version 1.0.5) to build the de novo repeat libraries based on the genome sequences. The multicopy genes and contaminations were removed from the libraries. Then, the RepeatMasker was used again to find repeats in these repetitive sequence libraries. Finally, we combined all the results generated by these methods and analyzed the density of TEs in the genome.

Circos program

The Circos program (http://circos.ca) was applied to draw the circos maps. Genomic sequences were assembled into chromosomes. Mapping of transposon elements (TEs) and microsatellites onto chromosomes was performed by calling “circos.conf” files containing locus information. The densities of microsatellites and TEs were described as numbers in a sliding window of 3 Mb with a step of 100 kb.

Ethics Approval

All animal experiments and methods were performed in accordance with the relevant approved guidelines and regulations, as well as under the approval of the Ethics Committee of Wuhan University.

References

Cheng, H., Guo, Y., Yu, Q. & Zhou, R. The rice field eel as a model system for vertebrate sexual development. Cytogenet. Genome Res. 101, 274–277 (2003).
Article CAS PubMed Google Scholar
Zhou, R., Cheng, H. & Tiersch, T. R. Differential genome duplication and fish diversity. Rev. Fish Biol. Fish. 11, 331–337 (2002).
Article Google Scholar
Tautz, D. & Renz, M. Simple sequences are ubiquitous repetitive components of eukaryotic genomes. Nucleic Acids Res. 12, 4127–4138 (1984).
Article CAS PubMed PubMed Central Google Scholar
Morgante, M. & Olivieri, A. M. PCR-amplified microsatellites as markers in plant genetics. Plant J. 3, 175–182 (1993).
Article CAS PubMed Google Scholar
Toth, G., Gaspari, Z. & Jurka, J. Microsatellites in different eukaryotic genomes: survey and analysis. Genome Res. 10, 967–981 (2000).
Article CAS PubMed PubMed Central Google Scholar
Agarwal, M., Shrivastava, N. & Padh, H. Advances in molecular marker techniques and their applications in plant sciences. Plant Cell Rep. 27, 617–631 (2008).
Article CAS PubMed Google Scholar
Deng, T. et al. De Novo Transcriptome Assembly of the Chinese Swamp Buffalo by RNA Sequencing and SSR Marker Discovery. PLoS One 11, e0147132 (2016).
Article PubMed PubMed Central Google Scholar
Wang, Q. et al. Genome-wide mining, characterization, and development of microsatellite markers in gossypium species. Sci. Rep. 5, 10638 (2015).
Article ADS CAS PubMed PubMed Central Google Scholar
Zhu, H. et al. Genome wide characterization of simple sequence repeats in watermelon genome and their application in comparative mapping and genetic diversity analysis. BMC Genomics 17, 557 (2016).
Article PubMed PubMed Central Google Scholar
Stolle, E., Kidner, J. H. & Moritz, R. F. Patterns of evolutionary conservation of microsatellites (SSRs) suggest a faster rate of genome evolution in Hymenoptera than in Diptera. Genome Biol. Evol. 5, 151–162 (2013).
Article PubMed PubMed Central Google Scholar
Gupta, P. K. & Varshney, R. K. The development and use of microsatellite markers for genetic analysis and plant breeding with emphasis on bread wheat. Euphytica 113, 163–185 (2000).
Article CAS Google Scholar
Powell, W., Machray, G. C. & Provan, J. Polymorphism revealed by simple sequence repeats. Trends Plant Sci. 1, 215–222 (1996).
Article Google Scholar
Subramanian, S. et al. SSRD: simple sequence repeats database of the human genome. Comp. Funct. Genomics 4, 342–345 (2003).
Article CAS PubMed PubMed Central Google Scholar
Sakai, T. et al. Update of mouse microsatellite database of Japan (MMDBJ). Exp. Anim. 53, 151–154 (2004).
Article CAS PubMed Google Scholar
Boby, T., Patch, A. M. & Aves, S. J. TRbase: a database relating tandem repeats to disease genes for the human genome. Bioinformatics 21, 811–816 (2005).
Article CAS PubMed Google Scholar
Aishwarya, V., Grover, A. & Sharma, P. C. EuMicroSatdb: a database for microsatellites in the sequenced genomes of eukaryotes. BMC Genomics 8, 225 (2007).
Article PubMed PubMed Central Google Scholar
Nagpure, N. S. et al. FishMicrosat: a microsatellite database of commercially important fishes and shellfishes of the Indian subcontinent. BMC Genomics 14, 630 (2013).
Article CAS PubMed PubMed Central Google Scholar
Knapik, E. W. et al. A microsatellite genetic linkage map for zebrafish (Danio rerio). Nat. Genet. 18, 338–343 (1998).
Article CAS PubMed Google Scholar
Gilbey, J., Verspoor, E., McLay, A. & Houlihan, D. A microsatellite linkage map for Atlantic salmon (Salmo salar). Anim. Genet. 35, 98–105 (2004).
Article CAS PubMed Google Scholar
Liu, F. et al. A microsatellite-based linkage map of salt tolerant tilapia (Oreochromis mossambicus x Oreochromis spp.) and mapping of sex-determining loci. BMC Genomics 14, 58 (2013).
Article CAS PubMed PubMed Central Google Scholar
Nikolic, N., Feve, K., Chevalet, C., Hoyheim, B. & Riquet, J. A set of 37 microsatellite DNA markers for genetic diversity and structure analysis of Atlantic salmon Salmo salar populations. J. Fish Biol. 74, 458–466 (2009).
Article CAS PubMed Google Scholar
Zhuo, Y., Hu, H., Zhang, L. & Shu, M. Microsatellite Analysis of Genetic Diversity of Monopterus albus Along the Middle and Lower Reaches of Yangtze River Basin. Biotech. Bull. 187–192 (2011).
Lu, S., Liu, S., Liu, H., Liu, Z. & Liu, Y. Screening of microsatellite primer and its application to conservation genetics of Monopterus albus. J. Fish China 29, 612–618 (2005).
CAS Google Scholar
Liu, Z., Luo, X., Lu, S., Kuang, G. & Zhang, J. Screening of Microsatellite and Genetic Polymorphism of Monopterus albus Populations with Different Gender Phenotype. Jiangsu J. of Agr. Sci. 25, 333–338 (2009).
Google Scholar
Subramanian, S., Mishra, R. K. & Singh, L. Genome-wide analysis of microsatellite repeats in humans: their abundance and density in specific genomic regions. Genome Biol. 4, R13 (2003).
Article PubMed PubMed Central Google Scholar
Mi, H. et al. PANTHER version 11: expanded annotation data from Gene Ontology and Reactome pathways, and data analysis tool enhancements. Nucleic Acids Res. 45, D183–D189 (2017).
Article PubMed Google Scholar
Schlotterer, C. Evolutionary dynamics of microsatellite DNA. Chromosoma 109, 365–371 (2000).
Article CAS PubMed Google Scholar
Reichwald, K. et al. High tandem repeat content in the genome of the short-lived annual fish Nothobranchius furzeri: a new vertebrate model for aging research. Genome Biol. 10, R16 (2009).
Article PubMed PubMed Central Google Scholar
Bouneau, L. et al. An active non-LTR retrotransposon with tandem structure in the compact genome of the pufferfish Tetraodon nigroviridis. Genome Res. 13, 1686–1695 (2003).
Article CAS PubMed PubMed Central Google Scholar
Li, W. T., Liao, X. L., Yu, X. M., Cheng, L. & Tong, J. Isolation and characterization of polymorphic microsatellites in a sex-reversal fish, rice field eel (Monopterus albus). Mol. Ecol. Notes 7, 705–707 (2007).
Article CAS Google Scholar
Lei, L., Feng, L., Jian, T. R. & Yue, G. H. Characterization and multiplex genotyping of novel microsatellites from Asian swamp eel, Monopterus albus. Conservation Genetics Resources 4, 363–365 (2011).
Article Google Scholar
Wang, Y. et al. Genome-wide distribution comparative and composition analysis of the SSRs in Poaceae. BMC Genet. 16, 18 (2015).
Article CAS PubMed PubMed Central Google Scholar
Sarika, A. V., Iquebal, M. A., Rai, A. & Kumar, D. In silico mining of putative microsatellite markers from whole genome sequence of water buffalo (Bubalus bubalis) and development of first BuffSatDB. BMC Genomics 14, 43 (2013).
Article CAS PubMed PubMed Central Google Scholar
Sharma, P. C., Grover, A. & Kahl, G. Mining microsatellites in eukaryotic genomes. Trends Biotechnol. 25, 490–498 (2007).
Article CAS PubMed Google Scholar
Yan, Q. et al. Identification of microsatellites in cattle unigenes. J. Genet. Genomics 35, 261–266 (2008).
Article PubMed Google Scholar
Liu, H., Wang, T., Wang, J., Quan, F. & Zhang, Y. Characterization of Liaoning cashmere goat transcriptome: sequencing, de novo assembly, functional annotation and comparative analysis. PLoS One 8, e77062 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Ellegren, H. Microsatellites: simple sequences with complex evolution. Nat. Rev. Genet. 5, 435–445 (2004).
Article CAS PubMed Google Scholar
Morgante, M., Hanafey, M. & Powell, W. Microsatellites are preferentially associated with nonrepetitive DNA in plant genomes. Nat. Genet. 30, 194–200 (2002).
Article CAS PubMed Google Scholar
Hancock, J. M. The contribution of slippage-like processes to genome evolution. J. Mol. Evol. 41, 1038–1047 (1995).
Article CAS PubMed Google Scholar
Hancock, J. M. Simple sequences and the expanding genome. Bioessays 18, 421–425 (1996).
Article CAS PubMed Google Scholar
Weber, J. L. Informativeness of human (dC-dA)n.(dG-dT)n polymorphisms. Genomics 7, 524–530 (1990).
Article CAS PubMed Google Scholar
Lander, E. S. et al. Initial sequencing and analysis of the human genome. Nature 409, 860–921 (2001).
Article ADS CAS PubMed Google Scholar
Sequencing, M. G. C. et al. Initial sequencing and comparative analysis of the mouse genome. Nature 420, 520–562 (2002).
Article Google Scholar
Bachtrog, D., Weiss, S., Zangerl, B., Brem, G. & Schlotterer, C. Distribution of dinucleotide microsatellites in the Drosophila melanogaster genome. Mol. Biol. Evol. 16, 602–610 (1999).
Article CAS PubMed Google Scholar
Tian, X., Strassmann, J. E. & Queller, D. C. Genome nucleotide composition shapes variation in simple sequence repeats. Mol. Biol. Evol. 28, 899–909 (2011).
Article CAS PubMed Google Scholar
Arcot, S. S., Wang, Z., Weber, J. L., Deininger, P. L. & Batzer, M. A. Alu repeats: a source for the genesis of primate microsatellites. Genomics 29, 136–144 (1995).
Article CAS PubMed Google Scholar
Ramsay, L. et al. Intimate association of microsatellite repeats with retrotransposons and other dispersed repetitive elements in barley. Plant J. 17, 415–425 (1999).
Article CAS PubMed Google Scholar
Temnykh, S. et al. Computational and experimental analysis of microsatellites in rice (Oryza sativa L.): frequency, length variation, transposon associations, and genetic marker potential. Genome Res. 11, 1441–1452 (2001).
Article CAS PubMed PubMed Central Google Scholar
Nadir, E., Margalit, H., Gallily, T. & Ben-Sasson, S. A. Microsatellite spreading in the human genome: evolutionary mechanisms and structural implications. Proc. Natl. Acad. Sci. USA 93, 6470–6475 (1996).
Article ADS CAS PubMed PubMed Central Google Scholar
Biscotti, M. A., Olmo, E. & Heslop-Harrison, J. S. Repetitive DNA in eukaryotic genomes. Chromosome Res. 23, 415–420 (2015).
Article CAS PubMed Google Scholar
Meštrović, N. et al. Structural and functional liaisons between transposable elements and satellite DNAs. Chromosome Res. 23, 583–596 (2015).
Article PubMed Google Scholar
Verkerk, A. J. et al. Identification of a gene (FMR-1) containing a CGG repeat coincident with a breakpoint cluster region exhibiting length variation in fragile X syndrome. Cell 65, 905–914 (1991).
Article CAS PubMed Google Scholar
Manley, K., Shirley, T. L., Flaherty, L. & Messer, A. Msh2 deficiency prevents in vivo somatic instability of the CAG repeat in Huntington disease transgenic mice. Nat. Genet. 23, 471–473 (1999).
Article CAS PubMed Google Scholar
Brook, J. D. et al. Molecular basis of myotonic dystrophy: expansion of a trinucleotide (CTG) repeat at the 3′ end of a transcript encoding a protein kinase family member. Cell 69, 385 (1992).
CAS PubMed Google Scholar
Liu, P. et al. Characterization of a novel disease resistance gene rtp3 and its association with VNN disease resistance in Asian seabass. Fish Shellfish Immunol. 61, 61–67 (2017).
Article CAS PubMed Google Scholar
Doyle, J. M. et al. Microsatellite analyses across three diverse vertebrate transcriptomes (Acipenser fulvescens, Ambystoma tigrinum, and Dipodomys spectabilis). Genome 56, 407–414 (2013).
Article CAS PubMed Google Scholar
Richards, V. P., Suzuki, H., Stanhope, M. J. & Shivji, M. S. Characterization of the heart transcriptome of the white shark (Carcharodon carcharias). BMC Genomics 14, 697 (2013).
Article CAS PubMed PubMed Central Google Scholar
Kejnovsky, E., Tokan, V. & Lexa, M. Transposable elements and G-quadruplexes. Chromosome Res. 23, 615–623 (2015).
Article CAS PubMed Google Scholar
Maslova, A., Zlotina, A., Kosyakova, N., Sidorova, M. & Krasikova, A. Three-dimensional architecture of tandem repeats in chicken interphase nucleus. Chromosome Res. 23, 625–639 (2015).
Article CAS PubMed Google Scholar
He, Y. et al. Gonadal apoptosis during sex reversal of the rice field eel: implications for an evolutionarily conserved role of the molecular chaperone heat shock protein 10. J. Exp. Zool. B Mol. Dev. Evol. 314, 257–266 (2010).
PubMed Google Scholar
Rozen, S. & Skaletsky, H. Primer3 on the WWW for general users and for biologist programmers. Methods Mol. Biol. 132, 365–386 (2000).
CAS PubMed Google Scholar
Schmid, R. & Blaxter, M. L. Annot8r: GO, EC and KEGG annotation of EST datasets. BMC Bioinformatics 9, 180 (2008).
Article PubMed PubMed Central Google Scholar
Yi, M. et al. Rapid evolution of piRNA pathway in the teleost fish: implication for an adaptation to transposon diversity. Genome Biol. Evol. 6, 1393–1407 (2014).
Article CAS PubMed PubMed Central Google Scholar
Huda, A. & Jordan, I. K. Analysis of transposable element sequences using CENSOR and RepeatMasker. Methods Mol. Biol. 537, 323–336 (2009).
Article CAS PubMed Google Scholar
Price, A. L., Jones, N. C. & Pevzner, P. A. De novo identification of repeat families in large genomes. Bioinformatics 21(Suppl 1), i351–358 (2005).
Article CAS PubMed Google Scholar

Download references

Acknowledgements

This work was supported by the National Natural Science Foundation of China, National Key Technologies R&D Program and Hubei Province Science and Technology project.

Author information

Authors and Affiliations

Hubei Key Laboratory of Cell Homeostasis, Laboratory of Molecular and Developmental Genetics, College of Life Sciences, Wuhan University, Wuhan, 430072, P. R. China
Zhigang Li, Feng Chen, Chunhua Huang, Weixin Zheng, Chunlai Yu, Hanhua Cheng & Rongjia Zhou

Authors

Zhigang Li
View author publications
You can also search for this author in PubMed Google Scholar
Feng Chen
View author publications
You can also search for this author in PubMed Google Scholar
Chunhua Huang
View author publications
You can also search for this author in PubMed Google Scholar
Weixin Zheng
View author publications
You can also search for this author in PubMed Google Scholar
Chunlai Yu
View author publications
You can also search for this author in PubMed Google Scholar
Hanhua Cheng
View author publications
You can also search for this author in PubMed Google Scholar
Rongjia Zhou
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conceived and designed the experiments: R.Z. and H.C. Performed the experiments: L.Z., F.C., C.H., W.Z., C.Y. Analyzed the data: Z.L. Wrote the paper: Z.L., RZ.

Corresponding authors

Correspondence to Hanhua Cheng or Rongjia Zhou.

Ethics declarations

Competing Interests

The authors declare that they have no competing interests.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Supplementary information.pdf

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Li, Z., Chen, F., Huang, C. et al. Genome-wide mapping and characterization of microsatellites in the swamp eel genome. Sci Rep 7, 3157 (2017). https://doi.org/10.1038/s41598-017-03330-7

Download citation

Received: 14 February 2017
Accepted: 26 April 2017
Published: 09 June 2017
DOI: https://doi.org/10.1038/s41598-017-03330-7

This article is cited by

Cross-species Exon Capture and Whole Exome Sequencing: Application, Utility and Challenges for Genomic Resource Development in Non-model Species
- T. Jackson
- E. Ishengoma
- C. Rhode
Marine Biotechnology (2021)
Integrative genetic map of repetitive DNA in the sole Solea senegalensis genome shows a Rex transposon located in a proto-sex chromosome
- Emilio García
- Ismael Cross
- Laureana Rebordinos
Scientific Reports (2019)
Genetic diversity and population structure of the threatened chocolate mahseer (Neolissochilus hexagonolepis McClelland 1839) based on SSR markers: implications for conservation management in Northeast India
- Lata Sharma
- Shahnawaz Ali
- Veena Pande
Molecular Biology Reports (2019)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.