Abstract
Recent improvements in next-generation sequencing technologies have enabled investigation of microsatellites on a genome-wide scale. Faced with a huge amount of candidates, the use of appropriate marker selection criteria is crucial. Here, we used the western flower thrips Frankliniella occidentalis for an empirical microsatellite survey and validation; 132,251 candidate microsatellites were identified, 92,102 of which were perfect. Dinucleotides were the most abundant category, while (AG)n was the most abundant motif. Sixty primer pairs were designed and validated in two natural populations, of which 30 loci were polymorphic, stable, and repeatable, but not all in Hardy–Weinberg equilibrium (HWE) and linkage equilibrium. Four marker panels were constructed to understand effect of marker selection on population genetic analyses: (i) only accept loci with single nucleotide insertions (SNI); (ii) only accept the most polymorphic loci (MP); (iii) only accept loci that did not deviate from HWE, did not show SNIs, and had unambiguous peaks (SS) and (iv) all developed markers (ALL). Although the MP panel resulted in microsatellites of highest genetic diversity followed by the SNI, the SS performed best in individual assignment. Our study proposes stringent criteria for selection of microsatellites from a large-scale number of genomic candidates for population genetic studies.
Similar content being viewed by others
Introduction
Molecular markers are useful tools for population genetics, linkage map construction, gene mapping, and phylogenetics analyses1. Microsatellites are widely used among available markers owing to many desirable traits, including a high mutation rate, ease of sample preparation, high applicability to genotyping, and high information content2,3,4. Until recently, the development of microsatellite markers is costly, inefficient, and laborious with conventional methods using microsatellite-enriched libraries5,6. However, current development in next-generation sequencing technologies have made the genome-scale study of non-model organisms feasible. Whole-genome data accessibility makes the development of novel microsatellite markers faster and less costly, and also makes it possible to survey and characterise microsatellites on a genome-wide scale.
The western flower thrips (WFT) Frankliniella occidentalis is an economically important invasive agricultural and horticultural pest. It is native to western North America, but has spread rapidly worldwide through the movement of horticultural material (e.g., cuttings, seedlings, and potted plants) since the late 1970s7. Microsatellite markers are commonly used in invasive species to study relations between genetic diversity and invasion success8,9,10,11,12,13, as well as in the reconstruction of invasion histories14,15. Although nearly 50 microsatellite markers have been developed for WFT16,17,18,19, most were developed from expressed sequence tag (EST) sequences. These EST-SSRs (simple sequence repeats) always have few alleles (e.g.18,19) and are possibly under selection20,21. This combination generates little information content, and negatively influences the estimation of population genetics parameters. Nonetheless, a large number of loci are necessary to obtain reliable admixture inferences22,23, which is to be expected in WFT populations because of this species’ high migration frequency7. The availability of a draft WFT genome provides opportunities to investigate microsatellites in this species’ genome and to develop potential microsatellite markers with less cost, more efficiency and variability, and greater potential success than with previous methods.
Although the development of microsatellite markers is now more accessible, the criteria for what constitutes good microsatellite markers are still controversial. Empirical studies have shown that selecting the most polymorphic markers, as many researchers often do, will generally overestimate genetic diversity24. However, selection strategy choice often has less effect on analyses of population differentiation and clustering24. In this study, we aimed to: (i) identify and characterise the microsatellites in the WFT genome; (ii) develop powerful microsatellite markers to examine the genetic structure, genetic diversity, and invasive spread pattern of WFT; and (iii) investigate the selection process for microsatellite markers used in population genetics research.
Results
WFT genome microsatellite characteristics
A total of 132,251 candidate microsatellites were identified from 4,151 WFT genome scaffolds, 92,102 of which were of the perfect type. The total relative abundance was 322.01 loci/Mb. The total relative density was 9.07 Kb/Mb, i.e., ca. 1% of the WFT sampled genome is constructed of microsatellites. Dinucleotides were the most abundant category, having highest density, followed by penta-nucleotides and tri-nucleotides (Table S1). The microsatellite counts decreased as the length of the microsatellites increased, while the rate of decrease of dinucleotides was relatively slower than so of other categories (Fig. 1). The average microsatellite length was 28.18 bp, while most microsatellites (95.1%) were shorter than 61 bp. The relationship between the length of the microsatellite and the percent of imperfect microsatellites or mismatches was next examined. Microsatellites between 20 bp and 70 bp long were used to analyse perfect versus imperfect type, as microsatellites with lengths <20 bp are always perfect under our search parameters. The results show that mismatches continuously increase as length increases for tri-, tetra-, penta-, and hexa-nucleotide motifs (Fig. 2). However, the number of mismatches in mono- and dinucleotide motifs plateaus out after 27 bp and 33 bp, respectively (the dinucleotide mismatch number increases again after 60 bp). A scatter graph of microsatellite lengths versus the percentage of imperfect microsatellites displays the data, illustrating similar interpretations (Figure S1). Mismatches and the percentage of imperfect microsatellites seem to also be associated with the type of motif. The 16 most abundant microsatellite motifs were (AG)n, (AC)n, (CCCCG)n, (AGC)n, (C)n, (CCG)n, (ACAG)n, (AGG)n, (A)n, (ACGCC)n, (ACC)n, (AGGG)n, (CCCGG)n, (AAC)n, (AGGGG)n, and (ACG)n, accounting for 77.5% of all identified microsatellites (Figure S2).
Development of microsatellite loci
The initial QDD3 program output provided 5,279 primer pairs, of which 4,175 were tri-nucleotide microsatellites and 1,104 were tetra-nucleotide. Following our stringent filter, a total of 381 primers (Appendix S1) flanking perfect microsatellites were retained. We found 305 primer pairs flanking tri-nucleotide microsatellites, and 76 pairs flanking tetra-nucleotide microsatellites. We selected and synthesised 60 primer pairs from different scaffolds for amplification (Appendix S1). In our initial test, 30 primers generated polymorphic genotypes, 23 primers failed to generate amplification in any individual, and seven primers failed in more than two individuals (Appendix S1).
The 30 positive primer pairs were characterised into two WFT populations (Table 1). Each polymorphic locus had four to 17 alleles, with an average of 9.43. The observed heterozygosities (HO) and expected heterozygosities (HE) ranged from 0.1 to 0.875 and 0.348 to 0.895, respectively (Table S2). Significant deviation (after sequential Bonferroni correction25, P < 0.05) from HWE was detected in 15 of 60 locus/population combinations, while two locus (WFT4-S31 and WFT4-S57) significantly deviated in both populations (Table S2). In addition, six pairs of loci in 435 pairs (S08 & S20, S16 & S32, S20 & S32, S16 & S36, S33 & S52, S16 & S57) from the MT population and one pair (S03 & S22) from the YQ population showed significant linkage disequilibrium (LD) (after sequential Bonferroni correction, P < 0.05), while none of the locus pairs were significant considering both populations together. In the LOSITAN analyses, one locus (WFT3-S60) fell into the candidate category for balancing selection, and the remaining loci were grouped into the neutral category (Figure S3).
Performance of four microsatellite panels
Eight loci showed single nucleotide insertions into their alleles (SNI, Table 1). These loci formed the SNI microsatellite panel, while the remaining loci were ranked according to their allele numbers to select the eight most polymorphic markers (MP). Fourteen loci were successfully amplified in all individuals, of which nine did not show SNI. Eight of those nine loci met the criteria for our most stringent strategy (SS) microsatellite panel, except locus WFT3-S01, whose alleles produced ambiguous peaks in GENEMAPPER. We investigated the genetic diversity and structure of two WFT populations using these three panels plus the ALL panel of all developed markers.
The allelic richness (AR) and expected heterozygosity (HE) values were higher for the MT population than for the YQ population in all cases, while both populations have quite similar FIS values (Fig. 3). Comparison of the microsatellite panels for AR and HE in the two populations reveals a decreasing trend in the order of MP, SNI, ALL, and SS values. The MP panel yielded significantly higher AR and HE values than the SS and ALL panels in the YQ population (MP vs. SS: AR, W = 53, P = 0.027; HE, W = 56, P = 0.012; MP vs. ALL: AR, W = 182.5, P = 0.025; HE, W = 176, P = 0.045). A significant difference was also observed in SNI versus SS in the YQ population (HE, W = 8, P = 0.012). No significant differences were found between the remaining panel pairs in the YQ populations nor in all the MT population panel pairs.
Population differentiation (FST) showed an increasing trend in order of MP, SNI, ALL, and SS values. However, the differences were not statistically significant. In the STRUCTURE analyses, all panels obtained their highest Delta K at K = 2, which suggests two distinct clusters. The average posterior probability (Q) was high in all cases (MT: SS, Q = 0.903; ALL, Q = 0.880; MP, Q = 0.779; SNI, Q = 0.885; YQ: SS, Q = 0.942; ALL, Q = 0.984; MP, Q = 0.979; SNI, Q = 0.899). However, some individuals were not correctly assigned to their respective population. There were 1, 2, 4, and 4 individuals incorrectly assigned for the SS, ALL, MP, and SNI microsatellite markers, respectively (Fig. 4). In the principal component analyses, the ALL and SS panels performed better than the MP and SNI panels in assignment of individuals (Figure S4).
Discussion
Genome characteristics
The proportional coverage of different motif types in genomes varies among species26. Dinucleotide motifs are the most abundant category in most cases, while the second most abundant are varied among species26. Mono-, tri-, and tetra-nucleotide microsatellites have higher coverage than penta-nucleotides in most species26, while the penta-nucleotide category is the second most abundant in our study. Furthermore, we found the CCCCG repeat to be the third most abundant microsatellite motif following two dinucleotides (AG, AC) in WFT. The AG repeat is the most frequent microsatellite motif in WFT, while the AC motif is the most frequent in most of other species26,27.
Mismatches are important in the evolution of microsatellites28. Point mutations within the repeat region that cause mismatches will divide the original repeat into shorter ones, which increase locus stability and accelerate the degeneration of long microsatellites27,28. In this study, mismatches showed a positive correlation with the length of the microsatellite in tri-, tetra-, penta-, and hexa-nucleotide motifs (Fig. 2), as previously reported29, although other factors might contribute to the mutation rate between loci29. However, mismatches of mono- and dinucleotide microsatellites with lengths between 30 bp and 60 bp were relatively constant. Additionally, the abundance of dinucleotide microsatellites was also relatively constant from 30 bp to 60 bp, which also had a lower percentage of imperfect microsatellites than other types (Fig. 1 and Figure S1). These long, perfect dinucleotide repeats may be less prone to point mutation than other types, or they may be later acquisitions, still in the expansion stage of their life cycle, and potentially close to confronting interruptions and degeneration30. The mechanism creating these patterns is outside the scope of this study.
Criteria for microsatellite selection
Selecting a set of appropriate markers is first step in many molecular ecology studies. Recent improvements in next-generation sequencing technologies, as well as the ready availability of genomic data, have made it possible to select an appropriate marker panel from a huge set of candidates. However, criteria of marker selection are often not mentioned in these studies, or the focus is on the polymorphism of markers24. Allelic variation is only one characteristic of microsatellite loci, and may insufficiently represent the performance of a locus under genetic study. Queiros et al. suggested that picking the appropriate microsatellite set should primarily take into account the ecological and evolutionary questions studied and selecting the most polymorphic markers will generally overestimate genetic diversity 24. In this study, we compared four marker panel selection criteria for population genetics studies.
The accuracy and stability of genotyping should first be considered prior to determining population genetics parameters. A number of genotyping problems, such as stuttering, split peaks, and low heterozygote peak height ratios4, always lead to high error rates and/or excessive manual correction. Tri- and tetra-nucleotide loci were selected for primer design in this study, because stuttering is less of a problem with them than so with dinucleotide loci. However, two loci (WFT3-S01 and WFT3-S21) showed similarly confusing genotyping profiles, and almost all of the individuals required manual correction. Both loci contain the same motif, (AAT)n, which corroborates with another report regarding genotyping profile problems associated with high A+T content31. This is probable the cause of deviations from HWE in these loci.
Only perfect loci were considered for primer design in our study. Eight of these loci exhibited an SNI. SNI mutation patterns can be complex, involving insertion/deletion (indel) events in flanking regions or may result from an excess of base substitutions32. Although these alleles can be identified manually, it is laborious and inaccurate.
Amplification success rates were considered as an important criterion in our study. Failed amplifications may relate to null alleles (Table S2), which can lower assignment power and affect FST accuracy33,34. Furthermore, complete data are expected to be more informative than deficient.
In our empirical study, the most polymorphic markers (the MP panel) resulted in higher HE and AR than the stringently selected markers (the SS panel), while the SNI panel obtained higher polymorphism levels. However, similar FST values were estimated by different marker panels, indicating that the approach of microsatellite selection will not influence the analysis of population differentiation, as previously reported24. However, the power of assignment was not enhanced in the most polymorphic markers; exactly the opposite was found. The SS panel performed better than the MP and SNI panel in both assignment tests, and even somewhat better than the ALL panel in Bayesian assignment. The low performance in assignment might be caused by high mutation rate in markers from MP and SNI panels, which either leading to genotyping error or deviation of data from the mutation models implemented in the analytical methods.
Conclusions
We examined and developed a large number of microsatellite markers from the WFT at the genome-wide scale. Our analyses revealed several distinct characteristics regarding the distribution of microsatellites in the species. High-quality primers can feasibly be designed and selected as appropriate markers for further research by taking advantage of the abundant microsatellites that can be isolated from genome sequences. We found that stringent criteria considering amplification efficiency, SNI within loci, and population genetics parameters can obviously improve individual assignment analysis, through a comparison of the performance of different marker panels. Our study provides an example for the bulk discovery of microsatellites and the development of stringent marker selection criteria for population genetics studies.
Methods
Sample collection and DNA extraction
A total of 55 adult females of WFT from eight sampled sites were used in the study. Among the specimens, one individual from each of the eight collection sites was used for an initial examination of selected primer pairs, while 47 from two of the collection sites (23 individuals from Mentougou [MT; 116.102 °E, 39.940 °N] and 24 individuals from Yanqing [YQ; 115.975 °E, 40.457 °N]) near Beijing, China, were used for a population-level survey. All samples were stored in 98% alcohol, and frozen at −80 °C. Total genomic DNA was extracted from a single female adult using a DNeasy Blood and Tissue Kit (Qiagen, Germany).
Identification of microsatellites
We used the 4,151 F. occidentalis genome scaffolds (410,700,254 bp) submitted to GenBank by “the Frankliniella occidentalis whole genome shotgun (WGS) project” (accession JMDY01000000, http://www.ncbi.nlm.nih.gov/Traces/wgs/?val = JMDY01#contigs) for microsatellite discovery. Microsatellites were scanned for and identified using the program SciRoKo, with the default fixed penalty motif criteria35. Default parameters included a minimum score of 15 and a mismatch penalty of 5, which identified microsatellites of mono-, di-, tri-, tetra-, penta- and hexa-nucleotide motifs. We also considered possible circular permutation motifs and/or reverse complement motifs as same type36,37. The length of motifs, the length of the microsatellite, the number of mismatches, the relative abundance of the microsatellite (the SSR number per Mb), and the relative density of the microsatellite (the SSR length [in bp] per Mb) were analysed, to characterise microsatellites in the WFT genome.
Primer design
The QDD3 program38 was used to extract microsatellites along with their flanking sequences (300 bp each) from the scaffolds, and to design the primers.
Genotyping accuracy and stability are associated with target region motifs. Dinucleotide microsatellites are prone to polymerase slippage during the amplification process, which may lead to mistyping39,40. This is particularly severe when the quality or concentration of the DNA sample is low41,42, as it often occurs with small insects. Tri- and tetra-nucleotide loci perform better than dinucleotides in this respect43,44. Our primer design criteria are as follows: (i) only tri- and tetra-nucleotide motifs were considered for primer design; (ii) the repeat number needed to be larger than seven; (iii) the primer needed to be between 18 bp and 24 bp long with an optimal length of 20 bp; (iv) the primer annealing temperature had to be between 58 °C and 62 °C with an optimal of 60 °C; and (v) the annealing temperature difference between pairwise primers had to be <4 °C. All other parameters were used at their default settings.
Imperfect (those with repeat region interruptions), compound, and multiple microsatellites within an amplified region may generate complex genotypes. Therefore, output primers were further filtered manually using the following, more stringent, criteria: (i) only perfect (without repeat region interruptions) loci were retained; (ii) the minimum distance between the 3′ end of two primer pairs and their target region had to be >10 bp; and (iii) primers using the ‘A’ design strategy that do not have multiple microsatellites, nanosatellites, or homopolymers in the amplicon were retained.
Polymorphic microsatellite isolation
Sixty primer pairs were selected and synthesised for validation. A universal primer (CAGGACCAGGCTACCGTG) was added to the 5′ end of the forward primers following the method of Blacket et al.45.
We initially tested for primer effectiveness and polymorphism level using one individual from each of the eight different sampling sites. The amplifications were performed using the GoTaq Green Master Mix (Promega, USA) in a final volume of 10 μl. The polymerase chain reaction (PCR) reaction mixture contained three primers: 0.08 μl forward primer, 0.16 μl reverse primer, and 0.32 μl universal primer labelled with fluorescence (FAM, HEX, and ROX sequencing dyes). Our PCR protocol follows: 5 min at 95 °C, followed by 35 cycles of 95 °C for 30 s, 56 °C for 40 s, and 72 °C for 40 s, with a final extension at 72 °C for 15 min. PCR products were analysed on an ABI 3730xl DNA Analyzer (Applied Biosystems, USA) using the GeneScan 500 LIZ size standard (Applied Biosystems, USA). Genotyping was conducted using GENEMAPPER 4.0 (Applied Biosystems, USA). Primer pairs that failed in more than two individuals, or that were monomorphic in eight individuals, or that produced more than two peaks were discarded. Those remaining primer pairs that generated polymorphic genotypes were further analysed in the next step.
For the next step, 48 female adults from two populations were used to analyse the genetic characteristics of each primer pair. The PCR reaction mixture, cycling conditions, and genotyping methods were identical to those of the previous step. We repeated failed amplifications twice to eliminate occasional technical mistakes.
Genetic diversity
The genotyping data of two populations were analysed. The macros Microsatellite Tools46 was used to calculate the number of alleles, and the observed heterozygosity (HO) and expected heterozygosity (HE). Allelic richness and the inbreeding coefficient of an individual relative to the subpopulation (FIS) were obtained using FSTAT 2.9.347. Null allele frequencies for each loci in each population were estimated using FREENA33, with 1,000 bootstrap replicates. The Hardy–Weinberg equilibrium (HWE) at each locus/population pair, the linkage disequilibrium (LD) between each pair of loci within each population, and the pairwise mean population differentiation (FST) were tested and/or calculated using GENEPOP 4.2.148. Putative loci under selection were detected based on the FST outlier method employed in the program LOSITAN49 with two options: “force mean FST” and “neutral mean FST.”.
Population structure
We examined population genetic structure using STRUCTURE 2.3.450. Admixture models with correlated allele frequencies and no prior location information were used for the analyses. The clustering test was replicated 30 times under each K value (from 1 to 5) with 200,000 Markov chain Monte Carlo iterations, after a burn-in of 100,000 iterations. The Delta (K) method51 was used to estimate optimal K values by submitting the Structure output to Structure Harvester Web 0.6.9451. The membership coefficient matrices (Q-matrices) associated with the optimal K values inferred from the previous step were processed using CLUMPP 1.1.252 with its Greedy algorithm, and then visualised with the program DISTRUCT 1.153. A posterior threshold probability (Q-value of individuals obtained with CLUMPP) of 0.60 was used to evaluate whether individuals were correctly assigned to their respective population. A principal component analysis (PCA) test was also performed using the package Adegenet 1.4–254 in R environment55, which does not require assumptions on the Hardy–Weinberg and linkage equilibrium.
Criteria for selection of marker panel
Six factors were taken into account for the selection of our microsatellite markers: HWE, LD, polymorphism, amplification fail rate, single nucleotide insertions (SNI), and peak patterns genotyped in GENEMAPPER. We constructed three microsatellite panels based on three criteria for each locus (Table 1): (i) the most polymorphic markers (MP); (ii) markers showing SNI; and (iii) stringently selected markers (SS), which were those that did not significantly deviate from HWE in all populations, generated amplification in all individuals, did not show SNIs, and had unambiguous GENEMAPPER peaks. Genetic diversity and population structure were estimated using these three microsatellite panels. Finally, the whole set of 30 markers developed in this study (ALL) were analysed. A comparison of the effect of marker selection was performed using the nonparametric Mann–Whitney–Wilcoxon test in the R environment.
Additional Information
How to cite this article: Cao, L.-J. et al. Bulk development and stringent selection of microsatellite markers in the western flower thrips Frankliniella occidentalis. Sci. Rep. 6, 26512; doi: 10.1038/srep26512 (2016).
References
Kalia, R. K., Rai, M. K., Kalia, S., Singh, R. & Dhawan, A. K. Microsatellite markers: an overview of the recent progress in plants. Euphytica 177, 309–334 (2011).
Selkoe, K. A. & Toonen, R. J. Microsatellites for ecologists: a practical guide to using and evaluating microsatellite markers. Ecol. Lett. 9, 615–629 (2006).
Abdelkrim, J., Robertson, B., Stanton, J. A. & Gemmell, N. Fast, cost-effective development of species-specific microsatellite markers by genomic sequencing. BioTechniques 46, 185–192 (2009).
Guichoux, E. et al. Current trends in microsatellite genotyping. Mol. Ecol. Resour. 11, 591–611 (2011).
Squirrell, J. et al. How much effort is required to isolate nuclear microsatellites from plants? Mol. Ecol. 12, 1339–1348 (2003).
Zane, L., Bargelloni, L. & Patarnello, T. Strategies for microsatellite isolation: a review. Mol. Ecol. 11, 1–16 (2002).
Kirk, W. D. J. & Terry, L. I. The spread of the western flower thrips Frankliniella occidentalis (Pergande). Agr. Forest. Entomol. 5, 301–310 (2003).
Genton, B. J., Shykoff, J. A. & Giraud, T. High genetic diversity in French invasive populations of common ragweed, Ambrosia artemisiifolia, as a result of multiple sources of introduction. Mol. Ecol. 14, 4275–4285 (2005).
Dlugosch, K. M. & Parker, I. M. Founding events in species invasions: genetic variation, adaptive evolution, and the role of multiple introductions. Mol. Ecol. 17, 431–449 (2008).
Glemin, S. How are deleterious mutations purged? Drift versus nonrandom mating. Evolution 57, 2678–2687 (2003).
Facon, B. et al. Inbreeding depression is purged in the invasive insect Harmonia axyridis . Curr. Biol. 21, 424–427 (2011).
Bolfikova, B., Konecny, A., Pfaffle, M., Skuballa, J. & Hulva, P. Population biology of establishment in New Zealand hedgehogs inferred from genetic and historical data: conflict or compromise? Mol. Ecol. 22, 3709–3720 (2013).
Konecny, A. et al. Invasion genetics of the introduced black rat (Rattus rattus) in Senegal, West Africa. Mol. Ecol. 22, 286–300 (2013).
Cristescu, M. E. Genetic reconstructions of invasion history. Mol. Ecol. 24, 2212–2225 (2015).
Estoup, A. & Guillemaud, T. Reconstructing routes of invasion using genetic data: why, how and so what? Mol. Ecol. 19, 4113–4130 (2010).
Brunner, P. C. & Frey, J. E. Isolation and characterization of six polymorphic microsatellite loci in the western flower thrips Frankliniella occidentalis (Insecta, Thysanoptera). Mol. Ecol. Notes. 4, 599–601 (2004).
Yang, X.-M., Sun, J.-T., Xue, X.-F., Li, J.-B. & Hong, X.-Y. Invasion genetics of the western flower thrips in China: evidence for genetic bottleneck, hybridization and bridgehead effect. PLoS ONE 7, e34567 (2012).
Yang, X.-M., Sun, J.-T., Xue, X.-F., Zhu, W.-C. & Hong, X.-Y. Development and characterization of 18 novel EST-SSRs from the western flower thrips, Frankliniella occidentalis (Pergande). Int. J. Mol. Sci. 13, 2863–2876 (2012).
Liu, J., Li, Z., Chen, X., Huang, H. & Gui, F. Development of polymorphic EST-SSR markers by sequence alignment in Frankliniella occidentalis (Pergande). J . Asia Pac. Entomol. 17, 581–585 (2014).
Bradbury, D., Smithson, A. & Krauss, S. L. Signatures of diversifying selection at EST-SSR loci and association with climate in natural Eucalyptus populations. Mol. Ecol. 22, 5112–5129 (2013).
Nie, Q., Yue, X., Chai, X., Wang, H. & Liu, B. Three vibrio-resistance related EST-SSR markers revealed by selective genotyping in the clam Meretrix meretrix. Fish shellfish Immun. 35, 421–428 (2013).
Vaughan, L. K. et al. The use of plasmodes as a supplement to simulations: a simple example evaluating individual admixture estimation methodologies. Comput. Stat. Data An. 53, 1755–1766 (2009).
Putman, A. I. & Carbone, I. Challenges in analysis and interpretation of microsatellite data for population genetic studies. Ecol. Evol. 4, 4399–4428 (2014).
Queiros, J. et al. Effect of microsatellite selection on individual and population genetic inferences: an empirical study using cross-specific and species-specific amplifications. Mol. Ecol. Resour. 15, 747–760 (2015).
Rice, W. R. Analyzing tables of statistical tests. Evolution 43, 223–225 (1989).
Meglecz, E., Neve, G., Biffin, E. & Gardner, M. G. Breakdown of phylogenetic signal: a survey of microsatellite densities in 454 shotgun sequences from 154 non model eukaryote species. PLoS ONE 7, e40861 (2012).
Bhargava, A. & Fuentes, F. F. Mutational Dynamics of Microsatellites. Mol. Biotechnol. 44, 250–266 (2010).
Buschiazzo, E. & Gemmell, N. J. The rise, fall and renaissance of microsatellites in eukaryotic genomes. BioEssays 28, 1040–1050 (2006).
Ellegren, H. Microsatellite mutations in the germline:: implications for evolutionary inference. Trends Genet. 16, 551–558 (2000).
Almeida, P. & Penha-Goncalves, C. Long perfect dinucleotide repeats are typical of vertebrates, show motif preferences and size convergence. Mol. Biol. Evol. 21, 1226–1233 (2004).
Brookes, C., Bright, J.-A., Harbison, S. & Buckleton, J. Characterising stutter in forensic STR multiplexes. Forensic. Sci. Int-Gen. 6, 58–63 (2012).
Cao, L. J. et al. Characterization of novel microsatellite markers for Hyphantria cunea and implications for other Lepidoptera. Bull. Entomol. Res. 105, 273–284 (2015).
Chapuis, M. P. & Estoup, A. Microsatellite null alleles and estimation of population differentiation. Mol. Biol. Evol. 24, 621–631 (2007).
Carlsson, J. Effects of microsatellite null alleles on assignment testing. J. Hered. 99, 616–623 (2008).
Kofler, R., Schlotterer, C. & Lelley, T. SciRoKo: a new tool for whole genome microsatellite search and investigation. Bioinformatics 23, 1683–1685 (2007).
Huang, J. et al. Genome-wide survey and analysis of microsatellites in giant panda (Ailuropoda melanoleuca), with a focus on the applications of a novel microsatellite marker system. BMC Genomics 16, 61 (2015).
Jurka, J. & Pethiyagoda, C. Simple repetitive DNA sequences from primates: compilation and analysis. J. Mol. Evol. 40, 120–126 (1995).
Meglecz, E. et al. QDD version 3.1: a user-friendly computer program for microsatellite selection and primer design revisited: experimental validation of variables determining genotyping success rate. Mol. Ecol. Resour. 14, 1302–1313 (2014).
Taberlet, P., Waits, L. P. & Luikart, G. Noninvasive genetic sampling: look before you leap. Trends Ecol. Evol. 14, 323–327 (1999).
Clarke, L., Rebelo, C., Goncalves, J., Boavida, M. & Jordan, P. PCR amplification introduces errors into mononucleotide and dinucleotide repeat sequences. Molecular Pathology 54, 351 (2001).
Pompanon, F., Bonin, A., Bellemain, E. & Taberlet, P. Genotyping errors: causes, consequences and solutions. Nat. Rev. Genet. 6, 847–846 (2005).
Taberlet, P. et al. Reliable genotyping of samples with very low DNA quantities using PCR. Nucleic Acids Res. 24, 3189–3194 (1996).
Walsh, P. S., Fildes, N. J. & Reynolds, R. Sequence analysis and characterization of stutter products at the tetranucleotide repeat locus vWA. Nucleic Acids Res. 24, 2807–2812 (1996).
Acharige, D., Terrence, S., Chand, V. & Mather, P. B. Development and characterisation of tri-and tetra-nucleotide polymorphic microsatellite markers for skipjack tuna (Katsuwonus pelamis). Ceylon J. Sci. Biol. Sci. 41, 11–17 (2012).
Blacket, M. J., Robin, C., Good, R. T., Lee, S. F. & Miller, A. D. Universal primers for fluorescent labelling of PCR fragments-an efficient and cost-effective approach to genotyping by fluorescence. Mol. Ecol. Resour. 12, 456–463 (2012).
Park, S. D. E. Trypanotolerance in West African Cattle and the Population Genetic Effects of Selection Ph.D. thesis, University of Dublin (2001).
Goudet, J. FSTAT (Version 1.2): A computer program to calculate F-statistics. J. Hered. 485–486 (1995).
Rousset, F. GENEPOP'007: a complete re-implementation of the GENEPOP software for Windows and Linux. Mol. Ecol. Resour. 8, 103–106 (2008).
Antao, T., Lopes, A., Lopes, R. J., Beja-Pereira, A. & Luikart, G. LOSITAN: a workbench to detect molecular adaptation based on a Fst-outlier method. BMC Bioinformatics 9, 323 (2008).
Pritchard, J. K., Stephens, M. & Donnelly, P. Inference of population structure using multilocus genotype data. Genetics 155, 945–959 (2000).
Earl, D. A. & Vonholdt, B. M. STRUCTURE HARVESTER: a website and program for visualizing STRUCTURE output and implementing the Evanno method. Conserv. Genet. Resour. 4, 359–361 (2012).
Jakobsson, M. & Rosenberg, N. A. CLUMPP: a cluster matching and permutation program for dealing with label switching and multimodality in analysis of population structure. Bioinformatics 23, 1801–1806 (2007).
Rosenberg, N. A. DISTRUCT: a program for the graphical display of population structure. Mol. Ecol. Notes. 4, 137–138 (2004).
Jombart, T., Devillard, S., Dufour, A. B. & Pontier, D. Revealing cryptic spatial patterns in genetic variability by a new multivariate method. Heredity 101, 92–103 (2008).
R Core Team R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. URL http://www.R-project.org/ (2013).
Acknowledgements
We thank Zong-Jiang Kang for his help on collection of specimens. This research was supported by the National Basic Research Program (2013CB127600), the Beijing New Star Program on Science and Technology (2010B027), and the National Natural Science Foundation (31472025), all of China.
Author information
Authors and Affiliations
Contributions
S.-J.W. and L.-J.C. conceived and designed the experiments. Z.-M.L., L.-J.C., Z.-H.W. and L.Z. conducted the experiments. L.-J.C. and S.-J.W. analysed the data and wrote the paper. S.-J.W., M.-C., Y.-J.G. and L.-J.C discussed the results.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing financial interests.
Supplementary information
Rights and permissions
This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
About this article
Cite this article
Cao, LJ., Li, ZM., Wang, ZH. et al. Bulk development and stringent selection of microsatellite markers in the western flower thrips Frankliniella occidentalis. Sci Rep 6, 26512 (2016). https://doi.org/10.1038/srep26512
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/srep26512
This article is cited by
-
Low levels of genetic differentiation with isolation by geography and environment in populations of Drosophila melanogaster from across China
Heredity (2021)
-
First development and characterization of 27 novel microsatellite markers in the dobsonfly Neoneuromus ignobilis (Megaloptera: Corydalidae) at genome-scale level
Applied Entomology and Zoology (2020)
-
Microsatellite development from genome skimming and transcriptome sequencing: comparison of strategies and lessons from frog species
BMC Genomics (2018)
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.