The widespread use of microsatellite markers in genome-wide association (GWA) studies, as supported by Bahram and Inoko, currently faces several obstacles. These include determining genome-wide coverage, developing high-throughput genotyping platforms and analysis issues.

Coverage has been well studied for SNP sets because it determines the power of a GWA study1. Bahram and Inoko state that “...the average length of linkage disequilibrium (LD) for microsatellites is 100 kb.” This does not adequately address the coverage issue for microsatellite markers. First, the 30,000 available microsatellites are unlikely to be evenly distributed across the genome; therefore, simply by chance, some areas will not be covered. Second, LD varies across the genome, necessitating additional markers in low LD regions. Third, the measure of LD that is used by Bahram and Inoko in their estimate of 30,000 markers, its expected magnitude, and the population in which it was measured are unclear; all are needed to determine coverage and power. Fourth, it is unknown how well LD between microsatellites reflects LD among SNPs. Microsatellites have considerably higher mutation rates compared with SNPs, and could therefore represent younger alleles that might not capture older SNP alleles. To determine the ability of the 30,000 available microsatellite markers to comprehensively cover the genome, they should be genotyped on the existing HapMap samples.

Another concern is the availability of high-throughput genotyping platforms. Bahram and Inoko cite two studies that have used more than 20,000 microsatellite markers to conduct association testing, both of which used pooled DNA to facilitate the genotyping of such a large number of markers. Pooling reduces the effort and cost of genotyping, but can introduce additional errors in estimating allele frequencies. This approach reduces statistical power and limits the size of detectable allele-frequency differences2. Many SNP-based GWA studies have instead decreased genotyping costs through a multistage approach3,4. Cheaper and more reliable high-throughput microsatellite genotyping would be required to make such a multistage approach equally attractive for microsatellite markers. Furthermore, microsatellite markers are not immune to interference from copy-number polymorphisms5, a problem that is only now being addressed on large-scale SNP genotyping platforms.

Bahram and Inoko correctly point out that testing SNP haplotypes in GWA studies has several drawbacks, including the incorrect assignment of haplotype blocks. Multimarker tests can be conducted without defining blocks, and the first published GWAs reported the results of SNPs that were tested individually3,6,7. Although there is a significant multiple-testing burden when testing hundreds of thousands of SNPs or their haplotypes, testing each of the 6–10 alleles of 30,000 microsatellites will require large numbers of both tests and degrees of freedom for each test. Assuming that a set of microsatellite markers provides the same level of coverage as a dense SNP set, gains in statistical power or reductions in sample size are likely to be limited8.

Finally, extended LD complicates the analysis and fine-scale localization of an association signal9,10,11,12. A study that utilizes a sparse set of microsatellite markers instead of SNPs in the initial genotyping phase will require a larger number of SNPs to be genotyped for fine-scale mapping compared with studies that use a dense SNP platform.