Recently, attention has focused on the use of whole-genome linkage disequilibrium (LD) studies to map common disease genes. Such studies would employ a dense map of single-nucleotide polymorphisms (SNPs) to detect association between a marker and disease based on LD between the marker and a disease-risk gene variant. Construction of SNP maps is currently underway. One key question is the required marker density of such maps. I have used population simulations to estimate the extent of LD around common gene variants in the general human population as well as in isolated populations. Two main conclusions have emerged from these investigations. First, a useful level of LD is unlikely to extend beyond an average distance of approximately 3 kb in the general population, which implies that at least 500,000 SNPs will be required for whole-genome LD mapping studies in samples drawn from this population. Second, the extent of LD is similar in isolated populations unless either their history meets a set of specific criteria, such as a very narrow founding bottleneck and slow early growth, or the frequency of the variant is low (<5%). Another important question concerns the choice of strategy for SNP discovery. I will discuss the impact of different SNP discovery strategies on the properties of the resulting catalogue of SNPs.

figure a