Introduction

It has been cogently argued that population-based genetic association studies will have a greater power than linkage studies to localize genes contributing moderately or only a little to the phenotype of complex diseases.1 However, the detection of association, or linkage disequilibrium between a genetic marker and a disease locus in outbred populations is only possible over small genetic distances.2, 3, 4, 5, 6 For screening large genomic regions, or even comprehensive whole-genome association studies, this implies that hundreds or thousands of markers have to be genotyped for each subject. Such studies are barely feasible using currently available genotyping technology.

Pooling of DNA samples for genetic marker analysis is a method to reduce the amount of genotyping required in allelic association studies.7, 8, 9, 10, 11, 12, 13, 14, 15, 16 This technique involves combining equal amounts of DNA from patients and controls into separate pools, and comparing the pools for differences in allele distributions of genetic markers. In the absence of haplotype information, which is the situation encountered in a typical association study based on pooled case–control comparisons, the biallelic variation of single-nucleotide polymorphisms (SNPs) contains far less polymorphic information than microsatellite markers. Therefore, microsatellites provide a more powerful tool on a marker-by-marker basis than SNPs.17, 18 However, in the case of microsatellite markers, the overall genotype patterns of pooled samples are often distorted by PCR artefacts such as stutter and preferential amplification, which prevent an accurate determination of the allele frequencies by simple procedures. Several methods have been proposed to handle these artefacts. Some studies compared summed differences in patterns between two pools without correction for PCR artefacts, and without allotting the individual allelic contributions to the differences.8, 9, 10, 11

A fundamentally different way to compare pool patterns is to correct the pool signal for predicted PCR artefacts, in order to derive more accurate estimates of the allele frequencies. Advantages of this approach are that it allows the comparison of frequencies for individual marker alleles, and that results from different experiments can be summated and analyzed using regular statistics such as χ2 tests, since the entire pool signal is deconvoluted into individual allele counts.12 All recent correction methods use information derived from a training set of individual genotype patterns to obtain information about the stutter behavior of the marker under investigation. One approach is to build a matrix of stutter patterns for individual alleles.7, 14, 15 This requires a set of well-distributed homozygous or well-separated (nonoverlapping) heterozygous individual genotype patterns, and interpolation or extrapolation has to be invoked to complete the matrix for missing alleles. These methods are sensitive to one or more nonrepresentative patterns caused by, for example, measurement errors.

Alternatively, a stutter model can be derived from individual genotypes, which is used to correct for stutter and permits interpolation of stutter for allele sizes not encountered in the training set.12 The advantage of a model is that it partly removes the influence of aberrant patterns. On the other hand, it interprets the stutter peaks according to a fixed behavior, which can yield a less accurate description. The model approaches presented thus far also require well-distributed homozygous or well-separated heterozygous individual patterns for each marker to define the model parameters. In both types of correction procedure, a rather large set (at least 20 to 50) of individual patterns has been considered necessary7, 8 to provide sufficient data to obtain the necessary stutter information.12 The search for and analysis of informative marker data often make these approaches tedious and highly interactive.

We have developed a stutter correction method that fits a model to one small set of genotype data from 10 individuals. This standard training set is identical for all markers, and can be of any allelic composition, since it does not need to include particularly defined homozygous or heterozygous individuals. The accuracy of the stutter correction model has been tested on 34 different microsatellite markers and used in a case–control study for celiac disease.

Materials and methods

Definitions

Uncorrected pool:

allele frequencies derived from pool signals uncorrected for stutter.

Corrected pool:

allele frequencies derived from pool signals corrected for stutter.

True pool:

allele frequencies obtained by individual genotyping of all samples present in a pool, and summing the allele counts.

Preparation of DNA pools, marker selection, PCR, and analysis

Genomic DNA was obtained from peripheral blood lymphocytes using established procedures. Stock solutions were diluted to approximately 25 ng/μl, vortexed gently, and measured with Pico Green (Molecular Probes, Leiden, the Netherlands) on a Genios plate reader (Tecan, Männedorf). Subsequently, samples were diluted to 10 ng/μl and final concentrations were measured in triplicate. Each sample was tested for adequate PCR amplification. Volumes containing 100 ng of DNA from individual samples were pooled. Pools, as well as a set of 10 random individual samples, were purified by phenol extraction, and diluted with water to 10 ng/μl. Characterized microsatellite markers were obtained from the Genome Database (GDB) and Marshfield database. New microsatellite markers were identified by searching a 4 Mb ADHD linkage region on chromosome 15 for microsatellite repeats19 using the Tandem Repeat Finder program (TRF). PCR primers flanking the repeats were designed with the Primer3 program (sequences are available on request). A so-called pig-tail sequence extension was added to one of the primers in order to reduce plus-A artefact during PCR.20 The other primer was labeled with 6-FAM, HEX, or NED fluorescent dyes (Biolegio, Malden, the Netherlands, and Applied Biosystems, Foster City, CA, USA).

Individual samples and triplicate pools were amplified simultaneously as described elsewhere,19 but with 27 instead of 33 cycles. Up to three products were pooled, and analyzed on an ABI 3700 sequencer.19 Sample files were analyzed using Genescan 3.5 and Genotyper 3.6 for Windows NT and the heights of all peaks were labeled. Samples with allelic peak heights below 200 or above 6000 were not labeled. A computer program called PoolFitter (freely available from our web site), which is a user interface invoking our stutter correction algorithm, then processed the tables with allele sizes and peak heights (see below). The pool patterns were corrected for stutter by applying the model parameters derived from the individual genotypes (see below). For marker D7S2422 only, preferential amplification of shorter alleles was compensated in the PoolFitter program, by dividing the peak heights of both individual data and pooled data before model fitting by a function fitted to the corrected heterozygous patterns without compensation for preferential amplification. Estimates from corrected and uncorrected pool patterns (averages of triplicate measurements) were compared with true pools using the program CLUMP.21

The model

The basic concept is that, for pooled DNA, any electrophoretic microsatellite marker pattern (See Figure 1a) is the sum of its constituent parts comprising a mixture of homozygous and heterozygous individual patterns. Peaks may represent individual alleles, or individual alleles plus a stutter component, or only stutter. We describe a pattern by Y(a), where Y is the height of the signal at fragment length a. In all figures, the signal height has been scaled to facilitate comparison with calculated quantities later on. The length a can assume discrete values differing by multiples of the repeat length Δa. For a dinucleotide marker, Δa=2 base pairs. Looking at a pattern for a single allelic peak at length a0, one expects stutter peaks at a0−Δa, a0−2Δa, etc. and possibly ‘up-stutter’ peaks at a0a, etc. In general, peaks are located at a=a0mΔa, with m integer. The modeled peaks are described by y(a), representing the peak height at length a. This peak y(a) can have contributions from an allelic peak y0(a), and from stutter peaks of alleles located m base pairs away: y(a)=ym(a+mΔa). Thus, the index m refers to the order of the (stutter) peak: y0(.) is the main, allelic peak, y1(.) is the first stutter peak, and so on. The argument of ym(.) refers to the location of the main, allelic peak. Our aim is to describe the total set of peaks by a model with as few parameters as possible. The values of the parameters will depend on the marker, PCR conditions, settings of the electrophoresis apparatus, etc. Knowing the stutter parameters, it is possible to deconvolve the measured signal Y(a) of a DNA pool into the contributions of individual alleles and hence calculate the frequency of each allele in the pool. The ratio between allele and stutter signal appears to be marker specific, and since pool patterns do not contain enough information to obtain reliable estimates for the stutter parameters used in the model, these parameters have to be derived by fitting the model to a number of individual test patterns for each marker.

Figure 1
figure 1

(a) Typical individual electrophoretic pattern after removal of background signal (line). The marker is the dinucleotide repeat marker DRD5. The individual is homozygous for the allele of size a=153 base pairs (indicated by the arrow). The circles indicate the tops of the peaks, which are used as measures of the amount of stutter and allele signal present. The numbers refer to the indexing of the peaks: 0 is the allelic peak, 1 the first stutter peak, and so on. Since the absolute heights of the peaks are not important, Y has been scaled to a 100% scale in most figures. (b) The same electrophoretic pattern as in Figure 1a, plotted on a logarithmic scale (line and circles; the numbers refer to the indexing of the peaks). The straight line indicates the exponential relationship between successive stutter peaks. The arrow indicates the main, allelic, peak at a=153.

Stutter pattern of a homozygous individual

It can be demonstrated that the heights of stutter peaks decay exponentially with the number of stutters, as clearly shown in a logarithmic plot (Figure 1b), in which a straight line can be drawn through the tops of the stutter peaks. A few simple theoretical assumptions about the nature of DNA amplification predict this exponential behavior.22 From many of such plots we found empirically that the ratios of the heights of successive stutter peaks are roughly constant for all samples of the same marker and amplification condition, but that the constant differs between markers and conditions. We denote this constant by the ratio r. The first stutter peak is usually found to be proportionally higher compared to the main peak; in Figure 1b this is observed as a deviation of the stutter straight line with the top of the allelic peak. We therefore use a different ratio to describe the relationship between the first stutter peak and the allele peak:

with 0<r<1, and λ>1, normally. This is for the ‘normal’ downward stutter. For the upward stutter, we take only one peak into account, as it is rare to see more up-stutter peaks; however, the model can easily be extended to more, if necessary.

with 0<μ1, normally.

For all other positions, that is, positions at larger lengths:

We can combine and rewrite Equations (1a), (1b), (1c) and (1d) as follows:

The fragment length of (stutter) peak ym(a) is

It is usually observed that stutter is more severe for longer alleles than shorter alleles. This can be understood, at least qualitatively, by realizing that a larger number of repeats offers more chances for the PCR process to stutter. We therefore introduce an a dependence of the stutter ratio r:

For positive values of b1, this formula yields an increasing stutter for increasing a.

The true amount of signal at the allelic peak, that is, if no allele signal had been dissipated into stutter peaks, is represented by

We have now described the set of stutter peaks by four parameters: b0, b1, λ, μ. In the trivial case of a pattern of a homozygous individual, Equation (2) can be fitted directly to the Y(a) data, with y0(a0) as a fifth fit parameter. The length of the allele is directly read from the pattern: a0; y0(a0) is just the height of the measured main peak Y0(a0); μ is the ratio of the up-stutter peak at a0a and the main peak. The factor r is determined by the logarithm of the heights of the stutter peaks y1(a0), y2,(a0)… the heights of which are directly taken from the measured peaks Y(a0−Δa), Y(a0−2Δa),… Then λ follows from y1(a0)/y0(a0), with the value of r inserted in Equation (2). In this example, r is kept constant; in a more realistic situation involving alleles of several lengths (such as in a pooled DNA sample), b0 and b1 can be fitted instead of a constant r.

Stutter pattern of a heterozygous individual

In the case of a heterozygous individual, there are more measured peaks to fit, and there is one extra fit parameter, namely the y0 of the second allelic peak. We will refer to the two y0s as 0S and 0L, located at a0S and a0L, respectively, with a0S< a0L (see inset of Figure 2). Heterozygous patterns often overlap to a large extent, and pool patterns always do. For two alleles close together, the measured peak heights in the overlapping region are the sum of two contributing peaks, one for the (shorter) S-allele, and one for the (longer) L-allele. For instance, the peak at the left arrow in the inset of Figure 2 is made up of the first stutter peak of the S-allele and the third stutter peak of the L-allele and is represented by

with a0S=153 and a0L=157. This effect makes the fit procedure more challenging and real solution algorithms have to be invoked. We used the Levenberg–Marquardt method.23 The result of such a fit is shown in Figure 2. The model fits the data well, and the relatively large contribution of the stutter peaks of the L-allele to both the allelic peak and stutter peaks of the S-allele is clearly seen.

Figure 2
figure 2

Model fitted to a heterozygous individual electrophoretic pattern (see inset for the original pattern (line) with peaks (circles); allelic peaks 0S and 0L are indicated). The marker is DRD5. The circles represent the data peaks; the squares show the fit. The two types of triangles and the dashed lines indicate the individual peak patterns of the two alleles (y0S and y0L) that comprise the measured signal. The diamonds in the inset represent the corrected, that is, estimated frequencies of the two alleles. These values are obtained by summing all peaks for each separate allele and normalizing the total sum of the two alleles to 100.

Pattern of a pooled sample

The generalization to fitting a pattern of a pooled DNA sample containing alleles of n individuals is straightforward. At every measured fragment length a, the following peaks can contribute to y(a), depending on the presence of alleles in the pooled sample:

  1. a)

    the allelic peak y0(a) of the allele at a0=a;

  2. b)

    the up-stutter peak y−1(a−Δa) of the allele just left of it, at a0=a−Δa;

  3. c)

    the first-order stutter peak y1(aa) of the allele just right of it, at a0=aa;

  4. d)

    higher-order stutter peaks ym(a+mΔa) of alleles more to the right (m=2,3,…).

In a formula, this can be written as

with ym(a+mΔa)=y0(a+mΔa)λ exp((b0+b1a)m) for m≥1 the mth stutter peak of the allele at a0=a+mΔa, and y−1(a−Δa)=y0(a−Δa)μ the up-stutter peak of the allelic peak just left, at a0=amΔa. The arguments of y(.) in Equation (7) must lie in the measured range of allele lengths (amin,amax).

The yt(a)s are now calculated as follows:

These values are proportional to the number of individuals n contributing to that allele a. To obtain estimates of the true allelic frequencies F(a) in the pool, one calculates:

where the summation is carried out over the full range (amin, amax) of the pool pattern.

To correct a pattern of a pooled DNA sample, one has to fit Equation (7) to the measured data Y(a). Values for the four model parameters b0, b1, λ, μ could be found from fitting the model to the genotype patterns of a small number of representative individuals one at a time, and deriving ni values for each of the fit parameters. These ni values could then be averaged to obtain a good estimate for each of the parameters. A much more efficient way is to perform the model fitting to all individual patterns simultaneously. The total number of data points is nimi, with mi the average number of measured peaks per individual. The total number of fit parameters is 4 + ni(1+h), with h the calculated heterozygote frequency of the marker (0≤ h≤1). A set of ni =10 individuals, each with on average mi=7 data points and a heterozygote frequency of h =0.5, requires fitting a model with 19 parameters to a combined data set of 70 data points, which, as shown in Figure 3 for marker D6S273, yields a very stable fit.

Figure 3
figure 3

Electrophoretic pattern of a pool of 100 individuals for marker D6S273 (dashed line). The circles mark the peaks used in the analysis. Diamonds indicate the estimated true allelic frequency F after compensation for stutter, as calculated from the model. The model parameters were derived from 10 randomly chosen individual marker patterns. The fit parameters (b0, b1, λ, μ) were inserted into the pool fit, from which the yts were derived. For most alleles, quite a big difference is observed between the original and corrected peak heights. The squares represent the summed individual genotypes. The inset shows the relationship between summed individual frequencies and uncorrected (circles) and corrected (diamonds) frequencies from the pool. The straight line represents the identity line: symbols on this line represent alleles for which the frequencies estimated from the pool equal the summed individual frequencies, showing perfect agreement.

Comparison with a deconvolution method

The most robust method previously published is the deconvolution method described by Perlin et al.15 Like our method, it uses a set of individual patterns to obtain the stutter behavior. The main difference between our method and Perlin et al's is the fact that we fit a model to the data to describe the stutter behavior, which makes our method potentially much more robust, and thus requiring fewer individual patterns to train the method. This has been tested below.

Results

Minimum number of genotypes required in the training set

For marker D6S273, we investigated the influence of training set size on the reproducibility of the results by fitting models based on sets varying in size from 2 to 30 individuals, which were taken at random from the n individuals in the pool. For each chosen set size, a random selection of individuals was taken 20 times to derive the model parameters and to correct the pool data. Figure 4 shows the effect of training set size plotted against the spread in the corrected peak height of one of the alleles (a=127; see Figure 3) in the pool. We chose to show this allele because of its low frequency (3%), in which adequate correction is crucial. For sets smaller than about five individuals, the variation in the results was relatively large, but for 10 individuals or more, the gain in reproducibility was limited. The effect of training set size was also tested for two other dinucleotide markers, with similar results (not shown). A set of ni=10 was found to give reliable results with a coefficient of variation of about 1%.

Figure 4
figure 4

The peak at a=127 of Figure 3 calculated from fits with individual sets of ni=2, 3, 4, 5, 6, 7, 8, 9, 10, 12, 15, 20, 25, and 30 individuals. For each value of ni, 20 fits with randomly comprised sets were carried out. The values of the allele frequency F derived following correction are plotted on the vertical axis: mean (circles) and standard deviation (error bars) and the smallest and largest values (diamonds). Small sets already provide reliable frequencies with mean values of 3.2. The frequency found from summing the individual genotypes in the pool is 2.5. The frequency read from the uncorrected pool pattern was 5.8.

This test was also carried out for Perlin et al's method.15 For all alleles and values of ni, the variation in estimated frequencies from this method was at least three times as large. For ni=30, the variation was still twice as high as our method's variation for ni=10.

Robustness to atypical training sets and measurement errors

Using data for marker D6S273, the robustness of the algorithm was checked in the following way: various sets of ni=10 individuals were used as a training set to fit the model to the pooled data depicted in Figure 3: (i) a set of 10 homozygous individuals; (ii) a set of 10 heterozygous individuals; (iii) a set of 10 individuals whose alleles were closely packed together in a certain region, leaving part of the allelic range of the pool uncovered; (iv) a set of eight regular individuals plus two measurement errors: patterns containing an allele exhibiting a completely different stutter behavior to the others (but with peaks in the same molecular weight range).

All tests yielded good results that hardly differed from the ‘normal’ pool fit results of Figure 3.

A comparison of test (ii) with test (i) shows that no prechosen homozygous (or well-separated heterozygous) individual patterns are needed to derive good parameter estimates. Further, test (iii) shows that there is no need for training data to cover the full molecular range of alleles. Only in the extreme case of having only data points at one extreme of the molecular range in the training set do the pool results at the other end become less reliable. Test (iv) simulates the presence of measurement errors. If one or two of the 10 individual patterns are dissimilar to the others, for example, because of an artefact in the PCR process or a measurement error, the fit procedure does not appear to be misguided. The test showed that the fits derived from a training set of eight normal and two abnormal patterns were nearly as good as those based on 10 good patterns.

Validation of the model

For 34 different microsatellite markers, correction models were derived from the same training set of 10 individuals, and both uncorrected and corrected pools were compared with the true pools (for definitions see Materials and methods section). An example is shown in Figure 3. In total, five genotypes from four different markers could not be determined reliably, and these were discarded. Correlation coefficients of uncorrected and corrected pools vs true pools for all 34 markers are given in Table 1. A graphical representation of the data for 16 characterized markers can be found on our web site (Figure C).

Table 1 Statistical comparison of allele frequencies obtained by individual genotyping and frequency estimates from uncorrected and corrected pool patterns

The only markers in which uncorrected pools approached true pools were the four tetranucleotide markers. For the dinucleotide markers, uncorrected pools were generally very different from the true pools, whereas corrected and true pools did not differ significantly, with the exception of markers D11S1760, D7S2422, and kk9. For marker D11S1760, there was a large overestimation of the frequency of the shortest allele in both uncorrected and corrected pools. Analysis of all individual genotype patterns for this marker revealed that stutter did not increase with allele length in a regular fashion (see web Figure D), which is an underlying assumption in the correction model. Marker D7S2422 showed a systematic overestimation of the peak height of the shorter allele in heterozygotes in the PoolFitter program, which persisted after correction for stutter. This suggested preferential amplification of shorter alleles, and after applying a simple compensation in the program, the differences between corrected and true pools were no longer significant (data not shown). No evidence for preferential amplification was found in the other markers (see web Figure E). Marker kk9 had two extra alleles (together accounting for 18% of all alleles in the true pool), with a size exactly between alleles at the regular 2 bp intervals. These aberrant alleles were discarded from the analysis, since the correction method ignores alleles at irregular intervals.

Case–control study

We investigated the application of the correction method in a case–control study in celiac disease (CD). DNA from 50 CD patients and 100 healthy controls was combined into two pools. Five microsatellite markers that had previously been used in association studies of CD patients were blinded and analyzed in CD and control pools. For three markers, allele frequencies did not differ significantly between cases and control pools, in either individual genotyping or pooled analysis. The other two markers showed significant differences between cases and controls. In each marker, one allele was very strongly associated, and already detectable in uncorrected pools, but after stutter correction, both markers also showed a much weaker but significant association with a second allele (see Figure 5). Both weak and strong associations were also demonstrable in the summed individual analysis.

Figure 5
figure 5

Comparison of allele frequencies in pools of celiac disease patients (black bars) and healthy controls (white bars) for marker D6D273. (a) Uncorrected pools, (b) corrected pools, (c) true pools. X-axis: allele number (increasing size). Y-axis: frequency of individual alleles (%). Significant differences for single alleles (P<0.05, not corrected for testing multiple alleles) are indicated with (*).

Discussion

Although the use of pooled DNA enormously reduces the amount of genotyping in comparing cases and controls, it suffers from the inability to generate haplotype information. As a result, microsatellite markers, with their high information content, are much more suitable than SNPs for use in pooled DNA samples. The number of potentially polymorphic microsatellites in the genome is much higher than the number of characterized markers in public databases. For example, in 11 schizophrenia candidate genes, we have tested 19 polymorphic microsatellites, eight of which were intragenic, while flanking markers were on average at 45 kb distance from the gene (max 130 kb). In a schizophrenia candidate region, we found nearly 250 potentially polymorphic microsatellites with an average spacing of 55 kb (max 168 kb). However, the widespread application of microsatellite markers in DNA pooling may have been prevented by uncertainties induced by stutter artefacts and the consequent distortion of allele frequency estimates.

We have developed a novel method, which enables accurate extraction of allele frequencies from microsatellite pool signals. A prerequisite for the application to large studies is that the correction method does not entail much additional analysis time. Our method meets this requirement, since the same training set of only 10 independent DNA samples plus the pool samples is required to carry out an analysis for a given marker. An apparent advantage of our approach is that there is no requirement for stutter and allelic peak signals of heterozygous individuals to be clearly separated, which greatly reduces the number of individuals required. There was little gain in accuracy when more than 5-10 individual genotypes were used, and accordingly, we choose one set of the same 10 independent individuals for all analyses, to allow for occasional dropouts. Other advantages of our fit algorithm are the simultaneous fitting of all data, which decreases the sensitivity to aberrant data, and that the size distribution of alleles and stutter, or alleles with an anomalous stutter height, had little influence on the predictive accuracy of the model.

The model was tested on DNA pool patterns with 34 different microsatellite markers, 18 of which were newly defined from human sequence data, since well-characterized markers could have been selected for their accuracy in genotyping. Our results with tetranucleotide markers confirm previous reports that stutter is low in these markers (generally <5%) and that no stutter correction is required.7, 12, 24 Significantly, for the two dinucleotide markers in which correction remained inaccurate, the presence of an aberration was readily detected in the PoolFitter program, even though it could not correct the stutter distortion.

In a case–control study involving celiac disease, marker alleles that were weakly associated in individual genotyping were also found to be associated in the pool analysis, but only after stutter correction. These two exceptionally strongly associated markers in the HLA region would have been detected even without correction, but would have been missed if only the weakly associated alleles had been present. This clearly demonstrates the benefit of stutter correction in DNA pooling.

Taken together, stutter correction generally resulted in accurate estimates of true allele frequencies in DNA pools. Compared with methods that use uncorrected pool patterns, several important advantages are apparent. The recently proposed ΔAIP and ΔTAC methods compare overall differences in peak area or peak height between pool patterns.8, 9 However, both methods assume a single fixed stutter profile for all markers and simulate large numbers of pool patterns to determine what proportion by chance will deviate significantly. Since the heights as well as the number of stutter peaks can differ greatly between markers, these methods raise the question whether realistic significance levels can be calculated in this way. In any case, such an approach prevents ascribing differences between pools to single alleles and summing results from different subpools or different experiments.12 These drawbacks are not evident in our method.

We found that technical measures, such as reducing the number of PCR cycles, and adding pig-tail sequences to primers to eliminate plus-A artefacts, and separation on a capillary sequencer instead of a slab gel machine,21 consistently improved the accuracy of DNA pool measurements. However, the nature of DNA pooling will inevitably result in some loss of sensitivity compared to individual genotyping.25 Furthermore, a four-parameter model is not a perfect description of reality.

Despite these and other limitations, such as the lack of haplotype information, until cheap and rapid large-scale individual genotyping of markers for single individuals becomes technically feasible, DNA pooling methods allow efficient initial screening of candidate regions, and candidate gene systems. In pooled DNA, microsatellites are much more informative than single SNPs. In a second phase, associated microsatellites could then be followed-up by individual genotyping of high-density SNP markers, and haplotype analysis. Even if cases and controls were divided into pools of only 100 individuals each, as recently advocated,16, 26 and all amplified in triplicate, DNA pooling decreases genotyping by a factor of 30 in studies involving 500 cases and 1000 controls.

Our results confirm that the accuracy of analyzing corrected pool patterns generated from microsatellites approaches that of individual genotyping. Particularly in complex disorders, where the association of marker alleles with disease loci is likely to be only moderate or weak, a gain in sensitivity with stutter correction in pooled analyses justifies the limited amount of extra genotyping required to create a small training set.

In conclusion, we have demonstrated that accurate estimates of microsatellite allele frequencies from DNA pools are feasible with a novel stutter correction method requiring one standard training set of only 10 additional individual genotypes. This method opens the way for realistic large-scale genetic association studies using microsatellite markers.