Introduction

Genome-wide association studies (GWAS) are now widely used to search for common variants underlying complex diseases and traits.1 These studies rely on the presence of discrete blocks of linkage disequilibrium (LD) between polymorphic markers, most commonly single nucleotide polymorphisms (SNPs).2, 3, 4 By genotyping a set of SNPs (ie, tagging SNPs (tagSNPs)) that are in high LD with a large set of neighbouring (un-typed) variants, one can reduce the costs involved in a GWAS with a limited loss of power.5, 6 The International HapMap Project4, 7 has generated an increasingly dense map of the LD structure of the human genome, on the basis of selected reference panels, to optimize the selection of tagSNPs for association studies in other populations.4 The main assumption is that the LD structure of the HapMap populations and the specific population under investigation is very similar. Deviations from this assumption may lead to a decrease in the power of GWAS based on tagSNPs.

Recent studies have assessed the similarity of the LD patterns in the three reference panels from the HapMap consortium (phase I and phase II)8, 9, 10 with other populations. The general claim is that tagSNPs from one population can capture common variation in other populations of similar origin.9, 10, 11, 12, 13 However, most of these studies have evaluated only a limited number of loci at varying degrees of SNP density,8, 14 and several of them have shown that the exchangeability of tagSNPs between populations may be locus and SNP-density specific.10, 14 Another issue is the variation in the local LD block structure observed between populations.15, 16 These differences have been attributed to differences in recombination rates,17 differential demography,16 stochastic genome variation or sampling variation.18 The last issue is highly relevant when comparing local LD patterns within small sample sizes, which have been the case for most studies so far, including those for the generation of HapMap phase I and II.

In the past years, an increasing number of GWAS have been conducted using European samples.19, 20, 21 Nonetheless, large scale studies that compare LD structure between European populations and the HapMap have not been performed. In this study, we estimated genome-wide measures of pair-wise LD and LD block structure in the Dutch population using a 600K SNP chip. We compared these LD estimates with data from the CEU–HapMap reference panel (individuals from Utah with northern and western European ancestry)4 to assess similarities and differences in LD patterns between these two European population samples.

Subjects and methods

The participants and genotypes we used were derived from a recent GWAS for major depression (MDD) in The Netherlands conducted as part of the Genetic Association Information Network (GAIN) (http://www.fnih.org/). The design of the study, description of the phenotypic data, as well as the genotyping procedures and quality control, are extensively detailed in a separate manuscript (Genome-wide association for Major Depressive Disorder: a possible role for the protein PCLO 2008; Sullivan et al, 2008, in press), and in earlier reports.22 Here, we briefly highlight the information that was relevant for our analysis.

Subjects

A sample of 1860 participants who were selected as controls for a recent GWAS of MDD (GAIN) project23 was available for our study. The participants were derived from two longitudinal studies, namely: The Netherlands Twin Registry (NTR) and The Netherlands Study of Depression and Anxiety (NESDA).23 Participants were selected as controls on the basis of the following criteria: age between 18 and 65 years, place of birth for the individual and his/her parents being The Netherlands or northern Europe, and low liability for MDD.23 The participants selected for the study were, on average, 42 years old and 60% of them were female. Details of the collection and sampling procedure, as well as other demographic characteristics are described elsewhere.23 After excluding participants who did not fulfil the inclusion criteria (Sullivan et al 2008, in press), a total of 36 trios and 1766 unrelated participants were available for the study.

DNA isolation, SNP genotyping and quality control

A description of the biological sample collection and DNA isolation is presented elsewhere.23 The genotyping of approximately 600K SNPs in the Dutch samples was performed by Perlegen Sciences (Mountain View, CA, USA), as part of the GAIN project.22 The genotyping platform consists of four proprietary, high-density oligonucleotide arrays that were chosen to tag common variation in the European and Asian panels from the HapMap consortium.2 Besides the quality control of the SNP genotyping performed by Perlegen,22 additional criteria were applied before the SNP genotypes were included in the final analysis. Briefly, SNPs were excluded if they had a gross mapping problem,24 ≥2 genotype disagreements in 40 duplicated samples, a minor allele frequency (MAF) <0.01, or >0.05 missing genotypes in either the controls, or in the cases from the GWAS of MDD. For our analysis, we also excluded SNPs with ≥1 Mendelian inheritance error in the 36 trios to improve the estimation of haplotypes. In addition, we also removed SNPs with deviations from Hardy–Weinberg Equilibrium proportions (HWE) (P<0.0001) in the trio sample. There were 2402 SNPs with significant deviations from HWE proportions, but that fulfilled the genotyping quality control (based on SNP missingness and/or low MAF).

Statistical analysis

To assess the similarities and differences of pair-wise LD and LD block structure between the Dutch samples (trios and unrelated subjects) and those from the HapMap consortium, we downloaded the genotypes available from the 30 trios from CEU–HapMap panel (phase II)25 (http://www.hapmap.org). From the CEU panel, we retrieved the genotypes of the SNPs that were present in the Dutch sample. SNPs that were not genotyped as part of the HapMap project, but that were available from dbSNP26 (eg, they were genotyped in the HapMap panels as quality control for the GAIN project) were also included in the analysis (n=7633). The physical positions of the SNPs were updated using dbSNP build 36.26

The MAF distribution was estimated in the trio-founders (Dutch and CEU samples) and in the group of unrelated Dutch subjects. Pair-wise LD (r2 and D′), and LD blocks were calculated per chromosome in both the Dutch (trios and unrelated subjects) as well as in the CEU samples. HAPLOVIEW v1.427 was used to estimate the above parameters with default settings except for the MAF of SNPs that was set to ≥0.05. LD blocks were determined using the method of Gabriel et al (2000).3

We used Fisher's exact test to test for statistical differences in MAF between the Dutch and the CEU trio-founders for individual SNPs, adjusting for multiple testing. Next, to compare the similarities of pair-wise LD between the Dutch and the CEU samples (trios), Pearson correlation coefficients (r) were calculated over non-overlapping 500 kb windows (with at least 50 pair-wise r2 values). Further, we used Fisher's z-transformation to convert r values to a normally distributed z distribution.28 This allowed us to look for local variation in the correspondence of pair-wise LD between populations by testing whether the Pearson correlation in a specific window was significantly lower than the genome-wide average on all autosomes. The distributions of block-derived parameters (total number of blocks, median block size, number of markers per block, and number of singleton SNPs) were compared between the Dutch trios and unrelated subjects, and the CEU samples using non-parametric tests (Wilcoxon signed rank test). These analyses were carried-out using the R statistical package v 2.7.0.

The level of genetic differentiation between the CEU and Dutch samples was assessed by means of the identity-by-state (IBS) test.29 This test was used to assess whether an individual is genetically more or less similar to a phenotypically discordant individual, when compared with a phenotypically concordant individual. Here, the phenotype is 1 if the individual is derived from the CEU sample, and 2 otherwise. For this analysis we restricted the comparison to the Dutch and CEU trios to have comparable sample sizes. These analyses were carried out using PLINK v1.03.29

To study the impact of the sampling variation on the estimation of the block structure, we performed bootstrapping. This means that we randomly sampled the 36 Dutch trios with replacement to create 1000 replicas, each with 36 trios. Next, LD blocks were estimated using the Gabriel method for each replicated set of 36 trios. Therefore, we estimated LD blocks 1000 times per chromosome. For each LD block observed in the Dutch trios (real blocks), we derived a frequency distribution for the proportion (X) of real blocks that were retrieved from the simulated data (X is the number of simulations in which the observed blocks were present out of 1000 simulations). We considered a real block robust if it was retrieved in at least 95% of the bootstraps. The analyses of bootstrapping and LD block estimation were carried out on the genetic cluster computer (http://www.geneticcluster.org).

Results

The present analysis is on the basis of 427 853 autosomal SNPs that were genotyped in the CEU and the Dutch population samples. Table 1 presents the total number of markers analyzed per chromosome. The average inter-marker distance across all autosomes was 6.31 kb. The large variation in the estimates of the inter-marker distances shows that the markers are not evenly spaced on the physical map. This feature of our data allowed us to evaluate whether the differences in the LD parameters between populations and/or samples were dependent on the differential SNP density.

Table 1 Median block length (interquantile range) and number of haplotypes per block in the Dutch and CEU populations

The MAF distribution of SNPs in both the Dutch and CEU trio-founders, and in the unrelated subjects was estimated. There was an excess of rare variants (0<MAF<5%; approximately 5% of SNPs per chromosome) in the Dutch trio-founders, but for SNPs with MAF ≥10%, the pattern was very similar between the three samples (Figure 1). Despite large differences in the density of markers per chromosome, the distribution of MAF was very similar across all autosomes (Supplementary Figure 1). There were 31 SNPs with significant MAF differences between the Dutch and the CEU trio-founders (Fisher test; Bonferroni-adjusted P-value: <0.002). We inspected those SNPs in SNPdb (latest release) and found that the MAF for these SNPs in the CEU sample differed according to the submitter group, and for most of them there were discordant genotypes. Thus, the differences we observed are not due to population differences.

Figure 1
figure 1

Distribution of minor allele frequency (MAF) of single nucleotide polymorphisms (SNPs) across 22 autosomes in the Dutch trios (red), CEU trios (light blue) and Dutch unrelated subjects (black). The frequency distribution of MAF for SNPs was categorized in bins. A full colour version of this figure is available at the European Journal of Human Genetics online.

Figure 2a depicts the median pair-wise r2 calculated over non-overlapping 500 kb windows in the Dutch trios. The median pair-wise r2 averaged across all chromosomes was 0.015 and 0.016 for the Dutch and CEU trios, respectively. These low pair-wise r2 estimates are expected as the Perlegen SNP-chips were designed to analyze tagging markers. More interesting is the high correlation of pair-wise r2 between the two trio samples (Figure 2b). The back-transformed mean Pearson correlation was 0.925 with 95% CI from 0.921 to 0.997. We tested whether the correlation estimates per window were significantly lower than the mean value estimated for all autosomes, and found that 1527 (30%) of the 5092, 500 kb-windows had significantly lower correlation estimates. Nonetheless, the lowest observed correlation was reasonably high (r=0.6). We also estimated pair-wise r2 in the Dutch unrelated subjects. The median r2 derived from this sample was lower than those estimated from the Dutch trios. In addition, the Pearson correlation of r2 between the Dutch samples (trios vs unrelated subjects) was not 1, despite the fact that we were comparing samples from the same population. This reflects the inaccuracy in estimating haplotypes in unrelated subjects in the presence of low r2 values with certain algorithms (eg Expectation–Maximization as implemented in Haploview)30 or an overestimation of low r2 values in small samples.31

Figure 2
figure 2figure 2

(a) Pair-wise linkage disequilibrium (LD) (r2) per chromosome for the Dutch trio sample. The median r2 per 500 kb window is plotted against physical distance (kb) in each chromosome. (b) Pearson correlation between r2 estimates for the CEU and the Dutch trios. The Pearson correlation between r2 calculated over 500 kb windows is plotted against physical distance (kb).

The LD block structure between the CEU and Dutch trios was also compared. We observed a higher number of blocks per chromosome and longer blocks in the Dutch trios, while the number of SNPs that were not in blocks was reduced (Table 1). The number of SNP singletons (SNPs that were not present in blocks) was significantly larger in the HapMap sample than in the Dutch trio sample for all chromosomes.

In addition, the median length of the blocks was higher for the Dutch than for the CEU trios (genome-wide Wilcox test; P-value <0.0001), although the interquantile distribution of this parameter overlaps in the two population samples (Table 2). Differences in other parameters, such as the median number of SNPs per block and the median number of haplotypes (for haplotypes with population frequencies ≥0.05) per block, were different (Table 2). LD blocks were also calculated in the unrelated Dutch individuals. Owing to computational limitations, we restricted the analysis of LD blocks to 400 randomly chosen subjects and observed a significantly larger number of blocks when compared with the Dutch trios. However, the estimates per block, such as median number of haplotypes per blocks and median length, were not different between these Dutch samples (Table 2).

Table 2 Number of SNP analyzed and intermarker distance

We investigated whether the differences observed in the LD block structure were due to population structure. We used permutation tests for observing between-group IBS differences29 (IBS test as implemented in PLINK), which allows testing whether, on average, individuals from one population (CEU) are more similar/dissimilar than individuals from another population (Dutch). The nominal P-values of the IBS test were significant for chromosomes 14, 15, and 17 (chromosome 14; P-value=0.021, chromosome 15; P-value=0.03, and chromosome 17; P-value=0.045), although these P-values were not significant after adjusting for multiple testing (0.05/22 chromosomes). This suggests that large differences in population structure between the Dutch and the CEU samples are unlikely.

To assess the effect of sampling variation on the LD structure, we performed a bootstrapping analysis as described in the section Subjects and methods. We derived a frequency distribution of the proportion of observed blocks in the Dutch population (real blocks) that were present in the replicas (Figure 3). By defining a Dutch block as robust if it was identified in 95% or more of the simulations, we found that only between 7.14% (chromosome 8) and 12.44% (chromosome 19) of the real blocks were robust. Figure 3 shows that when the real blocks were robust, there is a 70–75% probability that the real block will be identified in the CEU population as well (blue bars).

Figure 3
figure 3

Proportion of Dutch blocks retrieved in 1000 bootstraps. Frequency distribution of the proportion of observed Dutch blocks that were retrieved from the simulated data. The proportion of Dutch blocks that were observed in the CEU sample are depicted in light blue and the proportion of Dutch blocks that were not present in the CEU sample are depicted in grey. A full colour version of this figure is available at the European Journal of Human Genetics online.

Discussion

In this study, we estimated pair-wise r2 over 427K SNP markers in a Dutch population and compared these estimates with data from the CEU–HapMap project. A high correlation (0.93) between pair-wise r2 across all autosomes was found, which suggests that tagSNPs from the HapMap project can capture common variation in the Dutch population. Our findings extend earlier studies of high similarity of pair-wise LD between populations with shared ancestry based on fewer loci. We also observed that approximately 30%, 500-kb windows showed local variation of the Pearson correlation estimates, which were significantly lower than the genome-wide estimate. This suggests that in a considerable part of the genome, the efficiency of tagSNPs from HapMap to capture common variants may decrease as proposed in earlier studies.14, 15, 32 The LD block structure between the CEU and the Dutch populations was also compared. The average statistics per block, such as the number of common haplotypes and the number of SNPs per block, were not statistically different, but there was a large variation in the block boundaries between these two populations.

In a recent study, large variation in the LD block structure, even in populations of similar geographical background, was reported.16 Gu et al32 suggested that individual population history and genetic factors, including genetic drift, may account for these differences. Here, we found that sampling variation has a strong effect on the local variation of LD. The bootstrapping analysis showed that even in the same population sample, the observed blocks are retrieved only in 60–70% of the replicas. Moreover, we did not find substantial population stratification between the CEU–HapMap and Dutch samples. Furthermore, it has been shown that genetic drift has large effects only for variants with low population frequencies,33 which in our analyses were not included to estimate LD blocks. Thus, although differential recombination and other genetic stochastic factors are likely to be involved in the differential pattern of local LD structure between populations, we found that sampling variation has a major role, which is in agreement with other studies.34

The discrepancy between the high overall correlation of pair-wise r2 and the variation in the LD blocks between the Dutch and the CEU samples can partly be explained by the fact that D statistics are sensitive to sampling distribution, whereas r2 is more robust.35, 36 We also estimated pair-wise LD using D and observed larger variation in the correlation estimates between the CEU and Dutch population (Supplementary Figure 2). This further supports the finding that small samples largely account for the observed variation of the block structure, as the Gabriel method is based on D′. It is not clear what the optimal sample size for a robust estimation of LD blocks should be. Samples of 60–100 individuals have been shown to be enough for a robust reconstruction of LD block structure,14, 16, 37 which may hold when assessing the transferability of HapMap tagSNPs to other populations. We and others16, 18 have shown that for a reliable estimation of local LD structure, larger sample sizes are needed. This has consequences for interpreting association studies performed using tagSNPs derived from small samples, as in association studies regions with different LD blocks in cases compared with controls are assumed to harbour putative functional variants. This might partly explain the lack of replication of genetic association studies.38

A limitation of this study is that the participants of this study were selected as controls for a GWAS of MDD on the basis of their low liability for MDD. As a result, this sample may not be representative of the entire Dutch population. Nevertheless, the demographic characteristics of this Dutch sample were comparable to other European (or with European ancestry) samples used as replication controls (Sullivan et al, 2008, submitted). This means that the Dutch individuals used for our analysis do not largely deviate from other European population-based samples. Another issue was the use of tagSNPs to compare LD patterns, which results in a relatively large spacing between markers. As a result of this, we were not able to relate our findings to genomic features, such as differential recombination rates, that are an important cause of local differences in LD block structure between populations.4 However, our aim was to assess whether estimates of pair-wise LD vary substantially at the genomic level. At the marker density used in this study, we observed a consistent pattern of MAF distribution (Supplementary Figure 1) and a high correlation (above 0.6) between pair-wise r2 (Figure 2a) across all autosomes. This suggests that the marker density does not have a large effect on the patterns of LD we observed.

The HapMap project has been a milestone for the design of GWAS, as it provides the most complete human-SNP catalogue to the scientific community. The SNP-chip platforms that are currently used for these studies have been designed using the HapMap data. In the past years, GWAS have identified common variants associated with a wide range of disorders and other human traits.39 Nonetheless, many GWAS have been negative, and others have not been replicated.38 Low power, selection bias and population stratification39 might explain these negative GWAS. Given our results, as well as other recent findings,32 it can be argued that local differences in LD structure between the HapMap populations and the target population may also lead to a reduced power to detect candidate variants in regions of the genome where the correlation between pair-wise r2 in the HapMap panels and the target population is decreased. This is likely to affect the efficiency of HapMap tagSNPs. Of note, we only analyzed LD patterns between Dutch and CEU–HapMap reference panel, and therefore the patterns of LD we observed between these two European samples may not extend to other settings. Nonetheless, our analysis highlights the importance of local variations of LD between populations with shared ancestry that should be taken into account in both the design of GWAS and in the interpretation of results derived from these studies.

In summary, in this large-scale comparison of LD patterns between a Dutch population and the CEU–HapMap sample, we observed a high overall correlation of pair-wise r2 across 22 autosomes, with local variation in some genomic regions. Both a denser SNP-chip platform covering a larger number of SNPs derived from the HapMap panels, as well as a higher threshold to select HapMap tagSNPs, may reduce this problem. The second message of our work is that the reliable estimation of LD blocks is still an unsolved issue that is largely affected by sample size. Given the current definition of LD blocks, there is too much stochastic variability in the estimates of the blocks, because of the small samples that are used. Hence the observed blocks are often not robust. The current phase of the HapMap project (phase III) has been extended to include more populations as well as more individuals from the three earlier reference panels, which should provide an interesting opportunity to evaluate this issue.