Genome-wide SNP analysis reveals no gain in power for association studies of common variants in the Finnish Saami

Huyghe, Jeroen R; Fransen, Erik; Hannula, Samuli; Van Laer, Lut; Van Eyken, Els; Mäki-Torkko, Elina; Lysholm-Bernacchi, Alana; Aikio, Pekka; Stephan, Dietrich A; Sorri, Martti; Huentelman, Matthew J; Van Camp, Guy

doi:10.1038/ejhg.2009.210

Download PDF

Article
Published: 25 November 2009

Genome-wide SNP analysis reveals no gain in power for association studies of common variants in the Finnish Saami

Jeroen R Huyghe¹,
Erik Fransen¹,
Samuli Hannula²,
Lut Van Laer¹,
Els Van Eyken¹,
Elina Mäki-Torkko²,
Alana Lysholm-Bernacchi³^na1,
Pekka Aikio⁴,
Dietrich A Stephan³,
Martti Sorri²,
Matthew J Huentelman³ &
…
Guy Van Camp¹

European Journal of Human Genetics volume 18, pages 569–574 (2010)Cite this article

680 Accesses
3 Citations
1 Altmetric
Metrics details

Abstract

The Saami from Fennoscandia are believed to represent an ancient, genetically isolated population with no evidence of population expansion. Theoretical work has indicated that under this demographic scenario, extensive linkage disequilibrium (LD) is generated by genetic drift. Therefore, it has been suggested that the Saami would be particularly suited for genetic association studies, offering a substantial power advantage and allowing more economic study designs. However, no study has yet assessed this claim. As part of a GWAS for a complex trait, we evaluated the relative power for association studies of common variants in the Finnish Saami. LD patterns in the Saami were very similar to those in the non-African HapMap reference panels. Haplotype diversity was reduced and, on average, levels of LD were higher in the Saami as compared with those in the HapMap panels. However, using a ‘hidden’ SNP approach we show that this does not translate into a power gain in association studies. Contrary to earlier claims, we show that for a given set of common SNPs, genomic coverage attained in the Saami is similar to that in the non-African HapMap panels. Nevertheless, the reduced haplotype diversity could potentially facilitate gene identification, especially if multiple rare variants play a role in disease etiology. Our results further indicate that the HapMap is a useful resource for genetic studies in the Saami.

FinnGen provides genetic insights from a well-phenotyped isolated population

Article Open access 18 January 2023

Population-specific long-range linkage disequilibrium in the human genome and its influence on identifying common disease variants

Article Open access 06 August 2019

The Newfoundland and Labrador mosaic founder population descends from an Irish and British diaspora from 300 years ago

Article Open access 28 April 2023

Introduction

Population isolates have proved very useful for the identification of genes for rare Mendelian diseases.¹ Their utility in elucidating the genetic basis of complex diseases, however, has been a matter of much debate in the past.^{2, 3, 4, 5} This uncertainty has now largely been resolved. During the last few years, numerous variants associated with complex diseases and traits have been identified in population isolates.⁶ One important advantage some of these populations offer, is that they show substantially higher levels of linkage disequilibrium (LD) and fewer regions of very low LD compared with outbred populations.^{7, 8} This makes them particularly suited for genome-wide association studies (GWASs), which rely on the LD between the typed markers and the untyped causative variants. LD that stretches out over longer distances, leads to a better genomic coverage for a given marker density. Equivalently, isolates would allow a more economic study design, in which a high genomic coverage can be attained using relatively fewer markers. It has been suggested that, in some isolates, an association study would require at least 30% fewer markers than a study in an outbred population.⁸

To date, gene mapping efforts in population isolates have focused on populations that were founded by a small number of individuals and that have subsequently undergone rapid exponential growth and low immigration. Recent founder populations in particular, exhibit substantially increased levels of LD as compared with outbred populations.^{7, 8} Examples of such young isolates can be found in Finland, the Central Valley of Costa Rica and on Sardinia.

Theoretical work has shown that a completely different demographic history can also lead to elevated levels of LD, in the absence of a founder effect. In small populations that remain constant in size over long time periods, new LD is generated by genetic drift.^{9, 10} This has led to the proposal of ‘drift mapping’ as a gene discovery strategy.¹⁰ Chromosomal regions implicated in a complex phenotype would then be identified in a small, ancient population of stable size, using a map of modest marker density. This initial coarse mapping would subsequently be followed by fine-scale localization in an outbred population using a denser map. Isolates with this demographic past have, however, been largely ignored by geneticists due to their small population size and low recessive disease prevalence.⁵ Consequently, elaborate genome-wide, SNP-based studies of the magnitude and distribution of LD in such populations have not been performed.

The Saami of northern Scandinavia and the Kola Peninsula exemplify an ancient population with no evidence of expansion.^{2, 11, 12, 13} Among European populations, the Saami are considered a genetic ‘outlier’ because of the relatively high genetic differentiation between them and other European populations, including their geographic and linguistic neighbors, the Finns.^{12, 14} Due to their demographic history, it has been suggested that the Saami offer great potential for ‘drift mapping’, and hence, more economic GWASs.^{2, 10, 13, 15, 16, 17} However, this has been substantiated with only very limited empirical data, most of which predate the HapMap project.

In this paper, we present the results of a first genome-wide, SNP-based survey of the extent and distribution of LD and haplotype diversity in the Finnish Saami. We compare relative power for association studies of common SNPs with that in the HapMap reference panels, and discuss the implications for GWASs for complex phenotypes.

Materials and methods

Data sets

This study was conducted within the framework of the European ARHI project (QLRT-2001-00331) that aimed to identify environmental and genetic risk factors for age-related hearing impairment.^{18, 19, 20} Due to the anticipated statistical power advantage for association-based gene mapping, we conducted a GWAS in the Finnish Saami. The results of this association scan will be published elsewhere. Here we describe the results of our evaluation of relative genomic coverage.

Blood samples from Saami subjects, aged between 50 and 75 years, were collected across the north of Finland. Since this was a quantitative trait association study, there was no ascertainment based on phenotype. The eligible subjects were recruited with the aid of the public population register through a three-stage process. In a first stage, a geographical criterion was applied: only areas with a high probability of Saami inhabitation, were considered. In a second stage, putative study participants were invited based on an evaluation of Saami communities made by an expert. Finally, Saami identity of the subject in question was confirmed in a direct interview with the subject. Written informed consent was obtained from all study participants and all the samples were anonymized, with no identification of individual subjects possible. This study has been approved by the Finnish National Advisory Board on Health Care Ethics, and by the ethics committees or the appropriate local institutional review boards at all participating institutions.

Genomic DNA from 352 subjects in total was extracted from blood and diluted to 50 ng/μl. Each sample was genotyped on the Affymetrix GeneChip 100K array pair (116 204 SNPs). Genotype calling was performed using the BRLMM algorithm in the Affymetrix GeneChip Genotyping Analysis Software (GTYPE) version 4.1. Data management and quality control were performed using the PLINK toolset²¹ (http://pngu.mgh.harvard.edu/purcell/plink/). Eight subjects were removed due to either a low sample call rate (<94%), an unintentional sample duplication event or a sample switch event. The average sample call rate in the remaining 344 subjects was 99.2%.

To evaluate the magnitude of LD, haplotype diversity and power for genetic association studies, we obtained the genotype data of the International HapMap Project (phase 2; release 23).^{22, 23} This data set contains information on 3.96 million SNPs and includes samples from 30 CEPH trios (CEU) from Utah, USA, with European ancestry; 30 Yoruban (YRI) trios from Ibadan, Nigeria; 45 unrelated Japanese subjects from Tokyo, Japan (JPT) and 45 unrelated Han Chinese from Beijing, China (CHB).

After filtering out SNPs with more than 5% missing data across samples and SNPs that were not in Hardy–Weinberg equilibrium in at least one of the analysis panels (P-value from exact test <0.001), and after removing three SNPs with allele coding errors, we obtained a subset of 102 208 SNPs that were typed in both the Saami and the HapMap samples. The median distance between adjacent SNPs was 10.1 kb and the first and third quartiles were 1.0 and 31.2 kb, respectively. Mean intermarker distance was 28.1 kb. After excluding SNPs that were monomorphic in at least one of the populations, 76 913 SNPs were shared between panels. The median intermarker distance for this map was 14.0 kb, with first and third quartiles 1.6 and 40.9 kb, respectively. Mean distance was 37.4 kb. NCBI build 36 coordinates were used throughout.

Evaluation of potential for genetic association studies

Estimation of genome-wide pairwise identity-by-descent (IBD) sharing, using a method of moments approach implemented in PLINK,²¹ revealed a substantial degree of undocumented relatedness among the Saami participants. Therefore, a subset of 100 maximally unrelated subjects was selected for the analysis with the aid of PedMine, which implements a simulated annealing algorithm²⁴ (http://www.hg.med.umich.edu/labs/douglaslab/software.html). Within this subset, the maximum estimated genome-wide proportion of alleles shared IBD was 0.045, suggesting that subjects were not more closely related than second cousins. The sample size of 100 was chosen in order to have a sample that was roughly comparable in size to each of the HapMap reference panels.

We next compared minor allele frequency (MAF) distributions between Saami and HapMap panels. Due to their genetic similarity, the two Asian HapMap panels (JPT+CHB) were merged for all analyses. We calculated the correlation between allele frequency estimates for an arbitrarily chosen reference allele for 102 208 SNPs in the Saami sample and the CEU panel, and investigated whether there were instances of SNPs with MAF<5% in one sample and >10% in the other.

To evaluate the relative extent of LD we compared genome-wide average LD decay with genomic distance between the panels for common SNPs (MAF≥5% within panel). For each distance bin we calculated the proportions of SNP pairs with r² and ∣D′∣≥0.80. In addition we used a sliding window approach: the averages of the LD measures r² and ∣D′∣ were calculated for all SNPs within 500 kb from each other in sliding windows of 1.7 Mb (1.6-Mb overlap between adjacent windows). This choice of window size and SNP distance allows comparison with other studies.⁸ For this latter analysis we used the set of SNPs that were polymorphic in all panels. The LD statistics r² and ∣D′∣ were calculated using Haploview²⁵ (http://www.broad.mit.edu/mpg/haploview/) and further calculations were performed in R (http://www.r-project.org).

To compare long-range haplotype diversity in the Saami with the HapMap panels, we used the approach described by Service et al.⁸ In brief, the genome was divided into 1-Mb segments and segments containing ∼35 SNPs (range 30–40 SNPs) were retained for analysis. For each segment we inferred haplotypes and estimated their population frequencies using Haploview. Next we ranked haplotypes from common to rare, counted the number of haplotypes accounting for a given percentage of chromosomes and subsequently averaged this number over all segments. These calculations were performed in R.

Next, using PLINK we compared the abundance and length of extended regions of homozygosity (ROHs). The criteria used to identify such regions were length>1 Mb, at least 100 consecutive homozygous SNPs, with a density of minimum one SNP per 50 kb and no gap >1 Mb allowed. Also inbreeding coefficients were estimated as described by Purcell et al²¹ using a subset of SNPs in approximate linkage equilibrium (∼50 000 SNPs).

The relative power for genetic association studies in the Saami compared with that in the HapMap panels was quantified by determining the percentage of SNPs (of all ∼100 000) that had a proxy within 200 kb and within 2 Mb. The LD measure r² was used to measure the correlation and its cut-off was varied along its entire range.

Results

A comparison of the allele frequency distribution in the Saami with those in the HapMap panels indicates that the impact of the SNP ascertainment strategies is negligible (Figure 1). The MAF distribution in the Saami is very similar to that in the CEU panel, with a marginally higher proportion of SNPs with MAF<5%. The Pearson correlation between the allele frequency estimates in the Saami and those in the CEU panel was 0.97 (Figure 2). Using this set of 102 208 SNPs typed in both the Saami and HapMap panels, we also investigated whether there were instances of SNPs with MAF<5% in one sample and >10% in the other. Of 20 221 SNPs with MAF<5% in the CEU panel, we observed 1554 SNPs (7.7%) with frequency estimates ranging from 10.1 to 54.2% in the Saami. Conversely, out of 21 944 SNPs with MAF<5% in the Saami sample, 1453 SNPs (6.6%) with frequency estimates ranging from 10.2 to 31.7% were observed in the CEU panel.

Next, using a sliding windows approach we compared the extent and patterns of LD along the entire genome between the different populations. Figure 3 shows an example comparison for chromosome 18. It can be seen that, on average, the LD measures r² and ∣D′∣ are almost consistently higher in the Saami as compared with that in the CEU sample. This was most pronounced for ∣D′∣. This same pattern was observed for the other chromosomes (not shown). The distribution of LD for the CHB+JPT sample was very similar to that for the CEU sample (not shown).

To investigate whether high LD is more frequent in the Saami, we compared the proportions of SNP pairs with r² or ∣D′∣≥0.80 for a range of genomic distance bins for common SNPs (MAF≥5% within panel) between the different panels. For r², the most relevant LD measure for genetic association studies because of its relationship with statistical power, the patterns of decay over distance are very similar for the Saami, CEU and CHB+JPT samples, the extent of high LD being marginally higher for distances <50 kb in CHB+JPT as compared with the CEU and Saami samples (Figure 4a). For statistic ∣D′∣, which measures historical recombination, the pattern was quite different. Here the proportion of marker pairs with high ∣D′∣ values was consistently higher in the Saami as compared to the other populations (Figure 4b). This observation can be ascribed to the reduced haplotype diversity as compared to the other populations. Figure 5 compares long-range haplotype diversity in the Saami with the HapMap panels. It can be seen that haplotype diversity is lowest in the Saami and that the diversity in CHB+JPT is lower as compared with that in CEU and YRI. The diversity is the highest in YRI. Among the Saami, a genome-wide average of 76 haplotypes accounted for 95% of the chromosomes, whereas 76 haplotypes accounted for 89% of the chromosomes among CHB+JPT, 79% among CEU and 69% among YRI. Considering the fact that the sample size for the Saami sample is larger than those for the HapMap panels (in particular the CEU and YRI samples, which are trios), the difference in haplotype diversity may even be more pronounced.

The reduced haplotype diversity observed in the Saami is a consequence of their smaller historical population size. This is also reflected in a higher degree of background relatedness in the Saami as evidenced by comparison of the extent of homozygosity with the HapMap reference panels. We looked for extended ROHs in the genome and calculated the total length spanned by such regions. This analysis was performed on a subset of 100 maximally unrelated Saami subjects. Supplementary Figure 1 shows the results for the four analysis panels. Clearly, the Saami population is much more extreme in this respect, apart from three (known) outliers among the HapMap samples. For the Saami, median total ROH length was 61.3 Mb, with first and third quartiles 22.6 and 117.9 Mb, respectively. For CHB+JPT, CEU and YRI, median total length (first quartile; third quartile) was 5.0 Mb (2.5 and 8.6 Mb), 3.4 Mb (1.7 and 6.0 Mb) and 1.7 Mb (0 and 3.4 Mb), respectively. As expected, for the Saami, total ROH length was highly correlated with the inbreeding coefficient (Spearman rank correlation=0.89; Supplementary Figure 2). The median and maximum inbreeding coefficient for the Saami was 0.01 and 0.14, respectively. Note that the inbreeding coefficient estimates were calculated using an estimator based on genome-wide data. Hence, these values are not estimates of the classically defined inbreeding coefficient, which is derived from pedigree information.²⁶

To quantify the relative power for genetic association studies in the Saami compared with that in the HapMap panels, we determined the percentage of SNPs (of all ∼100 000) that had a proxy within 200 kb and within 2 Mb. The LD statistic r² was used to measure the correlation and its cut-off was varied along its entire range. The results for proxies within 2 Mb and within 200 kb were very similar, indicating that there is little long-range LD. Figure 6 (results for 2 Mb) shows that the difference in power between the Saami and the CHB+JPT and CEU panels is negligible. The percentage of SNPs that are highly correlated (r²≥0.80) to one or more others within 2 Mb for the CHB+JPT, CEU, Saami and YRI samples was 46.4, 44.7, 45.5 and 26.9%, respectively.

Discussion

This paper describes the results of an elaborate, genome-wide, SNP-based evaluation of the potential for GWASs for complex traits in the Finnish Saami. We studied the impact of SNP ascertainment strategies on the SNP frequency spectrum, extent and patterns of LD, haplotype diversity, and compared the power for association studies of common variants with that in the HapMap panels.

This study shows that patterns of LD in the Finnish Saami are very similar to those in the CEU and CHB+JPT HapMap reference panels. We found that, on average, the extent of LD is slightly higher in the Saami. Disappointingly, however, for an equivalent number of markers, genomic coverage as measured by the percentage of SNPs having a highly correlated proxy, does not differ much from that in the non-African HapMap panels. These results indicate that the potential for ‘drift mapping’, anticipated based on simulations and limited empirical data, has been greatly overestimated.^{2, 9, 10, 13, 15, 16, 17}

Most of the cited empirical work on the extent of LD in the Saami was based on microsatellite markers.^{2, 10, 15, 16} These studies only report P-values from pairwise significance tests of LD. Kaessmann et al¹³ also studied 50 SNPs in five genes, but only report D′-values. Pritchard and Przeworski²⁷ clarified that r² is the most pertinent measure for genetic association studies because of its direct connection with statistical power. In order to achieve roughly the same power at the marker locus as would be reached if the causative variant itself were tested, the sample size needs to be multiplied by 1/r². The LD measures r² and D′ behave very differently, and low values of r² can be consistent with high values of D′. Indeed, our results demonstrate a relatively high extent of LD in the Saami, as measured by D′. However, this does not translate to elevated values for r², that is, a power advantage. Our findings, thus, are not inconsistent with earlier studies reporting high values for D′.

Our findings, however, are in contrast with those of Johansson et al¹⁷ who reported dramatically elevated levels of LD as measured by r² and a different basic LD structure. This latter study was based on array-based SNP discovery in a 4.4-Mb region of only 28 phased copies of chromosome 21 in the Swedish Saami. An explanation for this discrepancy could be that their region studied may not be representative for the rest of the genome. This is unlikely, however, based on the evidence that Johansson et al¹⁷ provide. Alternatively, genetic population substructure may have led to strong but artefactual LD. Recent SNP-based population genetic studies have exposed significant population substructure in Finnish early- and late-settlement subpopulations that correlates with geography.^{28, 29} Genetic substructure within the Saami is very likely given their linguistic and cultural diversity. Indeed, we found evidence for substructure within the Finnish Saami (results not shown). However, LD evaluation within subsets of our data defined by municipality, suggests that this did not appreciably affect LD estimates (results not shown). In general, genetic differentiation probably has to be extreme in order to affect LD estimates. The most plausible explanation is provided by Terwilliger and Weiss³⁰ who show that a limited sample size can lead to upward biases of LD estimates. Johansson et al¹⁷ further state that there are serious limitations in the transferability of common tagSNPs between the HapMap and the Saami. The comparison they made, however, was unfair because the evaluation was not performed on the set of SNPs from which the tagSNPs were defined, a fact they acknowledge in their discussion. Based on their results, they suggest a difference in the basic correlation structure between Saami and the CEU HapMap population. Here, we show that this is not the case.

The pioneering simulation studies on the impact of demography on LD were based on multiallelic markers and only considered pairwise significance tests of LD.^{9, 10} Therefore, based on their results, little can be concluded about a power advantage for genetic association studies. Of course, real human populations differ from the idealized hypothetical populations upon which theoretical predictions are based. The precise demographic history of the Saami is uncertain. Pritchard and Przeworski²⁷ already noted that the levels of genetic diversity at the microsatellite loci studied by Laan and Pääbo² argue against the hypothesis of a very small population size. It could, thus, be that the effect of genetic drift on LD is minimal in the contemporary Saami population. In addition, in the absence of reference data for the Finns and other neighboring populations in Fennoscandia, admixture with those populations could not be investigated. This presents a limitation of our study.

In conclusion, our results indicate that the HapMap is a useful resource for genetic studies in the Finnish Saami. Imputation of untyped common SNPs, using a scaffold of LD relationships derived from the HapMap CEU panel, should be applicable in the Saami. We found that the power to detect common susceptibility alleles for common complex diseases in the Saami is similar to that in most other non-African populations. Thus, if the aim is to identify this class of variants, it seems that not much can be gained by conducting a genome-wide association scan or a candidate gene study in the Finnish Saami. This is especially true if one considers that several thousands of subjects need to be ascertained in order to reach sufficient power for detecting the typically small effect sizes for complex diseases.³¹ It seems unrealistic to recruit this large number of subjects in the Saami population. Furthermore, a higher extent of relatedness among subjects due to the smaller population size compromises the validity of classical statistical methods and requires more dedicated methodology^{32, 33} as compared with using an outbred population.

Recent studies demonstrate that complex disease susceptibility allele frequencies range from rare to common and that multiple rare variants within a single gene may contribute to common complex traits.³⁴ Since the majority of rare SNPs are not represented on the early-generation SNP arrays, and since their study would require very large sample sizes, we can only draw very limited conclusions on the rare end of the SNP allele frequency spectrum. Our results indicate that some alleles that are rare in the CEU panel have drifted to frequencies in the Saami that would make them detectable in an association study. As usually, rare mutations arise on a different haplotypic background, the reduced haplotype diversity in the Saami may imply reduced allelic and genetic heterogeneity. Hence, in certain cases, the power for detecting associations with rare variants could well be higher in the Saami.

References

Peltonen L, Jalanko A, Varilo T : Molecular genetics of the Finnish disease heritage. Hum Mol Genet 1999; 8: 1913–1923.
Article CAS PubMed Google Scholar
Laan M, Pääbo S : Demographic history and linkage disequilibrium in human populations. Nat Genet 1997; 17: 435–438.
Article CAS PubMed Google Scholar
Wright AF, Carothers AD, Pirastu M : Population choice in mapping genes for complex diseases. Nat Genet 1999; 23: 397–404.
Article CAS PubMed Google Scholar
Eaves IA, Merriman TR, Barber RA et al: The genetically isolated populations of Finland and Sardinia may not be a panacea for linkage disequilibrium mapping of common disease genes. Nat Genet 2000; 25: 320–323.
Article CAS PubMed Google Scholar
Peltonen L, Palotie A, Lange K : Use of population isolates for mapping complex traits. Nat Rev Genet 2000; 1: 182–190.
Article CAS PubMed Google Scholar
Kristiansson K, Naukkarinen J, Peltonen L : Isolated populations and complex disease gene identification. Genome Biol 2008; 9: 109.
Article PubMed PubMed Central Google Scholar
Bonnen PE, Pe′er I, Plenge RM et al: Evaluating potential for whole-genome studies in Kosrae, an isolated population in Micronesia. Nat Genet 2006; 38: 214–217.
Article CAS PubMed Google Scholar
Service S, DeYoung J, Karayiorgou M et al: Magnitude and distribution of linkage disequilibrium in population isolates and implications for genome-wide association studies. Nat Genet 2006; 38: 556–560.
Article CAS PubMed Google Scholar
Slatkin M : Linkage disequilibrium in growing and stable populations. Genetics 1994; 137: 331–336.
CAS PubMed PubMed Central Google Scholar
Terwilliger JD, Zöllner S, Laan M, Pääbo S : Mapping genes through the use of linkage disequilibrium generated by genetic drift: ‘drift mapping’ in small populations with no demographic expansion. Hum Hered 1998; 48: 138–154.
Article CAS PubMed Google Scholar
Sajantila A, Lahermo P, Anttinen T et al: Genes and languages in Europe: an analysis of mitochondrial lineages. Genome Res 1995; 5: 42–52.
Article CAS PubMed Google Scholar
Lahermo P, Sajantila A, Sistonen P et al: The genetic relationship between the Finns and the Finnish Saami (Lapps): analysis of nuclear DNA and mtDNA. Am J Hum Genet 1996; 58: 1309–1322.
CAS PubMed PubMed Central Google Scholar
Kaessmann H, Zöllner S, Gustafsson AC et al: Extensive linkage disequilibrium in small human populations in Eurasia. Am J Hum Genet 2002; 70: 673–685.
Article CAS PubMed PubMed Central Google Scholar
Sajantila A, Pääbo S : Language replacement in Scandinavia. Nat Genet 1995; 11: 359–360.
Article CAS PubMed Google Scholar
Laan M, Pääbo S : Mapping genes by drift-generated linkage disequilibrium. Am J Hum Genet 1998; 63: 654–656.
Article CAS PubMed PubMed Central Google Scholar
Johansson A, Vavruch-Nilsson V, Edin-Liljegren A, Sjolander P, Gyllensten U : Linkage disequilibrium between microsatellite markers in the Swedish Sami relative to a worldwide selection of populations. Hum Genet 2005; 116: 105–113.
Article CAS PubMed Google Scholar
Johansson A, Vavruch-Nilsson V, Cox DR, Frazer KA, Gyllensten U : Evaluation of the SNP tagging approach in an independent population sample – array-based SNP discovery in Sami. Hum Genet 2007; 122: 141–150.
Article PubMed Google Scholar
Fransen E, Topsakal V, Hendrickx JJ et al: Occupational noise, smoking, and a high body mass index are risk factors for age-related hearing impairment and moderate alcohol consumption is protective: a European population-based multicenter study. J Assoc Res Otolaryngol 2008; 9: 264–276.
Article PubMed PubMed Central Google Scholar
Van Laer L, Van Eyken E, Fransen E et al: The grainyhead like 2 gene (GRHL2), alias TFCP2L3, is associated with age-related hearing impairment. Hum Mol Genet 2008; 17: 159–169.
Article CAS PubMed Google Scholar
Huyghe JR, Van Laer L, Hendrickx JJ et al: Genome-wide SNP-based linkage scan identifies a locus on 8q24 for an age-related hearing impairment trait. Am J Hum Genet 2008; 83: 401–407.
Article CAS PubMed PubMed Central Google Scholar
Purcell S, Neale B, Todd-Brown K et al: PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet 2007; 81: 559–575.
Article CAS PubMed PubMed Central Google Scholar
The International HapMap Consortium: The International HapMap Project. Nature 2003; 426: 789–796.
Article Google Scholar
The International HapMap Consortium: A second generation human haplotype map of over 3.1 million SNPs. Nature 2007; 449: 851–861.
Article PubMed Central Google Scholar
Douglas JA, Sandefur CI : PedMine – a simulated annealing algorithm to identify maximally unrelated individuals in population isolates. Bioinformatics 2008; 24: 1106–1108.
Article CAS PubMed Google Scholar
Barrett JC, Fry B, Maller J, Daly MJ : Haploview: analysis and visualization of LD and haplotype maps. Bioinformatics 2005; 21: 263–265.
Article CAS PubMed Google Scholar
Malécot G : Les Mathématiques de l′Hérédité. Paris: Masson, 1948.
Google Scholar
Pritchard JK, Przeworski M : Linkage disequilibrium in humans: models and data. Am J Hum Genet 2001; 69: 1–14.
Article CAS PubMed PubMed Central Google Scholar
Jakkula E, Rehnstrom K, Varilo T et al: The genome-wide patterns of variation expose significant substructure in a founder population. Am J Hum Genet 2008; 83: 787–794.
Article CAS PubMed PubMed Central Google Scholar
Salmela E, Lappalainen T, Fransson I et al: Genome-wide analysis of single nucleotide polymorphisms uncovers population structure in Northern Europe. PLoS ONE 2008; 3: e3519.
Article PubMed PubMed Central Google Scholar
Terwilliger JD, Weiss KM : Linkage disequilibrium mapping of complex disease: fantasy or reality? Curr Opin Biotechnol 1998; 9: 578–594.
Article CAS PubMed Google Scholar
McCarthy MI, Abecasis GR, Cardon LR et al: Genome-wide association studies for complex traits: consensus, uncertainty and challenges. Nat Rev Genet 2008; 9: 356–369.
Article CAS PubMed Google Scholar
Yu J, Pressoir G, Briggs WH et al: A unified mixed-model method for association mapping that accounts for multiple levels of relatedness. Nat Genet 2006; 38: 203–208.
Article CAS PubMed Google Scholar
Amin N, van Duijn CM, Aulchenko YS : A genomic background based method for association analysis in related individuals. PLoS ONE 2007; 2: e1274.
Article PubMed PubMed Central Google Scholar
Cohen JC, Kiss RS, Pertsemlidis A, Marcel YL, McPherson R, Hobbs HH : Multiple rare alleles contribute to low plasma levels of HDL cholesterol. Science 2004; 305: 869–872.
Article CAS PubMed Google Scholar

Download references

Acknowledgements

We express our most sincere gratitude to all the Saami volunteers who have participated in this study. This paper benefited from comments of two anonymous referees. This work was funded by the European Community (fifth Framework project QLRT-2001-00331), by the University of Antwerp (TOP project), by the Research Foundation – Flanders (FWO grant G.0163.09) and by the State of Arizona. JRH is a fellow of the Research Foundation – Flanders (FWO).

Author information

Alana Lysholm-Bernacchi: In loving memory

Authors and Affiliations

Department of Medical Genetics, University of Antwerp, Antwerp, Belgium
Jeroen R Huyghe, Erik Fransen, Lut Van Laer, Els Van Eyken & Guy Van Camp
Department of Otorhinolaryngology, University of Oulu, Oulu, Finland
Samuli Hannula, Elina Mäki-Torkko & Martti Sorri
Neurogenomics Division, Translational Genomics Research Institute, Phoenix, AZ, USA
Alana Lysholm-Bernacchi, Dietrich A Stephan & Matthew J Huentelman
Thule Institute, University of Oulu, Oulu, Finland
Pekka Aikio

Authors

Jeroen R Huyghe
View author publications
You can also search for this author in PubMed Google Scholar
Erik Fransen
View author publications
You can also search for this author in PubMed Google Scholar
Samuli Hannula
View author publications
You can also search for this author in PubMed Google Scholar
Lut Van Laer
View author publications
You can also search for this author in PubMed Google Scholar
Els Van Eyken
View author publications
You can also search for this author in PubMed Google Scholar
Elina Mäki-Torkko
View author publications
You can also search for this author in PubMed Google Scholar
Alana Lysholm-Bernacchi
View author publications
You can also search for this author in PubMed Google Scholar
Pekka Aikio
View author publications
You can also search for this author in PubMed Google Scholar
Dietrich A Stephan
View author publications
You can also search for this author in PubMed Google Scholar
Martti Sorri
View author publications
You can also search for this author in PubMed Google Scholar
Matthew J Huentelman
View author publications
You can also search for this author in PubMed Google Scholar
Guy Van Camp
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Guy Van Camp.

Ethics declarations

Competing interests

The authors declare no conflict of interest.

Additional information

Supplementary Information accompanies the paper on European Journal of Human Genetics website

Supplementary information

Supplementary Figure 1 and 2 (PDF 47 kb)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Huyghe, J., Fransen, E., Hannula, S. et al. Genome-wide SNP analysis reveals no gain in power for association studies of common variants in the Finnish Saami. Eur J Hum Genet 18, 569–574 (2010). https://doi.org/10.1038/ejhg.2009.210

Download citation

Received: 03 June 2009
Revised: 06 October 2009
Accepted: 20 October 2009
Published: 25 November 2009
Issue Date: May 2010
DOI: https://doi.org/10.1038/ejhg.2009.210

Keywords

This article is cited by

A genome-wide analysis of population structure in the Finnish Saami with implications for genetic association studies
- Jeroen R Huyghe
- Erik Fransen
- Guy Van Camp
European Journal of Human Genetics (2011)
A genome-wide association study for age-related hearing impairment in the Saami
- Lut Van Laer
- Jeroen R Huyghe
- Guy Van Camp
European Journal of Human Genetics (2010)

Genome-wide SNP analysis reveals no gain in power for association studies of common variants in the Finnish Saami

Abstract

Similar content being viewed by others

FinnGen provides genetic insights from a well-phenotyped isolated population

Population-specific long-range linkage disequilibrium in the human genome and its influence on identifying common disease variants

The Newfoundland and Labrador mosaic founder population descends from an Irish and British diaspora from 300 years ago

Introduction