Family-based association tests for quantitative traits using pooled DNA

Bader, Joel S; Sham, Pak

doi:10.1038/sj.ejhg.5200893

Download PDF

Article
Published: 03 December 2002

Family-based association tests for quantitative traits using pooled DNA

Joel S Bader¹ &
Pak Sham^2,3

European Journal of Human Genetics volume 10, pages 870–878 (2002)Cite this article

411 Accesses
8 Citations
Metrics details

Abstract

Interest in whole-genome QTL mapping has spurred efforts to reduce the cost of studies now based primarily on individual genotyping. Pooled DNA tests are a possible solution, and understanding how measurement error affects test power could assist in study design. Here we describe pooled tests explicitly optimised for measurement error, including family-based tests robust to population stratification. Our results suggest that pooled DNA whole-genome screens may be feasible with current instruments.

Genome-wide association studies

Article 26 August 2021

Tissue-specific enhancer–gene maps from multimodal single-cell data identify causal disease alleles

Article 09 April 2024

Utility of polygenic scores across diverse diseases in a hospital cohort for predictive modeling

Article Open access 12 April 2024

Introduction

Association tests of outbred populations may have greater power than linkage analysis to identify the genetic variants contributing to complex human diseases.^1,2,3,4 While single-nucleotide polymorphisms (SNPs) occur at sufficient density to provide a suitable marker set,^5,6,7,8,9,10 individual genotyping remains costly. One method to reduce cost is to pool DNA from individuals with extreme phenotypic values and to measure the allele frequency difference between pools.^{11,12,13,14,15,16,17} The power of pooled tests has been investigated for case–control studies.¹⁸ More recently, pooled tests have been discussed for quantitative traits. In the absence of experimental error, the optimal design for an unrelated population is to compare frequencies between pools of the most extreme 27% of individuals ranked by phenotypic value, retaining 80% of the information of individual genotyping.¹⁹ This result has been obtained more generally in the context of optimal inefficient statistics.²⁰ Experimental sources of error, primarily allele frequency measurement error, degrade the test power.²¹ Recent applications^22,23 suggest that typical absolute measurement errors are 1–4%.

Population stratification poses a second challenge to pooled tests. Genomic control methods, developed to reduce stratification effects in genotype-based association tests,^{24,25,26,27,28} are not directly applicable to pooled tests.

Here we present optimized pooled DNA test designs, including family-based tests robust to stratification. Estimates of test power explicitly consider allele frequency measurement error. This distinguishes our treatment from prior theoretical work, permits the optimization of test design as a function of known parameters, and provides a bridge to experimentalists seeking practical guidance for whether to attempt and how to perform pooled association tests.

Methods

Sampling variance and concentration variance

Let p_i represent the frequency of allele A₁ for individual i, either 0, 1/2, or 1, and c_i represent the concentration of DNA contributed by this individual to a pool of n individuals. The allele frequency p^* for the pool is

which defines the relative concentration error . The terms δp_i and are uncorrelated, and each has expectation zero. Furthermore, the sum of the terms is constrained to be zero. The variance of p^* is

We have used

with the concentration coefficient of variation defined as τ≡[Var(c_i)]^1/2/c₀ and the genotypic correlation between a pair of individuals defined as r_ij.

For the between-family design, a pool of n individuals contains n/s sibships of size s and genotypic correlation r, and

with R=(1/s)[1+(s−1)r]. Since the individuals in the upper and lower pools are unrelated, V_s+V_c=2Var(p^*).

For a within-family design, the allele frequency difference between pools is

where i and j label individuals in the upper and lower pools respectively, and

Expected allele frequency difference and non-centrality parameter

The genotype-dependent phenotype distribution is defined using a variance components model,

Family and individual effects are normally distributed with mean zero and variance

The family index is k, the sib index is i, and the individual phenotypes X_ki are the sum of Y_k, the family effect excluding the QTL, Y_ki, the individual effect excluding the QTL, and μ_ki, the QTL effect μ(G_ki) for sib i with genotype G_ki. The total phenotypic correlation between sibs is t. Both r and u relate to the genetic background shared between sibs, r being the genotypic correlation (1 for monozygotic twins, 1/2 for full sibs, 1/4 for half sibs) and u being the shared genotype expectation (1 for monozygotic twins, 1/4 for full sibs, 0 for half sibs).²⁹

The phenotypic values X_ki and QTL effects μ_ki are re-expressed as family means and individual deviations from family means,

The phenotypic variances excluding QTL effects are

When the QTL effects are small, T≈(1/s)[1+(s−1)t].

The probability that sibling 1 from family κ with genotypes G=(G₁,G₂,…,G_s) is selected for the upper pool is 1−Φ[(X′−μ_G)/σ], where Φ(z) is the cumulative normal probability. The variable X under selection (with selection threshold X′), the QTL contribution μ_G, and σ²≡Var(X−μ_G) depend on pooling design. For between-family pools, these are , , and ; for within-family pools, δX_k1, δμ_k1, and . Because the labeling of sibs is arbitrary, the fraction f of individuals selected for the upper pool is equal to the probability that sib 1 is selected,

where Pr(G) is the probability of observing the sibship genotypes G. Numerical inversion provides X′ as a function of f. When the QTL effect is small (μ_G<σ), the linear approximation

is accurate, where φ(z)=dΦ(z)/dz is the normal probability density. This approximation yields f=1−Φ(X′/σ) because the terms linear in μ_G cancel in the sum over G.

The expected allele frequency of the upper pool is

where p_G represents the allele frequency of sib 1. Using the linear expansion for Φ[(X′−μ_G)/σ] yields

An analogous expression for the lower pools gives a symmetric result, yielding

where X′/σ has been replaced by Φ⁻¹(1−f).

The expectation of the correlation between p and μ for an individual is

Similarly, the correlation between sibs i and j is E(p_iμ_j)=r_ijσ_pσ_A, where r_ij is their genotypic correlation. Summing over sibs yields either Rσ_pσ_A (between-family pools) or (1−R)σ_pσ_A (within-family pools) for E(p_Gμ_G), with R=(1/s)[1+(s−1)r] as before.

Selecting discordant-like sib-pairs is equivalent to selection based on |δX_ki|, and the within-family analytical results are directly applicable. For larger families, discordant-like families are pre-selected in decreasing rank order of the within-family phenotypic variance summed over siblings s.

We have ascertained that the analytical results for the NCP are virtually indistinguishable from exact numerical results when the QTL effect is 5% or less of the trait variance. For larger effects, roughly when the effect size σ_A² approaches the minor allele frequency, the genotype-dependent phenotype distributions become resolved, transforming a complex trait into Mendelian trait amenable to traditional linkage analysis.

Analytical fit for the optimal pooling fraction

Optimizing the pooling fraction is equivalent to maximizing the objective function I=2y²/(f+f²κ²), where y is shorthand for φ[Φ⁻¹(1−f)]. Writing f as 1-Φ(z) and optimizing using dI=dz=0 yields

We have used y=φ(z), dy/dz=−yz, and df/dz=−y.

When κ² is large, z is also large, and f may be replaced by its asymptotic expansion for large z, f=y · (z⁻¹−z⁻³). With this substitution, the optimum satisfies

Taking the natural logarithm of both sides and equating exponents,

When κ and z are both large, the term 3 ln z is asymptotically small, giving

An improved fit is obtained by perturbation theory by writing

where . Substituting this expression for z into L(z) and simplifying,

which gives the asymptotic form b=(3/B²) ln B, or

For clarity, the functional dependence of B and b on κ has been suppressed.

Since the asymptotic behavior for large κ is not affected by introducing terms of lower order in κ, the fit can be improved for small κ without degrading the fit at large κ by writing

The constants a₁, a₂, and a₃ are then selected to fit the exact numerical results at particular values of κ. Fitting the results z=0.612 at κ=0 and z=0.8047 at κ=1 provides the particular parameters

Results

Consider a population of N/s families, each a sibship of size s (N total individuals). The genotypic correlation within a sibship is denoted r, r=1/4, 1/2, and 1 for half-sibs, full-sibs, and monozygotic twins, respectively. Sibships may also represent inbred lines, r being the the genetic correlation within each line. Sibs in different families are assumed to have uncorrelated genotypes.

To conduct a pooled DNA test for association of a particular allele A₁ with a quantitative trait, individuals are selected for an upper pool, comprising higher phenotypic values, and a lower pool, comprising lower phenotypic value, similar to designs for optimizing breeding value and for QTL mapping.^30,31,32,33 We restrict attention to balanced designs: each pool has fN individuals, with f⩽0.5 defined as the pooling fraction. Balanced designs are favored when high and low phenotypes are treated symmetrically.²¹

We consider four designs: (i) unrelated individuals (s=1), in which the fN individuals having highest and lowest phenotypic values are selected for the upper and lower pools respectively; (ii) between-family, in which all s sibs from the fN/s families having highest and lowest mean phenotypic values are selected for the upper and lower pools; (iii) within-family, in which the s′ sibs having highest and lowest phenotypic values within each family are selected for the upper and lower pools, with f=s′/s; (iv) within-family with pre-selection of discordant families, in which a fraction f ′ of families with greatest within-family phenotypic variance are selected, where X_s is the phenotype of sib s and \(\overline{X}\) is the family mean, then the extreme high and low sib within each selected family are selected for the upper and lower pool, with f=f ′/N.

A suitable statistic for a two-sided test for each design is

where the estimated frequencies of allele A₁ in the upper and lower pools are denoted and . The denominator is . The sampling variance V_s represents the unavoidable error in estimating the allele frequency frequency from a finite sample. The concentration variance V_C arises from sample-to-sample DNA concentration variance within a pool. The measurement variance is V_M=2ε², where ε is the experimental allele frequency measurement error for each pool. We assume that the three sources of variation are independent, justified when DNA samples are treated uniformly. Other sources of error, for example errors arising from unequal amplification of alleles, may also be included in this statistical framework.³⁴

Under the null hypothesis, Z² has a χ² distribution with one degree of freedom. Under the alternate hypothesis, the tested marker is assumed to be a bi-allelic quantitative trait locus (QTL) with alleles A₁ and A₂ occurring at frequencies p and (1−p)≡q. For between-family tests, the alleles are also assumed to be in Hardy–Weinberg equilibrium in a random-mating population. The variance of the allele frequency per individual is , and the estimated allele frequency is . The estimated variance of the allele frequency per individual, denoted , equals .

The mean phenotypic effects for genotypes G=A₁A₁, A₁A₂, and A₂A₂ are m_G=a,d and −a, respectively. The dominance ratio d/a describes the inheritance mode with values −1, 0, and 1 for pure recessive, additive, or dominant inheritance. The proportion of trait variance accounted for by the QTL is denoted ,

The mean QTL effect is m=(p−q)a+2pqd. Phenotypic values are assumed to be normally distributed for each genotype with mean μ_G=m_G−m and residual variance arising from other genetic and environmental factors. The distribution of phenotypic values in the population is a mixture of three normal distributions with overall mean 0 and variance 1. The total phenotypic correlation between sibs from genetic factors (including the QTL) and environmental factors is denoted t.

The non-centrality parameter (NCP),

measures the information provided by a pooled DNA test. The notation is the expectation of an observable . Below we evaluate the NCP numerator, providing accurate analytical results when possible and simulation results otherwise. We calculate the NCP denominator analytically for the null hypothesis. For the alternative hypothesis, the expected allele frequencies for each pool are displaced symmetrically from p to p±δp (see Methods), and the value of the denominator decreases by a small value proportional to (δp/p)². We make a conservative approximation by ignoring this change and using the null hypothesis denominator throughout. The NCP then equals (z_α/2−z_1−β)², where α and β are the type I and II error rates for the two-sided test and z_γ≡Φ⁻¹(1−γ) with Φ the cumulative normal probability. Maximising the NCP optimises the test.

The denominator of the NCP (see Methods) is

where τ is the coefficient of variation for DNA concentration; relates family-based genotypic variance components to pairwise correlations; and J is 1 for pools of unrelated individuals, sR for the between-family design, and (1–r) for both within-family designs. Typically τ is less than 10% and τ² may be ignored relative to J. The term κ, denoted the scaled measurement error, is defined

and is independent of QTL effect.

The numerator of the NCP is (see Methods)

where φ(z) is the normal density and F is 1 for pools of unrelated individuals, R²/T for between-family pools, and (1–R)²/(1–T) for within-family pools without pre-selection. For the within-family design using discordant-like pre-selection, F=(1–r)²/2(1–t) for sib-pairs (expressions for larger sibships are unwieldy). The term R is defined above, and relates family-based phenotypic variance components to pairwise correlations.

The resulting analytical result for the NCP, valid for small QTL effect, is

The first of the three factors is identical to the NCP for an association test performed by individual genotyping on a population of N unrelated individuals; the second factor, with τ=0, is the correction for individual genotyping a population of N/s families each having s sibs and then performing either a between-family test, with F/J=R/sT, or a within-family test, with F/J=(s−1)R/s(1−T). The third factor represents the fraction of information retained when the association test is performed by pooling instead of individual genotyping, and maximising this factor with respect to f provides the optimal pooling fraction. With no measurement error, κ=0, tests are optimised with f=0.27 and 80% of the information is retained.¹⁹ As ε increases, the maximum information that can be retained is determined entirely by the single collective term κ.

Expressions for F, J, and κ² are summarised in Table 1, and we now provide examples of each family-based design. Results for between-family designs are depicted in Figure 1 for populations of sib-quads, sib-pairs, and unrelated individuals, each population having 1000 total individuals. The optimal pooling fraction, indicated by an arrow, shifts to lower values as the number of sibs per family decreases. The optimal fraction and the information retained also shift to lower values as the minor allele frequency decreases, with results shown for frequencies 0.1 and 0.01. The raw measurement error is 0.01, and the pooling fraction and information retained would decrease for larger ε (see Figure 4 for examples of changing ε).

Table 1 The non-centrality parameter for family-based pooled DNA designs^a

Full size table

For within-family designs, the optimal pooling fraction (top panel) and information retained (bottom panel) are shown in Figure 2 as a function of κ for sibship sizes of 2–5, 6, 8, 16 and 32. For sibships through 5, it is always optimal to select just the highest and lowest sib. For larger families and small measurement error, the top and bottom quarters of the sibs are pooled and 80% of the information is retained. The pooling fraction and information retained decrease as the scaled measurement error increases.

Within-family tests can be improved by pre-selection of discordant-like families. In Figure 3, the optimal fraction of families to select (top panel) and information retained (bottom panel) are displayed for sibship sizes 2 through 6 as a function of the scaled measurement error κ (results from computer simulation). The pooling fraction and information retained decrease as κ increases. Pre-selection has the greatest benefit for sib-pairs: for the smallest values of κ, only 56% of families are selected, retaining 80% of the information; had all families been used, only 60% of the information would have been retained. Pre-selection is less beneficial for trios and larger sibships.

In Figure 4, the optimal pooling fraction (top panel) and information retained (bottom panel) using between-family pools and within-family pools with discordant-like pre-selection are displayed for a population of 500 sib-pairs (1000 individuals) as a function of the raw measurement error ε. Results are shown marker frequencies 0.5 and 0.01. With no measurement error, the optimal pooling fraction of 0.27 retains 80% of the information in each case. As measurement error increases, the optimal pooling fraction and information retained both decrease.

The information loss increases for rarer alleles and is worse for the within-family test than for the between-family test. This behaviour can be deduced from the scaled error κ², which is inversely proportional to the allele frequency sampling variance. Since the sampling variance is 3× smaller within-family vs. between-family, κ² is 3× larger, 4Nε²/p(1−p) vs. 4Nε²/3p(1−p), and more information is lost. The inverse dependence of κ² on minor allele frequency explains the decrease in power for rare alleles.

Because the allele frequency difference between sibs is uncorrelated from their allele frequency mean, the between-family and within-family tests are independent estimators of σ_A even when individuals contribute their DNA under both designs. The NCP of a combined test is the sum of the NCPs for each test and it too follows a χ² distribution with 1 degree of freedom. In practice, estimates for σ_A may be obtained by inverting the expressions for provided in Table 1, then weighting each estimator by the inverse of its variance.

Population stratification may be indicated by a difference between the estimates for σ_A from a between-family and within-family test. In the absence of stratification, the difference follows a normal distribution with variance

where the ‘+’ and ‘−’ subscripts refers to the between-family and within-family designs respectively, , and V represents the total variance, V_S+V_C+V_M, for each design. When stratification is indicated, the between-family estimate of _A may be unreliable but the within-family estimate remains robust.

A universal calibration curve for pooled test design is provided in Figure 5, with the optimal pooling fraction (top panel) and information retained (bottom panel) displayed as a function of κ. An accurate analytical fit to the numerically exact results is (see Methods)

The local maxima of the pooling fraction fitting error f_fit−f_exact occur at κ=0.5 (fitting error=+0.006) and at κ=3.5 (fitting error=−0.01). The fitting error for the information retained vanishes on the scale of the figure. The experimental measurement error ε corresponding to the scaled error κ depends on the population structure and marker frequency. For example, for a population of 500 cases, 500 matched unrelated controls, and 10% marker frequency, ε=0.0067κ is the raw error corresponding to κ.

Discussion

Based on the pooled designs described above, we outline a QTL mapping study using 100 000 markers. For 80% power to detect a QTL with 1% additive variance and no more than 100 false-positives from pooled tests (the false-positives may be resolved using individual genotyping), an NCP of 17 is required. We assume pooling of discordant sib-pairs to protect against stratification effects. At the scaled error κ=1 where the pooled tests are still close to maximum power, the pooling fraction would be 21%, 65% of the information of a population would be retained, and a population of 2600 individuals would be required. The raw measurement error corresponding to κ=1 for this population size is 0.005 for an allele with 50% frequency and 0.002 for an allele with 5% frequency, 5× to 10× more precise than achieved by current-day instrumentation.

To account for lower precision, we set κ=10, which from Figure 5 is seen to retain 7.7% of the information and corresponds to a pooling fraction of 1.6%. In this case, the total population size would be 22 000; the precision required for a pooled test would be 0.017 for an allele with 50% frequency and 0.007 for an allele with 5% frequency. This is currently feasible if repeated measures are used to decrease the effective measurement error.

Pooled tests perform worse for within-family tests and rare alleles, and may therefore be difficult to apply to disease-risk variants under negative selection pressure. The loss of power may be less severe for pharmacogenetic studies of variants affecting drug response, where selection pressure is absent, and for test crosses of model organisms or agricultural species whose marker frequencies are under experimental control.

The analysis provided here for quantitative traits may be extended to threshold characters yielding dichotomous classifications of a population. For case–control classification, the disease prevalence corresponds to the pooling fraction f. When the quantitative character is available for measurement, it is approximately 4× more efficient to compare unrelated individuals with extremely high vs extremely low characters than to compare the derived cases vs controls.¹⁹

In summary, we have derived the optimal pooling fractions for within-family and between-family tests of association. With ideal instrumentation, 80% of the information is retained and the optimal pooling fraction is 27%. As allele frequency measurement error increases, the optimal pooling fraction and the information retained both decreases. The information loss is more severe for low-frequency alleles and for within-family tests. The optimal pooling fraction depends on a single parameter representing the measurement error, and a universal calibration curve provides optimised designs as a function of this parameter.

References

Risch N, Merikangas K . The future of genetic studies of complex human diseases Science 1996 273: 1516–1517
Article CAS Google Scholar
Ott J . Analysis of Human Genetic Linkage. 3rd edn Baltimore: Johns Hopkins University Press 1999 pp 306–329
Sham PC, Cherny SS, Purcell S et al. Power of linkage versus association analysis of quantitative traits, by use of variance-components models, for sibship data Am J Hum Gen 2000 66: 1616–1630
Article CAS Google Scholar
Ardlie KG, Kruglyak L, Seielstad M . Patterns of linkage disequilibrium in the human genome Nat Rev Genet 2000 3: 299–309
Article Google Scholar
Collins FS, Guyer MS, Chakarvarti A . Variations on a theme: cataloguing human DNA sequence variation Science 1997 274: 1580–1581
Article Google Scholar
Abecasis GR, Noguchi E, Heinzmann A et al. Extent and distribution of linkage disequilibrium in three genomic regions Am J Hum Gen 2001 68: 191–197
Article CAS Google Scholar
Reich DE, Cargill M, Bolk S et al. Linkage disequilibrium in the human genome Nature 2001 411: 199–204
Article CAS Google Scholar
Patil N, Berno AJ, Hinds DA et al. Blocks of limited haplotype diversity revealed by high-resolution scanning of human chromosome 21 Science 2001 294: 1719–1723
Article CAS Google Scholar
Gabriel SB, Schaffner SF, Nguyen H et al. The structure of haplotype blocks in the human genome Science 2002 296: 2225–2229
Article CAS Google Scholar
Dawson E, Abecasis GR, Bumpstead S et al. A first-generation linkage disequilibrium map of human chromosome 22 Nature 2002 418: 544–548
Article CAS Google Scholar
Barcellos LF, Klitz W, Field LL et al. Association mapping of disease loci, by use of a pooled DNA genomic screen Am J Hum Gen 1997 61: 734–747
Article CAS Google Scholar
Daniels J, Holmans P, Williams N et al. A simple method for analysing microsatellite allele image patterns generated from DNA pools and its applications to allelic association studies Am J Hum Gen 1998 62: 1189–1197
Article CAS Google Scholar
Shaw SH, Carrasquillo MM, Kashuk C et al. Allele frequency distributions in pooled DNA samples: applications to mapping complex disease genes Genome Res 1998 8: 111–123
Article CAS Google Scholar
Stockton DW, Lewis RA, Abboud EB et al. A novel locus for Leber congenital amaurosis on chromosome 14q24 Hum Gen 1998 103: 328–333
Article CAS Google Scholar
Suzuki K, Bustos T, Spritz RA . Linkage disequilibrium mapping of the gene for Margarita Island ectodermal dysplasia (ED4) to 11q23 Am J Hum Gen 1998 63: 1102–1107
Article CAS Google Scholar
Fisher PJ, Turic D, Williams NM et al. DNA pooling identifies QTLs on chromosome 4 for general cognitive ability in children Hum Mol Gen 1999 8: 915–922
Article CAS Google Scholar
Hill L, Craig IW, Asherson P et al. DNA pooling and dense marker maps: a systematic search for genes for cognitive ability Neuroreport 1999 10: 843–848
Article CAS Google Scholar
Risch N, Teng J . The relative power of family-based and case control designs for linkage disequilibrium studies of complex human diseases I. DNA pooling Genome Res 1998 8: 1273–1288
Article CAS Google Scholar
Bader JS, Bansal A, Sham P . Efficient SNP-based tests of association for quantitative phonotypes using pooled DNA Genescreen 2001 1: 143–150
Article Google Scholar
Mosteller F . On some useful ‘inefficient’ statistics Annals of Mathematical Statistics 1946 17: 377–408
Article Google Scholar
Jawaid A, Bader JS, Purcell S et al. Optimal selection strategies for QTL mapping using pooled DNA samples Eur J Hum Gen 2002 10: 125–132
Article CAS Google Scholar
Beutow KH, Edmonson M, MacDonald R et al. High-throughput development and characterization of a genomewide collection of gene-based single nucleotide polymorphism markers by chip-based matrix-assisted laser desorption/ionization time-of-flight mass spectrometry Proc Natl Acad Sci USA 2001 98: 581–584
Article Google Scholar
Grupe A, Germer S, Usuka J et al. In silico mapping of complex disease-related traits in mice Science 2001 292: 1915–1918
Article CAS Google Scholar
Devlin B, Roeder K . Genomic control for association studies Biometrics 1999 55: 788–808
Article Google Scholar
Pritchard JK, Rosenberg NA . Use of unliked genetic markers to detect population stratification in association studies Am J Hum Gen 1999 65: 220–228
Article CAS Google Scholar
Pritchard JK, Stephens M, Rosenberg NA et al. Inference of population structure using multilocus genotype data Genetics 2000 155: 945–959
CAS PubMed PubMed Central Google Scholar
Zhang S, Zhao H . Quantitative similarity-based association tests using population samples Am J Hum Gen 2001 69: 601–614
Article CAS Google Scholar
Satten GA, Flanders DW, Yang Q . Accounting for unmeasured population substructure in case-control studies of genetic association using a novel latent-class model Am J Hum Gen 2001 68: 466–477
Article CAS Google Scholar
Falconer DS, MacKay TFC . Introduction to quantitative genetics Boston: Addison-Wesley 1996 pp 153
Google Scholar
Hill WG . Design and efficiency of selection experiments for estimating genetic parameters Biomerics 1971 27: 293–311
Article CAS Google Scholar
Kimura M, Crow JF . Effect of overall phenotypic selection on genetic change at individual loci Proc Natl Acad Sci USA 1978 75: 6168–6171
Article CAS Google Scholar
Darvasi A, Soller M . Selective DNA pooling for determination of linkage between a molecular marker and a quantitative trait locus Genetics 1994 138: 1365–1373
CAS PubMed PubMed Central Google Scholar
Ollivier L, Messer LA, Rothschild MF et al. The use of selection experiments for detecting quantitative trait loci Genet Res, Camb 1997 69: 227–232
Article CAS Google Scholar
Le Hellard S, Ballereau SJ, Visscher PM et al. SNP genotyping on pooled DNAs: comparison of genotyping technologies and a semi automated method for data storage and analysis Nucleic Acids Res 2002 30: e74 (electronic preprint)
Article Google Scholar

Download references

Author information

Authors and Affiliations

CuraGen Corporation, 555 Long Wharf Drive, New Haven, CT 06511, Connecticut, USA
Joel S Bader
Department of Psychological Medicine, Institute of Psychiatry, King's College, London, UK
Pak Sham
Social, Genetic and Developmental Psychiatry Research Centre, Institute of Psychiatry, King's College, London, UK
Pak Sham

Authors

Joel S Bader
View author publications
You can also search for this author in PubMed Google Scholar
Pak Sham
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Joel S Bader.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Bader, J., Sham, P. Family-based association tests for quantitative traits using pooled DNA. Eur J Hum Genet 10, 870–878 (2002). https://doi.org/10.1038/sj.ejhg.5200893

Download citation

Received: 04 December 2001
Revised: 27 August 2002
Accepted: 28 August 2002
Published: 03 December 2002
Issue Date: 01 December 2002
DOI: https://doi.org/10.1038/sj.ejhg.5200893

Keywords

This article is cited by

Interval mapping of quantitative trait loci with selective DNA pooling data
- Jing Wang
- Kenneth J Koehler
- Jack CM Dekkers
Genetics Selection Evolution (2007)
Design and Analysis of Association Studies using Pooled DNA from Large Twin Samples
- Jo Knight
- Pak Sham
Behavior Genetics (2006)
DNA Pooling: a tool for large-scale association studies
- Pak Sham
- Joel S. Bader
- Michael Owen
Nature Reviews Genetics (2002)

Family-based association tests for quantitative traits using pooled DNA

Abstract