Detecting non-multiplicative genotype relative risks from transmissions of parental alleles to affected children

Ayres, Karen L.; Curnow, Robert N.

doi:10.1007/s10038-004-0217-5

Download PDF

Short Communication
Published: 01 January 2005

Detecting non-multiplicative genotype relative risks from transmissions of parental alleles to affected children

Karen L. Ayres¹ &
Robert N. Curnow¹

Journal of Human Genetics volume 50, pages 46–48 (2005)Cite this article

309 Accesses
Metrics details

Abstract

The differential transmission of alleles from parents to affected children indicates that the locus under investigation is either directly involved in the occurrence of the disease or that there are allelic associations with other loci that are directly involved. Conditional logistic regression applied to a diallelic locus leads to a test with two degrees of freedom. The power of a single degree of freedom test to detect non-multiplicative allelic effects is discussed here.

Liability threshold modeling of case–control status and family history of disease increases association power

Article 20 April 2020

Genetic correlations of polygenic disease traits: from theory to practice

Article 06 June 2019

Mendelian randomization

Article 10 February 2022

The differential transmission rates of alleles from heterozygous parents to children affected by a disease can provide information about the likely location and mode of action of genes affecting susceptibility to that disease. Conditional logistic regression (CLR), first proposed by Self et al. (1991), is based on a comparison of the frequency of pairs of alleles inherited by affected children with the other combinations of the alleles that they might have inherited. Here we consider the power associated with fitting the second term in the usual CLR model after a first multiplicative term has already been fitted. Testing for a non-multiplicative effect of the alleles is needed to justify the use of a single multiplicative parameter to summarise the relationship of the alleles to the occurrence of the disease. Also, increasing interest in the biochemical pathways through which the effects of the genotypes are mediated requires knowledge of the quantitative relationship between specific genotypes and the phenotypic expression of the disease. A non-multiplicative effect at a marker locus would indicate non-multiplicative effects at an associated disease locus although, with no knowledge of the level of the allelic associations, their size would be unknown.

For generality, we shall refer to the locus as a marker locus so that a candidate gene is a special case. We denote the alleles at a diallelic marker by M₁ and M₂, and their frequencies by m and (1−m), respectively. The CLR approach involves treating the affected children as “cases” and the three genotypes formed by the un-inherited alleles as matched “controls”. If the log relative risks are assumed to be linear in variates x₁ and x₂ that represent the child’s marker genotype, then the probability of the disease p_i for child i is assumed to be related to x₁ and x₂ by the logistic regression equation (e.g. Collett 2003)

$$\log \frac{{p_i }}{{1 - p_i }} = \alpha _i + \beta _1 x_{1i} + \beta _2 x_{2i} $$

The genotype relative risks of the marker genotypes M₁M₁, M₁M₂ and M₂M₂ are denoted r₂, r₁ and 1. If the marker locus is not linked to any locus affecting susceptibility to the disease or if there is no association between the alleles at the marker locus and the alleles at the disease loci then r₂=r₁=1. The test of no differential transmission, and therefore no association between the locus and the disease, has two degrees of freedom. Many authors have studied the power of this test and the power of the single degree of freedom tests based on specific single parameter models representing dominant (r₂=r₁), recessive (r₁=1) and multiplicative (r₂=r ²₁ ) genotype relative risks (e.g. Schaid and Sommer 1993, 1994; Spielman et al. 1993; Schaid 1996; Sham 1998; Schaid 1999). The multiplicative model is thought to be the best single parameter model in terms of representing alternative models such as additive, dominant and recessive (Schaid 1996). We present results on the power of the one degree of freedom test of the non-multiplicative term in the regression, conditional on having fitted a multiplicative term.

Following Schaid (1999), we simulated families with affected children using the null, additive, multiplicative, dominant and recessive models with the genotype relative risks r₁=2 and 4, with frequencies m=0.1 and 0.5. Marker alleles positively associated with disease alleles are unlikely to have frequencies higher than 0.5. Random mating was assumed in generating the parental mating types. The total number of families was set at n=100 or 200. Only families with at least one parent heterozygous are informative. The probability P that a family with an affected child is informative is given by

$$ P=\frac{{m^2 (1 - m^2)r_2 + 2m(1 - m)(1 - m(1 - m))r_1 + (1 - m)^2 (1 - (1 - m)^2)}}{{m^2 r_2 + 2m(1 - m)r_1 + (1 - m)^2}} $$

The expected number of informative families in a study of size n is therefore nP.

For the multiplicative model, x₁ took the values 2, 1 and 0 for the genotypes M₁M_1, M₁M₂ and M₂M₂, respectively. Since a model with x₁ and x₂ is a full model, the variable x₂ can take any non-additive values; we used 1, 1 and 0 for the above genotypes, respectively.

For each of 10,000 simulations, we fitted the CLR models using the function clogit in the survival package of the statistical program R (Ihaka and Gentleman 1996). We first tested the full model with x₁ and x₂ and the reduced model with x₁ only and recorded the number of results significant at 5% using a likelihood ratio test. These results were as expected, given the work of other authors (e.g. Schaid 1996, 1999) and are omitted. However, we also performed a test based on the difference of deviances for these two models, as a test of deviations from the multiplicative models, and these results are presented here.

The regression analysis does not converge when the data do not allow the separation of the effects of the different parameters or deviate strongly from the pattern predicted by the model. Table 1 presents, for all genetic models considered, the proportion of the analyses of the full model that converged, together with the attributable risk and the expected proportion of families with an affected child that are informative, P. The attributable risk is defined as the population lifetime prevalence of the disease minus the disease penetrance for the least disease-related genotype, M₂M₂, as a proportion of the population lifetime prevalence. In our notation, the attributable risk is 1−1/(m²r₂+2m(1−m) r₁+(1−m)²).

Table 1 Properties of the nine genetic models considered, including attributable risk values (Schaid and Sommer 1993), expected proportion of informative families in a sample (P) and the percentage of simulations (out of 10,000) that reached convergence for the logistic regression model involving parameters x₁ and x₂, for n=100 and n=200 simulated families. M multiplicative, A additive, D dominance, R recessive

Full size table

When m=0.5, 75% of families are expected to be informative, but this figure is much lower for m=0.1. The convergence rate for the analysis of the full model with n=100 is at least 0.60 when m=0.1 and 0.95 when m=0.5. With larger samples, convergence is almost certain for most models with both m=0.1 and 0.5. Convergence for the model involving x₁ only is almost identical to that for the full model.

Table 1 also shows the power, calculated from the simulations that converged, of the test at a 5% significance level based on the additional variation explained by fitting x₂ having already fitted x₁, a test indicated in Table 1 by x₂|x₁. This test has the correct power, 0.05, when the null or the multiplicative model holds and has power of at least 30% when m=0.1 for all the stronger models except the additive model when n=100. The power is very low for all the weaker models. When m=0.5, the power increases to about 30% for the stronger dominance and recessive models even with n=100, but the power for the weaker additive model is again very low.

As expected, the predictions of the weak additive model are similar to those of the multiplicative model. Otherwise, there is sufficient power in the test to suggest that the regression on the multiplicative term x₁ only should always be calculated and the amount of variation explained by the full model then compared with the amount explained by this single term regression. Depending on the results of the test for non-multiplicative effects, the data can then be summarised in terms of estimates, together with confidence intervals, of either the relative risks for all three genotypes or the multiplicative effect of the allele.

References

Collett D (2003) Modelling binary data, 2nd edn. Chapman and Hall/CRC, London
Google Scholar
Ihaka R and Gentleman R (1996) R: a language for data analysis and graphics. J Comput. Graph Statist. 5:299–314
Google Scholar
Schaid DJ (1996) General score statistics for associations of genetic markers with disease using cases and their parents. Genet Epidemiol 13:423–449
Article CAS PubMed Google Scholar
Schaid DJ (1999) Likelihoods and TDT for the case-parents design. Genet Epidemiol 16:250–260
Article CAS PubMed Google Scholar
Schaid DJ, Sommer SS (1993) Genotype relative risks: methods for design and analysis of candidate-gene association studies. Am J Hum Genet 53:1114–1126
CAS PubMed PubMed Central Google Scholar
Schaid DJ and Sommer SS (1994) Comparison of statistics for candidate–gene association studies using cases and parents. Am J Hum Genet 55:402–409
CAS PubMed PubMed Central Google Scholar
Self SG, Longton G, Kopecky KJ, and Liang KY (1991) On estimating HLA–disease association with application to a study of aplastic anemia. Biometrics 47:53–61
Article CAS PubMed Google Scholar
Sham PC (1998) Statistics in human genetics. Arnold, London
Google Scholar
Spielman RS, McGinnis RE, Ewens WJ (1993) Transmission test for linkage disequilibrium: the insulin gene region and insulin-dependent diabetes mellitus (IDDM). Am J Hum Genet 52:506–516
CAS PubMed PubMed Central Google Scholar

Download references

Author information

Authors and Affiliations

School of Applied Statistics, The University of Reading, P.O. Box 240, Earley Gate, Reading, RG6 6FN, UK
Karen L. Ayres & Robert N. Curnow

Authors

Karen L. Ayres
View author publications
You can also search for this author in PubMed Google Scholar
Robert N. Curnow
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Robert N. Curnow.

Additional information

This paper is dedicated to the memory of Professor Steve Bennett of the London School of Hygiene and Tropical Medicine who was to be involved in the work on which it is based.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Ayres, K.L., Curnow, R.N. Detecting non-multiplicative genotype relative risks from transmissions of parental alleles to affected children. J Hum Genet 50, 46–48 (2005). https://doi.org/10.1007/s10038-004-0217-5

Download citation

Received: 23 July 2004
Accepted: 25 October 2004
Published: 01 January 2005
Issue Date: January 2005
DOI: https://doi.org/10.1007/s10038-004-0217-5

Detecting non-multiplicative genotype relative risks from transmissions of parental alleles to affected children

Abstract

Similar content being viewed by others

Liability threshold modeling of case–control status and family history of disease increases association power

Genetic correlations of polygenic disease traits: from theory to practice

Mendelian randomization

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Keywords

Search

Quick links

Abstract

Similar content being viewed by others

Liability threshold modeling of case–control status and family history of disease increases association power

Genetic correlations of polygenic disease traits: from theory to practice

Mendelian randomization

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Quick links