Glass, T. A., Goodman, S. N., Hernán, M. A. & Samet, J. M. Causal inference in public health. Annu. Rev. Public Health 34, 61–75 (2013).
Rimm, E. B. et al. Vitamin E consumption and the risk of coronary heart disease in men. N. Engl. J. Med. 328, 1450–1456 (1993).
Stampfer, M. J. et al. Vitamin E consumption and the risk of coronary disease in women. N. Engl. J. Med. 328, 1444–1449 (1993).
Millen, A. E., Dodd, K. W. & Subar, A. F. Use of vitamin, mineral, nonvitamin, and nonmineral supplements in the United States: the 1987, 1992, and 2000 National Health Interview Survey results. J. Am. Diet Assoc. 104, 942–950 (2004).
Eidelman, R. S., Hollar, D., Hebert, P. R., Lamas, G. A. & Hennekens, C. H. Randomized trials of vitamin E in the treatment and prevention of cardiovascular disease. Arch. Intern. Med. 164, 1552–1556 (2004).
Imai, K., King, G. & Stuart, E. A. Misunderstandings between experimentalists and observationalists about causal inference. J. Royal Stat. Soc. A Stat. Methodol. 171, 481–502 (2008).
Jaffee, S. R. & Price, T. S. The implications of genotype-environment correlation for establishing causal processes in psychopathology. Dev. Psychopathol. 24, 1253–1264 (2012).
Deaton, A. & Cartwright, N. Understanding and misunderstanding randomized controlled trials. Soc. Sci. Med. https://doi.org/10.1016/j.socscimed.2017.12.005 (2017).
DiMasi, J. A., Grabowski, H. G. & Hansen, R. W. Innovation in the pharmaceutical industry: new estimates of R&D costs. J. Health Econ. 47, 20–33 (2016).
McGue, M., Osler, M. & Christensen, K. Causal inference and observational research: the utility of twins. Perspect. Psychol. Sci. 5, 546–556 (2010).This study is an introduction to the twin model from a causal inference perspective. It includes a discussion of concepts, estimations and limitations.
Davey Smith, G. & Ebrahim, S. What can Mendelian randomisation tell us about modifiable behavioural and environmental exposures? BMJ 330, 1076–1079 (2005).
Davey Smith, G. & Hemani, G. Mendelian randomization: genetic anchors for causal inference in epidemiological studies. Hum. Mol. Genet. 23, R89–98 (2014).
Burgess, S., Timpson, N. J., Ebrahim, S. & Davey Smith, G. Mendelian randomization: where are we now and where are we going? Int. J. Epidemiol. 44, 379–388 (2015).
Nikpay, M. et al. A comprehensive 1,000 Genomes-based genome-wide association meta-analysis of coronary artery disease. Nat. Genet. 47, 1121–1130 (2015).
Sudlow, C. et al. UK Biobank: an open access resource for identifying the causes of a wide range of complex diseases of middle and old age. PLoS Med. 12, e1001779 (2015).
Bulik-Sullivan, B. et al. An atlas of genetic correlations across human diseases and traits. Nat. Genet. 47, 1236–1241 (2015).
Hemani, G. et al. MR-Base: a platform for systematic causal inference across the phenome using billions of genetic associations. Preprint at bioRxiv 78972 (2016).
Stuart, E. A. Matching methods for causal inference: a review and a look forward. Stat. Sci. 25, 1–21 (2010).
Angrist, J. D., Imbens, G. W. & Rubin, D. B. Identification of causal effects using instrumental variables. J. Am. Stat. Assoc. 91, 444–455 (1996).
Tenesa, A. & Haley, C. S. The heritability of human disease: estimation, uses and abuses. Nat. Rev. Genet. 14, 139–149 (2013).
Speed, D., Cai, N., Johnson, M. R., Nejentsev, S. & Balding, D. J. Reevaluation of SNP heritability in complex human traits. Nat. Genet. 49, 986–992 (2017).
Hernán, M. A. A definition of causal effect for epidemiological research. J. Epidemiol. Commun. Health 58, 265–271 (2004). This study is a pedagogical introduction to the counterfactual or potential outcomes framework for causal inference. It includes mathematical notations and a discussion of key concepts, such as association, causation and exchangeability.
Imbens, G. W. & Rubin, D. B. Causal Inference for Statistics, Social, and Biomedical Sciences. (Cambridge Univ. Press, Cambridge, 2015).
Pearl, J. Causality. (Cambridge Univ. Press, Cambridge, 2009).
Rice, F. et al. Disentangling prenatal and inherited influences in humans with an experimental design. Proc. Natl Acad. Sci. USA 106, 2464–2467 (2009).This is an example of the application of the IVF design to examine the effect of smoking during pregnancy on birthweight.
Mezuk, B., Myers, J. M. & Kendler, K. S. Integrating social science and behavioral genetics: testing the origin of socioeconomic disparities in depression using a genetically informed design. Am. J. Publ. Heal. 103 (Suppl.), 145–151 (2013).
Kendler, K. S. & Gardner, C. O. Dependent stressful life events and prior depressive episodes in the prediction of major depression: the problem of causal inference in psychiatric epidemiology. Arch. Gen. Psychiatry 67, 1120–1127 (2010).
Bruder, C. E. G. et al. Phenotypically concordant and discordant monozygotic twins display different DNA copy-number-variation profiles. Am. J. Hum. Genet. 82, 763–771 (2008).
Carlin, J. B., Gurrin, L. C., Sterne, J. A., Morley, R. & Dwyer, T. Regression models for twin studies: a critical review. Int. J. Epidemiol. 34, 1089–1099 (2005).
Vitaro, F., Brendgen, M. & Arseneault, L. The discordant MZ-twin method: one step closer to the holy grail of causality. Int. J. Behav. Dev. 33, 376–382 (2009).
Fletcher, J. M. & Lehrer, S. F. Genetic lotteries within families. J. Heal. Econ. 30, 647–659 (2011). This paper provides a model combining family fixed effects and genetic instruments, with a discussion of important concepts, such as dynastic effects.
Kohler, H.-P., Behrman, J. R. & Schnittker, J. Social science methods for twins data: integrating causality, endowments, and heritability. Biodemogr. Soc. Biol. 57, 88–141 (2011).
Hjelmborg, J. et al. Lung cancer, genetic predisposition and smoking: the Nordic Twin Study of Cancer. Thorax 72, 1021–1027 (2017).
Bröckerman, P., Hyytinen, A. & Kaprio, J. Smoking and long-term labour market outcomes. Tob. Control 24, 348–353 (2015).
Cohen-Cline, H., Turkheimer, E. & Duncan, G. E. Access to green space, physical activity and mental health: a twin study. J. Epidemiol. Commun. Health 69, 523–529 (2015).
Singham, T. et al. Concurrent and longitudinal contribution of exposure to bullying in childhood to mental health: the role of vulnerability and resilience. JAMA Psychiatry 74, 1112–1119 (2017).
Taylor, M. J. et al. Developmental associations between traits of autism spectrum disorder and attention deficit hyperactivity disorder: a genetically informative, longitudinal twin study. Psychol. Med. 43, 1735–1746 (2013).
Frisell, T., Öberg, S., Kuja-Halkola, R. & Sjölander, A. Sibling comparison designs: bias from non-shared confounders and measurement error. Epidemiology 23, 713–720 (2012).
Heath, A. C. et al. Testing hypotheses about direction of causation using cross-sectional family data. Behav. Genet. 23, 29–50 (1993).
Neale, M. C. & Cardon, L. R. Methodology for Genetic Studies of Twins and Families. (Kluwer Academic, 1992).
D’Onofrio, B. M. et al. Paternal age at childbearing and offspring psychiatric and academic morbidity. JAMA Psychiatry 71, 432–438 (2014).
Tully, E. C., Iacono, W. G. & McGue, M. An adoption study of parental depression as an environmental liability for adolescent depression and childhood disruptive disorders. Am. J. Psychiatry 165, 1148 (2008).
Duffy, D. L. & Martin, N. G. Inferring the direction of causation in cross-sectional twin data: theoretical and empirical considerations. Genet. Epidemiol. 11, 483–502 (1994).
Wood, A. C., Rijsdijk, F., Asherson, P. & Kuntsi, J. Inferring causation from cross-sectional data: examination of the causal relationship between hyperactivity-impulsivity and novelty seeking. Front. Genet. 2, 6 (2011).
Toulopoulou, T. et al. Reciprocal causation models of cognitive versus volumetric cerebral intermediate phenotypes for schizophrenia in a pan-European twin cohort. Mol. Psychiatry 20, 1386 (2015).
Katan, M. B. Apolipoprotein E isoforms, serum cholesterol, and cancer. Lancet 1, 507–508 (1986).
Davey Smith, G. Mendelian randomization for strengthening causal inference in observational studies: application to gene x environment interactions. Perspect. Psychol. Sci. 5, 527–545 (2010).
Brion, M.-J. A., Benyamin, B., Visscher, P. M. & Smith, G. D. Beyond the single SNP: emerging developments in Mendelian randomization in the ‘omics’ era. Curr. Epidemiol. Rep. 1, 228–236 (2014).
Nitsch, D. et al. Limits to causal inference based on Mendelian randomization: a comparison with randomized controlled trials. Am. J. Epidemiol. 163, 397–403 (2006).
Davey Smith, G. et al. Genetic epidemiology and public health: hope, hype, and future prospects. Lancet 366, 1484–1498 (2005).
Davey Smith, G. et al. Association of C-reactive protein with blood pressure and hypertension: life course confounding and Mendelian randomization tests of causality. Arter. Thromb. Vasc. Biol. 25, 1051–1056 (2005).
Hartwig, F. P., Borges, M. C., Horta, B. L., Bowden, J. & Davey Smith, G. Inflammatory biomarkers and risk of schizophrenia: a 2-sample Mendelian randomization study. JAMA Psychiatry 74, 1226 (2017).
Wensley, F. et al. Association between C reactive protein and coronary heart disease: mendelian randomisation analysis based on individual participant data. BMJ 342, d548 (2011).
Bolton, C. E. et al. The CRP genotype, serum levels and lung function in men: the Caerphilly Prospective Study. Clin. Sci. 120, 347–355 (2011).
Pingault, J.-B., Cecil, C.a. M., Murray, J., Munafo, M. & Viding, E. Causal inference in psychopathology: a systematic review of Mendelian randomisation studies aiming to identify environmental risk factors for psychopathology. Psychopathol. Rev. 4, 4–25 (2017).
Manousaki, D., Mokry, L. E., Ross, S., Goltzman, D. & Richards, J. B. Mendelian randomization studies do not support a role for vitamin D in coronary artery disease. Circ. Cardiovasc. Genet. 9, 349–356 (2016).
Mokry, L. E. et al. Vitamin D and risk of multiple sclerosis: a Mendelian randomization study. PLoS Med. 12, e1001866 (2015).
Sheehan, N. A. & Didelez, V. Commentary: Can ‘many weak’ instruments ever be ‘strong’? Int. J. Epidemiol. 40, 752–754 (2011).
Visscher, P. M. & Yang, J. A plethora of pleiotropy across complex traits. Nat. Genet. 48, 707 (2016).
Bowden, J., Davey Smith, G. & Burgess, S. Mendelian randomization with invalid instruments: effect estimation and bias detection through Egger regression. Int. J. Epidemiol. 44, 512–525 (2015).This study introduces the use of a meta-analytical method known as Egger regression to MR analysis. Under certain assumptions, this approach enables causal estimation even when all instruments are invalid.
Bowden, J., Davey Smith, G., Haycock, P. C. & Burgess, S. Consistent estimation in mendelian randomization with some invalid instruments using a weighted median estimator. Genet. Epidemiol. 40, 304–314 (2016).
Rees, J. M. B., Wood, A. M. & Burgess, S. Extending the MR-Egger method for multivariable Mendelian randomization to correct for both measured and unmeasured pleiotropy. Stat. Med. 36, 4705–4718 (2017).This study provides the analytical framework to combine multivariable-MR and MR-Egger methods, which yields causal estimates robust to invalid genetic instruments.
Brion, M.-J. A., Shakhbazov, K. & Visscher, P. M. Calculating statistical power in Mendelian randomization studies. Int. J. Epidemiol. 42, 1497–1501 (2013).
Burgess, S. & Thompson, S. G. Bias in causal estimates from Mendelian randomization studies with weak instruments. Stat. Med. 30, 1312–1323 (2011).
Burgess, S. & Thompson, S. G. Improving bias and coverage in instrumental variable analysis with weak instruments for continuous and binary outcomes. Stat. Med. 31, 1582–1600 (2012).
Gage, S. H. et al. Assessing causality in associations between cannabis use and schizophrenia risk: a two-sample Mendelian randomization study. Psychol. Med. 47, 971–980 (2017).
Stringer, S. et al. Genome-wide association study of lifetime cannabis use based on a large meta-analytic sample of 32 330 subjects from the International Cannabis Consortium. Transl Psychiatry 6, e769 (2016).
Burgess, S. & Thompson, S. G. Multivariable Mendelian randomization: the use of pleiotropic genetic variants to estimate causal effects. Am. J. Epidemiol. 181, 251–260 (2015).
Burgess, S., Freitag, D. F., Khan, H., Gorman, D. N. & Thompson, S. G. Using multivariable Mendelian randomization to disentangle the causal effects of lipid fractions. PLoS ONE 9, e108891 (2014).
Liu, D. J. et al. Exome-wide association study of plasma lipids in >300,000 individuals. Nat. Genet. 49, 1758 (2017).
Tyrrell, J. et al. Genetic evidence for causal relationships between maternal obesity-related traits and birth weight. JAMA 315, 1129–1140 (2016).
Richmond, R. C. et al. Using genetic variation to explore the causal effect of maternal pregnancy adiposity on future offspring adiposity:a Mendelian randomisation study. PLoS Med. 14, e1002221 (2017).
Zhang, G. et al. Assessing the causal relationship of maternal height on birth size and gestational age at birth: a Mendelian randomization analysis. PLoS Med. 12, e1001865 (2015).This study introduces intergenerational MR by computing allelic scores in the mother containing variants either transmitted or non-transmitted to the offspring. The method enables the estimation of the effect of maternal risk factors on the offspring free from passive gene–environment correlation.
Evans, D. M. et al. Mining the human phenome using allelic scores that index biological intermediates. PLoS Genet. 9, e1003919 (2013).
Krapohl, E. et al. Widespread covariation of early environmental exposures and trait-associated polygenic variation. Proc. Natl Acad. Sci. USA 114, 11727–11732 (2017).
Fletcher, J. M. The promise and pitfalls of combining genetic and economic research. Health Econ. 20, 889–892 (2011).
Minica, C. C., Dolan, C. V., Boomsma, D. I., de Geus, E. & Neale, M. C. Extending causality tests with genetic instruments: an integration of Mendelian randomization and the classical twin design. Preprint at bioRxiv 134585 (2017).
Davey Smith, G. & Ebrahim, S. ‘Mendelian randomization’: can genetic epidemiology contribute to understanding environmental determinants of disease? Int. J. Epidemiol. 32, 1–22 (2003).
Davey Smith, G. Capitalizing on Mendelian randomization to assess the effects of treatments. J. R. Soc. Med. 100, 432–435 (2007).
Pasaniuc, B. & Price, A. L. Dissecting the genetics of complex traits using summary association statistics. Nat. Rev. Genet. 18, 117–127 (2017).
Gill, D. et al. Age at menarche and lung function: a Mendelian randomization study. Eur. J. Epidemiol. 32, 701–710 (2017).
Bush, W. S., Oetjens, M. T. & Crawford, D. C. Unravelling the human genome-phenome relationship using phenome-wide association studies. Nat. Rev. Genet. 17, 129 (2016).
O’Reilly, P. F. et al. MultiPhen: joint model of multiple phenotypes can increase discovery in GWAS. PLoS ONE 7, e34861 (2012).
Porter, H. F. & O’Reilly, P. F. Multivariate simulation framework reveals performance of multi-trait GWAS methods. Sci. Rep. 7, 38837 (2017).
Pickrell, J. K. et al. Detection and interpretation of shared genetic influences on 42 human traits. Nat. Genet. 48, 709 (2016).This study introduces a method to detect shared genetic influences on multiple traits. It includes a test of asymmetry, which helps to identify pairs of phenotypes that are causally related and which phenotype influences the other (that is, direction of causation).
Zhu, Z. et al. Integration of summary data from GWAS and eQTL studies predicts complex trait gene targets. Nat. Genet. 48, 481–487 (2016).This study applies summary Mendelian randomization (SMR) methods to expression data and enables the distinction between shared aetiology between expression and phenotypes owing to shared causal variants or distinct variants in LD.
Richardson, T. G. et al. Mendelian randomization analysis identifies CpG sites as putative mediators for genetic influences on cardiovascular disease risk. Am. J. Hum. Genet. 101, 590–602 (2017).
Wallace, C. Statistical testing of shared genetic control for potentially related traits. Genet. Epidemiol. 37, 802–813 (2013).
Giambartolomei, C. et al. Bayesian test for colocalisation between pairs of genetic association studies using summary statistics. PLoS Genet. 10, e1004383 (2014).This paper introduces a Bayesian colocalization method to identify shared causal variants between phenotypes.
Walter, S. et al. Revisiting mendelian randomization studies of the effect of body mass index on depression. Am. J. Med. Genet. B. Neuropsychiatr. Genet. 168B, 108–115 (2015).
Hemani, G. et al. Automating Mendelian randomization through machine learning to construct a putative causal map of the human phenome. Preprint at bioRxiv 173682 (2017).
Davey Smith, G. et al. Incidence of type 2 diabetes in the randomized multiple risk factor intervention trial. Ann. Intern. Med. 142, 313–322 (2005).
Åsvold, B. O. et al. Causal associations of tobacco smoking with cardiovascular risk factors: a Mendelian randomization analysis of the HUNT Study in Norway. Int. J. Epidemiol. 43, 1458–1470 (2014).
Burgess, S., Daniel, R. M., Butterworth, A. S. & Thompson, S. G. Network Mendelian randomization: using genetic variants as instrumental variables to investigate mediation in causal pathways. Int. J. Epidemiol. 44, 484–495 (2015).
Chen, W.-M. & Abecasis, G. R. Family-based association tests for genomewide association scans. Am. J. Hum. Genet. 81, 913–926 (2007).
Dudbridge, F. Likelihood-based association analysis for nuclear families and unrelated subjects with missing genotype data. Hum. Hered. 66, 87–98 (2008).
Dudbridge, F. Power and predictive accuracy of polygenic risk scores. PLoS Genet. 9, e1003348 (2013).
Moayyeri, A., Hammond, C. J., Valdes, A. M. & Spector, T. D. Cohort profile: TwinsUK and healthy ageing twin study. Int. J. Epidemiol. 42, 76–85 (2013).
Haworth, C. M. A., Davis, O. S. P. & Plomin, R. Twins Early Development Study (TEDS): a genetically sensitive investigation of cognitive and behavioral development from childhood to young adulthood. Twin Res. Hum. Genet. 16, 117–125 (2013).
Magnus, P. et al. Cohort profile update: the Norwegian Mother and Child Cohort Study (MoBa). Int. J. Epidemiol. 45, 382–388 (2016).
Fraser, A. et al. Cohort profile: the Avon Longitudinal Study of Parents and Children: ALSPAC mothers cohort. Int. J. Epidemiol. 42, 97–110 (2013).
Walker, V. M., Davey Smith, G., Davies, N. M. & Martin, R. M. Mendelian randomization: a novel approach for the prediction of adverse drug events and drug repurposing opportunities. Int. J. Epidemiol. 46, 2078–2089 (2017).
Scott, R. A. et al. A genomic approach to therapeutic target validation identifies a glucose-lowering GLP1R variant protective for coronary heart disease. Sci. Transl Med. 8, 341ra76 (2016).
Lehrer, S. F. & Ding, W. Are genetic markers of interest for economic research? IZA J. Labor Policy. 6, 2 (2017).
Glymour, M. M., Tchetgen Tchetgen, E. J. & Robins, J. M. Credible Mendelian randomization studies: approaches for evaluating the instrumental variable assumptions. Am. J. Epidemiol. 175, 332–339 (2012).
Lawlor, D. A., Tilling, K. & Davey Smith, G. Triangulation in aetiological epidemiology. Int. J. Epidemiol. 45, 1866–1886 (2016).
Munafò, M. R. & Davey Smith, G. Robust research needs many lines of evidence. Nature 553, 399–401 (2018).
Fisher, R. A. Alleged dangers of cigarette-smoking. BMJ 2, 297–298 (1957).
Knopik, V. S., Neiderhiser, J. M., DeFries, J. C. & Plomin, R. Behavioral Genetics. (Worth Publishers, New York, 2016).
Okbay, A. et al. Genome-wide association study identifies 74 loci associated with educational attainment. Nature 533, 539–542 (2016).
Kendler, K. S. & Baker, J. H. Genetic influences on measures of the environment: a systematic review. Psychol. Med. 37, 615–626 (2007).
Krapohl, E. & Plomin, R. Genetic link between family socioeconomic status and children’s educational achievement estimated from genome-wide SNPs. Mol. Psychiatry 21, 437–443 (2016).
Munafò, M. R. et al. Association between genetic variants on chromosome 15q25 locus and objective measures of tobacco exposure. J. Natl Cancer Inst. 104, 740–748 (2012).
Tobacco and Genetics Consortium. Genome-wide meta-analyses identify multiple loci associated with smoking behavior. Nat. Genet. 42, 441–447 (2010).
Morral, A. R., McCaffrey, D. F. & Paddock, S. M. Reassessing the marijuana gateway effect. Addiction 97, 1493–1504 (2002).
Rutter, M. Proceeding from observed correlation to causal inference: the use of natural experiments. Perspect. Psychol. Sci. 2, 377–395 (2007).
Greenland, S. Quantifying biases in causal models: classical confounding versus collider-stratification bias. Epidemiology 14, 300–306 (2003).
Sheehan, N. A., Didelez, V., Burton, P. R. & Tobin, M. D. Mendelian randomisation and causal inference in observational epidemiology. PLoS Med. 5, e177 (2008).
Didelez, V. & Sheehan, N. Mendelian randomization as an instrumental variable approach to causal inference. Stat. Methods Med. Res. 16, 309–330 (2007).
Burgess, S. & Thompson, S. G. Mendelian Randomization: Methods for Using Genetic Variants in Causal Estimation. (CRC Press, Boca Raton, 2015).
Davey Smith, G. et al. Clustered environments and randomized genes: a fundamental distinction between conventional and genetic epidemiology. PLoS Med. 4, e352 (2007).
Pierce, B. L. & Burgess, S. Efficient design for Mendelian randomization studies: subsample and 2-sample instrumental variable estimators. Am. J. Epidemiol. 178, 1177–1184 (2013).
Hu, J. X., Thomas, C. E. & Brunak, S. Network biology concepts in complex disease comorbidities. Nat. Rev. Genet. 17, 615–629 (2016).
Solovieff, N., Cotsapas, C., Lee, P. H., Purcell, S. M. & Smoller, J. W. Pleiotropy in complex traits: challenges and strategies. Nat. Rev. Genet. 14, 483–495 (2013).
Paaby, A. B. & Rockman, M. V. The many faces of pleiotropy. Trends Genet. 29, 66–73 (2013).
Kong, A. et al. The nature of nurture: effects of parental genotypes. Science 359, 424–428 (2018).
Bates, T. C. et al. The nature of nurture: using a virtual-parent design to test parenting effects on children’s educational attainment in genotyped families. Twin Res. Hum. Genet. 21, 73–83 (2018).
Euesden, J., Lewis, C. M. & O’Reilly, P. F. PRSice: polygenic risk score software. Bioinformatics 31, 1466–1468 (2015).
Burgess, S., Butterworth, A. & Thompson, S. G. Mendelian randomization analysis with multiple genetic variants using summarized data. Genet. Epidemiol. 37, 658–665 (2013).
Hartwig, F. P., Davey Smith, G. & Bowden, J. Robust inference in summary data Mendelian randomization via the zero modal pleiotropy assumption. Int. J. Epidemiol. 46, 1985–1998 (2017).
Burgess, S., Zuber, V., Gkatzionis, A., Rees, J. M. B. & Foley, C. Improving on a modal-based estimation method: model averaging for consistent and efficient estimation in Mendelian randomization when a plurality of candidate instruments are valid. Preprint at bioRxiv 175372 (2017).
Bowden, J. et al. A framework for the investigation of pleiotropy in two-sample summary data Mendelian randomization. Stat. Med. 36, 1783–1802 (2017).