Abstract
Mendelian Randomisation (MR) is an increasingly popular approach that estimates the causal effect of risk factors on complex human traits. While it has seen several extensions that relax its basic assumptions, most suffer from two major limitations; their underexploitation of genomewide markers, and sensitivity to the presence of a heritable confounder of the exposureoutcome relationship. To overcome these limitations, we propose a Latent Heritable Confounder MR (LHCMR) method applicable to association summary statistics, which estimates bidirectional causal effects, direct heritabilities, and confounder effects while accounting for sample overlap. We demonstrate that LHCMR outperforms several existing MR methods in a wide range of simulation settings and apply it to summary statistics of 13 complex traits. Besides several concordant results with other MR methods, LHCMR unravels new mechanisms (how disease diagnosis might lead to improved lifestyle) and reveals new causal effects (e.g. HDL cholesterol being protective against high systolic blood pressure), hidden from standard MR methods due to a heritable confounder of opposite effect direction.
Introduction
The identification of frequent risk factors and the quantification of their impact on common diseases is a principal quest for public health policy makers. Epidemiological studies aim to address this issue, but they are most often based on observational data due to their abundance over the years. Despite major methodological advances, a large majority of such studies have inherent limitations and suffer from confounding and reverse causation^{1,2}. For these reasons, many of the reported associations found in classical epidemiological studies are mere correlates of disease risk, rather than causal factors directly involved in disease progression. Due to this, additional evidence is required before developing public health interventions in a bid to reduce the future burden of diseases. While welldesigned and carefully conducted randomised control trials (RCTs) remain the gold standard for causal inference, they are exceedingly expensive, timeconsuming, may not be feasible for ethical reasons, and have high failure rates^{3,4}.
Mendelian randomisation (MR), a natural genetic counterpart to RCTs, is an instrumental variable (IV) technique used to infer the strength of a causal relationship between a risk factor (X) and an outcome (Y)^{5}. To do so, it uses genetic variants (G) as instruments and relies on three major assumptions (see Supplementary Fig. 1): (1) Relevance—G is robustly associated with the exposure. (2) Exchangeability—G is not associated with any confounder of the exposureoutcome relationship. (3) Exclusion restriction—G is independent of the outcome conditional on the exposure and all confounders of the exposureoutcome relationship (i.e. the only path between the instrument and the outcome is via the exposure).
The advantage of the MR approach is that for most heritable exposures, dozens (if not hundreds) of genetic instruments are known to date thanks to wellpowered genomewide association studies (GWASs). Each instrument can provide a causal effect estimate, which can be combined with others, by using an inverse varianceweighting (IVW) scheme (e.g. Burgess et al.^{6}). However, the last assumption is particularly problematic, because genetic variants tend to be pleiotropic, i.e. exert effect on multiple traits independently. Still, it can be shown that if the instrument strength is independent of the direct effect on the outcome (InSIDE assumption) and the direct effects are on average zero, IVWbased methods will still yield consistent estimates. Methods, such as MREgger^{7}, produce consistent estimates even if direct effects are allowed to have a nonzero offset. The third assumption can be further reduced to assuming that >50% of the instruments (or in terms of their weight) are valid (medianbased estimators^{8}) or that zeropleiotropy instruments are the most frequent (modebased estimators^{9}).
The InSIDE assumption (i.e. horizontal pleiotropic effects (G → Y) are independent of the direct effect (G → X)) is reasonable if the pleiotropic path G → Y does not branch off to X. However, if there is such a branching off, the variable representing the split is a confounder of the X − Y relationship and we fall back on the violation of the second assumption (exchangeability), making it the most problematic. Therefore, in this paper, we extend the standard MR model to incorporate the presence of a latent (i.e. unmeasured) heritable confounder (U) and estimate its contribution to traits X and Y, while simultaneously estimating the bidirectional causal effect between the two traits. Standard MR methods are vulnerable to such heritable confounders, since any genetic marker directly associated with the confounder may be selected as an instrument for the exposure. However, such instruments will have a direct effect on the outcome that is correlated to their instrument strength, violating the InSIDE assumption and biasing the causal effect estimate.
In this paper, we first introduce the extended MR model and derive the likelihood function for the observed genomewide summary statistics (for X and Y). We then test and compare the method against conventional and more advanced (such as CAUSE^{10} and MRRAPS^{11}) MR approaches through extensive simulation settings, including several violations of the model assumptions. Finally, the approach is applied to association summary statistics (based on the UK Biobank and metaanalysis studies) of 13 complex traits to reassess all pairwise bidirectional causal relationships between them.
Results
Overview of the method
We set up a structural equation model (SEM) (Fig. 1) and derived how its parameters are linked to genomewide association summary statistics of two studied complex traits. We then maximised the resulting likelihood function in order to estimate bidirectional causal effects between them (for details see Methods), in addition to inferring direct heritabilities for X and Y, confounder effects, crosstrait and individual trait LDscore intercepts and the polygenicity for X and Y. All SNPs associated with the heritable confounder (U) are indirectly associated with X and Y with effects that are proportional (ratio q_{y}/q_{x}). SNPs that are directly associated with X (and not with U) are also associated with Y with proportional effects (ratio 1/α_{x→y}). Finally, SNPs that are directly Yassociated are also Xassociated with a proportionality ratio of 1/α_{y→x}. These three groups of SNPs are illustrated on the β_{x}vsβ_{y} scatter plot (Supplementary Fig. 2). In simple terms, the aim of our method is to identify the different clusters, estimate the slopes and distinguish which corresponds to the causal and confounder effects. In this paper, we focus on the properties of the maximum likelihood estimates (MLEs) (and their variances) for the bidirectional causal effects arising from our SEM.
Simulation results
We started off with a realistic simulation setting of 234,000 SNPs on chromosome 10 (LD patterns used from the UK10K panel) and 50,000 samples for both traits. Traits X, Y and confounder U had average polygenicity (π_{x} = 5 × 10^{−3}, π_{y} = 1 × 10^{−2}, π_{u} = 5 × 10^{−2}), with substantial direct heritability for X and Y (\({h}_{x}^{2}=0.25,{h}_{y}^{2}=0.2\)), mild confounding on X and Y (t_{x} = 0.16, t_{y} = 0.11, where \({t}_{x}=\sqrt{{h}_{u}^{2}\cdot {q}_{x}^{2}}\) and \({t}_{y}=\sqrt{{h}_{u}^{2}\cdot {q}_{y}^{2}}\)), and a causal effect between X and Y (α_{x→y} = 0.3, α_{y→x} = 0). Note that with these settings, SNPs associated with U would violate the InSIDE assumption but might still be used by conventional MR methods. Under this standard setting, there were no genomewide significant SNPs for standard MR methods, and estimates derived using SNPs with a pvalue < 5 × 10^{−6} showed a downward bias for all MR methods (Fig. 2a). MRRAPS using filtered SNPs (pvalue < 5 × 10^{−4}) was similarly downward biased whereas MRRAPS using the entire set of SNPs was upward biased with the least amount of variance compared to all methods including LHCMR. LHCMR in this scenario slightly overestimated the causal effect in comparison but had the smallest RMSE after MRRAPS (0.13 vs 0.06, Supplementary Data 1).
We ran all our simulation scenarios with a smaller and a larger sample size (50,000 and 500,000) and observed that the relative performance of the methods were in some cases sample size specific. Smaller sample sizes often meant that standard MR methods had little to no IVs reaching genomewide (GW) significance and hence we were forced to use IVs from less stringent thresholds ( < 5 × 10^{−4} and < 5 × 10^{−6}). Therefore, the causal effects were estimated with a substantial downward bias due to weak instrument bias (and winner’s curse). LHCMR in these cases was able to estimate the causal effect with less bias but with a larger variance compared to most standard MR methods—still outperforming them in terms of RMSE in most settings. In the larger sample size setting, standard MR methods had IVs for every threshold cutoff. However, a pattern also observed with smaller sample sizes—but to a lesser extent—emerged, where the causal estimates of some methods changed (either in mean or in variance, most noticeably observed in weighted median and IVW) as the threshold became more stringent. This is of particular concern and highlights that while in this simulation setting the 5 × 10^{−8} threshold may have optimally cancelled out the different biases for IVW (downward bias due to winner’s curse and weak instrument bias, upward bias due to genetic confounding), its estimate remains strongly settingdependent. LHCMR performed reasonably well, exhibiting lower RMSE than most other methods, except for IVW and MRRAPS for the 5 × 10^{−4} threshold (Supplementary Fig. 4a). However, we observed that the performance of MRRAPs is particularly setting and threshold dependent.
Furthermore, unequal sample sizes for the two traits showed an underestimation of the causal effects for almost all MR methods, while LHCMR remained the most accurate in the case where n_{x} (50,000) was smaller than n_{y} (500,000). However, the performances in the reverse scenario, where n_{x} was larger in size, were akin to the large sample size standard setting, where only IVW and filtered MRRAPS (<5 × 10^{−4}) showed superior performance to LHCMR both in terms of bias and variance (see Supplementary Fig. 5).
When testing scenarios in the absence of a causal or a confounder effect (imitating the classical MR assumptions), with a smaller causal effect (α_{x→y} = 0.1), or with both forward and reverse causal effects, we note that LHCMR outperforms the standard MR methods as well as MRRAPS in all these scenarios.
When there was no causal effect (α_{x→y} = 0), LHCMR had the smallest bias out of all the methods in both sample sizes (0.004 in both, Supplementary Fig. 6a and Supplementary Fig. 7a). The variance of the LHCMR estimates in the larger sample size was much lower (0.0001 vs 0.01), similarly the other methods had a smaller variance in the larger sample size and had more clearly seen upward biased estimates. The increased upward bias of standard MR methods is due to the fact that confounderassociated SNPs could only be detected in the larger sample size and those lead to positive bias (due to the concordant effect of the confounder on the two traits). Note that the variances of standard MR methods are low simply because, in these settings, we were forced to lower the instrument selection threshold, hence artificially included many (potentially invalid) instruments, which lowers the estimator variance while increasing bias. MRRAPS greatly overestimates the causal effects when the sample size is larger.
In the absence of a confounder effect, there is not much of a difference between the two sample sizes; standard MR methods have a large variance and are downward biased, LHCMR is less biased compared to them but MRRAPS performs best with the least bias and variance when all the SNPs are used as instruments (Supplementary Fig. 6b and Supplementary Fig. 7b). Trying a smaller causal effect led to an upward bias for all MR methods including both filterings of MRRAPS in the larger sample size. Alternately, when n_{x} = n_{y} = 50, 000, the MR methods are downward biased (Supplementary Figs. 6c and 7c). Lastly, when a (negative) reverse causal effect is introduced, all MR methods and MRRAPS are negatively biased in their estimation of the causal effect (see Fig. 2b). LHCMR has a much smaller bias for the forward causal effect estimate in this case, and a generally small bias for the reverse causal effect in both sample sizes (0.05 for n = 50, 000 and 0.03 for n = 500, 000, Supplementary Fig. 4b).
Increasing the indirect genetic effects, by intensifying the contribution of the confounder to X and Y (t_{x} = 0.41, t_{y} = 0.27), led to a general overestimation of the causal effects by all methods including LHCMR, though more drastically seen in standard MR methods and MRRAPS in the larger sample size, when there is sufficient power to pick up these confounderassociated SNPs. The causal effect estimates of standard MR methods in the smaller sample size were much less affected by the presence of a strong confounder compared to LHCMR and MRRAPS (Supplementary Fig. 8). The reason for this is that the confounderassociated SNPs remain undetectable at lower sample size and hence instruments will not violate the classical MR assumptions.
Further testing the effects of the confounder trait on the causal estimation, we tested the impact of confounders with opposite effects on X and Y. We observe a major underestimation of the causal effects for standard MR methods as well as MRRAPS, whereas LHCMR performs better for both sample sizes (RMSE = 0.01 and 0.1 for larger and smaller n respectively), see Fig. 2c and Supplementary Fig. 4c.
Our LHCMR method is influenced by the unlikely scenario of extreme polygenicity for traits X, Y and U, and it suffers from increased bias and variance regardless of sample size (see Supplementary Fig. 9). Standard MR methods as well as filtered MRRAPS underestimated the causal effect when n = 50, 000. Some also underestimated α_{x→y} when n = 500, 000, with the exception of IVW, Mode and filtered MRRAPS, that outperformed the rest. Decreasing the proportion of confounderassociated SNPs to 1% only, does not seem to affect our method and shows similar results to the standard setting (Supplementary Fig. 10).
Furthermore, we simulated summary statistics, where (contrary to our modelling assumptions) the X − Y relationship has two confounders, U_{1} and U_{2}. When the ratio of the causal effects of these two confounders on X and Y (\({q}_{y}^{(1)}/{q}_{x}^{(1)}\) and \({q}_{y}^{(2)}/{q}_{x}^{(2)}\), respectively) agreed in sign, the corresponding causal effects of standard MR methods were overestimated in the larger sample size and, conversely, underestimated in the smaller sample size (Supplementary Figs. 11a and 12a). LHCMR and weighted median performed better however in the larger sample size and had a bias of 0.03 and 0.07, respectively. However, when the signs were opposite (\({q}_{x}^{(1)}=0.3,{q}_{y}^{(1)}=0.2\) for U_{1} and \({q}_{x}^{(2)}=0.3,{q}_{y}^{(2)}=0.2\) for U_{2}), conventional MR methods and MRRAPS in this case almost all underestimated the causal effect regardless of sample size. LHCMR outperformed them both in the larger sample size (bias of 0.007) and in the smaller sample size (bias of −0.003), see Supplementary Figs. 11b and 12b.
Finally, we explored how sensitive our method is to different violations of our modelling assumptions. First, we simulated summary statistics when the underlying nonzero effects come from a nonGaussian distribution. Interestingly, we observed that, for the smaller sample size, the variance of the causal effect estimate was dependent on the kurtosis for most MR methods. LHCMR estimations yielded slightly more pronounced upward bias than IVW, while still exhibiting the lowest RMSE among all methods (Fig. 3a). Similar results are seen in the larger sample size with smaller variance for all methods under all degrees of kurtosis except for IVW, which showed a better performance than LHCMR (Supplementary Fig. 13a). Second, we simulated effect sizes coming from a threecomponent Gaussian mixture distribution (null/small/large effects), instead of the classical spikeandslab assumption of our model. The smaller sample size estimates mirror those of the standard setting with n also equal to 50,000 (see Fig. 3b). However, in the larger sample size, LHCMR overestimates the causal effect. This bias could be due to the merging of true effect estimates with confounder effect leading to an overestimation of α_{x→y} (Supplementary Fig. 13b). MREgger, IVW and filtered MRRAPS have the smallest RMSE in this case.
Comparing CAUSE and LHCMR
When running CAUSE on data simulated using the LHCMR model framework in order to estimate a causal effect (γ in their notation), we investigated three different scenarios, each with multiple data generations: one where the underlying model has a shared factor/confounder with effect on both exposure and outcome only, another where the underlying model has a causal effect of 0.3 only, a third where the underlying model has both a causal effect and a shared factor. The data generated using the LHCMR model was done under the standard settings (π_{x} = 5 × 10^{−3}, π_{y} = 1 × 10^{−2}, π_{u} = 5 × 10^{−2}, \({h}_{x}^{2}=0.25,{h}_{y}^{2}=0.2,{h}_{u}^{2}=0.3\), t_{x} = 0.16, t_{y} = 0.11, α_{x→y} = 0.3, α_{y→x} = 0, m = 234, 000, n_{x} = n_{y} = 50, 000). For each setting, 50 different replications were investigated.
In the case of an underlying shared effect only, CAUSE preferred the sharing model 100% of the time, and thus there was no causal estimation, however it underestimated both η and q. When there was an underlying causal effect only, CAUSE preferred the causal model only 4% of the times, where it slightly underestimated the causal effect (\(\widehat{\gamma }=0.241\)). Although the true values of η and q are null in this scenario, the sharing model returned estimates for these two parameters overestimating them both (probably driven by their priors), as seen in Supplementary Fig. 14. In the third case, and in the presence of both, CAUSE preferred the sharing model in 48 of the 50 simulations, yet it underestimated η (corresponding to t_{y}/t_{x} for our model) but overestimated q (\({t}_{x}^{2}/({t}_{x}^{2}+{h}_{x}^{2})\) in our model) (mean of 0.566 and 0.222, respectively, where the true values are 0.667 and 0.097) showing a similar estimation pattern to the second case. Interestingly, for the larger sample size, CAUSE selects the correct model 100% of the time, but still underestimates γ, as shown in Supplementary Fig. 15.
In the reverse situation, where data was generated using the CAUSE framework (with parameters h_{1} = h_{2} = 0.25, m = 97, 450, N1 = N2 = 50, 000) and LHCMR was used to estimate the causal effect, we saw the following results (see Supplementary Fig. 16). First, when we generated data in the absence of causal effect (\(\gamma =0,\eta =\sqrt{0.05},q=0.1\)), CAUSE does extremely well in estimating a null causal effect 100% of the time. Standard MR methods yield a slight overestimation of the (null) causal effect with varying degrees of variance, whereas LHCMR shows both a greater variance and an upward bias—still leading to a causal effect compatible with zero. Second, in the absence of a confounder combined with nonzero causal effect (\(\gamma \!=\!\sqrt{0.05}\!=0.22,\eta =0,q=0\)), CAUSE underestimates the causal effect (\(\widehat{\gamma }=0.18\)) compared to LHCMR which overestimates the causal effect: the mean of the estimates was 0.38 (over the 50 runs). Finally, in the presence of both a confounder and a causal effect (\(\gamma =\sqrt{0.05},\eta =\sqrt{0.05},q=0.1\)), CAUSE slightly underestimates the causal effect (\(\widehat{\gamma }=0.20\)), whereas LHCMR overestimates the effects and shows estimates reaching the boundaries 11 out of 50 times (mean of the converged \(\widehat{\gamma }=0.39\) over the 39 data simulations, see Supplementary Fig. 16c)— indicating that this setting of the CAUSE model is not compatible with the LHCMR model framework. Interestingly, classical MR methods outperform CAUSE in this case. Note that in the interest of run time we used less SNPs (than usual) for parameter estimations. The analysis of the three separate scenarios was repeated for a larger sample size of 500, 000 (Supplementary Fig. 17), with more favourable results for LHCMR. In the absence of a causal effect, we had similar results to the smaller sample size, whereas in the absence of a shared effect, LHCMR estimates the causal effect accurately with a mean of 0.22, CAUSE underestimates it and the rest of the MR methods are less biased. In the presence of both causal and shared factor, CAUSE recovers the causal effect. IVW, unlike the other MR methods and CAUSE, is more affected by the presence of the confounder, while LHCMR exhibits upward bias with a mean estimate of 0.27.
Application to association summary statistics of complex traits
We applied our LHCMR and other MR methods to estimate all pairwise causal effects between 13 complex traits (156 causal relationships in both directions). Our results are presented as a heatmap in Fig. 4 (and are detailed in Supplementary Data 2). Further, we calculated the alternate set of estimated parameters that naturally results from our model (for reference see Section The observed association summary statistics and Supplementary Methods 1.4). Among trait pairs for which the exposure had sufficient heritability (>2.5%), the alternate parameters of 102 trait pairs were within the possible ranges mentioned in methods (i.e. the confounder and the exposure are interchangeable). However, for all of these pairs, the alternative parameter optima lead to lower direct than indirect heritability, which we deem unrealistic. Therefore, we report only the primary set of estimated optimal parameters in the main results and provide the alternative parameters in the Supplementary Data 3. The comparison of the results obtained by LHCMR and standard MR methods is detailed below and more extensively in Supplementary Data 4–5. In summary, LHCMR provided reliable causal effect estimates for 132 out of 156 exposure traits (i.e. those exposures had an estimated total heritability greater than 2.5%). These estimates were compared to five different MR methods. Seventyfour causal relationships were deemed significant by LHCMR. Furthermore, for 117 out of those 132 comparable causal relationships, our LHCMR causal effect estimates were concordant (not significantly different) with at least two out of five standard MR methods’ estimates.
By simply comparing the significance status and the direction of the causal effects between the methods, we see that LHCMR agrees in sign and significance (or the lack there of) with at least 3 MR methods 77 times. For 31 relationships, LHCMR results lead to different conclusions than those of standard MR methods. For 28 of those, LHCMR identified a causal effect missed by all standard MR methods. For the other three, we observed a disagreement in sign: LDL has a negative effect on BMI according to weighted mode and weighted median, whereas we show a positive effect, HDL and LDL show a negative bidirectional causal effect for weighted mode but a positive bidirectional effect with LHCMR. Despite the conflicting evidence for the causal relationship of LDL on BMI, studies have shown that the relationship between them is nonlinear^{12}, possibly explaining the discrepancy between the results.
LHCMR agreed with most MR estimates and confirmed many previous findings, such as increased BMI leading to elevated blood pressure^{13,14}, diabetes mellitus^{15,16} (DM), myocardial infarction^{17} (MI) and coronary artery disease^{18} (CAD). Furthermore, we confirmed previous results^{19} that diabetes increases SBP (\({\hat{\alpha }}_{x\to y}=0.39\,P=1.70\times 1{0}^{9}\)).
Interestingly, it revealed that higher BMI increases smoking intensity, concordant with other studies^{20,21}. It also showed the protective effect of education against a range of diseases (e.g. CAD and diabetes^{22,23}) and risk factors such as smoking^{24,25}, in agreement with previous observational and MR studies. Probably reflecting lifestyle change recommendations by medical doctors upon disease diagnosis, statin use is greatly increased when being diagnosed with CAD, (systolic) hypertension, dislipidemia and diabetes as is shown by both LHCMR and standard MR methods.
Furthermore, causal effects of height on CAD, DM and SBP have been previously examined in large MR studies^{26,27}. LHCMR, agreeing with these claims, did not find significant evidence to support the effect of height on DM, but did find a significant protective effect on CAD and SBP. However, unlike the first two, the relationship between height and SBP also revealed the existence of a confounder with causal effects 0.14 (P = 9.2 × 10^{−11}) and 0.11 (P = 3.39 × 10^{−8}) on height and SBP respectively. Another example of a trait pair for which LHCMR found an opposite sign confounder effect is HDL and its protective effect on SBP. The confounder had a positive effect ratio of t_{y}/t_{x} = 0.84, opposing the negative causal effect of \({\hat{\alpha }}_{x\to y}=0.13\) supported by observational studies^{28}. This causal effect was not found by any other MR method.
It is important to note that while the effects of parental exposures on offspring outcomes can be seen as genetic confounding, LHCMR would not be able to distinguish parental and offspring causal effects, because the LHCMR model assumes that there is no correlation between the genetic effects on the exposure and the genetic effects on the confounder (which is not the case for parental vs offspring traits). Thus, LHCMR causal effect estimates are just as likely to reflect parental effects as any other MR method^{29}. This may be the case, for example, for the detrimental effect of increased (parental) BMI on education (supported by longitudinal studies^{30}), the positive effect of (parental) height on birth weight^{31}, or on education^{32}. There are also some associations identified only by LHCMR that might reflect parental effects: the negative causal effect of CAD on education or on birth weight, the positive impact of HDL on birth weight, or DM reducing height. All these pair associations uniquely found by LHCMR are examples of LHCMR’s use of wholegenome SNPs instead of GWsignificant SNPs only, as our estimates are of larger magnitude than those found by standard MR. Interestingly, for the CAD → birth weight relationship, LHCMR revealed a confounder of opposite causal effects, which could have masked/mitigated the causal effect of standard MR methods.
A systematic comparison between IVW and LHCMR has shown generally good agreement between the two methods, which is illustrated in Fig. 5. To identify discrepancies between our causal estimates and those of the standard MR results, we grouped the estimates into several categories, either nonsignificant pvalue for both or either, significant with an agreeing sign for the causal estimate, or significant with a disagreeing sign. The diagonal (seen in Fig. 5) representing the agreement in significance status and sign between the two methods, is heavily populated. On the other hand, 34 pairs have causal links that are significantly nonzero according to LHCMR, but are nonsignificant for IVW, while the opposite is true for seven pairs. We believe that many of these seven pairs may be false positives, since four of them are picked up by no other MR method, two are confirmed by only one other method and the last one by two methods. Further comparisons of significance between LHCMR estimates and the remaining standard MR methods can be found in Supplementary Table 2.
LHCMR identified a confounder for 16 trait pairs out of the possible 78. In order to support these findings, we used EpiGraphDB^{33,34} to systematically identify those potential confounders. EpiGraphDB could identify reliable confounders for ten out of the 16 trait pairs. Notably, for the birth weight–diabetes pair, the average epigraph confoundereffect ratio (r_{3}/r_{1}) clearly agreed in sign with our t_{y}/t_{x} ratio, indicating that the characteristics of the confounder(s) evidenced by LHCMR agree with those found in an exhaustive confounder search, and are mainly obesityrelated traits (Supplementary Fig. 18a). Six other trait pairs showed mixed signs of different confounders, indicating the possibility of having heterogeneous confounders (Supplementary Fig. 18be). Finally, three trait pairs showed a disagreement between our estimated confoundereffect ratio and the bulk of those found by epighraphDB as seen in Supplementary Fig. 18fj. However, at least one of the top ten potential confounders showed effects that are in agreement with our ratio for each of these pairs. Note that since the reported causal effects of the confounders on X and Y reported in EpiGraphDB are not necessarily on the same scale, we do not expect the magnitudes to agree.
As described in the methods (Eq. (32)), genetic correlation can be computed from our estimated model parameters. To verify that the fitted LHCMR model leads to a genetic correlation similar to the one obtained from LDscore regression^{35} (LDSC), we compared whether the two approaches produce similar genetic correlation estimates. We did this by taking the estimated parameters obtained from the 200 block jackknife to estimate the genetic correlations between traits (and their standard errors), and plotted them against LDscore regression values as seen in Fig. 6. As expected, we observe an overall good agreement between the estimates of the two methods, with only six trait pairs differing in sign. Of these six, only 2 were nominally significantly different between the two methods (LDL → Asthma and LDL → DM). Further decomposition of the genetic covariance into heritable confounderled or causal effectled covariance revealed that most of the genetic covariance between traits can be attributed to bidirectional causal effects. A reason for this could be that confounders would need to have very strong effects to substantially contribute to the genetic correlation (≈t_{x} ⋅ t_{y}) compared to the bidirectional causal effects (\(\approx\! {\alpha }_{x\to y}^{2}\cdot {h}_{x}^{2}+{\alpha }_{y\to x}^{2}\cdot {h}_{y}^{2}\)).
As for the comparison of LHCMR against CAUSE for real trait pairs, we ran CAUSE on all 156 trait pairs (bidirectional), and extracted the parameter estimates that corresponded to the methods winning model. The pvalue threshold was corrected for multiple testing and was equivalent to 0.05/156. Based on that threshold, the pvalue that compared between the causal and the sharing model of CAUSE was used to choose one of the two. Then the parameters estimated from the winning model, γ (only for causal model), η and q, were compared to their counterparts in LHCMR. A visual comparison of LHCMR’s causal estimates and those of CAUSE can be seen in Supplementary Fig. 19.
Whenever the causal effect estimates were significant both for CAUSE and LHCMR (30 causal relationships), they always agreed in sign (Supplementary Table 3) with a high Pearson correlation of 0.592. Calculating the correlation for their estimates regardless of significance yielded a smaller value of 0.377. When compared to the causal effect estimate from IVW, LHCMR was strongly correlated (0.585), whereas CAUSE had a slightly weaker correlation (0.471) using all estimates.
Similarly, the significant confoundereffect ratio of LHCMR (t_{y}/t_{x}) can be compared to the significant confoundereffect estimate of CAUSE (η) when a sharing model is chosen. These 12 confounding quantities by CAUSE and LHCMR disagreed in sign for all but one trait pair (Height → MI), with a Pearson correlation compatible with zero (−0.357 (95% CI [−0.77, 0.27])).
Discussion
We have developed a structural equation (mixedeffects) model to account for a latent heritable confounder (U) of an exposure (X)–outcome (Y) relationship in order to estimate bidirectional causal effects between the two traits (X and Y). The method, termed LHCMR, fits this model to association summary statistics of genomewide genetic markers to estimate various global characteristics of these traits, including bidirectional causal effects, confounder effects, direct heritabilities, polygenicities and population stratification.
We first demonstrated through simulations that in most scenarios, the method produces causal effect estimates with substantially less bias and variance (in the larger sample size) than other MR tools. The direction and magnitude of the bias of classical MR approaches varied across scenarios and sample sizes. This bias was mainly influenced by two often opposite forces: downward bias resulting from winner’s curse and weak instruments, and upward bias due to a positive confounder of the X − Y relationship, evident in the larger sample size. In the scenario lacking a confounder (thus respecting all MR assumptions), MR methods were distinctly underestimating the causal effect, except for LHCMR and to a better extent MRRAPS. However, under standard settings with an added small heritable confounder and no reverse causality present, all classical MR methods still slightly underestimated the causal effect in the smaller sample size, except for the MRRAPS estimate which was now overestimated. For the same standard setting scenario but in a larger sample size where confounder effects were more detectable, IVW had an estimation that was close to the true causal value chosen (α_{x→y} = 0.3) due to the opposite biases cancelling out. However, when the causal effect was set to be smaller (α_{x→y} = 0.1), the estimates of IVW became biased. More substantial violations of classical MR assumptions, such as the presence of negativeeffect confounder or a negative reverse causal effect, led to more substantial biases that impacted all methods (including MRRAPS) except LHCMR.
Interestingly, in the smaller sample size, standard MR methods showed a slight decreasing trend in the variance of the causal effect estimate as the kurtosis of the underlying effect size distribution went up from 2 to 10. On the other hand, LHCMR did not show a similar trend with growing kurtosis, and estimated the causal effect with a smaller bias. As confounder causal effects (q_{x}, q_{y}) increased, classical MR methods (except weighted ones) were prone to produce overestimated causal effects with at least twice the bias than that of LHCMR, especially in the large sample size where the confounderassociated SNPs make it to the set of GWsignificant instruments for all methods. Furthermore, modebased estimators were robust to the presence of two concordant confounders, yet their bias was still 10fold higher than LHCMR’s, and they did not perform as well in the presence of discordant confounders. In summary, LHCMR was robust to a wide range of violations of the classical MR assumptions and was less impacted than standard MR methods. Thus it outperformed all MR methods in virtually all tested scenarios, many of which violated even its own modelling assumptions.
We then applied our method to summary statistics of 13 complex traits from large studies, including the UK Biobank. We observed a general trend in our results (in agreement with epidemiological studies) that higher BMI and LDL are risk factors for most diseases such as diabetes and CAD. We also note the protective effect HDL has on these same diseases. Moreover, we observe many disease traits increasing the intake of lipidlowering medication (simvastatin), reflecting the recommendation/treatment of medical personnel following the diagnosis.
LHCMR can have discordant results compared to other MR methods for many possible reasons. The positive causal effect of smoking on MI, diabetes on asthma, the protective impact of higher birth weight on asthma, or higher education on smoking intensity, all of which were missed by standard MR could reflect the increased power of LHCMR with its use of fullgenome SNPs as opposed to genomewide significant SNPs of classical MR approaches.
Estimates from classical MR methods could also be impacted by sample overlap between the exposure and outcome datasets, whereas LHCMR takes this into account. However, when using large sample sizes, the bias due to sample overlap is expected to be very small, and therefore not sufficient to explain any discrepancy in the results^{36}. Another possible reason for the discrepancy between our findings and those of standard MR methods is the presence of a significant heritable confounder found by LHCMR with opposite effect to the estimated causal effect between the pair. These two opposite forces lead to association summary statistics that may be compatible with reduced (or even null) causal effect when the confounder is ignored. Possible examples of this scenario can be observed when (parental) traits, e.g. diabetes and CAD, act on birth weight. These pairs have a confounder of opposite effects, possibly related to (parental) obesity. Similarly, standard MR methods show little evidence for a causal effect of SBP on height, while our LHCMR estimate is −0.37 (P = 4.81 × 10^{−8}) which most probably reflects parental (maternal) effects as seen in previous studies^{37,38}. The protective effect of HDL on SBP is another example where a confounder of opposite sign to that of the causal effect allows it to be uniquely found by LHCMR. LHCMR assumes no genetic correlation between the confounder and the direct effects on the exposure, which may be violated when the confounder is the same trait as the exposure, but in the parent. Such parental effects can mislead most MR methods^{39}, including ours, and hence we may observe biased results for traits such as BMI → education and HDL → birth weight.
Sixteen trait pairs showed a strong confounder effect, in the form of significant t_{x} and t_{y} estimates. These pairs were investigated for the presence of confounders using EpiGraphDB, and 10 of them returned possible confounders. The bulk of such pairs returned confounders with both agreeing and disagreeing effect directions on X and Y, making it difficult to pinpoint a group of concordant and dominant confounders. However, for the birth weightDM pair, where LHCMR identifies a negative reverse causal effect and a confounder with effects t_{x} = 0.10(P = 6.77 × 10^{−8}) and t_{y} = 0.15(P = 3.13 × 10^{−7}) on birth weight and DM respectively, EpigraphDB confirmed several confounders related to body fat distribution and weight that matched in sign with our estimated confounder effect (Supplementary Fig. 18a). Note that EpiGraphDB causal estimates are not necessarily on the scale of SD outcome difference upon 1 SD exposure change scale, hence they are not directly comparable with the t_{y}/t_{x} ratio, but are rather indicative of the sign of the causal effect ratio of the confounder. Furthermore, if EpigraphDB does not find a causal relationship between the trait pair in either directions, then it does not return any possible confounders of the two, a reason why only 10 out of 16 confounderassociated trait pairs returned any hits.
Lastly, our comparison of the genetic correlations calculated from our estimated parameters against those calculated from LDscore regression showed good concordance, confirming that the detailed genetic architecture proposed by our model is compatible with the observed genetic covariance. The major difference between the genetic correlation obtained by LDscore regression vs LHCMR is that our model approximates all existing confounders by a single latent variable, which may be inaccurate when multiple ones exist with highly variable t_{y}/t_{x} ratios. Furthermore, LHCMR decomposed the observed genetic correlation into confounder and bidirectional causality driven components, revealing that most genetic correlations are primarily driven by bidirectional causal effects. Note that we have much higher statistical power to detect situations when the confounder effects are of opposite sign compared to the causal effects, because opposing genetic components are more distinct.
To our knowledge only two recent papers use similar models and genomewide summary statistics. The LCV approach^{40} is a special case of our model, where the causal effects are not included in the model, but they estimate the confounder effect mixed with the causal effect to estimate a quantity of genetic causality proportion (GCP). In agreement with others^{10,41}, we would not interpret nonzero GCP as evidence for causal effect. Moreover, in other simulation settings, LCV has shown very low power to detect causal effects (by rejecting GCP = 0) (Fig S15 in Howey et al.^{42}). Another very recent approach, CAUSE^{10}, proposes a structural equation mixed effect model similar to ours. However, there are several differences between LHCMR and CAUSE: (a) we allow for bidirectional causal effects and model them simultaneously, while CAUSE is fitted twice for each direction of causal effect; (b) they first use an adaptive shrinkage method to integrate out the multivariable SNP effects and then go on to estimate other model parameters, while we fit all parameters at once; (c) CAUSE estimates the correlation parameter empirically; (d) we assume that direct effects come from a twocomponent Gaussian mixture, while they allow for larger number of components; (e) their likelihood function does not explicitly model the shift between univariate vs multivariate effects (i.e. the LD); (f) CAUSE adds a prior distribution for the causal/confounder effects and the proportion π_{u}, while LHCMR does not; (g) to calculate the significance of the causal effect they estimate the difference in the expected log pointwise posterior density and its variance through importance sampling, whereas we use a simple blockjackknife method. Because of point (a), the CAUSE model can be viewed as a special case of ours when there is no reverse causal effect. We have the advantage of fitting all parameters simultaneously, while they only approximate this procedure. Although they allow for more than a twocomponent Gaussian mixture, for most traits with realistic sample sizes we do not have enough power to distinguish whether two or more components fit the data better. Therefore, we believe that a twocomponent Gaussian is a reasonable simplification. Due to the more complicated approach described in points (eg), CAUSE is computationally more intense than LHCMR, taking up to 1.25 CPUhours in contrast to our 2.5 CPUminute run time for a single starting point optimisation (which is massively parallelisable).
When we compared the performance of CAUSE and LHCMR, we found that for large sample sizes both LHCMR and CAUSE performed well not only when applied to data simulated by their own model, but also by the model of the other method. For smaller sample sizes, both methods performed poorly when applied to data generated by the other model. However, LHCMR was less biased when applied to data generated by its own model than CAUSE was on data simulated based on its own model, where it provided rather conservative estimates. This is somewhat expected, since the primary aim of CAUSE is model selection and it is less geared towards parameter estimation, especially for settings where both sharing and causal effects are present (leading to very broad estimates). Also, CAUSE parameter estimates have shown to be somewhat sensitive to the choice of the prior.
Finally, when applying both LHCMR and CAUSE to 156 complex trait pairs, we observed that the causal effects are reasonably well correlated (0.38 for all estimates, 0.59 for significant estimates) and agree in sign for trait pairs deemed significantly causal by either or both methods. In addition, LHCMR causal estimates were more similar to those of IVW than the estimates provided by CAUSE. Surprisingly, when a confounding factor was identified by both methods, the confounder effects (LHCMR t_{y}/t_{x} ratio and CAUSE η parameter) were uncorrelated. There are two possible explanations for this: (i) CAUSE may confuse/merge the confounder with the reverse causal effect, since it does not explicitly model the latter one. (ii) The two models assume different marginal effect size distributions, hence when multiple heterogeneous confounders exist, one method may detect one of the confounders, while the other method picks up the other confounder, depending on which has more similar genetic architecture to the assumed one.
Our approach has its own limitations, which we list below. Like any MR method, LHCMR provides biased causal effect estimates if the input summary statistics are flawed (e.g. not corrected for complex population stratification, parental/dynasty effects). As mentioned in the Methods section, our model is strictlyspeaking unidentifiable and two distinct sets of parameters fit the data equally well, if the alternate set of parameters fall within the parameter ranges. As opposed to classical MR methods that give a single (biased) causal effect estimate, ours can detect and calculate the competing model. Due to biological considerations, from these competing models, we chose the one which yielded larger direct heritability than confounderdriven (indirect) heritability. Additional pointers to decide which parameter optimum we choose can be to pick the one with smaller magnitude of causal effects (large causal effects are unrealistic) or pick the one that includes causal effects that agree better with those of other MR methods.
LHCMR is not an optimal solution for traits whose genetic architecture substantially deviates from a twocomponent Gaussian mixture of effect sizes. Also, for traits with low heritability (<2.5%), it is particularly important to compare the causal effect estimates to those from standard MR methods as results from LHCMR may be less robust. In addition, trait pairs with multiple confounders with heterogeneous effect ratios can violate the single confounder assumption of the LHC model and can lead to biased causal effect estimates. Finally, LHCMR, like other methods, is not immune to parental effects that are correlated with offspring effects. In such cases, the parental effect is grouped with the exposure (due to their strong genetic correlation) and not viewed as a confounder of the exposureoutcome relationship.
Methods
The underlying structural equation model
Let X and Y denote continuous random variables representing two complex traits. Let us assume (for simplicity) that there is one heritable confounder U of these traits. To simplify notation we assume that E(X) = E(Y) = E(U) = 0 and Var(X) = Var(Y) = Var(U) = 1. The genomewide sequence data for M sequence variants is denoted by G = (G_{1}, G_{2}, …, G_{M}). The aim of our work is to dissect the effects of the heritable confounding factor U from the bidirectional causal effects of these two traits (X and Y). For this we consider a model (see Fig. 1) defined by the following equations:
where \({{{{{{{{\boldsymbol{\gamma }}}}}}}}}_{{{{{{{{\boldsymbol{x}}}}}}}}},{{{{{{{{\boldsymbol{\gamma }}}}}}}}}_{{{{{{{{\boldsymbol{y}}}}}}}}},{{{{{{{{\boldsymbol{\gamma }}}}}}}}}_{{{{{{{{\boldsymbol{u}}}}}}}}}\in {{{{{{{{\mathcal{R}}}}}}}}}^{M}\) denote the (true multivariable) direct effect of all M genetic variants on X, Y and U, respectively. All error terms (e_{x}, e_{y} and e_{u}) are assumed to be independent of each other and normally distributed with variances \({\nu }_{x}^{2},{\nu }_{y}^{2}\) and \({\nu }_{u}^{2}\), respectively.
Note that we do not include in the model reverse causal effects on the confounder (X → U and Y → U). The reason for this is the following: Let s_{x} and s_{y} denote those causal effect of X and Y on U. We can see that by reparameterising the original model to \(\alpha ^{\prime} :={\alpha }_{x\to y}+{s}_{x}\cdot {q}_{y}\), \(\alpha ^{\prime} :={\alpha }_{y\to x}+{s}_{y}\cdot {q}_{x}\) and \({q}_{x}^{\prime}:={q}_{x}/(1{q}_{x}\cdot {s}_{x})\), \({q}_{y}^{\prime}:={q}_{y}/(1{q}_{y}\cdot {s}_{y})\), the genetic effects produced by the extended model with reverse causal effects on U and the simpler model (Fig. 1) with the updated parameters are indistinguishable. Thus these extra parameters are not identifiable and the reparameterisation means that α_{x→y} and α_{y→x} in our model represent the total causal effects, some of which may be mediated by U.
Note that the model cannot be represented by classical directed acyclic graphs, as the bidirectional causal effects could form a cycle. However, the equations can be reorganised to avoid recursive formulation as follows:
Regrouping the terms gives
Substituting U into the first two equations yields
with
where \({i}_{x}:=({\nu }_{x}^{2}+{\alpha }_{y\to x}^{2}{\nu }_{y}^{2}+({q}_{x}+{\alpha }_{y\to x}{q}_{y}){\nu }_{u}^{2})/{(1{\alpha }_{y\to x}{\alpha }_{x\to y})}^{2}\) and \({i}_{y}:=({\nu }_{y}^{2}+{\alpha }_{x\to y}^{2}{\nu }_{x}^{2}+({q}_{y}+{\alpha }_{x\to y}{q}_{x}){\nu }_{u}^{2})/{(1{\alpha }_{x\to y}{\alpha }_{y\to x})}^{2}\). Note that i_{x} is equivalent to the LDscore regression intercept^{43}.
We model the genetic architecture of these direct effects with a spikeandslab distribution, assuming that only 0 ≤ π_{x}, π_{y}, π_{u} ≤ 1 proportion of the genome have a direct effect on X, Y, U, respectively, and these direct effects come from a Gaussian distribution. Namely,
Here, ⊙ denotes elementwise multiplication and \({{{{{{{{\mathcal{B}}}}}}}}}_{m}(1,q)\) the m dimensional independent Bernoulli distribution. Further, we assume that all κ_{x}, κ_{y}, κ_{u}s are independent of each other and so are all ζ_{x}, ζ_{y}, ζ_{u}s. We can refer to \({h}_{x}^{2}:=M\cdot {\pi }_{x}\cdot {\sigma }_{x}^{2}\) as the direct heritability of X, i.e. independent of the genetic basis of U and Y. Similar notation is adapted for U (\({h}_{u}^{2}:=M\cdot {\pi }_{u}\cdot {\sigma }_{u}^{2}\)) and Y (\({h}_{y}^{2}:=M\cdot {\pi }_{y}\cdot {\sigma }_{y}^{2}\)). Note that when q_{x} = 0 and q_{y} ≠ 0 (or vice versa), this means that there is no confounder U present, but the genetic architecture of Y (or X) can be better described by a threecomponent Gaussian mixture distribution.
We assume that the correlation (across markers) between the direct effects of a genetic variant on X, Y and U is zero, i.e. cov(γ_{x}, γ_{y}) = cov(γ_{x}, γ_{u}) = cov(γ_{u}, γ_{y}) = 0. Note that this assumption still allows for a potential correlation between the total effect of G on X and its horizontal pleiotropic effect on Y, but only due to the confounder U and through the reverse causal effect Y → X. As we argued above, this is a reasonable assumption, since the most plausible reason (apart from outcomedependent sampling, which is out of the scope of this paper) for the violation of the InSIDE assumption may be one or more heritable confounder(s).
For simplicity, we also assume that the set of genetic variants with direct effects on each trait overlap only randomly, i.e. the fraction of the genome directly associated with both X and Y is π_{x} ⋅ π_{y}, etc. This assumption is in line with recent observation that the bulk of observed pleiotropy can be explained by extreme polygenicity with random overlap between trait loci^{44}. Note that uncorrelated effects (e.g. cov(γ_{x}, γ_{y}) = 0) do not ensure that the active variant sets overlap randomly, this is a slightly stronger assumption.
The observed association summary statistics
Let us now assume that we observe univariable association summary statistics for these two traits from two (potentially overlapping) finite samples N_{x} and N_{y} of size n_{x}, n_{y}, respectively. In the following, we will derive observed summary statistics in sample N_{x} and then we will repeat the analogous exercise for sample N_{y}. Let the realisations of X, Y and U be denoted by \({{\boldsymbol{x}}},{{\boldsymbol{y}}}\) and \({{\boldsymbol{u}}}\in {{{{{{{{\mathcal{R}}}}}}}}}^{{n}_{x}}\). The genomewide genetic data are represented by \({G}_{x}\in {{{{{{{{\mathcal{R}}}}}}}}}^{{n}_{x}\times M}\) and the genetic data for a single nucleotide polymorphism (SNP) k tested for association is \({{\boldsymbol{{g}}}_{k}}\in {{{{{{{{\mathcal{R}}}}}}}}}^{{n}_{x}}\). Note the distinction between the kth column of G_{x}, which is the kth sequence variant, in contrast to \({{\boldsymbol{{g}}}_{k}}\), which is the kth SNP tested for association in the GWAS. We assume that all SNP genotypes have been standardised to have zero mean and unit variance. The marginal effect size estimate for SNP k of trait X can then be written as \({\widehat{\beta }}_{k}^{x}={{{\boldsymbol{g}}}}_{k}^{\prime}\cdot {{\boldsymbol{x}}}/{n}_{x}\), which is a special case of univariable standard normal linear regression when both the outcome and the predictor is standardised to have zero mean and unit variance^{43}. Note that \({{{{{{{\boldsymbol{x}}}}}}}}^{\prime}\) denotes the transpose of the column vector x. This can be further transformed as
By denoting the linkage disequilibrium (LD) between variant k and all markers in the genome with \({{{{{{{{\boldsymbol{\rho }}}}}}}}}_{{{{{{{{\boldsymbol{k}}}}}}}}}={G}_{x}^{\prime}\cdot {{{\boldsymbol{g}}}}_{k}/{n}_{x}\) we get
with \({\eta }_{k}^{x}:={{{\boldsymbol{g}}}}_{k}^{\prime}\cdot {{{{{{{\boldsymbol{{\epsilon }}}}}}}_{x}}}/{n}_{x} \sim {{{{{{{\mathcal{N}}}}}}}}(0,{i}_{x}/{n}_{x})\). Given the abovedefined genetic effect size distribution the equation becomes
Similarly, assuming that the LD structures (ρ_{k}) in the two samples are comparable, for \({\widehat{\beta }}_{k}^{y}\) estimated in the other sample (N_{y}) we obtain
with \({\eta }_{k}^{y} \sim {{{{{{{\mathcal{N}}}}}}}}(0,{i}_{y}/{n}_{y})\).
Therefore, the joint effect size estimates can be written as
Following the same rational as the crosstrait LDscore regression^{45}, the noise term distribution is readily obtained
where r_{x,y} is the observational correlation between variables X and Y and n_{x∩y} is the size of the overlapping samples for X and Y. Since both n_{x∩y} and r_{x,y} cannot be estimated, we simply denote \({i}_{x,y}:={r}_{x,y}\cdot \frac{{n}_{x\cap y}}{\sqrt{{n}_{x}\cdot {n}_{y}}}\) as the only estimated parameter and parameterise the covariance term as \(\frac{{i}_{x,y}}{\sqrt{{n}_{x}\cdot {n}_{y}}}\). Note that i_{x,y} is the crosstrait LDscore regression intercept.
While the bivariate probability density function (PDF) of these summary statistics cannot be obtained analytically, we could derive its characteristic function (see Supplementary Methods 1.1), which is the product of some transformed version of the characteristic functions of \({z}_{k}^{(x)},{z}_{k}^{(u)},{z}_{k}^{(y)}\) and \(({\eta }_{k}^{x},{\eta }_{k}^{y})\), yielding
Approximating the local correlations of SNP k (ρ_{k}) by a spike and slab distribution, parameterised by the fraction of nonzero correlations (π_{k}) and the variance of the nonzero correlations (\({\sigma }_{k}^{2}\)), allows the derivation of a closed form expressions of \({\varphi }_{{z}_{k}^{(u)}}\), \({\varphi }_{{z}_{k}^{(x)}}\) and \({\varphi }_{{z}_{k}^{(y)}}\).
Derivation of the likelihood function
Given that the characteristic function can be analytically derived, we used the inversion theorem (for characteristic functions) to obtain the joint distribution of \(\left({\widehat{\beta }}_{k}^{x},{\widehat{\beta }}_{k}^{y}\right)\) as
This integral can be efficiently computed by the Fast Fourier Transformation (FFT, see ref. ^{46} and references within. Detailed derivation is found in Supplementary Methods 1.2). To speed up computation, we bin SNPs according to their π_{k} and σ_{k} values which characterise the local LD distribution for each SNP k (10 × 10 bins with equidistant centres  see Supplementary Methods 1.3) and for SNPs in the same bin the PDF function is evaluated over a fine grid (2^{7} × 2^{7} combinations) using the FFT.
To reduce the number of parameters we define t_{x}: = σ_{u} ⋅ q_{x} and t_{y}: = σ_{u} ⋅ q_{y} since σ_{u} and q_{x} are separately not identifiable, but only their product is. Extensive simulations have shown that π_{u} is unidentifiable, and hence is set to an arbitrary value of 0.1. For improved interpretability, we slightly reparameterise the likelihood function by using \({h}_{x}^{2}:={\pi }_{x}\cdot M\cdot {\sigma }_{x}^{2},{h}_{y}^{2}:={\pi }_{y}\cdot M\cdot {\sigma }_{y}^{2}\). Since different SNPs are correlated we have to estimate the overcounting of each SNP. We choose the same strategy as LDscore regression^{43} and weigh each SNP by the inverse of its restricted LD score, i.e. \({w}_{k}=1/\mathop{\sum }\nolimits_{j = 1}^{{m}_{0}}{r}_{jk}^{2}\), where r_{jk} is the correlation between GWAS SNPs k and j. The loglikelihood function is, thus, of the form
where \({f}_{k}\left({\widehat{\beta }}_{k}^{x},{\widehat{\beta }}_{k}^{y}\right)\) is the loglikelihood function value for SNP k. Parameters {n_{x}, n_{y}, m, σ_{k=1,…,K}, π_{k=1,…,K}} are known and the other 11 parameters
are to be estimated from the observed association summary statistics. In order to further speed up computation, we estimate the 11 parameters in two separate steps: we first estimate for each trait the parameters π_{x}, i_{x} and π_{y}, i_{y} (SNP polygenicity and LDscore intercept) and the total heritability (unlike the direct heritability obtained by the fullmodel of LHCMR) by using a simplified model with only the trait of interest, without a second trait or confounder, e.g. we fit only \({\pi }_{x},{h}_{x}^{2}\) and i_{x} using \({\widehat{{{{{{{{\boldsymbol{\beta }}}}}}}}}}^{{{{{{{{\boldsymbol{x}}}}}}}}}\) and assume that π_{x} and i_{x} do not change when two traits are taken into account. Note that π_{x} may change slightly (decreasing from the total to direct polygenicity), but its value has little impact on the likelihood function. The estimates from the first step are then fixed for the parameter estimation of trait pairs in the second step. Since only π_{x}, i_{x} and π_{y}, i_{y} are fixed, the remaining parameters to estimate are now:
It is key to note that our approach does not aim to estimate individual (direct or indirect) SNP effects, as these are handled as random effects. By replacing U with − U we swap the signs of both t_{x} and t_{y}, therefore these parameters are unique only if the sign of one of them is fixed. Thus, we will have the following restrictions on the parameter ranges: \({h}_{x}^{2},{h}_{y}^{2},{t}_{x}\) are in [0, 1], t_{y}, α_{x→y}, α_{y→x}, i_{x,y} are in [ − 1, 1].
Likelihood maximisation and standard error calculation
Our method, termed Latent Heritable Confounder Mendelian Randomisation (LHCMR), maximises this likelihood function to obtain the MLE. Due to the complexity of the likelihood surface, we initialise the maximisation using 50 different starting points, where they come from a uniform distribution within the parameterspecific ranges mentioned above. We then choose parameter estimates corresponding to the highest likelihood of the 50 runs. Run time depends on the number of iterations during the maximisation procedure, and is linear with respect to the number of SNPs. It takes ~0.25 CPUminute to fit the complete model to 50,000 SNPs with a single starting point.
Given the particular nature of the underlying directed graph, two different sets of parameters lead to an identical fit of the data, resulting in two global optima. The reason for this is the difficulty in distinguishing the ratio of the confounder effects (t_{y}/t_{x}) from the causal effect (α_{x→y}), as illustrated in Supplementary Fig. 2 by the slopes belonging to different SNPclusters. More rigorously, it can be show that if {h_{x}, h_{y}, α_{x→y}, α_{y→x}, t_{x}, t_{y}} is an optimum, then so will be \(\{{h}_{x}^{\prime},{h}_{y}^{\prime},{\alpha }_{x\to y}^{\prime},{\alpha }_{y\to x}^{\prime},{t}_{x}^{\prime},{t}_{y}^{\prime}\}\), where
with w = t_{y}/t_{x} (for further derivations, see Supplementary Methods 1.4). This allows us to directly obtain both optima, even if the optimisation only revealed one of them. It happens very often that one of these parameter sets are outside of the allowed ranges and hence can be automatically excluded. If not, we keep track of both parameter estimates maximising the likelihood function. Note that, we call the one for which the direct heritability is larger than the indirect one, i.e. \({h}_{x}^{2} \, > \, {t}_{x}^{2}\), the primary solution. We show that for real data application this solution is far more plausible than the alternative optimum. Finally, note that such bimodality can be observed at different levels: (i) For one given data generation, using multiple starting points leads to different optima; (ii) LHCMR applied to multiple different data generations for a fixed parameter setting can yield different optima. Both of these situations are signs of the same underlying phenomenon and most often cooccur.
We implemented the blockjackknife procedure that is also used by LDscore regression to calculate the standard errors. For this we split the genome into 200 jackknife blocks and compute MLE in a leaveoneblockout fashion yielding \({\widehat{{{{{{{{\boldsymbol{\theta }}}}}}}}}}^{(i)},i=1,\ldots ,200\) estimates. The variance of the full SNP MLE is then defined as \(Var(\widehat{\theta }):=\frac{mm\cdot (1/200)}{m\cdot (1/200)}\cdot \frac{1}{2001}\mathop{\sum }\nolimits_{i = 1}^{200}{({\widehat{{{{{{{{\boldsymbol{\theta }}}}}}}}}}^{(i)}\widehat{{{{{{{{\boldsymbol{\theta }}}}}}}}})}^{2}=\mathop{\sum }\nolimits_{i = 1}^{200}{({\widehat{{{{{{{{\boldsymbol{\theta }}}}}}}}}}^{(i)}\widehat{{{{{{{{\boldsymbol{\theta }}}}}}}}})}^{2}\).
Decomposition of genetic correlation
Given the starting equations for X and Y (Eqs. (2)–(3)) we can calculate their genetic correlation as the ratio between their genetic covariance and variance (calculated from their heritabilities) as such:
The full details of the derivation is found in Supplementary Methods 1.5. Using our estimated parameters, we first calculate the correlation based on Eq. (32) and then compare them to those obtained by LDscore regression.
Simulation settings
First, we tested LHCMR using realistic parameter settings with a mild violation of the classical MR assumptions. These standard parameter settings consisted of simulating m = 234,000 SNPs for two nonoverlapping cohorts of equal size (for simplicity) of n_{x} = n_{y} = 50,000 for each trait. X, Y and U were simulated with moderate polygenicity (π_{x} = 5 × 10^{−3}, π_{y} = 1 × 10^{−2}, π_{u} = 5 × 10^{−2}), and considerable direct heritability (\({h}_{x}^{2}=0.25,{h}_{y}^{2}=0.2,{h}_{u}^{2}=0.3\)). U had a confounding effect on the two traits as such, q_{x} = 0.3, q_{y} = 0.2 (resulting in t_{x} = 0.16, t_{y} = 0.11), and X had a direct causal effect on Y (α_{x→y} = 0.3), while the reverse causal effect from Y to X was set to null. Note that in this setting the total heritability of each of these traits is principally driven by direct effects and less than 10% of the total heritability is through a confounder and in case of Y less than an additional 8% of its total heritability is through X.It is important to note that for each tested parameter setting, we generated 50 different datasets, and each data generation underwent a likelihood maximisation of Eq. (25) using 50 starting points, and produced estimated parameters corresponding to the highest likelihood (simplified schema in Supplementary Fig. 3).
In the following simulations, we changed various parameters of these standard settings to test the robustness of the method. We explored how increased sample size (n_{x} = n_{y} = 500, 000) or differences in sample sizes ((n_{x}, n_{y}) = (50, 000, 500, 000) and (n_{x}, n_{y}) = (500, 000, 50, 000)) influence causal effect estimates of LHCMR and other MR methods. We also simulated data with no causal effect (or with no confounder) and then examined how LHCMR estimates those parameters. Next, we varied our causal effects between the two traits by lowering α_{x→y} to 0.1, and in another setting by introducing a reverse causal effect (α_{y→x} = −0.1). In addition, we tried to create extremely unfavourable conditions for all MR analyses by varying the confounding effects. We did this in several ways: (i) increasing q_{x} and q_{y} (q_{x} = 0.75, q_{y} = 0.50), (ii) having a confounder with causal effects of opposite signs on X and Y (q_{x} = 0.3, q_{y} = −0.2). We also drastically increased the proportion of SNPs with nonzero effect on traits X, Y and U (π_{x}, π_{y} and π_{u} = 0.1, 0.15, 0.2 respectively). We also simulated data whereby the confounder has lower (π_{u} = 0.01) polygenicity than the two focal traits.
Finally, we explored various violations of the assumptions of our model (see Methods Section). First, we introduced two confounders in the simulated data, once with causal effects on X and Y that were concordant (\({t}_{x}^{(1)}=0.16,{t}_{y}^{(1)}=0.11,{t}_{x}^{(2)}=0.22,{t}_{y}^{(2)}=0.16\)) in sign, and another with discordant effects (\({t}_{x}^{(1)}=0.16,{t}_{y}^{(1)}=0.11,{t}_{x}^{(2)}=0.22,{t}_{y}^{(2)}=0.16\)), while still fitting the model with only one U. Second, we breached the assumption that the nonzero effects come from a Gaussian distribution. By design, the first three moments of the direct effects are fixed: they have zero mean, their variance is defined by the direct heritabilities and they must have zero skewness because the effect size distribution has to be symmetrical. Therefore, to violate the normality assumption, we varied the kurtosis (2, 3, 5 and 10) of the distribution drawn from the Pearson’s distribution family. Third, we tested the assumption of the direct effects on our traits coming from a twocomponent Gaussian mixture by introducing a third component and observing how the estimates were effected. In this simulation scenario we introduced a large effect third component for X while decreasing the polygenicity of U (\({\pi }_{x1}=1\times 1{0}^{4},{\pi }_{x2}=1\times 1{0}^{2},{h}_{x1}^{2}=0.15,{h}_{x2}^{2}=0.1,{\pi }_{u}=1\times 1{0}^{2}\)).
Application to real summary statistics
Once we demonstrated favourable performance of our method on simulated data, we went on to apply LHCMR to summary statics obtained from the UK Biobank and other metaanalytic studies (Supplementary Table 1) in order to estimate pairwise bidirectional causal effect between 13 complex traits. The traits varied between conventional risk factors (such as low education, high body mass index (BMI), dislipidemia) and diseases (including diabetes and coronary artery disease among others). SNPs with imputation quality greater than 0.99, and minor allele frequency (MAF) greater than 0.5% were selected. Moreover, SNPs found within the human leukocyte antigen (HLA) region on chromosome 6 were removed due to the abundance of SNPs associated with autoimmune and infectious diseases as well as the complicated LD structure present in that region. For traits with total heritability below 2.5%, the outgoing causal effect estimates were ignored since instrumenting such barely heritable traits is questionable.
In order to perform LHCMR between trait pairs, a set of overlapping SNPs was used as input for each pair. The effects of these overlapping SNPs were then aligned to the same effect allele in both traits. To decrease computation time further (while only minimally reducing power), we selected every 10th QCfiltered SNP as input for the analysis. We calculated regression weights using the UK10K panel, which may be suboptimal for summary statistics not coming from the UK Biobank, but we have previously shown^{47} that estimating LD in a tentimes larger dataset (UK10K) outweighs the benefit of using smaller, but possibly bettermatched European panel (1000 Genomes^{48}).
We also ran IVW for each trait pair in both directions to estimate bidirectional causal effects as well as LDscore regression to get the crosstrait intercept term. We then added uniformly distributed (~U(−0.1, 0.1)) noise to these preestimated parameters to generate starting points for the second step of the likelihood optimisation. These closertotarget starting points did not change the optimisation results, simply sped up the likelihood maximisation and increased the chances to converge to the same (primary) optimum. The LHCMR procedure was run for each pair of traits 100 times, each using a different set of randomly generated starting points within the ranges of their respective parameters. For the optimisation of the likelihood function (Eq. (25)), we used the R function ‘optim’ from the ‘stats’ R package^{49}. Once we fitted this complete model estimating 11 parameters in two steps \(\{{i}_{x},{i}_{y},{\pi }_{x},{\pi }_{y},{h}_{x}^{2},{h}_{y}^{2},{t}_{x},{t}_{y},{\alpha }_{x\to y},{\alpha }_{y\to x},{i}_{xy}\}\), we then ran block jackknife to obtain the SE of the parameters estimated in the second step: \(\{{h}_{x}^{2},{h}_{y}^{2},{t}_{x},{t}_{y},{\alpha }_{x\to y},{\alpha }_{y\to x},{i}_{xy}\}\).
To support the existence of the confounders identified by LHCMR, we used EpiGraphDB^{33,34} to systematically identify those potential confounders. The database provided for each potential confounder of a causal relationship, a causal effect on trait X and Y (r1, and r3 in their notation), the sign of the ratio of which (sign(r_{3}/r_{1})) was compared to the sign of the LHCMR estimated t_{y}/t_{x} values representing the strength of the confounder acting on the two traits. We restricted our comparison to the sign only, since the r1, r3 values reported in EpiGraphDB are not necessarily on the same scale.
Comparison against conventional MR methods and CAUSE
We compared the causal parameter estimates of the LHCMR method to those of five conventional MR approaches (MREgger, weighted median, IVW, mode MR and weighted mode MR) using a Ztest^{50}. The ‘TwoSampleMR’ R package^{51} was used to get the causal estimates for all the pairwise traits as well as their standard errors from the abovementioned MR methods. The same set of genomewide SNPs that were used by LHCMR, were used as input for the package. SNPs associated with the exposure were selected to various degrees (for simulation we selected SNPs over a range of thresholds: absolute pvalue < 5 × 10^{−4} to < 5 × 10^{−8}), and SNPs more strongly associated with the outcome than with the exposure (pvalue < 0.05 in onesided ttest) were removed. The default package settings for the clumping of SNPs (r^{2} = 0.001) were used and the analysis was run with no further changes. We tested the agreement between the significance and direction of our estimates and that of standard MR methods, with the focus being on finding differences in statistical conclusions regarding causal effect sizes.
We compared our causal estimates from all our simulation settings to the causal estimates obtained by running MRRAPS^{11} also using the ‘TwoSampleMR’ R package, once by using the entire set of SNPs, and another by filtering for SNPs with a significance threshold of <5 × 10^{−4}. We also compared both our simulation as well as real data results against those of CAUSE^{10}. We first generated simulated data under the LHC model and used them as input to estimate the causal effect using CAUSE. We then generated simulated data using the CAUSE framework and inputted them into LHCMR (as well as standard MR methods) to estimate the causal parameters. Lastly, we compared causal estimates obtained for the 78 trait pairs (156 bidirectional causal effects) from LHCMR to those obtained when running CAUSE.
Reporting summary
Further information on research design is available in the Nature Research Reporting Summary linked to this article.
Data availability
The origin of the summary statistics data used is referenced in Supplementary Table 1. The UK Biobank summary statistics data used in this study came from the Neale Lab^{52}, and can be downloaded from http://www.nealelab.is/ukbiobank. Data on coronary artery disease^{53} have been contributed by the CARDIoGRAMplusC4D and UK Biobank CardioMetabolic Consortium CHD working group who used the UK Biobank Resource (application number 9922). Data have been downloaded from http://www.cardiogramplusc4d.org/datadownloads/. The computed local LD scores described in Supplementary Methods 1.3 can be downloaded from https://wp.unil.ch/sgg/lhcmr/. We also used EpiGraphDB, an analytical platform and database to support data mining in epidemiology, to preform Phenomewide MR search. Access to EpigraphDB is free and may be done through their web application https://epigraphdb.org or their R package https://github.com/MRCIEU/epigraphdbr.
Code availability
The source code^{54} for this work can be found on https://github.com/LizaDarrous/LHCMR_v2/(https://doi.org/10.5281/zenodo.5534639), it has also been developed into an R package that can be downloaded from https://github.com/LizaDarrous/lhcMR.
References
Fewell, Z., Davey Smith, G. & Sterne, J. A. C. The impact of residual and unmeasured confounding in epidemiologic studies: a simulation study. Am. J. Epidemiol. 166, 646–655 (2007).
Pingault, J.B. et al. Using genetic data to strengthen causal inference in observational research. Nat. Rev. Genet. 19, 566–580 (2018).
Barter, P. J. et al. Effects of torcetrapib in patients at high risk for coronary events. N. Engl. J. Med. 357, 2109–2122 (2007).
Fordyce, C. B. et al. Cardiovascular drug development: is it dead or just hibernating? J. Am. Coll. Cardiol. 65, 1567–1582 (2015).
Lawlor, D. A., Harbord, R. M., Sterne, J. A. C., Timpson, N. & Davey Smith, G. Mendelian randomization: using genes as instruments for making causal inferences in epidemiology. Stat. Med. 27, 1133–1163 (2008).
Burgess, S., Butterworth, A. & Thompson, S. G. Mendelian randomization analysis with multiple genetic variants using summarized data. Genet. Epidemiol. 37, 658–665 (2013).
Bowden, J., Davey Smith, G. & Burgess, S. Mendelian randomization with invalid instruments: effect estimation and bias detection through egger regression. Int. J. Epidemiol. 44, 512–525 (2015).
Bowden, J., Davey Smith, G., Haycock, P. C. & Burgess, S. Consistent estimation in mendelian randomization with some invalid instruments using a weighted median estimator. Genet. Epidemiol. 40, 304–314 (2016).
Hartwig, F. P., Davey Smith, G. & Bowden, J. Robust inference in summary data mendelian randomization via the zero modal pleiotropy assumption. Int. J. Epidemiol. 46, 1985–1998 (2017).
Morrison, J., Knoblauch, N., Marcus, J. H., Stephens, M. & He, X. Mendelian randomization accounting for correlated and uncorrelated pleiotropic effects using genomewide summary statistics. Nat. Genet. 52, 740–747 (2020).
Zhao, Q., Wang, J., Hemani, G., Bowden, J. & Small, D. S. Statistical inference in twosample summarydata Mendelian randomization using robust adjusted profile score. Ann. Stat. 48, 1742 – 1769 (2020).
Laclaustra, M. et al. Ldl cholesterol rises with bmi only in lean individuals: Crosssectional u.s. and spanish representative data. Diabetes Care 41, 2195–2201 (2018).
Drøyvold, W. B., Midthjell, K., Nilsen, T. I. L. & Holmen, J. Change in body mass index and its impact on blood pressure: a prospective population study. Int. J. Obes. 29, 650–655 (2005).
Lee, M.R., Lim, Y.H. & Hong, Y.C. Causal association of body mass index with hypertension using a mendelian randomization design. Medicine (Baltimore) 97, e11252 (2018).
Corbin, L. J. et al. Bmi as a modifiable risk factor for type 2 diabetes: refining and understanding causal estimates using mendelian randomization. Diabetes 65, 3002–3007 (2016).
Narayan, K., Boyle, J. P., Thompson, T. J., Gregg, E. W. & Williamson, D. F. Effect of bmi on lifetime risk for diabetes in the u.s. Diabetes Care 30, 1562–1566 (2007).
Yusuf, S. et al. Obesity and the risk of myocardial infarction in 27,000 participants from 52 countries: a casecontrol study. Lancet 366, 1640–1649 (2005).
Riaz, H. et al. Association between obesity and cardiovascular outcomes: a systematic review and metaanalysis of Mendelian Randomization Studies. JAMA Netw. Open 1, e183788–e183788 (2018).
Sun, D. et al. Type 2 diabetes and hypertension. Circulation Res. 124, 930–937 (2019).
Tomeo, C. A., Field, A. E., Berkey, C. S., Colditz, G. A. & Frazier, A. L. Weight concerns, weight control behaviors, and smoking initiation. Pediatrics 104, 918–924 (1999).
Cawley, J., Markowitz, S. & Tauras, J. Lighting up and slimming down: the effects of body weight and cigarette prices on adolescent smoking initiation. J. Health Econ. 23, 293–311 (2004).
Cao, M. & Cui, B. Association of educational attainment with adiposity, type 2 diabetes, and coronary artery diseases: a Mendelian Randomization Study. Front. Public Health 8, 112 (2020).
Loucks, E. B. et al. Education and coronary heart disease risk associations may be affected by earlylife common prior causes: a propensity matching analysis. Ann. Epidemiol. 22, 221–232 (2012).
Gage, S. H., Bowden, J., Davey Smith, G. & Munafò, M. R. Investigating causality in associations between education and smoking: a twosample Mendelian randomization study. Int. J. Epidemiol. 47, 1131–1140 (2018).
Sanderson, E., Davey Smith, G., Bowden, J. & Munafò, M. R. Mendelian randomisation analysis of the effect of educational attainment and cognitive ability on smoking behaviour. Nat. Commun. 10, 2949 (2019).
Marouli, E. et al. Mendelian randomisation analyses find pulmonary factors mediate the effect of height on coronary artery disease. Commun. Biol. 2, 119 (2019).
Tan, L. E., Llano, A., Aman, A., Dominiczak, A. F. & Padmanabhan, S. A18709 mendelian randomization study of causal relationship of height on blood pressure and arterial stiffness. J. Hypertens. 36, e91–e92 (2018).
Laaksonen, D. E. et al. Dyslipidaemia as a predictor of hypertension in middleaged men. Eur. Heart J. 29, 2561–2568 (2008).
Davies, N. M. et al. Within family Mendelian randomization studies. Hum. Mol. Genet. 28, R170–R179 (2019).
Benson, R., von Hippel, P. T. & Lynch, J. L. Does more education cause lower bmi, or do lowerbmi individuals become more educated? evidence from the national longitudinal survey of youth 1979. Soc. Sci. Med. 211, 370–377 (2018).
Witter, F. R. & Luke, B. The effect of maternal height on birth weight and birth length. Early Hum. Dev. 25, 181–186 (1991).
Tyrrell, J. et al. Height, body mass index, and socioeconomic status: mendelian randomisation study in uk biobank. BMJ 352, i582 (2016).
MRC IEU. EpiGraphDB. http://epigraphdb.org/ (2019).
Liu, Y. et al. Epigraphdb: a database and data mining platform for health data science. Bioinformatics 37, 1304–1311 (2021).
BulikSullivan, B. et al. An atlas of genetic correlations across human diseases and traits. Nat. Genet. 47, 1236–1241 (2015).
Mounier, N. & Kutalik, Z. Correction for sample overlap, winner’s curse and weak instrument bias in twosample mendelian randomization. bioRxiv https://www.biorxiv.org/content/10.1101/2021.03.26.437168v1?rss=1 (2021).
Thomas, D., Strauss, J. & Henriques, M.H. How does mother’s education affect child height? J. Hum. Resour. 26, 183–211 (1991).
Warrington, N. M. et al. Maternal and fetal genetic effects on birth weight and their relevance to cardiometabolic risk factors. Nat. Genet. 51, 804–814 (2019).
Brumpton, B. et al. Withinfamily studies for mendelian randomization: avoiding dynastic, assortative mating, and population stratification biases. Nat. Commun. 11, 3519 (2020).
O’Connor, L. J. & Price, A. L. Distinguishing genetic correlation from causation across 52 diseases and complex traits. Nat. Genet. 50, 1728–1734 (2018).
Brown, B. C. & Knowles, D. A. Phenomescale causal network discovery with bidirectional mediated mendelian randomization. bioRxiv https://www.biorxiv.org/content/10.1101/2020.06.18.160176v2.full (2020).
Howey, R., Shin, S.Y., Relton, C., Smith, G. D. & Cordell, H. J. Bayesian network analysis incorporating genetic anchors complements conventional mendelian randomization approaches for exploratory analysis of causal relationships in complex data. PLoS Genet 16, e1008198 (2020).
BulikSullivan, B. K. et al. Ld score regression distinguishes confounding from polygenicity in genomewide association studies. Nat. Genet. 47, 291–295 (2015).
Jordan, D. M., Verbanck, M. & Do, R. HOPS: a quantitative score reveals pervasive horizontal pleiotropy in human genetic variation is driven by extreme polygenicity of human traits and diseases. Genome Biol. 20, 222 (2019).
BulikSullivan, B. et al. An atlas of genetic correlations across human diseases and traits. Nat. Genet. 47, 1236–1241 (2015).
Heideman, M. T., Johnson, D. H. & Burrus, C. S. Gauss and the history of the fast fourier transform. Arch. Hist. Exact. Sci. 34, 265–277 (1985).
Rüeger, S., McDaid, A. & Kutalik, Z. Evaluation and application of summary statistic imputation to discover new heightassociated loci. PLoS Genet. 14, e1007371 (2018).
1000 Genomes Project Consortium. A map of human genome variation from populationscale sequencing. Nature 467, 1061–1073 (2010).
R Core Team. R: A Language and Environment for Statistical Computing (R Foundation for Statistical Computing, 2019).
Clogg, C. C., Petkova, E. & Haritou, A. Statistical methods for comparing regression coefficients between models. Am. J. Sociol. 100, 1261–1293 (1995).
Hemani, G. et al. The MRBase platform supports systematic causal inference across the human phenome. eLife 7, e34408 (2018).
Neale Lab. UK BioBank. http://www.nealelab.is/ukbiobank/ (2018).
van der Harst, P. & Verweij, N. Identification of 64 novel genetic loci provides an expanded view on the genetic architecture of coronary artery disease. Circ. Res. 122, 433–443 (2018).
Darrous, L. Simultaneous estimation of bidirectional causal effects and heritable confounding from GWAS summary statistics. Software https://doi.org/10.5281/zenodo.5534639 (2021).
Acknowledgements
This research has been conducted using the UK Biobank Resource under Application Number 16389. L.D. scores were calculated based on the UK10K data resource (EGAD00001000740, EGAD00001000741). Z.K. was funded by the Swiss National Science Foundation (31003A_169929, 310030_189147 and 32003B_173092). For computations we used the CHUV HPC cluster. We would like to thank Jack Bowden, Valentin Rousson, Matthew Robinson, George Davey Smith, Thomas Richardson, Eleonora Porcu and Sven Erik Ojavee for their valuable feedback and comments on this manuscript.
Author information
Authors and Affiliations
Contributions
Z.K. devised and directed the project. Z.K., N.M. and L.D. contributed to the mathematical derivations, design and implementation of the research, the analysis of the results and to the writing of the manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Peer review information Nature Communications thanks the anonymous reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Darrous, L., Mounier, N. & Kutalik, Z. Simultaneous estimation of bidirectional causal effects and heritable confounding from GWAS summary statistics. Nat Commun 12, 7274 (2021). https://doi.org/10.1038/s4146702126970w
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s4146702126970w
This article is cited by

Partner choice, confounding and trait convergence all contribute to phenotypic partner similarity
Nature Human Behaviour (2023)

Shared genetic architecture between attentiondeficit/hyperactivity disorder and lifespan
Neuropsychopharmacology (2023)
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.