A versatile, fast and unbiased method for estimation of gene-by-environment interaction effects on biobank-scale datasets

Di Scipio, Matteo; Khan, Mohammad; Mao, Shihong; Chong, Michael; Judge, Conor; Pathan, Nazia; Perrot, Nicolas; Nelson, Walter; Lali, Ricky; Di, Shuang; Morton, Robert; Petch, Jeremy; Paré, Guillaume

doi:10.1038/s41467-023-40913-7

Download PDF

Article
Open access
Published: 25 August 2023

A versatile, fast and unbiased method for estimation of gene-by-environment interaction effects on biobank-scale datasets

Nature Communications volume 14, Article number: 5196 (2023) Cite this article

4544 Accesses
5 Citations
5 Altmetric
Metrics details

Subjects

Abstract

Identification of gene-by-environment interactions (GxE) is crucial to understand the interplay of environmental effects on complex traits. However, current methods evaluating GxE on biobank-scale datasets have limitations. We introduce MonsterLM, a multiple linear regression method that does not rely on model specification and provides unbiased estimates of variance explained by GxE. We demonstrate robustness of MonsterLM through comprehensive genome-wide simulations using real genetic data from 325,989 individuals. We estimate GxE using waist-to-hip-ratio, smoking, and exercise as the environmental variables on 13 outcomes (N = 297,529-325,989) in the UK Biobank. GxE variance is significant for 8 environment-outcome pairs, ranging from 0.009 – 0.071. The majority of GxE variance involves SNPs without strong marginal or interaction associations. We observe modest improvements in polygenic score prediction when incorporating GxE. Our results imply a significant contribution of GxE to complex trait variance and we show MonsterLM to be well-purposed to handle this with biobank-scale data.

Quantification of the overall contribution of gene-environment interaction for obesity-related traits

Article Open access 13 March 2020

An approach to identify gene-environment interactions and reveal new biological insight in complex traits

Article Open access 22 April 2024

Boosting the power of genome-wide association studies within and across ancestries by using polygenic scores

Article 18 September 2023

Introduction

Identifying gene-by-environment interactions (GxE) is difficult because individual interaction effects are expected to be small¹, the multiple hypothesis burden is considerable^2,3, and the sample sizes needed are correspondingly large (N > 300,000)⁴. Many previous analyses have focused on identifying interactions with variants marginally associated with a phenotype of interest^5,6. Hitherto, methods developed to estimate the overall effect of these interactions rely on variance component methods, due to the predictor (m) > observation (n) problem, where SNPs (m) vastly outnumber the participants (n)^7,8. These methods are advantageous for smaller datasets; however, they can be limiting when applied to larger datasets due to computational burden⁷. Furthermore, variance component methods depend on strong assumptions about the underlying genetic model and often require a priori specification of parameters and/or hyper-parameters, such as polygenicity, minor allele frequency (MAF), and linkage disequilibrium (LD) dependence^{9,10,11,12,13}. While never formally tested in the context of GxE, it has previously been shown that these assumptions can lead to important biases in heritability estimates^{9,10,11,14,15,16}. Novel methods are thus needed to enable fast and unbiased calculations of the variance explained (R²) by GxE in large samples, on multiple traits and without the need for genetic model assumptions.

Our proposed method is similar to the generalized random effects (GRE) model¹⁷, building on the observation that the multiple regression coefficient of determination can be used to accurately estimate heritability¹⁷. Extending this observation to include an environmental exposure variable and computing the interactions between genotypes and the environmental exposure allows us to examine the variance explained by genetic interactions with an environmental exposure. However, the large number of single nucleotide polymorphisms (SNPs) (m) compared to participants (n) presents a challenge for genome-wide analysis¹⁸. By partitioning the genome into non-overlapping regions, it becomes possible to estimate genome-wide interactions with environmental exposures by reducing m within each region to a size where m < n. Some challenges remain: First, LD spillage at the junction of blocks can theoretically inflate heritability estimates if many such junctions exist⁹. Second, any residual population stratification effects would be amplified if heritability at each region is overestimated and this effect is expected to be proportional to the number of blocks¹⁹. Third, computing prediction R² on large blocks with high dimensionality can be slow. By using the conjugate gradient method²⁰ with graphics processing unit (GPU) acceleration²¹, it is possible to perform multiple linear regression modelling efficiently on large (25,000 SNPs) blocks. Thus, the potential for residual population stratification effects and LD spills are minimized since only 60 blocks or less are needed for genome-wide analyses and the variants included are LD-pruned. Furthermore, a block size of no more than 25,000 SNPs also ensures that n > 10 m for accurate estimations.

In this work we propose MonsterLM, a method to estimate the proportion of variance explained by GxE, in a fast, accurate, efficient, and unbiased manner on biobank-scale datasets (N > 300,000). We hypothesize that GxE interactions contribute significantly to complex trait variance. Our objective is to quantify and characterize these contributions for 13 complex traits using four environmental exposures (waist-to-hip ratio [WHR], smoking, an exercise parameter, and a randomly generated exposure). We illustrate an overview of our computational analyses in Fig. 1.

**Fig. 1: Summary of gene-by-environment (GxE) analysis conducted with MonsterLM.**

Results

Validation of MonsterLM using simulations

We conducted ten genome-wide simulations for each of the 12 scenarios (Fig. 2). The true heritability (${{R}^{2}}_{{GW}}$) was set to 0.20 and the true interaction variance (${{R}^{2}}_{{GWEI}}$) was set to 0.1 or 0.0. MonsterLM accurately and precisely estimated the true ${{R}^{2}}_{{GWEI}}$ across all 12 scenarios (Fig. 2). Under the null scenarios of no GxE, the estimated ${{R}^{2}}_{{GWEI}}$ was not different from zero (p > 0.05) in all ten simulations. Furthermore, observed precision estimates (i.e. variance of estimates across the 12 simulations) did not significantly differ from the precision estimates predicted from Eqs. (8)–(11) (Supplementary Table 1).

**Fig. 2: Estimation of variance explained by GxE for 12 simulated scenarios.**

MonsterLM accurately detected G and GxE null and non-null effects when ${{R}^{2}}_{{GWEI}}$ was set to 0.0 or 0.1, with ${{R}^{2}}_{{GW}}$ fixed at 0.20 (Fig. 2; scenario 1–2). These results remained when a true causal effect of ${E}_{{sim}}$ on ${Y}_{{sim}}$ (${{R}^{2}}_{{E\; on\; Y}}$ = 0.1) was simulated (Fig. 2; scenario 3). MonsterLM also remained unbiased to varying distributions of GxE effects, where non-zero GxE effects (i.e. ${{{{{{\rm{\beta }}}}}}}_{{GE}}$ ≠ 0) were sampled from exponential and beta (positive kurtosis) distributions (Fig. 2; scenario 6–7). Accurate GxE estimations were observed in the four scenarios including collider biases (Fig. 2; scenario 9–12). Accurate G estimations were observed in all collider scenarios except when exposure heritability was heterogeneous (Fig. 2; scenario 9–12).

To assess the robustness of MonsterLM to LD, SNPs with non-zero GxE effects were exclusively selected from SNPs in the highest quartile of LDscore or from SNPs in the highest quartile of LDscore and lowest quartile of MAF (Fig. 2; scenario 4–5). No significant bias in G or GxE was observed. We further stratified SNPs into 20 bins based on MAF and LD, and individually tested each stratum for G and GxE effects. Each stratum provided consistent estimates (after adjusting for the number of SNPs), further confirming the robustness of MonsterLM to MAF and LD (Supplementary Table 2).

Simulation estimates remained unbiased with dichotomous exposures (Fig. 2; scenario 7) and dichotomous outcomes (Supplementary Table 3) when applying MonsterLM with the modifications for dichotomous variables outlined in the methods.

Lastly, the performance of MonsterLM was tested in simulations where missing exposures or outcomes were mean imputed. Using the settings of scenario 2, the GxE estimate was biased towards the null when 20% of the exposure or 20% of the outcome in randomly chosen individuals were mean imputed (Supplementary Table 4). However, if 20% of the exposure and 20% of the outcome were missing in the same individuals and subsequently mean imputed, then the GxE estimates were inflated (Supplementary Table 4). Hence, all analyses using real data were performed on participants with no missing data.

MonsterLM power was evaluated with simulations for ${{R}^{2}}_{{GWEI}}$ varying from 0.005 to 0.50 and sample sizes ranging from 50,000 to 400,000 participants (Supplementary Fig. 1). MonsterLM reliably detects G and GxE effects with a minimum of 80% power when N > 100,000 participants and the true ${R}^{2}$ > 0.05. At a biobank sample size of 325,000, MonsterLM is well powered to detect true ${R}^{2}$ > 0.01.

Genome-wide interaction and heritability estimation in the UK Biobank

We applied MonsterLM to estimate the GxE variance between three environmental exposures (WHR, days of at least 10 minutes moderate physical activity status [M10], and smoking status) and ten cardiometabolic blood biomarkers (Apolipoprotein A, Apolipoprotein B, Total Cholesterol, CRP, Glucose, HbA1c, HDL-Cholesterol, LDL-Cholesterol, Triglycerides, Total Bilirubin), two cardiometabolic diseases (Coronary Artery Disease [CAD] and Type 2 Diabetes [T2D]), and height. Of the 13 outcomes, we observed significant GxE with WHR for 8 outcomes (${R}_{{GWEI}}^{2}$ ranging from 0.009 to 0.071), significant GxE with M10 for 7 outcomes (${R}_{{GWEI}}^{2}$ ranging from 0.010 to 0.045), and no significant GxE with smoking for any outcomes (Fig. 3a; Tables 1 and 2). The strongest GxE with WHR was observed for CRP (${R}_{{GWEI}}^{2}$ = 0.071) and strongest GxE with M10 was observed for LDL-Cholesterol (${R}_{{GWEI}}^{2}$ = 0.045). For some outcomes, interactions explained a substantial fraction of variance relative to heritability. For example, GxE with WHR explained 33% and 27% as much variance in Triglycerides and CRP as its estimated heritability, respectively. Generally, GxE with M10 results displayed consistent albeit attenuated ${R}_{{GWEI}}^{2}$ compared with GxE with WHR (Table 1). No significant GxE variance was observed for dichotomous outcomes CAD and T2D (Table 2) nor for randomly permuted exposures for all outcomes (Tables 1 and 2).

**Fig. 3: Estimates of genetic, interaction, and environment (WHR) R².**

Table 1 Real data estimates for continuous outcomes with MonsterLM

Full size table

Table 2 Real data estimates for dichotomous outcomes with MonsterLM

Full size table

Outcome heritability estimates for all 13 traits were significant and largely consistent with published estimates and other methods (BOLT, mtg2, and GRE; Tables 2 and 3). For two dichotomous outcomes, CAD and T2D (Table 2), genetic variance was estimated at 0.181 and 0.659, respectively, on the liability scale, which is consistent with the reported heritability of these diseases in literature^22,23,24,25. As MonsterLM adjusts outcomes for each specific exposure tested and this could potentially impact heritability estimates (which do not necessarily require such exposure adjustments) the analysis was repeated without adjustment, with consistent results (Supplementary Table 5).

Table 3 Benchmarking MonsterLM additive genetic variance (heritability, ${h}_{G}^{2}$) and GxE (GxE, ${{R}^{2}}_{{G{{{{{\rm{x}}}}}}E}_{{WHR}}}$)

Full size table

Follow-up analyses for GxE with WHR were performed to further observe components of the method. We observed significant directionality for interaction effects at both univariate marginal and interaction association p-values, ${P}_{G}$ and ${P}_{{GE}}$, for multiple p-value thresholds (<10⁻³, <10⁻², <10⁻¹, and <1; Fig. 3b; Supplementary Figure 2). That is, signs of ${\hat{\beta }}_{G}$ and ${\hat{\beta }}_{{GE}}$ were the same (+/+ or −/−) more often than expected in the null condition (>50%) for most outcomes when sorted by ${P}_{G}$ and ${P}_{{GE}}$. Consistent with the directionality concordance for each outcome at ${P}_{G}$ < 1 and ${P}_{{GE}}$ < 1, Pearson correlation coefficients of estimated genetic regression coefficients for each outcome, ${\hat{\beta }}_{1{x\; m}}$ (m is the number of SNPs: 1,030,579), were significant for all outcomes in Fig. 3b for ${\hat{\beta }}_{G}$ and ${\hat{\beta }}_{{GE}}$ (Supplementary Table 6). When extending the Pearson correlation tests to estimated genetic regression coefficients from WHR heritability (WHR heritability; Supplementary Table 5), ${\hat{\beta }}_{{h}_{{WHR}}^{2}}$, neither ${\hat{\beta }}_{G}$ or ${\hat{\beta }}_{{GE}}$ were significantly correlated with ${\hat{\beta }}_{{h}_{{WHR}}^{2}}$ for almost all outcomes (Supplementary Table 6). To assess the uniformity of the contribution of SNPs to both G and GxE, we stratified SNPs into twenty categories based on MAF and LDscore (Supplementary Table 2). The average SNP contribution to G and GxE did not markedly differ by MAF or LDscore, confirming the absence of large differences in contribution (Supplementary Fig. 3).

Comparison with other methods

Heritability estimates were largely consistent between MonsterLM, BOLT, mtg2, and GRE (Table 3). One notable exception was for total bilirubin, for which heritability was overestimated by GRE (${{R}^{2}}_{{GW}}$ > 0.99) relative to MonsterLM (${{R}^{2}}_{{GW}}$ = 0.40), BOLT (${{R}^{2}}_{{GW}}$ = 0.37) and mtg2 (${{R}^{2}}_{{GW}}$ = 0.43). mtg2 and LDSC heritability estimates were lower compared to MonsterLM, Bolt, and GRE for all compared outcomes except total bilirubin (with mtg2).

MonsterLM GxE estimates with WHR were compared to mtg2 and LDSC (Table 3). The mtg2 analysis was limited to 75,000 individuals due to computational constraints. GxE estimates were consistent between MonsterLM and mtg2 for cholesterol, height, and total bilirubin. However, glucose and HbA1c had considerably higher GxE estimates with mtg2 compared to MonsterLM (0.210 and 0.139 in mtg2, versus 0.00475 and 0.00884 in MonsterLM). MonsterLM heritability estimates of glucose and HbA1c were more consistent with Bolt and GRE versus mtg2 (0.105 and 0.289 in MonsterLM, versus 0.045 and 0.129 in mtg2). The LDSC GxE analysis used the full participant list, SNP set, and phenotypic data as in MonsterLM. GxE estimates were lower than MonsterLM for all outcomes.

The comparison between MonsterLM and mtg2 was further extended to simulated outcomes and exposures, with ten simulations split between true set ${{R}^{2}}_{{GWEI}}=0.10$ and ${{R}^{2}}_{{GWEI}}=0$ (Supplementary Table 7). MonsterLM accurately estimated the true interaction variance explained in all ten simulations. mtg2 was accurate in most simulations but overestimated the set interaction variance in three instances (i.e. estimate of 0.125 versus true interaction variance of 0.1 or 0.0).

Recovery of interaction and heritability variance according to SNP marginal and interaction effects

The presence of significant GxE with WHR prompted additional questions. First, do GxE interactions arise from SNPs strongly associated with the outcome of interest, as has been commonly assumed, or are the variants contributing to GxE interactions independent from those with marginal effects? To address this question, we randomly split participants into a discovery set comprising 80% of participants (260,768 individuals) with the remaining 20% of participants (65,222 individuals) comprising the validation set. Using the eight outcomes with significant GxE interaction variance (no further analyses were conducted with outcomes having non-significant GxE interaction variance), we conducted univariate linear regression on the discovery set using each outcome and a single SNP as the predictor variable, repeating this process for all SNPs (i.e. 1,030,579 SNPs) used in the study and calculating ${P}_{G}$, the association p-value. We then selected SNPs according to six association ${P}_{G}$ thresholds: <1 (i.e. all SNPs), <10⁻¹, <10⁻², <10⁻³, <10⁻⁴, <10⁻⁵. Likewise, we tested each SNP individually for interaction with WHR in the discovery set. We selected interactions based on six discovery ${P}_{{GE}}$ thresholds: <1 (i.e. all SNPs), <10⁻¹, <10⁻², <10⁻³, <10⁻⁴, <10⁻⁵. Each SNP set was then tested for variance explained with the corresponding outcome in the validation set, using the least number of blocks possible while keeping n > 10 m. We evaluated ${{R}^{2}}_{{GW}}$ and ${{R}^{2}}_{{GWEI}}$ for each of the eight significant outcomes at each of the six association ${P}_{G}$ or ${P}_{{GE}}$ thresholds in the SNP validation sets. ${{R}^{2}}_{{GW}}$ and ${{R}^{2}}_{{GWEI}}$ were then compared to the variance explained when including all SNPs (i.e. ${P}_{G}$ < 1 or ${P}_{{GE}}$ < 1) in the validation set. We estimated the proportion of ${{R}^{2}}_{{GW}}$ or ${{R}^{2}}_{{GWEI}}$ in the validation set (${{R}^{2}}_{G-{val}.}$ and ${{R}^{2}}_{{GE}-{val}.}$) recovered when including an increasing proportion of SNPs in the analysis (Fig. 4; Supplementary Figs. 4 and 5).

**Fig. 4: Proportion of ${R}_{G-val.}^{2}$ and ${R}_{GE-val.}^{2}$ as a function of P_G and P_GE.**

We observed that between 42–67% of the original ${{R}^{2}}_{G-{val}.}$ calculated in the validation set could be recovered with strongly associated marginal SNPs, defined as ${P}_{G}$ < 10⁻⁵ in the discovery set. When extending to more weakly associated SNPs (${P}_{G}$ < 10⁻¹), we observed that between 72–89% of ${{R}^{2}}_{G-{val}.}$ was recovered. Additionally, ${{R}^{2}}_{G-{val}.}$ recovery when calculating from SNPs with strong or weak interactions in the discovery sample (${P}_{{GE}}$ < 10⁻⁵ and ${P}_{{GE}}$ < 10⁻¹, respectively) was consistently lower as compared to their respective ${P}_{G}$ thresholds (Fig. 4a).

We then similarly estimated the proportion of interaction variance (${{R}^{2}}_{{GWEI}}$) recovered when including an increasing proportion of SNPs, based on discovery ${P}_{G}$ or ${P}_{{GE}}$ (Fig. 4b; Supplementary Figs. 4 and 5). We observed that between 1–2% of the original ${{R}^{2}}_{{GE}-{val}.}$ calculated in the validation set was recovered by SNPs with strong interactions in the discovery set (${P}_{{GE}}$ < 10⁻⁵). Conversely, 3–28% of the original ${{R}^{2}}_{{GE}-{val}.}$ was recovered by SNPs with strong marginal associations (${P}_{G}$ < 10⁻⁵). When extending to more weakly associated SNPs, between 15–84% of ${{R}^{2}}_{{GE}-{val}.}$ was recovered by SNPs with weak marginal associations (${P}_{G}$ < 10⁻¹); and between 30–84% of ${{R}^{2}}_{{GE}-{val}.}$ was recovered by SNPs with weak interactions (${P}_{{GE}}$ < 10⁻¹).

Polygenic scores analysis

Finally, we examined if the predictiveness of polygenic score of all eight outcomes with significant WHR interaction variance could be improved by incorporating interactions. To select SNPs and interaction effects to be included in each PS, we used both ${P}_{G}$ $a{{{{{\rm{nd}}}}}}$ ${P}_{{GE}}$ thresholds of 10⁻², 10⁻³, 10⁻⁴, and 10⁻⁵ in the discovery set when testing either each SNP individually or both a single SNP and corresponding interaction, respectively. Each PS was then tested in the validation sample for association with its corresponding outcome, with twenty total ${P}_{G}$ $a{{{{{\rm{nd}}}}}}$ ${P}_{{GE}}$ combinations. PS prediction ${R}^{2}$ was slightly improved (p < 0.05 for improvement) by incorporation of interaction effects for the outcome with the highest interaction variance, CRP (Supplementary Fig. 6), with the relative increase in prediction ${R}^{2}$ ranging from 0% to 1.3% across the outcomes analyzed (for interaction significance thresholds of 10⁻⁴, 10⁻⁵; Supplementary Data 1). The largest increases in polygenic score predictiveness tended to occur in outcomes with the largest GxE variance observed (Fig. 3; Supplementary Fig. 6).

Discussion

In this report, we developed a method, MonsterLM, to estimate variance explained by genome-wide interactions with environmental exposures. Using simulations, we verified that MonsterLM estimates the variance explained by interaction effects accurately and precisely. Analysis of UK Biobank data demonstrated the presence of significant GxE effects with WHR, a marker of metabolically deleterious adiposity. The interaction estimates for 8 of the 13 outcomes analysed were significant, ranging from 0.01 to 0.07 of overall variance, prompting further analyses into these results. The presence of significant GxE was further supported by the recovery of GxE with an exercise exposure, M10. MonsterLM was also successfully applied to dichotomous outcomes and exposures through simulations and real data. Together, real and simulated data analyses demonstrate the robustness of MonsterLM against biases such as collider effects or LD, and validates its utility as a versatile, fast and unbiased method for estimation of gene-by-environment interaction effects on biobank-scale datasets.

In benchmarking analyses, MonsterLM heritability estimates were consistent with alternative state-of-the-art methods (Table 3). When comparing MonsterLM GxE estimates to those of mtg2, MonsterLM tends to be conservative but provides accurate and consistent estimations across simulations and plausible estimates in real data. While mtg2 is also accurate and precise in most instances, it can be limited by computational burden, consistency in simulation estimates, and plausibility for some real data estimates. When comparing MonsterLM GxE estimates to LDSC GxE, MonsterLM estimated 8 of 11 outcomes to be non-null and LDSC GxE estimated 7 of 11 outcomes to be non-null. LDSC GxE estimates ranged from 0–1.8% and were lower than MonsterLM for each outcome (as was consistent with LDSC heritability estimates versus MonsterLM, Bolt, and GRE). Some LDSC GxE advantages include the fast computational speed of summary-level statistics compared to individual-level data and robustness to stratification and common environmental effects. However, the potential for LDSC underestimation is a discussed limitation in the literature. For example, Evans et al., 2018¹² conducted a heritability model comparison study where they showed a limitation of LD score regression was its potential to underestimate h² if the trait is not highly polygenic (such as in the case of total bilirubin; Table 3). Furthermore, consistently smaller LDSC h² estimates have been shown when compared to GREML in the same data set²⁶ and as the lowest estimate in a recent protocols study compared to 10 other approaches (including GREML, LDAK, threshold GRMs, and SumHer)²⁷.

When extending the comparison to the state-of-the-art, MonsterLM provides distinct advantages over current methods for GxE analysis (Table 4)^{28,29,30,31,32,33,34,35,36}. Several key advantages compared to other methods include: (i) computational efficiency (with two options in CPULS and GPULS) with biobank-scale individual-level data, (ii) no model specification beyond using additive genetic coding, (iii) extensibility to dichotomous exposures and outcomes, and (iv) and demonstrated genome-wide robustness. In many settings, inference methods for genome-wide SNP-heritability and GxE make assumptions about genetic architecture. These assumptions are parametrized by polygenicity (the number of variants with effects) and MAF/LD-dependence (the coupling of effects with MAF, LD or other functional annotations). Since the true genetic architecture of any given trait is unknown, existing heritability methods may yield vastly different estimates even when applied to the same data^10,11,12. This is also the case for the estimation of genome-wide environment interactions, where different assumptions about the structure of interactions result in a variety of different estimates^{29,30,31,32,33}. Although multi-component methods that stratify SNPs by LD/MAF can address these robustness issues, fitting multiple variance components to biobank-scale data is highly resource intensive³⁷, and this problem is compounded when considering interactions where the number of variables analyzed increases two-fold. Alternate methods that explicitly model these dependencies are also sensitive to model misspecification^{9,10,11,12,13}. Conversely, MonsterLM assumes an additive genetic model and does not apply further parametrization for underlying assumptions.

Table 4 Comparison of current methods estimating gene-by-environment contributions to MonsterLM

Full size table

Significant GxE with WHR was observed for 8 of the 13 outcomes studied. Interaction effects with WHR ranged from 0.009 to 0.071, and in all cases were of smaller magnitude than their heritability counterparts. These results have important implications for future research. First, our observations suggest that GxE can contribute significantly to complex trait variance. Second, genetic associations are likely to be heterogenous when comparing populations with dramatically different obesogenic environmental exposures. The observation that a majority of GxE effects do not come from SNPs with strong marginal effects suggests this may not impact top GWAS hits excessively. We also observed the presence of significant directionality effects for marginal effect SNPs and their associated interaction effects, which suggest an overall greater impact of genetic variation under certain environmental conditions. There are also potential clinical implications for these observations. For instance, CRP reflects low-grade inflammation and is strongly associated with risk of CVD³⁸. Our results suggest that genetic determinants of low-grade inflammation are dependent on adiposity distribution (WHR) and further research will be needed to understand the implications for CVD risk.

Our results also provide some further insights into why identification of GxE has been challenging. Many prior studies have reasonably focused the search for GxE on variants with genome-wide significant marginal effects. Our results show that a majority of GxE effects are due to variants with unremarkable marginal effects (i.e., only 3–28% of GxE variance recovered by SNPs with strong marginal effects), although variants with strong marginal effects remain preferred candidates for GxE interactions. We also show in a proof-of-concept analysis that incorporation of GxE can improve PS prediction, albeit very modestly.

Some limitations are worth mentioning. First, we quantile normalize all traits before analysis, and while this protects against potential scaling effects and is robust to nonnormal-distribution types, it could also bias results towards the null³⁹. Second, in the event of collider effects with a covariate that is heritable through additive and heterogenous elements there could be some inflation of heritability estimates (Fig. 2; scenario 12). However, the conditions for this scenario are presumed to be quite extreme and did not affect GxE estimates. Furthermore, GxE estimates have been shown to be stable when modelling collider biases whereas genetic estimates are less well-controlled⁴⁰. Third, information may be lost through LD pruning and from filtering rare and low frequency variants (MAF < 5%). Fourth, MonsterLM is susceptible to overestimating GxE variance when participant phenotypic and exposure missingness co-occurs in the same individuals. Fifth, the liability scale transformation for dichotomous outcomes could bias GxE estimates under specific conditions such as violation of the normality assumption needed for the Robertson transformation (i.e. can occur in really large interaction estimates), if substantial non-additive effects exists³⁴, or biases towards the null due to information loss in the transformation.

In this report, we have developed a robust and well-controlled method for genome-wide GxE estimation. We established the presence of GxE in cardiometabolic traits. We observed that SNPs with weak marginal and interaction effects contribute to the majority of GxE variance. MonsterLM makes minimal assumptions about genetic architecture and is well-powered for both continuous and dichotomous outcomes. It is computationally efficient, robust, and versatile and can be used as the basis for future analyses of genome-wide environment interactions.

Methods

UK Biobank

The UK Biobank is a large population-based study which includes over 500,000 participants living in the United Kingdom^41,42 (https://www.ukbiobank.ac.uk/). Men and women aged 40–69 years were recruited between 2006 and 2010, and extensive phenotypic and genotypic data were collected. Quality control of genotype data was applied for individual and SNP inclusion using PLINK version 1.9 (https://www.cog-genomics.org/plink2/). We selected 325,989 unrelated British individuals (the largest unrelated cohort; 54% female and 46% male) from the UK Biobank with both genotype and trait data for inclusion in the analysis. An unrelated set of individuals were chosen to reduce genomic prediction inaccuracies⁴³. Individual exclusion criteria included: (1) non-white British ancestry, (2) high ancestry-specific heterozygosity, (3) high genotype missingness (>0.05), (3) mismatching genetic ancestry, (4) sex chromosome aneuploidy, (5) mismatching gender sex and genetic sex, and (6) consent withdrawal at the time of analysis. Variants from the release version 3 of the UK Biobank data were used, which included those present in the Haplotype Reference Consortium and 1000 Genomes panels with imputation quality > 0.7 and had no deviation from Hardy-Weinberg equilibrium (P > 1 × 10⁻¹⁰)⁴². Our study focussed only on common variants; thus, genotypes were filtered by removing highly correlated SNPs with an LD r² > 0.9 and removing SNPs with a MAF < 0.05. SNP exclusion criteria included: (1) SNPs with low imputation quality (INFO score ≤0.30), (2) call rate <0.95, and (3) ambiguous or duplicated SNPs. After all quality control (QC) filters, 1,030,579 SNPs and 325,989 individuals remained. Genetic variants were partitioned to minimize the number of blocks on each chromosome, with each block having a maximum SNP count of 25,000. Genotypes were standardized to have a mean of zero and standard deviation of one. We examined eleven continuous traits and two (dichotomous) cardiometabolic outcomes including: Apolipoprotein A, Apolipoprotein B, Total Cholesterol, C-reactive protein (CRP), Glucose, HbA1c, HDL-Cholesterol, LDL-Cholesterol, Triglycerides, Total Bilirubin, height, coronary artery disease (CAD) and Type 2 Diabetes (T2D). WHR, an exercise parameter, smoking status, and a randomly generated exposure were used as environmental exposures (E) to quantify GxE interactions.

For follow-up analyses, and to avoid the potential for overfitting from sample overlap⁴⁴, we randomly partitioned the UK Biobank participants into two sets: a discovery set containing 80% of the participants used for model building and a validation set containing the remaining 20% of the participants.

MonsterLM estimations of variance explained by GxE effects

MonsterLM estimates heritability (G) and GxE effects in three steps outlined in Fig. 5. This can be performed using three variable-type combinations: continuous exposures and outcomes, dichotomous exposures and continuous outcomes, and continuous exposures and dichotomous outcomes. Slightly different configurations of MonsterLM are used depending on the variable-type combination. Step one processes exposure, outcome, interaction, and genotype data; step two calculates the coefficients of determination (R²); and step three calculates the estimated G and GxE with confidence intervals.

**Fig. 5: The MonsterLM method split into three steps for continuous outcomes and exposures.**

Step one: genotype and phenotype input and quality control

The standard linear model for an outcome, $Y$, when an interaction term is included can be expressed as:

$$Y={\beta }_{G}G+{\beta }_{E}E+{\beta }_{{GE}}{GE}+\epsilon$$

(1)

Where $G$ is the standardized genotype matrix, $E$ is the quantile normalized environmental exposure, ${GE}$ is the product between each genotype matrix and environmental exposure, resulting in a matrix with the same dimensionality as $G$. $G$ is coded in the additive model ({0,1,2}) and standardized so that the mean = 0 and standard deviation = 1 for each SNP. ${GE}$ is the quantile normalized product of $G$ and $E$. The betas (${{{{{\rm{\beta }}}}}}$) represent the true marginal effects associated with their respective term and $\epsilon$ represents the random error term. In practice, participants are selected to avoid missing data in Y and E. Both Y and E are first quantile normalized, then regressed for age, sex, and population stratification (first twenty genetic principal components), and quantile normalized once more. Two more transformations are applied to Y: (i) regression of the processed E then quantile normalization; (ii) and a heteroscedasticity adjustment for continuous outcomes. The aforementioned processing assumes continuous Y and E variables.

Step two: calculating the coefficients of determination

We denote the matrix of $G$ or ${GE}$ as $U$ with dimensions n×m, where n is number of participants and m is number of SNPs:

$$U=G,\, {or}\,U={GE}$$

(2)

As the environmental exposure is residualized from Y, we can leave $E$ out of the model.

The standard linear model becomes:

$$Y={{{{{{\rm{\beta }}}}}}}_{U}U$$

(3)

Given $Y$, the least squares estimate for ${\hat{\beta }}_{U}$ is:

$${\hat{\beta }}_{U}={\left({U}^{T}U\right)}^{-1}{U}^{T}Y$$

(4)

After computing ${\hat{\beta }}_{U}$, the predicted values of Y denoted as $\hat{y}$, are given by:

$$\hat{y}={\hat{\beta }}_{U}U$$

(5)

MonsterLM enables multiple linear regression on biobank-scale datasets by parallelizing the calculation of least squares regression. The calculation is done such that the only practical limitation is the speed of the inversion of the ${U}^{T}U$ matrix, without any restriction on n. This limitation is circumvented using the conjugate gradient method and GPU acceleration (henceforth referred as GPULS; Supplementary Table 8²¹). If users are constrained by GPU hardware but have adequate RAM allocation (>200 GB RAM) then a CPU-least squares method (henceforth referred as CPULS) can be used to compute traditional least squares in parallel to estimate block-wise ${R}^{2}$ (Fig. 5; Step 2). MonsterLM assumes an additive genetic model but does not make further assumptions regarding genetic architecture (such as polygenicity of effects, MAF and LD). Genotypes are partitioned into blocks with a maximal size of 25,000 SNPs (m) to minimize LD spillage between blocks and to optimize speed of the matrix calculation.

Step three: calculating total genetic and interaction estimates and confidence intervals

Once $\hat{y}$ is calculated for each block i using either $G$ or ${GE}$, both ${R}^{2}$ and adjusted ${R}^{2}$ can be derived for additive genetic effects and interaction effects, respectively. The total genome-wide contribution of additive genetic effects (${{R}^{2}}_{{GW}}$) and GxE interaction effects (${{R}^{2}}_{{GWEI}}$) is given by summing adjusted ${R}^{2}$ over all blocks:

$${{R}^{2}}_{{GW}}=(1-{{R}^{2}}_{E,Y})\mathop{\sum }_{i=1}^{j}{{R}^{2}}_{{G}_{i}}$$

(6)

$${{R}^{2}}_{{GWEI}}=(1-{{R}^{2}}_{E,Y})\mathop{\sum }_{i=1}^{j}{{R}^{2}}_{{{GE}}_{i}}$$

(7)

Where j is the number of blocks used per analysis (i.e. 60 blocks for current analyses) and ${{1-R}^{2}}_{E,Y}$ is an adjustment to account for the fact that Y is residualized for E. ${{R}^{2}}_{E,Y}$ is the coefficient of determination of Y and E.

${{R}^{2}}_{{U}_{i}}$ is the adjusted ${R}^{2}$ per block for G or GE, with n the sample size and m the number of predictors. The 95% confidence (CI) of ${{R}^{2}}_{{U}_{i}}$ can be estimated for each block by first calculating the variance of the squared multiple correlation coefficient using Kendall and Stuart’s method of variance estimation⁴⁵ available as “Variance.R2” in the MBESS⁴⁶ R package where:

$${\widehat{{Va}{r}}}_{{{\infty }}}\left({{R}^{2}}_{{U}_{i}}\right)=\left(1-{{R}^{2}}_{E,Y}\right){\left(\frac{n-1}{n-m-1}\right)}^{2}\frac{\left(n-m-1\right)(n-m+1)}{({n}^{2}-1)}\,{HyperG}({{R}^{2}}_{{U}_{i}},n,m)$$

(8)

and ${HyperG}({{R}^{2}}_{{U}_{i}},n,m)$ is the asymptotic adjustment using the hypergeometric function discussed in section 2B of Stuart, Ord, and Arnold (1999)⁴⁵. The 95% CI for a single block, i = 1, can then be derived using the Wald estimate:

$$95\%\,{CI}={{R}^{2}}_{{U}_{i}}\pm 1.96\sqrt{\widehat{{Va}{r}_{{{{{{\rm{\infty }}}}}}}}\left({{R}^{2}}_{{U}_{i}}\right)}$$

(9)

To estimate the 95% CI for our genome-wide G or GxE estimate, ${{R}^{2}}_{{GWU}}$, we calculate the total asymptotic variance as the sum of the individual variances $({{R}^{2}}_{{U}_{i}})$ for j blocks:

$$\widehat{{Va}{r}_{{{{{{\rm{\infty }}}}}}}}\left({{R}^{2}}_{{GWU}}\right)=\mathop{\sum }_{i=1}^{j}\widehat{{Va}{r}_{{{{{{\rm{\infty }}}}}}}}({{R}^{2}}_{{U}_{i}})$$

(10)

where $\widehat{{Va}{r}_{{{{{{\rm{\infty }}}}}}}}\left({{R}^{2}}_{{U}_{i}}\right)$ is each i variance estimate from Eq. (8). For the total asymptotic variance estimated, we calculate the 95% CI of ${{R}^{2}}_{{GWU}}$ as:

$$95\%\,{CI}={{R}^{2}}_{{GWU}}\pm 1.96\sqrt{\widehat{{Va}{r}_{{{{{{\rm{\infty }}}}}}}}\left({{R}^{2}}_{{GWU}}\right)}$$

(11)

MonsterLM for dichotomous outcomes and exposures

Applying MonsterLM with dichotomous exposures and continuous outcomes uses the same algorithm as with continuous variables (Fig. 5) with a few key modifications. These include: (i) no exposure modification (Step 1: E), (ii) the continuous outcome is quantile normalized in each dichotomous group separately (referred to as “E stratified QN Y”) in Step 1, and (iii) standardizing ($\mu=0,\,\sigma=1$) the interaction (GxE) terms (Step 1).

MonsterLM can also be applied to dichotomous outcome variables and continuous exposures (Fig. 5) with the following modifications: (i) the continuous exposure in each dichotomous outcome group is quantile normalized separately (referred to as “Y stratified QN E”); (ii) standardizing ($\mu=0,\,\sigma=1$) the dichotomous outcomes (Step 1; Y), and (iii) and applying a liability scale transformation⁴⁷ on the total estimates (Step 3) (Fig. 5).

Validation of MonsterLM using simulations

MonsterLM was tested using simulations under a range of scenarios. In all simulations, “real” genotypes from 325,989 UK Biobank participants (as described below) were used and outcomes and exposures were simulated. Unless otherwise stated, outcomes and exposures were simulated assuming true (unobserved) effects (${{{{{{\rm{\beta }}}}}}}_{G},\,{{{{{{\rm{\beta }}}}}}}_{E},\, {{{{{{\rm{\beta }}}}}}}_{{GE}}$) following a standard normal distribution. 20% of SNPs were randomly selected to have a marginal effect on the simulated trait of interest, Y_sim (i.e. ${{{{{{\rm{\beta }}}}}}}_{G}$ ≠ 0). We further assumed that 2% of total SNPs have an interaction effect (i.e. ${{{{{{\rm{\beta }}}}}}}_{{GE}}$ ≠ 0). The error ($\epsilon$) was sampled from an independent and identically distributed standard normal distribution. The simulated trait $({Y}_{{sim}})$ and simulated exposure (${E}_{{sim}}$) were then computed as:

$${Y}_{{sim}}={{{{{{\rm{\beta }}}}}}}_{G}G+{{{{{{\rm{\beta }}}}}}}_{E}{E}_{{sim}}+{{{{{{\rm{\beta }}}}}}}_{{GE}}G{E}_{{sim}}+\epsilon$$

(12)

We tested 12 scenario conditions through simulations (Fig. 1; top panel). Two model types, “base” and “collider” were considered. Firstly, base model simulations considered that E was not dependent on G (i.e. E is not heritable) and the genetic and interaction effects for all SNPs were randomly generated from a standard normal distribution. Secondly, we considered models including a collider bias. A collider bias can occur when two conditions are met: a controlled-for environmental exposure is both heritable and influenced through an unobserved confounder; and that same unobserved confounder influences the outcome (Fig. 1; top panel). As shown in Aschard et al. 2015⁴⁸ and Akimova et al., 2021⁴⁰, a collider bias can potentially lead to over-estimation of GxE variance and other variance components. Collider model simulations had a set correlation ${{{{{{\rm{R}}}}}}}^{2}$ between ${Y}_{{sim}}$ and ${E}_{{sim}}$ of 0.2 and assumed that the correlation was entirely due to the simulated unobserved confounder covariate (${{UC}}_{{sim}}$), an extreme collider model (Fig. 1; top panel). In these scenarios, ${E}_{{sim}}$ was simulated to have 20% of its variance explained by additive genetic effects (i.e. heritability of ${E}_{{sim}}$), as WHR was observed to have similar heritability empirically. In one collider scenario, genetic heterogeneity (which has been explored in the context of GxE models⁴⁹), explained 20% of the genetic variance of ${E}_{{sim}}$ (with the remaining genetic variance of ${E}_{{sim}}$ explained by additive genetic effects). Genetic heterogeneity was simulated by including a genetic interaction with a randomly generated binary variable. Genome-wide simulated variances were set at ${{R}^{2}}_{{GW}}$ = 0.20 (i.e. heritability of Y), and ${{R}^{2}}_{{GWEI}}$ = 0.1 or 0.0 (i.e. interaction variance of Y). In base model simulations, observed correlations between E and Y are due to the true set value (i.e. causal effect of E on Y). In collider simulations, there is no causal effect of E on Y or Y on E such that any observed correlation is entirely due to the collider bias.

For each simulated scenario, ten simulations were performed (Fig. 2; legend). Scenarios 1–8 use the base model format and scenarios 9–12 use the collider model format. The complete set of scenarios include: (i) null GxE effect (${{R}^{2}}_{{GWEI}}$ = 0.0); (ii) non-null GxE effect (${{R}^{2}}_{{GWEI}}$ = 0.1); (iii) non-null GxE effect with non-null exposure variance (${{R}^{2}}_{{E\; on\; Y}}$ = 0.1); (iv) non-null GxE effect where all ${{{{{{\rm{\beta }}}}}}}_{{GE}}$ effects are sampled from SNPs in the highest LDscore³⁷ quartile; (v) non-null GxE effect where all ${{{{{{\rm{\beta }}}}}}}_{{GE}}$ effects are sampled from SNPs in both the highest LDscore quartile and lowest MAF quartile; (vi-vii) non-null GxE effect when sampled ${{{{{{\rm{\beta }}}}}}}_{{GE}}$ effects have exponential and beta distributions (positive kurtosis); (viii) non-null GxE effect with a dichotomous exposure; (ix) non-null GxE effects in a collider scenario; (x) non-null GxE effect in a collider scenario where ${{{{{{\rm{\beta }}}}}}}_{{GE}}$ effect SNPs were not an element (completely non-overlapping) of ${{{{{{\rm{\beta }}}}}}}_{G}$ effect SNPs; (xi) null GxE effect in a collider scenario where ${{{{{{\rm{\beta }}}}}}}_{{GE}}$ effect SNPs are a strict subset (completely overlapping) of ${{{{{{\rm{\beta }}}}}}}_{G}$ effect SNPs; and (xii) null GxE effect in a collider scenario where ${E}_{{sim}}$ is heritable through additive and heterogenous genetic effects.

MonsterLM robustness was further investigated by testing the impact of mean imputation for both the ${E}_{{sim}}$ and ${Y}_{{sim}}$. We examined scenarios with mean imputation of ${E}_{{sim}}$ only, ${Y}_{{sim}}$ only, and ${E}_{{sim}}$ and ${Y}_{{sim}}$ within the same participants to further address any biases in model design. The previously described scenario 2 conditions were used for these latter simulations.

MonsterLM performance was also validated with dichotomous outcomes in scenario 2 conditions. Genome-wide simulated variances were set at ${{R}^{2}}_{{GW}}$ = 0.20 (i.e. heritability of Y), and ${{R}^{2}}_{{GWEI}}$ = 0.0 (i.e. interaction variance of Y) to assess for type I error.

Applying MonsterLM to UK Biobank traits and exposures

The MonsterLM method was applied to 13 outcomes from the UK Biobank in 325,989 unrelated British participants. The tested outcomes were clinically pertinent blood biomarkers, major diseases, and height (a well-studied heritable outcome). MonsterLM was applied as outlined in Fig. 5. Four different exposures were tested in total: WHR, days of at least 10 minutes moderate weekly physical activity status (M10), smoking status, and a randomly permuted exposure. WHR was selected as an environmental exposure because it is a measure of central obesity linked to a wide range of adverse metabolic consequences, including diabetes and cardiovascular disease (CVD)⁵⁰. M10 was selected due to its relevance as an obesogenic risk factor but minimal to negligible estimated additive genetic variance and correlation with outcomes (${{R}^{2}}_{G,E}$ = 0.02, ${{R}^{2}}_{E,Y}$ range from 0.00 to 0.01). This serves as a suitable control to reduce the possibility that any spurious collider effects or additive genetic effects would explain any of the interaction variance. Smoking status was chosen as a dichotomous exposure^51,52. Lastly, a permuted exposure was added as a negative control of no interaction.

Directionality of effects analysis

After computing ${{R}^{2}}_{{GW}}$ and ${{R}^{2}}_{{GWEI}}$ for our 13 outcomes, we tested whether the direction of effects was concordant between marginal and interaction regression coefficients for each SNP in the significant eight outcomes. Concordant direction of effects is defined as when ${\hat{\beta }}_{G}$ has the same sign (+/+, −/−) as ${\hat{\beta }}_{{GE}}$ for a single SNP and its associated interaction. Discordant direction of effects is defined as when the ${\hat{\beta }}_{G}$ and ${\hat{\beta }}_{{GE}}$ have a different sign (+/−, −/+) for a single SNP and its associated interaction. We used a subset of ${\hat{\beta }}_{G}$ and ${\hat{\beta }}_{{GE}}$ coefficients that were in low LD (r² < 0.1) and computed the direction of effect concordance for this subset. We then plotted the sign concordance twice: first as a function of univariate ${\hat{\beta }}_{G}$ p-values $({P}_{G})$, then as a function of univariate ${\hat{\beta }}_{{GE}}$ p-values $({P}_{{GE}})$, which were computed from association of single SNPs and their respective interaction on the outcomes. Two-proportion Z-tests were used to compare the proportion of directionally concordant marginal and interaction effects for each outcome in each threshold compared to a null proportion of 0.50.

Stratification of estimates by MAF and LD

SNPs were stratified by MAF and LDscore into a total of twenty bins: five MAF bins (0.05 ≤ 0.1, 0.1 < MAF ≤ 0.2, 0.2 < MAF ≤ 0.3, 0.3 < MAF ≤ 0.4, and 0.4 < MAF ≤ 0.5) and four LDscore quantiles (0 < LD ≤ 0.25, 0.25 < LD ≤ 0.50, 0.50 < LD ≤ 0.75, and 0.75 < LD ≤ 0.9). MAF and LDscore were calculated using a subset of 5000 participants from the UK Biobank. We then computed the variance explained (${{R}^{2}}_{{GW}},\,{{R}^{2}}_{{GWEI}}$) and divided each estimate by the total number of SNPs in each bin to get an ${R}^{2}$ per SNP value that was compared between bins and to the total genetic and interaction variance estimates.

Polygenic scores analysis

We calculated polygenic scores (PS) without interactions $({{PS}}_{G})$ for outcomes with statistically significant GxE variance. We first selected SNPs based on the univariate association p-value from regression of each variant with outcomes from the discovery set (randomly chosen 80% of UK Biobank sample). We then combined the selected SNPs into a single block from the discovery set and applied MonsterLM regression to obtain the multiple linear regression coefficients ($\hat{\beta }$_G). Using these coefficients, we calculated the PS_G in the validation set as:

$${{PS}}_{G,i}=\mathop{\sum }_{j}^{O}{G}_{i,j}{\hat{\beta }}_{G,j}$$

(13)

Where ${{PS}}_{G,i}$ is the individual polygenic score of participant i, j is the SNP number and O represents the total number of SNPs included in this analysis. We then evaluated the predictiveness of each ${{PS}}_{G}$ using ${R}^{2}$ in the validation set (remaining 20% of the UKB sample). We repeated the same process for four univariate ${P}_{G}$ thresholds (10⁻², 10⁻³, 10⁻⁴, 10⁻⁵) for each outcome.

We define ${{PS}}_{{GE}}$ as the ${PS}$ with GxE interactions included. To include GxE interactions, we selected significant interactions based on the univariate association p-value from regressing each GxE interaction with outcome concentration in the discovery set. These interactions are selected from the subset of SNPs included in polygenic scores without interactions. The interactions passing the univariate ${P}_{{GE}}$ thresholds (10⁻², 10⁻³, 10⁻⁴, 10⁻⁵) were then included with the SNPs to create a single block. We applied MonsterLM regression to obtain the multiple linear regression coefficients (${\hat{\beta }}_{G}$, ${\hat{\beta }}_{{GE}}$). Using these coefficients, we calculate the ${{PS}}_{{GE}}$ as:

$${{PS}}_{{GE},i}=\mathop{\sum }_{j}^{O}{G}_{i,j}{\hat{\beta }}_{G,j}+\mathop{\sum }_{k}^{P}{\left(G{{{{{\rm{x}}}}}}E\right)}_{i,k}{\hat{\beta }}_{{GE},k}$$

(14)

Where ${{PS}}_{{GE},i}$ is the polygenic score with interactions incorporated for participant i, summed over each SNP (j) and, if included, its associated interaction (k). O represents the SNPs included in the ${{PS}}_{{GE}}$, while P represents the interactions included, a subset of O. As with the ${{PS}}_{G}$, we evaluated the predictiveness of each polygenic score using ${R}^{2}$ in the validation set. We repeated for all pairwise combinations of the four ${P}_{G}$ thresholds and the four ${P}_{{GE}}$ thresholds, resulting in 16 ${{PS}}_{{GE}}$ for each outcome in addition to four ${{PS}}_{G}$.

Comparison with other methods

We compared MonsterLM estimates to alternative methods. For heritability, MonsterLM estimates were compared to BOLT⁵³ (https://alkesgroup.broadinstitute.org/BOLT-LMM/BOLT-LMM_manual.html) and GRE¹⁷ (https://github.com/bogdanlab/h2-GRE). MonsterLM heritability and GxE estimates were compared with mtg2 IGE²⁸ (https://bio.tools/mtg2) and LDSC³⁷ (https://github.com/bulik/ldsc).

Power calculations

Statistical power was estimated using sets of 10,000 simulations. Non-central F-distributions were used to simulate the observed genetic and interaction effects at each genotype block, and genome-wide heritability and GxE estimates derived as previously described. True set adjusted R² ranged from 0.05 to 0.5. Sample size ranged from N = 50,000 to 400,000 individuals by increments of 10,000. For each condition, power was defined as the proportion of observed p-values less than 0.05 out of the 10,000 simulations.

System requirements

MonsterLM software (Supplementary Software 1) can be run on all major platforms (e.g. GNU/Linux, macOS, Windows). For biobank-scale analyses, recommended hardware requirements are a unix-like virtual environment supporting a minimum of 250 GB RAM space for in-memory operations. System GPUs are optional and can be used to speed up matrix inversion. Software requirements include the program dependencies: BASH (≥5.0), R (≥3.6.3), and GPULS (optional). Essential R dependencies include the packages: tidyverse, data.table, MBESS, and gsl.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

This research has been conducted using individual genetic and phenotypic data obtained from the UK Biobank (http://www.ukbiobank.ac.uk/), under application #15255. The UK Biobank study received approval from the National Health Service National Research Ethics Service North West. Access to the UK Biobank individual-level data is not publicly available and must be obtained via an application (https://www.ukbiobank.ac.uk/register-apply/). All other data supporting the findings described in this manuscript are available in the article and its Supplementary Information files. Source data are provided with this paper.

Code availability

The software package containing all code and relevant documentation to run MonsterLM is available in a public GitHub repository at https://github.com/GMELab/MonsterLM (Supplementary Software 1; https://zenodo.org/record/8092995)⁵⁴.

References

Aschard, H. A perspective on interaction effects in genetic association studies. Genet. Epidemiol. 40, 678–688 (2016).
Article PubMed PubMed Central Google Scholar
Dempfle, A. et al. Gene-environment interactions for complex traits: definitions, methodological requirements and challenges. Eur. J. Hum. Genet. EJHG 16, 1164–1172 (2008).
Article CAS PubMed Google Scholar
Castaldi, P. J. et al. Screening for interaction effects in gene expression data. PloS One 12, e0173847 (2017).
Article PubMed PubMed Central Google Scholar
Kim, J. et al. Joint analysis of multiple interaction parameters in genetic association studies. Genetics 211, 483–494 (2019).
Article PubMed Google Scholar
Dai, J. Y. et al. Simultaneously testing for marginal genetic association and gene-environment interaction. Am. J. Epidemiol. 176, 164–173 (2012).
Article PubMed PubMed Central Google Scholar
Patel, C. J., Chen, R., Kodama, K., Ioannidis, J. P. A. & Butte, A. J. Systematic identification of interaction effects between genome- and environment-wide associations in type 2 diabetes mellitus. Hum. Genet. 132, 495–508 (2013).
Article CAS PubMed PubMed Central Google Scholar
Almasy, L. & Blangero, J. Variance component methods for analysis of complex phenotypes. Cold Spring Harb. Protoc. 2010. 10.1101/pdb.top77
Veerman, J. R., Leday, G. G. R. & van de Wiel, M. A. Estimation of variance components, heritability and the ridge penalty in high-dimensional generalized linear models. Commun. Stat. - Simul. Comput. 0, 1–19 (2019).
MATH Google Scholar
Speed, D., Hemani, G., Johnson, M. R. & Balding, D. J. Improved heritability estimation from genome-wide SNPs. Am. J. Hum. Genet. 91, 1011–1021 (2012).
Article CAS PubMed PubMed Central Google Scholar
Speed, D. et al. Reevaluation of SNP heritability in complex human traits. Nat. Genet. 49, 986–992 (2017).
Article ADS CAS PubMed PubMed Central Google Scholar
Speed, D. & Balding, D. J. SumHer better estimates the SNP heritability of complex traits from summary statistics. Nat. Genet. 51, 277–284 (2019).
Article CAS PubMed Google Scholar
Evans, L. M. et al. Comparison of methods that use whole genome data to estimate the heritability and genetic architecture of complex traits. Nat. Genet. 50, 737–745 (2018).
Article CAS PubMed PubMed Central Google Scholar
Gazal, S., Marquez-Luna, C., Finucane, H. K. & Price, A. L. Reconciling S-LDSC and LDAK models and functional enrichment estimates. http://biorxiv.org/lookup/doi/10.1101/256412 (2018).
Yang, J. et al. Genetic variance estimation with imputed variants finds negligible missing heritability for human height and body mass index. Nat. Genet. 47, 1114–1120 (2015).
Article CAS PubMed PubMed Central Google Scholar
Gazal, S. et al. Linkage disequilibrium-dependent architecture of human complex traits shows action of negative selection. Nat. Genet. 49, 1421–1427 (2017).
Article CAS PubMed PubMed Central Google Scholar
Speed, D., Kaphle, A. & Balding, D. J. SNP-based heritability and selection analyses: Improved models and new results. BioEssays 44, 2100170 (2022).
Article Google Scholar
Hou, K. et al. Accurate estimation of SNP-heritability from biobank-scale data irrespective of genetic architecture. Nat. Genet. 51, 1244–1251 (2019).
Article ADS CAS PubMed PubMed Central Google Scholar
Mayhew, A. J. & Meyre, D. Assessing the heritability of complex traits in humans: methodological challenges and opportunities. Curr. Genomics 18, 332–340 (2017).
Article CAS PubMed PubMed Central Google Scholar
Browning, S. R. & Browning, B. L. Population structure can inflate SNP-based heritability estimates. Am. J. Hum. Genet. 89, 191–193 (2011). author reply 193-195.
Article CAS PubMed PubMed Central Google Scholar
Shewchuk, J. R. An introduction to the conjugate gradient method without the agonizing pain. Technical Report no. ICG:865018. (Carnegie-Mellon University, Departmentof Computer Science, Pittsburgh, PA, USA, 1994).
Nogueira, B. & Pinheiro, R. G. S. A GPU based local search algorithm for the unweighted and weighted maximum s-plex problems. Ann. Oper. Res. 284, 367–400 (2020).
Article MathSciNet MATH Google Scholar
Venkatesan, V. et al. Burden of Type 2 Diabetes and Associated Cardiometabolic Traits and Their Heritability Estimates in Endogamous Ethnic Groups of India: Findings From the INDIGENIUS Consortium. Front. Endocrinol. 13, 847692 (2022).
Prasad, R. B. & Groop, L. Genetics of Type 2 Diabetes—Pitfalls and Possibilities. Genes 6, 87–123 (2015).
Article CAS PubMed PubMed Central Google Scholar
McPherson, R. & Tybjaerg-Hansen, A. Genetics of Coronary Artery Disease. Circ. Res. 118, 564–578 (2016).
Article CAS PubMed Google Scholar
Nikpay, M., Stewart, A. F. R. & McPherson, R. Partitioning the heritability of coronary artery disease highlights the importance of immune-mediated processes and epigenetic sites associated with transcriptional activity. Cardiovasc. Res 113, 973–983 (2017).
Article CAS PubMed Google Scholar
Ni, G. & Moser, G. Schizophrenia Working Group of the Psychiatric Genomics Consortium, Wray, N. R. & Lee, S. H. Estimation of Genetic Correlation via Linkage Disequilibrium Score Regression and Genomic Restricted Maximum Likelihood. Am. J. Hum. Genet. 102, 1185–1194 (2018).
Article CAS PubMed PubMed Central Google Scholar
Srivastava, A. K., Williams, S. M. & Zhang, G. Heritability estimation approaches utilizing genome-wide data. Curr. Protoc. 3, e734 (2023).
Article CAS PubMed Google Scholar
Lee, S. H. & van der Werf, J. H. J. MTG2: an efficient algorithm for multivariate linear mixed model analysis based on genomic information. Bioinformatics 32, 1420–1422 (2016).
Article CAS PubMed PubMed Central Google Scholar
Moore, R. et al. A linear mixed model approach to study multivariate gene-environment interactions. Nat. Genet. 51, 180–186 (2019).
Article CAS PubMed Google Scholar
Robinson, M. R. et al. Genotype-covariate interaction effects and the heritability of adult body mass index. Nat. Genet. 49, 1174–1181 (2017).
Article CAS PubMed Google Scholar
Dahl, A. et al. A Robust Method Uncovers Significant Context-Specific Heritability in Diverse Complex Traits. Am. J. Hum. Genet. 106, 71–91 (2020).
Article CAS PubMed PubMed Central Google Scholar
Sulc, J. et al. Quantification of the overall contribution of gene-environment interaction for obesity-related traits. Nat. Commun. 11, 1385 (2020).
Kerin, M. & Marchini, J. Inferring Gene-by-Environment Interactions with a Bayesian Whole-Genome Regression Model. Am. J. Hum. Genet. 107, 698–713 (2020).
Article CAS PubMed PubMed Central Google Scholar
Shin, J. & Lee, S. H. GxEsum: a novel approach to estimate the phenotypic variance explained by genome-wide GxE interaction based on GWAS summary statistics for biobank-scale data. Genome Biol. 22, 183 (2021).
Article PubMed PubMed Central Google Scholar
Jung, H.-U. et al. Gene-environment interaction explains a part of missing heritability in human body mass index. Commun. Biol. 6, 1–11 (2023).
Article Google Scholar
Ni, G. et al. Genotype–covariate correlation and interaction disentangled by a whole-genome multivariate reaction norm model. Nat. Commun. 10, 2239 (2019).
Article ADS PubMed PubMed Central Google Scholar
Bulik-Sullivan, B. K. et al. LD Score regression distinguishes confounding from polygenicity in genome-wide association studies. Nat. Genet. 47, 291–295 (2015).
Article CAS PubMed PubMed Central Google Scholar
Emerging Risk Factors Collaboration. C-reactive protein concentration and risk of coronary heart disease, stroke, and mortality: an individual participant meta-analysis. Lancet 375, 132–140 (2010).
McCaw, Z. R., Lane, J. M., Saxena, R., Redline, S. & Lin, X. Operating characteristics of the rank-based inverse normal transformation for quantitative trait analysis in genome-wide association studies. Biometrics 76, 1262–1272 (2020).
Article MathSciNet CAS PubMed PubMed Central MATH Google Scholar
Akimova, E. T., Breen, R., Brazel, D. M. & Mills, M. C. Gene-environment dependencies lead to collider bias in models with polygenic scores. Sci. Rep. 11, 9457 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Sudlow, C. et al. UK Biobank: An Open Access Resource for Identifying the Causes of a Wide Range of Complex Diseases of Middle and Old Age. PLOS Med 12, e1001779 (2015).
Article PubMed PubMed Central Google Scholar
Bycroft, C. et al. The UK Biobank resource with deep phenotyping and genomic data. Nature 562, 203–209 (2018).
Article ADS CAS PubMed PubMed Central Google Scholar
Lee, S. H., Clark, S. & van der Werf, J. H. J. Estimation of genomic prediction accuracy from reference populations with varying degrees of relationship. PLOS ONE 12, e0189775 (2017).
Article PubMed PubMed Central Google Scholar
De La Vega, F. M. & Bustamante, C. D. Polygenic risk scores: a biased prediction? Genome Med. 10, 100 (2018).
Article PubMed Google Scholar
Lumley, T. Kendall’s advanced theory of statistics. Volume 2A: classical inference and the linear model. Alan Stuart, Keith Ord and Steven Arnold, Arnold, London, 1998, No. of pages: xiv+885. Price: £85.00. ISBN 0-340-66230-1. Stat. Med. 19, 3139–3140 (2000).
Article Google Scholar
Kelley, K. Methods for the Behavioral, Educational, and Social Sciences: An R package. Behav. Res. Methods 39, 979–984 (2007).
Article PubMed Google Scholar
Lee, S. H., Wray, N. R., Goddard, M. E. & Visscher, P. M. Estimating Missing Heritability for Disease from Genome-wide Association Studies. Am. J. Hum. Genet. 88, 294–305 (2011).
Article PubMed PubMed Central Google Scholar
Aschard, H., Vilhjálmsson, B. J., Joshi, A. D., Price, A. L. & Kraft, P. Adjusting for Heritable Covariates Can Bias Effect Estimates in Genome-Wide Association Studies. Am. J. Hum. Genet. 96, 329–339 (2015).
Article CAS PubMed PubMed Central Google Scholar
Dahl, A., Cai, N., Flint, J. & Zaitlen, N. GxEMM: Extending linear mixed models to general gene-environment interactions. bioRxiv 397638 (2018) https://doi.org/10.1101/397638.
Poppitt, S. D. et al. Long-term effects of ad libitum low-fat, high-carbohydrate diets on body weight and serum lipids in overweight subjects with metabolic syndrome. Am. J. Clin. Nutr. 75, 11–20 (2002).
Article CAS PubMed Google Scholar
Rivera, N. V. et al. A Gene–Environment Interaction Between Smoking and Gene polymorphisms Provides a High Risk of Two Subgroups of Sarcoidosis. Sci. Rep. 9, 18633 (2019).
Article ADS CAS PubMed PubMed Central Google Scholar
Boua, P. R. et al. Novel and Known Gene-Smoking Interactions With cIMT Identified as Potential Drivers for Atherosclerosis Risk in West-African Populations of the AWI-Gen Study. Front. Genet. 10, 1354 (2020).
Loh, P.-R. et al. Efficient Bayesian mixed-model analysis increases association power in large cohorts. Nat. Genet. 47, 284–290 (2015).
Article CAS PubMed PubMed Central Google Scholar
Di Scipio, M. MonsterLM v0.1.1. (2023) https://doi.org/10.5281/zenodo.8092995.

Download references

Acknowledgements

The authors are thankful for all the UK Biobank participants. We would also like to thank Dr. Andrew McArthur and his laboratory at McMaster University for providing GPU support. M.D. is supported by a Canadian Institute of Health Research Doctoral Award, a Mach-Gaensslen Foundation Award, and a McMaster Undergraduate Medical Education Research scholarship.

Author information

These authors contributed equally: Matteo Di Scipio, Mohammad Khan.

Authors and Affiliations

Population Health Research Institute, David Braley Cardiac, Vascular and Stroke Research Institute, Hamilton Health Sciences and McMaster University, Hamilton, ON, Canada
Matteo Di Scipio, Mohammad Khan, Shihong Mao, Michael Chong, Conor Judge, Nazia Pathan, Nicolas Perrot, Ricky Lali, Robert Morton, Jeremy Petch & Guillaume Paré
Department of Medicine, Faculty of Health Sciences, McMaster University, Hamilton, ON, Canada
Matteo Di Scipio, Mohammad Khan, Nazia Pathan & Jeremy Petch
Thrombosis and Atherosclerosis Research Institute, David Braley Cardiac, Vascular and Stroke Research Institute, Hamilton, ON, Canada
Michael Chong & Guillaume Paré
Department of Pathology and Molecular Medicine, McMaster University, Michael G. DeGroote School of Medicine, Hamilton, ON, Canada
Michael Chong, Robert Morton & Guillaume Paré
Centre for Data Science and Digital Health, Hamilton Health Sciences, Hamilton, ON, Canada
Walter Nelson, Shuang Di & Jeremy Petch
Department of Statistical Sciences, University of Toronto, Toronto, ON, Canada
Walter Nelson
Department of Health Research Methods, Evidence, and Impact, McMaster University, Hamilton, ON, Canada
Ricky Lali & Guillaume Paré
Dalla Lana School of Public Health, University of Toronto, Toronto, ON, Canada
Shuang Di
Institute of Health Policy, Management and Evaluation, University of Toronto, Toronto, ON, Canada
Jeremy Petch

Authors

Matteo Di Scipio
View author publications
You can also search for this author in PubMed Google Scholar
Mohammad Khan
View author publications
You can also search for this author in PubMed Google Scholar
Shihong Mao
View author publications
You can also search for this author in PubMed Google Scholar
Michael Chong
View author publications
You can also search for this author in PubMed Google Scholar
Conor Judge
View author publications
You can also search for this author in PubMed Google Scholar
Nazia Pathan
View author publications
You can also search for this author in PubMed Google Scholar
Nicolas Perrot
View author publications
You can also search for this author in PubMed Google Scholar
Walter Nelson
View author publications
You can also search for this author in PubMed Google Scholar
Ricky Lali
View author publications
You can also search for this author in PubMed Google Scholar
Shuang Di
View author publications
You can also search for this author in PubMed Google Scholar
Robert Morton
View author publications
You can also search for this author in PubMed Google Scholar
Jeremy Petch
View author publications
You can also search for this author in PubMed Google Scholar
Guillaume Paré
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

M.D.: data curation, software, formal analysis, investigation, visualization, writing; M.K.: data curation, software, formal analysis, investigation, visualization, writing; S.M.: data curation, software, formal analysis; M.C.: data curation, analysis interpretation, writing (review and editing); C.J.: formal analysis, visualization, writing (review and editing); Na.P.: formal analysis; Ni.P.: formal analysis; WN: software; R.L.: software; S.D.: software; R.M.: visualization, writing (review and editing); J.P.: software; G.P.: conceptualization, supervision, funding acquisition, methodology, project administration, writing (review and edit).

Corresponding author

Correspondence to Guillaume Paré.

Ethics declarations

Competing interests

M.C. has received consulting fees from Bayer. G.P. has received consulting fees from Bayer, Sanofi, Amgen, and Illumina. The remaining authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks Sang Hong Lee and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Description of Additional Supplementary Files

Supplementary Data 1

Supplementary Software 1

Reporting Summary

Source data

Source Data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Di Scipio, M., Khan, M., Mao, S. et al. A versatile, fast and unbiased method for estimation of gene-by-environment interaction effects on biobank-scale datasets. Nat Commun 14, 5196 (2023). https://doi.org/10.1038/s41467-023-40913-7

Download citation

Received: 23 April 2021
Accepted: 16 August 2023
Published: 25 August 2023
DOI: https://doi.org/10.1038/s41467-023-40913-7

This article is cited by

A method to estimate the contribution of rare coding variants to complex trait heritability
- Nazia Pathan
- Wei Q. Deng
- Guillaume Paré
Nature Communications (2024)
What Causes Premature Coronary Artery Disease?
- Ann Le
- Helen Peng
- Guillaume Paré
Current Atherosclerosis Reports (2024)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

Validation of MonsterLM using simulations

Genome-wide interaction and heritability estimation in the UK Biobank

Comparison with other methods

Recovery of interaction and heritability variance according to SNP marginal and interaction effects

Polygenic scores analysis

Discussion

Methods

UK Biobank

MonsterLM estimations of variance explained by GxE effects

Step one: genotype and phenotype input and quality control

Step two: calculating the coefficients of determination

Step three: calculating total genetic and interaction estimates and confidence intervals

MonsterLM for dichotomous outcomes and exposures

Validation of MonsterLM using simulations

Applying MonsterLM to UK Biobank traits and exposures

Directionality of effects analysis

Stratification of estimates by MAF and LD

Polygenic scores analysis

Comparison with other methods

Power calculations

System requirements

Reporting summary

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Peer review

Peer review information

Additional information

Supplementary information

Source data

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links