Detecting heritable phenotypes without a model using fast permutation testing for heritability and set-tests

Schweiger, Regev; Fisher, Eyal; Weissbrod, Omer; Rahmani, Elior; Müller-Nurasyid, Martina; Kunze, Sonja; Gieger, Christian; Waldenberger, Melanie; Rosset, Saharon; Halperin, Eran

doi:10.1038/s41467-018-07276-w

Download PDF

Article
Open access
Published: 21 November 2018

Detecting heritable phenotypes without a model using fast permutation testing for heritability and set-tests

Regev Schweiger¹,
Eyal Fisher²,
Omer Weissbrod³,
Elior Rahmani¹,
Martina Müller-Nurasyid ORCID: orcid.org/0000-0003-3793-5910^4,5,6,
Sonja Kunze^7,8,
Christian Gieger^7,8,
Melanie Waldenberger^6,7,8,
Saharon Rosset² &
…
Eran Halperin ORCID: orcid.org/0000-0002-2373-3691^9,10

Nature Communications volume 9, Article number: 4919 (2018) Cite this article

3000 Accesses
4 Citations
14 Altmetric
Metrics details

Subjects

Abstract

Testing for association between a set of genetic markers and a phenotype is a fundamental task in genetic studies. Standard approaches for heritability and set testing strongly rely on parametric models that make specific assumptions regarding phenotypic variability. Here, we show that resulting p-values may be inflated by up to 15 orders of magnitude, in a heritability study of methylation measurements, and in a heritability and expression quantitative trait loci analysis of gene expression profiles. We propose FEATHER, a method for fast permutation-based testing of marker sets and of heritability, which properly controls for false-positive results. FEATHER eliminated 47% of methylation sites found to be heritable by the parametric test, suggesting a substantial inflation of false-positive findings by alternative methods. Our approach can rapidly identify heritable phenotypes out of millions of phenotypes acquired via high-throughput technologies, does not suffer from model misspecification and is highly efficient.

Genome-wide association studies

Article 26 August 2021

Evaluating and improving heritability models using summary statistics

Article 23 March 2020

Fast and covariate-adaptive method amplifies detection power in large-scale multiple hypothesis testing

Article Open access 31 July 2019

Introduction

One of the most fundamental problems in genetics is testing whether a particular phenotype is associated with a set of markers. For instance, it is often desired to understand whether a specific phenotype is heritable. The heritability of a phenotype is defined as the proportion of the variance explained by a genetic component. In this example, the set of single-nucleotide polymorphisms (SNPs) that are tested for association with the phenotype includes the entire set of SNPs of the genome; when local heritability is tested, then the set of SNPs includes the SNPs in particular regions such as chromosomes or with a particular functional annotation. Furthermore, these variants are often tested against a large number of phenotypes, such as expression profiles of genes^1,2,3,4, methlyation levels^5,6,7,8, or neuroimaging measurements^9,10. A similar notion to heritability has been tested in other fields as well, beyond genetics. For example, in metagenomics, it is a common practice to test for an association between a phenotype and the relative abundance vector obtained from either shotgun sequencing or targeted 16S sequencing¹¹. Heritability is commonly studied using the linear mixed model (LMM); this model is a linear model that implicitly assume a very small effect for each of the SNPs¹².

Naive association testing of single markers may be extremely underpowered, even in large datasets. The common technique to address this issue is set testing, which groups together markers and applies a joint test to them^13,14,15,16. Set testing is especially important when analyzing data with rare variants. Rare variants are becoming widely available, with sequencing costs rapidly decreasing, resulting in many whole-genome sequencing studies. Rare variants are particularly important since a large part of human genetic variation can be explained by these variants.

The success of the LMM depends on the degree to which it fits the data. For example, the LMM assumes the phenotype follows a normal distribution. However, phenotypes which are discrete, multimodal, bounded, truncated, or in general whose residuals, after adjusting for covariates, do not exhibit normality, might not be suitable for use with the LMM. The same argument holds for generalized LMMs (GLMMs), which replace the normality assumption with other parametric distributions. To mitigate such issues, one can attempt to pre-process the phenotypic values to make them as Gaussian as possible (see, e.g.¹⁷). However, there is no guarantee that a sufficiently good transformation exists, and the dependency on the parametric model may not be robust to other types of model deviations.

Our goal is to develop a practical and globally applicable test for existence of heritability, which will apply to the test statistic calculated via the LMM mechanism, whether or not the LMM assumptions actually fit the data well. Permutation testing is a non-parametric, assumption-free method for testing the null hypothesis of sample exchangeability¹⁸. Such exchangeability holds, for example, for any model under which non-heritable phenotypes are independent and identically distributed across all individuals, such as the LMM with a constant covariate and zero heritability. It also holds approximately with general covariates, under many realistic settings. In such permutation testing, we repeatedly permute the labels corresponding to each individual in the phenotype (along with additional covariates), re-estimate the heritability of each shuffled dataset, and compute the proportion of permutations for which we got a higher heritability value than the original estimate. This offers an intuitive notion of significance, which has little dependency on the underlying distribution of the phenotype.

Unfortunately, permutation tests are computationally demanding, requiring the calculation of the test statistic for each permuted dataset. As a rule of thumb, accurate estimation of p-values of 1/M requires 100M permutations¹⁹. Contemporary QTL studies³ often carry out hundreds of thousands (or more) of tests, calling for p-values smaller than, e.g., 0.05/100000 to establish significance after multiple testing correction. Thus, permutation-based testing of such studies will require on the order of 10⁹ permutations, a formidable task.

Previous works suggest permutation procedures as a way to calibrate the distribution of a statistic of choice, in the context of LMMs. For example, permutations are used to calibrate the likelihood ratio (LR) test statistic, assuming its distribution comes from a limited family of distributions¹⁶ or without such assumptions^20,21. This approach was extended to be used in association with multiple traits²². A less restrictive permutation scheme was used in²³ to test whether the distribution of survival endpoints varies among centers in an acute myeloid leukemia multicentre study, using the LMM for analysis. More generally, permutation and other nonparameteric bootstrapping schemes have been used to calibrate various statistics suggested for testing for a nonzero variance components in GLMMs. In ref.²⁴, an alternative statistic based on the score test is suggested, with a corresponding non-permutation-based parametric bootstrap test. Other works follow a similar permutation procedure to the one presented in this work, but apply it to a newly suggested T-based statistic²⁰ or to particular quadratic forms of the phenotype²⁵. In⁹, one of the considered permutation tests is similar to ours, but they do not address computational issues, which limits their ability to detect small p-values.

In this paper, we study the behavior of the permutation test compared to the p-values calculated by assuming the parametric LMM. We analyze methylation measurement profiles from the longitudinal KORA study (Cooperative health research in the Region of Augsburg), and gene expression profiles from the GTEx project. We show that large discrepancies exist between the two tests. In particular, p-values from the parametric test are often much smaller than those obtained from the permutation test. We show that this likely stems from model mis-specification of the LMM, which suggests a large majority of methylation sites or expression profiles with seemingly significant heritability are in fact false positives, motivating the use of permutation tests.

We then propose a fast method for permutation testing for heritability and for set testing. To do so, we address two issues. First, for each permutation, we speed up evaluation by using the derivative of the likelihood of the permuted phenotype instead of a full estimation step. Second, we use an efficient p-value evaluation procedure²⁶ based on the Stochastic Approximation Markov Chain Monte Carlo (SAMC) algorithm^27,28, which allows us to estimate the significance of a heritability estimate with a fraction of the number of permutations required by the naive method. We apply our approach to the KORA dataset, to achieve a speed up of up to eight orders of magnitude in p-value calculation.

Results

For a phenotype y, the LMM assumes y = Xβ + g + e, where Xβ is the contribution of covariates, ${\mathbf{g}}\sim {\cal N}({\mathbf{0}},\sigma _g^2{\mathbf{K}})$ is the genetic component of the trait, and ${\mathbf{e}}\sim {\cal N}({\mathbf{0}},\sigma _e^2{\mathbf{I}})$ is the environmental component. K is a kinship matrix capturing the genetic relatedness between individuals in the sample, which can be constructed in various ways, using genotypes or known familial relations. Heritability is then defined as $h^2 = \sigma _g^2/(\sigma _g^2 + \sigma _e^2)$. Similarly, the heritability estimate is $\hat h^2 = \hat \sigma _g^2/(\hat \sigma _g^2 + \hat \sigma _e^2)$, where $\hat \sigma _g^2$ and $\hat \sigma _e^2$ are estimates for $\sigma _g^2$ and $\sigma _e^2$, respectively. These estimates are calculated using restricted maximum likelihood (REML) estimation (see Methods).

Under the LMM, the common technique for parametric p-value calculation (e.g., in the popular GCTA software package²⁹) is to calculate the generalized likelihood ratio test statistic, and to assume it is distributed as a 50:50 mixture of zero and the chi-square distribution (see Methods). Other distributions are sometimes used¹⁶, as we discuss below. An alternative test, usually used in the context of set testing, is the score test for $\sigma _g^2 = 0$, used most predominantly as implemented by the Sequential Kernel Association Test (SKAT)³⁰. The distribution of the SKAT test statistic is assumed to be a certain weighted mixture of chi-square distributions.

To avoid model mis-specification or relying on an asymptotic distribution of the test statistic, we consider the permutation test. In the context of heritability, it consists of permuting the phenotype, estimating the heritability for each permuted phenotype, and counting the proportion of permutations for which the estimate obtained was higher than that of the original phenotype (see Methods). In practice, we do not enumerate over the entire set of permutations, but rather sample random permutations (e.g., N = 1,000,000). This gives us an estimate of the p-value of the test under the model, for which we can construct accurate confidence intervals (CIs). One appealing property of the permutation test is that it is an exact test under the assumption that the true model for the data is invariant to relabeling of individuals, which holds specifically if the trait is non-heritable, and holds approximately when non-constant covariates are used, in a wide range of settings. For example, if we decide that a p-value of < 0.001 is our threshold for proclaiming a phenotype as heritable, then we will falsely label non-heritable phenotypes as heritable about 0.1% of the time.

Large discrepancies in p-values in a methylation study

We compared the p-values of the permutation test to the p-values calculated by assuming the parametric LMM. We analyzed methylation measurement profiles from the longitudinal KORA study (Cooperative health research in the Region of Augsburg), which consists of subjects from the general population living in the region of Augsburg, southern Germany. In this dataset, both whole-blood methylation levels and genotypes are available for 1799 individuals (see Methods). The phenotype in this study is the proportion of methylated samples at a specific site, averaged across cells. These proportions are often empirically bimodal for a given site, and their values are bounded between 0 and 1, and thus it is not clear that an assumption of normality or near-normality is suitable here.

Several works have studied heritable DNA methylation effects and the role of such epigenetic variants in disease and genetic regulation (see, e.g., a recent review⁵ and references therein). For example, heritable methylation sites were previously shown⁶ to be enriched for open chromatin regions and binding sites of regulators of transcription and chromatin architecture, and to be proximal to genes enriched in several known pathways, suggesting a potential regulatory mechanism through which genetic variation can affect phenotype.

We calculated the LMM heritability estimates for 431,366 methylation sites, and calculated their p-values using two methods: The parametric test (using the generalized likelihood ratio test, e.g., using GCTA) and using a permutation test with 10,000 permutations, using only an intercept covariate (see Methods). We observed that parametric p-values are often considerably smaller than the exact p-values obtained by the permutation test, frequently by several orders of magnitude, and thus they may incur false-positive findings (Supplementary Figure 1)). We re-ran this analysis using age, sex, and smoking status as covariates, which are commonly used in methylation studies as known confounders. The results (Fig. 1) show large discrepancies, showing that it is not a result of lack or addition of covariates in the analysis.

Applying a Bonferroni threshold of 0.05 ⋅ 1/431, 366 ≈ 10⁻⁷, we further took the 3489 most significant sites which passed this threshold accordingly to GCTA. We then estimated their p-value using 10,000,000 permutations (Supplementary Figure 2). This increased accuracy reveals that for many sites, GCTA would proclaim a site as very significant, while permutation testing does not indicate significance. For example, 395 sites have a permutation p-value of p > 10⁻⁴, but a parametric p-value of p < 10⁻¹⁰. A further inspection of phenotypes displaying such large discrepancies discovered that it is often the result of the phenotype taking disparate values for individuals who are relatively genetically distant from the rest of the sample.

Finally, to check for opposite discrepancies, we took the 6686 sites with permutation p-value of 0/10,000, and calculated their permutation p-value using the method we present below. For 81 sites, the permutation p-value as estimated was significant while the parametric was not, indicating that parametric methods can suffer not only from false-positive results, but also from lack of power. For most of these sites (62/81 sites, 76%), the discrepancy was less than 1 order of magnitude smaller, which can be accounted for by noise in p-value estimation. The remaining 19 sites showed discrepancies below 2 orders of magnitude, and 14 of them exhibit a tri-modal behavior. The results are summarized in Table 1.

Table 1 Summary of p-value discrepancies

Full size table

Large discrepancies in p-values in a gene expression study

In order to show the generality of the discrepancy phenomenon, we analyzed gene expression data in the GTEx dataset (see Methods). We first performed a cis-eQTL study, where for each of 22,171 genes, we created a kinship matrix from the SNPs located within 500 kbps from the gene transcription start site (total window size of 1 Mbps). For each gene, we used both the parametric and the permutation tests to test for association between the SNPs in the window and the gene expression profile, as measured in whole-blood samples. We used the same covariates used in the original GTEx study—namely, the first three genotyping principal components (PCs), the first 15 expression PEER factors (Probabilistic Estimation of Expression Residuals)³¹, and sex. Gene expression profiles were quantile-normalized before analysis (see³²). As evident in Fig. 2, despite inclusion of relevant covariates and careful preprocessing, there still remain significant discrepancies that could be a source of false positives. Indeed, in the original GTEx study, one required criterion for detection of eQTL-containing genes is a permutation p-value obtained from 10,000 permutations³². This exemplifies the need for a fast and accurate permutation testing procedure.

In addition, we performed a heritability study over the same data, where now the set of tested markers includes the entire genome. We still observed p-value discrepancies, when permutation p-values did not detect significantly heritable expression profiles, while parametric p-values did. Further analysis showed that all profiles with significant parametric p-values obtained a heritability estimate of $\hat h^2 = 1$ (Supplementary Figure 3). In such a case, the assumed LRT statistic distribution (see Methods) does not handle the maximal boundary estimate correctly. In studies of small sample sizes, such boundary estimates are likely to occur³³, so such discrepancies are expected. This presents another scenario where parametric p-values are not calibrated.

Reasons for p-value discrepancy

We proceeded to analyze possible underlying reasons for p-value discrepancies. The analysis is given in Supplementary Note 1, and is summarized here. First, we showed that the permutation test is equally powerful to the parametric test under the LMM (Supplementary Figure 4), so that power differences do not explain the discrepancy. Second, we applied quantile normalization (QN) as a preprocessing step before calculating p-values, finding that it did not eliminate the discrepancies, and introduced a potential power loss (Fig. 3). Third, we considered an extended family of distribution as an alternative to the assumed distribution of the LRT statistic¹⁶, but it failed to substantially alter the results. Fourth, we verified that sites with p-value discrepancies are not limited to those with a multimodal behavior. To this end, we excluded from the analysis both sites whose probes are known to contain SNPs (and are thus expected to be multimodal), as well as sites empirically showing a multimodal behavior. However, p-value discrepancies remained. Fifth, we examined whether this discrepancy exists when using the score test instead of the LR test. We note that using the SKAT method³⁰ to calculate p-values, we found that it generated significantly deflated p-values throughout the dataset, indicating that the statistic distribution is not calibrated^34,35. Using RL-SKAT³⁵ to calculate calibrated p-values, we observed the same discrepancies (Supplementary Figure 5). Finally, we performed a simulation study, where non-heritable phenotypes were generated from a non-normal, heavy-tailed marginal distribution per entry. These phenotypes showed parametric p-value miscalibration, while permutation p-values remained calibrated.

We conclude that model mis-specification is the probable reason for such large discrepancies, which further motivates the use of the permutation test. We find that if the LMM is a suitable model for the data at hand, then both the parametric and the permutation tests have similar power. In the case where the LMM is not suitable, the parametric test breaks while the permutation test remains calibrated under much weaker assumptions. Therefore, from a statistical perspective, the permutation test is superior in the context of testing for heritability as defined by the LMM.

Speeding up the evaluation step per permutation

Running time is a major obstacle to performing permutation tests. First, for each permutation, finding the heritability estimate of the permuted phenotype is a computationally intensive task. Second, in order to accurately estimate small p-values, we would need to draw many permutations. In our proposed method, FEATHER (Fast pErmutAtion Testing HERitability), we address these two considerations in turn, beginning with the task of speeding up the evaluation performed for each random permutation.

For each permuted phenotype, the naive permutation test estimates its heritability and compares it to the heritability of the unpermuted phenotype, denoted H². However, we are not in fact interested in the estimated heritability value of the permuted phenotype, but rather only if it is smaller or larger than that of the unpermuted phenotype. Recall that REML obtains the heritability estimate which maximizes the (restricted) likelihood of the permuted phenotype as a function of the suggested heritability value. Consider the derivative of the likelihood function at the point H². Assuming the likelihood function is well behaved, this derivative points us to the direction of the maximum: If the derivative is positive, then the maximum is obtained at a value larger than H², and conversely if it is negative (see Supplementary Figure 6 for an illustration). Therefore, a faster approach is to simply examine the derivative of the likelihood function, rather than trying to find its maximum. We validated the assumption that the likelihood function is well behaved in practice with extensive simulations of real and permuted phenotypes (Supplementary Note 2).

Additionally, in our previous work³³ it was shown that given the eigendecomposition of the kinship matrix, the derivative of the likelihood function can be calculated in O(n²) time. Moreover, the core of the computation is a single matrix-by-vector product, an operation enjoying an efficient implementation in existing software and hardware³⁶, and thus a small constant factor (see Methods). The savings in computation complexity depend on the heritability estimation algorithm used in the naive approach. For example, when using the AI or the EM algorithms, as in GCTA²⁹, the computational complexity is O(n³), which gives our approach a speed up factor of O(n). With approaches that utilize the eigendecomposition, such as pylmm³⁷ or FaST-LMM³⁸, the asymptotic complexity is the same; However, we still get a significant empirical speed up as a result of avoiding many evaluations of the likelihood functions. In practice, on the KORA dataset, we observed a speed up of four orders of magnitudes per permutation of the derivative-based approach compared to full estimation using the widely used GCTA tool, and an order of magnitude improvement compared to FaST-LMM (Table 2).

Table 2 Benchmarks

Full size table

Reducing the number of sampled permutations with SAMC

A major computational hurdle of permutation testing is the potentially large number of random permutations that need to be used in order to estimate small p-values accurately. To cope with this computational burden, we use an efficient p-value evaluation procedure based on the Stochastic Approximation Markov Chain Monte Carlo (SAMC) algorithm^26,27. We give an overview of the method and its properties here. For the full description, see Supplementary Note 4.

In the context of heritability testing, we utilize SAMC as follows. Given an estimate H², we want to calculate its p-value, i.e., the probability a randomly permuted phenotype obtains a higher heritability estimate. We divide the interval [0,1] to D + 1 intervals, where the interval [0,H²] is divided into D equally sized intervals, and [H²,1] is an additional interval. This induces a partitioning of the permutation space to D + 1 subsets. Each subset is the set of permutations of the phenotype for which the estimated heritability value falls in the corresponding interval. Then, the p-value is exactly the size of the subset corresponding to [H²,1], divided by n!, the number of permutations of size n.

The SAMC algorithm estimates the size (i.e., probability) of each subset in the partition. Starting with an arbitrary initial permutation, each SAMC iteration consists of two steps: (1) Given the current random permutation, sample a new random permutation according to a certain target distribution, using the Metropolis-Hastings (MH) sampling algorithm; (2) Given the subset in which the new permutation falls, update the partition probability estimates, and the target distribution of step 1. The update rule for subset probability estimates follows the stochastic approximation algorithm³⁹, which ensures that the estimates can be improved continuously as the simulation goes on. Importantly, the number of random permutations required for convergence is much smaller than the estimated p-value.

Additionally, in order to determine in which subset a permutation falls, we need not calculate the heritability estimate of the respective permuted phenotype. Instead, we can check the derivatives at the endpoints of the interval. If the derivative is positive at the left endpoint and negative at the right endpoint, then we know a maximum exists within that interval. Using the derivative allows us to avoid the heritability estimation step, as before. When the algorithm converges, its estimate for the last subset will be our estimate of the p-value.

Analysis of the performance of SAMC

We implemented both the simple and SAMC derivative-based permutation testing, as an efficient, multi-threaded C++ program. The fact that only few of the values of the random permutation change between successive iterations allows SAMC to be faster than the standard permutation testing, per permutation (see Table 2).

SAMC has several parameters that need to be tuned for a successful application. The most important of which are the number of intervals, D; the number of iterations, N; and an additional parameter, t₀, that corresponds to the number of iterations after which the estimation will begin converging more rapidly. As described in Supplementary Note 4, we chose t₀ = 1,000 and N = 1,000,000 as suitable parameters. Here and throughout the rest of the paper, we used D = 50 intervals.

In Fig. 4 we show the SAMC p-value estimates, compared to those from a standard permutation testing with 10,000 permutations, across all methylation sites in chromosome 22. To further validate the accuracy of SAMC on smaller p-values, we ran SAMC on the sites which GCTA deemed significant, as described above. Again, SAMC appears to give accurate estimates, also for small p-values, where such accuracy is more important in practical applications, as shown in Supplementary Figure 7. Finally, we chose sites with particularly small p-values, and compared their parametric p-values to those by the permutation test with 10⁹ permutations, and with the p-values estimated by SAMC. In Table 3, we show 10 sites with permutation p-values larger than 0, which enable us to informatively examine the calibration of SAMC. The results suggest that SAMC continues to give accurate permutation p-value estimates, as far as it was possible for us to assess.

Table 3 Performance of SAMC on extreme p-values

Full size table

While SAMC is guaranteed to converge²⁷, there are no theoretical guarantees of the speed of convergence. In practice, we have observed that the number of required permutations is significantly reduced, with 2-to-5 orders of magnitude. This is in line with previous applications of SAMC²⁶. In summary, the derivative-based approach gives a speed up improvement of at least two orders of magnitude over full estimation approaches; additionally, for small p-values, the SAMC approach may improve by up to another six orders of magnitude. These improvements allow testing for heritability without the assumption of a parametric model, in a feasible time.

Discussion

In this work, we have discussed the merits of permutation testing for heritability, compared with parametric methods. We have presented two ways to accelerate permutation testing: First, using the derivative of the likelihood function in order to avoid finding the maximum likelihood estimator; and second, using SAMC to substantially reduce the required number of permutations. We have shown that with these modifications, the running time decreases by several orders of magnitude.

SAMC requires using a minimal number of permutations before convergence. Therefore, given a large set of phenotypes for which we wish to test for heritability, we suggest the following scheme. First, perform simple (non-SAMC) permutation testing with a small (e.g., 100) number of permutations. Filter out all sites whose permutation p-value was too large, e.g., for which the lower end of a one-sided binomial or Poisson confidence interval is larger that the threshold p-value indicating significance. Continue with this gradual filtering, increasing the number of permutations in each round. Once reaching N large enough for SAMC convergence (calibrated as described above), switch to SAMC for estimating the p-value for the remaining sites.

One advantage of the permutation approach is that it allows using certain statistics while overlooking otherwise important methodological caveats. For example, it is known that using the REML estimator for ascertained case-control studies leads to incorrect estimates⁴⁰. However, under the mild conditions considered here, the probability of false positives will remain calibrated, although the test may be underpowered. Indeed, any statistic that captures a correspondence between a phenotype and genetics will be suitable here.

We note that our use of SAMC is also independent of our choice of the REML estimator as the statistic. Indeed, any other statistic, for which it is possible to determine in which region of a partition the statistic of a permuted phenotype falls, can be used. Natural candidates are the score test statistic³⁰, or the PCGC regression⁴⁰ statistic.

One critical issue that is not covered by our current approach is the limitation to one variance component. Many applications currently use LMMs using multiple variance components, specifically by dividing the genome into regions and constructing a variance component from each region. In those cases, the contribution of each variance component is then estimated or tested for being significant. Estimating multiple variance components with REML is computationally intensive, and the derivative does not appear to lend itself to a simple analytical expression as in the single variance component case. However, as PCGC regression provides an alternative, faster estimation method, using its statistic is a particularly attractive avenue of research in the context of multiple variance components.

Typically, a preliminary step in heritability estimation is the eigendecomposition of the kinship matrix. This procedure is not very efficient, as it is cubic in the number of individuals, and may therefore be too computationally inefficient for large datasets that include many individuals. Recently, it has been suggested to use conjugate gradient methods in order to estimate heritability⁴¹, thus avoiding the cubic complexity. A natural extension of our proposed method is to derive a procedure that calculates the derivative of the restricted likelihood function using conjugate gradient methods. Such a procedure could result in a quadratic complexity, as it avoids the eigendecomposition.

One disadvantage of SAMC is that, unlike the simple permutation test, it has to be run sequentially. However, it is possible to run multiple shorter chains simultaneously, each with a less strict convergence criteria, and then to aggregate the results in order to obtain a more accurate estimate²⁶. Preliminary results appear encouraging, but a more thorough study of this tradeoff remains a direction for future research, as well as other variants of SAMC⁴².

Finally, the ideas presented here can be readily extended to the testing of genetic correlations, which determine if there is evidence that two phenotypes have common underlying genetic drivers (e.g., two diseases or two gene expression profiles)⁴³.

Methods

For clarity of presentation, we begin by defining the heritability under the LMM. We then introduce our improved method for fast permutation testing for heritability, while reviewing relevant results from the ALBI method³³ and the SAMC algorithm.

The linear mixed model

We first present the standard variance components model⁴⁴. Let n be the number of individuals (or observations, in general) and let y be a n × 1 vector of phenotype measurements for each individual. Let X be a n × p matrix of p covariates, associated with fixed effects (possibly including an intercept vector 1_n as a first column, as well as other covariates, such as, sex, age, etc.). Let β be a p × 1 vector of fixed effects. Let Z be a n × m standardized (i.e., columns have zero mean and unit variance) genotype matrix containing the m SNPs we test. Finally, let K be a kinship matrix, which can be taken to be any symmetric positive-definite matrix that encodes similarity between individuals, using any biomarkers, e.g., a set of SNPs. A standard choice for K is a weighted dot product¹²; formally, define K = ZWZ^T, where W is a non-negative m × m diagonal matrix assigning a weight per SNP (e.g., W_i,i = 1/m, see³⁰ for a discussion).

Then, y is assumed to follow:

$${\mathbf{y}}\sim {\cal N}\left( {{\mathbf{X}}{\beta} ,\sigma _g^2{\mathbf{K}} + \sigma _e^2{\mathbf{I}}_n} \right){\kern 1pt} ,$$

(1)

The fixed effects β and the coefficients $\sigma _g^2$ and $\sigma _e^2$ are the parameters of the model.

The narrow-sense heritability due to genotyped common SNPs is defined as the proportion of total variance explained by a genetic component⁴⁵:

$$h^2 = \frac{{\sigma _g^2}}{{\sigma _g^2 + \sigma _e^2}}.$$

Defining $\sigma _p^2 = \sigma _g^2 + \sigma _e^2$, Equation (1) becomes:

${\mathbf{y}}\sim {\cal N}\left( {{\mathbf{X}}\beta ,\sigma _p^2{\mathrm{V}}_{h^2}} \right).$where ${\mathbf{V}}_{h^2} = h^2{\mathbf{K}} + (1 - h^2){\mathbf{I}}_n$.

Estimation and testing of heritability with REML

The most common way of estimating h² is REML estimation. REML consists of maximizing the likelihood function associated with the projection of the phenotype onto the subspace orthogonal to that of the fixed effects of the model⁴⁶. The logarithm of the REML function is, up to additive and multiplicative constants:

$$ \ell _{\rm{REML}} ({\mathbf{y}}; h^{2},\sigma _{p}^{2},\beta ) \propto \\ - (n - p){\mathrm{log}}\sigma _{p}^{2} - {\mathrm{log}} | {\mathbf{V}}_{h^{2}}| - {\mathrm{log}}|{\mathbf{X}}^{T}{{\mathbf{V}}_{{h^{2}}}^{-1}} {\mathbf{X}}| - \frac{({{\mathbf{y}} - {\mathbf{X}} {{\beta}} } )^{T}{\mathbf{V}}_{h^{2}}^{-1}( {\mathbf{y}} - {\mathbf{X}}{{\beta}} )}{{\sigma _{p}^{2}}},$$

In practice, often some of the eigenvalues of K are zero or near-zero. This occurs, for example, when K is constructed from a mean-centered Z, in which case the constant vector 1 would be an eigenvector with the eigenvalue 0. Another example is when Z has fewer SNPs than samples, in which case K will be low rank. When this is the case, the likelihood at h² = 1 may be undefined or prone to numerical instability. To avoid this, we project both the phenotype and covariates to the subspace spanned by eigenvectors corresponding to nonzero eigenvalues. Let U be the matrix whose columns are the eigenvectors of K, and let d_i be the eigenvalues of K, for i = 1,…,n. If there are z eigenvalues larger than a sufficiently small threshold, denote by U_z the matrix with the first z eigenvectors. Effectively, this amounts to replacing $\textbf{V}_{h^2}$ in $\ell _{{\rm{REML}}}$ with ${\mathbf{U}}_z^{T}{{\mathbf{V}}}_{h^{2}}{\mathbf{U}}_{z}$.

The common way to test for the statistical significance of a nonzero heritability value is using the generalized restricted likelihood ratio test statistic

$${\mathrm{\Lambda }} = \frac{{\begin{array}{*{20}{c}} {} \\ {{\mathrm{max}}} \\ {h^2,\sigma _p^2,\beta } \end{array}{\cal L}_{{\rm{REML}}}\left( {{\mathbf{y}};h^2,\sigma _p^2,\beta } \right)}}{{\begin{array}{*{20}{c}} {} \\ {{\mathrm{max}}} \\ {\sigma _p^2,\beta } \end{array}{\cal L}_{{\rm{REML}}}\left( {{\mathbf{y}};0,\sigma _p^2,\beta } \right)}},$$

where $\ell _{{\rm{REML}}} = {\mathrm{log}}{\cal L}_{{\rm{REML}}}$. Asymptotically, we have the distribution

$${2{\mathrm{log\Lambda \sim }}0.5 \cdot \chi _0^2 + 0.5 \cdot \chi _1^2}$$

where $\chi _1^2$ is the chi-square distribution with 1 degree of freedom, and $\chi _0^2$ is the constant distribution of the constant zero^47,48.

Permutation testing for heritability

Monte Carlo permutation testing: The p-value of the full permutation test is calculated by enumerating over all permutations to calculate

$$p_{{\mathrm{perm}}} = \frac{1}{{n!}}\left| {\left\{ {\pi \in S_n,|{\kern 1pt} \hat h^2(\pi ({\mathbf{y}})) \ge \hat h^2({\mathbf{y}})} \right\}} \right|$$

where π(y) is the application of the permutation π on the phenotype y, and S_n is the set of all permutations of n elements. This is an exact test—that is, under a null hypothesis invariant to permutations, p_perm is distributed uniformly. However, since the number of permutations, n!, is huge, the common approach is to employ a Monte Carlo approximation. In detail, let π₁,…,π_N be N random permutations of n elements; the p-value of the test is:

$$p_{{\mathrm{MC}}} = \frac{1}{N}\left| {\left\{ {\pi _t{\kern 1pt} |{\kern 1pt} \hat h^2(\pi _t({\mathbf{y}})) \ge \hat h^2({\mathbf{y}})} \right\}} \right|$$

The p-value p_MC is an approximation of the required p-value p_perm. Moreover, since each permutation was chosen randomly and with replacement, p_MC can be seen as the result of a binomial experiment. Therefore, we can calculate accurate confidence intervals for p_perm given p_MC, e.g., using the Cloppe–Pearson method⁴⁹.

Covariates: If there are no covariates, or the only covariate is the constant vector (i.e., X = 1_n), then the permutation test does not require taking covariates into consideration. In the general case, we apply the same permutation on each covariate vector as we do on the phenotype. We note that the permutation approach requires exchangeability of the residuals, which are not observed in general in presence of non-constant covariates in the model. However, we verified in simulations that the test remains exact or approximately exact under various settings; this is theoretically supported in recent works on permutation tests in linear regression^50,51,52. See Supplementary Note 3 for an extended discussion.

Speeding up evaluation by using the likelihood derivative

The naive calculation above requires, for each permuted phenotype π_t(y), the estimation of its heritability using REML, $\hat h^2(\pi _t({\mathbf{y}}))$. However, instead of explicitly calculating $\hat h^2(\pi ({\mathbf{y}}))$, we are only interested whether $\hat h^2(\pi ({\mathbf{y}})) \ge H^2$, where $H^2 = \hat h^2({\mathbf{y}})$, the heritability estimate of the unpermuted phenotype.

In³³, it is shown that when X = 1_n, checking if $\hat h^2(\pi ({\mathbf{y}})) \ge H^2$ can equivalently be performed by computing u = U^Tπ(y), and checking if

$$\mathop {\sum}\limits_{i = 1}^n \xi _i^{H^2}u_i^2 > 0,$$

(2)

where

$$\xi _i^{H^2} = \frac{1}{{H^2(d_i - 1) + 1}}\left( {\frac{{d_i - 1}}{{H^2(d_i - 1) + 1}} - \frac{1}{{n - 1}}\mathop {\sum}\limits_{j = 1}^{n - 1} \frac{{d_j - 1}}{{H^2(d_j - 1) + 1}}} \right), {\mathrm{for}}\, {i=1},...,{n-1},$$

and $\xi _n^{H^2} = 0$. The sign of the expression in Eq. (2) is equal to the sign of $\frac{{\partial \ell _{{\mathrm{REML}}}}}{{\partial h^2}}(H^2)$, the derivative of $\ell _{{\mathrm{REML}}}$ at the point H². Therefore, assuming the restricted likelihood function is well behaved, a positive derivative indicates that the REML heritability estimate is larger than H². Similar expressions are defined for a general X in³³.

Therefore, once the eigendecomposition of K is obtained, calculating p_MC may be performed in a time complexity quadratic in n:

1.
Given y and its heritability estimate H², calculate $\xi _i^{H^2}$ (complexity: O(n)).
2.
Draw π₁,…,π_N $\in$ S_n (complexity: O(nN)).
3.
For t = 1,…,N:
1. (a)
  Calculate u_t = U^Tπ_t(y) (complexity: O(n²)).
2. (b)
  Let b_t = 1 if $\mathop {\sum}\nolimits_{i = 1}^n \xi _i^{H^2}({\mathbf{u}}_t)_i^2 \ge 0$ and b_t = 0 otherwise (complexity: O(n)).
4.
Return $p = \frac{1}{N}\mathop {\sum}\nolimits_{t = 1}^N b_t$ (complexity: O(N)).

The total complexity is O(n²N). For a general covariate matrix X, the only change will be in the condition checked in step (b), whose complexity is O(np² + p³) instead of O(n), resulting in a final complexity of O((n² + np² + p³)N), as detailed in³³.

Reducing the number of sampled permutations using SAMC

To cope with the major computational hurdle of permutation testing, we use an efficient p-value evaluation procedure based on the Stochastic Approximation Markov Chain Monte Carlo (SAMC) algorithm^26,27. A description of the SAMC algorithm and its tuning is given in Supplementary Note 4.

In summary, let the proposal distribution q(π_t, τ) define the probability of choosing a new permutation τ, given that the current permutation is π_t. Let ${\mathbf{e}}_i = \left( {0, \ldots ,0,\underbrace 1_i,0, \ldots ,0} \right)$. Let D + 1 be the number of intervals in the partitioning of [0,1]. For a permutation π∈S_n, let J(π) be the index of the interval in which $\hat h^2(\pi ({\mathbf{y}}))$ falls. Let $\theta _1^{(t)} \ldots ,\theta _{D + 1}^{(t)}$ be the logarithm of our current estimates of partition sizes, up to a multiplicative (in log scale, additive) constant. The algorithm is:

1.
Initialize a uniform estimate, $\theta _1^{(t)} = \ldots , = \theta _{D + 1}^{(t)} = 0$.
2.
Choose a random initial permutation π₁.
3.
For t = 1,…,T (or until convergence):
1. (a)
  Simulate a sample π_{t + 1} by a single Metropolis-Hastings update, as follows:
  1. i.
    Generate τ according to the proposal distribution q(π_t,τ).
  2. ii.
    Calculate the ratio $r = {\mathrm{exp}}\left( {\theta _{J(\pi _t)}^{(t)} - \theta _{J(\tau )}^{(t)}} \right) \cdot q(\tau ,\pi _t)/q(\pi _t,\tau )$
  3. iii.
    Accept the proposed move with a probability of min (1, r). If accepted, set π_t+1 = τ. Otherwise, set π_t+1 = π_t.
2. (b)
  Update the estimates: For i = 1,…,D + 1, set $\theta _i^{(t + 1)} = \theta _i^{(t)} + \gamma ^{(t)}\left( {{\mathbf{e}}_{J(\pi _{t + 1})} - \left( {\frac{1}{{D + 1}}, \ldots ,\frac{1}{{D + 1}}} \right)} \right)$, where γ^(t) is called the gain factor and is defined as γ^(t) = t₀/max(t₀, t).
4.
Return $\mathrm{exp}(\theta _{D + 1}^{(t)})/\mathop {\sum}\nolimits_{i = 1}^{D + 1} \mathrm{exp}(\theta _i^{(t)})$.

The KORA dataset

The KORA project studies n = 1799 individuals from the general population living in the region of Augsburg, southern Germany⁵³. The measured phenotype is the proportion of methylated samples at a specific site, averaged across DNA samples of an individual. We used whole-blood samples of the KORA F4 study, as described elsewhere⁵⁴. Briefly, DNA methylation levels were collected using the Infinium HumanMethylation450K BeadChip array (Illumina). Beta Mixture Quantile (BMIQ)⁵⁵ normalization was applied to the methylation levels. Further processing was performed as in ref.⁵⁶; briefly, genotyping was performed with the Affymetrix 6.0 SNP Array (534,174 SNP markers after quality control), with further imputation using HapMap2 as a reference panel. A total of 657,103 probes remained for the analysis. In summary, a total of 431,366 methylation site phenotypes, and 657,103 SNPs, were available for analysis. Covariates used in this study are age, sex, and smoking status.

The GTEx dataset

The Genotype-Tissue Expression (GTEx)⁵² Project is a US National Institutes of Health (NIH) Common Fund project that aims to collect a comprehensive set of tissues from 900 deceased donors (for a total of about 20,000 samples) and to provide the scientific community with a database of genetic associations with molecular traits such as mRNA levels. We used 22,171 gene expression profiles obtained from whole-blood samples of 338 individuals, as preprocessed and using the same covariates as described in³².

Benchmarks

We used GCTA version 1.26²⁹ and pylmm⁵⁷. We used the C++ implementation of FaST-LMM, FastLmmC v2.07.20140723³⁸ and calculated the kinship matrix and its eigendecomposition in advance using the -eigen flag. We only considered the GWAS analysis but not the data loading time, as reported by FaST-LMM.

Code Availability

FEATHER is available at https://github.com/cozygene/feather.

Data availability

The informed consents given by the KORA study participants do not cover data posting in public databases. However, data are available upon request from KORA-gen (http://www.helmholtz-muenchen.de/kora-gen). Data requests can be submitted online and are subject to approval by the Steering Committee of the Research Network for Community Medicine (for SHIP data) and the KORA Board. All other relevant data are available upon request.

References

Price, A. L. et al. Single-tissue and cross-tissue heritability of gene expression via identity-by-descent in related or unrelated individuals. PLoS Genet. 7, e1001317 (2011).
Article CAS Google Scholar
Wright, F. A. et al. Heritability and genomics of gene expression in peripheral blood. Nat. Genet. 46, 430–437 (2014).
Article CAS Google Scholar
Lloyd-Jones, L. R. The genetic architecture of gene expression in peripheral blood.Am J Hum Genet 100, 228–237 (2017).
Article CAS Google Scholar
Sun, S. et al. Differential expression analysis for RNAseq using Poisson mixed models. Nucleic Acids Res. 45, e106–e106 (2017).
Article CAS Google Scholar
Bell, J. T. & Spector, T. D. DNA methylation studies using twins: what are they telling us? Genome Biol. 13, 172 (2012).
Article CAS Google Scholar
Quon G., & Lippert C. & Heckerman D. & Listgarten J. Patterns of methylation heritability in a genome-wide analysis of four brain regions. Nucleic Acids Res. 41, 2095–2104 (2013).
Article CAS Google Scholar
McRae, A. F. et al. Contribution of genetic variation to transgenerational inheritance of DNA methylation. Genome Biol. 15, R73 (2014).
Article Google Scholar
Van Dongen, J. et al. Genetic and environmental influences interact with age and sex in shaping the human methylome. Nature Commun. 7, 11115 (2016).
Ganjgahi, H. et al. Fast and powerful heritability inference for family-based neuroimaging studies. Neuroimage 115, 256–268 (2015).
Article Google Scholar
Ge, T. et al. Massively expedited genome-wide heritability analysis (MEGHA). Proc. Natl Acad. Sci. USA 112, 2479–2484 (2015).
Article ADS CAS Google Scholar
Zhao, N. et al. Testing in microbiome-profiling studies with MiRKAT, the microbiome regression-based kernel association test. Am. J. Human. Genet. 96, 797–807 (2015).
Article CAS Google Scholar
Yang, J. et al. Common SNPs explain a large proportion of the heritability for human height. Nat. Genet. 42, 565–569 (2010).
Article CAS Google Scholar
Tzeng, J.-Y. & Zhang, D. Haplotype-based association analysis via variance-components score test. Am. J. Human. Genet. 81, 927–938 (2007).
Article CAS Google Scholar
Kwee, L. C., Liu, D., Lin, X., Ghosh, D. & Epstein, M. P. A powerful and flexible multilocus association test for quantitative traits. Am. J. Human. Genet. 82, 386–397 (2008).
Article CAS Google Scholar
Wu, M. C. et al. Powerful SNP-set analysis for case-control genome-wide association studies. Am. J. Human. Genet. 86, 929–942 (2010).
Article CAS Google Scholar
Listgarten, J. et al. A powerful and efficient set test for genetic markers that handles confounders. Bioinformatics 29, 1526–1533 (2013).
Article CAS Google Scholar
Fusi, N., Lippert, C., Lawrence, N. D. & Stegle, O. Warped linear mixed models for the genetic analysis of transformed phenotypes. Nature Commun. 5, 4890 (2014).
Hoeffding, W. The large-sample power of tests based on permutations of observations. The Annals of Mathematical Statistics 23, 169–192 (1952).
Article MathSciNet Google Scholar
Kimmel, G. & Shamir, R. A fast method for computing high-significance disease association in large population-based studies. Am. J. Human. Genet. 79, 481–492 (2006).
Article CAS Google Scholar
Samuh, M. H., Grilli, L., Rampichini, C., Salmaso, L. & Lunardon, N. The use of permutation tests for variance components in linear mixed models. Commun. Stat. Theory Methods 41, 3020–3029 (2012).
Article MathSciNet Google Scholar
Zeng, P., Zhao, Y., Li, H., Wang, T. & Chen, F. Permutation-based variance component test in generalized linear mixed model with application to multilocus genetic association study. Bmc. Med. Res. Methodol. 15, 37 (2015).
Article CAS Google Scholar
Casale, F. P., Rakitsch, B., Lippert, C. & Stegle, O. Efficient set tests for the genetic analysis of correlated traits. Nat. Methods 12, 755–758 (2015).
Article CAS Google Scholar
Biard, L., Porcher, R. & Resche-Rigon, M. Permutation tests for centre effect on survival endpoints with application in an acute myeloid leukaemia multicentre study. Stat. Med. 33, 3047–3057 (2014).
Article MathSciNet CAS Google Scholar
Sinha, S. K. Bootstrap tests for variance components in generalized linear mixed models. Can. J. Stat. 37, 219–234 (2009).
Article MathSciNet Google Scholar
Drikvandi, R., Verbeke, G., Khodadadi, A. & Nia, V. P. Testing multiple variance components in linear mixed-effects models. Biostatistics 14, 144–159 (2013).
Article Google Scholar
Yu, K., Liang, F., Ciampa, J. & Chatterjee, N. Efficient p-value evaluation for resamplingbased tests. Biostatistics 12, 582–593 (2011).
Article Google Scholar
Liang, F., Liu, C. & Carroll, R. J. Stochastic approximation in Monte Carlo computation. J. Am. Stat. Assoc. 102, 305–320 (2007).
Article MathSciNet CAS Google Scholar
Liang, F. An overview of stochastic approximation Monte Carlo. Wiley Interdiscip. Rev.: Comput. Stat. 6, 240–254 (2014).
Article Google Scholar
Yang, J., Lee, S. H., Goddard, M. E. & Visscher, P. M. GCTA: A tool for genome-wide complex trait analysis. Am. J. Human. Genet. 88, 76–82 (2011).
Article CAS Google Scholar
Wu, M. C. et al. Rare-variant association testing for sequencing data with the sequence kernel association test. Am. J. Human. Genet. 89, 82–93 (2011).
Article CAS Google Scholar
Stegle, O., Parts, L., Piipari, M., Winn, J. & Durbin, R. Using probabilistic estimation of expression residuals (PEER) to obtain increased power and interpretability of gene expression analyses. Nat. Protoc. 7, 500 (2012).
Article CAS Google Scholar
Consortium, G. et al. The Genotype-Tissue Expression (GTEx) pilot analysis: multitissue gene regulation in humans. Science 348, 648–660 (2015).
Article Google Scholar
Schweiger, R. et al. Fast and accurate construction of confidence intervals for heritability. Am. J. Human. Genet. 98, 1181–1192 (2016).
Article CAS Google Scholar
Chen, J., Chen, W., Zhao, N., Wu, M. C. & Schaid, D. J. Small sample kernel association tests for human genetic and microbiome association studies. Genet. Epidemiol. 40, 5–19 (2016).
Article CAS Google Scholar
Schweiger, R. et al. RL-SKAT: an exact and efficient score test for heritability and set tests.Genetics 207, 1275–1283 (2017).
PubMed Google Scholar
Guennebaud, G., et al. Eigen v3 http://eigen.tuxfamily.org (2010).
Furlotte, N. A. & Eskin, E. Efficient multiple trait association and estimation of genetic correlation using the matrix-variate linear mixed-model. Genetics 200, 59–68 (2015).
Article Google Scholar
Listgarten, J. et al. Improved linear mixed models for genome-wide association studies. Nat. Methods 9, 525–526 (2012).
Article CAS Google Scholar
Robbins, H. & Monro, S. A stochastic approximation method. The annals of mathematical statistics 22, 400–407 (1951).
Article MathSciNet Google Scholar
Golan, D., Lander, E. S. & Rosset, S. Measuring missing heritability: Inferring the contribution of common variants. Proc. Natl Acad. Sci. USA 111, E5272–E5281 (2014).
Article ADS CAS Google Scholar
Loh, P.-R. et al. Contrasting genetic architectures of schizophrenia and other complex diseases using fast variance-components analysis. Nat. Genet. 47, 1385–1392 (2015).
Article CAS Google Scholar
Liang, F., Liu, C. & Carroll, R. Advanced Markov chain Monte Carlo methods: learning from past samples (John Wiley & Sons, 2011).
Bulik-Sullivan, B. et al. An atlas of genetic correlations across human diseases and traits. Nat. Genet. 47, 1236–1241 (2015).
Article CAS Google Scholar
Searle, S. R., Casella, G. & McCulloch, C. E. Variance components (John Wiley & Sons, New Jersey, 2009).
Visscher, P. M., Hill, W. G. & Wray, N. R. Heritability in the genomics eraconcepts and misconceptions. Nat. Rev. Genet. 9, 255–266 (2008).
Article CAS Google Scholar
Patterson, H. D. & Thompson, R. Recovery of inter-block information when block sizes are unequal. Biometrika 58, 545–554 (1971).
Article MathSciNet Google Scholar
Chernoff, H. On the distribution of the likelihood ratio. The Annals of Mathematical Statistics 25, 573–578 (1954).
Article MathSciNet Google Scholar
Moran, P. A. Maximum-likelihood estimation in non-standard conditions in. Math. Proc. Camb. Philos. Soc. 70, 441–450 (1971).
Article ADS MathSciNet Google Scholar
Clopper, C. J. & Pearson, E. S. The use of confidence or fiducial limits illustrated in the case of the binomial. Biometrika 26, 404–413 (1934).
Article Google Scholar
Schmoyer, R. L. Permutation tests for correlation in regression errors. J. Am. Stat. Assoc. 89, 1507–1516 (1994).
Article MathSciNet Google Scholar
Anderson, M. J. & Robinson, J. Permutation tests for linear models. Aust. N.Z. J. Stat. 43, 75–88 (2001).
Article MathSciNet Google Scholar
Nyblom, J. in Modern Nonparametric, Robust and Multivariate Methods 69–90 (Springer, Berlin, Germany, 2015).
Chapter Google Scholar
Holle, R. et al. KORA-a research platform for population based health research. Das. Gesundh. 67, 19–25 (2005).
Article Google Scholar
Pfeiffer, L. et al. DNA methylation of lipid-related genes affects blood lipid levels. Circ. Cardiovasc Genet. 8, 334–342 (2015).
Article CAS Google Scholar
Teschendorff, A. E. et al. A beta-mixture quantile normalization method for correcting probe design bias in Illumina Infinium 450 k DNA methylation data. Bioinformatics 29, 189–196 (2013).
Article CAS Google Scholar
Kolz, M. et al. Meta-analysis of 28,141 individuals identifies common variants within five new loci that influence uric acid concentrations. PLoS Genet. 5, e1000504 (2009).
Article Google Scholar
Furlotte, N. A., Heckerman, D. & Lippert, C. Quantifying the uncertainty in heritability. J. Hum. Genet. 59, 269–275 (2014).
Article Google Scholar

Download references

Acknowledgements

The authors would like to thank Jennifer Listgarten for valuable suggestions. R.S. is supported by the Colton Family Foundation. E.H. and E.R. were partially supported by National Science Foundation (NSF) grant 1705197. E.R. and R.S. were supported in part by the Israel Science Foundation (Grant 1425/13) and by the Edmond J. Safra Center for Bioinformatics at Tel-Aviv University. The KORA study was initiated and financed by the Helmholtz Zentrum München German Research Center for Environmental Health, which is funded by the German Federal Ministry of Education and Research (BMBF) and by the State of Bavaria. Furthermore, KORA research was supported within the Munich Center of Health Sciences (MC-Health), Ludwig-Maximilians-Universität, as part of LMUinnovativ.

Author information

Authors and Affiliations

Blavatnik School of Computer Science, Tel Aviv University, Tel Aviv, 6997801, Israel
Regev Schweiger & Elior Rahmani
School of Mathematical Sciences, Department of Statistics, Tel Aviv University, Tel Aviv, 69978, Israel
Eyal Fisher & Saharon Rosset
Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, 02115, MA, USA
Omer Weissbrod
Institute of Genetic Epidemiology, Helmholtz Zentrum München—German Research Center for Environmental Health, Neuherberg, 85764, Germany
Martina Müller-Nurasyid
Department of Medicine I, Ludwig-Maximilians-Universität, Munich, 80539, Germany
Martina Müller-Nurasyid
DZHK (German Centre for Cardiovascular Research), partner site Munich Heart Alliance, Munich, 80636, Germany
Martina Müller-Nurasyid & Melanie Waldenberger
Institute of Epidemiology II, Helmholtz Zentrum München - German Research Center for Environmental Health, 85764, Neuherberg, Germany
Sonja Kunze, Christian Gieger & Melanie Waldenberger
Research Unit of Molecular Epidemiology, Helmholtz Zentrum München—German Research Center for Environmental Health, 85764, Neuherberg, Germany
Sonja Kunze, Christian Gieger & Melanie Waldenberger
Los Angeles, University of California Los Angeles, Los Angeles, 90095, CA, USA
Eran Halperin
Department of Anesthesiology and Perioperative Medicine, University of California, Los Angeles, 90095, CA, USA
Eran Halperin

Authors

Regev Schweiger
View author publications
You can also search for this author in PubMed Google Scholar
Eyal Fisher
View author publications
You can also search for this author in PubMed Google Scholar
Omer Weissbrod
View author publications
You can also search for this author in PubMed Google Scholar
Elior Rahmani
View author publications
You can also search for this author in PubMed Google Scholar
Martina Müller-Nurasyid
View author publications
You can also search for this author in PubMed Google Scholar
Sonja Kunze
View author publications
You can also search for this author in PubMed Google Scholar
Christian Gieger
View author publications
You can also search for this author in PubMed Google Scholar
Melanie Waldenberger
View author publications
You can also search for this author in PubMed Google Scholar
Saharon Rosset
View author publications
You can also search for this author in PubMed Google Scholar
Eran Halperin
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

R.S. led the development of the statistical method, ran the simulations, wrote the code, analyzed the simulated and real data, and led the writing of the manuscript. M.M-.N., S.K., C.G. and M.W. collected the K.O.R.A. dataset. R.S., E.F., O.W., E.R., S.R. and E.H. contributed to method development and validation, and wrote the manuscript with input from all co-authors. E.H. supervised the project.

Corresponding author

Correspondence to Regev Schweiger.

Ethics declarations

Competing interests

R.S. is an employee of MyHeritage Ltd. The remaining authors declare no competing interests.

Additional information

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Supplementary Information

Peer Review file

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Schweiger, R., Fisher, E., Weissbrod, O. et al. Detecting heritable phenotypes without a model using fast permutation testing for heritability and set-tests. Nat Commun 9, 4919 (2018). https://doi.org/10.1038/s41467-018-07276-w

Download citation

Received: 07 November 2017
Accepted: 26 October 2018
Published: 21 November 2018
DOI: https://doi.org/10.1038/s41467-018-07276-w

This article is cited by

Ultrarare variants drive substantial cis heritability of human gene expression
- Ryan D. Hernandez
- Lawrence H. Uricchio
- Noah Zaitlen
Nature Genetics (2019)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Genome-wide association studies

Evaluating and improving heritability models using summary statistics

Fast and covariate-adaptive method amplifies detection power in large-scale multiple hypothesis testing

Introduction

Results

Large discrepancies in p-values in a methylation study

Large discrepancies in p-values in a gene expression study

Reasons for p-value discrepancy

Speeding up the evaluation step per permutation

Reducing the number of sampled permutations with SAMC

Analysis of the performance of SAMC

Discussion

Methods

The linear mixed model

Estimation and testing of heritability with REML

Permutation testing for heritability

Speeding up evaluation by using the likelihood derivative

Reducing the number of sampled permutations using SAMC

The KORA dataset

The GTEx dataset

Benchmarks

Code Availability

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Electronic supplementary material

Supplementary Information

Peer Review file

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Ultrarare variants drive substantial cis heritability of human gene expression

Comments

Search

Quick links