BridgePRS leverages shared genetic effects across ancestries to increase polygenic risk score portability

Hoggart, Clive J.; Choi, Shing Wan; García-González, Judit; Souaiaia, Tade; Preuss, Michael; O’Reilly, Paul F.

doi:10.1038/s41588-023-01583-9

Download PDF

Technical Report
Open access
Published: 20 December 2023

BridgePRS leverages shared genetic effects across ancestries to increase polygenic risk score portability

Nature Genetics volume 56, pages 180–186 (2024)Cite this article

7450 Accesses
74 Altmetric
Metrics details

Subjects

Abstract

Here we present BridgePRS, a novel Bayesian polygenic risk score (PRS) method that leverages shared genetic effects across ancestries to increase PRS portability. We evaluate BridgePRS via simulations and real UK Biobank data across 19 traits in individuals of African, South Asian and East Asian ancestry, using both UK Biobank and Biobank Japan genome-wide association study summary statistics; out-of-cohort validation is performed in the Mount Sinai (New York) BioMe biobank. BridgePRS is compared with the leading alternative, PRS-CSx, and two other PRS methods. Simulations suggest that the performance of BridgePRS relative to PRS-CSx increases as uncertainty increases: with lower trait heritability, higher polygenicity and greater between-population genetic diversity; and when causal variants are not present in the data. In real data, BridgePRS has a 61% larger average R² than PRS-CSx in out-of-cohort prediction of African ancestry samples in BioMe (P = 6 × 10⁻⁵). BridgePRS is a computationally efficient, user-friendly and powerful approach for PRS analyses in non-European ancestries.

Genome-wide association studies

Article 26 August 2021

Utility of polygenic scores across diverse diseases in a hospital cohort for predictive modeling

Article Open access 12 April 2024

Genomic data in the All of Us Research Program

Article Open access 19 February 2024

Main

PRSs have typically been derived using European ancestry genome-wide association study (GWAS) data, resulting in substantially lower predictive power when applied to non-European samples, in particular those of African ancestry^1,2. The PRS trans-ancestry portability problem is well established and is caused by marked differences in linkage disequilibrium (LD), differences in allele frequency driven by genetic drift and natural selection, and gene–environment interactions affecting causal effect sizes³. Consequently, the etiological insights and clinical utility provided by PRS derived in Europeans may have limited relevance to individuals of non-European ancestries.

Increasing GWAS sample sizes for underrepresented populations is of critical importance for improving their PRS. However, optimal power will be achieved by using all GWASs available across ancestries for PRS prediction in any one ancestry; this is because causal genetic effect sizes are highly correlated globally, even between genetically distant ancestries⁴. PRS-CSx⁵, developed to tackle the PRS portability problem, makes cross-population inference on the inclusion of each single-nucleotide polymorphism (SNP) across the genome (or, more precisely, the degree of shrinkage of variant effect sizes to zero). PRS-CSx uses Bayesian modeling with a prior that strongly shrinks small effect sizes to zero, reducing the number of candidate SNPs to a minimal set. This is analogous to fine-mapping of causal variants. However, although the inclusion of causal variants in the PRS is ideal, fine-mapping approaches may not be as effective when causal variants are missing or when power is insufficient for them to be accurately identified.

We introduce BridgePRS, a novel Bayesian PRS method that also integrates trans-ancestry GWAS summary statistics. Unlike the fine-mapping approach of PRS-CSx, BridgePRS retains all variants within loci to best tag causal variants shared across ancestries. The focus is on correctly estimating causal effect sizes, which is key when the goal is prediction, rather than on estimating their location. This approach is less reliant on the inclusion and identification of causal variants. BridgePRS is most applicable to combining the information of a well-powered GWAS performed in a (discovery) population or populations not matched to the ancestry of the target sample, with a second GWAS of relatively limited power in a (target) population that is well-matched to the ancestry of the target sample.

We apply BridgePRS to simulated data and compare its performance with that of PRS-CSx and two single-ancestry PRS methods adapted to use GWAS data from multiple ancestries. The simulations demonstrate the different scenarios in which BridgePRS and PRS-CSx are optimal. We then use UK Biobank (UKB)⁶ and Biobank Japan (BBJ)^7,8 GWAS data to construct PRS for African, South Asian and East Asian ancestry samples. The resultant PRSs are validated in unseen UKB samples and in the entirely independent New York-based Mount Sinai BioMe biobank⁹, producing results consistent with the simulations.

Results

Overview of BridgePRS method

An overview of the BridgePRS modeling employed here is shown in Fig. 1. The key modeling (model 1 in Fig. 1; Methods) can be broken into two stages: (1) a PRS is trained and optimized using data from a large discovery population (for example, European) GWAS, with a zero-centered Gaussian prior distribution for SNP effect sizes (analogous to ridge regression) within putative loci; and (2) the SNP effect sizes of this PRS are treated as priors and updated in a Bayesian framework by those of the smaller target population (for example, African) GWAS. Thus, the two-stage Bayesian ridge approach of BridgePRS ‘bridges’ the PRS between the two populations.

**Fig. 1: Flow diagram describing the modeling of BridgePRS.**

The main causes of poor trans-ancestry PRS portability are differences in LD and allele frequencies between populations³. Differences in LD result in the best tag for a causal variant differing between populations. To account for the resultant uncertainty in the location of causal variants, BridgePRS averages SNP effects across SNPs within putative loci instead of selecting a single best SNP as performed by standard clumping and thresholding (C+T) PRS¹⁰. BridgePRS is first applied to the discovery population GWAS, using Bayesian modeling with zero-centered Gaussian priors, equivalent to penalized likelihood ridge regression, at putative loci. Given summary data from large GWAS in Europeans, we find that this procedure alone improves predictive accuracy in African and South Asian target data compared with choosing single best SNPs at putative loci. Thus, whereas the main BridgePRS method uses GWAS data from the discovery and target GWAS, the option of using only discovery GWAS is available in the BridgePRS software.

Stage 1 modeling results in multivariate Gaussian posterior distributions for SNP effect sizes at each locus. Stage 2 modeling integrates the (smaller) target population GWAS data into the PRS by using this posterior distribution as a prior distribution for SNP effect sizes of the target population. Stage 2 allows for different effect size estimates between the populations, caused by differences in LD, in allele frequencies driven by drift or selection, and by differences in causal effect sizes due to gene–environment interactions. Stages 1 and 2 both use conjugate prior–posterior updates, providing computationally efficient analytical solutions and enabling BridgePRS analyses to be performed rapidly.

Variation in causal allele frequencies between populations can mean that causal variants with relatively low minor allele frequency in the discovery population are estimated with large errors or missed altogether. To ameliorate this problem, PRSs are derived by applying BridgePRS stage 1 modeling to the target population data alone (model 2 in Fig. 1; Methods). Model 1 and model 2 PRSs are combined in model 3 (Fig. 1 and Methods).

Each stage of the modeling is fit across a spectrum of prior parameters and criteria to select loci for inclusion in the PRS calculation, with each combination of parameters giving rise to a unique PRS. These PRSs are then combined in a ridge regression fit using available genotype–phenotype test data, choosing the optimal ridge penalty parameters by cross-validation (Methods).

Benchmarking methods via simulation

We used the HAPGEN2 software¹¹ to simulate HAPMAP3 variants for 100,000 European, 40,000 African and 40,000 East Asian ancestry samples using 1000 Genomes Phase 3 (1000G) samples¹² as a reference. Simulations were restricted to 1,295,289 variants with minor allele frequency >1% in at least one of the three populations. Phenotypes were subsequently simulated under three models of genetic architecture in which causal variants were sampled from 1%, 5% and 10% of the available HAPMAP3 variants. Population-specific effect sizes were sampled from a multivariate Gaussian distribution with between-population correlation of 0.9. Genetic effects were combined assuming additivity, and Gaussian noise at two levels of variance was added to generate phenotypes with 25% and 50% SNP heritability. For each of the six scenarios of polygenicity and heritability, ten independent phenotypes were generated and analyses were run with and without inclusion of the causal variants.

Data were split into training for GWAS (80,000 European, 20,000 non-European), with the remainder split equally into 10,000 samples for model optimization (test data) and assessment of model performance (validation data). The performance of BridgePRS was compared with that of PRS-CSx, PRS-CS-mult and PRSice-meta. PRS-CS-mult applies the single-ancestry PRS-CS method¹³ to the populations under study and combines them by estimating weights in a linear regression using the test data. PRSice-meta applies clumping and thresholding, as implemented in PRSice¹⁴, to meta-analysis of the populations under study, selecting the LD panel from the two populations under study that optimizes prediction in the test data of the target population.

Polygenicity ranging from 1% to 10% (fraction of variants with nonzero effect sizes) is consistent with the findings of a recent study of 28 complex traits in the UKB¹⁵. Between-population correlation of causal variant effect sizes of 0.9 is consistent with the results of a multiancestry lipids GWAS in which causal variants were fine-mapped¹⁶ and with a recent study estimating a mean genetic correlation of 0.98 of causal variant effect sizes between ancestries across a range of continuous traits⁴. Approximately one-third to two-thirds of heritability is captured by common SNPs¹⁷; therefore, our simulation at 25% heritability implies a total heritability of 37.5–75.0%. The power of GWAS, and therefore PRS, is a function of sample size and heritability, such that doubling heritability is equivalent to doubling sample size in terms of power, as the standard error of a GWAS regression coefficient is the same if either the sample size or heritability is doubled (Methods). Therefore, our simulations at 50% SNP heritability and GWAS with 80,000 European samples are equivalent to 25% SNP heritability and GWAS with 160,000 European samples.

Figure 2 summarizes the results from PRS analyses performed on simulated data. Both BridgePRS and PRS-CSx outperformed the single-ancestry methods across all scenarios. BridgePRS performed better than PRS-CSx in analyses of African samples with 5% and 10% of variants assigned as causal. With 1% of variants causal, the methods had similar accuracy when causal variants were not included and at 25% heritability, and PRS-CSx performed better with causal variants included at 50% heritability. In analyses of East Asian samples, the same relative pattern was observed, but the differences were less pronounced, and PRS-CSx performed better in all scenarios in which 1% of variants were causal. Across the analyses, BridgePRS performed better compared with PRS-CSx when the causal variants were not included in the data (Extended Data Fig. 1). Overall, the simulations reveal that the performance of BridgePRS relative to that of PRS-CSx increases as the uncertainty increases: at lower heritability, higher polygenicity, greater between-population genetic diversity and when causal variants are not present in the data.

**Fig. 2: Predictive accuracy for different polygenic prediction methods in simulations.**

The theoretical proportion of heritability (h²) captured by a PRS derived by C+T, assuming independent causal variants, is ${r}^{2}/{h}^{2}={(1+m/n{h}^{2})}^{-1}$, where r² is the variance explained by the PRS, m is the number of causal variants and n is the GWAS sample size^18,19. Although BridgePRS and PRS-CSx are more sophisticated methods than C+T, the factor nh²/m in the equation, which is a measure of power to detect individual causal variant effects, is useful in describing the relative performance of the methods. Figure 2 shows results in relation to nh²/m (up to a proportionality constant): lower values favor BridgePRS, higher values favor PRS-CSx, and within the same target population the relative performance of the methods is similar for constant nh²/m. For example, results at 25% heritability and 5% causal variants showed the same relative method performance as results at 50% heritability and 10% causal variants, for both African and East Asian target samples (Fig. 2a versus Fig. 2b), as expected.

Extended Data Fig. 2 shows results for the same simulation settings as those used in the main analysis (Fig. 2) but with the GWAS training sample size halved (40,000 European, 10,000 non-European). Here, the performance of BridgePRS relative to PRS-CSx increased compared with the results with the full GWAS samples sizes at 50% heritability, and as predicted, the relative performance of the methods at 50% heritability was similar to that at 25% heritability and the full GWAS sample sizes. Extended Data Fig. 3 shows results at the original GWAS sample size and 75% heritability (equivalent to 240,000 European, 60,000 non-European GWAS training sample sizes and 25% heritability). As predicted, the performance of BridgePRS relative to PRS-CSx decreased compared with the results at 25% and 50% heritability.

These simulation analyses used 1000G data as their reference LD panel, that is, the correct LD panel. To assess the sensitivity of the methods to misspecification of LD, analyses were rerun using UKB data to estimate ancestry-specific LD. Extended Data Fig. 4 shows the performance of BridgePRS and PRS-CSx using an LD reference panel constructed from African and East Asian UKB samples relative to their performance using the 1000G reference panel. Both methods exhibited a minimal loss in predictive accuracy using UKB reference panels.

Benchmarking methods via real data

The four PRS methods were applied to UKB⁶ samples of African and South Asian ancestry across 19 continuous anthropometric and blood measure traits (for East Asian ancestry, see below). These traits were selected to maximize heritability and samples sizes of non-European individuals and to minimize their pairwise correlation (maximum r² < 0.3; Methods). For each trait, UKB samples of European, African and South Asian ancestry were split into training, test and validation sets in proportions of 2/3, 1/6 and 1/6, respectively. Sample sizes are shown in Extended Data Table 1. The training data were used to generate GWAS summary statistics, and the test data were used to select optimal model parameters. Results are shown for the resultant PRS in the unseen UKB validation data. In addition, an entirely out-of-sample validation study was performed by applying the PRS derived in the UKB to BioMe⁹ for the nine traits also available in BioMe.

Within the UKB there were 2,472 East Asian samples, which was too few to split into training (GWAS), test and validation sets as above. However, GWAS summary statistic data from BBJ were available for download^7,8. We combined these data with the European UKB GWAS summary statistics described above for 13 overlapping traits to estimate PRS for East Asian ancestry (as above). BridgePRS combines SNP effect size estimates across GWAS (as does the PRSice-meta method) and therefore requires effect sizes to be on the same scale. However, the BBJ summary statistics were generated after standardizing the trait values to have a mean of zero and a standard deviation of one, whereas the UKB GWASs were applied to raw trait data. Therefore, before applying the methods, the BBJ effect estimates and standard errors were transformed to the respective scale of the UKB measures, assuming that the BBJ and UKB trait values had the same variance. UKB East Asian samples were then split equally into test data for model optimization and validation data to assess model performance, as above. PRSs were also validated in East Asian BioMe samples across eight overlapping traits.

Trait sample sizes for each ancestral population in the UKB and BioMe cohorts are shown in Extended Data Tables 1 and 2. For all analyses, imputed genotype data were used.

Figure 3 shows boxplots of the variance explained (R²) by BridgePRS, PRS-CSx, PRS-CS-mult and PRSice-meta, for all traits analyzed, for prediction of African, South Asian and East Asian ancestry samples in the UKB and BioMe cohorts. Also shown are P values comparing the differences in within trait R², summed across all traits, between BridgePRS, PRS-CSx and PRS-CS-mult (not PRSice-meta as it was universally inferior across all comparisons). For prediction of African ancestry samples, BridgePRS had the highest median R² in UKB (0.031 versus 0.025) and a 61% higher median R² than PRS-CSx (0.044 versus 0.027) in the out-of-cohort BioMe samples (P = 6 × 10⁻⁵). For prediction of South Asian ancestry, there were no significant differences among methods. For prediction of East Asian samples, BridgePRS was inferior to both PRS-CSx and PRS-CS-mult in both UKB and BioMe, but these differences did not reach statistical significance.

**Fig. 3: Predictive accuracy of quantitative traits for different polygenic prediction methods and target populations.**

Figure 4 shows the individual results for each trait (R² with confidence intervals) analyzed in the out-of-sample prediction into the BioMe cohort. Although the methods showed similar results across many of the traits, the relative performance of the methods was highly variable, and for some traits there were distinct differences in the accuracy of the methods, especially in African ancestry samples. For example, in African ancestry samples, BridgePRS performed markedly better for mean corpuscular volume (MCV) and low-density lipoprotein (LDL), but markedly worse for eosinophil count. In both African and South Asian ancestry samples, the PRS-CSx prediction of height was highly inaccurate, possibly owing to the impact of variant nonoverlap between cohorts when applying PRS-CSx out of sample (‘Discussion’). The corresponding trait-specific results for prediction into UKB are shown in Extended Data Fig. 5, with a similar pattern of results observed. Of note, BridgePRS again performed markedly better for MCV and LDL in African ancestry samples.

**Fig. 4: Predictive accuracy of quantitative traits in BioMe samples.**

Discussion

We have introduced a trans-ancestry PRS method, BridgePRS, that leverages shared genetic effects across ancestries to increase the accuracy of PRS in non-European populations. We benchmarked BridgePRS and the leading trans-ancestry PRS method PRS-CSx, as well as single-ancestry PRS methods PRS-CS and PRSice adapted for trans-ancestry prediction, across a range of simulated and real data. In all analyses, target population PRS used GWAS summary statistics from Europeans and the target population. Results from our simulated data suggest that BridgePRS has higher performance relative to PRS-CSx when the uncertainty is greater: for lower heritability traits, for lower GWAS sample sizes, when the genetic signal is dispersed over more causal variants (higher polygenicity), for greater between-population diversity (for example, with European base and African target rather than Asian target) and when the causal variants are not included in the analyses. In all analyses of simulated data, BridgePRS and PRS-CSx had superior performance relative to the single-ancestry PRS methods.

Application of the methods to real GWAS summary statistics from the UKB and BBJ cohorts and validation in independent samples of African, South Asian and East Asian ancestry in the UKB and BioMe Biobank (recruited in the New York City area of the USA) gave results consistent with the simulations. Specifically, BridgePRS had superior average R² across the traits analyzed for samples of African ancestry, in which uncertainty was high owing to greater differences in LD between Africans and Europeans, and because of the relatively small African GWAS used. Likewise, PRS-CSx had superior average R² for samples of East Asian ancestry, for which differences in LD are smaller and the contributing East Asian GWASs are much larger (90,000–160,000). For prediction into South Asian ancestry, in which LD is relatively similar but the South Asian GWASs used are small, the methods performed similarly.

The stronger performance of PRS-CSx in the real data analysis of East Asian samples may also have been due to PRS-CSx not requiring GWAS to be on the same scale and thus being unaffected by the rescaling of the BBJ effect estimates. PRS-CSx is unaffected by GWAS scale as it combines information across ancestries on the shrinkage (to zero) of the effect estimate of each SNP and does not combine information on the effect sizes. The final PRS-CSx PRS estimate is derived by combining ancestry-specific PRS with relative weights estimated in a linear regression in the test data. Differences in scale between the base GWAS are accounted for by the linear regression weights. BridgePRS should have improved performance when the GWASs used are performed on the same scale, as it shares information on effect sizes across ancestries.

In UKB and BioMe data, we have demonstrated that BridgePRS has superior out-of-cohort predictive accuracy in genetic prediction in individuals of African ancestry. However, PRS-CSx has better accuracy when using UKB European and BBJ East Asian summary statistics to predict into individuals of East Asian ancestry. In general, in simulated and real data, BridgePRS performs better than PRS-CSx when uncertainty in mapping of causal variants is higher. Given the complementary nature of the two methods, either can be optimal depending on the trait and study characteristics; therefore, we recommend applying both methods until it is known which offers greater power in the given setting.

BridgePRS is a fully dedicated PRS tool that performs the entire PRS process, is computationally efficient based on conjugate prior–posterior updates and offers a theoretical approach to tackling the PRS portability problem, with particularly strong performance for deriving PRS in populations of African and other underrepresented ancestries.

Methods

The BridgePRS model

All modeling is performed at the locus-level, and each locus is assumed to be independent of all others. A locus is defined as a genomic region that captures all variants with r² > 0.01 within 1 Mb of a lead variant. Within loci, SNP effect sizes β are modeled by a multivariate Gaussian distribution, and we assume that the trait y of individuals with genotype data X at the locus follows a Gaussian distribution y ~ N(Xβ, ψI). Throughout, the Gaussian distribution is parameterized by its mean and precision matrix (inverse covariance matrix).

Below, we describe the BridgePRS methodology used to derive a PRS for a target population, population 2 (in our applications: African, South Asian and East Asian) for which we have summary statistics from a relatively underpowered GWAS, and GWAS summary statistics from a well-powered GWAS from a different ancestral population, population 1 (in our applications: European). We also assume that we have small datasets of individual-level genotype–phenotype data from both populations.

Stage 1: PRS informed by a single population

In stage 1 modeling, we train and optimize PRS using GWAS summary statistics and test genotype–phenotype data from a single population. To determine the PRS for population 2, this modeling stage is applied to populations 1 and 2 (model 1 in Fig. 1). Application to population 1 determines the prior distributions for population 2 SNP effect sizes used in stage 2 (see below). Application of stage 1 modeling to population 2 only (model 2 in Fig. 1) is used to identify effects specific to population 2 that are missed when using population 1 effects as a prior.

In stage 1, a zero-centered conjugate Gaussian prior is assigned for the SNP effects at each locus β ~ N(0, ψ(diag(λ))), where λ is a vector of SNP-specific shrinkage parameters. The use of a conjugate prior allows the posterior distribution of SNP effects to be determined analytically²²:

$${\boldsymbol{\upbeta}} \sim \,{\rm{N}}\,\left({({\rm{diag}}({\bf{\uplambda}}) +{X}^{T}X)}^{-1}{X}^{T}{y},\psi ({\rm{diag}}({\boldsymbol{\uplambda}}) +{X}^{T}X)\right).$$

X^Ty can be calculated from the vector of maximum likelihood marginal effects, ${\mathbf{\hat{\upbeta }}}$, available from GWAS summary statistics by ${({X}^{T}y)}_{i}=2n{{\mathbf{\uptheta}} }_{i}(1-{{\mathbf{\uptheta}} }_{i}){\hat{{\mathbf{\upbeta}} }}_{i}$, where n is the sample size, θ is the vector of allele frequencies and ${({X}^{T}y)}_{i}$ is the ith element of X^Ty, with i indexing SNPs. X^TX = nΦ; here, Φ is the pairwise genotypic covariance, which can be estimated from a reference panel representative of the population used in the GWAS. Thus, rescaling λ by n, the posterior is estimated as

$$\begin{array}{rcl}{\bf{\upbeta}} & \sim &\,{\rm{N}}\,\left(({\rm{diag}}({\bf{\uplambda}}) +{{\varPhi }})^{-1}{\mathbf{\uptheta}} (1-{\mathbf{\uptheta}} ){\mathbf{\hat{\upbeta }}},\psi ({\rm{diag}}({\bf{\uplambda}}) +{{\varPhi }})\right)\\ {\bf{\upbeta}} & \sim &\,{\rm{N}}\,\left(\tilde{\bf{\upbeta} },\psi {{\varOmega }}\right).\end{array}$$

To accommodate the effects of natural selection, we allow the prior on SNP effects to be dependent on allele frequencies such that the prior precision for the kth SNP is ${{\mathbf{\uplambda}} }^{(k)}={\lambda} ^{(0)}{({{\mathbf{\uptheta}} }_{k}(1-{{\mathbf{\uptheta}} }_{k}))}^{\alpha }$ and α ∈ [0, 1] (ref. ²³). When α = 0, allele frequencies and effect size are a priori independent. α = 1 is the value implicitly assumed by many methods²⁴ and implies a strong assumption of larger effects at SNPs of lower minor allele frequency. Multiple models are fit at each locus under priors defined by all combinations of α = (0, 0.25, 0.5, 0.75, 1) and λ⁽⁰⁾ = (0.05, 0.1, 0.2, 0.5, 1, 2, 5). Loci are ranked by the P value of their most associated SNP and assigned to subset S_k; if the top SNP P value is less than 10^−k, values of k = 1, …, 8 are considered. Multiple genome-wide PRSs are calculated for a test set of phenotype and genotype data by summing the effects across all contributing loci for all combinations of α, λ₀ and k:

$$\begin{array}{r}{{{\mbox{PRS}}}}_{ijk}={\sum }_{l\in {S}_{k}}{X}_{l}{\tilde{\bf{\upbeta} }}_{{{\lambda} }_{i}^{(0)}{\alpha }_{j}}^{(l)},\end{array}$$

where X_l is the genotype data at locus l, ${\tilde{\bf{\upbeta} }}_{{{\lambda} }_{i}^{(0)}{\alpha }_{j}}^{(l)}$ is the posterior mean at locus l with prior defined by parameters ${{\lambda} }_{i}^{(0)}$ and α_j, and S_k is the subset of loci with top SNP P value <10^−k. A single PRS is calculated by a weighted sum of the PRS across all i, j and k, with weights determined by a ridge regression fit to the test data, using leave-one-out cross-validation to select the ridge shrinkage parameter that minimizes out-of-sample deviance, as implemented in the R package glmnet²⁵.

Stage 2: PRS informed by stage 1

In stage 2 modeling, SNP effect sizes estimated by the application of stage 1 modeling to population 1 (for example, Europeans) are updated based on population 2 GWAS summary statistics and optimized using population 2 genotype–phenotype data. The prior used is taken as the posterior derived from the λ₀ and α prior parameters, which optimize prediction in the test data of population 1. As for stage 1, this prior is also a multivariate Gaussian. A parameter τ is added to the precision parameter of the Gaussian to control the contribution of population 1 to population 2; thus, the prior is specified as ${\bf{\upbeta} }_{2} \sim {\rm{N}}(\;{\tilde{\bf{\upbeta} }}_{1},\psi \tau {{{\varOmega }}}_{1})$. This is similarly a conjugate model with a Gaussian posterior²²:

$$\begin{array}{rcl}{\bf{\upbeta} }_{2}& \sim &\,{{\mbox{N}}}\,\left({\left(\tau {{{\varOmega }}}_{1}+{{{\varPhi }}}_{2}\right)}^{-1}\left(\tau {{{\varOmega }}}_{1}{\tilde{\bf{\upbeta} }}_{1}+{{\mathbf{\hat{\upbeta }}}}_{2}{{\mathbf{\uptheta}} }_{2}(1-{{\mathbf{\uptheta}} }_{2})\right),\psi \left(\tau {{{\Omega }}}_{1}+{{{\varPhi }}}_{2}\right)\right)\\ {\bf{\upbeta} }_{2}& \sim &\,{{\mbox{N}}}\,({\tilde{\bf{\upbeta} }}_{2},{{{\varOmega }}}_{2}),\end{array}$$

where Φ₂ is the SNP covariance at the locus in population 2, ${\hat{{\mathbf{\upbeta}} }}_{2}$ is the vector of marginal maximum likelihood SNP effect sizes and θ₂ is the vector of allele frequencies. Small values of τ correspond to using effect estimates close to those from population 2. As τ increases, more weight is assigned to population 1, such that as τ → ∞, β₂ → β₁.

Ranking loci in stage 2

Owing to differences in LD between populations, we do not rank loci by the P value of a single best SNP but instead aggregate information across loci by adapting the F test. We show below that the F test in a multivariate linear regression model for the null H₀: β = 0 is well approximated by:

$$\begin{array}{r}{F}_{{\mathrm{stat}}}=\frac{n-k}{kn{\sigma }^{2}}{\bf{\upbeta} }^{T}{X}^{T}X\bf{\upbeta} \end{array}$$

with degrees of freedom k and n − k, where k is the dimension of β, n is the number of observations and σ² is the phenotypic variance. The maximum likelihood estimate and X^TX are substituted by the posterior mean and precision matrix and n with n_eff = n(1 + τ), the effective number of observations accounting for the prior, giving the statistic:

$$\begin{array}{r}{F}_{{\mathrm{Bayes}}}=\frac{{n}_{\rm{eff}}-k}{k{\sigma }^{2}}{\tilde{\bf{\upbeta} }}_{2}{{{\varOmega }}}_{2}{\tilde{\bf{\upbeta} }}_{2}.\end{array}$$

The resultant tail probability is analogous to a P value, although it cannot be interpreted as such as the parameter estimates β and λ include prior information. Instead, for each τ, a locus with test statistic F is assigned to S_k if F > q_k, where q_k is the F quantile corresponding to Prob(p < 10^−k), where the values p are the locus-specific top SNP P values. This ranking ensures that the pseudo F statistic ranking assigns the same number of loci to each subset as the SNP P value ranking. As for the stage 1 single-ancestry PRS, multiple genome-wide PRSs are constructed by:

$$\begin{array}{r}{{{\mbox{PRS}}}}_{ik}={\sum }_{l\in {S}_{k}}{X}_{l}{\tilde{\bf{\upbeta} }}_{{\tau }_{i}}^{(l)},\end{array}$$

where ${\tilde{\bf{\upbeta} }}_{{\tau }_{i}}^{(l)}$ is the posterior mean at locus l with prior defined by parameter τ_i, and S_k is the subset of loci with F > q_k. Models are fit for τ = 1, 2, 5, 10, 15, 20, 50, 100, 200 and 500 and the same P value thresholds as those used in stage 1 of the modeling. A single PRS is estimated via a ridge regression fit using population 2 test data as described above using glmnet.

Supplementary Table 1 shows the average R² from BridgePRS ranking loci by the pseudo F statistic versus the P value from the European GWAS across the 19 traits analyzed here for African and South Asian UKB samples. There were broadly similar results for the pseudo F statistic versus the P value ranking: 0.0413 versus 0.0403 and 0.0683 versus 0.0688 in African and South Asian samples, respectively. Also shown in Supplementary Table 1 are equivalent results using UKB genotyped variants (rather than imputed variants); here, there was a pronounced improvement using the pseudo F statistic ranking: 0.0413 versus 0.0359 in African samples and 0.0694 versus 0.0646 in South Asian samples (P = 0.086 for the superiority of the F statistic ranking). All results presented here were obtained using the pseudo F statistic loci ranking. The BridgePRS software allows users to rank loci in stage 2 using either of the two ranking methods.

Incomplete SNP overlap between populations 1 and 2

Quality control (QC) is performed separately in each population; see below. This results in variants included in analyses differing between populations. Thus, stage 2 analyses are performed on the intersection of variants passing QC in both populations and the prior is calculated conditional on effects of nonoverlapping variants set to zero. Thus, given a prior of ${\bf{\upbeta} }_{2} \sim \,N\,({\tilde{\bf{\upbeta} }}_{1},\psi \tau {{{\varOmega }}}_{1})$, the prior on the overlapping variants is given by²²

$$\begin{array}{r}p\left({\bf{\upbeta} }_{2}^{(a)}| {\bf{\upbeta} }_{2}^{(b)}=0\right)=\,{{\mbox{N}}}\,\left({\tilde{\bf{\upbeta} }}_{1}^{(a)}+{\left({{{\varOmega }}}_{1}^{(aa)}\right)}^{-1}{{{\varOmega }}}_{1}^{(ab)}{\tilde{\bf{\upbeta} }}_{1}^{(b)},\psi \tau {{{\varOmega }}}_{1}^{(aa)}\right),\end{array}$$

where a represents the overlapping variants and b the nonoverlapping variants, and ${{{\varOmega }}}_{1}^{(aa)}$ and ${{{\varOmega }}}_{1}^{(ab)}$ are the appropriate submatrices of Ω₁. SNP overlap is taken at stage 2 to allow models fit in stage 1 to be applied to other datasets with different SNP sets.

Combining PRSs

We consider three alternative models for the PRS of population 2: (1) PRS estimated using only the two-stage European-informed PRS, that is, where the population 2 GWAS is underpowered and contributes insufficient information on its own; (2) PRS estimated using only population 2, that is, where European GWAS does not inform the PRS of population 2; and (3) the case where both the population-2-only PRS and the two-stage PRS contribute independent information. The estimation of models (1) and (2) is determined by a cross-validated ridge regression fit as described above using glmnet. Model (3) is estimated similarly by merging all single-ancestry and two-stage PRS and weighting by a cross-validated ridge regression fit.

The final PRS is a weighted sum of these three PRS, with weights determined by the estimated marginal likelihood of each. The log-marginal likelihood of a linear regression model M_i can be approximated by²⁶

$$\begin{array}{r}\log p(\;y,X| {M}_{i})=\frac{n}{2}\log {\sigma }_{i}^{2}+\kappa, \end{array}$$

where ${\sigma }_{i}^{2}$ is the residual model variance estimated from cross-validation and κ is a constant. With equal prior weight for each of the models, the posterior model weights for models M₁, M₂ and M₃ are given by:

$$\begin{array}{r}p({M}_{i}|\; y,X)=\frac{\exp \left\{n\log {\sigma }_{i}^{2}/2\right\}}{\mathop{\sum }\nolimits_{i = 1}^{3}\exp \left\{n\log {\sigma }_{i}^{2}/2\right\}}.\end{array}$$

Combining PRSs in this way can be extended to any number of contributing PRS. For example, we also combined PRSs for African ancestry samples constructed from East Asian BBJ and African UKB GWAS summary statistics to PRS constructed in our main analysis that used African and European UKB GWAS summary statistics. Supplementary Fig. 1 compares trait R² for African + European PRS with African + European + East Asian PRS for UKB and BBJ overlapping traits. Marginal improvement was observed with the addition of the BBJ East Asian data for monocyte count, BMI and height; for the other traits, R² was practically unaltered.

Definition of loci

Loci for the two-stage modeling were defined by clumping and thresholding of European GWAS summary statistics and LD estimated from UKB European samples using PLINK v.1.9 (ref. ²⁷) with the following parameters: --clump-p1 0.01, --clump-p2 0.01, --clump-kb 1,000, --clump-r2 0.01. The P value for each locus was determined by the P value of the lead SNP of the locus in the European GWAS. The ancestry-specific loci were defined similarly but used GWAS and LD data from the appropriate ancestry.

Estimating LD

BridgePRS calculates LD on the fly using genotype data supplied by the user and is therefore not restricted to any predefined subset of variants. In the simulation analyses, BridgePRS used all 1,000G samples from the appropriate ancestry to estimate LD, and in the real data analyses a subsample (between 5,000 and 6,000) of UKB samples from the appropriate ancestry was used.

Application of PRS-CSx

PRS-CSx is a Python-based software package that integrates GWAS summary statistics and LD reference data from multiple populations to estimate population-specific PRS. PRS-CSx applies a continuous shrinkage prior to SNP effects genome-wide in which the sparseness of the genetic architecture across populations is controlled by a parameter ϕ. PRS-CSx does not make any inference on ϕ but instead estimates separate PRS for each value of ϕ considered. Throughout, we followed the implementation described in Ruan et al.⁵; thus, values of ϕ = (10⁻⁶, 10⁻⁴, 10⁻² and 1) were considered. For each ϕ, PRS-CSx first estimates population-specific PRS, for example. PRS_ϕ,EUR (European) and PRS_ϕ,AFR (African), where PRS_ϕ,x is the standardized PRS for population x. For each ϕ, PRS-CSx fits the following linear regression to the target population test data y:

$$\begin{array}{r}y = {w}_{\phi ,{\mathrm{EUR}}}{{{\mbox{PRS}}}}_{\phi ,{\mathrm{EUR}}}+{w}_{\phi ,{\mathrm{AFR}}}{{{\mbox{PRS}}}}_{\phi ,{\mathrm{AFR}}}\end{array}+e.$$

where e is Gaussian error. The ϕ value and the corresponding regression coefficients for the linear combination of PRSs that maximize the coefficient of determination (R²) in the target population (for example, African) test set were used in the validation dataset to calculate the final PRS:

$$\begin{array}{r}{{{\mbox{PRS}}}}_{{\mathrm{final}}}={\hat{w}}_{\hat{\phi },{\mathrm{EUR}}}{{{\mbox{PRS}}}}_{\hat{\phi },{\mathrm{EUR}}}+{\hat{w}}_{\hat{\phi },{\mathrm{AFR}}}{{{\mbox{PRS}}}}_{\hat{\phi },{\mathrm{AFR}}}\end{array}$$

Unlike BridgePRS, PRS-CSx does not use European test data to estimate non-European PRS. Therefore, to ensure that both methods used the same data, GWASs were performed on the European test samples using PLINK v.2.0 (ref. ²⁷) and then meta-analyzed with the GWAS data from the European data METAL²⁸. The meta-analyzed European GWAS, the GWASs generated from the training samples of the target population and the LD reference panel generated by the authors of PRS-CSx were provided to PRS-CSx.

UKB genotype and sample QC

The UKB is a prospective cohort study of around 500,000 individuals recruited across the United Kingdom during 2006–2010. The genetic data comprise 488,377 samples genotyped at 805,426 SNPs. Population ancestries were defined by four-means clustering performed on the first two principal components (PCs) of the genotype data. The ancestry of each cluster was defined by the country of birth (field ID: 20115) of the majority of individuals in the cluster. Standard QC procedures were then performed on each ancestry cluster independently; any SNP with minor allele frequency <0.01, genotype missingness >0.02 or Hardy–Weinberg equilibrium test P value < 10⁻⁸ was removed. Samples with high levels of missingness or heterozygosity, with mismatching genetic-inferred and self-reported sex, or with aneuploidy of the sex chromosomes were removed as recommended by the UKB data processing team. A greedy algorithm²⁹ was used to remove related individuals, with kinship coefficient >0.044, in a way that maximized sample retention. In total, 557,369 SNPs and 387,392 individuals were retained for analysis.

Imputation

Imputed variants were extracted from imputed UKB data using PLINK v.2.0, converting the imputed data into hard-coded genotypes and retaining variants with the following filters: biallelic variants (--max-alleles 2), minor allele frequency greater than 0.001 (-maf 0.001), genotype missingness less than 1% (--geno 0.01) and MACH info score greater than 0.8 (--mach-r2-filter 0.8).

Trait selection

We extracted all continuous traits from unique samples in the UKB and performed basic filtering, discarding samples with phenotypic values six standard deviations away from the mean. Traits with more than 2,000 samples of African ancestry were extracted. For each trait, 300,000 European samples were extracted (retaining at least 10,000 samples for test and validation for each trait) and GWASs were run on the genotype data using PLINK v.2.0 with --glm. Sex (field ID: 31), age (field ID: 21003), genotyping batch, UKB assessment center (field ID: 54) and 40 PCs were included as covariates, with fasting time (field ID: 74) and dilution factor (field ID: 30897) also included for blood biochemical traits. LD score regression³⁰ was run on the resultant summary statistics and traits were further filtered, discarding those with heritability less than 1%. The remaining traits were ranked according to their heritability, and traits correlated with a more heritable trait (absolute Pearson correlation greater than 0.3) were removed, resulting in 27 traits. Results are presented for 19 traits that had an R² in Africans of greater than 1% for at least one analysis. The sample sizes for each trait and ancestry are shown in Extended Data Table 1.

Implementation

European, African and South Asian UKB samples were split into three independent groups: training data to construct the GWAS summary statistics, test data to select best-fitting parameters, and validation data to calculate out-of-sample predictive accuracy. The proportions of samples allocated to each set were 2/3 training, 1/6 test and 1/6 validation. Each GWAS was run in PLINK v.2.0 as described above. East Asian samples were split equally between test and validation sets.

For each trait, analyses were run with imputed variants. GWASs were run separately for the training samples of European, African and South Asian ancestry for each of the 19 traits using PLINK v.2.0 as described above. All PRSs were calculated using two populations: the African PRS used African and European UKB GWAS data, the South Asian PRS used South Asian and European UKB GWAS data, and the East Asian PRS used BBJ and European UKB GWAS.

Application to BioMe

BioMe samples were genotyped on the Infinium Global Screening Array v.1.0 platform. Samples were removed if they had a population-specific heterozygosity rate of greater than ±6 standard deviations of the population-specific mean, along with a call rate of <95%. In addition, samples were removed if they exhibited persistent discordance between the electronic health record and genetic sex. Variants were removed that had a call rate <95%, a Hardy–Weinburg Equilibrium P value threshold of P < 10⁻⁵ in African American and European American ancestry, or P < 10⁻¹³ in Hispanic and South Asian ancestry.

PC analysis was performed, and African, South Asian and East Asian samples were selected by clusters on PC plots corresponding to self-reported ancestry. African samples were selected as those with PC1 > 0.0075, PC2 < −0.0005 and PC3 > −0.002. South Asian samples were selected as those with −0.01 < PC3 < −0.004, −0.003 < PC4 < 0.001 and PC5 < −0.015. East Asian samples were selected as those with PC3 < −0.01, PC4 > 0.001, PC5 > −0.005 and PC6 > −0.0035. Supplementary Figs. 2–4 plot the top six PCs, with samples colored by self-reported ancestry, and show the thresholds used to select African, South Asian and East Asian ancestry samples.

Imputation was performed using IMPUTE2 (ref. ³¹) with the 1000G Phase 3 v.5 reference panel¹². Variants were first filtered by info score >0.3. Genotype data for the calculation of PRS in unique individuals were generated for in each of the two ancestry groups separately by first removing variants with minor allele frequency <1% in the respective BioMe population and then removing one of each pair of variants with duplicate genomic position. BioMe variants were mapped onto the UKB PRS by genomic position (build 37). Variants were coded by their expected allele count (dosage) for the calculation of PRS. Samples with phenotypic values three standard deviations away from the mean were excluded.

Measure of PRS accuracy

Variance explained was calculated as

$$\begin{array}{r}{R}^{2}=1-\frac{\,{{\mbox{Var}}}(\;y| {M}_{1})}{{{\mbox{Var}}}\,(\;y| {M}_{0})}\end{array},$$

where M_i is the regression model with (i = 1) and without (i = 0) the PRS, with both models including covariates for the top 40 PCs, age, sex, center and batch, fasting and dilution for the biochemical traits. Variance explained in the applications to BioMe included covariates for age, sex and the top 32 PCs. Standard errors and confidence intervals were calculated by bootstrapping in the R package boot^20,21 using 10,000 replicates.

Equivalence of sample size and heritability on GWAS power

We assume a phenotype value is given by additive genetic effects β and an environmental component e

$$Y=\mathop{\sum}\limits_{j}{X}_{j}{\beta }_{j}+e,$$

where $e \approx {{N}}(0,{\sigma }_{e}^{2})$. Therefore,

$${{{\rm{Var}}}}(Y\,)=\mathop{\sum}\limits_{j}{\beta }_{j}^{2}{{{\rm{Var}}}}({X}_{j})+{\sigma }_{e}^{2};$$

setting variance due to genetics to ${\sigma }_{g}^{2}$, we have

$$={\sigma }_{g}^{2}+{\sigma }_{e}^{2}.$$

As heritability ${h}^{2}=\frac{{\sigma }_{g}^{2}}{\textrm{Var}(Y\;)}$, for fixed genetic effects β and therefore fixed ${\sigma }_{g}^{2}$, if heritability changes by a factor of κ, Var(Y) must change by a factor of κ⁻¹. If the genetic effect β_j in a GWAS is estimated in a linear regression model, the expected variance of its maximum likelihood estimate ${\hat{{\mathbf{\upbeta}} }}_{j}$ is approximately $\frac{{{{\rm{Var}}}}(Y\;)}{n{{{\rm{Var}}}}(X\;)}$. Therefore, changing h² by a factor of κ, and thus Var(Y) by a factor of κ⁻¹, has the same effect on ${{{\rm{Var}}}}({\hat{{\mathbf{\upbeta}} }}_{j})$ as changing the sample size n by a factor of κ.

Reformulation of the F test

Without loss of generality, assume zero-centered normally distributed trait data y with variance σ². A linear regression is fitted to this data with an n × k covariate matrix X, resulting in maximum likelihood estimates ${\mathbf{\hat{\upbeta }}}$. The F statistic is defined by the residual sum of squares of the null and alternative models (RSS₀ and RSS₁) as follows:

$$\begin{array}{rcl}F&=&\frac{n-k}{k}\left(\frac{{{{\mbox{RSS}}}}_{0}-{{{\mbox{RSS}}}}_{1}}{{{{\mbox{RSS}}}}_{1}}\right)\\ &=&\frac{n-k}{k}\left(\frac{{y}^{T}y-{\left(y-X{\mathbf{\hat{\upbeta}} }\right)}^{T}\left(y-X{\mathbf{\hat{\upbeta}} }\right)}{{\left(y-X{\mathbf{\hat{\upbeta}} }\right)}^{T}\left(y-X{\mathbf{\hat{\upbeta}} }\right)}\right)\\ &=&\frac{n-k}{k}\left(\frac{n{\sigma }^{2}}{{\left(y-X{\mathbf{\hat{\upbeta}} }\right)}^{T}\left(y-X{\mathbf{\hat{\upbeta}} }\right)}-1\right)\\ &=&\frac{n-k}{k}\left(\frac{n{\sigma }^{2}}{{y}^{T}y-{{\mathbf{\hat{\upbeta }}}}^{T}{X}^{T}y-{y}^{T}X{\mathbf{\hat{\upbeta}} }+{{\mathbf{\hat{\upbeta}} }}^{T}{X}^{T}X{\mathbf{\hat{\upbeta}} }}-1\right),\end{array}$$

as $\hat{\bf{\upbeta }}={({X}^{T}X)}^{-1}{X}^{T}y$

$$\begin{array}{rcl}&=&\frac{n-k}{k}\left(\frac{n{\sigma }^{2}}{n{\sigma }^{2}-{{\mathbf{\hat{\upbeta}} }}^{T}{X}^{T}X\mathbf{\hat{\upbeta} }}-1\right)\\ &=&\frac{n-k}{k}\left(\frac{{\mathbf{\hat{\upbeta} }}^{T}{X}^{T}X{\mathbf{\hat{\upbeta }}}}{n{\sigma }^{2}-{{\mathbf{\hat{\upbeta }}}}^{T}{X}^{T}X{\mathbf{\hat{\upbeta }}}}\right)\\ &=&\frac{n-k}{kn{\sigma }^{2}}{{\mathbf{\hat{\upbeta}} }}^{T}{X}^{T}X{\mathbf{\hat{\upbeta}} }{\left(1-\frac{1}{n{\sigma }^{2}}{{\mathbf{\hat{\upbeta}} }}^{T}{X}^{T}X{\mathbf{\hat{\upbeta}} }\right)}^{-1}.\end{array}$$

$\frac{{\bf{\upbeta} }^{T}{X}^{T}X{\bf\upbeta} }{{\sigma }^{2}}$ is the variance explained by the locus; therefore, assuming this is small, a first-order Taylor approximation can be used to give

$$\approx \frac{n-k}{kn{\sigma }^{2}}{{\mathbf{\hat{\upbeta}} }}^{T}{X}^{T}X{\mathbf{\hat{\upbeta }}}.$$

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

Publicly available data used to generate the simulated data are available from the following sites. 1000G Phase 3 reference panels: https://mathgen.stats.ox.ac.uk/impute/1000GP_Phase3.html; and genetic maps for each subpopulation: ftp.1000genomes.ebi.ac.uk/vol1/ftp/technical/working/20130507_omni_recombination_rates. UKB genotype and phenotype data were obtained from the UKB resource under application 18177 (https://www.ukbiobank.ac.uk/enable-your-research/approved-research/multi-trait-gwas-analyses-in-the-uk-biobank). UKB QC information (missingness, allele frequency, Hardy–Weinberg equilibrium) was obtained from UKB resource 531 (https://biobank.ctsu.ox.ac.uk/crystal/refer.cgi?id=531). Recruitment and enrollment of participants into BioMe was Institutional Review Board (IRB) and Health Insurance Portability and Accountability Act 1996 (HIPAA) approved. It is a biobank linked to electronic medical records that allows the use of deidentified samples linkable to past, present and future clinical information from electronic health records at Mount Sinai. BioMe contains protected health information and is thus under controlled access. Applications to access the data can be made to biome@mountsinai.org; see also https://icahn.mssm.edu/research/ipm/programs/biome-biobank. BBJ summary statistics were downloaded from PheWeb: https://pheweb.jp. SNP weights for the polygenic risk scores estimated by BridgePRS in this paper are available on GitHub (https://github.com/clivehoggart/BridgePRS_data).

Code availability

Software, example data and a tutorial for BridgePRS are available from www.bridgeprs.net. Source code, to which www.bridgeprs.net links, is available from https://github.com/clivehoggart/BridgePRS, DOI badge https://doi.org/10.5281/zenodo.8385983, v.0.1 (ref. ³²). Scripts used for all analyses are available on GitHub: https://github.com/clivehoggart/BridgePRS_data. All other code used in this study is available from the following websites: BridgePRS: https://www.bridgeprs.net; HAPGEN2 v.2.2.0: https://mathgen.stats.ox.ac.uk/genetics_software/hapgen/hapgen2.html; IMPUTE2 v.2: https://mathgen.stats.ox.ac.uk/impute/impute_v2.html; LDSC v.1.0.1: https://github.com/bulik/ldsc; METAL v.2011-03-25: http://csg.sph.umich.edu/abecasis/metal/; PLINK v.1.9: https://www.cog-genomics.org/plink; PLINK v.2.0: https://www.cog-genomics.org/plink/2.0/; PRS-CSx v.1.0.0: https://github.com/getian107/PRScsx; PRS-CS v.1.0.0: https://github.com/getian107/PRScs; PRSice-2 v.2: https://www.prsice.info; R v.4.0.3: https://cran.r-project.org; R boot package v.1.3.25: https://cran.r-project.org/web/packages/boot/index.html; Ridge reg glmnet package v.4.0-2: https://cran.r-project.org/web/packages/glmnet/index.html.

References

Martin, A. R. et al. Human demographic history impacts genetic risk prediction across diverse populations. Am. J. Hum. Genet. 100, 635–649 (2017).
Article CAS PubMed PubMed Central Google Scholar
Duncan, L. et al. Analysis of polygenic risk score usage and performance in diverse human populations. Nat. Commun. 10, 3328 (2019).
Article CAS PubMed PubMed Central Google Scholar
Wang, Y. et al. Theoretical and empirical quantification of the accuracy of polygenic scores in ancestry divergent populations. Nat. Commun. 11, 3865 (2020).
Article PubMed PubMed Central Google Scholar
Hu, S. et al. Leveraging fine-scale population structure reveals conservation in genetic effect sizes between human populations across a range of human phenotypes. Preprint at bioRxiv https://doi.org/10.1101/2023.08.08.552281 (2023).
Ruan, Y. et al. Improving polygenic prediction in ancestrally diverse populations. Nat. Genet. 54, 573–580 (2022).
Article CAS PubMed PubMed Central Google Scholar
Sudlow, C. et al. UK Biobank: an open access resource for identifying the causes of a wide range of complex diseases of middle and old age. PLoS Med. 12, e1001779 (2015).
Article PubMed PubMed Central Google Scholar
Kanai, M. et al. Genetic analysis of quantitative traits in the japanese population links cell types to complex human diseases. Nat. Genet. 50, 390–400 (2018).
Article CAS PubMed Google Scholar
Sakaue, S. et al. A cross-population atlas of genetic associations for 220 human phenotypes. Nat. Genet. 53, 1415–1424 (2021).
Article CAS PubMed Google Scholar
Abul-Husn, N. S. & Kenny, E. E. Personalized medicine and the power of electronic health records. Cell 177, 58–69 (2019).
Article CAS PubMed PubMed Central Google Scholar
Choi, S. W., Mak, T. S.-H. & O’Reilly, P. F. Tutorial: a guide to performing polygenic risk score analyses. Nat. Protoc. 15, 2759–2772 (2020).
Article CAS PubMed PubMed Central Google Scholar
Su, Z., Marchini, J. & Donnelly, P. HAPGEN2: simulation of multiple disease SNPs. Bioinformatics 27, 2304–2305 (2011).
Article CAS PubMed PubMed Central Google Scholar
The 1000 Genomes Project Consortium. A map of human genome variation from population-scale sequencing. Nature 467, 1061–1073 (2010).
Article PubMed Central Google Scholar
Ge, T., Chen, C.-Y., Ni, Y., Feng, Y.-C. A. & Smoller, J. W. Polygenic prediction via Bayesian regression and continuous shrinkage priors. Nat. Commun. 10, 1776 (2019).
Article PubMed PubMed Central Google Scholar
Choi, S. W. & O’Reilly, P. F. PRSice-2: polygenic risk score software for biobank-scale data. Gigascience 8, giz082 (2019).
Zeng, J. et al. Signatures of negative selection in the genetic architecture of human complex traits. Nat. Genet. 50, 746–753 (2018).
Article CAS PubMed Google Scholar
Graham, S. E. et al. The power of genetic diversity in genome-wide association studies of lipids. Nature 600, 675–679 (2021).
Article CAS PubMed PubMed Central Google Scholar
Wainschtein, P. et al. Assessing the contribution of rare variants to complex trait heritability from whole-genome sequence data. Nat. Genet. 54, 263–273 (2022).
Article CAS PubMed PubMed Central Google Scholar
Daetwyler, H. D., Villanueva, B. & Woolliams, J. A. Accuracy of predicting the genetic risk of disease using a genome-wide approach. PLoS ONE 3, e3395 (2008).
Article PubMed PubMed Central Google Scholar
Wu, T., Liu, Z., Mak, T. S. H. & Sham, P. C. Polygenic power calculator: statistical power and polygenic prediction accuracy of genome-wide association studies of complex traits. Front. Genet. 13, 989639 (2022).
Canty, A. & Ripley, B. D. boot: Bootstrap R (S-Plus) Functions. R package version 1.3-28 (2022).
Davison, A. C. & Hinkley, D. V. Bootstrap Methods and Their Applications (Cambridge University Press, 1997).
Bernardo, J. M. & Smith, A. F. M. Bayesian Theory (Wiley, 1994).
Speed, D., Hemani, G., Johnson, M. R. & Balding, D. J. Improved heritability estimation from genome-wide SNPs. Am. J. Hum. Genet. 91, 1011–1021 (2012).
Article CAS PubMed PubMed Central Google Scholar
Yang, J., Lee, S. H., Goddard, M. E. & Visscher, P. M. GCTA: a tool for genome-wide complex trait analysis. Am. J. Hum. Genet. 88, 76–82 (2011).
Article CAS PubMed PubMed Central Google Scholar
Friedman, J., Hastie, T. & Tibshirani, R. Regularization paths for generalized linear models via coordinate descent. J. Stat. Softw. 33, 1–22 (2010).
Article PubMed PubMed Central Google Scholar
Fong, E. & Holmes, C. C. On the marginal likelihood and cross-validation. Biometrika 107, 489–496 (2020).
Article Google Scholar
Chang, C. C. et al. Second-generation PLINK: rising to the challenge of larger and richer datasets. Gigascience 4, 7 (2015).
Willer, C. J., Li, Y. & Abecasis, G. R. METAL: fast and efficient meta-analysis of genomewide association scans. Bioinformatics 26, 2190–2191 (2010).
Article CAS PubMed PubMed Central Google Scholar
Choi, S. W. GreedyRelated: script for greedily remove related samples, v.1.2. Zenodo zenodo.org/record/3697212#.Yd__oi-l3sc (2017).
Bulik-Sullivan, B. K. et al. LD Score regression distinguishes confounding from polygenicity in genome-wide association studies. Nat. Genet. 47, 291–295 (2015).
Article CAS PubMed PubMed Central Google Scholar
Howie, B. N., Donnelly, P. & Marchini, J. A flexible and accurate genotype imputation method for the next generation of genome-wide association studies. PLoS Genet. 5, e1000529 (2009).
Article PubMed PubMed Central Google Scholar
Hoggart, C. J. BridgePRS, v.0.1. Zenodo https://doi.org/10.5281/zenodo.8385983 (2023).

Download references

Acknowledgements

We thank the participants in the UK Biobank (UKB), Biobank Japan (BBJ) and BioMe Biobank and the scientists involved in the construction of these resources. This research has been conducted using the UKB resource under application 18177 (P.F.O.). All participants gave full informed consent. This work was supported by grants to P.F.O. from the National Institute of Mental Health (R01MH122866) and the National Human Genome Research Institute (R01HG012773) and through the computational resources and staff expertise provided by Scientific Computing at the Icahn School of Medicine at Mount Sinai, in particular, the Minerva and Data Ark teams. We also thank A. Ori, B. Rowan, C. Iyegbe, H. M. (Beatrice) Wu, L. Liou, L. Sloofman and Z. Wang for helpful discussions.

Author information

Authors and Affiliations

Department of Genetics and Genomic Sciences, Icahn School of Medicine, Mount Sinai, New York, NY, USA
Clive J. Hoggart, Shing Wan Choi, Judit García-González & Paul F. O’Reilly
Regeneron Genetics Center, Tarrytown, NY, USA
Shing Wan Choi
Department of Cellular Biology, Suny Downstate Health Sciences, Brooklyn, NY, USA
Tade Souaiaia
The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine, Mount Sinai, New York, NY, USA
Michael Preuss

Authors

Clive J. Hoggart
View author publications
You can also search for this author in PubMed Google Scholar
Shing Wan Choi
View author publications
You can also search for this author in PubMed Google Scholar
Judit García-González
View author publications
You can also search for this author in PubMed Google Scholar
Tade Souaiaia
View author publications
You can also search for this author in PubMed Google Scholar
Michael Preuss
View author publications
You can also search for this author in PubMed Google Scholar
Paul F. O’Reilly
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

C.J.H. and P.F.O. conceived and designed the project and methodology. C.J.H. developed the statistical modeling with input from P.F.O. C.J.H. programmed all the BridgePRS code and performed the analyses. S.W.C. preprocessed the UKB data and performed the GWAS in the UKB. S.W.C. and J.G.-G. developed the pipeline to run PRS-CSx on the data. T.S. tested the code and wrote a wrapper for the software. T.S. also developed the BridgePRS software website, with input from C.J.H. and P.F.O. M.P. preprocessed the BioMe data. C.J.H. and P.F.O. wrote the manuscript, and all authors reviewed and approved the final version.

Corresponding authors

Correspondence to Clive J. Hoggart or Paul F. O’Reilly.

Ethics declarations

Competing interests

S.W.C. is a current employee of Regeneron Genetics Center. The other authors declare no competing interests.

Peer review

Peer review information

Nature Genetics thanks Zoltán Kutalik and Yixuan Ye for their contribution to the peer review of this work. Peer reviewer reports are available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Extended data

Extended Data Fig. 1 Relative loss in removing causal variants from analysis in simulated data.

Relative loss measured by ratio of models’ variance explained (R²) without and with the causal variants included. Results are shown for BridgePRS, PRS-CSx, PRS-CS-mult and PRSice-meta across six simulation scenarios for African and East Asian ancestry samples. a SNP heritability ${{{{{\rm{h}}}}}^{2}}_{{{{\rm{SNP}}}}}=0.25$ and b SNP heritability ${{{{{\rm{h}}}}}^{2}}_{{{{\rm{SNP}}}}}=0.5$, ten simulated phenotypes per scenario. Under each set of analyses the proportion of causal variants and the relative power of the data used is shown, measured by nh²/m up to proportionality, where n is the GWAS sample size, h² heritability and m the number of causal variants. The central rectangular boxes show the interquartile range, horizontal lines inside the boxes show the median, whiskers extend to the most extreme results and points show results for each of the 10 simulated phenotypes. PRSice-meta results for East Asian analyses were unstable and removed for clarity.

Extended Data Fig. 2 Predictive accuracy for different polygenic prediction methods in simulations using half GWAS sample size as used in the primary simulation.

Sample sizes of 40K European and 10K non-European were used. Results are shown for BridgePRS, PRS-CSx, PRS-CS-mult and PRSice-meta across six simulation scenarios, with and without the causal variants included in the model for African and East Asian ancestry samples. a SNP heritability ${{{{{\rm{h}}}}}^{2}}_{{{{\rm{SNP}}}}}=0.25$ and b SNP heritability ${{{{{\rm{h}}}}}^{2}}_{{{{\rm{SNP}}}}}=0.5$, ten simulated phenotypes per scenario. Under each set of analyses the proportion of causal variants and the relative power of the data used is shown, measured by nh²/m up to proportionality, where n is the GWAS sample size, h² heritability and m the number of causal variants. The central rectangular boxes show the interquartile range, horizontal lines inside the boxes show the median, whiskers extend to the most extreme results and points show results for each of the 10 simulated phenotypes.

Extended Data Fig. 3 Predictive accuracy for different polygenic prediction methods in simulations at ${{\mathbf{h}}^{\mathbf{2}}}_{\mathbf{SNP}}{\mathbf{=0.75}}$.

Results are shown for BridgePRS, PRS-CSx, PRS-CS-mult and PRSice-meta across six simulation scenarios, with and without the causal variants included in the model for African and East Asian ancestry samples, ten simulated phenotypes per scenario. Under each set of analyses the proportion of causal variants and the relative power of the data used is shown, measured by nh²/m up to proportionality, where n is the GWAS sample size, h² heritability and m the number of causal variants. The central rectangular boxes show the interquartile range, horizontal lines inside the boxes show the median, whiskers extend to the most extreme results and points show results for each of the 10 simulated phenotypes.

Extended Data Fig. 4 Ratio of phenotypic variance explained R² using UK Biobank and 1000 Genomes LD reference panels in simulations.

Results are shown for BridgePRS and PRS-CSx across six simulation scenarios, 10 simulated phenotypes per scenario with h²=0.25 for African and East Asian ancestry samples. Data was simulated using 1000 Genomes as reference. Under each set of analyses the proportion of causal variants and the relative power of the data used is shown, measured by nh²/m up to proportionality, where n is the GWAS sample size, h² heritability and m the number of causal variants. The central rectangular boxes show the interquartile range, horizontal lines inside the boxes show the median, whiskers extend to the most extreme results and points show results for each of the 10 simulated phenotypes.

Extended Data Fig. 5 Predictive accuracy for quantitative traits in UK Biobank samples.

For each trait variance explained (R²), point estimates and 95% confidence intervals, by BridgePRS, PRS-CSx, PRS-CS-mult and PRSice-meta are shown for African, South Asian and East Asian ancestry samples. n indicates sample size. Neutro count=Neutrophil count, MCV=Mean corpuscular volume, Platelets=Platelet count, Retic count=Reticulocyte per- centage, ALP=Alkaline phosphatase, Mono count=Monocyte count, apoA1=Apolipoprotein A, BMI=Body mass index, RDW=Red blood cell distribution width, Eos count=Eosinophil count, TG=Triglycerides, Baso %=Basophil percentage, CRP=C-reactive protein. Confidence intervals were calculated by bootstrapping using 10,000 replicates.

Extended Data Table 1 GWAS and UK Biobank test and validation sample sizes

Full size table

Extended Data Table 2 BioMe Biobank sample sizes for individuals of African, South Asian and East Asian ancestry

Full size table

Supplementary information

Supplementary Information

Supplementary Table 1 and Figs. 1–4.

Reporting Summary

Peer Review File

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Hoggart, C.J., Choi, S.W., García-González, J. et al. BridgePRS leverages shared genetic effects across ancestries to increase polygenic risk score portability. Nat Genet 56, 180–186 (2024). https://doi.org/10.1038/s41588-023-01583-9

Download citation

Received: 18 January 2022
Accepted: 20 October 2023
Published: 20 December 2023
Issue Date: January 2024
DOI: https://doi.org/10.1038/s41588-023-01583-9

Subjects

Abstract

Similar content being viewed by others

Main

Results

Overview of BridgePRS method

Benchmarking methods via simulation

Benchmarking methods via real data

Discussion

Methods

The BridgePRS model

Stage 1: PRS informed by a single population

Stage 2: PRS informed by stage 1

Ranking loci in stage 2

Incomplete SNP overlap between populations 1 and 2

Combining PRSs

Definition of loci

Estimating LD

Application of PRS-CSx

UKB genotype and sample QC

Imputation

Trait selection

Implementation

Application to BioMe

Measure of PRS accuracy

Equivalence of sample size and heritability on GWAS power

Reformulation of the F test

Reporting summary

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Peer review

Peer review information

Additional information

Extended data

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

Search

Quick links