Mapping eGFR loci to the renal transcriptome and phenome in the VA Million Veteran Program

Hellwege, Jacklyn N.; Velez Edwards, Digna R.; Giri, Ayush; Qiu, Chengxiang; Park, Jihwan; Torstenson, Eric S.; Keaton, Jacob M.; Wilson, O. D.; Robinson-Cohen, Cassianne; Chung, Cecilia P.; Roumie, Christianne L.; Klarin, Derek; Damrauer, Scott M.; DuVall, Scott L.; Siew, Edward; Akwo, Elvis A.; Wuttke, Matthias; Gorski, Mathias; Li, Man; Li, Yong; Gaziano, J. Michael; Wilson, Peter W. F.; Tsao, Philip S.; O’Donnell, Christopher J.; Kovesdy, Csaba P.; Pattaro, Cristian; Köttgen, Anna; Susztak, Katalin; Edwards, Todd L.; Hung, Adriana M.

doi:10.1038/s41467-019-11704-w

Download PDF

Article
Open access
Published: 26 August 2019

Mapping eGFR loci to the renal transcriptome and phenome in the VA Million Veteran Program

Nature Communications volume 10, Article number: 3842 (2019) Cite this article

7953 Accesses
82 Citations
34 Altmetric
Metrics details

Subjects

Abstract

Chronic kidney disease (CKD), defined by low estimated glomerular filtration rate (eGFR), contributes to global morbidity and mortality. Here we conduct a transethnic Genome-Wide Association Study of eGFR in 280,722 participants of the Million Veteran Program (MVP), with replication in 765,289 participants from the Chronic Kidney Disease Genetics (CKDGen) Consortium. We identify 82 previously unreported variants, confirm 54 loci, and report interesting findings including association of the sickle cell allele of betaglobin among non-Hispanic blacks. Our transcriptome-wide association study of kidney function in healthy kidney tissue identifies 36 previously unreported and nine known genes, and maps gene expression to renal cell types. In a Phenome-Wide Association Study in 192,868 MVP participants using a weighted genetic score we detect associations with CKD stages and complications and kidney stones. This investigation reinterprets the genetic architecture of kidney function to identify the gene, tissue, and anatomical context of renal homeostasis and the clinical consequences of dysregulation.

Discovery and prioritization of variants and genes for kidney function in >1.2 million individuals

Article Open access 16 July 2021

Imputation-powered whole-exome analysis identifies genes associated with kidney function and disease in the UK Biobank

Article Open access 09 March 2023

Unravelling the complex genetics of common kidney diseases: from variants to mechanisms

Article 08 June 2020

Introduction

Chronic kidney disease (CKD), defined by an estimated glomerular filtration rate (eGFR) lower than 60 ml/min/1.73 m² for three or more months¹, is a global health concern and is associated with premature death^2,3. CKD has a prevalence of 15% and affects 30 million people in the US^4,5. In addition to the risk of progressing to end-stage-renal disease (ESRD), CKD is associated with significant cardiovascular morbidity and mortality^6,7. Patients with an eGFR of <15 ml/min/1.73 m², for example, have a threefold increased mortality than those with normal renal function⁸.

Diabetes is the most common comorbidity associated with ESRD worldwide, occurring in 44–60% of ESRD cases^4,5. CKD in diabetics is multifactorial and also related to hyperglycemia, hypertension, atherosclerosis, and aging. Currently there are few therapies that slow CKD progression and life-extending treatments for ESRD are restricted to dialysis and transplantation.

There is a great need to understand the biological mechanisms that lead to CKD so that treatments that target those biological factors can be developed. Kidney function, as measured by eGFR, is a heritable trait⁹ that has been studied in genetic association studies, and over 50 eGFR loci have been definitively identified^10,11,12,13. A small proportion of variance in eGFR is explained by the subtle effects of previously reported variants, and the genetic etiology of this trait is highly complex. The inferences of what genes influence eGFR are often unclear from loci detected by genome-wide association studies (GWAS) of common variants, which comprise a large proportion of known genetic determinants. Regulatory effects may account for significant additional heritability in GWAS, and GWAS results are enriched for regulatory single-nucleotide polymorphisms (SNPs) compared with the proportion of the genome containing regulatory elements^14,15,16. In recent large-scale GWAS meta-analyses of eGFR, it was shown that many significant SNPs map into tissue-specific regulatory regions, and that gene expression may mediate many of the associations between genetic variants and eGFR^10,12. Several methods were recently developed to leverage multiple variants to perform gene-based tests of association between imputed gene expression levels and phenotypes^17,18,19,20. These tests are tissue-specific and provide effect estimates with interpretable direction and magnitude compared with studies that only evaluate associations between SNPs and traits.

Thus, to capitalize on our large electronic health record-based study of patients receiving care in a standardized setting and to leverage available analytical possibilities, we conduct a transethnic GWAS of eGFR in 280,722 participants of the U.S. Veteran’s Administration Million Veteran Program, with replication in an additional 765,289 participants from the Chronic Kidney Disease Genetics (CKDGen) Consortium. In addition, because of the important role of diabetes in nephropathy, we stratify analysis by diabetes status. We also evaluate associations between genetically predicted gene expression (GPGE) in a human healthy kidney expression quantitative trait locus (eQTL) reference panel²¹ and eGFR, followed by comparison of significant GPGE associations with gene expression profiles in murine kidney cell types²² to identify the specific cells where gene expression effects likely arise. Finally, we evaluate clinical translation by performing a phenome-wide association study (PheWAS)²³ of a weighted genetic risk score (GRS) of eGFR in the electronic health records across 192,868 MVP participants.

Results

MVP characteristics

A total of 280,722 participants were available from MVP for analyses of eGFR, ~33% were diabetic (Table 1). Most participants were non-Hispanic whites (80%), male (93%), and hypertensive (70%). When evaluating the data stratified by diabetes status, there were more diabetics than non-diabetics who were hypertensive within both race groups (non-Hispanic white diabetics and hypertensive 91%; non-Hispanic black diabetics and hypertensive 93%). Across both race groups, eGFR was lower in diabetics than in non-diabetics, and eGFR was higher in non-Hispanic blacks than in non-Hispanic whites.

Table 1 Characteristics of Million Veteran Program discovery sample

Full size table

Transethnic GWAS

Meta-analyses of GWAS data were performed on all variants (minor allele frequency, MAF, ≥1%), with data stratified by race, diabetes status, and hypertension status. Meta-analysis of both race groups across all diabetes and hypertension strata identified 136 genome-wide significant loci, of which 122 were available and evaluated for replication in a transethnic GWAS meta-analysis of eGFR from the CKDGen Consortium (n = 765,289). Fourteen of these SNPs or proxies from the all-ancestry analyses were unavailable for replication.

Of the 122 variants, 79 were replicated at the Tier 1 definition (p ≤ 5 × 10⁻⁸) and an additional 28 were Tier 2 replicated (5 × 10⁻⁸ < p ≤ Bonferroni correction for 122 SNPs = 0.0004), and 11 were Tier 3 replicated (0.0004 < p ≤ 0.05) (Supplementary Data 1, Table 2, Fig. 1). Only four showed no evidence of association in CKDGen. Meta-analysis of p-values from MVP and CKDGen results identified all Tier 1 and seven Tier 2 results as genome-wide significant. Across the three tiers of successful replication, 2.75% of variance in eGFR was explained. Fifty-seven of these results have been previously detected in GWAS of eGFR, while six others have associated with overt kidney disease, microalbuminuria, or other related phenotypes in prior studies. Among 64 novel variants, the most significantly associated was rs2823139 near NRIP1 (p_discovery= 1.82 × 10⁻¹⁸, effect = -0.45 mL/min/1.73 m² [standard error (SE) = 0.051]; p_replication = 5.24 × 10⁻¹⁶). Evaluation of conditionally independent signals within the discovery meta-analysis results using GCTA²⁴ identified 18 SNPs from 15 loci (five novel) that are significantly independently associated with eGFR (Supplementary Table 1).

Table 2 Summary of significant known and novel loci from analysis of common variants

Full size table

We also conducted analyses limiting to non-Hispanic white individuals. Similar to the trans-ancestry analyses, the majority of variants replicated in CKDGen (n = 567,401; Supplementary Data 2). Six variants failed to replicate (p > 0.05). Of the 105 replicated SNPs, 61 were Tier 1 (p ≤ 5 × 10⁻⁸), 29 were Tier 2 (5 × 10⁻⁸ < p ≤ 0.00048), and 9 were Tier 3 (0.00048 < p ≤ 0.05).

We identified 14 variants that were significant in non-Hispanic blacks (Supplementary Table 2) within our MVP discovery population. Among these four were novel (rs75113983 near OLFR690, rs200950799, which is intergenic on chromosome 12, rs144803907 near C15orf43, rs10084572 near AGPAT3). Only one of these novel variants was also identified in non-Hispanic white analyses (rs13230509). An interesting finding among the non-Hispanic blacks is the strong association with rs334 (p = 1.54 × 10⁻¹⁸) in HBB, the variant which encodes the sickle cell allele of beta globin. The origins of this variant have been recently described²⁵. The derived A allele on the + strand that encodes for the sickle version of the beta globin protein is associated with lower kidney function. This allele has also been previously associated with increased urinary albumin-to-creatinine ratio²⁶.

We compared the effect sizes for significant discovery GWAS results across non-Hispanic black and non-Hispanic white subjects MVP (Supplementary Fig. 1, Supplementary Table 3). Most of the effects were in the same direction except for three variants. One was a novel variant (rs142314590 in LDB2) and two were known variants (rs532086 and rs2235826 near C2 and PRK1, respectively). The effect sizes across non-Hispanic black and non-Hispanic white subjects were more correlated among known (r²= 0.25) loci than novel (r² = 0.13) variants (Supplementary Table 3). The overall allele frequencies across ancestry groups for significant variants were highly correlated (r² = 0.67).

Diabetes stratified GWAS

There were 91,279 patients with diabetes. In our GWAS for this group we identified 32 variants reaching genome-wide significance, which make up a subset of the loci discovered in combined analysis. Seventeen of these variants were in known loci and 15 were novel loci (Supplementary Data 3, Supplementary Figs. 2 and 3). The top six hits were near UMOD (p = 2.43 × 10⁻⁸²), PRKAG2 (p = 7.89 × 10⁻²³), MPPED2/DCDC5 (p = 9.85 × 10⁻²²) GATM/SPATA5L1 (p = 2.21 × 10⁻¹⁷), SHROOM3 (p = 1.63 × 10⁻¹³), and HBB (p = 7.95 × 10⁻¹³). Comparison of the association effect sizes in the stratified analysis in subjects with (Supplementary Data 3) and without diabetes (Supplementary Data 4) demonstrated generally consistent effects reflecting the shared pathways between diabetic and non-diabetic kidney disease (Supplementary Fig. 4).

LD score regression

Subsequently, we utilized the LD Score Regression approach²⁷ in each contributing group to ascertain whether inflation was due to residual population stratification or polygenicity. Calculation of the intercept in the MVP non-Hispanic white discovery analysis datasets were 1.02 (SE = 0.01), 1.04 (SE = 0.01), 1.03 (SE = 0.01), and 0.99 (SE = 0.01), for diabetics with hypertension, non-diabetics with hypertension, non-diabetics without hypertension and diabetics without hypertension, respectively, suggesting that little of the observed inflation in the lambda is due to population stratification (Supplementary Table 4). Similarly, intercepts in the MVP non-Hispanic black discovery analysis datasets were 1.01 (SE = 0.003), 1.02 (SE = 0.004), 1.00 (SE = 0.003), and 1.00 (SE = 0.003), for diabetics with hypertension, non-diabetics with hypertension, non-diabetics without hypertension, and diabetics without hypertension, respectively.

eGFR heritability estimation

Evaluation of SNP-based heritability was assessed using LD score regression across groups and indicated that eGFR was most heritable among non-diabetics and non-hypertensive individuals in both non-Hispanic whites (h² = 0.15, SE = 0.02) and non-Hispanic blacks (h² = 0.13, SE = 0.04) (Supplementary Table 4). In general, heritability was higher in non-Hispanic whites than in non-Hispanic blacks.

Human kidney genetically predicted gene expression (GPGE)

Common variants from transethnic meta-analyses were used to evaluate the association between eGFR and GPGE in human healthy kidney tissue using S-PrediXcan¹⁷ with the healthy kidney eQTL reference described by Ko et al.²¹ (Fig. 2, Supplementary Data 5). We identified 45 significant results for transethnic analyses, among these 36 were novel genes which had not been identified by GWAS, either in the original reports or by the GWAS catalog mapping. The strongest result was with a known CKD locus, SPATA5L1 (effect = −2.38; p = 1.01 × 10⁻¹¹⁰). The strongest novel result was at TPRKB, a protein coding gene for the TP53RK-binding protein (effect = −10.78; p = 3.40 × 10⁻²²). Twenty-five of the 45 genes with significant results were associated with a decrease in kidney function with increasing renal gene expression. Among all significant results, 19 also had SNPs likely to be causal for both gene expression and eGFR differences, as identified through the COLOC approach.

We also conducted secondary GPGE analyses stratifying subjects by diabetes status (Supplementary Data 5). SPATA5L1 was the strongest result in analyses of diabetic and non-diabetic participants, with stronger effects estimated in the non-diabetics. In analyses of diabetic participants, we identified seven significant associations. Two of the genes (HLA-H, UBD) identified in analyses of diabetic participants were not observed in overall analyses (combining diabetics and non-diabetics) or in analyses limiting to non-diabetic subjects. Both genes were associated with a decrease in kidney function with increasing expression and neither gene has been previously associated with eGFR. HLA-H has been associated with non-albumin protein levels and UBD with blood proteins^28,29. All significant results in analyses of non-diabetics were also identified in overall analyses combining diabetics and non-diabetics.

Evaluation of GPGE results in murine kidney cells

We also evaluated homologs of genes identified in GPGE analyses using an atlas of kidney cell type-specific RNA expression from single cell sequencing of murine kidney cells²² (Fig. 3, Supplementary Table 5). Cells were clustered into 13 types that represent structural features and other cell types of the kidney. We identified six genes (NARS2, ARNT, TPRKB, RNF152, BST2, and RGS14) across five cell types (podocyte, proximal tubule, collecting duct principal cell, fibroblast, and neutrophil) that had a 1.96 or greater fold increase in gene expression, though none of these were significant after accounting for multiple tests. Cross-referencing protein expression levels in the Human Protein Atlas confirmed findings from murine kidney, including higher expression of TPRKB protein in tubules compared with glomeruli (Supplementary Table 6).

eGFR risk score PheWAS

To assess the potential pleiotropic effects of associated eGFR variants we tested the association between an eGFR weighted GRS (w-GRS) and diseases throughout the phenome using 63 SNPs with independent weights identified previously in the CKDGen Consortium^10,30 (Supplementary Table 7) and clinical phenotype data from 192,868 self-reported/administratively identified non-Hispanic white MVP individuals (Supplementary Table 8). We regressed PheWAS outcomes onto the w-GRS, adjusting for sex and the top ten principal components of ancestry. We identified nine conditions that were significantly associated with the eGFR w-GRS. The majority were diseases related to genitourinary systems (n = 6), the strongest result was with chronic renal failure (p = 3.55 × 10⁻⁵⁷, OR per SD of w-GRS = 0.88). Interesting results included calculus of kidney, calculus of ureter, and urinary calculus, which were all significant and positively associated with the weighted GRS (i.e., increased eGFR [improved kidney function] associated with increased risk of kidney stones). In addition, we observed significant associations with stage III of CKD and renal failure.

Discussion

We present the results from a multi-omic transethnic GWAS of eGFR with discovery and replication in over a million participants. We identified several novel loci using multiple statistical and bioinformatics approaches and validated previously reported loci, with replicated SNPs explaining 2.75% of eGFR variance. Significant strengths of this study include our large discovery and replication populations, transethnic analyses of diverse populations, incorporation of GPGE from a healthy human kidney reference, identification of cell-specific eGFR-associated gene expression from murine kidney, and evaluation of the clinical phenome through PheWAS of an eGFR w-GRS. We also assessed heterogeneity of effect estimates at GWAS loci between diabetic and non-diabetic populations. In addition to identifying novel and known loci, our discovery analyses using the MVP population showed heritability estimates consistent with published studies of eGFR, supporting the quality of our EHR phenotyping³⁰. These data provide insight into the genetic architecture and clinical factors of eGFR.

The limitations of estimating GFR from creatinine and/or cystatin C include low sensitivity in detecting early CKD and poor prediction of the course of CKD. These limitations have been described previously in detail^31,32,33. However, despite these limitations, eGFR is the outcome most often used in genetic association studies of kidney function because of its clinical utility and the translational potential of inferences, as well as the availability of large numbers relative to alternative measures.

We identified several novel common variants associated with eGFR, some of which may tag genes implicated in Mendelian forms of kidney disease. For instance, we detected a common (and reportedly benign) missense variant in PKD1 (polycystin 1) associated with eGFR. Mutations in PKD1 can cause autosomal dominant polycystic kidney disease (OMIM #173900). Variants in NOS3 (nitric oxide synthase 3), where we also detected an association with eGFR, may act as a modifier among those with polycystic kidney disease³⁴ through the nitric oxide pathway. We also identified associations near the NRIP1 gene (nuclear receptor interacting protein 1), with supporting GPGE evidence. NRIP1 mutations have recently been implicated in congenital anomalies of the kidney and urinary tract (OMIM# 610805), which are a leading cause of CKD among those under 30³⁵. It has been suggested that mutations in this gene cause CKD through disruption of retinoic acid signaling.

Novel SNP associations were found at or nearby genes previously reported in GWAS of other renal and urological phenotypes. Specifically, loci were identified with urate/urea/uric acid levels (KLHDC7A,³⁶ MTX1,³⁷ RREB1,³⁸ and MIR1538^28,38), urine albumin-to-creatinine ratio (C9orf3³⁹), IgA Nephropathy (MTMR3⁴⁰), and frequency of urinary tract infection (ZNF165⁴¹). We also observed an association at a pharmacogenetic locus strongly related to tacrolimus dose in renal transplant patients⁴², as well as development of new-onset diabetes after transplant⁴². However, transplant patients were excluded from this analysis.

Loci implicated in platelet and red blood cell phenotypes were also represented among novel eGFR-associated SNPs, including TET2, HIST1H1C, UBE2H, TRIB1-LOC105375746, FAM53B, TPM1, NRIP1, A4GALT, DOCK7, PLCB1, and SERTAD2^28,43. It has been previously shown that mean platelet volume, platelet counts, and platelet distribution width are associated with eGFR^44,45. Higher values of mean platelet volume are observed among CKD patients⁴⁶, while platelet count and distribution width decrease along with eGFR. CKD patients also demonstrate an attenuated response to antiplatelet therapy relative to those without renal insufficiency⁴⁷.

Ischemic stroke is a common secondary occurrence in CKD patients, which may be due to shared risk factors⁴⁸. Reduced kidney function among stroke patients is a risk factor for mortality^49,50. We identified four novel eGFR-associated loci which have been previously associated with ischemic stroke in other studies, specifically USP38, ZFHX3, PMF1-BGLAP, and TTBK1^51,52,53. A previous analysis of the APOL1 high risk variants showed increased risk of ischemic stroke⁵⁴. Combined with our results, this suggests that the genetic architecture of kidney function may predispose to other small vessel disease outcomes.

We examined the APOL1 locus extensively for association with eGFR in black MVP participants in anticipation that there would be a relationship there with eGFR and did not observe an association with an additive genotype coding. It has been suggested in multiple studies where participants had predominantly normal kidney function that APOL1 is a kidney disease gene and has not been significantly associated with eGFR^55,56,57. This suggests that the lack of association between APOL1 genetic variants and eGFR is likely not due to low power, as has been true in previous studies of African-descent populations.

We evaluated our transethnic sentinel SNPs for consistency of effect size across racial groups and found that in general effects were of similar direction, with only three variants being of opposite effect in non-Hispanic blacks (Supplementary Fig. 1). These included two known variants: rs532086 at C2, rs2235826 at PCK2 and a novel variant, rs12646572 at LDB2. Effect sizes of these three SNPs in non-Hispanic blacks were small (<0.01) and likely represented null effects. Comparison of variants identified in non-Hispanic blacks with non-Hispanic whites demonstrated consistency of direction across known variants with effect sizes generally similar or larger in non-Hispanic blacks. When eGFR effects across the entire genome were compared between white and black MVP participants using the Popcorn software package⁵⁸, the genetic effects were highly correlated and not significantly different from no difference (r_ge = 0.94, p = 0.45). Novel variants detected in non-Hispanic blacks were not observed in non-Hispanic whites. Our analysis of eGFR in non-Hispanic whites was largely consistent with previous European-ancestry cohorts, replicating many strongly associated loci among our top hits, such as UMOD, GATM/SPATA5L1, SHROOM3, CPS1, PRKAG2, and SLC31A1^10,11,59.

An interesting finding among the non-Hispanic blacks is the strong association with rs334 (p = 1.54 × 10⁻¹⁸, SNPTEST info score = 0.75) in HBB, which encodes the sickle cell allele of beta globin. This variant has been previously associated with increased urinary albumin-to-creatinine ratio²⁶. The sickling allele was present at a frequency of 5.7% among MVP non-Hispanic blacks, despite sickle cell trait carriers being restricted from serving in certain roles in the United States armed forces⁶⁰. This observed frequency is higher than that in the African Americans from the southwestern United States in 1000 Genomes (MAF = 1.6%), but substantially lower than in continental African populations (MAF 10.1–13.9%). Screening for sickle cell in the United States Military has undergone several changes since the initial policy was introduced in 1969, in which the Navy began screening all recruits^60,61. Currently universal sickle cell screening is performed prior to military ascension in the U.S. Navy, Air Force, and Marine Corps, while in the U.S. Army screening is performed only in specific scenarios related to deployment and certain occupational hazards^62,63. Similarly, in other countries’ militaries, sickle cell allele carriers may be barred from diving, submarine, and aircrew service⁶⁴. A recent study found that sickle cell trait was not associated with increased risk of mortality among black soldiers in the U.S. Army, but that it was associated with increased risk of exertional rhabdomyolysis (breakdown of skeletal-muscle tissue)⁶⁵. However, the effect of sickle cell trait was less than that of recent statin use. Our results suggest that this allele also negatively affects kidney function, which may indicate that carriers should be monitored clinically for decreases in eGFR.

Associations were also identified near loci previously implicated in diabetes and insulin/glucose homeostasis (CDKAL1, SLC9B2, RREB1, RAI1, and PPARG)^28,66,67,68. CDKAL1, RREB1, and PPARG were not significant in either arm of the diabetes stratified analysis, while RAI1 and SLC9B2 were significant in the non-diabetes group. We identified 36 SNPs that reached genome-wide significance in our diabetes strata, 12 were novel and not significant in non-diabetics, but were observed in the overall transethnic analysis combining diabetic groups. We also did an exploratory comparison of the association effect size in the subjects with and without diabetes. The effects were generally consistent across groups (Supplementary Fig. 3). Uromodulin (UMOD) has been recently detected in a GWAS of diabetic eGFR that limited to diabetic subjects⁶⁹. We observed similar associations for UMOD in separate analyses within diabetic and non-diabetic participants (sentinel SNP [A effect allele] effect size for UMOD diabetic 0.11 and non-diabetic 0.07), suggesting the association is not specific to diabetics. UMOD is known to affect kidney function and mutations in that gene cause several syndromic kidney disorders. Few variants have reached genome wide significance for studies of eGFR in patients with diabetes, potentially due to power and heterogeneity of renal disease phenotypes in diabetics^70,71.

We leveraged the human healthy kidney eQTL reference first described by Ko et al.²¹ to investigate the relationship between GPGE and eGFR using S-PrediXcan. We detected associations with 45 genes, 18 of which are not annotated for any trait in the GWAS catalog, either by the original report or by the catalog mapping. When we restricted the GWAS catalog to genes identified by studies of eGFR, CKD, kidney disease, urinary albumin to creatinine ratio, or urinary metabolites, 36 of the 45 genes we detected were novel. The most statistically significant association with GPGE was for the spermatogenesis associated protein 5-like 1 gene (SPATA5L1). This locus has been previously reported to be associated with eGFR^10,72. However, there are several genes in the region and Köttgen et al⁷² suggested the nearby gene glycine amidinotransferase (GATM), based on a pathway-level connection of that gene to creatinine biosynthesis. The sentinel non-coding SNP rs2467853 in this region is within the SPATA5L1 gene and is more strongly associated with SPATA5L1 expression in human kidney than GATM (rs2467853 SPATA5L1 beta = 0.77, p = 8.2 × 10⁻⁸; GATM beta = 2 × 10⁻⁴, p = 0.99)⁷³. In our evaluation of RNAseq and genotypes in cis with GATM from healthy human kidney, no models could be constructed that sufficiently predict GATM expression to use in GPGE analysis. We detected an association between GPGE and eGFR for SPATA5L1 and not for GATM, as well as a COLOC signal with a high posterior probability (P4 = 0.98) of several SNPs in the region being causal for both SPATA5L1 expression levels and the eGFR association signal, and no COLOC or GPGE evidence supporting GATM. The mechanism underlying the relationship between SPATA5L1 gene product abundance and kidney function is unclear.

The most statistically significant novel gene detected in S-PrediXcan analysis was the TP53RK-binding protein (TPRKB), which is significantly highly expressed in murine proximal tubules. Consanguineous families with homozygous missense mutations in TPRKB have been diagnosed with Galloway-Mowat syndrome-5 (OMIM #617731), in which affected people exhibit early-onset nephrotic syndrome, as well as dysmorphologies and delayed psychomotor development⁷⁴. The NARS2 gene is also novel and significantly highly expressed in murine podocytes. Mutations of this gene have been observed in a phenotype similar to Alpers syndrome that included renal dysfunction (OMIM #612803)⁷⁵.

It is also notable that UMOD from the chr16p12.3 region was not associated in the GPGE association analysis. No SNPs in the UMOD region were sufficiently associated with UMOD expression in the human kidney reference data to create predictive models for S-PrediXcan analysis. This suggests that the relationship between UMOD genotypes and kidney function is mediated by a mechanism other than gene expression, or that the proportion of cells from which UMOD is expressed is small and difficult to detect in bulk RNA sequencing of whole kidney. The sentinel SNP nearby UMOD from our study (rs77924615) is a nominally significant eQTL within NephQTL⁷⁶ (a database of nephrotic syndrome human kidneys) in the kidney tubules (p = 0.0014) and has a non-significant effect in the glomerulus (p = 0.079). The conditionally independently associated SNP rs111285796 in UMOD is also nominally associated in NephQTL with UMOD expression in tubules (p = 3.6 × 10⁻⁴) and less associated with other genes at this locus (PDILT p = 0.047), and less significant for glomerulus (UMOD p = 0.021). Within a healthy kidney eQTL atlas⁷⁷ (Online Methods) rs111285796 is not associated with UMOD in tubules (p = 0.214) but is nominally associated in glomerulus (p = 0.004). This demonstrates that there are important differences between the nephrotic and healthy kidney reference data with regard to cis-eQTL effects at this locus and suggests that SNPs in this locus influence gene expression in a context-dependent manner.

We constructed a w-GRS of eGFR and performed a PheWAS using EHRs from 192,868 non-Hispanic white MVP participants. To our knowledge this is the first PheWAS of an eGFR w-GRS. We observed several associations between our w-GRS and kidney disease-related phenotypes. The most significant PheWAS association (p = 3.55 × 10⁻⁵⁷) was with chronic renal failure, followed by related renal failure phenotypes, and CKD stage III (p = 6.13 × 10⁻²⁴), all diseases defined by eGFR. Our w-GRS was associated with hypertensive kidney disease but not essential hypertension. In addition, we observed an association with kidney stones that was positively associated with our w-GRS; a relationship that has not been previously reported with eGFR. Our results are consistent with a prior study that reported an association between increased eGFR and hypercalciuria⁷⁸. It is known that some individual genes that influence eGFR are also important for kidney stone formation. For example, decreased production of UMOD is associated with kidney stone formation as uromodulin impairs the aggregation of calcium oxalate crystals⁷⁹, hypertension, arterial stiffness, and CKD.

In conclusion, we identified multiple novel loci associated with eGFR levels after conducting a transethnic GWAS and several post-GWAS bioinformatic analyses. We identified several novel loci and genes with additional confirmation and refinement through GPGE analyses, including cell-specific expression. We observed consistent effects across racial groups for associated GWAS loci. Furthermore, we evaluated the clinical phenome associated with our eGFR w-GRS and identified associations among diseases related to kidney and endocrine disease phenotypes. Overall, our study leveraged a racially diverse clinical population to identify novel eGFR loci common across racial groups and remapped previously reported loci using GPGE, leading to a greater understanding of the genetic architecture of kidney function.

Methods

Ethics statement

The central Veterans Affairs (VA) institutional review board (IRB) and site-specific IRBs approved the Million Veteran Program study.

The Million Veteran Program

The Million Veteran Program (MVP) is a large cohort of fully consented participants who were recruited from the patient populations of 63 VA medical facilities. MVP recruitment was initiated in 2011 and conducted in-person, after responding to an invitation letter. Full MVP participation includes completion of baseline and lifestyle surveys, providing access to medical records, a blood sample, and giving permission to recontact. Informed consent is provided after counseling by research staff and access to informational materials. All study materials and protocols are approved by the VA Central Institutional Review Board. Genotyping was performed on the Affymetrix Axiom Biobank Array chip, with custom content included to provide better coverage of African and Hispanic haplotypes. All samples are de-identified for research purposes, and investigators are not permitted or able to link study data to a participant’s identity.

Blood samples were obtained from MVP participants by phlebotomists and shipped to a central biorepository in Boston, Massachusetts for biobanking. DNA was extracted and provided to two external sites for genotyping. Standardized quality control and genotype calling algorithms using the Affymetrix Power Tools Suite (v1.18) were applied to the data in batches by the MVP genomics working group. Quality control pipelines included the exclusion of duplicate samples, those with discordant reported and genotyped sex, and samples with more heterozygosity than expected. One of each pair of related individuals as measured by the KING software⁸⁰(halfway between 2nd and 3rd degree relatives or closer) were excluded from genetic analysis.

Prior to imputation, variants that were poorly called or that deviated from their expected allele frequency based on reference data from the 1000 Genomes Project⁸¹ were excluded. After pre-phasing using EAGLE v2⁸², genotypes from the 1000 Genomes Project phase 3, version 5 reference panel were imputed into Million Veteran Program (MVP) participants via Minimac3 software⁸³. Principal component analysis was performed using the FlashPCA⁸⁴, to generate the top ten genetic principal components explaining the greatest variability.

Information on race and ethnicity (Hispanic: Yes or No) were extracted from standardized survey forms (self-report), or from the corporate data warehouse (CDW), or observational medical outcomes partnership (OMOP) data, when information from self-report was unavailable, and data were combined to form administratively assigned variables. Race and ethnicity categories used in this study included non-Hispanic whites and non-Hispanic blacks.

Baseline estimated glomerular filtration rate (eGFR) was determined using the creatinine closest to enrollment. For the vast majority of the patients creatinine was measured using the IDMS reference method. GFR was calculated using chronic kidney disease epidemiology collaboration (CKD-EPI) serum creatinine equation⁸⁵. The eGFR CKD-EPI equation is (1):

GFR = 141 ×min(Scr ×κ⁻¹,1)^α × max(Scr × κ⁻¹, 1)^−1.209 × 0.993^Age × 1.018 [if female] × 1.159 [if Black]

where Scr is serum creatinine (mg/dL), κ is 0.7 for females and 0.9 for males, α is −0.329 for females and −0.411 for males, min indicates the minimum of Scr × κ⁻¹ or 1, and max indicates the maximum of Scr × κ⁻¹ or 1. We excluded individuals that were on dialysis, had a kidney transplant, amputees, individuals on HIV medications which may increase creatinine clearance, BMI < 18, and Scr values less than 0.4 mg/dl as they may have represented lab errors. Diabetic patients were defined as those subjects on any anti-diabetic medications or those who had at least two outpatient ICD-9 codes for diabetes (ICD9 250.*) on separate dates within 365 days prior to enrollment. Subjects lacking codes and not on anti-diabetes medications were categorized as non-diabetics. Hypertension was defined as the presence of a hypertension code, being on antihypertensive drug or having two SBP’s > 140 mmHg and/or two DBP’s > 90. BMI was estimated using the closest weight to the GFR measure, and the height mode as weight (kg) × (height (m))⁻².

MVP GWAS analysis

For the MVP GWAS we performed linear regression association tests with additive models for untransformed eGFR. We adjusted linear regression models analyzing SNP associations for age at eGFR measure, age², sex, BMI, and the top ten genetic principal components (PCs) in analyses. All primary analyses for the MVP were conducted by stratum of administratively assigned race, as well as by the presence or absence of diabetes and hypertension. All regression-based analyses were conducted in SNPTEST-v2.5.4-beta⁸⁶. Inference was limited to genotyped and imputed variants with SNPTEST Info scores of 0.4 or higher, with Hardy-Weinberg equilibrium p-value > 5 × 10⁻⁸ for common variant analysis (MAF > 0.1). Meta-analyses across race and strata were performed using fixed-effects, inverse variance-weighted meta-analysis implemented in METAL⁸⁷.

Genomic inflation factors were calculated, and λ_GC for the discovery from MVP were 1.11, 1.16, 1.15, and 1.01 for the diabetic hypertensive participants, non-diabetic hypertensive participants, non-diabetic non-hypertensive participants, and diabetic non-hypertensive participants in whites, respectively, 1.03, 1.03, 1.02, and 0.99 for the diabetic hypertensive participants, non-diabetic hypertensive participants, non-diabetic non-hypertensive participants, and diabetic non-hypertensive participants in blacks, respectively, and 1.36 in the overall discovery analysis (Supplementary Fig. 3).

Genome-wide significant loci were defined by one or more SNPs attaining genome-wide significance (p < 5 × 10⁻⁸) which were at least 1MB away from other signals, or if within that distance, were not in linkage disequilibrium (r² < 0.1). Sentinel variants and up to two proxies (r² > 0.8 and prioritized by correlation and then by distance from lead SNP) were selected for replication.

We approximated the proportion of variance explained in the transethnic meta-analysis by all independent sentinel SNPs (novel and known) and novel SNPs, separately. Variance explained by each SNP was first estimated by the following equation (2):

$$r^2 = \chi ^2n^{ - 1}.$$

The sum of the variances of the independent sentinel SNPs for common variants provided estimates for the proportion of variance explained for all SNPs, and novel SNPs for eGFR. The transformation of the relationship between t-statistic and r² to ${\mathrm{\chi }}^2$ statistic to r² is described in Supplementary Note 1.

CKDGen transethnic and European ancestry GWAS meta-analyses

The Chronic Kidney Disease Genetics (CKDGen) Consortium is a collaborative effort that includes mainly population-based studies from different ethnicities to perform GWAS of renal function traits aimed at uncovering the genetic basis of CKD. We interrogated data from the most recent CKDGen meta-analysis that included 121 GWAS, totaling 765,289 individuals of European (n = 567,401), East Asian (n = 165,726), South Asian (n = 13,359), African American (n = 13,842), and Hispanic (n = 4961) ancestries⁸⁸ . Following a centralized analysis plan, participating studies estimated the eGFR based on serum creatinine using the Chronic Kidney Disease Epidemiology Collaboration (CKD-EPI)⁸⁵ or the Schwartz’s⁸⁹ equations, depending on whether adults or ≤18 year old children were concerned. eGFR values were winsorized at 15 and 200 ml/min/1.73 m². Studies had a median mean eGFR of 89 ml/min/1.73 m² (interquartile range: 81–94) and a median mean age of 54 years. Overall, 50% participants were females. Each study performed genotype imputation based on the Haplotype Reference Consortium (HRC) v1.1 or the 1000 Genomes Project phase 3 v5 ALL or phase 1 v3 ALL panels. Sex- and age-adjusted linear regression models were fitted to the logarithm of eGFR. GWAS were performed by regressing the residuals of the linear models on SNP dosage levels, assuming additive genetic effects. Family-based studies accounted for relatedness including kinship estimation or genetic principal components into the linear models. After selecting SNPs with imputation quality score >0.6 and minor allele count >10, genomic control correction was applied in case of an inflation factor λ > 1. GWAS were pooled using fixed effects inverse-variance weighted meta-analysis. After meta-analysis, SNPs that were not present in at least 50% of the studies were discarded, leaving 8,221,591 variants from ≥61 GWAS across all ancestries and 8,834,748 variants from ≥42 GWAS in the European ancestry subset. No further genomic control correction was applied. Genome-wide significance level was set at 5 × 10⁻⁸.

Summary statistics for selected variants from the CKDGen consortium were used for replication. 105 variants were assessed for the non-Hispanic white-only meta-analysis in a maximum of 567,453 individuals from up to 84 studies. As many as 120 studies provided summary statistics for 122 variants for a maximum of 765,346 participants for the all-ancestry analysis. We performed a sample-size weighted z-score linear combination meta-analysis of MVP and CKDgen results (Supplementary Data 1).

DEPICT methods

Enrichment analyses in DEPICT⁹⁰ were conducted using significant GWAS sentinel SNPs from three separate analyses as input: (1) transethnic analyses of all MVP subjects, (2) transethnic analyses of MVP subjects with DM, and (3) transethnic analyses of MVP subjects without DM. DEPICT incorporates predefined phenotypic gene sets from multiple databases with expression microarray data (Affymetrix HGU133a2.0) from more than37k subjects to provide gene sets with high expression for Medical Subject Heading (MeSH) tissue and cell type annotations. DEPICT output includes enrichment p-values for both tissue level and gene-set features, as well as an indicator for whether each enrichment test had an FDR q-value of <0.05.

LD score regression and Popcorn

Subsequently, we utilized the LD Score Regression approach²⁷ in each contributing group to ascertain whether inflation was due to residual population stratification or polygenicity. Among whites we used 1000 Genomes precomputed LD Scores and for non-Hispanic blacks we calculated LD Scores in 2217 African American participants from BioVU using Illumina Mega Array (Illumina, Inc) genotype data imputed to 1000 Genomes phase 3 haplotypes. Calculation of the intercept in the MVP non-Hispanic white discovery analysis datasets were 1.02 SE = 0.01), 1.04 (SE = 0.01), 1.03 (SE = 0.01), and 0.99 (SE = 0.01), for the diabetic hypertensive participants, non-diabetics hypertensive participants, non-diabetics non-hypertensive participants, and diabetic non-hypertensive participants, respectively, suggesting that little of the observed inflation in the lambda is due to population stratification. Similarly, intercepts in the MVP Blacks discovery analysis datasets were 1.01 (SE = 0.003), 1.02 (SE = 0.004), 1.00 (SE = 0.003), and 1.00 (SE = 0.003), for the diabetic hypertensive participants, non-diabetic hypertensive participants, non-diabetic non-hypertensive participants, and diabetic non-hypertensive participants, respectively. Heritability was also assessed within each strata of the MVP data using LD Score regression. We also used the related Popcorn⁵⁸ software to evaluate the transethnic genetic effect correlation across the entire genome.

Conditional analysis

For conditional analysis of common variants we used two parallel approaches implemented in the genome-wide complex traits analysis (GCTA) software:²⁴ (i) genome-wide joint conditional analysis; and (ii) locus-specific conditional analysis.

(i)Genome-wide joint conditional analysis

Conditional analysis was conducted within GCTA software, using the –cojo method, which performs iterative conditional and joint analysis simultaneously with stepwise model selection²⁴. The summary statistics from the GWAS discovery whites was used as the input summary data, and the imputed, hard-called BioVU EA genetic data (n = 19,726) was used as the reference genotype-level data, in PLINK format. Combination of these two input data files restricted the GCTA analysis to the imputed SNPs in common to the GWAS discovery meta-analysis (which was itself restricted to MAF > 1%). Within the BioVU genetic data, LD was calculated between all pairwise SNPs. A p-value cutoff of 5 × 10⁻⁸ was used as the selection threshold within GCTA, and the collinearity threshold was set at the default value of 0.9, so that SNPs are not selected if the multiple regression with the current SNPs in the model has R² ≥ 0.9. After combining results across all 22 chromosomes, each trait-specific analysis resulted in a distinct set of jointly independent significant signals. Hence all final SNPs are pairwise-LD-independent.

(ii)Locus-specific conditional analysis

Analysis of each significant or previously reported locus was performed across all regional (1 Mb locus region centered ± 500 kb around the sentinel SNP) imputed SNPs with MAF ≥ 1%, conditioning on the sentinel SNP, using the --cojo-cond command in GCTA. As in the genome-wide approach, GWAS discovery meta-analysis results were used as the input summary data, with the BioVU EA imputed genetic data used as the reference PLINK dataset for LD computation. This approach provides conditional analysis results for each SNP within the implicated regions after conditioning on the sentinel SNP(s). These results yield a list of potential secondary SNPs which are evaluated according to the criteria below to identify those which are both significant and independent:

(a)p < 5 × 10⁻⁸ from original (unconditioned) GWAS discovery analysis, (i.e., SNP is significantly associated with eGFR itself)

(b)p_c < 5 × 10⁻⁸ from the conditional analysis (i.e., the SNP is significantly associated with eGFR after conditioning on the sentinel/published SNPs)

(c)independent of any of the sentinel SNPs (r² < 0.1)

Significant independent SNPs meeting the above criteria, from any locuswere combined into a single list. This list is more comprehensive than that from approach (i), as it contains all possible secondary SNPs, rather than a single lead SNP at each independent signal. These secondary SNPs may be in LD with each other within a given locus.

The outputs from the two different approaches were then combined to identify those SNPs which are genome-wide significant in the discovery dataset and jointly independent on a genome-wide level. For robustness, a secondary signal was only claimed if the SNP is validated from both approaches.

S-PrediXcan analysis

Genetically predicted gene expression was evaluated for the discovery GWAS with S-PrediXcan⁹, a gene-level approach which estimates the genetically determined component of gene expression in each tissue and tests it for association with SNP-level summary statistics. We utilized a human kidney gene expression atlas published by Ko et al.²¹ to conduct our genetically predicted gene expression analyses with our eGFR outcome.

Human kidney compartment cis-eQTL analysis

Human kidney tissues (n = 151) were microdissected in RNAlater to obtain glomerular and tubular renal compartments. An unbiased review of the tissue section was performed by a renal pathologist by scoring of multiple parameters. One microgram of total RNA from each compartment was used for isolation of poly(A) purified mRNA using the Illumina TruSeq RNA Preparation Kit. Samples were sequenced in single-end 100 bp reads, and annotated RNA counts were calculated by Illumina’s CASAVA 1.8.2 to generate fastq. Alignment of trimmed reads to the Gencode human genome (GRCh37) was performed using STAR-2.4.1d^91,92. RNA-seq data are available at Gene Expression Omnibus (GSE115098).

Compartmental eQTL data sets were generated from 121 tubule samples and 119 glomerulus samples, respectively⁷⁷. The cis expression window was defined as 1 Mb on either side of the gene transcriptional start site. eQTL analyses were performed using linear regression in FastQTL software⁹³ under an additive model and adjusted for six genetic PCs.

Murine kidney single cell sequencing analysis

Homologs of human genes detected in S-PrediXcan analyses of healthy human kidney eQTL reference and GWAS summary statistics were further investigated for kidney cell type-specific RNA expression using single cell sequencing in murine kidney cells. Cells were clustered by expression profiles into groups representing kidney structural features and additional cell types found in the kidney²². Confirmation of murine kidney findings was performed through cross-reference of protein expression levels available in the Human Protein Atlas⁹⁴.

Genetic risk score construction

We constructed a weighted genetic risk score (w-GRS) for eGFR by calculating a linear combination of weights derived from previous publications from CKDGen^10,30 for index SNPs at each of the 63 statistically significant replicated loci (Supplementary Table 7). GRSs were constructed for self-reported/administratively assigned non-Hispanic white individuals in the MVP only.

Phenome wide association study analysis

We performed a phenome wide association study (PheWAS) of GRS in MVP non-Hispanic whites (N_max = 192,868), leveraging the full catalog of ICD-9 diagnosis codes. We used logistic regression to separately model each of 1813 PheWAS traits as a function of GRS, adjusted for sex and ten PCs. We report the results from these analyses as odds ratios where the estimate is the average change in odds of the PheWAS trait per weighted eGFR-increasing allele. Multiple testing thresholds for significance were set to p ≤ 2.75 × 10⁻⁵(0.05/1813).

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

Full summary statistics relating to the Million Veteran Program (MVP) studies are available at dbGAP accession phs001672.v2.p1. Statistically significant reports for S-PrediXcan results for human kidney tissues and PheWAS analyses for eGFR are made available in the supplementary data and tables. Human Kidney RNA-seq data are available at Gene Expression Omnibus (GSE115098).

References

Levey, A. S. & Coresh, J. Chronic kidney disease. Lancet 379, 165–180 (2012).
Article PubMed Google Scholar
Go, A. S., Chertow, G. M., Fan, D., McCulloch, C. E. & Hsu, C. Y. Chronic kidney disease and the risks of death, cardiovascular events, and hospitalization. N. Engl. J. Med. 351, 1296–1305 (2004).
Article CAS PubMed Google Scholar
Tonelli, M. et al. Risk of coronary events in people with chronic kidney disease compared with those with diabetes: a population-level cohort study. Lancet 380, 807–814 (2012).
Article PubMed Google Scholar
System, U. S. R. D. USRDS Annual Data Report: Epidemiology of Kidney Disease in the United States. (National Institutes of Health, National Institute of Diabetes and Digestive and Kidney Diseases, Bethesda, MD, 2017).
Google Scholar
Prevention, C.f.D.C.a. National Chronic Kidney Disease Fact Sheet. (US Department of Health and Human Services, Centers for Disease Control and Prevention, Atlanta, GA, 2017).
Google Scholar
Matsushita, K. et al. Estimated glomerular filtration rate and albuminuria for prediction of cardiovascular outcomes: a collaborative meta-analysis of individual participant data. Lancet Diabetes Endocrinol. 3, 514–525 (2015).
Article PubMed PubMed Central Google Scholar
Gansevoort, R. T. et al. Chronic kidney disease and cardiovascular risk: epidemiology, mechanisms, and prevention. Lancet 382, 339–352 (2013).
Article PubMed Google Scholar
Matsushita, K. et al. Association of estimated glomerular filtration rate and albuminuria with all-cause and cardiovascular mortality in general population cohorts: a collaborative meta-analysis. Lancet 375, 2073–2081 (2010).
Article PubMed PubMed Central Google Scholar
Satko, S. G., Sedor, J. R., Iyengar, S. K. & Freedman, B. I. Familial clustering of chronic kidney disease. Semin. Dial. 20, 229–236 (2007).
Article PubMed Google Scholar
Pattaro, C. et al. Genetic associations at 53 loci highlight cell types and biological pathways relevant for kidney function. Nat. Commun. 7, 10023 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Gorski, M. et al. Genome-wide association study of kidney function decline in individuals of European descent. Kidney Int. 87, 1017–1029 (2015).
Article CAS PubMed Google Scholar
Li, M. et al. SOS2 and ACP1 loci identified through large-scale exome chip analysis regulate kidney development and function. J. Am. Soc. Nephrol. 28, 981–994 (2017).
Article CAS PubMed Google Scholar
Wuttke, M. & Kottgen, A. Insights into kidney diseases from genome-wide association studies. Nat. Rev. Nephrol. 12, 549–562 (2016).
Article CAS PubMed Google Scholar
Lee, D. et al. A method to predict the impact of regulatory variants from DNA sequence. Nat. Genet. 47, 955–961 (2015).
Article CAS PubMed PubMed Central Google Scholar
Finucane, H. K. et al. Partitioning heritability by functional annotation using genome-wide association summary statistics. Nat. Genet. 47, 1228–35 (2015).
Article CAS PubMed PubMed Central Google Scholar
Gusev, A. et al. Partitioning heritability of regulatory and cell-type-specific variants across 11 common diseases. Am. J. Hum. Genet. 95, 535–552 (2014).
Article CAS PubMed PubMed Central Google Scholar
Barbeira, A. N. et al. Exploring the phenotypic consequences of tissue specific gene expression variation inferred from GWAS summary statistics. Nat. Commun. 9, 1825 (2018).
Article ADS PubMed PubMed Central CAS Google Scholar
Gamazon, E. R. et al. A gene-based association method for mapping traits using reference transcriptome data. Nat. Genet. 47, 1091–1098 (2015).
Article CAS PubMed PubMed Central Google Scholar
Gusev, A. et al. Integrative approaches for large-scale transcriptome-wide association studies. Nat. Genet. 48, 245–252 (2016).
Article CAS PubMed PubMed Central Google Scholar
Zhu, Z. et al. Integration of summary data from GWAS and eQTL studies predicts complex trait gene targets. Nat. Genet. 48, 481–487 (2016).
Article CAS PubMed Google Scholar
Ko, Y. A. et al. Genetic-variation-driven gene-expression changes highlight genes with important functions for kidney disease. Am. J. Hum. Genet. 100, 940–953 (2017).
Article CAS PubMed PubMed Central Google Scholar
Park, J. et al. Single-cell transcriptomics of the mouse kidney reveals potential cellular targets of kidney disease. Science 360, 758–763 (2018).
Article CAS PubMed PubMed Central Google Scholar
Denny, J. C. et al. PheWAS: demonstrating the feasibility of a phenome-wide scan to discover gene-disease associations. Bioinformatics 26, 1205–1210 (2010).
Article CAS PubMed PubMed Central Google Scholar
Yang, J., Lee, S. H., Goddard, M. E. & Visscher, P. M. GCTA: a tool for genome-wide complex trait analysis. Am. J. Hum. Genet. 88, 76–82 (2011).
Article CAS PubMed PubMed Central Google Scholar
Shriner, D. & Rotimi, C. N. Whole-genome-sequence-based haplotypes reveal single origin of the sickle allele during the holocene wet phase. Am. J. Hum. Genet. 102, 547–556 (2018).
Article CAS PubMed PubMed Central Google Scholar
Kramer, H. J. et al. African ancestry-specific alleles and kidney disease risk in hispanics/latinos. J. Am. Soc. Nephrol. 28, 915–922 (2017).
Article CAS PubMed Google Scholar
Bulik-Sullivan, B. K. et al. LD Score regression distinguishes confounding from polygenicity in genome-wide association studies. Nat. Genet. 47, 291–295 (2015).
Article CAS PubMed PubMed Central Google Scholar
Kanai, M. et al. Genetic analysis of quantitative traits in the Japanese population links cell types to complex human diseases. Nat. Genet. 50, 390–400 (2018).
Article CAS PubMed Google Scholar
Suhre, K. et al. Connecting genetic risk to disease end points through the human blood plasma proteome. Nat. Commun. 8, 14357 (2017).
Article ADS CAS PubMed PubMed Central Google Scholar
Gorski, M. et al. 1000 Genomes-based meta-analysis identifies 10 novel loci for kidney function. Sci. Rep. 7, 45040 (2017).
Article ADS PubMed PubMed Central Google Scholar
Alaini, A. et al. Establishing the presence or absence of chronic kidney disease: uses and limitations of formulas estimating the glomerular filtration rate. World J. Method 7, 73–92 (2017).
Article Google Scholar
Warnock, D. G. Estimated glomerular filtration rate: fit for what purpose? Nephron 134, 43–49 (2016).
Article CAS PubMed Google Scholar
Levey, A. S., Inker, L. A. & Coresh, J. GFR estimation: from physiology to public health. Am. J. Kidney Dis. 63, 820–834 (2014).
Article PubMed PubMed Central Google Scholar
Xue, C. et al. Effects of endothelial nitric oxide synthase gene on end stage renal disease progression in autosomal dominant polycystic kidney disease. Nephrology 19, 630–637 (2014).
Article CAS PubMed Google Scholar
Vivante, A. et al. A dominant mutation in nuclear receptor interacting protein 1 causes urinary tract malformations via dysregulation of retinoic acid signaling. J. Am. Soc. Nephrol. 28, 2364–2376 (2017).
Article CAS PubMed PubMed Central Google Scholar
Huffman, J. E. et al. Modulation of genetic associations with serum urate levels by body-mass-index in humans. PLoS One 10, e0119752 (2015).
Article PubMed PubMed Central CAS Google Scholar
Nagy, R. et al. Exploration of haplotype research consortium imputation for genome-wide association studies in 20,032 Generation Scotland participants. Genome Med. 9, 23 (2017).
Article PubMed PubMed Central CAS Google Scholar
Kottgen, A. et al. Genome-wide association analyses identify 18 new loci associated with serum urate concentrations. Nat. Genet. 45, 145–154 (2013).
Article PubMed CAS Google Scholar
Teumer, A. et al. Genome-wide association studies identify genetic loci associated with albuminuria in diabetes. Diabetes 65, 803–817 (2016).
Article CAS PubMed Google Scholar
Gharavi, A. G. et al. Genome-wide association study identifies susceptibility loci for IgA nephropathy. Nat. Genet. 43, 321–327 (2011).
Article CAS PubMed PubMed Central Google Scholar
Tian, C. et al. Genome-wide association and HLA region fine-mapping studies identify susceptibility loci for multiple common infections. Nat. Commun. 8, 599 (2017).
Article ADS PubMed PubMed Central CAS Google Scholar
Kurzawski, M., Malinowski, D., Dziewanowski, K. & Drozdzik, M. Impact of PPARA and POR polymorphisms on tacrolimus pharmacokinetics and new-onset diabetes in kidney transplant recipients. Pharm. Genom. 24, 397–400 (2014).
CAS Google Scholar
Astle, W. J. et al. The allelic landscape of human blood cell trait variation and links to common complex disease. Cell 167, 1415–1429.e19 (2016).
Article CAS PubMed PubMed Central Google Scholar
Ju, H. Y. et al. Could mean platelet volume be a promising biomarker of progression of chronic kidney disease? Platelets 26, 143–147 (2015).
Article ADS CAS PubMed Google Scholar
Ucar, H. et al. Mean platelet volume is independently associated with renal dysfunction in stable coronary artery disease. Platelets 25, 274–278 (2014).
Article CAS PubMed Google Scholar
Verdoia, M. et al. Impact of renal function on mean platelet volume and its relationship with coronary artery disease: a single-centre cohort study. Thromb. Res. 141, 139–144 (2016).
Article CAS PubMed Google Scholar
Gremmel, T. et al. Chronic kidney disease is associated with increased platelet activation and poor response to antiplatelet therapy. Nephrol. Dial. Transpl. 28, 2116–2122 (2013).
Article CAS Google Scholar
Castro, P., Azevedo, E., Rocha, I., Sorond, F. & Serrador, J. M. Chronic kidney disease and poor outcomes in ischemic stroke: is impaired cerebral autoregulation the missing link? BMC Neurol. 18, 21 (2018).
Article PubMed PubMed Central CAS Google Scholar
Zorrilla-Vaca, A. et al. Acute kidney injury following acute ischemic stroke and intracerebral hemorrhage: a meta-analysis of prevalence rate and mortality risk. Cereb. Dis. 45, 1–9 (2018).
Article Google Scholar
You, S. et al. Prognostic significance of blood urea nitrogen in acute ischemic stroke. Circ. J. 82, 572–578 (2018).
Article PubMed Google Scholar
Malik, R. et al. Multiancestry genome-wide association study of 520,000 subjects identifies 32 loci associated with stroke and stroke subtypes. Nat. Genet. 50, 524–537 (2018).
Article CAS PubMed PubMed Central Google Scholar
Traylor, M. et al. Genetic variation at 16q24.2 is associated with small vessel stroke. Ann. Neurol. 81, 383–394 (2017).
Article CAS PubMed PubMed Central Google Scholar
NINDS Stroke Genetics Network (SiGN) & International Stroke Genetics Consortium (ISGC). Loci associated with ischaemic stroke and its subtypes (SiGN): a genome-wide association study. Lancet Neurol. 15, 174–184 (2016).
Akinyemi, R. et al. APOL1, CDKN2A/CDKN2B, and HDAC9 polymorphisms and small vessel ischemic stroke. Acta Neurol. Scand. 137, 133–141 (2018).
Article CAS PubMed Google Scholar
Chen, T. K. et al. APOL1 risk variants, incident proteinuria, and subsequent eGFR decline in blacks with hypertension-attributed CKD. Clin. J. Am. Soc. Nephrol. 12, 1771–1777 (2017).
Article CAS PubMed PubMed Central Google Scholar
Grams, M. E. et al. Race, APOL1 risk, and eGFR decline in the general population. J. Am. Soc. Nephrol. 27, 2842–2850 (2016).
Article CAS PubMed PubMed Central Google Scholar
Tin, A. et al. Patterns of kidney function decline associated with APOL1 genotypes: results from AASK. Clin. J. Am. Soc. Nephrol. 11, 1353–1359 (2016).
Article CAS PubMed PubMed Central Google Scholar
Brown, B. C., Ye, C. J., Price, A. L. & Zaitlen, N. Transethnic genetic-correlation estimates from summary statistics. Am. J. Hum. Genet. 99, 76–88 (2016).
Article CAS PubMed PubMed Central Google Scholar
Kottgen, A. et al. New loci associated with kidney function and chronic kidney disease. Nat. Genet. 42, 376–384 (2010).
Article PubMed PubMed Central CAS Google Scholar
Webber, B. J. & Witkop, C. T. Screening for sickle-cell trait at accession to the United States military. Mil. Med. 179, 1184–1189 (2014).
Article PubMed Google Scholar
Brodine, C. E. & Uddin, D. E. Medical aspects of sickle hemoglobin in military personnel. J. Natl Med. Assoc. 69, 29–32 (1977).
CAS PubMed PubMed Central Google Scholar
Department of the Army. Army Regulation 40-501: standards of medical fitness. (ed. Army, D.o.t.) (Department of the Army, Washington, DC, 2011).
O’Connor, F. G. et al. ACSM and CHAMP summit on sickle cell trait: mitigating risks for warfighters and athletes. Med. Sci. Sports Exerc. 44, 2045–2056 (2012).
Article PubMed Google Scholar
James, C. M. Sickle cell trait and military service. J. R. Nav. Med. Serv. 76, 9–13 (1990).
CAS PubMed Google Scholar
Nelson, D. A. et al. Sickle cell trait, rhabdomyolysis, and mortality among U.S. Army Soldiers. N. Engl. J. Med. 375, 435–442 (2016).
Article PubMed PubMed Central Google Scholar
Zhao, W. et al. Identification of new susceptibility loci for type 2 diabetes and shared etiological pathways with coronary heart disease. Nat. Genet. 49, 1450–1457 (2017).
Article CAS PubMed PubMed Central Google Scholar
Scott, R. A. et al. An expanded genome-wide association study of Type 2 diabetes in Europeans. Diabetes 66, 2888–2902 (2017).
Article CAS PubMed PubMed Central Google Scholar
Wood, A. R. et al. A genome-wide association study of IVGTT-based measures of first phase insulin secretion refines the underlying physiology of Type 2 diabetes variants. Diabetes 66, 2296–2309 (2017).
van Zuydam, N. R. et al. A genome-wide association study of diabetic kidney disease in subjects with Type 2 diabetes. Diabetes 67, 1414–1427 (2018).
Article PubMed PubMed Central Google Scholar
Mooyaart, A. L. et al. Genetic associations in diabetic nephropathy: a meta-analysis. Diabetologia 54, 544–553 (2011).
Article CAS PubMed Google Scholar
Ma, R. C. & Cooper, M. E. Genetics of diabetic kidney disease-from the worst of nightmares to the light of dawn? J. Am. Soc. Nephrol. 28, 389–393 (2017).
Article CAS PubMed Google Scholar
Kottgen, A. et al. Multiple loci associated with indices of renal function and chronic kidney disease. Nat. Genet. 41, 712–717 (2009).
Article CAS PubMed PubMed Central Google Scholar
Qiu, C. Human Kidney eQTL Atlas. (2018).
Braun, D. A. et al. Mutations in KEOPS-complex genes cause nephrotic syndrome with primary microcephaly. Nat. Genet. 49, 1529–1538 (2017).
Article CAS PubMed PubMed Central Google Scholar
Sofou, K. et al. Whole exome sequencing reveals mutations in NARS2 and PARS2, encoding the mitochondrial asparaginyl-tRNA synthetase and prolyl-tRNA synthetase, in patients with Alpers syndrome. Mol. Genet. Genom. Med. 3, 59–68 (2015).
Article CAS Google Scholar
Gillies, C. E. et al. An eQTL landscape of kidney tissue in human nephrotic syndrome. Am. J. Hum. Genet. 103, 232–244 (2018).
Article CAS PubMed PubMed Central Google Scholar
Qiu, C. et al. Renal compartment-specific genetic variation analyses identify new pathways in chronic kidney disease. Nat. Med. 24, 1721–1731(2018).
Moreira, D. M. et al. Association of estimated glomerular filtration rate with 24-h urinalysis and stone composition. Urolithiasis 44, 319–325 (2016).
Article CAS PubMed Google Scholar
Argade, S. et al. An evaluation of Tamm–Horsfall protein glycans in kidney stone formers using novel techniques. Urolithiasis 43, 303–312 (2015).
Article CAS PubMed Google Scholar
Manichaikul, A. et al. Robust relationship inference in genome-wide association studies. Bioinformatics 26, 2867–2873 (2010).
Article CAS PubMed PubMed Central Google Scholar
Auton, A. et al. A global reference for human genetic variation. Nature 526, 68–74 (2015).
Article ADS PubMed CAS Google Scholar
Loh, P. R. et al. Reference-based phasing using the Haplotype Reference Consortium panel. Nat. Genet. 48, 1443–1448 (2016).
Article CAS PubMed PubMed Central Google Scholar
Das, S. et al. Next-generation genotype imputation service and methods. Nat. Genet. 48, 1284–1287 (2016).
Article CAS PubMed PubMed Central Google Scholar
Abraham, G. & Inouye, M. Fast principal component analysis of large-scale genome-wide data. PLoS One 9, e93766 (2014).
Article ADS PubMed PubMed Central CAS Google Scholar
Levey, A. S. et al. A new equation to estimate glomerular filtration rate. Ann. Intern. Med. 150, 604–612 (2009).
Article PubMed PubMed Central Google Scholar
Marchini, J., Howie, B., Myers, S., McVean, G. & Donnelly, P. A new multipoint method for genome-wide association studies by imputation of genotypes. Nat. Genet. 39, 906–913 (2007).
Article CAS PubMed Google Scholar
Willer, C. J., Li, Y. & Abecasis, G. R. METAL: fast and efficient meta-analysis of genomewide association scans. Bioinformatics 26, 2190–2191 (2010).
Article CAS PubMed PubMed Central Google Scholar
Wuttke, M. et al. A catalog of genetic loci associated with kidney function from analyses of a million individuals. Nat. Genet. 51, 957–972 (2019).
Schwartz, G. J. et al. Improved equations estimating GFR in children with chronic kidney disease using an immunonephelometric determination of cystatin C. Kidney Int. 82, 445–453 (2012).
Article CAS PubMed PubMed Central Google Scholar
Pers, T. H. et al. Biological interpretation of genome-wide association studies using predicted gene functions. Nat. Commun. 6, 5890 (2015).
Article CAS PubMed Google Scholar
Harrow, J. et al. GENCODE: the reference human genome annotation for The ENCODE Project. Genome Res. 22, 1760–1774 (2012).
Article CAS PubMed PubMed Central Google Scholar
Dobin, A. et al. STAR: ultrafast universal RNA-seq aligner. Bioinformatics 29, (15–21 (2013).
Google Scholar
Ongen, H., Buil, A., Brown, A. A., Dermitzakis, E. T. & Delaneau, O. Fast and efficient QTL mapper for thousands of molecular phenotypes. Bioinformatics 32, 1479–1485 (2016).
Article CAS PubMed Google Scholar
Uhlen, M. et al. Proteomics. Tissue-based map of the human proteome. Science 347, 1260419 (2015).
Article PubMed CAS Google Scholar

Download references

Acknowledgements

This work was supported by VA grant I01BX003360 (to A.M.H.), for the contributions of A.G., A.M.H. and D.R.V.E. This research is based on data from the Million Veteran Program, Office of Research and Development, Veterans Health Administration. Acknowledgment of VA Million Veteran Program leadership and staff contributions can be found in Supplementary Note 2. This work was supported using resources and facilities of the VA Informatics and Computing Infrastructure (VINCI), VA HSR RES 13-457. We would also like to acknowledge the grants Genetics of Cardiometabolic Diseases in the VA Population (BX-003362-01); Veterans Affairs Office of Research and Development; PIs: PS Tsao and K-M Chang and U.S. Department of Veterans Affairs IK2-CX001870 (Damrauer PI); U.S. Department of Veterans Affairs I01-BX002641(PI Tsao and Chang). This project was supported in part the by VA Clinical Science research and Development investigator-initiated grant CX000570-06(Roumie). AMH was further supported by 1I01CX000982. This publication does not represent the views of the Department of Veterans Affairs or the United States Government. The authors are grateful to members of the Chronic Kidney Disease Genetics (CKDGen) Consortium for providing summary data for replication. A full list of CKDGen consortium membership can be found in Supplementary Note 3. D.R.V.E. and T.L.E. received support from NIH/NHLBI (HL121429). J.N.H. was supported by the Vanderbilt Molecular and Genetic Epidemiology of Cancer (MAGEC) training program, funded by T32CA160056 (PI: X.-O. Shu). The work of J.N.H. and A.G. is supported by K12HD043483 (PI: K.E. Hartmann). J.M.K. is supported by the Vanderbilt Genomic Medicine Training Program. The Vanderbilt Genomic Medicine Training Program is supported by an institutional training grant (T32HG008341) from the National Human Genome Research Institute of the National Institute of Health. K.S. was funded by R01DK076077, R01DK105821, and DP3DK108220. The work of A.K. was supported by KO 3598/3-1 and CRCs 1140 and 992 of the German Research Foundation. The work of M.G. was supported by DFG BO-3815/4-1. The work of M.W. was supported by DFG CRC 1140 and the Else Kroener Fresenius Forschungskolleg NAKSYS, and Y.L. was supported by DFG KO 3598/4-1. C.P.C. was supported by K23AR064768 (NIAMS) and the Rheumatology Research Foundation. C.R.-C. was supported by NIH/NIDDK K01DK109019. A dataset used for the analyses described were obtained from Vanderbilt University Medical Center’s BioVU which is supported by institutional funding, the 1S10RR025141-01 instrumentation award, and by the CTSA grant UL1TR000445 from NCATS/NIH. Additional funding provided by the NIH through grants P50GM115305 and U19HL065962. The authors wish to acknowledge the technical support of the VANTAGE and VANGARD core facilities, supported in part by the Vanderbilt-Ingram Cancer Center (P30 CA068485) and Vanderbilt Vision Center (P30 EY08126).

Author information

These authors contributed equally: Jacklyn N. Hellwege, Digna R. Velez Edwards, Ayush Giri.
These authors jointly supervised this work: Katalin Susztak, Todd L. Edwards, Adriana M. Hung

Authors and Affiliations

Biomedical Laboratory Research and Development, Tennessee Valley Healthcare System (626)/Vanderbilt University, Nashville, TN, USA
Jacklyn N. Hellwege, Digna R. Velez Edwards, Ayush Giri, Eric S. Torstenson, Jacob M. Keaton, Cecilia P. Chung, Todd L. Edwards & Adriana M. Hung
Division of Genetic Medicine, Department of Medicine, Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, TN, USA
Jacklyn N. Hellwege
Department of Obstetrics & Gynecology, Vanderbilt Genetics Institute, Vanderbilt Epidemiology Center, Vanderbilt University Medical Center, Nashville, TN, USA
Digna R. Velez Edwards & Ayush Giri
Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN, USA
Digna R. Velez Edwards
Department of Medicine, Renal Electrolyte and Hypertension Division, University of Pennsylvania, Philadelphia, PA, USA
Chengxiang Qiu, Jihwan Park & Katalin Susztak
Division of Epidemiology, Department of Medicine, Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, TN, USA
Eric S. Torstenson, Jacob M. Keaton & Todd L. Edwards
Division of Nephrology and Hypertension, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, USA
O. D. Wilson, Cassianne Robinson-Cohen, Elvis A. Akwo & Adriana M. Hung
Divisions of Rheumatology and Clinical Pharmacology, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, USA
Cecilia P. Chung
Veteran Affairs Administration Tennessee Valley VA Health Care System Geriatric Research Education Clinical Center (GRECC), Nashville, TN, USA
Christianne L. Roumie
Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, USA
Christianne L. Roumie & Edward Siew
VA Boston Health Care System, Boston, MA, USA
Derek Klarin, Man Li & Christopher J. O’Donnell
Center for Genomic Medicine, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA
Derek Klarin
Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, Cambridge, MA, USA
Derek Klarin & J. Michael Gaziano
Department of Surgery, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA
Derek Klarin & J. Michael Gaziano
Department of Surgery, Corporal Michael Crescenz VA Medical Center, Philadelphia, PA, USA
Scott M. Damrauer
Department of Surgery, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
Scott M. Damrauer
VA Salt Lake City Health Care System, Salt Lake City, UT, USA
Scott L. DuVall
University of Utah School of Medicine, Salt Lake City, UT, USA
Scott L. DuVall
Institute of Genetic Epidemiology, Department of Biometry, Epidemiology and Medical Bioinformatics, Faculty of Medicine and Medical Centre—University of Freiburg, Freiburg, Germany
Matthias Wuttke, Yong Li & Anna Köttgen
Department of Genetic Epidemiology, Institute of Epidemiology and Preventive Medicine, University of Regensburg, Regensburg, Germany
Mathias Gorski
Department of Nephrology, University Hospital Regensburg, Regensburg, Germany
Mathias Gorski
Division of Nephrology, Department of Internal Medicine, University of Utah School of Medicine, Salt Lake City, UT, USA
Man Li
Massachusetts Veterans Epidemiology Research and Information Center (MAVERIC), VA Boston Healthcare System, Boston, MA, USA
J. Michael Gaziano
Atlanta VA Medical Center, Atlanta, GA, USA
Peter W. F. Wilson
Emory Clinical Cardiovascular Research Institute, Atlanta, GA, USA
Peter W. F. Wilson
VA Palo Alto Health Care System, Palo Alto, CA, USA
Philip S. Tsao
Department of Medicine, Stanford University School of Medicine, Stanford, CA, USA
Philip S. Tsao
Section of Cardiology and Department of Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA, USA
Christopher J. O’Donnell
Nephrology Section, Memphis VA Medical Center, Memphis, TN, USA
Csaba P. Kovesdy
Division of Nephrology, University of Tennessee Health Science Center, Memphis, TN, USA
Csaba P. Kovesdy
Institute for Biomedicine, Eurac Research, Bolzano, Italy
Cristian Pattaro
Department of Epidemiology, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
Anna Köttgen

Authors

Jacklyn N. Hellwege
View author publications
You can also search for this author in PubMed Google Scholar
Digna R. Velez Edwards
View author publications
You can also search for this author in PubMed Google Scholar
Ayush Giri
View author publications
You can also search for this author in PubMed Google Scholar
Chengxiang Qiu
View author publications
You can also search for this author in PubMed Google Scholar
Jihwan Park
View author publications
You can also search for this author in PubMed Google Scholar
Eric S. Torstenson
View author publications
You can also search for this author in PubMed Google Scholar
Jacob M. Keaton
View author publications
You can also search for this author in PubMed Google Scholar
O. D. Wilson
View author publications
You can also search for this author in PubMed Google Scholar
Cassianne Robinson-Cohen
View author publications
You can also search for this author in PubMed Google Scholar
Cecilia P. Chung
View author publications
You can also search for this author in PubMed Google Scholar
Christianne L. Roumie
View author publications
You can also search for this author in PubMed Google Scholar
Derek Klarin
View author publications
You can also search for this author in PubMed Google Scholar
Scott M. Damrauer
View author publications
You can also search for this author in PubMed Google Scholar
Scott L. DuVall
View author publications
You can also search for this author in PubMed Google Scholar
Edward Siew
View author publications
You can also search for this author in PubMed Google Scholar
Elvis A. Akwo
View author publications
You can also search for this author in PubMed Google Scholar
Matthias Wuttke
View author publications
You can also search for this author in PubMed Google Scholar
Mathias Gorski
View author publications
You can also search for this author in PubMed Google Scholar
Man Li
View author publications
You can also search for this author in PubMed Google Scholar
Yong Li
View author publications
You can also search for this author in PubMed Google Scholar
J. Michael Gaziano
View author publications
You can also search for this author in PubMed Google Scholar
Peter W. F. Wilson
View author publications
You can also search for this author in PubMed Google Scholar
Philip S. Tsao
View author publications
You can also search for this author in PubMed Google Scholar
Christopher J. O’Donnell
View author publications
You can also search for this author in PubMed Google Scholar
Csaba P. Kovesdy
View author publications
You can also search for this author in PubMed Google Scholar
Cristian Pattaro
View author publications
You can also search for this author in PubMed Google Scholar
Anna Köttgen
View author publications
You can also search for this author in PubMed Google Scholar
Katalin Susztak
View author publications
You can also search for this author in PubMed Google Scholar
Todd L. Edwards
View author publications
You can also search for this author in PubMed Google Scholar
Adriana M. Hung
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Writing of the manuscript: J.N.H., D.R.V.E., A.G., T.L.E., A.M.H. Critical revision of the manuscript: J.N.H., D.R.V.E., A.G., C.P.C., D.K., C.P., A.K., K.S., T.L.E., A.M.H. Discovery analysis contributor: J.N.H., D.R.V.E., A.G., E.S.T., J.M.K., O.D.W., C.P.C., C.R.-C., C.L.R., E.S., E.A.A., J.M.G., C.P.K., P.W.F.W., C.J.O., T.L.E., A.M.H. Replication study contributor: M.G., M.W., M.L., Y.L., A.K., C.P. Human kidney and mouse model systems: C.Q., J.P., K.S. PheWAS: A.G., S.M.D., D.K., S.L.D., P.S.T.

Corresponding authors

Correspondence to Todd L. Edwards or Adriana M. Hung.

Ethics declarations

Competing interests

The authors declare no competing interests. This publication does not represent the views of the Department of Veterans Affairs or the United States Government.

Additional information

Peer review information: Nature Communications thanks the anonymous reviewers for their contribution to the peer review of this work.

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Supplementary Data 1

Supplementary Data 2

Supplementary Data 3

Supplementary Data 4

Supplementary Data 5

Supplementary Data 6

Supplementary Data 7

Supplementary Data 8

Supplementary Data 9

Description of Additional Supplementary Files

Reporting Summary

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Hellwege, J.N., Velez Edwards, D.R., Giri, A. et al. Mapping eGFR loci to the renal transcriptome and phenome in the VA Million Veteran Program. Nat Commun 10, 3842 (2019). https://doi.org/10.1038/s41467-019-11704-w

Download citation

Received: 15 January 2019
Accepted: 23 July 2019
Published: 26 August 2019
DOI: https://doi.org/10.1038/s41467-019-11704-w

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.