Polygenic analysis of the effect of common and low-frequency genetic variants on serum uric acid levels in Korean individuals

Cho, Sung Kweon; Kim, Beomsu; Myung, Woojae; Chang, Yoosoo; Ryu, Seungho; Kim, Han-Na; Kim, Hyung-Lae; Kuo, Po-Hsiu; Winkler, Cheryl A.; Won, Hong-Hee

doi:10.1038/s41598-020-66064-z

Download PDF

Article
Open access
Published: 08 June 2020

Polygenic analysis of the effect of common and low-frequency genetic variants on serum uric acid levels in Korean individuals

Scientific Reports volume 10, Article number: 9179 (2020) Cite this article

3600 Accesses
12 Citations
1 Altmetric
Metrics details

Subjects

Abstract

Increased serum uric acid (SUA) levels cause gout and are associated with multiple diseases, including chronic kidney disease. Previous genome-wide association studies (GWAS) have identified more than 180 loci that contribute to SUA levels. Here, we investigated genetic determinants of SUA level in the Korean population. We conducted a GWAS for SUA in 6,881 Korean individuals, calculated polygenic risk scores (PRSs) for common variants, and validated the association of low-frequency variants and PRS with SUA levels in 3,194 individuals. We identified two low-frequency and six common independent variants associated with SUA. Despite the overall similar effect sizes of variants in Korean and European populations, the proportion of variance for SUA levels explained by the variants was greater in the Korean population. A rare, nonsense variant SLC22A12 p.W258X showed the most significant association with reduced SUA levels, and PRSs of common variants associated with SUA levels were significant in multiple Korean cohorts. Interestingly, an East Asian-specific missense variant (rs671) in ALDH2 displayed a significant association on chromosome 12 with the SUA level. Further genetic epidemiological studies on SUA are needed in ethnically diverse cohorts to investigate rare or low-frequency variants and determine the influence of genetic and environmental factors on SUA.

Genome-wide meta-analysis revealed several genetic loci associated with serum uric acid levels in Korean population: an analysis of Korea Biobank data

Article 01 November 2021

Jin sung Park, Yunkyung Kim & Jihun Kang

Genetics of 35 blood and urine biomarkers in the UK Biobank

Article 18 January 2021

Nasa Sinnott-Armstrong, Yosuke Tanigawa, … Manuel A. Rivas

Multi-phenotype genome-wide association studies of the Norfolk Island isolate implicate pleiotropic loci involved in chronic kidney disease

Article Open access 30 September 2021

Ngan K. Tran, Rodney A. Lea, … Lyn R. Griffiths

Introduction

Uric acid is the final product of purine metabolism in humans¹. Uric acid is produced primarily in the liver, and 70% of daily uric acid excretion occurs via the kidney¹. The uric acid transport system in the kidney plays an important role in its excretion², since only 10% of the initially filtered uric acid is excreted through urine³. The serum uric acid (SUA) level is determined by the balance of renal clearance, extrarenal clearance in the gut, and purine metabolism in the liver. The absence of the enzyme uricase and the presence of the renal uptake system contribute to higher SUA levels in humans, compared to that in other mammals^1,4. SUA levels in the general population follow a normal distribution². The physiological normal range of uric acid is 2.5-7.5 and 2.0-6.5 mg/dL for men and women, respectively⁵. The female hormones that influence the renal uptake of uric acid^6,7 and the effects of genetic variations in SLC2A9 and ABCG2^8,9 are reported as sex-related factors.

Hyperuricemia is the result of overactive hepatic metabolism and high cell turnover or renal under-excretion/extra-renal under-excretion or a combination of both¹⁰. Generally, renal and gut under-excretion account for two-thirds¹¹ and one-third¹² of hyperuricemia caused by under-excretion. In addition to environmental factors, genetic factors identified by previous genome-wide association studies (GWASs) have been implicated in the critical process of uric acid excretion and its role in hyperuricemia^{13,14,15,16,17,18}. SLC2A9 and ABCG2 were found to be the most prominent loci in a GWAS of more than 143,160 participants of European ancestry⁹ and a trans-ancestry GWAS of 457,690 individuals¹⁹.

A rare, loss-of-function variant (p.W258X) of SLC22A12 has been reported to be a founder mutation of familial renal hypouricemia (RHUC1, OMIM #220150) in the Japanese population²⁰. An earlier GWAS reported an association between common variants of the SLC22A12 locus and SUA levels¹⁵. The loss-of-function variant (p.W258X) could not be identified in the Asian Genetic Epidemiology Network (AGEN) GWAS owing to its rare occurrence in East Asian populations²¹. Recently, rare functional variants (p.R325W, p.R405C, and p.T467M) of the SLC22A12 gene have been detected in European and African-American populations, highlighting the importance of population-specific rare and common variants²².

Global research in GWAS suffers from a Eurocentric bias and, as a consequence, disproportionately underrepresents non-European populations^23,24. Although trans-ancestry GWASs have identified more than 180 loci associated with SUA^25,26, the current challenges include unearthing explanations for the remaining missing heritability of SUA by identifying additional genetic variants in diverse populations and a systematic evaluation of differential effects of common and rare variants. Recently, it has been shown that common and low-frequency variants can accurately be imputed (R² ≥ 0.8) using a large imputation panel of the Haplotype Reference Consortium (HRC) consisting of 64,976 haplotypes²⁷. Here, we performed a genome-wide association analysis of SUA levels in the Korean population and evaluated the genetic effects on SUA levels using imputed single-nucleotide polymorphism (SNP) chip data. We also calculated polygenic risk scores (PRS) to evaluate the variability of SUA in multiple Korean cohorts.

Results

Characteristics of the participants

In total, 6,881 individuals from urban and rural cohorts were included in the discovery phase, and a total of 3,194 individuals from the Ansan-Ansung and Kangbuk Samsung Hospital (KBSMC) cohorts were used for model validation (Table 1). The mean characteristics of the participants were slightly or substantially different between the cohorts. Overall, the urban and Ansan-Ansung cohorts included more males and more participants with hypertension, hyperlipidaemia, and diabetes than the rural cohort, which were included as covariates in the association analysis. Our discovery (n = 6,881) and replication (n = 3,194) sets consisted of individuals with preserved renal function (eGFR > 60 ml/min); however, we did not exclude participants with hypertension or diabetes.

Table 1 Clinical characteristics of the study cohort.

Full size table

Meta-analysis of SUA levels

We analysed a total of 6,129,701 SNPs in the discovery phase to identify the association with SUA levels in individuals using linear regression and discovered eight independent SNPs (two low-frequency and six common) that reached the threshold of statistical significance (Table 2). In the discovery phase, the inflation factor for the meta-analysis of SUA was 0.998 after performing a genomic control (1.02 without genomic control) (Supplementary Fig. S1). A Manhattan plot of the meta-analysis on SUA levels is shown in Fig. 1. A total of 1,149 SNPs that reached a level of genome-wide significance (P-value <5.0 × 10⁻⁸) for association with SUA levels are listed in Supplementary Table S1, and the 10 independent nonsynonymous SNPs that passed Bonferroni’s correction for 11,600 nonsynonymous SNPs (P-value <4.31 × 10⁻⁶) are listed in Supplementary Table S3. The regional plots for genome-wide significant association are shown in Supplementary Fig. S3. Conditional analyses identified multiple variants, including rs184521656, a low-frequency (minor allele frequency (MAF) = 0.01) intronic variant in FRMD8, that reached a level of genome-wide significance for association with SUA levels after conditioning with pre-selected lead SNPs in the flanking region (Supplementary Fig. S4a). Other regions did not show multiple independent signals at the level of genome-wide significance (Supplementary Fig. S4). The results of the main meta-analysis were similar to the simple model that used only sex, age, and the first 10 principal components of genetic variants as covariates, except for the chromosome 17 locus near the BCAS3 gene, which is a previously reported locus (Supplementary Fig. S5)⁹.

Table 2 Lead variants associated with SUA from meta-analysis of GWASs.

Full size table

Of note, a rare nonsense variant in the SLC22A12 locus showed the strongest association with SUA levels (rs121907892, P-value = 7.4 × 10⁻⁵⁴; β = −1.15 mg/dl; standard error (SE) = 0.07 mg/dl) in the discovery phase (Supplementary Fig. S2). The locus was monomorphic in most populations, but the variant was found to be present at a very low frequency in only East Asian populations. This variant has been reported in a previous exome-wide association study in the Japanese population²⁸. It is also present in very low linkage disequilibrium with most of the other neighbouring variants. Another low-frequency missense variant in SLC2A9, previously identified in GWAS studies^9,29, also showed significant associations with SUA levels (rs16890979, P-value = 5.86 × 10⁻⁷; β = −0.47 mg/dl; SE = 0.09 mg/dl) in the discovery phase. This variant is a common polymorphism observed in most tested populations. Other significant common variants were found to be located near the ABCG2, SLC2A9, NRXN2, NAA25, SLC17A3-SLC17A2, and BCAS3 genes (Supplementary Table S5). Although most of these loci were previously reported in GWASs, the lead SNPs identified in the present study are different from those reported in European populations (Supplementary Fig. S3)⁹.

Sex-stratified meta-analysis of SUA levels

In addition to the lead variants in ABCG2 (rs2231142, male; P-value = 5.9 × 10⁻¹⁴; β = 0.27 mg/dl; SE = 0.04 mg/dl, female; P-value = 4.3 × 10⁻¹⁸; β = 0.19 mg/dl; SE = 0.02 mg/dl) and SLC2A9 (rs4529048, male; P-value = 6.2 × 10⁻⁰⁶; β = 0.15 mg/dl; SE = 0.03 mg/dl, female; P-value = 2.2 × 10⁻³¹; β = 0.23 mg/dl; SE = 0.02 mg/dl) that were previously known to show sex-specific patterns^8,9, our sex-stratified analysis identified intronic variants in CDH13 on chromosome 16 that were significant in the analysis of female participants (rs8063966, P-value = 1.6 × 10⁻⁰⁸; β = −0.18 mg/dl; SE = 0.03 mg/dl) (Supplementary Fig. S7).

Alcohol intake-adjusted associations in the region on chromosome 12

We performed meta-analysis of SUA levels using the alcohol intake-adjusted association results to check for the influence of the identified loci on uric acid levels via alcohol-related pathways. The variants in the chromosome 12 region did not show significant associations after adjusting alcohol intake, whereas other loci remained significant after this adjustment (Supplementary Fig. S6a,b). In particular, the chromosome 12 region was significant in the genome-wide meta-analysis of alcohol intake (Supplementary Fig. S6c). A common, missense variant in ALDH2 in this region, known to be associated with alcohol metabolism and drinking behaviour^30,31, showed the strongest association with alcohol intake (rs671, P-value = 9.4 × 10⁻¹⁰¹; odds ratio = 0.049; 95% confidence interval = 0.037 to 0.065) and was in high linkage disequilibrium with rs116873087, the lead SNP in NAA25 (r² = 0.985). Variants in the ATXN2 and CUX2 genes associated with SUA levels also showed association with alcohol intake (Supplementary Fig. S6d).

Comparison of significant variants identified in our study and the European study

Allele frequencies, haplotype structure, and effect sizes of the SNPs identified in the Korean cohorts may be different from those reported in European populations, which may lead to differences in the proportion of variance explained by the variants identified in the Korean and European populations. We compared the effect sizes and the proportion of variance for phenotype explained by a given SNP (PVE) for the significant variant sets from our study and from the Global Urate Genetics Consortium (GUGC) for the European population. The effect sizes of the lead SNPs in our study were approximately linear to those in the GUGC study and showed the same direction of effects as the GUGC study. Of the 27 significant lead variants from the GUGC study, 22 showed significant associations (P-value <0.05) in our study and the same direction of effects between the two studies.

Spearman’s correlation coefficient of effect sizes for the GUGC GWAS hit SNPs was 0.743, and it was calculated to be 0.878 for the lead SNPs in our study (Fig. 2a,c). The Cohen’s kappa (κ) coefficients of effect sizes were also high (Supplementary Table S2). In contrast, the sum of the PVE values was different between the two cohorts. The sum of PVE for the lead SNPs in our study was larger in Koreans (0.135) than in Europeans (0.071), whereas that for GUGC GWAS hit SNPs was slightly larger in Europeans (0.055) than in Koreans (0.048) (Fig. 2b,d).

Validation of low-frequency variants and PRSs of common SNPs

The linear regression model that comprised PRSs of common variants associated with SUA levels, the two low-frequency variants, and covariates was fitted to the discovery phase cohorts and validated in the two independent Ansan-Ansung and KBSMC cohorts. The PRSs were normally distributed and the overall distribution correlated with SUA levels (Fig. 3).

The highest and lowest bins of the PRSs corresponded to the highest and lowest levels of SUA, respectively. The PRS was significant in both validation cohorts (P-value = 7.06 × 10⁻¹³, β = 0.24 mg/dL, SE = 0.03 mg/dL in the Ansan-Ansung cohort and P-value = 7.93 × 10⁻²¹, β = 0.21 mg/dL, SE = 0.02 mg/dL in the KBSMC cohort) (Table 3). Among the two low-frequency variants, the p.W258X nonsense mutation of SLC22A12 showed nominal association (P-value = 0.047). There was insufficient data to verify the significance of these low-frequency variants because of the small sample size in the cohort study.

Table 3 Estimated coefficients and standard errors obtained on modelling trends in SUA by linear regression.

Full size table

Discussion

The main findings of our study include the identification of genome-wide significant association of common and low-frequency variants with SUA levels in Koreans, and the validation of the PRS model of SUA in different cohort studies. The strength of our study is that it includes both urban and rural dwelling participants across Korea and represents an investigation on alcohol consumption. In addition, we have assessed our PRS model in independent cohorts and used available data of genotype-phenotype associations from a large European GWAS for comparison. Our study evaluates the contribution of both low-frequency and common variants to SUA levels by using an HRC-based imputation.

Our results demonstrate the benefits of using a large imputation panel such as the HRC reference panel for discovery, and characterisation of common and low-frequency variants contributing to SUA levels in the general population. We replicated several previously known genetic loci associated with SUA levels (ABCG2, SLC2A9, NRXN2, NAA25, SLC17A3-SLC17A2, SLC22A12, and BCAS3) reported for the European population and identified independent lead SNPs in two loci (SLC22A12 and NAA25) between the European GWAS and the present study. Comparison with results of the GUGC European study showed that effect sizes of common variants were comparable between East Asian and European populations, whereas allele frequencies and sum of PVE of variants were higher in the population in which the corresponding variants were identified. These results suggest that SUA levels may be affected by population-specific variants as well as shared variants at the same locus, and in particular by population-specific low-frequency variants.

The population-specific SLC22A12 p.W258X variant and the SLC2A9 p.W282I variant were accurately imputed (R² ≥ 0.8) using the HRC reference panel. We identified an association between a low-frequency, SLC22A12 loss-of-function variant (rs121907892) and SUA levels in the general population at a genome-wide significance level (P-value <5 × 10⁻⁸)^9,15,32. SLC22A12 p.W258X is a well-known variant for hypouricemia in Koreans³³. This variant has been earlier identified only by sequencing analysis^20,34 because of its rarer frequency and weaker linkage disequilibrium with other neighbouring common variants analysed in GWASs. Other common variants in this region were identified in the European population study (rs505802)¹⁵ and AGEN study (rs504915) including a combined total of ~110,347 Asian individuals²¹. The imputation in AGEN was performed on the basis of the HapMap Phase 2 reference panel. In a more recent analysis by Lee et al., who used 1000 Genomes phase 3 haplotypes as a reference panel for imputation, the founder mutation was still not identified³⁵. The observed significant association near the NAA25 gene on chromosome 12 was previously reported, although different genes (ALDH2, ATXN2, and CUX) were reported for this region. The lead SNP was in high linkage disequilibrium with rs671 in ALDH2, a common missense variant in East Asians, but mostly absent in other populations. Our results of alcohol intake-adjusted association analysis and comparison with GWAS for alcohol intake suggests that the association of this region with SUA levels is mediated by alcohol consumption or metabolism through ALDH2.

The sex-specific CDH13 variant (rs8063966) was only detected in females. This may be associated with lower levels of SUA in women since oestradiol and progesterone have been shown to be associated with CDH13 expression³⁶. Estrogen is known to exert a protective effect on SUA levels, with SUA starting to rise in the late-menopausal transition stage³⁷. Further research is needed to elucidate the mechanism underlying the interplay of CDH13 and SUA levels in women.

The missense variant SLC2A9 p.W282I (rs16890979) has also been previously reported in European and African populations²⁹. Renal hypouricemia is believed to arise owing to genetic defects in SLC22A12 and SLC2A9; this is based on the “rare disease, rare variant hypothesis”³⁸. In contrast, we tested a regression model of low-frequency variants and PRSs of common alleles associated with SUA levels. Our model showed their independent effects despite predicting a relatively small proportion of variance in SUA levels of the general population, reflecting the “combinational effects of common and rare variants”. Our linear regression model also showed that the SUA levels increased by 0.21–0.28 per 1 standard deviation (SD) increase in PRS and increased by 0.77–0.90 (rs121907892) and 0.44–0.45 (rs16890979) per risk allele of the low-frequent missense variant in the general population.

Our study has several limitations. First, since we enrolled a cohort of ethnically homogeneous Koreans, the generalisability of our results to other populations may be limited. For example, the rare SLC22A12 p.W258X variant has been found only in the East Asian population, whereas different missense SLC22A12 variants (c.1245-1253del and p.T467M) have been found in the Roma population with renal hypouricemia³⁹. Second, we were not able to analyse all the low-frequency variants due to the lack of sequencing data and population-matched reference panels for imputation. The power to detect additional rare and less frequent disease-causing variants is expected be improve with the availability of reference panels that include sequencing data of more diverse racial/ethnic (e.g., East Asians) samples. Third, we conducted the present analysis in the general population, and therefore, we could not assess the role of genetic variants in people with impaired renal functions. Our results may be different from those reported for gout patients or individuals with impaired renal function, who may exhibit distinct pathways for SUA regulation by other transporters such as ABCG2^10,40. Fourth, potentially confounding environmental factors were not considered in our analysis. A potential contribution of genetic interaction with other factors such as food and drug intake, and chronic disease conditions in the pathophysiology of SUA should be examined. The combination of all the significant genetic loci identified in this study explained approximately 9.55% of the variance in SUA levels of our study, suggesting the need for discovery of additional epigenetic and genetic factors in larger, more comprehensive cohorts.

In summary, we replicated the previously known common genetic variants associated with SUA in the Korean population and identified low-frequency variants that exerted substantial impact on reduced SUA levels. Further studies are needed to identify additional variants associated with SUA and to evaluate our model under different disease conditions (e.g., gout and chronic kidney disease).

Methods

Ethics, consent, and permissions

This study was approved by the institutional review board of Sungkyunkwan University (IRB# SKKU 2017-12-007) and Kangbuk Samsung Hospital (KBSMC 2013-01-245-008), and written informed consent was obtained from all participants. All experiments were performed in accordance with all applicable institutional and governmental regulations concerning the ethical use of human participants.

Study participants

The Korean Genome and Epidemiology Study (KoGES) is a national biobank for genomic and epidemiological studies⁴¹. The discovery-phase samples comprised two cohorts from KoGES, namely a nation-wide cohort from urban (3,585 individuals) and another from rural (3,296 individuals) areas. In total, 6,881 individuals with no missing covariates were included in the present study. Genotyping was conducted on the Affymetrix 6.0 and the Illumina Omni1 arrays for the urban and rural cohorts, respectively. We evaluated two low-frequency variants and PRS for validation in 1,167 individuals of a community-based KoGES cohort from Ansan and Ansung areas, in which SUA levels were measured, and in 2,027 individuals from the Health Screening and Examination cohort of the Kangbuk Samsung Hospital (KBSMC). Genotyping was conducted using the Affymetrix 5.0 array and Illumina HumanCore BeadChips for the Ansan-Ansung and KBSMC cohorts, respectively.

Quality control, imputation, and annotation

Sample-level and SNP-level quality controls were performed on the genotyped data using PLINK 1.9⁴². SNPs with MAF < 1%, call rate <98%, and deviation from Hardy-Weinberg equilibrium (P-value <1.0 × 10⁻⁶) were excluded. Samples were excluded based on criteria including call rate <95%, heterozygosity rate (samples with observed heterozygosity rate 3 SD away from the mean were removed), and principal components derived from the genome-wide genotype data (samples with the first and second principal components 3 SD away from the mean were removed). We excluded one of related pairs of individuals with second-degree or closer relationships using KING 2.1⁴³. After quality control, the genotype data was phased using Eagle 2.3⁴⁴ and imputed on the reference panel of the Haplotype Reference Consortium (HRC r1.1 2016) using Minimac3⁴⁵. SNPs with low imputation quality (R² < 0.8) and MAF < 0.5% were excluded from the main analysis. We annotated gene symbols closest to each SNP using ANNOVAR⁴⁶; the two nearest genes were annotated for intergenic SNPs.

Association analysis

The association of SNPs with SUA was tested using a linear regression model adjusted for sex, age, body mass index (BMI), estimated glomerular filtration rate (eGFR), diabetes, hypertension, hyperlipidaemia, systolic blood pressure, total cholesterol, high-density lipoprotein (HDL) cholesterol, and triglycerides. Blood pressure medication was measured in the rural cohort and adjusted in linear regression. The equation from the Chronic Kidney Disease Epidemiology collaboration (CKD-EPI) was used for the estimation of eGFR⁴⁷. We performed an inverse variance-weighted fixed-effects meta-analysis of SUA levels to combine the summary statistics of the association test results from the two discovery cohorts using METAL⁴⁸. The meta-analysis results were double-genomic-controlled for population structure. In the meta-analysis, SNPs that reached a level of genome-wide significance (P-value <5.0 × 10⁻⁸) for association with the SUA levels were considered significant. For nonsynonymous variants, Bonferroni’s correction threshold (P-value <4.31 × 10⁻⁶, 0.05 for a total of 11,600 nonsynonymous tested variants) was used.

Conditional analysis

We performed conditional association analysis for each significant locus using GCTA-COJO⁴⁹ to identify statistically independent SNPs, which have been listed in Table 2. The peak SNPs in each significant locus were selected as the primary associated lead SNP and the association analysis conditioning on the primary lead SNP was conducted for variants within the surrounding 2-megabase pairs region. If there were significant SNPs with a conditional P-value that reached a level of significance described in the association analysis, the peak SNP of conditional analysis was selected as the secondary associated lead SNP. The association analysis conditioning on the primary and secondary lead SNPs was conducted for variants in the surrounding region. This procedure was repeated until there were no more SNPs that reached the significant level in each locus.

Alcohol intake

We observed a high linkage disequilibrium between a lead SNP on chromosome 12 and a known functional missense variant (rs671) in the aldehyde dehydrogenase 2 (ALDH2) gene. Therefore, alcohol intake-adjusted association of SNPs with SUA was tested using a linear regression model adjusted for sex, age, the first 10 principal components derived from the genome-wide genotype data, and dummy variables for the alcohol intake groups. Several alcohol-related variables were investigated in the urban and rural cohorts, including the prevalence of a drinking habit, average intake per drink by type of alcoholic beverage, and average number of intakes per year. We calculated the daily alcohol intake based on these variables using the following equation:

$$Alcohol\,intake(g/day)=\sum \{(Alcohol\,content\,ratio\,)\times (Average\,intake\,per\,drink)\times (Average\,number\,of\,intake\,per\,year)\div365\}$$

(1)

The study participants in each cohort were then divided into four groups based on the calculated daily alcohol intake: non-drinker, light drinker (<20 g/day), moderate-heavy drinker (≥ 20 g/day and <50 g/day), and heavy drinker (≥50 g/day). To compare the GWAS results of SUA, we also performed the association analysis of alcohol intake (non-drinkers vs. moderate-heavy or heavy drinkers) using a logistic regression model adjusted for sex, age, and the first 10 principal components.

Comparison with results from the European cohort study

Summary statistics results from the GUGC were used for comparison of significant associations identified in our study with those reported for European populations. We extracted lead SNPs from the present study and GWAS hit SNPs from GUGC. Three of the GUGC GWAS hit SNPs were not found in our datasets and nine of the lead SNPs in our study were absent from the GUGC data. We set the β coefficients of these SNPs to zero. To compare the effect sizes from our meta-analysis with those of GUGC summary statistics, we used Cohen’s kappa (κ) coefficient to determine the direction of effect size and Spearman’s correlation coefficient (ρ). In addition, we used proportion of variance in phenotype explained by a given SNP (PVE) for comparison. The GUGC summary statistics provides sample size, MAF, effect size (β coefficient), and SE of effect size for each SNP above the P-value <5.0 × 10⁻⁸ threshold⁹. We estimated PVE values using the equation below⁵⁰:

$$PVE=\frac{2{\hat{\beta }}^{2}MAF(1-MAF)}{2{\hat{\beta }}^{2}MAF(1-MAF)+{(se(\hat{\beta }))}^{2}2NMAF(1-MAF)}$$

(2)

we used values of the MAF of Europeans from the Genome Aggregation Database (gnomAD) because information on allele frequency was not included in the GUGC summary statistics⁵¹. Comparative measurements were calculated after excluding the variants that were missing in our meta-analysis result or from the GUGC summary statistics.

Polygenic risk scoring and model construction

We calculated PRS values as a weighted sum of the effects multiplied by the number of alleles based on the β estimates for SUA-raising alleles for 14 common (MAF ≥ 5%) SNPs that reached a level of genome-wide significance (P < 5.0 × 10⁻⁸) for association with SUA levels and were not in a high linkage disequilibrium (r² ≤ 0.2) with the other variants. The PRS of β estimates for SUA-raising alleles was standardised to zero mean and unit variance. For nonsynonymous variants, Bonferroni’s correction threshold (P < 4.31 × 10⁻⁶, 0.05 for a total of 11,600 nonsynonymous variants) was applied to correct for the multiple testing problem. Two low-frequency SNPs (MAF ~1%), rs121907892 and rs16890979, which passed the threshold were also included in the linear regression model-based analysis of SUA levels. We constructed a linear regression model on SUA levels that comprised these PRSs and two low-frequency nonsynonymous variants. The linear regression model was adjusted for the following covariates available in all cohorts for validation: sex, age, BMI, eGFR, diabetes, hypertension, hyperlipidaemia, systolic blood pressure, total cholesterol, HDL cholesterol, and triglycerides.

References

Wu, X. W., Muzny, D. M., Lee, C. C. & Caskey, C. T. Two independent mutational events in the loss of urate oxidase during hominoid evolution. J. Mol. Evol. 34, 78–84 (1992).
Article ADS CAS PubMed Google Scholar
Riches, P. L., Wright, A. F. & Ralston, S. H. Recent insights into the pathogenesis of hyperuricaemia and gout. Hum. Mol. Genet. 18, R177–184, https://doi.org/10.1093/hmg/ddp369 (2009).
Article CAS PubMed Google Scholar
Anzai, N., Kanai, Y. & Endou, H. New insights into renal transport of urate. Curr. Opin. Rheumatol. 19, 151–157, https://doi.org/10.1097/BOR.0b013e328032781a (2007).
Article CAS PubMed Google Scholar
Bobulescu, I. A. & Moe, O. W. Renal transport of uric acid: evolving concepts and uncertainties. Adv. Chronic Kidney D. 19, 358–371, https://doi.org/10.1053/j.ackd.2012.07.009 (2012).
Article Google Scholar
Cho, S. K., Chang, Y., Kim, I. & Ryu, S. U-Shaped Association Between Serum Uric Acid Level and Risk of Mortality: A Cohort Study. Arthritis Rheumatol. 70, 1122–1132, https://doi.org/10.1002/art.40472 (2018).
Article CAS PubMed Google Scholar
Rho, Y. H., Zhu, Y. & Choi, H. K. The epidemiology of uric acid and fructose. Semin. Nephrol. 31, 410–419, https://doi.org/10.1016/j.semnephrol.2011.08.004 (2011).
Article CAS PubMed PubMed Central Google Scholar
Yahyaoui, R. et al. Effect of long-term administration of cross-sex hormone therapy on serum and urinary uric acid in transsexual persons. J. Clin. Endocrinol. Metab. 93, 2230–2233, https://doi.org/10.1210/jc.2007-2467 (2008).
Article CAS PubMed Google Scholar
Doring, A. et al. SLC2A9 influences uric acid concentrations with pronounced sex-specific effects. Nat. Genet. 40, 430–436, https://doi.org/10.1038/ng.107 (2008).
Article CAS PubMed Google Scholar
Kottgen, A. et al. Genome-wide association analyses identify 18 new loci associated with serum urate concentrations. Nat. Genet. 45, 145–154, https://doi.org/10.1038/ng.2500 (2013).
Article CAS PubMed Google Scholar
Ichida, K. et al. Decreased extra-renal urate excretion is a common cause of hyperuricemia. Nat. Commun. 3, 764, https://doi.org/10.1038/ncomms1756 (2012).
Article ADS CAS PubMed Google Scholar
Matsuo, H. et al. ABCG2 dysfunction causes hyperuricemia due to both renal urate underexcretion and renal urate overload. Sci. Rep. 4, 3755, https://doi.org/10.1038/srep03755 (2014).
Article CAS PubMed PubMed Central Google Scholar
Mandal, A. K. & Mount, D. B. The molecular physiology of uric acid homeostasis. Annu. Rev. Physiol. 77, 323–345, https://doi.org/10.1146/annurev-physiol-021113-170343 (2015).
Article CAS PubMed Google Scholar
Cho, S. K., Kim, S., Chung, J. Y. & Jee, S. H. Discovery of URAT1 SNPs and association between serum uric acid levels and URAT1. BMJ open. 5, e009360, https://doi.org/10.1136/bmjopen-2015-009360 (2015).
Article PubMed PubMed Central Google Scholar
Jang, W. C. et al. T6092C polymorphism of SLC22A12 gene is associated with serum uric acid concentrations in Korean male subjects. Clin. Chim. Acta 398, 140–144, https://doi.org/10.1016/j.cca.2008.09.008 (2008).
Article CAS PubMed Google Scholar
Kolz, M. et al. Meta-analysis of 28,141 individuals identifies common variants within five new loci that influence uric acid concentrations. PLoS Genet. 5, e1000504, https://doi.org/10.1371/journal.pgen.1000504 (2009).
Article CAS PubMed PubMed Central Google Scholar
Woodward, O. M. et al. Identification of a urate transporter, ABCG2, with a common functional polymorphism causing gout. Proc. Natl Acad. Sci. U S A 106, 10338–10342, https://doi.org/10.1073/pnas.0901249106 (2009).
Article ADS PubMed PubMed Central Google Scholar
Gabrikova, D., Bernasovska, J., Sokolova, J. & Stiburkova, B. High frequency of SLC22A12 variants causing renal hypouricemia 1 in the Czech and Slovak Roma population; simple and rapid detection method by allele-specific polymerase chain reaction. Urolithiasis 43, 441–445, https://doi.org/10.1007/s00240-015-0790-4 (2015).
Article CAS PubMed Google Scholar
Hurba, O. et al. Complex analysis of urate transporters SLC2A9, SLC22A12 and functional characterization of non-synonymous allelic variants of GLUT9 in the Czech population: no evidence of effect on hyperuricemia and gout. PLoS one 9, e107902 (2014).
Article ADS PubMed PubMed Central Google Scholar
Tin, A. et al. Target genes, variants, tissues and transcriptional pathways influencing human serum urate levels. Nat. Genet. 51, 1459–1474, https://doi.org/10.1038/s41588-019-0504-x (2019).
Article CAS PubMed PubMed Central Google Scholar
Enomoto, A. et al. Molecular identification of a renal urate anion exchanger that regulates blood urate levels. Nature 417, 447–452, https://doi.org/10.1038/nature742 (2002).
Article ADS CAS PubMed Google Scholar
Okada, Y. et al. Meta-analysis identifies multiple loci associated with kidney function-related traits in east Asian populations. Nat. Genet. 44, 904–909, https://doi.org/10.1038/ng.2352 (2012).
Article CAS PubMed PubMed Central Google Scholar
Tin, A. et al. Large-scale whole-exome sequencing association studies identify rare functional variants influencing serum urate levels. Nat. Commun. 9, 4228, https://doi.org/10.1038/s41467-018-06620-4 (2018).
Article ADS CAS PubMed PubMed Central Google Scholar
Martin, A. R. et al. Clinical use of current polygenic risk scores may exacerbate health disparities. Nat. Genet. 51, 584–591, https://doi.org/10.1038/s41588-019-0379-x (2019).
Article CAS PubMed PubMed Central Google Scholar
Lambert, S. A., Abraham, G. & Inouye, M. Towards clinical utility of polygenic risk scores. Hum Mol Genet, https://doi.org/10.1093/hmg/ddz187 (2019).
Tin, A. et al. Target genes, variants, tissues and transcriptional pathways influencing human serum urate levels. Nat. Genet. 51, 1459–1474, https://doi.org/10.1038/s41588-019-0504-x (2019).
Article CAS PubMed PubMed Central Google Scholar
Nakatochi, M. et al. Genome-wide meta-analysis identifies multiple novel loci associated with serum uric acid levels in Japanese individuals. Commun. Biol. 2, 115, https://doi.org/10.1038/s42003-019-0339-0 (2019).
Article CAS PubMed PubMed Central Google Scholar
McCarthy, S. et al. A reference panel of 64,976 haplotypes for genotype imputation. Nat. Genet. 48, 1279–1283, https://doi.org/10.1038/ng.3643 (2016).
Article CAS PubMed PubMed Central Google Scholar
Yasukochi, Y. et al. Identification of CDC42BPG as a novel susceptibility locus for hyperuricemia in a Japanese population. Mol. Genet. Genomics 293, 371–379, https://doi.org/10.1007/s00438-017-1394-1 (2018).
Article CAS PubMed Google Scholar
Dehghan, A. et al. Association of three genetic loci with uric acid concentration and risk of gout: a genome-wide association study. Lancet 372, 1953–1961, https://doi.org/10.1016/S0140-6736(08)61343-4 (2008).
Article CAS PubMed PubMed Central Google Scholar
Muramatsu, T. et al. Alcohol and aldehyde dehydrogenase genotypes and drinking behavior of Chinese living in Shanghai. Hum. Genet. 96, 151–154, https://doi.org/10.1007/bf00207371 (1995).
Article CAS PubMed Google Scholar
Higuchi, S., Matsushita, S., Muramatsu, T., Murayama, M. & Hayashida, M. Alcohol and aldehyde dehydrogenase genotypes and drinking behavior in Japanese. Alcohol. Clin. Exp. Res. 20, 493–497, https://doi.org/10.1111/j.1530-0277.1996.tb01080.x (1996).
Article CAS PubMed Google Scholar
van der Harst, P. et al. Replication of the five novel loci for uric acid concentrations and potential mediating mechanisms. Hum. Mol. Genet. 19, 387–395, https://doi.org/10.1093/hmg/ddp489 (2010).
Article CAS PubMed Google Scholar
Cha, D. H. et al. Contribution of SLC22A12 on hypouricemia and its clinical significance for screening purposes. Sci. Rep. 9, 14360, https://doi.org/10.1038/s41598-019-50798-6 (2019).
Article ADS CAS PubMed PubMed Central Google Scholar
Ichida, K. et al. Clinical and molecular analysis of patients with renal hypouricemia in Japan-influence of URAT1 gene on urinary urate excretion. J. Am. Soc. Nephrol. 15, 164–173, https://doi.org/10.1097/01.asn.0000105320.04395.d0 (2004).
Article PubMed Google Scholar
Lee, J. et al. Genome-wide association analysis identifies multiple loci associated with kidney disease-related traits in Korean populations. PLoS one 13, e0194044, https://doi.org/10.1371/journal.pone.0194044 (2018).
Article CAS PubMed PubMed Central Google Scholar
Fava, C. et al. A Variant Upstream of the CDH13 Adiponectin Receptor Gene and Metabolic Syndrome in Swedes. Am. J. Cardiol. 108, 1432–1437, https://doi.org/10.1016/j.amjcard.2011.06.068 (2011).
Article CAS PubMed Google Scholar
Cho, S. K., Winkler, C. A., Lee, S. J., Chang, Y. & Ryu, S. The Prevalence of Hyperuricemia Sharply Increases from the Late Menopausal Transition Stage in Middle-Aged Women. Journal of clinical medicine 8, https://doi.org/10.3390/jcm8030296 (2019).
Takahashi, T. et al. Recurrent URAT1 gene mutations and prevalence of renal hypouricemia in Japanese. Pediatr. Nephrol. 20, 576–578, https://doi.org/10.1007/s00467-005-1830-z (2005).
Article PubMed Google Scholar
Stiburkova, B. et al. Prevalence of URAT1 allelic variants in the Roma population. Nucleosides, nucleotides nucleic acids 35, 529–535, https://doi.org/10.1080/15257770.2016.1168839 (2016).
Article CAS PubMed Google Scholar
Bhatnagar, V. et al. Analysis of ABCG2 and other urate transporters in uric acid homeostasis in chronic kidney disease: potential role of remote sensing and signaling. Clin. Kidney J. 9, 444–453, https://doi.org/10.1093/ckj/sfw010 (2016).
Article CAS PubMed PubMed Central Google Scholar
Kim, Y., Han, B. G. & Ko, G. E. S. G. Cohort Profile: The Korean Genome and Epidemiology Study (KoGES) Consortium. Int. J. Epidemiol. 46, e20, https://doi.org/10.1093/ije/dyv316 (2017).
Article PubMed Google Scholar
Purcell, S. et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet. 81, 559–575, https://doi.org/10.1086/519795 (2007).
Article CAS PubMed PubMed Central Google Scholar
Manichaikul, A. et al. Robust relationship inference in genome-wide association studies. Bioinformatics 26, 2867–2873, https://doi.org/10.1093/bioinformatics/btq559 (2010).
Article CAS PubMed PubMed Central Google Scholar
Loh, P. R. et al. Reference-based phasing using the Haplotype Reference Consortium panel. Nat. Genet. 48, 1443–1448, https://doi.org/10.1038/ng.3679 (2016).
Article CAS PubMed PubMed Central Google Scholar
Das, S. et al. Next-generation genotype imputation service and methods. Nat. Genet. 48, 1284–1287, https://doi.org/10.1038/ng.3656 (2016).
Article CAS PubMed PubMed Central Google Scholar
Wang, K., Li, M. & Hakonarson, H. ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data. Nucleic Acids Res. 38, e164, https://doi.org/10.1093/nar/gkq603 (2010).
Article CAS PubMed PubMed Central Google Scholar
Levey, A. S. et al. A new equation to estimate glomerular filtration rate. Ann. Intern. Med. 150, 604–612, https://doi.org/10.7326/0003-4819-150-9-200905050-00006 (2009).
Article PubMed PubMed Central Google Scholar
Willer, C. J., Li, Y. & Abecasis, G. R. METAL: fast and efficient meta-analysis of genomewide association scans. Bioinformatics 26, 2190–2191, https://doi.org/10.1093/bioinformatics/btq340 (2010).
Article CAS PubMed PubMed Central Google Scholar
Yang, J. et al. Conditional and joint multiple-SNP analysis of GWAS summary statistics identifies additional variants influencing complex traits. Nat. Genet. 44(369–375), S361–363, https://doi.org/10.1038/ng.2213 (2012).
Article CAS Google Scholar
Shim, H. et al. A multivariate genome-wide association analysis of 10 LDL subfractions, and their response to statin treatment, in 1868 Caucasians. PLoS one 10, e0120758, https://doi.org/10.1371/journal.pone.0120758 (2015).
Article CAS PubMed PubMed Central Google Scholar
Lek, M. et al. Analysis of protein-coding genetic variation in 60,706 humans. Nature 536, 285–291, https://doi.org/10.1038/nature19057 (2016).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

This study was provided with biological resources from National Biobank of Korea, Centers for Disease Control and Prevention, Republic of Korea. This work was supported by the National Research Foundation of Korea (NRF) grant, funded by the Korea government (MSIT) [No. 2019R1A2C4070496], the Basic Science Research Program through the National Research Foundation of Korea (NRF), funded by the Ministry of Education [NRF-2016R1A6A3A11933380], and the Korea Health technology R&D Project through the Korea Health Industry Development Institute (KHIDI), funded by the Ministry of Health & Welfare, Republic of Korea [Grant Number: HI17C2372]. H.-N.K. was supported by the National Research Foundation of Korea (NRF) grant, funded by the Korea government (MSIT) [NRF-2014R1A2A2A04006291]. This research was partially supported by the Intramural Research Program of the National Institutes of Health, National Cancer Institute, Center for Cancer Research and in part with federal funds from the National Cancer Institute, National Institutes of Health under contract [HHSN26120080001E]. The content of this publication does not necessarily reflect the views or policies of the Department of Health and Human Services, nor does the mention of trade names, commercial products, or organisations in this publication imply endorsement by the U.S. Government.

Author information

These authors contributed equally: Sung Kweon Cho and Beomsu Kim.

Authors and Affiliations

Samsung Advanced Institute for Health Sciences and Technology (SAIHST), Sungkyunkwan University, Samsung Medical Center, Seoul, Republic of Korea
Sung Kweon Cho, Beomsu Kim & Hong-Hee Won
Molecular Genetic Epidemiology Section, Basic Research Program, Frederick National Laboratory for Cancer Research, Frederick, MD, USA
Sung Kweon Cho & Cheryl A. Winkler
Department of Neuropsychiatry, Seoul National University Bundang Hospital, Seongnam-si, Republic of Korea
Woojae Myung
Center for Cohort Studies, Total Healthcare Center, Kangbuk Samsung Hospital, Sungkyunkwan University School of Medicine, Seoul, Republic of Korea
Yoosoo Chang, Seungho Ryu & Han-Na Kim
Medical Research Institute, Kangbuk Samsung Hospital, Sungkyunkwan University School of Medicine, Seoul, Republic of Korea
Han-Na Kim
Department of Biochemistry, Ewha Womans University, Seoul, Republic of Korea
Hyung-Lae Kim
Department of Public Health & Institute of Epidemiology and Preventive Medicine, College of Public Health, National Taiwan University, Taipei, Taiwan
Po-Hsiu Kuo

Authors

Sung Kweon Cho
View author publications
You can also search for this author in PubMed Google Scholar
Beomsu Kim
View author publications
You can also search for this author in PubMed Google Scholar
Woojae Myung
View author publications
You can also search for this author in PubMed Google Scholar
Yoosoo Chang
View author publications
You can also search for this author in PubMed Google Scholar
Seungho Ryu
View author publications
You can also search for this author in PubMed Google Scholar
Han-Na Kim
View author publications
You can also search for this author in PubMed Google Scholar
Hyung-Lae Kim
View author publications
You can also search for this author in PubMed Google Scholar
Po-Hsiu Kuo
View author publications
You can also search for this author in PubMed Google Scholar
Cheryl A. Winkler
View author publications
You can also search for this author in PubMed Google Scholar
Hong-Hee Won
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

S.K.C. and H.-H.W. conceptualised the project and designed the study. S.K.C. and B.K. drafted the manuscript. B.K. performed data analysis. Y.C., S.R., H.-N.K. and H.-L.K. were involved in patient recruitment and sample collection. S.K.C., B.K., H.-H.W., C.A.W., W.M. and P.-H.K. participated in result interpretation and manuscript revision. H.-H.W. and C.A.W. supervised the research and finalised the manuscript with the approval of all the authors.

Corresponding authors

Correspondence to Cheryl A. Winkler or Hong-Hee Won.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Tables and Figures.

Supplementary Table S1.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Cho, S.K., Kim, B., Myung, W. et al. Polygenic analysis of the effect of common and low-frequency genetic variants on serum uric acid levels in Korean individuals. Sci Rep 10, 9179 (2020). https://doi.org/10.1038/s41598-020-66064-z

Download citation

Received: 06 October 2019
Accepted: 05 May 2020
Published: 08 June 2020
DOI: https://doi.org/10.1038/s41598-020-66064-z

This article is cited by

Linkage analysis using whole exome sequencing data implicates SLC17A1, SLC17A3, TATDN2 and TMEM131L in type 1 diabetes in Kuwaiti families
- Prashantha Hebbar
- Rasheeba Nizam
- Fahd Al-Mulla
Scientific Reports (2023)
Genetic assessment of hyperuricemia and gout in Asian, Native Hawaiian, and Pacific Islander subgroups of pregnant women: biospecimens repository cross-sectional study
- Ali Alghubayshi
- Alison Edelman
- Youssef Roman
BMC Rheumatology (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Genome-wide meta-analysis revealed several genetic loci associated with serum uric acid levels in Korean population: an analysis of Korea Biobank data

Genetics of 35 blood and urine biomarkers in the UK Biobank

Multi-phenotype genome-wide association studies of the Norfolk Island isolate implicate pleiotropic loci involved in chronic kidney disease

Introduction

Results

Characteristics of the participants

Meta-analysis of SUA levels

Sex-stratified meta-analysis of SUA levels

Alcohol intake-adjusted associations in the region on chromosome 12

Comparison of significant variants identified in our study and the European study

Validation of low-frequency variants and PRSs of common SNPs

Discussion

Methods

Ethics, consent, and permissions

Study participants

Quality control, imputation, and annotation

Association analysis

Conditional analysis

Alcohol intake

Comparison with results from the European cohort study

Polygenic risk scoring and model construction

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Additional information

Supplementary information

Supplementary Tables and Figures.

Supplementary Table S1.

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Linkage analysis using whole exome sequencing data implicates SLC17A1, SLC17A3, TATDN2 and TMEM131L in type 1 diabetes in Kuwaiti families

Genetic assessment of hyperuricemia and gout in Asian, Native Hawaiian, and Pacific Islander subgroups of pregnant women: biospecimens repository cross-sectional study

Comments

Search

Quick links