Evaluation of Polygenic Determinants of Non-Alcoholic Fatty Liver Disease (NAFLD) By a Candidate Genes Resequencing Strategy

Di Costanzo, Alessia; Belardinilli, Francesca; Bailetti, Diego; Sponziello, Marialuisa; D’Erasmo, Laura; Polimeni, Licia; Baratta, Francesco; Pastori, Daniele; Ceci, Fabrizio; Montali, Anna; Girelli, Gabriella; De Masi, Bruna; Angeloni, Antonio; Giannini, Giuseppe; Del Ben, Maria; Angelico, Francesco; Arca, Marcello

doi:10.1038/s41598-018-21939-0

Download PDF

Article
Open access
Published: 27 February 2018

Evaluation of Polygenic Determinants of Non-Alcoholic Fatty Liver Disease (NAFLD) By a Candidate Genes Resequencing Strategy

Alessia Di Costanzo ORCID: orcid.org/0000-0001-8359-6599¹,
Francesca Belardinilli²,
Diego Bailetti³,
Marialuisa Sponziello¹,
Laura D’Erasmo ORCID: orcid.org/0000-0002-9174-4667¹,
Licia Polimeni¹,
Francesco Baratta^1,4,
Daniele Pastori ORCID: orcid.org/0000-0001-6357-5213^1,4,
Fabrizio Ceci ORCID: orcid.org/0000-0003-4051-4922⁵,
Anna Montali¹,
Gabriella Girelli⁶,
Bruna De Masi⁶,
Antonio Angeloni²,
Giuseppe Giannini²,
Maria Del Ben¹,
Francesco Angelico⁷ &
…
Marcello Arca¹

Scientific Reports volume 8, Article number: 3702 (2018) Cite this article

4746 Accesses
58 Citations
1 Altmetric
Metrics details

Subjects

Abstract

NAFLD is a polygenic condition but the individual and cumulative contribution of identified genes remains to be established. To get additional insight into the genetic architecture of NAFLD, GWAS-identified GCKR, PPP1R3B, NCAN, LYPLAL1 and TM6SF2 genes were resequenced by next generation sequencing in a cohort of 218 NAFLD subjects and 227 controls, where PNPLA3 rs738409 and MBOAT7 rs641738 genotypes were also obtained. A total of 168 sequence variants were detected and 47 were annotated as functional. When all functional variants within each gene were considered, only those in TM6SF2 accumulate in NAFLD subjects compared to controls (P = 0.04). Among individual variants, rs1260326 in GCKR and rs641738 in MBOAT7 (recessive), rs58542926 in TM6SF2 and rs738409 in PNPLA3 (dominant) emerged as associated to NAFLD, with PNPLA3 rs738409 being the strongest predictor (OR 3.12, 95% CI, 1.8-5.5, P < 0.001). A 4-SNPs weighted genetic risk score value >0.28 was associated with a 3-fold increased risk of NAFLD. Interestingly, rs61756425 in PPP1R3B and rs641738 in MBOAT7 genes were predictors of NAFLD severity. Overall, TM6SF2, GCKR, PNPLA3 and MBOAT7 were confirmed to be associated with NAFLD and a score based on these genes was highly predictive of this condition. In addition, PPP1R3B and MBOAT7 might influence NAFLD severity.

Genome-wide association studies

Article 26 August 2021

Emil Uffelmann, Qin Qin Huang, … Danielle Posthuma

Utility of polygenic scores across diverse diseases in a hospital cohort for predictive modeling

Article Open access 12 April 2024

Ting-Hsuan Sun, Chia-Chun Wang, … Kai-Cheng Hsu

Genomic data in the All of Us Research Program

Article Open access 19 February 2024

The All of Us Research Program Genomics Investigators

Introduction

Non-alcoholic fatty liver disease (NAFLD), is a multifactorial disease characterized by an increased hepatic triglyceride content (>5.5% of liver weight) in the absence of an excess of alcohol consumption, HVC infection, familial hypobetalipoproteinemia or endocrine disorders^1,2. NAFLD, which currently represents the leading cause of liver damage in developed countries³, has well established risk factors such as insulin resistance associated with overweight, physical inactivity and type 2 diabetes mellitus (T2DM)⁴. However, epidemiological, familial and twin studies have clearly indicated that the risk of NAFLD has also a strong genetic component⁵.

In the last few years, a large number of genetic investigations, employing single candidate gene as well as genome-wide association studies (GWAS) strategies, have provided compelling evidence that several gene variants are associated with NAFLD⁵. In particular, the rs738409 C > G change in the Patatin-like Phospholipase domain-containing 3 (PNPLA3) gene, coding for the I148M protein variation, has been identified as a major determinant of inter-individual and ethnicity-related differences in hepatic fat content^6,7. The mechanism by which this substitution induces liver fat is related to an impaired hepatocellular triglycerides hydrolysis and increased lipogenesis associated to the 148 M allele^8,9. More recently, the Transmembrane 6 Superfamily Member 2 (TM6SF2) E167K variant has also been shown to increase NAFLD susceptibility¹⁰. This effect appears to be due to an impaired mobilization of neutral lipids for very low-density lipoprotein (VLDL) assembly and secretion by the liver in E167K carriers^11,12,13.

Furthermore, GWAS studies have indicated additional loci whose involvement in the pathogenesis of liver steatosis is less established. In particular, Speliotes E.K. et al.¹⁴ reported that variants in Protein Phosphatase 1 Regulatory Subunit 3B (PPP1R3B), Glucokinase Regulatory Protein (GCKR), Neurocan (NCAN) and Lysophospholipase Like 1 (LYPLAL1) genes were associated to the presence of NAFLD. Finally, based on results obtained in patients with alcoholic liver disease (ALD)¹⁵, Mancina et al.¹⁶ demonstrated an association between the rs641738 in the membrane Bound O-Acyltransferase Domain Containing 7 gene (MBOAT7) and the occurrence and progression of NAFLD. The minor allele (T) was significantly associated with high liver fat content only in European Americans, but not in African Americans and Hispanics. Moreover, this variant showed an additive effect with PNPLA3 and TM6SF2 single nucleotide polymorphisms (SNPs) in determining the risk of liver fibrosis¹⁷. Overall, these findings clearly indicate that the genetic predisposition to NAFLD results from a combination of several variants, which may influence different steps of hepatic lipid and carbohydrate metabolism¹⁸.

Despite this wealth of knowledge, the proportion of genetic risk of NAFLD explained by the identified loci remains modest (<5%). This might be because the majority of GWAS tag SNPs are common and/or lie in intergenic or intronic regions¹⁹. Moreover, GWAS did not capture rare or low frequency risk variants with moderate/strong effects, which could explain a part of this missing heritability. An effective way to overcome these limitations is to re-sequence the entire coding portion of candidate genes to capture all non-genotyped risk alleles. This strategy has already been successfully employed for various conditions where subjects with well distinct phenotypes were genotyped²⁰. In addition, the candidate genes resequencing strategy, by providing a comprehensive evaluation of the polygenic architecture of NAFLD, would allow to weighing the overall as well as the individual contribution of different variants to the risk of this complex trait.

Here we provide the evaluation of genetic determinants of NAFLD using the sequencing analysis of candidate genes emerged from GWAS. To this aim, we have re-sequenced the coding regions of GCKR, PPP1R3B, LYPLAL1, NCAN and TM6SF2 genes in NAFLD and control subjects, where PNPLA3 rs738409 and MBOAT7 rs641738 genotypes were also obtained.

The association of individual variants with NAFLD has been evaluated by using logistic regression analysis. Furthermore, following the logic of recent studies that have tested in complex disorders the combined impact of multiple genetic variants²¹, we determined a polygenic score for NAFLD based on identified risk alleles.

Results

Subjects characteristics

Baseline characteristics of study participants are reported in Table 1. Compared with controls, NAFLD subjects were older, showed higher indices of adiposity and increased plasma triglycerides (TG) and reduced HDL-C concentrations (all P < 0.001). Also fasting plasma levels of glucose, insulin and HOMA_IR values were significantly higher in NAFLD compared with control subjects (all P < 0.001). As expected, the prevalence of T2DM and metabolic syndrome (MetS) was higher in NAFLD than in controls. Moreover, subjects with NAFLD were more frequently smokers and hypertensives (P < 0.001). A statistically significant elevation of ALT (P < 0.001), AST (P = 0.008) and γGT (P < 0.001) were seen in NAFLD compared with control subjects. Among NAFLD subjects, 164 (76.3%) were classified as having moderate to severe liver steatosis according to Hamaguchi’s criteria.

Table 1 Clinical and metabolic characteristics of study subjects.

Full size table

DNA re-sequencing

Overall, 168 variants were identified, of which 100 were intronic and 68 exonic. Among exonic variants, 43 were nonsynonymous (NS), 2 nonsense, 2 frameshift and 21 synonymous (Supplementary Table S1). Thirty one (65.9%) were classifiable as rare (MAF < 0.01), 5 (10.6%) as low frequency/less common (0.01 ≤ MAF < 0.05) and 3 (6.4%) common variants (MAF ≥ 0.05). Six exonic variants (12.7%) were not been previously reported in dbSNP and thus submitted to EXAC database (http://exac.broadinstitute.org). Forty-seven variants were annotated as functional (nonsense, frameshift and nonsynonymous), 23% in GCKR, 14.8% in LYPLAL1, 34% in NCAN, 8.5% in PPP1R3B and 19.4% in TM6SF2 genes. These variants were considered for further analyses.

The list of identified variants with their in silico prediction of deleteriousness is reported in the Supplementary Table S2.

Enrichment of gene variants in NAFLD and controls

Figure 1 shows the percentage of subjects carrying at least one functional variant within each gene in study groups. Overall, 80% of subjects with NAFLD were positive for at least one variant in GCKR, 31% for LYPLAL1, 15% for NCAN, 8% for PPP1R3B and 14% for TM6SF2 genes. Among controls, 79% were positive for at least one variant in GCKR, 34% in LYPLAL1, 10% in NCAN, 4% in PPP1R3B and 8% in TM6SF2 genes. Although variants in NCAN and PPP1R3B appeared to be more frequent in cases compared with controls, only those in TM6SF2 reached the statistical significance (OR = 2.0, 95% CI, 1.0-4.0, P = 0.04). However, after correction for multiple comparisons, the association of TM6SF2 gene was no longer significant.

Association of individual variants with NAFLD

In order to investigate the effect of genetic variants on NAFLD susceptibility, each identified sequence variation was included in a stepwise regression analysis. As all study subjects were also genotyped for the PNPLA3 rs738409 and MBOAT7 rs641738, these variants were also considered. As reported in Table 2, rs1260326 C/T in GCKR, rs58542926 C/T in TM6SF2, rs738409 C/G in PNPLA3 and MBOAT7 rs641738 T allele emerged as significantly associated with the presence of NAFLD (all P ≤ 0.05).

Table 2 Genotype frequencies and Odds Ratios (ORs) of variants associated with NAFLD.

Full size table

After adjustment for covariates such as age, gender, body mass index (BMI), HOMA_IR and TG, a dominant model of inheritance best explained the association with NAFLD of rs738409 C/G PNPLA3 (OR = 3.2, 95% CI, 1.79-5.59, P_adj < 0.001). Conversely, the association of rs1260326 C/T GCKR (OR = 1.9, 95% CI, 1.12-3.46, P_adj = 0.018) fitted better with a recessive model of inheritance. A similar trend was observed for TM6SF2 rs58542926 T and MBOAT7 rs641738 T alleles, although with a borderline level of significance. However, it must be noted that a real estimation of the effect of TM6SF2 variant in the dominant or recessive model could not be provided because of the low frequency of the T-allele (167 K) (only two homozygous subjects). Thus, all calculations were based on the dominant model of inheritance.

When these 4 NAFLD-associated variants were tested together, they explained, overall, about 7% of the genetic risk of NAFLD and the rs738409 in PNPLA3 ranked as the strongest predictor (OR = 3.12, 95% CI, 1.8-5.5, P < 0.001) when adjusted for conventional risk factors (Table 3). Hosmer-Lomeshow goodness of fit test showed that the model combining both genetic and non-genetic variables explained the observed data (X²₆ = 16.45; P = 0.036) with a predictive ability of 58.2%. The results did not change even after including or excluding from the model plasma TG and glucose levels. It is worth to mention that when in this model HOMA_IR was removed from covariates, TM6SF2 T allele reemerged as a significant predictor of NAFLD (OR = 3.6, 95% CI, 1.40-9.27, P_adj = 0.008). This might be explained by the observation that carriers of this variant showed higher levels of HOMA_IR compared to non-carriers (3.2 (2.1-6.0) vs. 1.43 (0.88-1.93), P = 0.002, respectively).

Table 3 Independent associations of genetic variants with NAFLD.

Full size table

Next, we examined the proportion of risk of NAFLD conferred by gene-gene and gene-environment interaction. Although we did not identify any gene-gene synergy, PNPLA3 rs738409 showed a significant inverse interaction with TG (OR = 0.9, 95% CI, 0.97-0.99, P = 0.0021) and HOMA_IR (OR = 0.33, 95% CI, 0.17-0.62, P = 0.001) but not with age, gender or BMI. Moreover, we observed a barely significant interaction between TM6SF2 polymorphism and BMI (OR = 0.8, 95% CI, 0.63-1.01, P = 0.07). Conversely, no interactions between GCKR rs1260326 or MBOAT7 rs641738 and conventional NAFLD risk factors were observed.

Association of genetic variants with metabolic traits

After stratifying the study population by TM6SF2 rs58542926 (dominant model), GCKR rs1260326 (recessive model), PNPLA3 rs738409 (dominant model) and MBOAT7 rs641738 (recessive model) genotypes, difference in clinical, anthropometric and biochemical indices were found across groups. NAFLD individuals carrying the minor T allele of rs58542926 (167 K) (N = 25) showed lower plasma total cholesterol (TC) (P_adj = 0.021) and TG (P_adj = 0.002) and higher AST (P_adj = 0.006) and ALT (P_adj = 0.012) levels when compared with NAFLD patients with wild-type C allele (N = 193). More importantly, the association with TC and TG levels was unchanged after adjustment for BMI, T2DM or statin therapy (all P_adj < 0.05). Similarly, NAFLD patients carrying CG or GG PNPLA3 genotypes (N = 126) compared with non-carriers (N = 92) showed, lower BMI (P_adj = 0.001), lower TG levels (P_adj = 0.001) and HOMA_IR (P_adj = 0.004) and higher AST levels (P_adj < 0.001). Notably, the association of [CG + GG] genotypes with TG levels persisted even after adjustment for BMI and diabetes (P_adj = 0.01). On the contrary, no differences in clinical, anthropometric and biochemical indices were found across the GCKR rs1260326 or the MBOAT7 rs641738 genotypes in patients with NAFLD.

Genetic risk score (GRS) and the risk of NAFLD

The median values of weighted and unweighted 4-SNP GRS were significantly higher in NAFLD than in controls (median unweighted GRS: 3 (2–4) vs. 2 (2–3), P = 0.001; median weighted GRS: 0.38 (0.17–0.50) vs. 0.18 (0.12–0.44), P = 0.03, respectively). When weighted 4-SNP GRS values was distributed according to tertiles (Fig. 2, Panel a), the prevalence of NAFLD significantly increased along with increasing tertiles (χ2 = 14.9, P = 1 × 10^-4) and the risk was significantly higher for GRS values above >0.28 (corresponding to the 2^th tertile) (Fig. 2, Panel b). This trend persisted even after adjustment for age, gender, BMI, HOMA_IR and TG levels.

Association of genetic variants with severity of NAFLD

The association of genetic variants with the ultrasound-defined severity of NAFLD is reported in Table 4. A NGS-identified variant in PPP1R3B gene (rs61756425 G/T p.S41R) emerged as more frequent in NAFLD patients than in controls (χ² = 16.11, P < 0.001) and as the strongest independent genetic predictor of severe hepatic steatosis (OR = 32.6, 95% CI, 4.22-251.4, P_adj = 0.001). This association was maintained even after bootstrap correction (two-tailed P_adj = 0.001). Similarly, the rs641738 in MBOAT7 gene showed a significant effect on NAFLD severity (OR = 2.6, 95% CI, 1.10-6.28, P_adj = 0.022). As expected, age, BMI and HOMA_IR were detected as the non-genetic significant predictors of severity of NAFLD.

Table 4 Predictors of NAFLD severity in the whole cohort.

Full size table

Discussion

NAFLD is a complex trait whose genetic component has been explored by many studies using different approaches¹⁸. Although several genes and genetic variants have been identified as involved in the occurrence of the disease, not all were consistently confirmed. Moreover, their combined effect on NAFLD susceptibility has rarely been explored.

In agreement with previous studies^{6,7,10,14,16,18,22,23}, we found that PNPLA3 rs738409, GCKR rs1260326, TM6SF2 rs58542926 and MBOAT7 rs641738, but not the other GWAS identified variants¹⁴, were genetic contributors to NAFLD in our cohort.

Hepatic fat accumulation results from an imbalance between TGs acquisition, synthesis, utilization and secretion²⁴ and, as yet described, the PNPLA3 I148M, TM6SF2 E167K and GCKR P446L polymorphisms promote steatosis through interaction with distinct metabolic mechanisms. Both genotypes in PNPLA3 and TM6SF2 influence the ability to export very low-density lipoproteins (VLDLs) from the liver^{8,9,10,11,12,13,17}. In addition, p.P446L is a loss-of-function variant that results in increased phosphorylation of glucose²⁵, glycolysis and fatty acid synthesis²⁶. Similarly, recent findings indicated that the rs641738 in MBOAT7 gene, by decreasing the expression of MBOAT7 enzyme, unfavorably affects the acyl remodeling of phospholipid acyl-chain in the liver¹⁶. Taken together, these data indicate that a biologically plausible mechanism by which these gene variants directly influence the development of NAFLD exists. On the other hand, they further support the notion that the impaired lipid handling by hepatocytes has a major causal role in the pathogenesis of NAFLD.

When considered on individual basis, PNPLA3 variant ranked as the strongest genetic predictor of NAFLD followed by GCKR. In contrast, a weaker association was detected for TM6SF2 and MBOAT7. These observations could be partially explained by the low frequency of TM6SF2 risk allele as compared to PNPLA3 risk allele²⁷. In addition, the finding that NAFLD carriers of the TM6SF2 167K allele have increased levels of HOMA_IR might have masked the effect of this variant on NAFLD risk due to the predominant role of insulin resistance in the pathogenesis of NAFLD^28,29. The finding on MBOAT7 is more difficult to be interpreted. However, it must be pointed out that this gene has not been consistently associated with NAFLD in all ethnic groups¹⁶ and, when a role was demonstrated, the MBOAT7 rs641738 variant showed the smallest effect in predisposing to fatty liver²².

Another interesting aspect of our findings is that, in agreement with previous studies, they do not fully support the notion that common variants are the major contributors to NAFLD susceptibility^20,30. In fact, in our cohort the rs58542926 T allele (MAF ~ 7%) displayed a 2.5-fold risk of hepatic steatosis higher than the 1.9 fold risk associated to GCKR T allele (MAF ~ 42%). Thus, according to the hypothesis of Manolio T.A. et al.²⁰, the lower frequency of rs58542926 in the general population could be the reason why we observed a larger effect of TM6SF2 T allele differently to GCKR TT genotype in the occurrence of NAFLD. Moreover, it is important to consider that in our cohort 55% of rs1260326 TT carriers were also heterozygous or homozygous carriers for the PNPLA3 rs738409, thus suggesting that the association with GCKR might be due to genetic bias.

Nevertheless, we confirmed that these genetic variants might act in an additive fashion. In fact, by using a 4-SNP GRS, we observed that the full combination of risk alleles increased the probability of hepatic steatosis up to 5 fold and this effect was present even after adjustment for traditional risk factors. These results emphasize the importance to consider a multiplicity of potentially involved gene variants when studying the genetic epidemiology of a common complex trait as NAFLD^31,32. Of note, only four previous studies, carried out in different ethnic groups, have considered the combined effect of different genetic variants in determining fatty liver^17,22,33,34. However, these Authors only performed genotyping tests for some of all previously GWAS-identified SNPs without exploring the entire coding region of NAFLD-associated loci. In addition, Krawczyk M. et al.¹⁷ concentrated their attention in evaluating the effect of the number of risk alleles (unweighted GRS) only on the grade of steatosis and fibrosis. Although our results need to be further evaluated in larger populations, they highlight the possibility to identify individuals at high risk of NAFLD by genotyping these genetic risk factors. Notably, EASL–EASD–EASO Clinical Practice Guidelines³⁵ already suggest genotyping for TM6SF2 and PNPLA3 to select patients with higher risk of hepatic steatosis.

Our results indicated that the model including genetic and non-genetic variables accounts for the 58.2% of NAFLD heritability. These findings further confirm that the hepatic steatosis is a dynamic process that results from a constant interplay between genetic and environmental determinants and its heritability is not only due to the primary effect of PNPLA3, TM6SF2, GCKR and MBOAT7 genotypes but also by the secondary effects of non-genetic factors. In contrast with Stender S. et al.³⁶, we did not find any synergy between adiposity and genotypes. The lack of interaction with BMI could be related to the fact that we did not quantify the intrahepatic triglyceride content (IHTG) but we considered hepatic steatosis as binary outcome variable in a case-control study³⁷. Thus, while we described an inverse interaction effect between PNPLA3 genotypes HOMA_IR and TG in predicting the higher risk of hepatic steatosis, the modest sample size could be the reason why we did not identified synergy with BMI³⁷.

Finally, we found a significant association between the PPP1R3B rs61756425 and MBOAT7 rs641738 variants in predicting the severity of hepatic steatosis. Although MBOAT7 has been previously associated with progression of NAFLD^16,17,38, the observation on PPP1R3B is novel. This variant has never been identified in GWAS nor associated to severity of NAFLD. This is probably due to its very low frequency in the population, thus supporting the notion that the re-sequencing of entire coding region of candidate genes may capture non-genotyped low-frequency risk alleles^20,39. It must be, however, pointed out that the high value with a wide 95% CI of OR associating PPP1R3B rs61756425 with NAFLD severity may suggest a low level of accuracy of this estimate, mainly due to the very low number of carriers of this variant. Therefore, additional evaluations in much larger cohorts are needed. On the other hand, very recent observations have challenged the role of PPP1R3B as genetic factor predisposing to liver fat accumulation. In fact, Stender S. et al.⁴⁰ have shown in mice that the lack of PPP1R3B was associated with reduced glycogen and unchanged fat liver content; conversely, the hepatic overexpression of PPP1R3B caused accumulation of hepatic glycogen and elevated ALT levels without affecting triglycerides accumulation. Based on this evidence, the role of PPP1R3B rs61756425 variant in liver disease surely requires additional evaluation with a direct confirmation of liver histological changes associated with this variant.

Strengths and limitations of our study must be acknowledged. To the best of ours knowledge, this is the first study reporting a comprehensive evaluation of sequence variants detectable in the entire coding regions of NAFLD-associated loci. This allowed evaluating the individual as well as the cumulative contribution of identified variants to the risk of NAFLD. However, it must be recognized that the definition of NAFLD was based on ultrasound and not on direct measurement of hepatic fat content with a more accurate methods such as the liver magnetic resonance. In addition, the size of the sequencing sample was modest so that some relevant rare variant might have been missed. To this regard, it is noteworthy that the power analysis indicated that our experimental design was able to identify low frequency variants and to detect moderate effects size. More, we did not re-sequenced the PNPLA3 and the MBOAT7 genes. For the former, we have considered the 148 M variant as it represents the only variant in this gene associated to hepatic fat content⁴¹. In fact, Donati B et al. by re-sequencing a cohort of children with early-onset histological NAFLD did not find any additional predictive rare variant in the PNPLA3 gene⁴¹. In addition, we genotyped only the MBOAT7 rs641738 variant as it was reported to display the major role in NAFLD^16,42. Finally, an additional limitation of our study was the lack of replication of identified rare variants in an independent sample. However, it must be considered that in the present study we have re-sequenced well-known genetic determinants of NAFLD not requiring replication. Nevertheless, the analysis of rare variants was adjusted for multiple testing.

In conclusion, we confirmed the role of PNPLA3, TM6SF2, GCKR and MBOAT7 gene variants as genetic determinants of NAFLD and we suggested a weighted GRS based on their additive and combined effect. We believe these results point the way towards a future feasibility of creating comprehensive risk factor panels, in which applying genetic testing for the individual-level NAFLD-risk prediction. If definitely confirmed, our GRS score could offer the opportunity to exclude low-risk patients from screening tests.

Material and Methods

Study subjects

We studied 445 Caucasian, unrelated subjects, 218 with echographically defined NAFLD and 227 healthy controls. The enrollment criteria as well as the protocol for clinical and biochemical characterization of study subjects have been reported elsewhere⁴³. In brief, NAFLD subjects, were considered eligible after exclusion of secondary causes of liver steatosis such as previous viral infection, past or present history of alcohol abuse (defined as an average daily consumption >20 g/day), use of drugs known to influence the development of hepatic steatosis as well as clinical and biochemical evidence of chronic liver diseases. Healthy controls, recruited from blood donors, were selected based on the absence of advanced liver disease at ultrasound⁴³. Liver ultrasonography was performed with a GE Vivid S6 apparatus equipped with a 3.5-MHz convex-array probe. All examinations were done by the same hepatologist and steatosis was assessed semi-quantitatively on a scale of 0–6: 0, absent; 1, 2 mild; 3, 4 moderate; and 5, 6 severe according to the Hamaguchi criteria⁴⁴.

The study protocol was reviewed and approved by the Ethics Committee of Sapienza University of Rome, Policlinico Umberto I (Rome, Italy). Written informed consent was obtained from all participants in accordance with the principles of the Helsinki Declaration. All methods were carried out in accordance with the relevant guidelines and regulations.

DNA analysis

Selection of candidate genes

The genes considered for next generation sequencing (NGS) were the following: GCKR, NCAN, PPP1R3B, LYPLAL1 and TM6SF2. They were selected because reported by GWAS to be associated with NAFLD above a significance threshold of P < 10⁻⁴ for any tagging SNPs^10,14. In our screening, we have also considered the PNPLA3 rs738409 and the MBOAT7 rs641738, as previously demonstrated to be genetic determinants of NAFLD^6,16. The genotyping of these latter variants was performed in duplicate by TaqMan 5′-Nucleotidase assay having a concordance rate of 100%.

Next-generation sequencing (NGS)

A custom panel was designed with the help of the AmpliSeq designer online tool (https://www.ampliseq.com), which was employed to generate optimized primer designs for the five genes present in the human reference genome (hg19). The overall coverage of the design region was 99,9%. (Pipeline version 4.2). Amplicon library preparation was performed with the Ion Ampliseq Library kit v2.0 using 10 ng of DNA (Thermo Fisher Scientific). PCR products were partially digested using FuPa reagent, followed by the ligation of barcoded sequencing adapters (Ion Xpress Barcode Adapters kit; Life Technologies, Carlsbad, CA, USA). Final libraries were purified using Agencourt AMPure XP magnetic beads (Beckman Coulter, Brea, CA, USA) and quantified using a Qubit 3.0 Fluorometer (Thermo Fisher Scientific, Wilmington, DE). The individual libraries were diluted to a final concentration of 100 pM and were pooled and processed to library amplification using Ion PGM Template OT2 400 kit. Unenriched libraries were quality-controlled using Ion Sphere quality control measurement on a Qubit instrument. Following library enrichment (Ion OneTouch ES), libraries were processed for sequencing by using the Ion PGM Hi-Q Sequencing Kit v2.

Data filtering and analysis

Sequencing runs were analyzed using the Torrent Suite v4.4.3 analysis. SNPs and insertion/deletions were identified across the targeted subset of the reference genome (hg19) using the analysis plug-in Torrent Variant Caller with the parameter settings optimized for germline low stringency and minimal false positive calls. The output variant call format (VCF) file was then annotated through Ion Reporter (Ion Reporter™ Software 4.6) and wANNOVAR softwares (http://wannovar.wglab.org). All sequencing variants were filtered using our custom NGS pipeline. All variants with Depth Coverage (DP) ≥30, Genotype Quality (GQ) ≥30, Allele Frequency (AF) ≥33 and ≤50 or ≥70 and ≤100, with balanced Alternate Allele Observations on the forward strand (SAF) and Alternate Allele Observations on the reverse strand (SAR) were considered as high confident variants and used for further analysis. Twenty four variants in 77 subjects with moderate quality were retested by Sanger sequencing on an ABI PRISM 3130 XL Genetic Analyzer following standard protocols. Overall, 13 variants with AF <33 and DP <20 were not confirmed.

The damaging effect of identified missense variants were evaluated by in silico prediction softwares. SIFT, PolyPhen-2, Provean, SNP&GO and Mutation T@ster Prediction softwares were used. A collective predictive score, ranging from 0–10, was calculated as the sum of individual scores of the 5 tools utilized, each being 0 (Neutral/benign/Polymorphism) or 1 (possibly damaging by Polyphen) or 2 (Disease Causing/Probably Damaging). Variants were defined as damaging if reported as deleterious in at least three of five prediction tools.

All common (MAF >5%), low frequency and rare variants (MAF ≥1 or ≤5% and <1%, respectively) annotated as functional (nonsense, frameshift, splice-region and missense) were considered for the analysis.

Power Calculation

Power analysis was performed by the Genetic Association Study (GAS) Power Calculator (© 2017 Jennifer Li Johnson | University of Michigan), commonly used to compute statistical power for one-stage genetic association studies within the setting of additive or multiplicative genetic models. The prevalence of NAFLD in our population was estimated as 0.30 and the odd ratio (OR) for each risk allele of the tested genetic variants was set at approximately 2.0, as estimated in previous studies⁴⁵. Assuming an allele frequency of 0.3 (for common variants) and 0.03 (for low-frequency variants) in the general population and an additive model for disease risk, with a sample size of 218 cases and 227 controls the expected power under a significance level of 0.05 was of 100% to identify common genetic associations and 87% to identify low-frequency genetic associations. Notably our analysis had a power of 0.80 to detect genetic effects with OR of at least 1.9 for low-frequency and 1.35 for common variants.

Genetic Risk Score Computation

The Genetic Risk Score (GRS) was calculated based on the four SNPs reaching the highest levels of significance for NAFLD: rs1260326 C/T in GCKR, rs58542926 C/T in TM6SF2, rs738409 C/G in PNPLA3 and rs641738 C/T in MBOAT7 genes. Two methods were used to create the GRS: a simple count method (unweighted GRS) and a weighted method (weighted GRS)^31,32. The count method assumed that each SNP contributed equally to NAFLD risk and was calculated applying a linear weighting of 0, 1 and 2 to genotypes containing 0, 1, or 2 risk alleles, respectively. While in our population we found only two homozygous subjects for rs58542926 in TM6SF2 gene this produced a score between 0 and 7, representing the maximum total number of risk alleles. The weighted 4-SNP GRS was calculated by multiplying each β -coefficient for the NAFLD phenotype obtained from the Dallas Heart Study^6,22 by the number of corresponding risk alleles (0, 1, or 2) and then summing the products. The β–coefficient considered for each SNP were: 0.2653 (rs738409 PNPLA3), 0.2711 (rs58542926 TM6SF2), 0.0649 (rs1260326 GCKR) and 0.0575 (rs641738 MBOAT7). The 4-SNP GRS was modelled as a continuous variable and then categorized into tertiles.

Statistical analysis

The two-sample t-test (for parametric variables) or the Mann–Whitney test (for non-parametric variables) was used to compare the difference between case and control groups for quantitative traits, while Pearson’s χ² test was used to compare discrete traits. Deviations of genotype frequency from the Hardy–Weinberg assumption were assessed using a χ² test. Differences in allele and genotype frequencies between cases and controls were assessed by χ² test under either dominant or recessive model of penetrance.

The enrichment of gene variants was evaluated by counting NAFLD cases and controls positive for at least on functional variant in each candidate gene.

Logistic regression analysis (Forward-Wald Statistic or Enter method) were adopted to assess the most significant model of inheritance for each SNP, the joint effects of genes and clinical variables and to evaluate gene-gene and gene-environmental interactions. The adequacy of the final model was assessed using Hosmer-Lameshow goodness-of-fit test. Furthermore, the Nagelkerke R² was calculated to indicate how useful the explanatory variables in the model were in predicting NAFLD⁴⁶. We further analysed the association of PNPLA3 rs738409, TM6SF2 rs58542926, GCKR rs1260326 and MBOAT7 rs641738 with biochemical indices by using General Linear Model test with bootstrap correction including age and sex as covariates. TG and TC were also adjusted for BMI, diabetes and statin therapy while HOMA_IR was adjusted for BMI. For variables with skewed distributions (ALT, AST, TG, HOMA_IR), a logarithm was applied before analysis to ensure that the residuals were approximately normal and had constant variance.

The effect of studied variants as well as of additional risk factors on the degree of hepatic steatosis were analyzed in logistic regression analysis (Forward-Wald Statistic) by comparing patients with severe NAFLD as having Hamaguchi score = 5–6 (N = 71) with patients without or with mild-moderate hepatic steatosis (Hamaguchi score = 0–4) (N = 362).

Associations between NAFLD risk and 4-SNP GRS were tested using Pearson correlations or logistic regression analysis.

Multiple comparisons were adjusted by bootstrap correction based on 1000 bootstrap samples with the aim to adjust raw p-value thus obtaining more robust estimates of standard errors and confidence intervals of parameters included in the models. Statistical significance was taken at nominal P-value < 0.05 for all comparisons. All analyses were performed using SPSS package (version 22.0) (SPSS, Inc., Chicago, IL, USA).

Data availability statement

All data generated or analyzed during this study are included in this published article (and its Supplementary Information files). The datasets generated during and/or analyzed during the current study are not publicly available due to the lack of a specific patients’ consent but are made available by corresponding author based on reasonable request.

References

Chalasani, N. et al. The diagnosis and management of non-alcoholic fatty liver disease: practice guideline by the American association for the study of liver diseases, American college of gastroenterology and the American gastroenterological association. Gastroenterology. 142, 1592–1609 (2012).
Article PubMed Google Scholar
Nascimbeni, F. et al. From NAFLD in clinical practice to answers from guidelines. J. Hepatol. 59, 859–871 (2013).
Article PubMed Google Scholar
Browning, J. D. et al. Prevalence of hepatic steatosis in an urban population in United States: impact of ethnicity. Hepatology 40, 1387–1395 (2004).
Article PubMed Google Scholar
Marchesini, G. et al. Nonalcoholic fatty liver disease: a feature of the metabolic syndrome. Diabetes. 50, 1844–50 (2001).
Article CAS PubMed Google Scholar
Dongiovanni, P. & Valenti, L. Genetics of nonalcoholic fatty liver disease. Metabolism. 65, 1026–37 (2016).
Article CAS PubMed Google Scholar
Romeo, S. et al. Genetic variation in PNPLA3 confers susceptibility to nonalcoholic fatty liver disease. Nat. Gene. 40, 1461–1465 (2008).
CAS Google Scholar
Sookoian, S. & Pirola, C. J. Meta-analysis of the influence of I148M variant of patatin-like phospholipase domain containing 3 gene (PNPLA3) on the susceptibility and histological severity of nonalcoholic fatty liver disease. Hepatology 53, 1883–1894 (2011).
Article CAS PubMed Google Scholar
Smagris, E. et al. Pnpla3I148M knockin mice accumulate PNPLA3 on lipid droplets and develop hepatic steatosis. Hepatology 61, 108–18 (2015).
Article CAS PubMed Google Scholar
Pirazzi, C. et al. Patatin-like phospholipase domain-containing 3 (PNPLA3) I148M (rs738409) affects hepatic VLDL secretion in humans and in vitro. J Hepatol. 57, 1276–82 (2012).
Article CAS PubMed Google Scholar
Kozlitina, J. et al. Exome-wide association study identifies a TM6SF2 variant that confers susceptibility to nonalcoholic fatty liver disease. Nat Genet. 46, 352–6 (2014).
Article CAS PubMed PubMed Central Google Scholar
Smagris, E., Gilyard, S., BasuRay, S., Cohen, J. C. & Hobbs, H. H. Inactivation of Tm6sf2, a Gene Defective in Fatty Liver Disease, Impairs Lipidation but Not Secretion of Very Low Density Lipoproteins. J Biol Chem. 13, 10659–76 (2016).
Article Google Scholar
Dongiovanni, P. et al. Transmembrane 6 superfamily member 2 gene variant disentangles nonalcoholic steatohepatitis from cardiovascular disease. Hepatology 61, 506–14 (2015).
Article CAS PubMed Google Scholar
Mahdessian, H. et al. TM6SF2 is a regulator of liver fat metabolism influencing triglyceride secretion and hepatic lipid droplet content. Proc Natl Acad Sci USA 111, 8913–8918 (2014).
Article ADS CAS PubMed PubMed Central Google Scholar
Speliotes, E. K. et al. Genome-wide association analysis identifies variants associated with non-alcoholic fatty liver disease that have distinct effects on metabolic traits. PLoS Genet. 7, e1001324 (2011).
Article CAS PubMed PubMed Central Google Scholar
Buch, S. et al. A genome-wide association study confirms PNPLA3 and identifies TM6SF2 and MBOAT7 as risk loci for alcohol-related cirrhosis. Nat Genet. 47, 1443–8 (2015).
Article CAS PubMed Google Scholar
Mancina, R. M. et al. The MBOAT7-TMC4 Variant rs641738 Increases Risk of Nonalcoholic Fatty Liver Disease in Individuals of European Descent. Gastroenterology. 150, 1219–1230 (2016).
Article CAS PubMed PubMed Central Google Scholar
Krawczyk, M. et al. Combined effects of the PNPLA3 rs738409, TM6SF2 rs58542926 and MBOAT7 rs641738 variants on NAFLD severity: a multicenter biopsy-based study. J Lipid Res. 58, 247–255 (2017).
Article CAS PubMed Google Scholar
Sookoian, S. & Pirola, C. J. Genetic predisposition in nonalcoholic fatty liver disease. Clin Mol Hepatol. 23, 1–12 (2017).
Article PubMed PubMed Central Google Scholar
Edwards, S. L., Beesley, J., French, J. D. & Dunning, A. M. Beyond GWASs: illuminating the dark road from association to function. Am J Hum Genet. 93, 779–97 (2013).
Article CAS PubMed PubMed Central Google Scholar
Manolio, T. A. et al. Finding the missing heritability of complex diseases. Nature 461, 747–753 (2009).
Article ADS CAS PubMed PubMed Central Google Scholar
Cooke Bailey, J. N. & Igo, R. P. Jr. Genetic Risk Scores. Curr Protoc Hum Genet. 91, 1.29.1–1.29.9 (2016).
Article Google Scholar
Dongiovanni, P. et al. Causal relationship of hepatic fat with liver damage and insulin resistance in nonalcoholic fatty liver. J Intern Med. https://doi.org/10.1111/joim.12719 (2017).
Wang, X., Liu, Z., Peng, Z. & Liu, W. The TM6SF2 rs58542926 T Allele Is Significantly Associated with Nonalcoholic Fatty Liver Disease in Chinese. J Hepatol. 62, 1438–1439 (2015).
Article CAS PubMed Google Scholar
Cohen, J. C., Horton, J. D. & Hobbs, H. H. Human fatty liver disease: old questions and new insights. Science. 332, 1519–23 (2011).
Article ADS CAS PubMed PubMed Central Google Scholar
Rees, M. G. et al. Cellular characterisation of the GCKR P446L variant associated with type 2 diabetes risk. Diabetologia 55, 114–122 (2012).
Article CAS PubMed Google Scholar
Santoro, N. et al. Hepatic de novo lipogenesis in obese youth is modulated by a common variant in the GCKR gene. J. Clin. Endocrinol. Metab. 100, E1125–E1132 (2015).
Article PubMed PubMed Central Google Scholar
Sookoian, S. et al. Genetic variation in transmembrane 6 superfamily member 2 and the risk of nonalcoholic fatty liver disease and histological disease severity. Hepatology 61, 515–25 (2015).
Article CAS PubMed Google Scholar
Sookoian, S. & Pirola, C. J. Systematic review with meta-analysis: risk factors for non-alcoholic fatty liver disease suggest a shared altered metabolic and cardiovascular profile between lean and obese patients. Aliment Pharmacol Ther. 46, 85–95 (2017).
Article CAS PubMed Google Scholar
Benedic, M. & Zhang, X. Non-alcoholic fatty liver disease: An expanded review. World J Hepatol. 9, 715–732 (2017).
Article Google Scholar
Browning, J. D. Common genetic variants and nonalcoholic fatty liver disease. Clin Gastroenterol Hepatol. 11, 1191–3 (2013).
Article PubMed Google Scholar
Anderson, J. L. et al. Joint effects of common genetic variants from multiple genes and pathways on the risk of premature coronary artery disease. Am Heart J. 160, 250–256 (2010).
Article ADS CAS PubMed PubMed Central Google Scholar
Läll, K., Mägi, R., Morris, A., Metspalu, A. & Fischer, K. Personalized risk prediction for type 2 diabetes: the potential of genetic risk scores. Genet Med. 19, 322–329 (2017).
Article PubMed Google Scholar
Leon-Mimila, P. et al. A genetic risk score is associated with hepatic triglyceride content and non-alcoholic steatohepatitis in Mexicans with morbid obesity. Exp Mol Pathol. 98, 178–83 (2015).
Article CAS PubMed Google Scholar
Wang, X. et al. Additive Effects of the Risk Alleles of PNPLA3 and TM6SF2 on Non-alcoholic Fatty Liver Disease (NAFLD) in a Chinese Population. Front Genet. 7, 140 (2016).
PubMed PubMed Central Google Scholar
European Association for the Study of the Liver (EASL), European Association for the Study of Diabetes (EASD) and European Association for the Study of Obesity (EASO). EASL–EASD–EASO. Clinical Practice Guidelines for the management of non-alcoholic fatty liver disease. J Hepatol. 64, 1388–1402 (2016).
Google Scholar
Stender, S. et al. Adiposity amplifies the genetic risk of fatty liver disease conferred by multiple loci. Nat Genet. 49, 842–847 (2017).
Article CAS PubMed PubMed Central Google Scholar
Hernandez, L. M. & Blazer, D. G. Genes, Behavior and the Social Environment: Moving Beyond theNature/Nurture Debate. Washington (DC): National Academies Press (US). 3, 44–62 (2006).
Google Scholar
Luukkonen, P. K. et al. The MBOAT7 variant rs641738 alters hepatic phosphatidylinositols and increases severity of non-alcoholic fatty liver disease in humans. J Hepatol. 65, 1263–1265 (2016).
Article CAS PubMed Google Scholar
Mardis, E. R. The impact of next-generation sequencing technology on genetics. Trends Genet. 24, 133–141 (2008).
Article CAS PubMed Google Scholar
Stender, S. et al. Relationship between Genetic Variation at PPP1R3B and Liver Glycogen and Triglyceride Levels. Hepatology, https://doi.org/10.1002/hep.29751 (2017).
Donati, B. et al. The rs2294918 E434K variant modulates patatin-like phospholipase domain-containing 3 expression and liver damage. Hepatology 63, 787–98 (2016).
Article CAS PubMed Google Scholar
Viitasalo, A. et al. Association of MBOAT7 gene variant with plasma ALT levels in children: the PANIC study. Pediatr Res. 80, 651–655 (2016).
Article CAS PubMed Google Scholar
Di Costanzo, A. et al. Non-alcoholic fatty liver disease and subclinical atherosclerosis: A comparison of metabolically- versus genetically-driven excess fat hepatic storage. Atherosclerosis. 257, 232–239 (2017).
Article PubMed Google Scholar
Hamaguchi, M. et al. The severity of ultrasonographic findings in nonalcoholic fatty liver disease reflects the metabolic syndrome and visceral fat accumulation. Am. J. Gastroenterol. 102, 2708–2715 (2007).
Article PubMed Google Scholar
Kahali., B., Halligan, B. & Speliotes, E. K. Insights from Genome-Wide Association Analyses of Nonalcoholic Fatty Liver Disease. Semin Liver Dis. 35, 375–391 (2015).
Article PubMed PubMed Central Google Scholar
Bewick, V., Cheek, L. & Ball, J. Statistics review 14: Logistic regression. Crit Care. 9, 112–8 (2005).
Article PubMed PubMed Central Google Scholar

Download references

Acknowledgements

This work has been partially supported by Grant C26A14HZBX from Sapienza University of Rome.

Author information

Authors and Affiliations

Departments of Internal Medicine and Medical Specialties, “Sapienza” University, Rome, Italy
Alessia Di Costanzo, Marialuisa Sponziello, Laura D’Erasmo, Licia Polimeni, Francesco Baratta, Daniele Pastori, Anna Montali, Maria Del Ben & Marcello Arca
Molecular Medicine, “Sapienza” University, Rome, Italy
Francesca Belardinilli, Antonio Angeloni & Giuseppe Giannini
Experimental Medicine, “Sapienza” University, Rome, Italy
Diego Bailetti
Anatomical, Histological, Forensic Medicine and Ortopedics Sciences, “Sapienza” University, Rome, Italy
Francesco Baratta & Daniele Pastori
Cellular Biotechnologies and Hematology, “Sapienza” University, Rome, Italy
Fabrizio Ceci
Immunohematology and Transfusion Medicine Unit, “Sapienza” University, Rome, Italy
Gabriella Girelli & Bruna De Masi
Public Health and Infectious Diseases, “Sapienza” University, Rome, Italy
Francesco Angelico

Authors

Alessia Di Costanzo
View author publications
You can also search for this author in PubMed Google Scholar
Francesca Belardinilli
View author publications
You can also search for this author in PubMed Google Scholar
Diego Bailetti
View author publications
You can also search for this author in PubMed Google Scholar
Marialuisa Sponziello
View author publications
You can also search for this author in PubMed Google Scholar
Laura D’Erasmo
View author publications
You can also search for this author in PubMed Google Scholar
Licia Polimeni
View author publications
You can also search for this author in PubMed Google Scholar
Francesco Baratta
View author publications
You can also search for this author in PubMed Google Scholar
Daniele Pastori
View author publications
You can also search for this author in PubMed Google Scholar
Fabrizio Ceci
View author publications
You can also search for this author in PubMed Google Scholar
Anna Montali
View author publications
You can also search for this author in PubMed Google Scholar
Gabriella Girelli
View author publications
You can also search for this author in PubMed Google Scholar
Bruna De Masi
View author publications
You can also search for this author in PubMed Google Scholar
Antonio Angeloni
View author publications
You can also search for this author in PubMed Google Scholar
Giuseppe Giannini
View author publications
You can also search for this author in PubMed Google Scholar
Maria Del Ben
View author publications
You can also search for this author in PubMed Google Scholar
Francesco Angelico
View author publications
You can also search for this author in PubMed Google Scholar
Marcello Arca
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

The listed authors have contributed to the work as follows: A.D.C., M.A., F.A., M.D.B. designed the study, reviewed all analyses, interpreted the data and prepared the manuscript; L.D.E., F.B., D.P., G.G., B.D.M., M.D.B., F.A. recruited subjects and performed all clinical evaluations; L.P. carried out the liver ultrasound examination; A.M., A.A. and F.C. executed the biochemical analyses; A.D.C., D.B., F.B. and G.G. performed next generation sequencing and A.D.C., D.B. and M.S. carried out genetic data filtering and analysis; A.D.C. performed genotyping tests and all statistical analyses. All authors reviewed the manuscript.

Corresponding author

Correspondence to Alessia Di Costanzo.

Ethics declarations

Competing Interests

The authors declare no competing interests.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Supplementary Table S1 and TableS2

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Di Costanzo, A., Belardinilli, F., Bailetti, D. et al. Evaluation of Polygenic Determinants of Non-Alcoholic Fatty Liver Disease (NAFLD) By a Candidate Genes Resequencing Strategy. Sci Rep 8, 3702 (2018). https://doi.org/10.1038/s41598-018-21939-0

Download citation

Received: 03 October 2017
Accepted: 13 February 2018
Published: 27 February 2018
DOI: https://doi.org/10.1038/s41598-018-21939-0

This article is cited by

Association between PNPLA3 rs738409 polymorphism and nonalcoholic fatty liver disease: a systematic review and meta-analysis
- Nader Salari
- Niloufar Darvishi
- Masoud Mohammadi
BMC Endocrine Disorders (2021)
Genetic and metabolic predictors of hepatic fat content in a cohort of Italian children with obesity
- Alessia Di Costanzo
- Lucia Pacifico
- Marcello Arca
Pediatric Research (2019)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Genome-wide association studies

Utility of polygenic scores across diverse diseases in a hospital cohort for predictive modeling

Genomic data in the All of Us Research Program

Introduction

Results

Subjects characteristics

DNA re-sequencing

Enrichment of gene variants in NAFLD and controls

Association of individual variants with NAFLD

Association of genetic variants with metabolic traits

Genetic risk score (GRS) and the risk of NAFLD

Association of genetic variants with severity of NAFLD

Discussion

Material and Methods

Study subjects

DNA analysis

Selection of candidate genes

Next-generation sequencing (NGS)

Data filtering and analysis

Power Calculation

Genetic Risk Score Computation

Statistical analysis

Data availability statement

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing Interests

Additional information

Electronic supplementary material

Supplementary Table S1 and TableS2

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Association between PNPLA3 rs738409 polymorphism and nonalcoholic fatty liver disease: a systematic review and meta-analysis

Genetic and metabolic predictors of hepatic fat content in a cohort of Italian children with obesity

Comments

Search

Quick links