Introduction

Nonalcoholic fatty liver disease (NAFLD) is the most common cause of liver disease in western countries, affecting up to 20–35% of the general population1 and has emerged as a major public health issue worldwide2,3. NAFLD has a broad spectrum of manifestations and can be histologically subdivided into simple steatosis and non-alcoholic steatohepatitis (NASH), which include steatosis, lobular inflammation and hepatocyte ballooning with or without fibrosis4. Although simple steatosis is generally considered to have a benign hepatological prognosis, NASH much more frequently progresses to fibrosis, cirrhosis and hepatocellular carcinoma in later years5,6 and will be the leading cause of liver transplantation in the United States by 20207. The precise mechanism responsible for the development and progression of NAFLD has not been elucidated. Some NAFLD patients will progress into NASH with cirrhosis, whereas others do not develop beyond simple steatosis. Currently, there is increasing evidence that genetic8,9,10 as well as environmental factors11 play important roles in the progression of NAFLD.

The human patatin-like phospholipase-3 (PNPLA3) gene is localized on human chromosome 22. The PNPLA3 protein, which is also known as adiponutrin, is expressed in both adipocytes and hepatocytes12. PNPLA3 exhibits lipase activity against triglycerides and acylglycerol transacetylase activity and its expression is highly responsive in energy mobilization and the storage of lipid droplets13. The PNPLA3 gene is one of the potential candidate genes currently related to NAFLD susceptibility. In 1998, Romeo et al. noted that a single nucleotide polymorphism in residue 148 (I148 M, rs738409), which exhibits a C-to-G transition resulting in an amino acid substitution of isoleucine to methionine, was a strong genetic determinant of NAFLD10. Consistent with this result, some following studies also demonstrated an association between the rs738409 polymorphism and NAFLD risk14,15,16. However, it is unclear whether this polymorphism is associated with simple steatosis only or also associated with NASH. Further studies have also attempted to analyze the association between the rs738409 polymorphism and histological parameters of NAFLD17,18,19, but the results are not consistent, partially because only few studies with a limited number of subjects analyzed the association between the rs738409 polymorphism and NASH or simple steatosis.

There is no approved therapy for NAFLD and the diagnosis of NASH can only be proven by liver biopsy. In addition, it is important to establish whether the associations differ between different subgroups of NAFLD. To clarify the association between the rs738409 polymorphism and risk of NAFLD, we conducted a systematic review and meta-analysis of the available prospective studies with the specific aims of analyzing NAFLD subgroups, including simple steatosis or NASH, to clarify whether the association differed by histological parameters.

Methods

Search strategy

We conducted an electronic search of the PubMed, EMBASE and Web of Science databases from their inception until December 2014 to identify the association between the rs738409 polymorphism and NAFLD risk using the following search terms: PNPLA3 and (polymorphism or variant or variation) and (NAFLD or NASH or (non-alcoholic fatty liver disease) or (fatty liver) or steatohepatitis). Additional studies not captured by our database search were identified by surveying the references of the originally identified reviews and research reports and by using the MEDLINE option “Related Articles”. The search was confined to human studies without country restrictions. In addition, the publication language was restricted to English.

Inclusion and exclusion criteria

Potentially relevant studies were selected based on the following inclusion criteria: (1) studies concerning the association between the PNPLA3 rs738409 polymorphism and risk of NAFLD; (2) case-control studies based on unrelated individuals; (3) studies in which the diagnosis of NAFLD was clear; (4) studies that provide the number of NAFLD cases and controls and the frequency of the rs738409 genotypes; and (5) studies published in English. The major reasons for study exclusion were the following: (1) case-only study or overlapping data; (2) studies with a sample size less than one hundred; (3) studies with abstracts only and reports published as comment and review papers; and (4) studies with secondary causes of steatosis, including alcohol abuse, the use of drugs, surgical procedures and hepatitis B and hepatitis C virus infection.

Data extraction

Two investigators independently selected the trials and extracted the data and disagreements or uncertainties were resolved by consensus. The following data were extracted: first author, publication year, country of origin, ethnicity of studied population, sex ratio, mean age, diagnostic criteria for NAFLD, number of individuals in the case and control groups, frequency of PNPLA3 genotypes in the cases and controls; and consistency with the Hardy-Weinberg equilibrium(HWEs).

Study quality assessment

The quality of the studies was assessed independently by two investigators according to the quality assessment scores developed from the genetic association studies conducted by Thakkinstian et al. The total scores ranged from 0 (worst) to 13 (best)20. The criteria of the quality assessment used to analyze the studies in this meta-analysis are available in Table S1.

Statistical analysis

The strength of the association between the PNPLA3 polymorphism and NAFLD risk was assessed by the odds ratios (ORs) and 95% confidence interval (CI). The Chi-square test was used to assess the Hardy-Weinberg equilibrium (HWE) in order to analyze the genotype distribution in the control groups. Meta-analyses were performed for four genotype contrasts per outcome: allele contrast (G versus C), dominant model (GG+CG versus CC), recessive model (GG versus CG+CC) and additive model (GG versus CC)21,22. The Cochrane Q statistic and the inconsistency index (I2) were used to calculate the heterogeneity among the studies and a P value < 0.10 or I2 > 50% was considered to be significant23. If heterogeneity existed among the studies, the random-effect model (the Dersimonian and Laird method) was used to calculate the pooled OR. Otherwise, a fixed-effect model (the Mantel-Haenszel method) was used for outcomes without obvious heterogeneity24. Sensitivity analyses were performed to assess the stability of the results by excluding one study at a time in order to analyze the influence of each study on the overall OR. The publication bias was assessed using funnel plots and Egger's test25.

Three subgroup analyses were additionally carried out by ethnicity (Caucasian, Asian or Hispanics), mean age (pediatric or adult) and source of the controls (hospital based or population based). The statistical analysis was performed with RevMan software version 5 (Cochrane Collaboration) and STATA software version 10.0 (Stata Corporation). A P value < 0.05 was considered to be statistically significant in this trial unless otherwise specified.

Results

Literature search

The search strategy initially identified 419 potentially relevant articles and 363 articles were determined to be irrelevant after a review of the titles and abstracts. Thus, 56 trials proceeded to a full-text review and an additional 33 studies were excluded. Finally, 23 articles were ultimately selected for inclusion in the meta-analysis17,18,19,26,27,28,29,30,31,32,33,34,35,36,37,38,39,40,41,42,43,44,45. A flow describing the article selection process for this meta-analysis is shown in Figure 1. Of all of the studies included, 10 studies involved Caucasians17,19,26,27,28,29,30,31,32,42, 12 studies investigated Asians18,33,34,35,36,37,38,39,40,43,44,45 and 1 study researched Hispanics41. All of the studies followed a case-control design, 8 studies used population-based controls16,18,28,33,40,41,42,43 and 15 studies used hospital-based controls17,19,26,29,30,31,32,34,35,36,37,38,39,44,45. In addition, 19 studies were conducted in adult patients16,17,18,19,26,28,29,30,31,32,34,35,36,37,38,39,42,44,45 and 4 investigated pediatric patients33,40,41,43. The distribution of genotypes in the controls was consistent with HWE in 21 studies17,18,19,26,27,29,31,32,33,34,35,36,37,38,39,40,41,42,43,44,45 and insufficient in the 2 other studies28,30. The quality score of the included studies ranged from 7 to 11 (Table S1). The characteristics of the included studies are presented in Table 1.

Table 1 Characteristics of Studies Included in this meta-analysis
Figure 1
figure 1

Flowchart of the study selection.

Association between rs738409 and risk for NAFLD

All studies

A total of 23 studies with 6071 cases and 10366 controls reported an association between the rs738409 polymorphism and NAFLD risk17,18,19,26,27,28,29,30,31,32,33,34,35,36,37,38,39,40,41. Overall, the frequency of the G allele was 49.5% in NAFLD and 34.8% in the controls. The Hispanic population bears the highest frequency of the G allele (69.0% cases vs. 41.9% controls), followed by the Asian (54.2% cases vs. 39.9% controls) and Caucasian (42.2% cases vs. 22.7% controls) populations. The distribution of the rs738409 genotypes and alleles is presented in Table 2. Strong evidence of an association between the rs738409 polymorphism and NAFLD risk was found in all genetic models: allele contrast (OR = 2.10, 95% CI = 1.78–2.48, P < 0.00001; heterogeneity test: I2 = 89%, P < 0.00001); dominant model (OR = 2.06, 95% CI = 1.75–2.43, P < 0.00001; heterogeneity test: I2 = 67%, P < 0.00001); recessive model (OR = 2.49, 95% CI = 2.01–3.08, P < 0.00001; heterogeneity test: I2 = 72%, P < 0.0001); and additive model (OR = 3.41, 95% CI = 2.57–4.52, P < 0.00001; heterogeneity test: I2 = 77%, P < 0.00001) (Figure 2). After exclusion of the two articles deviating from HWE in the cases and controls, the results of the relationship was not influenced significantly in all genetic models (Table 3).

Table 2 The distribution of alleles and genotypes of PNPLA3 in NAFLD studies
Table 3 Association between PNPLA3 polymorphism and NAFLD risk
Figure 2
figure 2

Forest plot of NAFLD susceptibility associated with rs738409 polymorphism at additive model (GG vs CC).

Subgroup analyses

Subgroup analyses were conducted to explore the differences between ethnicity, mean age and sources of the controls. In the subgroup analysis by ethnicity, significant association was found between the rs738409 polymorphism and NAFLD risk among the Caucasian, Asian and Hispanic populations. The association between rs738409 polymorphism and NAFLD was most significant in Hispanic population, which followed by Caucasian population and the association was weakest in Asian population. The analyses also showed that the risk of NAFLD was significantly increased in both adult participants and pediatric subjects. In addition, the G allele was strongly associated with NAFLD susceptibility in hospital-based controls and population-based controls. The results were consistent in all genetic models. More details are presented in Table 3.

Histological Severity of NAFLD

Five eligible studies were used to investigate the association between the rs738409 polymorphism and lobular necroinflammation, including 1978 patients. A statistically significant association was seen between carrying GG genotype and higher inflammation scores (OR = 3.13, 95% CI = 2.76–3.56, P < 0.00001; heterogeneity test: I2 = 0%, P = 0.674) with obvious publication bias (Egger test: P = 0.980) (Figure 3). The six eligible studies with 2552 patients analyze the relationship between rs738409 polymorphism and fibrosis. The analysis pointed out that the GG genotype was significantly associated with fibrosis score (OR = 3.11, 95% CI = 2.66–3.65, P < 0.00001; heterogeneity test: I2 = 18.3%, P = 0.295) and the publication bias was not significant (Egger test: P = 0.457) (Figure 3).

Figure 3
figure 3

(A)The forest plot for the association between rs738409 polymorphism and the risk of necroinflammation (additive model: GG vs CC). (B)The forest plot for the association between rs738409 polymorphism and the risk of fibrosis (additive model: GG vs CC).

Association between rs738409 and risk for simple steatosis

Overall, 7 studies with 387 cases and 2306 controls analyzed the rs738409 polymorphism and risk of simple steatosis17,18,19,28,32,34,36. Interestingly, the frequency of the risk G allele was very close between the cases (38.1%) and controls (38.0%). In Caucasian subjects, the frequency of the G allele was 34.3% in cases and 23.2% in controls and these values are lower than those found in the Asian population (44.3% cases vs. 42.3% controls) (Table 4). We analyzed the relationship between the G allele and the risk of simple steatosis. No significant association was observed between rs738409 polymorphism and simple steatosis under additive model (OR = 1.34, 95% CI = 0.82–2.20, P = 0.25; heterogeneity test: I2 = 0%, P = 0.58), dominant model and recessive model (Figure 4). However, a significant association was found in allele contrast. A further subgroup analysis based on ethnicity showed no obvious association between the rs738409 polymorphism and simple steatosis in Asian subjects, while a strong association was found in the Caucasian population under the allele contrast instead of the other three genetic models. (Table 5).

Table 4 The distribution of alleles and genotypes of PNPLA3 in SS studies and NASH studies
Table 5 Association between PNPLA3 polymorphism and simple steatosis risk
Figure 4
figure 4

Forest plot of simple steatosis susceptibility associated with rs738409 polymorphism at additive model (GG vs CC).

Association between rs738409 and NASH risk

Overall, seven studies with 1466 cases and 2306 controls reported the rs738409 polymorphism and risk of NASH17,18,19,28,32,34,36: four studies conducted in Caucasians and three studies performed in Asian populations. The pooled overall frequency of the risk G allele was 53.5% in the cases and 38.0% in the controls. The G allele varied widely between the different populations: high in the Asian populations (60.9% cases vs.42.3% controls) and lower in the Caucasian subjects (45.9% cases vs. 23.2% controls) (Table 4). Strong evidence of an association was detected between the rs738409 polymorphism and NASH risk under the additive model (OR = 4.44, 95% CI = 3.39–5.82, P < 0.00001; heterogeneity test: I2 = 0%, P = 0.49) (Figure 5). The association was also significant in the other three genetic models and no evidence of heterogeneity was observed between the studies. Evidence of a strong association between the rs738409 polymorphism and NASH susceptibility was also found in both Asian and Caucasian populations with all genetic models. In addition, Caucasian populations with rs738409 polymorphism are more easily develop into NASH than Asian populations. The results are described in Table 6.

Table 6 Association between PNPLA3 Polymorphism and NASH risk
Figure 5
figure 5

Forest plot of NASH susceptibility associated with rs738409 polymorphism at additive model (GG vs CC).

Sensitivity and Publication Bias

Sensitivity analysis was performed under additive model to evaluate the influence of a specific study on the overall estimate. The corresponding pooled ORs with 95% CIs produced similarly before and after omitting each study at a time, indicating that our results were stable and reliable (Table S2). The funnel plots of the studies were symmetric in the current meta-analysis (Figure 6). Furthermore, the results of Egger's test did not support the existence of publication bias (additive model: NAFLD: P = 0.467; SS: P = 0.611; NASH: P = 0.282).

Figure 6
figure 6

Publication bias on rs738409 polymorphism under additive model.

(A) Funnel plot of studies of the rs738409 variant and NAFLD. (B) Funnel plot of studies of the rs738409 variant and simple steatosis. (C) Funnel plot of studies of the rs738409 variant and NASH.

Discussion

The current meta-analysis provided a systematic assessment of the association between the PNPLA3 rs738409 polymorphism and susceptibility to NAFLD, including its subtypes simple steatosis and NASH. Our results suggested that rs738409 polymorphism exerted a significant influence not only on NAFLD risk, but also on histological severity of NAFLD. In addition, a further analysis showed that individuals with the rs738409 polymorphism experienced a significantly increased risk for NASH. However, our meta-analysis did not show a definite association of rs738409 polymorphism with simple steatosis.

Our results are consistent with those from a previous meta-analysis conducted by Sookoian et al.14, which showed a significant association between the rs738409 polymorphism and NAFLD (OR = 3.26, 95% CI = 2.73–3.89, P < 0.00001) and a significant association between the rs738409 polymorphism and NASH (OR = 3.26, 95% CI = 2.14–4.95, P < 0.00001), similar to the results reported in this manuscript. In the present meta-analysis, analysis of the rs738409 polymorphism revealed a significantly increased NAFLD risk in all genetic models. When the data were stratified by subject ethnicity, a significant correlation was found in all three populations, suggesting that the susceptibility genes may be a strong indicator across different races. In the population-based and hospital-based control studies, a significant correlation was also observed in all genetic models, suggesting that our results were not influenced by the source of controls. In addition, the association between the rs738409 polymorphism and NAFLD risk was also significant in both adult and children populations, indicating that the results are highly stable and not influenced by ethnicity, source of the controls and age of participants.

A large population-based study that involved 9229 multiethnic population, including African-Americans, Hispanics and European-Americans, revealed that patients with the rs738409 polymorphism are associated with a higher risk of NAFLD compared with normal controls10. These findings are generally consistent with individual published reports because 70–90% of the trials showed an association between the rs738409 polymorphism and NAFLD risk27,38,41. The underlying mechanism for how PNPLA3 genotype increases NAFLD susceptibility remains to be elucidated. The questions that have been raised are whether the I148M polymorphism increases liver damage favoring the accumulation of fatty acids in lipid droplets or increases the susceptibility to progress into NASH and fibrogenesis.

It should be noted that rs738409 polymorphism was only significantly associated with increased simple steatosis risk under allelic model, but not under the other three genetic models. When stratified by ethnicity, we only detect a significant association in the Asian subgroup under allele contrast, but failed to detect a significant association in the Caucasion population under all genetic models. This meta-analysis of the associations of the rs738409 polymorphism with NASH showed a significant relation. In the subgroup analysis stratified by ethnicity, similar correlations were observed in both Caucasian and Asian populations. The results from the allele contrast were consistent with those from the other genetic models. The sensitivity analysis revealed that no single study qualitatively changed the pooled odds ratios. These findings suggested that rs738409 polymorphism was strongly associated with NASH.

In our meta-analysis, it appears that the rs738409 polymorphism is more likely to increase the NASH risk instead of simple steatosis. Consistent with our results, animal studies have revealed that, although PNPLA3 has triglyceride lipase activity and is responsible for the transalkylation of acylglycerol, knockout of PNPLA3 has no effect on liver steatosis or insulin resistance46. Further epidemiological studies have also noted that this G allele variation did not affect the main risk factors for steatosis, including insulin resistance, LDL, HDL, total cholesterol and glucose levels29. Other polymorphisms, such as CD14 rs2569190 and GCLC rs4140528, are also regarded to increase the risk of NASH instead of simple steatosis47. There are some possible reasons to explain this phenomenon. First, the effect of the rs738409 G allele may be involved in the differential expression and function of variant PNPLA3 instead of resulting in a loss of function of the wild-type protein. Second, there may be some gene-gene interactions. It is possible that the difference in phenotypes may be caused by some other genetic variant that is strongly linked to rs738409. Third, although NASH and simple steatosis are currently regarded as two histological subtypes along the unique spectrum of NAFLD, evidence suggests that these two conditions may be not only different from the histological syndrome but also varied from pathophysiological standpoints. The results that the association between NASH risk and the rs738409 genotype is independent of simple steatosis might suggest that simple steatosis may not be the essential condition for the progressive damage. Simple steatosis and NASH are likely to be two independent conditions in the NAFLD spectrum.

Despite the inevitable limitations of this meta-analysis, we believe that our research provides useful information. First, the individual sample size of each study included in our meta-analysis was too small to obtain a definite association between rs738409 polymorphisms and NAFLD risk, but the pooled odds ratios generated from the 23 studies significantly increased the statistical power of the analysis compared to that obtained with a single study. Moreover, the protocol of this meta-analysis has been well-designed with explicit criteria and methods for study selection, data extraction and data analysis, which allowed reliable inferences about causality. Third, there was no significant publication bias in this meta-analysis and the results of the sensitivity analysis support the stability of the results.

However, some limitations of this meta-analysis should be addressed. First, the retrieved literature may not be sufficiently comprehensive. Only published case-control studies were included in this meta-analysis. Second, most of the study subjects were of Caucasian and Asian ancestry and the Hispanic subgroup was very limited in this meta-analysis. Thus, potential selective bias and publication bias may have occurred. Third, because NAFLD was a multifactor disease, the potential effects of gene-gene and gene-environment interactions should be considered. Fourth, the sample size of NASH in this meta-analysis was so small that the statistical power for making a definitive conclusion regarding the possible risk of the rs738409 polymorphism was limited.

In conclusion, results from this meta-analysis showed that the G allele at PNPLA3 gene was a risk factor for NAFLD and its subtype NASH, especially in Asian, Caucasian and Hispanic populations. However, no association was observed between the rs738409 polymorphism and simple steatosis risk. Further studies with higher quality, more participants and various ethnicities are needed to obtain a more precise estimate of the genetic effects.