Introduction

In 2018, ~25% of the world’s population was estimated to have non-alcoholic fatty liver disease (NAFLD); the prevalence of NAFLD in Asia ranges from 15% to 40% and is accompanied by a serious public and economic burden [1]. Hepatic steatosis is generally graded as minimal (<5%), mild (5%–33%), moderate (33%–66%), and severe (≥66%) steatosis [2], with the mild grade accounting for the highest proportion of NAFLD patients in population-based studies [3, 4].

Because of the high prevalence and asymptomatic presentation of NAFLD, the early diagnosis of NAFLD is clinically important. First, as it is not always benign, simple steatosis can even evolve to non-alcoholic steatohepatitis (NASH) or progress to fibrosis, cirrhosis, and hepatocellular carcinoma [5, 6]. Second, the severity of NAFLD increases the risk of impaired fasting glucose, diabetes, microvascular diabetic complications, and cardiovascular diseases [7, 8]. Third, even a low grade of NAFLD can affect adherence to interventional studies among NAFLD patients [9]. Moreover, this condition is often overlooked in healthy controls selected for clinical trials because of its asymptomatic presentation leading to misclassifications and decreased study validity [10]. Hence, much attention should be paid to the early detection of NAFLD.

Liver biopsy, the gold standard for identifying NAFLD, is invasive; thus, the uses of noninvasive alternatives, such as imaging studies, are necessary for assessing NAFLD. Ultrasonography, the most commonly used noninvasive method, is accurate and reliable for detecting moderate-to-severe fatty liver, but it is known to underestimate NAFLD with intrahepatic fat <30% [11]. Computed tomography (CT) is associated with radiation exposure, and it is also not accurate in assessing mild NAFLD [12]. Magnetic resonance spectroscopy (MRS) is highly accurate for even minimal amounts of steatosis, but its widespread application is hampered by time-consuming acquisitions and complex postprocessing procedures [13, 14].

Some biomarkers also hold promise in diagnosing NAFLD, but their roles in diagnosing mild NAFLD defined by MRS have rarely been explored or compared. Fibroblast growth factor 21 (FGF21) is a hepatokine that is often increased in obesity, metabolic syndrome, type 2 diabetes, and NAFLD [15, 16]. Serum FGF21 was positively correlated with hepatic fat content in subjects with mild/moderate NAFLD [17]. Moreover, it has been reported that the neutrophil serine proteases involved in the induction of inflammation and apoptosis play crucial roles in the pathogenesis of obesity-induced NAFLD [18,19,20,21,22]. While several studies have evaluated serum FGF21, apoptosis markers, such as cytokeratin 18 M65ED (CK18 M65ED) and neutrophil serine proteases (proteinase 3 [PR3], neutrophil elastase [NE]), and their inhibitor (alpha-1 antitrypsin [A1AT]) in relation to the development of NAFLD, no studies have focused on the role of these biomarkers in diagnosing mild NAFLD.

This study aimed to investigate the relationship between novel serum hepatic biomarkers and liver steatosis grades as measured by MRS and assess the diagnostic performance of NAFLD, including novel hepatic markers, in combination with ultrasonography for NAFLD.

Materials and methods

Study participants

The data of the 531 subjects who underwent ultrasonography and MRS from the local community in Shanghai between October 2017 and October 2018 were reviewed retrospectively to identify Chinese subjects aged 18–70 years with an intrahepatic triglyceride content (IHTC) <33%. In this project, overweight or obese subjects, as defined by body mass index (BMI) ≥24 kg/m2 or waist circumference (WC) ≥85 cm in men and ≥80 cm in women, were chosen as the high-risk subgroup from the local community. Subjects did not have any known chronic or acute liver diseases, such as autoimmune disease, celiac disease, Wilson’s disease, A1AT deficiency, hepatic malignancies, hepatobiliary infections, or biliary tract disease. All the subjects were asked to undergo the ultrasonography examination, and then were invited to quantify their IHTC with an MRS examination. Of 470 eligible subjects, 48 were excluded (12 because they had excessive alcohol consumption [men > 140 g/week, women > 70 g/week], 16 with hepatitis infection, 17 with known hyperthyroidism or hypothyroidism, and 3 without serum samples). The final study population consisted of 422 subjects. Of these, 336 individuals had an IHTC more than 5.56%, and 86 individuals had an IHTC less than 5.56%. All of the participants provided informed consent, and the local ethics committee approved the protocol of the study.

Anthropometric and biochemical measurements

Blood pressure, body weight, height, WC, and biomedical indices were measured according to our previous standardized protocols [16]. BMI (weight [kg]/height2 [m2]) was also calculated. Blood samples were collected from participants after an overnight fast of at least 10 h and were used to measure serum fasting plasma glucose (FPG), fasting insulin (FINS), fasting C peptide (FCP), alanine transaminase (ALT), aspartate aminotransferase (AST), gamma-glutamyl transferase (GGT), triglyceride (TG), total cholesterol (TC), high-density cholesterol (HDL-C), and low-density cholesterol (LDL-C). Two-hour plasma glucose was measured following a 75-g oral glucose tolerance test. The homeostasis model assessment of insulin resistance (HOMA-IR) was calculated as follows: HOMA-IR = FPG (mmol/L) × FINS (mU/L)/22.5.

Measurement of FGF21, CK18 M65ED, PR3, NE, and A1AT

Concentrations of FGF21, PR3, NE, and A1AT in serum were quantified using an enzyme-linked immunosorbent assay (ELISA) kit (Antibody and Immunoassay Services, the University of Hong Kong). Serum CK18 M65ED concentration was quantified with the M65 EpiDeath ELISA kit (Peviva AB, Bromma, Sweden). The intra-assay variations for the measurement of FGF21, PR3, NE, A1AT, and CK18 M65ED were 1.89%, 4.25%, 8.61%, 5.97%, and 0.77%, respectively, and for interassay variations, these values were 4.08%, 2.45%, 4.86%, 2.52%, and 8.23%, respectively.

Ultrasonography and MRS examination

After an overnight fast of at least 10 h, all subjects underwent ultrasonography examination in the morning by two experienced sonographers who were blinded to the clinical data. The sonographers performed abdominal ultrasonographic examinations in all subjects using a Voluson 730 Expert B-mode ultrasound machine (GE Healthcare, 5.0-MHz transducer, Waukesha, WI, USA). In accordance with the standards we used in our previous study, we had followed the guidelines for the assessment and management of NAFLD in the Asia-Pacific region [16]. Diffuse fatty liver can be defined by the presence of at least two of three abnormal findings on abdominal ultrasonography (diffusely increased echogenicity (‘bright’) in the liver with liver echogenicity greater than kidney or spleen, vascular blurring, and deep attenuation of ultrasound signal). Any disagreement between the two sonographers was resolved by consensus.

Subjects underwent liver MRS by the segmented breath-hold method using the 3.0-T Philips Ingenia medical system (Philips Healthcare, the Netherlands) and the 16-channel body coil. Single breath-hold single-voxel MRS data were acquired in mid-expiration using point-resolved single-voxel spectroscopy pulse sequence without water suppression with the following parameters: repetition time, 2000 ms; echo time, 50 ms; flip angle, 90; 2 dynamic scans, dynamic scan time, 12 s; and voxel size, 20 mm × 20 mm × 20 mm. Sagittal, coronal, and axial slices were acquired, and regions of interest in the right posterior lobe were selected by an experienced radiologist, avoiding visible blood vessels and bile ducts. All the raw MR spectrum images were processed by two experienced radiologists independently who were blinded to the clinical histories using the standard Philips curve-fitting package. Automatic processing included phase correction based on the water and 200-ms exponential filter. Manual processing included optimization of the phase and a linear baseline correction, if required. The average IHTC derived from the two analyzers was expressed as a percentage by dividing the integral of the methylene groups in the fatty acid chains of the hepatic TG by the sum of the methylene groups and water. Any disagreement between the two analyzers was resolved by consensus.

Diagnostic criteria for NAFLD, mild NAFLD, and metabolic syndrome

The diagnostic criteria for NAFLD and metabolic syndrome were in accordance with the standards we followed in our previous study [16]. All subjects were invited to confirm the diagnosis of NAFLD via an accurate quantification by MRS (IHTC ≥ 5.56%) [23]. Mild NAFLD was defined as NAFLD with IHTC less than 33% [2].

Statistical analysis

All analyses were performed with SPSS version 25.0 (Chicago, IL, USA). Normally distributed data were expressed as the mean ± SD. Data that were not normally distributed were log-transformed before analysis and were expressed as medians with interquartile ranges. Student’s unpaired t test was used for between-group comparisons. Pearson’s chi-square and partial correlation analyses were used to examine the associations between IHTC and various parameters. Multiple stepwise regression analysis was used to examine the association of IHTC with other parameters. Multiple logistic regressions were used to identify independent risk factors for mild NAFLD. The accuracy of the models was evaluated using the area under the curve (AUC) with 95% CI. Two-sided P values < 0.05 were considered significant. The sensitivity, specificity, positive predictive value, and negative predictive value were calculated to assess the diagnostic accuracy of different diagnostic models. Chi-square tests were used to compare the difference in diagnostic accuracy.

Results

Characteristics of subjects and serum levels of biomarkers

The clinical characteristics of the 422 subjects included in this study are described in Table 1. Their average age and BMI were 41.00 years and 27.06 kg/m2, respectively. No significant differences in age were observed between non-NAFLD and mild NAFLD subjects. The male to female ratio and BMI were significantly higher in the mild NAFLD subjects than in the non-NAFLD subjects (both P < 0.05). After adjustment for age, sex, and BMI, subjects with mild NAFLD had a higher WC, FCP, ALT, AST, GGT, TG, TC, LDL-C, and IHTC (all P < 0.01).

Table 1 The general characteristics of the subjects with NAFLD in 422 subjects.

Regarding changes in the six biomarkers, both FGF21 and CK18 M65ED levels were significantly higher in patients with mild NAFLD than in the non-NAFLD subjects after adjustment for age, sex, and BMI (P< 0.001 and P = 0.002, respectively). Other biomarkers, including PR3, NE, A1AT, and NE/A1AT, did not differ significantly between the two groups. Among the subjects diagnosed with non-NAFLD by ultrasonography, those with MRS-defined NAFLD had significantly higher serum levels of FGF21 than non-NAFLD subjects (196.24 [130.54–298.04] pg/mL vs. 143.12 [97.05–219.78] pg/mL, P < 0.01), with and without adjustment for BMI.

Relationship between biomarkers and IHTC

As IHTC is a quantitative index for the severity of NAFLD, we investigated the relationship between IHTC and the six biomarkers to identify an appropriate biomarker for mild NAFLD. Correlations between IHTC and biomedical parameters are presented in Table 2. After adjustment for age, sex, and BMI, IHTC was positively correlated with WC, FCP, ALT, AST, GGT, TG, TC, high-density lipoprotein cholesterol (HDL-C), LDL-C, FGF21, and CK18 M65ED (all P < 0.05). Notably, no significant associations were found between IHTC and the other four markers. Multiple stepwise regression analysis involved the above parameters; variables with significant correlations with IHTC included BMI, WC, systolic blood pressure, diastolic blood pressure, FCP, ALT, AST, GGT, TG, HDL-C, FGF21, and CK18 M65ED. FGF21 (standardized β = 0.185, P < 0.001), FCP, ALT, and TG were found to be independently associated with IHTC. However, CK18 M65ED did not demonstrate a significant association.

Table 2 Correlations of IHTC with anthropometric parameters and biochemical indexes in 422 subjects.

FGF21 is an independent risk factor for mild NAFLD diagnosed by MRS

To determine whether FGF21 was an independent risk factor for NAFLD, we performed a multiple logistic regression analysis in which the presence of mild NAFLD was designated as the dependent variable in three different models (Table 3). FGF21, ALT, AST, GGT, TG, TC, HDL-C, LDL-C, sex, age, and BMI were defined as independent variables in Model 1. The analysis identified serum FGF21 levels (per 20 pg/mL increase; odds ratio [OR] 1.082, 95% confidence interval [CI] 1.020–1.147, P = 0.008) as an independent and positive risk factor for NAFLD, along with ALT, TG, and BMI (all P < 0.05). The OR (95% CI) for the risk of mild NAFLD per 20 pg/mL increase of serum FGF21 was 1.082 (1.017–1.151; P = 0.013) when CK18 M65ED, diastolic blood pressure, and metabolic syndrome were included in the analysis, in addition to FGF21, ALT, TG, and BMI, in Model 2. To simplify the model, only FGF21, ALT, TG, and BMI, which remained significant in Models 1 and 2, were included in Model 3. Taking the presence of mild NAFLD as the dependent variable (Model 3), the OR (95% CI) for the risk of mild NAFLD per 20 pg/mL increase of serum FGF21 was 1.062 (1.010–1.118; P = 0.019). The model was named the Mild NAFLD Model, which was calculated as follows:

$${\rm{Mild}}\,{\rm{NAFLD}}\,{\rm{Model}} = x/ \left( {1\,+\,x} \right),$$
$$x\,=\,{\mathrm{e}}^{\left({-6.871+0.003\times{\mathrm{FGF}}21+0.079\times{\mathrm{ALT}}+1.151\times{\mathrm{TG}}+0.163\times{\mathrm{BMI}}} \right)}.$$
Table 3 Multivariate logistic regression for the association between serum FGF21 levels and mild NAFLD in 422 subjects.

Diagnostic effect of FGF21 for mild NAFLD diagnosed by MRS

The receiver-operating characteristic curves shown in Fig. 1 represent the predictive accuracy of the Mild NAFLD Model for mild NAFLD diagnosed by MRS. The AUC was 0.853 (95% CI 0.816–0.886) for the Mild NAFLD Model and 0.637 (95% CI 0.589–0.683) for FGF21 alone. The AUC of the Mild NAFLD Model was larger than that of FGF21 alone (P < 0.0001). The AUC of the Mild NAFLD Model was compared with that of the FGF21 Model, which was used to predict ultrasonography-diagnosed NAFLD and was calculated with FGF21 and BMI in a prospective study [15]. In our cohort, the FGF21 Model indicated NAFLD, with an AUC of 0.737 (95% CI 0.692–0.779). However, the AUC of the FGF21 Model was significantly smaller than that of the Mild NAFLD Model (P = 0.0001). Calculation of the Youden index to find the optimal cut-off value of the Mild NAFLD Model for indicating the occurrence of mild NAFLD produced a value of 0.69, with a sensitivity of 85.12% (95% CI 81.31%–88.92%), specificity of 70.93% (95% CI 61.33%–80.53%), positive predictive value of 91.96% (95% CI 88.94%–94.98%), and negative predictive value of 54.95% (95% CI 45.70%–64.21%) (Table 4).

Fig. 1
figure 1

ROC curves for predicting MRS-diagnosed mild NAFLD and the comparison of AUCs. Mild NAFLD Model (FGF21 + ALT + TG + BMI), AUC = 0.853 (95% CI 0.816–0.886); FGF21 Model (FGF21 + BMI), AUC = 0.737 (95% CI 0.692–0.779); FGF21, AUC = 0.637 (95% CI 0.589–0.683). Comparison of AUCs: Mild NAFLD Model vs. FGF21 Model, P < 0.0001; Mild NAFLD Model vs. FGF21, P < 0.0001; FGF21 Model vs. FGF21, P = 0.0042.

Combined application of ultrasonography and Mild NAFLD Model in diagnosing mild NAFLD

Table 4 summarizes the diagnostic performance of ultrasonography alone, the Mild NAFLD Model alone, and the combination of ultrasonography and the Mild NAFLD Model in diagnosing mild NAFLD using MRS as the reference standard. Ultrasonography or Mild NAFLD Model alone did not show any significant differences regarding the diagnosis of mild NAFLD via the chi-square test. We further evaluated diagnostic performance using a two-step approach in which subjects classified as non-NAFLD by ultrasonography would undergo a secondary assessment by the Mild NAFLD Model using the cutoff value of 0.69. Compared with ultrasonography or the Mild NAFLD Model alone, the two-step approach yielded an overall sensitivity of 97.32% (95% CI 95.60%–99.05%) and a negative predictive value of 85.48% (95% CI 76.72%–94.25%), revealing a significant improvement.

Table 4 Improved performance in diagnosing mild NAFLD by the two-step approach by combining ultrasonography and Mild NAFLD Model.

Discussion

In this study, elevated serum FGF21 was the only biomarker independently correlated with IHTC measured by MRS. FGF21 was the only indicator among six potential NAFLD biomarkers that was significantly associated with mild NAFLD diagnosis, and the Mild NAFLD Model based on FGF21 had an AUC of 0.853. Furthermore, the two-step method using the combination of ultrasonography and the Mild NAFLD Model was more sensitive than each method alone to diagnose NAFLD with MRS as the diagnostic reference standard. Elevated FGF21 was independently associated with IHTC in mild NAFLD and was also increased in NAFLD before ultrasonography was performed. Increased hepatic TG deposition reflects an imbalance of hepatic free fatty acid (FFA) metabolism [24]. FFAs are natural agonists of peroxisome proliferator-activated receptor-α (PPARα), and hepatic Fgf21 expression is upregulated by the transcription factor PPARα [25]. In addition, a study showed that hepatic FGF21 was increased during nutrient overload and was involved in de novo lipogenesis in the liver in response to carbohydrate load [25].

Although CK18 M65ED was elevated in mild NAFLD in our study, it was not independently associated with IHTC. CK18 M65ED, a CK18 fragment, can detect the common epitope in both uncleaved and caspase-cleaved CK18, reflects hepatocellular apoptosis and is potentially useful for diagnosing NAFLD [26, 27]. Unlike FGF21, the elevated CK18 M65ED in our study might reflect the increasing level of apoptosis-induced total cell death, which might be the result of cellular stress and hepatic injury subsequently induced by the hepatic accumulation of lipids [28]. This may explain why CK18 M65ED increased but was not correlated with IHTC in mild NAFLD in our population.

An animal study revealed that the activation of PR3 can contribute to NAFLD and insulin resistance, and PR3-deficient mice showed reduced intrahepatic lipids and were resistant to high-fat-diet-induced weight gain [18]. NE can cause cellular insulin resistance, and the deletion of NE in mice can ameliorate tissue inflammation [19]. A1AT is an endogenous inhibitor of NE, and the imbalance between NE and A1AT is associated with obesity-related metabolic inflammation [20]. Zang et al. found an elevation in serum NE and NE/A1AT and a reduction of A1AT in NASH subjects [21]. However, in our current study, PR3, NE, A1AT, and NE/A1AT were not significantly altered in patients with mild NAFLD. This might be explained by the ‘multiple hit hypothesis’ [29] where, in mild NAFLD, the first hit of rising hepatic lipid concentrations had rendered hepatocytes sensitive to an additional hit featuring hepatocyte cell death, but it had not yet driven drastic changes in hepatic inflammation.

In our study, the two-step method, which involved combining serum biomarkers with conventional ultrasonography, resulted in good sensitivity in mild NAFLD. In this study, the two-step method achieved a sensitivity of 97.32% and a negative predictive value of 85.48% for the diagnosis of mild NAFLD. Ultrasonography alone has well-known limitations in detecting IHTC, with sensitivities of 65.1% and 64.9% in the assessment of mild NAFLD patients [30] and severely obese patients [31], respectively. However, ultrasonography has some advantages. It is not influenced by liver iron [30], it is easily accessible and inexpensive, and it is particularly useful for the general screening of subjects compared with other imaging techniques (i.e., CT or MRS) [11]. In addition, ultrasonography can be performed easily in conscious patients, which is an advantage when examining pediatric patients or adults unable to tolerate magnetic resonance imaging tests without sedation [32].

The Mild NAFLD Model, which is the other component of the two-step method and was obtained using MRS as a reference standard, is composed of FGF21, ALT, TG, and BMI, which are all easily obtained indices in clinical practice. Other biomarkers, such as the fatty liver index and the hepatic steatosis index, also have moderate accuracy in diagnosing NAFLD [33,34,35]. However, the key limitation of these two indices is their use of ultrasonography, which is a suboptimal reference standard [35]. Hopefully, by adopting the two-step method in diagnosing mild NAFLD, suggestions on lifestyle interventions regarding diet and physical activity can be provided to these subjects to prevent the deterioration of NAFLD.

This study has several limitations. First, longitudinal data were lacking; thus, we could not address the causal relationship between FGF21 and mild NAFLD. Prospective studies in larger populations are needed to confirm our diagnostic method. Second, a liver biopsy was not used, and information concerning coexisting steatohepatitis or fibrosis was lacking. Third, the diagnostic accuracy of NAFLD via MRS could have been affected by hepatic iron content [36]. However, it was not possible to exclude the influence of hepatic iron because of insufficient data. In addition, one major limitation of the study is that the diagnostic performance of the Mild NAFLD Model had not been validated in another external validation population. In addition, our study population may not represent the general Chinese community population. Therefore, the conclusion needs further validation in a general population. Last, the MRS diagnosis was based on data collected from a small portion of the liver (within a voxel = 2 cm × 2 cm × 2 cm), and this might have led to a sampling error [35].

In conclusion, the present study demonstrated that elevated serum FGF21 was independently associated with IHTC as measured by MRS and was indicative of mild NAFLD compared with the other five potential NAFLD biomarkers. A two-step approach that combined ultrasonography and the Mild NAFLD Model derived from serum FGF21 further improved the diagnostic sensitivity for mild NAFLD. We believe this diagnostic model can be applied in the early diagnosis of NAFLD in clinical practice.