Introduction

As prevalence of nonalcoholic fatty liver disease (NAFLD) is increasing, it has become a major cause of chronic liver disease1,2. Nonalcoholic steatohepatitis (NASH) is considered a progressive disease of NAFLD, which is distinguished from non-alcoholic fatty liver (NAFL)3. NASH is an important factor for disease progression of NAFLD and the diagnosis of NASH requires histologic confirmation with steatosis, inflammation, and hepatocyte ballooning appearance4,5. Therefore, a liver biopsy is essential to confirm NASH, but has several limitations such as high cost, complication risks, and sampling errors6. Developing noninvasive tools for evaluating NAFLD severity is an emerging interest of hepatologists, especially for the discrimination of NASH or advanced fibrosis among NAFLD7. Several serologic and imaging biomarkers have been developed and validated for detecting advanced fibrosis such as the Fibrosis 4 index, NAFLD fibrosis score, transient elastography (TE), and magnetic resonance elastography (MRE)8. For noninvasive diagnosis of NASH, several serologic markers were investigated7, but the diagnostic accuracy of those markers was limited. Among them, cytokeratin 18 (CK18) levels are elevated during the apoptosis or necroptosis of hepatocytes in patients with NASH, and CK18 is a well-validated single marker for the diagnosis of NASH9. CK18 exhibited moderate diagnostic accuracy for NASH in a meta-analysis, but the cut-off for NASH diagnosis is variable, and its availability is limited10. Therefore, there is an urgent need to develop more accurate biomarkers for the diagnosis of NASH.

MicroRNA (miRNA) is a small-sized non-coding RNA comprising 20–25 nucleotides, and it binds to the target mRNA, resulting in translation inhibition or mRNA degradation11,12. miRNAs are synthesized in a variety of cells and participate in cell signaling13; they have important roles in regulating cell growth, proliferation, and metabolism14. In addition to their physiological role, circulating miRNAs have been studied as a candidate for diagnosis of a variety of diseases such as malignancy and cardiovascular, neurologic, and metabolic diseases, including diabetes and NAFLD15,16. In patients and animal models with NAFLD, circulating miRNAs exhibit significant differences compared with healthy controls17,18. Circulating miRNA-34a, miRNA-122, and miRNA192 consistently increase in patients with NASH compared to patients with simple steatosis19. Diagnostic accuracy to distinguish NASH from NAFL was evaluated using miR-34a, but it showed moderate accuracy [area under the receiver operating characteristic curve (AUC) = 0.78)].

In the present study, we aimed to develop a biomarker to diagnose NASH in NAFLD by analyzing the circulating miRNA expression profile in sera from patients with biopsy-confirmed NAFLD using emerging next generation sequencing (NGS).

Results

Baseline characteristics

For miRNA sequencing, sera from 24 biopsy-proven patients with NAFLD were collected between February 2019 and July 2019, including 12 patients with NAFL and 12 patients with NASH. The baseline characteristics of patients are summarized in Table 1. The NASH group was more women dominant (75%) and showed lower body mass index (BMI), hemoglobin, total bilirubin, albumin, and creatinine compared with the NAFL group. Steatosis, lobular inflammation, and fibrosis did not show significant differences in severity between the NAFL and NASH groups. Ballooning is essential for the diagnosis of NASH; there was a significant difference in the presence of ballooning between the NAFL and NASH groups. The NASH group showed significantly higher NAFLD activity score (NAS) than that observed for the NAFL group. Representative histopathological findings are presented in Supplementary Fig. 1. In the validation cohort, 37 patients with biopsy-confirmed NAFLD were enrolled, including 11 and 26 patients with NAFL and NASH, respectively (Supplementary Table 1). Clinical and laboratory characteristics were similar in both groups, except for aspartate aminotransferase (AST). We observed no significant differences for steatosis, lobular inflammation, and fibrosis between the NAFL and NASH groups in the validation cohort; however, hepatocyte ballooning and NAS were found to be significantly higher in the NASH group compared with those in the NAFL group in the validation cohort.

Table 1 Baseline characteristics.

miRNA expression profiles

In miRNA analysis, a total of 2588 miRNAs were analyzed, and each sample expressed 332 to 618 miRNAs (median 469 miRNAs). Approximately 2153 miRNAs were excluded from analysis because these miRNAs were expressed in < 50% samples; therefore, 435 miRNAs were compared between NAFL and NASH groups (Supplementary Fig. 2). A total of 38 miRNAs showed a significant difference in expression between the NAFL group and NASH group as shown in the heatmap (Fig. 1A). Twenty-six miRNAs significantly increased more than two times in the NASH group as compared to the NAFL group (P < 0.05) (Fig. 1B). In contrast, 12 miRNAs significantly decreased in the NASH group as compared to the NAFL group (P < 0.05).

Figure 1
figure 1

Heatmap (A) and volcano plot (B) of miRNA expression profile in sera of patients with NAFL and NASH. In heatmap, column annotation bar indicates two groups of disease (NAFL and NASH), and two row annotation bars indicate P-value and fold change between two groups. In row annotation bars, black and red color indicate high significance and higher expressed in NASH. Green dots indicate miRNA with decreased expression [log2(fold change) − 1] in NASH group (P < 0.05). Red dots indicate miRNA with increased expression [log2(fold change) > 1] in NASH group (P < 0.05). The dashed blue line represents where P < 0.05 and the threshold for high statistical significance.

Diagnostic accuracy of miRNAs for NASH

Eight miRNAs (miR-15b-3p, miR-21-5p, miR-29b-3p, miR-126-5p, miR-151a-3p, miR-183-5p, miR-192-5p, and miR-4449) were selected to evaluate the diagnostic accuracy for NASH. These eight miRNAs increased more than 2 times in the NASH group compared with the NAFL group and showed upper 25% normalized expression value in the NASH group among 435 miRNAs that were expressed in > 50% of 24 samples (Supplementary Fig. 2). Because 12 miRNAs that significantly increased in the NAFL group compared with the NASH group showed low detection as normalized expression values, these 12 miRNAs were excluded from further analysis. The ROC curve and comparison of normalized expression value for miRNAs between NAFL and NASH groups are shown (Supplementary Fig. 3). The AUC, sensitivity, and specificity of eight miRNAs are summarized in Table 2. Only four miRNAs (miR-21-5p, miR-151a-3p, miR-192-5p, and miR-4449) showed meaningful AUC values for NASH diagnosis (P < 0.05). When AUC values for diagnosis of NASH were compared between a combination of eight miRNAs (miR-15b-3p, miR-21-5p, miR-29b-3p, miR-126-5p, miR-151a-3p, miR-183-5p, miR-192-5p, and miR-4449) (AUC, 0.924; 95% CI 0.739–0.992) and a combination of four miRNAs (miR-21-5p, miR-151a-3p, miR-192-5p, and miR-4449) (AUC, 0.875; 95% CI 0.676–0.973), there was no significant difference (P = 0.263) (Fig. 2).

Table 2 Diagnostic accuracy of miRNA for NASH from NGS analysis.
Figure 2
figure 2

Diagnostic accuracy of miRNA combination. Receiver operating characteristic (ROC) curves for diagnostic accuracy for NASH using miRNA expression level from miRNA sequencing. The red line represents the ROC curve for a combination of eight miRNAs (miR-15b-3p, miR-21-5p, miR-29b-3p, miR-126-5p, miR-151a-3p, miR-183-5p, miR-192-5p, and miR-4449). The blue line represents the ROC curve for the combination of four miRNAs (miR-21-5p, miR-151a-3p, miR-192-5p, and miR-4449).

We also compared miRNA expression between patients with and without fibrosis to identify the confounding effect of fibrosis in NASH (Supplementary Fig. 4). Although 14 miRNAs showed significant differences in expression, there was no overlap between these 14 miRNAs and 8 miRNAs that were selected to evaluate the diagnostic accuracy for NASH in Supplementary Fig. 2.

External validation by comparing miRNA expression using qRT-PCR between NAFL and NASH

We analyzed the expression of circulating miRNAs between NAFL and NASH in an external validation cohort from another tertiary center. The expression of miR-21-5p, miR-151a-3p, miR-192-5p, and miR-4449 was higher in the NASH group than in the NAFL group (Fig. 3). For the combination of miR-21-5p, miR-151a-3p, miR-192-5p, and miR-4449, the AUC value for diagnosis of NASH was 0.874 (95% CI 0.724–0.960, P < 0.001) (Fig. 4).

Figure 3
figure 3

Expression level and diagnostic receiver operating characteristic (ROC) curve for NASH diagnosis of four miRNAs (miR-21-5p, miR-151a-3p, miR-192-5p, and miR-4449) in validation cohort. Expression level of each miRNA is normalized to small nuclear RNA U6 expression value and represents 2-∆∆Ct. *Indicates P < 0.05 compared with the corresponding control. **Indicates P < 0.01 compared with the corresponding control.

Figure 4
figure 4

Diagnostic accuracy of miRNA combination in validation cohort. Receiver operating characteristic (ROC) curve for diagnostic accuracy for NASH in validation cohort. The blue line represents the ROC curve for the combination of four miRNAs (miR-21-5p, miR-151a-3p, miR-192-5p, and miR-4449) with ∆∆Ct value.

miRNA-mRNA correlation and Kyoto Encyclopedia of Genes and Genomes (KEGG)

We compared eight miRNAs (miR-15b-3p, miR-21-5p, miR-29b-3p, miR-126-5p, miR-151a-3p, miR-183-5p, miR-192-5p, and miR-4449) with upregulated expression and 265 genes with downregulated expression in the NASH group studied previously as GSE4845220. We confirmed 26 pairs of miRNA-mRNA interactions that were predicted by mirDIP21 and integrated in a single network constructed by Cytoscape22. In the predicted interactions, five miRNAs (miR-21-5p, miR-29b-3p, miR-126-5p, miR-183-5p, and miR-192-5p) were linked to 17 mRNAs (CADM2, CTH, DNAJC12, GPAM, HSD17B11, IGF1, NAMPT, NIPAL1, PCDH20, PLSCR4, RBMS1, RNF152, SLC16A1, SLC16A10, SLC16A7, SLC19A2, and SOCS2). According to the fold changes between NAFL and NASH, each node is indicated by color keys (Supplementary Fig. 5A).

Then, we confirmed enriched KEGG pathways detected from the eight highly expressed miRNAs in the NASH group. TGF-β and Wnt signaling pathways are depicted, and red nodes represent factors inhibited by eight miRNAs (Supplementary Fig. 5B). A total of 10 pathways were associated with four or more miRNAs (Supplementary Table 2).

Discussion

As the need for noninvasive testing to determine the severity of NAFLD increases, various biomarkers have been investigated to diagnose NASH or advanced fibrosis. However, biomarkers for NASH that represent progressive inflammation of hepatocytes have shown limited accuracy and accessibility7. In this study, we found a combination of several circulating miRNAs using NGS and qRT-PCR, which could be a useful biomarker to diagnose NASH.

miRNA is a small non-coding RNA, comprising ≤ 25 nucleotides. The major role of miRNAs is post-transcriptional inhibition of gene expression by binding to the 3′-untranslated region of target mRNAs23. Circulating miRNAs are derived from specific cells and reflect the presence of disease or disease severity. The expression of miRNAs depends on the presence of NAFLD or the disease severity of NAFLD; therefore, miRNA expression has been suggested as a diagnostic biomarker for NAFLD, NASH, and advanced fibrosis19. Several circulating miRNAs, such as miR-16, miR-21-5p, miR-34a, miR-122, miR-192, and miR-375, have shown higher expression in the sera of patients with NASH than in those with NAFL. Most of these studies evaluated miRNA expression using real-time PCR quantification. More than 2600 sequences of human mature miRNAs are known in miRBase24; therefore, there is a limitation in evaluating the overall expression profile of whole human mature miRNAs. Our study analyzed a total of 2588 mature circulating miRNA expression profile using NGS. We selected four candidate miRNAs as biomarkers for NASH diagnosis and analyzed their diagnostic value using normalized values from NGS data. A combination of four miRNAs, miR-21-5p, miR-151a-3p, miR-192-5p, and miR-4449 showed significant diagnostic accuracy with an AUC of 0.875 in the normalized value from NGS data. According to external validation using qRT-PCR, we found that the diagnostic accuracy of a combination of miR-21-5p, miR-151a-3p, miR-192-5p, and miR-4449 was sufficient for them to be a biomarker for NASH diagnosis. Among human mature miRNAs, miR-122-5p is the major miRNA that is expressed in the liver, and circulating miR-122-5p increases in patients with NAFLD25. Although miR-122-5p was the second most abundant circulating miRNA in our study after miR-423-5p, the difference in expression level of circulating miR-122-5p between NAFL and NASH groups was not significant.

miR-21-5p increases in the plasma of patients with NASH26 and it is associated with hepatic metabolism, inflammation, and lipid metabolism27,28. miR-21-5p has been identified as a typical onco-miRNA in many previous studies. Our study indicated that miR-21-5p was highly expressed in NASH, and the relationship among miR-21-5p, NASH, and liver cancer could be an interesting topic for further study. Plasma level of miR-151a-3p is positively correlated with that of TNF-α, which is the classical inflammatory parameter and major factor in the progression of NAFLD29. Circulating miR-192-5p is also upregulated in patients with NASH as compared to patients with NAFL17. Exosomes from lipotoxic hepatocytes showed increased miR-192-5p, and exosomal miR-192-5p regulates disease progression of NAFLD by activating proinflammatory macrophages30,31. miR-4449 expression is rarely known in patients with NAFLD. In patients with obesity, circulating exosomal miR-4449 showed increased expression as compared to the healthy control group, and its expression was decreased after bariatric surgery32. The increasing patterns of our miRNAs exhibited consistency with other liver-related pathology states or obesity.

We applied a previous study uploaded in GEO to network analysis. From the GSE48452 dataset, we identified 265 downregulated genes in patients with NASH, and 26 pairs of miRNA–mRNA interactions were selected. Thus, bioinformatic analysis was applied to explore the correlation between miRNA–mRNA expression profiles. The target genes downregulated with upregulated miRNAs in NASH could be revealed by the public expression dataset.

This study has some limitations. First, the expression levels of miRNAs were standardized to U6 as an internal control. Although not all studies used snRNA U6 as internal control, many other studies used snRNA U6 as internal control33,34,35. Further validation studies are required to evaluate the possible clinical application of miRNAs as diagnostic biomarkers using other standardized control. Second, sequencing was conducted using a small number of patients. To overcome this limitation, we conducted external validation with patients from other centers, and we found similar diagnostic accuracy for NASH in an external validation cohort. However, further validation studies with larger populations and varying degrees of fibrosis are required. Third, we could only provide relative expression level of miRNA for the diagnosis of NASH in patients with NAFLD not absolute expression level. Further studies are required for clinical application.

In conclusion, NASH represents significant distinct miRNA expression profiles compared with NAFL. A combination of serum circulating miRNAs including miR-21-5p, miR-151a-3p, miR-192-5p, and miR-4449 could be used as a novel biomarker for the diagnosis of NASH in NAFLD.

Patients and methods

Patients and sera collection

For small RNA sequencing, 24 patients with biopsy-confirmed NAFLD, comprising 12 NAFL patients and 12 NASH patients, were enrolled from our previous study at Korea University Guro Hospital36. Another 37 patients with biopsy-confirmed NAFLD from Anam Hospital were enrolled for comparison of miRNA expression between NAFL and NASH in external validation. Patients were excluded if they had consumed excessive alcohol and had viral hepatitis, autoimmune hepatitis, primary biliary cholangitis, decompensated cirrhosis, or other severe systemic diseases.

Laboratory tests were performed before liver biopsy, and sera were collected during blood sampling for laboratory tests. Sera were stored at -80℃ and thawed just before RNA extraction. This study had been approved by the institutional review board from Korea University Guro Hospital (2016GR0302) and Anam Hospital (2018AN0129). All patients agreed to the sera collection and submitted written informed consent. All investigators conducted this study in accordance with the Declaration of Helsinki. Each author reviewed and approved the final manuscript.

Liver biopsy and histopathological evaluation

NAFLD was diagnosed in all patients by percutaneous liver biopsy via intercostal spaces. Using an 18-gauge Tru-cut needle (TSK laboratory, Tochigi, Japan), two pieces of minimum 2-cm length were obtained from the specimens. Tissues were fixed in formalin, and paraffin tissue blocks were made. Tissue slides were made with 4-μm sections and performed hematoxylin & eosin and Masson’s trichrome staining. Histological findings were analyzed based on NAS developed by NASH clinical research network37. NAFL was defined when > 5% of hepatic steatosis was presented without hepatocyte ballooning, whereas NASH was defined when > 5% of hepatic steatosis and inflammation with hepatocyte ballooning were presented4.

Small RNA sequencing and analysis

Total circulating RNA from sera was seperated using the miRNeasy Serum/Plasma Kit (Qiagen, Valencia, CA) under the manufacture’s guidelines. The concentration of extracted RNA was determined using Quant-IT RiboGreen (Invitrogen, Carlsbad, CA, USA), and RNA size was measured using an Agilent RNA 6000 Pico Kit and Small RNA Kit on an Agilent 2100 Bioanalyzer (Agilent Technologies, Böblingen, Germany). For the construction of sequencing libraries, 10 ng of RNA was used with the SMARTersmRNA-Seq Kit for Illumina according to the manufacturer’s instructions. The purification of amplified libraries was performed using 6% Novex tris–borate-EDTA polyacrylamide gel electrophoresis gels (Thermo Fisher, MA, USA) from 171 to 253 bp (18 to 100 bp cDNA plus 153 bp of adaptors) fraction. After quantification with quantitative PCR (qPCR) of guidance with the Quantification Protocol Guide (KAPA Library Quantification kits for Illumina Sequencing platforms) and qualification with the TapeStation D1000 ScreenTape (Agilent Technologies, Waldbronn, Germany), the libraries sequencing was done by an Illumina HiSeq 2500 (Illumina, San Diego, CA, USA). To eliminate adapter sequences, the raw reads of small RNAs were processed using the Cutadapt program (NBIS, Uppsala, Sweden).

Quantitative reverse transcription PCR

cDNA synthesis was done using reverse transcriptase with miRNA-specific stem-loop primers (Applied Biosystems). qRT-PCR was performed on a QuantStudio 6 Flex Real-time PCR system (Applied Biosystems) using Taqman master mixture. The relative abundance of miRNA was normalized to that of small nuclear RNA U6. The relative amount of each miRNA was measured using the 2−∆∆Ct method. The primers are summarized (Supplementary Table 3).

mRNA data collection and analysis

Data mining from Gene Expression Omnibus database (GEO; https://www.ncbi.nlm.nisteatohepatitish.gov/geo/) was performed to confirm a spectrum of differentially expressed mRNA profiles of NASH. Gene microarray expression profiles between the NAFL group and NASH group were collected from GEO using the keyword “steatosis and steatohepatitis.” GSE48452 was suitable for our analysis and downloaded to select genes differently expressed in the NASH group. Differently expressed genes were retrieved by t-test using R software and filtered when log2(fold change) > 1 or < − 1, and P < 0.05 between the NAFL group and NASH group. An miRNA–mRNA network was constructed using mirDIP21 and visualized using Cytoscape version 3.8.022.

Statistics

The demographics and laboratory results are represented as percentage and median with first quartile to third quartile. Pearson’s chi-squared test for categorical variables and the Man–Whitney U-test for continuous variables were used to compare baseline characteristics. The cut-off and AUC values for the diagnosis of NASH were determined by receiver operating characteristic (ROC) curve analysis and based on histologic diagnosis. Logistic regression analysis was performed to combine miRNA value. For statistical analysis, SPSS (version 25; IBM Corporation, Armonk, NY, USA) and MedCalc (version 17.6; MedCalc Software, Ostend, Belgium) were used.