Development and validation of a neural network for NAFLD diagnosis

Sorino, Paolo; Campanella, Angelo; Bonfiglio, Caterina; Mirizzi, Antonella; Franco, Isabella; Bianco, Antonella; Caruso, Maria Gabriella; Misciagna, Giovanni; Aballay, Laura R.; Buongiorno, Claudia; Liuzzi, Rosalba; Cisternino, Anna Maria; Notarnicola, Maria; Chiloiro, Marisa; Fallucchi, Francesca; Pascoschi, Giovanni; Osella, Alberto Rubén

doi:10.1038/s41598-021-99400-y

Download PDF

Article
Open access
Published: 12 October 2021

Development and validation of a neural network for NAFLD diagnosis

Paolo Sorino¹,
Angelo Campanella¹,
Caterina Bonfiglio¹,
Antonella Mirizzi¹,
Isabella Franco¹,
Antonella Bianco¹,
Maria Gabriella Caruso²,
Giovanni Misciagna³,
Laura R. Aballay⁴,
Claudia Buongiorno¹,
Rosalba Liuzzi¹,
Anna Maria Cisternino⁵,
Maria Notarnicola²,
Marisa Chiloiro⁶,
Francesca Fallucchi⁷,
Giovanni Pascoschi⁸ &
…
Alberto Rubén Osella¹

Scientific Reports volume 11, Article number: 20240 (2021) Cite this article

3322 Accesses
14 Citations
11 Altmetric
Metrics details

Subjects

Abstract

Non-Alcoholic Fatty Liver Disease (NAFLD) affects about 20–30% of the adult population in developed countries and is an increasingly important cause of hepatocellular carcinoma. Liver ultrasound (US) is widely used as a noninvasive method to diagnose NAFLD. However, the intensive use of US is not cost-effective and increases the burden on the healthcare system. Electronic medical records facilitate large-scale epidemiological studies and, existing NAFLD scores often require clinical and anthropometric parameters that may not be captured in those databases. Our goal was to develop and validate a simple Neural Network (NN)-based web app that could be used to predict NAFLD particularly its absence. The study included 2970 subjects; training and testing of the neural network using a train–test-split approach was done on 2869 of them. From another population consisting of 2301 subjects, a further 100 subjects were randomly extracted to test the web app. A search was made to find the best parameters for the NN and then this NN was exported for incorporation into a local web app. The percentage of accuracy, area under the ROC curve, confusion matrix, Positive (PPV) and Negative Predicted Value (NPV) values, precision, recall and f1-score were verified. After that, Explainability (XAI) was analyzed to understand the diagnostic reasoning of the NN. Finally, in the local web app, the specificity and sensitivity values were checked. The NN achieved a percentage of accuracy during testing of 77.0%, with an area under the ROC curve value of 0.82. Thus, in the web app the NN evidenced to achieve good results, with a specificity of 1.00 and sensitivity of 0.73. The described approach can be used to support NAFLD diagnosis, reducing healthcare costs. The NN-based web app is easy to apply and the required parameters are easily found in healthcare databases.

NASHmap: clinical utility of a machine learning model to identify patients at risk of NASH in real-world settings

Article Open access 05 April 2023

Artificial intelligence outperforms standard blood-based scores in identifying liver fibrosis patients in primary care

Article Open access 21 February 2022

Machine learning classifiers for screening nonalcoholic fatty liver disease in general adults

Article Open access 03 March 2023

Introduction

Non-alcoholic liver steatosis (NAFLD) is the leading cause of chronic liver disease in Western countries. This condition increases the risk of cardiovascular disease, type 2 diabetes mellitus and chronic kidney disease and leads to increased mortality^1,2. The condition is estimated to affect about 20–30% of the adult population in developed countries³. NAFLD is defined as an accumulation of Triglycerides in the hepatocytes (> 5% of liver volume) of patient with low alcohol intake (< 20 g/day in women or < 30 g/day in men), diagnosed once causes due to viral infections or other specific liver diseases have been excluded⁴. NAFLD is becoming more common among adults between 40 and 60 years of age, but the disease is also seen children⁵. A meta-analysis published in 2016 reported that this disease has an average prevalence of 23.71% in Europe⁶. Population-based studies conducted in our geographical area (district of Bari, Apulia Region, Italy), estimated a prevalence of NAFLD of around 30%, males and the elderly are most commonly affected⁷.

NAFLD is strongly associated with the metabolic syndrome and is considered the hepatic manifestation of the metabolic syndrome⁸. It can manifest as pure fatty liver disease (hepato-steatosis) or as non-alcoholic steatohepatitis (NASH), an evolution of the former in which steatosis is associated with inflammation and hepatocellular damage, and with fibrogenic activation that can lead to cirrhosis and the onset of hepatocarcinoma⁹. In general it has been established that early diagnosis of cirrhosis and elimination of the cause can stop further liver damage, increase the chances of transplant success and also reduce mortality rates¹⁰. According to recent EASL—EASD—EASO guidelines¹¹, at the individual level the gold standard for identifying steatosis in individual patients is magnetic resonance imaging (MRI), although ultrasound scanning (US) is considered a good alternative being more widely available and cheaper than MRI. In addition, for large-scale screening studies, serum biomarkers and steatosis score indices have been preferred because their easy availability and low cost has a substantial impact on the feasibility of screening. One of the best validated indexes is the Fatty Liver Index (FLI)¹², although other anthropometric indices or measurements work together with FLI in predicting NAFLD risk¹³.

In recent years, due to the increasing prevalence of NAFLD, there has been a research trend towards identifying low cost, diagnostic methods, and Machine Learning has been acknowledged as a valuable tool. Machine Learning (ML) is a branch of artificial intelligence aimed to enable machines to operate using intelligent "learning" algorithms¹⁴. Using the data sets supplied, the machine is able to process them through algorithms that allow it to develop its own logic in order to perform the required function or task. Machine Learning has already been used as a support tool for the diagnosis of different diseases, and for risk quantification, such as cardiovascular risk in patients with diabetes mellitus^15,16, ischemic heart disease¹⁷ and tumors¹⁸.

Nowadays, NAFLD diagnosis is made by performing Ultrasound¹⁹ and MRI with lipid content quantification²⁰. Besides some biochemical and/or anthropometric parameters alone or in combination are used to perform the diagnosis^21,22. This implies to refer patients to more specialized health center with the consequent healthcare system burden²³. Many studies have used ML for the diagnosis of NAFLD but they were predominantly focused to identify particular aspects of NAFLD such as quantification of lipid content, staging, fibrosis, etc^24,25,26,27. and no longer simply ascertain the absence of disease, for example, in a large cohort of subjects avoiding in that way the use of non-invasive diagnostics for screening and monitoring NAFLD.

As imaging technologies such as ultrasound, magnetic resonance imaging (MRI), transient elastography (TE), and computed tomography (CT) are expensive and time consuming, they are generally impractical for most serial assessments²⁸ or when large-scale population studies are considered. In addition to high cost, other limitations of imaging-based diagnosis of liver damage such as operator dependence, lower sensitivity and range, radiation exposure and limited availability need to be considered²⁹. Moreover, ML-based models have also been used to classify liver diseases into distinct categories with ~ 80% accuracy^30,31, highlighting that biomarker-based diagnostic methods meet the requirements for diagnosis³².

Then, our purpose was to develop a simple web app which permits to perform the diagnosis of absence of NALFD with high accuracy to reduce waiting list and costs for the National Health System, as. most studies on NAFLD diagnosis are based on images or laboratory parameters that are not always available^26,33.

The aim of our study was to develop and validate a simple Neural Network (NN), using easily available laboratory parameters which had been identified in our previous study³⁴, in order to build a web app incorporating the NN, trained to apply them to identify subjects at greater risk of NAFLD to be scheduled for ultrasound assessment. We also checked the performance of the trained NN by analyzing Explainability (XAI)³⁵; to evaluate its reliability and ease of use and validate the results on a randomly selected sample subset extracted from a population-based study.

In the first part of this paper the population under study the variables and formula on which the AVI parameter is built have been described, then. Next, a first analysis with the t-SNE³⁶ technique was performed and then we switched to an approach based on NN to search for optimal parameters to build the NN with the parameters identified. Subsequently, the NN performance and XAI are evaluated. Finally, we illustrate the development of a simple local web app tested on a population sample.

Methods

Population

The subjects included in the were drawn from two different cohort studies conducted at the laboratory of Epidemiology and Biostatistics of the National Institute of Gastroenterology, Research Hospital "Saverio de Bellis" (Castellana Grotte, Bari, Italy). Subjects participating in the MICOL study and NUTRIHEP study were eligible. Details on the MICOL and NUTRIHEP study populations have been published elsewhere^7,13,37. The MICOL study is an ongoing randomized study of subjects drawn from the electoral list of Castellana Grotte (aged ≥ 30 years) in 1985 and followed up in 1992, 2005–2006 and 2013–2016. The study included a total of 2970 out of 3000 selected subjects; 56.5% were male. By 1985, 2472 subjects had been enrolled. In 2005–2006, 1697 of the original cohort were still present. In 2005–2006 this cohort was added with a randomized sample of 1273 subjects (PANEL study) aged between 30 and 50 years, to compensate for the cohort aging^38,39. All subjects gave prior informed written consent to participate.

All procedures were performed in accordance with the ethical standards of the institutional research committee (IRCCS Saverio de Bellis approval for research and the ethics committee for the MICOL study (DDG-CE-347/1984; DDG-CE-453/1991; DDG-CE-589/2004; DDG-CE 782/2013) and, with the Helsinki Declaration of 1964. The NUTRIHEP study was conducted at the National Institute of Gastroenterology Saverio de Bellis (Castellana Grotte, Bari, Italy) in collaboration with 12 General Practitioners (GPs) operating in Putignano (Bari, Italy). The study period was from July 2005 to January 2007. By means of systematic random sampling of 1 of every 5 procedures, a sample from the general population aged ≥ 18 years had been obtained from the General Practitioners lists. Instead, we used records from a census design, because no significant difference was found between the age-sex distribution of the general population from Putignano and the subjects listed in the general practitioners' registers. Therefore, 2550 subjects were invited to participate in the survey and, 2301 (90%) agreed. NUTRIHEP subjects were followed-up in 2015–2017 then, 951 of them were included. All subjects provided written information and consent according to the 1964 Helsinki Declaration.

The subjects participating in the MICOL and NUTRIHEP studies underwent anthropometric measurements, blood sampling and hepatic ultrasound. They were weighed wearing underwear, on an electronic scale, SECA; weight was approximated to the nearest 0.1 kg. Height was measured with a SECA wall stadiometer, approximated to the nearest 1 cm. Blood pressure (BP) measurements were performed following international guidelines⁴⁰. The mean of 3 BP measurements was calculated.

ML algorithm development

Data acquisition and pre-processing

The initial database for the MICOL III trial contained 2970 subjects. The sample declined to 2869 as for 101 subjects there were no data on at least one of the values among Waist Circumference (WC), Hips (HP) (variables for the constitution of AVI), Gamma-Glutamyl Transferase (GGT), Glucose. These 2869 subjects constituted the new database used for training and testing the NN using a train-test-split approach. From the NUTRIHEP database, initially composed of 2301 subjects, we randomly extracted a further 100 subjects to constitute the validation sample.

Variables used

The Variables used to develop the NN were: Sex, Age, Gamma-Glutamyl Transferase (GGT), Glucose, Abdominal Volume Index (AVI)⁴¹ and NAFLD condition.

We have previously highlighted that the best model to detect the NAFLD condition is based on the above variables. These variables were identified starting from a sample of 27 variables and exploiting a subset selection approach in order to identify the model with fewer variables and better performance³⁴. Table 1 shows the formula employed to build the AVI index.

Table 1 Index formula and its structure.

Full size table

AVI is the only compound index used, and this formula is easy to compute and the component variables are easily available as they consist of anthropometric measurements.

The array composed by Sex, Age, Gamma-Glutamyl Transferase (GGT), Glucose, Abdominal Volume Index (AVI) represents the X of our algorithm and the condition of NAFLD the Y.

NAFLD diagnosis was performed using an ultrasound scanner Hitachi H21 Vision (Hitachi Medical Corporation, Tokyo, Japan). Examination of the visible liver parenchyma was performed with a 3.5 MHz transducer.

Data exploration

Data were explored by using a t-Distributed Stochastic Neighbor Embedding (t-SNE)³⁶. It is an unsupervised and nonlinear technique used primarily for data exploration and visualization of high-dimensional data; its output shows how the data are organized in a high-dimensional space. This technique has not performed in optimal way failing to clearly discriminate the two classes 0 (No NAFLD), 1 (NAFLD), Fig. 1 shows data displayed with the t-SNE.

Hyperparameter tuning for the neural network

Initially, a NN was created using the Open Source library “scikit-learn”⁴² by Python.

For the interaction with the csv file containing the database, the library “numpy” (np)⁴³ by python was used.

The NN is an MLPClassifier (Perceptron Multilayer Classifier)⁴² and a supervised machine learning algorithm⁴⁴. The first fundamental step was to split the considered database using the “Train_test_split” (function present in scikit-learn) in order to divide the sample into two subsets (80% of the data used for NN training and the remaining 20% for the testing).

GridSearchCV⁴² was used to search for optimal parameters for the NN.

The GridSearchCV is included in the scikit-learn library.

We have performed the NN optimization for the following parameters:

Activation function: searched among (‘identity’, ‘logistic’, ‘tanh’, ‘relu’)
Solver type: limited memory Broyden-Fletcher-Goldfarb-Shanno algorithm (lbfgs)⁴⁵ Stochastic Gradient Descent⁴⁶, Adam⁴⁷, (‘lbfgs’, ‘sgd’, ‘adam’). "lbfgs" is an optimizer in the family of almost Newtonian methods⁴⁸. We selected "lbfgs" because for small data sets it can converge faster and get better performance.
Learning rate: searched among (‘constant’, ‘invscaling’, ‘adaptive’)
the Maximum number of iterations looking for it in a defined range of values (max_iter': [1000,1100,1200,1300,1400,1500,1600,1700,1800,1900,2000, 3000,4000,5000,6000,7000, 8000,9000],) Maximum number of iterations. The solver iterates until convergence (determined by "tol") or until the maximum number of iterations.
The alpha value searched for in a set of defined values (alpha': 10.0 ** -np.arange(0, 10),) Penalty parameter L2⁴⁹.
The number of hidden layers of the network 'hidden_layer_sizes': np.arange(0, 20), searched in a range from 0 to 20
And the value of 'random_state': [0,1,2,3,4,5,6,7,8,9,10] searched in the range from 0 to 10 to make sure the results were replicable.

MLPClassifier performs iterative training because at each time step the partial derivatives⁵⁰ of the loss function⁵⁰ are calculated with respect to the model parameters, in order to update the parameters. It can also have a regularization term added to the loss function that reduces the Model Parameters to prevent overfitting. The values obtained at the end of the NN optimization were:

activation: 'logistic'
alpha: 1.0
hidden_layer_sizes: 19
learning_rate: 'constant'
max_iter: 9000
random_state: 10
solver: 'lbfgs'

Training session and neural network test

The algorithm was trained using as target variable the NAFLD condition and as features Sex, Age, GGT, Glucose and AVI values.

The dataset used for the training and the test of the algorithms was the MICOL subjects, subdivided into the Test and Training subsets: 80% of the dataset was dedicated to the training phase while the remaining 20% was used in the model testing phase. The output reported the accuracy during training and testing, the value of the area of the Roc curve (AUC)^51,52 in the training and testing phase, the Confusion Matrix⁵³ and the value of Precision, Recall and F1-score in the testing phase.

Results

Participants characteristics and the performance of AVI indexes in MICOL subjects are shown in Table 2. The NAFLD prevalence was 31.7%, the condition being, as expected, more prevalent among men. Subjects with NAFLD were a little older, with increased levels of Glucose and GGT.

Table 2 Sample subset characteristics by NAFLD condition.

Full size table

In Table 3 are shown Participants characteristics and the performance of AVI indexes in the NUTRIHEP study. In the original study NAFLD prevalence was 24.3% and, as expected, more prevalent among men.

Table 3 Sample subset characteristics by NAFLD condition.

Full size table

Neural network performance analysis

The first parameter considered to evaluate the performance of the NN was the accuracy defined as⁵⁴:

$$ {\text{Accuracy }} = { }\frac{{{\text{Number}}\;{\text{of}}\;{\text{correct}}\;{\text{preditions}}}}{{{\text{Total}}\;{\text{number}}\;{\text{of}}\;{\text{preditcions}}}}*{ }100{ } $$

More specifically, the accuracy of a model is calculated with the following formula⁵⁴:

$$ {\text{Accuracy }} = \frac{{{\text{TP }} + {\text{ TN}}}}{{{\text{TP}} + {\text{TN}} + {\text{FP}} + {\text{FN}}}}*100{\text{\% }} $$

where TP = True Positive, TN = True Negative, FP = False Positive and FN = False Negative.

Accuracy was measured during both the NN training and the testing phase.

Another performance index that we considered was the value of the ROC curve⁵². The area under the ROC (AUC, "Area Under the Curve") is a measure of accuracy and indicates the diagnostic power of the test.

In Figs. 2 and 3 the ROC curves with the AUC value obtained during the training phase and testing phase are shown.

In addition to the accuracy and ROC curve values, we evaluated the confusion matrix to verify the reliability of the NN. Figure 4 shows the confusion matrix values in the test phase.

In addition, the Positive (PPV) (0.57) and Negative (NPV) (0.86) predictive values were calculated. It is worth to note that the NN is able to identify subjects without the condition with a very high precision.

Table 4 shows the Accuracy and AUC values obtained during training and testing of the NN.

Table 4 Accuracy and AUC values in the training and test phase.

Full size table

The values obtained for AUC and Accuracy (both for the training phase and for the test phase) show that the NN implemented does not present overfitting or underfitting problems, because the values of the two ROC curves and the values related to the accuracy differ very slightly. Additionally, in order to validate the performance of the NN precision, recall and f1-score values during the test phase were evaluated. In Table 5 are shown values of Precision, Recall, f1-score of No NAFLD and, NAFLD subject, Macro average and Weighted average during test phase.

Table 5 Value of precision, recall and F1-score on test set.

Full size table

Evaluating Explainability using SHAP

After verifying the behavior of the NN by comparing the various indices considered, we performed with the analysis of Explainability (XAI) using LIME⁵⁵ and the SHAP⁵⁶ library of Python to compare any inconsistencies. We initially proceeded to the evaluation by performing a relevance analysis of the features in order to verify whether the anthropometric and biochemical variables considered gave a real and consistent contribution in the diagnosis of NAFLD. Figures 4 and 5 show the contribution given by each feature used in the diagnosis of NAFLD within the NN during the Training and Test.

Figures 5 and 6 shows the importance of AVI, GGT and Age as already highlighted in previous studies³⁴ are more important than sex and glucose in the diagnosis of this pathology but still combining them all together they lead to a good diagnostic result in a NAFLD diagnosis.

In Figs. 7 and 8 we report the previous graph seen in another way, more specifically we can understand:

Feature importance: variables ranked in descending order of importance.
Impact: horizontal position shows whether the effect of that value is associated with a higher or lower prediction.
Value: color shows whether that variable is high or low for that observation. Red color deducts the high value and blue for the lower value. The change in color of the dot shows the value of the feature. Correlation: Of each characteristic with the pathology being examined.

Evaluating Explainability using LIME

Subsequently exploiting the LIME library, it has been verified how the NN has reasoned in order to obtain a diagnosis verifying both the case of diagnosis of "sick subject" and that of "healthy subject".

Figure 9 shows which characteristics had a greater impact on a diagnosis of disease present and which had a greater impact on a diagnosis of disease absent with relative final diagnosis. Regarding subjects diagnosed as sick, the features that contributed most to directing the NN toward a diagnosis of sick subject were AVI, age, and GGT value demonstrating how the NN performs optimal reasoning.

Figure 10 shows what concerns the characteristics that contribute to the identification of healthy subjects, the NN took into consideration the values that from the clinical diagnosis are standard values of GGT, Glucose and a low value of the AVI index.

Also, in the diagnosis of healthy subjects the NN has produced an optimal reasoning correctly directing the diagnosis.

Export of the trained algorithm and incorporation into the web app

After the NN training and testing and the XAI analysis we exported the already trained model. In this way it is possible to avoid repeating the training every time we want to perform a new forecast. The model export was done using the “pickle” tool by Python⁵⁷, which allowed the generation of a file with the extension “.pkl”. This file is then loaded by means of another python program which can be used to make a new forecast. Another important function implemented is the creation of a web application written using the HTML languages⁵⁸, CSS⁵⁹ and JavaScript⁶⁰. This web application can interface with the trained NN to test it on new data, different from those used to train the original NN. The interface of the web application with the trained algorithm was implemented through the “flask library”⁶¹ by python. A flask object receives a request from the web and displays the HTML file that allows it to interface with the NN.

The user can fill in the form present in a web page and after clicking the submit button, the flask object receives a request, extracts the input, runs it through the template and finally displays the HTML page with the result of the prediction.

The HTML page includes various fields in which to enter variables, and a submit button to pass the input data to the NN that will perform the prediction. At the end of the prediction, the HTML page will display the NAFLD status: “NAFLD Detected” or the string “No NAFLD Detected”.

The web app also includes the automatic calculation of the AVI parameter from the values for hips Circumference and waist Circumference using the code implemented in Javascript.

Test of the web app on a sample of subjects with known NAFLD

To test the web app, the database previously formed by random extraction of 100 subjects participating in the NUTRIHEP study was used. The web app was passed the data: age, Sex, GGT, GLUCOSE, WC, HC.

After the input of the parameters and clicking the submit button, the values were sent to the NN. The web app feedback, related to the NAFLD status, was then saved in a dataset used for comparison with the true NAFLD condition, already known to us.

Using the saved dataset, we could calculate the accuracy, sensitivity and specificity of the web app.

In the sample considered, there were 50 subjects affected by NAFLD and 50 healthy subjects. The NN correctly identified all the healthy subjects but made 18 errors, all false negatives. On this result we calculated the values of Specificity and Sensitivity of the NN.

It is important to point out that many of the subjects considered healthy by the NN had anthropometric and biochemical values in the norm, but it is possible that these subjects were affected by mild NAFLD, although with values still within normal range⁶².

Table 6 shows the sensitivity and specificity values for the NN in the web app.

Table 6 Sensitivity and specificity values for the neural network in the web app.

Full size table

Discussion

In this study, a NN to support NAFLD diagnosis has been developed on a model made up of easily available variables, as already highlighted in our previous work³⁴.

In particular, in this work we trained a NN to identify patients at risk of NAFLD and, developed a local web app for use as a tool in epidemiological studies and screening. The aim was to make a prior identification of healthy patients in order to ensure that only subjects really needing it are sent on for ultrasound examination.

Today, alternative, less expensive methods of diagnosis compared to traditional tools (MRI, Ultrasound) are very important in the diagnosis of NAFLD. The reorganization of the National Health System requires close consideration of aspects related to performance together with factors related to the reduction of costs and waiting times. The objective of our study was to create a NN implementing an intuitive and easy application to support medical decisions during the diagnostic phase using simpler and cheaper tools, thus reducing both costs and waiting times related to the use of instrumental methods. We highlight that it would thereby be possible to use simple computers to make a diagnosis of NAFLD, resulting in a faster diagnosis and thus preventing disease evolution and the resulting serious consequences.

Several prediction models for NAFLD in the literature have been developed to identify healthy subjects and subjects with NAFLD. These existing NAFLD prediction models have employed clinical and laboratory parameters; however, some parameters are not always routinely measured or retrievable in health databases^63,64. This limits the use of these models in large-scale epidemiologic studies and health database research. Specifically comparing the AUC of NN (0.821) with traditional methods we could verify that the performance in terms of AUC is superior to LAP⁶⁵ (0.79), Hepatic steatosis index⁶⁶ (0.81), SteatoTest⁶⁷ (0.79), APRI⁶⁸ (0.60), NAFLD fibrosis score⁶⁹ (0.82). When considering some studies exploiting AI techniques, we could verify that a new approach using LWA (learning by abstraction) method classifies liver ultrasound images as normal or abnormal and does not classify the data unless it is confident of accurate prediction. Features were extracted from ROIs within 99 ultrasound images and were used to train NN, SVM, and LWA classifiers with fivefold cross-validation. The proposed LWA method outperformed the other classifiers with an AUROC of 0.78⁷⁰. In a second study, the prediction ability of particle swarm optimization (PSO), GA, MReg (multilinear regression), and alternative decision tree (ADT) algorithms were compared using medical data from 39,567 patients. Using uniform random sampling, the dataset was divided into training (22,690 patients) and test (16,877) sets. Four algorithms were applied for classification using tenfold cross-validation. The results evidenced that the ADT model had an AUROC between 0.73 and 0.76⁷¹. In another study factors provided by the 2005 updated ATP III clinical criteria for metabolic syndrome (MetS) along with age and gender were used to create a NAFLD prediction model. After preprocessing data from 40,637 patients they were divided into 66% and 34% for training and testing sets, respectively. The classification was performed by the J48 algorithm using hold-out cross-validation, and the AUROC of 0.731 was achieved⁷². NN also performed better in these cases.

From the described results, it can be seen that the NN, using AVI plus Glucose plus GGT plus Sex plus Age, produced few prediction errors in the test phase, whereas the accuracy percentage was not very high. However, the 18% error (18 of 100 subjects) in the test phase may be open to doubt, since it is possible that these subjects were developing NAFLD and so merely diagnosed in advance).

It has been demonstrated that the good performance of the ML algorithms used to identify NAFLD, applying common anthropometric parameters and other variables, can be a valid alternative to the classic indexes^73,74.

Moreover, the NN was able to correctly identify all the subjects without NAFLD, as evidenced by the high VPP value (0.86). This VPP satisfies our objectives to detect subjects without NAFLD to avoid referral to perform more expensive diagnostic procedures.

This type of study highlights the fact that a NN can be used to find high-risk NAFLD subjects to send on for US. In this way, 82.6% of unnecessary US tests could be avoided (this value was calculated as the ratio of the total number of subjects in the web app test set, divided by the total number of subjects in the web app test set plus the number of false predictions).

In addition, to lighten the waiting lists, our aim was to develop a machine learning algorithm that would allow savings by eliminating a number of US that would otherwise be prescribed. The NN developed is therefore useful to exclude NAFLD and may be considered a valid diagnostic support in the context of epidemiological studies, not merely a smart working replacement diagnostic tool.

In conclusion, the NN can be considered a valid support for medical decision making in regard to health policies, in the context of epidemiological studies and screening.

Study limitations

There are several limitations to this work. The most significant is that this study was conducted in a single center and so has a rather limited sample size. Deep learning models in other fields have included millions of samples. Another problem is that the NN is strongly linked to the identification of the NAFLD condition only in a Mediterranean population with the characteristics on which it was formed. A second limitation is the low sensitivity of the NAFLD diagnostic methodology, as it fails to detect a fatty liver content as low as > 25%⁷⁵. However, both databases were drawn from population-based studies and subjects were selected from electoral lists or from the physicians lists. Moreover, participants subjects did not seek medical assistance and participated on a voluntary basis. Therefore, the NAFLD diagnosis performed by US was the only diagnostic procedure that could be proposed to participants, since biopsy or H-MRS would obviously be unethical.

Future developments

In the future the NN based web app can be improved by using a SQL database where to save the entered data and, providing feedback to the app (correct or wrong prediction) in order to continue its training and make it more flexible so that it can be used on any kind of population. This could be done by leveraging a document classification system⁷⁶ to retrieve data from electronic medical records and then building an open dataset⁷⁷ in order to improve with more heterogeneous data the web app.

Conclusion

The application of ML in the diagnosis of NAFLD is an efficient approach to identify healthy subjects. The model we propose has that can be exploited to target only those subjects who have a real need for further investigation, thus leading to a reduction in waiting lists, costs and time required for instrumental examinations. In this research we have predicted the risk of developing NAFLD in individuals using biochemical and anthropometric variables in a NN. The rationale behind our approach is divided into two parts: first train, evaluate performance and validate the result in assessing NAFLD risk in an individual. Second, development of a local web app that incorporates the previously evaluated NN, compare its performance applying in this way a rapid and non-invasive methodology in order to demonstrate that the proposed technique is suitable for optimal discrimination for NAFLD risk assessment. It is worthy to note that through XAI, it is possible to identify the factors that contribute to a given diagnosis. This facilitates the physician to do informed choices about their patients management and improve the health conditions of the subjects.

Abbreviations

US:: Ultrasound scan
NAFLD:: Non-alcoholic fatty liver disease
WC:: Waist circumference
HP:: Hips
FLI:: Fatty liver index
ML:: Machine learning
AVI:: Abdominal volume index
GGT:: Gamma-glutamyl transferase
NASH:: Non-alcoholic steatohepatitis
NN:: Neural network
MRI:: Magnetic resonance imaging
BP:: Blood pressure
TP:: True positive
TN:: True negative
FP:: False positive
FN:: False negative
CSS:: Cascading style sheets
HTML:: HyperText markup language
AUC:: Area under the curve
PPV:: Positive predictive value
NPV:: Negative predictive value

References

Fazel, Y., Koenig, A. B., Sayiner, M., Goodman, Z. D. & Younossi, Z. M. Epidemiology and natural history of non-alcoholic fatty liver disease. Metabolism 65, 1017–1025 (2016).
Article CAS PubMed Google Scholar
Levene, A. P. & Goldin, R. D. The epidemiology, pathogenesis and histopathology of fatty liver disease. Histopathology 61, 141–152 (2012).
Article PubMed Google Scholar
Preiss, D. & Sattar, N. Non-alcoholic fatty liver disease: An overview of prevalence, diagnosis, pathogenesis and treatment considerations. Clin. Sci. (Lond.) 115, 141–150 (2008).
Article CAS Google Scholar
Neuschwander-Tetri, B. A. & Caldwell, S. H. Nonalcoholic steatohepatitis: Summary of an AASLD single topic conference. Hepatology 37, 1202–1219 (2003).
Article PubMed Google Scholar
Zelber-Sagi, S., Ratziu, V. & Oren, R. Nutrition and physical activity in NAFLD: An overview of the epidemiological evidence. World J. Gastroenterol. 17, 3377–3389 (2011).
Article PubMed PubMed Central Google Scholar
Younossi, Z. M. et al. Global epidemiology of nonalcoholic fatty liver disease-meta-analytic assessment of prevalence, incidence, and outcomes. Hepatology 64, 73–84 (2016).
Article PubMed Google Scholar
Cozzolongo, R. et al. Epidemiology of HCV infection in the general population: A survey in a southern Italian town. Am. J. Gastroenterol. 104, 2740–2746 (2009).
Article PubMed Google Scholar
Marchesini, G., Marzocchi, R., Agostini, F. & Bugianesi, E. Nonalcoholic fatty liver disease and the metabolic syndrome. Curr. Opin. Lipidol. 16, 421–427 (2005).
Article CAS PubMed Google Scholar
Ratziu, V., Bellentani, S., Cortez-Pinto, H., Day, C. & Marchesini, G. A position statement on NAFLD/NASH based on the EASL 2009 special conference. J. Hepatol. 53, 372–384 (2010).
Article PubMed Google Scholar
Schuppan, D. & Afdhal, N. H. Liver cirrhosis. Lancet 371, 838–851 (2008).
Article CAS PubMed PubMed Central Google Scholar
Mahana, D. et al. Antibiotic perturbation of the murine gut microbiome enhances the adiposity, insulin resistance, and liver disease associated with high-fat diet. Genome Med. 8, 1–20 (2016).
Article CAS Google Scholar
Bedogni, G. et al. The fatty liver index: A simple and accurate predictor of hepatic steatosis in the general population. BMC Gastroenterol. 6, 33 (2006).
Article ADS PubMed PubMed Central CAS Google Scholar
Procino, F. et al. Reducing NAFLD-screening time: A comparative study of eight diagnostic methods offering an alternative to ultrasound scans. Liver Int. 39, 187–196 (2019).
Article PubMed Google Scholar
Mohammed, M., Khan, M. B. & Bashier, E. B. M. Machine Learning: Algorithms and Applications (CRC Press, 2016).
Book Google Scholar
Napoli, C., Benincasa, G., Schiano, C. & Salvatore, M. Differential epigenetic factors in the prediction of cardiovascular risk in diabetic patients. Eur. Heart J. Cardiovasc. Pharmacother. 6, 239–247 (2020).
Article PubMed Google Scholar
Dagliati, A. et al. Machine learning methods to predict diabetes complications. J. Diabetes Sci. Technol. 12, 295–302 (2018).
Article PubMed Google Scholar
Kukar, M., Kononenko, I., Groselj, C., Kralj, K. & Fettich, J. Analysing and improving the diagnosis of ischaemic heart disease with machine learning. Artif. Intell. Med. 16, 25–50 (1999).
Article CAS PubMed Google Scholar
Kourou, K., Exarchos, T. P., Exarchos, K. P., Karamouzis, M. V. & Fotiadis, D. I. Machine learning applications in cancer prognosis and prediction. Comput. Struct. Biotechnol. J. 13, 8–17 (2015).
Article CAS PubMed Google Scholar
Ferraioli, G. & Monteiro, L. B. S. Ultrasound-based techniques for the diagnosis of liver steatosis. World J. Gastroenterol. 25, 6053 (2019).
Article PubMed PubMed Central Google Scholar
Schaapman, J. J., Tushuizen, M. E., Coenraad, M. J. & Lamb, H. J. Multiparametric MRI in patients with nonalcoholic fatty liver disease. J. Magn. Reson. Imaging 53, 1623–1631 (2021).
Article PubMed Google Scholar
Papatheodoridi, M. & Cholongitas, E. Diagnosis of non-alcoholic fatty liver disease (NAFLD): Current concepts. Curr. Pharm. Des. 24, 4574–4586 (2018).
Article CAS PubMed Google Scholar
Stachowska, E., Portincasa, P., Jamioł-Milc, D., Maciejewska-Markiewicz, D. & Skonieczna-Żydecka, K. The relationship between prebiotic supplementation and anthropometric and biochemical parameters in patients with NAFLD-A systematic review and meta-analysis of randomized controlled trials. Nutrients 12, 3460 (2020).
Article CAS PubMed Central Google Scholar
Cotter, T. G. et al. Nonalcoholic fatty liver disease: Impact on healthcare resource utilization, liver transplantation and mortality in a large, integrated healthcare system. J. Gastroenterol. 55, 722–730 (2020).
Article PubMed Google Scholar
Jiang, T. et al. Application of computer tongue image analysis technology in the diagnosis of NAFLD. Comput. Biol. Med. 135, 104622 (2021).
Article PubMed Google Scholar
Taylor-Weiner, A. et al. A machine learning approach enables quantitative measurement of liver histology and disease monitoring in NASH. Hepatology 74, 133–147 (2021).
Article PubMed Google Scholar
Feng, G. et al. Machine learning algorithm outperforms fibrosis markers in predicting significant fibrosis in biopsy-confirmed NAFLD. J. Hepatobiliary Pancreat. Sci. 28, 593–603 (2021).
Article PubMed Google Scholar
Qu, H. et al. Training of computational algorithms to predict NAFLD activity score and fibrosis stage from liver histopathology slides. Comput. Methods Prog. Biomed. 207, 106153 (2021).
Article Google Scholar
Schwenzer, N. F. et al. Non-invasive assessment and quantification of liver steatosis by ultrasound, computed tomography and magnetic resonance. J. Hepatol. 51, 433–445 (2009).
Article PubMed Google Scholar
Calès, P. et al. Reproducibility of blood tests of liver fibrosis in clinical practice. Clin. Biochem. 41, 10–18 (2008).
Article PubMed Google Scholar
Fatima, M. & Pasha, M. Survey of machine learning algorithms for disease diagnostic. J. Intell. Learn. Syst. Appl. 9, 1 (2017).
Google Scholar
Vijayarani, S. & Dhayanand, S. Liver disease prediction using SVM and Naïve Bayes algorithms. Int. J. Sci., Eng. Technol. Res. (IJSETR) 4, 816–820 (2015).
Google Scholar
Hadizadeh, F., Faghihimani, E. & Adibi, P. Nonalcoholic fatty liver disease: Diagnostic biomarkers. World J. Gastrointest. Pathophysiol. 8, 11 (2017).
Article PubMed PubMed Central Google Scholar
Das, A., Connell, M. & Khetarpal, S. Digital image analysis of ultrasound images using machine learning to diagnose pediatric nonalcoholic fatty liver disease. Clin. Imaging 77, 62–68 (2021).
Article PubMed Google Scholar
Sorino, P. et al. Selecting the best machine learning algorithm to support the diagnosis of non-alcoholic fatty liver disease: A meta learner study. PLoS ONE 15, e0240867 (2020).
Article CAS PubMed PubMed Central Google Scholar
Arrieta, A. B. et al. Explainable artificial intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI. Inf. Fusion 58, 82–115 (2020).
Article Google Scholar
Linderman, G. C. & Steinerberger, S. Clustering with t-SNE, provably. SIAM J. Math. Data Sci. 1, 313–332 (2019).
Article MathSciNet PubMed PubMed Central Google Scholar
Osella, A. R. et al. Overweight and obesity in southern Italy: Their association with social and life-style characteristics and their effect on levels of biologic markers. Rev. Fac. Cien. Med. Univ. Nac. Cordoba 71, 113–124 (2014).
PubMed Google Scholar
Osella, A. R., Misciagna, G., Leone, A., Di Leo, A. & Fiore, G. Epidemiology of hepatitis C virus infection in an area of southern Italy. J. Hepatol. 27, 30–35 (1997).
Article CAS PubMed Google Scholar
Misciagna, G. et al. Epidemiology of cholelithiasis in southern Italy. Part II: Risk factors. Eur. J. Gastroenterol. Hepatol. 8, 585–593 (1996).
Article CAS PubMed Google Scholar
Sever, P. New hypertension guidelines from the National Institute for Health and clinical excellence and the British hypertension society. J. Renin-Angiotensin-Aldosterone Syst. 7, 61–63 (2006).
Article PubMed Google Scholar
Guerrero-Romero, F. & Rodríguez-Morán, M. Abdominal volume index. An anthropometry-based index for estimation of obesity is strongly related to impaired glucose tolerance and type 2 diabetes mellitus. Arch. Med. Res. 34, 428–432 (2003).
Article PubMed Google Scholar
Pedregosa, F. et al. Scikit-learn: Machine learning in Python. J. Mach. Learn. Res. 12, 2825–2830 (2011).
MathSciNet MATH Google Scholar
Harris, C. R. et al. Array programming with NumPy. Nature 585, 357–362 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Cunningham, P., Cord, M. & Delany, S. J. Supervised learning. In Machine Learning Techniques for Multimedia 21–49 (Springer, 2008).
Saputro, D. R. S. & Widyaningsih, P. Limited memory Broyden-Fletcher-Goldfarb-Shanno (L-BFGS) method for the parameter estimation on geographically weighted ordinal logistic regression model (GWOLR). In AIP Conference Proceedings, Vol. 1868, 040009 (AIP Publishing LLC, 2017).
Bottou, L. Large-Scale Machine Learning with Stochastic Gradient Descent 177–186 (Physica-Verlag HD, 2010).
MATH Google Scholar
Kingma, D. P. & Ba, J. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014).
Grippo, L. & Sciandrone, M. Metodi quasi-Newton. In Metodi di ottimizzazione non vincolata 289–323 (Springer, 2011).
Ng, A. Y. Feature selection, L 1 vs. L 2 regularization, and rotational invariance. In Proceedings of the Twenty-First International Conference on Machine learning 78 (2004).
Ashcroft, M. Advanced Machine Learning: Training Basic Neural Networks.
ROC Curve. in Encyclopedia of Machine Learning (eds. Sammut, C. & Webb, G. I.) 875–875 (Springer, 2010).
Melo, F. Area under the ROC curve. in Encyclopedia of Systems Biology (eds. Dubitzky, W., Wolkenhauer, O., Cho, K.-H. & Yokota, H.) 38–39 (Springer, 2013).
Ting, K. M. Confusion matrix. in Encyclopedia of Machine Learning and Data Mining (eds. Sammut, C. & Webb, G. I.) 260–260 (Springer, 2017).
Biswas, A. K., Noman, N. & Sikder, A. R. Machine learning approach to predict protein phosphorylation sites by incorporating evolutionary information. BMC Bioinform. 11, 273 (2010).
Article CAS Google Scholar
Samuel, T. S. B. Comparing the Explainability of Different Crop Disease Identification Models Using LIME (2021).
Bugaj, M., Wrobel, K. & Iwaniec, J. Model explainability using SHAP values for LightGBM predictions. In 2021 IEEE XVIIth International Conference on the Perspective Technologies and Methods in MEMS Design (MEMSTECH) 102–106 (IEEE, 2021).
Rossum, G. V. The Python Library Reference: Release 3.6.4 (2018).
Patel, K. Incremental journey for World Wide Web: Introduced with web 1.0 to recent web 5.0–a survey paper. Int. J. Adv. Res. Comput. Sci. Softw. Eng. 3, 1–9 (2013).
Google Scholar
Duckett, J. HTML & CSS: Design and Build Websites (Wiley, 2011).
Google Scholar
Flanagan, D. & Novak, G. M. Java-Script: The Definitive Guide (American Institute of Physics, 1998).
Google Scholar
Grinberg, M. Flask Web Development: Developing Web Applications with Python (O’Reilly Media Inc, 2018).
Google Scholar
Kumar, R. & Mohan, S. Non-alcoholic fatty liver disease in lean subjects: Characteristics and implications. J. Clin. Transl. Hepatol. 5, 216–223 (2017).
PubMed PubMed Central Google Scholar
Kwok, R. et al. Systematic review with meta-analysis: Non-invasive assessment of non-alcoholic fatty liver disease–the role of transient elastography and plasma cytokeratin-18 fragments. Aliment. Pharmacol. Ther. 39, 254–269 (2014).
Article CAS PubMed Google Scholar
Shen, J. et al. Assessment of non-alcoholic fatty liver disease using serum total cell death and apoptosis markers. Aliment. Pharmacol. Ther. 36, 1057–1066 (2012).
Article CAS PubMed Google Scholar
Bedogni, G., Kahn, H. S., Bellentani, S. & Tiribelli, C. A simple index of lipid overaccumulation is a good marker of liver steatosis. BMC Gastroenterol. 10, 1–8 (2010).
Article CAS Google Scholar
Lee, J. H. et al. Hepatic steatosis index: A simple screening tool reflecting nonalcoholic fatty liver disease. Dig. Liver Dis. 42, 503–508 (2010).
Article CAS PubMed Google Scholar
Poynard, T. et al. The diagnostic value of biomarkers (SteatoTest) for the prediction of liver steatosis. Comp. Hepatol. 4, 1–14 (2005).
Article CAS Google Scholar
Sebastiani, G. et al. The impact of liver disease aetiology and the stages of hepatic fibrosis on the performance of non-invasive fibrosis biomarkers: An international study of 2411 cases. Aliment. Pharmacol. Ther. 34, 1202–1216 (2011).
Article CAS PubMed Google Scholar
Angulo, P. et al. The NAFLD fibrosis score: A noninvasive system that identifies liver fibrosis in patients with NAFLD. Hepatology 45, 846–854 (2007).
Article CAS PubMed Google Scholar
Hamid, K., Asif, A., Abbasi, W. & Sabih, D. Machine learning with abstention for automated liver disease diagnosis. In 2017 International Conference on Frontiers of Information Technology (FIT) 356–361 (IEEE, 2017).
Hashem, S. et al. Comparison of machine learning approaches for prediction of advanced liver fibrosis in chronic hepatitis C patients. IEEE/ACM Trans. Comput. Biol. Bioinf. 15, 861–868 (2017).
Article Google Scholar
Perveen, S., Shahbaz, M., Keshavjee, K. & Guergachi, A. A systematic machine learning based approach for the diagnosis of non-alcoholic fatty liver disease risk and progression. Sci. Rep. 8, 1–12 (2018).
Article CAS Google Scholar
Yip, T.C.-F. et al. Laboratory parameter-based machine learning model for excluding non-alcoholic fatty liver disease (NAFLD) in the general population. Aliment. Pharmacol. Ther. 46, 447–456 (2017).
Article CAS PubMed Google Scholar
Canbay, A. et al. Non-invasive assessment of NAFLD as systemic disease—A machine learning perspective. PLoS ONE 14, e0214436 (2019).
Article CAS PubMed PubMed Central Google Scholar
Saadeh, S. et al. The utility of radiological imaging in nonalcoholic fatty liver disease. Gastroenterology 123, 745–750 (2002).
Article PubMed Google Scholar
Bianchi, M., Draoli, M., Fallucchi, F. & Ligi, A. Service Level Agreement Constraints into Processes for Document Classification 545–550 (2014).
Fallucchi, F., Petito, M. & De Luca, E. in Analysing and Visualising Open Data Within the Data and Analytics Framework: 12th International Conference, MTSR 2018, Limassol, Cyprus, October 23–26, 2018, Revised Selected Papers 135–146 (2019).

Download references

Acknowledgements

Computations for this research were performed in the Laboratory of Epidemiology and Biostatistics, National Institute of Gastroenterology, “S de Bellis” Research Hospital, MICOL Working Group: Vittorio Pugliese (Laboratory of Epidemiology and Biostatistics), Mario Correale, Palma Iacovazzi, Anna Mastrosimini, Giampiero De Michele (Laboratory of Clinical Pathology), Osvaldo Burattini (Unit of Gastroenterology), Valeria Tutino, Benedetta D’Attoma (Laboratory of Nutritional Biochemistry), Maria R Noviello (Department of Radiology), National Institute of Gastroenterology “S de Bellis” Research Hospital, Castellana Grotte (BA), Italy.

Funding

MICOL III: This research was supported by a public Grant from the Ministry of Health, Italy (Progetto Finalizzato del Ministero della Salute, ICS 160.2/RF 2003, 2004/2006). NUTRIHEP: This research was supported by a public Grant from the Ministry of Health, Italy (Progetto Finalizzato delMinistero della Salute- Progetto no. 37-2004), NUTRIHEP FOLLOW-UP: This research was supported by a public Grant from the Ministry of Health, Italy (Ricerca Corrente DDG 045 del 24.01.2017) and by Apulia Region-D.G.R. n. 1159, 28/6/ 2018 and 2019.

Author information

Authors and Affiliations

Laboratory of Epidemiology and Biostatistics, National Institute of Gastroenterology, “S de Bellis” Research Hospital, Via Turi 27, 70013, Castellana Grotte, BA, Italy
Paolo Sorino, Angelo Campanella, Caterina Bonfiglio, Antonella Mirizzi, Isabella Franco, Antonella Bianco, Claudia Buongiorno, Rosalba Liuzzi & Alberto Rubén Osella
Laboratory of Nutritional Biochemistry, National Institute of Gastroenterology, “S de Bellis” Research Hospital, Via Turi 27, 70013, Castellana Grotte, BA, Italy
Maria Gabriella Caruso & Maria Notarnicola
Scientific and Ethical Committee, Polyclinic Hospital, University of Bari, Piazza Giulio Cesare, 11, 70124, Bari, BA, Italy
Giovanni Misciagna
Human Nutrition Research Center (CenINH), School of Nutrition, Faculty of Medical Sciences, Universidad Nacional de Córdoba, Córdoba, Argentina
Laura R. Aballay
Clinical Nutrition Outpatient Clinic, National Institute of Gastroenterology, “S de Bellis” Research Hospital, Via Turi 27, 70013, Castellana Grotte, BA, Italy
Anna Maria Cisternino
San Giacomo Hospital, Largo S. Veneziani, 21, 70043, Monopoli, BA, Italy
Marisa Chiloiro
Department of Engineering Sciences, Guglielmo Marconi University, Via plinio 44, 00193, Rome, Italy
Francesca Fallucchi
Department of Electrical and Information Engineering, Polytechnic of Bari, Via Re David, 200, 70125, Bari, BA, Italy
Giovanni Pascoschi

Authors

Paolo Sorino
View author publications
You can also search for this author in PubMed Google Scholar
Angelo Campanella
View author publications
You can also search for this author in PubMed Google Scholar
Caterina Bonfiglio
View author publications
You can also search for this author in PubMed Google Scholar
Antonella Mirizzi
View author publications
You can also search for this author in PubMed Google Scholar
Isabella Franco
View author publications
You can also search for this author in PubMed Google Scholar
Antonella Bianco
View author publications
You can also search for this author in PubMed Google Scholar
Maria Gabriella Caruso
View author publications
You can also search for this author in PubMed Google Scholar
Giovanni Misciagna
View author publications
You can also search for this author in PubMed Google Scholar
Laura R. Aballay
View author publications
You can also search for this author in PubMed Google Scholar
Claudia Buongiorno
View author publications
You can also search for this author in PubMed Google Scholar
Rosalba Liuzzi
View author publications
You can also search for this author in PubMed Google Scholar
Anna Maria Cisternino
View author publications
You can also search for this author in PubMed Google Scholar
Maria Notarnicola
View author publications
You can also search for this author in PubMed Google Scholar
Marisa Chiloiro
View author publications
You can also search for this author in PubMed Google Scholar
Francesca Fallucchi
View author publications
You can also search for this author in PubMed Google Scholar
Giovanni Pascoschi
View author publications
You can also search for this author in PubMed Google Scholar
Alberto Rubén Osella
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

A.R.O. and G.P. conceived and designed the study, revised it critically and approved the final version; P.S. wrote a draft of the paper and contributed for important intellectual content, analyzed and interpreted the data and Formal analysis; G.M., M.G.C., A.C. and F.F. contributed to drafting the article; M.C. performed all Ultrasound scans and contributed to drafting the article; C.B., C.Bu., I.F., A.B., A.M., L.R.A., R.L., M.N., A.M.C. worked in the acquisition of data and critically read the paper.

Corresponding author

Correspondence to Alberto Rubén Osella.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Sorino, P., Campanella, A., Bonfiglio, C. et al. Development and validation of a neural network for NAFLD diagnosis. Sci Rep 11, 20240 (2021). https://doi.org/10.1038/s41598-021-99400-y

Download citation

Received: 16 April 2021
Accepted: 24 September 2021
Published: 12 October 2021
DOI: https://doi.org/10.1038/s41598-021-99400-y

This article is cited by

Noninvasive Diagnostic Technique for Nonalcoholic Fatty Liver Disease Based on Features of Tongue Images
- Rong-rui Wang
- Jia-liang Chen
- Shu-kun Yao
Chinese Journal of Integrative Medicine (2024)
Artificial Intelligence in Liver Diseases: Recent Advances
- Feifei Lu
- Yao Meng
- Xingshun Qi
Advances in Therapy (2024)
Application of multiple-finding segmentation utilizing Mask R-CNN-based deep learning in a rat model of drug-induced liver injury
- Eun Bok Baek
- Jaeku Lee
- Jae-Woo Cho
Scientific Reports (2023)
Prediction of decreased estimated glomerular filtration rate using liver fibrosis markers: a renal biopsy-based study
- Akira Mima
Scientific Reports (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

NASHmap: clinical utility of a machine learning model to identify patients at risk of NASH in real-world settings

Artificial intelligence outperforms standard blood-based scores in identifying liver fibrosis patients in primary care

Machine learning classifiers for screening nonalcoholic fatty liver disease in general adults

Introduction

Methods

Population

ML algorithm development

Data acquisition and pre-processing

Variables used

Data exploration

Hyperparameter tuning for the neural network

Training session and neural network test

Results

Neural network performance analysis

Evaluating Explainability using SHAP

Evaluating Explainability using LIME

Export of the trained algorithm and incorporation into the web app

Test of the web app on a sample of subjects with known NAFLD

Discussion

Study limitations

Future developments

Conclusion

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Noninvasive Diagnostic Technique for Nonalcoholic Fatty Liver Disease Based on Features of Tongue Images

Artificial Intelligence in Liver Diseases: Recent Advances

Application of multiple-finding segmentation utilizing Mask R-CNN-based deep learning in a rat model of drug-induced liver injury

Prediction of decreased estimated glomerular filtration rate using liver fibrosis markers: a renal biopsy-based study

Comments

Search

Quick links