Predicting three-month fasting blood glucose and glycated hemoglobin changes in patients with type 2 diabetes mellitus based on multiple machine learning algorithms

Tao, Xue; Jiang, Min; Liu, Yumeng; Hu, Qi; Zhu, Baoqiang; Hu, Jiaqiang; Guo, Wenmei; Wu, Xingwei; Xiong, Yu; Shi, Xia; Zhang, Xueli; Han, Xu; Li, Wenyuan; Tong, Rongsheng; Long, Enwu

doi:10.1038/s41598-023-43240-5

Download PDF

Article
Open access
Published: 30 September 2023

Predicting three-month fasting blood glucose and glycated hemoglobin changes in patients with type 2 diabetes mellitus based on multiple machine learning algorithms

Xue Tao¹,
Min Jiang²,
Yumeng Liu³,
Qi Hu⁴,
Baoqiang Zhu⁵,
Jiaqiang Hu¹,
Wenmei Guo¹,
Xingwei Wu¹,
Yu Xiong⁶,
Xia Shi¹,
Xueli Zhang⁷,
Xu Han⁷,
Wenyuan Li¹,
Rongsheng Tong¹ &
…
Enwu Long¹

Scientific Reports volume 13, Article number: 16437 (2023) Cite this article

1806 Accesses
3 Citations
11 Altmetric
Metrics details

Subjects

Abstract

Fasting blood glucose (FBG) and glycosylated hemoglobin (HbA1c) are key indicators reflecting blood glucose control in type 2 diabetes mellitus (T2DM) patients. The purpose of this study is to establish a predictive model for blood glucose changes in T2DM patients after 3 months of treatment, achieving personalized treatment.A retrospective study was conducted on type 2 diabetes mellitus real-world medical data from 4 cities in Sichuan Province, China from January 2015 to December 2020. After data preprocessing, data inputting, data sampling, and feature screening, 16 kinds of machine learning methods were used to construct prediction models, and 5 prediction models with the best prediction performance were screened respectively. A total of 100,000 cases were included to establish the FBG model, and 2,169 cases were established to establish the HbA1c model. The best prediction model both of FBG and HbA1c finally obtained are realized by ensemble learning and modified random forest inputting, the AUC values are 0.819 and 0.970, respectively. The most important indicators of the FBG and HbA1c prediction model were FBG and HbA1c. Medication compliance, follow-up outcome, dietary habits, BMI, and waist circumference also had a greater impact on FBG levels. The prediction accuracy of the models of the two blood glucose control indicators is high and has certain clinical applicability.HbA1c and FBG are mutually important predictors, and there is a close relationship between them.

Early detection of type 2 diabetes mellitus using machine learning-based prediction models

Article Open access 20 July 2020

Machine learning for characterizing risk of type 2 diabetes mellitus in a rural Chinese population: the Henan Rural Cohort Study

Article Open access 10 March 2020

Comparisons of the prediction models for undiagnosed diabetes between machine learning versus traditional statistical methods

Article Open access 11 August 2023

Introduction

Diabetes mellitus is a chronic progressive disease characterized by disorders of glucose metabolism¹. In recent years, with the social and economic development of countries around the world and the improvement of residents' living standards, the incidence rate and prevalence rate of diabetes have increased year by year, which has become a major social problem threatening people's health and has attracted the attention and attention of governments, health departments and medical workers in various countries. According to epidemiological data from the International Diabetes Federation (IDF): the global diabetes prevalence in 20–79 years old in 2021 was estimated to be 10.5% (536.6 million people), rising to 12.2% (783.2 million) in 20,452². And in 2021, almost one in two adults (20–79 years old) with diabetes were unaware of their diabetes status (44.7%; 239.7 million)³.

Type 2 diabetes mellitus (T2DM) patients accounted for more than 90.0% of the total diabetic patients⁴. With the development of the disease, most T2DM patients will have different degrees of complications, which will seriously reduce the quality of life of the patients and bring a heavy economic burden to the patients' families⁵. And the severity of complications is inseparable from glycemic control. Therefore, active, safe, and effective blood glucose control has positive significance for preventing complications, improving the quality of life of T2DM patients, and reducing the economic burden on patients and society. Fasting blood glucose (FBG) and glycosylated hemoglobin (HbA1c) are the key indicators for clinical diagnosis and evaluation of treatment effect of T2DM. They are used to detect blood glucose and both are important indicators for diagnosing diabetes and reflecting the prognosis of diabetes⁶.

With the continuous development of database and data mining technology, data mining is more and more used to mine medical databases efficiently⁷. The existing data mining technology application research shows that the model established by data mining has high accuracy⁸. There are many researches on building prediction models based on data mining technology to predict diabetes and its complications, so as to prevent diabetes^9,10. However, in addition to the prediction of diabetes, the control of blood glucose after treatment is also a matter of concern. The improvement of blood glucose in T2DM patients is related to individual differences, because the factors affecting the treatment results of T2DM involve physiology, pathology, diet structure, lifestyle and other aspects. Clinicians need to give individualized treatment plan according to the patient's own situation to ensure the best effect^11,12.

At present, prediction models based on machine learning algorithms are mainly used for the prediction of diabetes and its complications^13,14, and there are few studies on the prediction models of patients' glycemic control after medication. Therefore, it is urgent to establish an efficient, accurate and economical prediction model of T2DM treatment results, and improve the treatment rate of T2DM in medical institutions at all levels. Based on this, this study intends to establish artificial intelligence prediction models for the compliance of two blood glucose indicators in T2DM patients after 3 months of treatment through data mining, to explore potential predictive relationships between FBG and HbA1c, improve the treatment rate and control rate of T2DM, reduce the incidence of adverse reactions, and prevent and reduce the occurrence of complications.

Materials and methods

Study design and data source

The data of this study were obtained from the Public Health Service System and the Medical Record Homepage Management System of the Health Information Center of Sichuan Province, China (including personal basic information form, health check-up form, and follow-up service record form), and the overall data were derived from patients who received anti-diabetic drugs or had the International Classification of Diseases Tenth Revision (ICD-10) code¹⁵ for type 2 diabetes between January 2015 and December 2020.

A total of 375,723 T2DM patients' related diagnosis and treatment data were collected in this study, and the available data for constructing the FBG prediction model and the HbA1c prediction model were screened according to the following criteria: if the same patient had 2 or more registration data within 3 ± 1 months, the patient's data was available; If there were multiple sets of data within 3 ± 1 months, the data closest to 3 months from the baseline should be taken; if there were multiple sets of consistent data longitudinally for the same patient (a patient had multiple sets of data that met the requirements of having 2 or more data within 3 ± 1 months), a group was randomly selected for inclusion. Determination of glycemic control status: to facilitate the development of a predictive model, this study converted the continuous measurements of FBG and HbA1c into discrete categorical outcomes. According to the relevant guidelines¹⁶, the FBG threshold range was defined as [4.4–7.0]. A value of 1 was assigned to patients whose FBG values fell within the well-regulated range (FBG: 4.4–7.0), while a value of 0 was assigned to those outside this range. Similarly, the HbA1c threshold was set at 7%. A value of 1 was attributed to patients who achieved controlled HbA1c levels (HbA1c < 7), and a value of 0 was assigned to those who did not achieve the desired HbA1c control (HbA1c ≥ 7).

The data included the patient's basic information, drug use, test indicators, and living and diet, as well as the actual follow-up of the patient after treatment. This study used a unique ID to identify patient connection information, and all research operations carried out would not be traced to the individual patient, and the patient's sensitive personal information (such as name, phone number, address, work unit, responsible doctor, etc.) would be deleted. All files were encrypted during transmission and use, and documents were received by a password. This study has passed the ethical review, the approval document in Supplementary Fig. 1.

In this study, a total of 511 variables were included, which were named X1–X511 for statistical convenience (Detailed variables are shown in Supplemental Table 1). Data analysis was performed using named variables, and the variable names were restored after the model evaluation process was complete.

Table 1 Baseline characteristics of participants.

Full size table

Data cleaning

We deleted variables with a missing ratio of 90%, a single category ratio of 90%, and variables with a coefficient of variation less than 0.1. These variables had little impact on the establishment of the model, and the analysis was meaningless, so they were deleted. Two methods were used for inputting missing data: not inputting and modified random forest inputting . After the data were inputted, if there was a large difference between the positive and negative sample sizes, the data was balanced by sampling. And we modified outliers to the maximum or minimum value of the norm.

The method of “not inputting” was to delete the missing columns and the missing rows in the data in turn, and finally, we got the data without missing values. The “modified random forest inputting” meant that by continuously introducing the inputted columns into the model, as the amount of data continued to accumulate, the obtained values had a higher accuracy rate, which could achieve a more accurate prediction of missing values.

After the data were inputted, the data were divided into training and test sets for machine learning. And the number of training sets accounted for 80% of the total sample size, and the number of test sets accounted for 20% of the total sample size.

Feature screening

The data were screened using three methods: Not screening, Lasso screening, and Boruta screening. Feature screening is an important aspect of model building, which helps to exclude relevant variables, biases, and limitations of unnecessary noise, making the final analysis results closer to reality. Lasso are a useful atheoretical approach for both developing predictive models and selecting key indicators within an often substantially larger pool of available indicators by inputting all latent variables at the same time, reducing bias caused by unimportant variables, and selecting only the most important variables from a potentially large initial pool¹⁷. Boruta screening is also a popular method at present¹⁸. It uses the random forest algorithm to extract feature variables, disrupt the sequence of feature variables, and calculate the importance of feature variables¹⁹.

Model training

16 kinds of machine learning algorithms were used for model training, and the data after feature screening were modeled respectively. The specific machine learning algorithm models used included: Logistic regression, Decision Tree, Random Forest, Extra Tree, Stochastic Gradient Descent (SGD), Gaussian Naive Bayes, Bernoulli Naive Bayes, Multinomial Naive Bayes, Quadratic Discriminant Analysis (QDA), Linear Discriminant Analysis (LDA), Passive Aggressive, AdaBoost, Bagging, Gradient Boosting, XGBoost, and Ensemble Learning (The introduction and comparison of various machine learning algorithms are detailed in the references^20,21). In 2011, Tianqi Chen and Carlos Guestrin first proposed the XGBoost algorithm. It is a machine learning model that achieves stronger learning effects by integrating multiple weak learners²², and has better flexibility and scalability. Compared with general machine learning algorithms, the XGBoost model shows strong advantages. These machine learning algorithms have their strengths, among which, the ensemble learning model is an evaluation index based on the trained model, summarizing the best model and outputting according to the voting principle. The evaluation indicators of the prediction model included Area Under Curve (AUC), Accuracy, Precision, Recall, and F1 Score. According to the machine learning results, the 5 models with the best prediction performance were selected and their receiver operating characteristic curve (ROC) and P-R curves were drawn.

Model verification

Ten-fold cross-validation and bootstrapping sampling were used to verify the impact of different preprocessing algorithms and different machine learning algorithms on the prediction of building FBG and HbAlc models²³. The model with the largest AUC was selected and constructed using 10 subsets (randomly drawn 10%–100% of the total sample size) to assess the effect of different sample sizes on predictive power. Each subset was split 4:1 into a training set and a test set, and the AUC calculated from the test set was used for sample size checking. By transforming randomly sampled data, 10 independent replicates were generated for each model.

A process framework of the data flow is shown in Fig. 1. Data flowed through each node according to a predetermined schedule.

Statistical analysis

Continuous variables were expressed as mean ± standard deviation, and count variables were expressed as frequency. Differences between quantitative data were tested using a t-test and rank test. Hypothesis testing was used to investigate the influence of different data processing methods and algorithms on the model prediction performance. On the analysis results of bootstrapping sampling and validation set, hypothesis testing single factor analysis was performed. The analysis content included different data inputting methods, feature screening methods, and the corresponding mean ± standard deviation and 95% confidence interval between the three dimensions of the machine learning model and the five evaluation indicators (AUC, Accuracy, Precision, Recall, and F1 Score) and p-value.

Excel 2016 was used for summarizing data, and all statistical analyses were performed using Python 3.8.

The transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD) statement for prediction model development is in Supplemental Table 10.

Results

Baseline characteristics

The FBG study cohort included 100,000 patients and the HbA1c study cohort included 2,169 patients. In the FBG study cohort, the sample sizes assigned to 1 and 0 are 50,000, respectively, and in the HbAlc study cohort, the sample sizes assigned to 1 and 0 are 1,027 and 1,142, respectively. Baseline demographic, clinical, laboratory and medication details are shown in Table 1. The mean ages of the two cohorts were 64.0 ± 10.1 years and 63.1 ± 10.2 years, respectively. The most common comorbidities in both cohorts were hypertension, kidney disease, and heart disease. At baseline, FBG was 8.6 ± 3.9 mmol/L and 8.9 ± 3.8 mmol/L, respectively, and HbA1c were both 7.8 ± 2.8%.

Variable and feature screening

426 and 432 variables were removed from the FBG prediction model and the HbA1c prediction model, respectively, during data cleaning, and the specific variables are shown in Supplementary Table 2. Therefore, a total of 85 and 79 variables were finally used for modeling, respectively (Supplementary Table. 3). After inputting data with missing variables, the positive and negative samples are relatively balanced, so no sampling is required.

The results of the feature screening are shown in Table 2. The results showed that the most important indicators of the FBG and HbA1c prediction model were the FBG value and HbA1c. The patient's medication compliance, follow-up, dietary habits, BMI, and waist circumference also had a greater impact on the FBG level. In addition, the feature selection results also showed that patients with hypertension or other comorbid diseases, laboratory indicators of related diseases such as Scr, serum alanine aminotransferase, blood urea nitrogen, platelets, and white blood cells, as well as age, smoking, all had a certain degree of influence on FBG. For the HbA1c prediction model, the feature screening results showed that laboratory indicators such as platelets, Scr, AST, Hb, AST, etc. accounted for a large proportion. According to the results of Not inputting and Lasso screening, the feature importance of the data is drawn (Fig. 2), and the feature importance bar chart drawn by other inputting and screening methods is shown in supplementary Figs. 2–7.

Table 2 Feature screening results.

Full size table

Model performance

The machine learning results are shown in Tables 3 and 4. The inputting methods used by all best models were all modified random forest inputting: The optimal feature screening method was Not screening and Boruta screening The optimal machine learning methods were ensemble learning and XGBoost. The AUC value of the best model for FBG was model 1 (AUC = 0.8190); the worse one was model 5 (AUC = 0.8082). The AUC value of the best model for HbA1c was model 1 (AUC = 0.9704); the worse one was model 5 (AUC = 0.9674). The AUC values of the ten best models were all greater than 0.75, indicating that the prediction model had good prediction performance, and had the possibility of certain clinical application. The ROC curves and P-R curves of the five best models are shown in Figs. 3 and 4.

Table 3 Predictive model building results.

Full size table

Table 4 Machine learning algorithm ten-fold cross-validation analysis results.

Full size table

The effect of different data processing methods on the results

The results of the ten-fold cross-validation analysis of the data inputting method for FBG are shown in Supplemental Table 4. The modified random forest inputting had the greater impact on the model effect, the AUC values for the FBG and HbA1c models were 0.749 ± 0.044 (95%CI = 0.745–0.753) and 0.901 ± 0.078 (95%CI = 0.894–0.907). The results of the ten-fold cross-validation analysis of the feature screening of the validation set showed that (Supplemental Table 5), Lasso screening had the greatest impact on the model effect, the AUC values for the FBG, and HbA1c models were 0.728 ± 0.038 (95%CI = 0.725–0.731) and 0.776 ± 0.130 (95%CI = 0.766–0.785).

The analysis results of the data inputting method in the bootstrapping sampling showed that the modified random forest inputting had a greater impact on the model effect (Supplemental Table 6), the AUC values for the FBG and HbA1c models were 0.754 ± 0.048 (95%CI = 0.754–0.755) and 0.902 ± 0.083 (95%CI = 0.902–0.903). The results of feature screening analysis in bootstrapping sampling showed that Boruta screening had the greatest impact on the FBG model effect, with an AUC value of 0.732 ± 0.040 (95%CI = 0.731–0.732). For HbA1c, the greatest impact analysis screening method in bootstrapping sampling was Lasso screening, with an AUC value of 0.772 ± 0.131 (95%CI = 0.772–0.773) (Supplemental Table 7).

The effect of different algorithms on the results

Hypothesis testing was used to examine the impact of different algorithms on the model's predictive performance. The ten-fold cross-validation results for FBG and HbA1c prediction model are shown in Supplemental Table 8. The results showed that XGBoost had the greatest impact on the model effect, the mean AUC values for the FBG and HbA1c models were 0.761 ± 0.029 (95% CI = 0.756–0.766) and 0.802 ± 0.159 (95% CI = 0.773–0.831). The results of bootstrapping sampling analysis (Supplemental Table 9) showed that among the 16 machine learning models used in this section, ensemble learning had the greatest impact on the model effect, the mean AUC values of FBG and HbA1c are respectively 0.767 ± 0.029 and 0.843 ± 0.11.

Discussion

The factors that affect the treatment results of diabetes are very complex, including age, gender, weight, course of disease, eating habits, lifestyle, organ function, etc. Therefore, the treatment of T2DM needs to consider multiple risk factors according to the actual diagnosis and treatment of patients, and provide appropriate treatment plans for different patients. Effectively evaluating and predicting the improvement of blood glucose after treatment can help clinicians better provide individualized treatment services for patients, while FBG and HbAlc are the most commonly accepted indicators to measure the improvement of blood glucose. Our research is based on machine learning methods and aims to build artificial intelligence prediction models for FBG and HbAlc in patients with T2DM. By analyzing the influencing factors of related blood glucose control indicators in T2DM patients, we established two prediction models for FBG and HbAlc, and they can assist clinical treatment and T2DM patient management, to allow early adjustment of the treatment plan and improve the treatment rate and control rate of T2DM. Our analysis results show that: (1) FBG, HbA1c, medication compliance, and dietary habits have a greater impact on both prediction models; (2) The multi-parameter predictive risk models incorporate variables from different domains, including baseline demographics, complications, and laboratory tests, and can accurately predict three-month FBG values and HbAlc values; (3) In the machine learning-driven algorithm, the optimal models both adopt the ensemble learning algorithm. The influence of different algorithms and data processing methods on the results shows that the algorithms that have the greatest impact on the model effect are ensemble learning and XGBoost, and the best inputting method is the modified random forest inputting. Ensemble learning is an advanced machine learning strategy that can improve classification performance and generalization by combining multiple models²⁴. Nemat H et al. utilized deep learning and ensemble learning to predict blood glucose levels, and compared the performance of the proposed ensemble model with the non-ensemble model, and the results showed that the developed ensemble model outperformed the non-ensemble baseline model²⁵. In our research, we combine the remaining model indicators that have been trained, and the integrated model indicators have great advantages.

From the feature screening results, this study is consistent with other studies, FBG and HbAlc are both the most important predictors. Del Parigi A et al.²⁶ used several machine learning algorithms to find predictors of glycemic control in diabetes and found that HbA1c and FPG were the strongest predictors of achieving glycemic control. This is consistent with our findings. For this result, we explain that current FBG and HbA1c values have important effects on future FBG and HbA1c values, respectively. In our study, FBG and HbAlc were mutually important predictors, indicating an important correlation between glycated hemoglobin and fasting blood glucose. The reason maybe is that once the glucose in human blood combines with hemoglobin to form glycosylated hemoglobin, it will age with the aging of red blood cells, which is the product of an irreversible glycation reaction. The contact time between blood glucose and hemoglobin and the content of blood glucose can determine the level of HbA1c, so the content of HbA1c is positively correlated with the blood glucose content of diabetic patients, which may have the ability to predict each other. Studies have shown that postprandial blood glucose and fasting blood glucose are closely related to glycosylated hemoglobin, and for poorly controlled diabetic patients, the greater the value of HbA1c, the greater the contribution of fasting blood glucose value²⁷. Wang J et al²⁸ .established a blood glucose prediction model. After feature screening, the top six indicators were: age, fasting glutamate transaminase (ALT), blood urea nitrogen (BUN), total protein (TP), uric acid (UA), and BMI. BMI is also in the top ten important features in this study, and the rest of the indicators did not enter the top ten in importance. However, our study also found that ALT and BUN will have a certain degree of influence on blood glucose. Wang YS et al. established a T2DM prediction model in western Xinjiang, China, and used Lasso screening for feature screening²⁹. The study showed that age, family history of T2DM, waist circumference, TC, TG, BMI, HDL-C, and previous history of hypertension had a significant impact on FBG. These factors are included in the feature selection results of our study. Chien KL et al³⁰ used multiple logistic regression to predict HbA1c and found that both waist circumference and BMI were associated with abnormal glycated hemoglobin levels. Age, family history of diabetes, systolic blood pressure, and biochemical markers including C-reactive protein and triglycerides were significantly associated with higher glycated hemoglobin levels. In our study, waist circumference and BMI had important effects on HbA1c, as did age and hypertension, but the study did not take into account enough variables, and our accuracy is higher. In addition, our study, based on a large sample of physical examination and follow-up data, found that patients' medication compliance, follow-up conditions, and living habits (including dietary habits and smoking) had a greater impact on blood glucose control. The results better clarify the importance of primary prevention of T2DM, which is to focus on changing environmental factors and lifestyles, reducing calorie intake, maintaining a low-salt, low-sugar, high-fiber diet, quitting smoking, limiting alcohol, and getting daily moderate exercise. At the same time, the results of this study also show the importance of follow-up for secondary and tertiary prevention of T2DM. Pourat N et al³¹ conducted an observational study and found that timely linking behavioral health patients to outpatient follow-up after hospitalization is an effective care transition strategy that may reduce readmission rates. Tong L et al³² also concluded that follow-up was associated with a reduced risk of readmission. Patients benefited the most from outpatient follow-up because face-to-face conversations allowed more information (both therapeutic and emotional) to be exchanged with patients and better individualized care for patients. In addition, adverse reaction monitoring during the follow-up process can timely detect the risk of hypoglycemia in patients, which brings greater benefits to patients.

Most studies tend to use machine learning algorithms such as decision trees, random forests, SVM, logistic regression, and neural networks to build T2DM prediction models, with AUC values ranging from 0.7 to 0.9^33,34. Wang J et al. adopted three commonly used machine learning algorithms (RF, SVM, and BP-ANN) combined with the elastic network (EN) to simulate and predict blood glucose status in China. The AUCs of RF, SVM and BP were 0.75, 0.72 and 0.72, respectively³⁵. In a study of T2DM prediction models in Australia by Zhang L et al³⁶, the model built using XGBoost had the best prediction ability, with a 3-year prediction model AUC value of 0.78 and a 10-year AUC value of 0.75. Xue M et al³⁷ established a T2DM prediction model using algorithms such as decision trees, random forests, AdaBoost with decision trees (AdaBoost), and extreme gradient boosting decision trees (XGBoost), and XGBoost had the best performance (AUC = 0.968). Usually, the AUC value is above 0.8, showing a good classification effect³⁸. The AUC values of the five optimal FBG models obtained in our study are all greater than 0.8 and the AUC values of the five optimal HbA1c models obtained in our study are all greater than 0.9. Based on incorporating 100,000 pieces of data, 85 variables, and 16 machine learning algorithms for research, we obtained the FBG prediction model with the best AUC value of 0.819 and the HbA1c prediction model with the best AUC value of 0.970, indicating that these prediction models have better performance and better clinical prediction ability. The establishment of these two models can input the current gap in the prediction model of individualized treatment of T2DM patients, provide new ideas and methods for T2DM treatment, and provide T2DM patients with efficient and accurate individualized treatment plans to solve the real health problems of patients. In addition, this study explores the T2DM prediction model based on real-world medical data mining, uses multiple classifiers for comparative research, and selects the optimal model to ensure the optimization of the model, which effectively makes up for the current shortcomings of using a single classifier. Therefore, this study comprehensively and completely demonstrated the process of predicting the outcome of T2DM drug treatment with the help of data mining technology under the background of real-world research and provided a good methodological reference for the management of other chronic diseases.

Strengths

1.
The data in this study came from the Public Health Service System and the Medical Record Homepage Management System of the Health Information Center of Sichuan Province. The data quality is reliable enough to meet the needs of modeling.
2.
In the process of data cleaning, this study is not limited to a single data preprocessing method but uses a variety of inputting methods, feature screening methods, and various data preprocessing methods and applies them to the data cleaning of each predictive indicator. The process avoids the possible impact of a certain data preprocessing method on the modeling effect.
3.
The modeling method has been improved in this study. Different from the previous use of one or several algorithms to build predictive models, our study used more than ten machine learning algorithms for modeling and selected the optimal five models. The model results are more reliable and the prediction performance is better.
4.
The information included in the study is more comprehensive, including the basic information about patients, disease-related factors, treatment factors, metabolic index factors, and lifestyle-related factors.

Limitations

1.
This study only uses medical data from Sichuan Province, China for modeling. Differences in lifestyle and ethnicity in different regions may lead to a limited scope of application of the model.
2.
Although the data set used for validation in this study is independent of the data set used for model development, the two are derived from practice records in the same database, and no more rigorous prospective external validation has been performed.
3.
In this study, the classification of some variables may be wrong because the system automatically recognizes variables with more than 10 categories as continuous variables, but this operation will not affect the prediction effect of the final prediction model.
4.
In this study, the AUC values of the two optimal prediction models differed by 0.1, possibly due to too little HbA1c data for modeling. In future studies, if the amount of data used to build the model can be increased, the difference in values may be reduced.
5.
Some predictive factors, such as the course of diabetes, are not recorded in detail in the original database, so these variables are not included in the modeling, which may have an impact on the prediction results. In the following research, we will use more extensive and detailed data for modeling to obtain a more accurate prediction model.

Conclusion

In this study, the three-month FBG prediction model and three-month HbA1c prediction model for T2DM patients were constructed using population data from Sichuan Province, China. The patient's FBG and HbA1c are both the most important predictors of two kinds of prediction models. This research can provide a methodological reference for other prediction models. The AUC values of the five best FBG prediction models finally established are all greater than 0.8, and the AUC values of the five best HbA1c prediction models finally established are all greater than 0.9, which could accurately predict the FBG and HbA1c for clinical applications.

In our future plans, we intend to establish a web-based prediction platform that will integrate the best prediction models. This platform will allow the input of various factors strongly associated with glycemic control in T2DM, such as individual characteristics, disease status, medication information, and laboratory test results. Using the prediction platform, the prediction model can be applied in clinical practice for forecasting FBG and HbA1c control in patients, and our institution conducts preliminary application studies. Actually, it's challenging to ensure the absolute correctness of model predictions. Therefore, in clinical practice, the results of predictive models can be combined with the empirical knowledge of clinicians and pharmacists to help them make decisions to improve patient outcomes.

Data availability

The data of the Public Health Service System and the Medical Record Homepage Management System of the Health Information Center of Sichuan Province, China used to support the findings of this study have not been made available because the availability of these data is only licensed under the current study and therefore not publicly available. Data are however available from the authors upon reasonable request and with permission of Sichuan Provincial Health Commission.

Abbreviations

ALT:: Alanine aminotransferase;
AST:: Aspartate aminotransferase
AUC:: Area under the curve
BMI:: Body mass index
BP-ANN:: Back-propagation artificial neural network
BUN:: Blood urea nitrogen
CI:: Confidence intervals
CPM:: Cycles per minute
DBP:: Diastolic blood pressure
EN:: Elastic network
FBG:: Fasting blood glucose
Fig:: Figure
Hb:: Hemoglobin
HbA1c:: Glycosylated hemoglobin
HBP:: High blood pressure
HDL-C:: High density lipoprotein cholesterol
LDA:: Linear discriminant analysis
P-R curve:: Precision-recall curve
QDA:: Quadratic discriminant analysis
RF:: Random forest
ROC:: Receiver operating characteristic curve
SBP:: Systolic blood pressure
Scr:: Serum creatinine
SD:: Standard deviation
SGD:: Stochastic Gradient Descent
SVM:: Support vector machine
TC:: Total cholesterol
TRIPOD:: Transparent reporting of a multivariable prediction model for individual prognosis or diagnosis
TG:: Triglyceride
TP:: Total protein
UA:: Uric acid

References

American Diabetes Association Professional Practice Committee. 2. Classification and Diagnosis of Diabetes: Standards of Medical Care in Diabetes-2022. Diabetes care, 45(Suppl 1), S17–S38 (2022).
Sun, H. et al. IDF diabetes atlas: Global, regional and country-level diabetes prevalence estimates for 2021 and projections for 2045. Diabetes Res. Clin. Pract. 183, 109119 (2022).
Article PubMed Google Scholar
Ogurtsova, K. et al. IDF diabetes Atlas: Global estimates of undiagnosed diabetes in adults for 2021. Diabetes Res. Clin. Pract. 183, 109118 (2022).
Article PubMed Google Scholar
Zheng, Y., Ley, S. H. & Hu, F. B. Global aetiology and epidemiology of type 2 diabetes mellitus and its complications. Nat. Rev. Endocrinol. 14(2), 88–98 (2018).
Article PubMed Google Scholar
Harding, J. L. et al. Global trends in diabetes complications: a review of current evidence. Diabetologia 62(1), 3–16 (2019).
Article PubMed Google Scholar
Jia, W. et al. Standards of medical care for type 2 diabetes in China 2019. Diabetes Metab. Res. Rev. 35(6), e3158 (2019).
Article PubMed Google Scholar
Xiao, W. et al. Different data mining approaches based medical text data. J. Healthcare Eng. 2021, 1285167 (2021).
Article Google Scholar
Makridakis, S., Spiliotis, E. & Assimakopoulos, V. Statistical and machine learning forecasting methods: Concerns and ways forward. PLoS ONE 13(3), e0194889 (2018).
Article PubMed PubMed Central Google Scholar
Fregoso-Aparicio, L., Noguez, J., Montesinos, L. & García-García, J. A. Machine learning and deep learning predictive models for type 2 diabetes: a systematic review. Diabetol. Metab. Syndr. 13(1), 148 (2021).
Article PubMed PubMed Central Google Scholar
Zhu, T. et al. Deep learning for diabetes: A systematic review. IEEE J. Biomed. Health Inform. 25(7), 2744–2757 (2021).
Article PubMed Google Scholar
Perreault, L., Skyler, J. S. & Rosenstock, J. Novel therapies with precision mechanisms for type 2 diabetes mellitus. Nat. Rev. Endocrinol 17(6), 364–377 (2021).
Article CAS PubMed Google Scholar
Peter, P. R. & Lupsa, B. C. Personalized management of type 2 diabetes. Curr. Diab. Rep. 19(11), 115 (2019).
Article PubMed Google Scholar
Ou, S. M. et al. Prediction of the risk of developing end-stage renal diseases in newly diagnosed type 2 diabetes mellitus using artificial intelligence algorithms. BioData Min. 16(1), 8 (2023).
Article PubMed PubMed Central Google Scholar
Wang, S. et al. Comparative study on risk prediction model of type 2 diabetes based on machine learning theory: a cross-sectional study. BMJ Open. 13(8), e069018 (2023).
Article PubMed PubMed Central Google Scholar
WHO. World Health Organization. ICD-10 version: 2010, 2010. Available from https://icd.who.int/browse10/2019/en. Accessed 16 March 2021.
Chinese Elderly Type 2 Diabetes Prevention and Treatment of Clinical Guidelines Writing Group; Geriatric Endocrinology and Metabolism Branch of Chinese Geriatric Society; Geriatric Endocrinology and Metabolism Branch of Chinese Geriatric Health Care Society; Geriatric Professional Committee of Beijing Medical Award Foundation; National Clinical Medical Research Center for Geriatric Diseases (PLA General Hospital). Zhonghua Nei Ke Za Zhi. 2022;61(1):12–50. https://doi.org/10.3760/cma.j.cn112138-20211027-00751
Greenwood, C. J. et al. A comparison of penalised regression methods for informing the selection of predictive markers. PLoS ONE 15(11), e0242730 (2020).
Article CAS PubMed PubMed Central Google Scholar
Kursa, M. B. & Rudnicki, W. R. Feature selection with the Boruta package. J. Stat. Softw 36, 1–13 (2010).
Article Google Scholar
Maurya, N. S. et al. Prognostic model development for classification of colorectal adenocarcinoma by using machine learning model based on feature selection technique boruta. Sci. Rep. 13(1), 6413 (2023).
Article ADS CAS PubMed PubMed Central Google Scholar
Bzdok, D., Krzywinski, M. & Altman, N. Machine learning: supervised methods. Nat. Methods 15(1), 5–6 (2018).
Article CAS PubMed PubMed Central Google Scholar
Bi, Q., Goodman, K. E., Kaminsky, J. & Lessler, J. What is machine learning? A primer for the epidemiologist. Am. J. Epidemiol. 188(12), 2222–2239 (2019).
PubMed Google Scholar
Zhao, Z. et al. Identify DNA-binding proteins through the extreme gradient boosting algorithm. Front. Genet. 12, 821996 (2022).
Article PubMed PubMed Central Google Scholar
Kernbach, J. M. & Staartjes, V. E. Foundations of machine learning-based clinical prediction modeling: Part II-generalization and overfitting. Acta Neurochir. Suppl. 134, 15–21 (2022).
Article PubMed Google Scholar
Jin, L. P. & Dong, J. Ensemble deep learning for biomedical time series classification. Comput. Intell. Neurosci. 2016, 6212684 (2016).
Article PubMed PubMed Central Google Scholar
Nemat, H. et al. Blood glucose level prediction: Advanced deep-ensemble learning approach. IEEE J. Biomed. Health Inform. 26(6), 2758–2769 (2022).
Article PubMed Google Scholar
Del Parigi, A. et al. Machine learning to identify predictors of glycemic control in type 2 diabetes: An analysis of target HbA1c reduction using empagliflozin/linagliptin data. Pharmaceut. Med. 33(3), 209–217 (2019).
MathSciNet PubMed Google Scholar
Makris, K. et al. Relationship between mean blood glucose and glycated haemoglobin in Type 2 diabetic patients. Diabetic Med. 25(2), 174–178 (2008).
Article CAS PubMed Google Scholar
Wang, J. et al. Multiple linear regression and artificial neural network to predict blood glucose in overweight patients. Exp. Clin. Endocrinol. Diabetes 124(1), 34–38 (2016).
Article CAS PubMed Google Scholar
Wang, Y. et al. Nomogram model for screening the risk of type II diabetes in Western Xinjiang, China. Diabetes Metabolic Syndrome Obes 14, 3541–3553 (2021).
Article Google Scholar
Chien, K. L. et al. Prediction model for high glycated hemoglobin concentration among ethnic Chinese in Taiwan. Cardiovasc. Diabetol. 9, 59 (2010).
Article PubMed PubMed Central Google Scholar
Pourat, N. et al. Timely outpatient follow-up is associated with fewer hospital readmissions among patients with behavioral health conditions. J. Am. Board Fam. Med. 32(3), 353–361 (2019).
Article PubMed Google Scholar
Tong, L. et al. The association between outpatient follow-up visits and all-cause non-elective 30-day readmissions: A retrospective observational cohort study. PLoS ONE 13(7), e0200691 (2018).
Article PubMed PubMed Central Google Scholar
De Silva, K. et al. Use and performance of machine learning models for type 2 diabetes prediction in clinical and community care settings: Protocol for a systematic review and meta-analysis of predictive modeling studies. Digital Health 7, 20552076211047390 (2021).
Article PubMed PubMed Central Google Scholar
Kim, H., Lim, D. H. & Kim, Y. Classification and prediction on the effects of nutritional intake on overweight/obesity, dyslipidemia, hypertension and type 2 diabetes mellitus using deep learning model: 4–7th Korea national health and nutrition examination survey. Int. J. Environ. Res. Public Health 18(11), 5597 (2021).
Article CAS PubMed PubMed Central Google Scholar
Wang, J. et al. Status of glycosylated hemoglobin and prediction of glycemic control among patients with insulin-treated type 2 diabetes in North China: a multicenter observational study. Chin. Med. J. 133(1), 17–24 (2020).
Article CAS PubMed PubMed Central Google Scholar
Zhang, L. et al. Predicting the development of Type 2 diabetes in a large Australian cohort using machine-learning techniques: longitudinal survey study. JMIR Med. Inform. 8(7), e16850 (2020).
Article PubMed PubMed Central Google Scholar
Xue, M. et al. Identification of potential Type II diabetes in a large-scale chinese population using a systematic machine learning framework. J. Diabetes Res. 2020, 6873891 (2020).
Article PubMed PubMed Central Google Scholar
Mandrekar, J. N. Receiver operating characteristic curve in diagnostic test assessment. J. Thoracic Oncol. 5(9), 1315–1316 (2010).
Article Google Scholar

Download references

Acknowledgements

We would like to thank Research Square for providing us with a preprint platform so that more researchers can read our manuscript (DOI:https://doi.org/10.21203/rs.3.rs-1868105/v1).

Funding

E.W.L. was funded by the Sichuan Provincial Department of Science and Technology (Grant No. 2021YFS0197). X.W.W. was supported by the National Natural Science Foundation of China (grant number: 72004020).

Author information

Authors and Affiliations

Personalized Drug Therapy Key Laboratory of Sichuan Province, Department of Pharmacy, Sichuan Provincial People’s Hospital, School of Medicine, University of Electronic Science and Technology of China, Chengdu, 610072, Sichuan, China
Xue Tao, Jiaqiang Hu, Wenmei Guo, Xingwei Wu, Xia Shi, Wenyuan Li, Rongsheng Tong & Enwu Long
West China School of Public Health and West China Fourth Hospital, Sichuan University, Chengdu, 610044, Sichuan, China
Min Jiang
Department of Pharmacy, Daping Hospital, Army Medical University, Chongqing, 400042, China
Yumeng Liu
Hospital of Chengdu University of Traditional Chinese Medicine, Chengdu, 610072, Sichuan, China
Qi Hu
School of Pharmacy, Southwest Medical University, Luzhou, 646000, Sichuan, China
Baoqiang Zhu
Institute of Materia Medica, Chinese Academy of Medical Sciences/Peking Union Medical College, Beijing, 100050, China
Yu Xiong
Sichuan Provincial Health Information Center, Chengdu, 610015, Sichuan, China
Xueli Zhang & Xu Han

Authors

Xue Tao
View author publications
You can also search for this author in PubMed Google Scholar
Min Jiang
View author publications
You can also search for this author in PubMed Google Scholar
Yumeng Liu
View author publications
You can also search for this author in PubMed Google Scholar
Qi Hu
View author publications
You can also search for this author in PubMed Google Scholar
Baoqiang Zhu
View author publications
You can also search for this author in PubMed Google Scholar
Jiaqiang Hu
View author publications
You can also search for this author in PubMed Google Scholar
Wenmei Guo
View author publications
You can also search for this author in PubMed Google Scholar
Xingwei Wu
View author publications
You can also search for this author in PubMed Google Scholar
Yu Xiong
View author publications
You can also search for this author in PubMed Google Scholar
Xia Shi
View author publications
You can also search for this author in PubMed Google Scholar
Xueli Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Xu Han
View author publications
You can also search for this author in PubMed Google Scholar
Wenyuan Li
View author publications
You can also search for this author in PubMed Google Scholar
Rongsheng Tong
View author publications
You can also search for this author in PubMed Google Scholar
Enwu Long
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

E.W.L. and X.T. contributed to the idea and design of the study. M.J. and B.Q.Z. conducted the literature search. Y.M.L. and Q.H. conducted the data analysis and created the figures. Y.M.L. wrote the manuscript with support from W.M.G., M.J., X.T., X.W.W. and Y.X., X.L.Z., X.H., J.Q.H., X.S., R.S.T. and E.W.L. provided critical feedback on the analysis and its interpretation and commented on the drafted manuscript. W.Y.L., R.S.T., X.W.W. and E.W.L. provided final approval of the manuscript for publication. XT is responsible for the integrity of the work as a whole.

Corresponding author

Correspondence to Enwu Long.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Tao, X., Jiang, M., Liu, Y. et al. Predicting three-month fasting blood glucose and glycated hemoglobin changes in patients with type 2 diabetes mellitus based on multiple machine learning algorithms. Sci Rep 13, 16437 (2023). https://doi.org/10.1038/s41598-023-43240-5

Download citation

Received: 18 July 2023
Accepted: 21 September 2023
Published: 30 September 2023
DOI: https://doi.org/10.1038/s41598-023-43240-5

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Early detection of type 2 diabetes mellitus using machine learning-based prediction models

Machine learning for characterizing risk of type 2 diabetes mellitus in a rural Chinese population: the Henan Rural Cohort Study

Comparisons of the prediction models for undiagnosed diabetes between machine learning versus traditional statistical methods

Introduction

Materials and methods

Study design and data source

Data cleaning

Feature screening

Model training

Model verification

Statistical analysis

Results

Baseline characteristics

Variable and feature screening

Model performance

The effect of different data processing methods on the results

The effect of different algorithms on the results

Discussion

Strengths

Limitations

Conclusion

Data availability

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Supplementary Information.

Rights and permissions

About this article

Cite this article

Share this article

Comments

Search

Quick links