A prediction nomogram for the 3-year risk of incident diabetes among Chinese adults

Wu, Yang; Hu, Haofei; Cai, Jinlin; Chen, Runtian; Zuo, Xin; Cheng, Heng; Yan, Dewen

doi:10.1038/s41598-020-78716-1

Download PDF

Article
Open access
Published: 10 December 2020

A prediction nomogram for the 3-year risk of incident diabetes among Chinese adults

Yang Wu^1,3,5^na1,
Haofei Hu^2,4,5^na1,
Jinlin Cai^1,3,6,
Runtian Chen^1,3,5,
Xin Zuo⁷,
Heng Cheng⁷ &
…
Dewen Yan^1,3,5

Scientific Reports volume 10, Article number: 21716 (2020) Cite this article

2150 Accesses
21 Citations
Metrics details

Subjects

Abstract

Identifying individuals at high risk for incident diabetes could help achieve targeted delivery of interventional programs. We aimed to develop a personalized diabetes prediction nomogram for the 3-year risk of diabetes among Chinese adults. This retrospective cohort study was among 32,312 participants without diabetes at baseline. All participants were randomly stratified into training cohort (n = 16,219) and validation cohort (n = 16,093). The least absolute shrinkage and selection operator model was used to construct a nomogram and draw a formula for diabetes probability. 500 bootstraps performed the receiver operating characteristic (ROC) curve and decision curve analysis resamples to assess the nomogram's determination and clinical use, respectively. 155 and 141 participants developed diabetes in the training and validation cohort, respectively. The area under curve (AUC) of the nomogram was 0.9125 (95% CI, 0.8887–0.9364) and 0.9030 (95% CI, 0.8747–0.9313) for the training and validation cohort, respectively. We used 12,545 Japanese participants for external validation, its AUC was 0.8488 (95% CI, 0.8126–0.8850). The internal and external validation showed our nomogram had excellent prediction performance. In conclusion, we developed and validated a personalized prediction nomogram for 3-year risk of incident diabetes among Chinese adults, identifying individuals at high risk of developing diabetes.

A nomogram model for predicting 5-year risk of prediabetes in Chinese adults

Article Open access 18 December 2023

A nomogram model for screening the risk of diabetes in a large-scale Chinese population: an observational study from 345,718 participants

Article Open access 14 July 2020

Prediction model and assessment of probability of incident hypertension: the Rural Chinese Cohort Study

Article 27 February 2020

Introduction

Diabetes mellitus has become a significant public health issue all over the world. Due to the aging population and unhealthy lifestyles, the prevalence of diabetes worldwide is rapidly increasing. It was estimated that there were 451 million (age 18–99 years) people with diabetes in 2017, and the number was expected to increase to 693 million by 2045¹. The global burden of disease study identified that diabetes resulted in 1.37 million deaths in 2017². Due to its high morbidity, disability and mortality, diabetes has a major impact on society, economy, and development worldwide. China has the world’s most enormous numbers of diabetic patients, reaching up to 109.6 million³. However, more than half of Chinese adults with diabetes were undiagnosed⁴.

As a debilitating chronic epidemic, early identification and diagnosis, early treatment is an essential part of diabetes prevention and health care. The central component of diabetes preventive strategies is to identify individuals at high risk for incident diabetes⁵. Studies demonstrated that lifestyle modification and pharmacological intervention could prevent or delay the occurrence of diabetes^6,7. Moreover, for newly diagnosed diabetic patients, intensive lifestyle intervention, metabolic surgery and early short-term intensive insulin therapy can induce long-term glycemic remission without further antidiabetic medication^8,9,10,11,12. Several studies have shown that early diagnosis and timely treatment can delay the progression of diabetes, delay or even prevent the occurrence of diabetes complications^13,14,15. Therefore, it is crucial to find a feasible and accurate screening tool to identify those with undiagnosed diabetes or at high risk of the onset of diabetes, which will be beneficial for the effective implementation of diabetes prevention programs.

Risk prediction models have considerable potential to contribute to the decision-making process regarding the clinical management of a patient¹⁶. The models can screen individuals to identify at an increased risk of having an undiagnosed condition, for which diagnosis management and treatment can be initiated and ultimately improve patient outcomes. A variety of risk prediction models for screening diabetes have been established, mainly applied to western populations^{17,18,19,20,21,22,23}. These predictive models may not apply to the Chinese population due to the differences in diet, lifestyle, social environment, and genetic predisposition. The least absolute shrinkage and selection operator (LASSO) method is suitable for reducing high-dimensional data and is performed to select the most useful prediction candidates^24,25. Nomogram is an intuitive graphical prediction model that can provide accurate and individualized risk predictions for each individual. However, there were only a limited number of prediction nomogram for risk of diabetes in China^26,27,28. And the existing diabetes risk prediction models incorporate many variables, which are not convenient to apply. Besides, they are mainly single-center studies, and none of them has conduct external validation. Therefore, we aimed to introduce the LASSO method to select the least and optimal variables to predict the 3-year risk of incident diabetes. Furthermore, we sought to develop and validate a personalized diabetes prediction nomogram by more cost-effective and readily available parameters in a large cohort of Chinese adults across 32 sites and 11 cities to help clinicians accurately identify individuals at high risk for diabetes and guide them in timely diabetes screening.

Materials and methods

Study design and participants

The data was obtained from a public, non-profit computerized database established by the Rich Healthcare Group in China, namely, the ‘DATADRYAD’ database (www.Datadryad.org). We downloaded the raw data shared by Chen et al.²⁹ from: Association of body mass index and age with incident diabetes in Chinese adults: a population-based cohort study. Dryad Digital Repository. http://dx.doi.org/10.1136/bmjopen-2018-021768. And the raw data is available publicly for use. The original study enrolled 685,277 participants ≥ 20 years old with at least two routine health checks from 2010 to 2016 across 32 sites and 11 cities in China (Shanghai, Beijing, Nanjing, Suzhou, Shenzhen, Changzhou, Chengdu, Guangzhou, Hefei, Wuhan, Nantong).

Variables were extracted as follows: age, gender, smoking status, drinking status, family history of diabetes, body mass index (BMI), systolic blood pressure (SBP), diastolic blood pressure (DBP), fasting plasma glucose (FPG), total cholesterol (TC), triglyceride(TG), low-density lipoprotein cholesterol(LDL-C), high-density lipoprotein cholesterol (HDL-C), serum urea nitrogen(BUN), serum creatinine(Scr), alanine aminotransferase(ALT) at baseline, years of follow up, a censor of diabetes at follow up.

The original study initially included all study participants at least 20 years old with at least two routine health checks between 2010 and 2016. Participants were excluded at baseline in the original study, as follows:(1) no available information on weight, height and gender; (2) extreme BMI values (< 15 kg/m² or > 55 kg/m²); (3) visit intervals < 2 years; (4) no available fasting plasma glucose value; (5) participants diagnosed with diabetes at baseline (participants diagnosed by self-report or diagnosed by a fasting plasma glucose ≥ 7.0 mmol/L) and participants with undefined diabetes status at follow-up. A total of 211,833 participants remained after applying the exclusion criteria in the original study. Our study further excluded participants with the missing value of baseline variables to predict the 3-year risk of incident diabetes. Figure 1 depicted the participants' selection process. Finally, our study included 32,312 subjects (20,995 male and 11,317 female) for secondary analysis.

The study was conducted in accordance with the Declaration of Helsinki and patient consent was not required, referencing the original study article³⁰.

Variable measurement

Participants were required to do a personal questionnaire on demographic, lifestyle, medical history, and family history of chronic disease in each visit to the health check center. And trained staff conducted the baseline examination, including anthropometric measurements and laboratory biochemical measurements. Weight was measured in light clothing without shoes to the nearest 0.1 kg. The height was accurate to 0.1 cm. BMI was equal to the weight divided by the square of height, which is accurate to 0.1 kg/m². And the staff used a standard mercury sphygmomanometer to measure their blood pressure. Fasting venous blood samples were taken after fasting for at least 10 h each visit. Plasma glucose levels were measured by the glucose oxidase method. The clinical measurements of FPG, TC, TG, LDL-C, HDL-C, BUN, Scr, and ALT were performed on an autoanalyzer (Beckman 5800). The data were collected under standardized conditions and conducted following uniform procedures. Laboratory methods also were carefully standardized through stringent internal and external quality controls.

Definitions

The diabetes definitions were fasting blood glucose ≥ 7.00 mmol/L and/or self-reported diabetes during follow-up. Patients were censored either at the time of the diagnosis or at the last visit, whichever comes first.

Statistical analysis

All participants were randomly stratified into the training cohort and the validation cohort. Baseline characteristics were expressed as means ± standard deviations (normal distribution) or medians (quartiles) (skewed distribution) for continuous variables and as frequency or percentages for categorical variables. Two-sample t-tests were applied to analyze differences between training cohort and validation cohort for normally distributed continuous variables, Wilcoxon rank-sum tests for non-normally distributed continuous variables, and chi-square tests for categorical variables. Standardized differences of less than 0.10 for a given covariate indicate a relatively small imbalance³¹. We also showed the baseline characteristics of the training and validation cohort stratified by the incidence of diabetes. After collinearity screening, logistic regression models were used to assess each variable's significance to investigate the independent risk factors of developing diabetes. The risk factors reported in the literature associated with incident diabetes were candidates for the multivariate analysis^{26,27,28,32,33,34,35}.

To find a simple and reliable risk prediction model, we established four models for comparison. First, we apply all risk factors to build a full model. Second, we conducted a backward step-down selection process according to the Akaike information criterion (AIC) to establish a parsimonious model (stepwise model)³⁶. Third, according to the multivariable fractional polynomials (MFP) algorithm, we used the iterative fashion to determine the significant variables and functional form by backward elimination to establish a stable model (MFP model) in the real world³⁷. The least absolute shrinkage and selection operator (LASSO) method is suitable for reducing high-dimensional data and is applied to select the most useful prediction candidates^24,25. Candidates with non-zero coefficients are selected to establish LASSO model³⁸. Considering that fewer variables in the LASSO model and the prediction performance are relatively good, we choose the LASSO model for further analysis. To evaluate and compare the discriminatory power of these prediction models, we plotted the receiver operating characteristic (ROC) curve and calculated the area under the ROC curve (AUC) with 95% confidence intervals (CI) in the training cohort and validation cohort, respectively. We simultaneously presented the sensitivity, specificity, accuracy, positive predictive value (PPV), negative predictive value (NPV), positive likelihood ratio (PLR), negative likelihood ratio (NLR), and diagnostic odds ratio (DOR) of these four models calculated according to standard definitions. Sensitivity = True positive rate (TPR) = (Σ True positive)/(Σ Condition positive), Specificity = True negative rate (TNR) = (Σ True negative)/(Σ Condition negative), Accuracy = [ (Σ True positive) + (Σ True negative)] / (Σ Total population), Positive predictive value (PPV) = (Σ True positive)/(Σ Predicted condition positive), Negative predictive value (NPV) = (Σ True negative)/(Σ Predicted condition negative), False negative rate (FNR) = (Σ False negative)/(Σ Condition positive), False positive rate (FPR) = (Σ False positive)/(Σ Condition negative), Positive likelihood ratio (PLR) = TPR/FPR, Negative likelihood ratio (NLR) = FNR/TNR, DOR = PLR/NLR. Besides, we obtained a diabetic prediction formula for the LASSO model. The nomogram is based on proportionally converting each regression coefficient in multivariate logistic regression to a 0- to 100-point scale³⁹. The effect of the variable with the highest β coefficient (absolute value) is assigned 100 points. The points are added across independent variables to derive total points, converted to predicted probabilities of developing diabetes. The nomogram score is a numeric value representing the prediction model score of the individual patient. Sensitivity and specificity for predicting diabetes at different cut-off values of nomogram scores are different. Besides, we compared the predicted risk and observed a 3-year incidence of deciles of predicted diabetes risk for the training cohort in the nomogram. The predicted and actual risks in each decile were compared by the Hosmer–Lemeshow × 2 test⁴⁰. Decision curve analysis was conducted to determine the clinical use of the risk prediction model for diabetes: the proportion of the person who showed a true positive result subtracted by the proportion of the person who showed the false positive result, and then weighed the relative hazard of the false positive and false negative results to obtain a net benefit of making a decision⁴¹. Bootstraps with 500 resample were applied to ROC curve, nomogram and decision curve analysis to decrease the overfit bias^27,42. We also performed the ROC curve to analyze each risk factor of incident diabetes' performances and optimal cut-off value in the LASSO model. What’s more, we used a cohort of 12,545 Japanese participants from the NAGALA (NAfld in the Gifu Area, Longitudinal Analysis) database for the external validation. The data were also extracted from the ‘DATADRYAD’ database (www.Datadryad.org), shared by Okamura et al.⁴³ from: Ectopic fat obesity presents the greatest risk for incident type 2 diabetes: a population-based longitudinal study. Dryad Digital Repository. https://doi.org/10.1038/s41366-018-0076-3. And we did a sensitivity analysis on the overall population of the original study (n = 211,833). Multiple imputations were used to replace the missing values. All results are reported according to the TRIPOD statement⁴⁴.

All analyses were performed with the statistical software package R (http://www.R-project.org The R Foundation) and Empower-Stats (http://www.empowerstats.com, X&Y Solutions, Inc, Boston, MA). The tests were 2‐tailed, and P < 0.05 was taken as statistically significant.

Ethical approval

In the previously published article²⁹ Ying Chen, et al. has stated the study was conducted in accordance with the Declaration of Helsinki, and the Rich Healthcare Group Review Board approved the original research, and the information was retrieved retrospectively.

Results

The present study included 32,312 eligible participants (64.98% men and 35.02% women). Figure 1 depicted the participant's selection process. The mean age of all participants was 43.12 ± 12.62 years old. During the 2.66 years of the median follow-up period, a total of 296 participants developed diabetes. The mean BMI was 23.55 ± 3.30 kg/m². The mean SBP and DBP were 119.80 ± 15.83 and 74.95 ± 10.50 mmHg, respectively. The mean FPG was 4.97 ± 0.62 mmol/L. The mean HDL-C and LDL-C were 1.34 ± 0.31 and 2.74 ± 0.69 mmol/L, respectively. We excluded TC based on collinearity screening. The mean BUN and Scr were 4.71 ± 1.17 mmol/L and 72.24 ± 15.23 umol/L, respectively. The mean follow-up period was 2.66 ± 0.42 years.

Baseline characteristics of participants

Table 1 illustrated the basic demographic, anthropological, and clinical information of the eligible participants. We divided all participants into the training cohort (n = 16,219) and the validation cohort (n = 16,093). During the 2.66 years of the median follow-up period, 155 and 141 participants developed diabetes in the training and validation cohort, respectively. As for all baseline characteristics, the difference between the training cohort and the validation cohort was not statistically significant (all P > 0.05).

Table 1 Baseline characteristics of the training and validation cohorts.

Full size table

Table 2 showed the baseline characteristics of the two cohorts by incident diabetes status. The participants with incident diabetes had higher age, BMI, SBP, DBP, FPG, TG, ALT, BUN, Scr, and higher rates of ever or current smokers in the training and validation cohort (all P < 0.05). And there was no statistically significant difference in the family history of diabetes (P > 0.05).

Table 2 Baseline characteristics for the training and validation cohorts by incident diabetes status.

Full size table

Univariate and multivariate analysis

Table 3 displayed risk predictors for incident diabetes in the univariate and multivariate logistic regression analysis. The univariate analysis showed that age (OR = 1.066), female (OR = 0.421), BMI (OR = 1.238), SBP (OR = 1.039), DBP (OR = 1.042), FPG (OR = 13.925), TG (OR = 1.304), LDL-C (OR = 1.303), ALT (OR = 1.010), BUN (OR = 1.343), Scr (OR = 1.011), ever/current smoking (OR = 2.308) and family history of diabetes (OR = 1.561) was associated with incident diabetes (all P < 0.05), HDL-C, and drinking status were not correlated with diabetes (all P > 0.05). The multivariate analysis showed that age (OR = 1.047), BMI (OR = 1.122), FPG (OR = 8.564), HDL-C (OR = 1.515), ALT (OR = 1.008), ever/current smoking (OR = 1.527), and family history of diabetes (OR = 1.902) were associated with incident diabetes (all P < 0.05). However, gender, SBP, DBP, TG, LDL-C, BUN, Scr, and drinking status was not correlated with diabetes (all P > 0.05).

Table 3 Risk predictors for incident diabetes in the univariate and multivariate analysis.

Full size table

Development and validation of risk prediction models

We established four prediction models, including the full model, stepwise model, MFP model and LASSO model. 15 risk factors were reduced to 5 potential risk predictors based on the training cohort (Fig. 2A,B) that had nonzero coefficients in the LASSO model, which were less than the other three models. These potential risk predictors were age, BMI, SBP, FPG and TG. In the training cohort, AUCs of the LASSO model, full model, stepwise model and MFP model were 0.9125, 0.9155, 0.9161 and 0.9161. In the validation cohort, AUCs of the LASSO model, full model, stepwise model and MFP model were 0.9030, 0.9146, 0.9131 and 0.9131, respectively (Table 4, Table S1). The AUC of these four models were relatively close. Given that the LASSO model incorporated fewer risk factors and could predict the 3-year diabetes risk relatively well, we choose the LASSO model as the final risk prediction model for diabetes and further construct a corresponding nomogram (Fig. 3). The total nomogram score was applied to obtain the sort of probability for predicting incident diabetes. The 3-year diabetes probability was calculated by: − 23.14183 + 0.03224* age (year) + 0.10645* BMI (kg/m²) + 0.01388* SBP (mmHg) + 2.24841* FPG (mmol/L) + 0.09444* TG (mmol/L).

Table 4 Prediction performance of the nomogram for the risk of diabetes.

Full size table

Prediction performance of the LASSO model

In the training cohort and the validation cohort, AUCs of the LASSO model were 0.9125 (95% CI, 0.8887–0.9364) and 0.9030 (95% CI, 0.8747–0.9313), respectively (Table 4). At the best threshold, the sensitivity rates were 89.03% and 85.11%, and the specificity percentages were 80.11% and 82.30% for the training cohort and the validation cohort, respectively. Notably, the AUC of the prediction nomogram was internally confirmed to be relatively stable through the bootstrap validation (AUC = 0.909) (Fig. 4). The differences in AUC, sensitivity, specificity, and accuracy between the four models were relatively small, both in the training cohort and the validation cohort. The other three models' results were shown in the Supplemental Appendix (Table 4, Table S1, Fig S1).

We also evaluated how close the predicted risk was to the observed 3-year incidence of deciles of predicted diabetes risk for the nomogram's training cohort. Figure 5 illustrates the fraction of individuals in each decile of predicted risk in the training cohort. Our nomogram underestimated the 3-year risk of diabetes. However, the Hosmer–Lemeshow × 2 test showed no statistically significant difference between the predicted diabetes risk and observed diabetes (P > 0.05).

We also showed the prediction performance of each risk predictor in the nomogram, including age, BMI, SBP, FPG, TG (Table S2, Fig S2). The AUC of the prediction nomogram was greater than the AUC of each risk factor for incident diabetes. The predictive ability of other similar risk prediction models for diabetes in China was summarized in Table S3.

Optimal cut-off value for nomogram score

Table 5 showed the sensitivity and specificity for predicting diabetes at different cut-off values. At a cut-off value of 0.05, the specificity is 95.61% and the sensitivity is 61.29%. When the cut-off value increased to 0.3, the specificity increased to 99.78%, while the sensitivity drops to 12.26%. In summary, although higher cut-off values resulted in higher specificity, the sensitivity rapidly fell to a relatively low point.

Table 5 Values of sensitivity, specificity and predictive values of the nomogram scores at different cut-off values.

Full size table

Clinical use of the nomogram

Figure 6 demonstrated the result of the LASSO model's decision curve analysis in the training and validation cohorts. The black line represents the net benefit when none of the participants are considered to develop diabetes. In contrast, the light gray line represents the net benefit when all participants are considered to develop diabetes. The area between the "no treatment line" (black line) and "all treatment line" (light gray line) in the model curve indicates the clinical utility of the model. The farther the model curve is from the black and light gray lines, the better the nomogram's clinical application. Specifically, in the training cohort, if the threshold probability of a patient was 4% in the LASSO model, the net benefit was about 50%, which was equivalent to performing 50 additional diabetes screenings (such as oral glucose tolerance test) per 100 Chinese adults when without a significant change in the incidence of diabetes.

External validation

The external validation was performed on a cohort of 12,545 Japanese participants. The mean age, BMI, SBP, and FPG of the participants were 43.56 ± 8.68 years old, 22.11 ± 3.11 kg/m², 114.42 ± 14.89 mmHg, and 5.15 ± 0.41 mmol/L, respectively. The median TG was 0.75 (0.50–1.12) mmol/L. (Table S4).The AUC of the external validation was 0.849 (Fig. 7A). At the best threshold, the specificity and sensitivity rates were 81.46% and 75.25%, respectively. (Table S5). The external validation revealed that our nomogram had excellent prediction performance.

Sensitivity analysis

To perform the LASSO model's sensitivity analysis, we used multiple imputations to replace the missing values of variables of the overall population in the original study (n = 211,833). The mean age, BMI, SBP, and FPG were 42.10 ± 12.65 years old, 23.24 ± 3.34 kg/m², 119.06 ± 16.38 mmHg, and 4.92 ± 0.61 mmol/L, respectively. The median of TG was 1.07 (0.73–1.62). (Table S4). The AUC was 0.918 (Fig. 7b). At the best threshold, the specificity and sensitivity rates were 86.17% and 83.90%, respectively. (Table S5).

Discussion

In this retrospective cohort study, we developed and validated a personalized prediction nomogram for the 3-year risk of incident diabetes by cost-effective and readily available parameters among Chinese adults, helping clinicians identify individuals with a high risk of developing diabetes. The nomogram included five parameters: age, BMI, SBP, FPG, and TG. The internal and external validation showed that our nomogram had excellent prediction performance. We also summarized the sensitivity and specificity of the nomogram for predicting diabetes at different cut-off values. Decision curve analysis illustrated the clinical use of the nomogram.

Although many diabetes risk prediction models based on demographic, anthropological, and clinical information have been established, they are mainly used in European^45,46,47and American populations^48,49,50. Only a limited number of reliable diabetes prediction models were established in the Chinese population, each of which included different risk predictors. Besides, their prediction performance and clinical usefulness varied greatly. In 2019, Zeyin Lin et al.⁵¹ performed cox proportional hazards regression analysis to develop a nomogram to predict the 5-year incidence of type 2 diabetes mellitus based on age, sex, BMI, and hypertension dyslipidemia, smoking status and family history of diabetes. The C-index of the model was 0.815 (95% CI, 0.797–0.834). However, they did not conduct a decision curve analysis to evaluate the clinical usefulness of the model. Additionally, they did not try other methods to compare and screen the most suitable risk prediction model for incident diabetes. Moreover, age, BMI, TC, TG, HDL-C, and LDL-C are continuous risk predictors, and categorizing them into categories will cause detrimental information loss and affect the ability to detect real relationships^52,53. In 2019, Kun Wang et al.⁵⁴ developed a nomogram to predict the 3-year risk of T2DM in healthy mainland China residents based on age, BMI, FPG, LDL-C, HDL-C, and TG. The AUCs were 0.847 (95% CI, 0.801–0.892) and 0.755(95% CI, 0.717–0.794) for females and males, respectively. Consistent with our nomogram, their nomogram incorporated continuous predictors. Besides, they established a full model, MFP model, and stepwise model, and chose an appropriate model after comparison. However, they did not take into account family history of diabetes, smoking, and drinking history. Although our nomogram did not include them, we have considered them in the variable selection process. Besides, they did not measure how closely the predicted risk fits the actual risk. In 2015, Carlos et al.⁵⁵ developed a simple non-laboratory- and laboratory-based risk assessment algorithms and nomogram to predict undiagnosed diabetes in Hong Kong. The AUCs were 0.686 (95% CI, 0.650–0.722) for non-laboratory-based algorithm and 0.696 (95% CI, 0.661–0.731) for laboratory-based algorithm. They produced two different nomograms based on anthropometric and biochemical assessments, respectively. And each nomogram included relatively few risk predictors, which may lead to insufficient accuracy and prediction performance of the diabetes prediction model. Thus, their model's predictive ability is relatively low (AUC = 0.686 and 0.696), which revealed that we need to incorporate relatively more risk factors in developing the risk prediction model to ensure the prediction performance. Furthermore, this was a single-center study based on a professional driver community project. The cohort's inappropriate selection and relatively small sample size made it insufficient to represent the Chinese population. It is worth mentioning that none of these studies have performed external validation. Compared with the similar studies mentioned above, our nomogram filled those gaps. Our research sample size was considerable (n = 32,312), and participants were from multiple centers, so our findings may be better applied to the Chinese population. Unlike most previous Chinese DM risk scores with integer points or segmented values in China, our nomogram uses continuous variables to provide more precise and personalized risk prediction. It is worth mentioning that we constructed four models and selected the simplest and reliable LASSO model to ensure clinical practicality. Given that a nomogram could provide accurate and individualized risk prediction for each individual. According to the LASSO model, we constructed the corresponding nomogram, which makes up for the deficiencies of many other similar Chinese studies. Notably, our nomogram has an excellent prediction performance (AUC = 0.9125, 95% CI, 0.8887–0.9364). Besides, we proved no significant difference between the predicted diabetes risk and the observed incidence of diabetes.

Diabetes can cause various complications, bring severe physical and psychological distress to patients, and bring a huge burden to the healthcare system. And it tends to be undiagnosed due to the lack of specific symptoms. However, screening for diabetes through oral glucose tolerance test may increase the yield and economic efficiency of screening⁵⁶. In this study, we used the LASSO model with relatively good predictive performance to construct the nomogram. And we provided a corresponding formula to calculate the risk of diabetes based on risk predictors, which could help clinicians accurately identify individuals at high risk for diabetes, guide them in timely diabetes screening, and avoid the costs and efforts of prevention and treatment in low-risk groups. And our nomogram underestimated the 3-year risk of diabetes, so the individuals at high risk of developing diabetes identified by our nomogram are indeed at higher risk. Our nomogram items are routine clinical variables readily available to clinicians, thus allowing the nomogram to be easily adopted in practice. Furthermore, the nomogram's predictive performance was high both in the internal and external validation, which suggests its high generalizability. Notably, there were subtle differences between the AUC of our model and that of internal and external validation models. AUC of the external validation model was slightly smaller than the AUC of our nomogram (AUC = 0.849 vs. AUC = 0.913). The difference may come from the following: (1) the study populations were different, our study was performed on the Chinese, and the validation dataset was from Japanese. (2) Participants with FPG ≥ 6.1 mmol/L were excluded from the external validation cohort. (3) The outcome of the external validation cohort was T2DM. However, we could not distinguish between type 1, type 2, and other diabetes types in our model. (4) Diabetes was diagnosed as HbA1c ≥ 6.5%, FPG ≥ 7 mmol/L, or self-reported in the external validation cohort. However, the definitions of diabetes in our nomogram did not include HbA1c ≥ 6.5%. For sensitivity analysis, the AUC for the original study's overall population was close to that of our nomogram (AUC = 0.918 vs. AUC = 0.913), which showed that our study participants could represent the general population.

The risk predictors included in our nomogram were age, BMI, SBP, FPG and TG, which were also included in previous diabetes risk prediction models. Venerable age is a nonmodifiable risk factor for developing diabetes⁵⁷. Aging pancreatic β cells result in the decline of glucose sensitivity and insulin secretory defects⁵⁸. Age-related glucose intolerance is usually accompanied by insulin resistance and β-cell dysfunction⁵⁹. Obesity could increase the fat content of the liver and pancreas, which affect the function of pancreatic β cells⁶⁰. Besides, obesity leads to metabolic derangements and adipose organ dysfunction, leading to insulin resistance⁶¹. Hypertension and diabetes are often concurrent. The substantial mediators could involve inflammation, oxidative stress, endothelial dysfunction, and insulin resistance⁶². FPG is an independent risk factor of the onset of diabetes, and people with relatively high FPG had a higher risk score of diabetes in our nomogram. It may be that FPG is closely related to insulin response and insulin sensitivity⁶³. Dyslipidemia and diabetes often co-exist in the same individual. As an endocrine organ, adipose tissue can affect glucose and lipids' metabolism, and TG is the most abundant lipid in adipose tissue⁶⁴. Excess fatty tissue can release many lipid metabolites, proinflammatory cytokines, and cellular stress, which mediate insulin resistance⁶⁵. Therefore, the application of the five risk predictors in our models is well-founded.

There are some strengths in the present study, as follows: (1) The present study has a large sample size, and participants were from multiple centers. (2) We established four prediction models, including the LASSO model, full model, stepwise, and MFP models. And we selected the simplest LASSO model with relatively good prediction performance to construct the nomogram to ensure clinical practicability. (3) We provided a formula to calculate the risk of diabetes based on risk predictors, which helps clinicians quickly and accurately calculate the individual’s risk of developing diabetes and provide external verification information for other similar studies. (4) Our decision curve analysis demonstrated the nomogram's clinical use and could avoid performing additional diabetes screenings (such as OGTT) for individuals with low-risk diabetes. (5) We performed both internal and external validation to ensure the reliability of the results. (6) As this was a retrospective cohort study, it decreased the risk of selection bias and message bias.

Although our nomogram performed well, the study still has some potential limitations. First of all, this is a secondary retrospective study. The raw data did not provide other diabetes risk factors, such as waist/hip ratio, medical history, and lifestyle factors, affecting the onset of diabetes. However, our nomogram has excellent prediction performance in both internal and external validation, suggesting that the nomogram based on the existing five risk factors has high generalizability. Second, the database did not distinguish between type 1, type 2, and other diabetes types. And the risk factors of different kinds of diabetes are somewhat different. However, type 2 diabetes is the most common kind of diabetes, accounting for over 90% of diabetes cases⁶⁶. The nomogram is approximately used to predict the 3-year risk of developing type 2 diabetes. Third, the researchers did not conduct an oral glucose tolerance test and measure glycosylated hemoglobin. A study showed that 55% of diabetic patients were diagnosed by testing fasting blood glucose alone in Asians⁶⁷. Thus, the diagnostic criteria for diabetes in our study may underestimate the true prevalence of diabetes. In other words, the development and validation datasets included only very small numbers of diabetes cases, which may be related to the diagnostic criteria for diabetes in our study. However, a 2-h oral glucose tolerance test for all participants was not feasible in such a large cohort. Fourth, we excluded participants with incomplete records for complete-case analysis to build the models, which may introduce selection bias. However, we used multiple imputations to replace missing values to do sensitivity analysis. And the results proved that our study participants could well represent the overall population. Therefore, in the future, we can consider designing our studies or cooperating with other researchers to collect as many variables as possible, reduce missing values, and distinguish the types of diabetes. Fifth, there were no interactions between the covariates included within the full model, which may cause bias in the results of the full model. However, we focused predominantly on the LASSO model, which has the fewest variables and is more convenient for clinical application, rather than the full model.

Conclusion

We developed and validated a personalized prediction nomogram for the 3-year risk of incident diabetes among Chinese adults, including age, BMI, SBP, FPG and TG. The nomogram had excellent prediction performance in both training and validation cohorts for estimating the risk of developing diabetes, and it has high generalizability. The nomogram was a simple and reliable tool to help clinicians accurately identify individuals with high diabetes risk.

Data availability

Data can be downloaded from the ‘DATADRYAD’ database (www.Datadryad.org), shared by Chen et al.²⁹ from: Association of body mass index and age with incident diabetes in Chinese adults: a population-based cohort study. Dryad Digital Repository. http://dx.doi.org/10.1136/bmjopen-2018-021768.

Abbreviations

BMI:: Body mass index
SBP:: Systolic blood pressure
DBP:: Diastolic blood pressure
FPG:: Fasting plasma glucose
TC:: Total cholesterol
TG:: Triglyceride
HDL-C:: High-density lipoprotein cholesterol
LDL-C:: Low-density lipid cholesterol
ALT:: Alanine aminotransferase
BUN:: Serum urea nitrogen
Scr:: Serum creatinine
T2DM:: Type 2 diabetes mellitus
DM:: Diabetes mellitus
LASSO:: Least absolute shrinkage and selection operator
SD:: Standardized difference
HR:: Hazard ratios
CI:: Confidence intervals
Ref:: Reference
PPV:: Positive predictive value
NPV:: Negative predictive value
PLR:: Positive likelihood ratio
NLR:: Negative likelihood ratio
DOR:: Diagnostic odds ratio
ROC:: Receiver operating characteristic
AUC:: Area under the curve

References

Cho, N. H. et al. IDF diabetes atlas: global estimates of diabetes prevalence for 2017 and projections for 2045. Diabetes Res. Clin. Pract. 138, 271–281 (2018).
Article CAS PubMed Google Scholar
Global, Regional, and National Age-Sex-Specific Mortality for 282 Causes of Death in 195 Countries and Territories, 1980–2017: A Systematic Analysis for the Global Burden of Disease Study 2017. Lancet. 392, 1736–1788 (2018).
Unnikrishnan, R., Pradeepa, R., Joshi, S. R. & Mohan, V. Type 2 diabetes: demystifying the global epidemic. Diabetes 66, 1432–1442 (2017).
Article CAS PubMed Google Scholar
Wang, L. et al. Prevalence and Ethnic Pattern of Diabetes and Prediabetes in China in 2013. JAMA 317, 2515–2523 (2017).
Article PubMed PubMed Central Google Scholar
Golubnitschaja, O. & Costigliola, V. General report & recommendations in predictive, preventive and personalised medicine 2012: white paper of the European Association for predictive preventive and personalised medicine. EPMA J. 3, 14 (2012).
Article PubMed PubMed Central Google Scholar
Ley, S. H., Hamdy, O., Mohan, V. & Hu, F. B. Prevention and management of type 2 diabetes: dietary components and nutritional strategies. Lancet 383, 1999–2007 (2014).
Article CAS PubMed PubMed Central Google Scholar
le Roux, C. W. et al. 3 Years of Liraglutide versus Placebo for type 2 diabetes risk reduction and weight management in individuals with prediabetes: a randomised. Double-Blind Trial. Lancet 389, 1399–1409 (2017).
Article PubMed CAS Google Scholar
Brito, J. P., Montori, V. M. & Davis, A. M. Metabolic surgery in the treatment algorithm for Type 2 diabetes: a joint statement by international diabetes organizations. JAMA 317, 635–636 (2017).
Article PubMed PubMed Central Google Scholar
Lee, W. J. et al. Predicting success of metabolic surgery: age, body mass index, C-peptide, and duration score. Surg. Obes. Relat. Dis. 9, 379–384 (2013).
Article PubMed Google Scholar
Pucci, A. et al. Type 2 diabetes remission 2 years Post Roux-en-Y gastric bypass and sleeve gastrectomy: the role of the weight loss and comparison of DiaRem and DiaBetter scores. Diabetics Med. 35, 360–367 (2018).
Article CAS Google Scholar
Gregg, E. W. et al. Association of an intensive lifestyle intervention with remission of type 2 diabetes. JAMA 308, 2489–2496 (2012).
Article CAS PubMed PubMed Central Google Scholar
Shi, X. et al. Effect of exenatide after short-time intensive insulin therapy on glycaemic remission maintenance in type 2 diabetes patients: a randomized controlled trial. Sci. Rep. 7, 2383 (2017).
Article ADS PubMed PubMed Central CAS Google Scholar
Hostalek, U., Gwilt, M. & Hildemann, S. Therapeutic use of metformin in prediabetes and diabetes prevention. Drugs. 75, 1071–1094 (2015).
Article CAS PubMed PubMed Central Google Scholar
Vijan, S. Type 2 diabetes. Ann. Intern. Med. 171, C65–C80 (2019).
Article Google Scholar
Long-Term Effects of Metformin on Diabetes Prevention. Identification of subgroups that benefited most in the diabetes prevention program and diabetes prevention program outcomes study. Diabetes Care 42, 601–608 (2019).
Article Google Scholar
Collins, G. S., Mallett, S., Omar, O. & Yu, L. M. Developing Risk prediction models for type 2 diabetes: a systematic review of methodology and reporting. BMC Med. 9, 103 (2011).
Article PubMed PubMed Central Google Scholar
Griffin, S. J., Little, P. S., Hales, C. N., Kinmonth, A. L. & Wareham, N. J. Diabetes risk score: towards earlier detection of type 2 diabetes in general practice. Diabetes Metab. Res. Rev. 16, 164–171 (2000).
Article CAS PubMed Google Scholar
Lindstrom, J. & Tuomilehto, J. The diabetes risk score: a practical tool to predict type 2 diabetes risk. Diabetes Care 26, 725–731 (2003).
Article PubMed Google Scholar
Heikes, K. E., Eddy, D. M., Arondekar, B. & Schlessinger, L. Diabetes risk calculator: a simple tool for detecting undiagnosed diabetes and pre-diabetes. Diabetes Care 31, 1040–1045 (2008).
Article PubMed Google Scholar
Gray, L. J. et al. The leicester risk assessment score for detecting undiagnosed type 2 diabetes and impaired glucose regulation for use in a multiethnic UK setting. Diabetics Med. 27, 887–895 (2010).
Article CAS Google Scholar
Tabaei, B. P. & Herman, W. H. A multivariate logistic regression equation to screen for diabetes: development and validation. Diabetes Care 25, 1999–2003 (2002).
Article PubMed Google Scholar
Lin, Y. et al. A rule-based prognostic model for type 1 diabetes by identifying and synthesizing baseline profile patterns. PLoS ONE 9, e91095 (2014).
Article ADS PubMed PubMed Central CAS Google Scholar
Lamain-de, R. M. et al. External validation of prognostic models to predict risk of gestational diabetes mellitus in one dutch cohort: prospective multicentre cohort study. BMJ 354, i4338 (2016).
Article Google Scholar
Friedman, J., Hastie, T. & Tibshirani, R. Regularization paths for generalized linear models via coordinate descent. J. Stat. Softw. 33, 1–22 (2010).
Article PubMed PubMed Central Google Scholar
Sauerbrei, W., Royston, P. & Binder, H. Selection of important variables and determination of functional form for continuous predictors in multivariable model building. Stat. Med. 26, 5512–5528 (2007).
Article MathSciNet PubMed Google Scholar
Lin, Z., Guo, D., Chen, J. & Zheng, B. A nomogram for predicting 5-year incidence of type 2 diabetes in a chinese population. Endocrine 67, 561–568 (2020).
Article CAS PubMed Google Scholar
Wang, K. et al. Nomogram prediction for the 3-year risk of type 2 diabetes in healthy mainland China Residents. EPMA J. 10, 227–237 (2019).
Article PubMed PubMed Central Google Scholar
Wong, C. K. et al. Simple Non-laboratory- and laboratory-based risk assessment algorithms and nomogram for detecting undiagnosed diabetes mellitus. J. Diabetes. 8, 414–421 (2016).
Article CAS PubMed Google Scholar
Chen, Y. et al. Association of body mass index and age with incident diabetes in chinese adults: a population-based cohort study. BMJ Open. 8, e21768 (2018).
Google Scholar
Chen, Y. et al. Association of Body mass index and age with incident diabetes in Chinese adults: a population-based cohort study. BMJ Open. 8, e21768 (2018).
Google Scholar
Normand, S. T. et al. Validating recommendations for coronary angiography following acute myocardial infarction in the elderly: a matched analysis using propensity scores. J. Clin. Epidemiol. 54, 387–398 (2001).
Article CAS PubMed Google Scholar
Gao, F. et al. Independent effect of alanine transaminase on the incidence of type 2 diabetes mellitus, stratified by age and gender: a secondary analysis based on a large cohort study in China. Clin. Chim. Acta. 495, 54–59 (2019).
Article CAS PubMed Google Scholar
Xie, Y. et al. Higher blood urea nitrogen is associated with increased risk of incident diabetes mellitus. Kidney Int. 93, 741–752 (2018).
Article CAS PubMed Google Scholar
Qin, P. et al. Dose-response associations between serum creatinine and type 2 diabetes mellitus risk: a Chinese cohort study and meta-analysis of cohort studies. J. Diabetes. 12, 594–604 (2020).
Article CAS PubMed Google Scholar
Holst, C., Becker, U., Jørgensen, M. E., Grønbæk, M. & Tolstrup, J. S. Alcohol drinking patterns and risk of diabetes: a cohort study of 70,551 men and women from the general danish population. Diabetologia 60, 1941–1950 (2017).
Article PubMed Google Scholar
Collignon, O. & Monnez, J. Clustering of the values of a response variable and simultaneous covariate selection using a stepwise algorithm. Appl. Math. 07, 1639–1648 (2016).
Article Google Scholar
Roh, J. et al. Risk Stratification Using multivariable fractional polynomials in diffuse large B-cell lymphoma. Front Oncol. 10, 329 (2020).
Article PubMed PubMed Central Google Scholar
Kidd, A. C. et al. Survival prediction in mesothelioma using a scalable lasso regression model: instructions for use and initial performance using clinical predictors. BMJ Open Respir Res. 5, e240 (2018).
Article Google Scholar
Lei, Z. et al. Nomogram for preoperative estimation of microvascular invasion risk in hepatitis B virus-related hepatocellular carcinoma within the milan criteria. JAMA Surg. 151, 356–363 (2016).
Article PubMed Google Scholar
Sun, F., Tao, Q. & Zhan, S. An accurate risk score for estimation 5-year risk of type 2 diabetes based on a health screening population in Taiwan. Diabetes Res Clin Pract. 85, 228–234 (2009).
Article PubMed Google Scholar
Fitzgerald, M., Saville, B. R. & Lewis, R. J. Decision curve analysis. JAMA 313, 409–410 (2015).
Article CAS PubMed Google Scholar
Steyerberg, E. W. & Vergouwe, Y. Towards better clinical prediction models: seven steps for development and an ABCD for validation. Eur. Heart J. 35, 1925–1931 (2014).
Article PubMed PubMed Central Google Scholar
Okamura, T. et al. Ectopic fat obesity presents the greatest risk for incident type 2 diabetes: a population-based longitudinal study. Int J Obes (Lond). 43, 139–148 (2019).
Article Google Scholar
Collins, G. S., Reitsma, J. B., Altman, D. G. & Moons, K. G. Transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD): the TRIPOD statement. BMC Med. 13, 1 (2015).
Article PubMed PubMed Central Google Scholar
Balkau, B. et al. Predicting diabetes: clinical, biological, and genetic approaches: data from the epidemiological study on the insulin resistance syndrome (DESIR). Diabetes Care 31, 2056–2061 (2008).
Article CAS PubMed PubMed Central Google Scholar
Hippisley-Cox, J., Coupland, C., Robson, J., Sheikh, A. & Brindle, P. Predicting risk of Type 2 diabetes in England and Wales: prospective derivation and validation of QDScore. BMJ 338, b880 (2009).
Article PubMed PubMed Central Google Scholar
Gupta, A. K. et al. Determinants of new-onset diabetes among 19,257 hypertensive patients randomized in the anglo-scandinavian cardiac outcomes trial-blood pressure lowering arm and the relative influence of antihypertensive medication. Diabetes Care 31, 982–988 (2008).
Article CAS PubMed Google Scholar
Kahn, H. S., Cheng, Y. J., Thompson, T. J., Imperatore, G. & Gregg, E. W. Two risk-scoring systems for predicting incident diabetes mellitus in US adults age 45 to 64 years. Ann. Intern. Med. 150, 741–751 (2009).
Article PubMed Google Scholar
Schmidt, M. I. et al. Identifying individuals at high risk for diabetes: the atherosclerosis risk in communities study. Diabetes Care 28, 2013–2018 (2005).
Article PubMed Google Scholar
Wilson, P. W. et al. Prediction of incident diabetes mellitus in middle-aged adults: the framingham offspring study. Arch Intern Med. 167, 1068–1074 (2007).
Article PubMed Google Scholar
Lin, Z., Guo, D., Chen, J. & Zheng, B. A nomogram for predicting 5-year incidence of type 2 diabetes in a chinese population. Endocrine 67, 561–568 (2020).
Article CAS PubMed Google Scholar
Royston, P., Altman, D. G. & Sauerbrei, W. Dichotomizing continuous predictors in multiple regression: a bad idea. Stat. Med. 25, 127–141 (2006).
Article MathSciNet PubMed Google Scholar
Lagakos, S. W. Effects of mismodelling and mismeasuring explanatory variables on tests of their association with a response variable. Stat. Med. 7, 257–274 (1988).
Article CAS PubMed Google Scholar
Wang, K. et al. Nomogram prediction for the 3-Year risk of type 2 diabetes in healthy Mainland China residents. EPMA J. 10, 227–237 (2019).
Article PubMed PubMed Central Google Scholar
Wong, C. K. et al. Simple Non-laboratory- and laboratory-based risk assessment algorithms and nomogram for detecting undiagnosed diabetes mellitus. J. Diabetes. 8, 414–421 (2016).
Article CAS PubMed Google Scholar
Selph, S. et al. Screening for type 2 diabetes mellitus: a systematic review for the US preventive services task Force. Ann. Intern. Med. 162, 765–776 (2015).
Article PubMed Google Scholar
Duarte, A. A., Mohsin, S. & Golubnitschaja, O. Diabetes care in figures: current pitfalls and future scenario. EPMA J. 9, 125–131 (2018).
Article PubMed PubMed Central Google Scholar
Coordt, M. C., Ruhe, R. C. & McDonald, R. B. Aging and insulin secretion. Proc. Soc. Exp. Biol. Med. 209, 213–222 (1995).
Article CAS PubMed Google Scholar
Chang, A. M. & Halter, J. B. Aging and Insulin Secretion. Am. J. Physiol. Endocrinol. Metab. 284, E7–E12 (2003).
Article CAS PubMed Google Scholar
Taylor, R. et al. Remission of human type 2 diabetes requires decrease in liver and pancreas fat content but is dependent upon capacity for beta cell recovery. Cell Metab. 28, 667 (2018).
Article CAS PubMed Google Scholar
Barazzoni, R., Gortan, C. G., Ragni, M. & Nisoli, E. Insulin resistance in obesity: an overview of fundamental alterations. Eat Weight Disord. 23, 149–157 (2018).
Article PubMed Google Scholar
Moreno, B. et al. Glycated hemoglobin correlates with arterial stiffness and endothelial dysfunction in patients with resistant hypertension and uncontrolled diabetes mellitus. J. Clin. Hypertens (Greenwich). 20, 910–917 (2018).
Article CAS PubMed Central Google Scholar
Lorenzo, C. et al. A1C Between 5.7 and 6.4% as a marker for identifying pre-diabetes, insulin sensitivity and secretion, and cardiovascular risk factors: The Insulin Resistance Atherosclerosis Study (IRAS). Diabetes Care. 33, 2104–2109 (2010).
Article CAS PubMed PubMed Central Google Scholar
Scherer, P. E. Adipose tissue: from lipid storage compartment to endocrine organ. Diabetes 55, 1537–1545 (2006).
Article CAS PubMed Google Scholar
Boden, G. Obesity, insulin resistance and free fatty acids. Curr. Opin. Endocrinol. Diabetes Obes. 18, 139–143 (2011).
Article CAS PubMed PubMed Central Google Scholar
Zheng, Y., Ley, S. H. & Hu, F. B. Global aetiology and epidemiology of type 2 diabetes mellitus and its complications. Nat. Rev. Endocrinol. 14, 88–98 (2018).
Article PubMed Google Scholar
Qiao, Q. et al. Age- and sex-specific prevalence of diabetes and impaired glucose regulation in 11 asian cohorts. Diabetes Care 26, 1770–1780 (2003).
Article PubMed Google Scholar

Download references

Funding

This study was supported in part by the Discipline Construction Ability Enhancement Project of Shenzhen Municipal Health Commission (SZXJ2017031).

Author information

These authors contributed equally: Yang Wu and Haofei Hu.

Authors and Affiliations

Department of Endocrinology, The First Affiliated Hospital of Shenzhen University, No.3002 Sungang Road, Futian District, Shenzhen, 518035, Guangdong Province, China
Yang Wu, Jinlin Cai, Runtian Chen & Dewen Yan
Department of Nephrology, The First Affiliated Hospital of Shenzhen University, Shenzhen, 518035, Guangdong Province, China
Haofei Hu
Department of Endocrinology, Shenzhen Second People’s Hospital, Shenzhen, 518035, Guangdong Province, China
Yang Wu, Jinlin Cai, Runtian Chen & Dewen Yan
Department of Nephrology, Shenzhen Second People’s Hospital, Shenzhen, 518035, Guangdong Province, China
Haofei Hu
Shenzhen University Health Science Center, Shenzhen, 518071, Guangdong Province, China
Yang Wu, Haofei Hu, Runtian Chen & Dewen Yan
Shantou University Medical College, Shantou, 515000, Guangdong Province, China
Jinlin Cai
Department of Endocrinology, Shenzhen Third People’s Hospital, Shenzhen, 518116, Guangdong Province, China
Xin Zuo & Heng Cheng

Authors

Yang Wu
View author publications
You can also search for this author in PubMed Google Scholar
Haofei Hu
View author publications
You can also search for this author in PubMed Google Scholar
Jinlin Cai
View author publications
You can also search for this author in PubMed Google Scholar
Runtian Chen
View author publications
You can also search for this author in PubMed Google Scholar
Xin Zuo
View author publications
You can also search for this author in PubMed Google Scholar
Heng Cheng
View author publications
You can also search for this author in PubMed Google Scholar
Dewen Yan
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Y.W. and H.H. conceived and designed the research, drafted the manuscript. J.C. and R.C. did the statistical analysis. X.Z. and H. C. took part in the discussion. D.Y. revised the manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Dewen Yan.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Wu, Y., Hu, H., Cai, J. et al. A prediction nomogram for the 3-year risk of incident diabetes among Chinese adults. Sci Rep 10, 21716 (2020). https://doi.org/10.1038/s41598-020-78716-1

Download citation

Received: 02 June 2020
Accepted: 23 November 2020
Published: 10 December 2020
DOI: https://doi.org/10.1038/s41598-020-78716-1

This article is cited by

A machine learning screening model for identifying the risk of high-frequency hearing impairment in a general population
- Yi Wang
- Xinmeng Yao
- Liangwen Xu
BMC Public Health (2024)
Progress of the application clinical prediction model in polycystic ovary syndrome
- Guan Guixue
- Pu Yifu
- Yang Wen
Journal of Ovarian Research (2023)
Machine learning for predicting hepatitis B or C virus infection in diabetic patients
- Sun–Hwa Kim
- So–Hyeon Park
- Heeyoung Lee
Scientific Reports (2023)
A nomogram model for predicting 5-year risk of prediabetes in Chinese adults
- Yanhua Hu
- Yong Han
- Yongcheng He
Scientific Reports (2023)
Risk prediction models for incident type 2 diabetes in Chinese people with intermediate hyperglycemia: a systematic literature review and external validation study
- Shishi Xu
- Ruth L. Coleman
- Nanwei Tong
Cardiovascular Diabetology (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Materials and methods

Study design and participants

Variable measurement

Definitions

Statistical analysis

Ethical approval

Results

Baseline characteristics of participants

Univariate and multivariate analysis

Development and validation of risk prediction models

Prediction performance of the LASSO model

Optimal cut-off value for nomogram score

Clinical use of the nomogram

External validation

Sensitivity analysis

Discussion

Conclusion

Data availability

Abbreviations

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links