A simplified prediction model for end-stage kidney disease in patients with diabetes

Inoguchi, Toyoshi; Okui, Tasuku; Nojiri, Chinatsu; Eto, Erina; Hasuzawa, Nao; Inoguchi, Yukihiro; Ochi, Kentaro; Takashi, Yuichi; Hiyama, Fujiyo; Nishida, Daisuke; Umeda, Fumio; Yamauchi, Teruaki; Kawanami, Daiji; Kobayashi, Kunihisa; Nomura, Masatoshi; Nakashima, Naoki

doi:10.1038/s41598-022-16451-5

Download PDF

Article
Open access
Published: 21 July 2022

A simplified prediction model for end-stage kidney disease in patients with diabetes

Toyoshi Inoguchi^1,4^na1,
Tasuku Okui²^na1,
Chinatsu Nojiri²,
Erina Eto³,
Nao Hasuzawa⁴,
Yukihiro Inoguchi⁴,
Kentaro Ochi⁶,
Yuichi Takashi⁷,
Fujiyo Hiyama⁸,
Daisuke Nishida⁸,
Fumio Umeda⁵,
Teruaki Yamauchi⁵,
Daiji Kawanami⁷,
Kunihisa Kobayashi⁶,
Masatoshi Nomura⁴ &
…
Naoki Nakashima²

Scientific Reports volume 12, Article number: 12482 (2022) Cite this article

3094 Accesses
2 Altmetric
Metrics details

Subjects

Abstract

This study aimed to develop a simplified model for predicting end-stage kidney disease (ESKD) in patients with diabetes. The cohort included 2549 individuals who were followed up at Kyushu University Hospital (Japan) between January 1, 2008 and December 31, 2018. The outcome was a composite of ESKD, defined as an eGFR < 15 mL min⁻¹ [1.73 m]⁻², dialysis, or renal transplantation. The mean follow-up was 5.6 $\pm$ 3.7 years, and ESKD occurred in 176 (6.2%) individuals. Both a machine learning random forest model and a Cox proportional hazard model selected eGFR, proteinuria, hemoglobin A1c, serum albumin levels, and serum bilirubin levels in a descending order as the most important predictors among 20 baseline variables. A model using eGFR, proteinuria and hemoglobin A1c showed a relatively good performance in discrimination (C-statistic: 0.842) and calibration (Nam and D’Agostino $\chi$² statistic: 22.4). Adding serum albumin and bilirubin levels to the model further improved it, and a model using 5 variables showed the best performance in the predictive ability (C-statistic: 0.895, $\chi$² statistic: 7.7). The accuracy of this model was validated in an external cohort (n = 5153). This novel simplified prediction model may be clinically useful for predicting ESKD in patients with diabetes.

Machine learning to predict end stage kidney disease in chronic kidney disease

Article Open access 19 May 2022

Machine learning models for prediction of HF and CKD development in early-stage type 2 diabetes patients

Article Open access 21 November 2022

An independent validation of the kidney failure risk equation in an Asian population

Article Open access 31 July 2020

Introduction

Diabetic kidney disease (DKD) continues to be the leading cause of end-stage kidney disease (ESKD) and accounts for $\ge$ 40% of patients receiving dialysis and renal transplantation in many countries^1,2. Early and accurate prediction of progression to ESKD is of practical benefit in patients with diabetes.

In current clinical practice, the estimated glomerular filtration rate (eGFR) and the presence of proteinuria (albuminuria) are used as the main predictors for progression of chronic kidney disease (CKD) including DKD^3,4. However, these two variables may not be sufficient for clinical decision-making, especially in DKD. In fact, other clinical variables, such as glycemic control, affect the progression of DKD^5,6. Various prediction models for the progression of CKD to ESKD have been reported^7,8,9,10, and some ESKD prediction equations are widely used through electronic applications¹⁰. However, CKD is heterogenous in the variability of progression rates and its pathogeneses. Therefore, developing a prediction model specific for DKD is important. None of the CKD models include variables related to glycemic control such as hemoglobin A1c (HbA1c) levels. Therefore, several prediction models have also been reported for predicting ESKD in people with diabetes^{11,12,13,14,15}. These models showed a moderate to good performance for predicting ESKD, but they used many predictive variables in addition to the eGFR (or creatinine) and albuminuria. For clinical use, the ideal prediction models should not only be accurate, but also easy to implement. In this study, we aimed to develop a simplified, but accurate model for predicting ESKD in patients with diabetes using a few commonly available variables that are easy to measure in the primary care setting.

In recent years, oxidative stress has been considered to be an important pathogenic factor in the development of DKD^{16,17,18,19,20,21}. Since bilirubin and albumin are potent endogenous antioxidants in serum^22,23,24, these variables may play an important role in the progression of DKD^{10,11,14,15,25,26,27,28}. Therefore, in this study, we first evaluated the relative importance of various possibly predictive variables including serum bilirubin and albumin levels for ESKD using a machine learning random forest model. The benefit of the machine learning approach is that it can use a data-driven approach to analyze a large number of variables. We then developed a final simplified prediction model using the minimum number of selected important variables and Cox proportional hazard model, which may be useful for predicting ESKD in primary care. We also validated the performance of this model in an independent external cohort.

Results

Characteristics of study subjects in the development cohort

A total of 2,549 patients (1,432 men and 1,117 women) were eligible for inclusion in the analysis. Table 1 shows the baseline characteristics of the enrolled patients. The median age was 57 years (interquartile range [IQR]: 47–63), and the mean follow-up was 5.6 $\pm$ 3.7 years. There were 176 ESKD events (6.2%) during the follow-up. The median time to ESKD was 2.5 years (IQR: 0.9–4.8).

Table 1 Baseline characteristics of study subjects (n = 2549).

Full size table

Performance of prediction models in the development cohort

The random forest model using 20 variables showed an excellent predictive ability for ESKD (Harrell’s concordance statistic [C-statistic]; 0.935), and selected eGFR, proteinuria, HbA1c, serum albumin, and serum bilirubin as the most important predictors in descending order (Table 2). The Cox proportional hazard model also showed a similar performance in predictive ability (C-statistic: 0.905) and the upper 5 variables were the same as those in the random forest model (Table 2). Therefore, we developed a sequential series of models using these 5 selected variables and Cox proportional hazard models, and then compared their performances. The hazard ratios for the variables and C-statistics for the models are shown in Table 3. The C-statistic was 0.736 for Model 1 (eGFR alone), 0.806 for Model 2 (eGFR and proteinuria), 0.841 for Model 3 (eGFR, proteinuria, and HbA1c), 0.852 for Model 4.1 (eGFR, proteinuria, HbA1c, and serum albumin), 0.881 for Model 4.2 (eGFR, proteinuria, HbA1c, and serum bilirubin), and 0.895 for Model 5 (eGFR, proteinuria, HbA1c, serum albumin, and serum bilirubin). Comparing with the basic model (Model 2, eGFR and proteinuria), the C statistic was significantly increased in Model 3 and Model 5 (P = 0.030, P = 0.012, respectively), suggesting that Model 3 and 5 have a significantly better performance in the discrimination of ESKD than Model 2. In addition, there was no significant difference in the C statistic between these Models and Model 6 (all 20 variables), suggesting that Model 3 and 5 are highly efficient in the discrimination. Next, the prediction risk scores for ESKD using Models 3 and 5 from the Cox proportional hazard model were calculated as follows:

$${\text{Risk score }}\left( {\text{Model 3}} \right)\, = \,\left( { - \,0.0{59}\, \times \,{\text{eGFR in mL min}}^{{ - {1}}} \left[ {{1}.{73} {\text{m}}} \right]^{{ - {2}}} } \right)\, + \,\left( {0.{415}\, \times \,{\text{HbA1c in }}\% } \right)\, + \,({1}.{822}\, \times \,{\text{1 if positive for proteinuria}}).$$

$${\text{Risk score }}\left( {\text{Model 5}} \right)\, = \,\left( { - \,0.0{52}\, \times \,{\text{eGFR in mL min}}^{{ - {1}}} \left[ {{1}.{73} {\text{m}}} \right]^{{ - {2}}} } \right)\, + \,\left( {0.{368}\, \times \,{\text{HbA1c in }}\% } \right)\, + \,\left( { - \,0.{972}\, \times \,{\text{albumin levels in mg}}/{\text{dL}}} \right)\, + \,\left( { - \,{1}.{41}0\, \times \,{\text{bilirubin levels in mg}}/{\text{dL}}} \right)\, + \,({1}.{27}0\, \times \,{\text{1 if positive for proteinuria}}).$$

Table 2 Relative importance of variables for predicting end-stage kidney disease (ESKD) using the random forest model and Cox proportional hazard model.

Full size table

Table 3 Hazard ratios and C-statistics of various models for predicting end-stage kidney disease (ESKD) as evaluated by the Cox proportional hazard model.

Full size table

Using these risk scores, the 5-year risk of each individual in Models 3 and 5 was estimated as follows:

$$ \begin{gathered} {\text{5 - year risk }}\left( {\text{Model 3}} \right):p\left( {{1},{825}} \right) \, = {1} - {\text{ exp }}\left( { - H_{0} \left( {{1},{825}} \right)} \right)^{{{\text{exp }}({\text{risk score in Model 3}})}} \, \hfill \\ = \,{1}{-}{\text{ exp }}\left( { - \,0.0{43}} \right)^{{{\text{exp }}({\text{risk score in Model 3}})}} \, = \,{1} \hfill \\ \hfill \\ {\text{5 - year risk }}\left( {\text{Model 5}} \right):p\left( {{1},{825}} \right)\, = \,{1}{-}{\text{exp }}\left( { - \,H_{0} \left( {{1},{825}} \right)} \right)^{{{\text{exp }}({\text{risk score in Model 5}})}} \hfill \\ = \,{1}{-}{\text{ exp }}\left( { - \,{5}.{26}0} \right)^{{{\text{exp }}({\text{risk score in Model 5}})}} \, = \,{1} \hfill \\ \hfill \\ \end{gathered} $$

Next, calibration was examined by comparing the observed vs predicted probabilities of ESKD at a 5-year risk for Models 3 and 5 (Fig. 1a,b). The predicted probabilities were calculated from the above 5-year risk equations for Models 3 and 5. The Nam and D’Agostino $\chi$² statistics were 22.4 and 7.7 for Models 3 and 5, respectively. These findings suggested that the calibration showed an appropriate agreement between observed vs predicted probabilities for both Models, but Model 5 was more accurate than Model 3. Therefore, we built the nomogram to predict the ESKD probability for each individual based on the Model 5 (Fig. 2), which may provide a practical tool for clinical application.

Performance of prediction models in the external validation cohort

A total of 5153 patients were eligible for inclusion in the validation analysis. The baseline characteristics were shown in supplementary Table S1. The baseline risk in this cohort appeared to differ from those in the development cohort. The patients in the external cohort had complete data for the presence of proteinuria, but the rate of proteinuria was still lower in the external cohort than that in the development cohort with missing data (19.1% vs 26.6%). The rate of ESKD events in the external cohort was lower than that in the development cohort (3.1% vs 6.2%, P < 0.001). Despite the difference in baseline characteristics, the predictive performance of Model 5 was excellent in discrimination (C-statistic: 0.951) in the external cohort. In calibration, its performance was moderate without recalibration ($\chi$² statistic: 36.1 at a 5-year risk) (Fig. 1c) and there was a tendency to slightly overestimate risk. This was likely to be due to the lower baseline risk in the external cohort than that in the development cohort. For the high-risk subgroup of the individuals with CKD in the external cohort (eGFR < 60 mL min⁻¹ [1.73 m]⁻² and/or positive proteinuria) (n = 1,350), its performance was good in discrimination (C-statistic: 0.862) and adequate in calibration without recalibration ($\chi$² statistic: 23.1 at a 5-year risk) (Fig. 1d), suggesting the clinical usefulness of this model for predicting ESKD.

Discussion

In this study, we first evaluated the relative importance of 20 possibly predictive variables for ESKD using a machine learning random forest model and a Cox proportional hazard model. Both models selected eGFR, proteinuria, hemoglobin HA1c, serum albumin, and serum bilirubin as the most important predictors. Then, we developed a simplified prediction model using these 5 variables and the Cox proportional hazard model. To the best of our knowledge, this is the first report to show that a 5-year risk model using these 5 commonly available variables has a good performance in the predictive ability for ESKD in patients with diabetes.

Previously, several prediction models for ESKD in patients with diabetes have also been reported^{11,12,13,14,15}. Jardine et al. reported a prediction model using 7-variables, including eGFR, urinary albumin-creatinine ratio, sex, systolic blood pressure, blood pressure-lowering agent use, presence of retinopathy, and education career from the ADVANCE trial (C-statistic: 0.847)¹². A similar prediction model using 11 variables was reported in Chinese patients with diabetes (area under the curves [AUC] of the 3-, 5-, and 8-year risk: 0.90, 0.86, and 0.81, respectively)¹⁴. However, these models lacked external validation and thereby may not be generalized well to other populations. The model reported by Elley et al. showed a good performance in the predictive ability in the development cohort and the external validation cohort (C-statistic: 0.89–0.92), but this model used 10 variables including sex, ethnicity, age, diabetes duration, albuminuria, serum creatinine, systolic blood pressure, HbA1c, smoking status, and previous cardiovascular disease status¹³. A recent study developed a machine learning based prediction model called the feed-forward neural network model¹⁵. In that model, 18 variables were used in patients with diabetes and nephropathy participating in past clinical trials, including RENAAL, IDNT and ALTITUDE studies (AUC: 0.82, 0.81, and 0.84, respectively). The machine learning approach appears to be superior to the traditional hypothesis-driven statistical methods in terms of its data-driven approach to analyze a large number of possibly predictive variables. Our random forest model using 20 variables also showed an excellent predictive ability for ESKD (C-statistic 0.935). However, the main obstacle is that many predictive variables are not readily obtainable in primary care, thus limiting their usefulness to clinicians’ managing patients with diabetes. In contrast, Keane et al. reported a simple prediction model using four variables (serum creatinine, urine albumin-creatinine ratio, serum albumin, and hemoglobin) in a cohort from the RENNAL study¹¹. They selected those four variables from 23 baseline variables using the Cox proportional hazard model with backward selection process, with P < 0.01 required for inclusion in a final model. However, our analysis using both the machine learning approach and the Cox proportional hazard model showed that HbA1c levels and bilirubin levels were more important predictors than hemoglobin levels for predicting ESKD. The mean follow-up period was much shorter in the RENNAL study than in our study (3.4 years vs. 5.6 years), and decreasing hemoglobin level is a generally late sign of renal impairment. Their model may be effective for risk prediction at a time shorter than 3 years. Thus, our simplified 5-year prediction model may be more useful than previous prediction models in clinical practice. Our model could guide clinicians in making clinical decision earlier regarding intensification of monitoring and preventive therapies or referral to specialists. Risk information helps patients to become aware of their current risk, promote motivation on improving their lifestyle. In our study, the nomogram based on our model was built to predict the absolute ESKD probability for each individual. The 5-year risk equation we showed can be also used as electronic applications. These tools may provide practical risk predictive tools for future clinical application.

The reason why serum bilirubin levels were so important among various predictive variables for predicting ESKD remained to be elucidated. Bilirubin is a product of heme catabolism by heme oxygenase, which is a major antioxidant enzyme. Bilirubin is thought to have a protective effect on oxidative stress-induced organ damage through its strong antioxidant activity²². We previously showed a lower prevalence of nephropathy and other vascular complications, as well as reduced oxidative stress, in patients with diabetes and Gilbert syndrome, which is a hereditary hyperbilirubinemia²⁵. Accumulating evidence has also shown that serum bilirubin levels are negatively associated with the progression of DKD^26,27,28. Taken together, it is most likely that serum bilirubin may prevent the progression of nephropathy via its anti-oxidative activities. This possibility is supported by an animal study, which showed that bilirubin prevented renal oxidative stress and dysfunction in type-1 diabetic rats and type 2 diabetic mice²⁹. In addition, serum bilirubin levels have been reported to be affected by oxidative stress-related factors such as smoking, obesity, hypertension, metabolic syndrome, and cardiovascular diseases in addition to genetic factors^{30,31,32,33,34}, all of which are possible risk facors for DKD. Therefore, serum bilirubin levels might represent a total susceptibility determined by such factors to the progression to ESKD. In line with these concepts, the effect of anti-oxidative properties of albumin on the progression to ESKD may be plausible, although the effect of serum albumin levels may be mainly explained by their association with the levels of albuminuria. Albumin is thought to be an important serum antioxidant in addition to serum bilirubin^24,25. In serum, free thiol groups are one of the most important scavengers of hydroxy radicals and other oxidants and are largely located on albumin³⁵. Serum albumin levels have been reported to be inversely associated with the cardiovascular disease risk and aging, supporting its possible causal relationships with the oxidative stress-related status and diseases^36,37,38. The mechanisms underlying the close associations of serum bilirubin and albumin levels with the progression to ESKD should be clarified in future studies.

There are several limitations to this study. First, we might not have obtained ideal information regarding clinical data and timely assessment of endpoint, compared with controlled clinical trials. Second, the sample size may not have been sufficient to develop prediction models. Third, we used proteinuria by a conventional urine test rather than measurements of albuminuria because the rate of albuminuria measurements was low in the electronic medical record data, although proteinuria data are much more easily obtainable than those of albuminuria measurement in primary care. Forth, a competitive risk analysis of death was not performed because only 16 death cases occurred and the relationship between each death and kidney dysfunction was unknown in this study using electronic medical records. Despite these limitations, our prediction models showed a good performance in the independent external validation as well as the development cohorts. Lastly, although the excellent performance in discrimination of our prediction model was confirmed in an external cohort, the performance in calibration should be evaluated in more various populations including different ethnicities, and cohorts outside Japan for widespread adoption of this model, because patient characteristics, healthcare system, and treatment strategies vary between health centers, regions and countries, and such heterogeneity can affect risk estimates and their calibration^39,40.

In conclusion, we developed a simplified and accurate 5-year prediction model using the commonly available clinical variables of eGFR, proteinuria, HbA1c, serum albumin and bilirubin levels, which are easy to measure. Prospective studies should be done to establish its clinical usefulness in reducing ESKD in patients with diabetes.

Methods

Study subjects in the development cohort

We obtained data from the electronic medical record system at Kyushu University Hospital (Japan) between January 1, 2008 and December 31, 2018. Eligible patients with diabetes were aged 20–69 years and had laboratory data, including the eGFR, and serum bilirubin and albumin levels at baseline. In this study, diabetes was defined as casual blood glucose levels $\ge$ 200 mg/dL, HbA1c levels $\ge$ 6.5%, or the use of glucose lowering agents. Other inclusion criteria were a follow-up $\ge$ 1 year, and at least 5 measurements of the eGFR during follow-up. Patients were excluded if they had an eGFR $\le$ 15 mL min⁻¹ [1.73 m]⁻², received dialysis or renal transplantation at baseline. Other exclusion criteria were co-occurrence of acute kidney injury or cancer during follow-up, or co-occurrence of liver cirrhosis, other hepatobiliary diseases with abnormal liver enzyme levels (alanine aminotransferase or alkaline phosphatase levels $\ge$ 2 fold of the upper limit of the normal range), or hemolytic anemia to evaluate the true contribution of serum bilirubin levels to ESKD events. Patients with missing data were also excluded in this analysis if the rate of missing data was very low (< 3%). A detailed flowchart of the patients’ selection is shown in the supplementary Fig. S1.

Study subjects in the validation cohort

For the independent external validation cohort, we obtained data from the medical record data for patients (n = 5153) who were followed up at 5 hospitals or one disease management institute from January 1, 2008 to December 31, 2020 and had laboratory data, including the eGFR, HbA1c levels, proteinuria, serum albumin levels, and bilirubin levels, at baseline. The same exclusion criteria as those in the development cohort were used.

All procedures were performed in accordance with the relevant guidelines and regulations. Informed consent was obtained from all participants and/or their legal guardians. The study was approved by the ethics committees of Kyushu University Hospital and other related institutes.

Variables

Twenty variables were used to select important variables for the prediction model for ESKD, including age, sex, body mass index (BMI), smoking status, presence of hypertension, presence of dyslipidemia, eGFR, HbA1c, serum albumin, serum bilirubin, serum uric acid, red blood cell count, white blood cell count, platelet count, presence of proteinuria, and medication use (angiotensin II converting enzyme inhibitors, angiotensin II receptor blockers, statins, fibrate-related drugs, and GLP-1 receptor agonists) (Table 1). eGFR was calculated with an equation from the Japanese Society of Nephrology⁴¹. HbA1c levels were presented as the National Glycohemoglobin Standardization Program value and the International Federation of Clinical Chemistry and Laboratory Medicine mmol/mol units converted using the National Glycohemoglobin Standardization Program converter for HbA1c, available at http://www.ngsp.org/convert1.asp. The presence of hypertension was based on the ICD code. Dyslipidemia was defined as serum low-density lipoprotein cholesterol levels $\ge$ 120 mg/dL, triglyceride levels $\ge$ 150 mg/dL, or high-density lipoprotein cholesterol levels < 40 mg/dL, in accordance with the Japan Atherosclerosis Society criteria, or the current use of lipid-lowering agents. Proteinuria was defined as a positive result using US-3500 or US-1200 analyzer and Uropaper III Eiken (Eiken Chemical, Co., Ltd., Tokyo, Japan). A positive result was urinary protein concentrations of $\ge$ 30 mg/dL, which are the same as those using the Albustix dipstick method. In the enrolled patients, there were missing data for the smoking status (20.7%), dyslipidemia (0.6%), and proteinuria (37.2%). There were no missing data for the other variables.

Clinical outcomes

The outcome of this study was a composite of ESKD events, which was defined as an eGFR < 15 mL min⁻¹ [1.73 m]⁻², chronic dialysis, or renal transplantation.

Statistical analysis

To select important variables for predicting ESKD, we evaluated the relative importance of 20 variables using a random forest model (randomForestSRC package, http://cran.r-project.org/web/packages/randomForestSRC). In this study, we also evaluated the predictive value of these variables for ESRD using the Cox proportional hazard model to confirm the results of the random forest model. In the Cox proportional hazard model, variables were standardized by subtracting the mean and dividing by the standard deviation, and they were then applied to the Cox proportional hazard model to compare the relative importance. Thus, we selected the important variables from both models, and then developed a sequential series of models using Cox proportional hazard analysis. We used Harrell’s concordance statistic (C-statistic) as a measure of discrimination for ESKD to evaluate the performance of the model⁴². Calibration was assessed using the Nam and D’Agostino $\chi$² statistic to examine how closely each model’s predicted probabilities agreed with the observed ESKD outcomes⁴³. The prediction risk calculated from a Cox proportional hazard model at time $t$ can be written as follows:

$${p}_{i}\left(t\right)=1-{S}_{i}\left(t\right)=1-{\mathrm{exp}(-{H}_{0}\left(t\right))}^{\mathrm{exp}(\sum_{p}{\beta }_{p}{x}_{ip})}$$

where ${S}_{i}\left(t\right)$ is a survivor function and ${H}_{0}\left(t\right)$ is a cumulative baseline hazard function⁴⁴. The value of $(\sum_{p}{\beta }_{p}{x}_{ip})$ (${\beta }_{p}$, coefficient for variable $p$; ${x}_{ip},$variable $p$ for patient $i$) was defined as a prediction risk score, which was developed from the linear progression equation from the Cox hazard regression model. Given that the predicted absolute ESKD probability is clinically important, a nomogram was also built using the coefficients of the prediction model.

There were missing data for the smoking status, dyslipidemia, and proteinuria in this study. For our data imputation approach, we used the random forest imputation algorithm, called missForest (missForest package version 1.2, http://cran.r-project.org/web/packages/missForest)⁴⁵. To confirm that our missForest prediction models were robust and not sensitive to missing data, we also used another imputation method, where we imputed the missing data by multiple imputation method (mice package version 3.14.0, https://cran.r-project.org/web/packages/mice/mice.pdf) in this analysis. The results in this method were similar to those in the missForest models (supplementary Table S1). Two-tailed P < 0.05 was defined as statistical significance. Data are presented as the mean ± standard deviation or the median (IQR). The significance of differences was determined by the chi-square test for categorical variables and the unpaired t test or the Mann–Whitney U test for continuous variables. We performed statistical analyses using R (version 3.6.3, https://www.r-project.org).

Data availability

The datasets generated and/or analyzed during the current study are available from the corresponding author on reasonable request.

References

US Renal Data System 2019 Annual Data Report: Epidemiology of Kidney Disease in the United States. https://doi.org/10.1053/j.ajkd.2019.09.003 (2019).
The Japan Society for Dialysis Therapy. An Overview of Regular Dialysis Treatment in Japan, 2017 Report. http://www.jsdt.or.jp/english/2426.html (2017).
KDOQI. KDOQI clinical practice guideline and clinical practice recommendations for diabetes and chronic kidney disease. Am. J. Kidney Dis. 49, S12–S154 (2007).
Amin, A. P. et al. The synergic relationship between estimated GFR and microalbuminuria in predicting long-term progression to ESKD or death in patients with diabetes: Results from the Kidney Early Evaluation Program (KEEP). Am. J. Kidney Dis. 61, S12–S23 (2013).
Article Google Scholar
The Diabetes Control and Complications (DCCT) Research Group. Effect of intensive therapy on the development and progression of diabetic nephropathy in the diabetes control and complications trial. Kidney Int. 47, 1703–1720 (1995).
UK Prospective Diabetes Study (UKPDS) Group. Intensive blood glucose control with sulphonylureas or insulin compared with conventional treatment and risk of complications in patients with type 2 diabetes (UKPDS 33). Lancet 352, 837–853 (1998).
Tangri, N. et al. A predictive model for progression of chronic kidney disease to kidney failure. JAMA 305, 1553–1559 (2011).
Article CAS Google Scholar
Peeters, M. J. et al. Validation of the kidney failure risk equation in European CKD patients. Nephrol. Dial. Transplant. 28, 1773–1779 (2013).
Article CAS Google Scholar
Marks, A. et al. Looking to the future: predicting renal replacement outcomes in a large community cohort with chronic kidney disease. Nephrol. Dial. Transplant. 30, 1507–1517 (2015).
Article CAS Google Scholar
Tangri, N. et al. Multinational assessment of accuracy of equations for predicting risk of kidney failure. A meta-analysis. JAMA 315, 164–174 (2016).
Article CAS Google Scholar
Keane, W. F. et al. Risk scores for predicting outcomes in patients with type 2 diabetes and nephropathy: The RENNAL Study. Clin. J. Am. Soc. Nephrol. 1, 761–767 (2006).
Article Google Scholar
Jardine, M. J. et al. Prediction of kidney-related outcomes in patients with type 2 diabetes. Am. J. Kidney Dis. 60, 770–778 (2012).
Article CAS Google Scholar
Elley, C. R. et al. Derivation and validation of a renal risk score for people with type 2 diabetes. Diabetes Care 36, 3113–3120 (2013).
Article CAS Google Scholar
Lin, C. C. et al. Development and validation of a risk prediction model for end-stage renal disease in patients with type 2 diabetes. Sci. Rep. 7, 10177. https://doi.org/10.1038/s41598-017-09243-9 (2017).
Article ADS CAS PubMed PubMed Central Google Scholar
Belur Nagaraj, S. et al. Machine learning based early prediction of end-stage renal disease in patients with diabetic kidney disease using clinical trials data. Diabetes Obes. Metab. 22, 2479–2486 (2020).
Article CAS Google Scholar
Ha, H. et al. DNA damage in the kidneys of diabetic rats exhibiting microalbuminuria. Free Radic. Biol. Med. 16, 271–274 (1994).
Article CAS Google Scholar
Kakimoto, M. et al. Accumulation of 8-hydroy-2’-deoxyguanosine and mitochondrial DNA deletion in kidney of diabetic rats. Diabetes 51, 1588–1595 (2002).
Article CAS Google Scholar
Koya, D. et al. Effect of antioxidants in diabetes-induced oxidative stress in the glomeruli of diabetic rats. J. Am. Soc. Nephrol. 14, S250–S253 (2003).
Article CAS Google Scholar
Craven, P. A. et al. Overexpression of Cu²⁺/Zn²⁺ superoxide dismutase protects against early diabetic glomerular injury in transgenic mice. Diabetes 50, 2114–2125 (2001).
Article CAS Google Scholar
Etoh, T. et al. Increased expression of NAD(P)H oxidase subunits, NOX4 and p22phox, in the kidney of streptozotocin-induced diabetic rats and its reversibility by interventive insulin treatment. Diabetologia 46, 1428–1437 (2003).
Article CAS Google Scholar
Inoguchi, T. et al. Protein kinase C-dependent increase in reactive oxygen species (ROS) production in vascular tissues of diabetes: Role of vascular NAD(P)H oxidase. J. Am. Soc. Nephrol. 14, S227–S232 (2003).
Article CAS Google Scholar
Stocker, R. et al. Bilirubin is an antioxidant of possible physiological importance. Science 235, 1043–1046 (1987).
Article ADS CAS Google Scholar
Roche, M. et al. The antioxidant properties of serum albumin. FEBS Lett. 582, 1783–1787 (2008).
Article CAS Google Scholar
Anraku, M. et al. Redox properties of serum albumin. Biochim. Biophys. Acta. 1830, 5465–5472 (2013).
Article CAS Google Scholar
Inoguchi, T. et al. Relationship between Gilbert syndrome and prevalence of vascular complications in patients with diabetes. JAMA 298, 1398–1400 (2007).
Article CAS Google Scholar
Fukui, M. et al. Relationship between serum bilirubin and albuminuria in patients with type 2 diabetes. Kidney Int. 74, 1197–1201 (2008).
Article CAS Google Scholar
Riphagen, I. J. et al. Bilirubin and progression of nephropathy in type 2 diabetes: a post hoc analysis of RENNAL with independent replication in IDNT. Diabetes 63, 2845–2853 (2014).
Article CAS Google Scholar
Zhu, B. et al. Effect of bilirubin concentration on the risk of diabetic complications: a meta-analysis of epidemiologic studies. Sci. Rep. 7, 41681. https://doi.org/10.1038/srep41681 (2017).
Article ADS CAS PubMed PubMed Central Google Scholar
Fujii, M. et al. Bilirubin and biliverdin protect rodents against diabetic nephropathy by downregulating NAD(P)H oxidase. Kidney Int. 78, 905–919 (2010).
Article CAS Google Scholar
Van Hoydonck, P. G., Temme, E. H. & Schouten, E. G. Serum bilirubin concentration in a Belgian population: The association with smoking status and type of cigarettes. Int. J. Epidemiol. 30, 1465–1472 (2001).
Article Google Scholar
Vitek, L. The role of bilirubin in diabetes, metabolic syndrome, and cardiovascular diseases. Front. Pharmacol. 3, 55. https://doi.org/10.3389/fphar.2012.00055 (2012).
Article PubMed PubMed Central Google Scholar
Wang, L. & Bautista, L. E. Serum bilirubin and the risk of hypertension. Int. J. Epidemiol. 44, 142–152 (2015).
Article Google Scholar
Takei, R. et al. Bilirubin reduces visceral obesity and insulin resistance by suppression of inflammatory cytokines. PLoS ONE 4, e0223302. https://doi.org/10.1371/journal.pone.0223302 (2019).
Article CAS Google Scholar
Choi, Y. et al. Causal associations between serum bilirubin levels and decreased stroke risk. A two-sample Mendelian randomized study. Arterioscler. Thromb. Vasc. Biol. 40, 437–445 (2020).
Article CAS Google Scholar
Danesh, J. et al. Association of fibrinogen, C-reactive protein, albumin, or leucocyte count with coronary heart disease: Meta-analyses of prospective studies. JAMA 279, 1477–1482 (1998).
Article CAS Google Scholar
Putin, E. et al. Deep biomarkers of human aging: Application of deep neural networks to biomarker development. Aging 8, 1021–1033 (2016).
Article CAS Google Scholar
Zoanni, B. et al. Novel insights about albumin in cardiovascular diseases: Focus on heart failure. Mass Spectrum Rev. 8, e21743. https://doi.org/10.1002/mas.21743 (2021).
Article CAS Google Scholar
Fukuhara, S. et al. Clinical usefulness of human serum nonmercaptalbumin to mercaptalbumin ratio as a biomarker for diabetic complications and disability in activities of daily living in elderly patients with diabetes. Metabolism 103, 153995. https://doi.org/10.1016/j.metabol (2019).
Article PubMed Google Scholar
Riley, R. D. et al. External validation of clinical prediction models using big datasets from e-health record or IPD meta-analysis: opportunities and challenges. BMJ 353, i3140. https://doi.org/10.1136/bmj.i3140 (2016).
Article PubMed PubMed Central Google Scholar
Davis, S. E. et al. Calibration drift in regression and machine learning models for acute kidney injury. J. Am. Med. Inform. Assoc. 24, 1052–1061 (2017).
Article Google Scholar
Matsuo, S. et al. Collaborators developing the Japanese equation for estimated GFR. Revised equations foe estimated GFR from serum creatinine in Japan. Am. J. Kidney Dis. 53, 982–992 (2009).
Article CAS Google Scholar
Steyerberg, E. W. et al. Assessing the performance of prediction models: A framework for traditional and novel measures. Epidemiology 211, 128–138 (2010).
Article Google Scholar
D’Agostino, R.B. & Nam, B.H. Evaluation of the performance of survival analysis models: Discrimination and calibration measures. in Handbook of Statistics (eds. Balakrishnan, N. & Rao C.R.). Vol. 23. 1–25. (Elsevier, 2003).
Honda, T. et al. Development and validation of modified risk prediction models for cardiovascular disease and its subtypes: The Hisayama Study. Atherosclerosis 279, 38–44 (2018).
Article CAS Google Scholar
Stekhoven, D. J. & Buhlmann, P. MissForest-nonparametric missing value imputation for mixed-type data. Bioinformatics 28, 112–118 (2012).
Article CAS Google Scholar

Download references

Acknowledgements

This study was supported by the Clinical Observational Study Support System (COS3) in the Medical Information Center (MIC), Kyushu University Hospital.

Author information

These authors contributed equally: Toyoshi Inoguchi and Tasuku Okui.

Authors and Affiliations

Fukuoka City Health Promotion Support Center, Fukuoka City Medical Association, Maizuru 2-5-1, Chuou-ku, Fukuoka, 810-0073, Japan
Toyoshi Inoguchi
Medical Information Center, Kyushu University Hospital, Fukuoka, 812-8582, Japan
Tasuku Okui, Chinatsu Nojiri & Naoki Nakashima
Department of Diabetes and Endocrinology, Saga-Ken Medical Centre Koseikan, Saga, 840-8571, Japan
Erina Eto
Division of Endocrinology and Metabolism, Department of Internal Medicine, Kurume University School of Medicine, Kurume, 830-0011, Japan
Toyoshi Inoguchi, Nao Hasuzawa, Yukihiro Inoguchi & Masatoshi Nomura
Yukuhashi Central Hospital, Yukuhashi, 824-0031, Japan
Fumio Umeda & Teruaki Yamauchi
Department of Endocrinology and Diabetes Mellitus, Fukuoka University Chikushi Hospital, Chikushino, 818-8502, Japan
Kentaro Ochi & Kunihisa Kobayashi
Department of Endocrinology and Diabetes Mellitus, School of Medicine, Fukuoka University, Fukuoka, 814-0180, Japan
Yuichi Takashi & Daiji Kawanami
Carna Health Support, Co., Ltd., Fukuoka, 810-0054, Japan
Fujiyo Hiyama & Daisuke Nishida

Authors

Toyoshi Inoguchi
View author publications
You can also search for this author in PubMed Google Scholar
Tasuku Okui
View author publications
You can also search for this author in PubMed Google Scholar
Chinatsu Nojiri
View author publications
You can also search for this author in PubMed Google Scholar
Erina Eto
View author publications
You can also search for this author in PubMed Google Scholar
Nao Hasuzawa
View author publications
You can also search for this author in PubMed Google Scholar
Yukihiro Inoguchi
View author publications
You can also search for this author in PubMed Google Scholar
Kentaro Ochi
View author publications
You can also search for this author in PubMed Google Scholar
Yuichi Takashi
View author publications
You can also search for this author in PubMed Google Scholar
Fujiyo Hiyama
View author publications
You can also search for this author in PubMed Google Scholar
Daisuke Nishida
View author publications
You can also search for this author in PubMed Google Scholar
Fumio Umeda
View author publications
You can also search for this author in PubMed Google Scholar
Teruaki Yamauchi
View author publications
You can also search for this author in PubMed Google Scholar
Daiji Kawanami
View author publications
You can also search for this author in PubMed Google Scholar
Kunihisa Kobayashi
View author publications
You can also search for this author in PubMed Google Scholar
Masatoshi Nomura
View author publications
You can also search for this author in PubMed Google Scholar
Naoki Nakashima
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

T.I., T.O., and N.N. designed the study and conducted the data analysis. T.I., C.N., E.E., N.H., Y.I., K.O., Y.T., F.H., D.N., F.U., T.Y., D.K., K.K., and M.N. acquired data. All authors interpreted the results. T.I. drafted the manuscript. All authors critically revised the manuscript. All authors approved the final version.

Corresponding author

Correspondence to Toyoshi Inoguchi.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Inoguchi, T., Okui, T., Nojiri, C. et al. A simplified prediction model for end-stage kidney disease in patients with diabetes. Sci Rep 12, 12482 (2022). https://doi.org/10.1038/s41598-022-16451-5

Download citation

Received: 11 March 2022
Accepted: 11 July 2022
Published: 21 July 2022
DOI: https://doi.org/10.1038/s41598-022-16451-5

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.