Abstract
Fluid overload, while common in the ICU and associated with serious sequelae, is hard to predict and may be influenced by ICU medication use. Machine learning (ML) approaches may offer advantages over traditional regression techniques to predict it. We compared the ability of traditional regression techniques and different ML-based modeling approaches to identify clinically meaningful fluid overload predictors. This was a retrospective, observational cohort study of adult patients admitted to an ICU ≥ 72 h between 10/1/2015 and 10/31/2020 with available fluid balance data. Models to predict fluid overload (a positive fluid balance ≥ 10% of the admission body weight) in the 48–72 h after ICU admission were created. Potential patient and medication fluid overload predictor variables (n = 28) were collected at either baseline or 24 h after ICU admission. The optimal traditional logistic regression model was created using backward selection. Supervised, classification-based ML models were trained and optimized, including a meta-modeling approach. Area under the receiver operating characteristic (AUROC), positive predictive value (PPV), and negative predictive value (NPV) were compared between the traditional and ML fluid prediction models. A total of 49 of the 391 (12.5%) patients developed fluid overload. Among the ML models, the XGBoost model had the highest performance (AUROC 0.78, PPV 0.27, NPV 0.94) for fluid overload prediction. The XGBoost model performed similarly to the final traditional logistic regression model (AUROC 0.70; PPV 0.20, NPV 0.94). Feature importance analysis revealed severity of illness scores and medication-related data were the most important predictors of fluid overload. In the context of our study, ML and traditional models appear to perform similarly to predict fluid overload in the ICU. Baseline severity of illness and ICU medication regimen complexity are important predictors of fluid overload.
Similar content being viewed by others
Introduction
Fluid overload, a frequent and unintended consequence of the resuscitation process in critically ill adults may result in increased rates of acute kidney injury and invasive mechanical ventilation initiation, prolonged intensive care unit (ICU) stay, and mortality1,2. Timely de-resuscitation to remove excess fluid is associated with improved outcomes3,4,5,6. While the predictors of volume responsiveness are well-established7,8, particularly in patients with sepsis9,10, the predictors for ICU fluid overload remain unclear. Development of rigorous fluid overload prediction algorithms could shorten the time to the implementation of fluid overload mitigation strategies [e.g., concentration of intravenous (IV) fluid products, discontinuation of maintenance fluids, administration of diuretics] and improve outcomes.
Non-diuretic ICU medication use may affect fluid overload risk; preliminary data suggests the medication regimen complexity-ICU (MRC-ICU) score is associated with both fluid overload and fluid balance11. This score has also been shown to predict mortality and length of stay and also the medication interventions needed to optimize a patient’s pharmacotherapy regimen12,13,14,15,16,17,18,19. Therefore, quantifying patient-specific, medication-related data is likely an important consideration in the prediction of fluid overload in critically adults2,20,21.
Event prediction in the ICU remains a perennial area of research given the many challenges that exist for clinicians to accurately predict clinical outcomes in the highly complex and dynamic critical care environment22,23. Artificial intelligence and machine learning techniques have been proposed as a method to improve ICU clinical outcome prediction given their unique ability to handle multi-dimensional problems and identify novel patterns within the vast troves of continuously-generated patient data21,24,25,26. However, to some ICU clinicians, the use of artificial intelligence/machine learning approaches to predict clinical events may have a ‘black-box effect,’ which can ultimately preclude implementation. The rigorous evaluation of whether artificial intelligence-based approaches predict clinical events better than traditional regression models (or clinical expertise alone) remains a key question in critical care practice27,28,29,30,31.
In this study, we sought to compare the ability of machine learning approaches to traditional regression models to predict fluid overload and the individual predictors for its occurrence in critically ill adults. We hypothesized that advanced machine learning techniques perform better than traditional regression models to predict fluid overload and that the predictors for fluid overload identified through machine learning approaches may be different.
Methods
We conducted a retrospective, observational study of adults admitted ICUs at the University of North Carolina Health System (UNCHS), an integrated health system, who had fluid overload data available. The protocol for this study was approved with waivers of informed consent and HIPAA authorization granted by UNHCS Institutional Review Board (approval number: Project00001541; approval date: October 2021). Procedures followed in the study were in accordance with the ethical standards of the of the UNHCS Institutional Review Board and the Helsinki Declaration of 1975, as most recently amended32. The reporting of this study adheres to the STrengthening and reporting of OBservational data in Epidemiology statement33.
Population
A random sample of 1000 adults (≥ 18 years) admitted to an ICU at UNCHS between October 2015 and October 2020 was generated. Patients on their index ICU admission with fluid balance data available for the first 72 h were included (Supplemental Digital Content (SDC) Fig. S1). Patients were excluded if the admission was not their index ICU admission.
Data collection and outcomes
De-identified UNCHS electronic health record (EHR) data (Epic Systems, Verona, WI) housed in the Carolina Data Warehouse (CDW) was extracted by a trained CDW data analyst. The primary outcome was the presence of fluid overload at the 48–72 h (i.e., day 3) after ICU admission. Fluid overload was defined as a positive fluid balance in milliliters (mL) greater than or equal to 10% of the patient’s admission body weight in kilograms (kg)2,34. For example, a patient with a body weight of 100 kg at ICU admission having a positive fluid balance at 72 h of 12,000 mL (or 12 kg) would be considered to have fluid overload. A secondary outcome was the amount of fluid overload as a function of body weight. For example, the aforementioned patient would have a fluid overload amount of 12%.
Following a literature review, and through investigator consensus, potential predictor variables for fluid overload were defined2,35,36,37,38. A total of 28 potential predictors were identified: 1) ICU baseline: age ≥ 65 years, sex, admission to a medical (vs. surgical) ICU, primary ICU admission diagnosis (i.e., cardiac, chronic kidney disease, heart failure, hepatic, pulmonary, sepsis, trauma), and select co-morbidities (i.e., chronic kidney disease, heart failure); 2) 24 h after ICU admission: APACHE II and SOFA score (using worst values in the 24 h period), use of supportive care devices (i.e., renal replacement therapy, invasive mechanical ventilation), serum laboratory values (i.e., albumin < 3 mg/dL, bicarbonate < 22 mEq/L or > 29 mEq/L, chloride ≥ 110 mEq/L, creatinine ≥ 1.5 mg/dL, lactate ≥ 2 mmol/L, potassium ≥ 5.5 mEq/L, sodium ≥ 148 mEq/L or < 134 mEq/L), fluid balance (mL), and presence of acute kidney injury (as defined by need for renal replacement therapy or serum creatinine greater than or equal two times baseline); 3) Medication data at 24 h: MRC-ICU score, vasopressor use in the first 24 h, use of continuous medication infusions, and the number of continuous medication infusions.
Data analysis
Data missingness
Due to the hypothesis-generating nature of our study and the lack of published data on ICU fluid overload prediction, no attempt was made to estimate a study sample size. The 991 patients were split into training and testing datasets using a 80:20 ratio. We assumed data was missing at random (MAR) (i.e., related to observed, not unobserved values) and therefore chose Multiple Imputation by Chained Equation (MICE), rather than complete case analysis or simple imputation, as the most appropriate approach to address missingness. Ten imputations per variable were therefore applied for all missing data in the training and testing datasets to generate multiple imputed training and testing datasets (SDC Fig. S1).
Machine learning models
We employed Random Forest, SVM, and XGBoost for the task of modeling the presence of fluid overload39,40,41. During the model training on each of the ten imputed training sets, fivefold cross validation was applied for Random Forest, SVM and XGBoost, using their most appropriate R package42,43,44, to choose the hyperparameters for these machine learning models that resulted in the highest prediction accuracy. Each of these models was then fitted on the corresponding imputed training set, and predictions for probability of fluid overload were made on each of the ten imputed testing sets using the corresponding optimal model. During this phase, hyperparameters were tuned. For Random Forest, two hyperparameters were tuned (number of trees and number of variables randomly sampled as candidates at each split). For SVM, linear kernel and cost of constraints violation were tuned. For XGBoost, two hyperparameters were tuned (maximum depth of a tree and maximum number of boosting iterations). For each model, ten different predictions were generated on ten different imputed test sets. These predictions of the probability for fluid overload were averaged as the final prediction.
For the degree of fluid overload, we built models with the amount of fluid overload at 72 h. Since this is a continuous variable, we employed their regression of the above machine learning models: Random Forest regression, SVM regression, and XGBoost regression. For XGBoost, feature importance was measured as the frequency a feature was used in the trees. For Random Forest, feature importance was measured by mean decrease in node impurity. Because ten different models were used on each imputed dataset, ten different feature importance lists were generated for each. A subsequent analysis modeling fluid overload as a continuous variable (percent of net milliliters of fluid by body weight) instead of dichotomous presence or absence of fluid overload) was performed (see SDC—Additional Methods S1).
Traditional regression models
Subsequently, a full logistic regression model was built for the presence of fluid overload for each of the ten complete training sets. We then applied backward elimination to select the final model. The initial set of variables for the variable selection were determined by the significance of variables in the ten full models by multivariate Wald testing45. We built our linear regression models so that the degree of fluid overload was similar to that of the ten completed training sets. In order to compare these models with the MRC-ICU only model, we also built logistic regression and linear regression models with MRC-ICU as the sole predictor in the ten training sets. After model fitting, model fits were pooled using Rubin’s method46. Using the pooled models, odds ratios (OR) and their 95% confidence intervals (CI) were reported.
For each regression model, ten different predictions were generated on ten different imputed test sets as well. These predictions of the probability for fluid overload were averaged as the final prediction. We compared the variables selected through backward selection with the top five variables chosen by Random Forest (see SDC Additional Methods S1). To further evaluate our results in those patients with high APACHE-2 (≥ 25) and high SOFA (≥ 10) scores, we generated predictions using the backward section model (see SDC Additional Methods S1).
Ethical approval
The protocol for this study was approved with waivers of informed consent and HIPAA authorization granted by UNHCS Institutional Review Board (approval number: Project00001541; approval date: October 2021).
Results
A total of 49 (12.5%) of the 391 included patients had fluid overload on ICU day 3. The degree of day 3 fluid overload was significantly greater in the fluid overload (vs non overload) patients (16.6% vs 2.2%, p < 0.01). Overall, the mean APACHE II score was 15.7 ± 6.6, mean SOFA score was 8.3 ± 3.3, and MRC-ICU score was 11.8 ± 8.7. A significantly greater proportion of fluid overload patients (vs. those without) had an elevated serum lactate ≥ 2 mmol/L (32.7% vs. 14.9%, p = 0.01) and AKI (28.6% vs. 10.5%, p < 0.001) at 24 h and positive fluid balance (1,840 mL vs. 390 mL, p < 0.001) on ICU day 3. All model covariates are summarized in Table 1. At ICU day 3, patients with fluid overload (vs those without) were more likely to be dead (20.4% vs. 7.3%, p = 0.01), have AKI (34.7% vs. 15.8%, p < 0.001), and remain on mechanical ventilation (12.7% vs. 4.2%, p = 0.05).
Among the machine learning models, XGBoost demonstrated the highest AUROC (0.78) compared to SVM (0.69) and RF (0.76) and was associated with a PPV of 0.27 and NPV of 0.94. Notably, all models tested at relatively poor PPV. In comparison, stepwise logistic regression had an AUROC of 0.70, PPV 0.26, and NPV 0.94. Full results are reported in Table 2, and AUROC curves for all models are provided in SDC Supplemental Fig. S2. Results of the full logistic regression are reported in SDC Supplemental Table S1. Stepwise regression resulted in a more parsimonious model (7 variables vs. 31 variables) but demonstrated similar performance to the machine learning models (SDC Supplementary Table S2). In the stepwise regression, presence of sepsis, male sex, the SOFA score at 24 h, and the 24 h serum sodium and bicarbonate comprised the stepwise regression model (Table 2). In an analysis of MRC-ICU as a single predictor for fluid overload, the model had an AUROC of 0.74 (0.60–0.84), sensitivity 0.62 (0.35–0.85), specificity 0.70 (0.63–0.77), PPV 0.16 (0.08–0.27), and NPV 0.96 (0.90–0.98).
Feature importance graphs were plotted for XGBoost (Fig. 1), RF (SDC Supplemental Fig. S3) and SVM (SDC 5 Supplemental Fig. S4). Among the 10 different feature importance lists generated for each model, differences between top features were noted. For example, for two of the machine learning models, XGBoost (Fig. 2) and RF, the top five most important features were fluid balance at 24 h, SOFA score at 24 h, MRC-ICU at 24 h, APACHE II at 24 h, and the number of continuous infusions at 24 h. While the stepwise regression model found fluid balance at 24 h and APACHE II at 24 h to be top features, the SOFA score at 24 h, the MRC-ICU at 24 h and the number of continuous infusions were not found to be model features. The full regression results for predicting the amount of fluid overload at 72 h are reported in SDC Supplemental Table S3. For stepwise regression, twelve variables were included with fluid balance, laboratory values, and severity of illness being significant predictors (SDC Supplemental Table S4). All models demonstrated similar performance as measured by MSE (SDC Supplemental Table S5). Feature importance graphs are presented in SDC Supplemental Figs. S5–S7).
When variables selected through backward selection were compared with the variables chosen by the Random Forest model, we found MRC-ICU at 24 h to be highly correlated with sex-male, number of IV continuous infusions to be highly correlated with sex-male and age—≥ 65), fluid balance at 24 h (mL) to be highly correlated with admission diagnosis-sepsis/septic shock, laboratory values-serum bicarbonate, and age—≥ 65 (SDC Supplemental Tables S6 and S7). These results indicate high explanatory power exists between the backward selection and random forest variables. The vast majority of cases of fluid overload occur in patients with both high APACHE II and SOFA scores (SDC Supplemental Table S8).
Discussion
Although machine learning models have been shown to outperform traditional regression models in a variety of settings47,48, the potential benefits of machine learning in critical care remain an open field of exploration, in part due to a current lack of rigorous comparison in high quality ICU datasets29,49,50. Our analysis represents the first published comparison of machine learning approaches with traditional regression methods to predict fluid overload using a novel dataset with granular medication data.
We report that machine learning and logistic regression analyses demonstrate a similar predictive power to identify patients with fluid overload on day 3 of their ICU stay. Although use of machine learning did not appear to improve predictive performance over regression analysis, it expanded the number of variables critical to fluid overload prediction and highlights the importance of further artificial intelligence-based exploration in this area. This analysis of individual predictors may help bedside clinicians better understand how the machine learning models work and may help overcome their ‘black box’ hesitancy to trust machine learning-generated results51,52. For example, feature importance graphs for the machine learning analyses found complexity of the daily ICU medication regimen (i.e., MRC-ICU score), which includes the number of intravenous medication infusions (the primary method to administer medications in this population and a primary source of fluids to a patient), to be an important contributor to fluid overload. In comparison, in the traditional multivariable regression, the MRC-ICU score was not associated with fluid overload. This may be because machine learning analyses better account for severity of illness and the response of clinicians to respond to this severity by administering more medication infusions leading to a more complex daily medication regimen; however, the methods applied, including feature importance, preclude causal inference at this juncture. As such, our results highlight the unique power of machine learning to identify complex relationships that can be further elucidated via machine-learning based causal inference modeling and other designs aimed at causation2,20.
Optimizing fluid management (or fluid stewardship) has been previously defined by the ROSE model of Resuscitation, Optimization, Stabilization, and dE-resuscitation35. After an initial 24—48 h period characterized by overt volume resuscitation (e.g., a crystalloid bolus) and IV medication initiation (e.g., antibiotics), and the associated fluid administration, the care priority shifts from volume administration to volume removal. While comprehensive fluid stewardship management strategies including reduced fluid use and diuretic administration can effectively reduce fluid overload and its sequelae, they are often deployed too late1,2. Interestingly, some reports have indicated ‘hidden fluids’ (defined as blood products, enteral nutrition, flushes, and intravenous medications) were significantly associated with the development of fluid overload. During critical illness many of these ‘hidden fluids’ are necessary (e.g., blood products), given that intravenous medications account for over 40% of total fluid intake in this analysis, interventions such as concentrating intravenous medications, employing oral formulations when feasible, careful evaluation of maintenace fluids, and antibiotic de-escalation are potoentially still viable even in high illness severity that can reduce this complication. However, weighing risks and benefits associated with these interventions in this context may be aided by more quantitative prediction data56,57. Overall, de-resuscitation and fluid stewardship can be deceptively complex53. In a patient with shock, balancing the dueling forces of volume responsiveness assessment and timely volume resuscitation with the risks associated with fluid overload represents a highly complex Goldilocks scenario that requires clinicians to have high clinical precision, essentially pivoting ‘on a dime’, from a strategy of aggressive volume expansion to one of rapid volume removal36,54,55.
Despite the complexities of this decision process, limited prediction tools for fluid overload are available to assist clinicians at the ICU bedside. As such, real-time recognition identifying when to make the shift from resuscitation to de-resuscitation has the potential to improve bedside management. However, to go beyond the hourly assessment of ‘Ins and Outs’ would require accurate prediction of future fluid overload risk and the adverse events associated with it, in the time-dependent context of intervention delivery (e.g., diuretics). In such a scenario, an algorithm would be able to accurately interpret a septic patient who is 3 L positive 24 h after fluid resuscitation initiation as being in a ‘green zone’ (i.e., appropriately resuscitated). However, 24 h later, if the same patient is 4 L positive while off vasopressors and with down-trending sepsis markers the algorithm could alert clinicians that the patient is now in a 'yellow zone' where interventions like diuretic therapy and fluid reductions are required to reduce acute kidney injury and intubation risk. This type of real-time predictive capability could support continuous clinician decision-making but requires evaluation outside the scope of our current study.
Fluid overload also presents an important test case for exploring and adapting artificial intelligence methods to ICU problems, particularly those related to ICU medication use. Fluid overload represents a uniquely intervenable event in the ICU. Intervenable events share three key characteristics: they are predictable, preventable, and otherwise associated with poor outcomes. The results of our study, and others, indicate that fluid overload can be predicted with modeling of some kind, especially given its ability to be quantitatively defined56,57,58. Fluid overload has been associated with poor outcomes including acute kidney injury, delirium, poor respiratory outcomes, prolonged length of stay, and potentially increasing mortality2,37,59,60,61,62. Evidence demonstrates the timely recognition and management of fluid overload is feasible and is associated with reduced mortality and time in the ICU3,5,63,64. Notably, fluid stewardship has been adapted by critical care pharmacists as key component of comprehensive medication management5,6,65. As such, these results may support other investigations as they identify patients in whom it is safe to initiate de-resuscitation or importantly never needed that degree of fluid volume initially and at the bedside may prompt clinicians to be more targeted in therapies initiated or aggressive in curtailing early ‘hidden’ fluids to avoid the complications of fluid overload and/or the need for a highly interventional period of de-resuscitation (e.g., diuretics, dialysis). Artificial intelligence may be particularly well suited to bolster these efforts, and thus while feature importance analyses cannot provide foundation for causal inference, they may guide such future investigations.
Our study has limitations. Our patient sample may have been too small to demonstrate superiority of the machine learning approaches compared to traditional regression, and no validation in a separate, external dataset was undertaken at this juncture66. Future studies applying this approach to alternative, larger datasets (e.g., MIMIC-III) should be considered to examine the external validity of our findings. Although MICE is the established approach to address missingness in cohort studies that includes variables that are a composite of several individual patient-specific values (e.g. SOFA), it is possible that some of the values in the imputed datasets that represented our new ground truth may not have been accurate67. Bias may exist due to which patients had fluid balance data available. Other predictors for fluid overload not included in our models may exist68. By relying on prediction data derived in the first 24 h of ICU admission, we did not fully capture the dynamic nature of critical illness over the entire three day ICU period before fluid overload occurred. Future time-dependent evaluations of changing features employing unsupervised learning techniques may yield novel insights.
Conclusion
Fluid overload is an important, intervenable event in the ICU population. Incorporation of medication-related variables and artificial intelligence has demonstrated promise to improve prediction that may ultimately guide timely intervention and mitigation of this ICU complication; however, comparative advantages over traditional modeling techniques may remain warranted.
Data availability
The datasets used and/or analyzed during the current study available from the corresponding author on reasonable request.
References
Carr, J. R. et al. Fluid stewardship of maintenance Intravenous fluids. J. Pharm. Pract. 897, 190 (2021).
Hawkins, W. A. et al. Fluid stewardship during critical illness: A call to action. J. Pharm. Pract. 33(6), 863–873 (2020).
Bissell, B. D. et al. Impact of protocolized diuresis for de-resuscitation in the intensive care unit. Crit. Care 24(1), 70 (2020).
Jones, T. W. et al. Early diuretics for de-resuscitation in septic patients with left ventricular dysfunction. Clin. Med. Insights Cardiol. 16, 11795468221095876 (2022).
Hawkins, W. A. et al. From theory to bedside: implementation of fluid stewardship in a medical ICU pharmacy practice. Am. J. Health Syst. Pharm. 79(12), 984–992 (2022).
Bissell, B. D. et al. A narrative review of pharmacologic de-resuscitation in the critically ill. J. Crit. Care 59, 156–162 (2020).
Messmer, A. S. et al. Fluid overload phenotypes in critical illness-a machine learning approach. J. Clin. Med. 11(2), 1 (2022).
Zhang, Z., Ho, K. M. & Hong, Y. Machine learning for the prediction of volume responsiveness in patients with oliguric acute kidney injury in critical care. Crit. Care 23(1), 112 (2019).
Raghu, A., Komorowski, M., Celi, L. A., Szolovits, P., & Ghassemi, M. Continuous state-space models for optimal sepsis treatment: A deep reinforcement learning approach. In Machine Learning for Healthcare Conference 2017; pp. 147–163.
Komorowski, M., Celi, L. A., Badawi, O., Gordon, A. C. & Faisal, A. A. The artificial intelligence clinician learns optimal treatment strategies for sepsis in intensive care. Nat. Med. 24(11), 1716–1720 (2018).
Olney, W. J. et al. Medication regimen complexity score as an indicator of fluid balance in critically ill Patients. J. Pharm. Pract. 897, 190 (2021).
Sikora, A. et al. Impact of pharmacists to improve patient care in the critically ill: A large multicenter analysis using meaningful metrics with the medication regimen complexity-ICU (MRC-ICU) score. Crit. Care Med. 50(9), 1318–1328 (2022).
Newsome, A. S. et al. Multicenter validation of a novel medication-regimen complexity scoring tool. Am. J. Health Syst. Pharm. 77(6), 474–478 (2020).
Newsome, A. S. et al. Characterization of changes in medication complexity using a modified scoring tool. Am. J. Health Syst. Pharm. 76(Supplement 4), S92-s95 (2019).
Gwynn, M. E. et al. Development and validation of a medication regimen complexity scoring tool for critically ill patients. Am. J. Health Syst. Pharm. 76(Suppl 2), S34–S40 (2019).
Al-Mamun, M. A., Brothers, T. & Newsome, A. S. Development of machine learning models to validate a medication regimen complexity scoring tool for critically ill patients. Ann. Pharmacother. 55(4), 421–429 (2021).
Smith, S. E., Shelley, R. & Sikora, A. Medication regimen complexity vs patient acuity for predicting critical care pharmacist interventions. Am. J. Health Syst. Pharm. 79(8), 651–655 (2022).
Webb, A. J., Rowe, S. & Newsome, A. S. A descriptive report of the rapid implementation of automated MRC-ICU calculations in the EMR of an academic medical center. Am. J. Health Syst. Pharm. 79(12), 979–983 (2022).
Newsome, A. S. et al. Medication regimen complexity is associated with pharmacist interventions and drug-drug interactions: A use of the novel MRC-ICU scoring tool. J. Am. Coll. Clin. Pharm. 3(1), 47–56 (2020).
Sanchez, P. et al. Causal machine learning for healthcare and precision medicine. R Soc. Open Sci. 9(8), 220638 (2022).
Iwase, S. et al. Prediction algorithm for ICU mortality and length of stay using machine learning. Sci. Rep. 12(1), 12912 (2022).
Beil, M. et al. On predictions in critical care: The individual prognostication fallacy in elderly patients. J. Crit. Care 61, 34–38 (2021).
Lovejoy, C. A., Buch, V. & Maruthappu, M. Artificial intelligence in the intensive care unit. Crit. Care 23(1), 7 (2019).
Gutierrez, G. Artificial intelligence in the intensive care unit. Crit. Care 24(1), 101 (2020).
Goh, K. H. et al. Artificial intelligence in sepsis early prediction and diagnosis using unstructured data in healthcare. Nat. Commun. 12(1), 711 (2021).
Hyland, S. L. et al. Early prediction of circulatory failure in the intensive care unit using machine learning. Nat. Med. 26(3), 364–373 (2020).
DeGrave, A. J., Janizek, J. D. & Lee, S. I. AI for radiographic COVID-19 detection selects shortcuts over signal. medRxiv 1, 1 (2020).
Nguyen, D., Ngo, B. & vanSonnenberg, E. AI in the intensive care unit: Up-to-date review. J. Intensive Care Med. 36(10), 1115–1123 (2021).
Yoon, J. H., Pinsky, M. R. & Clermont, G. Artificial intelligence in critical care medicine. Crit. Care 26(1), 75 (2022).
Farion, K. J. et al. Comparing predictions made by a prediction model, clinical score, and physicians: Pediatric asthma exacerbations in the emergency department. Appl. Clin. Inf. 4(3), 376–391 (2013).
Feng, J. Z. et al. Comparison between logistic regression and machine learning algorithms on survival prediction of traumatic brain injuries. J. Crit. Care 54, 110–116 (2019).
World Medical Association. World Medical Association Declaration of Helsinki: Ethical principles for medical research involving human subjects. JAMA 312, 2191–2194 (2023).
Von Elm, E. A. D. et al. STROBE Initiative: Strengthening the reporting of observational studies in epidemiology (STROBE) statement: Guidelines for reporting observational studies. BMJ 335, 806–808 (2007).
Bouchard, J. et al. Fluid accumulation, survival and recovery of kidney function in critically ill patients with acute kidney injury. Kidney Int. 76(4), 422–427 (2009).
Carr, J. R. et al. Fluid stewardship of maintenance intravenous fluids. J. Pharm. Pract. 35(5), 769–782 (2022).
Malbrain, M. et al. Principles of fluid management and stewardship in septic shock: It is time to consider the four D’s and the four phases of fluid therapy. Ann. Intensive Care 8(1), 66 (2018).
Claure-Del Granado, R. & Mehta, R. L. Fluid overload in the ICU: Evaluation and management. BMC Nephrol. 17(1), 109 (2016).
O’Connor, M. E. & Prowle, J. R. Fluid overload. Crit. Care Clin. 31(4), 803–821 (2015).
Chen, T., & Guestrin, C. XGBoost: A Scalable Tree Boosting System. In: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining [Internet]. New York, NY, USA: ACM; 2016. p. 785–94. Available from: https://doi.org/10.1145/2939672.2939785.
Cortes, C. & Vapnik, V. Support-vector networks. Mach. Learn. 20(3), 273–297 (1995).
Ho, T. K. Random decision forests. In: Proceedings of 3rd international conference on document analysis and recognition. p. 278–82 (1995).
Liaw, A. & Wiener, M. Classification and regression by Random Forest. R News 2(3), 18–22 (2002).
Meyer, D., Dimitriadou, E., & Hornik, K., et al. Miscellaneous functions of the Department of Statistics and Probability Theory Group (2023). R Package version 1.7–13. https://CRAN.R-project.org/package=e1071.
Chen, T., He, T., Benesty, M., et al. XGBoost: Extreme gradient boosting. R package versions 1.7.5.1. https://CRAN.R-project.org/package=xgboost.
Li, K. H., Raghunathan, T. E. & Rubin, D. B. Large-sample significance levels from multiply imputed data using moment-based statistics and an F reference distribution. J. Am. Stat. Assoc. 86, 1065–1073 (1991).
Rubin, D. B. Multiple imputation for nonresponse in surveys (Wiley-Interscience, Hoboken, NJ, 2004).
Topol, E. J. Deep medicine: how artificial intelligence can make healthcare human again. First edition. pp 1 online resource (Basic Books, New York, 2019).
Kahneman, D., Sibony, O., & Sunstein, C. R. Noise: A flaw in human judgment. First edition. Edition (Little, Brown Spark, New York, 2021)
D’Hondt, E. et al. Identifying and evaluating barriers for the implementation of machine learning in the intensive care unit. Commun. Med. (Lond) 2(1), 162 (2022).
van de Sande, D. et al. Moving from bytes to bedside: A systematic review on the use of artificial intelligence in the intensive care unit. Intensive Care Med. 47(7), 750–760 (2021).
Moss, L. et al. Demystifying the black box: The importance of interpretability of predictive models in neurocritical care. Neurocrit. Care 37(Suppl 2), 185–191 (2022).
The Lancet Respiratory M: Opening the black box of machine learning. Lancet Respir. Med. 6(11), 801 (2018).
Malbrain, M., Martin, G. & Ostermann, M. Everything you need to know about deresuscitation. Intensive Care Med. 48(12), 1781–1786 (2022).
Gelbart, B. et al. Fluid accumulation in mechanically ventilated, critically ill children: retrospective cohort study of prevalence and outcome. Pediatr. Crit. Care Med. 23(12), 990–998 (2022).
National Heart L, Blood Institute Acute Respiratory Distress Syndrome Clinical Trials N, Wiedemann, H. P., et al. Comparison of two fluid-management strategies in acute lung injury. N. Engl. J. Med. 354(24), 2564–2575 (2006).
Gamble, K. C. et al. Hidden fluids in plain sight: Identifying intravenous medication classes as contributors to intensive care unit fluid intake. Hosp. Pharm. 57(2), 230–236 (2022).
Branan, T. et al. Association of hidden fluid administration with development of fluid overload reveals opportunities for targeted fluid minimization. SAGE Open Med. 8, 2050312120979464 (2020).
Mitchell, K. H. et al. Volume Overload: prevalence, risk factors, and functional outcome in survivors of septic shock. Ann. Am. Thorac. Soc. 12(12), 1837–1844 (2015).
Ouchi, A. et al. Association between fluid overload and delirium/coma in mechanically ventilated patients. Acute Med. Surg. 7(1), e508 (2020).
Murphy, C. V. et al. The importance of fluid management in acute lung injury secondary to septic shock. Chest 136(1), 102–109 (2009).
Boyd, J. H. et al. Fluid resuscitation in septic shock: A positive fluid balance and elevated central venous pressure are associated with increased mortality. Crit. Care Med. 39(2), 259–265 (2011).
Woodward, C. W. et al. Fluid overload associates with major adverse kidney events in critically ill patients with acute kidney injury requiring continuous renal replacement therapy. Crit. Care Med. 47(9), e753–e760 (2019).
Silversides, J. A., Perner, A. & Malbrain, M. Liberal versus restrictive fluid therapy in critically ill patients. Intensive Care Med. 45(10), 1440–1442 (2019).
Goldstein, S. et al. Pharmacological management of fluid overload. Br. J. Anaesth. 113(5), 756–763 (2014).
Silversides, J. A. et al. Fluid management and deresuscitation practices: A survey of critical care physicians. J. Intensive Care Soc. 21(2), 111–118 (2020).
Burkov, A. The hundred-page machine learning book (Quebec City, Canada, Andriy Burkov, 2019).
O’Keefe, A. G., Farewell, D. M., Tom, B. D. M. & Farewell, V. T. Multiple imputation of missing composite outcomes in longitudinal data. Stat. Biosci. 8(2), 310–332 (2016).
Qin, X. et al. A deep learning model to identify the fluid overload status in critically ill patients based on chest X-ray images. Pol. Arch. Intern. Med. 133(2), 1 (2023).
Acknowledgements
Data acquisition were supported by NC TraCS, funded by Grant Number UL1TR002489 from the National Center for Advancing Translations Sciences at the National Institutes of Health, and Data Analytics at the University of North Carolina Medical Center Department of Pharmacy.
Funding
Funding through Agency of Healthcare Research and Quality for Drs. Devlin, Murphy, Sikora, Smith, and Kamaleswaran was provided through R21HS028485 and R01HS029009.
Author information
Authors and Affiliations
Contributions
A.S. was responsible for project execution, design, and initial manuscript writing. J.D., D.M., and R.K. provided critical revisions of manuscript, data interpretation, and senior level oversight. M.Y., T.Z, and X.C. handled data pre-processing and analysis (M.Y., T.Z.) and methodology support and data interpretation (X.C., R.K.). B.M. served as site coordinator for all data validation and procurement as well as manuscript revisions and data interpretation. S.S., M.B., and S.R. provided clinical interpretation, results interpretation, and manuscript revision.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Sikora, A., Zhang, T., Murphy, D.J. et al. Machine learning vs. traditional regression analysis for fluid overload prediction in the ICU. Sci Rep 13, 19654 (2023). https://doi.org/10.1038/s41598-023-46735-3
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-023-46735-3




