Development and validation of a simple machine learning tool to predict mortality in leptospirosis

Galdino, Gabriela Studart; de Sandes-Freitas, Tainá Veras; de Andrade, Luis Gustavo Modelli; Adamian, Caio Manuel Caetano; Meneses, Gdayllon Cavalcante; da Silva Junior, Geraldo Bezerra; de Francesco Daher, Elizabeth

doi:10.1038/s41598-023-31707-4

Download PDF

Article
Open access
Published: 18 March 2023

Development and validation of a simple machine learning tool to predict mortality in leptospirosis

Scientific Reports volume 13, Article number: 4506 (2023) Cite this article

1426 Accesses
1 Citations
12 Altmetric
Metrics details

Subjects

Abstract

Predicting risk factors for death in leptospirosis is challenging, and identifying high-risk patients is crucial as it might expedite the start of life-saving supportive care. Admission data of 295 leptospirosis patients were enrolled, and a machine-learning approach was used to fit models in a derivation cohort. The comparison of accuracy metrics was performed with two previous models—SPIRO score and quick SOFA score. A Lasso regression analysis was the selected model, demonstrating the best accuracy to predict mortality in leptospirosis [area under the curve (AUC-ROC) = 0.776]. A score-based prediction was carried out with the coefficients of this model and named LeptoScore. Then, to simplify the predictive tool, a new score was built by attributing points to the predictors with importance values higher than 1. The simplified score, named QuickLepto, has five variables (age > 40 years; lethargy; pulmonary symptom; mean arterial pressure < 80 mmHg and hematocrit < 30%) and good predictive accuracy (AUC-ROC = 0.788). LeptoScore and QuickLepto had better accuracy to predict mortality in patients with leptospirosis when compared to SPIRO score (AUC-ROC = 0.500) and quick SOFA score (AUC-ROC = 0.782). The main result is a new scoring system, the QuickLepto, that is a simple and useful tool to predict death in leptospirosis patients at hospital admission.

Using machine learning tools to predict outcomes for emergency department intensive care unit patients

Article Open access 01 December 2020

Machine learning-based prediction of in-ICU mortality in pneumonia patients

Article Open access 17 July 2023

A prediction model of outcome of SARS-CoV-2 pneumonia based on laboratory findings

Article Open access 20 August 2020

Introduction

Leptospirosis is a worldwide neglected zoonotic disease caused by pathogenic spirochetes from the genus Leptospira, mainly L. interrogans, with higher prevalence in tropical countries¹. It is a waterborne disease transmitted through rat urine, and its outbreaks occur during rainy seasons². The disease mainly affects low-income populations and individuals exposed to contaminated animals and environments, such as farmers, veterinarians, sewage workers, meat inspectors, rodent control workers, and military personnel².

Leptospirosis is a major concern for public health due to its high morbidity and mortality rates. It is estimated that 1.03 million new cases occur worldwide annually, along with 58,900 or even more deaths, mostly among young adult males³. Many cases of leptospirosis are undiagnosed or misdiagnosed as other tropical febrile illnesses, concealing its actual burden⁴. Therefore, the World Health Organization (WHO) considers leptospirosis an important neglected tropical zoonosis due to its underestimated incidence and high mortality⁵. The median mortality in a recent review was 10.05% [(range 0–33.3%)]⁶.

Most patients manifest an asymptomatic or mild disease with fever, chills, headache, and myalgia. Ocular and cutaneous manifestations have also been described^7,8.

However, 10% of affected individuals may develop a life-threatening condition, characterized by hepatic dysfunction with rubinic jaundice and acute kidney injury (AKI), known as Weil’s syndrome, in addition to pulmonary hemorrhage, gastrointestinal symptoms, coagulopathy, electrolyte disturbances and myocarditis, as well as liver failure and neurological symptoms^4,9.

The diagnosis of leptospirosis is challenging and cumbersome due to unspecific initial presentation that mimics other bacterial and viral infections, hampering early recognition, and the lack of a standard testing technique to check infection at all stages. Additionally, some endemic areas are unprovided of adequate laboratory resources and infrastructure as well as well trained staff¹⁰.

In an attempt to improve leptospirosis prognosis, previous studies have investigated risk factors for poor outcomes^{6,11,12,13,14}. However, to the best of our knowledge, only one predictor of outcomes is available, SPIRO. This tool, built with data from patients hospitalized for leptospirosis in Australia, was based on three variables obtained at any time during hospitalization: abnormal auscultatory findings on respiratory examination, hypotension and oliguria. As limitations, no external validation was performed, and assessed end-point was a composite outcome of pulmonary hemorrhage, or intensive care unit (ICU) admission, or requirement for renal replacement therapy (RRT), or intubation, or need for vasoactive drugs¹⁵. Although these variables are classically associated with death, the indication for dialysis and ICU depend on the center's routine and logistics, which may preclude extrapolation to other centers. Thus, assessing death as an outcome would be more accurate. In the absence of specific predictors for leptospirosis, the classic predictors of death in septic patients have been used, such as the quick SOFA¹⁶. However, septic patients due to leptospirosis often have peculiar organic involvements that can have a distinct impact on outcomes.

Hence, aiming to assess a hard and objectively measurable endpoint, and focus on the possibility of early intervention, we proposed a new score constructed using machine learning techniques to predict death based on admission variable.

Materials and methods

Study design, population and ethics

This was a retrospective multicenter cohort study carried out from January 2005 to December 2019, including all patients with leptospirosis consecutively admitted to three tertiary reference hospitals in Fortaleza, state of Ceara, Brazil.

Patients with confirmed diagnosis of leptospirosis were included. The criteria for leptospirosis diagnosis included the presence of a positive serology result with a microscopic agglutination test (MAT) titer higher than 1:800, or ELISA assay for the detection of immunoglobulin M (IgM) antibodies associated with an epidemiological and clinical history compatible with leptospirosis. Patients with insufficient data for the diagnosis and those with concomitant acute infectious diseases (e.g., hepatitis A, HIV, dengue, typhoid fever) were excluded.

The study protocol was conducted in agreement with the Declaration of Helsinki and with resolution 466/2012 of the National Health Council, which regulates ethics in human research in Brazil. The Local Institutional Review Boards (IRB) of the three participating hospitals (Hospital São José de Doenças Infecciosas, Hospital Universitário Walter Cantídio, and Hospital Geral Fortaleza) have approved this study (no. 65452016.2.3001.5044). Due to the observational and retrospective nature of the study, using de-identified data, the IRBs waived the obtention of informed consent.

Assessed parameters

Data were collected from the medical records, and patients were followed from hospital admission until death or hospital discharge, whichever comes first. Demographic and hospitalization characteristics, such as age, gender, the time between symptoms onset and hospital admission, and length of hospital stay were recorded. The clinical investigation included a record of clinical signs and symptoms presented at hospital admission, vital signs at admission (systolic and diastolic blood pressure, heart rate, and respiratory rate), acute kidney injury (AKI) development, and need for dialysis during hospitalization. Laboratory data collected within 24 h of hospital admission included serum urea, creatinine, sodium, potassium, direct bilirubin, indirect bilirubin, aspartate aminotransferase (AST), alanine aminotransferase (ALT), lactate dehydrogenase (LDH), creatine phosphokinase (CK), hemoglobin, hematocrit, white blood cell (WBC) count, platelet count, and arterial blood gas analysis.

AKI was defined according to the Kidney Disease Improving Global Outcomes (KDIGO) criteria¹⁷. Tachypnea was defined as a respiratory rate higher than 22 breaths per minute. Oliguria was defined as urine output < 400 mL/day after 24 h of effective hydration. Hypotension was defined as mean arterial blood pressure (MAP) < 60 mmHg, and therapy with vasoactive drugs was initiated when MAP remained lower than 60 mmHg despite the administration of parenteral fluids. Symptoms of pulmonary involvement were defined by the occurrence of coughing, crackles, or hemoptysis. Symptoms of lethargy were defined by the presence of sensory alterations, including disorientation, lethargy, and agitation.

Outcome

The main evaluated outcome was in-hospital death.

Statistical analysis

Exploratory data analysis

All variables of interest were compared between patients who survived and those who died during the hospital stay.

Predictive model—pre-processing step

We removed the variables with more than 30% of missing values (14% of the predictors) and imputed the others (Supporting information—S2-Table 2). A k-nearest neighbors (KNN) algorithm was used for the imputation method to account for missing values. All predictor variables were used to compute Gower's distance and the five nearest neighbors in the KNN imputation model. Once the nearest neighbors are determined, the model is used to impute nominal variables, and the mean is used for numerical data.

The continuous variables were standardized by subtracting their values from the mean (center) and dividing them by the standard deviation (scale). Continuous variables were transformed using Box–Cox transformation. Variables with zero or near-zero variance were removed from the model. In the feature engineering process for Lasso regression, natural splines with four degrees of freedom for age were chosen to account for the non-linearity.

For the class imbalance adjustment, the Synthetic Minority Over-sampling Technique (SMOTE) was used to create synthetic classes in the training set. The SMOTE algorithm generated new examples of the minority class using the nearest neighbors of these cases. This approach was used to balance the target class. All the pre-processing steps were performed in the training set.

Feature selection

We used the Boruta algorithm to select the most important predictors. The Boruta algorithm is a feature selection method that classifies which features are important and which are not. The Boruta algorithm uses feature importance scores, which are provided by random forest. The importance measure of an attribute is obtained as the loss of classification accuracy caused by the random permutation of attribute values between objects. It is computed separately for all trees in the forest that use a given attribute for classification. Then the average and standard deviation of the accuracy loss are computed¹⁸. The method performs a top-down search for relevant features by comparing the importance of original attributes and progressively eliminating irrelevant features¹⁹. Features considered not important by the Boruta algorithm were removed. (Supporting information—S2-Table 2). We apply the feature selection in the training set.

Model training

We split the data into derivation (training) and validation (test) datasets. To create the datasets, a random split was used, stratified by the target into training (80%) and test set (20%). In the training set (derivation cohort), bootstrap resampling was used to select the hyperparameters of the models and to reduce the bias.

We fitted gradient boosting decision trees (xgBoost), and Lasso regression to develop the candidate equations. Finally, the best hyperparameters were selected using machine learning approaches by bootstrap resampling in a training set aimed to maximize the area under the receiver operating characteristic (ROC) curve.

Assessment of accuracy

The accuracy of the derivation cohort model was tested on the data of the validation cohort. The area under the ROC curve (AUC-ROC) was used to discriminate the ability of the models in the training and test sets. The 95% confidence interval (95%CI) of the AUC-ROC was estimated by bootstrap resampling (2000 samples) to reduce overfit bias. Additionally, the balanced accuracy, sensitivity and specificity were evaluated. Additionally, we estimate the best cut-point for ROC curve using the method of maximize the metric function, and J-Index metric using 1000 bootstrap resamples.

Score fit and model visualization

The model with higher AUC-ROC in the validation cohort associated with better balanced accuracy values was used to build the new score named LeptoScore. Subsequently, a quick score (QuickLepto) was developed using the importance values of the highest coefficients of Lasso regression. For the development of QuickLepto for numerical predictors we discretized the data using the cutoff derived from a Classification and Regression Trees for Machine Learning (CART) tree.

Accuracy metrics for previously published models

The final models (LeptoScore and QuickLepto) were compared with SPIRO and quick SOFA. SPIRO predict severe disease in patients with leptospirosis (pulmonary hemorrhage, or intensive care unit (ICU) admission, or requirement for renal replacement therapy (RRT), or intubation, or need for vasoactive drugs and is based on the following variables: oliguria (urine output ≤ 500 mL/24 h), abnormal auscultatory findings on respiratory examination and hypotension (systolic blood pressure ≤ 100 mmHg)¹⁵. The quick SOFA is a three-point score broadly used to identify high-risk patients for in-hospital mortality with suspected infection outside the ICU. Altered mental status (coma Glasgow score < 15), respiratory rate ≥ 22 breaths per minute and systolic BP ≤ 100 mmHg are the predictors of this score¹⁶. Given the relevance of these scores, we applied them (SPIRO and quick SOFA) in our dataset to compare the predictive values with the new LeptoScore and QuickLepto models.

The software R, version 4.0.2 and the tidymodels packages, and the R package “glmnet” statistical software (R Foundation) were used to perform the Lasso regression.

Results

Leptospirosis patients’ characteristics at hospital admission

A total of 295 leptospirosis patients were included. Death was observed in 32 cases (11%). The population was primarily young adults, with a median age of 36 years (25–49) and 86% were males. The median time from hospital admission to symptoms was 7 (5–8) days. Fever and chills were the most frequent symptoms (93%), followed by myalgia (78%) (Supporting information—S1-Table 1). The univariate analysis is shown in Supporting information—S1-Table 1.

Predictive model

Patients were randomly grouped into two cohorts: the derivation cohort or training set (n = 235, 80%) and the internal validation cohort (test set) (n = 60, 20%).

There was a total of 63 predictors, and six were removed due to higher missing values (higher than 30%). Because there was a high number of predictors, feature selection was used, resulting in 14 possible candidate predictors. After that, three collinear predictors (Supporting information—S3-Table 3) were also removed. After that, predictive models were fitted using the final predictors (n = 11 predictors).

Several models were fitted with bootstrap resampling and the performance of these models was analyzed throughout the area under the curve of the receiver operating characteristic curves (AUC-ROC) in the derivation cohort. The AUC-ROC were 0.738, and 0.772 in the xgBoost and Lasso models, respectively. As a second step, the performance of these models was tested in the internal validation cohort. The AUC-ROC were 0.703 (0.414–0.987) and 0.776 (0.601–0.951) for the xgBoost and Lasso models, respectively (Table 1). The Lasso model had higher values of balance accuracy and specificity when compared to xgBoost (Table 1). Additionally, we plotted a confusion matrix of mortality in the derivative cohort as shown in Supporting information—S4-Fig. 1.

Table 1 Performance metrics of leptospirosis mortality models in the derivation and validation cohorts.

Full size table

Making a score-based prediction

The results of the Lasso model regression showed that older age, the lethargy symptom, pulmonary involvement symptom, higher alanine aminotransferase (ALT) values, higher direct bilirubin values, and higher leukocyte levels were related to death. In contrast, a higher hematocrit level, higher mean arterial pressure, higher urea and sodium values, and higher platelet levels were related to survival (Supporting information—S5-Fig. 2). The coefficients of the Lasso model were used to build the LeptoScore (Supporting information—S5-Fig. 2).

Quick score (QuickLepto)—fit

To create QuickLepto, continuous variables were discretized based on a CART tree (Supporting information—S6-Fig. 3). Then, we used the coefficients of Lasso regression and mapped them in round numbers considering their absolute values. For variables whose Lasso regression coefficients were above 2, it was attributed 2 points (age); for those variables whose Lasso coefficients ranged from 0.5 and 2, it was given 1 point (pulmonary involvement, lethargy, hematocrit, and MAP). Those below 0.5 we excluded from QuickLepto (serum urea, sodium, bilirubin, ALT, leucocytes, platelets) (Supporting information—S7-Table 4).

The QuickLepto uses 5 predictors (Fig. 1):

1.
Age over 40 years: 2 points
2.
Presence of the lethargy symptom: 1 point
3.
Presence of the pulmonary symptom: 1 point
4.
Mean Arterial Pressure < 80 mmHg: 1 point
5.
Hematocrit < 30%: 1 point

The AUC-ROC for QuickLepto was 0.788 [95% CI 0.693–0.883]. Accuracy, balanced accuracy, sensitivity, and specificity values using a cutoff of three or more points, are shown in Table 2, Fig. 1. The best cut-off for AUC-ROC was 0.778.

Table 2 Performance metrics of LeptoScore and QuickLepto in validation cohorts.

Full size table

Comparison of accuracy metrics with previous models

The performances of LeptoScore and QuickLepto were compared with two other models, one derived from a population with leptospirosis and another model used in septic patients. The results are shown in Table 3. The accuracy, balanced accuracy, sensitivity, and specificity values were, respectively: 0.50, 0.71, 0.43, and 1.00 for the SPIRO score, and 0.78, 0.56, 0.84, and 0.28 for the model derived from quick SOFA. Therefore, all of them resulted in low specificity and/or lower sensitivity for patients with leptospirosis, showing a lower performance than the LeptoScore and QuickLepto score. Thus, the LeptoScore and QuickLepto score had a better balance between sensitivity and specificity.

Table 3 Comparison of accuracy metrics with the previous predictive model.

Full size table

Discussion

This is the first study to predict mortality in human leptospirosis through a machine learning model in a high prevalence area, which was called LeptoScore. A quick score was also developed (QuickLepto), which could be easily applied and attained a similar performance to that of the complete model. Using age, two clinical symptoms, mean arterial pressure measure, and hematocrit values, it was possible to predict death at hospital admission with a high discriminatory power. Although many studies^11,12,14, including a systematic review⁶, had found some independent predictors of mortality, this is the first study that established a hospital admission tool that is easy to use and has the best balance between sensitivity and specificity to predict death in human leptospirosis.

Previous predictive models in leptospirosis showed high performance but used combined data obtained at admission and other moments during hospital stay. For example, the SPIRO score predicts leptospirosis severity using: oliguria (urine output ≤ 500 mL/24 h), abnormal auscultatory findings on respiratory examination and hypotension (systolic blood pressure ≤ 100 mmHg)¹⁵. The presence of oliguria must be evaluated 24 h after and not promptly at hospital admission. Aiming to focus on early intervention, the new score was developed using admission variables and evaluated a hard and objectively measurable endpoint, death.

The present study had 11% of mortality (32/295), a similar finding to that of previous studies, which was around 10% (range of 0–33.3%)⁶. Independent risk factors for mortality in leptospirosis-associated AKI reported in a systematic review were oliguria, jaundice, arrhythmia, crackles, elevated direct bilirubin level, elevated activated prothrombin time, hyperbilirubinemia and leukocytosis^6,20. We found similar mortality predictors in the present cohort, not exclusively related to the presence of AKI.

A study conducted in patients with leptospirosis in intensive care units showed that the Simplified Acute Physiology Score (SAPS) showed a worse performance in relation to mortality. The mortality of patients with leptospirosis was lower than that predicted by the SAPS score²¹. This suggests the need for a specific predictive model for patients with leptospirosis. Confirming these findings, the quick SOFA, which was a general score for septic patients, showed an inferior performance than the LeptoScore.

Three of the five predictors included in QuickLepto are similar to the parameters used in quick SOFA, but anemia (Hematocrit < 30%) and age over 40 years, the most important variables, were not included.

In line with previous studies, our results showed that anemia in leptospirosis patients was associated with poor outcomes. In a prospective observational study, hemoglobin (Hb) levels lower than 11 g/dL were associated with severe forms of the disease (70% versus 14.8%; OR = 16.2 [95% CI 3.9–66.9])²². Another study has shown that ICU patients with leptospirosis had lower levels of hemoglobin than those treated in hospital wards (10.2 ± 2.4 vs. 11.6 ± 1.9 g/dL, p < 0.0001)²³.

Daher et al. have previously shown that age is a crucial predictor of outcomes. Elderly patients with leptospirosis showed less hemodynamic impairment on admission, higher incidence of AKI (OR 2.049, 95% CI 1.207–3.477), and a higher frequency of death (OR 3.520, 95% CI 1.940–6.386) during hospital stay than younger patients¹³.

This study has some limitations that are mainly due to its retrospective design and the fact that data were collected over 14 years. Although the long period, the main treatment guidelines remain the same. Our previous article showed that the mortality rate had dropped each decade since 1985, which probably reflects early diagnosis and the provision of adequate treatment. The QuickLepto did not have an external cohort validation. Although we performed the validation metrics in an independent test set, the results of QuickLepto need further external validation cohorts. This was especially true for the patients that scored more than 3 points because the number of patients that ranked 4–5 points was lower in the present dataset. On the other hand, only basic hospital admission data were included, the statistical models used in the study are very sophisticated, and it is one of the largest samples ever studied.

In conclusion, patient age, presence of lethargy or pulmonary symptoms, arterial hypotension, and anemia were associated with death in patients with leptospirosis requiring hospitalization. These variables were selected to fit a new scoring system, the QuickLepto, a simple and useful tool to predict death in leptospirosis patients at hospital admission. Despite its good accuracy in predicting death, the LeptoScore is more complex, requiring specific calculators. Thus, we encourage physicians in the clinical setting to use QuickLepto to predict outcomes and make decisions, such as choosing the appropriate ward, allocating staff, and prescribing treatments and interventions. The next step is its validation in a prospective sample, especially in different populations, to provide overall appropriateness and demonstrate its significant usefulness in resource-limited settings with the greatest clinical burden.

Data availability

The dataset supporting the conclusions of this article is available upon reasonable request from the corresponding author (GSG).

References

Ko, A. I., Goarant, C. & Picardeau, M. Leptospira: The dawn of the molecular genetics era for an emerging zoonotic pathogen. Nat. Rev. Microbiol. 7, 736–747 (2009).
Article CAS PubMed PubMed Central Google Scholar
Pappas, G., Papadimitriou, P., Siozopoulou, V., Christou, L. & Akritidis, N. The globalization of leptospirosis: Worldwide incidence trends. Int. J. Infect. Dis. https://doi.org/10.1016/j.ijid.2007.09.011 (2008).
Article PubMed Google Scholar
Costa, F. et al. Global morbidity and mortality of leptospirosis: A systematic review. PLoS Negl. Trop. Dis. 9, e0003898 (2015).
Article PubMed PubMed Central Google Scholar
Haake, D. A. & Levett, P. N. Leptospirosis in humans. Curr. Top. Microbiol. Immunol. 387, 65–97 (2015).
CAS PubMed PubMed Central Google Scholar
Abela-Ridder, B., Sikkema, R. & Hartskeerl, R. A. Estimating the burden of human leptospirosis. Int. J. Antimicrob. Agents 36, S5–S7 (2010).
Article CAS PubMed Google Scholar
Al Hariri, Y. K., Sulaiman, S. A. S., Khan, A. H., Adnan, A. S. & Al Ebrahem, S. Q. Mortality of leptospirosis associated acute kidney injury (LAKI) & predictors for its development in adults: A systematic review. J. Infect. Public Health 12, 751–759 (2019).
Article PubMed Google Scholar
Puca, E. et al. Ocular and cutaneous manifestation of leptospirosis acquired in Albania: A retrospective analysis with implications for travel medicine. Travel Med. Infect. Dis. 14, 143–147 (2016).
Article PubMed Google Scholar
Arrieta-Bechara, C. E. & Carrascal-Maldonado, A. Y. Ocular leptospirosis: A review of current state of art of a neglected disease. Rom. J. Ophthalmol. 66, 282–288 (2022).
PubMed PubMed Central Google Scholar
Daher, E. F. et al. Different patterns in a cohort of patients with severe leptospirosis (Weil Syndrome): Effects of an educational program in an endemic area. Am. J. Trop. Med. Hyg. 85, 479–484 (2011).
Article PubMed PubMed Central Google Scholar
Karpagam, K. B. & Ganesh, B. Leptospirosis: A neglected tropical zoonotic infection of public health importance—an updated review. Eur. J. Clin. Microbiol. Infect. Dis. 39, 835–846 (2020).
Article CAS PubMed Google Scholar
Wang, H.-K., Lee, M.-H., Chen, Y.-C., Hsueh, P.-R. & Chang, S.-C. Factors associated with severity and mortality in patients with confirmed leptospirosis at a regional hospital in northern Taiwan. J. Microbiol. Immunol. Infect. 53, 307–314 (2020).
Article PubMed Google Scholar
De Francesco Daher, E. et al. Changing patterns in leptospirosis: A three-decade study in Brazil. Int. J. Infect. Dis. 60, 4–10 (2017).
Article PubMed Google Scholar
Daher, E. D. F. et al. Leptospirosis in the elderly: The role of age as a predictor of poor outcomes in hospitalized patients. Pathog. Glob. Health 113, 117–123 (2019).
Article PubMed PubMed Central Google Scholar
Goswami, R. P. et al. Predictors of mortality in leptospirosis: An observational study from two hospitals in Kolkata, eastern India. Trans. R. Soc. Trop. Med. Hyg. 108, 791–796 (2014).
Article CAS PubMed Google Scholar
Smith, S. et al. A simple score to predict severe leptospirosis. PLoS Negl. Trop. Dis. 13, 1–13 (2019).
Article Google Scholar
Singer, M. et al. The Third International Consensus Definitions for Sepsis and Septic Shock (Sepsis-3) Clinical Review & Education Special Communication|caring for the critically ill patient. JAMA 315, 801–810 (2016).
Article CAS PubMed PubMed Central Google Scholar
Kellum, J. A., Lameire, N., KDIGO AKI Guideline Work Group. Diagnosis, evaluation, and management of acute kidney injury: A KDIGO summary (Part 1). Crit. Care 17, 204 (2013).
Article PubMed PubMed Central Google Scholar
Ponce, D., de Andrade, L. G. M., Del Granado, R. C., Ferreiro-Fuentes, A. & Lombardi, R. Development of a prediction score for in-hospital mortality in COVID-19 patients with acute kidney injury: A machine learning approach. Sci. Rep. 11, 1–13 (2021).
Article Google Scholar
Kursa, M. B. & Rudnicki, W. R. Feature selection with the Boruta package. J. Stat. Softw. 36, 1–13 (2010).
Article Google Scholar
Rista, E. et al. Acute kidney injury in leptospirosis: A country-level report. Travel Med. Infect. Dis. 49, 102359 (2022).
Article PubMed Google Scholar
Delmas, B. et al. Leptospirosis in ICU: A retrospective study of 134 consecutive admissions. Crit. Care Med. 46, 93–99 (2018).
Article PubMed Google Scholar
Biscornet, L. et al. An observational study of human leptospirosis in Seychelles. Am. J. Trop. Med. Hyg. 103, 999–1008 (2020).
Article CAS PubMed PubMed Central Google Scholar
De Francesco Daher, E. et al. Risk factors for intensive care unit admission in patients with severe leptospirosis: A comparative study according to patients’ severity. BMC Infect. Dis. 16, 1–7 (2016).
Google Scholar

Download references

Acknowledgements

We are grateful to the team of clinicians, residents, medical students and nurses from Hospital São José de Doenças Infecciosas, Hospital Universitário Walter Cantídio, and Hospital Geral Fortaleza for the assistance provided to patients and for the technical support that aided in the development of this research.

Funding

This study was supported by the Brazilian Research Council for Scientific and Technological Development (CNPq) with financial support by protocol 405963/2016-5 and grants to EFD (Process Number: 302017/2018-6), GBSJ (Process Number: 310974/2020-8) and TVSF (Process Number: 305664/2021-2), and by Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES) with grants to GCM (Process Number: 88882.306447/2018-01). We have also received financial support from the Edson Queiroz Foundation/University of Fortaleza.

Author information

Authors and Affiliations

Medical Sciences Postgraduate Program, Federal University of Ceará, Rua Silva Jatahy 1000 ap 600, Fortaleza, Ceará, 60165-070, Brazil
Gabriela Studart Galdino, Tainá Veras de Sandes-Freitas, Gdayllon Cavalcante Meneses, Geraldo Bezerra da Silva Junior & Elizabeth de Francesco Daher
Hospital Universitário Walter Cantídio, Federal University of Ceará, Fortaleza, Ceará, Brazil
Gabriela Studart Galdino, Tainá Veras de Sandes-Freitas, Caio Manuel Caetano Adamian & Geraldo Bezerra da Silva Junior
Hospital Geral de Fortaleza, Fortaleza, Ceara, Brazil
Tainá Veras de Sandes-Freitas
Botucatu Medical School, Universidade Estadual Paulista, Botucatu, São Paulo, Brazil
Luis Gustavo Modelli de Andrade
School of Medicine, Medical Sciences and Public Health Postgraduate Programs, University of Fortaleza, Fortaleza, Ceará, Brazil
Geraldo Bezerra da Silva Junior

Authors

Gabriela Studart Galdino
View author publications
You can also search for this author in PubMed Google Scholar
Tainá Veras de Sandes-Freitas
View author publications
You can also search for this author in PubMed Google Scholar
Luis Gustavo Modelli de Andrade
View author publications
You can also search for this author in PubMed Google Scholar
Caio Manuel Caetano Adamian
View author publications
You can also search for this author in PubMed Google Scholar
Gdayllon Cavalcante Meneses
View author publications
You can also search for this author in PubMed Google Scholar
Geraldo Bezerra da Silva Junior
View author publications
You can also search for this author in PubMed Google Scholar
Elizabeth de Francesco Daher
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

The authors confirm contribution to the paper as follows: study conception and design: G.S.G., T.V.S.F., E.F.D.; data collection: C.M.C.A.; analysis and interpretation of results: G.S.G., T.V.S.F., L.G.M.A.; draft manuscript preparation: G.S.G., T.V.S.F., C.M.C.A., G.C.M., G.B.S.J., E.F.D. All authors reviewed the results and approved the final version of the manuscript.

Corresponding author

Correspondence to Gabriela Studart Galdino.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Galdino, G.S., de Sandes-Freitas, T.V., de Andrade, L.G.M. et al. Development and validation of a simple machine learning tool to predict mortality in leptospirosis. Sci Rep 13, 4506 (2023). https://doi.org/10.1038/s41598-023-31707-4

Download citation

Received: 29 September 2022
Accepted: 16 March 2023
Published: 18 March 2023
DOI: https://doi.org/10.1038/s41598-023-31707-4

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Using machine learning tools to predict outcomes for emergency department intensive care unit patients

Machine learning-based prediction of in-ICU mortality in pneumonia patients

A prediction model of outcome of SARS-CoV-2 pneumonia based on laboratory findings

Introduction

Materials and methods

Study design, population and ethics

Assessed parameters

Outcome

Statistical analysis

Exploratory data analysis

Predictive model—pre-processing step

Feature selection

Model training

Assessment of accuracy

Score fit and model visualization

Accuracy metrics for previously published models

Results

Leptospirosis patients’ characteristics at hospital admission

Predictive model

Making a score-based prediction

Quick score (QuickLepto)—fit

Comparison of accuracy metrics with previous models

Discussion

Data availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Supplementary Information.

Rights and permissions

About this article

Cite this article

Share this article

Comments

Search

Quick links