Prediction of ciprofloxacin resistance in hospitalized patients using machine learning

Mintz, Igor; Chowers, Michal; Obolski, Uri

doi:10.1038/s43856-023-00275-z

Download PDF

Article
Open access
Published: 28 March 2023

Prediction of ciprofloxacin resistance in hospitalized patients using machine learning

Communications Medicine volume 3, Article number: 43 (2023) Cite this article

4531 Accesses
3 Citations
10 Altmetric
Metrics details

Subjects

Abstract

Background

Ciprofloxacin is a widely used antibiotic that has lost efficiency due to extensive resistance. We developed machine learning (ML) models that predict the probability of ciprofloxacin resistance in hospitalized patients.

Methods

Data were collected from electronic records of hospitalized patients with positive bacterial cultures, during 2016-2019. Susceptibility results to ciprofloxacin (n = 10,053 cultures) were obtained for Escherichia coli, Klebsiella pneumoniae, Morganella morganii, Pseudomonas aeruginosa, Proteus mirabilis and Staphylococcus aureus. An ensemble model, combining several base models, was developed to predict ciprofloxacin resistant cultures, either with (gnostic) or without (agnostic) information on the infecting bacterial species.

Results

The ensemble models’ predictions are well-calibrated, and yield ROC-AUCs (area under the receiver operating characteristic curve) of 0.737 (95%CI 0.715–0.758) and 0.837 (95%CI 0.821–0.854) on independent test-sets for the agnostic and gnostic datasets, respectively. Shapley additive explanations analysis identifies that influential variables are related to resistance of previous infections, where patients arrived from (hospital, nursing home, etc.), and recent resistance frequencies in the hospital. A decision curve analysis reveals that implementing our models can be beneficial in a wide range of cost-benefits considerations of ciprofloxacin administration.

Conclusions

This study develops ML models to predict ciprofloxacin resistance in hospitalized patients. The models achieve high predictive ability, are well calibrated, have substantial net-benefit across a wide range of conditions, and rely on predictors consistent with the literature. This is a further step on the way to inclusion of ML decision support systems into clinical practice.

Plain language summary

Ciprofloxacin is an antibiotic commonly used to treat various infections. Due to the frequent use of ciprofloxacin, bacteria have developed high rates of resistance to it, which means they continue to grow, reducing the effectiveness of treatment. The aim of this study was to develop computer code to predict ciprofloxacin resistance in hospitalized patients. We used data from medical records and tests of whether particular bacteria could be killed by antibiotics from a large hospital in Israel to develop the computer code. The computational model accurately predicted resistance. This model could enable antibiotic treatment to be more appropriately targeted to patients that would benefit from it and reduce the amount of bacteria resistant to ciprofloxacin.

Interpretable machine learning-based decision support for prediction of antibiotic resistance for complicated urinary tract infections

Article Open access 02 November 2023

Machine learning model for predicting ciprofloxacin resistance and presence of ESBL in patients with UTI in the ED

Article Open access 25 February 2023

Personalized antibiograms for machine learning driven antibiotic selection

Article Open access 08 April 2022

Introduction

Antimicrobial resistance (AMR) has developed into a global public health crisis. AMR often emerges rapidly in bacterial populations, and the effectiveness of newly introduced antibiotics can substantially drop after a few years of clinical use^1,2. In settings of high resistance levels, such as treatment of hospitalized patients, it may become challenging to find empiric antibiotic treatments which will be effective, while minimizing collateral resistance³. Such inappropriate empirical treatment is associated with the prevalence of AMR⁴. Despite guidelines⁵, literature on collateral damage of antibiotics^5,6, and stewardship initiatives⁷, the frequency of bug-drug mismatch in empiric treatment often remains high^4,8.

A notable example of a broadly used antibiotic, with increasing concerns about its resistance frequencies, is ciprofloxacin. Ciprofloxacin is a fluoroquinolone antibiotic, which has been widely used since the early 2000s and is currently on the World Health Organization’s List of Essential Medicines⁹. Ciprofloxacin is effective against various gram-negative bacteria, and to a lesser extent gram-positive bacteria, and is used in the treatment of urinary tract, respiratory tract, bone and joint, intra-abdominal, and other infections^10,11. Hence, ciprofloxacin has been the drug of choice for many infections both in in- and out-patient settings. High consumption rates over decades inevitably increased resistance to the drug^12,13,14, with an additional indirect effect on non-consumers¹⁵, impeding effective therapy¹⁶. However, reversion to high levels of sensitivity to quinolones is rapid upon decrease in quinolone consumption¹⁷. Therefore, minimizing unnecessary ciprofloxacin use can have substantial public health impact.

The use of machine learning (ML) in the context of AMR has been rapidly increasing with the availability of electronic medical records (EMRs) and development of new algorithms. ML models are potentially nearing the point where they can support clinicians’ decisions of empiric therapy, by providing rapid predictions of resistance^18,19. Hence, constant improvement of the methodology and outcomes of such models is of high importance. In the context of ciprofloxacin, prediction models have been scarce and limited to community-acquired urinary tract infections²⁰, only to intensive care units²¹, specific site of infection²², or to specific subsets of patients²³.

In this study, we developed an ensemble ML model that predicts resistance to ciprofloxacin based on hospitalized patients’ EMRs. Importantly, we include as variables relevant frequencies of resistance within the hospital, and not solely the examined patient’s EMR. Our models are applied to two settings: assuming that the infecting bacterial species is unknown (a bacteria-agnostic dataset) or known (the bacteria-gnostic dataset), with resulting test-set AUC values of 0.737 (95%CI 0.715–0.758) and 0.837 (95%CI 0.821–0.854).

Furthermore, explainability methods are used to analyze important predictors of resistance in our ML models.

Methods

Data

Data were retrieved from Meir Medical Center, a hospital in Israel which serves approximately 600,000 residents. EMRs of patients who had positive bacterial cultures that were tested for ciprofloxacin susceptibility between the years 2016-2019 were retrieved. The data contained information regarding patients’ demographics, functional status, previous antibiotics usage and previous hospitalization within the previous year, bacterial pathogen, and susceptibility results. For gram-negative bacteria in urine or wound culture, VITEK 2 (bioMerieux, Durham, NC) was used. For all isolates from blood or for gram-positive bacteria, in urine, wounds, or blood cultures, disk diffusion with CLSI breakpoints was used. Bacterial cultures demonstrating intermediate resistance results were regarded as resistant.

Additional features related to previous infections with resistant bacteria, previous antibiotic usage, and previous hospitalizations were engineered from the patients’ EMRs. The final dataset contained 10,053 susceptibility test results of 5540 patients and 73 variables (see Supplementary Data 1). These data were used to create two data sets: bacteria gnostic (the whole data) and bacteria agnostic (without 20 features related to the bacteria). The train-test split was performed based on calendar time, rather than randomly. This minimizes chances of “data-leakage”, where training on future observations holds information on past observations. Furthermore, such a split emulates a real-world scenario where the model can be trained up to a certain point and then used in the clinic from that point onwards, and is considered a form of external validation^24,25,26,27. Each dataset was divided into a training set (75% of all samples) and a test set (25% of all samples), based on the date the culture was taken (Fig. 1). These datasets are mutually exclusive - all the presented results were obtained when training the models solely on the training set, and testing them on the independent test set.

**Fig. 1: Ciprofloxacin resistance time-trends stratified by bacterial species.**

Machine learning algorithms

We used an ensemble of several ML algorithms, which we term ‘base learners’: LASSO penalized logistic regression²⁸, random forest²⁹, gradient-boosted trees²⁹, and neural networks²⁹. The base learners’ hyperparameters were optimized using 200 random searches³⁰ with a five-fold, time series cross-validation. To improve the predictions of the four base learners, a stacking technique was applied. In this technique, the predictions of the base learners are given as inputs to a second-level learning algorithm (super learner). The super learner was a logistic regression algorithm trained to optimize the predictions³¹. We adopted a process described elsewhere³² to train the super learner on time series data (Figure S1 in the Supplementary Material). This resulted in a single ensemble model whose output is the predicted probability of the culture result to have resistance to ciprofloxacin. The tuned hyperparameters are shown in Supplementary Data 2. Model performance was evaluated using the area under the receiver operating characteristic curve (ROC-AUC) metric. Confidence intervals (CI) were calculated using 5,000 bootstrap samples of the test-set data. Model agnostic approximation of the Shapley additive explanations (SHAP) was performed with “kernel SHAP”³³, employing 300 background samples from the training data and calculating the SHAP values of the entire test set.

Decision curve analysis

A decision (also known as a utility) curve analysis, which is increasingly recognized as valuable in clinical predictive modeling²⁶, was performed using the predictions of our ensemble model on the test-set. A decision curve is a graphical representation of the trade-offs between the benefits and costs of a particular treatment or intervention, when administered according to a prognostic algorithm. It is used to evaluate the overall utility of the algorithm by considering both the magnitude of the benefits and costs of no-treatment and redundant treatment, and the likelihood of these results based on prevalence of the outcome and the algorithms’ prediction abilities. In such an analysis, the standardized net benefit (sNB) of a decision is defined by the following equation:^34,35

$${sNB}={TPR}-{FPR}\frac{1-{f}_{{res}}}{{f}_{{res}}}\frac{{p}_{t}}{1-{p}_{t}}$$

(1)

where TPR and FPR are the true- and false-positive rates, respectively; ${p}_{{t}}$ is a threshold probability; and ${f}_{{res}}$ is the frequency of resistant infections. In our case, ${p}_{{t}}$ is the threshold probability above which a decision maker (i.e., clinician) is willing to act as if the infection is resistant to ciprofloxacin. This implies that the cost of falsely deciding that an infection is susceptible to ciprofloxacin is ${p}_{t}/(1-{p}_{t})$ fold the benefit of correctly deciding it is susceptible to ciprofloxacin. Hence ${p}_{t}/(1-{p}_{t})$ is also termed the cost-benefit ratio. For example, assume clinicians will not treat an infection with ciprofloxacin when they know that the probability of ciprofloxacin resistance is above 0.2, but will treat them with ciprofloxacin otherwise. The clinicians are hence implicitly willing to inefficiently treat one patient with a ciprofloxacin resistant infection for every four patients with susceptible infections, yielding a cost-benefit ratio of 1:4.

The sNB of the model merges all the above-mentioned parameters into a single number for each threshold, and hence produces a curve. This curve is compared to two simple decision strategies: assuming that every infection is resistant (all resistant) and that no infection is resistant (all susceptible). The sNB can reach a maximum value of 1, equivalent to assuming that all resistant and susceptible cases are treated correctly (TPR = 1 and FPR = 0).

Analyses were performed with Python 3.7³⁶, using the following packages: Numpy 1.20.3³⁷, Pandas 1.3.5³⁸ and Scikit-learn 1.0.1³⁹ for data processing; Scikit-learn, XGBoost 1.5.0⁴⁰, and Tensorflow 2.4.1⁴¹ for modeling; Matplotlib 3.5.0⁴² for plotting; and SHAP 0.40.0⁴³ for variable influence.

Ethics approval

The study was approved by the Institutional Review Board (Helsinki) Committee of Meir Medical Center. Since this was a retrospective study, using archived medical records, an exemption from informed consent was granted by the Helsinki Committee.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Results

We trained four base learners, and an ensemble model composed of these base learners, to predict ciprofloxacin resistance for six bacterial species. The demographics and basic clinical characteristics corresponding to the cultures’ patients are shown in Supplementary Data 3. We note that K.pneumoniae and M.morganii had a higher proportion of resistant samples in the test set, which potentially may harm predictions. Regardless, our algorithms were able to generalize successfully and achieve high ROC-AUC scores.

ROC-AUC scores and calibration plots were calculated for all the base learners (Fig. 2a, b). The ensemble consistently outperformed all base learners, on both datasets, achieving high ROC-AUC scores. For the bacteria-agnostic dataset, the ROC-AUC scores were 0.716 for the neural network, 0.736 for the logistic regression (LASSO), 0.719 for the random forest, 0.729 for the XGBoost and 0.737 (95% CI 0.715-0.758) for the ensemble. On the bacteria-gnostic dataset the scores were 0.82 for the neural network, 0.835 for the LASSO, 0.812 for the random forest, 0.832 for the XGBoost and 0.837 (95% CI 0.821–0.854) for the ensemble. Furthermore, our ensemble models were well-calibrated (Fig. 2c, d).

**Fig. 2: ROC curves and calibration plots for bacteria-agnostic and bacteria-gnostic datasets.**

In an effort to improve the ensemble’s transparency and gain a better comprehension of the variables influencing its predictions, we used Kernel SHAP. This method estimates the contribution of each variable to the model’s prediction by approximating their SHAP values³³. These SHAP values allow us to understand the magnitude and direction of influence of variables, which implies variable importance (Fig. 3).

**Fig. 3: SHAP values of the ensemble model for the five most influential variables in the agnostic and gnostic datasets.**

For the agnostic dataset, the five most influential variables in the bacteria agnostic dataset, as measured by the mean absolute SHAP values (Fig. 3a), were: previous resistance to ciprofloxacin in the past 60 days, whether the patient arrived from an institution, recent resistance to any antibiotic in same type of units (e.g., internal medicine or orthopedic units), previous resistance to ciprofloxacin during the previous 61–180 days, and recent resistance to any antibiotic in the hospital. Analogously, the five most influential variables in the bacteria gnostic dataset were (Fig. 3b): average resistance of the same bacterial species to any antibiotic in the past 30 days, across the hospital; the number of previous fluoroquinolone resistant infections the patient had in the past 60 days; whether the bacterial species was P. aeruginosa; and the number of non-ciprofloxacin antibiotics that the same bacterial species had resistance to in the past 60 days, in the same patient. In both agnostic and gnostic settings, higher values of the influential variables consistently yielded positive influence on the ensemble’s prediction, as can be seen by the swarm plots of the SHAP values (Fig. 3). This is simply the result of our coding of the binary variables (i.e., deciding which variable levels are set to zero or one) as risk factors.

Finally, we have performed a decision curve analysis (see Methods). Figure 4 shows that relying on predictions of our models can be at least as beneficial as assuming that every infection is resistant to ciprofloxacin, or assuming that every infection is sensitive to ciprofloxacin, for all cost-benefit ratios.

**Fig. 4: agnostic and gnostic decision curves.**

Discussion

In this study, we developed two ensemble ML models to predict resistance to ciprofloxacin of hospitalized patients’ infections. The first model was trained on the bacteria agnostic dataset, i.e., without any knowledge of the infecting bacterial species. This represents the most common situation before the start of antibiotic treatment. The second ensemble was trained on the bacteria gnostic dataset, i.e., with primary information of the infecting bacterial species. Both models achieved high ROC-AUC metrics on an independent test set: 0.737 (95%CI 0.715–0.758) and 0.837 (95%CI 0.821–0.854) for the agnostic and gnostic datasets, respectively, and were well calibrated. Moreover, a decision curve analysis revealed that implementing our models can be beneficial in a wide range of cost-benefit considerations of withholding vs prescribing ciprofloxacin.

Our ML models include several innovative components in the field of AMR prediction. First, we use a super learner that is trained to effectively combine the outputs of several base learners. This increases our final ROC-AUC by up to 0.025 with respect to the base-learners. Second, we incorporate variables representing recent and local resistant patterns within the hospital, in addition to a specific patient’s EMR. Consequently, and despite the limited ability to compare such results between different settings, our models achieve high predictive abilities relative to previous studies^20,21. Importantly, our models perform well on a very heterogeneous dataset, comprising various bacterial species, sample sources and multiple departments of the hospital. For example, Feretzakis et al.²¹ predicted ciprofloxacin resistance using data from a single internal medicine department, conditioned on the sample’s Gram stain result, and reached an ROC-AUC of 0.726²¹. Yelin et al.²⁰ predicted ciprofloxacin resistance only in outpatients, strictly using urine samples, and limited to three bacterial species, reaching a ROC-AUC of 0.83²⁰. Other studies either did not calculate ROC-AUC^23,44 or used cultures derived from a single sample source^22,23, a single bacterial species⁴⁴, or a single hospital unit⁴⁵.

An additional advantage of our ensemble modeling approach is built-in model calibration. Due to the logistic transformation the single-model outputs undergo, we are able to provide an output of well-calibrated probabilities of resistance. Prescribing antibiotics forces the clinician to make a compromise between patient’s care and population-level consequences⁴⁶. Hence, providing clinicians with unbiased probabilities of resistance can facilitate incorporation of other considerations into their decision. However, we note that continuous outputs from antibiotic prescription decision-support systems have been suggested to promote over-prescription of antibiotics, and hence decisions on output forms should be made with caution⁴⁷.

Our models’ predictions were analyzed using SHAP values, which can aid in assessing the influence of different covariates on predictions when applying complex ML models⁴⁸. We note that SHAP values contain inherent flaws⁴⁹ in approximating the impact of variables on predictions, and certainly do not aim to estimate causal effects. Despite these drawbacks, SHAP values can be useful for validating model outcomes against prior knowledge of risk factors and increase models’ transparency. This can in turn facilitate increasing clinicians’ trust in using ML decision support systems in their practice⁵⁰.

The results of our SHAP analyses are indeed consistent with the literature. Highly influential variables on the ensemble models’ predictions were related to previous infections containing resistant bacteria, either to ciprofloxacin or other antibiotics. Previous resistance to ciprofloxacin is an obvious risk factor for current resistance^20,51,52. However, the importance of previous resistance to other antibiotics may be explained by cross-resistance^53,54,55, or confounding by the patients’ exposure to resistant bacteria or to antibiotics. Patients’ origin (home, another hospital, nursing home, medical clinic, or other) had substantial influence on predictions and was also found to be an important variable by others^22,56. This is a known risk factor, as antibiotics are administered more frequently in medical facilities and nursing homes, leading to high selection for resistance⁵⁷. Local resistance frequencies, which we introduced into the data as moving averages of resistance frequencies, were also found to be highly influential on prediction. This is consistent with previous research and clinical use of local antibiograms, representing the susceptibility patterns of different bacteria⁵⁸. Furthermore, our moving average of resistance frequencies is potentially more sensitive to resistance trends than yearly or monthly antibiograms. In the gnostic model, P. aeruginosa was selected as an influential variable. This stems from the binary encoding of the bacterial species, which defined the reference species as E. coli. Since P. aeruginosa was the second-most common bacterial species in the dataset, and was less resistant than E. coli (Supplementary Data 3), it was determined to be influential in reducing the predicted probability of a resistant infection. Finally, age was not deemed by our models as a highly important variable for ciprofloxacin resistance, in contrast to previous ML research 20 and classic retrospective studies^51,52. This could potentially be attributed to the relatively old population in our study, especially when compared to studies on outpatients, which contain more heterogeneous cohorts.

Our study has several limitations. First, our dataset lacks relevant community-related patient information, such as antibiotic consumption in the community⁵⁷, and antibiotic consumption in the patients’ surroundings, including neighborhoods¹⁵ and households⁵⁹. Our models can be easily extended to accommodate these covariates, which will likely further improve the models’ predictive abilities. Second, our models are not necessarily immediately generalizable to other settings, or even the same setting, in different time periods. Variations in patients’ demographics, antibiotic consumption, and the dynamic nature of AMR may lead to variation in risk factors over space and time^60,61. For example, as we mention above, our data has under-representation of younger patients. This may be manifested in our model and needs to be taken in consideration when predicting resistance in young patients. Retraining of the models on site-specific data will likely be required to fine-tune predictions in different settings. However, the rates of ciprofloxacin resistance and patient covariates in our dataset are comparable with those of hospitalized patients in other developed countries⁶². We therefore expect a reasonable degree of consistency in our results, if our models would have been developed on a dataset from comparable settings.

Conclusions

The models developed in this study represent a further step on the way to inclusion of ML decision support systems into clinical practice. Improvement of such models depends on advances in algorithm development, specific feature engineering, and the augmentation of the quantity and quality of EMR data. As we have shown, modern ML models can achieve high prediction while autonomously imparting high influence to risk factors that are known to be clinically relevant to AMR. Hopefully, future studies can further leverage the presented models and the vast EMR data available to improve prediction of AMR and consequently reduce antibiotic misuse.

Data availability

Raw data is proprietary but can be made available upon reasonable request from the authors: The data pertains to the patient’s electronic medical records. These are private and cannot be shared without approval from Meir Medical Center’s IRB. Upon request, the authors and the individuals interested in accessing the data can write a formal request to the aforementioned IRB and seek its approval.

For the source data used to plot the resistance trends (Fig. 1), see Supplementary Data 4. For the source data used to plot the ROC curves, calibration and net benefit (Figs. 2 and 4) see Supplementary Data 5. For the source data used to plot SHAP (Fig. 3) see Supplementary Data 6.

Code availability

The code is available at http://github.com/igormintz/cipro⁶³.

References

Smith, R. A., M’ikanatha, N. M. & Read, A. F. Antibiotic resistance: A primer and call to action. Health Commun 30, 309–314 (2015).
Article PubMed Google Scholar
Palumbi, S. R. Humans as the world’s greatest evolutionary force. Science 293, 1786–1790 (2001).
Article CAS PubMed Google Scholar
Weber, D. J. Collateral damage and what the future might hold. The need to balance prudent antibiotic utilization and stewardship with effective patient management. Int. J. Infect. Dis. 10, S17–S24 (2006).
Article CAS Google Scholar
Carrara, E., Pfeffer, I., Zusman, O., Leibovici, L. & Paul, M. Determinants of inappropriate empirical antibiotic treatment: systematic review and meta-analysis. Int. J. Antimicrob. Agents 51, 548–553 (2018).
Article CAS PubMed Google Scholar
World Health Organization. Executive summary: the selection and use of essential medicines 2019: report of the 22nd WHO Expert Committee on the selection and use of essential medicines: WHO Headquarters, Geneva, 1-5 April 2019. https://apps.who.int/iris/handle/10665/325773 (2019).
Chowers, M. et al. Estimating the impact of cefuroxime versus cefazolin and amoxicillin/clavulanate use on future collateral resistance: a retrospective comparison. J. Antimicrob. Chemother 77, 1992–1995 (2022).
Article CAS PubMed Google Scholar
Nathwani, D. et al. Value of hospital antimicrobial stewardship programs [ASPs]: a systematic review. Antimicrob. Resist. Infect. Control 8, 1–13 (2019).
Article Google Scholar
Tribble, A. C. et al. Appropriateness of antibiotic prescribing in United States children’s hospitals: a national point prevalence survey. Clin. Infect. Dis 71, e226–e234 (2020).
Article PubMed Google Scholar
eEML - Electronic Essential Medicines List. https://list.essentialmeds.org/.
Loscalzo, J. et al. Harrison’s Principles of Internal Medicine, (Vol. 1 & Vol. 2). (McGraw Hill Professional, 2022).
Sharma, P. C., Jain, A., Jain, S., Pahwa, R. & Yar, M. S. Ciprofloxacin: review on developments in synthetic, analytical, and medicinal aspects. J. Enzyme Inhib. Med. Chem. 25, 577–589 (2010).
Article CAS PubMed Google Scholar
Thomson, C. J. The global epidemiology of resistance to ciprofloxacin and the changing nature of antibiotic resistance: a 10 year perspective. J. Antimicrob. Chemother. 43, 31–40 (1999).
Article CAS PubMed Google Scholar
Organization, W. H. Global antimicrobial resistance and use surveillance system (GLASS) report: 2021. (2021).
Dalhoff, A. Global fluoroquinolone resistance epidemiology and implictions for clinical use. Interdiscip. Perspect. Infect. Dis. 2012, 976273 (2012).
Article PubMed PubMed Central Google Scholar
Low, M. et al. Association between urinary community-acquired fluoroquinolone-resistant Escherichia coli and neighbourhood antibiotic consumption: a population-based case-control study. Lancet Infect. Dis. 19, 419–428 (2019).
Article CAS PubMed Google Scholar
Eliopoulos, G. M., Cosgrove, S. E. & Carmeli, Y. The impact of antimicrobial resistance on health and economic outcomes. Clin. Infect. Dis 36, 1433–1437 (2003).
Article Google Scholar
Gottesman, B. S., Carmeli, Y., Shitrit, P. & Chowers, M. Impact of quinolone restriction on resistance patterns of Escherichia coli isolated from urine by culture in a community setting. Clin. Infect. Dis. 49, 869–875 (2009).
Article CAS PubMed Google Scholar
Anahtar, M. N., Yang, J. H. & Kanjilal, S. Applications of machine learning to the problem of antimicrobial resistance: an emerging model for translational research. J. Clin. Microbiol. 59, e01260–20 (2021).
Article CAS PubMed PubMed Central Google Scholar
Rawson, T. M., Ahmad, R., Toumazou, C., Georgiou, P. & Holmes, A. H. Artificial intelligence can improve decision-making in infection management. Nat. Hum. Behav. 3, 543–545 (2019).
Article PubMed Google Scholar
Yelin, I. et al. Personal clinical history predicts antibiotic resistance of urinary tract infections. Nat. Med. 25, 1143–1152 (2019).
Article CAS PubMed PubMed Central Google Scholar
Feretzakis, G. et al. Using machine learning techniques to aid empirical antibiotic therapy decisions in the intensive care unit of a general hospital in Greece. Antibiotics 9, 50 (2020).
Article CAS PubMed PubMed Central Google Scholar
Dan, S. et al. Prediction of fluoroquinolone resistance in gram-negative bacteria causing bloodstream infections. Antimicrob. Agents Chemother. 60, 2265–2272 (2016).
Article CAS PubMed PubMed Central Google Scholar
Dickstein, Y., Geffen, Y., Andreassen, S., Leibovici, L. & Paul, M. Predicting antibiotic resistance in urinary tract infection patients with prior urine cultures. Antimicrob. Agents Chemother. 60, 4717–4721 (2016).
Article CAS PubMed PubMed Central Google Scholar
Binuya, M. A. E., Engelhardt, E. G., Schats, W., Schmidt, M. K. & Steyerberg, E. W. Methodological guidance for the evaluation and updating of clinical prediction models: a systematic review. BMC Med. Res. Methodol. 22, 1–14 (2022).
Article Google Scholar
Staffa, S. J. & Zurakowski, D. Statistical development and validation of clinical prediction models. Anesthesiology 135, 396–405 (2021).
Article PubMed Google Scholar
de Hond, A. A. et al. Guidelines and quality criteria for artificial intelligence-based prediction models in healthcare: a scoping review. Npj Digit. Med. 5, 1–13 (2022).
Google Scholar
Debray, T. P. et al. A new framework to enhance the interpretation of external validation studies of clinical prediction models. J. Clin. Epidemiol. 68, 279–289 (2015).
Article PubMed Google Scholar
Eilers, P. H. C., Boer, J. M., van Ommen G. J. & van Houwelingen, H. C. Classification of microarray data with penalized logistic regression. in Microarrays: Optical Technologies and Informatics vol. 4266 187–198 (International Society for Optics and Photonics, 2001).
Friedman, J., Hastie, T. & Tibshirani, R. The Elements of Statistical Learning. vol. 1 (Springer series in statistics New York, 2001).
Bergstra, J. & Bengio, Y. Random search for hyper-parameter optimization. J. Mach. Learn. Res. 13, 281–305 (2012).
Google Scholar
Sill, J., Takács, G., Mackey, L. & Lin, D. Feature-weighted linear stacking. ArXiv Prepr. arXiv:0911.0460 (2009).
Van der Laan, M. J., Polley, E. C. & Hubbard, A. E. Super learner. Stat. Appl. Genet. Mol. Biol. 6 (2007).
Lundberg, S. M. & Lee, S.-I. A Unified Approach to Interpreting Model Predictions. in Advances in Neural Information Processing Systems 30 (eds. Guyon, I. et al.) 4765–4774 (Curran Associates, Inc., 2017).
Vickers, A. J. & Elkin, E. B. Decision curve analysis: a novel method for evaluating prediction models. Med. Decis. Mak. 26, 565–574 (2006).
Article Google Scholar
Kerr, K. F., Brown, M. D., Zhu, K. & Janes, H. Assessing the clinical impact of risk prediction models with decision curves: guidance for correct interpretation and appropriate use. J. Clin. Oncol. 34, 2534 (2016).
Article PubMed PubMed Central Google Scholar
Python Software Foundation. Python programming language. https://www.python.org/.
NumPy Developers. NumPy: Scientific computing with Python. https://numpy.org/doc/stable/.
Pandas Developers. Pandas: Powerful data structures for data analysis and manipulation. https://pandas.pydata.org/.
Scikit-learn developers. Scikit-learn: Machine learning in Python. https://scikit-learn.org/stable/.
XGBoost: Scalable, distributed gradient boosting. https://xgboost.readthedocs.io/en/latest/.
TensorFlow Developers. TensorFlow: An end-to-end open source machine learning platform. https://www.tensorflow.org/.
Matplotlib: A comprehensive library for static, animated, and interactive visualizations in Python. https://matplotlib.org/stable/.
SHAP Developers. SHAP: A unified approach to explain the output of any machine learning model. https://shap.readthedocs.io/en/latest/.
Gallini, A. et al. Influence of fluoroquinolone consumption in inpatients and outpatients on ciprofloxacin-resistant Escherichia coli in a university hospital. J. Antimicrob. Chemother. 65, 2650–2657 (2010).
Article CAS PubMed Google Scholar
Wang, T. et al. Predicting Antimicrobial Resistance in the Intensive Care Unit. ArXiv Prepr. ArXiv211103575 (2021).
Wojcik, G. et al. Understanding the complexities of antibiotic prescribing behaviour in acute hospitals: a systematic review and meta-ethnography. Arch. Public Health 79, 1–19 (2021).
Article Google Scholar
Diamant, M. et al. A game theoretic approach reveals that discretizing clinical information can reduce antibiotic misuse. Nat. Commun. 12, 1–13 (2021).
Article Google Scholar
Shapley, L. S. A value for n-person games. Contrib. Theory Games 2, 307–317 (1953).
Google Scholar
Kumar, I. E., Venkatasubramanian, S., Scheidegger, C. & Friedler, S. Problems with Shapley-value-based explanations as feature importance measures. in International Conference on Machine Learning 5491–5500 (PMLR, 2020).
Chen, M. et al. Physician and Medical Student Attitudes Toward Clinical Artificial Intelligence: A Systematic Review with Cross-Sectional Survey. Available SSRN 4128867.
Mulder, M. et al. Risk factors for resistance to ciprofloxacin in community-acquired urinary tract infections due to Escherichia coli in an elderly population. J. Antimicrob. Chemother. 72, 281–289 (2016).
Article PubMed Google Scholar
Arslan, H., Azap, Ö. K., Ergönül, Ö. & Timurkaynak, F. On behalf of the Urinary Tract Infection Study Group Risk factors for ciprofloxacin resistance among Escherichia coli strains isolated from community-acquired urinary tract infections in Turkey. J. Antimicrob. Chemother. 56, 914–918 (2005).
Article CAS PubMed Google Scholar
Beckley, A. M. & Wright, E. S. Identification of antibiotic pairs that evade concurrent resistance via a retrospective analysis of antimicrobial susceptibility test results. Lancet Microbe 2, e545–e554 (2021).
Article CAS PubMed PubMed Central Google Scholar
Cherny, S. S., Chowers, M. & Obolski, U. Patterns of antibiotic cross-resistance by bacterial sample source: a retrospective cohort study. medRxiv (2022).
Cherny, S. S. et al. Revealing antibiotic cross-resistance patterns in hospitalized patients through Bayesian network modelling. J. Antimicrob. Chemother 76, 239–248 (2021).
Article CAS PubMed Google Scholar
Lewin-Epstein, O., Baruch, S., Hadany, L., Stein, G. & Obolski, U. Predicting antibiotic resistance in hospitalized patients by applying machine learning to electronic medical records. medRxiv 2020.06.03.20120535 https://doi.org/10.1101/2020.06.03.20120535. (2020)
Chatterjee, A. et al. Quantifying drivers of antibiotic resistance in humans: a systematic review. Lancet Infect. Dis. 18, e368–e378 (2018).
Article CAS PubMed Google Scholar
Truong, W. R., Hidayat, L., Bolaris, M. A., Nguyen, L. & Yamaki, J. The antibiogram: Key considerations for its development and utilization. JAC-Antimicrob. Resist. 3, dlab060 (2021).
Article PubMed PubMed Central Google Scholar
Oonsivilai, M. et al. Using machine learning to guide targeted and locally-tailored empiric antibiotic prescribing in a children’s hospital in Cambodia. Wellcome Open Res. 3, 131 (2018).
Article PubMed PubMed Central Google Scholar
Bell, B. G., Schellevis, F., Stobberingh, E., Goossens, H. & Pringle, M. A systematic review and meta-analysis of the effects of antibiotic consumption on antibiotic resistance. BMC Infect. Dis. 14, 1–25 (2014).
Article Google Scholar
Baraz, A., Chowers, M., Nevo, D. & Obolski, U. Stable temporal relationships as a first step towards causal inference: an application to antibiotic resistance. medRxiv (2022).
Fasugba, O., Gardner, A., Mitchell, B. G. & Mnatzaganian, G. Ciprofloxacin resistance in community-and hospital-acquired Escherichia coli urinary tract infections: a systematic review and meta-analysis of observational studies. BMC Infect. Dis. 15, 1–16 (2015).
Article Google Scholar
Mintz, I. igormintz/cipro. GitHub. https://doi.org/10.5281/zenodo.7632713. (2023)

Download references

Acknowledgements

This study was supported by the Israel Science Foundation (ISF 1286/21).

Author information

Authors and Affiliations

School of Public Health, Tel Aviv University, Tel Aviv, Israel
Igor Mintz & Uri Obolski
Porter School of the Environment and Earth Sciences, Tel Aviv University, Tel Aviv, Israel
Igor Mintz & Uri Obolski
Meir Medical Center, Kfar Saba, Israel
Michal Chowers
Sackler School of Medicine, Tel Aviv University, Tel Aviv, Israel
Michal Chowers

Authors

Igor Mintz
View author publications
You can also search for this author in PubMed Google Scholar
Michal Chowers
View author publications
You can also search for this author in PubMed Google Scholar
Uri Obolski
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

I.M. and UO conceived the study; I.M. implemented the analysis; I.M., M.C., and U.O. interpreted the results; I.M. and U.O. wrote the initial draft of the manuscript; all authors revised and approved the final version of the manuscript.

Corresponding author

Correspondence to Uri Obolski.

Ethics declarations

Competing interest

The authors declare no competing interests.

Peer review

Peer review information

Communications Medicine thanks Naveed Ahmed, Catherine Chen and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Description of Additional Supplementary Files

Supplementary data 1

Supplementary data 2

Supplementary data 3

Supplementary data 4

Supplementary data 5

Supplementary data 6

Supplementary Information

Peer Review File

Reporting Summary

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Mintz, I., Chowers, M. & Obolski, U. Prediction of ciprofloxacin resistance in hospitalized patients using machine learning. Commun Med 3, 43 (2023). https://doi.org/10.1038/s43856-023-00275-z

Download citation

Received: 03 November 2022
Accepted: 14 March 2023
Published: 28 March 2023
DOI: https://doi.org/10.1038/s43856-023-00275-z

Subjects

Abstract

Background

Methods

Results

Conclusions

Plain language summary

Similar content being viewed by others

Introduction

Methods

Data

Machine learning algorithms

Decision curve analysis

Ethics approval

Reporting summary

Results

Discussion

Conclusions

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interest

Peer review

Peer review information

Additional information

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

Search

Quick links