Deep learning model for personalized prediction of positive MRSA culture using time-series electronic health records

Nigo, Masayuki; Rasmy, Laila; Mao, Bingyu; Kannadath, Bijun Sai; Xie, Ziqian; Zhi, Degui

doi:10.1038/s41467-024-46211-0

Download PDF

Article
Open access
Published: 06 March 2024

Deep learning model for personalized prediction of positive MRSA culture using time-series electronic health records

Nature Communications volume 15, Article number: 2036 (2024) Cite this article

2521 Accesses
5 Altmetric
Metrics details

Subjects

Abstract

Methicillin-resistant Staphylococcus aureus (MRSA) poses significant morbidity and mortality in hospitals. Rapid, accurate risk stratification of MRSA is crucial for optimizing antibiotic therapy. Our study introduced a deep learning model, PyTorch_EHR, which leverages electronic health record (EHR) time-series data, including wide-variety patient specific data, to predict MRSA culture positivity within two weeks. 8,164 MRSA and 22,393 non-MRSA patient events from Memorial Hermann Hospital System, Houston, Texas are used for model development. PyTorch_EHR outperforms logistic regression (LR) and light gradient boost machine (LGBM) models in accuracy (AUROC^PyTorch_EHR = 0.911, AUROC^LR = 0.857, AUROC^LGBM = 0.892). External validation with 393,713 patient events from the Medical Information Mart for Intensive Care (MIMIC)-IV dataset in Boston confirms its superior accuracy (AUROC^PyTorch_EHR = 0.859, AUROC^LR = 0.816, AUROC^LGBM = 0.838). Our model effectively stratifies patients into high-, medium-, and low-risk categories, potentially optimizing antimicrobial therapy and reducing unnecessary MRSA-specific antimicrobials. This highlights the advantage of deep learning models in predicting MRSA positive cultures, surpassing traditional machine learning models and supporting clinicians’ judgments.

Predicting bloodstream infection outcome using machine learning

Article Open access 11 October 2021

Prediction of ciprofloxacin resistance in hospitalized patients using machine learning

Article Open access 28 March 2023

Development of an artificial intelligence bacteremia prediction model and evaluation of its impact on physician predictions focusing on uncertainty

Article Open access 19 August 2023

Introduction

Methicillin-resistant Staphylococcus aureus (MRSA) is a common pathogenic cause of hospital-acquired and community-associated infections^1,2,3. Since this pathogen eliminates most beta-lactam class antibiotics as a treatment option, physicians often need to add an antibiotic, such as vancomycin, to empirically treat this pathogen when suspected. Considering the side effect profile of vancomycin and the antibiotic stewardship standpoint, avoiding unnecessary antimicrobial therapy is highly desirable⁴. Furthermore, a recent study showed the absolute benefit of empiric therapy against MRSA is 0.1% or less⁵. Therefore, accurately identifying high-risk patients is critical to preserve the benefit of treatment and minimize the adverse side effects of empiric therapy. Although multiple clinical factors have been proposed as risk factors for MRSA infection^6,7,8,9, there are several limitations to identifying high-risk patients. Commonly, the tested population is restricted to specific populations, such as patients with ventilator-associated pneumonia¹⁰. Due to the complex associations among risk factors, it is often difficult to discern actual risks when multiple risk factors exist simultaneously⁸. For example, previous exposure to cephalosporine and fluoroquinolone are considered risk factors^11,12. The risk seems to accumulate when multiple antibiotics are previously prescribed¹³. Furthermore, the optimal timeline between the index infection and the presence of the risk factor is not well established, and often an arbitrary duration is used^8,14. More flexible models that can integrate multiple risk factors and the timing of various risk factors are warranted for frontline physicians to safely decide the necessity of empiric antibiotic therapy.

Electronic health records (EHRs) became widely available in the United States since the Meaningful Use program was introduced as part of the 2009 Health Information Technology for Economic and Clinical Health Act¹⁵. EHRs are a rich data source for daily clinical practice and research. As the data in EHRs expand, physicians have more information to process and interpret to improve patient management. Given its computational capabilities, artificial intelligence could reveal complex relationships among numerous factors in EHRs. Artificial intelligence has been used to process genetic and imaging data and has become an attractive technology to process real-time big EHR data to facilitate personalized medicine^16,17. Although there are multiple machine learning models predict drug-resistant bacterial infections with EHR data^18,19,20, they use limited input data, such as basic demographics, previous susceptibility results, or a limited number of patients¹⁸. Furthermore, some models only predict the index culture or screening results, which may not be optimal in clinical use to guide antibiotic therapy²¹. Deep learning-based models, such as recurrent neural network (RNN) models, have a significant advantage in time-sequence events because the fundamental model structure allows sequential inputs into the model. Also, RNNs with medical code embedding can take inputs directly from a real-time EHR data stream, automatically adjust to reflect subtle changes, and provide real-time outputs²² PyTorch_EHR, a deep learning-based prediction model using time-series categorical data, has been successfully applied to predict various clinical outcomes²³. Despite the potentially high expressive power of deep learning models, deep learning models using time-series EHR data to predict drug-resistant bacteria, particularly for MRSA, are limited²⁴.

We created a deep learning-based prediction model using PyTorch_EHR for positive MRSA culture using big time-series EHR data from a local hospital system and compared it to the traditional machine learning approaches and clinician’s decisions of empirical therapy against MRSA. We also evaluated the model’s generalizability using external EHR data from a different region of the United States. PyTorch_EHR outperforms logistic regression (LR) and light gradient boost machine (LGBM) models in accuracy (Area Under Receiver Operating Characteristic Curve [AUROC] ^PyTorch_EHR = 0.911, AUROC^LR = 0.857, AUROC^LGBM = 0.892). External dataset from the Medical Information Mart for Intensive Care (MIMIC)-IV validates its superior accuracy (AUROC^PyTorch_EHR = 0.859, AUROC^LR = 0.816, AUROC^LGBM = 0.838). Our model effectively stratifies patients into high-, medium-, and low-risk categories, potentially optimizing antimicrobial therapy and reducing unnecessary MRSA-specific antimicrobials.

Results

Patient characteristics

A total of 26,233 and 152,979 patients who met our selection criteria, as described under Methods, were identified from the Memorial Hermann Hospital System (MHHS) and Medical Information Mart for Intensive Care (MIMIC)-IV databases, respectively. Those patients had 56,233 and 393,713 index culture events over time in MHHS and MIMIC-IV datasets. The aggregated patient characteristics are described in Table 1. Some patients were classified into MRSA and non-MRSA groups when they had both MRSA and non-MRSA events at different index time. Patient features were used once if the patient had two or more events in the same group. Demographic features at the time of index culture were used to describe the characteristics when patients were classified more than twice into one group. Overall, the MRSA group had a higher number of intensive care unit (ICU) admissions (MHHS: 4.3% vs. 0.7%, MIMIC-IV: 31.7% vs. 16.7%) and emergency department (ED) patients (MHHS: 66.4% vs. 13.3%, MIMIC-IV: 51.3% vs 35.0%). As MIMIC-IV was originally developed based on an ICU database, the MIMIC-IV dataset included a higher number of ICU patients. Intermediate unit (IMU) status was not included in the MIMIC-IV data. Table 2 summarizes types of antibiotics and cultures before index time. Vancomycin was the most commonly used antibiotic, followed by cefepime in the MHHS dataset, whereas ceftriaxone was the second most commonly used antibiotic in the MIMIC-IV dataset. As expected, given the origin of the EHRs (MHHS from Houston and MIMIC-IV from Boston), the MHHS dataset had more Hispanic patients compared to MIMIC-IV (10.5–10.6% vs. 3.6–3.9%). Across groups, Caucasian was the most common race, and 55–65 years was the most common age group. Gender was equally distributed in all groups. Blood and urine cultures were other common cultures taken during the study periods.

Table 1 Characteristics of Patients with and without Positive MRSA Cultures

Full size table

Table 2 Types of antibiotics and cultures which patients had before Index Time

Full size table

Types of infection and other pathogens

Table 3 summarizes the bacteria and diagnostic codes identified within the event periods. S. aureus were the most common bacteria in MRSA groups, whereas E. coli was the most common in the non-MRSA group. Bacteremia (MHHS: 6.7% vs. 2.1%, MIMIC-IV: 8.6% vs. 1.9%) and skin soft tissue infection (MHHS: 24.8% vs. 5.6%, MIMIC-IV: 13.2% vs. 2.6%) were more common in MRSA groups.

Table 3 Name of bacteria identified from cultures and types of infection based on ICD codes

Full size table

Model prediction

Table 4 shows the prediction accuracy of the models. For the MHHS dataset, the deep learning model PyTorch_EHR exhibited the highest Area Under Curve of Receiver Operating Characteristics (AUROC) of 0.911 [0.900 – 0.916] (see ROC curve in Supplementary Fig. 5-1) compared to other machine learning models (logistic regression [LR]: 0.857 [0.849–0.865] and light gradient boost machine [LGBM]: 0.892 [0.885–0.899]). Similar results were obtained for the MIMIC-IV dataset (PyTorch_EHR: 0.859 [0.849–0.869], LR: 0.816 [0.804–0.828], and LGBM: 0.838 [0.823–0.849]; see ROC curve in Supplementary Fig. 5-2). We also evaluated the AUROC in each patient group with a specific diagnosis during the event. Although the AUROC decreased by 0.50–0.10, we had acceptable accuracy in each infection in the MHHS dataset. We also evaluated confusion matrices based on our model’s high-risk and low-risk predictions (see Supplementary Table 4). In high-risk groups, Pytorch_EHR showed a specificity of 95.0% and 99.0%, and a sensitivity of 48.1% and 19.3% in MHHS and MIMIC-IV datasets, respectively, whereas LGBM showed a specificity of 95.0% and 99.0%, and a sensitivity of 44.5% and 14.9%. In low-risk groups, Pytorch_EHR had a sensitivity of 95.0% and 90.0% and a specificity of 62.9% and 58.7% in MHHS and MIMIC-IV datasets, respectively, whereas LGBM showed a sensitivity of 95.0% and 90% and a specificity of 62.8% and 57.2%.

Table 4 Outcome of Models in Overall and Subgroup Analyses

Full size table

Given the imbalanced distributions of positive events in both datasets, for high-risk patients, positive predictive values (PPV) were relatively low: 65.6% and 22.4% for Pytorch_EHR and 63.6% and 17.5% for LGBM in MHHS and MIMIC-IV datasets, respectively. However, negative predictive values (NPV) were high: 90.3% and 98.9% for Pytorch_EHR and 89.7% and 98.8% for LGBM in MHHS and MIMIC-IV datasets, respectively. For low-risk patients, PPV was low: 37.6% and 3.0% for Pytorch_EHR and 33.5% and 2.9% for LGBM in MHHS and MIMIC-IV datasets, respectively. However, NPV were particularly high: 98.6% and 99.8% for Pytorch_EHR and 98.5% and 99.8% for LGBM in MHHS and MIMIC-IV datasets, respectively.

Fig. 1 shows the cumulative incidence curve of MRSA-positive cultures over two weeks from the index culture. In both datasets, our model clearly differentiated the patients with high and low risks of MRSA-positive cultures. The cumulative incidence of MRSA-positive cultures in the MRSA group in the MHHS dataset was 61.2%, whereas the incidence in the MIMIC-IV dataset was approximately 18.2%. The low incidence in MIMIC-IV despite a high risk was likely due to the overall incidence of positive MRSA cultures in the MIMIC-IV dataset.

**Fig. 1: Cumulative Incidence Curve of Positive MRSA Over Two Weeks in the MHHS and MIMIC-IV Datasets.**

AUROC curves over multiple index events were evaluated in MHHS and MIMIC-IV test datasets. (See Supplementary Fig. 10) When evaluated on patients with only the first event in MHHS dataset, LGBM model performance was better than that of PyTorch_EHR and LR models. However, when evaluated on patients who had repeated events, i.e., a longer duration of observation in the dataset, PyTorch_EHR model performance improved significantly and sustained superiority against the LR and LGBM models. Similar results were obtained for the MIMIC-IV dataset, with a longer duration of observation providing better performance in the PyTorch_EHR model.

Potential clinical impact

Table 5 summarizes the potential clinical impact of the PyTorch_EHR model. In patients predicted as low risk, our model exhibited NPV of 98.6% and 99.8% in MHHS and MIMIC-IV datasets, respectively. In addition, among those low-risk patients who had true negative results, MRSA-specific antimicrobials were given by treating clinicians in 21.6% (1505/6975) and 2.3% (1069/45,533) of events, which translated to 7949 and 1397 doses of MRSA-specific antimicrobials in MHHS and MIMIC-IV, respectively. The main antimicrobials used for those patients were vancomycin (6833 and 1254 doses in MHHS and MIMIC-IV, respectively), followed by linezolid (852 and 88 doses) and daptomycin (264 and 55 doses). Further, 1.4% (98/6,975) and 0.2% (108/45,533) events were false negatives in our model. Among them, only 0.3% (23/6,975) and 0.04% (27/45,533) events received MRSA-specific antimicrobials, which could be missed by our model.

Table 5 Potential Clinical Impact of the PyTorch_EHR Model

Full size table

In high-risk patients, our model exhibited PPV of 65.6% and 22.4% in MHHS and MIMIC-IV datasets, respectively (Supplementary Table 4). The model predicted 12% (1437/11,922) and 1.2% (957/78,548) of events as high risk. Among high-risk groups, patients did not receive any MRSA-specific antimicrobials in 34.6% (497/1437) and 19.7% (189/957) of events in MHHS and MIMIC-IV datasets, respectively. On the contrary, with our model’s high-risk prediction, 15.8% (227/1437) and 71.1% (671/957) events may receive unnecessary MRSA-specific antimicrobials (potential harm from our model).

Finally, we evaluated the performance of our model in patients who had MRSA bacteremia. As summarized in Tables 5, 31.8% (457/1437) and 7.3% (70/957) of high-risk events in MHHS and MIMIC-IV datasets, respectively, had MRSA bacteremia. These rates were much higher than the rates in low-risk events in MHHS (0.5%; 32/6975) and MIMIC-IV (0.04%; 35/48,455). Based on these findings, high-risk group had 69.3 and 101.2 higher relative risk of MRSA bacteremia compared to low-risk patient group. In addition, our model identified 58.0% (265/457) and 50.0% (35/70) of high-risk patients with true MRSA bacteremia did not receive MRSA-specific antimicrobials, considered “optimal” antibiotics for MRSA bacteremia, within 12 h of the index cultures.

These results were also evaluated in other models and any MRSA antimicrobials (see Supplementary Table 5). Overall, PyTorch_EHR model exhibited higher net-benefits against treating clinician’s decisions compared to LGBM and LR models, except for MRSA bacteremia in MIMIC-IV dataset. LGBM model provided better net benefit compared to PyTorch_EHR model (18 vs. 10 MRSA bacteremia cases may receive early MRSA antimicrobials, respectively.)

Feature importance

We obtained the contribution scores for positive MRSA cultures in the datasets. Supplementary Fig. 7 shows the top 14 median contribution scores of admission diagnoses in our model for MHHS data. Interestingly, our model identified multiple diagnoses often related to MRSA infections, such as cutaneous abscesses or boils. Supplementary Fig. 8 shows the top 10 overall contribution scores for antimicrobial exposures before the index time in the datasets. Some common antibiotics had high scores in both datasets, but it was difficult to interpret the scores clinically.

We also present individual feature importance as a bar graph for an example patient among the patients we visualized (see Supplementary Fig. 9). The patient is female and between 45 – 54 years of age, with multiple underlying comorbidities listed on admission two days (−2 days) before the index culture (blood culture on index date). Our model identified a risk score of 0.541 (predicted as a positive patient). After the patient was admitted to the hospital, vancomycin and meropenem were initiated, and a blood culture was ordered. Subsequently, cultures identified MRSA over two weeks.

Discussion

In this study, our deep learning-based MRSA-predictive model exhibited better performance compared to other machine learning models in real-world MHHS and MIMIC-IV datasets. Traditional Machine learning, especially LGBM, also provided a great performance in predicting MRSA-positive culture. However, PyTorch_EHR model had better overall AUCROC and showed better potential clinical impact in majority of datasets. PyTorch_EHR model successfully “learned” patient-specific features, especially with time sequence events, to provide personalized risks of positive MRSA cultures over two weeks from index time. The model maintained better predictions even after transferring from the MHHS dataset to the MIMIC-IV dataset and tolerated the significantly imbalanced outcomes in the MIMIC-IV dataset. Compared to other existing models, our model successfully predicted positive MRSA cultures not only on the index day but also over two weeks from the index day. (see Fig. 2) This prediction window is better aligned with the daily clinical practice of physicians since physicians decide on empiric antibiotic therapy to treat MRSA, such as intravenous vancomycin, not only for the culture of index day but also any subsequent cultures that may be related to the episode of infection after initiation of therapy. We decided to use a two-week window in this project as the majority of infections after admission are diagnosed within the time periods. The incidence curve successfully captured any events within two weeks. Our deep learning model readily accepts the time sequence of the events in the patient history, which we believe is more consistent with the physician’s assessment in clinical practice. In addition, our model showed that accuracy improves when time-series data are used, and patients have a longer duration of observation before the index time. (see Supplementary Fig. 10) We also tested the model in different types of infection posing various MRSA risks, such as sepsis, bacteremia, and pneumonia. Although there were some decreases in the AUROC, high performance were maintained, which supports the use of this single model for multiple types of infections. Finally, our model could benefit clinical practice by reducing the number of antimicrobials used in low-risk patients and providing optimal MRSA antimicrobials when the model predicts high risk, including bacteremia. Although the difference in AUCROC was small between PyTorch_EHR and LGBM, the actual difference of possible net-benefit is substantial, especially in MHHS datasets which had high prevalence rates of positive MRSA cultures.

**Fig. 2: Schematic Structure of Deep Learning-Based Prediction Model for MRSA-Positive Cultures.**

Personalized medicine is of great interest in medical fields. Many studies on personalized medicine focus more on genetic-based predictions rather than clinical data from EHRs²⁵. EHR data have become a rich source of real-world data and provide invaluable information. Even without genetic data, we believe EHR data can be a useful source for deep learning models to achieve personalized medicine in multiple clinical settings. Furthermore, compared to traditional machine learning models, deep learning can easily integrate time-sequence data as inputs into the model, which provides significant advantages for outcome predictions requiring sequential event inputs. Although PyTorch_EHR only uses categorical data from EHRs, this model provides high performance with the advantages of relatively simpler preprocessing steps and flexible variable selections for input. This allows us to preserve model transferability and generalizability across different data sources.

Since MRSA emerged, multiple predictive models for risk factors for MRSA infections have been proposed. The models have differing degrees of accuracy but often focus on a certain type of infection, such as pneumonia, to achieve and simplify the risk factors and models. Rhodes et al. used a machine learning model to predict community-acquired MRSA pneumonia²⁶. Although the time frame and patient population differed from our study, their model achieved an AUROC of 0.775, which was lower than ours. Additionally, some risk factor-based models rely heavily on certain tests, such as the nasal MRSA PCR test from nare²⁷, which hampers the model’s generalizability due to limitations in the tests’ availability and applicability to other types of infections. Also, some of the results may not be available when starting antibiotics, which limits the usability of models in hospitals. In contrast, our model carries a significant advantage since the model can take widely available data from EHRs and predict the outcomes even with some missing certain tests. Our model can be used not only for treatment decisions but also for infection prevention to isolate the patients in high-risk groups before culture results, although the utility of contact precaution is still controversial. Our model used a two-week time window to provide more meaningful predictions in clinical settings. Some predictive models only predict the index culture rather than overall risks²¹. To be applied in a clinical setting, predicting over a two-week window can be more impactful for clinicians when they choose antimicrobial therapy at the time of the initiation. The cumulative incidence curves based on our model prediction clearly differentiated the high-risk and low-risk patients. The majority of patients had positive MRSA cultures on the index day, but approximately 15% of high-risk patients had positive cultures after the index day, which could be missed if we only predicted the positivity of the index culture. Currently, our model only predicts the positivity of cultures regardless of the source of cultures, which allows simplification of the data processing and model structure. However, providing more informative predictions, such as source of cultures, may allow physicians to decide finer selection of antimicrobial therapies. For example, when the model predicts only wound culture is positive for MRSA in stable cellulitis patients, oral antimicrobials, such as sulfamethoxazole-trimethoprim, may be adequate for the therapy.

We evaluated the potential impact of our PyTorch_EHR model in a clinical setting. MRSA-specific and any MRSA antibiotics were used to evaluate the impacts. Although linezolid and daptomycin can be used for vancomycin-resistant enterococci (VRE), the low-risk groups had positive VRE in 6 cases in MHHS test datasets and 32 cases in MIMIC-IV test datasets. The model identified a large number of potentially avoidable antimicrobials targeting MRSA used in low-risk patients (7949 and 1397 doses in MHHS and MIMIC-IV, respectively). Our model only “missed” a small number of patients (0.3% and 0.04% in MHHS and MIMIC-IV, respectively). When evaluating overall performance, our model potentially provides benefits in 1752 cases and 560 cases in MHHS and MIMIC-IV datasets. The high-risk patient population had a significantly high relative risk ratio for MRSA bacteremia. This indicates that our model predicts not only the positivity of MRSA culture but also the severity of MRSA infections in high-risk patients. Furthermore, although the absolute numbers were small, 58.0% and 38.6% of events with MRSA bacteremia did not receive “optimal” antimicrobials within 12 h of the index time. MRSA bacteremia is one of the most severe infections in the hospital. In our study, although only early antimicrobial therapy and avoidable MRSA-specific antimicrobials were evaluated, early initiation of appropriate antibiotics in critically ill patients improves their outcomes¹ and avoiding unnecessary antimicrobials reduces side effects and potential complications from those antimicrobials, such as Clostridioides difficile infection. We believe potential benefits can be larger in clinical settings.

One of the challenges of deep learning models is their explainability. Interestingly, our model successfully identified clinically important features. Although there was variability among patients, our model successfully identified MRSA-related admission diagnosis. Previous antimicrobial exposures were also visualized in the population. However, the results were difficult to interpret clinically. We also visualized the factors contributing to the model predictions at an individual level (see Supplementary Fig. 9). Since the model uses the time sequence without dichotomizing the time frame with an arbitrary cutoff, i.e., positive MRSA culture within 90 days, the contribution weight can be different depending on the patient and the timing of events. Although some of the factors seem associated with MRSA infection, those highly contributed events are not necessarily directly associated with the predictions of MRSA. The inputs could surrogate other underlying events. Caution is required to interpret the feature importance as those outputs may not be traditional risk factors we use in clinical settings.

This study has limitations. First, due to the nature of retrospective studies, potential biases are inevitable, and its findings should be confirmed in prospective studies. In addition, although the datasets we used are from hospitals in two distinct regions of the United States, the model should be validated in other patient populations and high-risk populations, such as immunocompromised patients. Second, this model predicts positive MRSA cultures rather than infections. Since some patients can have MRSA infections without positive cultures, the model should be used cautiously when there are significant concerns about MRSA when initiating antibiotics. We also included analysis for patients with MRSA bacteremia, which is usually considered a true infection. The potential clinical benefits were consistent in this cohort. Third, potential clinical impacts by our model were evaluated based on the clinician’s antibiotic prescriptions and final culture results. We used MRSA-specific antimicrobials and any MRSA antimicrobials to evaluate the different clinical scenarios. Our model consistently showed the benefits in both settings. Particularly in the evaluation of MRSA-specific antimicrobials, although vancomycin is often used to target MRSA, linezolid and daptomycin can be used for other potential pathogens, such as VRE. Although those were minor cases in the datasets, there could be uncommon situations where those antibiotics were used for other purposes. Fourth, although we included multiple variables in the model, several important variables as known MRSA risk factors, such as residence in a long-term care facility, were not included. Furthermore, vital signs or other basic laboratory results were not included in this model. Those can be considered in future studies. Finally, although we showed the generalizability of the model in this study, the transferability of the model needs to be addressed to use the deep-learning model widely.

In summary, our deep learning-based predictive model successfully predicted positive MRSA culture over two weeks from index culture. Our study revealed model superiority against other traditional machine learning models in both MHHS and MIMIC-IV datasets with high performance, even in significantly imbalanced datasets and some subgroup analyses. The model can be widely applied to various types of infections. Compared to the treating physician’s decision, our model could provide potential benefits, reducing unnecessary MRSA antimicrobial use and optimizing antimicrobial therapy. Considering the performance of our model in the datasets, the model likely provides more clinical benefits in populations with a high prevalence of MRSA infections. Studies in high-risk populations, such as immunocompromised patients, and prospective studies are warranted to validate the model.

Methods

EHR datasets

Patient data were retrospectively retrieved from two EHR databases: 1) Database at MHHS, Houston, Texas, for model training and comparison to traditional machine learning models and 2) MIMIC-IV v2.1 for external validation. MIMIC-IV is a relational de-identified EHR database containing hospital encounters from a tertiary academic medical center in Boston, Massachusetts²⁸.

From the MHHS database, EHRs from 1/2018 and 4/2021 were obtained for patients >= 18 years of age, with at least one bacterial culture during the study period. To avoid an imbalanced dataset, we randomly selected 8,164 patients with MRSA-positive cultures and 18,069 patients with other types of cultures, including cultures positive for methicillin-sensitive S. aureus (MSSA) and other types of bacteria and negative cultures. Demographic data, admission data, diagnostic and procedure codes, antibiotic administration, other infectious disease-related test results, and previous microbiological data, including the type of cultures, name of bacteria, and all antibiotic sensitivities, were obtained from the database. Microbiology data tables included cultures and other infectious disease tests, such as serologies. To avoid label leakage, we used only results reported by the index time. The laboratory orders were included without results when they were ordered by the index time. For diagnostic and procedure codes, International Classification of Disease (ICD)−9 or ICD-10 codes were used. Since other data tables, such as antibiotics, did not contain standardized codes for medications, free text, such as “vancomycin,” was used. Extracted data were cleaned and converted to categorical data to fit the PyTorch_EHR scheme. The admission ward information was converted to generalized features, such as ED, ICU, and IMU, to later map those locations to MIMIC-IV data.

Similarly, EHRs for all patients with bacterial cultures and >18 years of age were retrieved from the MIMIC-IV database. To validate the generalizability of the model, each data table was mapped with the MHHS data table. Only data mapped with MHHS data were used in the MIMIC-IV dataset. Since the MIMIC-IV dataset aggregated the ICD and procedure codes at each encounter level, only codes reported in the previous encounters were used to avoid label leakage. The microbiology event table was used to identify eligible patient events, and those data were used as part of inputs in our model. Of the total 25,599 S. aureus-positive cultures from various sources in the table of MIMIC-IV, 19,605 isolates (76.6%) had been tested for antimicrobial sensitivity for various reasons, including multiple positive cultures with S. aureus in a short period and positive wound cultures due to multiple organisms. S. aureus-positive cultures within seven days of positive MSSA or MRSA were removed, leaving 519 S. aureus isolates, which did not have any recent sensitivity to classify them as MRSA or MSSA. These isolates were classified into the non-MRSA group. The datasets were further divided as 70:10:20 (Supplementary Fig. 1). We used the data for two purposes; 1) to generate a model only trained and tested on MIMIC-IV, and 2) to fine-tune the pre-trained model with the MHHS datasets and test on the MIMIC-IV test dataset. For the results of model predictions and clinical impact, only test datasets of each database were used.

For subgroup analysis in the MHHS dataset, the ICD code was used to identify the patient with that code within the two-week period. Since the MIMIC-IV dataset only provided ICD codes at the encounter levels, we used the encounter to find the patients with the ICD codes within the encounter.

PyTorch_EHR prediction model scheme

We used the deep learning platform PyTorch_EHR to predict clinical outcomes using categorical data from EHRs. As the majority of MRSA infections or new infections are diagnosed within two weeks, we set a two-week window for the prediction, and any first culture within the window was used as an index culture (Fig. 2). This prediction window allows not only prediction at the time of culture but also cultures obtained after initiation of empiric antibiotics, which is essential for physicians to decide whether to start or continue empiric MRSA antibiotic therapy at the index time. Some patients had multiple cultures over time, including MRSA and non-MRSA cultures. Those patients were included in both MRSA and non-MRSA groups for patient characteristic description, depending on the timing of positive or negative culture the patient had during the window period.

PyTorch_EHR implements an RNN model. We chose the gated recurrent unit (GRU) RNN architecture, which is known for being an efficient sequential deep learning architecture for clinical event predictions (see Supplementary Fig. 1). The source code of this model is publicly available to enable its application and further evaluation by other researchers²⁹. In addition to categorical data, PyTorch_EHR handles the time difference between hospital visits for a better temporal representation of patient history to improve accuracy (see Supplementary Fig. 2)^30,31. We converted the interval to days from visits to accommodate predictions for more acute issues.

For binary classification tasks, we compared our model to two traditional machine learning algorithms, LR³² and LGBM³³. We elected LR as the most basic binary classification model and gradient boost machine as powerful and used in multiple classification tasks^34,35. To keep the temporal relationship between index time and each feature available for those models, we prepared the data to include the number of occurrences of each feature before index time and the distance between the most recent feature occurrence and the index time. (see Supplementary Fig. 3) After preparation of the data, we standardized the numerical values to optimize algorithms. For each model including RNN, we obtained their optimal hyperparameters using optuna (ver. 2.10.0)³⁶. After obtaining the area under the curve (AUC) for each model prediction, we use DeLong test³⁷ to obtain p-values and the 95% confidence intervals of AUC differences between the models to statistically evaluate the significance.

For survival prediction, we used the DeepSurv³⁸ architecture, replacing the multiple-layer perceptron layers with GRU layers for better sequential information modeling, similar to the way we modelled COVID-19 outcome prediction²³. Python version 3.9.7, PyTorch version 1.7.1, and Sklearn version 0.24.2 were used in this study.

Possible clinical impact of our model

To evaluate the potential clinical impact of our model, we filtered high-risk and low-risk patient cohorts based on the prediction of our model. Considering the different prevalences of MRSA-positive cultures in each dataset, we defined different cutoffs for high-risk and low-risk patients. For MHHS dataset, we used the cutoff to obtain a specificity of 95% for high-risk patients and a sensitivity of 95% for low-risk patients. For MIMIC-IV, considering significant imbalanced data, we decided to use a specificity of 99% and a sensitivity of 90%, respectively. All three models used the same cutoff and were evaluated for the model performance. After defining the cohort, we evaluated the number of patients who had positive MRSA cultures and received or did not receive empiric MRSA-related antimicrobial therapy. We used two groups of antimicrobials: MRSA-specific antibiotics and MRSA antibiotics. MRSA-specific antibiotics include vancomycin, daptomycin, linezolid, and telavancin, which are often used in the hospital when empiric therapy is necessary, or bacteremia is suspected. Any MRSA antibiotics include other intravenous and oral antimicrobials, which possess anti-MRSA activity. However, these antimicrobials are also often used for other types of bacteria, such as gram-negative bacteria. We evaluated our model with both groups of antimicrobials. MRSA bacteremia was specifically chosen to define true bacterial infections since positive cultures in other types of culture do not necessarily mean true infections, i.e., some patients may have contamination or colonization in some situations.

Model interpretation

For the mechanistic interpretation of MRSA predictions, we used the integrated gradient technique³⁹ to expose the factors contributing to the personalized model predictions. For RNN-based models, we can achieve a patient-level explanation, which shows the contribution scores for each clinical event on each day in the patient trajectory. We also obtained the medians of contribution scores of frequent features in our model to evaluate the overall importance of certain features in the cohort. However, we need to highlight that such contribution scores should be mainly used for patient-level predicted score explanation and not for inferring population-level risk factors/important features as it is different from LR coefficients or LGBM feature importance scores, such as SHAP⁴⁰. To evaluate our RNN-based model explainability, we reviewed the calculated contribution scores for each clinical event in the input of 10 patients. We visualized the contribution score per patient through an institutional Tableau interactive dashboard (Seattle, Washington), where clinicians can navigate different clinical events within various categories and across multiple visits in the patient history.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

MHHS data that support the findings of this study are not openly available due to reasons of sensitivity and are available from the corresponding author upon requests and our institutional IRB approvals. MIMIC-IV data v2.1, used in this study as an external validation, is publicly available after data use agreement on the website. (https://physionet.org/content/mimiciv/2.1/).

Code availability

The Original Pytorch_EHR code and sample codes used in this work are publicly available^29,41.

References

Liu, C. et al. Clinical practice guidelines by the Infectious Diseases Society of America for the treatment of methicillin-resistant Staphylococcus aureus infections in adults and children. Clin. Infect. Dis. 52, e18–e55 (2011).
Article PubMed Google Scholar
Fridkin, S. K., Sanza, L. T., Jernigan, J. A. & Lynfield, R. Methicillin-resistant Staphylococcus aureus disease in three communities. N. Engl. J. Med. 352, 1436–1444 (2005).
Article CAS PubMed Google Scholar
Moran, G. J., Gorwitz, R. J. & McDougal, L. K. Methicillin-Resistant S. aureus Infections among Patients in the Emergency Department. N. Engl J. Med. 355, 666–674 (2006).
Article CAS PubMed Google Scholar
Rybak, M. et al. Therapeutic monitoring of vancomycin in adult patients: a consensus review of the American Society of Health-System Pharmacists, the Infectious Diseases Society of America, and the Society of Infectious Diseases Pharmacists. Am. J. Health Syst. Pharm. 66, 82–98 (2009).
Article CAS PubMed Google Scholar
Carey, G. B. et al. Estimated mortality with early empirical antibiotic coverage of methicillin-resistant Staphylococcus aureus in hospitalized patients with bacterial infections: a systematic review and meta-analysis. J. Antimicrob. Chemother. 78, 1150–1159 (2023).
Article CAS PubMed Google Scholar
Hidron, A. I. et al. Risk factors for colonization with methicillin-resistant Staphylococcus aureus (MRSA) in patients admitted to an urban hospital: emergence of community-associated MRSA nasal carriage. Clin. Infect. Dis. 41, 159–166 (2005).
Article PubMed Google Scholar
Szumowski, J. D. et al. Methicillin-resistant Staphylococcus aureus colonization, behavioral risk factors, and skin and soft-tissue infection at an ambulatory clinic serving a large population of HIV-infected men who have sex with men. Clin. Infect. Dis. 49, 118–121 (2009).
Article PubMed Google Scholar
Wakatake, H. et al. Positive clinical risk factors predict a high rate of methicillin-resistant Staphylococcus aureus colonization in emergency department patients. Am. J. Infect. Control 40, 988–991 (2012).
Article PubMed Google Scholar
Cadena, J., Thinwa, J., Walter, E. A. & Frei, C. R. Risk factors for the development of active methicillin-resistant Staphylococcus aureus (MRSA) infection in patients colonized with MRSA at hospital admission. Am. J. Infect. Control 44, 1617–1621 (2016).
Article PubMed Google Scholar
Shorr, A. F. et al. A risk score for identifying methicillin-resistant Staphylococcus aureus in patients presenting to the hospital with pneumonia. BMC Infect. Dis. 13, 268 (2013).
Article PubMed PubMed Central Google Scholar
MacDougall, C., Powell, J. P., Johnson, C. K., Edmond, M. B. & Polk, R. E. Hospital and community fluoroquinolone use and resistance in Staphylococcus aureus and Escherichia coli in 17 US hospitals. Clin. Infect. Dis. 41, 435–440 (2005).
Article CAS PubMed Google Scholar
Asensio, A., Guerrero, A., Quereda, C., Lizán, M. & Martinez-Ferrer, M. Colonization and infection with methicillin-resistant Staphylococcus aureus: associated factors and eradication. Infect. Control Hosp. Epidemiol. 17, 20–28 (1996).
Article CAS PubMed Google Scholar
Schneider-Lindner, V., Delaney, J. A., Dial, S., Dascal, A. & Suissa, S. Antimicrobial drugs and community-acquired methicillin-resistant Staphylococcus aureus, United Kingdom. Emerg. Infect. Dis. 13, 994–1000 (2007).
Article CAS PubMed PubMed Central Google Scholar
Huang, S. S. & Platt, R. Risk of Methicillin-resistant Staphylococcus aureus infection after previous infection or colonization. Clin. Infect. Dis. 36, 281–285 (2003).
Article PubMed Google Scholar
DHHS, HITECH Act Enforcement Interim Final Rule. HHS.gov https://www.hhs.gov/hipaa/for-professionals/special-topics/hitech-act-enforcement-interim-final-rule/index.html (2009).
Anahtar, M. N., Yang, J. H. & Kanjilal, S. Applications of machine learning to the problem of antimicrobial resistance: an emerging model for translational research. J. Clin. Microbiol 59, e0126020 (2021).
Article PubMed Google Scholar
Kim, J. I. et al. Machine learning for antimicrobial resistance prediction: current practice, limitations, and clinical perspective. Clin. Microbiol Rev. 35, e00179–21 (2022).
Article PubMed PubMed Central Google Scholar
Feretzakis, G. et al. Using machine learning algorithms to predict antimicrobial resistance and assist empirical treatment. Stud. Health Technol. Inf. 272, 75–78 (2020).
Google Scholar
Hsu, C.-C., Lin, Y. E., Chen, Y.-S., Liu, Y.-C. & Muder, R. R. Validation study of artificial neural network models for prediction of methicillin-resistant Staphylococcus aureus Carriage. Infect. Control Hosp. Epidemiol. 29, 607–614 (2008).
Article PubMed Google Scholar
Lewin-Epstein, O., Baruch, S., Hadany, L., Stein, G. Y. & Obolski, U. Predicting antibiotic resistance in hospitalized patients by applying machine learning to electronic medical records. Clin. Infect. Dis. 72, e848–e855 (2021).
Article PubMed Google Scholar
Hirano, Y. et al. Machine learning approach to predict positive screening of methicillin-resistant Staphylococcus aureus during mechanical ventilation using synthetic dataset from MIMIC-IV Database. Front. Med. 8, 694520 (2021).
Article Google Scholar
Nigo, M. et al. PK-RNN-V E: A deep learning model approach to vancomycin therapeutic drug monitoring using electronic health record data. J. Biomed. Inf. 133, 104166 (2022).
Article Google Scholar
Rasmy, L. et al. Recurrent neural network models (CovRNN) for predicting outcomes of patients with COVID-19 on admission to hospital: model development and validation using electronic health record data. Lancet Digit Health S2589-7500(22)00049–8 (2022). https://doi.org/10.1016/S2589-7500(22)00049-8.
Hernàndez-Carnerero, À. et al. Dimensionality reduction and ensemble of LSTMs for antimicrobial resistance prediction. Artif. Intell. Med. 138, 102508 (2023).
Article PubMed Google Scholar
Abul-Husn, N. S. & Kenny, E. E. Personalized medicine and the power of electronic health records. Cell 177, 58–69 (2019).
Article CAS PubMed PubMed Central Google Scholar
Rhodes, N. J. et al. Machine learning to stratify methicillin-resistant staphylococcus aureus risk among hospitalized patients with community-acquired pneumonia. Antimicrob. Agents Chemother. 67, e01023–22 (2022).
PubMed PubMed Central Google Scholar
Baby, N. et al. Nasal Methicillin-Resistant Staphylococcus aureus (MRSA) PCR testing reduces the duration of MRSA-targeted therapy in patients with suspected MRSA Pneumonia. Antimicrob. Agents Chemother. 61, e02432-16 (2017).
Article PubMed PubMed Central Google Scholar
Goldberger, A. L. et al. PhysioBank, PhysioToolkit, and PhysioNet: components of a new research resource for complex physiologic signals. Circulation 101, E215–E220 (2000).
Article CAS PubMed Google Scholar
ZhiGroup. Predictive Modeling on Electronic Health Records (EHR) using Pytorch. https://github.com/ZhiGroup/pytorch_ehr (2023).
Choi, E. et al. RETAIN: An Interpretable Predictive Model for Healthcare using Reverse Time Attention Mechanism. in Advances in Neural Information Processing Systems vol. 29 (Curran Associates, Inc., 2016).
Wu, S. et al. Modeling asynchronous event sequences with RNNs. J. Biomed. Inf. 83, 167–177 (2018).
Article Google Scholar
Scikit-learn. LogisticRegression. https://scikit-learn.org/stable/modules/generated/sklearn.linear_model.LogisticRegression.html.
LightGBM. LightGBM 3.3.2 documentation. https://lightgbm.readthedocs.io/en/v3.3.2/.
Tran Quoc, V. et al. Predicting antibiotic resistance in ICUs patients by applying machine learning in Vietnam. Infect. Drug Resist 16, 5535–5546 (2023).
Article CAS PubMed PubMed Central Google Scholar
Corbin, C. K. Personalized antibiograms for machine learning driven antibiotic selection. Commun Med (Lond). 2, 38 (2022).
Article PubMed Google Scholar
Optuna - A hyperparameter optimization framework. Optuna https://optuna.org/.
Sun, X. & Xu, W. Fast implementation of DeLong’s algorithm for comparing the areas under correlated receiver operating characteristic curves. IEEE Signal Process. Lett. 21, 1389–1393 (2014).
Article ADS Google Scholar
Katzman, J. L. et al. DeepSurv: personalized treatment recommender system using a Cox proportional hazards deep neural network. BMC Med. Res. Methodol. 18, 24 (2018).
Article PubMed PubMed Central Google Scholar
Sundararajan, M., Taly, A. & Yan, Q. Axiomatic attribution for deep networks. in Proceedings of the 34th International Conference on Machine Learning - Volume 70 3319–3328 (JMLR.org, 2017).
Lundberg, S. M. & Lee, S.-I. A Unified Approach to Interpreting Model Predictions. Neural Information Processing Systems (2017).
ZhiGroup. PyTorch_EHR for MRSA Positive Culture. https://github.com/ZhiGroup/pytorch_ehr/tree/MRSA. (2024)

Download references

Acknowledgements

M.N. receives NIH grant (NIH/NIAID R01 AI175699).

Author information

Authors and Affiliations

McGovern Medical School, University of Texas Health Science Center at Houston, Houston, TX, USA
Masayuki Nigo
McWilliams School of Biomedical Informatics, University of Texas Health Science Center at Houston, Houston, TX, USA
Masayuki Nigo, Laila Rasmy, Bingyu Mao, Ziqian Xie & Degui Zhi
Division of Infectious Diseases, Department of Medicine, Houston Methodist Hospital, Texas Medical Center, Houston, TX, USA
Masayuki Nigo
Department of Internal Medicine, University of Arizona College of Medicine, Phoenix, AZ, USA
Bijun Sai Kannadath

Authors

Masayuki Nigo
View author publications
You can also search for this author in PubMed Google Scholar
Laila Rasmy
View author publications
You can also search for this author in PubMed Google Scholar
Bingyu Mao
View author publications
You can also search for this author in PubMed Google Scholar
Bijun Sai Kannadath
View author publications
You can also search for this author in PubMed Google Scholar
Ziqian Xie
View author publications
You can also search for this author in PubMed Google Scholar
Degui Zhi
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

M.N. conducted data cleaning, model training, and wrote manuscripts. B.M. cleaned codes for publication and analyzed data. L.R., Z.X. and D.Z. developed model and revised manuscripts. BSK revised manuscripts and provided insights in model training and evaluation.

Corresponding author

Correspondence to Masayuki Nigo.

Ethics declarations

Competing interests

The authors declare no competing interests.

Ethics approval

This study was approved by Institutional Review Boards (IRBs) at the University of Texas Health Science Center, Houston, Texas, and MHHS (Protocol number: HSC-MS-20-0121).

Peer review

Peer review information

Nature Communications thanks Alastair Hay and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

.Supplementary information

Supplementary Information

Peer Review File

Reporting Summary

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Nigo, M., Rasmy, L., Mao, B. et al. Deep learning model for personalized prediction of positive MRSA culture using time-series electronic health records. Nat Commun 15, 2036 (2024). https://doi.org/10.1038/s41467-024-46211-0

Download citation

Received: 06 December 2023
Accepted: 19 February 2024
Published: 06 March 2024
DOI: https://doi.org/10.1038/s41467-024-46211-0

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Predicting bloodstream infection outcome using machine learning

Prediction of ciprofloxacin resistance in hospitalized patients using machine learning

Development of an artificial intelligence bacteremia prediction model and evaluation of its impact on physician predictions focusing on uncertainty

Introduction

Results

Patient characteristics

Types of infection and other pathogens

Model prediction

Potential clinical impact

Feature importance

Discussion

Methods

EHR datasets

PyTorch_EHR prediction model scheme

Possible clinical impact of our model

Model interpretation

Reporting summary

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Ethics approval

Peer review

Peer review information

Additional information

.Supplementary information

Supplementary Information

Peer Review File

Reporting Summary

Rights and permissions

About this article

Cite this article

Share this article

Comments

Search

Quick links