Rapid identification of pathogenic bacteria using Raman spectroscopy and deep learning

Ho, Chi-Sing; Jean, Neal; Hogan, Catherine A.; Blackmon, Lena; Jeffrey, Stefanie S.; Holodniy, Mark; Banaei, Niaz; Saleh, Amr A. E.; Ermon, Stefano; Dionne, Jennifer

doi:10.1038/s41467-019-12898-9

Download PDF

Article
Open access
Published: 30 October 2019

Rapid identification of pathogenic bacteria using Raman spectroscopy and deep learning

Nature Communications volume 10, Article number: 4927 (2019) Cite this article

51k Accesses
406 Citations
312 Altmetric
Metrics details

Subjects

Abstract

Raman optical spectroscopy promises label-free bacterial detection, identification, and antibiotic susceptibility testing in a single step. However, achieving clinically relevant speeds and accuracies remains challenging due to weak Raman signal from bacterial cells and numerous bacterial species and phenotypes. Here we generate an extensive dataset of bacterial Raman spectra and apply deep learning approaches to accurately identify 30 common bacterial pathogens. Even on low signal-to-noise spectra, we achieve average isolate-level accuracies exceeding 82% and antibiotic treatment identification accuracies of 97.0±0.3%. We also show that this approach distinguishes between methicillin-resistant and -susceptible isolates of Staphylococcus aureus (MRSA and MSSA) with 89±0.1% accuracy. We validate our results on clinical isolates from 50 patients. Using just 10 bacterial spectra from each patient isolate, we achieve treatment identification accuracies of 99.7%. Our approach has potential for culture-free pathogen identification and antibiotic susceptibility testing, and could be readily extended for diagnostics on blood, urine, and sputum.

Accurate and fast identification of minimally prepared bacteria phenotypes using Raman spectroscopy assisted by machine learning

Article Open access 30 September 2022

Drug-resistant Staphylococcus aureus bacteria detection by combining surface-enhanced Raman spectroscopy (SERS) and deep learning techniques

Article Open access 16 September 2021

Accurate and rapid antibiotic susceptibility testing using a machine learning-assisted nanomotion technology platform

Article Open access 18 March 2024

Introduction

Bacterial infections are a leading cause of death in both developed and developing nations, taking >6.7 million lives each year^1,2. These infections are also costly to treat, accounting for 8.7% of annual healthcare spending, or $33 billion, in the United States alone³. Current diagnostic methods require sample culturing to detect and identify the bacteria and its antibiotic susceptibility, a slow process that can take days even in state-of-the-art labs^4,5. Broad spectrum antibiotics are often prescribed while waiting for culture results⁶, and according to the Centers for Disease Control and Prevention, over 30% of patients are treated unnecessarily⁷. New methods for rapid, culture-free diagnosis of bacterial infections are needed to enable earlier prescription of targeted antibiotics and help mitigate antimicrobial resistance.

Raman spectroscopy has the potential to identify the species and antibiotic resistance of bacteria, and when combined with confocal spectroscopy, can interrogate individual bacterial cells (Fig. 1a, b). Different bacterial phenotypes are characterized by unique molecular compositions, leading to subtle differences in their corresponding Raman spectra. However, because Raman scattering efficiency is low (~10⁻⁸ scattering probability⁸), these subtle spectral differences are easily masked by background noise. High signal-to-noise ratios (SNRs) are thus needed to reach high identification accuracies⁹, typically requiring long measurement times that prohibit high-throughput single-cell techniques. Additionally, the large number of clinically relevant species, strains, and antibiotic resistance patterns require comprehensive datasets that are not gathered in studies that focus on differentiating between species^10,11, isolates (typically referred to as strains in the literature)^12,13, or antibiotic susceptibilities^{14,15,16,17,18,19}. In this work, we address this challenge by training a convolutional neural network (CNN) to classify noisy bacterial spectra by isolate, empiric treatment, and antibiotic resistance.

Results

Deep learning for bacterial classification from Raman spectra

In order to gather a training dataset, we measure Raman spectra using short measurement times on dried monolayer samples, as illustrated in Fig. 1. We ensure that the majority of individual spectra are taken over single cells and preparation conditions are consistent between samples (See Methods). We construct reference datasets of 60,000 spectra from 30 bacterial and yeast isolates for 3 measurement times — these 30 isolate classes cover over 94% of all bacterial infections treated at Stanford Hospital in the years 2016–17 and are representative of the majority of infections in intensive care units worldwide²⁰. We further augment our reference dataset with 12,000 spectra from clinical patient isolates, including MRSA and MSSA isolates (see Methods for full dataset information). Previously, the lack of large datasets prohibited the use of CNNs due to the high number of spectra per bacterial class needed for training.

In recent years, CNNs have been applied with tremendous success to a broad range of computer vision problems^{21,22,23,24,25,26,27,28,29,30}. However, while classical machine learning techniques have been applied to spectral data^{11,12,14,31,32}, relatively little work has been done in adapting deep learning models to spectral data^33,34,35,36. In particular, state-of-the-art CNN techniques from image classification such as residual connections have previously not been applied to low SNR, 1D spectral data. Our CNN architecture consists of 25 1D convolutional layers and residual connections³⁷ — instead of two-dimensional images, it takes one-dimensional spectra as input (see Methods for further detail). Unlike previous work, we do not use pooling layers and instead use strided convolutions with the goal of preserving the exact locations of spectral peaks³⁸. Empirically, we find that this strategy improves model performance.

We train the neural network on a 30-class isolate identification task, where the CNN outputs a probability distribution across the 30 reference isolates and the maximum is taken as the predicted class. The model is trained on the reference dataset and tested on an independent test dataset gathered from separately cultured samples.

A performance breakdown for individual classes is displayed in the confusion matrix in Fig. 2a. Here, we show data for 1 s measurement times, corresponding to a SNR of 4.1 — roughly an order of magnitude lower than typical reported bacterial spectra^10,11,12; classification accuracies increase with SNR, as shown in Supplementary Fig. 1. On the 30-class task, the average isolate-level accuracy is 82.2±0.3% (± calculated as standard deviation across 5 train and validation splits). Gram-negative bacteria are primarily misclassified as other Gram-negative bacteria; the same is generally true for Gram-positive bacteria, where additionally, the majority of misclassifications occur within the same genus. In comparison, our implementations of the more common classification techniques of logistic regression and support vector machine (SVM) achieve accuracies of 75.7% and 74.9%, respectively.

Identification of empiric treatments and antibiotic resistance

Species-level classification accuracy is the standard metric for bacterial identification, but in practice, the priority for physicians is choosing the correct antibiotic to treat a patient. Common antibiotics often have activity against multiple species, so the 30 isolates can be arranged into groupings based on the recommended empiric treatment if the bacterial species is known. Classification accuracies can thus be condensed into a new confusion matrix grouped by empiric antibiotic treatment (Fig. 2b), where the average accuracy of our method is 97.0±0.3%. In comparison, logistic regression and SVM achieve accuracies of 93.3% and 92.2%, respectively.

Beyond empiric first choice antibiotics, clinicians also conduct antibiotic susceptibility tests to determine bacterial responses to drugs. As a step toward a culture-free antibiotic susceptibility test using Raman spectroscopy, we train a binary CNN classifier to differentiate between methicillin-resistant and -susceptible isolates of S. aureus. This model achieves 89.1±0.1% identification accuracy (Fig. 3a). Because the consequences for misdiagnosing MRSA as MSSA are often more severe than the reverse misdiagnosis, the binary decision can be tuned for higher sensitivity (low false negative rate), as shown in the receiver operating characteristic (ROC) curve in Fig. 3b (dotted line denotes performance of random guessing). The area under the curve (AUC) is 0.953, meaning that a randomly selected positive example (i.e., Raman sample from patient with MRSA) will be predicted to be more likely to be MRSA than a randomly selected negative example (i.e., sample from patient with MSSA) with probability 0.953.

Extension to clinical patient isolates

To demonstrate that this approach can be extended to new clinical settings, we test our model on two groups of 25 clinical isolates derived from patient samples, for a total of 50 patients, Within each patient group, samples include 5 isolates from each of the 5 most prevalent³⁹ empiric treatment groups (see Supplementary Table 2 and Supplementary Fig. 4). We first consider isolates from 25 patients collected from Palo Alto VA Medical Center in 2018. We augment our reference dataset with this clinical dataset comprised of 400 spectra per clinical isolate. To account for changes in the relative prevalence of species and antibiotic resistances over time, the model may be fine-tuned on a small dataset that is representative of current patient populations. We use a leave-one-patient-out cross-validation (LOOCV) strategy for fine-tuning, where we assign 1 patient in each class to the test set (5 patients total) and use the other 4 for fine-tuning (20 patients total), fine-tuning on 10 randomly sampled spectra per patient isolate — we repeat this process 5 times, so all 25 patient isolates appear in the held-out test set once. We then use 10 randomly sampled spectra from each patient isolate in the test set to reach an infection identification for that patient isolate. The sampling procedure for identification is repeated for 10,000 trials, and we report the average accuracy and standard deviation, and display a trial representing the modal result in Fig. 4a (full experiment details can be seen in Supplementary Note 1). A CNN pre-trained on the reference dataset serves both as initialization for the fine-tuned model and as a baseline, achieving 89.0±3.6% (± calculated as standard deviation across 10,000 sampling trials) species identification accuracy, a statistically significant improvement over logistic regression and support vector machine baselines (see Methods for details). When the CNN is fine-tuned on clinical data and then evaluated on the held-out patients, the identification accuracy is improved to 99.0±1.9% (Supplementary Fig. 5). Samples for the clinical tests were prepared separately for each patient, so we conclude that the measured performance is not due to batch effects from sample preparation or measurement conditions.

Because patient samples may contain very low numbers of bacterial cells without culturing (e.g. 1 CFU/mL or fewer in blood⁴⁰), only a few individual bacterial spectra per patient may be available to make a diagnosis. As seen in Fig. 4c, just 10 cellular spectra are enough to reach high identification accuracy. The rate of correct identification using 10 spectra is 99.0%, within 1% of the performance with 400 spectra (100.0%). While acquiring spectra from 400 individual bacterial cells would likely necessitate culturing, we achieve high accuracy on spectra from 10 individual bacterial cells, commensurate with typical levels of bacterial cells present in uncultured samples^40,41.

For a proof-of-concept antibiotic susceptibility test on clinical isolates, we collect Raman spectra on 5 additional clinical MRSA isolates and test the binary MRSA/MSSA classifier that is pre-trained on the reference MRSA and MSSA isolates. Using the same LOOCV process, we fine-tune the binary classifier on the clinical spectra. A representative result is shown in Fig. 4b; any misclassifications of MSSA as MRSA are labeled as “suboptimal”, indicating that Vancomycin (prescribed for MRSA) is also effective on MSSA but is not considered optimal treatment and may introduce adverse patient effects. On average, the pre-trained binary classifier achieves 61.7$\pm$7.3% accuracy and the fine-tuned binary classifier achieves 65.4$\pm$6.3% accuracy (Supplementary Fig. 5).

Finally, to test the robustness of the fine-tuning approach over multiple clinical datasets, we use our second patient group of 25 isolates, collected from Stanford Hospital from February 2019 to March 2019. We conduct additional fine-tuning of the model that is pre-trained on the reference dataset and fine-tuned on the original clinical dataset. The treatment group identification accuracy on the new clinical dataset using only 10 spectra per patient is 99.7±1.1% Fig. 4d, e, with improved performance for both S. aureus and P. aeruginosa, demonstrating the potential for continuous improvement of the trained model.

Discussion

In this work, we apply state-of-the-art deep learning techniques to noisy Raman spectra to identify clinically relevant bacteria and their empiric treatment. A CNN model pre-trained on our dataset can easily be extended to new clinical settings through fine-tuning on a small number of clinical isolates, as we have shown on our clinical dataset. We envision that fine-tuning processes such as the one demonstrated here could be important components for continuously evaluating and improving deployed models. Our model, applied here to the identification of clinically relevant bacteria, can be applied with minimal modification to other identification problems such as materials identification, or other spectroscopic techniques such as nuclear magnetic resonance, infrared, or mass spectrometry.

This study uses measurement times of 1 s, corresponding to SNRs that are an order of magnitude lower than typical reported bacterial spectra — while still achieving comparable or improved identification accuracy on more isolate classes than typical Raman bacterial identification studies. A common strategy for reducing measurement times is surface-enhanced Raman scattering (SERS) using plasmonic structures, which can increase the signal strength by several orders of magnitude^11,42,43. SERS spectra can be highly variable and difficult to reproduce, particularly on cell samples^8,44, making it difficult to develop a reliable diagnostic method based on SERS. However, with a dataset capturing the breadth of variation in SERS spectra, a CNN could enable a platform that processes blood, sputum, or urine samples in a few hours.

Compared to other culture-free methods⁴⁵ including single-cell sequencing^46,47,48,49 and fluorescence or magnetic tagging⁵⁰, Raman spectroscopy has the unique potential to be a technique for identifying phenotypes that does not require specially designed labels, allowing for easy generalizability to new strains.

To achieve treatment recommendations as fine-grained as those from culture-based methods, larger datasets covering more resistant and susceptible clinical isolates, greater diversity in antibiotic susceptibility profiles, cell states, and growth media and conditions would be needed. Though collecting such datasets is beyond an academic scope, requiring highly automated sample preparation and data acquisition processes, there is promise for clinical translation. Similarly, studies applying the Raman-CNN system to identify pathogens in relevant biofluids such as whole blood, sputum, and urine are a promising future direction to demonstrate the validity of the method as a diagnostic tool. When combined with such an automated system, the Raman-CNN platform presented here could rapidly scan and identify every cell in a patient sample and recommended an antibiotic treatment in one step, without needing to wait for a culture step. Such a technique would allow for accurate and targeted treatment of bacterial infections within hours, reducing healthcare costs and antibiotics misuse, limiting antimicrobial resistance, and improving patient outcomes.

Methods

Dataset

The reference dataset consists of 30 bacterial and yeast isolates, including multiple isolates of Gram-negative and Gram-positive bacteria, as well as Candida species. We also include an isogenic pair of S. aureus from the same strain, in which one variant contains the mecA resistance gene for methicillin (MRSA) and the other does not (MSSA)⁵¹ (see Supplementary Table 1 for full isolate information). The reference training dataset consists of 2000 spectra each for the 30 reference isolates plus isogenic MSSA at 3 measurement times. The reference fine-tuning and test datasets each consist of 100 spectra for each of the 30 reference isolates. The first clinical dataset consists of 30 patient isolates distributed across 5 species, with 400 spectra per isolate. The second clinical dataset consists of 25 patient isolates distributed across the same 5 species, with 100 spectra per isolate. Due to degradation in optical system efficiency, the measurement times for the reference fine-tuning and test and second clinical datasets were increased from 1 s to 2 s in order to keep SNR consistent across datasets. Antibiotic susceptibility was performed by first genotypic testing for methicillin by detecting mecA using PCR (PMID: 19741081). Then phenotypic antimicrobial susceptibility testing was performed on the Microscan Walkaway instrument (Beckman Coulter, Brea, CA) and VITEK® 2 (Biomérieux, Inc., Durham, NC).

Dataset variance

For our datasets, we observe that intra-sample variance is high, as demonstrated by the pairwise spectral difference analysis summarized in Supplementary Fig. 2. For 19 out of 30 isolates, spectra from at least one other isolate are more similar on average than spectra from the same isolate, on average. For example, when we rank isolates in order of similarity to E. faecalis 2 (Supplementary Fig. 2c), there are 8 other isolates where the average difference between a spectrum from E. faecalis 2 and a spectrum from the other isolate is smaller than the average difference between two spectra from E. faecalis 2. When intra-sample variance is high, a large number of spectra per sample may help to better represent the full data distribution and lead to higher predictive performance.

Sample preparation

Bacterial isolates were cultured on blood agar plates each day before measurement. Plates were sealed with Parafilm and stored at 4 °C for 20 min to 12 h before sample preparation. Storage times varied to allow for multiple measurement times per day; however all other sample preparation conditions were kept consistent between samples. Differences in storage time were not found to result in spectral changes greater than spectral changes due to strain or isogenic differences. All clinical isolates were prepared in separate samples with consistent sample preparation conditions. Because test samples were prepared separately from samples used for training, we conclude that classifications are not due to batch effects such as differences in sample preparation. We prepared samples for measurement by suspending 0.6 mg of biomass from a single colony in 10 µL of sterile water (0.4 mg in 5 µL water for Gram-positive species) and drying 3 µL of the suspension on a gold-coated silica substrate (Fig. 1a, b). Substrates were prepared by electron beam evaporation of 200 nm of gold onto microscope slides that were pre-cleaned using base piranha. Samples were allowed to dry for 1 h before measurement.

Raman measurements

We measured Raman spectra across monolayer regions of the dried samples (Fig. 1a) using the mapping mode of a Horiba LabRAM HR Evolution Raman microscope. 633 nm illumination at 13.17 mW was used with a 300 l/mm grating to generate spectra with 1.2 cm⁻¹ dispersion to maximize signal strength while minimizing background signal from autofluorescence. Wavenumber calibration was performed using a silicon sample. The ×100 0.9 NA objective lens (Olympus MPLAN) generates a diffraction-limited spot size, $\sim$1 µm in diameter. A 45 × 45 discrete spot map is taken with 3 µm spacing between spots to avoid overlap between spectra. The spectra are individually background corrected using a polynomial fit of order 5 using the subbackmod Matlab function available in the Biodata toolbox (see Supplementary Fig. 1 for examples of raw and corrected spectra). The majority of spectra are measured on true monolayers and arise from ~1 cell due to the diffraction-limited laser spot size, which is roughly the size of a bacteria cell. However, a small number of spectra may be taken over aggregates or multilayer regions. We exclude the spectra that are most likely to be non-monolayer measurements by ranking the spectra by signal intensity and discarding the 25 spectra with highest intensity, which includes all spectra with intensities greater than two standard deviations from the mean. We measured both monolayers and single cells, and found that monolayer measurements have SNRs of 2.5 ± 0.7, similar to single-cell measurements (2.4 ± 0.6), while allowing for the semi-automated generation of a large training dataset. The spectral range between 381.98 and 1792.4 cm⁻¹ was used, and spectra were individually normalized to run from a minimum intensity of 0 and maximum intensity of 1 within this spectral range. SNR values are calculated by dividing the total intensity range by the intensity range over a 20-pixel wide window in a region where there is no Raman signal.

CNN architecture & training details

The CNN architecture is adapted from the Resnet architecture³⁷ that has been widely successful across a range of computer vision tasks. It consists of an initial convolution layer followed by 6 residual layers and a final fully connected classification layer — a block diagram can be seen in Fig. 1. The residual layers contain shortcut connections between the input and output of each residual block, allowing for better gradient propagation and stable training (refer to reference 37 for details). Each residual layer contains 4 convolutional layers, so the total depth of the network is 26 layers. The initial convolution layer has 64 convolutional filters, while each of the hidden layers has 100 filters. These architecture hyperparameters were selected via grid search using one training and validation split on the isolate classification task. We also experimented with simple MLP (multi-layer perceptron) and CNN architectures but found that the Resnet-based architecture performed best.

We first train the network on the 30-isolate classification task, where the output of the CNN is a vector of probabilities across the 30 classes and the maximum probability is taken as the predicted class. The binary MRSA/MSSA and binary isogenic MRSA/MSSA classifiers have the same architecture as the 30-isolate classifier, aside from the number of classes in the final classification layer. We use the Adam optimizer⁵² across all experiments with learning rate 0.001, betas (0.5, 0.999), and batch size 10. Classification accuracies are reported across 5 randomly selected train and validation splits. We first pre-train the CNN on the reference training dataset, then fine-tune on the reference fine-tuning dataset to account for measurement changes due to degradation in optical system efficiency. For each of the 5 splits, we split the fine-tuning data into 90/10 train and validation splits, train the CNN on the train split, and use the accuracy on the validation split to perform model selection. We then evaluate and report the test accuracy on the test dataset which is gathered from independently cultured and prepared samples. The binary MRSA/MSSA classifier is trained and fine-tuned using the same procedure. The binary isogenic MRSA/MSSA classifier is trained using a similar procedure on data from a single measurement series.

All error values reported for tests on the reference dataset are standard deviation values across 5 splits.

While a high number of samples is good for ensuring dataset variation, deep learning approaches can still benefit from having a high number of examples per sample. When intra-sample variance is high, as we observe for our datasets, a large number of spectra per sample may better represent the full distribution and lead to higher predictive performance.

For the clinical isolates, we start by pre-training a CNN on the empiric treatment labels for the 30 reference isolates. We then use the following leave-one-patient-out cross-validation (LOOCV) strategy to fine-tune the parameters of the CNN. There are a total of 25 patient isolates across 5 species. In each of the 5 folds, we assign 1 patient in each species to the test set, 1 patient in each species to the validation set, and the remaining 3 patients in each species to the training (i.e., fine-tuning) set. We then use the clinical training set (consisting of isolates from 15 patients) to fine-tune the CNN parameters, and use accuracy on the validation set (5 patient isolates) to do model selection. The test accuracy for each fold is evaluated on the test set (5 patient isolates) using the method described below.

Clinical identification data analysis

To reach an identification for patient isolates, 400 spectra are measured across a sample from each patient isolate. 10 of these spectra are chosen at random to be classified. The most common class out of the 10 spectral classifications is then chosen as the identification for each patient isolate, with ties broken randomly. All error values reported for tests on the clinical dataset are standard deviations across 10,000 trials of random selections of 10 spectra, with an upper accuracy bound of 100%. For the second clinical dataset, we perform the same procedure, except that we choose 10 out of 100 spectra for each patient isolate, and use a model that is both pre-trained on the reference dataset and fine-tuned on the first clinical dataset.

Baselines

In all experiments where logistic regression (LR) and support vector machine (SVM) baselines were used, we first used PCA to reduce the input dimension from 1000 to 20 — this hyperparameter was determined by plotting test accuracies for different settings on one training and validation split for the 30 isolate task and picking a value near where the test accuracy saturated. Using only the first 20 principal components not only decreases computation costs, but also increases accuracy by reducing the amount of noise in the data. For each fold of the cross validation procedure, we use grid search to choose the regularization hyperparameter for each model achieving the best validation accuracy and report the corresponding test accuracy. Using both the training and fine-tuning reference datasets to train the baseline models, LR and SVM achieve 57.5% and 56.8% on the 30-class task and 89.0% and 88.3% on the empiric treatment task, respectively. Using only the fine-tuning reference dataset, LR and SVM achieve 75.7% and 74.9% on the 30-class task and 93.3% and 92.2% on the empiric treatment task, respectively. The latter performance is higher because the baseline models do not benefit from additional training data as the CNN does, but rather benefit from training data the most closely matches the measurement conditions of the test data.

Two-sample test of sample means

We use the Welch’s two-sample $t$-test to test whether the differences in mean clinical accuracy for the CNN and the SVM and LR baselines were statistically significant. Welch’s $t$-test is a variation of the Student’s $t$-test that is used when the two samples may have unequal variances. In each case, we start by computing the pooled standard deviation as

$$\sigma =\sqrt{\frac{({n}_{1}-1){\sigma }_{1}^{2}+({n}_{2}-1){\sigma }_{2}^{2}}{{n}_{1}+{n}_{2}-2}}.$$

(1)

We then compute the standard error of the difference between the means as

$${\rm{se}}=\sigma \times \sqrt{\frac{1}{{n}_{1}}+\frac{1}{{n}_{2}}}.$$

(2)

Finally, we can compute the test statistic as

$$t=\frac{{\mu }_{1}-{\mu }_{2}}{{\rm{se}}},$$

(3)

and then compute the p-value using the corresponding Student’s $t$-distribution. For our computations, ${n}_{{\rm{CNN}}}={n}_{{\rm{LR}}}={n}_{{\rm{SVM}}}=10000$, ${\mu }_{{\rm{CNN}}}=89.0$, ${\mu }_{{\rm{LR}}}=81.8$, ${\mu }_{{\rm{SV M}}}=82.9$, ${\sigma }_{{\rm{CNN}}}=3.6$, ${\sigma }_{{\rm{LR}}}=6.0$, and ${\sigma }_{{\rm{SV M}}}=5.9$. In comparing the CNN with LR, we computed a $t$-statistic of 102.9 and in comparing the CNN with SVM, we computed a $t$-statistic of 88.3. In both cases, we reject the null hypothesis that the means are equal at the 1e-6 p-level.

Biological materials availability

Unique isolates are available from the authors upon reasonable request.

Reporting Summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

All data needed to replicate these results are available at https://github.com/csho33/bacteria-ID.

Code availability

All code needed to replicate these results is available at https://github.com/csho33/bacteria-ID.

References

Fleischmann, C. et al. Assessment of global incidence and mortality of hospital-treated sepsis. current estimates and limitations. Am. J. Respir. Crit. Care Med. 193, 259–272 (2016).
Article CAS Google Scholar
DeAntonio, R., Yarzabal, J.-P., Cruz, J. P., Schmidt, J. E. & Kleijnen, J. Epidemiology of community-acquired pneumonia and implications for vaccination of children living in developing and newly industrialized countries: A systematic literature review. Hum. Vaccin. Immunother. 12, 2422–2440 (2016).
Article Google Scholar
Torio, C.M. & Moore, B.J. National inpatient hospital costs: The most expensive conditions by payer, 2013. Tech. Rep. HCUP Statistical Brief #204., Agency for Healthcare Research and Quality (2016).
Dellinger, R. P. et al. Surviving sepsis campaign: international guidelines for management of severe sepsis and septic shock: 2012.
Chaudhuri, A. et al. EFNS guideline on the management of community-acquired bacterial meningitis: report of an EFNS task force on acute bacterial meningitis in older children and adults. Eur. J. Neurol. 15, 649–659 (2008).
Article CAS Google Scholar
American Thoracic Society. & Infectious Diseases Society of America. Guidelines for the management of adults with hospital-acquired, ventilator-associated, and healthcare-associated pneumonia. Am. J. Respir. Crit. Care Med. 171, 388–416 (2005).
Article Google Scholar
Fleming-Dutra, K. E. et al. Prevalence of inappropriate antibiotic prescriptions among US ambulatory care visits, 2010-2011. JAMA 315, 1864–1873 (2016).
Article CAS Google Scholar
Butler, H. J. et al. Using raman spectroscopy to characterize biological materials. Nat. Protoc. 11, 664–687 (2016).
Article CAS Google Scholar
Stöckel, S., Kirchhoff, J., Neugebauer, U., Rösch, P. & Popp, J. The application of raman spectroscopy for the detection and identification of microorganisms. J. Raman Spectrosc. 47, 89–109 (2016).
Article ADS Google Scholar
Kloss, S. et al. Culture independent raman spectroscopic identification of urinary tract infection pathogens: a proof of principle study. Anal. Chem. 85, 9610–9616 (2013).
Article CAS Google Scholar
Boardman, A. K. et al. Rapid detection of bacteria from blood with Surface-Enhanced raman spectroscopy. Anal. Chem. 88, 8026–8035 (2016).
Article CAS Google Scholar
Schmid, U. et al. Gaussian mixture discriminant analysis for the single-cell differentiation of bacteria using micro-raman spectroscopy. Chemometrics Intellig. Lab. Syst. 96, 159–171 (2009).
Article CAS Google Scholar
Münchberg, U., Rösch, P., Bauer, M. & Popp, J. Raman spectroscopic identification of single bacterial cells under antibiotic influence. Anal. Bioanal. Chem. 406, 3041–3050 (2014).
Article Google Scholar
Novelli-Rousseau, A. et al. Culture-free antibiotic-susceptibility determination from single-bacterium raman spectra. Sci. Rep. 8, 3957 (2018).
Article ADS CAS Google Scholar
Liu, C.-Y. et al. Rapid bacterial antibiotic susceptibility test based on simple surface-enhanced raman spectroscopic biomarkers. Sci. Rep. 6, 23375 (2016).
Article ADS CAS Google Scholar
Lu, X. et al. Detecting and tracking nosocomial methicillin-resistant staphylococcus aureus using a microfluidic SERS biosensor. Anal. Chem. 85, 2320–2327 (2013).
Article CAS Google Scholar
Germond, A. et al. Raman spectral signature reflects transcriptomic features of antibiotic resistance in escherichia coli. Communications Biology 1, 85 (2018).
Article Google Scholar
Ayala, O. D. et al. Drug-Resistant staphylococcus aureus strains reveal distinct biochemical features with raman microspectroscopy. ACS Infect Dis 4, 1197–1210 (2018).
Article MathSciNet CAS Google Scholar
Kirchhoff, J. et al. Simple ciprofloxacin resistance test and determination of minimal inhibitory concentration within 2 h using raman spectroscopy. Anal. Chem. 90, 1811–1818 (2018).
Article CAS Google Scholar
Vincent, J.-L. et al. International study of the prevalence and outcomes of infection in intensive care units. JAMA 302, 2323–2329 (2009).
Article CAS Google Scholar
Krizhevsky, A., Sutskever, I. & Hinton, G.E. ImageNet classification with deep convolutional neural networks. In Pereira, F., Burges, C. J. C., Bottou, L. & Weinberger, K. Q. (eds.) Advances in Neural Information Processing Systems 25, 1097-1105 (Curran Associates, Inc., 2012).
Mnih, V., Heess, N., Graves, A. & Kavukcuoglu, K. Recurrent models of visual attention. In Ghahramani, Z., Welling, M., Cortes, C., Lawrence, N. D. & Weinberger, K. Q. (eds.) Advances in Neural Information Processing Systems 27, 2204–2212 (Curran Associates, Inc., 2014).
Karpathy, A. & Fei-Fei, L. In Proceedings of the IEEE conference on computer vision and pattern recognition, 3128–3137 (cv-foundation.org, 2015).
Zhang, R., Isola, P. & Efros, A.A. In Computer Vision – ECCV 2016, 649-666 (Springer International Publishing, 2016).
Dong, C., Loy, C.C., He, K. & Tang, X. In Computer Vision – ECCV 2014, 184–199 (Springer International Publishing, 2014).
Wang, L., Ouyang, W., Wang, X. & Lu, H. In Proceedings of the IEEE international conference on computer vision, 3119–3127 (cv-foundation.org, 2015).
Girshick, R., Donahue, J., Darrell, T. & Malik, J. In Proceedings of the IEEE conference on computer vision and pattern recognition, 580–587 (cv-foundation.org, 2014).
Girshick, R. et al. Hierarchical deep convolutional neural networks combine spectral and spatial information for highly accurate raman microscopy based cytopathology. J. Biophotonics 11, e201800022 (2018).
Article Google Scholar
Lotfollahi, M., Berisha, S., Daeinejad, D. & Mayerich, D. Digital staining of High-Definition fourier transform infrared (FT-IR) images using deep learning. Appl. Spectrosc. 73, 556–564 (2019).
Article ADS CAS Google Scholar
Berisha, S. et al. Deep learning for FTIR histology: leveraging spatial and spectral features with convolutional neural networks. Analyst 144, 1642–1653 (2019).
Article ADS CAS Google Scholar
Kampe, B., Kloß, S., Bocklitz, T., Rösch, P. & Popp, J. Recursive feature elimination in raman spectra with support vector machines. Front. Optoelectron. 10, 273–279 (2017).
Article Google Scholar
Guo, S. et al. Model transfer for raman-spectroscopy-based bacterial classification. J. Raman Spectrosc. 49, 627–637 (2018).
Article ADS CAS Google Scholar
Gurbani, S. S. et al. A convolutional neural network to filter artifacts in spectroscopic MRI. Magn. Reson. Med. 80, 1765–1775(2018).
Malek, S., Melgani, F. & Bazi, Y. One-dimensional convolutional neural networks for spectroscopic signal regression: Feature extraction based on 1D-CNN is proposed and validated. J. Chemom. 32, e2977 (2018).
Article Google Scholar
Liu, J. et al. Deep convolutional neural networks for raman spectrum recognition: a unified solution. Analyst (2017).
Zhang, X., Lin, T., Xu, J., Luo, X. & Ying, Y. DeepSpectra: An end-to-end deep learning approach for quantitative spectral analysis. Anal. Chim. Acta 1058, 48–57 (2019).
Article ADS CAS Google Scholar
He, K., Zhang, X., Ren, S. & Sun, J. In Proceedings of the IEEE conference on computer vision and pattern recognition, 770–778 (2016).
Dumoulin, V. & Visin, F. A guide to convolution arithmetic for deep learning. Preprint at https://arxiv.org/abs/1603.07285(2016).
Banaei, N., Watz, N., Getsinger, D. & Ghafghaichi, L. SUH antibiogram data for bacterial and yeast isolates. Tech. Rep., Stanford Healthcare Clinical Microbiology Laboratory http://med.stanford.edu/bugsanddrugs/clinical-microbiology/_jcr_content/main/panel_builder/panel_0/download_748639600/file.res/SHC/%20antibiogram/202016.pdf (2016).
Lamy, B., Dargère, S., Arendrup, M. C., Parienti, J.-J. & Tattevin, P. How to optimize the use of blood cultures for the diagnosis of bloodstream infections? a state-of-the art. Front. Microbiol. 7, 697 (2016).
Article Google Scholar
Reimer, L. G., Wilson, M. L. & Weinstein, M. P. Update on detection of bacteremia and fungemia. Clin. Microbiol. Rev. 10, 444–465 (1997).
Article CAS Google Scholar
Kögler, M. et al. Bare laser-synthesized au-based nanoparticles as nondisturbing surface-enhanced raman scattering probes for bacteria identification. J. Biophotonics 11, e201700225 (2018).
Article Google Scholar
Chen, Y., Premasiri, W. R. & Ziegler, L. D. Surface enhanced raman spectroscopy of chlamydia trachomatis and neisseria gonorrhoeae for diagnostics, and extra-cellular metabolomics and biochemical monitoring. Sci. Rep. 8, 5163 (2018).
Article ADS CAS Google Scholar
Li, J. F. et al. Shell-isolated nanoparticle-enhanced raman spectroscopy. Nature 464, 392–395 (2010).
Article ADS CAS Google Scholar
Cronquist, A. B. et al. Impacts of culture-independent diagnostic practices on public health surveillance for bacterial enteric pathogens. Clin. Infect. Dis. 54 Suppl 5, S432–S439 (2012).
Article Google Scholar
Kang, D.-K. et al. Rapid detection of single bacteria in unprocessed blood using integrated comprehensive droplet digital detection. Nat. Commun. 5, 5427 (2014).
Article ADS CAS Google Scholar
Tung, P.-Y. et al. Batch effects and the effective design of single-cell gene expression studies. Sci. Rep. 7, 39921 (2017).
Article ADS CAS Google Scholar
Wang, Y. & Navin, N. E. Advances and applications of single-cell sequencing technologies. Mol. Cell 58, 598–609 (2015).
Article CAS Google Scholar
Pallen, M. J., Loman, N. J. & Penn, C. W. High-throughput sequencing and clinical microbiology: progress, opportunities and challenges. Curr. Opin. Microbiol. 13, 625–631 (2010).
Article CAS Google Scholar
Chung, J., Kang, J. S., Jurng, J. S., Jung, J. H. & Kim, B. C. Fast and continuous microorganism detection using aptamer-conjugated fluorescent nanoparticles on an optofluidic platform. Biosens. Bioelectron. 67, 303–308 (2015).
Article CAS Google Scholar
Diep, B. A. et al. The arginine catabolic mobile element and staphylococcal chromosomal cassette mec linkage: convergence of virulence and resistance in the USA300 clone of methicillin-resistant staphylococcus aureus. J. Infect. Dis. 197, 1523–1530 (2008).
Article CAS Google Scholar
Kingma, D. P. & Ba, J. Adam: A method for stochastic optimization. Preprint at https://arxiv.org/abs/1412.6980. (2014).

Download references

Acknowledgements

The authors gratefully acknowledge the assistance of Joel Jean, Chi-Min Ho, Alice Lay, Katherine Sytwu, Randy Mehlenbacher, Tracey Hong, Samuel Lee, David Zeng, Mark Winters, Marcin Walkiewicz and Andrey Malkovskiy. Raman measurements were performed at the Stanford Nano Shared Facilities (SNSF), supported by the National Science Foundation under award ECCS-1542152. The authors gratefully acknowledge support from the Alfred P. Sloan Foundation, the Stanford Catalyst for Collaborative Solutions and the Gates Foundation. N.J. acknowledges support from the Department of Defense (DoD) through the National Defense Science & Engineering Graduate Fellowship (NDSEG) Program.

Author information

These authors contributed equally: Chi-Sing Ho, Neal Jean.

Authors and Affiliations

Dept. of Applied Physics, Stanford University, Stanford, CA, USA
Chi-Sing Ho
Dept. of Materials Science and Engineering, Stanford University, Stanford, CA, USA
Chi-Sing Ho, Lena Blackmon, Amr A. E. Saleh & Jennifer Dionne
Dept. of Computer Science, Stanford University, Stanford, CA, USA
Neal Jean & Stefano Ermon
Dept. of Electrical Engineering, Stanford University, Stanford, CA, USA
Neal Jean
Dept. of Pathology, Stanford University School of Medicine, Stanford, CA, USA
Catherine A. Hogan & Niaz Banaei
Clinical Microbiology Laboratory, Stanford Health Care, Stanford, CA, USA
Catherine A. Hogan & Niaz Banaei
Dept. of Surgery, Stanford University School of Medicine, Stanford, CA, USA
Stefanie S. Jeffrey
Dept. of Medicine, Stanford University School of Medicine, Stanford, CA, USA
Mark Holodniy
VA Palo Alto Health Care System, Palo Alto, CA, USA
Mark Holodniy
Division of Infectious Diseases and Geographic Medicine, Department of Medicine, Stanford University School of Medicine, Stanford, CA, USA
Mark Holodniy & Niaz Banaei
Dept. of Engineering Mathematics and Physics, Faculty of Engineering, Cairo University, Giza, Egypt
Amr A. E. Saleh

Authors

Chi-Sing Ho
View author publications
You can also search for this author in PubMed Google Scholar
Neal Jean
View author publications
You can also search for this author in PubMed Google Scholar
Catherine A. Hogan
View author publications
You can also search for this author in PubMed Google Scholar
Lena Blackmon
View author publications
You can also search for this author in PubMed Google Scholar
Stefanie S. Jeffrey
View author publications
You can also search for this author in PubMed Google Scholar
Mark Holodniy
View author publications
You can also search for this author in PubMed Google Scholar
Niaz Banaei
View author publications
You can also search for this author in PubMed Google Scholar
Amr A. E. Saleh
View author publications
You can also search for this author in PubMed Google Scholar
Stefano Ermon
View author publications
You can also search for this author in PubMed Google Scholar
Jennifer Dionne
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

C.H. and N.J. conceptualized the algorithms, analyzed classification results, and fine-tuned the algorithms. C.H. developed sample preparation and data collection protocols, and collected the datasets. N.J. designed, optimized, and trained the algorithms. C.H. and L.B. prepared sample cultures. N.B., M.H. and C.A.H. developed the antibiotic groupings, collected samples, and provided input on clinical relevance. A.A.E.S., N.B. and J.A.D. conceived the initial idea and C.H. and N.J. further developed the idea. J.A.D. and A.A.E.S. supervised the project along with supervision from S.S.J., N.B., M.H. and S.E. on relevant portions of the research. All authors contributed to editing of the manuscript.

Corresponding authors

Correspondence to Chi-Sing Ho, Amr A. E. Saleh, Stefano Ermon or Jennifer Dionne.

Ethics declarations

Competing Interests

The authors declare no competing interests.

Additional information

Peer review information Nature Communications thanks the anonymous reviewers for their contribution to the peer review of this work.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementry Information

Supplementary Information

Reporting Summary

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Ho, CS., Jean, N., Hogan, C.A. et al. Rapid identification of pathogenic bacteria using Raman spectroscopy and deep learning. Nat Commun 10, 4927 (2019). https://doi.org/10.1038/s41467-019-12898-9

Download citation

Received: 04 December 2018
Accepted: 27 September 2019
Published: 30 October 2019
DOI: https://doi.org/10.1038/s41467-019-12898-9

This article is cited by

Species identification of adult ixodid ticks by Raman spectroscopy of their feces
- Tianyi Dou
- Aidan P. Holman
- Dmitry Kurouski
Parasites & Vectors (2024)
Noise learning of instruments for high-contrast, high-resolution and fast hyperspectral microscopy and nanoscopy
- Hao He
- Maofeng Cao
- Bin Ren
Nature Communications (2024)
RSPSSL: A novel high-fidelity Raman spectral preprocessing scheme to enhance biomedical applications and chemical resolution visualization
- Jiaqi Hu
- Gina Jinna Chen
- Perry Ping Shum
Light: Science & Applications (2024)
Optics miniaturization strategy for demanding Raman spectroscopy applications
- Oleksii Ilchenko
- Yurii Pilhun
- Anja Boisen
Nature Communications (2024)
Prediction of single-cell RNA expression profiles in live cells by Raman microscopy with Raman2RNA
- Koseki J. Kobayashi-Kirschvink
- Charles S. Comiter
- Aviv Regev
Nature Biotechnology (2024)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

Deep learning for bacterial classification from Raman spectra

Identification of empiric treatments and antibiotic resistance

Extension to clinical patient isolates

Discussion

Methods

Dataset

Dataset variance

Sample preparation

Raman measurements

CNN architecture & training details

Clinical identification data analysis

Baselines

Two-sample test of sample means

Biological materials availability

Reporting Summary

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing Interests

Additional information

Supplementry Information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links