Rapid species identification of pathogenic bacteria from a minute quantity exploiting three-dimensional quantitative phase imaging and artificial neural network

Kim, Geon; Ahn, Daewoong; Kang, Minhee; Park, Jinho; Ryu, DongHun; Jo, YoungJu; Song, Jinyeop; Ryu, Jea Sung; Choi, Gunho; Chung, Hyun Jung; Kim, Kyuseok; Chung, Doo Ryeon; Yoo, In Young; Huh, Hee Jae; Min, Hyun-seok; Lee, Nam Yong; Park, YongKeun

doi:10.1038/s41377-022-00881-x

Download PDF

Article
Open access
Published: 23 June 2022

Rapid species identification of pathogenic bacteria from a minute quantity exploiting three-dimensional quantitative phase imaging and artificial neural network

Geon Kim^1,2,
Daewoong Ahn³,
Minhee Kang⁴,
Jinho Park^1,2,
DongHun Ryu^1,2,
YoungJu Jo ORCID: orcid.org/0000-0002-2364-7160^1,2,3^nAff11,
Jinyeop Song^1,2^nAff12,
Jea Sung Ryu⁵,
Gunho Choi³,
Hyun Jung Chung^5,6,
Kyuseok Kim⁷,
Doo Ryeon Chung⁸,
In Young Yoo⁹,
Hee Jae Huh¹⁰,
Hyun-seok Min³,
Nam Yong Lee¹⁰ &
…
YongKeun Park ORCID: orcid.org/0000-0003-0528-6661^1,2,3

Light: Science & Applications volume 11, Article number: 190 (2022) Cite this article

8183 Accesses
30 Citations
6 Altmetric
Metrics details

Subjects

Abstract

The healthcare industry is in dire need of rapid microbial identification techniques for treating microbial infections. Microbial infections are a major healthcare issue worldwide, as these widespread diseases often develop into deadly symptoms. While studies have shown that an early appropriate antibiotic treatment significantly reduces the mortality of an infection, this effective treatment is difficult to practice. The main obstacle to early appropriate antibiotic treatments is the long turnaround time of the routine microbial identification, which includes time-consuming sample growth. Here, we propose a microscopy-based framework that identifies the pathogen from single to few cells. Our framework obtains and exploits the morphology of the limited sample by incorporating three-dimensional quantitative phase imaging and an artificial neural network. We demonstrate the identification of 19 bacterial species that cause bloodstream infections, achieving an accuracy of 82.5% from an individual bacterial cell or cluster. This performance, comparable to that of the gold standard mass spectroscopy under a sufficient amount of sample, underpins the effectiveness of our framework in clinical applications. Furthermore, our accuracy increases with multiple measurements, reaching 99.9% with seven different measurements of cells or clusters. We believe that our framework can serve as a beneficial advisory tool for clinicians during the initial treatment of infections.

Rapid identification of pathogenic bacteria using Raman spectroscopy and deep learning

Article Open access 30 October 2019

Identification of a clonal population of Aspergillus flavus by MALDI-TOF mass spectrometry using deep learning

Article Open access 28 January 2022

Accurate and rapid antibiotic susceptibility testing using a machine learning-assisted nanomotion technology platform

Article Open access 18 March 2024

Introduction

Infections by microorganisms are a global healthcare issue that is associated with a large number of deaths and a significant amount of expenses. Notably, bacteria account for approximately half of the reported cases of infections¹, as well as a large portion of the entire healthcare spending². Hence, effectively treating this widespread and possibly deadly illness has been a long-sought goal in the clinical society.

Multiple studies indicate that an antibiotic treatment appropriate to the pathogen, during the early hours of an infection, can significantly reduce the mortality^3,4. In clinical settings, however, early antibiotic treatments are commonly empirical and imperfect, mainly due to the long turnaround time of routine microbial identification^5,6, resulting in increased mortality risk⁷.

The typical turnaround time of the routine microbial identification is over 24 h⁸. Conventional approaches including culture tests are often nonspecific as well as time-consuming, despite being relatively simple to perform⁹. Molecular diagnostic methods screen for genetic materials in a shorter duration, yet they are not scalable for arbitrary pathogens⁸. In recent days, matrix-assisted laser desorption/ionization time-of-flight mass spectroscopy (MALDI-TOF MS) serve as the gold standard of microbial identification. MALDI-TOF MS detects the molecular markers of bacteria^8,9 but only when the sample quantity is detectable, which is commonly satisfied after 24 h of culture.

Image-based methods have also been implemented to promptly detect or identify bacteria from a low quantity. Fluorescence microscopy has often been utilized in detecting and counting individual bacteria¹⁰. More recently, fluorescence in situ hybridization has allowed screening for certain types of bacteria, by specifically labeling genomic patterns^11,12. However, fluorescence imaging entails destructive chemical alteration of the sample, as well as requiring optimally manufactured probes for high specificity. Label-free alternatives including autofluorescence microscopy have been adopted for bacterial detection to circumvent the drawbacks of labeling^13,14, but at a specificity restricted to the variation in the intrinsic fluorophores.

In this study, we tackle the challenge of rapid microbial identification by exploiting three-dimensional (3D) quantitative phase imaging (QPI) and image classification based on an artificial neural network (ANN). 3D QPI is a label-free imaging technique that measures the 3D refractive index (RI) tomogram of a live cell and has been actively employed in quantitative cell profiling^{15,16,17,18,19}.

Our unprecedented utilization of 3D QPI and ANN for bacterial identification achieves 82.5% accuracy in determining the species from a single bacterial cell or cluster. The accuracy increases with 3D QPI measurements of multiple specimens, reaching 99.9% with seven different measurements. We note that this accuracy is obtained between 19 major species of bacteria that account for bloodstream infections (BSIs)^20,21,22, further underlining the potential in clinical applications. This exceptional performance from a minute quantity of bacteria suggests that the proposed method can guide the early antibiotic treatment prior to the time-consuming culture process.

Results

The workflow of the 3D QPI in the identification framework is illustrated in Fig. 1. Our 3D QPI system, which is commercialized and dubbed holotomography (HT-2H, Tomocube Inc., Daejeon, Republic of Korea), utilizes Mach-Zehnder laser interferometry equipped with a digital micromirror device (DMD) as shown in Fig. 1a. The DMD scans the illumination angle and the 3D refractive index (RI) tomogram is reconstructed from the sinogram of 2D QPI measurements under the principle of optical diffraction tomography (Fig. 1b, c)²³.

**Fig. 1: Three-dimensional (3D) QPI measurement of bacteria.**

The 3D RI tomogram is then classified into one of the 19 species, through a trained ANN. The training process involves gradient-based optimization of the network parameters, using the training dataset whose species are known. Our implementation of ANN mainly consists of 3D convolution operations for effective recognition of the 3D structure in 3D RI tomograms (Fig. 2). More specifically, the dense connections between the convolution operations induce the ANN to revisit the feature maps of the shallower layers even at the deep layers²⁴.

**Fig. 2: The structure of the ANN utilized in our framework.**

The key function of this identification framework is to identify the species of the bacteria from single to few cells. It can provide preliminary results during the early stages of infections before the diagnostic evidence from gold standard methods is available dozens of hours later. Incorporation of the proposed framework into the gold standard routine is practicable since it operates without destroying nor chemically modifying the bacteria.

3D QPI measurement of bacteria

A database of 3D RI tomograms was established from the isolates of 19 BSI-related bacterial species (Fig. 3). The database comprised a total of 10,556 3D RI tomograms, where each tomogram contained a single bacterium or several adhering bacteria. 3D QPI effectively conveyed the 3D structure of the bacteria, and some characteristic morphologies were visible in the 3D RI tomograms, e.g., cellular chains of streptococci. The species and the corresponding numbers of tomograms are as follows: Acinetobacter baumannii (664), Bacillus subtilis (515), Enterobacter cloacae (541), Enterococcus faecalis (526), Escherichia coli (600), Haemophilus influenzae (511), Klebsiella pneumoniae (525), Listeria monocytogenes (632), Micrococcus luteus (247), Proteus mirabilis (517), Pseudomonas aeruginosa (596), Serratia marcescens (519), Staphylococcus aureus (558), Staphylococcus epidermidis (559), Stenotrophomonas maltophilia (549), Streptococcus agalactiae (537), Streptococcus anginosus (644), Streptococcus pneumoniae (566), and Streptococcus pyogenes (750). The majority tomograms of bacilli, i.e., rod-shaped bacteria, contained single bacterial cells. On the other hand, most of cocci and coccobacilli, i.e., spherical and ovoid bacteria, respectively, were in the form of clusters of several adhering bacteria. For instance, the specimens belonging to the genus Streptococcus are mostly found as chains of multiple adhering bacteria; a feature that the genus is characterized with. 3D QPI also facilitates the calculation of biophysical properties of each specimen (see section 1 of Supplementary Information), owing to the quantitative contrast related to the sample composition.

**Fig. 3: Three-dimensional (3D) RI tomograms of bacterial BSI pathogens.**

Identification of pathogens using a single tomogram

With a single 3D RI tomogram, the proposed framework achieved a blind test accuracy of 82.5% in species identification. This single-measurement accuracy is comparable to the rate of correct species identification obtained using MALDI-TOF MS with a sufficient number of bacteria²⁵. The high performance was realized despite the limited amount of samples, by statistically utilizing the detailed 3D morphologies of the bacteria. Namely, each neuron in the ANN was distinctly activated based on the morphology of the input tomogram, as the result of the training process. This led the ANN output to be related to the conditional probability of the species given the input tomogram and the training data distribution (Fig. 4a).

**Fig. 4: Species identification using a single 3D RI tomogram.**

We note that this accurate single-measurement identification is the product of both 3D QPI and ANN, which rigorously measure and recognize the morphologies, respectively. To verify this, variant frameworks were implemented by altering the imaging strategy and the algorithm (see sections 2–4 of Supplementary Information). The performance of species identification dramatically decreased when 3D QPI was replaced with 2D QPI or 2D QPI sinogram, as well as when the ANN was replaced with a conventional machine learning algorithm²⁶.

The omission of the correct species could be further prevented at the expense of specificity. Namely, the correct species can be indicated at a higher rate by taking more than one species as the possible pathogen; we refer to this rate that the correct species is included in the N most likely species as the top-N accuracy. The top-2 accuracy and top-3 accuracy of the proposed framework were 94.3% and 97.1%, respectively (Fig. 4b). In clinic, although this trade-off itself is not unexpected, lowering risk with such strategies would be favorably considered whereas the loss of specificity can be buffered based on other indications, including characteristic symptoms and environmental evidence. Also, the sharp mitigation of the omission rate also underlines that the ANN robustly extracted features related to the correct species, even in the misidentified data. This robust feature extraction ability was also indicated by comparing the contrast of ANN outputs for the correctly and incorrectly identified data (see section 5 of Supplementary Information).

Error in identification using a single tomogram

To characterize the distribution of errors, the blind test result for the entire test dataset was investigated using the confusion matrix (Fig. 5a). The most frequent errors included the misidentification of A. baumannii as S. pneumoniae, K. pneumoniae as S. pneumoniae, S. agalactiae as S. aureus, and L. monocytogenes as B. subtilis. Notably, the misidentification of thick bacilli and coccobacilli as S. pneumoniae contributed to a large portion of the error. This is in consistency with the relatively elongated morphology of Streptococcus pneumoniae compared to other cocci^27,28. The overall identification performance varied among different species of bacteria. Among the 19 species, M. luteus was identified with both the highest sensitivity (95.0%) and specificity (100%). K. pneumoniae was the least sensitively identified species (62.5%), whereas S. peumoniae was the least specifically identified species (97.8%). The distribution of sensitivity and specificity in identifying each species are presented in more detail in section 6 of Supplementary Information.

**Fig. 5: Distribution of error in the species identification using a single 3D RI tomogram.**

The distribution of the second and third most likely species provided further insights regarding interspecific similarities (Fig. 5b). These plots visualize how similar the test data of different species are, concerning the features extracted by the ANN. Notably, a group of multiple species with morphological resemblance can be outlined as a cluster. The species of bacilli form a large cluster while the rest of the 19 species form another large cluster. In addition, E. cloacae, E. coli, and K. pneumoniae, namely, the species belonging to the family Enterobacteriaceae, showed a distinct clustering amidst other species of bacilli.

Apart from species identification, the proposed framework accurately performed common categorizations of bacteria from a single 3D QPI measurement. Accuracies of 94.6% and 94.2% were achieved in distinguishing between Gram-negative and positive bacteria, and between aerobic and facultatively anaerobic bacteria, respectively (Fig. 5c, d). This suggests the capability to distinguish bacteria in different standards, after training the ANN accordingly while maintaining the workflow.

Identification of pathogens using multiple tomograms

While the single-measurement performance of the proposed framework was comparable to that of the gold standard methods, securing more samples further increases the identification accuracy. The identification based on multiple measurements of 3D RI tomograms was realized by taking the average of the ANN outputs resulting from each of the individual 3D RI tomograms (Fig. 6a). The accuracy of species identification rose from 84.5% to 95.2%, 98.4%, and 99.9%, when reflecting two, three, and seven tomograms, respectively (left column, Fig. 6b). The error rate dropped more sharply than a simple reciprocal function of the sample quantity. This dramatic gain in the accuracy was attributable to the robust feature-extracting ability of the ANN. The correct species were strongly indicated in the ANN output even in the misidentified cases, as underlined in the abovementioned trade-off between the sensitivity and specificity; this can be seen from example data and outputs displayed in Fig. 6a where the multi-measurement identification is accurate even when the majority of the individual tomograms are misclassified.

**Fig. 6: Species identification based on multiple measurements of 3D RI tomograms.**

The multi-measurement strategy was also applied to the categorization between Gram-positive and negative bacteria, and between aerobic and facultatively anaerobic bacteria (center and right columns, Fig. 6b). Although a larger sample quantity led to higher performances in these categorizations as well, the gain in accuracy was not as significant as in the species identification. The two standards for categorization are not closely related to the optically accessible morphologies, and this might be why these categorizations did not benefit as profoundly from the multi-measurement strategy. Furthermore, it is indicated that the species-sensitive training drives the ANN to extract more diverse features as the multi-measurement identification of species interpreted into gram-stainability or respiratory metabolism provides higher accuracy than the direct categorization.

Discussion

We propose a bacterial identification framework that is sensitive to a few individual bacteria, using 3D QPI and ANN. The exceptionally high accuracy under a limited sample quantity is attributable to the remarkable single-cell profiling ability of 3D QPI and the feature-extracting ability of ANN. Results prove that the species-related cellular morphologies captured by 3D QPI are robustly recognized by the trained ANN, remarkably reducing the sample quantity required for identification. Recent studies leveraged ANNs to extract clinically relevant or biologically important information from QPI measurements^{26,29,30,31,32,33,34,35,36,37,38,39,40,41}. Despite these encouraging results, the capability of 3D QPI and ANN has not been assessed in diagnostic microbiology over a wide variety of species thus far.

We believe that this framework consisting of 3D QPI and ANN can effectively refine the initial antibiotic treatment. The accuracy of species identification using our framework is comparable to that of MALDI-TOF MS²⁵, even though the quantity of bacteria involved in the two approaches are single to several cells and over 10⁵ colony-forming units, respectively⁴². In addition, the risk of misidentification based on single tomograms can be strategically suppressed at the cost of specificity. Our framework also shows high single-measurement performance in distinguishing between subgroups of bacteria such as Gram-positive and negative groups. Furthermore, it achieves a nearly perfect identification within the 19 species using only seven tomograms of the bacteria, suggesting that accuracy higher than the single-measurement baseline is viable depending on the situation. Finally, we stress that our framework can be implemented along with the routine microbial identification, including MALDI-TOF MS. That is, the noninvasive property of 3D QPI allows our framework to be added to the existing identification routine without exhausting the initially obtained sample.

Future studies on sample processing will propel our framework towards a more immediate use. In practice, the enrichment of bacteria will be required for 3D QPI measurement when the ratio of bacteria in the given material is extremely small. The concentration of bacteria present in a urine sample is high, and thus the present method can be readily applicable in diagnosing urinary tract infection. On the other hand, bacteria may be scarce in blood samples as well as surrounded by a great number of blood cells. Lysis centrifugation is the common approach to enrich the bacteria from a positive blood culture⁴³. However, our sensitive framework can operate before the time-consuming blood culture, if high-throughput sample processing is introduced. A prominent and practical technique is the selective collection of particles utilizing advanced fluidic systems^44,45,46, which has successfully demonstrated enrichment of bacteria in laboratory^47,48.

In addition, validations on a larger diversity of pathogens will expand the scope of application for our method. We expect the proposed framework to be applicable to pathogens causing other classes of infections, such as urinary tract infections and lower respiratory infections, which are partially covered in this study. Moreover, achieving to screen antibiotics-resistant strains will be a crucial step in introducing this framework as a diagnostic routine. It is yet to be assessed whether this framework can distinguish resistant strains, while the need to screen out resistant strains has been highlighted over time^6,49,50. From a practical point of view, studying and improving ANN’s capability to tolerate the physiological difference is also required to further generalize our method. Although we cultured each species with a fixed protocol and a single type of growth media in this study, each species of bacteria can be cultured or found in various environments. An extreme case would be applying our framework on dead bacterial cells; while our database was collected with live and active bacteria, dead bacterial cells in clinical samples may serve as diagnostic evidence.

Further reducing the cost will encourage extensive studies based on our framework. Even though our framework does not entail an expense as large as MALDI-TOF MS, common hardware implementations of 3D QPI still involve advanced components including a coherent light source, a beam steering device, two microscopic objective lenses, and an imaging sensor with a high space-bandwidth product. Recent studies including Fourier ptychographic tomography⁵¹ or reference-free intensity-based tomography⁵², have achieved 3D QPI using relatively low-cost and simple optical systems. Despite the differences in the reconstruction process and imaging resolution, these techniques provide sufficient imaging quality for our framework.

The present bacterial species identification framework based on 3D QPI and ANN can also be combined with recently developed techniques of artificial intelligence for image processing, leading to various synergistic studies. For example, an automatic segmentation algorithm³⁴ may enable the species identification from densely distributed bacterial samples, such as biofilms⁵³ or colonies⁵⁴. Inference of molecular- or chemical-specific information^31,32,33,55 can also be exploited for correlative label-free analysis at single-cell or subcellular levels.

Lastly, we expect that the proposed framework will benefit from recent and future advances elucidating the working principle of ANNs. Investigations on ANN architectures have improved the performance of ANNs and expanded the applicability of ANNs over recent years, along with the rapid growth in the hardware capacity. On the other hand, techniques including Bayesian deep learning⁵⁶ have contributed to enhancing the interpretability, as well as offering a guideline for effective optimization. Fostering interpretability will render the proposed method more approachable for the medical industry.

Materials and methods

Preparation of bacteria

The bacterial samples were cultured in vitro from frozen glycerol stocks. The frozen stock of each species was stored at −80 °C and thawed at room temperature (25 °C) before use. After thawing, the stock was inoculated into a liquid medium and stabilized for over an hour in a shaking incubator at 35 °C. The stabilized bacteria were seeded in an agar plate containing a suitable medium. The agar plates were incubated at 35 °C for 12−24 h until colony formation was visible. A liquid subculture seeded from the agar plate was incubated at 35 °C for over 8 h in a shaking incubator. The subculture solution was diluted with a liquid medium to a concentration suitable for imaging, then sandwiched between cover glasses. Each species was inoculated in one of the following media: nutrient agar, brain heart infusion agar, tryptic soy agar, and chocolate agar. The glycerol stock or subculture was grown in nutrient broth, brain heart infusion broth, tryptic soy broth, or Giolitti-Cantoni broth.

The specimens were measured alive with no fixation nor any other chemical process; the sample can be immediately measured in the absence of a trained biologist and this is one of the main advantages of this method. A sample slide was prepared by simply sandwiching the solution of bacteria between two cover glasses, after diluting into a concentration suitable for imaging. Before optical measurement, we reduced the turbulent motion in the sample-loaded slides by placing them still on the sample stage for 5–10 min. All of the measurement was carried out within the time window of 8−24 h after inoculating the subculture in order to secure a database of active and live bacteria.

3D QPI measurement

We measured each 3D RI tomogram utilizing the 3D QPI as briefly introduced in the Results section. The DMD located on the sample illumination path can alter the illumination angle, by serving as a controllable binary grating^57,58. Using the DMD, a sinogram of 2D QPI measurements was obtained for each sample by scanning the illumination angle (Fig. 1b). The sinogram covered a total of 49 illumination angles, including a normal angle and 48 oblique angles equally spaced in the azimuthal direction. The 3D RI tomogram was reconstructed from the sinogram under the principle of optical diffraction tomography, which inversely solves the Helmholtz equation^23,59, then went through an iterative regularization to mitigate the missing cone problem⁶⁰ (Fig. 1c). The detailed procedure for the field retrieval and tomographic reconstruction can be found elsewhere^59,61.

A continuous-wave laser with a wavelength of 532 nm served as the light source. Two water-immersion objective lenses with 1.2 numerical aperture magnified and de-magnified the light, whereas the polar angle of the oblique illumination was equivalent to a numerical aperture of 0.9. The theoretical resolution of the tomograms was 110 nm in the horizontal direction and 330 nm in the vertical direction, considering the spatial frequency range of the imaging system⁶². The measurement of an entire sinogram required ~0.4 s, which was mainly limited by the camera frame rate.

Each tomogram was cropped into a field of view of 12.8 × 12.8 × 12.8 μm, and sampled at a voxel resolution of 100 × 100 × 200 nm. As a result, each tomogram contained a single bacterium or several bacteria adhering to each other, which considerably depended on the species-related physiology. For instance, specimens of the genus Streptococcus were commonly found in chains of multiple bacteria due to their nature.

A manual inspection and curation of tomograms ensured the quality of the database. The quality criteria reflected in this process included the noise level, motion artifact, and location of the specimens. Noisy tomograms, which mostly originated from objects in the oblique illumination path, were removed. Tomograms displaying motion artifacts were also excluded, as turbulent motion faster than the image acquisition rate causes distinctly blurred boundaries. The tomograms were shifted and cropped to place at least one bacterial cell in the central region of the tomogram.

ANN and optimization

The structure of the ANN in our framework was inspired by a design that outperformed most of the other designs in the benchmark tasks of 2D image analysis²⁴. This structure ensures that the feature maps in hidden layers of various depths and scales are utilized for image recognition, by concatenations of the feature maps (Fig. 2a). The elementary units composing our ANN are dense blocks. Each dense block repeats two 3D convolution operations followed by a concatenation (Fig. 2b). The feature maps are re-scaled between two adjacent dense blocks through a transition unit (Fig. 2c). Our ANN included four dense blocks containing 12, 24, 64, and 64 convolution operations, respectively. The number of feature maps after the initial convolution is set to 64, while the number of the feature maps increases by 32 through every convolution operation.

The ANN was optimized to classify the 3D RI tomograms, by minimization of the cross-entropy loss between the ground truth and the prediction. For each species, 40 tomograms were randomly chosen as the blind test dataset and another 40 tomograms were randomly chosen as the validation dataset. The remaining tomograms composed the training dataset, which was directly reflected in the loss minimization process. The loss that occurred in the training dataset was reduced using the stochastic gradient descent algorithm, at a mini-batch size of 48. The step size of the stochastic gradient descent algorithm was scheduled according to the cosine annealing method at an initial step size of 0.001 and a period of 64 epochs⁶³. During training, data augmentation took place for each tomogram, once every epoch, to prevent overfitting of the trained model. The augmentation included random processes of a horizontal crop, horizontal rotation, and Gaussian noise. During the blind test, each input tomogram was horizontally cropped around the center to provide an identical dimension. These processes resulted in an input tomogram with a field of view of 9.6 × 9.6 × 12.8 μm to be fed into the ANN. The ANN and the optimization were implemented using PyTorch 1.0.0.

The ANN was trained for ~290 h to obtain the models involved in our results. Two runs of training the ANN from scratch were carried out for ~1000 epochs each. Each training epoch required 504.3 ± 8.3 s in a server equipped with eight graphics processing units (GPUs) of GeForce GTX 1080 Ti and a central processing unit of Xeon E5–2600. The time required to infer a tomogram to a trained ANN model was 28.9 ± 2.9 ms.

Training the ANN with the identical setting can also run on a personal desktop computer, although we utilized an 8-GPU server for training at a higher rate. For instance, a single device of GeForce GTX 1080 Ti is sufficient for training the ANN under our setting, which requires 11,181 MB of graphics memory. When utilizing only a single device of GeForce GTX 1080ti in our server, each training epoch required 516.0 ± 9.6 s. In principle, an ANN of the identical design can be trained with only 1161 MB of graphics memory, by reducing the mini-batch size to 1. However, this minimal setting accompanies 3770.5 ± 67.4 s of duration for a single epoch of training, and altering the mini-batch size may cause the parameters to follow a different path of optimization. For inference using a trained ANN model, 945 MB of graphics memory are sufficient.

The final classifier for the blind test involved the predictions of multiple best-performing ANN models. The models with the highest accuracies for the training and validation datasets were chosen and integrated, to exploit a wider variety of features and prevent model-by-model variance. In search of the optimal strategy for choosing and integrating multiple models, four relevant parameters were explored. These parameters included the number of integrated models, weighting between the accuracies for the training and validation dataset, whether or not to normalize the output, and the method to integrate the predictions by the chosen models. Four options were considered as the method to integrate the predictions: taking the average, taking the exponential average, voting, and taking the maximum projection of the output. The combination of the parameters, which yielded the highest validation accuracy established the algorithm for the blind test.

Data availability

The data that support the findings of this study are available from the corresponding author upon reasonable request.

References

Hessling, M., Feiertag, J. & Hoenes, K. Pathogens provoking most deaths worldwide. Biosci. Biotechnol. Res. Commun. 10, 1–7 (2017).
Google Scholar
Torio, C. M. & Moore, B. J. National inpatient hospital costs: the most expensive conditions by payer, 2013. In: Healthcare Cost and Utilization Project (HCUP) Statistical Briefs [Internet]. Statistical Brief# 204 (Agency for Healthcare Research and Quality (US), 2016).
Liu, V. X. et al. The timing of early antibiotics and hospital mortality in sepsis. Am. J. Resp. Crit. Care Med. 196, 856–863 (2017).
Article Google Scholar
Moehring, R. W. et al. Delays in appropriate antibiotic therapy for Gram-negative bloodstream infections: a multicenter, community hospital study. PLoS ONE 8, e76225 (2013).
Article ADS Google Scholar
García, M. S. Early antibiotic treatment failure. Int. J. Antimicrobial Agents 34, S14–S19 (2009).
Article Google Scholar
Hutchings, M. I., Truman, A. W. & Wilkinson, B. Antibiotics: past, present and future. Curr. Opin. Microbiol. 51, 72–80 (2019).
Article Google Scholar
Paul, M. et al. Systematic review and meta-analysis of the efficacy of appropriate empiric antibiotic therapy for sepsis. Antimicrobial Agents Chemother. 54, 4851–4863 (2010).
Article Google Scholar
Bizzini, A. & Greub, G. Matrix-assisted laser desorption ionization time-of-flight mass spectrometry, a revolution in clinical microbial identification. Clin. Microbiol. Infect. 16, 1614–1619 (2010).
Article Google Scholar
Seng, P. et al. Ongoing revolution in bacteriology: routine identification of bacteria by matrix-assisted laser desorption ionization time-of-flight mass spectrometry. Clin. Infect. Dis. 49, 543–551 (2009).
Article Google Scholar
Francisco, D. E., Mah, R. A. & Rabin, A. C. Acridine orange-epifluorescence technique for counting bacteria in natural waters. Trans. Am. Microsc. Soc. 92, 416–421 (1973).
Article Google Scholar
Müller, V. et al. Identification of pathogenic bacteria in complex samples using a smartphone based fluorescence microscope. RSC Adv. 8, 36493–36502 (2018).
Article ADS Google Scholar
Amann, R., Fuchs, B. M. & Behrens, S. The identification of microorganisms by fluorescence in situ hybridisation. Curr. Opin. Biotechnol. 12, 231–236 (2001).
Article Google Scholar
Patiño, S. et al. Autofluorescence of mycobacteria as a tool for detection of Mycobacterium tuberculosis. J. Clin. Microbiol. 46, 3296–3302 (2008).
Article Google Scholar
Bhattacharjee, A., Datta, R., Gratton, E. & Hochbaum, A. I. Metabolic fingerprinting of bacteria by fluorescence lifetime imaging microscopy. Sci. Rep. 7, 1–10 (2017).
Article Google Scholar
Park, Y., Depeursinge, C. & Popescu, G. Quantitative phase imaging in biomedicine. Nat. Photon. 12, 578–589 (2018).
Article ADS Google Scholar
Mir, M. et al. Optical measurement of cycle-dependent cell growth. Proc. Natl Acad. Sci. USA 108, 13124–13129 (2011).
Article ADS Google Scholar
Ahn, J. H. et al. Enhanced succinic acid production by Mannheimia employing optimal malate dehydrogenase. Nat. Commun. 11, 1–12 (2020).
Article ADS Google Scholar
Kemper, B. et al. Towards 3D modelling and imaging of infection scenarios at the single cell level using holographic optical tweezers and digital holographic microscopy. J. Biophoton. 6, 260–266 (2013).
Article Google Scholar
Oh, J. et al. Three-dimensional label-free observation of individual bacteria upon antibiotic treatment using optical diffraction tomography. Biomed. Opt. Express 11, 1257–1267 (2020).
Article Google Scholar
Opota, O., Croxatto, A., Prod’hom, G. & Greub, G. Blood culture-based diagnosis of bacteraemia: state of the art. Clin. Microbiol. Infect. 21, 313–322 (2015).
Article Google Scholar
Bearman, G. M. & Wenzel, R. P. Bacteremias: a leading cause of death. Arch. Med. Res. 36, 646–659 (2005).
Article Google Scholar
Lee, C.-C. et al. Beneficial effects of early empirical administration of appropriate antimicrobials on survival and defervescence in adults with community-onset bacteremia. Crit. Care 23, 1–12 (2019).
Article Google Scholar
Wolf, E. Three-dimensional structure determination of semi-transparent objects from holographic data. Opt. Commun. 1, 153–156 (1969).
Article ADS Google Scholar
Huang, G., Liu, Z., Van Der Maaten, L. & Weinberger, K. Q. Densely Connected Convolutional Networks. Proc. IEEE Comput. Soc. Conf. Comput. Vis. Pattern Recognit. 2261–2269 (2017).
Drancourt, M. Detection of microorganisms in blood specimens using matrix-assisted laser desorption ionization time-of-flight mass spectrometry: a review. Clin. Microbiol. Infect. 16, 1620–1625 (2010).
Article Google Scholar
Yoon, J. et al. Identification of non-activated lymphocytes using three-dimensional refractive index tomography and machine learning. Sci. Rep. 7, 1–10 (2017).
Article ADS Google Scholar
Hoyer, J. et al. Proteomic response of Streptococcus pneumoniae to iron limitation. Int. J. Med. Microbiol. 308, 713–721 (2018).
Article Google Scholar
Pathak, A. et al. Factor H binding proteins protect division septa on encapsulated Streptococcus pneumoniae against complement C3b deposition and amplification. Nat. Commun. 9, 1–16 (2018).
Article Google Scholar
Jo, Y. et al. Quantitative phase imaging and artificial intelligence: a review. IEEE J. Sel. Top. Quantum Electron. 25, 1–14 (2018).
Article Google Scholar
Rivenson, Y., Wu, Y. & Ozcan, A. Deep learning in holography and coherent imaging. Light.: Sci. Appl. 8, 1–8 (2019).
Article Google Scholar
Rivenson, Y. et al. Virtual histological staining of unlabelled tissue-autofluorescence images via deep learning. Nat. Biomed. Eng. 3, 466–477 (2019).
Article Google Scholar
Kandel, M. E. et al. Phase imaging with computational specificity (PICS) for measuring dry mass changes in sub-cellular compartments. Nat. Commun. 11, 1–10 (2020).
Article Google Scholar
Jo, Y. et al. Label-free multiplexed microtomography of endogenous subcellular dynamics using generalizable deep learning. Nat. Cell Biol. 23, 1329–1337 (2021).
Article Google Scholar
Choi, J. et al. Label-free three-dimensional analyses of live cells with deep-learning-based segmentation exploiting refractive index distributions. Preprint at bioRxiv https://doi.org/10.1101/2021.05.23.445351 (2021).
Lee, M. et al. Deep-learning-based three-dimensional label-free tracking and analysis of immunological synapses of CAR-T cells. Elife 9, e49023 (2020).
Article Google Scholar
Kamilov, U. S. et al. Learning approach to optical tomography. Optica 2, 517–522 (2015).
Article ADS Google Scholar
Ryu, D. et al. Deep learning-based optical field screening for robust optical diffraction tomography. Sci. Rep. 9, 1–9 (2019).
Article Google Scholar
Ryu, D. et al. DeepRegularizer: rapid resolution enhancement of tomographic imaging using deep learning. IEEE Trans. Med. Imaging 40, 1508–1518 (2021).
Article Google Scholar
Chen, C. L. et al. Deep learning in label-free cell classification. Sci. Rep. 6, 1–16 (2016).
Google Scholar
Ryu, D. et al. Label-free white blood cell classification using refractive index tomography and deep learning. BME Front. 2021 (2021).
Jo, Y. et al. Holographic deep learning for rapid optical screening of anthrax spores. Sci. Adv. 3, e1700606 (2017).
Article ADS Google Scholar
Barreiro, J. R. et al. Non-culture-based identification of mastitis-causing bacteria by MALDI-TOF mass spectrometry. J. Dairy Sci. 100, 2928–2934 (2017).
Article Google Scholar
Kirn, T. & Weinstein, M. Update on blood cultures: how to obtain, process, report, and interpret. Clin. Microbiol. Infect. 19, 513–520 (2013).
Article Google Scholar
Lee, S. et al. Nanoelectrokinetic bufferchannel-less radial preconcentrator and online extractor by tunable ion depletion layer. Biomicrofluidics 13, 034113 (2019).
Article Google Scholar
Kuntaegowdanahalli, S. S., Bhagat, A. A. S., Kumar, G. & Papautsky, I. Inertial microfluidics for continuous particle separation in spiral microchannels. Lab Chip 9, 2973–2980 (2009).
Article Google Scholar
Lei, H., Zhang, Y. & Li, B. Particle separation in fluidic flow by optical fiber. Opt. Express 20, 1292–1300 (2012).
Article ADS Google Scholar
Jung, T., Jung, Y., Ahn, J. & Yang, S. Continuous, rapid concentration of foodborne bacteria (Staphylococcus aureus, Salmonella typhimurium, and Listeria monocytogenes) using magnetophoresis-based microfluidic device. Food Control 114, 107229 (2020).
Article Google Scholar
D’Amico, L., Ajami, N., Adachi, J., Gascoyne, P. & Petrosino, J. Isolation and concentration of bacteria from blood using microfluidic membraneless dialysis and dielectrophoresis. Lab Chip 17, 1340–1348 (2017).
Article Google Scholar
Shariati, A. et al. Global prevalence and distribution of vancomycin resistant, vancomycin intermediate and heterogeneously vancomycin intermediate Staphylococcus aureus clinical isolates: a systematic review and meta-analysis. Sci. Rep. 10, 1–16 (2020).
Article Google Scholar
Chamieh, A., El-Hajj, G., Zmerli, O., Afif, C. & Azar, E. Carbapenem resistant organisms: A 9-year surveillance and trends at Saint George University Medical Center. J. Infect. Public Health 13, 2101–2106 (2020).
Article Google Scholar
Horstmeyer, R., Chung, J., Ou, X., Zheng, G. & Yang, C. Diffraction tomography with Fourier ptychography. Optica 3, 827–835 (2016).
Article ADS Google Scholar
Baek, Y. & Park, Y. Intensity-based holographic imaging via space-domain Kramers–Kronig relations. Nat. Photon. 15, 354–360 (2021).
Article ADS Google Scholar
Berne, C., Ellison, C. K., Ducret, A. & Brun, Y. V. Bacterial adhesion at the single-cell level. Nat. Rev. Microbiol. 16, 616–627 (2018).
Article Google Scholar
Fenchel, T. Microbial behavior in a heterogeneous world. Science 296, 1068–1071 (2002).
Article ADS Google Scholar
Nygate, Y. N. et al. Holographic virtual staining of individual biological cells. Proc. Natl Acad. Sci. USA 117, 9223–9231 (2020).
Article ADS Google Scholar
Kendall, A. & Gal, Y. What uncertainties do we need in bayesian deep learning for computer vision? Adv. Neural Inf. Process. Syst. 30 (2017).
Shin, S., Kim, K., Yoon, J. & Park, Y. Active illumination using a digital micromirror device for quantitative phase imaging. Opt. Lett. 40, 5407–5410 (2015).
Article ADS Google Scholar
Lee, K., Kim, K., Kim, G., Shin, S. & Park, Y. Time-multiplexed structured illumination using a DMD for optical diffraction tomography. Opt. Lett. 42, 999–1002 (2017).
Article ADS Google Scholar
Kim, K. et al. High-resolution three-dimensional imaging of red blood cells parasitized by Plasmodium falciparum and in situ hemozoin crystals using optical diffraction tomography. J. Biomed. Opt. 19, 011005 (2013).
Article Google Scholar
Lim, J. et al. Comparative study of iterative reconstruction algorithms for missing cone problems in optical diffraction tomography. Opt. Express 23, 16933–16948 (2015).
Article ADS Google Scholar
Debnath, S. K. & Park, Y. Real-time quantitative phase imaging with a spatial phase-shifting algorithm. Opt. Lett. 36, 4677–4679 (2011).
Article ADS Google Scholar
Park, C., Shin, S. & Park, Y. Generalized quantification of three-dimensional resolution in optical diffraction tomography using the projection of maximal spatial bandwidths. J. Opt. Soc. Am. A 35, 1891–1898 (2018).
Article ADS Google Scholar
Loshchilov, I. & Hutter, F. Sgdr: Stochastic gradient descent with warm restarts. Preprint at https://arxiv.org/abs/1608.03983 (2016).

Download references

Acknowledgements

This work was supported by KAIST Up Program, BK21 + program, Tomocube, National Research Foundation of Korea (2015R1A3A2066550), KAIST Institute of Technology Value Creation, Industry Liaison Center (G-CORE Project) grant funded by the Ministry of Science and ICT (N11210014, N11220131), Institute of Information & communications Technology Planning & Evaluation (IITP; 2021-0-00745) grant funded by the Korea government (MSIT), and the Commercialization Promotion Agency for R&D Outcomes (COMPA; 055586) funded by the Korea government. Y.J. acknowledges support from KAIST Presidential Fellowship and Asan Foundation Biomedical Science Scholarship. The clinical isolates were obtained from Asian Bacterial Bank of the Asia Pacific Foundation for Infectious Diseases.

Author information

YoungJu Jo
Present address: Department of Applied Physics, Stanford University, Stanford, CA, 94305, USA
Jinyeop Song
Present address: Department of Physics, Massachusetts Institute of Technology, Cambridge, MA, 02139, USA

Authors and Affiliations

Department of Physics, Korea Advanced Institute of Science and Technology, Daejeon, 34141, Republic of Korea
Geon Kim, Jinho Park, DongHun Ryu, YoungJu Jo, Jinyeop Song & YongKeun Park
KAIST Institute for Health Science and Technology, KAIST, Daejeon, 34141, Republic of Korea
Geon Kim, Jinho Park, DongHun Ryu, YoungJu Jo, Jinyeop Song & YongKeun Park
Tomocube Inc., Daejeon, 34109, Republic of Korea
Daewoong Ahn, YoungJu Jo, Gunho Choi, Hyun-seok Min & YongKeun Park
Smart Healthcare & Device Research Center, Samsung Medical Center, Sungkyunkwan University School of Medicine, Seoul, 06351, Republic of Korea
Minhee Kang
Graduate School of Nanoscience and Technology, Korea Advanced Institute of Science and Technology, Daejeon, 34141, Republic of Korea
Jea Sung Ryu & Hyun Jung Chung
Department of Biological Sciences, Korea Advanced Institute of Science and Technology, Daejeon, 34141, Republic of Korea
Hyun Jung Chung
Department of Emergency Medicine, Bundang CHA Hospital, Seongnam-si, Gyeonggi-Do, 13496, Korea
Kyuseok Kim
Division of Infectious Diseases, Department of Internal Medicine, Samsung Medical Center, Sungkyunkwan University School of Medicine, Seoul, 06351, Republic of Korea
Doo Ryeon Chung
Department of Laboratory Medicine, Seoul St. Mary’s Hospital, College of Medicine, The Catholic University of Korea, Seoul, 06591, Republic of Korea
In Young Yoo
Department of Laboratory Medicine and Genetics, Samsung Medical Center, Sungkyunkwan University School of Medicine, Seoul, 06351, Republic of Korea
Hee Jae Huh & Nam Yong Lee

Authors

Geon Kim
View author publications
You can also search for this author in PubMed Google Scholar
Daewoong Ahn
View author publications
You can also search for this author in PubMed Google Scholar
Minhee Kang
View author publications
You can also search for this author in PubMed Google Scholar
Jinho Park
View author publications
You can also search for this author in PubMed Google Scholar
DongHun Ryu
View author publications
You can also search for this author in PubMed Google Scholar
YoungJu Jo
View author publications
You can also search for this author in PubMed Google Scholar
Jinyeop Song
View author publications
You can also search for this author in PubMed Google Scholar
Jea Sung Ryu
View author publications
You can also search for this author in PubMed Google Scholar
Gunho Choi
View author publications
You can also search for this author in PubMed Google Scholar
Hyun Jung Chung
View author publications
You can also search for this author in PubMed Google Scholar
Kyuseok Kim
View author publications
You can also search for this author in PubMed Google Scholar
Doo Ryeon Chung
View author publications
You can also search for this author in PubMed Google Scholar
In Young Yoo
View author publications
You can also search for this author in PubMed Google Scholar
Hee Jae Huh
View author publications
You can also search for this author in PubMed Google Scholar
Hyun-seok Min
View author publications
You can also search for this author in PubMed Google Scholar
Nam Yong Lee
View author publications
You can also search for this author in PubMed Google Scholar
YongKeun Park
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

G.K., Y.J., and Y.P. conceived and designed the research. G.K., M.K., J.P., J.S., and J.S.R. conducted the experiments. G.K., D.A., G.C., and H.M. analyzed the data. G.K., D.R., Y.J., H.J.C., K.K., D.R.C., I.Y.Y, H.J.H., H.M., N.Y.L., and Y.P. prepared the manuscript. All authors read and discussed the results.

Corresponding authors

Correspondence to Nam Yong Lee or YongKeun Park.

Ethics declarations

Conflict of interest

D.A., D.R.,G.C., H.M., and Y.P. have financial interests in Tomocube Inc., a company that commercializes optical diffraction tomography and quantitative phase-imaging instruments and is one of the sponsors of the work.

Supplementary information

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Kim, G., Ahn, D., Kang, M. et al. Rapid species identification of pathogenic bacteria from a minute quantity exploiting three-dimensional quantitative phase imaging and artificial neural network. Light Sci Appl 11, 190 (2022). https://doi.org/10.1038/s41377-022-00881-x

Download citation

Received: 26 November 2021
Revised: 03 June 2022
Accepted: 09 June 2022
Published: 23 June 2022
DOI: https://doi.org/10.1038/s41377-022-00881-x

This article is cited by

Single-shot quantitative phase-fluorescence imaging using cross-grating wavefront microscopy
- Baptiste Marthy
- Maëlle Bénéfice
- Guillaume Baffou
Scientific Reports (2024)
On the use of deep learning for phase recovery
- Kaiqiang Wang
- Li Song
- Edmund Y. Lam
Light: Science & Applications (2024)
Rapid and stain-free quantification of viral plaque via lens-free holography and deep learning
- Tairan Liu
- Yuzhu Li
- Aydogan Ozcan
Nature Biomedical Engineering (2023)
Three-dimensional label-free morphology of CD8 + T cells as a sepsis biomarker
- MinDong Sung
- Jong Hyun Kim
- Yu Rang Park
Light: Science & Applications (2023)
Artificial intelligence-enabled quantitative phase imaging methods for life sciences
- Juyeon Park
- Bijie Bai
- YongKeun Park
Nature Methods (2023)