Leveraging machine learning to distinguish between bacterial and viral induced pharyngitis using hematological markers: a retrospective cohort study

Jin, Zhe; Ma, Fengmei; Chen, Haoyang; Guo, Shufan

doi:10.1038/s41598-023-49925-1

Download PDF

Article
Open access
Published: 21 December 2023

Leveraging machine learning to distinguish between bacterial and viral induced pharyngitis using hematological markers: a retrospective cohort study

Zhe Jin¹,
Fengmei Ma²,
Haoyang Chen³ &
…
Shufan Guo²

Scientific Reports volume 13, Article number: 22899 (2023) Cite this article

479 Accesses
Metrics details

Subjects

Abstract

Accurate differentiation between bacterial and viral-induced pharyngitis is recognized as essential for personalized treatment and judicious antibiotic use. From a cohort of 693 patients with pharyngitis, data from 197 individuals clearly diagnosed with bacterial or viral infections were meticulously analyzed in this study. By integrating detailed hematological insights with several machine learning algorithms, including Random Forest, Neural Networks, Decision Trees, Support Vector Machine, Naive Bayes, and Lasso Regression, for potential biomarkers were identified, with an emphasis being placed on the diagnostic significance of the Monocyte-to-Lymphocyte Ratio. Distinct inflammatory signatures associated with bacterial infections were spotlighted in this study. An innovation introduced in this research was the adaptation of the high-accuracy Lasso Regression model for the TI-84 calculator, with an AUC (95% CI) of 0.94 (0.925–0.955) being achieved. Using this adaptation, pivotal laboratory parameters can be input on-the-spot and infection probabilities can be computed subsequently. This methodology embodies an improvement in diagnostics, facilitating more effective distinction between bacterial and viral infections while fostering judicious antibiotic use.

An autoantibody signature predictive for multiple sclerosis

Article 19 April 2024

A single-cell atlas enables mapping of homeostatic cellular shifts in the adult human breast

Article Open access 28 March 2024

The 5th edition of the World Health Organization Classification of Haematolymphoid Tumours: Myeloid and Histiocytic/Dendritic Neoplasms

Article Open access 22 June 2022

Introduction

Pharyngitis, defined by pharyngeal inflammation, is a predominant concern within the department of otorhinolaryngology, affecting a vast number of individuals annually¹. Plenty of infectious agents can induce pharyngitis; however, bacterial sources, especially Group A β-hemolytic streptococcus (GABHS) is most prevalent². Concurrently, viral infections continue to present significant clinical complexities³. Accurately differentiating between bacterial and viral pharyngitis is critical not only for precise therapeutic strategies but also to curb the overuse of antibiotics, a trend exacerbating the global rise of antibiotic-resistant organisms^4,5.

In the past decade, the potential of complete blood count (CBC) parameters such as Neutrophil-to-Lymphocyte ratio (NLR) and Monocyte-to-Lymphocyte ratio (MLR) has been extensively explored as biomarkers for early diagnosis in cancers^6,7,8,9. These markers have also shown promise in differentiating between bacterial and viral infections, providing a non-invasive, cost-effective approach to aid in clinical decision-making^10,11. Nonetheless, the differentiation between viral and bacterial pharyngitis still poses a significant challenge, calling for more advanced and precise diagnostic tools^12,13.

Recently, machine learning (ML) has gained traction in healthcare for its potential to revolutionize diagnosis and treatment^14,15. ML models, capable of learning from large datasets and identifying complex patterns, have shown promise in infections. Despite these advances, there is still a paucity of research exploring the utility of machine learning in differentiating between bacterial and viral pharyngitis specifically^16,17.

Therefore, this study aims to develop and validate an ML model specifically tailored to distinguish between viral and bacterial pharyngitis, improving diagnosis accuracy¹⁸ and promoting more responsible antibiotic stewardship^19,20.

Methods

Study design

This retrospective cohort study included adult patients with pharyngitis caused by different infection types. An evaluation of the diagnostic accuracy of bacterial and viral pharyngitis across various demographic groups was conducted based on a retrospective study design. A comprehensive clinical examination has been undertaken for each participant, considering their medical history and current symptoms.

Patients’ recruitment

The patients with pharyngitis were enrolled in the study through a systematic recruitment process that aimed to ensure the inclusion of eligible participants with complete and relevant data. The recruitment process followed several steps to identify and select suitable candidates:

1.
Patient Identification: Potential participants with symptoms of pharyngitis were identified from the patient population attending the Department of Otorhinolaryngology at Hebei Provincial Hospital of Traditional Chinese Medicine, between 2019 and 2023.
2.
Screening for patients with pharyngitis: This screening involved a review of their medical history and a clinical examination.
3.
Inclusion Criteria: To be included in the study, patients had to meet the following criteria:
1. 3.1.
  Confirmed diagnosis of pharyngitis based on clinical evaluation.
2. 3.2.
  Absence of severe medical conditions or comorbidities that could confound the analysis.
4.
Exclusion Criteria: Patients with the following characteristics were excluded from the study:
1. 4.1.
  Absence of essential demographic details or incomplete data pertaining to complete cell count metrics.
2. 4.2.
  Age below 18 years.
3. 4.3.
  Patients without a definitive diagnosis of infection type.

Independent variables

Several parameters, including basic demographic information, complete cell count, and novel parameters such as the NLR, platelet-to-lymphocyte ratio (PLR), monocyte-to-lymphocyte ratio (MLR), and mean platelet volume-to-lymphocyte ratio (MPVLR), were assessed to provide a comprehensive picture. These novel parameters have been log-transformed prior to analysis to manage skewness, stabilize variance, lessen the influence of outliers, and convert multiplicative relationships into more interpretable additive ones, enhancing the robustness and validity of our statistical tests.

Statistical analysis

The current investigation employed a dataset comprising diverse clinical metrics, indicative of either bacterial or viral infections. To ensure a balanced comparison between the different infection types, a 1:1 propensity score matching (PSM) was utilized. Following this matching, the dataset was randomly partitioned into a training cohort (75%) and a validation cohort (25%). Continuous variables were evaluated using two-sample t-tests, while categorical variables were assessed through chi-squared tests. The threshold for statistical significance was set at p = 0.05. Analytical computations were conducted using R (version 3.6.3) and Python (version 3.7).

Machine learning analysis

A suite of machine learning algorithms was applied to selected clinical parameters to develop predictive models. The algorithms employed included Lasso Regression, Decision Trees, Random Forest, Support Vector Machines (SVM), Neural Networks (NN), and Naive Bayes (NB). The performance of these algorithms was evaluated using metrics such as accuracy, precision, recall, F1-score, and the area under the Receiver Operating Characteristic (AUC) curve. Furthermore, the importance of each feature was assessed across various models to ascertain their contribution to the predictive power of the models.

Visual representations, including ROC curves, were crafted for each model to enable a comparative evaluation of their performances between training and validation cohorts. Additionally, violin plots illustrated the distribution of clinical metrics across the two infection types.

Model deployment

The Lasso regression model was encapsulated into a TI-84 calculator via a custom script, engineered for rapid input of laboratory parameters. Upon input, an output delineating the infection type probability was generated. The model's performance was stringently evaluated and validated using our designated validation cohort.

Ethical compliance

This study is approved by the Medical Ethical Committee of Hebei Provincial Hospital of Traditional Chinese Medicine, the register num is HBZY2023-KY-012-01. A waiver for the requirement of informed consent has been granted by the Medical Ethical Committee of Hebei Provincial Hospital of Traditional Chinese Medicine due to its retrospective nature. Strict adherence to the ethical guidelines related to human subjects in research was maintained in our study. all their privacy and confidentiality were upheld throughout the study.

Results

A total of 693 patients diagnosed with pharyngitis were initially identified. Following rigorous adherence to predefined inclusion and exclusion criteria, a cohort of 197 eligible patients was delineated. This cohort included 74 individuals diagnosed with viral infections and 123 with bacterial infections, as depicted in Fig. 1. These participants were then methodically allocated into two primary cohorts for further analysis: the training cohort, consisting of 55 individuals with bacterial infections and 56 with viral infections, and the validation cohort, comprising 19 individuals with bacterial infections and 18 with viral infections. This stratification provided a structured framework for the comparative analysis of viral and bacterial pharyngitis cases, thereby facilitating the subsequent development and validation of machine learning models.

Demographic attributes such as sex, age, and clinical status (outpatient or inpatient) were recorded. An approximately balanced distribution of males and females was observed across both cohorts, aligning with the demographic findings in related literature²¹. Age ranged from 18 to 85 years, with the most represented age group being 18–34 years. Majority of the patients were outpatients, with no significant difference in distribution between the two infection types (Table 1).

Table 1 Comparative analysis of patient characteristics and hematological indices between patients with viral and bacterial induced pharyngitis.

Full size table

Outcome measures focused on hematological indices. The observed variations included higher Red blood cell (RBC) count and Hemoglobin concentration (HGB) levels in patients with viral infections (p = 0.003 and p = 0.006, respectively) and elevated White Blood Cell (WBC) count, Neutrophil count (NEU) count, and Monocyte count (MONO) count in patients with bacterial infections (p < 0.01 for each parameter). Significant differences were noted for other parameters such as Percentage of Neutrophils (NEUp), Lymphocyte (LYM) count, Percentage of Lymphocytes (LYMp), log-transformed Monocyte to Lymphocyte ratio (logMLR), log-transformed Neutrophil to Lymphocyte ratio (logNLR), and log-transformed Platelet to Lymphocyte ratio (logPLR) between the two infection groups (Table 1).

A comparative analysis revealed significant differences in several hematological indices between the viral and bacterial infection groups in both the training and validation cohorts. Notably, in the training cohort, there were significant variations regarding HGB, WBC, NEU, NEUp, LYM, LYMp, logMLR, logNLR, logPLR, and log-transformed Platelet Volume to Lymphocyte ratio (logMPVLR) (all p < 0.05). Meanwhile, the validation cohort displayed significant differences for NEU, NEUp, LYMp, logMLR, and logNLR (all p < 0.05) (Table 2). These findings echo the inherent diagnostic challenges associated with pharyngitis, where overlapping symptoms between bacterial, primarily caused by Group A β-hemolytic streptococcus, and viral pharyngitis often complicate accurate diagnosis²². Although blood tests have been instrumental in aiding the diagnosis of acute viral and bacterial infections, their efficacy is sometimes hindered by their inability to capture the evolving inflammatory response post-symptom onset²³.

Table 2 Hematological parameters in bacterial and viral infections in training and validation cohorts.

Full size table

The violin plots demonstrated distinct trends in hematological and inflammatory parameters between bacterial and viral infections. Parameters such as WBC and MONO had overlapping distributions, while NEU, NEUp, logMLR, logNLR, and logPLR were predominantly higher in bacterial infections. LYM and LYMp leaned more towards viral infections (Fig. 2). It was observed that parameters like WBC count and MONO count exhibited overlapping distributions, hinting at a common inflammatory response irrespective of the infection type. Conversely, parameters such as NEU, NEUp, logMLR, logNLR, and logPLR demonstrated elevated levels predominantly in bacterial infections.

The employment of machine learning methodologies was directed at determining their predictive efficacy on the dataset. Performance metrics for both the training and validation cohorts, including accuracy, precision, recall, F1-score, and the AUC were computed and are displayed in Tables 3 and 4. The Random Forest notably exhibited superior performance in terms of accuracy and AUC in both cohorts, aligning with findings from past studies on bacterial and viral infections²⁴. Meanwhile, ROC curves for Lasso Regression and SVM models suggested a high degree of accuracy in infection type classification. During cross-validation on the training set, the optimized Lasso Regression model attained an AUC score of approximately 0.90. The model's robustness and generalizability were confirmed through its performance on a separate validation set, where it achieved an AUC score of approximately 0.94, demonstrating its ability to effectively distinguish between viral and bacterial infections (Fig. 3).

Table 3 Performance metrics of machine learning models on the training cohort.

Full size table

Table 4 Performance metrics of machine learning models on the validation cohort.

Full size table

The feature importance of each variable was evaluated across different machine learning models, revealing NEUp, logMLR, and logPLR to be the significant. The highest importance scores in the Lasso Regression model were found for NEUp (2.0110), logMPVLR (1.0451), and logPLR (0.6210). In the Decision Tree model, a high importance score was assigned to the NEUp variable (0.5127). Notably, the Random Forest model showed elevated scores for NEUp (0.3024) and LYMp (0.1349). The SVM model indicated WBC (0.0528) and NEU (0.0722) as most important, while in the Neural Network model, logMPVLR (0.1306) had the highest score. In the Naive Bayes model, the WBC variable scored slightly higher (0.0556), underscoring their potential utility in diagnostic algorithms (Table 5).

Table 5 Feature importance across multiple machine learning models in differentiating bacterial and viral infections.

Full size table

Following deployment, the Lasso Regression model exhibited substantial adeptness in differentiating between bacterial and viral infections. By simply inputting the selected laboratory parameters into the TI-84 calculator, healthcare professionals could expeditiously generate infection probability outcomes (Fig. 4). The model was stringently assessed. The validation cohort in our study, included data from 37 patients (19 bacterial, 18 viral infections). The consistent and effective performance emphasizes the model's robustness and reliability.

Discussion

This study has highlighted hematological disparities between bacterial and viral infections, shedding light on the pronounced inflammatory response elicited predominantly by bacterial infections. The hematological parameters, MLR, NLR, PLR, and MPVLR have been emphasized as notable biomarkers^25,26. In line with established literature, viral infections are typically characterized by augmented RBC counts and HGB levels, while bacterial infections are more likely to display heightened WBC, NEU, and MONO counts²⁷ (Table 1).

The distinct variations in key hematological parameters such as NEU, NEUp, LYMp, logMLR, and logNLR underscore the differential immunological responses between bacterial and viral infections (Table 2). It is well-established that neutrophils are the primary leukocytes engaged in immune responses during bacterial infections, while lymphocyte-mediated immune responses are predominantly observed during viral infections²⁸. The substantial feature importance score of logMLR across various machine learning models accentuates its critical role as a distinguishing hematological parameter, potentially aiding in the enhanced diagnostic differentiation between bacterial and viral infections in our study. Nevertheless, the diagnostic quandaries stemming from the overlapping distributions of WBC and NEU, as illustrated in Fig. 2, emphasize the imperative for a broader diagnostic strategy, transcending the reliance on singular markers²⁹.

From a computational viewpoint, the Random Forest emerged as the most proficient predictor for classifying infection types, albeit with Neural Networks showing close prowess³⁰. Conversely, SVM and Naive Bayes showcased diverse performances, underscoring the imperative nature of meticulous model selection tailored to specific data characteristics³¹ (Tables 3 and 4). Both Lasso Regression and Random Forest were proficient in differentiating bacterial from viral infections (Fig. 3).

In this study, Lasso Regression was utilized to create a diagnostic model for classifying infection types. The choice of Lasso Regression was predicated on its unique characteristics, which encompass both variable selection and regularization functionalities³². This makes it particularly suitable for this type of problem. Although more complex machine learning methodologies are available, Lasso Regression establishes a balance between model intricacy and interpretability³³. This equilibrium is essential in clinical environments where elucidating the relationship between predictors and outcomes is as vital as achieving prediction accuracy^34,35.

A noteworthy innovation of this work is the successful amalgamation of the Lasso model with a widely accessible computational tool, the TI-84 calculator. Although both Random Forest and Lasso Regression exhibited commendable performance in our analysis, the computational parsimony of Lasso Regression rendered it a more pragmatic choice for the TI-84 calculator, known for its ease of use, cost-effectiveness, and ubiquitous availability. This integration facilitated the rapid and efficient input of pivotal laboratory parameters including NEU, MONO, NEUp, LYM, LYMp, PLT, and MPV, consequently generating the probability of infection type. By merely inputting the selected laboratory parameters into the calculator, healthcare professionals could promptly ascertain the likelihood of bacterial or viral infections (Fig. 4). No commercial reagents or specific equipment were required in this methodology, promoting its cost-effectiveness and widespread accessibility.

Integral to this discussion is the concept of antibiotic stewardship. Given the emerging global challenge of antibiotic resistance, it is imperative to differentiate bacterial from viral infections to ensure judicious antibiotic use³⁶. These findings contribute significantly to antibiotic stewardship efforts by pinpointing potential biomarkers that might expedite accurate diagnosis, thereby minimizing unwarranted antibiotic prescriptions. Emphasizing the need for precise diagnosis and targeted therapies, this study underlines the importance of combining clinical, laboratory, and computational tools in the era of personalized medicine and antibiotic stewardship³⁷.

The prospect of amplifying diagnostic precision through the amalgamation of optimization algorithms with machine learning methodologies is indeed exhilarating. Esteemed optimization algorithms such as the refined Grey Wolf Optimizer (LGWO)³⁸, Hunger Games Search (HGS)³⁹, Shrimp and Goby Association Search algorithm (SGA)⁴⁰, Planet Optimization Algorithm (P.O.A.)⁴¹, and Runge Kutta optimizer (RUN)⁴² possess the potential to significantly enhance model efficacy. Although the current study did not delve into these optimization techniques, the future incorporation of such advanced optimization algorithms to refine the machine learning models utilized in this study is a significant direction we plan to pursue.

Limitations

This study acknowledges several limitations. The dataset, while comprehensive, encapsulates a specific patient population with unique characteristics that may influence the performance of the machine learning models. Potential confounding variables, including underlying health conditions and medication usage, were not rigorously controlled, possibly subtly impacting the outcomes. The generalizability of the findings may be contingent on the specific patient population from which the dataset was derived. Despite these considerations, the insights derived from this study are valuable, laying a groundwork for more exhaustive future investigations.

Conclusion

This study underscores the clinical necessity of accurately and swiftly distinguishing between bacterial and viral pharyngitis. By integrating traditional laboratory techniques with advanced machine learning, a new dimension to the diagnostic potential of hematological markers such as MLR was explored. The notable efficacy of the Random Forest and Lasso Regression in data prediction for this specific dataset suggests that exploring various machine learning techniques could hold promise for further diagnostic advancements.

The adaptation of a Lasso Regression model for use in a TI-84 calculator showcased a practical application of machine learning in clinical settings, enhancing accessibility and ease of use compared to traditional nomograms. These findings illuminate hematological distinctions between viral and bacterial infections in adult patients with pharyngitis, offering MLR as a potential addition to diagnostic methodologies. This not only has the potential to enhance diagnostic accuracy but also to refine therapeutic interventions.

It would be beneficial to extend the application of this model to other types of infections, and to integrate more variables and machine learning techniques, thereby enhancing its utility in infectious disease diagnosis. The results from this study mark a step towards more precise and timely diagnosis of pharyngitis, contributing to better management and treatment of this common condition.

Data availability

In adherence to privacy regulations and to ensure the confidentiality of sensitive patient information, the datasets utilized in this study are not publicly available. The research project and access to health databases are overseen by the Ethical Committee of Hebei Provincial Hospital of Traditional Chinese Medicine, which has set guidelines recommending against public dissemination of these datasets. Should researchers wish to request access to the data for academic purposes, they may contact the Corresponding Author who will facilitate the request in accordance with the guidelines set forth by the Ethical Committee of Hebei Provincial Hospital of Traditional Chinese Medicine.

Abbreviations

GABHS:: Group A β-hemolytic streptococcus
CBC:: Complete blood count
RBC:: Red blood cell count
HGB:: Hemoglobin concentration
WBC:: White blood cell count
NEU:: Neutrophil count
NEUp:: Percentage of neutrophils
MONO:: Monocyte count
MONOp:: Percentage of monocytes
LYM:: Lymphocyte count
LYMp:: Percentage of lymphocytes
PLT:: Platelet count
MPV:: Mean platelet volume
ML:: Machine learning
MLR:: Monocyte to lymphocyte ratio
NLR:: Neutrophil to lymphocyte ratio
PLR:: Platelet to lymphocyte ratio
MPVLR:: Mean platelet volume to lymphocyte ratio
IQR:: Interquartile range
SD:: Standard deviation
AUC:: Area under the curve
CI:: Confidence interval
LR:: Lasso regression
DT:: Decision trees
SVM:: Support vector machine
NN:: Neural networks
NB:: Naive bayes

References

Berkley, J. Management of pharyngitis. Circulation 138, 1920–1922. https://doi.org/10.1161/circulationaha.118.035900 (2018).
Article PubMed Google Scholar
Luo, R. et al. Diagnosis and management of group a streptococcal pharyngitis in the United States, 2011–2015. BMC Infect. Dis. 19, 193. https://doi.org/10.1186/s12879-019-3835-4 (2019).
Article PubMed PubMed Central Google Scholar
Yildiz, I., Gonullu, E., Soysal, A., Oner, C. N. & Karabocuoglu, M. The epidemiology of influenza virus infection and group A streptococcal pharyngitis in children between 2011 and 2018 in an outpatient pediatric clinic. Cureus 15, e33492. https://doi.org/10.7759/cureus.33492 (2023).
Article PubMed PubMed Central Google Scholar
Badr, A. F., Humedi, R. A., Alfarsi, N. A. & Alghamdi, H. A. Rapid antigen detection test (RADT) for pharyngitis diagnosis in children: Public and pharmacist perception. Saudi Pharm. J. 29, 677–681. https://doi.org/10.1016/j.jsps.2021.04.029 (2021).
Article PubMed PubMed Central Google Scholar
Barbieri, E. et al. Antibiotic prescriptions in acute otitis media and pharyngitis in Italian pediatric outpatients. Ital. J. Pediatr. 45, 103. https://doi.org/10.1186/s13052-019-0696-9 (2019).
Article CAS PubMed PubMed Central Google Scholar
Wang, K., Chen, Y., Nie, Z. & Wang, J. Neutrophil-to-lymphocyte ratio to estimate colorectal cancer liver metastasis: A commentary. Int. J. Surg. https://doi.org/10.1097/js9.0000000000000535 (2023).
Article PubMed PubMed Central Google Scholar
Heymann, W. R. The neutrophil-to-lymphocyte ratio in cutaneous oncology: Simply elegant. J. Am. Acad. Dermatol. 86, 533–534. https://doi.org/10.1016/j.jaad.2021.11.060 (2022).
Article PubMed Google Scholar
Zhou, D. et al. The prognostic value of neutrophil-to-lymphocyte ratio and monocyte-to-lymphocyte ratio in metastatic gastric cancer treated with systemic chemotherapy. J. Cancer 11, 4205–4212. https://doi.org/10.7150/jca.39575 (2020).
Article CAS PubMed PubMed Central Google Scholar
Xia, L. J. et al. Significance of neutrophil-to-lymphocyte ratio, platelet-to-lymphocyte ratio, lymphocyte-to-monocyte ratio and prognostic nutritional index for predicting clinical outcomes in T1–2 rectal cancer. BMC Cancer 20, 208. https://doi.org/10.1186/s12885-020-6698-6 (2020).
Article CAS PubMed PubMed Central Google Scholar
Buonacera, A., Stancanelli, B., Colaci, M. & Malatino, L. Neutrophil to lymphocyte ratio: An emerging marker of the relationships between the immune system and diseases. Int. J. Mol. Sci. 23, 3636. https://doi.org/10.3390/ijms23073636 (2022).
Article CAS PubMed PubMed Central Google Scholar
Xu, L. et al. Role of lymphocyte-related immune-inflammatory biomarkers in detecting early progression of Guillain-Barré syndrome. J. Clin. Neurosci. 105, 31–36. https://doi.org/10.1016/j.jocn.2022.08.017 (2022).
Article MathSciNet CAS PubMed Google Scholar
Wang, N., He, S., Zheng, Y. & Wang, L. The value of NLR versus MLR in the short-term prognostic assessment of HBV-related acute-on-chronic liver failure. Int. Immunopharmacol. 121, 110489. https://doi.org/10.1016/j.intimp.2023.110489 (2023).
Article CAS PubMed Google Scholar
Regolo, M. et al. Neutrophil-to-lymphocyte ratio (NLR) is a promising predictor of mortality and admission to intensive care unit of COVID-19 patients. J. Clin. Med. 11, 2235. https://doi.org/10.3390/jcm11082235 (2022).
Article CAS PubMed PubMed Central Google Scholar
Amal, S. et al. Use of multi-modal data and machine learning to improve cardiovascular disease care. Front. Cardiovasc. Med. 9, 840262. https://doi.org/10.3389/fcvm.2022.840262 (2022).
Article CAS PubMed PubMed Central Google Scholar
Willem, T. et al. Risks and benefits of dermatological machine learning health care applications-an overview and ethical analysis. J. Eur. Acad. Dermatol. Venereol. 36, 1660–1668. https://doi.org/10.1111/jdv.18192 (2022).
Article CAS PubMed Google Scholar
Ozer, M. E., Sarica, P. O. & Arga, K. Y. New machine learning applications to accelerate personalized medicine in breast cancer: Rise of the support vector machines. Omics 24, 241–246. https://doi.org/10.1089/omi.2020.0001 (2020).
Article CAS PubMed Google Scholar
Takács, A. T., Bukva, M., Bereczki, C., Burián, K. & Terhes, G. Diagnosis of Epstein-Barr and cytomegalovirus infections using decision trees: An effective way to avoid antibiotic overuse in paediatric tonsillopharyngitis. BMC Pediatr. 23, 301. https://doi.org/10.1186/s12887-023-04103-0 (2023).
Article PubMed PubMed Central Google Scholar
Corbin, C. K. et al. Personalized antibiograms for machine learning driven antibiotic selection. Commun. Med. (Lond.) 2, 38. https://doi.org/10.1038/s43856-022-00094-8 (2022).
Article PubMed Google Scholar
Sayood, S. & Durkin, M. J. The challenge of outpatient antibiotic stewardship. JAMA Netw. Open 6, e2312996. https://doi.org/10.1001/jamanetworkopen.2023.12996 (2023).
Article PubMed Google Scholar
Pacios, E. Antibiotic stewardship in the real world. Lancet Infect. Dis. 22, 448–449. https://doi.org/10.1016/s1473-3099(22)00147-5 (2022).
Article PubMed Google Scholar
Mponponsuo, K. et al. Age and sex-specific incidence rates of group A streptococcal pharyngitis between 2010 and 2018: A population-based study. Future Microbiol. 16, 1053–1062. https://doi.org/10.2217/fmb-2021-0077 (2021).
Article CAS PubMed Google Scholar
Mustafa, Z. & Ghaffari, M. Diagnostic methods, clinical guidelines, and antibiotic treatment for group A streptococcal pharyngitis: A narrative review. Front. Cell Infect. Microbiol. 10, 563627. https://doi.org/10.3389/fcimb.2020.563627 (2020).
Article CAS PubMed PubMed Central Google Scholar
Largman-Chalamish, M. et al. Differentiating between bacterial and viral infections by estimated CRP velocity. PLoS One 17, e0277401. https://doi.org/10.1371/journal.pone.0277401 (2022).
Article CAS PubMed PubMed Central Google Scholar
Zhang, M. et al. Prediction of virus-host infectious association by supervised learning methods. BMC Bioinform. 18, 60. https://doi.org/10.1186/s12859-017-1473-7 (2017).
Article Google Scholar
Bg, S. et al. Neutrophil-to-lymphocyte, lymphocyte-to-monocyte, and platelet-to-lymphocyte ratios: Prognostic significance in COVID-19. Cureus 13, e12622. https://doi.org/10.7759/cureus.12622 (2021).
Article PubMed PubMed Central Google Scholar
Inanli, I., Aydin, M., Çaliskan, A. M. & Eren, I. Neutrophil/lymphocyte ratio, monocyte/lymphocyte ratio, and mean platelet volume as systemic inflammatory markers in different states of bipolar disorder. Nord. J. Psychiatry 73, 372–379. https://doi.org/10.1080/08039488.2019.1640789 (2019).
Article PubMed Google Scholar
Livorsi, D. J. et al. Antibiotic stewardship implementation and antibiotic use at hospitals with and without on-site infectious disease specialists. Clin. Infect. Dis. 72, 1810–1817. https://doi.org/10.1093/cid/ciaa388 (2021).
Article CAS PubMed Google Scholar
Russell, C. D. et al. The utility of peripheral blood leucocyte ratios as biomarkers in infectious diseases: A systematic review and meta-analysis. J. Infect. 78, 339–348. https://doi.org/10.1016/j.jinf.2019.02.006 (2019).
Article PubMed PubMed Central Google Scholar
Xu, H. et al. Potential blood biomarkers for diagnosing periprosthetic joint infection: A single-center, retrospective study. Antibiotics (Basel) 11, 505. https://doi.org/10.3390/antibiotics11040505 (2022).
Article CAS PubMed Google Scholar
Lewin-Epstein, O., Baruch, S., Hadany, L., Stein, G. Y. & Obolski, U. Predicting antibiotic resistance in hospitalized patients by applying machine learning to electronic medical records. Clin. Infect. Dis. 72, e848–e855. https://doi.org/10.1093/cid/ciaa1576 (2021).
Article PubMed Google Scholar
Dhiman, P. et al. Overinterpretation of findings in machine learning prediction model studies in oncology: A systematic review. J. Clin. Epidemiol. 157, 120–133. https://doi.org/10.1016/j.jclinepi.2023.03.012 (2023).
Article PubMed Google Scholar
Adamichou, C. et al. Lupus or not? SLE risk probability index (SLERPI): A simple, clinician-friendly machine learning-based model to assist the diagnosis of systemic lupus erythematosus. Ann. Rheum Dis. 80, 758–766. https://doi.org/10.1136/annrheumdis-2020-219069 (2021).
Article CAS PubMed Google Scholar
Li, Y., Lu, F. & Yin, Y. Applying logistic LASSO regression for the diagnosis of atypical Crohn’s disease. Sci. Rep. 12, 11340. https://doi.org/10.1038/s41598-022-15609-5 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Huang, J. C. et al. Predictive modeling of blood pressure during hemodialysis: A comparison of linear model, random forest, support vector regression, XGBoost, LASSO regression and ensemble method. Comput. Methods Programs Biomed. 195, 105536. https://doi.org/10.1016/j.cmpb.2020.105536 (2020).
Article PubMed Google Scholar
Wu, L. et al. LASSO Regression-based diagnosis of acute ST-segment elevation myocardial infarction (STEMI) on electrocardiogram (ECG). J. Clin. Med. 11, 5408. https://doi.org/10.3390/jcm11185408 (2022).
Article ADS PubMed PubMed Central Google Scholar
Cunha, C. B. & Opal, S. M. Antibiotic stewardship: Strategies to minimize antibiotic resistance while maximizing antibiotic effectiveness. Med. Clin. North Am. 102, 831–843. https://doi.org/10.1016/j.mcna.2018.04.006 (2018).
Article PubMed Google Scholar
Moore, M. Antibiotic stewardship: Where next?. Br. J. Gen. Pract. 73, 100–101. https://doi.org/10.3399/bjgp23X732033 (2023).
Article PubMed Google Scholar
Sang-To, T., Le-Minh, H., Mirjalili, S., Abdel Wahab, M. & Cuong-Le, T. A new movement strategy of grey wolf optimizer for optimization problems and structural damage identification. Adv. Eng. Softw. 173, 103276. https://doi.org/10.1016/j.advengsoft.2022.103276 (2022).
Article Google Scholar
Yang, Y., Chen, H., Heidari, A. A. & Gandomi, A. H. Hunger games search: Visions, conception, implementation, deep analysis, perspectives, and towards performance shifts. Expert Syst. Appl. 177, 114864. https://doi.org/10.1016/j.eswa.2021.114864 (2021).
Article Google Scholar
Sang-To, T., Le-Minh, H., Abdel Wahab, M. & Thanh, C.-L. A new metaheuristic algorithm: Shrimp and Goby association search algorithm and its application for damage identification in large-scale and complex structures. Adv. Eng. Softw. 176, 103363. https://doi.org/10.1016/j.advengsoft.2022.103363 (2023).
Article Google Scholar
Sang-To, T. et al. Forecasting of excavation problems for high-rise building in Vietnam using planet optimization algorithm. Sci. Rep. 11, 23809. https://doi.org/10.1038/s41598-021-03097-y (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Ahmadianfar, I., Heidari, A. A., Gandomi, A. H., Chu, X. & Chen, H. RUN beyond the metaphor: An efficient optimization algorithm based on Runge Kutta method. Expert Syst. Appl. 181, 115079. https://doi.org/10.1016/j.eswa.2021.115079 (2021).
Article Google Scholar

Download references

Author information

Authors and Affiliations

School of Medical Technology, Hebei Medical University, Shijiazhuang, 050017, People’s Republic of China
Zhe Jin
Department of Otorhinolaryngology, Hebei Provincial Hospital of Traditional Chinese Medicine, Shijiazhuang, 050011, People’s Republic of China
Fengmei Ma & Shufan Guo
Medicine-Education Coordination and Medical Education Research Center, Hebei Medical University, Shijiazhuang, 050017, People’s Republic of China
Haoyang Chen

Authors

Zhe Jin
View author publications
You can also search for this author in PubMed Google Scholar
Fengmei Ma
View author publications
You can also search for this author in PubMed Google Scholar
Haoyang Chen
View author publications
You can also search for this author in PubMed Google Scholar
Shufan Guo
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Z.J.: Design of the work, Analysis and Draft-writing. F.M.: Data curation, Interpretation of data, Methodology. H.C.: the idea of creation of codes used in Ti 84 as in Fig. 4. S.G.: Supervision, substantively revised the work. All authors reviewed the manuscript.

Corresponding author

Correspondence to Shufan Guo.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Jin, Z., Ma, F., Chen, H. et al. Leveraging machine learning to distinguish between bacterial and viral induced pharyngitis using hematological markers: a retrospective cohort study. Sci Rep 13, 22899 (2023). https://doi.org/10.1038/s41598-023-49925-1

Download citation

Received: 09 August 2023
Accepted: 13 December 2023
Published: 21 December 2023
DOI: https://doi.org/10.1038/s41598-023-49925-1

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.