Abstract
Accurate differentiation between bacterial and viral-induced pharyngitis is recognized as essential for personalized treatment and judicious antibiotic use. From a cohort of 693 patients with pharyngitis, data from 197 individuals clearly diagnosed with bacterial or viral infections were meticulously analyzed in this study. By integrating detailed hematological insights with several machine learning algorithms, including Random Forest, Neural Networks, Decision Trees, Support Vector Machine, Naive Bayes, and Lasso Regression, for potential biomarkers were identified, with an emphasis being placed on the diagnostic significance of the Monocyte-to-Lymphocyte Ratio. Distinct inflammatory signatures associated with bacterial infections were spotlighted in this study. An innovation introduced in this research was the adaptation of the high-accuracy Lasso Regression model for the TI-84 calculator, with an AUC (95% CI) of 0.94 (0.925–0.955) being achieved. Using this adaptation, pivotal laboratory parameters can be input on-the-spot and infection probabilities can be computed subsequently. This methodology embodies an improvement in diagnostics, facilitating more effective distinction between bacterial and viral infections while fostering judicious antibiotic use.
Similar content being viewed by others
Introduction
Pharyngitis, defined by pharyngeal inflammation, is a predominant concern within the department of otorhinolaryngology, affecting a vast number of individuals annually1. Plenty of infectious agents can induce pharyngitis; however, bacterial sources, especially Group A β-hemolytic streptococcus (GABHS) is most prevalent2. Concurrently, viral infections continue to present significant clinical complexities3. Accurately differentiating between bacterial and viral pharyngitis is critical not only for precise therapeutic strategies but also to curb the overuse of antibiotics, a trend exacerbating the global rise of antibiotic-resistant organisms4,5.
In the past decade, the potential of complete blood count (CBC) parameters such as Neutrophil-to-Lymphocyte ratio (NLR) and Monocyte-to-Lymphocyte ratio (MLR) has been extensively explored as biomarkers for early diagnosis in cancers6,7,8,9. These markers have also shown promise in differentiating between bacterial and viral infections, providing a non-invasive, cost-effective approach to aid in clinical decision-making10,11. Nonetheless, the differentiation between viral and bacterial pharyngitis still poses a significant challenge, calling for more advanced and precise diagnostic tools12,13.
Recently, machine learning (ML) has gained traction in healthcare for its potential to revolutionize diagnosis and treatment14,15. ML models, capable of learning from large datasets and identifying complex patterns, have shown promise in infections. Despite these advances, there is still a paucity of research exploring the utility of machine learning in differentiating between bacterial and viral pharyngitis specifically16,17.
Therefore, this study aims to develop and validate an ML model specifically tailored to distinguish between viral and bacterial pharyngitis, improving diagnosis accuracy18 and promoting more responsible antibiotic stewardship19,20.
Methods
Study design
This retrospective cohort study included adult patients with pharyngitis caused by different infection types. An evaluation of the diagnostic accuracy of bacterial and viral pharyngitis across various demographic groups was conducted based on a retrospective study design. A comprehensive clinical examination has been undertaken for each participant, considering their medical history and current symptoms.
Patients’ recruitment
The patients with pharyngitis were enrolled in the study through a systematic recruitment process that aimed to ensure the inclusion of eligible participants with complete and relevant data. The recruitment process followed several steps to identify and select suitable candidates:
-
1.
Patient Identification: Potential participants with symptoms of pharyngitis were identified from the patient population attending the Department of Otorhinolaryngology at Hebei Provincial Hospital of Traditional Chinese Medicine, between 2019 and 2023.
-
2.
Screening for patients with pharyngitis: This screening involved a review of their medical history and a clinical examination.
-
3.
Inclusion Criteria: To be included in the study, patients had to meet the following criteria:
-
3.1.
Confirmed diagnosis of pharyngitis based on clinical evaluation.
-
3.2.
Absence of severe medical conditions or comorbidities that could confound the analysis.
-
3.1.
-
4.
Exclusion Criteria: Patients with the following characteristics were excluded from the study:
-
4.1.
Absence of essential demographic details or incomplete data pertaining to complete cell count metrics.
-
4.2.
Age below 18 years.
-
4.3.
Patients without a definitive diagnosis of infection type.
-
4.1.
Independent variables
Several parameters, including basic demographic information, complete cell count, and novel parameters such as the NLR, platelet-to-lymphocyte ratio (PLR), monocyte-to-lymphocyte ratio (MLR), and mean platelet volume-to-lymphocyte ratio (MPVLR), were assessed to provide a comprehensive picture. These novel parameters have been log-transformed prior to analysis to manage skewness, stabilize variance, lessen the influence of outliers, and convert multiplicative relationships into more interpretable additive ones, enhancing the robustness and validity of our statistical tests.
Statistical analysis
The current investigation employed a dataset comprising diverse clinical metrics, indicative of either bacterial or viral infections. To ensure a balanced comparison between the different infection types, a 1:1 propensity score matching (PSM) was utilized. Following this matching, the dataset was randomly partitioned into a training cohort (75%) and a validation cohort (25%). Continuous variables were evaluated using two-sample t-tests, while categorical variables were assessed through chi-squared tests. The threshold for statistical significance was set at p = 0.05. Analytical computations were conducted using R (version 3.6.3) and Python (version 3.7).
Machine learning analysis
A suite of machine learning algorithms was applied to selected clinical parameters to develop predictive models. The algorithms employed included Lasso Regression, Decision Trees, Random Forest, Support Vector Machines (SVM), Neural Networks (NN), and Naive Bayes (NB). The performance of these algorithms was evaluated using metrics such as accuracy, precision, recall, F1-score, and the area under the Receiver Operating Characteristic (AUC) curve. Furthermore, the importance of each feature was assessed across various models to ascertain their contribution to the predictive power of the models.
Visual representations, including ROC curves, were crafted for each model to enable a comparative evaluation of their performances between training and validation cohorts. Additionally, violin plots illustrated the distribution of clinical metrics across the two infection types.
Model deployment
The Lasso regression model was encapsulated into a TI-84 calculator via a custom script, engineered for rapid input of laboratory parameters. Upon input, an output delineating the infection type probability was generated. The model's performance was stringently evaluated and validated using our designated validation cohort.
Ethical compliance
This study is approved by the Medical Ethical Committee of Hebei Provincial Hospital of Traditional Chinese Medicine, the register num is HBZY2023-KY-012-01. A waiver for the requirement of informed consent has been granted by the Medical Ethical Committee of Hebei Provincial Hospital of Traditional Chinese Medicine due to its retrospective nature. Strict adherence to the ethical guidelines related to human subjects in research was maintained in our study. all their privacy and confidentiality were upheld throughout the study.
Results
A total of 693 patients diagnosed with pharyngitis were initially identified. Following rigorous adherence to predefined inclusion and exclusion criteria, a cohort of 197 eligible patients was delineated. This cohort included 74 individuals diagnosed with viral infections and 123 with bacterial infections, as depicted in Fig. 1. These participants were then methodically allocated into two primary cohorts for further analysis: the training cohort, consisting of 55 individuals with bacterial infections and 56 with viral infections, and the validation cohort, comprising 19 individuals with bacterial infections and 18 with viral infections. This stratification provided a structured framework for the comparative analysis of viral and bacterial pharyngitis cases, thereby facilitating the subsequent development and validation of machine learning models.
Demographic attributes such as sex, age, and clinical status (outpatient or inpatient) were recorded. An approximately balanced distribution of males and females was observed across both cohorts, aligning with the demographic findings in related literature21. Age ranged from 18 to 85 years, with the most represented age group being 18–34 years. Majority of the patients were outpatients, with no significant difference in distribution between the two infection types (Table 1).
Outcome measures focused on hematological indices. The observed variations included higher Red blood cell (RBC) count and Hemoglobin concentration (HGB) levels in patients with viral infections (p = 0.003 and p = 0.006, respectively) and elevated White Blood Cell (WBC) count, Neutrophil count (NEU) count, and Monocyte count (MONO) count in patients with bacterial infections (p < 0.01 for each parameter). Significant differences were noted for other parameters such as Percentage of Neutrophils (NEUp), Lymphocyte (LYM) count, Percentage of Lymphocytes (LYMp), log-transformed Monocyte to Lymphocyte ratio (logMLR), log-transformed Neutrophil to Lymphocyte ratio (logNLR), and log-transformed Platelet to Lymphocyte ratio (logPLR) between the two infection groups (Table 1).
A comparative analysis revealed significant differences in several hematological indices between the viral and bacterial infection groups in both the training and validation cohorts. Notably, in the training cohort, there were significant variations regarding HGB, WBC, NEU, NEUp, LYM, LYMp, logMLR, logNLR, logPLR, and log-transformed Platelet Volume to Lymphocyte ratio (logMPVLR) (all p < 0.05). Meanwhile, the validation cohort displayed significant differences for NEU, NEUp, LYMp, logMLR, and logNLR (all p < 0.05) (Table 2). These findings echo the inherent diagnostic challenges associated with pharyngitis, where overlapping symptoms between bacterial, primarily caused by Group A β-hemolytic streptococcus, and viral pharyngitis often complicate accurate diagnosis22. Although blood tests have been instrumental in aiding the diagnosis of acute viral and bacterial infections, their efficacy is sometimes hindered by their inability to capture the evolving inflammatory response post-symptom onset23.
The violin plots demonstrated distinct trends in hematological and inflammatory parameters between bacterial and viral infections. Parameters such as WBC and MONO had overlapping distributions, while NEU, NEUp, logMLR, logNLR, and logPLR were predominantly higher in bacterial infections. LYM and LYMp leaned more towards viral infections (Fig. 2). It was observed that parameters like WBC count and MONO count exhibited overlapping distributions, hinting at a common inflammatory response irrespective of the infection type. Conversely, parameters such as NEU, NEUp, logMLR, logNLR, and logPLR demonstrated elevated levels predominantly in bacterial infections.
The employment of machine learning methodologies was directed at determining their predictive efficacy on the dataset. Performance metrics for both the training and validation cohorts, including accuracy, precision, recall, F1-score, and the AUC were computed and are displayed in Tables 3 and 4. The Random Forest notably exhibited superior performance in terms of accuracy and AUC in both cohorts, aligning with findings from past studies on bacterial and viral infections24. Meanwhile, ROC curves for Lasso Regression and SVM models suggested a high degree of accuracy in infection type classification. During cross-validation on the training set, the optimized Lasso Regression model attained an AUC score of approximately 0.90. The model's robustness and generalizability were confirmed through its performance on a separate validation set, where it achieved an AUC score of approximately 0.94, demonstrating its ability to effectively distinguish between viral and bacterial infections (Fig. 3).
The feature importance of each variable was evaluated across different machine learning models, revealing NEUp, logMLR, and logPLR to be the significant. The highest importance scores in the Lasso Regression model were found for NEUp (2.0110), logMPVLR (1.0451), and logPLR (0.6210). In the Decision Tree model, a high importance score was assigned to the NEUp variable (0.5127). Notably, the Random Forest model showed elevated scores for NEUp (0.3024) and LYMp (0.1349). The SVM model indicated WBC (0.0528) and NEU (0.0722) as most important, while in the Neural Network model, logMPVLR (0.1306) had the highest score. In the Naive Bayes model, the WBC variable scored slightly higher (0.0556), underscoring their potential utility in diagnostic algorithms (Table 5).
Following deployment, the Lasso Regression model exhibited substantial adeptness in differentiating between bacterial and viral infections. By simply inputting the selected laboratory parameters into the TI-84 calculator, healthcare professionals could expeditiously generate infection probability outcomes (Fig. 4). The model was stringently assessed. The validation cohort in our study, included data from 37 patients (19 bacterial, 18 viral infections). The consistent and effective performance emphasizes the model's robustness and reliability.
Discussion
This study has highlighted hematological disparities between bacterial and viral infections, shedding light on the pronounced inflammatory response elicited predominantly by bacterial infections. The hematological parameters, MLR, NLR, PLR, and MPVLR have been emphasized as notable biomarkers25,26. In line with established literature, viral infections are typically characterized by augmented RBC counts and HGB levels, while bacterial infections are more likely to display heightened WBC, NEU, and MONO counts27 (Table 1).
The distinct variations in key hematological parameters such as NEU, NEUp, LYMp, logMLR, and logNLR underscore the differential immunological responses between bacterial and viral infections (Table 2). It is well-established that neutrophils are the primary leukocytes engaged in immune responses during bacterial infections, while lymphocyte-mediated immune responses are predominantly observed during viral infections28. The substantial feature importance score of logMLR across various machine learning models accentuates its critical role as a distinguishing hematological parameter, potentially aiding in the enhanced diagnostic differentiation between bacterial and viral infections in our study. Nevertheless, the diagnostic quandaries stemming from the overlapping distributions of WBC and NEU, as illustrated in Fig. 2, emphasize the imperative for a broader diagnostic strategy, transcending the reliance on singular markers29.
From a computational viewpoint, the Random Forest emerged as the most proficient predictor for classifying infection types, albeit with Neural Networks showing close prowess30. Conversely, SVM and Naive Bayes showcased diverse performances, underscoring the imperative nature of meticulous model selection tailored to specific data characteristics31 (Tables 3 and 4). Both Lasso Regression and Random Forest were proficient in differentiating bacterial from viral infections (Fig. 3).
In this study, Lasso Regression was utilized to create a diagnostic model for classifying infection types. The choice of Lasso Regression was predicated on its unique characteristics, which encompass both variable selection and regularization functionalities32. This makes it particularly suitable for this type of problem. Although more complex machine learning methodologies are available, Lasso Regression establishes a balance between model intricacy and interpretability33. This equilibrium is essential in clinical environments where elucidating the relationship between predictors and outcomes is as vital as achieving prediction accuracy34,35.
A noteworthy innovation of this work is the successful amalgamation of the Lasso model with a widely accessible computational tool, the TI-84 calculator. Although both Random Forest and Lasso Regression exhibited commendable performance in our analysis, the computational parsimony of Lasso Regression rendered it a more pragmatic choice for the TI-84 calculator, known for its ease of use, cost-effectiveness, and ubiquitous availability. This integration facilitated the rapid and efficient input of pivotal laboratory parameters including NEU, MONO, NEUp, LYM, LYMp, PLT, and MPV, consequently generating the probability of infection type. By merely inputting the selected laboratory parameters into the calculator, healthcare professionals could promptly ascertain the likelihood of bacterial or viral infections (Fig. 4). No commercial reagents or specific equipment were required in this methodology, promoting its cost-effectiveness and widespread accessibility.
Integral to this discussion is the concept of antibiotic stewardship. Given the emerging global challenge of antibiotic resistance, it is imperative to differentiate bacterial from viral infections to ensure judicious antibiotic use36. These findings contribute significantly to antibiotic stewardship efforts by pinpointing potential biomarkers that might expedite accurate diagnosis, thereby minimizing unwarranted antibiotic prescriptions. Emphasizing the need for precise diagnosis and targeted therapies, this study underlines the importance of combining clinical, laboratory, and computational tools in the era of personalized medicine and antibiotic stewardship37.
The prospect of amplifying diagnostic precision through the amalgamation of optimization algorithms with machine learning methodologies is indeed exhilarating. Esteemed optimization algorithms such as the refined Grey Wolf Optimizer (LGWO)38, Hunger Games Search (HGS)39, Shrimp and Goby Association Search algorithm (SGA)40, Planet Optimization Algorithm (P.O.A.)41, and Runge Kutta optimizer (RUN)42 possess the potential to significantly enhance model efficacy. Although the current study did not delve into these optimization techniques, the future incorporation of such advanced optimization algorithms to refine the machine learning models utilized in this study is a significant direction we plan to pursue.
Limitations
This study acknowledges several limitations. The dataset, while comprehensive, encapsulates a specific patient population with unique characteristics that may influence the performance of the machine learning models. Potential confounding variables, including underlying health conditions and medication usage, were not rigorously controlled, possibly subtly impacting the outcomes. The generalizability of the findings may be contingent on the specific patient population from which the dataset was derived. Despite these considerations, the insights derived from this study are valuable, laying a groundwork for more exhaustive future investigations.
Conclusion
This study underscores the clinical necessity of accurately and swiftly distinguishing between bacterial and viral pharyngitis. By integrating traditional laboratory techniques with advanced machine learning, a new dimension to the diagnostic potential of hematological markers such as MLR was explored. The notable efficacy of the Random Forest and Lasso Regression in data prediction for this specific dataset suggests that exploring various machine learning techniques could hold promise for further diagnostic advancements.
The adaptation of a Lasso Regression model for use in a TI-84 calculator showcased a practical application of machine learning in clinical settings, enhancing accessibility and ease of use compared to traditional nomograms. These findings illuminate hematological distinctions between viral and bacterial infections in adult patients with pharyngitis, offering MLR as a potential addition to diagnostic methodologies. This not only has the potential to enhance diagnostic accuracy but also to refine therapeutic interventions.
It would be beneficial to extend the application of this model to other types of infections, and to integrate more variables and machine learning techniques, thereby enhancing its utility in infectious disease diagnosis. The results from this study mark a step towards more precise and timely diagnosis of pharyngitis, contributing to better management and treatment of this common condition.
Data availability
In adherence to privacy regulations and to ensure the confidentiality of sensitive patient information, the datasets utilized in this study are not publicly available. The research project and access to health databases are overseen by the Ethical Committee of Hebei Provincial Hospital of Traditional Chinese Medicine, which has set guidelines recommending against public dissemination of these datasets. Should researchers wish to request access to the data for academic purposes, they may contact the Corresponding Author who will facilitate the request in accordance with the guidelines set forth by the Ethical Committee of Hebei Provincial Hospital of Traditional Chinese Medicine.
Abbreviations
- GABHS:
-
Group A β-hemolytic streptococcus
- CBC:
-
Complete blood count
- RBC:
-
Red blood cell count
- HGB:
-
Hemoglobin concentration
- WBC:
-
White blood cell count
- NEU:
-
Neutrophil count
- NEUp:
-
Percentage of neutrophils
- MONO:
-
Monocyte count
- MONOp:
-
Percentage of monocytes
- LYM:
-
Lymphocyte count
- LYMp:
-
Percentage of lymphocytes
- PLT:
-
Platelet count
- MPV:
-
Mean platelet volume
- ML:
-
Machine learning
- MLR:
-
Monocyte to lymphocyte ratio
- NLR:
-
Neutrophil to lymphocyte ratio
- PLR:
-
Platelet to lymphocyte ratio
- MPVLR:
-
Mean platelet volume to lymphocyte ratio
- IQR:
-
Interquartile range
- SD:
-
Standard deviation
- AUC:
-
Area under the curve
- CI:
-
Confidence interval
- LR:
-
Lasso regression
- DT:
-
Decision trees
- SVM:
-
Support vector machine
- NN:
-
Neural networks
- NB:
-
Naive bayes
References
Berkley, J. Management of pharyngitis. Circulation 138, 1920–1922. https://doi.org/10.1161/circulationaha.118.035900 (2018).
Luo, R. et al. Diagnosis and management of group a streptococcal pharyngitis in the United States, 2011–2015. BMC Infect. Dis. 19, 193. https://doi.org/10.1186/s12879-019-3835-4 (2019).
Yildiz, I., Gonullu, E., Soysal, A., Oner, C. N. & Karabocuoglu, M. The epidemiology of influenza virus infection and group A streptococcal pharyngitis in children between 2011 and 2018 in an outpatient pediatric clinic. Cureus 15, e33492. https://doi.org/10.7759/cureus.33492 (2023).
Badr, A. F., Humedi, R. A., Alfarsi, N. A. & Alghamdi, H. A. Rapid antigen detection test (RADT) for pharyngitis diagnosis in children: Public and pharmacist perception. Saudi Pharm. J. 29, 677–681. https://doi.org/10.1016/j.jsps.2021.04.029 (2021).
Barbieri, E. et al. Antibiotic prescriptions in acute otitis media and pharyngitis in Italian pediatric outpatients. Ital. J. Pediatr. 45, 103. https://doi.org/10.1186/s13052-019-0696-9 (2019).
Wang, K., Chen, Y., Nie, Z. & Wang, J. Neutrophil-to-lymphocyte ratio to estimate colorectal cancer liver metastasis: A commentary. Int. J. Surg. https://doi.org/10.1097/js9.0000000000000535 (2023).
Heymann, W. R. The neutrophil-to-lymphocyte ratio in cutaneous oncology: Simply elegant. J. Am. Acad. Dermatol. 86, 533–534. https://doi.org/10.1016/j.jaad.2021.11.060 (2022).
Zhou, D. et al. The prognostic value of neutrophil-to-lymphocyte ratio and monocyte-to-lymphocyte ratio in metastatic gastric cancer treated with systemic chemotherapy. J. Cancer 11, 4205–4212. https://doi.org/10.7150/jca.39575 (2020).
Xia, L. J. et al. Significance of neutrophil-to-lymphocyte ratio, platelet-to-lymphocyte ratio, lymphocyte-to-monocyte ratio and prognostic nutritional index for predicting clinical outcomes in T1–2 rectal cancer. BMC Cancer 20, 208. https://doi.org/10.1186/s12885-020-6698-6 (2020).
Buonacera, A., Stancanelli, B., Colaci, M. & Malatino, L. Neutrophil to lymphocyte ratio: An emerging marker of the relationships between the immune system and diseases. Int. J. Mol. Sci. 23, 3636. https://doi.org/10.3390/ijms23073636 (2022).
Xu, L. et al. Role of lymphocyte-related immune-inflammatory biomarkers in detecting early progression of Guillain-Barré syndrome. J. Clin. Neurosci. 105, 31–36. https://doi.org/10.1016/j.jocn.2022.08.017 (2022).
Wang, N., He, S., Zheng, Y. & Wang, L. The value of NLR versus MLR in the short-term prognostic assessment of HBV-related acute-on-chronic liver failure. Int. Immunopharmacol. 121, 110489. https://doi.org/10.1016/j.intimp.2023.110489 (2023).
Regolo, M. et al. Neutrophil-to-lymphocyte ratio (NLR) is a promising predictor of mortality and admission to intensive care unit of COVID-19 patients. J. Clin. Med. 11, 2235. https://doi.org/10.3390/jcm11082235 (2022).
Amal, S. et al. Use of multi-modal data and machine learning to improve cardiovascular disease care. Front. Cardiovasc. Med. 9, 840262. https://doi.org/10.3389/fcvm.2022.840262 (2022).
Willem, T. et al. Risks and benefits of dermatological machine learning health care applications-an overview and ethical analysis. J. Eur. Acad. Dermatol. Venereol. 36, 1660–1668. https://doi.org/10.1111/jdv.18192 (2022).
Ozer, M. E., Sarica, P. O. & Arga, K. Y. New machine learning applications to accelerate personalized medicine in breast cancer: Rise of the support vector machines. Omics 24, 241–246. https://doi.org/10.1089/omi.2020.0001 (2020).
Takács, A. T., Bukva, M., Bereczki, C., Burián, K. & Terhes, G. Diagnosis of Epstein-Barr and cytomegalovirus infections using decision trees: An effective way to avoid antibiotic overuse in paediatric tonsillopharyngitis. BMC Pediatr. 23, 301. https://doi.org/10.1186/s12887-023-04103-0 (2023).
Corbin, C. K. et al. Personalized antibiograms for machine learning driven antibiotic selection. Commun. Med. (Lond.) 2, 38. https://doi.org/10.1038/s43856-022-00094-8 (2022).
Sayood, S. & Durkin, M. J. The challenge of outpatient antibiotic stewardship. JAMA Netw. Open 6, e2312996. https://doi.org/10.1001/jamanetworkopen.2023.12996 (2023).
Pacios, E. Antibiotic stewardship in the real world. Lancet Infect. Dis. 22, 448–449. https://doi.org/10.1016/s1473-3099(22)00147-5 (2022).
Mponponsuo, K. et al. Age and sex-specific incidence rates of group A streptococcal pharyngitis between 2010 and 2018: A population-based study. Future Microbiol. 16, 1053–1062. https://doi.org/10.2217/fmb-2021-0077 (2021).
Mustafa, Z. & Ghaffari, M. Diagnostic methods, clinical guidelines, and antibiotic treatment for group A streptococcal pharyngitis: A narrative review. Front. Cell Infect. Microbiol. 10, 563627. https://doi.org/10.3389/fcimb.2020.563627 (2020).
Largman-Chalamish, M. et al. Differentiating between bacterial and viral infections by estimated CRP velocity. PLoS One 17, e0277401. https://doi.org/10.1371/journal.pone.0277401 (2022).
Zhang, M. et al. Prediction of virus-host infectious association by supervised learning methods. BMC Bioinform. 18, 60. https://doi.org/10.1186/s12859-017-1473-7 (2017).
Bg, S. et al. Neutrophil-to-lymphocyte, lymphocyte-to-monocyte, and platelet-to-lymphocyte ratios: Prognostic significance in COVID-19. Cureus 13, e12622. https://doi.org/10.7759/cureus.12622 (2021).
Inanli, I., Aydin, M., Çaliskan, A. M. & Eren, I. Neutrophil/lymphocyte ratio, monocyte/lymphocyte ratio, and mean platelet volume as systemic inflammatory markers in different states of bipolar disorder. Nord. J. Psychiatry 73, 372–379. https://doi.org/10.1080/08039488.2019.1640789 (2019).
Livorsi, D. J. et al. Antibiotic stewardship implementation and antibiotic use at hospitals with and without on-site infectious disease specialists. Clin. Infect. Dis. 72, 1810–1817. https://doi.org/10.1093/cid/ciaa388 (2021).
Russell, C. D. et al. The utility of peripheral blood leucocyte ratios as biomarkers in infectious diseases: A systematic review and meta-analysis. J. Infect. 78, 339–348. https://doi.org/10.1016/j.jinf.2019.02.006 (2019).
Xu, H. et al. Potential blood biomarkers for diagnosing periprosthetic joint infection: A single-center, retrospective study. Antibiotics (Basel) 11, 505. https://doi.org/10.3390/antibiotics11040505 (2022).
Lewin-Epstein, O., Baruch, S., Hadany, L., Stein, G. Y. & Obolski, U. Predicting antibiotic resistance in hospitalized patients by applying machine learning to electronic medical records. Clin. Infect. Dis. 72, e848–e855. https://doi.org/10.1093/cid/ciaa1576 (2021).
Dhiman, P. et al. Overinterpretation of findings in machine learning prediction model studies in oncology: A systematic review. J. Clin. Epidemiol. 157, 120–133. https://doi.org/10.1016/j.jclinepi.2023.03.012 (2023).
Adamichou, C. et al. Lupus or not? SLE risk probability index (SLERPI): A simple, clinician-friendly machine learning-based model to assist the diagnosis of systemic lupus erythematosus. Ann. Rheum Dis. 80, 758–766. https://doi.org/10.1136/annrheumdis-2020-219069 (2021).
Li, Y., Lu, F. & Yin, Y. Applying logistic LASSO regression for the diagnosis of atypical Crohn’s disease. Sci. Rep. 12, 11340. https://doi.org/10.1038/s41598-022-15609-5 (2022).
Huang, J. C. et al. Predictive modeling of blood pressure during hemodialysis: A comparison of linear model, random forest, support vector regression, XGBoost, LASSO regression and ensemble method. Comput. Methods Programs Biomed. 195, 105536. https://doi.org/10.1016/j.cmpb.2020.105536 (2020).
Wu, L. et al. LASSO Regression-based diagnosis of acute ST-segment elevation myocardial infarction (STEMI) on electrocardiogram (ECG). J. Clin. Med. 11, 5408. https://doi.org/10.3390/jcm11185408 (2022).
Cunha, C. B. & Opal, S. M. Antibiotic stewardship: Strategies to minimize antibiotic resistance while maximizing antibiotic effectiveness. Med. Clin. North Am. 102, 831–843. https://doi.org/10.1016/j.mcna.2018.04.006 (2018).
Moore, M. Antibiotic stewardship: Where next?. Br. J. Gen. Pract. 73, 100–101. https://doi.org/10.3399/bjgp23X732033 (2023).
Sang-To, T., Le-Minh, H., Mirjalili, S., Abdel Wahab, M. & Cuong-Le, T. A new movement strategy of grey wolf optimizer for optimization problems and structural damage identification. Adv. Eng. Softw. 173, 103276. https://doi.org/10.1016/j.advengsoft.2022.103276 (2022).
Yang, Y., Chen, H., Heidari, A. A. & Gandomi, A. H. Hunger games search: Visions, conception, implementation, deep analysis, perspectives, and towards performance shifts. Expert Syst. Appl. 177, 114864. https://doi.org/10.1016/j.eswa.2021.114864 (2021).
Sang-To, T., Le-Minh, H., Abdel Wahab, M. & Thanh, C.-L. A new metaheuristic algorithm: Shrimp and Goby association search algorithm and its application for damage identification in large-scale and complex structures. Adv. Eng. Softw. 176, 103363. https://doi.org/10.1016/j.advengsoft.2022.103363 (2023).
Sang-To, T. et al. Forecasting of excavation problems for high-rise building in Vietnam using planet optimization algorithm. Sci. Rep. 11, 23809. https://doi.org/10.1038/s41598-021-03097-y (2021).
Ahmadianfar, I., Heidari, A. A., Gandomi, A. H., Chu, X. & Chen, H. RUN beyond the metaphor: An efficient optimization algorithm based on Runge Kutta method. Expert Syst. Appl. 181, 115079. https://doi.org/10.1016/j.eswa.2021.115079 (2021).
Author information
Authors and Affiliations
Contributions
Z.J.: Design of the work, Analysis and Draft-writing. F.M.: Data curation, Interpretation of data, Methodology. H.C.: the idea of creation of codes used in Ti 84 as in Fig. 4. S.G.: Supervision, substantively revised the work. All authors reviewed the manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Jin, Z., Ma, F., Chen, H. et al. Leveraging machine learning to distinguish between bacterial and viral induced pharyngitis using hematological markers: a retrospective cohort study. Sci Rep 13, 22899 (2023). https://doi.org/10.1038/s41598-023-49925-1
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-023-49925-1
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.