Host-response transcriptional biomarkers accurately discriminate bacterial and viral infections of global relevance

Ko, Emily R.; Reller, Megan E.; Tillekeratne, L. Gayani; Bodinayake, Champica K.; Miller, Cameron; Burke, Thomas W.; Henao, Ricardo; McClain, Micah T.; Suchindran, Sunil; Nicholson, Bradly; Blatt, Adam; Petzold, Elizabeth; Tsalik, Ephraim L.; Nagahawatte, Ajith; Devasiri, Vasantha; Rubach, Matthew P.; Maro, Venance P.; Lwezaula, Bingileki F.; Kodikara-Arachichi, Wasantha; Kurukulasooriya, Ruvini; De Silva, Aruna D.; Clark, Danielle V.; Schully, Kevin L.; Madut, Deng; Dumler, J. Stephen; Kato, Cecilia; Galloway, Renee; Crump, John A.; Ginsburg, Geoffrey S.; Minogue, Timothy D.; Woods, Christopher W.

doi:10.1038/s41598-023-49734-6

Download PDF

Article
Open access
Published: 18 December 2023

Host-response transcriptional biomarkers accurately discriminate bacterial and viral infections of global relevance

Emily R. Ko¹^na1,
Megan E. Reller^2,3,4^na1,
L. Gayani Tillekeratne^2,3,4,5,
Champica K. Bodinayake^4,5,
Cameron Miller⁶,
Thomas W. Burke²,
Ricardo Henao⁷,
Micah T. McClain^2,3,
Sunil Suchindran²,
Bradly Nicholson⁸,
Adam Blatt⁹,
Elizabeth Petzold²,
Ephraim L. Tsalik^2,10,
Ajith Nagahawatte¹¹,
Vasantha Devasiri⁵,
Matthew P. Rubach^3,4,12,13,
Venance P. Maro^13,14,
Bingileki F. Lwezaula^14,15,
Wasantha Kodikara-Arachichi⁵,
Ruvini Kurukulasooriya¹¹,
Aruna D. De Silva¹⁶,
Danielle V. Clark^17,18,
Kevin L. Schully¹⁸,
Deng Madut^3,4,
J. Stephen Dumler¹⁹,
Cecilia Kato²⁰,
Renee Galloway²⁰,
John A. Crump^4,5,13,14,21,
Geoffrey S. Ginsburg^22,23,
Timothy D. Minogue²⁴ &
…
Christopher W. Woods^2,3,4

Scientific Reports volume 13, Article number: 22554 (2023) Cite this article

1513 Accesses
1 Citations
1 Altmetric
Metrics details

Subjects

Abstract

Diagnostic limitations challenge management of clinically indistinguishable acute infectious illness globally. Gene expression classification models show great promise distinguishing causes of fever. We generated transcriptional data for a 294-participant (USA, Sri Lanka) discovery cohort with adjudicated viral or bacterial infections of diverse etiology or non-infectious disease mimics. We then derived and cross-validated gene expression classifiers including: 1) a single model to distinguish bacterial vs. viral (Global Fever-Bacterial/Viral [GF-B/V]) and 2) a two-model system to discriminate bacterial and viral in the context of noninfection (Global Fever-Bacterial/Viral/Non-infectious [GF-B/V/N]). We then translated to a multiplex RT-PCR assay and independent validation involved 101 participants (USA, Sri Lanka, Australia, Cambodia, Tanzania). The GF-B/V model discriminated bacterial from viral infection in the discovery cohort an area under the receiver operator curve (AUROC) of 0.93. Validation in an independent cohort demonstrated the GF-B/V model had an AUROC of 0.84 (95% CI 0.76–0.90) with overall accuracy of 81.6% (95% CI 72.7–88.5). Performance did not vary with age, demographics, or site. Host transcriptional response diagnostics distinguish bacterial and viral illness across global sites with diverse endemic pathogens.

A 6-mRNA host response classifier in whole blood predicts outcomes in COVID-19 and other acute viral infections

Article Open access 18 January 2022

Transcript host-RNA signatures to discriminate bacterial and viral infections in febrile children

Article 15 December 2021

Sepsis endotypes identified by host gene expression across global cohorts

Article Open access 18 June 2024

Introduction

Infectious diseases are leading causes of morbidity and mortality worldwide^1,2,3. The toll is greatest in low- and middle-income countries (LMIC), where infections are frequently caused by pathogens that cannot be identified when patients present with fever and resources for testing and treatment are limited. High rates of malnutrition and HIV exacerbate the problem by contributing to increased susceptibility to infection and diversity of pathogens^4,5,6,7,8. Without sensitive and specific point-of-care diagnostics to rapidly confirm or refute multiple etiologies of fever, bacterial infections remain untreated and viral infections are treated with antibiotics unnecessarily. The result has been unprecedented inappropriate antibiotic use and associated increasing antimicrobial resistance^{9,10,11,12,13,14,15,16,17}. The World Health Organization estimates that by 2050 antimicrobial resistance will lead to 10 million lives lost and cost 100 trillion USD per year, leading to an urgent called for new diagnostic assays and approaches to combat the problem¹⁸.

Host-response transcription patterns could fill this diagnostic gap by distinguishing between bacterial and viral etiologies early^{19,20,21,22,23,24,25,26,27}, including before symptoms, to limit spread and guide resource allocation^28,29,30. Gene expression classification models have shown great promise for the classification of causes of fever in high-income countries (HIC)^31,32 with progress extending to atypical pathogens present in LMIC^{20,26,33,34,35}. These multi-analyte gene expression models can be translated to rapid diagnotic platforms that inform clinical care^32,33,34,36. In this study, we generated host response biomarkers for the varied etiologies of suspected infection important worldwide, translated them to a quantitative RT-PCR multiplex platform, and validated them in a globally diverse independent cohort.

Methods

Global fever discovery and validation cohorts

Participants were prospectively enrolled within 48 h of presentation to academic hospitals in the USA^25,37,38,39, Sri Lanka^40,41,42,43, Tanzania^44,45, Cambodia^46,47,48, and Australia (Supplemental Table 1). Samples from participants were stored in a Duke University international biorepository and selected for analysis if they met inclusion critieria for suspected infection defined as: 1) a qualifying vital sign or lab abnormalities (fever ≥ 38.0 °C or ≤ 36 °C, heart rate ≥ 90, respiratory rate ≥ 20, and/or white blood cell count ≥ 12 (cells × 10⁹L), 2) clinical symptoms consistent with acute infection, and 3) adjudicated as meeting bacterial, viral, or noninfectious case definitions (Supplemental Table 2). A committee inclusive of clinical and statistical teams made final cohort selections, ensuring adequate balance among demographic and infectious phenotypes. The discovery cohort included 294 participants presenting to academic hospitals in the USA (n = 152) or Sri Lanka (n = 142). The validation cohort included 101 participants enrolled in the USA (n = 19), Sri Lanka (n = 53), Tanzania (n = 15), Cambodia (n = 10), and Australia (n = 4).

Table 1 Demographics and participant characteristics of discovery and validation cohort.

Full size table

Table 2 Differential expression of genes upregulated at least tenfold in bacterial versus viral illness.

Full size table

Samples and etiologic testing

Blood was collected at enrollment in PAXgene RNA tubes (QIAGEN) at all sites. Sera were collected at both enrollment (acute phase) and 2–6 week follow-up (convalescent phase) in Sri Lanka and Tanzania. Naso-pharyngeal swabs were collected at enrollment in the USA, Sri Lanka, and Australia. All samples were processed according by standardized protocols, stored at − 70 °C, and shipped on dry ice.

Etiologic testing was performed using reference standard methods to confirm or refute possible bacterial and viral causes of suspected infection endemic to the region. Blood culture and/or urine antigen tests performed as part of clinical care confirmed bacteremia for USA subjects. Bacterial isolates and urine collected in Cambodia confirmed Burkholderia pseudomallei by blood culture, sputum culture, and/or urine antigen testing^47,49. For participants enrolled in Sri Lanka and Tanzania, bacterial zoonoses were confirmed by a ≥ fourfold rise in titer of microscopic agglutination testing for Leptospira spp. and Brucella spp.^44,45, or indirect immunofluorescence assay for Rickettsia spp. (Spotted Fever Group, Typhus Group, and Orientia tsutsugamushi) and Coxiella burnetii, and/or by polymerase chain reaction (PCR) in a USA reference laboratory. For participants enrolled in the USA and Sri Lanka, respiratory viral infections were confirmed by PCR on nasopharyngeal samples (Luminex Integrated System NxTAG Respiratory Pathogen Panel; Luminex Corporation; Austin, TX)⁵⁰. For those enrolled in Sri Lanka, acute dengue was confirmed by fourfold rise in antibody titer, viral isolation, and/or PCR at a reference laboratory^41,51. The Tanzania study performed blood culture and/or blood smears for malarial pathogens.

Reference standard adjudication of etiology

Phenotypic adjudication of bacterial, viral, or noninfectious etiology independent of cohort selection (described above) was performed by a panel of ≥ 2 physicians who reviewed all available microbiologic data, de-identified clinical data extracted from case report forms (international), or the full medical records (USA) (Supplemental Table 2). Participants known to have malaria by blood smear were excluded due to insufficient frequency required to generate a parasitic classifier. Non-infectious cases had supportive clinical and radiographic data along with negative testing for infectious etiologies. Infectious cases were defined by positive etiology testing and supportive clinical data. Participants included from Tanzania had confirmed bacterial etiologic testing, but did not undergo testing for viral co-infection because dengue testing and respiratory viral swab were not available as part of this study (Supplemental Table 1).

Generation and normalization of transcriptomic data

Total RNA was extracted from whole blood collected and stored at − 70 °C in PAXgene Blood RNA tubes using the PAXgene miRNA Extraction Kit (QIAGEN) according to manufacturer’s instructions. RNA yield and integrity were assessed using NanoDrop ND-2000 spectrophotometer (ThermoFisher Scientific, Wilmington, DE) and 2100 Bioanalyzer with RNA 6000 Nano kit (Agilent Technologies, Santa Clara, CA), respectively. All RNA was purified under BSL3 conditions by approved protocols at Duke Regional Biocontainment Laboratory, except B. pseudomallei mRNA isolated under BSL4 conditions by standard procedures at the Navel Medical Research Laboratory.

RNA sequencing was performed at EAGenomics/Q2 Labs (Durham, NC) for 183 samples and the Duke Sequencing and Genomic Techologies Facility for 111 samples. Library preparation resulted in selected poly-A mRNA for sequencing using GlobinClear RNA Reduction (Invitrogen) and TruSeq Stranded mRNA Library Kit (Illumina) for the EA Genomics/Q2 Labs batch, and NuGEN Universal Plus mRNA-Seq Library Prep Kit with AnyDeplete Globin depletion (NuGEN/Tecan) for the Duke Sequencing Facility batch. Sequencing libraries were sequenced on Illumina HiSeq 2500 instrument (EA Genomics/Q2 Labs) or NovaSeq 6000 instrument (Duke Sequencing Facility) with 50 bp paired-end reads and target of > 40 million reads per sample, including crossover of 24 samples between the two batches to allow for quality control and batch corrections.

Nanostring multiplex transcript detection platform

Quantitative RT-PCR assays for genes in both the Global Fever Bacterial/Viral (GF-B/V) and Global Fever-Bacterial/Viral/Noninfectious (GF-B/V/N) models were developed using the NanoString platform. Total RNA (100 ng) from each participants was analyzed using a NanoString nCounter XT custom transcriptional response probe panel (NanoString Technologies, Seattle, WA). Nanostring assay processing was performed by the Duke Microbiome Core Facility according to manufacturer instructions.

Statistical analysis

We used Limma-voom modeling to obtain differential expression of transcripts ≥ tenfold in bacteria versus virus infected participants with an adjusted p-value < 0.01 in the discovery cohort. A cutoff of ≥ tenfold was used to identify the most highly differentially expressed genes. A significance threshold of 5% false discovery rate (FDR) was used. Pathway analysis used Database for Annotation, Visualization and Integrated Discovery (DAVID) and ENRICHr programs to create broad functional groups. Transcripts that did not fit into well-defined ontologic clusters were categorized by literature review.

To develop predictive models, the discovery cohort included Duke and Sri Lanka participants because these sites had similar extensive phenotypic analysis for both bacterial and viral pathogens and adequate populations of at least two of the phenotypes classes. We developed a simple binary GF-B/V model including only participants with bacterial or viral infection (Fig. 1A). Since fever or suspected infections may be neither bacterial nor viral, we incorporated participants with non-infectious illness as a control group in a second modeling approach (GF-B/V/N). The GF-B/V/N model used two binary predictive classifiers for discrimination: bacterial vs. non-bacterial (viral or non-infectious), and viral vs. non-viral (bacterial or non-infectious). The categorization of bacterial or viral illness by the GF-B/V/N test is made for each participant by comparing the probabilities of each binary classifier (Supplemental Fig. 1A). High-confidence noninfectious samples were only available from the USA, but there were no significant difference in expression of control house-keeping genes that would suggest a site specific or confounding affect.

Standard quality control and principal component analysis was performed and ensured there were no site dependent effects or inappropriate clustering of the data. We then conducted supervised regularized regression (Least Absolute Shrinkage and Selection Operator [LASSO]) analysis of the entire transcriptome. Nested, repeated (500 repeats) fivefold cross-validation was performed to estimate predicted probabilities. All model-building steps were performed on training data only to maintain unbiased estimates generated on the test fold. Predicted probabilities were utilized to estimate area under the receiver operating curve (AUROC) and ROC01 method was used to select a cutoff to estimate accuracy and characterize performance. Use of 500 sets of predictions for the discovery cohort limited calculation of predicted confidence intervals by the standard approach⁵², but was more representative of model development.

The validation cohort was designed to represent a more typical global population; thus, sites representative of a single class or with less extensive phenotyping were included. To generate NanoString nCounter assays, we expanded feature prediction to include correlated transcripts that can substitute for one another with respect to class prediction (bacterial, viral, or noninfectious). Feature selection was performed using elastic net regression and the selection frequency across resampling iterations measured variable importance. Characterizing performance in a targeted validation study required selecting 263 transcripts (Supplementary Table 3). Endogenous control transcripts (TRAP1, DECR1, TBP, and PPIB) were incorporated to normalize for differences in sample input and correct for technical variability. A model was trained on the NanoString data using 91 participants from the discovery cohort (47 bacterial, 34 viral, 10 noninfectious), accommodating known positive control normalization to reduce technical variability and allow background subtraction using negative controls. Discovery cohort participants selected for model training on NanoString prioritized three goals in the following order: 1) balance of infectious etiologies and phenotypes, 2) robust performance in the discovery models, and 3) representation from diverse geographic regions and pathogens. Noninfectious samples were not incorporated into the validation cohort due to availability of unique specimens and a desire to incorporate increased infectious etiologies. The NanoString GF-B/V and GF-B/V/N models were then fixed and applied to the independent validation cohort.

Confidence intervals were calculated using the epiR package in R. exact binomial for the sensitivity, specificity, and model accuracy⁵³. The approach of Simel et al. was used to calculate confidence intervals for the positive and negative likelihood ratios⁵⁴. Confidence intervals for the validation AUROCs were calculated using the method of DeLong⁵². A confidence interval for the overall accuracy of the GF-B/V/N model for the discovery cohort was estimated by taking 10,000 bootstrapped samples. We used the nonparametric Mood’s median test to calculate the p-value estimating the differences in median ages and to evaluate whether the proportion of women in bacterial samples was different than non-bacterial samples.

Ethical approval

Prospective collection of specimens and data after written informed consent by subjects or their legally authorized representatives, and assent was obtained for minors less than 18 years old. Studies were approved by Institutional Review Boards of Duke University Health System, Faculty of Medicine, University of Ruhuna, Johns Hopkins University, Naval Medical Research Center, Kilimanjaro Christian Medical Center Research Ethics Committee, Tanzania National Institute for Medical Research National Research Ethics Coordinating Committee, University of Otago Human Ethics Committee (Health), and the USA CDC. This study used deidentified specimens and clinical data, and was approved by Duke University Health System (Durham, NC) Institutional Revew Board (Duke IRB Pro00072857). All research was conducted in accordance with the Declaration of Helsinki.

Results

Participants and pathogens

The discovery cohort consisted of participants from the USA and Sri Lanka with median age 48 years (IQR 31–61; range 10–86 years), 48% female, 1.4% Hispanic, 28.5% White, 19% Black/African American, 49.5% Asian/South Asian (Table 1). The median age of the USA cohort was higher than the Sri Lankan cohort (54 years [(IQR 42–66] vs. 37 years [IQR 26–51], p = 0.51), although this was not statistically significant. Those with bacterial infections were more likely to be male (p = 0.001), but this was not site or pathogen specific. USA participants had more severe illness (intensive care, 16.4% [n = 25/152], mechanical ventilation, 8.5% [n = 13/152], and mortality 7.2% [n = 11/152]) than those in Sri Lanka (intensive care, 0.7% [n = 1/142], mechanical ventilation, 0.7% [n = 1/142], and mortality, 0.7% [n = 1/142]). However, determining severity of illness between internationally diverse clinical settings, types of infection, and standards of care may be misleading. Chronic HIV was low across the total cohort (3 USA in discovery cohort, 3 Tanzania in validation cohort), and although HIV status was not collected for Sri Lanka the incidence in the country is < 0.01%.

The discovery cohort included 102 participants with bacterial (42 with bloodstream infections and 60 bacterial zoonoses), 125 with viral (82 respiratory, 43 dengue), and 67 with non-infectious illness (e.g., pulmonary embolus, congestive heart failure, COPD/Asthma, cancer, autoimmune disorders). The validation cohort had 101 participants (52 bacterial, 49 viral) and represented a wider range of demographics, geographic locations (USA, Sri Lanka, Tanzania, Cambodia, and Australia), and pathogens (Table 1). Patients with non-infectious illness were not analyzed in the validation cohort.

Differential gene expression of global pathogens

To identify differentially expressioned genes, we employed a conservative approach, using a 5% FDR and a ≥ tenfold change in expression. We identified 38 unique genes increased at least tenfold in participants with bacterial illness, and these were divided into 18 primary clusters (Table 2). Transcripts corresponded to known pathways for acute phase reactants, antimicrobial killing, innate immunity, and immune response. Similarly, we identified 65 unique genes associated with increased expression by tenfold or greater in viral infection, and these were divided into 17 primary clusters (Table 2) primarily corresponding to interferon response and chemokine/cytokine pathways.

Bacterial versus viral classification: a simple binary model

We conducted predictive analysis to develop a binary model (Fig. 1A) using the entire transcriptome from the discovery cohort. The Global Fever-Bacterial/Viral (GF-B/V) model classified bacterial from viral disease with high accuracy when internally validated using fivefold cross-validation: AUROC of 0.93 (Fig. 1B), sensitivity of 84.2% (95% CI 75.6–90.7), specificity of 94.7% (95% CI 88.6–97.7), and overall accuracy of 89.7% (95% CI 85.0–93.4). Additional performance characteristics are shown in Table 3. The model demonstrated similar performance after stratifying for specific pathogen (Fig. 1D), site, age, sex, or race (Supplemental Fig. 2).

Table 3 Performance characteristics for Global Fever classifier models for acute bacterial and viral infection.

Full size table

To independently validate this model using a quantitative RT-PCR system that more closely approximates a clinical assay, we used the NanoString system to measure expression levels of 27 highly predictive genes (Supplemental Table 4A). After training a classification model on subjects from the discovery cohort, the model and its parameters were fixed and applied to the validation cohort. We incorporated both pathogen and geographic diverisity (Table 1). For the discrimination of bacterial and viral infection, the GF-B/V model an AUROC of 0.84 (95% CI 0.76–0.9) (Fig. 1C), sensitivity of 78.8% (95% CI 65.3–88.9), specificity of 84.3% (95% CI 71.4–93.0), and overall accuracy of 81.6% (95% CI 72.7–88.5) with additional performance characteristics reported (Table 3). Additionally, GF-B/V discriminated difficult-to-diagnose bacterial zoonotic pathogens not included in the discovery cohort, such as spotted fever group rickettsiae, B. pseudomallei, and Brucella spp. (Fig. 1E).

Classification of bacterial and viral infections in the setting of other illness: a complex model

The Global Fever-Bacterial/Viral/Noninfectious (GF-B/V/N) classifier provides two probabilities, a measure of bacterial infection or viral infection in the context of nonbacterial/nonviral illness as a control (Supplemental Fig. 1A). Theoretically, this model has the potential for identifying a co-infection if both the probability of bacterial and viral infection were high (Supplemental Fig. 1A). For classification of bacterial infection (bacterial vs. nonbacterial model) the AUROC was 0.92 (Supplemental Fig. 1B), with sensitivity 87.7% (95% CI 79.0–89.8), specificity 84.2% (95% CI 78.2–89.1), and accuracy 85.2% (95% CI 80.6–89.1) (Table 3). For the classification of viral infection (viral vs. nonviral model), AUROC was 0.91 (Supplemental Fig. 1C), with sensitivity 83.7% (95% CI 76.0–89.8), specificity 81.5% (95% CI 74.8–87.1), and accuracy 82.5% (95% CI 77.6–86.7) (Table 3). Similar to the binary model, the GF-B/V/N test demonstrated good performance for a broad range of bacterial and viral pathogens (Supplemental Fig. 1D,E).

Translation of the 2-model GF-B/V/N system to NanoString was exploratory in nature because it only validated the GF-B/V/N test for bacterial and viral illness, evaluating how often bacterial or viral disease was misclassified in the context of nonbacterial/nonviral illness. We measured expression of 33 genes for the bacterial model and 19 for the viral model (Supplemental Table 4B,C). In the validation cohort, the bacterial model had an AUROC of 0.84 (95% CI 0.76–0.93) (Supplemental Fig. 1F), sensitivity of 82.7% (95% CI 69.7–91.8), specificity of 80.4% (95% CI 66.9–90.2), and accuracy of 81.6% (95% CI 72.7–88.5) (Table 3). The viral model had an AUROC of 0.85 (95% CI 0.77–0.93) (Supplemental Fig. 1G), sensitivity of 76.5% (95% CI 62.5–87.2), specificity of 80.8% (95% CI 67.5–90.4), and accuracy of 78.6% (95% CI 69.5–86.1) for viral infection (Table 3). Performance was similar across pathogens (Supplemental Fig. 1H,I), except for a single Viridans group streptococcus case.

Discordant classifications

Discordant cases in the validation cohort were similar between the two classifiers (19 GF-B/V, 19 GF-B/V/N; with overlap of 15 for both models) (Supplemental Table 5). A review of these discordant cases did not identify any pattern with respect to site or pathogen. The relative increased number of Sri Lanka patients was nearly proportional to the total number in the whole cohort. Interestingly, when predictive genes were fixed and the model weights were allowed to vary among the validation cohort, performance improved.

Discussion

We utilized a 294-participant multinational prospectively enrolled cohort to develop a bacterial versus viral host-response classifier that incorporates LMIC with representation of zoonotic bacteria and arboviruses. While others have utilized publically available data to apply host-response transcriptional classifiers to atypical global infections³³, this cohort is the largest prospectively enrolled with robust clinical, phenotypic, and adjudication data. Translation of the GF-B/V test to a multiplex gene expression detection platform demonstrated good performance (overall accuracy of 81.6% [95% CI 72.7–88.5]) in independent validation despite different genetic backgrounds, geographies (five countries), and pathogens. For example, a person with a positive GF-B/V NanoString test in the validation cohort was 5-times more likely to have a bacterial infection and 3-times less likely with a negative test. Such a test could provide timely diagnostic reassurance to inform antibiotic use and guide clinical care.

Decreasing morbidity, mortality, and misuse of antimicrobials from infections requires improved diagnosis at the time a patient presents to care. LMIC have decreased laboratory infrastructure, so performing multiple pathogen-based tests is unrealistic. Accurate acute-phase pathogen-based diagnostics do not exist for many bacterial zoonotic infections, such as ricktettsial infections, that require different treatment from antibiotics empirically used for routinely cultivatable organisms. Point-of-care biomarkers commonly utilized in high-resource settings, like C-reactive protein and procalcitonin, have yielded mixed performance in LMIC (e.g., low specificity, poorer performance for bacterial zoonotic pathogens)^{27,50,55,56,57}, and are potentially affected by higher rates of malnutrition, parasitic disease, HIV, and co-infection⁵⁸. Host-response gene expression assays are poised to fill this void^{25,26,27,31,32,33,59,60}.

Tremendous progress has been made developing host-response diagnostics in HIC in multiple disciplines, including infectious diseases^31,59,60,61. Recently, an algorithmic approach utilizing publically available data extended this method to intracellular and atypical pathogens prevalent globally³³. Rao et al., utilize a co-normalization technique to diminish study variability and batch effects. While the signal for the bacterial versus viral classifier was preserved, the co-normalization technique could potentially reduce biological variability and artificially improve overall performance in a population with increased variability of pathogens and genetic ancestry. Additionally, use of publically available data does not align enrollment criteria or apply an even reference standard. Prospective validation of this promising work will be critical to determine performance in a real world population of global infections.

Taking a different approach, our study utilized existing biorepository specimens of prospectively enrolled patients that meet reliable eligibility criteria and apply a consistent diagnostic reference. This approach preserves biological variability while avoiding potential bias and confounding. Access to participant-level clinical, biologic, and etiologic data allows refinement of the cohort not possible for publically available data. Additionally, the GF-B/V incorporates a significant number of zoonotic bacterial pathogens that are both extracellular (e.g. Leptospirosis spp.) and intracellular (e.g. Ricketsial spp.) at the model development and validation phase, while other studies have a low percentage of Leptospirosis or other extracellular pathogens representated in LMIC settings³³.

A binary bacterial versus viral classifier provides a simple approach to identifying bacterial infections, but does not account for other treatable etiologies of suspected infection. Layered diagnostic tests using multiple binary classifiers, like GF-B/V/N, are more generalizable for a global population, and are attractive given the breadth of pathogen diversity and febrile illness globally. Precedent exists for layered transcriptional expression classifiers that incorporate other classes of illness^25,32. We demonstrate a more complex model can discriminate bacterial from viral infection in an independent validation cohort, but the absence of noninfectious samples in the validation cohort limits full evaluation in a real world population. Thus, we cannot comment on noninfectious illness, but simply on nonbacterial or nonviral disease. However, we demonstrate that misclassification by GF-B/V or GF-B/V/N is largely overlapping, reassuringly demonstrating that incorporating more complexity does not reduce performance in a limited population of bacterial and viral illness. Incorporating multiple models for this and other work has previously been shown and will need to be addressed going forward⁶². While exploratory, a model with this complexity is not available in other published work on global pathogens, such as leptospirosis or rickettsial infection^31,33,63,64. The composite model could provide a path forward in the complex milleu of global illness.

Host response biomarkers could change clinical practice, but expansion of these diagnostics to LMIC must be inexpensive, easy to operate, and clinically interpretable. Host gene expression diagnostics for non-infectious applications are considered high complexity tests, often run in referral laboratories. However, technical advances have enabled highly multiplexed quantitative, real-time PCR systems that operate in a sample-in, answer-out format with results available in < 1-h^32,36,60. As simpler host gene expression tests continue to be developed, cost-of-goods and simplicity will be key parameters for their implementation in LMIC settings⁶⁵. Host response-based biomarker panels have also extended to proteomics and metabolomics^64,66,67, which may be less expensive and amenable to field deployable diagnostics. Progress refining host-response biomarkers in international cohorts must occur alongside technological advances in platform development to allow more rapid translation to LMIC. The results presented here suggest easy translatability of this approach to LMIC.

GF-B/V and GF-B/V/N multi-analyte biomarkers have attractive features, but there are limitations to this study. Translation to a PCR-based detection system revealed lower accuracy in the validation cohort compared to the RNA-seq based classification in the discovery cohort. This could be due to technical differences (e.g., going from RNAseq to NanoString) but is also an expected difference between discovery and validation, the latter of which includes a wider array of infections and variability of illness. Analysis of discordant classifications suggests that genes used in the models have strong predictive power, but that individuals have variability in the amount, or weight, each gene contributes to the model. Consistent with this is the observation that both classifiers had a reduction of performance on pathogens not hightly represented in the discovery cohort (Viridians group Streptococcus, non-influenza respiratory viruses, Coxiella burnetii). The GF-B/V/N model is constrained by reliance on non-infectious illness as a control rather than being representative of febrile illness globally. Additional limited availability of high confidence noninfectious samples prevented incorporation into the validation cohort, prohibiting validation of the performance of the GF-B/V/N test for nonbacterial/nonviral illness or co-infection. It will be critical for future studies to perform iterations and optimization on expanded cohorts with increased pathogen (e.g. atypical viruses, tuberculosis, malaria, cryptococcus) and host diversity (e.g., a larger cohort of children and immunocompromised hosts) that would be expected to improve model weights, overall performance, and be more representative of febrile illnesses⁶².

We found that novel host transcriptional biomarkers could accurately discriminate diverse bacterial and viral infections, including those endemic in not only high-income temperate regions but also LMIC in the tropics. Translation of these tests to a custom multiplex gene expression platform, such as the NanoString, shows promise for identification of infections in increasingly diverse populations with the future possibility of point-of-care application. Host-response biomarkers to distinguish bacterial from viral infection could improve clinical care and antibiotic stewardship across the globe.

Data availability

All data in this article were generated as part of this work. All RNA sequencing data has been submitted to GEO under accession number GSE211567. NanoString transcripts are included in supplemental information. Token to access GSE211567: obqzkkoarjwpfct.

References

Rudd, K. E. et al. Global, regional, and national sepsis incidence and mortality, 1990–2017: Analysis for the Global Burden of Disease Study. Lancet 395, 200–211 (2020).
Article PubMed PubMed Central Google Scholar
Farrar, J. J. J. T., Kang, G., Lalloo, D., & White, N. J. Manson's Tropical Diseases (Saunders, 2009).
Naghavi, M. et al. Global burden of antimicrobial resistance: Essential pieces of a global puzzle—Authors’ reply. Lancet 399, 2349–2350 (2022).
Article PubMed Google Scholar
Barr, D. A. et al. Mycobacterium tuberculosis bloodstream infection prevalence, diagnosis, and mortality risk in seriously ill adults with HIV: A systematic review and meta-analysis of individual patient data. Lancet Infect. Dis. 20, 742–752 (2020).
Article PubMed PubMed Central CAS Google Scholar
Crump, J. A. et al. Controlled comparison of BacT/Alert MB system, manual Myco/F lytic procedure, and isolator 10 system for diagnosis of Mycobacterium tuberculosis Bacteremia. J. Clin. Microbiol. 49, 3054–3057 (2011).
Article PubMed PubMed Central Google Scholar
Crump, J. A. et al. Invasive bacterial and fungal infections among hospitalized HIV-infected and HIV-uninfected children and infants in northern Tanzania. Trop. Med. Int. Health 16, 830–837 (2011).
Article PubMed PubMed Central Google Scholar
Crump, J. A. et al. Invasive bacterial and fungal infections among hospitalized HIV-infected and HIV-uninfected adults and adolescents in northern Tanzania. Clin. Infect. Dis. 52, 341–348 (2011).
Article PubMed PubMed Central Google Scholar
Gray, K. D. et al. Prevalence of mycobacteremia among HIV-infected infants and children in northern Tanzania. Pediatr. Infect. Dis. J. 32, 754–756 (2013).
Article PubMed PubMed Central Google Scholar
Havers, F. P. et al. Outpatient antibiotic prescribing for acute respiratory infections during influenza seasons. JAMA Netw. Open 1, e180243 (2018).
Article PubMed PubMed Central Google Scholar
Lee, G. C. et al. Outpatient antibiotic prescribing in the United States: 2000 to 2010. BMC Med. 12, 96 (2014).
Article PubMed PubMed Central Google Scholar
Shapiro, D. J., Hicks, L. A., Pavia, A. T. & Hersh, A. L. Antibiotic prescribing for adults in ambulatory care in the USA, 2007–09. J. Antimicrob. Chemother. 69, 234–240 (2014).
Article PubMed CAS Google Scholar
Tillekeratne, L. G. et al. Antibiotic overuse for acute respiratory tract infections in Sri Lanka: A qualitative study of outpatients and their physicians. BMC Fam. Pract. 18, 37 (2017).
Article PubMed PubMed Central Google Scholar
Wang, J., Wang, P., Wang, X., Zheng, Y. & Xiao, Y. Use and prescription of antibiotics in primary health care settings in China. JAMA Intern. Med. 174, 1914–1920 (2014).
Article PubMed Google Scholar
Al-Hadidi, S. H. et al. The spectrum of antibiotic prescribing during COVID-19 pandemic: A systematic literature review. Microb. Drug Resist. 27, 1705–1725 (2021).
Article PubMed PubMed Central CAS Google Scholar
Dhimal, M. et al. An outbreak investigation of scrub typhus in Nepal: Confirmation of local transmission. BMC Infect. Dis. 21, 193 (2021).
Article PubMed PubMed Central Google Scholar
Dittrich, S. et al. Orientia, rickettsia, and leptospira pathogens as causes of CNS infections in Laos: A prospective study. Lancet Glob. Health 3, e104-112 (2015).
Article PubMed Google Scholar
Steinbrink, J. M. et al. The host transcriptional response to Candidemia is dominated by neutrophil activation and heme biosynthesis and supports novel diagnostic approaches. Genome Med. 13, 108 (2021).
Article PubMed PubMed Central CAS Google Scholar
WHO. Antimicrobial Resistance—Global Report on Surveillance (World Health Organization, 2014).
Zaas, A. K. et al. Gene expression signatures diagnose influenza and other symptomatic respiratory viral infections in humans. Cell Host. Microbe 6, 207–217 (2009).
Article PubMed PubMed Central CAS Google Scholar
Bloom, C. I. et al. Transcriptional blood signatures distinguish pulmonary tuberculosis, pulmonary sarcoidosis, pneumonias and lung cancers. PLoS ONE 8, e70630 (2013).
Article PubMed PubMed Central ADS CAS Google Scholar
Mihret, A. et al. Combination of gene expression patterns in whole blood discriminate between tuberculosis infection states. BMC Infect. Dis. 14, 257 (2014).
Article PubMed PubMed Central Google Scholar
Singhania, A. et al. A modular transcriptional signature identifies phenotypic heterogeneity of human tuberculosis infection. Nat. Commun. 9, 2308 (2018).
Article PubMed PubMed Central ADS Google Scholar
Valim, C. et al. Responses to bacteria, virus, and malaria distinguish the etiology of pediatric clinical pneumonia. Am. J. Respir. Crit. Care Med. 193, 448–459 (2016).
Article PubMed PubMed Central CAS Google Scholar
Nikolayeva, I. et al. A blood RNA signature detecting severe disease in young dengue patients at hospital arrival. J. Infect. Dis. 217, 1690–1698 (2018).
Article PubMed PubMed Central CAS Google Scholar
Tsalik, E. L. et al. Host gene expression classifiers diagnose acute respiratory illness etiology. Sci. Transl. Med. 8, 322ra311 (2016).
Article Google Scholar
Robinson, M. et al. A 20-gene set predictive of progression to severe dengue. Cell Rep. 26, 1104-1111e1104 (2019).
Article PubMed PubMed Central CAS Google Scholar
Tillekeratne, L. G. et al. Previously derived host gene expression classifiers identify bacterial and viral etiologies of acute febrile respiratory illness in a south asian population. Open Forum Infect. Dis. 7, ofaa194 (2020).
Article PubMed PubMed Central Google Scholar
Ockenhouse, C. F. et al. Common and divergent immune response signaling pathways discovered in peripheral blood mononuclear cell gene expression patterns in presymptomatic and clinically apparent malaria. Infect. Immun. 74, 5561–5573 (2006).
Article PubMed PubMed Central CAS Google Scholar
Woods, C. W. et al. A host transcriptional signature for presymptomatic detection of infection in humans exposed to influenza H1N1 or H3N2. PLoS ONE 8, e52198 (2013).
Article PubMed PubMed Central ADS CAS Google Scholar
McClain, M. T. et al. A blood-based host gene expression assay for early detection of respiratory viral infection: An index-cluster prospective cohort study. Lancet Infect. Dis. 21, 396–404 (2021).
Article PubMed CAS Google Scholar
Miller, R. R. 3rd. et al. Validation of a host response assay, SeptiCyte LAB, for discriminating sepsis from systemic inflammatory response syndrome in the ICU. Am. J. Respir. Crit. Care Med. 198, 903–913 (2018).
Article PubMed PubMed Central CAS Google Scholar
Tsalik, E. L. et al. Discriminating bacterial and viral infection using a rapid host gene expression test. Crit. Care Med. 49, 1651–1663 (2021).
Article PubMed PubMed Central CAS Google Scholar
Rao, A. M. et al. A robust host-response-based signature distinguishes bacterial and viral infections across diverse global populations. Cell Rep. Med. 3, 100842 (2022).
Article PubMed PubMed Central Google Scholar
Sutherland, J. S. et al. Diagnostic accuracy of the Cepheid 3-gene host response fingerstick blood test in a prospective, multi-site study: interim results. Clin. Infect. Dis. 6, 66 (2021).
Google Scholar
Warsinske, H. C. et al. Assessment of validity of a blood-based 3-gene signature score for progression and diagnosis of tuberculosis, disease severity, and treatment response. JAMA Netw. Open 1, e183779 (2018).
Article PubMed PubMed Central Google Scholar
Tsalik, E. L. et al. Rapid, sample-to-answer host gene expression test to diagnose viral infection. Open Forum Infect. Dis. 6, ofz466 (2019).
Article PubMed PubMed Central Google Scholar
Tsalik, E. L. et al. An integrated transcriptome and expressed variant analysis of sepsis survival and death. Genome Med. 6, 111 (2014).
Article PubMed PubMed Central Google Scholar
Langley, R. J. et al. An integrated clinico-metabolomic model improves prediction of death in sepsis. Sci. Transl. Med. 5, 195ra195 (2013).
Article Google Scholar
Glickman, S. W. et al. Disease progression in hemodynamically stable patients presenting to the emergency department with sepsis. Acad. Emerg. Med. 17, 383–390 (2010).
Article PubMed PubMed Central Google Scholar
Tillekeratne, L. G. et al. An under-recognized influenza epidemic identified by rapid influenza testing, southern Sri Lanka, 2013. Am. J. Trop. Med. Hyg. 92, 1023–1029 (2015).
Article PubMed PubMed Central Google Scholar
Bodinayake, C. K. et al. Emergence of epidemic dengue-1 virus in the Southern Province of Sri Lanka. PLoS Negl. Trop. Dis. 10, e0004995 (2016).
Article PubMed PubMed Central Google Scholar
Uehara, A. et al. Analysis of dengue Serotype 4 in Sri Lanka during the 2012–2013 Dengue Epidemic. Am. J. Trop. Med. Hyg. 97, 130–136 (2017).
Article PubMed PubMed Central Google Scholar
Bodinayake, C. K. et al. Evaluation of the WHO 2009 classification for diagnosis of acute dengue in a large cohort of adults and children in Sri Lanka during a dengue-1 epidemic. PLoS Negl. Trop. Dis. 12, e0006258 (2018).
Article PubMed PubMed Central Google Scholar
Maze, M. J. et al. Risk factors for human acute leptospirosis in northern Tanzania. PLoS Negl. Trop. Dis. 12, e0006372 (2018).
Article PubMed PubMed Central Google Scholar
Pisharody, S. et al. Incidence estimates of Acute Q fever and spotted fever group rickettsioses, Kilimanjaro, Tanzania, from 2007 to 2008 and from 2012 to 2014. Am. J. Trop. Med. Hyg. 106, 494–503 (2021).
Article PubMed PubMed Central Google Scholar
Schully, K. L., & Clark, D. V. Aspiring to Precision Medicine for Infectious Diseases in Resource Limited Settings 105–115 (Elsevier, 2019).
Schully, K. L. et al. Melioidosis in lower provincial Cambodia: A case series from a prospective study of sepsis in Takeo Province. PLoS Negl. Trop. Dis. 11, e0005923 (2017).
Article PubMed PubMed Central Google Scholar
Rozo, M. et al. An observational study of sepsis in Takeo Province Cambodia: An in-depth examination of pathogens causing severe infections. PLoS Negl. Trop. Dis. 14, e0008381 (2020).
Article PubMed PubMed Central Google Scholar
Schully, K. L. et al. Next-generation diagnostics for melioidosis: Evaluation of a Prototype i-STAT Cartridge to detect Burkholderia pseudomallei biomarkers. Clin. Infect. Dis. 69, 421–427 (2019).
Article PubMed CAS Google Scholar
Do, N. T. et al. Point-of-care C-reactive protein testing to reduce inappropriate use of antibiotics for non-severe acute respiratory infections in Vietnamese primary health care: A randomised controlled trial. Lancet Glob. Health 4, e633-641 (2016).
Article PubMed PubMed Central Google Scholar
Sheng, T. et al. Point-prevalence study of antimicrobial use in public hospitals in southern Sri Lanka identifies opportunities for improving prescribing practices. Infect. Control Hosp. Epidemiol. 40, 224–227 (2019).
Article PubMed Google Scholar
DeLong, E. R., DeLong, D. M. & Clarke-Pearson, D. L. Comparing the areas under two or more correlated receiver operating characteristic curves: A nonparametric approach. Biometrics 44, 837–845 (1988).
Article PubMed CAS Google Scholar
Stevenson, M., E.S.w.c.f.T.N., Heuer, C., Marshall, J., Sanchez, J., Thornton, R., Reiczigel, J., Robison-Cox, J., Sebastiani, P., Solymos, P., Yoshida, K., Jones, G., Pirikahu, S., Firestone, S., Kyle, R., Popp, J., Jay, M., Reynard, C., Cheung, A., Singanallur, N., Szabo, A. & Rabiee, A. epiR: Tools for the Analysis of Epidemiological Data. R package version 2.0.50. https://CRAN.R-project.org/package=epiR (2022).
Simel, D. L., Samsa, G. P. & Matchar, D. B. Likelihood ratios with confidence: Sample size estimation for diagnostic test studies. J. Clin. Epidemiol. 44, 763–770 (1991).
Article PubMed CAS Google Scholar
Tan, T. L. et al. Comparison of sPLA2IIA performance with high-sensitive CRP neutrophil percentage PCT and lactate to identify bacterial infection. Sci. Rep. 11, 11369 (2021).
Article PubMed PubMed Central ADS CAS Google Scholar
Althaus, T. et al. Effect of point-of-care C-reactive protein testing on antibiotic prescription in febrile patients attending primary care in Thailand and Myanmar: An open-label, randomised, controlled trial. Lancet Glob. Health. 7, e119–e131 (2019).
Article PubMed Google Scholar
Lubell, Y. et al. Performance of C-reactive protein and procalcitonin to distinguish viral from bacterial and malarial causes of fever in Southeast Asia. BMC Infect. Dis. 15, 511 (2015).
Article PubMed PubMed Central Google Scholar
Van Hecke, O. et al. In-vitro diagnostic point-of-care tests in paediatric ambulatory care: A systematic review and meta-analysis. PLoS ONE 15, e0235605 (2020).
Article PubMed PubMed Central Google Scholar
Sweeney, T. E. et al. Validation of the sepsis metascore for diagnosis of neonatal sepsis. J. Pediatr. Infect. Dis. Soc. 7, 129–135 (2018).
Article Google Scholar
Ko, E. R. et al. Prospective validation of a rapid host gene expression test to discriminate bacterial from viral respiratory infection. JAMA Netw. Open 5, e227299 (2022).
Article PubMed PubMed Central Google Scholar
Martinez-Ledesma, E., Verhaak, R. G. & Trevino, V. Identification of a multi-cancer gene expression biomarker for cancer clinical outcomes using a network-based algorithm. Sci. Rep. 5, 11966 (2015).
Article PubMed PubMed Central ADS CAS Google Scholar
Bodkin, N. et al. Systematic comparison of published host gene expression signatures for bacterial/viral discrimination. Genome Med. 14, 18 (2022).
Article PubMed PubMed Central CAS Google Scholar
Mor, M. et al. Bacterial vs viral etiology of fever: A prospective study of a host score for supporting etiologic accuracy of emergency department physicians. PLoS ONE 18, e0281018 (2023).
Article PubMed PubMed Central CAS Google Scholar
Papan, C. et al. A host signature based on TRAIL, IP-10, and CRP for reducing antibiotic overuse in children by differentiating bacterial from viral infections: a prospective, multicentre cohort study. Clin. Microbiol. Infect. 6, 66 (2021).
Google Scholar
Manabe, Y. C. et al. Clinical evaluation of the BioFire Global Fever Panel for the identification of malaria, leptospirosis, chikungunya, and dengue from whole blood: A prospective, multicentre, cross-sectional diagnostic accuracy study. Lancet Infect. Dis. 22, 1356–1364 (2022).
Article PubMed PubMed Central CAS Google Scholar
Eden, E. et al. Diagnostic accuracy of a TRAIL, IP-10 and CRP combination for discriminating bacterial and viral etiologies at the Emergency Department. J. Infect. 73, 177–180 (2016).
Article PubMed Google Scholar
Langley, R. J. et al. Integrative “omic” analysis of experimental bacteremia identifies a metabolic signature that distinguishes human sepsis from systemic inflammatory response syndromes. Am. J. Respir. Crit. Care Med. 190, 445–455 (2014).
Article PubMed PubMed Central CAS Google Scholar

Download references

Acknowledgements

We thank the 395 participants worldwide for the provision of samples and clinical data and the study teams that recruited them.

Disclaimer

The opinions expressed herein are those of the author(s) and are not necessarily representative of those of the Uniformed Services University of the Health Sciences (USUHS), the Department of Defense (DOD); or, the United States Army, Navy, or Air Force. The findings and conclusions in this report are those of the author(s) and do not necessarily represent the official position of the Centers for Disease Control and Prevention. The views, opinions and/or findings contained in this report are those of the author(s) and should not be construed as an official Department of the Army position, policy or decision unless so designated by other documentation. In the conduct of research where humans are the subjects, the investigator(s) adhered to the policies regarding the protection of human subjects as prescribed by Code of Federal Regulations (CFR) Title 45, Volume 1, Part 46; Title 32, Chapter 1, Part 219; and Title 21, Chapter 1, Part 50 (Protection of Human Subjects).

Funding

This work is supported by the US Army Medical Research and Materiel Command under Contract No. W81XWH-16-C-0147. Tanzania studies were supported by US NIH NIAID R01TW009237.

Author information

These authors contributed equally: Emily R. Ko and Megan E. Reller.

Authors and Affiliations

Division of General Internal Medicine, Department of Medicine, Duke Regional Hospital, Duke University Health System, Duke University School of Medicine, 3643 N. Roxboro St., Durham, NC, 27704, USA
Emily R. Ko
Division of Infectious Diseases, Department of Medicine, Duke University School of Medicine, Durham, NC, USA
Megan E. Reller, L. Gayani Tillekeratne, Thomas W. Burke, Micah T. McClain, Sunil Suchindran, Elizabeth Petzold, Ephraim L. Tsalik & Christopher W. Woods
Durham Veterans Affairs Health Care System, Durham, NC, USA
Megan E. Reller, L. Gayani Tillekeratne, Micah T. McClain, Matthew P. Rubach, Deng Madut & Christopher W. Woods
Duke Global Health Institute, Duke University, Durham, NC, USA
Megan E. Reller, L. Gayani Tillekeratne, Champica K. Bodinayake, Matthew P. Rubach, Deng Madut, John A. Crump & Christopher W. Woods
Department of Medicine, Faculty of Medicine, University of Ruhuna, Galle, Sri Lanka
L. Gayani Tillekeratne, Champica K. Bodinayake, Vasantha Devasiri, Wasantha Kodikara-Arachichi & John A. Crump
Clinical Research Unit, Department of Medicine, Duke University School of Medicine, Durham, NC, USA
Cameron Miller
Department of Biostatistics and Informatics, Duke University, Durham, NC, USA
Ricardo Henao
Institute for Medical Research, Durham, NC, USA
Bradly Nicholson
Division of Pediatric Infectious Diseases, Department of Pediatrics, Duke University School of Medicine, Durham, NC, USA
Adam Blatt
Danaher Diagnostics, Washington, DC, USA
Ephraim L. Tsalik
Department of Microbiology, Faculty of Medicine, University of Ruhuna, Galle, Sri Lanka
Ajith Nagahawatte & Ruvini Kurukulasooriya
Programme in Emerging Infectious Diseases, Duke-National University of Singapore, Singapore, Singapore
Matthew P. Rubach
Kilimanjaro Christian Medical Center, Moshi, Tanzania
Matthew P. Rubach, Venance P. Maro & John A. Crump
Kilimanjaro Christian Medical University College, Moshi, Tanzania
Venance P. Maro, Bingileki F. Lwezaula & John A. Crump
Maswenzi Regional Referral Hospital, Moshi, Tanzania
Bingileki F. Lwezaula
General Sir John Kotelawala Defence University, Colombo, Sri Lanka
Aruna D. De Silva
The Henry M. Jackson Foundation for the Advancement of Military Medicine, Bethesda, MD, USA
Danielle V. Clark
Austere Environments Consortium for Enhanced Sepsis Outcomes (ACESO), Biological Defense Research Directorate, Naval Medical Research Center-Frederick, Ft. Detrick, MD, USA
Danielle V. Clark & Kevin L. Schully
Joint Departments of Pathology, School of Medicine, Uniformed Services University, Bethesda, MD, USA
J. Stephen Dumler
Centers for Disease Control and Prevention, National Center for Emerging Zoonotic Infectious Diseases, Atlanta, USA
Cecilia Kato & Renee Galloway
Centre for International Health, University of Otago, Dunedin, New Zealand
John A. Crump
Department of Medicine, Duke University School of Medicine, Durham, NC, USA
Geoffrey S. Ginsburg
National Institute of Health, Bethesda, MD, USA
Geoffrey S. Ginsburg
Diagnostic Systems Division, USAMRIID, Fort Detrick, Frederick, MD, USA
Timothy D. Minogue

Authors

Emily R. Ko
View author publications
You can also search for this author in PubMed Google Scholar
Megan E. Reller
View author publications
You can also search for this author in PubMed Google Scholar
L. Gayani Tillekeratne
View author publications
You can also search for this author in PubMed Google Scholar
Champica K. Bodinayake
View author publications
You can also search for this author in PubMed Google Scholar
Cameron Miller
View author publications
You can also search for this author in PubMed Google Scholar
Thomas W. Burke
View author publications
You can also search for this author in PubMed Google Scholar
Ricardo Henao
View author publications
You can also search for this author in PubMed Google Scholar
Micah T. McClain
View author publications
You can also search for this author in PubMed Google Scholar
Sunil Suchindran
View author publications
You can also search for this author in PubMed Google Scholar
Bradly Nicholson
View author publications
You can also search for this author in PubMed Google Scholar
Adam Blatt
View author publications
You can also search for this author in PubMed Google Scholar
Elizabeth Petzold
View author publications
You can also search for this author in PubMed Google Scholar
Ephraim L. Tsalik
View author publications
You can also search for this author in PubMed Google Scholar
Ajith Nagahawatte
View author publications
You can also search for this author in PubMed Google Scholar
Vasantha Devasiri
View author publications
You can also search for this author in PubMed Google Scholar
Matthew P. Rubach
View author publications
You can also search for this author in PubMed Google Scholar
Venance P. Maro
View author publications
You can also search for this author in PubMed Google Scholar
Bingileki F. Lwezaula
View author publications
You can also search for this author in PubMed Google Scholar
Wasantha Kodikara-Arachichi
View author publications
You can also search for this author in PubMed Google Scholar
Ruvini Kurukulasooriya
View author publications
You can also search for this author in PubMed Google Scholar
Aruna D. De Silva
View author publications
You can also search for this author in PubMed Google Scholar
Danielle V. Clark
View author publications
You can also search for this author in PubMed Google Scholar
Kevin L. Schully
View author publications
You can also search for this author in PubMed Google Scholar
Deng Madut
View author publications
You can also search for this author in PubMed Google Scholar
J. Stephen Dumler
View author publications
You can also search for this author in PubMed Google Scholar
Cecilia Kato
View author publications
You can also search for this author in PubMed Google Scholar
Renee Galloway
View author publications
You can also search for this author in PubMed Google Scholar
John A. Crump
View author publications
You can also search for this author in PubMed Google Scholar
Geoffrey S. Ginsburg
View author publications
You can also search for this author in PubMed Google Scholar
Timothy D. Minogue
View author publications
You can also search for this author in PubMed Google Scholar
Christopher W. Woods
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

E.R.K. and M.E.R. wrote the manuscript. Project conception, design, planning and data interpretation were carried out by E.R.K., M.E.R., L.G.T., T.W.B., R.H., M.T.M., S.S., B.N., E.P., E.L.T., G.S.G., T.D.M., and C.W.W. Analysis and figures prep were performed by E.R.K., C.M., A.B., R.H., and S.S. Sample acquisition at global sites conducted by M.E.R., L.G.T., C.K.B., A.N., V.D., M.P.R., V.P.M., B.F.L., D.M., W.K.-A., R.K., A.D.D., D.V.C., K.L.S., and J.A.C. Laboratory work and etiology testing performed by T.W.B., C.K., R.D., J.S.D. and B.N.

Corresponding author

Correspondence to Emily R. Ko.

Ethics declarations

Competing interests

CWW is a consultant for Biomeme, Arena Pharmaceuticals, Biofire, FHI Clinical and sits on the advisory board for Biomeme, FHI Clinical, and Regeneron. CWW is also the acting Chief Medical Officer for Biomeme. CWW is a member of the board of directors for Global Health Innovation Alliance Accelerator. TWB hold equity in and is consultant for Biomeme. ELT consults for Biomeme and is employed by Danaher Diagnostics. CWW, ELT, MTM, TWB, and RH hold patents on genomic methods to diagnose and treat acute infections. All other authors reported no conflicts of interest.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Table S1.

Supplementary Table S2.

Supplementary Table S3.

Supplementary Table S4.

Supplementary Table S5.

Supplementary Figure S1.

Supplementary Figure S2.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Ko, E.R., Reller, M.E., Tillekeratne, L.G. et al. Host-response transcriptional biomarkers accurately discriminate bacterial and viral infections of global relevance. Sci Rep 13, 22554 (2023). https://doi.org/10.1038/s41598-023-49734-6

Download citation

Received: 27 December 2022
Accepted: 11 December 2023
Published: 18 December 2023
DOI: https://doi.org/10.1038/s41598-023-49734-6

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Methods

Global fever discovery and validation cohorts

Samples and etiologic testing

Reference standard adjudication of etiology

Generation and normalization of transcriptomic data

Nanostring multiplex transcript detection platform

Statistical analysis

Ethical approval

Results

Participants and pathogens

Differential gene expression of global pathogens

Bacterial versus viral classification: a simple binary model

Classification of bacterial and viral infections in the setting of other illness: a complex model

Discordant classifications

Discussion

Data availability

References

Acknowledgements

Disclaimer

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Rights and permissions

About this article

Cite this article

Share this article

Comments

Search

Quick links