Severe influenza infection has no effective treatment available. One of the key barriers to developing host-directed therapy is a lack of reliable prognostic factors needed to guide such therapy. Here, we use a network analysis approach to identify host factors associated with severe influenza and fatal outcome. In influenza patients with moderate-to-severe diseases, we uncover a complex landscape of immunological pathways, with the main changes occurring in pathways related to circulating neutrophils. Patients with severe disease display excessive neutrophil extracellular traps formation, neutrophil-inflammation and delayed apoptosis, all of which have been associated with fatal outcome in animal models. Excessive neutrophil activation correlates with worsening oxygenation impairment and predicted fatal outcome (AUROC 0.817–0.898). These findings provide new evidence that neutrophil-dominated host response is associated with poor outcomes. Measuring neutrophil-related changes may improve risk stratification and patient selection, a critical first step in developing host-directed immune therapy.
Influenza is one of the most prevalent respiratory virus infections in the world, with one out of five people suffering from it annually1. Severe influenza, the most serious form of influenza infection, is characterized by lung inflammation, refractory hypoxaemia, and respiratory failure. The change from a mild/moderate, self-limited “flu” illness toward severe influenza represents a critical transition point in the clinical course of influenza infection. The initial mild “flu” begins in the upper airway, where tissue damage is minimal and the infection is well controlled by an effective, local immune response. The spread of infection to the lower airway is accompanied by a dysregulated local and systematic immune response (both innate and adaptive immunity)2,3,4,5,6,7,8,9,10. The spread occurs rapidly (within days) resulting in airway inflammation and extensive alveolar damage. When fully developed, severe influenza leads to significant respiratory complications, such as acute respiratory distress syndrome or secondary bacterial pneumonia. Mortality of severe influenza is high, ranging from to 10% (seasonal influenza) to 65% (in pandemic H1N1 patients needing extracorporeal membrane oxygenation support)11,12. Early antiviral treatment may improve the outcome13. However, mortality remains high in severe cases because the influenza-related lung inflammation is driven primarily by immunopathology2,3,4,5,6,7,8,9,10.
Host-directed therapy offers a promising new approach to target immunopathology and may reduce the mortality of severe influenza14,15. However, a myriad of immunological abnormalities have been identified in severe influenza and it is unclear which of these should be targeted by host-directed therapy9,16. To identify host factors associated with severe disease, a unifying framework of influenza host response is needed. A rational approach to developing such framework is to use a network analysis approach to identify all disease-relevant host factors and rank these factors in the order of their relative contribution to immunopathology/outcomes17. The top-ranking host factors may then be targeted by using host-directed therapy—as has been successfully demonstrated in HIV medicine18.
Gene-expression analysis provides a unifying framework in understanding the host factors associated with influenza pathogenesis. A common approach is to measure gene expression of circulating leukocytes in infected patients19,20,21. Several recent studies have adopted this approach, but these studies were limited by including only asymptomatic individuals21, patients with mild illness19 or those presenting in early stage of the disease20. In this study, we use a network analysis approach to analyze the host response of circulating leukocytes in a large cohort of moderate-to-severe influenza patients. Using weighted gene co-expression network analysis, we identify host factors that were statistically significantly associated with severe influenza infection. We then quantify each factor’s contribution to infection by ranking them based on their strength of association with disease severity. Here, we report that this approach produces a global overview of the immunological landscape of the influenza infection and enable us to identify several key biological processes that underpin the development of severe influenza and fatality.
In this prospective study, we screened 720 patients who met the World Health Organization’s criteria for influenza-like illness. After initial evaluation, 154 patients had clinical and virological evidence of influenza infection. Of the 154 collected blood samples, 107 samples were randomly selected for microarray analysis; the remaining 47 samples were reserved for a subsequent validation analysis (Fig. 1).
Of the 107 samples assigned to microarray analysis, 63 were from patients with moderate infection who did not receive mechanical ventilation (referred as the “moderate group”) and 44 from patients with severe infection who developed respiratory failure requiring mechanical ventilation (referred as the “severe group”). Samples from healthy donors were also included (n = 52).
Initial analyses showed that patients in the severe group were more likely to have received antiviral treatment (p < 0.0001) and to have developed respiratory failure (p < 0.0001) (Table 1). The severe group also had a much higher morbidity/mortality (e.g., organ failure, length of stay, and mortality) compared to the moderate group. However, other than infection severity, the moderate and severe groups did not differ significantly in factors that might affect the gene-expression of host response, including age, gender, pre-existing illnesses, duration of symptoms, or the types of virus strains. (Table 1).
Host response in moderate and severe influenza infection
We applied principal component analysis (PCA) to the blood transcriptome data to visualize the overall trend in gene-expression values across all samples. This preliminary examination suggested that gene-expression differed between phenotypes (healthy controls, moderate, and severe) (Fig. 2). We then assessed whether this difference could be due to cell count variations between these groups, since gene-expression difference may be attributed to altered cell counts, altered transcripts per cell or both. This analysis showed that the cell count did not account for the observed difference in gene-expression since leukocytes in both groups were within normal ranges: moderate (8.1 × 1000/mm3) and severe (9.6 × 1000/mm3). Furthermore, leukocyte counts between these two groups were not significantly different (p = 0.76) (Supplementary Fig. 1).
Since peripheral blood consists of many cell subsets (e.g., dendritic cells, B cells, monocytes, and CD4/CD8 lymphocytes), the different cell compositions between moderate and severe groups might also account for the observed gene-expression difference. The cell subsets were not routinely measured in clinical practice and thus were not available for analysis in this study. To circumvent this problem, we decomposed the peripheral blood transcriptome into relative abundances of cell subsets by measuring gene-expression markers of 33 cell subsets (using digital cell quantification). This analysis showed that there were no differences in the relative abundance of cells subsets between the moderate and severe groups, with the exception of pro-erythroblasts (Supplementary Fig. 2). This finding, together with no significant difference in baseline characteristics or premorbid comorbidities (between the moderate and severe groups) as noted earlier, indicate that the observed gene-expression differences in PCA were mainly due to altered cellular states associated with infection severity, rather than with cell numbers, cellular subset composition, or patient-related factors.
Network analysis revealed five different biological themes
We next performed weighted gene co-expression network analysis (WGCNA) to identify disease modules associated with infection severity (Fig. 1). The WGCNA approach takes into consideration the inter-relationship between/within gene pathways, thus allowing us to quantify the complex interplay between cellular networks22. This analysis revealed 20 gene-expression modules (Fig. 3a). Of these 20 modules, 13 were statistically significant when correlated with infection status. Of these 13 modules, 6 modules contained more than 300 genes, which represented an adequate number of genes suitable for further analysis.
We noted that these six modules were well separated in gene-expression space (Fig. 3b), which suggested that the influenza host response is driven by multiple host factors, rather than a single dominant process. Further examination of the genes within these modules indicated that the modules corresponded broadly to five different biological themes (antiviral response, immune response, cell cycle, neutrophils, and cell metabolism) (Fig. 4). One module, the indeterminate module, did not display a discernible biological theme. Within each module, multiple cell types were represented (e.g., lymphocytes, monocytes, and natural killer cells). However, the neutrophils module was a notable exception because most genes were neutrophil-related (only a few genes were related to non-neutrophils leukocytes).
We asked whether the biological themes observed in these modules could be related to time-dependent changes in blood composition during infection. Influenza infection is known to be associated with a relative lymphopenia/neutrophilia, caused by a leukocyte redistribution between blood and peripheral compartments (e.g., bone marrow). This redistribution is usually transient and appears only during the first 2 days of influenza infection 23. For most patients in this study, collection of blood samples occurred between day 4 and day 6 of infection (Table 1), thus making it unlikely that the observed biological themes were related to leukocyte redistribution. Indeed, leukocyte differentials in these patients did not reveal any evidence of lymphopenia or neutrophilia. Moreover, an analysis of variance of the module expression levels did not detect any changes related to cell count variability (see below), further supporting our hypothesis that the observed biological themes reflected functional changes in cell states, rather than leukocyte redistribution or changes in blood composition.
Host response correlated with infection severity
Next, to identify the main determinants of module’s gene expression, we performed an analysis of variance (ANOVA) using the overall expression level of each module (“eigengene”) as dependent variable and host factors (infection severity, age, and cell counts) as independent variables. This analysis showed that infection severity, but not age or cell counts, was the main determinant of modular expression in neutrophils, cell cycle, and immune response modules (Fig. 5). As infection severity increased from moderate to severe, these three modules showed statistically significant changes in their gene-expression levels. In contrast, cell metabolism, antiviral response, and indeterminate modules did not show statistically significant changes in modular expression (cell metabolism p = 0.12, antiviral response p = 0.11, indeterminate p = 1.00), indicating that these modules were not associated with the severe infection. Put together, these findings suggest that all six modules played a role in the onset of influenza infection (from healthy state transitioning to moderate infection), but only neutrophils, cell cycle, and immune response modules were associated with severe, life-threatening illness.
To further quantify the extent of each module’s association with severe disease, we calculated the differences in the module’s expression levels (“eigengene differentials”) between healthy state and moderate influenza, or between moderate influenza and severe influenza (Fig. 3c). This calculation revealed that, as infection severity increased from moderate to severe, the neutrophils module displayed the highest increase in modular expression. Since moderate and severe influenza patients did not differ in their absolute neutrophil counts (Supplementary Fig. 1), this increase likely reflected differences in neutrophil activation, rather than changes in neutrophil cell numbers.
Neutrophil module showed neutrophil activation and migration
Further examination of the neutrophils module confirmed the presence of neutrophil activation, as evidenced by the detection of a neutrophil transcriptomic signature that is typically found in activated neutrophils homing toward infected lung tissue in acute lung injury24. This “activation” signature included an increased expression in genes involved in delayed apoptosis (BCL2A1 and SMPD3), activated inflammatory pathways (CLEC4D, GPR84, S100A12, and S100A8) and increased neutrophil extracellular traps (GADD45A, HIST1H4H, and HIST2H2AA4) (Fig. 4a).
Activated neutrophils usually migrate toward the infected lung after influenza infection25. Consistent with this, we detected genes within the neutrophils module that are involved in chemotaxis, neutrophil–endothelium interaction and transendothelial migration. These migration genes, include CD177 (neutrophil adhesion to endothelium and transmigration), PVRL2 (transendothelial migration), VNN1 (hematopoietic cell trafficking), GPR84 (neutrophil chemotaxis), MMP9 (neutrophil activation and migration), S100A8 and S100A12 (neutrophil recruitment, chemotaxis and migration) (Fig. 4a).
Since an increased neutrophil activation is sometimes associated with a cytokine-mediated inflammation in influenza infection26, we searched for transcriptomic evidence of “hypercytokinaemia”, a phenomenon frequently reported in severe influenza patients14. We did not find strong evidence of hypercytokinaemia on a gene-expression level (Fig. 4d). The absence of pro-inflammatory gene upregulation might be related to the sampling method used in this study, in which only peripheral blood was measured (whereas the infected lung is thought to be the main source of inflammatory cytokines).
Immune response module revealed broad downregulation
In contrast to the neutrophils module, the immune response module showed a broad downregulation in gene expression (Figs. 4b and 5). This downregulation included key genes involved in innate and adaptive immunity, such as GZMH, KLRB1, SH2D1B (NK cells function), CCR7 (T cells activation/homing), TCF7 (T cell differentiation), IL2RB (T cell-mediated immunity), CX3CR1 (lymphocyte migration and monocytes survival), GNLY (T cell cytotoxicity), GZMB (cytotoxic lymphocyte-induced apoptosis of infected cells), KLRG1 (NK cells-mediated activation of virus-specific CD8 lymphocytes), and FGFBP2 (cytotoxic lymphocyte-mediated immunity).
Downregulation of gene expression was also observed in genes associated with CD4/CD8 T cells, which are critical cells required for viral clearance. This downregulation was more marked in the severe group than the moderate group (Supplementary Fig. 3). In addition, a number of genes related to T cell receptor signaling were also downregulated (Supplementary Fig. 4).
The above findings indicated reduced expression across both innate and adaptive immunity, a phenomenon typically observed in patients with severe infection27. HLADR downregulation, a well-established marker of immune downregulation28, showed that this downregulation was more marked in the severe group than the moderate group (Supplementary Fig. 5). This finding was also consistent with the ANOVA analysis above (Fig. 5) and was confirmed by pathway analysis (see below). It was also in keeping with our observation that severe influenza patients had a significantly higher incidence of secondary complications known to be associated with immune suppression, including bacterial co-infection and multiple organ failure 29 (Table 1).
Cell cycle pathway suggested host–virus interaction
The cell cycle module displayed the second highest increase in modular expression from moderate-to-severe influenza. Examination of the genes within this module revealed changes in several critical transition points in cell cycle pathways (Fig. 6), which were previously identified in influenza infection30–32. These changes, included delayed transition of host cell from G0 to G1 phase and downregulation of Cycling D, a key regulator of early cell cycle G1 phase31,32. These changes are in keeping with findings of several previous studies, which showed that cell cycle transition points could be manipulated by influenza virus to facilitate viral survival and to escape host inhibition30,31,32.
Pathway analysis confirmed module biological themes
Pathway analysis was performed in all 6 modules, including the neutrophils (783 genes), cell cycle (408 genes), immune response (2759 genes), cell metabolism (645 genes), antiviral response (522 genes), and indeterminate (325 genes) modules. This analysis confirmed our earlier findings in that the neutrophils module showed neutrophil migration toward injury (e.g., chemotaxis, neutrophil–endothelium interaction, and transendothelial migration), the immune response module showed downregulated immune pathways, the cell cycle module, and the antiviral module showed upregulated pathways (Fig. 6, Supplementary Fig. 6).
The pathway analysis revealed a marked concordance between our observations and influenza-related mechanisms previously reported, including changes in a large number of interferon-derived genes (Figs. 4c and 5). In addition to the interferon pathways, the pathway analysis identified other pathways that have been previously reported, including increased IL-10 signaling33, reduced innate immunity7, impaired CD4 and CD8 response4,5,6 and reduced natural killer cell function8. The pathway analysis also identified several mechanisms that have not been previously reported, including downregulation of several key T cell functions (e.g., cell activation/differentiation and Th1/Th2 differentiation) and reduced antigen presentation (MHC class II), both of which were consistent with our earlier findings of downregulated immune function.
Internal and external validation
We performed leave-one-out cross-validation of three main modules (neutrophils, cell cycle, and immune response) (Supplementary Method). To be consistent with the original analysis, all samples included in the WGCNA (63 moderate influenza and 44 severe influenza samples) were also used in the cross-validation. The correct classification rate (i.e., predicting whether an unidentified sample belongs to a moderate or severe patient) was computed during the cross-validation. To ensure the findings were not affected by the choice of the prediction algorithm, we deployed seven machine-learning algorithms during the cross-validation process (Supplementary Table 1). The results of this cross-validation showed that all three modules performed well in predicting class membership (moderate vs. severe). Notably, most algorithms consistently identified the neutrophils module as the best performing module in distinguishing between moderate influenza and severe influenza samples (Supplementary Table 1).
We performed RT-PCR validation of the microarray findings on a small number of selected genes (see Supplementary Method). The RT-PCR measurements showed that, on these selected genes, microarray data correlated well with PCR data (Supplementary Fig. 8). We also assessed the protein expression of the CD177 gene, the most upregulated differentially expressed gene in patients with severe influenza (Fig. 7a). Blood samples from influenza infected patients in a separate study (n = 26) were used for this purpose. Using flow cytometry analysis (Supplementary Method), we found that CD177 protein expression in blood also distinguished between moderate and severe influenza (p = 0.038) (Supplementary Fig. 9).
We performed external validation by downloading the full microarray dataset of a similar study (GSE 111368). This external dataset was comparable to our study in terms of sample size (107 vs. 109) and infection severity (severe, moderate, and healthy controls). Because the two datasets used different microarray platforms (Agilent vs. Illumina), we restricted our analysis to the probes represented in both microarray platforms. This analysis revealed three major findings. Firstly, among the 20 most highly upregulated, differentially expressed genes (moderate vs. severe), there was 80% concordance between the two datasets (Supplementary Table 2). Notably, many of these concordant genes encode proteins relating to neutrophil functions, indicating there was biological similarity between the two datasets among the top-ranking genes. Secondly, among all the differentially expressed genes common to both datasets (a total of 356 upregulated and downregulated genes), pathway analysis revealed two major themes similar to our original finding—for the upregulated genes, the main biological theme in both datasets was excessive neutrophil activation, and for the downregulated genes, the main biological theme in both datasets was a reduced immune response (Supplementary Table 3). Thirdly, representative genes related to these two themes (neutrophils and the immune response) discriminated between moderate and severe influenza in both datasets (Supplementary Fig. 10).
Neutrophil activity predicted patient outcome
The findings presented heretofore indicate that, among the six main modules, the neutrophils module showed the strongest association with disease severity (Figs. 3c and 5). Recent animal studies suggest several possible mechanisms to explain this strong association, including the possible role of neutrophils extracellular traps (NETs) in causing lung injury in influenza25,34. In keeping with this, we found that the top 30 differentially expressed genes (Supplementary Fig. 7) contained many NETs-related genes, including defensin, myeloperoxidase, elastase, neutrophil gelatinase and cathepsin, all of which are well-known markers of NETs 35.
Given the prominent roles displayed by neutrophil-related genes in our data, and the previously reported etiological link between neutrophil activity and influenza severity25,34, we hypothesized that changes in neutrophil state could predict patient outcome. To test this hypothesis, we performed a separate study to evaluate whether changes in neutrophil gene expression could accurately predict influenza mortality in our patients. CD177 was selected as a representative gene of neutrophil activation state because CD177 is a neutrophil-specific marker 36 and importantly, it was the most highly differentially expressed gene in severe infection (Fig. 7a). The association between CD177 expression levels (measured by real-time quantitative PCR) and patient outcomes (e.g., mortality) was measured by area-under-the receiver–operator curve, using the established method37. Two independent datasets were used in this analysis; a “discovery set” consisted of 41 patients from the original microarray analysis (the other remaining samples did not contain enough RNA), and a “validation set” consisted of 47 patients for whom blood samples were not used in the original analysis (Fig. 1, Supplementary Table 4).
This analysis showed that the CD177 gene had a significantly higher expression in nonsurvivors than survivors (Fig. 7b). Furthermore, nonsurvivors had a significantly higher number of neutrophils in their peripheral blood (Fig. 7c), and this was associated with a greater degree of oxygenation failure in their lung (Fig. 7d). Importantly, area under the curve (AUC) analysis of the receiver operating characteristic curve (Fig. 7e, f) showed that CD177 expression predicted influenza-related deaths in both the discovery set (AUC 0.817, 95% confidence interval: 0.602–0.921) and the validation set (AUC 0.898, 95% confidence interval: 0.769–0.957). Based on Youden index, CD177 displayed sensitivity 0.80 (95% confidence interval 0.28–0.99), specificity 0.75 (95% confidence interval 0.58–0.88), positive predictive value 0.31 (95% confidence interval: 0.18–0.48), and negative predictive value 0.96 (95% confidence interval 0.82–0.99). In addition, CD177 expression better discriminated survivors vs. nonsurvivors than the traditional clinical score APACHE (AUC 0.625, 95% confidence interval: 0.230–0.843) (Fig. 7g, h). Collectively, these findings support our hypothesis that an excessive neutrophil activation state was linked to a poor outcome in influenza infection.
Numerous host factors contribute to the pathogenesis of influenza infection. However, the relative contribution of individual host factor remains poorly defined9. This uncertainty represents a major roadblock in the design of host-based drug therapy, which has long been recognized as a priority in the global effort to reduce influenza-related mortality. Here, we assembled a large cohort of influenza-patients with well characterized phenotypes in whom we identified six modules of the systemic host response associated with influenza infection. Three of the six modules (neutrophils, cell cycle and immune response) showed an association with disease severity. Of these three modules, the neutrophils module displayed the highest correlation with disease activity. Within the neutrophils module, we identified several key pathogenic pathways in circulating neutrophils, including neutrophil extracellular traps formation and neutrophil-inflammation, both of which have been previously implicated in the pulmonary pathogenesis of severe influenza25,34. Furthermore, we found that excessive neutrophil activity was present in patients with severe respiratory failure and that increased expression of neutrophil activation marker could predict influenza-related fatality. Collectively, these findings provide a clinically relevant framework to better understand influenza pathogenesis and revealed a previously under-appreciated link between neutrophils and severe influenza infection.
An important caveat is warranted in the interpretation of our findings—that causality cannot be inferred from our findings. This is due to two main reasons. First, this is a single time point study; no longitudinal data is provided. Second, the patients were recruited relatively late in the course of their illness; patients from an early phase of the infection were under-represented in the cohort (since symptoms were not severe enough or the patient had not sought medical attention). Due to these reasons, it remains impossible to conclude whether the observed findings (e.g., neutrophil activation) were a cause or a consequence of severe disease. A longitudinal analysis could address these issues since samples can be collected at symptom onset and continuously until respiratory failure occurs (from subclinical state, to symptom development and finally to respiratory failure). This approach captures the time-dependent dynamics of the host response, which may be more informative.
The study findings highlight the dichotomous roles of host response and add nuance to our current understanding of influenza infection. Immunologists have long agreed that the same host factors responsible for lung injury are also critical for efficient control of influenza virus replication2. For example, excessive neutrophil response is associated with severe influenza, as observed in our study and in previous reports25. However, a robust neutrophil response is also a necessary part of the host defense35. Indeed, this study shows that patients with moderate influenza had evidence of neutrophil activation, albeit only modestly. This finding consolidates the current view that the host response operates on a continuum (from benefit to harm), and that preserving some degree of a beneficial host response along this continuum should be an important consideration in the design of host-directed therapy.
The main strengths of this study are its large sample size, its relevant clinical context and the global view of the host response generated from within this context. The large sample size enabled us to examine the full spectrum of infection severity, ranging from moderate influenza to severe influenza-induced lung injury. The use of whole blood sampling helped us generate an integrated view of the leukocyte-mediated host response by preserving the interactions between different leukocyte subsets, which provides a realistic measure of how immune cells interact during infection. The transcriptome analysis provided a broad coverage of gene functions, thereby allowing a systematic interrogation of many pathways (although we are cognizant of the limitations of transcriptomic analysis, including the need for further validation on a protein level). Collectively, these study design features allowed us measure influenza host response in a clinically relevant context across a broad array of immune cells.
The findings of this study are in line with previous animal studies, which suggested that neutrophils play a prominent role in causing severe disease in influenza infection25,34. However, neutrophils are unlikely to be the only host factors that influence disease severity; a more plausible scenario is that severe disease results from the interplay between neutrophils and other host factors38. In this study, we attempted to capture this interplay by using an “unsupervised” network analysis approach (WGCNA). Typically, an “unsupervised” approach does not use a priori phenotype information (e.g., infection severity) or prior assumption about the underlying biology39. Thus, it allowed the generation of an unbiased view of all pathways expressed in circulating leukocytes. Another advantage of the WGCNA approach is that it treats the host response as a “whole system”, rather than individual components with few interconnectivities. With these methodological considerations, an integrated, global view of host factors was generated from our analysis allowing us to identify the main host factors associated with disease severity.
This global analysis allowed us to identify neutrophils as the dominant host factors associated with patient outcome. Similar findings have been provided by studies conducted in animal models25,34. It remains an intriguing question to ask whether modulation of these host factors, such as the neutrophil response, could modify disease outcome. This question has recently been investigated in an animal model40; but a randomized controlled clinical trial in humans would provide the most definitive answer.
As with all observational studies, this study could be biased due to additional confounders. Patient level confounders were likely; however, in our analysis, both the moderate and severe groups were well balanced in baseline characteristics. Cell level confounders were also possible; however, our analyses had carefully considered and allowed for the possible bias introduced by cell number variability. As a further step to minimize bias, we included total leukocyte counts and neutrophil counts in the ANOVA model, which showed that cell counts had no effect on gene-expression levels. Another potential limitation of this study was that pulmonary leukocytes were not measured in our patients; this measurement would have expanded the insights gained. However, it is worthwhile pointing out that performing bronchoscopy in these patients (in order to retrieve pulmonary leukocytes in infected lung) was difficult to justify on ethical grounds because these patients were critically ill and were at a high risk of rapid decompensation during bronchoscopy.
In conclusion, this study helps identify a previously under-appreciated link between neutrophils and severe disease in influenza infection. Further mechanistic study is needed to unravel the mechanism underlying the neutrophil-related host response and thus help identify novel therapeutic target.
Study design and participants
We recruited patients from 20 hospitals (Australia, Canada and Germany) during the period from 2009 to 2016 (Fig. 1). The eligibility criteria, included (1) age greater than 18 years and (2) World Health Organization definition of influenza-like illness (fever of 38 °C or higher, cough and illness onset within the last 10 days). All eligible patients were assessed by an admitting physician for likelihood of influenza infection. Patients with a high likelihood of infection, based on history and clinical features, were enrolled into the study. Airway samples (nasopharyngeal swab, throat sample or sputum) and a 2.5 ml peripheral blood sample (into PAXgene tubes) were obtained from each study participant. Routine investigations were performed, as determined by the admitting physician. Airway samples were tested for bacterial pathogens and common respiratory viruses. Blood samples in PAXgene tubes were later processed for microarray analysis (Agilent 8× 60k Human V3). The study was approved by the institutional review board of each participating institution. Informed consent was obtained from all participants.
Microbiological and virology testing
Nasopharyngeal, sputum, urine, and blood samples were obtained at admission. In patients admitted to intensive care unit, additional respiratory samples were obtained from bronchoalveolar lavage or tracheal aspirates. Standard microbiological testing was performed in these samples, including sputum Gram stain and culture. Testing for atypical respiratory pathogens (Chlamydophila pneumoniae, Mycoplasma pneumoniae, and Legionella pneumophila) was performed in selected patients at the discretion of treating physicians. All patients were tested for respiratory viruses using nucleic acid PCR. The PCR panel included primers for influenza A, influenza B, respiratory syncytial virus, rhinovirus, parainfluenza virus, and human metapneumovirus. Only patients who were positive for influenza virus were included in the main analysis.
Influenza infection was defined as the identification of either influenza A (H1N1 or H3N2) or influenza B using nucleic acid amplification (real-time PCR) in a participant with clinical features consistent with influenza infection. Moderate influenza infection was defined as disease with significant symptoms resulting in presentation to an emergency department, but without the need for invasive respiratory support. Severe influenza infection was defined as influenza infection with significant respiratory failure requiring endotracheal intubation and mechanical ventilation. Case assignment (moderate or severe group) was done retrospectively, based on a full review of clinical features, virology testing and laboratory investigations, and was performed by two consultant physicians independent of those physicians admitting the participants. Discrepancies in phenotype assignments were resolved by a third physician.
RNA extraction, normalization, and microarray analysis
In each sample, whole-blood RNA was extracted from PAXgene tubes as per manufacturer’s protocol (QIAGEN PreAnalytiX—Blood RNA version 2; 2015). After checking RNA integrity on Bioanalyzer (Agilent Technologies; Waldbronn, Germany), 100 ng of total RNA were applied for Cy3-labeling reaction using the one color Quick Amp Labeling protocol (Agilent Technologies; Waldbronn, Germany). Labeled cRNA was hybridized to Agilent 8 × 60k Human V3 (Design ID: 072363) microarrays for 16 h at 68 °C and scanned using the Agilent DNA Microarray Scanner. Microarray data were then analyzed using the R software package (version 3.1.3). Preprocessing steps included background correction, adding an offset of 50, quantile normalization and annotation using the limma package and Agi4x44PreProcess packages. Multigroup comparisons and identification of differentially expressed probe sets were performed using limma with Benjamini and Hochberg correction for multiple testing. Differentially expressed genes were identified based on an adjusted p value of <0.05, and exhibiting more than a twofold difference in expression levels ([log2] > 1). Full dataset of the gene-expression data is available at the National Centre for Biotechnology Information Gene Expression Omnibus (GEO accession number GSE101702).
Weighted gene co-expression network analysis
Overview of weighted gene co-expression network analysis (WCGNA)—to identify groups of co-regulated genes (“modules”) associated with infection severity, we applied unsupervised hierarchical clustering across microarray data of all samples (Fig. 1). WGCNA was used as a clustering method. The main advantage of WGCNA is that it takes into account the inter-relationship between and within networks, thereby allowing a more biologically meaningful interpretation41. WGCNA organizes the transcriptome into tightly correlated sets of transcripts. It first generates a pairwise similarity matrix based on expression correlation, which is transformed into an “adjancy matrix”. This results in a highly modular network topology. Hierarchical clustering (using average linkage) is then used to define modules within this network. The overall expression of a given module is summarized by the “eigengene” (the first principle component of the gene-expression matrix). We used the eigengene values to calculate the “eigengene differentials” (the difference in eigengene values between illness severity), which quantifies each module’s association with severe disease. ANOVA was used to determine the association between the eigengene (dependent variable) and infection severity, age, gender, and cell counts (independent variables). Modules that were significantly associated with disease severity were further investigated by pathway analysis (see below).
Procedure of WCGNA—we used the R package WGCNA (version 1.5.1) and we selected the top 10,000 most variant probesets as input and applied a soft threshold of 10 for network construction. This allows the construction of Pearson’s correlation matrices for all pairs of genes. The correlation matrices were then transformed into an adjacency matrix using a power function f(x) = xβ. The parameter β of the power function was determined such that the resulting adjacency matrix was scale free. The adjacency matrix was transformed into a topology overlap, which measures the connectivity among all the genes in the network. Based on this topology overlap, hierarchical average linkage clustering was then used to construct the dendrogram and identify gene co-expression modules that contain the maximal sets of inter-connected genes. More detailed information regarding the WCGNA procedure can be found in WCGNA website: (https://labs.genetics.ucla.edu/horvath/CoexpressionNetwork/). In this study, the following parameters were used for WCGNA analysis: TOMType: unsigned network, minModuleSize: 30, reassignThreshold: 0.25, power: 10 and reassignThreshold: 0.
We identified the biological theme in each module by performing a pathway analysis using the MetaCore (https://portal.genego.com/). MetaCore is a web-based, integrated software suite for functional analysis of omics data. It uses a topology approach (similar to WCGNA), which takes into account the interaction between and within pathways, the position of the genes within each pathway and the interaction between genes. We uploaded genes from each module (identified earlier in WCGNA) into MetaCore and used the pathway enrichment analysis option within the software. The enrichment analysis identifies coordinated changes in gene-expression in a predefined group of functionally related pathway genes, based on a curated database of known protein–protein interaction, signaling and metabolic pathways. To ensure our results were biologically meaningful, pathway analysis was restricted to modules which contained at least 300 genes.
Digital cell quantification
We inferred the immune cell quantities in each blood sample using the decomposition-based digital cell quantification (DCQ) method, which has been previously validated in influenza infection4. DCQ uses a large reference library (“immune cell compendium”) consisting of the transcriptional profiles of a predefined set of genes that encode cell surface markers (i.e., that identify immune cell types by fluorescence-activated cell sorting). Here, we used the DMAP expression data for the immune cell compendium and the R package ComICS for DCQ analysis (https://cran.r-project.org/web/packages/ComICS/index.html). The DCQ output was visualized as a heatmap (Supplementary Fig. 2), which provides a quantitative measure of the “relative” abundance of each immune cell types, namely the change in cell quantity between a sample of a given phenotype (e.g., severe influenza) and a sample in steady state (e.g., healthy control).
Sample size calculation
The sample size was chosen to have 90% power in detecting differentially expressed genes between severity groups (moderate vs. severe), assuming a false-discovery rate of 0.05. Based on gene-expression variance and effect size estimated from our previous studies42,43, we calculated that a sample size of 46 in each severity group would be required. In total, 750 individuals with flu-like illness were needed to obtain 46 influenza-positive subjects in each severity group (moderate or severe), based on the estimation that (1) only 25% of flu-like illness individuals would be test-positive for influenza virus (estimated from our pilot data), and (2) among those positive for influenza, 13% would develop moderate or severe influenza infection (based on publicly available data of 70,000 laboratory confirmed hospitalized patients with influenza infection44).
Comparisons between two severity groups (moderate vs. severe) were calculated using the unpaired two-tailed Student’s t test or the nonparametric Mann–Whitney U test where appropriate. To assess outcome prediction in the independent validation study, area-under-curve (and its confidence interval) was calculated using the receiver–operator-characteristic curve analysis, as implemented in NCSS software (2018, Utah, USA, ncss.com/software/ncss). General statistical analyses were performed using PRISM and R software package, and bioinformatic analyses were performed using R software package.
Further information on research design is available in the Nature Research Reporting Summary linked to this article.
Microarray data underlying Figs. 2, 3, 4, 5, 6, 7a and Supplementary Figs. 2, 3, 4, 5, 6, 7, 10, 11 are available at Gene Expression Omnibus (Accession numbers: GSE 101702 and GSE 111368); [https://www.ncbi.nlm.nih.gov/geo/]. Source Data underlying Fig. 7b–h and Supplementary Figs. 1, 8 are provided as Source Data file. All other data are available from the corresponding author upon reasonable requests.
Codes and derived data used for the analysis of the microarray expression (normalized data matrix, derived data, PCA, WGCNA, heatmaps, volcano plots, DCQ analysis, boxplots for gene expression, scatter plots, boxplots for module eigengenes, VENN diagrams for modules) is publicly available at [https://doi.org/10.22000/152].
Hayward, A. C. et al. Comparative community burden and severity of seasonal and pandemic influenza: results of the Flu Watch cohort study. Lancet Respir. Med. 2, 445–454 (2014).
La Gruta, N. L., Kedzierska, K., Stambas, J. & Doherty, P. C. A question of self-preservation: immunopathology in influenza virus infection. Immunol. Cell Biol. 85, 85–92 (2007).
Short, K. R., Kroeze, E. J. B. V., Fouchier, R. A. M. & Kuiken, T. Pathogenesis of influenza-induced acute respiratory distress syndrome. Lancet Infect. Dis. 14, 57–69 (2014).
Li, C. K. F. et al. Preexisting influenza-specific CD4+ T cells correlate with disease protection against influenza challenge in humans. Nat. Med. 18, 276–282 (2012).
Wang, Z. et al. Recovery from severe H7N9 disease is associated with diverse response mechanisms dominated by CD8+ T cells. Nat. Commun. 6, 1–12 (2015). 1AD.
Sridhar, S. et al. Cellular immune correlates of protection against symptomatic pandemic influenza. Nat. Med. 19, 1305–1312 (2013).
Kobasa, D., Jones, S. M., Shinya, K., Kash, J. C. & Copps, J. Aberrant innate immune response in lethal infection of macaques with the 1918 influenza virus. Nature 445, 319–323 (2007).
Guo, H. et al. The functional impairment of natural killer cells during influenza virus infection. Immunol. Cell Biol. 87, 579–589 (2009).
Herold, S., Becker, C., Ridge, K. M. & Budinger, G. R. S. Influenza virus-induced lung injury: pathogenesis and implications for treatment. Eur. Respir. J. 45, 1463–1478 (2015).
de Jong, M. D. et al. Fatal outcome of human influenza A (H5N1) is associated with high viral load and hypercytokinemia. Nat. Med. 12, 1203–1207 (2006).
PhD, A. D. I. et al. Estimates of global seasonal influenza-associated respiratory mortality: a modelling study. Lancet 391, 1285–1300 (2018).
Zangrillo, A. et al. Extracorporeal membrane oxygenation (ECMO) in patients with H1N1 influenza infection: a systematic review and meta-analysis including 8 studies and 266 patients receiving ECMO. Crit. Care 17, R30 (2013).
Dobson, J., Whitley, R. J., Pocock, S. & Monto, A. S. Oseltamivir treatment for influenza in adults: a meta-analysis of randomised controlled trials. Lancet 385, 1729–1737 (2015).
Liu, Q., Zhou, Y.-H. & Yang, Z.-Q. The cytokine storm of severe influenza and development of immunomodulatory therapy. Cell Mol. Immunol. 13, 3–10 (2015).
Fedson, D. S. Confronting the next influenza pandemic with anti-inflammatory and immunomodulatory agents: why they are needed and how they might work. Influenza Other Respir. Viruses 3, 129–142 (2009).
Gregory, D. J. & Kobzik, L. Influenza lung injury: mechanisms and therapeutic opportunities. Am. J. Physiol. Lung Cell Mol. Physiol. 309, L1041–L1046 (2015).
Dimitrakopoulou, K. et al. Influenza A immunomics and public health omics: the dynamic pathway interplay in host response to H1N1 infection. OMICS 18, 167–183 (2014).
Pirrone, V., Thakkar, N., Jacobson, J. M., Wigdahl, B. & Krebs, F. C. Combinatorial approaches to the prevention and treatment of HIV-1 infection. Antimicrob. Agents Chemother. 55, 1831–1842 (2011).
Zaas, A. K., Chen, M., Varkey, J., Veldman, T. & Hero, A. O. III Gene expression signatures diagnose influenza and other symptomatic respiratory viral infections in humans. Cell 6, 207–217 (2009).
Zhai, Y. et al. Host transcriptional response to influenza and other acute respiratory viral infections—a prospective cohort study. PLoS Pathog. 11, e1004869–29 (2015).
Woods, C. W. et al. A host transcriptional signature for presymptomatic detection of infection in humans exposed to influenza H1N1 or H3N2. PLoS ONE 8, e52198 (2013).
Zhao, W. et al. Weighted gene coexpression network analysis: state of the art. J. Biopharm. Stat. 20, 281–300 (2010).
Nedzad Music, A. J. R. J. H. K. I. A. Y. Peripheral Leukocyte Migration in Ferrets in Response to Infection with Seasonal Influenza Virus. 1–17. https://doi.org/10.1371/journal.pone.0157903 (2016).
Juss, J. K. et al. Acute respiratory distress syndrome neutrophils have a distinct phenotype and are resistant to phosphoinositide 3-kinase inhibition. Am. J. Respir. Crit. Care Med. 194, 961–973 (2016).
Brandes, M., Klauschen, F., Kuchen, S. & Germain, R. N. A systems analysis identifies a feedforward inflammatory circuit leading to lethal influenza infection. Cell 154, 197–212 (2013).
Teijaro, J. R. The role of cytokine responses during influenza virus pathogenesis and potential therapeutic options. Curr. Top. Microbiol. Immunol. 386, 3–22 (2015).
Grimaldi, D., Llitjos, J. F. & Pene, F. Post-infectious immune suppression: a new paradigm of severe infections. Med Mal. Infect. 44, 455–463 (2014).
Venet, F., Lepape, A. & Monneret, G. Clinical review: flow cytometry perspectives in the ICU—from diagnosis of infection to monitoring of injury-induced immune dysfunctions. Crit. Care 15, 231 (2011).
van Vught, L. A. et al. Incidence, risk factors, and attributable mortality of secondary infections in the intensive care unit after admission for sepsis. J. Am. Med. Assoc. 315, 1469–1479 (2016).
Jiang, W. et al. Influenza A virus NS1 induces G0/G1 cell cycle arrest by inhibiting the expression and activity of RhoA protein. J. Virol. 87, 3039–3052 (2013).
He, Y. et al. Influenza A virus replication induces cell cycle arrest in G0/G1 phase. J. Virol. 84, 12832–12840 (2010).
Fan, Y. et al. Cell cycle-independent role of cyclin D3 in host restriction of influenza virus infection. J. Biol. Chem. 292, 5070–5088 (2017).
Sun, J., Madan, R., Karp, C. L. & Braciale, T. J. Effector T cells control lung inflammation during acute influenza virus infection by producing IL-10. Nat. Med. 15, 277–284 (2009).
Narasaraju, T. et al. Excessive neutrophils and neutrophil extracellular traps contribute to acute lung injury of influenza pneumonitis. Am. J. Pathol. 179, 199–210 (2011).
Vorobjeva, N. V. & Pinegin, B. V. Neutrophil extracellular traps: mechanisms of formation and role in health and disease. Biochemistry 79, 1286–1296 (2014).
Stroncek, D. F. Neutrophil-specific antigen HNA-2a, NB1 glycoprotein, and CD177. Curr. Opin. Hematol. 14, 688–693 (2007).
Metz, C. E. Basic principles of ROC analysis. Semin. Nucl. Med. 8, 283–298 (1978).
Hufford, M. M. et al. Influenza-infected neutrophils within the infected lungs act as antigen presenting cells for anti-viral CD8(+) T cells. PLoS ONE 7, e46581 (2012).
Maetschke, S. R., Madhamshettiwar, P. B., Davis, M. J. & Ragan, M. A. Supervised, semi-supervised and unsupervised inference of gene regulatory networks. Brief Bioinforma. 15, 195–211 (2014).
Teluguakula, N. & Harshini, A. Neutrophils as possible therapeutic targets in severe influenza pneumonia. J. Infect. Pulm. Dis. 2, 1 (2016).
Langfelder, P. & Horvath, S. WGCNA: an R package for weighted correlation network analysis. BMC Bioinforma. 9, 559–13 (2008).
Parnell, G. P. et al. A distinct influenza infection signature in the blood transcriptome of patients with severe community-acquired pneumonia. Crit. Care 16, R157 (2012).
Parnell, G. et al. Aberrant cell cycle and apoptotic changes characterise severe influenza A infection—a meta-analysis of genomic signatures in circulating leukocytes. PLoS ONE 6, e17186 (2011).
Van Kerkhove, M. D. et al. Risk factors for severe outcomes following 2009 influenza A (H1N1) infection: a global pooled analysis. PLoS Med. 8, e1001053–12 (2011).
BA, K. D. et al. Oxygenation saturation index predicts clinical outcomes in ARDS. Chest 152, 1151–1158 (2017).
We would like to acknowledge the support from the following institutions/individuals: (1) Nepean Institute of Critical Care Education and Research. (2) The Public Health Agency of Canada (PHAC). (3) Roche Pharmaceuticals. (4) The Canadian Severe Influenza Study (CSIS) and the Randomized, Double-Blinded Controlled Trial Comparing High vs Standard Dose Oseltamivir in Severe, Influenza Infection in ICU (ROSII) Study Groups with special thanks to the following individuals for their support; (5) Leonie Herson for her graphic design work. (6) Joey Lai and Westmead Scientific Platforms supported by Westmead Research Hub. Gordon Wood, MD, Royal Jubilee Hospital and Victoria General Hospital, Victoria, BC, Canada. Steve Reynolds, MD, Vinay Dhingra, MD, Vancouver General Hospital, University of British Columbia, Vancouver, BC, Canada. Brent Winston, MD, Foothills Medical Centre and Peter Lougheed Hospital, University of Calgary, Calgary, AB, Canada. Sean Bagshaw, MD, University of Alberta Hospital, University of Alberta, Edmonton, Canada. Jim Kutsogiannis, MD, Royal Alexandra Hospital, University of Alberta, Edmonton, Canada. William Anderson, MD, Thunder Bay Regional Health Sciences Centre, Northern Ontario School of Medicine, Thunder Bay Ontario, Canada. Michael Silverman, MD, Rouge Valley Hospital and Lakeridge Hospital, Ajax and Oshawa, ON, Canada. Margaret Herridge, MD, Toronto General Hospital and Toronto Western Hospital, University of Toronto, Toronto, Canada. Rob Fowler, MD, Sunnybrook Hospital, University of Toronto, Toronto ON, Canada. Alison McGeer, MD, Mount Sinai Hospital, University of Toronto, Toronto ON, Canada. Mary-Anne Aarts, MD, John Marshall, MD, St. Michael’s Hospital, University of Toronto, Toronto ON, Canada. Deborah Cook, MD, St. Joseph’s Healthcare, McMaster University, Hamilton ON, Canada. Lauralyn McIntyre, MD, Ottawa Hospital General Campus and, Civic Campus, University of Ottawa. Ottawa ON, Canada. Stephane Ahern, MD, Hopital Maissonneuve-Rosemont, Universite de Montreal, QC, Canada. Kosar Khwaja, MD, Montreal General Hospital and Royal Victoria Hospital, McGill University, Montreal, QC, Canada. Natalie Bandrauk, MD, Health Sciences Centre and St. Claire’s Mercy Hospital, Memorial University, St. John’s NL, Canada. Richard Hall, MD, Queen Elizabeth Health Sciences Centre—Halifax Infirmary and Victoria General Site, Dalhousie University NS, Canada. Jordi Rello, MD, Hospital Universitali Joan XXIII—CIBER Enfermedades Respiratorias, Universitat Rovira i Virgili, Tarragona, Spain. This work was supported by Nepean Institute of Critical Care Education and Research. K.S. was supported by intramural grants from the Helmholtz-Association (Program Infection and Immunity).
The authors declare no competing interests.
Peer review information: Nature Communications thanks Jens Lundgren and other anonymous reviewer(s) for their contribution to the peer review of this work.
Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Tang, B.M., Shojaei, M., Teoh, S. et al. Neutrophils-related host factors associated with severe disease and fatality in patients with influenza infection. Nat Commun 10, 3422 (2019). https://doi.org/10.1038/s41467-019-11249-y
This article is cited by
Co-expression network analysis identifies potential candidate hub genes in severe influenza patients needing invasive mechanical ventilation
BMC Genomics (2022)
Forsythiaside A improves Influenza A virus infection through TLR7 signaling pathway in the lungs of mice
BMC Complementary Medicine and Therapies (2022)
Genome Medicine (2022)
Integrated host-microbe plasma metagenomics for sepsis diagnosis in a prospective cohort of critically ill adults
Nature Microbiology (2022)
Comparison of clinical features on admission between coronavirus disease 2019 and influenza a among children: a retrospective study in China
BMC Infectious Diseases (2021)