Unlike other respiratory viruses, SARS-CoV-2 disproportionately causes severe disease in older adults whereas disease burden in children is lower. To investigate whether differences in the upper airway immune response may contribute to this disparity, we compare nasopharyngeal gene expression in 83 children (<19-years-old; 38 with SARS-CoV-2, 11 with other respiratory viruses, 34 with no virus) and 154 older adults (>40-years-old; 45 with SARS-CoV-2, 28 with other respiratory viruses, 81 with no virus). Expression of interferon-stimulated genes is robustly activated in both children and adults with SARS-CoV-2 infection compared to the respective non-viral groups, with only subtle distinctions. Children, however, demonstrate markedly greater upregulation of pathways related to B cell and T cell activation and proinflammatory cytokine signaling, including response to TNF and production of IFNγ, IL-2 and IL-4. Cell type deconvolution confirms greater recruitment of B cells, and to a lesser degree macrophages, to the upper airway of children. Only children exhibit a decrease in proportions of ciliated cells, among the primary targets of SARS-CoV-2, upon infection. These findings demonstrate that children elicit a more robust innate and especially adaptive immune response to SARS-CoV-2 in the upper airway that likely contributes to their protection from severe disease in the lower airway.
One of the defining features of the COVID-19 pandemic has been the striking relationship between disease severity and age1,2,3. While infection with other respiratory viruses, such as influenza or respiratory syncytial virus, causes significant morbidity and mortality in both young children and older adults4,5,6,7,8,9, severe COVID-19 occurs disproportionately in older adults1,2,3,10,11,12,13,14. A comprehensive modeling study, conducted prior to the advent of vaccines, estimated that the infection fatality rate was lowest for children ages 5–9 (~0.001%) and that even adults in their 40 s were already at 100-fold greater risk of death from COVID-191. The age-dependent effect on disease severity and mortality has been shown even when accounting for age-associated comorbidities15.
A few studies have examined differences in systemic immunological profiles of children and adults with COVID-19. Among patients in the early/mild phase of infection, children and adults displayed a comparable magnitude of systemic innate immune responses, including interferon-stimulated and pro-inflammatory gene expression16. These responses tended to resolve more quickly in children, however, while adults maintained a prolonged inflammatory and cytotoxic response in the circulation16,17,18. Among hospitalized patients, adults displayed greater breadth and neutralizing activity of SARS-CoV-2 specific antibodies as well as stronger CD4+ T cell responses to spike protein19,20, suggesting that poorer outcomes in adults are not due to failure to engage a robust systemic adaptive response and may even be driven by it.
Recent work has also begun to shed light on age-related differences in the immune response at the site of initial infection, the upper airway. It has been proposed that the upper airway of children is primed for viral sensing, exhibits a pre-activated anti-viral state, and/or engages a more robust innate immune response upon SARS-CoV-2 infection17,21,22. However, numerous studies have found little to no evidence of a systematic difference between infected children and adults in the distribution of SARS-CoV-2 viral load in the nasopharynx or in the kinetics of viral clearance16,23,24,25,26, and a few studies have even shown infants exhibit the highest viral load27,28. This suggests children are not significantly better able to achieve early control of viral replication in the upper airway. Nevertheless, differences in the upper airway microenvironment and immune response could contribute to protection from severe disease in children in additional ways, for example, by limiting migration of the virus into the lower airway.
Several studies have placed particular focus on potential differences in interferon-stimulated gene (ISG) expression in the upper airway of children and adults, given its well-established importance as a front-line of anti-viral innate immunity17,21,22,29. However, these studies reported some contradictory results, and none directly controlled for SARS-CoV-2 viral load on a gene-by-gene basis, highlighting the need for further investigation.
Here, we assess age-related differences in upper airway gene expression in response to SARS-CoV-2 infection by comparing previously published RNA-sequencing data of nasopharyngeal (NP) swabs from an adult cohort30 with new sequencing data from a pediatric cohort. Our results suggest that differences in the overall magnitude of ISG expression in the upper airway of children and adults with COVID-19 are subtle and appear unlikely to explain their distinct clinical outcomes. However, we also find evidence of more robust innate and especially adaptive immune responses in the upper airway of children, as well as increased clearance of ciliated cells, which may contribute to their protection from severe disease.
To compare the upper airway gene expression response to SARS-CoV-2 infection in children and adults, we utilized a previously published dataset of NP swab RNA-sequencing from an adult cohort30 alongside newly sequenced swabs from a pediatric cohort. All samples were obtained in the course of clinical testing for SARS-CoV-2 infection at the University of California San Francisco or Children’s Hospital Colorado between March and September 2020, prior to the availability of COVID-19 vaccines. We included patients up to 19 years of age in the pediatric cohort and restricted the adult cohort to those at least 40 years of age to impose clearer separation.
We divided each age cohort into three viral status groups: 1) patients with PCR-confirmed SARS-CoV-2 infection (“SARS-CoV-2” group), 2) patients negative for SARS-CoV-2 by PCR with no other pathogenic respiratory virus detected by metagenomic RNA sequencing (“No Virus” group), and 3) patients negative for SARS-CoV-2 who had another respiratory virus detected by sequencing (“Other Virus” group). Finally, we limited the samples in the SARS-CoV-2 group to those with at least 10 viral reads-per-million (rpM), comparable to PCR Ct values below 3030. Viral load above this threshold is characteristic of acute infection, from just before symptom onset up to ~6 days later31, and has been associated with recovery of actively replicating virus32,33,34.
The final dataset included 83 children (38 SARS-CoV-2, 34 No Virus, 11 Other Virus; median age 4 years, IQR 2-12) and 154 adults (45 SARS-CoV-2, 81 No Virus, 28 Other Virus; median age 62 years, IQR 47-71) (Fig. 1a, b; Table 1; Supplementary Data 1). Most of the patients in the SARS-CoV-2 group in both age cohorts were tested as outpatients, indicative of an early/mild stage of disease (Table 1). Samples in the SARS-CoV-2 group in both age cohorts spanned several orders of magnitude of viral load, and while viral load trended higher in the children, this did not reach statistical significance (Fig. 1c). Patients in the No Virus group in both age cohorts were more likely to be hospitalized, with a higher proportion in the pediatric cohort (Table 1; Supplementary Data 1). Rhinovirus was the most prevalent among the other respiratory viruses in both age cohorts (Fig. 1d).
We began by performing differential expression (DE) analyses between the SARS-CoV-2 and No Virus groups within each age cohort separately. This approach minimizes confounding by age differences unrelated to SARS-CoV-2 infection and by any potential batch effects, though it could be influenced by differences between each cohort’s No Virus group. The analyses yielded 1,961 and 1,216 differentially expressed genes at a p-value < 0.1 (based on a moderated t-statistic and Benjamini-Hochberg adjusted) for the pediatric and adult cohorts, respectively (Supplementary Data 2). As expected, interferon-stimulated genes (ISGs) were prominent among the genes highly significant in both age cohorts (Supplementary Fig. 1a). Overall, children exhibited a considerably larger number of unique DE genes, including immune-related genes, despite a smaller sample size that would have been expected to provide less statistical power (Supplementary Fig. 1a).
We next performed gene set enrichment analyses35 (GSEA) using Gene Ontology (GO) biological process annotations36 on the DE results from each cohort and compared the enriched pathways. As expected, a range of immune-related pathways were upregulated in both adults and children with SARS-CoV-2 infection compared to those with no virus (Fig. 2a; Supplementary Fig. 1b; Supplementary Data 3). Pathways related to the interferon response appeared as a whole to be strongly induced in both children and adults. Children, however, demonstrated stronger upregulation of the pathways for B cell activation, T cell activation, response to TNF, macrophage activation and phagocytosis. Children also exhibited stronger activation of several cytokine production pathways typically associated with T cell activation, such as IL-2, IL-4, and IFNγ production (Supplementary Fig. 2a–c). A few pathways showed expression changes in opposite directions compared to the respective No Virus group, such as neutrophil mediated immunity and mast cell mediated immunity. Both children and adults infected with SARS-CoV-2 exhibited downregulation of olfactory receptor gene expression (‘sensory perception of chemical stimulus’), consistent with the loss of sense of smell that has been clinically observed across the age spectrum37,38.
We complemented the analyses comparing SARS-CoV-2 and No Virus patients in each cohort separately by directly comparing gene expression between SARS-CoV-2 infected children and adults, controlling for viral load. We identified 5,352 differentially expressed genes at a p-value < 0.1 (based on a moderated t-statistic and Benjamini-Hochberg adjusted; Supplementary Data 4). Age differences unrelated to viral status likely contributed to the larger number of DE genes in the direct comparison. Nevertheless, GSEA of the DE results yielded overall similar patterns to those described above with regard to immune pathways (Fig. 2b; Supplementary Data 5). B cell related pathways (B cell activation, humoral immune response), T cell related pathways (T cell activation, IL-2, IL-4 and IFNγ production) and chemokine/cytokine signaling were more highly expressed in children with SARS-CoV-2 infection.
While some immune pathways did not reach statistical significance in the direct comparison between children and adults with SARS-CoV-2 infection, they typically trended in the same direction observed in the comparison to the No Virus groups (Fig. 2a, b). On the other hand, the stark disparity in neutrophil activation observed in the comparison to the No Virus groups was only weakly supported in the direct comparison, likely reflecting differences among the No Virus patients themselves. The direct comparison clearly revealed lower expression of cilia-associated genes in children with SARS-CoV-2 infection and suggested a trend toward lower expression of interferon-stimulated genes (Fig. 2b; Supplementary Fig. 2d), though the pathway just missed the statistical significance cutoff (p-value = 0.06, based on an adaptive multilevel splitting Monte Carlo approach and Benjamini-Hochberg adjusted). As expected, many developmental processes unrelated to infection also differed in the direct comparison between children and adults (Supplementary Data 5).
Importantly, we observed similar DE and GSEA results in a secondary analysis restricted only to outpatient children (n = 30) and adults (n = 24) with SARS-CoV-2 infection, suggesting that differences in the proportion of hospitalized patients and outpatients in each cohort did not bias the direct comparison (Supplementary Fig. 3).
Many of the pathways identified in the GSEA results as differentially expressed between children and adults with SARS-CoV-2 infection were tightly related to specific cell types. We therefore applied in silico estimation of cell type proportions39 based on marker genes derived from an airway single-cell study40 as an additional approach to contextualize our findings (Fig. 3; Supplementary Fig. 4; Supplementary Data 6). Consistent with the GSEA results, we found that SARS-CoV-2 infection triggered significantly greater recruitment of B cells to the upper airway in children compared to adults, which was also evident in the comparison between children and adults with other respiratory viruses (Fig. 3a). In contrast, differences in estimated T cell proportions were much subtler (Supplementary Fig. 4a), suggesting the GSEA results may reflect distinctions in T cell identity and regulation and not only cell number.
In the previous analysis of the adult cohort, infection with SARS-CoV-2 was associated with blunted recruitment of macrophages and neutrophils to the upper airway as compared to other respiratory viral infections30. The pediatric Other Virus group was too small to definitively conclude whether this observation recapitulates in children, especially given the mix of different viruses represented. Nevertheless, it is notable that a substantial fraction of the samples in the pediatric Other Virus group indeed exhibited markedly higher macrophage, neutrophil and dendritic cell proportions than most SARS-CoV-2 samples (Fig. 3b–d). Macrophage proportions did trend higher in children with SARS-CoV-2 infection compared with adults (Fig. 3b).
Intriguingly, while proportions of ciliated cells did not differ between children and adults in the No Virus groups, children with SARS-CoV-2 infection exhibited a marked decrease in ciliated cell proportions that was absent in the adults (Fig. 3e), consistent with the GSEA findings. This was accompanied by greater proportions of basal cells in children compared to adults with SARS-CoV-2 infection (Fig. 3f), possibly reflecting compensatory regeneration of the airway epithelium.
Finally, we wished to examine the effect of SARS-CoV-2 viral load on gene expression in pathways of interest within each age cohort. Expression of interferon-stimulated genes (ISGs) frequently correlates with viral load, as was previously observed in the adult cohort30. We performed robust regression to relate the expression of n = 100 ISGs to viral load in children or adults with SARS-CoV-2 infection, and compared the resulting slopes and coefficients of determination (Fig. 4a, b). The ISGs that most strongly correlated with viral load in adults, such as CXCL11 and OASL, exhibited overall similar patterns in children with slightly greater slopes (Fig. 4a–c). However, a subset of ISGs was considerably better correlated with viral load in children, most strikingly exemplified by genes such as IFI6 and IFI27 (Fig. 4d). While even adults with low viral load displayed elevated expression of these genes, the response in children was more gradual and only caught up to the adults at higher viral loads. ISGs that shifted from an almost stepwise response to the virus in adults to a more proportional one in children were among the leading-edge genes that contributed to the apparent trend toward lower interferon-response pathway expression in children in the GSEA results (Fig. 2b; Supplementary Fig. 2d; Supplementary Data 5). These findings suggest relatively subtle differences in ISG-specific regulation and/or cellular origins between children and adults that defy simple generalization. Such regulatory differences might also be reflected in the differential expression of certain interferon-regulatory factors (e.g., IRF8; Supplementary Fig. 2c)
In stark contrast to ISGs, the expression of B cell marker genes, such as CD22 and CD79A, was entirely uncorrelated with viral load in children (Fig. 4d). These genes exhibited significant heterogeneity between patients, likely reflecting the timing of activation of the B cell response, but the fraction of children who were engaging the response at the time of sampling was substantially greater.
We compared upper airway gene expression in children and adults to identify commonalities and distinctions in the response to SARS-CoV-2 at the site of initial infection, which may ultimately contribute to their disparate clinical outcomes.
Our analysis supports the conclusion that, when controlling for viral load, children and adults with SARS-CoV-2 infection both engage a pronounced interferon-stimulated gene (ISG) response in the upper airway that, as a whole, is of comparable magnitude. Our data further demonstrate that children exhibit a more gradual and proportional ‘dose response’ to viral load for a subset of prominent ISGs. These results are broadly in line with the findings of Koch et al., who also performed bulk RNA-sequencing on upper airway samples from children and adults and assessed a composite measure of ISG expression in patients with the highest viral load29. Yoshida et al. who performed a single-cell RNA-sequencing study, also observed only subtle distinctions, with slightly stronger upregulation of composite ISG expression in epithelial cells of infected adults but slightly stronger upregulation in immune cells of infected children17. In contrast, the single-cell analysis by Loske et al. showed somewhat elevated ISG expression in children, though neither single-cell study directly controlled for viral load21.
Importantly, both single-cell studies suggested a pre-activated anti-viral state in healthy children, characterized by elevated expression of upstream viral pattern recognition receptors and/or ISGs themselves17,21. Our study was not well suited to examine this question since the patients in the No Virus groups were generally not healthy controls. It may indeed be the case that a pre-activated anti-viral state decreases the chance that a productive infection is established in children. However, numerous large-scale studies have shown infected children do not systematically exhibit lower SARS-CoV-2 viral load in the upper airway compared with adults16,23,24,25,26, and ISG expression following infection does not appear significantly stronger in children when controlling for viral load. Thus, it remains unclear to what extent a pre-activated anti-viral state ultimately contributes to disparate clinical outcomes between children and adults who become infected.
This aside, our results suggest important elements of the adaptive immune response may be engaged more robustly in the upper airway of children. Specifically, we observed elevated gene expression markers of B cell and T cell activation, as well as cytokine production typically associated with T cell activation (such as IFNγ), in the upper airway of children. Intriguingly, Loske et al. also observed increased prevalence and activation of T cells as well as IFNγ expression in the upper airway of children in their single-cell study21. Vono et al. recently reported that children exhibited an elevated gene expression signature of B cell activation in the circulation compared to adults in the early days following symptom onset16, and Yoshida et al. observed a striking increase in naïve lymphocytes in the circulation of children with COVID-1917, which they speculated could reflect increased migration of B cells and T cells to the site of infection. Both our data and that of Loske et al. provide compelling evidence in support of this hypothesis. An early adaptive immune response to a novel pathogen in the upper airway of children, perhaps due to a more naïve immunological state, may therefore represent a critical factor in preventing progression to severe disease in children.
Finally, we observed evidence consistent with increased clearance of ciliated cells in children with SARS-CoV-2 infection, and a proportional increase in basal cells, which could be differentiating to restore homeostasis to the airway epithelium. Recent studies have found that ciliated cells are a major target for SARS-CoV-2 at the onset of infection41,42. It is thus conceivable that more effective turnover of infected ciliated cells and epithelial regeneration in the upper airway of children may limit the ability of the virus to migrate into the lower airway, where it can cause more severe disease.
Our study has several limitations that should be kept in mind: 1) a larger sample size would have increased the generalizability of the findings; 2) precise information on the timing of sample collection with respect to symptom onset was unavailable, although we limited our analysis to samples with viral load characteristic of the timeframe from just before symptom onset and up to ~6 days later31; 3) we did not have access to sequential data to investigate immune response dynamics over time; 4) we did not directly assess cell types present in the mucosa; and lastly, 5) the majority of subjects with COVID-19 had mild disease at the time of sampling and did not require hospitalization. Results may have differed if specimens from a greater proportion of severely ill individuals had been available. However, this likely resulted in a more relevant comparison since relatively few children develop severe disease and the upper airway is no longer the principal site of pathology in severe disease.
Our study provides added perspective on several leading hypotheses regarding the molecular underpinnings of clinical outcome disparities between children and adults with COVID-19. Nevertheless, further study is warranted to understand why children are protected against severe disease from SARS-CoV-2, or β-coronaviruses more generally43, as compared to several other respiratory viral pathogens.
Study design and clinical cohort
The previously published adult cohort consisted of patients with acute respiratory illnesses tested for COVID-19 by RT-PCR at the University of California San Francisco (UCSF), leveraging leftover RNA extracted from clinical NP swab specimens30. The UCSF Institutional Review Board granted a waiver of consent under protocol #17-24056. For the novel analyses presented here, we leveraged this published adult dataset while supplementing it with pediatric samples similarly obtained at UCSF under the same protocol. Additionally, pediatric samples were obtained from patients tested for COVID-19 by RT-PCR from NP swabs at Children’s Hospital Colorado (CHCO). CHCO specimens and data were obtained under Colorado Multiple Institutional Review Board protocols #20-0865, #20-1617 and #20-0972, which also granted a waiver of consent. All samples were collected between March and September 2020, prior to the availability of COVID-19 vaccines. Demographic and clinical data for all patients were obtained from a combination of electronic and manual abstraction of medical records at the respective institutions. Comprehensive sample metadata, including known clinical diagnoses, is available in Supplementary Data 1.
Children up to 19 years of age and adults at least 40 years of age were eligible for inclusion in the present analysis. Some samples were ultimately excluded based on sequencing metrics, as described in the following sections.
Excess clinical swab specimens were stored in viral transport media at −80 °C in the respective Clinical Microbiology Laboratories. Specimens were thawed and 200 uL aliquots of specimen were added to 200 uL of DNA/RNA Shield (Zymo Research, Irvine, CA) in sterile 1.5 mL microtubes with appropriate biohazard precautions.
Metagenomic RNA sequencing
Specimens underwent RNA extraction and metagenomic sequencing, as previously described30. Briefly, RNA was extracted from 200 μL of specimen in DNA/RNA shield using bead-based lysis and the Zymo Pathogen Magbead kit (Zymo). We also processed negative control samples (water and HeLa cell RNA) to account for background contamination. All samples were spiked with RNA standards from the External RNA Controls Consortium (ERCC)44. Samples were DNase treated, depleted of cytosolic and mitochondrial rRNA using FastSelect (Qiagen, Germantown, MD), and reverse transcribed to generate cDNA. Sequencing libraries were constructed using the NEBNext Ultra II Library Prep Kit (New England Biolabs, Ipswich, MA). Libraries underwent 146 nucleotide paired-end sequencing on an Illumina Novaseq 6000 instrument.
Metagenomic analysis of respiratory viruses
Samples were processed through the CZ-ID pipeline (formerly called IDSeq)45,46, which performs reference based alignment at both the nucleotide and amino acid level against sequences in the National Center for Biotechnology Information (NCBI) nucleotide (NT) and non-redundant (NR) databases, respectively, followed by assembly of the reads matching each taxon. We further processed the results for viruses with established pathogenicity in the respiratory tract47. We evaluated whether one of these viruses was present in a patient sample if it met the following three initial criteria: (i) at least 10 counts mapped to NT sequences, (ii) at least 1 count mapped to NR sequences, (iii) average assembly nucleotide alignment length of at least 70 bp.
Negative control (water and HeLa cell RNA) samples enabled estimation of the number of background reads expected for each virus, which were normalized by input mass as determined by the ratio of sample reads to spike-in ERCC RNA standards. Viruses meeting the initial criteria outlined above were then additionally tested for whether the number of sequencing reads aligned to them in the NT database was significantly greater than background. This was done by modeling the number of background reads as a negative binomial distribution, with mean and dispersion fitted on the negative controls. We estimated the mean parameter of the negative binomial for each taxon (virus) by averaging the read counts across all negative controls after normalizing by ERCC counts. We estimated a single dispersion parameter across all taxa using the functions glm.nb() and theta.md() from the R package MASS. We considered a sample to have a pathogenic respiratory virus detected by sequencing if the virus achieved an adjusted p-value < 0.05 after Holm’s correction for all tests performed in the same sample.
We used the CZ-ID-calculated viral reads-per-million (rpM), based on the NT alignment, as a uniform measure of SARS-CoV-2 abundance across all samples. A value of 0.1 rpM was added to all samples with rpM < 0.1.
Assignment of samples to comparator groups
We divided the samples in each age cohort into three viral status groups: 1) samples with a positive clinical PCR test for SARS-CoV-2 were assigned to the “SARS-CoV-2” group; 2) samples with a negative PCR test for SARS-CoV-2 and no evidence of another pathogenic respiratory virus in the metagenomic sequencing were assigned to the “No Virus” group; and 3) samples with a negative PCR test for SARS-CoV-2 but another respiratory virus detected by sequencing were assigned to the “Other Virus” group.
We retained for analysis only samples in the SARS-CoV-2 groups with at least 10 rpM, roughly corresponding to PCR Ct values below 3030, to focus on cases with likely active viral replication, where a clear transcriptional response to the virus is expected to be found. This approach was based on the well-established correlation between viral load and recovery of actively replicating virus from respiratory specimens, specifically the finding that specimens with a PCR Ct < 30 are associated with the ability to culture SARS-CoV-232,33,34. Viral load above this threshold is typically characteristic of acute infection, within ~6 days of symptom onset31.
Human gene expression quantification
Following demultiplexing, sequencing reads were pseudo-aligned with kallisto48 (v. 0.46.1; including bias correction) to an index consisting of all transcripts associated with human protein coding genes (ENSEMBL v.99), cytosolic and mitochondrial ribosomal RNA sequences, and the sequences of ERCC RNA standards. Samples were retained for analysis if they had at least 400,000 estimated counts associated with transcripts of protein coding genes. Gene-level counts were generated from the transcript-level abundance estimates using the R package tximport49, with the scaledTPM method.
Differential expression (DE) analyses
Genes were retained for each DE analysis if they had at least 10 counts in at least 20% of the samples included in the analysis. All analyses were performed with the R package limma50, using quantile normalization and the voom method. The design formula for the comparisons within each age cohort was ~viral status, where viral status was either “SARS-CoV-2” or “No Virus”. The design formula for the direct comparison between children and adults with SARS-CoV-2 infection was ~log10(rpM) + age cohort, where age cohort was either “children” or “adults”. DE p-values were adjusted using the Benjamini-Hochberg method within each comparison. Full DE results are available in Supplementary Data 2 and Supplementary Data 4.
Gene set enrichment analyses (GSEA)
Gene set enrichment analysis was based on Gene Ontology (GO) biological process pathway annotations36, using the non-redundant version available through WebGestalt51. Only pathways with a minimum size of 10 genes and a maximum size of 1500 genes were retained for analysis. The analysis was performed using the fgseaMultilevel function in the R package fgsea52, which calculates p-values based on an adaptive, multilevel splitting Monte Carlo scheme. The input consisted of all genes in the respective DE analysis, except for histone genes, pre-ranked by fold-change. The gene sets in Fig. 2 were manually selected to reduce redundancy and highlight diverse immune-related pathways and other relevant biological functions from among those with a Benjamini-Hochberg adjusted p-value < 0.05 in at least one of the three comparisons. Full results are provided in Supplementary Data 3 and Supplementary Data 5.
In silico estimation of cell type proportions
Cell-type proportions were estimated using the CIBERSORT X algorithm39 based on single cell signatures derived from the human lung cell atlas40. Differences in estimated proportions between comparator groups were evaluated for statistical significance using a Mann-Whitney test with Holm’s correction for multiple testing. Full results are provided in Supplementary Data 6.
Regression of gene counts against viral load
We performed robust regression of the limma-generated quantile normalized gene counts (log2 scale) against log10(rpM) of SARS-CoV-2 for n = 100 ISGs based on the “Hallmark interferon-alpha response” gene set in MSigDB (https://www.gsea-msigdb.org/gsea/msigdb), as well as for selected B cell activation marker genes. The analysis was performed within each age cohort separately using the R package robustbase53, which implements MM-type estimators for linear regression54,55, the KS2014 setting, and the model: quantile normalized counts (log2 scale) ~ log10(rpM). Model predictions were generated using the R package ggeffects and used for display in the individual gene plots. Error bands represent normal distribution 95% confidence intervals around each prediction. Reported p-values for significance of the difference of the regression coefficient from 0 are based on a t-statistic. Reported R2 values represent the adjusted robust coefficient of determination56.
Further information on research design is available in the Nature Research Reporting Summary linked to this article.
The raw sequencing data are protected due to patient privacy restrictions in the IRB protocols governing enrollment in this study under a waiver of consent. Researchers who wish to obtain the FASTQ files can contact the corresponding author in order to be added to the IRB protocols and sign a materials transfer agreement with UCSF ensuring secure storage of the data and its exclusive use for de-identified transcriptomic analyses. Processed gene counts have been deposited under NCBI GEO accession GSE179277. The published human lung single-cell datasets used for cell-type proportions analysis can be obtained through Synapse under accessions syn21560510 and syn21560511. The “Hallmark Interferon Alpha Response” gene set is available from MsigDB under accession M5911. Source data are provided with this paper.
Code for the cell-type proportions analysis and the robust regression analysis is available in the repository of the previously published study on the adult cohort: https://github.com/czbiohub/covid19-transcriptomics-pathogenesis-diagnostics-results.
O’Driscoll, M. et al. Age-specific mortality and immunity patterns of SARS-CoV-2. Nature 590, 140–145 (2021).
Levin, A. T. et al. Assessing the age specificity of infection fatality rates for COVID-19: Systematic review, meta-analysis, and public policy implications. Eur. J. Epidemiol. 35, 1123–1138 (2020).
Woolf, S. H., Chapman, D. A. & Lee, J. H. COVID-19 as the leading cause of death in the United States. JAMA https://doi.org/10.1001/jama.2020.24865 (2020).
Shi, T. et al. Global, regional, and national disease burden estimates of acute lower respiratory infections due to respiratory syncytial virus in young children in 2015: A systematic review and modelling study. Lancet 390, 946–958 (2017).
Nair, H. et al. Global burden of respiratory infections due to seasonal influenza in young children: A systematic review and meta-analysis. Lancet 378, 1917–1930 (2011).
Principi, N. & Esposito, S. Severe influenza in children: Incidence and risk factors. Expert Rev. Anti-infective Ther. 14, 961–968 (2016).
Matias, G. et al. Estimates of hospitalization attributable to influenza and RSV in the US during 1997–2009, by age and risk status. BMC Public Health 17, 271 (2017).
Tin Tin Htar, M., Yerramalla, M. S., Moïsi, J. C. & Swerdlow, D. L. The burden of respiratory syncytial virus in adults: a systematic review and meta-analysis. Epidemiol. Infect. 148, e48 (2020).
Writing Committee of the WHO Consultation on Clinical Aspects of Pandemic (H1N1) 2009 Influenza. et al. Clinical aspects of pandemic 2009 influenza A (H1N1) virus infection. N. Engl. J. Med. 362, 1708–1719 (2010).
Ludvigsson, J. F. Systematic review of COVID-19 in children shows milder cases and a better prognosis than adults. Acta. Paediatrica 109, 1088–1095 (2020).
Bhopal, S. S., Bagaria, J., Olabi, B. & Bhopal, R. Children and young people remain at low risk of COVID-19 mortality. Lancet Child Adolesc. Health 5, e12–e13 (2021).
Bailey, L. C. et al. Assessment of 135 794 pediatric patients tested for severe acute respiratory syndrome coronavirus 2 across the United States. JAMA Pediatr. 175, 176 (2021).
Xu, Y. et al. Characteristics of pediatric SARS-CoV-2 infection and potential evidence for persistent fecal viral shedding. Nat. Med. 26, 502–505 (2020).
Dong, Y. et al. Epidemiology of COVID-19 among children in China. Pediatrics 145, https://doi.org/10.1542/peds.2020-0702 (2020).
Williamson, E. J. et al. Factors associated with COVID-19-related death using OpenSAFELY. Nature 584, 430–436 (2020).
Vono, M. et al. Robust innate responses to SARS-CoV-2 in children resolve faster than in adults without compromising adaptive immunity. Cell Rep. 37, 109773 (2021).
Yoshida, M. et al. Local and systemic responses to SARS-CoV-2 infection in children and adults. Nature 1–10 https://doi.org/10.1038/s41586-021-04345-x (2021).
Cohen, C. A. et al. SARS-CoV-2 specific T cell responses are lower in children and increase with age and time after infection. Nat. Commun. 12, 4678 (2021).
Weisberg, S. P. et al. Distinct antibody responses to SARS-CoV-2 in children and adults across the COVID-19 clinical spectrum. Nat. Immunol. 22, 25–31 (2021).
Pierce, C. A. et al. Immune responses to SARS-CoV-2 infection in hospitalized pediatric and adult patients. Sci. Transl. Med. 12, https://doi.org/10.1126/scitranslmed.abd5487 (2020).
Loske, J. et al. Pre-activated antiviral innate immunity in the upper airways controls early SARS-CoV-2 infection in children. Nat. Biotechnol. 1–6 https://doi.org/10.1038/s41587-021-01037-9 (2021).
Pierce, C. A. et al. Natural mucosal barriers and COVID-19 in children. JCI. Insight 6, https://doi.org/10.1172/jci.insight.148694 (2021).
Madera, S. et al. Nasopharyngeal SARS-CoV-2 viral loads in young children do not differ significantly from those in older children and adults. Sci. Rep. 11, 3044 (2021).
Chung, E. et al. Comparison of symptoms and RNA levels in children and adults with SARS-CoV-2 infection in the community setting. JAMA Pediatrics https://doi.org/10.1001/jamapediatrics.2021.2025 (2021).
Bellon, M. et al. Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2) viral load kinetics in symptomatic children, adolescents, and adults. Clin. Infect. Dis. 73, e1384–e1386 (2021).
Costa, R. et al. Upper respiratory tract SARS-CoV-2 RNA loads in symptomatic and asymptomatic children and adults. Clin. Microbiol. Infect. 27, 1858.e1–1858.e7 (2021).
Zachariah, P. et al. Symptomatic infants have higher nasopharyngeal SARS-CoV-2 viral loads but less severe disease than older children. Clin. Infect. Dis. 71, 2305–2306 (2020).
Ochoa, V. et al. Infants younger than 6 months infected with SARS-CoV-2 show the highest respiratory viral loads. J. Infect. Dis. 225, 392–395 (2022).
Koch, C. M. et al. Age-related differences in the nasal mucosal immune response to SARS-CoV-2. Am. J. Respir. Cell Mol. Biol. https://doi.org/10.1165/rcmb.2021-0292OC (2021).
Mick, E. et al. Upper airway gene expression reveals suppressed immune responses to SARS-CoV-2 compared with other respiratory viruses. Nat. Commun. 11, 5854 (2020).
Kissler, S. M. et al. Viral dynamics of SARS-CoV-2 variants in vaccinated and unvaccinated persons. N. Engl. J. Med. 385, 2489–2491 (2021).
Singanayagam, A. et al. Duration of infectiousness and correlation with RT-PCR cycle threshold values in cases of COVID-19, England, January to May 2020. Eurosurveillance 25, 2001483 (2020).
La Scola, B. et al. Viral RNA load as determined by cell culture as a management tool for discharge of SARS-CoV-2 patients from infectious disease wards. Eur. J. Clin. Microbiol Infect. Dis. 39, 1059–1061 (2020).
Yamada, S. et al. Assessment of SARS-CoV-2 infectivity of upper respiratory specimens from COVID-19 patients by virus isolation using VeroE6/TMPRSS2 cells. BMJ Open Respir. Res. 8, e000830 (2021).
Subramanian, A. et al. Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc. Natl. Acad. Sci. USA. 102, 15545–15550 (2005).
Ashburner, M. et al. Gene ontology: Tool for the unification of biology. The Gene Ontology Consortium. Nat. Genet. 25, 25–29 (2000).
Giacomelli, A. et al. Self-reported olfactory and taste disorders in patients with severe acute respiratory coronavirus 2 infection: A cross-sectional study. Clin. Infect. Dis. 71, 889–890 (2020).
Kumar, L. et al. Loss of smell and taste in COVID-19 infection in adolescents. Int. J. Pediatr. Otorhinolaryngol. 142, 110626 (2021).
Newman, A. M. et al. Robust enumeration of cell subsets from tissue expression profiles. Nat. Methods 12, 453–457 (2015).
Travaglini, K. J. et al. A molecular cell atlas of the human lung from single-cell RNA sequencing. Nature 587, 619–625 (2020).
Ravindra, N. G. et al. Single-cell longitudinal analysis of SARS-CoV-2 infection in human airway epithelium identifies target cells, alterations in gene expression, and cell state changes. PLOS Biol. 19, e3001143 (2021).
Fiege, J. K. et al. Single cell resolution of SARS-CoV-2 tropism, antiviral responses, and susceptibility to therapies in primary human airway epithelium. PLOS Pathog. 17, e1009292 (2021).
Stockman, L. J. et al. Severe acute respiratory syndrome in children. Pediatr. Infect. Dis. J. 26, 68–74 (2007).
Pine, P. S. et al. Evaluation of the External RNA Controls Consortium (ERCC) reference material using a modified Latin square design. BMC Biotechnol. 16, 54 (2016).
Ramesh, A. et al. Metagenomic next-generation sequencing of samples from pediatric febrile illness in Tororo, Uganda. PLoS One 14, e0218318 (2019).
Kalantar, K. L. et al. IDseq—An open source cloud-based pipeline and analysis service for metagenomic pathogen detection and monitoring. Gigascience 9, https://doi.org/10.1093/gigascience/giaa111 (2020).
Langelier, C. et al. Integrating host response and unbiased microbe detection for lower respiratory tract infection diagnosis in critically ill adults. Proc. Natl. Acad. Sci. USA. 115, E12353–E12362 (2018).
Bray, N. L., Pimentel, H., Melsted, P. & Pachter, L. Near-optimal probabilistic RNA-seq quantification. Nat. Biotechnol. 34, 525–527 (2016).
Soneson, C., Love, M. I. & Robinson, M. D. Differential analyses for RNA-seq: transcript-level estimates improve gene-level inferences. F1000Res. 4, 1521 (2015).
Ritchie, M. E. et al. limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res. 43, e47 (2015).
Liao, Y., Wang, J., Jaehnig, E. J., Shi, Z. & Zhang, B. WebGestalt 2019: Gene set analysis toolkit with revamped UIs and APIs. Nucleic Acids Res. 47, W199–W205 (2019).
Korotkevich, G. et al. Fast gene set enrichment analysis. bioRxiv 060012 https://doi.org/10.1101/060012 (2021).
Maechler, M., Rousseeuw, P., Croux, C., Todorov, V. & Ruckstuhl, A. robustbase: Basic Robust Statistics. R package version 0.95-0, http://robustbase.r-forge.r-project.org (2022).
Koller, M. & Stahel, W. A. Sharpening Wald-type inference in robust regression for small samples. Computational Stat. Data Anal. 55, 2504–2515 (2011).
Yohai, V. J. High breakdown-point and high efficiency robust estimates for regression. Ann. Stat. 15, 642–656 (1987).
Renaud, O. & Victoria-Feser, M.-P. A robust coefficient of determination for regression. J. Stat. Plan. Inference 140, 1852–1862 (2010).
We would like to thank Sam Dominguez, MD, Ph.D. Kirk Harris, Ph.D. Aline Maddux, MD, Christina Osborne, MD, and Matthew Leroue, MD, for their input on study design and review of the data. We are grateful for funding support from the following sources: NHLBI K23HL138461-01A1 (C.R.L.); Chan Zuckerberg Biohub (C.R.L.); COVID-Child Health Research Award, Research Institute at Children’s Hospital Colorado (P.M.M.); and philanthropic contributions from Mark and Carrie Casey, Julia and Kevin Hartz, Carl Kawaja and Wendy Holcombe, Eric Keisman and Linda Nevin, Martin and Leesa Romo, Diana Wagner, Jerry Yang and Akiko Yamazaki, and Three Sisters Foundation (C.R.L.).
The authors declare no competing interests.
Peer review information
Nature Communications thanks Matthew Altman and the other anonymous reviewers for their contribution to the peer review of this work. Peer reviewer reports are available.
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Mick, E., Tsitsiklis, A., Spottiswoode, N. et al. Upper airway gene expression shows a more robust adaptive immune response to SARS-CoV-2 in children. Nat Commun 13, 3937 (2022). https://doi.org/10.1038/s41467-022-31600-0
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.