Respiratory syncytial virus (RSV) infection is a leading cause of hospitalization in infants. Underlying risk factors for RSV infection in the general population are not well understood, as previous work has focused on severe outcomes of infection in a clinical setting. Here we use RSV-specific IgG and IgA antibody measurements from two population-based cross-sectional serosurveys carried out in the Netherlands (n = 682) to classify children up to 5 years as seronegative or seropositive. We employ a generalized additive model to estimate the probability of prior RSV infection as function of age, date of birth within the year, and other risk factors. The analyses show that the majority of children have experienced a RSV infection before the age of 2 years. Age and birthdate are strong predictors of RSV infection in the first years of life, and children born in summer have higher estimated probability of infection than those born in winter [e.g., 0.56 (95% CI 0.45–0.66) vs. 0.32 (0.21–0.45) at age 1 year]. Our analyses reveal that the mean age at infection depends on date of birth, which has implications for the design of vaccination programmes and prioritisation schemes for the prophylactic use of monoclonal antibodies.
Respiratory syncytial virus (RSV) is a main cause of acute respiratory infection (ARI) in infants and young children1, leading to an estimated 60,000 in-hospital deaths and 3.2 million hospital admissions per year in children younger than 5 years2. In addition, RSV is also increasingly recognized as a main cause of the burden of respiratory disease in older adults3. In young children the overall disease burden depends critically on the age of primary infection, where prematurity and young age (< 6 months) are associated with severe disease4.
Despite the progress being made in vaccine and monoclonal antibody development5, there still is an incomplete quantitative understanding of the RSV infection dynamics across all age groups. Most research on RSV infections is done with a focus on severe outcome of infection in a clinical setting6. Building on such information, attempts have been undertaken to gauge the incidence of infection4,7 and infection attack rates8, showing that these are highest in young children9. Nevertheless, the majority of infections lead to relatively mild illness for which no medical care is sought. For a proper understanding of the transmission dynamics of RSV and optimal planning of preventative measures, direct information is needed on the incidence of infection in different age groups, irrespective of clinical signs. Such information on the cumulative incidence of infection can be obtained from serological surveys.
Here we study the infection dynamics of RSV in children under 5 years in the Netherlands by fitting a generalized additive model (GAM) to serological antibody data from two large cross-sectional population based studies10,11. Our analyses uncover risk factors for infection with RSV in the first years of life, providing quantitative estimates of the infection probabilities as function of relevant covariates. Hence, our analyses complement and extend on earlier studies that focussed on risk factors for severe outcome and mortality12.
We use serological data from two large Dutch population-based cross-sectional seroprevalence studies carried out in 2006/2007 and 2016/201710,11. We focus on infants and young children up to 60 months of age. The youngest participant in our study is 1 month old (36 days). In total, 450 individuals originated from the 2006/2007 cohort and 741 from the 2016/2017 study. Both studies have been described in detail and approved by a relevant Medical Ethical Committee10,11. RSV-specific IgG was measured in 1191 individuals, and in 497 also RSV-specific IgA was determined. Children under 1 year of age are oversampled, and the age distribution of the participant population is shown in Supplement Figure 1.
A multiplex immunoassay (MIA) was performed for IgA and IgG antibody concentrations directed against five RSV antigens13,14. Antibody (Ab) concentrations against prefusion F protein, postfusion F protein, nucleoprotein (N), glycoprotein of RSV type A and B (Ga and Gb) were measured simultaneously. Details of the MIA are described by Schepp et al.13. Additional information from the participants is available from a questionnaire filled in by parents of the participants. The questionnaire includes information on the number and ages of household members, the number of contacts made by the participants, whether the child visits a day-care, individual characteristics (age, gestational age, weight, length), and various socio-economic factors.
For determining the timing of the RSV seasons, we use weekly data from a laboratory-based surveillance system in the Netherlands. This system surveys a selection of viral infections, including RSV15.
Classification of children as previously infected (i.e. seropositive) or as yet uninfected (seronegative) is usually based on the IgG antibody concentration in a blood sample; children with Ab concentration higher than a predetermined cut-off are classified as previously infected and those with a concentration below the cut-off are classified as uninfected. Here, however, this method is complicated by the interference of maternal IgG antibodies16. Therefore, we have based the classification in young infants on both IgG and IgA concentrations, as IgA antibodies are not transferred across the placenta from mother to child. Specifically, in children younger than 500 days we use a previously determined IgA cut-off of 0.2 AU/mL to discriminate between infected and non-infected children14. For children older than 500 days an IgG cut-off of 1.0 AU/mL is used. Here, to increase specificity in both cases (i.e. under and over 500 days) we require that at least two out of three antibody concentrations (prefusion F, postfusion F and N) must exceed the cut-off for a sample to be classified as seropositive. Subgroup-specific proteins Ga and Gb are not used in the sero-classification. Using our classification method we are able to determine the infection status of 682 individuals (408 using IgA and 274 using IgG). The age distribution of the participants is shown in Fig. 2.
We analyse the RSV serology using a logistic regression in a generalized additive model (GAM) using the R package mgcv17. The regression models express the serostatus of the participants with complete information (n = 616) as a function of age, birth day of year (DOY), having siblings in various age ranges, attending day-care, and sex of the individual. Continuous variables age and birth DOY are modelled using penalized cubic splines (P-splines), using second order penalties. Birth DOY is modelled with a cyclic P-spline, with the boundary knots at day 365 (December 31) attached to day 1 (January 1). Based on preliminary analyses, we supply the splines for age with 25 knots and those for birth DOY with 11 knots. In addition, having a young sibling (0–4 years) in the household and visiting day-care are modelled as categorical variables (present/absent). Parameter estimation is performed by restricted maximum likelihood. The output of the logistic regression (log-odds of infection) is transformed into a probability of prior infection using the inverse logit function. The model can be written as
where s()and cs() represent the P-spline and cyclic P-spline, respectively.
A number of checks and supplementary analyses are performed to ensure that the results are not affected by differences between the two cohorts (2006/2007 and 2016/2017) or by specific RSV epidemics (2005/2006, 2006/2007, 2015/2016 and 2016/2017). We further exclude the potentially important covariate gestational age as only a very small number of premature infants is available (n = 10).
For selection of variables we perform an univariate analysis, retaining only variables that are significant at the 0.05 level (age, birth DOY, siblings0–4 and day-care) and discarding the non-significant variables (siblings5–9, siblings total, sex of the individual). Using this selection, starting from age and birth DOY, all possible model combinations are produced and checked for significance (p < 0.05) of the variables. In these procedures, we exclude day-care visits of household members which is marginally significant in the univariate analysis but not significant in the full model. We also evaluate all first-order interactions among variables that are significant in the univariate analyses.
Antibody dynamics and infection status
Figure 1 shows IgG and IgA antibody concentrations as a function of age. The figure show that maternal IgG decreases at an exponential rate, such that at age 1 year the majority of samples would be classified as uninfected using the IgG cut-off. Varying the thresholds for infection appears has a minor impact on the classification (Supplement Table 1).
Figure 2 shows that the majority of participants have been infected at least once before the age of 2 years, and that all children have been infected at least once by the age of 32 months. As expected, all infected individuals have experienced one or more RSV epidemics. Supplement Figure 3 gives an overview of the results using Lexis diagrams.
Age and birth day of year
Using the logistic regression model, we find a significant effect of age and birth DOY on infection status (p < 2e−16 and p = 0.00209), while the interaction (tensor product) is not significant (p = 0.364). As expected, age is a major determinant for RSV infection, with probability of prior infection increasing from almost 0 at 1 month of age to virtually 1 at three years of age (Fig. 3). During the first 3 years of life, we also find a strong effect of birth DOY on the probability of prior infection. Specifically, the probability of prior infection is highest for children born in summer months (June–August) and lowest for those born in winter (December–February). For instance, at the age of 6 months, the estimated probability of prior infection for children born in July is 0.16 [95% CI 0.10, 0.24], while it is 0.07 [95% CI 0.04, 0.12] for children born in January. At the age of 1 year, the estimated probabilities of infection are 0.55 [95% CI 0.44, 0.65] for children born in July and 0.33 [95% CI 0.22, 0.47] for children born in January. Hence, at 6 months and one year of age the infection probability is 2.2- and 1.6-fold higher for children born in July as compared to those born in January. In the following, we refer to the model with age and birth DOY as the baseline model.
Siblings and day-care attendance
Extending the baseline model with a binary variable for having a household member in the age range 0–4 years (siblings0–4) improves the model fit significantly (p = 0.0364, Supplement Figure 5) and is also significant in the full model (p = 0.0175, Fig. 4). For example, when included in the baseline model, the estimated risk of infection at age 6 months and birth in July is 0.20 [95% CI 0.12, 0.30] for children having a young sibling (0–4 years), and 0.13 [95% CI 0.08, 0.21] for children without a young sibling. In contrast, having one or more older siblings (5–9 years) in the household does not significantly improve the model fit (p = 0.420; Supplement Figure 6).
Inclusion of day-care into the model strongly improves the model fit (p = 2.84e−06 when added to the baseline model, Supplement Figure 7; p = 1.58e−06 in the full model, Fig. 4). For instance, at the age of 6 months the estimated probability of infection is 0.26 [95% CI 0.16, 0.40] for children born in July that visit day-care, and 0.10 [95% CI 0.06, 0.17] for children that do not visit day-care. Noticeably, the difference between two extreme subsets (children with young siblings who visit a day-care vs children without siblings not visiting day-care) the estimated probabilities of infection at 6 months of age are 0.34 [95% CI 0.21, 0.51] and 0.08 [95% CI 0.04, 0.14], respectively, a more than fourfold difference. At 12 months of age these estimates are 0.78 [95% CI 0.65, 0.87] and 0.36 [95% CI 0.24, 0.49], respectively, still a more than twofold difference (Table 1). With increasing age, the probability of infection becomes less dependent on the day care visit and having young siblings in the household and it ceases to be a risk factor.
Using data from two large population-based serological studies in the Netherlands, our analyses have provided risk factors and quantitative estimates for the probability of prior RSV infection. Our results show that the probability of prior RSV infection increases strongly with age but is also highly dependent on birth DOY, with highest probabilities of infection for children born in summer (Figs. 3, 4). In addition, our results show that having a young siblings in the household (0–4 years) and attending day-care increases the probability of prior RSV infection significantly. Differences between estimated infection probabilities can be substantial. For instance, at 6 months of age children that attend day-care and have a young sibling have a more than fourfold higher estimated probability of infection than children that do not visit a day-care and do not have a young sibling (0.34 vs. 0.08).
The high incidence rate in the first 2–3 years of life is in agreement with previous studies4,6,8,9,14, and our infection estimates at 1 and 2 years (44.1% and 84.6%) are also comparable to estimates from Finland and Kenya18,19,20. While our results show that in the first 2 years of life children born in summer are infected at a younger age, hospitalization data from the Netherlands show that the burden in the hospital is focused in children 2–3 months old that are born between August and December21. Hence, this implies that while the probability of prior infection is low for children born in late summer to early winter, presumably due to the presence of maternal antibodies, the severity of disease is highest in these children.
In our analyses, age is the leading predictor for prior infection, followed by birth DOY, attending day-care, and living with children aged 0–4 years. Interestingly, having a somewhat older sibling in the household (siblings 5–9 years) is not associated with an increased probability of infection. Perhaps, the difference could be explained by the increased duration of shedding22 and the higher incidence of infection7 in younger children. With respect to day-care, it should be noted that in the Netherlands many children start going to day-care already at 3 months of age, at the end of maternity leave. An increased risk for RSV hospitalization in case of day-care attendance and preschool age sibling(s) has also been reported in previous studies23,24. In this study, we have shown that these risk factors also apply to infection in the Dutch general population.
We discuss a number of limitations. First, our analyses are based on 616 samples with full information on covariates. This has resulted in fairly broad confidence bands for estimates, especially in the full model with all significant factors included (Fig. 4). By extension, this also implies that we may not have uncovered risk factors with relatively small impact. And due to lack of information or small stratum size we could not include potentially relevant factors such gestational age, ethnicity, and socio-economic status into the analyses. This is unfortunate, especially with regard to prematurity, which is a known risk factor for severe infection1. Thus, in future studies pooling of data from the current study with seroprevalence data from other countries may increase the power to detect additional risk factors. An alternative would be a longitudinal design with multiple follow-ups and comprehensive background information on relevant factors.
Second, we have combined data from two studies, a cohort from 2006/2007 and another cohort from 2016/2017. Even though we did not find statistical evidence of differences in infection status (p = 0.0832) or antibody concentrations14 between the two studies, there are subtle differences in study setup and study characteristics. For instance the response rate was substantially higher in the 2006/2007 cohort as compared to the 2016/2017 cohort and the serum collection of young children (< 1 year) in the 2016/2017 cohort are all taken in the beginning of the sampling period, while for the 2006/2007 cohort sampling is performed scattered over the whole sampling period (see Supplement Figure 3). Nevertheless, we believe that our study has provided valuable quantitative estimates of infection rates in the first years of life that will help with the design of future vaccination strategies.
In summary, we have provided precise quantitative estimates of the probability of primary infection with RSV in the first years of life, and showed that next to age, birth DOY, day-care attendance, and having a young sibling in the household are risk factors for infection. Our results provide support for the design and prioritization of intervention and prevention measures aimed at protecting infants in the first years of life. For instance, based on our analyses one could argue that the impact of using monoclonal antibodies in selected groups would be highest when supplied to children born in summer. Of course, in practical applications, other factors such as logistical constraints and ethical considerations will have to be factored in as well.
Data and scripts for the data processing, statistical analysis, figures, and tables are publicly available on GitHub (github.com/Stijn-A/RSV_serology).
Scheltema, N. M. et al. Global respiratory syncytial virus-associated mortality in young children (RSV GOLD): a retrospective case series. Lancet Glob. Health 5, e984–e991 (2017).
Shi, T. et al. Global, regional, and national disease burden estimates of acute lower respiratory infections due to respiratory syncytial virus in young children in 2015: a systematic review and modelling study. Lancet 390, 946–958 (2017).
Shi, T. et al. Global disease burden estimates of respiratory syncytial virus-associated acute respiratory infection in older adults in 2015: a systematic review and meta-analysis. J. Infect. Dis. https://doi.org/10.1093/infdis/jiz059 (2019).
Ohuma, E. O. et al. The natural history of respiratory syncytial virus in a birth cohort: the influence of age and previous infection on reinfection and disease. Am. J. Epidemiol. 176, 794–802 (2012).
RSV Vaccine and mAb Snapshot—PATH Vaccine Resource Library. Accessed 16 Apr 2021. http://vaccineresources.org/details.php?i=1562.
Bont, L. et al. Defining the epidemiology and burden of severe respiratory syncytial virus infection among infants and children in western countries. Infect. Dis. Ther. 5, 271–298 (2016).
Taylor, S. et al. Modelling estimates of the burden of respiratory syncytial virus infection in children in the UK. BMJ Open 6, e009337 (2016).
van Boven, M. et al. Estimating transmission parameters for respiratory syncytial virus and predicting the impact of maternal and pediatric vaccination. J. Infect. Dis. 222, S688–S694 (2020).
Goldstein, E. et al. On the relative role of different age groups during epidemics associated with respiratory syncytial virus. J. Infect. Dis. 217, 238–244 (2018).
Mollema, L. et al. PIENTER 2-project: second research project on the protection against infectious diseases offered by the national immunization programme in the Netherlands. Accessed 16 Apr 2021. https://www.rivm.nl/publicaties/pienter-2-project-second-research-project-on-protection-against-infectious-diseases (2010).
Verberk, J. D. M. et al. Third national biobank for population-based seroprevalence studies in the Netherlands, including the Caribbean Netherlands. BMC Infect. Dis. 19, 470 (2019).
Shi, T., Vennard, S., Mahdy, S. & Nair, H. Risk factors for RSV associated acute lower respiratory infection poor outcome and mortality in young children: a systematic review and meta-analysis. J. Infect. Dis. https://doi.org/10.1093/infdis/jiaa751 (2021).
Schepp, R. M. et al. Development and standardization of a high-throughput multiplex immunoassay for the simultaneous quantification of specific antibodies to five respiratory syncytial virus proteins. mSphere 4, e00236-19 (2019).
Berbers, G., Mollema, L., van der Klis, F., den Hartog, G. & Schepp, R. Antibody responses to respiratory syncytial virus: a cross-sectional serosurveillance study in the Dutch population focusing on infants younger than 2 years. J. Infect. Dis. https://doi.org/10.1093/infdis/jiaa483 (2020).
Van den Brandhof, W. E., Kroes, A. C. M., Bosman, A., Peeters, M. F., & Heijnen, M. L. A. Rapportage van virologische diagnostiek in Nederland. Representativiteit van de gegevens uit de virologische weekstaten [Reporting virological diagnostics in The Netherlands. Representativity of data from the weekly viral reports]. Infectieziekten Bulletin.. 4, 137–143 (2002).
Simister, N. E. Placental transport of immunoglobulin G. in Vaccine vol. 21 3365–3369 (Elsevier BV, 2003).
Wood, S. N. Fast stable restricted maximum likelihood and marginal likelihood estimation of semiparametric generalized linear models. J. R. Stat. Soc. Ser. B. Stat. Methodol. 73, 3–36 (2011).
Nyiro, J. U. et al. Defining the vaccination window for Respiratory syncytial virus (RSV) using ageseroprevalence data for children in Kilifi, Kenya. PLoS One 12, e0177803 (2017).
Kazakova, A. et al. Serological array-in-well multiplex assay reveals a high rate of respiratory virus infections and reinfections in young children. mSphere 4, e00447-19 (2019).
Kutsaya, A. et al. Prospective clinical and serological follow-up in early childhood reveals a high rate of subclinical RSV infection and a relatively high reinfection rate within the first 3 years of life. Epidemiol. Infect. 144, 1622–1633 (2016).
Rachel M. R., Maarten van Wijhe, S. T. et al. Respiratory syncytial virus-associated hospital admissions in children younger than 5 years in 7 European countries using routinely collected datasets. J. Infect. Dis. https://doi.org/10.1093/infdis/jiaa360 (2020).
Munywoki, P. K. et al. Influence of age, severity of infection, and co-infection on the duration of respiratory syncytial virus (RSV) shedding. Epidemiol. Infect. 143, 804–812 (2015).
Law, B. J. et al. The pediatric investigators collaborative network on infections in Canada study of predictors of hospitalization for respiratory syncytial virus infection for infants born at 33 through 35 completed weeks of gestation. Pediatr. Infect. Dis. J. 23, 806–814 (2004).
Nielsen, H. et al. Respiratory syncytial virus infection-risk factors for hospital admission: a case–control study. Acta Paediatr. 92, 1314–1321 (2003).
This work was supported in part by the Respiratory Syncytial Virus Consortium in Europe (RESCEU). RESCEU has received funding from the Innovative Medicines Initiative 2 Joint Undertaking under grant agreement No. 116019. This Joint Undertaking receives support from the EU's Horizon 2020 research and innovation programme and European Federation of Pharmaceutical Industries and Associations. The funding source had no involvement in this study.
The authors declare no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Andeweg, S.P., Schepp, R.M., van de Kassteele, J. et al. Population-based serology reveals risk factors for RSV infection in children younger than 5 years. Sci Rep 11, 8953 (2021). https://doi.org/10.1038/s41598-021-88524-w
This article is cited by
Burden of respiratory syncytial virus-associated lower respiratory infections in children in Spain from 2012 to 2018
BMC Infectious Diseases (2022)