Introduction

Respiratory tract infection constitutes a major load in pediatric outpatient services and hospitalizations, especially in developing countries,1 and thereof recurrent respiratory tract infection is commonly seen among young children.2 Both upper and lower respiratory tracts may get infected, and recurrent lower respiratory tract infection, such as pneumonia, remains the leading infectious cause of pediatric fatality under the age of 5 years. According to the Global Burden of Disease Study 2016, an estimated 0.65 million children died from lower respiratory tract infection,3 in sharp contrast to 0.94 million in 20134 and 1.3 million in 2011.1 Despite a declining trend in the recent decade, the actual fatality figure is still high. In addition, epidemiologic statistics in 2010 revealed that the incidence of community-acquired childhood pneumonia was ~0.22 (interquartile range: 0.11–0.51) episodes per child-year in low- and middle-income countries, and it ranged from 0.015 to 0.060 in high-income countries.5,6 As there are currently no reliable tools to distinguish which children are at enhanced risk for recurrent respiratory tract infection, higher priority should be given to research that identify potential risk predictors to enable risk stratification, optimize prevention strategies, and ensure timely therapeutics.

Clinically, deficiencies in the defense system or in the pulmonary pathways often confer susceptibility to recurrent respiratory tract infection.7 Several epidemiologic risk factors have been acknowledged for the development of recurrent respiratory tract infection in children, such as parental smoking and asthma,8,9 yet no consensus has been reached. For example, Jopje and colleagues10 found that exposure to environmental tobacco smoke had a strongly increased risk for recurrent respiratory tract infection, yet no effect of maternal smoking during pregnancy was reported by Liesbeth and colleagues.11 Because the underlying pathogenesis of recurrent respiratory tract infection is proposed to be multifactorial, it is unlikely that any one single factor would exert a dominant impact on recurrent respiratory tract infection. Given its complexity, it is of interest to interrogate the possible interaction of major risk factors, and if allowed, construct a multivariate risk prediction model. In addition, given that the incidence of recurrent respiratory tract infection varies considerably in different countries and across ethnic groups, it is of public health importance to construct a database of risk profiles related to this disease in each ethnic group.

To fill this gap in our knowledge and generate more information for future studies, we conducted a large-scale cross-sectional investigation among preschool-aged children randomly selected from 20 kindergartens in Beijing, aiming to identity potential risk factors responsible for recurrent respiratory tract infection, both in isolation and in combination, and further to construct a risk nomogram prediction model on the basis of significant risk factors identified.

Methods

Study design

This is a cross-sectional survey conducted in Beijing from September to November 2019 following the guidelines of the Declaration of Helsinki. The conduct of this study got approval from the Ethics Committee of China-Japan Friendship Hospital, and written informed consent was obtained from the parents or guardians of study children prior to participation.

Study children

A stratified cluster random sampling strategy was utilized to select four (including two urban and two suburban) districts from 16 districts in Beijing, and further select five kindergartens, either public or private, from each designated district. Study subjects were restricted to preschool-aged children currently attending junior to senior classes in kindergartens. Children with unhealthy conditions, such as airway malformations, chronic respiratory disease, or congenital heart disease, were excluded from this present study.

In this survey, a self-designed structured electronic questionnaire was administered to the parents or guardians of 7524 children, and 7469 questionnaires were answered (response rate: 99.3%). After excluding 247 questionnaires with incomplete or missing information of interest, data from 7222 valid questionnaires were left in the final analysis.

Data collection and quality control

Data on possible risk factors of both children and their parents for pediatric recurrent respiratory tract infection were exported from qualified electronic questionnaires to a Microsoft Office ExcelTM spreadsheet. From children, data were collected on date of birth, sex, region, birth weight, history of respiratory diseases, sleep duration, and time of adding complementary foods, as well as the episodes of upper and lower respiratory tract infection over the past year, diagnosis of asthma by physicians, frequency of snoring during sleeping, allergy (including food, drug, dust, flower, or chemicals), and the initial use of antibiotics. For children, body height (to the nearest 0.1 cm) and weight (to the nearest 0.1 kg) were measured by trained health physicians. From parents, data were collected on date of birth, sex, body height and weight, maternal pre-pregnancy weight, gestational weight gain (GWG), delivery mode, education, family income, breastfeeding duration, cigarette smoking, gestational diabetes mellitus, and gestational hypertension.

To ensure quality control, one health physician was invited from each designated kindergarten, and received training on the methodology of this survey. Questionnaires were handed out by kindergarten teachers, and were completed online. Data from questionnaires were cross-checked by health physicians and investigators, and missing or extreme data were returned to kindergarten teachers in charge for further confirmation, and if necessary contacting parents or guardians.

Diagnosis of recurrent respiratory tract infection

There is currently no consensus about the accepted definition of recurrent respiratory tract infection in children, and in most cases, it is defined according to a certain number of annual infection episodes or specific diagnoses.12 Clinically, recurrent lower respiratory tract infection includes pneumonia and acute bronchitis, and recurrent upper respiratory tract infection includes common cold, tonsillitis, sinusitis, and otitis media. In this study, we employed the definition of recurrent respiratory tract infection recommended by Martino and Ballotti.13 In detail, a child is diagnosed to have recurrent respiratory tract infection if any pathological underlying condition justifying the recurrence of infections is absent, and at least one of the following conditions is met, that is, (i) six or more upper or lower respiratory tract infection episodes per year, (ii) one or more respiratory tract infection episodes per month from October to February, and (iii) two or more lower respiratory tract infection episodes per year. Recurrent upper respiratory tract infection is defined as one of the following two conditions: (i) six or more upper respiratory tract infection episodes per year, and (ii) one or more upper respiratory infection episodes per month from October to February. Recurrent lower respiratory tract infection is defined as two or more lower respiratory tract infection episodes per year.

Other definitions

Parental and maternal body mass index (BMI) (kg/m2) were calculated from self-reported height and weight (including the present and prior to pregnancy) and used to classify mothers as underweight (BMI < 18.5 kg/m2), normal weight (18.5 kg/m2 ≤ BMI ≤ 24 kg/m2), and overweight/obesity (BMI ≥ 24 kg/m2).14 GWG was calculated by subtracting maternal pre-pregnancy weight from maternal weight at delivery.

Education level was categorized as graduate degree or above, college (or equivalent) degree, high school (or equivalent) degree, or middle school degree or below. Family income level (RMB per year) was categorized as ≥500,000, 200,000–500,000, 100,000–200,000, or <100,000. Gestational diabetes mellitus and gestational hypertension, diagnosed by doctors from second-class or above hospitals, were recorded. Maternal pregnancy smoking was defined as smoking and non-smoking. Maternal pre-pregnancy smoking and paternal smoking were grouped into never smoking and ever (current or former) smoking. Delivery mode included natural birth and cesarean section.

For children, breastfeeding duration and time of adding complementary foods were self-reported and recorded in months. Breastfeeding duration was classified as <6 months and ≥6 months. Sleep duration was calculated as the sum of both sleep time on work days × 5 and sleep time on weekends × 2 divided by 7. Birth weight was classified as “low weight” (≤2.5 kg), “normal weight” (2.6–3.9 kg), or “high weight” (≥4.0 kg).15 Asthma was self-reported to be diagnosed by a doctor or health professional. Allergy was self-reported, including food, drug, dust, flower, or chemicals allergies. Snoring during sleeping was self-reported yes or no. Initial use of antibiotics was self-reported and classified as <6 months and ≥6 months.

Statistical analysis

Based on the diagnosis of recurrent respiratory tract infection and by pathogenic sites, all study children were classified into two groups. The distributions of continuous variables were assessed for normality by use of the skewness and kurtosis test. Skewed continuous variables are expressed as median (interquartile range), and normally distributed variables as mean (SD). Categorical variables are expressed as number (percentage). Between-group comparisons were implemented by t test or rank-sum test or χ2 test, where appropriate. Potential risk factors for recurrent respiratory tract infection were selected by both forward and backward logistic regression analyses at a statistical significance level of 5%, before and after adjusting for confounding factors, including age of children, sex, and region. The magnitude of risk association is quantified by odds ratio (OR) and 95% confidence interval (95% CI). The difference between two effect-size estimates was assessed using the Z test as recommended by Altman and Bland.16

Prediction accuracy of significant risk factors was appraised from calibration and discrimination aspects. Calibration statistics includes Akaike information criterion (AIC) and Bayesian information criterion (BIC), as well as the −2 log-likelihood ratio test. Discrimination statistics includes the area under the receiver-operating characteristic (ROC) to justify the improvement in prediction performance. In addition, the net benefit for adding significant risk factors was justified by decision curve analysis.17

Finally, on the basis of significant risk factors, a risk prediction nomogram model was constructed to enhance clinical and public health application, and this model was generated by the R language version 3.5.2 for Windows.

Unless otherwise reported, statistical analyses were completed using the STATA software version 14.0 (Stata Corp, TX) for Windows. Two-sided P value < 5% was reported to be statistically significant. Statistical power was estimated using the PS Power and Sample Size Calculations software version 3.0.

Results

Baseline characteristics

In total, data from 7222 qualified questionnaires from 7222 preschool-aged children were extracted for analysis. The mean age of all children was 4.63 years old, and there are 3601 boys and 3621 girls. The percentage of recurrent respiratory tract infection was 23.91%, and that of recurrent upper and lower respiratory tract infection was 11.73% and 17.64%, respectively.

Shown in Table 1 are the baseline characteristics of study children upon stratification by recurrent respiratory tract infection overall and by pathogenic sites. The distributions of region, maternal BMI, paternal BMI, maternal pre-pregnancy BMI, GWG, gestational diabetes mellitus, breastfeeding duration, asthma, snore during sleeping, allergy, and initial use of antibiotics differed significantly between children with and without recurrent respiratory tract infection (all P < 0.05).

Table 1 The baseline characteristics of study children according to recurrent respiratory tract infection overall and by sites.

Identification of potential risk factors

Both forward and backward logistic regression analyses were used to potential risk factors in association with the risk for recurrent respiratory tract infection overall and by pathogenic sites among preschool-aged children (Table 2). Of note, six factors, including asthma, allergy, initial use of antibiotics, breastfeeding duration, maternal BMI, and GWG, were found to be associated with the significant risk of recurrent respiratory tract infection overall and at the lower site before adjustment (all P < 0.05), and at the upper site, significance remained for asthma, allergy, initial use of antibiotics, and maternal BMI only (P < 0.001, <0.001, <0.001, and 0.014, respectively).

Table 2 Risk prediction of six significant factors for recurrent respiratory infection in preschool-aged children under three disease classifications.

After adjusting for age, sex, and region, all six, except GWG, factors were significantly associated with recurrent respiratory tract infection overall and at the lower site (all P < 0.05), and for recurrent upper respiratory tract infection, breastfeeding duration was additionally insignificant (P = 0.159). For example, the significant odds of having recurrent respiratory tract infection was 8.31 (95% CI: 5.69–12.12, P < 0.001), 2.31 (95% CI: 2.06–2.58, P < 0.001), 1.72 (95% CI: 1.48–1.99, P < 0.001), 1.24 (95% CI: 1.08–1.43, P = 0.002), and 1.19 (95% CI: 1.09–1.31, P < 0.001), respectively, for asthma, allergy, initial use of antibiotics <6 months, breastfeeding duration <6 months, and maternal BMI, which were retained for further explorations. The power to detect significance was >90% for above comparisons.

The difference between the effect-size estimates in children with recurrent upper and lower respiratory tract infection was nonsignificant (P > 0.05) as reflected by the Z test (Table 2).

Prediction performance assessment

Prediction performance for five aforementioned significant risk factors associated with recurrent respiratory tract infection overall and by pathogenic sites was assessed by comparing two models—basic model (without five risk factors) and full model (all variables with five risk factors), from calibration and discrimination aspects (Table 3). Significant improvement was seen after adding the five significant factors under three disease classifications. For example, the full model was significantly superior to the basic model as reflected by the goodness-of-fit test (all P < 0.001), which was further reinforced by decision curve analysis (Fig. 1).

Table 3 Prediction accuracy gained by adding five significant factors associated with recurrent respiratory tract infection overall and by sites in preschool children.
Fig. 1: Decision curve analysis for recurrent respiratory tract infection overall and by pathogenic sites.
figure 1

a Recurrent respiratory tract infection overall, b recurrent upper respiratory tract infection, and c recurrent lower respiratory tract infection.

Interaction exploration

Considering the factor that asthma is an established risk factor for recurrent respiratory tract infection and the number of children with asthma was very limited, pairwise interaction analysis was undertaken for the rest four significant risk factors (allergy, initial use of antibiotics, breastfeeding duration, and maternal BMI). To maximize the power to detect significance, the four significant risk factors were divided into two or three subgroups. Specifically, both maternal BMI and maternal pre-pregnancy BMI were divided into the low (BMI < 18.5 kg/m2), normal (BMI: 18.5–24 kg/m2), and high (BMI ≥ 24 kg/m2) groups.18 Breastfeeding duration was binarized on the basis of 6 months according to the recommendation of the World Health Organization, that is, solids were introduced at 6 months.19 Initial use of antibiotics was binarized on the basis of 6 months according to the study, which demonstrated that antibiotic use in the first 6 months preceded the manifestation of wheeze.20

As shown in Table 4, pairwise interaction of four significant risk factors was explored and presented. Of note, there was a graded increase in the odds of having recurrent respiratory tract infection overall and by pathogenic sites. For example, when compared with children without allergy and with initial use of antibiotics ≥6 months, children without allergy and initial use of antibiotics <6 months, with allergy and initial use of antibiotics ≥6 months, and with allergy and initial use of antibiotics <6 months were 1.81 (95% CI: 1.50–2.20; P < 0.001), 2.24 (95% CI: 1.97–2.54; P < 0.001), and 3.25 (95% CI: 2.56–4.13; P < 0.001) times more likely to suffer recurrent respiratory tract infection, respectively. The case was the same for the interaction between allergy and breastfeeding, as well as initial use of antibiotics and breastfeeding, indicating that allergy, initial use of antibiotics, and breastfeeding contributed to the development of recurrent respiratory tract infection in a dose-dependent manner. Relative to normal maternal BMI, low maternal BMI seemed to be a protective, albeit nonsignificant factor in the context of without allergy, initial use of antibiotics ≥6 months, or breastfeeding ≥6 months, and by contrast, high maternal BMI was a significant risk-conferring factor, especially in the present of with allergy, initial use of antibiotics <6 months, or breastfeeding <6 months for recurrent respiratory tract infection overall and by pathogenic sites.

Table 4 The interaction of allergy, first use of antibiotic, breast feeding, and maternal BMI in predicting the risk of recurrent respiratory infection in preschool-aged children under three disease classifications.

Risk prediction nomogram model

For practical reasons, a nomogram was constructed and displayed in Fig. 2 to help visually predict the risk of having recurrent respiratory tract infection overall and by pathogenic sites among preschool-aged children by modeling promising and significant risk factors under investigation (Fig. 2). Overall accuracy for prediction was good, with the C-index over 80% for three nomogram models (all P < 0.001) and calibration curves (Supplementary Fig. 1).

Fig. 2: Prediction nomograms for recurrent respiratory tract infection overall and by pathogenic sites.
figure 2

a Recurrent respiratory tract infection overall, b recurrent upper respiratory tract infection, and c recurrent lower respiratory tract infection.

Taking the risk prediction nomogram model for recurrent respiratory tract infection as an example, assuming a child in suburban region (27 points), aged 60 months (7.5 points), with BMI at 15 kg/m2 (55 points), maternal BMI of 30 kg/m2 (36 points), paternal BMI of 30 kg/m2 (12.5 points), asthma (100 points), without allergy (0 point), birth weight of 3 kg (12.5 points), breastfeeding duration of 5 months (18 points), initial use of antibiotics ≥6 months (0 point), and GWG of 20 kg (5 points), the probability of having recurrent respiratory tract infection was estimated to be 70%.

Discussion

To identify potential risk factors responsible for the development of recurrent respiratory tract infection, we, in a large-scale cross-sectional survey of 7222 Chinese preschool-aged children, explored the individual and interactive effects of these factors. Importantly, five independent factors, including asthma, allergy, initial use of antibiotics, maternal obesity, and breastfeeding duration, were significantly associated with the risk of recurrent respiratory tract infection overall and by pathogenic sites. Moreover, besides the leading role played by asthma, allergy, initial use of antibiotics, and breastfeeding might exert a graded, dose-dependent effect on recurrent respiratory tract infection susceptibility. To our knowledge, this is the first report of examining the joint contribution of multiple potential risk factors to recurrent respiratory tract infection among Chinese preschool-aged children.

Currently, the burden of recurrent respiratory tract infection is high in China, infected children frequently use supportive care services or antibiotics, or undergo surgical procedures for severe cases of infection in clinical practice.21 In such situation, it is imperative to take more practical and effective actions to prevent or control pediatric recurrent respiratory tract infection around the globe.

The implication of asthma in the development of recurrent respiratory tract infection is subject to much debate. Some studies have shown that asthma is the most common underlying illness for recurrent pneumonia,22,23 an acute and common lower respiratory tract infection in pediatrics. By contrast, Hoving and Brand24 deemed that asthma was more likely a differential diagnostic consideration than an underlying cause of recurrent pneumonia in children. Another study by Heffelfinger et al.25 reported that asthma was infrequently present with recurrent pneumonia, and whether their coexistence represents a causality or merely coincides by chance, was not always clear. The findings of this study supported the leading predominant role of asthma with the risk of having recurrent respiratory tract infection overall and by pathogenic sites, consistent with that of previous studies.22,26 However, the cross-sectional nature of our study prevents us from evaluating the direction of the association, such as whether asthma contributes to recurrent respiratory tract infection or vice versa. Considering the limited number of children with asthma and the association paradox, further exploration is restricted to the rest four significant factors in this study.

Besides asthma, the significant contribution of other four significant factors identified to the risk of recurrent respiratory tract infection in this study merits special discussion, and they are epidemiologically and biologically plausible. Population studies showed that allergic children had more and longer-lasting respiratory tract infection, in the upper and lower respiratory airways, than children free of allergies.26,27,28 Allergy is featured by the immune mechanism inducing the T-helper type 2 (Th2) lymphocyte pathway, which reduces Th1 response. Some cytokines such as interferon-γ, typical of the Th1 response, are essential to control infection, and they are lowered in allergic patients,28,29 which is posed as a possible reason for the involvement of allergy in recurrent respiratory tract infection. In addition, allergic mucosal inflammation can trigger the expression of adhesion molecules on epithelial cells such as intercellular adhesion molecule-1,30 the important receptor for rhinovirus,31 which may predispose to upper airway infection as a result. As a crucial cytokine in allergic inflammation, interleukin-13 is able to reduce mucociliary clearance and facilitate viral adhesion to airway epithelial cells.32

As for breastfeeding, growing evidence indicated its protective role in respiratory tract infection, and this role is dominant with the increase of breastfeeding duration.33,34 Breastmilk de facto contains many protective factors such as immunoglobulins, lactoferrin, and lymphocytes, as well as other factors that may contribute to reduced infant mortality.35 In addition, breastfeeding is reported to be influenced by maternal obesity.36 Maternal obesity is a significant risk factor for recurrent refractory tract infection in this study. There is evidence that maternal pre-pregnancy obesity and GWG may impact the fetus on the intrauterine environment with the ‘fuel-mediated teratogenesis’ and ‘developmental origin of health and disease’ hypotheses.37,38,39

With regard to the use of antibiotics, approximately one in five children aged over 1 month to 18 years have received antibiotics due to bacterial lower respiratory tract infection,40 with around four in five children using antibiotics in the first years of life.41 Interestingly, our findings indicate that initial use of antibiotics <6 months was associated with an increased risk of recurrent respiratory tract infection. Several possible reasons are behind this association. First, antibiotics can cause immunomodulatory changes in intestinal bacterial diversity during the first months of life, and influence immune system, Th, and regulatory T cell balance accordingly.42,43,44 As evidenced, gut microbiomes plays a key role in the maturation of the neonatal immune system, and further precipitates the development of respiratory tract infection.42,43,44 Second, children in need of antibiotics early in life have Th1/Th2 imbalance caused by infection, which makes them more susceptible to respiratory tract infection later in life.45 Third, the antibiotic-related suppression of inflammatory responses in the course of treatment may lead to enhanced immune response among Th2-shifted children or impaired Th1 immune responses in early childhood.46 Based on above evidence, it is reasonable to speculate that the five factors identified in this study might be responsible for the development of recurrent respiratory tract infection among Chinese preschool-aged children.

Another key finding of our study is that allergy, initial use of antibiotics, and breastfeeding predispose to recurrent respiratory tract infection in a graded, dose-dependent manner as revealed by our pairwise interactive analyses, highlighting the importance of joint analysis of multiple risk factors. To take one step forward and increase the application of our findings, we employed the nomogram technique, and fortunately the accuracy of prediction nomogram models constructed for recurrent respiratory tract infection overall and by pathogenic sites was very good. Despite the significant association, our findings presented here should be regarded as preliminary, and we agree that validation in other independent, well-designed, longitudinal studies are necessary to confirm or refute our conclusions.

Limitations

Finally, some limitations should be acknowledged for this present study. First, the cross-sectional design of this study restricted further comments on the causality of potential risk factors with recurrent respiratory tract infection. Second, our data were obtained through questionnaires filled in by the parents or guardians of study children, and hence a recall bias cannot be excluded. Third, all study children are living in Beijing and of Chinese origin, and extrapolation of our findings to other racial or ethnical groups should be done with caution.

Conclusions

To sum up, we have identified five potential risk factors responsible for the risk of having recurrent respiratory tract infection via an analysis of data from 7222 preschool-aged Chinese children. Notably, asthma plays a leading role, and allergy, initial use of antibiotics, and breastfeeding might exert a graded, dose-dependent effect on recurrent respiratory tract infection susceptibility. For practical reasons, we hope the present study will not remain just another endpoint of research instead of a beginning to establish background data to further explore potential risk factors of pediatric recurrent respiratory tract infection, as well as the possible underlying molecular mechanisms.