Introduction

Wheezing often occurs in infants with respiratory diseases. It is usually related to viral respiratory infections, although some children also wheeze in response to other triggers and have symptoms between episodes.1 The age of onset and the recurrence of wheezing have been used as the basis for imprecise nomenclatures of these episodes, which get many diagnostic labels with therapeutic consequences.2 Wheeze is also a manifestation of asthma, and there is concern about the relationship between wheezing in young children and later asthma.

The natural history of wheezing in young children has been studied, and searches have been made for phenotypes that associate different prognoses and risk factors. A pioneer study in this field was the Tucson Children's Respiratory Study which classified infant wheezing in the first three years as either transient or persistent, depending on the absence or presence of symptoms at six years of age.3 Although the same procedure has been used in other studies,4,5 this is not the best way to classify infant wheezing as these categories are a priori created by the researcher and their existence depends only on the occurrence of asthma symptoms in the school years. A high diversity of infant wheezing could be masked in this classification.

To avoid these problems, probabilistic classification methods such as latent class analysis6 (LCA) have been used in recent studies.710 LCA has highlighted a more complex reality in infant wheezing and has shown that a number of wheezing phenotypes can be identified before three or four years of age. These studies have improved our knowledge of the natural history of wheezing in young infants, but they have limitations. They analysed parents' answers to questionnaires completed at scheduled ages, asking for events that happened during a previous period. This information is retrospective and subjective, it does not fit clinical criteria, and it incorrectly estimates the true incidence of wheezing.11 A validation of the results of these studies against prospective doctor-diagnosed wheezing is lacking. Moreover, a description of phenotypes in terms of incidence rates of wheezing episodes has never been done. Such a description is essential for understanding the meaning of the phenotype classification, as it would show the disease burden for each phenotype and the age at which the incidence peaks or subsides.

The present study aimed to identify time patterns and phenotypes of doctor-diagnosed wheezing in the first 36 months of life, to describe the changes in incidence rates of wheezing episodes in those phenotypes, and to analyse their relationship with active asthma at six years of age.

Methods

Population and setting

The SLAM (Sibilancias de Lactante y Asma de Mayor) study used data from a historical cohort composed of all six-year-old children born between 2002 and 2004 in a population covered by 29 primary healthcare practices in Asturias, a northern Spanish region with a high prevalence of asthma. Children were assigned at birth to a primary healthcare centre where a paediatrician acted as the primary care physician for children under 14 years of age. All children attended a scheduled visit at six years, when a review of health problems, vaccination, and other preventive care was provided by the attending paediatrician.

Children with specific diseases related to chronic or recurrent respiratory problems were not included (see Appendix 1 available online at www.thepcrj.org). Incomplete follow-up was noted if a child did not have a complete clinical record because he or she was not assigned to the study centre at birth. These were children who had either changed residency or been born abroad.

Information about wheeze and asthma

Clinical records were reviewed for wheezing episodes in the first 36 months of life. The review included both paper and electronic clinical records (since 2001 all primary healthcare practices in Asturias have used the same electronic clinical recording system), as well as reports of both hospitalisations and emergency room visits, which are routinely documented in clinical records.

A wheezing episode was defined as any respiratory disease in which a physician recorded that wheeze was auscultated on physical examination. One month was required as a minimum elapsed time between distinct episodes. Clinical records were also used to identify active asthma at six years, defined as the occurrence in the previous 12 months of both symptoms attributed to asthma by a paediatrician and the receipt of at least one anti-asthma drug prescription.

Results of allergy tests for airborne allergens (either skin prick tests or serum-specific immunoglobulin E determinations) were identified in the clinical records of children with asthma. The type and number of tested allergens could vary according to clinical needs, but dust mite sensitisation was always included for testing because it is the allergen most commonly associated with asthma in Asturias. When results of allergy tests were available, asthma was classified as either allergic or non-allergic on the basis of a positive or negative test to any airborne allergen.

Analysis of data

Data analysis was conducted in three steps. First, LCA was applied to identify phenotypes of wheezing. Latent-class cluster models were built with 12 nominal variables that represented the occurrence or absence of any wheezing episode in each three-month period between 0 and 36 months of age. Models with between one and seven classes were investigated, and the best model was selected according to three criteria: lowest Bayesian information criterion (BIC), good fit to the data distribution (p>0.05 in a bootstrap L2 test), and significant fit improvement over a model with one less number of classes (p<0.05 in a conditional bootstrap 2LL test). Children were classified in the class they most probably belonged to (this is termed modal classification). This modal classification entails an error rate that can be estimated by comparison with the probabilistic distribution of each class. Between-class differences among clusters in a number of characteristics were analysed by χ2 or Kruskal-Wallis statistics as needed, and time to the first wheezing episode was analysed by the Kaplan–Meier survival method.

Second, mean monthly incidence rates of wheezing episodes per 1,000 person-months through the first 36 months were computed for each phenotype identified by LCA, and changes in incidence rates were analysed through joinpoint regression models. This is a least square linear regression method that allows the identification of both trends in incidence rates and points (joinpoints) at which incidence changes.12 Linear joinpoint regression models were built with the monthly incidence rate of wheezing as the dependent variable and the midpoint of each monthly period as the independent variable. Homoscedasticity and lack of autocorrelation were assumed and later tested. For each phenotype, models with between 0 and 5 joinpoints were built and the best model was chosen by a permutation test with a significance level of p<0.05 after a Bonferroni correction for repeated tests.12

Third, the modal classification was tested for its ability to predict active asthma at six years of age by computing sensitivity, specificity, and the Brierscaled, a measure of overall predictive performance standardised to give a result between 0 (minimum) and 100 (maximum predictive ability).13

LCA was conducted by means of the Latent Gold 4.5 software (Statistical Innovations, Boston, USA) and joinpoint regression models were built using the Joinpoint Regression Program (US National Cancer Institute). Other analyses were conducted by means of Excel (Microsoft Corporation).

Ethics approval

The study protocol was approved by the Regional Clinical Research Ethics Committee of the Principado de Asturias (number 03/2011).

Results

Sample

Out of a cohort of 4,765 children, 36 (0.8%) had one of the exclusion criteria (Appendix 1 and 4 available online at www.thepcrj.org). There were 990 children (20.8%) with incomplete follow-up, but no differences were found in sex, year of birth, or birth weight between children with and without complete follow-up. The analysis used the complete data from 3,739 children (51.2% male).

There were 4,676 recorded wheezing episodes and 1,704 children (45.6%) had at least one of them. Active asthma at six years of age was identified in 573 children (15.3%). Wheezing in the first 36 months was more common in children with asthma than in those without asthma (73.6% versus 40.5%, p<0.001).

Phenotypes and incidence rates

A model with four classes was chosen after LCA as it fulfilled the three abovementioned selection criteria (see Appendix 2 available online at www.thepcrj.org). These classes were defined by the probability of having any wheezing episode in each three-month period (Figure 1). Further description of these classes through estimates of monthly incidence rates of wheezing for each class was obtained by means of joinpoint regression models, as shown in Table 1 and Figure 2 (additional data in Appendix 3 available online at www.thepcrj.org). The following four phenotypes could be described:

  1. 1

    “Never/infrequent wheeze” (NIW) is a class with a very low incidence of wheezing. All children who never wheezed belonged to this class; only 25.6% of the class members had any wheezing episode and none had more than three episodes in the first 36 months. They had a little peak of incidence at 6.5 months and then a fall until 14.5 months with later stabilisation (no more significant slopes).

  2. 2

    “Transient wheeze” (TW) is a class with a fast rise in the incidence of wheezing from birth to a peak of 218 episodes per 1,000 person-months at 7.5 months, followed by a fast drop in incidence until 16.5 months and subsequently a slower decrease in the incidence rate. The first episode happened at a median (interquartile range, IQR) age of 6 (4–9) months; 90% of children in this class had their first episode before the 13th month.

  3. 3

    “Persistent wheeze” (PW) is a class with a very fast rise in the incidence of wheezing to a peak of 296 episodes per 1,000 person-months at 6.5 months, followed by a slow descent. The first episode occurred at an age of 6 (4–8) months; 90% of children in this class had their first episode before the 12th month.

  4. 4

    “Late wheeze” (LW) is a class with an initial very low incidence, only perceptible from the fourth month, but a slow constant rise through the first 36 months. The age of the first wheezing episode was 19 (12–25) months.

Figure 1
figure 1

Probability of wheeze in each three-month period from birth to 36 months in the never/infrequent (square marker, bold continuous line), late (circle marker, bold broken line), transient (circle marker, light continuous line), and persistent (square marker, light broken line) wheezing classes

Table 1 Incidence joinpoint regression models for the classes identified in latent-class analysis
Figure 2
figure 2

Joinpoint regression lines: incidence rates per 1,000 person-months for each wheezing class: never/ infrequent wheezing (bold continuous line), late wheezing (bold broken line), transient wheezing (light continuous line), and persistent wheezing (light broken line.

The probability distribution of the four classes was 65.4% NIW, 18.3% TW, 6.6% PW, and 9.7% LW, whereas the modal classification allocated 73.2% of the children to the NIW class, 12.7% to the TW class, 6.0% to the PW class, and 8.2% to the LW class. Therefore, the modal classification incorporates a 15.5% error rate due to both overestimation in the NIW class and underestimation in the TW class.

The four classes were different regarding some covariates (Table 2). Figure 3 demonstrates the Kaplan–Meier curves for time to the first wheezing episode. The curves for TW and PW classes almost superimpose. The NIW and LW classes initially have close survival curves but, from the ninth month, they progressively diverge.

Table 2 Covariate differences between classes
Figure 3
figure 3

Kaplan–Meier curves for time to the first wheezing episode in the never/infrequent (bold continuous line), late wheezing (bold broken line), transient wheezing (light continuous line), and persistent wheezing (light broken line) classes

Prediction of asthma

Active asthma at six years was more frequent in the TW, PW, and LW classes than in the NIW class. In the NIW and LW classes, asthma at six years was allergic in a higher proportion than in the other two classes. As a whole, 267 of the 573 children with asthma (46.6%) were classified in the NIW class, 103 (18.0%) in the TEW class, 83 (14.5%) in the PEW class, and 120 (20.9%) in the LW class. Prediction of asthma on the basis of the modal classification had a very low sensitivity and a weak predictive performance (Table 3).

Table 3 Performance of the modal classification in the prediction of active asthma at six years compared with the never/infrequent class

Discussion

Main findings

Wheezing in the first 36 months can be classified into four discrete phenotypes according to temporal patterns: a never/infrequent phenotype, two early-onset phenotypes that differ in persistence of symptoms but not in the age of onset, and a late-onset phenotype that emerges around the fourth month and slowly becomes the predominant type. The risk of active asthma at six years of age is different among these phenotypes, but the classification has a weak predictive ability for asthma.

Limitations and strengths of this study

The limitations of the study are due to its historical nature. A first limitation is the risk of unrecorded episodes that can lead to an underestimation of the incidence rate and can also cause imprecision in the definition of the classes. Part of the error in the modal classification could be due to unrecorded episodes — for example, most errors are apparently attributable to a misclassification of children into the NIW class instead of the TW class. However, the number of unrecorded episodes was probably low as the cumulative incidence (4,676 episodes in 3,739 children, i.e. 1.25 per child) is even higher than the reported incidence of wheezing in the Copenhagen Prospective Study on Asthma in Childhood,14 a high-risk prospective cohort study that identified every wheezing episode in the first three years (290 episodes in 313 children, i.e. 0.93 per child).

A second limitation is the absence of data that were not gathered because they were absent or incomplete in most of the clinical records, and their use might pose a high risk of bias. For instance, allergy tests were seldom performed in non-asthmatic children, so this information is lacking. There is also a lack of information about family history, environmental exposures, and clinical features.

This study also has significant strengths. Data were prospectively recorded and wheeze was confirmed by a physician, thus avoiding the bias posed by data retrospectively obtained from parents' responses to questionnaires.11 We also established a criterion to separate each episode from the next, avoiding a false multiplication of episodes. The date of each episode was exactly established, allowing for a high-resolution analysis of incidence that has only been done previously in a small selected cohort.14 Furthermore, the sample size is relatively large and the rate of losses is small compared with those in other cohort studies.

Interpretation of findings in relation to previously published work

The four depicted classes are phenotypes characterised by frequency and temporal trajectories. More information is needed before they can be seen as endotypes defined by a distinct pathophysiological mechanism.15 A correspondence can be drawn between these classes and those in the ALSPAC study.8 The “never/infrequent wheeze”, “transient early wheeze”, and “persistent wheeze” phenotypes in ALSPAC have their counterparts in the NIW, TW, and PW classes in the SLAM. However, an “intermediate-onset wheeze”, with wheezing episodes starting at 18 months, had a much lower frequency in the ALSPAC study (2.7%) than the LW class in the SLAM study (9.7%), perhaps because part of the “late-onset wheeze” phenotype of the ALSPAC study was also included in our LW class. Equally, the “prolonged early wheeze” phenotype could be shared between the PW and LW classes. The main difference between the ALSPAC and SLAM studies is that the SLAM study, by using incidence data, has been able to back-track the onset of the “intermediate phenotype” or LW class to the first year.

Regarding very early wheeze, there are two classes both starting at a median age of six months when bronchiolitis is probably the most common diagnosis these episodes get. One of them is transient while the other exhibits a heavy recurrence and a higher risk of asthma symptoms at six years. The differences between nonrecurrent bronchiolitis and bronchiolitis with heavy recurrent post-bronchiolitis wheezing are unclear. The type of virus involved in the first infection,16 bacterial colonisation of the airways,17 and characteristics of the immune response18 have all been proposed as causes whereas atopy, effect of pharmacological treatments, and epidemiological characteristics seem not to play a significant role in this setting.10,1922 We did not find any clue that could help in differentiating these two phenotypes.

Prediction of asthma at an early age remains difficult and our results do not improve on previous efforts. Although most of the children with active asthma at six years had wheeze in the first 36 months, 70.2% of them had less than four episodes in that period and 46.6% were classified in the NIW class — that is, many children with asthma had a limited clinical expression in the first three years. In some studies up to two-thirds of the children with asthma do not have asthma symptoms in their early years,23 and therefore they cannot receive an early diagnosis as any predictive index would have a low sensitivity.

Although the wheezing classes depicted in this study entail an increased risk of asthma compared with a never/infrequent wheeze class, the predictive performance of that classification is poor because of low sensitivity. The most credited procedure to recognise infants at risk of asthma is a modified asthma predictive index (mAPI) which identifies young children who respond to antiinflammatory asthma treatment. This has been proven in studies in which children with a positive mAPI had a mean age of three years.24,25 The first criterion for mAPI is the occurrence of four or more wheezing episodes, and in the SLAM cohort the fourth wheezing episode took place at a median (IQR) age of 21 (15–28) months. Therefore, it seems difficult to identify very young children at risk for asthma on the basis of a positive mAPI.

Reduced lung function and increased airway responsiveness since birth have been found in both virus-related wheezing in the first year and asthma in school years,26,27 so genetic or prenatally acquired airflow limitation could explain the association often detected between these diseases. However, the LW class does not exhibit an incidence peak in the first year, suggesting that congenital airflow limitation is not an important feature of this phenotype which seems to follow the previously described development of allergic sensitisation.28

Implications for future research, policy and practice

This work suggests that the main issues for future research should be the causes underlying the different prognoses for transient and persistent early-onset wheezing and whether atopic predisposition or early allergen exposures drive the surge of late wheezing.

Conclusions

This work clarifies the spectrum of wheezing-related diseases found in young children and, for the first time, describes the incidence patterns of each phenotype. In clinical practice, two conclusions can be drawn. First, transient and persistent wheezing in the young infant have an identical early onset, so it is difficult to give a prognosis in children with wheezing starting in the first year. Second, as a child grows, a late-onset phenotype becomes the most common type of wheezing, and this phenotype has an increased risk of persistent allergic asthma.