Introduction

Meconium ileus (MI) is the earliest disease complication in cystic fibrosis (CF), a multiorgan disease that is caused by mutations in the cystic fibrosis transmembrane conductance regulator (CFTR) gene.1 Affected newborns often present with severe bowel obstruction caused by inspissated mucous and meconium occluding the mid or distal small bowel. Resolution of the ileal obstruction requires extensive treatment, including rectal infusion of gastrografin and/or enema under fluoroscopic control, as well as additional saline/acetylcysteine enemas. In about half of the newborns with CF presenting with MI, surgical intervention is necessary to alleviate the blockage and/or manage secondary complications such as intestinal atresia or intestinal perforation. Thanks to advancements in surgical and medical management, the outcome of infants with CF and MI has greatly improved, and their survival is now similar to children with CF without MI.2,3

Only a subset of patients with CF develop MI, and previous studies have reported an MI prevalence (MIP) of 14% in Canada4 14% in Italy5 and 20% in the United States.6 Further, MI demonstrates notable heritability.7,8 Although studies have shown that non-CFTR genes contribute to susceptibility,9 the CFTR genotype itself affects the occurrence of this complication; only patients with the more severe CFTR variants are at risk for MI.

More than 1,900 CFTR variants have been identified (http://www.genet.sickkids.on.ca) in patients with CF. Attempts to predict the functional severity of these mutations or their predictive value on the overall clinical outcome of individual patients have been challenged by a general lack of genotype and phenotype correlation.4,10,11 Lung disease, the primary determinant of a patient’s CF disease course, shows great heterogeneity, even among patients carrying the same CFTR mutations, because of the impact of environmental factors and multiple modifier genes.4 Exocrine pancreatic disease, however, correlates well with the underlying genotype,12,13 and CF has traditionally been dichotomized into a pancreatic insufficient (PI) and thus “severe” CF phenotype or a pancreatic sufficient (PS) and thus “mild” CF phenotype. In general, this distinction corresponds well to the mutation class system, first proposed in 1993 (ref. 14), that considered the underlying molecular consequences for the CFTR protein. According to this system, class I–III mutations, which include most disease alleles, are associated with minimal or no CFTR function and often are associated with a PI phenotype. Class IV and V mutations confer residual CFTR function and are associated with a PS phenotype.12,13,15

Recently established PI prevalence (PIP) scores are being used to distinguish more accurately between the severity of different CFTR mutations within the milder range.16,17 By contrast, MI is commonly reported in patients with CF carrying “severe” class I–III CFTR mutations.4 Given the early presentation with minimal opportunity for environmental influence, we sought to determine whether MI is associated with the CFTR genotype using large patient cohorts. We hypothesized that, similar to the pancreatic phenotype,16,18 the susceptibility to MI is influenced by specific CFTR genotypes, and that the prevalence of MI can be used to discriminate among severe CFTR mutations. We estimated and compared the prevalence of MI for different CFTR mutations using data sets from four countries and investigated association with other clinical markers of disease severity, including lung function (forced expiratory volume in 1 second (FEV1)) and body mass index (BMI).

Materials and Methods

Study cohorts

The Canadian Gene Modifier Consortium (GMC) was established to carry out large-scale genetic studies with recruitment of 60% of Canadian patients with CF between 2004 and 2008 (n = 2,492) and has been described earlier.19 In addition, we obtained longitudinal lung function and nutritional data from a similarly aged patient population from the Canadian patient data registry including only patients born after 1980 (n = 3,153). Further, we worked with the US patient registry (n = 43,432), the German CF patient registry (n = 3,596) and the CF patient data set from Veneto/Trentino/Alto Adige (northern Italy; n = 1,788) including patients with information about MI status.

Data collection

The following data were collected from all groups: occurrence of MI, age at CF diagnosis, CFTR genotype, and presence of siblings with CF. Additional data were collected from the Canadian patient data registry, including longitudinal lung function measures (percentage of predicted FEV1 (FEV1%pred)), weight and height for patients born in or after 1980 (reporting years 1986–2013). Lung function and growth data were obtained from the first stable clinic visit per year. All available data were included in our analysis.

The study was approved by the Research Ethics Board of The Hospital for Sick Children (Toronto, Canada), Cystic Fibrosis Canada, CF Foundation (Bethesda, MD), and the German and Italian CF patient registry committees.

Data validation and establishing scores

MIP score. The GMC cohort, which provided the most detailed phenotype data, was used to investigate the accuracy of the MI phenotype. We reviewed the charts of a total of 227 patients with CF with a diagnosis of MI across Canadian CF centers in the database and for whom charts were available. Further, we reviewed the charts of all 42 patients from The Hospital for Sick Children who were classified as not having MI and who received a diagnosis of CF at ≤2 months of age. The MI phenotype was verified based on a physician’s note and/or imaging results documenting the MI. It was considered false in the absence of any chart documentation for MI and/or when chart documentation suggested that other disease symptoms had led to a diagnosis of CF. We discovered that the age at CF diagnosis was helpful in confirming the presence or absence of MI. False assignment of MI when a patient with CF in fact did not have MI occurred in 15.8% of the cases. It was lower when the CF diagnosis was established before 1 (1.65%) or 2 months of age (11.1%) and higher when the CF diagnosis was established after 3 months of age (41.2%). The opposite was true for falsely assigning the non-MI variable to a patient with CF, which occurred in 19% of cases. It was higher among patients with a CF diagnosis before 1 month of age (32%) in the absence of any siblings with a CF diagnosis, but lower with a CF diagnosis between the ages of 1 and 2 months (4%).

In view of these findings, we used logistic regression analysis to model the probability of MI using the validated status obtained from the chart review as the outcome and the reported diagnosis of MI, age at diagnosis, and the presence/absence of a sibling with CF obtained from the database as predictors. The coefficients from the model then were applied to estimate the probability of MI in the data sets obtained from the GMC as well as the Canadian, US, German, and North Italian patient registries. The probability term is provided in the Supplementary Methods online.

A MIP score was estimated for each CF-causing variant as the average probability of MI for individuals with the CF-causing variant on one allele and F508del on the second allele. MIP scores were calculated for CF-causing variant groups populated by at least 10 patients with CF within each country group. While the absence or presence of siblings with CF was a component of the MI probability term, the siblings themselves were not identifiable to each other, risking confounding the MIP scores for CFTR mutations with a small number of patients. However, we controlled all small groups of patients carrying the same CFTR mutation on the second allele within the GMC cohort and confirmed a similar MI frequency between groups of patients with and without siblings.

PIP score. The GMC cohort was used to develop PIP scores, as previously described.18 Exocrine pancreatic function was evaluated using one or more of the following tests, and PI was diagnosed when there was a low serum cationic trypsinogen concentration (<16.5 ng/ml), low fecal elastase concentration in stool (<200 μg/g stool), low isoamylase concentration (<13 IU/l), high 72-h fecal fat (>7%) and/or abnormal pancreas imaging at ≥10 years of age.

The PIP score was calculated only for the GMC cohort as the proportion of PI among all patients (PI and non-PI) carrying the same CF-causing variants (where n ≥ 10).

Longitudinal analysis of lung function and BMI

FEV1%pred was determined using the equation presented by Hankinson et al.20 and Wang et al.21 to adjust for age, sex, and height. BMI z scores (zBMI) were derived from height, weight, age, and sex using the SAS macros obtained from the Centers for Disease Control and Prevention growth percentiles ( http://www.cdc.gov/nccdphp/dnpao/growthcharts/resources/sas.htm; accessed 26 March 2014). Both Canadian patient databases—the GMC and the Canadian patient data registry—were used to model the effect of MIP score and individual MI status on FEV1%pred and zBMI across ages 6–30 years using hierarchical linear regression.22 Models controlled for pancreatic status, sex, and age. We also examined interactions with age to assess the effect of each variable on the rate of change of FEV1 %pred and zBMI over age. SAS version 9.3 was used for statistical analysis (SAS Institute, Cary, NC).

Results

MI prevalence

Given the frequencies of CF-associated variants and the prominence of F508del as the most common CF-associated allele, we focused on patients with CF with heterozygous genotypes including at least one F508del allele. The MI prevalence among patients carrying F508del on one allele ranged from 13% in the North Italian and 15% in the Canadian to 20% in the German and 21% in the US CF patient registry cohorts; the MIP calculated using the predicted probability of MI led to modestly higher results ( Figure 1 ).

Figure 1
figure 1

Overview of the different databases and cystic fibrosis (CF) patient data registries. Only patients with F508del and a known second mutation, as well as with known age at CF diagnosis and meconium ileus variable, were included in the analysis.

MIP scores in the GMC cohort

CF-causing variants were assigned individual MIP scores ( Table 1 ). MI occurred only in patients carrying “severe” CFTR mutations, which were associated with a predominantly PI phenotype (high PIP scores). Within this severe end of the spectrum (PIP > 0.25) MIP scores ranged from 0.0 to 0.31, indicating that individual “severe” CFTR mutations confer different levels of risk for this early occurring CF disease complication. By contrast, the risk for MI, and thus the MIP score, was low or zero for CF-causing variants, associated with a predominantly PS phenotype (PIP < 0.25).

Table 1 Meconium ileus prevalence scores for the most common cystic fibrosis–causing variants

International comparisons of MIP scores

To evaluate whether CF-causing variants are associated with MIP scores and do not merely reflect population or environmental conditions, MIP scores were estimated for the most frequent CFTR mutations in the US, North Italian, and German CF patient registries ( Table 1 ). The MIP scores derived from the GMC cohort correlated very well with the MIP scores estimated from the US (r = 0.8, P < 0.0001), German (r = 0.7, P = 0.01), and North Italian data (r = 0.96, P = 0.0008) (Supplementary Figure S1 online).

MIP scores distinguish and stratify between “severe” CFTR mutations

Beyond the categorical separation of severe and mild CF-causing variants, which is paralleled by the PIP score, MIP scores differed among the severe variants. The highest MIP scores were calculated for mutations with premature termination, such as G542X (class I mutation, MIP 0.31), followed by F508del (class II mutation, MIP 0.22); the lowest MIP score was for G551D (class III mutation, MIP 0.08) (P = 0.0009). We were curious to determine whether the low MIP score noted for G551D was obtained for other class III CFTR mutations: those with gating deficiencies. Using the US and Italian data, we calculated equally low MIP scores for G178R (0.09, n = 22), S549N (0.12, n = 39), S1251N (0.07, n = 14), and G1244E (0.0, n = 3), but not for S549R (0.21, n = 14) (Supplementary Table S1 online). The majority of class III CF-causing variants is less severe, per MIP score ranking, when compared with class I or II variants. The US database had the largest number of different CF-causing variants, allowing the estimation of MIP scores for an additional 28 CF-causing variants. The highest MIP score (0.38) corresponded to the CF-causing variant 394delTT (c.262_263delTT) ( Table 1 ). This mutation is found more frequently in the Swedish CF population, where it is associated with lower lung function.23 Of interest is that the MIP score ranked the clinically unclassified missense mutation V520F into a similar severe range of CFTR mutations as other class I CF-causing nonsense variants.

Association of MIP scores and clinical phenotypes

In a next step we investigated whether the severity of CFTR variants, per MIP scores, corresponds to the severity of disease measured by classical clinical markers such as lung function (FEV1%pred) and nutrition (zBMI).

Lung function (FEV1%pred). There was a significant effect of sex (P < 0.0001) and pancreatic status (P < 0.0001) on the rate of FEV1%pred decline, with an estimated 10-year rate of decline of −11.3% predicted (95% confidence limits (CLs): −12.7, −9.9) among patients with PS compared with a 10-year rate of decline of −19.9% predicted (95% CLs: −20.3, −19.9) among patients with PI. After adjusting for sex, pancreatic status, and their interaction with age in the model, the MIP score showed an overall association with FEV1%pred, with higher MIP scores resulting in lower FEV1 (P = 0.01) ( Figure 2a ). For example, a patient with a MIP score of 0.3 has a FEV1%pred value that is 6.6% (95% CLs: 1.5%, 11.6%) lower than the FEV1%pred value of a patient with a MIP score of 0.01. When the patient’s individual MI status was added to the model, it did not show a statistically significant effect (P = 0.7), whereas the MIP score remained statistically significant (P = 0.01).

Figure 2
figure 2

Predicted clinical phenotypes for selected MIP scores. (a) The coefficients obtained from the hierarchical linear regression of percentage of predicted forced expiratory volume in 1 second (FEV1%pred) across all patients in the Canadian patient data registry (CPDR) were used to predict pulmonary function values for three scenarios representing a range of meconium ileus prevalence (MIP) scores and pancreatic function status. After adjusting for sex and individual pancreatic status in the model, patients with a MIP score of 0.01 had an FEV1%pred 6.6% higher (95% confidence limits (CLs): 1.5, 11.6; P = 0.01) than those with a MIP score of 0.3. (b) The coefficients obtained from the hierarchical linear regression of body mass index z scores (zBMI) across all patients in the CPDR were used to predict nutritional status for three scenarios representing a range of MIP scores and pancreatic function status. After adjusting for sex and the individual pancreatic status in the model, patients with a MIP score of 0.3 had a 0.24 lower zBMI than patients with a MIP score of 0.01 (95% CLs: −0.47, −0.01; P = 0.04).

Nutrition (zBMI). Similarly, sex (P < 0.0001) and pancreatic status (P < 0.0001) were also significantly associated with the rate of zBMI decline, with an estimated 10-year rate of decline of −0.13 (95% CLs: −0.20, −0.01) among patients with PS compared with a 10-year rate of decline of −0.39 (95% CLs: −0.45, −0.33) among patients with PI ( Figure 2b ). After adjusting for sex, the individual pancreatic status, and their interaction with age in the model, the MIP score showed an overall association with zBMI (P = 0.04), with a difference in zBMI of 0.23 (95% CLs: 0.01, 0.47) between a MIP score of 0.01 and 0.3 ( Figure 2b ). When added to the model, the individual MI status demonstrated a significant but clinically minimal association with the rate of zBMI decline (P = 0.007), with a 10-year difference in zBMI change of 0.04 (95% CLs: 0.01, 0.08) between those with and without MI. Including individual MI and its interaction with age in the model did not affect the association between zBMI and MIP score (P = 0.05).

Discussion

A patient’s individual risk for MI as a first clinical manifestation of CF disease is dependent on the individual CFTR genotype and varies between patients carrying different CF-causing variants. Making use of this genotype–phenotype relationship, we assigned MIP scores to individual CF-causing variants, allowing them to be ranked according to their MI risk. The MIP score also correlated with clinical disease severity measured in other CF-affected organs, suggesting that, on average, variants with high MIP scores also are associated with worse lung disease and BMI. Because MI occurred only in patients carrying “severe” CFTR mutations that are associated with PI, MIP scores seem to be particularly helpful in sorting CF-causing variants within the severe spectrum of CF disease.

The calculated MIP score is well in agreement with previously published information about the functional expression of different CFTR mutations. VanGoor et al.24 recently compared the magnitude of mRNA expression levels, the levels of mature CFTR protein, as well as the chloride channel activity of a large number of missense variants using a Fisher rat thyroid cell expression system. Comparing findings with ours, we observe that generally, the MIP score correlates with the CFTR functional levels (P = 0.009; Table 2 ). The mutations M1101K and R347P, however, showed no CFTR function, with a low MIP score and intermediate PIP score, suggesting that the functional consequences of these mutations may be very organ-specific and/or are greatly influenced by non-CFTR-modifying factors. This finding does seem to reflect previous reports of various outcomes of patients with M1101K or R347P, ranging from PI and an early decline in lung function to PS and only mild lung disease.25,26

Table 2 Meconium ileus prevalence scores and CFTR function

MIP scores distinguished between the “molecular” classification of CFTR mutations, especially regarding the distinctive class III or gating mutations. The highest MIP scores were calculated for class I mutations, including nonsense variants, the missense variant V520F, and the Scandinavian variant 394delTT. Nonsense variants and c.262_263delTT have been associated with a more severe CF phenotype when compared with F508del.23,27 Little clinical information is available for V520F, but the location of the amino acid substitution may indicate impairment of channel function.

MIP scores were significantly different between the mutations F508del and G551D, which is in agreement with early studies of smaller numbers of patients with CF that reported a lower MI incidence in patients carrying F508del/G551D when compared with patients with CF with F508del/ F508del.28,29 We demonstrated equally low MIP scores for other class III CFTR mutations (G178R, S549N, G1244E, S1251N), further supporting the idea that class III CFTR mutations are not as severe as F508del, at least with respect to gastrointestinal development. This difference in the clinical phenotype is somewhat remarkable given that G551D is classically viewed as a “nongating” and “nonfunctional” CFTR mutation. Defects in responsiveness to adenosine triphosphate regulation is thought to explain the low open probability of G551D channels.30,31 However, following expression in different heterologous cells, measurable G551D-mediated chloride currents have been reported.32 In Fisher rat thyroid cells G551D showed 1% of wild-type function, and human bronchial epithelial cells generated from G551D/F508del lung explants expressed 5% wild-type CFTR function.24 Our results suggest that, in contrast to classes I and II, class III CFTR mutations confer sufficient chloride channel function to maintain early intestinal fluid homeostasis and thus eliminate the risk for MI in patients with CF.

The MIP score is indicative of the clinical severity of specific CFTR mutations; it is not intended to predict outcome on an individual patient level, but it can be used to evaluate larger patient groups. While non–CFTR modifier genes as well as environmental factors largely influence the development and progression of lung disease and nutritional decline,33,34,35,36 we demonstrate that the severity of the underlying CFTR genotype itself plays a role in predisposing patients to an overall better or worse clinical outcome measured as lung function and nutritional status.

The high agreement of MIP scores among different CF patient registries indicates that the risk for MI can be used as a phenotypic marker for the severity of CFTR mutations. This reflects the heritability of susceptibility8 and a presumption that early occurrence equates to minimal environmental interference. However, that prenatal toxic drug exposure can lead to measurable drug levels in the meconium of infants37,38 may indicate that some environmental contribution does occur. A CF pig model further supports the MIP score assignments as a consistent linear relation between intestinal CFTR expression and MI susceptibility39 (with a containment of genetic background). Population studies with large patient cohorts have identified that non-CFTR genes such as SLC6A14, SLC9A3, and SLC26A9 are associated with, and do modify, MI susceptibility. Other studies, including family-based investigations, have also highlighted that loci on chromosomes 19 (ref. 40) and 8 (specifically the MSRA gene)41 influence MI susceptibility. Collectively, it is apparent that the individual risk for MI in a patient with CF depends on the severity of the CFTR gene defect plus the influence of modifier genes. A parallel scenario has been proposed for the intestinal obstruction observed in the CF mouse and ferret genetic models.40,42

For most CF-causing variants, information about the phenotype or the severity of the disease course is unknown. This has prompted the establishment of CFTR2 as a public resource with summarized clinical data, including FEV1%pred, frequencies of Pseudomonas aeruginosa infection, sweat chloride results as well as information about the pancreas status.11 While knowledge of these descriptions provides averages of expected clinical phenotypes for a specific CFTR variant, it does not allow discrimination of disease severity by organ. In addition, the prediction of the clinical consequences of specific CFTR variants based on nucleotide sequence or in vitro assays remains immensely challenging. In silico tools are often insufficient to accurately predict the liability of many CFTR variants,18 leaving the interpretation of the likelihood of specific clinical consequences to highly experienced CF specialists. Notably, as shown in the schematic in ( Figure 3 ), MIP and PIP scores offer additional tools to estimate the clinical severity of CFTR variants based on phenotype presentation. Thus, MIP and PIP scores can be used to benchmark already existing in silico tools and molecular measures, and help to explain complex genotype–phenotype correlations. Of immediate interest is understanding how gating mutations limit susceptibility to MI.

Figure 3
figure 3

Schematic ranking of CFTR mutation using meconium ileus prevalence (MIP) and pancreas insufficiency prevalence (PIP) scores. PIP scores have been published previously.18

A major strength of this study was the use of large cohorts, enabling determination of phenotypic scores in rare conditions with a range of disease-causing variants, such as CF. This study is strengthened by the use of the very rigorously controlled patient sets (GMC and Canadian patient data registry) to develop a model to estimate the probability of MI and to develop the MIP score. We were not able to apply the same rigor to control the meconium and pancreas phenotypes for the other CF patient data registries; rather, we applied the estimated MI probability model. This may have led to subtle over- or underestimations of MI for some CFTR variant groups in the other databases. Nevertheless, the strong correlation between the MIP scores from the GMC and the other databases argues against a large error in our MIP calculations. A noted limitation remains the absence of complete genotypes for a large portion of registered patients with CF (noted in all databases), which may have led to some incongruence between derived MIP scores.

In summary, we developed the MIP score as a phenotype-based scoring system to help sort CFTR variants according to their clinical disease severity. MIP and PIP scores advance our knowledge regarding the genotype–phenotype relation in CF and help to stratify patients according to their presumed disease severity for future clinical or genetic studies.

Disclosure

The authors declare no conflict of interest.