Introduction

One-quarter of smokers develop chronic obstructive pulmonary disease, which is the fourth most common cause of death worldwide and is largely caused by smoking.1,2 Fukuchi et al.3 reported that the prevalence of chronic obstructive pulmonary disease in Japanese patients increased steeply in individuals aged 60 years and older, from 5.8% in those aged 50–59 years to 15.7 and 24.7% in those aged 60–69 years and 70 years and older, respectively. The number of patients with insidious chronic obstructive pulmonary disease is estimated to exceed five million, but more than 95% of cases are currently undiagnosed, and many of them are not treated.3,4

Spirometry can detect obstructive lung damage in susceptible individuals.5,6 Early diagnosis of chronic obstructive pulmonary disease could encourage smoking cessation and improve quit rates in individuals most vulnerable to lung damage.7 One approach used to increase the smoking cessation rate is the sharing of spirometric results with smokers.8 The concept of ‘lung age’ was developed in 1985 as a way of making spirometric data easier to understand for patients and also as a psychological tool to show smokers the apparent premature ageing of their lungs.7 In the original formula developed by Morris and Temple7, the regression equation constructed for predicting the reference value of forced expiratory volume in 1s (FEV1) from age and height was rearranged to solve for ‘age.’ To date, several groups have reported revised equations for predicting lung age in populations worldwide.813

In Japan we have often used the regression equations reported by the Japanese Respiratory Society (JRS)14 to calculate patient lung age. However, we have found that the JRS equations show a discrepancy between lung age and lung damage, especially in Japanese males.15 Occasionally, healthy males had a large lung age deficit (lung age minus chronological age), despite their status as never/non-smokers.2

The aim of the present study was to establish novel regression equations using data from a large number of healthy never-smokers with normal spirometric measurements and normal body mass indices (BMIs). The newly developed equations were validated using data obtained from another cohort of subjects.

Materials and methods

Study design

Two independent retrospective cohorts were used.

Study population

The participants in the discovery cohort were selected from 104,563 individuals undergoing a 1-day standardised medical check-up at the Center for Preventive Medicine of St Luke's International Hospital between January 2004 and December 2012. A questionnaire was administered to subjects to obtain information about smoking status, past and present disease status and current medications. Subjects who indicated that they had never smoked before the health screening examination were identified as ‘never-smokers’ and comprised 63,224 patients (20,184 males and 43,040 females). The never-smokers who reported no past or present diseases, such as cardiovascular disease, cancer, hypertension, dyslipidaemia, diabetes mellitus, gout and lung disease, were identified as healthy never-smokers (n=21,832). Healthy never-smokers who were taking medications at the time of the medical check-up were excluded. Among healthy never-smokers with no current medications, we included those with normal spirometric measurements: %FEV1 (a proportion of FEV1 divided by the forced vital capacity (FVC))>70% and %FVC (a proportion of FVC divided by the predicted FVC value)>80% and normal BMIs (18.5<BMI<25.0). We defined them as ‘1-day check-up healthy never-smokers with normal spirometric measurements and normal BMIs’ (Supplementary Figure 1) and enroled them as Group 1 (n=15,238, including 5,499 males and 9,739 females; Table 1).

Table 1 Descriptive data of the subjects in Group 1 (1-day check-up of healthy never-smokers with normal spirometric measurements and normal BMIs) and Group 2 (comprehensive overnight check-up of healthy subjects)

The participants in the validation cohort were selected from an initial population of 5,595 individuals receiving a comprehensive overnight check-up at the Center for Preventive Medicine of St Luke's International Hospital between January 2004 and December 2012. Similar to the process for Group 1, we selected Japanese subjects who reported no past or present diseases as candidates for enrolment in Group 2, which constituted the study group of ‘comprehensive overnight check-up healthy subjects.’ A total of 2,079 participants (n=1,294 males and 785 females) met the criteria and were included in the study (Table 1).

Variables

Baseline data included age at check-up, anthropometric variables (height, weight, BMI, waist circumference and blood pressure), smoking history (never/non-smoker, ex-smoker or current smoker), medical history, current medications and co-morbidities, including chronic bronchitis or emphysema, asthma or other lung diseases. All participants underwent standard measurements of lung function (FEV1, %FEV1, FVC and %FVC) with a spirometer (DISCOM-21; CHEST MI Inc., Tokyo, Japan).16

Estimation of lung age

The lung age of participants was calculated by working back from the regression equations predicting reference values of FEV1 reported by the JRS14 and our novel regression equation. The following JRS equations14 were used:

FEV 1 ( l ) = 0.022 × height ( cm ) 0.022 × age ( years ) 0.005 in females
FEV 1 ( l ) = 0.036 × height ( cm ) 0.028 × age ( years ) 1 . 178 in males

We calculated the spirometric-derived lung age (SDL age) using the following equations:

SDL age = ( 0.022 × height ( cm ) 0.005 FEV 1 ( l ) ) / 0.022 in females
SDL age = ( 0.036 × height ( cm ) 1 . 178 FEV 1 ( l ) ) / 0.028 in males

The lung age deficit was defined as the difference between SDL age and chronological age.

Statistical analysis

We used the original method of Morris and Temple in which the assumption is that the lung age can be expressed as a linear function of FEV1 and height (cm). Data were analysed separately for female and male participants using multiple linear regressions and the ‘enter’ method. Pearson’s correlation coefficients between two variables are shown in Supplementary Table 1. The difference in lung age deficits between more than three categories was analysed using the Kruskal–Wallis test, followed by multiple comparisons. Data were analysed using SPSS statistical software version 22.0 (IBM Japan, Tokyo, Japan). Two-tailed probability values <0.05 were considered statistically significant.

Ethical considerations

We excluded the records of individuals who had declined to participate in the clinical study. This study was approved by the Institutional Research Board of St Luke's International Hospital.

Results

The demographic data of Group 1 and Group 2 participants are shown in Table 1. The mean age of Group 1 was about 20 years less than that of Group 2. In group 1, 75 and 78% of subjects were between ages 30 and 50 among males and females, respectively, whereas in Group 2 this was 14 and 15%. All subjects in Group 1 were never-smokers, whereas Group 2 included never-smokers, ex-smokers and current smokers. Height and weight were similar in the two groups, but the subjects in Group 2 had slightly higher BMIs and waist circumference compared with Group 1. The spirometric measurement values in Group 1 were higher than those in Group 2.

The scatter plots of the relationship between chronological age and FEV1 in females and males are shown in Figure 1. FEV1 declined continuously and almost linearly over chronological ages in both females and males. FVC also declined continuously and almost linearly over chronological ages; however, there was a wider spread of data (Supplementary Figure 2).

Figure 1
figure 1

The scatter plots between chronological age (years) and FEV1 (l) in females and males in Group 1. Scatter plots showing the relationship between chronological age and FEV1 in (a) females (n=9,739) and (b) males (n=5,499) in Group 1. The line of best fit for FEV1 (y) against chronological age (x) is expressed by a solid line.

We performed linear regression analysis for FEV1. Table 2 shows the results of the representative multiple linear regression analysis of models 1–3. Model 1 included two predictive variables (height and weight), similar to the JRS report. Model 2 was created by adding one additional variable (%FVC) and model 3 was created with spirometric variables only (FVC (l) and %FVC) excluding a height variable. Supplementary Table 1 shows a comparison of the other model equations; models 4, 5, 6 and 7 were created by adding one additional variable (weight (W) BMI, waist circumference, FVC, %FVC and %FEV1 and FVC (l), respectively). The fitness of the model was not improved by adding the other anthropometric variables. However, the addition of FVC data significantly improved the performance of the model (the adjusted R2 values >0.8) and multicollinearity was suspected between some variables (Supplementary Table 2).

Table 2 Model equations by multiple linear regression analysis

We developed the SDL age estimation equations according to the models described above and shown in Table 2. The representative scatter plots between chronological age and SDL age are shown in Figure 2. We compared models 2 and 3 with the original JRS model. On the basis of the R2 values, the slope (nearly equal to 1) and its simplicity (only two predictive variables), model 3 was considered the best model in both females (R2=0.66) and males (R2=0.55).

Figure 2
figure 2

Scatter plots between chronological age (years) and SDL age (years) in Group 1. Scatter plots showing the relationship between chronological age and SDL age in Group 1. SDL age was determined in females and males according to the JRS equations (a and b, respectively), model 2 equations (c and d, respectively) and model 3 equations (e and f, respectively). The lines of best fit for SDL age (y) against chronological age (x) are expressed by solid lines. Dotted lines indicate identity lines (yx).

The box and whisker plots of lung age deficit (lung age minus chronological age) sorted by age group are shown in Figure 3. Compared with model 3, the lung age deficit values from the JRS equations were more widely distributed, indicating a larger discrepancy between SDL age and chronological age when using the JRS method. In addition, the lung age deficit values in males tended to be positive in the JRS method, suggesting an overestimation of SDL age in all age groups, whereas the lung age deficit values using model 3 were distributed around zero in all age groups, suggesting appropriate prediction of SDL age in all age groups.

Figure 3
figure 3

The box plots of lung age deficit (SDL ages minus chronological age) sorted by age in Group 1. Box and whisker plots of SDL age deficit (SDL ages minus chronological age) according to age in Group 1. SDL age was determined in females and males according to the JRS equations (a and b, respectively) and model 2 equations (c and d, respectively). The bottom and top of the box represent the first and third quartiles, and the thick band inside the box represents the median. The ends of the whiskers represent the first and third quartiles, and the thick band inside the box represents the median. The ends of the whiskers represent the minimum and maximum of all data within 1.5 times the interquartile range from the bottom or the top of the box. Circles represent outliers.

We conducted the validation study using an independent cohort (Group 2). The scatter plots of the relationships between chronological age and SDL age in never-smokers with normal spirometric measurements and normal BMIs in Group 2 are shown in Figure 4. We compared model 2 and model 3 with the original JRS model. Similar to Group 1, model 3 was considered the best model in both females and males because of the values of R2. However, the slopes of model 2 and model 3 were from 0.73 to 0.80 (there were some discrepancies to 1). The box and whisker plots of the lung age deficit according to smoking status in Group 2 are shown in Supplementary Figure 3. Similar to Group 1, the lung age deficit values distributed more widely in the JRS method compared with model 3.

Figure 4
figure 4

The scatter plots between chronological age (years) and SDL age (years) in non-smokers with normal spirometric measurements and normal BMI in Group 2. Scatter plots between chronological age and SDL age in never/non-smokers with normal spirometric measurements and BMIs in Group 2. SDL age was determined in females and males according to the JRS equations (a and b, respectively), model 2 equations (c and d, respectively) and model 3 equations (e and f, respectively). The lines of best fit for SDL age (y) against chronological age (x) are expressed by solid lines. Dotted lines indicate identity lines (y=x).

Discussion

Main findings

This study produced the following novel SDL age equations using data from a large number of healthy never-smokers with normal spirometric measurements and normal BMIs: SDL age (females)=0.84×%FVC+50.2–40×FEV1 (l), and SDL age (males)=1.00×%FVC+50.7–33.3×FEV1 (l).

Interpretation of findings in relation to previously published work

The JRS14 has developed standardised equations for determining the lung age in the Japanese population based on the original method of Morris and Temple.7 However, several problems have been noted for the JRS equations. First, these equations were not validated with a large number of Japanese subjects.9,17 Second, the JRS equations produced discrepant results between lung age and lung damage, especially in Japanese males.15 Occasionally, healthy males who were healthy never-smokers showed a large lung age deficit. In addition, it is not self-evident whether lung age can be reliably predicted from the single spirometric parameter of FEV1.17

To our knowledge, several groups have reported revised equations predicting lung age in worldwide populations.813 The American Thoracic Society guidelines for spirometry recommend that predictive equations be derived from a ‘relevant’ population18 and should be updated at least every 10 years.19 The lung age equations currently used in Japan were developed by the JRS in 2001;14 thus, adequately updated equations using data from the Japanese population will be indispensable for appropriate evaluation of lung age.

We developed candidate regression models using the original methods of Morris and Temple.7 Thereafter, we compared alternative regression models by adding one more variable. The fitness of the model was not improved by adding the other anthropometric variables, but the addition of FVC information improved the adjusted R2 values significantly. Models 2 and 3 had a comparable ability for predicting SDL age; however, we selected model 3 as the best model because, first, the ideal equations should be as simple as possible and, second, because model 3 in both females and males revealed not only higher values of R2 but also nearer to 1 as the slopes in the validation cohort (Group 2).

Strengths and limitations of this study

One strength of the current study is that our model was developed using a large sample size (n=15,238) of healthy never-smokers with normal spirometric measurements and normal BMIs. In addition, these SDL age equations used two different spirometric parameters, FEV1 and %FVC.20 It is well known that these two values reflect different aspects of lung damage. We also validated this model in an independent cohort group.

The results of this study should be interpreted in the context of the acknowledged limitations. First, this study was conducted in only one hospital (St Luke's International Hospital), and the socioeconomic status of our participants does not seem to be representative of the general population of Japanese patients. Our participants might consist of higher-income upper-class individuals, with white collar occupations. However, data on occupational and other environmental exposure were not available in this study. Second, Yomaguchi et al.9,17 suggested that the simple backward calculation of age from the regression equation for the reference value of FEV1 may not be appropriate from statistical points of view.

Implications for future research, policy and practice

We used contemporary data to produce novel SDL age equations for Japanese adults aged 18–80 years. International guidelines suggest that lung function equations should be derived from the same population to which they will be applied and should be updated regularly. Appropriate SDL age equations can be used to give a more powerful message to motivate people to quit smoking. Further research is needed to validate these equations using other populations with large sample sizes.

Conclusions

We developed novel SDL age equations using FEV1 and %FVC from 15,238 Japanese healthy never-smokers with normal spirometric measurements and BMI.