Introduction

Respiratory diseases are among the most common causes of mortality and morbidity in the occupational setting. The National Institute for Occupational Safety and Health (NIOSH) estimates that deaths from work-related respiratory diseases account for about 70% of all deaths from occupational disease.1 Approximately 10–15% of cases of adult asthma may be attributed to occupation, and cohort studies have shown that 20% or more of working people with asthma can have exacerbations of their asthma due to work-related exposures and conditions.2 Also, 15% of cases of chronic obstructive pulmonary disease (COPD) may be due to occupational exposures,3 and less prevalent lung diseases such as bronchiolitis obliterans have been reported in cases with occupational exposures.4 Additionally, it has been shown that pulmonary function testing is an important predictor of mortality.5,6

In occupations believed to be associated with the risk of pulmonary diseases, periodic spirometry is usually conducted in workers to monitor their lung function and prevent work-related respiratory conditions. Workplace monitoring through conduction of spirometry tests of forced expiratory volume in one second (FEV1) and forced vital capacity (FVC) is widely performed for early (preclinical) recognition of both obstructive and restrictive lung diseases.7,8 Other indications for periodic spirometry in occupational settings include assessment of fitness to wear respirators and maintaining workers' general health.911

Quality improvement, standardisation of spirometry testing, and interpretation of results are critically important in the occupational setting. More than 150 years have passed since the first spirometry test, yet it is still performed with suboptimal quality leading to false positive and false negative results.12 Almost all currently available spirometers meet the American Thoracic Society (ATS) criteria for accuracy and repeatability12 but, in clinical settings, technicians and general practitioners (GPs) do not usually use these devices properly. Spirometry tests are technically demanding, and selection of the device and its calibration prior to testing are especially important in performing a proper test. Although spirometry is often described as a simple screening test, it has a more important role in determining pulmonary performance. In generally healthy worker populations, making a diagnosis based on longitudinal data (periodic spirometry) may be more beneficial than a single abnormal spirometry test.1013 High quality spirometry depends on both participants and technicians because effort plays a central role in the flow-volume curve.14,15 Previous researchers have recommended criteria for evaluating spirometry quality over time.16 Nevertheless, data are limited on the accuracy of spirometric measurements and acceptable precision of longitudinal data for spirometry monitoring.17 In a previous study, only 33.1% of spirometry tests performed by trained primary care providers and 12.5% of those performed by control group participants achieved the requirement of a minimum of two acceptable spirometry manoeuvres.18 Poor quality of spirometry in the primary care setting is an international concern and can lead to secondary care referrals.19 It has been shown that an educational intervention in the primary care setting to minimise the risk of poor quality spirometry tests (due to poor technique) can reduce unnecessary secondary care referrals.20

Every day we encounter several workers who are referred to our clinic for further evaluation because of spirometric abnormalities. Most of these spirometric measurements have technical errors. There is no approved training programme for technicians and GPs who want to perform spirometry or interpret the results. Apart from occupational medicine specialists, GPs who have participated in occupational medicine courses are legally eligible to supervise spirometry tests and to interpret the results in the occupational setting. Performance of spirometry tests and interpretation of the results are part of these training courses. However, based on our experience, complete and proper training of GPs is not possible in these training courses. Although spirometry tests are commonly performed in occupational settings in our country, they have not been sufficiently evaluated.

This study sought to determine the quality of spirometry tests and changes in periodic pulmonary function testing in two consecutive years among workers at an industrial company in Iran.

Methods

Study design and subjects

The study was performed in an oil refinery in Iran in 2011. Data on 1,004 male workers were evaluated. The first phase of the study was started in June 2010. All spirometry tests and interpretations done by technicians and GPs at the refinery were checked by two independent occupational medicine specialists who were experienced in spirometry. In this oil refinery, two technicians perform the spirometry tests and three GPs interpret the results. These staff were not changed or replaced during these two consecutive years. All understudy subjects (technicians and GPs) had a minimum of seven years of work experience.

The basic inclusion criteria for the study were at least one year of work experience in the oil refinery and possession of a complete data profile for the year before the intervention. All co-workers participated voluntarily in the study and provided written informed consent. The study was approved by the Ethics Committee of the National Research Institute of Tuberculosis and Lung Diseases (NRITLD).

Pulmonary function test and training course

American Thoracic Society/European Respiratory Society (ATS/ERS) guidelines were used for assessment of acceptability and repeatability criteria. These guidelines include exhaling with maximum effort, aiming to achieve at least three acceptable and two reproducible curves.15

Spirometry tests were considered ‘acceptable’ if they were free from artifacts (cough during the first second of exhalation, glottis closure that affected the measurement, early termination or cut-off, less than maximal effort during the test, leak, obstructed mouthpiece), had good starts (extrapolated volume <5% of FVC or 0.15L, whichever was greater), and had satisfactory exhalation (duration of ≥6 seconds or a 1 second plateau in the volume-time curve or if the subject could not or should not continue to exhale).

After three acceptable spirometry tests, repeatability criteria were considered as the two largest FVC values within 150ml of each other and the two largest FEV1 values within 150ml of each other.14

The intervention, which consisted of technician training according to the NIOSH-approved spirometry training course21 and monitoring of the quality and precision of spirometry tests and their interpretations, was then initiated. A total of 1,004 pre- and post-intervention spirometry tests were evaluated. The training course consisted of eight hours of formal lectures, eight hours of practical education for the two technicians and four hours of practical education for the three GPs, followed by two hours of evaluation of the participants' spirometry skills with both written and practical examinations. The course content included the following:

  • Basic physiology of the FVC manoeuvre and the determinants of airflow limitation with emphasis on its relation to repeatability of results.

  • Knowledge about the device and its requirements, calibration check procedures, how to find sources of errors, and their correction with the results.

  • Performance testing, subject coaching, recognition of improperly performed manoeuvres, and corrective actions.

  • Quality of results.

  • Actual use of the device under supervised conditions.

  • Measurement of tracings and calculation of results.

Additionally, a pamphlet on spirometry errors with figures and an instruction form containing indications and contraindications of spirometry were administered to course participants. A refresher training course was held for the same participants six months later. This course was conducted according to the NIOSH-approved refresher training course guidelines.21 The technicians' refresher training course focused primarily on enhancing their skills and coaching them to improve test quality. After the refresher course, another examination similar to the previous one was performed.

These training courses were scheduled between two periodic examinations of the refinery personnel. After completion of these training courses, the participating GPs and technicians fulfilled the criteria for pulmonary function testing and interpretation of results. All understudy subjects underwent closed-circuit spirometry by a calibrated portable spirometer (Spirolab III, MIR Co., Italy). The Spirolab III spirometer displays quality grades and offers an automated interpretation; however, this device does not display a specific error message for each error. In this study, the quality of device messages was not considered as an index for rejecting or accepting the spirometry test. All spirometry tests were reviewed by two occupational medicine specialists based on ATS/ERS criteria, and data from the first and second years were compared. FEV1/FVC <0.70 (obstruction) and FVC <80% predicted (restriction) were used to define abnormal spirometry results.

Before the initiation of the main study, a checklist of errors that could make the test unacceptable or unrepeatable was prepared based on ATS/ERS criteria. One hundred spirometry tests were then reviewed by both reviewers and interpersonal variability was assessed (kappa coefficient 95%). These tests were excluded from the main study. The checklist was completed for all spirometry tests.

Two technicians, three GPs, and 1,004 workers entered the study. In order for the data to be complete and interpretable, participants had to submit three spirometry curves per testing session. The two reviewers were present in the setting two days a week. Thus, post-training continuous monitoring by the technicians was not available. However, the technicians and GPs had access to the reviewers in their off days.

Statistical analysis

Data were analysed using descriptive and analytical statistical methods and SPSS version 11.0 software. To compare quantitative variables, the paired t-test was applied and the McNemar test was used for qualitative variables; p<0.05 was considered statistically significant.

Results

A total of 1,004 spirometry tests were evaluated. At the start of the study the mean (SD) age of the workers was 37.6 (6.3) years (range 20–62) and all were male. The mean (SD) body mass index (BMI) was 25.5 (3.38) kg/m2 (range 16.9–35.8) and mean (SD) work experience was 11.44 (4.31) years (range 1–25). The mean (SD) weight and height of participants was 76.13 (11.58) kg (range 45–107) and 172.6 (6.85) cm (range 152–197), respectively. A total of 264 participants were smokers (26.3%) and mean (SD) cigarette consumption in smokers was 8.36 (5.25) pack/years. Mean cigarette consumption and shift work did not change significantly in the two consecutive years (p>0.1), and the mean weight of participants did not change significantly during the two-year follow-up period (p>0.05). The following outcomes were evaluated for assessing the impact of the interventions: percentage of spirometry tests fulfilling the ATS/ERS recommendations; decline in most frequent spirometry test and interpretation errors; and the percentage of improvement in spirometric values (FEV1, FVC, FEV1%).

Table 1 shows descriptive statistics of the spirometry tests before and after the course. The most common error in year 1 was FVC >6 seconds or a 1 second plateau. Table 2 shows the changes in mean spirometric values. The mean of all spirometric values in year 2 increased significantly compared with the same values in year 1 (p<0.001).

Table 1 Descriptive statistics of spirometry tests that had acceptability and repeatability criteria before and after the course (N=1,004 tests)
Table 2 Spirometry test values of participants in study years 1 and 2

In this study, GPs had errors in terms of repeatability and acceptability of the tests and also wrong interpretation. A total of 149 tests (14.8%) met all the acceptability criteria in year 1, and this rate reached 706 tests (70.3%) in year 2 (p<0.05). The rate of correct interpretations by physicians also increased from 531 (52.9%) in year 1 to 910 (90.6%) in year 2 (p<0.05). However, no significant differences were detected in repeatability of the tests between year 1 (958 tests, 98.1%) and year 2 (999 tests, 99.5%). Table 3 shows the details of incorrect interpretations. The only significant reduction occurred in restrictive interpretation (p<0.05, Table 3). We also evaluated spirometry tests that were conducted and interpreted accurately. The rate of correct spirometry tests increased significantly in the year after the workshop (p<0.05). In order to evaluate other factors that may affect the educational programme results, data from the first year were compared with those of the previous year. No significant difference was found in the main observed outcomes (acceptability and repeatability percentage, FEV1, FVC, and FEV1/FVC).

Table 3 General practitioners' interpretations and rate of correct interpretation before and after the course (N=1,004 tests)

Finally, logistic regression analysis was used for more precise evaluation of the relationship between good quality spirometry testing (i.e. spirometry tests with acceptability and repeatability criteria in addition to correct interpretation) and the training course. The analysis showed that, even after adjusting confounding factors for technicians, GPs, and workers, there was a significant relationship between a good quality spirometry test and the training course (p<0.001, Table 4).

Table 4 Association between good quality of spirometry tests and study variables using logistic regression analysis (2008 tests)

Discussion

Main findings

In this study we evaluated the quality of spirometry tests and their interpretation in two years in an industrial company. In high-risk occupations, it is recommended that precise spirometric test results — especially longitudinal data — are obtained periodically in order to prevent respiratory conditions among personnel.8,22 To achieve the maximum potential of spirometry for detecting respiratory conditions among the workforce, it is necessary to reach an acceptable test quality. This study is the first to evaluate the quality of periodic spirometry tests and its improvement in the occupational setting in Iran. No systematic training or retraining programmes regarding the conduction and interpretation of periodic spirometry tests are available in Iran. In this study only 14.8% of spirometry tests met the ATS acceptability criteria. Insufficient education and training, or not paying enough attention by technicians and GPs, are among the possible reasons. The present study results demonstrated that, after educational workshops, the rate of acceptable spirometry tests increased from 14.8% to 70.3%. These findings confirm that proper training and supervision can improve test results. We cannot define the exact role of each of these factors, but training may play a more prominent role than improved supervision.

Interpretation of findings in relation to previously published work

The Lung Health Study showed that only 2.1% of participants were unable to produce three acceptable FVC manoeuvres.23 Our results showed the necessity of training and retraining workshops in the occupational setting in Iran. Pre- and post-workshop written assessments also showed significant improvement in participants' performance. Eaton et al.18 demonstrated that, due to the possibility of forgetting some information, continuous training is necessary for preserving the quality of spirometry tests. We therefore provided participants with a written instruction manual and attached a poster to the wall to reduce the risk of forgetting the criteria. Additionally, a retraining workshop based on NIOSH standards was scheduled to be held six months after the first workshop.

In this study, 52.9% of interpretations were correct before the workshop. This finding was similar to that in the study by Eaton et al.18 After the training workshop the rate of correct interpretations increased up to 90.6% (Table 3). This finding clearly shows the need for training in both conduction of spirometry testing and interpretation of the results. High rates of misclassification (false positives and false negatives) in the occupational setting can lead to unacceptable results. False positives can result in unnecessary secondary care referrals, wasting the time of the workforce and imposing unnecessary costs. On the other hand, false negatives can cause failure in the early diagnosis of occupational lung diseases.

This study showed that, after the course, mean FEV1 values improved more than mean FVC values, suggesting that the improved technique yielded deeper inhalations. FEV1 increases by taking deeper breaths prior to forced exhalations while FVC increases by taking deeper breaths and longer exhalation times (reaching a volume-time plateau).

Our results showed that the prevalence of restriction was low (1.6% even after the course) despite the workplace exposures and the high prevalence of obesity among the workers.

An inappropriate reference equation for the lower limit of FVC may explain this finding. Another possibility is that smoking and occupational exposures may have resulted in manifestation of an obstructive pattern. The healthy worker effect may also explain a portion of this finding. Additionally, since the 5th percentile lower limit of normal (LLN) was not available for our population, we used FEV1/FVC <0.70 and FVC <80% predicted to define abnormal spirometry results. Miller et al. showed that using fixed thresholds (instead of 5th percentile LLN) can result in >20% misdiagnosis in COPD patients.24

Strengths and limitations of the study

This study has some limitations. First, a small group of technicians and GPs participated in the study and we cannot claim that they are truly representative of this group of healthcare workers in society. Second, other studies have shown that, in relatively healthy working populations, the level of data precision can reach 4% or less.9 Also, in the BOLD study performed in 14 countries, 10% of participants failed to perform a proper spirometry test that met the ATS/ERS criteria.25 In our study, data precision after the training courses was lower. Other factors such as the aptitude of the spirometry tester may affect our results and can be the subject of further investigations.

One strength of this study is that it is the first study in this subject group in Iran. The study evaluated the critical role of education in improving the quality of spirometry tests. Other approaches, such as paying only for good quality spirometry tests, may be more effective. Further research in this respect is required to obtain insight on this issue.

Conclusions

The results of this study show that the validity and quality of spirometric tests conducted in an industrial company in Iran were unacceptable, but they improved significantly following the NOSH-approved training intervention. The study demonstrated the lack of a systematic guideline for conducting spirometry tests and interpreting the results in an occupational setting in Iran, and emphasised the need for a nationwide programme to improve the quality of spirometry tests in the occupational setting.