Introduction

Chronic obstructive pulmonary disease (COPD) is the leading cause of morbidity and mortality worldwide. COPD accounts for more than half of health expenditures spent for respiratory disease (Forum of International Respiratory Societies. The Global Impact of Respiratory Disease. 2nd ed. Sheffield: European Respiratory Society, Forum of International Respiratory Societies; 2017). COPD is the third leading cause of death in the United States1, and WHO predicts that COPD will account for the third largest cause of death in 20302. The prevalence of COPD continues to increase in the twenty-first century; it will be particularly severe in Asia owing to air pollution3. Prevalence of COPD is affected by methods, definition, and survey populations. Estimated COPD prevalence in the Asia Pacific region was 6.3%, but the WHO reported it to be 3.9%4. In the systematic review, COPD prevalence was estimated to be 7.8–14.1%, and as per the Western Pacific Regional Office of WHO, it was 10%. To accurately determine COPD prevalence, clinically proven tools such as pulmonary function tests (PFT) are needed. In a few countries, in approximately 10% patients, COPD was measured by actual airflow obstruction5.

Underdiagnosis of COPD is a public health concern6. COPD prevalence in South Korea in 2015 ranged from 13.1 to 14.6%7. Although Korea achieved universal health coverage via the mandatory National Health Insurance, among those with COPD as determined by spirometry, only 2.8% were diagnosed as COPD by physicians and the treatment rate was 1.6%8. Perhaps in countries without universal health insurance or in developing countries, the problem of underdiagnosis is higher9. COPD will be underestimated, but there is insufficient research on how underestimated it is. Despite being COPD patients, many individuals are unaware of their illness, and this group is a major public health threat. Therefore, these groups should be considered as potential risk groups. Although there are numerous studies related to COPD, the majority of them are based on small sample sizes or are limited to national studies that only measure prevalence through spirometry10; therefore, larger-scale studies are still needed.

Identifying the socioeconomic group with high COPD prevalence and determining those who are unaware of COPD are important. To decrease the rate of underdiagnosed COPD and to plan strategies for efficient screening for disease management, we investigated the national-level prevalence of COPD by airflow obstruction. First, we confirmed the trend of COPD prevalence by year by PFT. Second, we determined the proportion of patients diagnosed by doctors and those who were unaware of their COPD. Lastly, we analyzed the factors associated with COPD prevalence and potential high-risk group characteristics.

Methods

Study design and population

We used the Korea National Health and Nutrition Health Examination Survey (KNHANES) from 2010 to 2017. KNHANES surveys about 10,000 people each year by stratified multistage sampling design to show the representativeness of national indicators. The survey is conducted in three areas: health surveys like smoking and drinking; nutrition surveys like diet and nutrition; and physical examinations like measurement of height, weight, blood, and urine and PFT. We performed PFT in adults aged > 40 years, based on 8 years of data. Of the 35,745 individuals aged > 40 years, 24,454 were selected as the study subjects, excluding 8239 who did not receive a PFT and 3052 with screening or run-in failures despite receiving PFT. The definition of ‘COPD diagnosis’ is based on spirometric screening for the all participants. Via consensus and validation for the decades, the Global Initiative for Chronic Obstructive Lung Disease (GOLD) defined COPD by a fixed forced expiratory volume in 1 s (FEV1)/forced vital capacity (FVC) of 0.70, in 200111. We included (1) those with FEV1/FVC of < 0.70 by GOLD standard defined as the COPD prevalence group and (2) group of patients diagnosed with COPD by doctors. (3) We defined potential high-risk among those with a FEV1/FVC ratio of < 0.70 who have not been diagnosed with COPD and other respiratory-related diseases such as TB, asthma, lung cancer. This is because the subjects who are not diagnosed with other respiratory diseases are likely to be unmanaged despite having COPD. Of the 24,454 participants, 3469 were classified into the COPD prevalence groups; of these, 168 were diagnosed with COPD by doctors and 335 were diagnosed with other respiratory-related diseases. The 2946 were classified as the potential high-risk group (Fig. 1).

Figure 1
figure 1

Flow diagram for study subject. COPD chronic obstructive pulmonary disease, KNHANES Korea National Health and Nutrition Health Examination Survey, FEV1 forced expiratory volume in one second, FVC forced vital capacity.

Measurement

The survey data and biological samples were collected by trained staff in two mobile clinic vehicles. The first mobile clinic was designed for physical examinations like measuring lung function. PFT was performed using the Dry rolling seal spiro, and then changed to the Vyntus spiro from June 2016. The test was performed by a trained professional, and after daily inspection, the analysis was conducted by the researcher in charge of the quality assessment on the day of the examination according to the refined ABCD assessment tool of GOLD guidelines12. PFT (Vmax Model 2130; SensorMedics, Yorba Linda, CA, USA) was performed and assessed on the basis of the criteria of the American Thoracic Society and European Respiratory Society. Participants were then moved to the second mobile clinic to receive a health questionnaire. The survey was conducted face-to-face, and the responses are stored real time on tablet PC by the researcher. About 1 week after the two-step mobile clinic survey was completed, a nutritionist visited their households to conduct a nutrition survey.

Variables

PFT measured FEV1 and FVC. FEV1/FVC ratio of < 0.70 was defined as COPD. COPD is closely related to age, sex10,13, and socioeconomic status such as income, education, job status, and residential area3,14. We classified income into quartiles. Regarding job status, white collar comprised individuals working in customer service area, finance, medical professionals, and office workers; blue collar comprised those in agriculture, forestry, fishery, factory, and construction workers; and non-jobs comprised those currently without jobs, housewives, and students.

The regional area was classified into metro city, other urban city, and rural area. Regarding health behavior, we considered major variables that could affect respiratory health like current smoking status, pack year (PY) of cigarettes, high-risk drinking, and walking practice (Person who walked for 30 min at a time over the past week for 5 days)3,15,16. Smoking is a known cause of COPD3,17,18. Smokers who had smoked more than five packs (100 cigarettes) in their lifetime and who currently smoked were defined as current smokers, those smoking ≥ 20 cigarettes daily as heavy smokers, and those who had smoked in the past but did not smoke currently as ever smokers. If you smoke one pack (20 cigarettes) daily for 10 years, you get 10 PY. Ever smoker will get 20 PY if a smoker has smoked a pack daily for the past 20 years. We divided PY into quartiles as follows: 1Q < 8 PY, 2Q < 20 PY or less, 3Q < 32 PY or less, and 4Q ≥ 32 PY. Next, the heavy drinkers in men were defined as those who drank more than seven drinks twice a week and in women as those who drank more than five drinks twice a week. Walking practice was defined as a person who walked for 30 min at a time over the past week for 5 days. COPD is associated with various health outcomes19. We defined patients with hypertension, diabetes, and dyslipidemia as those with non-communicable diseases (NCD) who had been diagnosed by doctors or were currently taking medication for these diseases20, and experience of cancer (stomach, liver, colon, breast, cervix, etc.) patient21.

Statistical analysis

We showed the prevalence by year according to the COPD classification and proportion of patients with COPD, those with other respiratory diseases (TB, asthma, and lung cancer), and the potential high-risk group. Demographic analysis was conducted by dividing the patients into groups diagnosed by doctor and potential high-risk group. We performed multivariable logistic regression analysis by adjusting dependent variable to determine if each factor was related to COPD. All statistical analyses were weighted to reflect national representativeness according to the stratified multistage sampling design. All analyses were conducted using SAS for Windows 9.4.

Ethics approval and consent to participate

The Institutional Review Board of the Korea Disease Control and Prevention Agency (KDCA) reviewed and approved the KNHANES (IRB Nos. 2010-02CON-21-C, 2011-02CON-06-C, 2012-01EXP-01-2C, 2013-07CON-03-4C, 2013-12EXP-03-5, 2018-01-03-P-A). All participants were informed about the purpose and provided written consent at the beginning of this investigation. This study was conducted in accordance with the Declaration of Helsinki, and all of the materials used in the article were only publicly available data.

Results

COPD prevalence and proportion of potential high-risk group

The annual COPD prevalence increased from 13.1% in 2010 to 14.6% in 2012 followed by 13.3% in 2017. The prevalence of COPD diagnosed patients was 0.7% in 2010 and remained at 0.5–1.0% until 2017. The potential high-risk group for COPD was 11.7% in 2010 followed by 11.7% in 2017 after a slight increase and decrease (Table 1). In the COPD prevalence group, the number of patients diagnosed with COPD by doctors was 5.1% in 2010 followed by 3.9% in 2017 after a slight increase/decrease. The other respiratory-related disease diagnosed group that was not diagnosed with COPD comprised 10.0%–20.0%. The proportion of potential high-risk group that was not diagnosed with COPD and other respiratory diseases was 80.8% in 2010 and 78.1% in 2017 ranging between 73.0 and 86.2% (Fig. 2).

Table 1 Prevalence of COPD, diagnosed patient, and potential high-risk group in 2010–2017.
Figure 2
figure 2

Proportion of patients diagnosed with chronic obstructive pulmonary disease (COPD) and diagnosed with other respiratory-related diseases (tuberculosis, asthma, and lung cancer) diagnosed among COPD prevalence.

Demographic analysis

The subjects of this study were 24,454 individuals who received lung function tests. Among them, 41.3% were aged 50–59 years, 34.0% were aged 40–49 years, 14.2% were aged 70–79 years, 9.2% were aged 60–69 years, and 1.3% were aged ≥ 80 years. By gender, 51.9% of men and income were in the fourth quarter, which was similar. In terms of education level, the percentage of individuals with high school education was the highest (33.5%), followed by the above college. Occupation was slightly higher in non-job (35.8%) than white collar (33.2%) and blue (31.0%), and in residential area, city, metro city, and rural areas were in order. Regarding smoking status, never smoker was the most common with 56.0%, followed by ever smoker (23.8%), current smoker (11.0), and heavy smoker (9.2%). Among those who had smoking experience, 2Q (more than 8PY and less than 20PY) was the most frequent in smoking pack year, and 11.4% of all participants were heavy drinkers. The walking practice rate was 37.8%. Patients with non-communicable disease who were diagnosed with at least hypertension, diabetes mellitus, or dyslipidemia accounted for 37.7% and those diagnosed with cancer accounted for 7.7%.

The potential high-risk group comprised those aged > 80 years, higher prevalence in the group 4Q, male, with low income, elementary, non-job, ever smoker, and PY of cigarettes, the group with NCDs, and with high prevalence of cancer diagnosis. There was statistical significance. However, there was no statistical significance between heavy drinkers and walking practices. Among COPD diagnosed patients, those in their 1970s had the highest rate of diagnosis. The diagnosis rates were high in the group with male, non-jobs, ever smoker, and high smoking PY, and the group with NCD, which showed a similar tendency except for age in comparison to the potential high-risk group (Supplement Table S1).

Factors affecting COPD prevalence

Multivariable logistic regression showed that ever smokers in the COPD diagnosed patient group had a 2.38 times higher rate of diagnosis than never smokers. Furthermore, the group with PY of 4Q had a 2.81 times higher diagnosis rate than that of 1Q. The potential high-risk group had a higher prevalence of COPD in the 1950s (3.12), 1960s (9.04), 1970s (14.84), and those > 80 years (15.44) than those in the 1940s. The prevalence in males was 3.22 times higher than that in females in middle school graduates (1.28) and it was lower in elementary school graduates (1.42) than that in college graduates. The ever smokers had 1.32 times higher COPD prevalence, the current smokers had 2.22 times higher prevalence, and the heavy smokers had 2.31 times higher prevalence than the never smokers. Regarding PY, 3Q was 1.28 times higher and 4Q was 1.64 times higher than 1Q (Table 2).

Table 2 Multivariable logistic regression to determine the factors identifying subjects with COPD.

Discussion

In this study, patients diagnosed with COPD ranged between 0.5 and 1.0% in the last 8 years. However, those with COPD as per PFTs ranged from 13.1 to 14.6%, which was 10–20 times more than the actual number of patients diagnosed, which means that only 5% of all COPD patients were diagnosed with COPD by doctors. Especially the older age group, women, low-educated group, and current smokers who have been smoking for a long time are more likely to be in the high-risk group having a higher possibility to develop COPD but are not diagnosed with COPD appropriately. Although, only the diagnosis rate of COPD in ever smokers was 2.38 times higher than never smokers, indicating that a system is needed to screen and intervention for these groups.

As it is difficult to calculate COPD prevalence using PFT, various studies have estimated COPD prevalence. We showed a higher level than the existing predicted prevalence4,10,22. The study conducted by John23 estimated COPD prevalence in 12 countries using a mathematical model, and the prevalence in Korea was 6.2% at that time. As the year was different, it was difficult to directly compare it with the present study. The actual prevalence rate in 2010, which is the nearest year, was 13.1% that was about twice as high than expected, i.e., the number of actual COPD patients is likely to be higher than currently predicted. In other words, the burden of diseases caused by COPD worldwide is like to be higher than current forecasts. In our study, patients diagnosed with COPD ranged between 0.5 and 1.0% in the last 8 years. However, those with COPD as per PFTs ranged from 13.1 to 14.6%, which was 10–20 times more than the actual number of patients diagnosed, i.e., only 1/20 of all COPD patients were diagnosed with COPD by doctors. Excluding those diagnosed with respiratory diseases like TB, asthma, and lung cancer, eight in ten patients with COPD were likely to be unaware of COPD and other respiratory-related disease and to remain as the potential high-risk group without management or intervention. COPD is a chronic respiratory disease that needs to be managed. Airflow obstruction increases coronary events and mortality17,24 and contributes to deaths caused by respiratory diseases such as pneumonia25. Early COPD means early age onset (< 50 years) of COPD, whereas mild COPD represents mild airflow limitation (FEV1 ≥ 80% predicted)26. Early COPD accounts for 15% of COPD. However, its prognosis was poor with hazard ratio of 6.42 (95% CI 3.39–12.2) for hospitalization and 1.79 (1.28–2.52) for all-cause mortality27. Mild COPD causes accelerated FEV1 decline and increases mortality risk compared with that in those without COPD26. Although, early and mild COPD patients have substantial disease burden, their treatment strategy was limited because they are likely to be undiagnosed, thus not included in clinical research. Therefore, active COPD finding is recommended in patients with respiratory symptoms and/or risk factors28.

To prevent severe conditions and complications and to lower respiratory mortality, COPD should be detected at an early stage, managed29, and prioritized groups should be selected. Age, gender, and education are the most important factors related to COPD10,13. In our study, COPD prevalence was higher in older age groups, in men than in women, and at lower education levels. Particularly, those in their 1970s and those aged ≥ 80 years were about 15 times more likely to develop COPD than in their 1940s, men were three times more likely to develop COPD than women, and elementary school graduates were 1.5 times more likely to develop COPD than college graduates. However, patients with COPD were not statistically significant in terms of age; they have a high prevalence and are unlikely to be detected at an early stage and to be managed.

Although COPD prevalence was high in the ever, current, and heavy smokers, only the diagnosis rate of COPD in ever smokers was higher than never smokers, indicating that smoking is the most important cause of COPD30. This means that current and heavy smokers with COPD will continue to smoke unless any clinical or health issues is detected. COPD diagnosis provides the motivation to quit smoking31. Therefore, only ever smokers have a high diagnosis rate because smokers may have quit smoking after being diagnosed with COPD. However, in our study, most smokers were unaware that they had COPD and continued smoking. PY is the most powerful predictor for COPD32. In our study, the higher the PY, the higher the prevalence, and the 4Q group was diagnosed with COPD 2.8 times more than the 1Q group. As the lungs are irreversible once their functions are impaired, medical practitioners should encourage COPD patients who smoke to quit smoking33. However, this intervention is not easy because most patients with COPD are not detected.

In summary, social backgrounds may vary depending on countries and cultures, but previous studies3,17,34,35, and our studies have confirmed that the low socioeconomic group has a high prevalence of COPD. Therefore, when it is difficult to identify the COPD prevalence group, it would be universally valid to prioritize such low socioeconomic groups and smokers group. It has been proven that the older age group, women, low-educated group, and current smokers who have smoked for a long time are more likely to develop COPD. However, we found that these high-risk groups are not diagnosed with COPD in spite of being more likely to develop COPD. Early detection and management of COPD are necessary for personal health and for lowering the social burden of medical expenses36. The burden of medical expenses for COPD is a key challenge globally3,37, and the cost of medical treatment for COPD is rapidly increasing in Korea38,39. Therefore, there must be intervention from the public health viewpoint. The study by Kylie6 argued that screening in primary care needs to be further expanded since COPD is underdiagnosed. Accordingly, countries with health insurance as social insurance should actively consider including PFT in the national benefit services. Moreover, trainings according to clinical guidelines are necessary because primary care practitioners do not have sufficient knowledge and skills to diagnose and treat COPD40, and there is a need to strengthen the competencies of the primary medical-oriented COPD management system35.

COPD, a public health threat since 2000s, has been claimed to be intervened and managed4. The management of respiratory diseases like COPD will become more important in the future41. Nevertheless, there is still no important consensus. The results of our study further consolidated the previous evidence that COPD could be a significant threat to health problems and highlighted the need for further studies.

Strength and limitations

According to the systematic review, while there are numerous studies related to COPD, only 0.3% of them have used spirometry to measure COPD, and population-based studies may have even fewer10. Therefore, our study has the advantage of using large-scale clinical data from 25,000 individuals assessed by a reliable diagnostic method, which can be considered as actual clinical evidence. While COPD should be diagnosed and managed, it is difficult to find studies that focus on identifying hidden high-risk groups that remain unmanaged. Under these circumstances, our study is among the first to identify the size and characteristics of the potential high-risk population. This study also has several limitations. First, as a cross-sectional study, it only shows the association with each variable and does not explain the causal relationship. When smokers are diagnosed with COPD, it is not possible to examine whether they quit smoking. Second, fine dust and indoor air pollution recently increased the incidence of COPD42,43, but factors related to environmental pollution were not controlled in our study. Therefore, studies that consider environmental factors are needed in the future.

Conclusions

It has been proven that the older age group, women, low-educated group, and current smokers who have smoked for a long time are more likely to develop COPD. However, we found that these high-risk groups are not diagnosed with COPD despite being more likely to develop COPD.