Association of cancer screening and residing in a coal-polluted East Asian region with overall survival of lung cancer patients: a retrospective cohort study

Yang, Runxiang; He, Ming; Wang, Dongmei; Ye, Rongrong; Li, Lu; Deng, Rouyu; Shah, Mohsin; Yeung, Sai-Ching Jim

doi:10.1038/s41598-020-74082-0

Download PDF

Article
Open access
Published: 15 October 2020

Association of cancer screening and residing in a coal-polluted East Asian region with overall survival of lung cancer patients: a retrospective cohort study

Runxiang Yang¹,
Ming He¹,
Dongmei Wang¹,
Rongrong Ye¹,
Lu Li²,
Rouyu Deng¹,
Mohsin Shah³ &
…
Sai-Ching Jim Yeung⁴

Scientific Reports volume 10, Article number: 17432 (2020) Cite this article

1204 Accesses
1 Citations
Metrics details

Subjects

Abstract

Lung cancer is the leading cause of cancer death worldwide. The Xuanwei-Fuyuan (XF) region of Yunnan, China has a high incidence of lung cancer from coal-related pollution. Effort to raise public awareness screening for lung cancer has been ongoing. We retrospectively analyzed overall survival (OS) of lung cancer patients of a tertiary cancer center in Yunnan to investigate screening and regional residential status as predictive factors. Consecutive cases of newly diagnosed lung cancer were reviewed. The lung cancer cases diagnosed by screening were more likely to be early-staged and treated by surgery than those diagnosed not by screening. In patients diagnosed not by screening, XF residential status was a significant predictor of improved OS. Frailty model detected significant heterogeneity associated with region of residence in unscreened patients. Potential biases associated with screening were examined by Monte Carlo simulations and sensitivity analyses. Focused effort in cancer screening and increased public awareness of pollution-related lung cancer in XF might have led to early diagnosis and improved OS, and increased investment in health care resources in high risk areas may have produced additional unobserved factors that underlay the association of XF residential status with improved OS in patients diagnosed not by screening.

Incidence trends and spatial distributions of lung adenocarcinoma and squamous cell carcinoma in Taiwan

Article Open access 30 January 2023

Occupation as a risk factor of small cell lung cancer

Article Open access 23 March 2023

Trends in lung cancer incidence by age, sex and histology from 2012 to 2025 in Catalonia (Spain)

Article Open access 02 December 2021

Introduction

Lung cancer remains the leading cause of cancer mortality worldwide with an estimated 1.4 million deaths annually and a 5-year survival of 17%^1,2. It accounts for 20% of cancer-related deaths in America and ranks first in incidence and mortality rates for both males and females in China³. Tobacco smoking is the primary risk factor for developing lung cancer; other factors include occupational exposures (e.g. asbestos, silica and chromium), environmental tobacco smoke, indoor coal/wood emissions, radon, family history, and pulmonary fibrosis^4,5,6. Particularly, there is an increased prevalence of adenocarcinoma in Asian non-smokers, especially females, compared with western countries⁶. Geographic patterns in cancer occurrence provide clues to the role of environmental or lifestyle factors affecting cancer risk⁷. Moreover, population-based cancer survival is an important index that assists in evaluating the overall efficiency of cancer health services in a region⁷.

Yunnan, a southwestern province of China, has a lung cancer incidence rate which is twice that of the whole China². In Yunnan, Xuanwei and Fuyuan (XF) are two neighboring counties where coal burning is ubiquitous (infamily cooking, heating and industrial production, etc.). Polycyclic aromatic hydrocarbons and other suspended particles (e.g., nano-quartz) produced by bituminous coal combustion lead to serious air pollution^3,8. Environmental pollution is perhaps the most important factor in lung carcinogenesis in XF residents⁹. Since the 1970s, various governmental efforts tried to combat pollution-related lung cancer in XF residents. The change from unvented fire pits to stoves with chimneys reduced indoor air pollution by more than 65%, and was associated with reduction in lung cancer incidence 10 years after the change¹⁰. In addition to environmental measures, governmental efforts to strengthen the lung cancer screening, early diagnosis and early treatment were undertaken^11,12. Although low-dose computed tomography (CT) lung screening may decrease lung cancer mortality by diagnosing in early stages¹³, the efficiency of lung cancer screening may depend on the selection of patients for screening¹⁴ and the criteria used in the interpretation of radiological images^15,16. In spite of screening efforts, the mortality rates in Xuanwei based on the data from 2004 to 2005¹⁷ and those from 2011 to 2015¹⁸ were similar. Improvement in early diagnosis and lung cancer mortality was expected to lag behind sustained public health efforts of lung cancer screening. We hypothesized that lung cancer cases diagnosed by screening would be diagnosed at earlier stages and have better prognosis than cases diagnosed not by screening.

Retrospective analysis of cohorts in which screening have taken place is complicated by surveillance (detection) bias, lead time bias and length bias. Surveillance bias primarily affect the analysis of cancer risk, and is difficult to mitigate in retrospective observational studies¹⁹. The estimated mean lead time for overall survival of lung cancer is 3.4 months for stages I and II and ≤ 1 month for stages III and IV²⁰. We investigated whether cancer screening is still a significant factor for improved overall survival after correction for length bias or lead time bias.

Material and methods

Participants, study and design

The study was approved by the Yunnan Cancer Hospital Institutional Review Board, and was carried out according to the research protocol in compliance with the Declaration of Helsinki. Yunnan Cancer Hospital is the provincial cancer center and the tertiary referral center for cancer patients in Yunnan province²¹, and the patient population comes from and is representative of all over Yunnan. The cohort for the current study consisted of consecutive new cases of lung cancer patients at Yunnan Cancer Hospital from January 1, 2012 to July 10, 2015. Participants were excluded from the study: (i) if they were not residents of Yunnan, (ii) carcinoma in situ (stage 0), or (iii) had received cancer treatments (radiation, chemotherapy or surgery) at other hospitals. A total of 3859 cases were then reviewed and proceeded to telephone follow up. The informed consent for telephone follow up was obtained verbally from all subjects with successful follow up because obtaining a written consent was not feasible during telephone follow up, and this was approved by the Institutional Review Board. All participants were over the age of 19 years.

Data variables

Each address was confirmed, and the classification whether it belonged to XF was based on the postal code, and the geographic location of residence at the time of diagnosis was confirmed by plotting the latitude and longitude of each address (obtained using “Geocode”) on a map. Clinical chart review and data abstraction were performed by experienced oncologists. Demographic (ethnicity (Han vs. non-Han), gender, age at diagnosis), Clinicopathological characteristics of lung cancer, cancer stage based on the American Joint Committee on Cancer (AJCC) TNM staging system, cancer treatments, body mass index (BMI), Karnofsky performance score (KPS), comorbidities and clinic follow up information were abstracted from clinical records. BMI was categorized for analysis according to Asian-Pacific recommendations²². Comorbidity information was used for calculating the age-unadjusted Charlson Comorbidity Index (CCI). Data were also collected about the presenting symptoms (i.e., fever, cough, sputum, blood-tinged sputum, hemoptysis, chest pain, chest pressure, dyspnea on exertion, dyspnea at rest, skeletal symptoms, gastrointestinal symptoms, and neurological symptoms), and symptoms deemed relevant to lung cancer by the chart reviewers were recorded. The group of patients diagnosed by screening was defined as the asymptomatic patients who were discovered to have pulmonary lesions in chest radiographs or CT scan of the chest during routine checkup visits or cancer screening. Clinical outcome was obtained by telephone follow up to obtain the date of death from surviving family members, and for cases who were still alive, they were censored at the date of telephone follow up. There were 454 (11.7%) patients that did not have a successful telephone follow up.

Statistical analysis

Descriptive statistics were presented as frequencies and percentages for the categorical variables. Student’s t test or nonparametric rank sum test was used to compare two groups where appropriate. Ratios and proportions were compared by Pearson Chi-square test. Kaplan–Meier method was used to analyze overall survival (OS) rates with log-rank test to determine statistical differences between groups. Random survival forest (R package “ranger”) was performed to evaluate the relative importance of various factors in predicting OS. Multivariate Cox proportional hazard models were chosen a priori with consideration of results from random survival forest. Multicollinearity was assessed by the variance inflation factor (VIF) of each covariate. Hazard ratios (HRs) were reported with their 95% confidence intervals (CIs). The Cox models were validated by examining scaled Schoenfeld residuals and Martingale residuals (R package “ggcoxdiagnostics”). Latent factors affecting OS were assessed using shared frailty models (R package “survival”). Sensitivity analysis of Cox regression model was performed using the R package “obsSens”.

Correction of relative risk of death for length time bias and correction of survival time for lead time were based on the method by Duffy et al.²³ that assumed an exponentially distributed sojourn time²⁴. For each scenario of a specified mean rate of transition to symptomatic disease λ and its standard deviation, the observed overall survival time of each screen-detected lung cancer case was corrected by subtracting a lead time [E(s)] calculated using the published formula [E(s) = (1−e^-λt)/λ]²³, and randomized λ for the screen-detected cases (using the rnorm() function in R) for the specified mean and standard deviation (SD) of λ, Cox regression analysis of each data set after correction of lead time for screened cases was performed. The probability of obtaining P > 0.05 for the association of screening and XF residential status with OS out of 1000 randomizations were calculated for each combination of mean and SD of λ. The estimated true relative hazard after correcting for length time bias (Ψ) was calculated using formulae by Duffy et al.²³, with the following assumptions: (1) the survival function is exponential²⁴; (2) there exists a non-aggressive cancer subtype that is more likely to be screen-detected than the rest of the cancers. In the calculation of estimated true relative risk of death independent of length bias for screen-diagnosed cases compared to symptomatic cases (φ), p1 was assumed to be the observed 5-year case fatality for symptomatic cancers, p2 was assumed to be the observed 5-year case fatality for screen-detected lung cancers, and p3 was the observed probability of screen detection in the entire cohort. Using the above assumed values for p1, p2 and p3, and Ψ was calculated for random combinations of values (between 0 and 1) of the ratio of probability of death from aggressive cancer subtype to that from the non-aggressive cancer subtype (θ) and the proportion of aggressive cancer subtype in the whole cohort (q). The R package “MonteCarlo” was used to perform the calculations.

Statistical analysis was performed using Statistical Package for the Social Sciences (SPSS) version 20.0 (IBM Analytics, USA) and R statistical software (version 3.6.2, R Foundation for Statistical Computing, Vienna, Austria. https://www.R-project.org/). Except when Bonferroni correction for multiple testing was applied, P < 0.05 was considered statistically significant.

Ethical approval

Institutional IRB of the Yunnan Cancer Hospital.

Results

Demographic and Clinicopathological Characteristics

Between 1/1/2012 and 7/10/2015, 7740 consecutive admissions of newly discovered lung masses or newly diagnosed lung cancer were identified from the hospital database. After applying the aforementioned exclusion criteria, 3859 cases were reviewed and followed up by telephone (Supplemental Fig. 1). The number of patients from XF or non-XF regions and their ratios did not vary a lot over the years of the study (Supplemental Table 1). Among these, 3405 reviewed cases of newly diagnosed lung cancer with successful telephone follow up were analyzed. There were 13.9% of patients in the XF group, who were unable to be contacted by telephone or refused to participate in the study follow up, and 12.6% of patients such patients in the non-XF group. The rates of unsuccessful telephone follow up were very similar for both groups.

Table 1 Demographic and clinicopathological characteristics.

Full size table

There were 266 (7.8%) cases diagnosed by lung cancer screening (123 in XF residents and 143 in non-XF residents). There were 698 cases (20.4%) from XF. The longitude and latitude of the addresses of XF patients were verified by plotting the address of residence of each case on the map (Supplemental Fig. 2). The demographic and clinicopathological characteristics were summarized in Table 1.

Patients diagnosed by screening accounted for 27.4% of all stage I patients, and 46.6% of stage I patients diagnosed by screening resided in XF. Almost all (98.4%) of stage I patients underwent surgical resection of the primary cancer. In contrast, the majority of the patients not diagnosed through screening had stage IV cancer.

The male:female ratio in non-XF patients diagnosed not by screening was 2.62:1 while that ratio in XF patients diagnosed by screening was 1.37:1. The percentage of XF patients diagnosed by screening was 17.6%, which was higher than non-XF patients (5.3%; P < 0.001). The distributions of histopathological types were different (P < 0.001) between the two regions with adenocarcinoma accounting for 63.9% of XF cases. A higher percentage of patients diagnosed by screening were diagnosed in stage I and II (XF: 65%, non-XF: 54.6%) than patients diagnosed not through screening (XF: 32.7%, non-XF: 14.8%) (P < 0.001). Concordantly, there were higher percentages of patients treated with surgery in patients diagnosed by screening and XF patients than patients diagnosed not by screening and non-XF patients (P < 0.001).

In summary, patients diagnosed by screening were more likely to be diagnosed in early stages (I and II). There were more female patients with adenocarcinoma of the lung in XF than other parts of Yunnan. XF lung cancer patients were more likely to be diagnosed by screening and were diagnosed at earlier stages than non-XF patients. The patients diagnosed by screening were diagnosed at younger ages, had less comorbidity and were more likely to undergo surgical treatment than patients diagnosed not by screening.

Survival analyses

Examination of screening and XF residential status as predictive factors of OS.

We first examined whether there were differences in OS associated with lung cancer screening and XF residency status. Kaplan–Meier analysis showed that the OSs of screened patients (median not reached for both XF and non-XF patients) were higher than those of patients not diagnosed by screening (unscreened non-XF patients: median OS = 463 days, unscreened XF patients: median OS = 1218 days) (Fig. 1, log rank test: P < 0.001). The 5-year survival rates were 72.7% for screened XF patients, 66.3% for screened non-XF patients, 46.2% for unscreened XF patients, 22.5% for unscreened non-XF patients. A random forest machine learning strategy (“ranger” R package) was used to identify important covariates among a total of 49 (including demographic and social characteristics, clinicopathological characteristics, cancer treatment modalities, screening, presenting symptoms and comorbidities)²⁵. The random survival forest method obviates imposition of semi-parametric or parametric constraints and automatically addresses interactions among variables to predict survival accurately²⁶. The Kaplan–Meier, Cox proportional hazard and random survival forest methods produce very similar models of OS (Fig. 2A), and XF residential status was among the top twelve predictive factors based on the “relative importance” values calculated in the random forest method (Fig. 2B).

Cox proportional hazard regression analysis

Given the above findings about the differences between diagnosis by screening and geographic regions of residence (XF vs. non-XF) patients in terms of lung cancer histology, age at diagnosis, CCI and stage of cancer, it was clear that XF residents were more likely to have lung cancer screening and were diagnosed at younger ages, had less comorbidity and were more likely to undergo surgical treatment than non-XF patients. In addition to residency status and screening, we used univariate Cox proportional hazard analysis for the following variables: diagnosis by screening (yes vs. no), XF residency (yes vs. no), demographic factors (age > 65 years, sex, ethnicity, smoking), clinicopathological factors (age-unadjusted CCI > 3, KPS > 70, BMI categories for the Asian and Pacific population, surgery, radiotherapy, chemotherapy, targeted therapy, cancer histology, and TNM stage) (Table 2). To test the hypothesis that XF residency status and screening were independent predictors of OS, we examined a multivariate Cox regression model constructed a priori with these variables. Age at diagnosis > 65 years, male sex, non-Han ethnicity, higher cancer stages were associated with poor OS and KPS > 70, surgery, radiotherapy, chemotherapy, targeted therapy, diagnosis by screening and XF residency were associated with improved OS (Table 2). Proportional hazard assumption was checked as described in Methods and was not violated (data not shown). Therefore, XF residency status and screening both appeared to be independent predictors of improved OS.

Table 2 Univariate and multivariate Cox proportional hazard analysis for predictors of overall survival.

Full size table

Evaluation of potential bias introduced by lung cancer screening

Sensitivity analysis of the Cox model was performed by excluding patients diagnosed by screening or vice versa. XF residency status was not a predictor of OS in the patients diagnosed by screening, but in the group diagnosed by screening, residing in XF was still associated with improved OS (HR = 0.792, 95%CI: 0.688–0.913, P = 0.001) (Supplemental Table 2). Simple bias analysis for unmeasured residual confounders was performed to see how the estimates of HR for XR residency status and screening in the Cox model would change by adding on an unmeasured confounder with HR of 0.7, 0.8 or 0.9 (Supplemental Tables 3 and 4). In most situations, patients who were diagnosed by screening or were residing in XF had improved OS in spite of the presence of a favorable unmeasured confounder.

To explore the potential impact of lead time bias introduced by lung cancer screening on OS of lung cancer patients, we attempted to correct for lead time bias in two ways. First, we tried to subtract an estimated mean lead time from the survival time of patients diagnosed by screening. The mean lead time was estimated to be 3.4 months for OS of stages one and two lung cancer and ≤ 1 month for stages three and four²⁰. By subtracting 100 days if stage 1 or 2 and subtracting 30 days if stage 3 or 4 from the survival time of patients diagnosed by screening, the same multivariate Cox regression model in Table 2 using the corrected dataset still showed that both XF residency status (HR = 0.789, 95% CI: 0.687–0.906, P < 0.001) and screening (HR = 0.677, 95% CI: 0.511–0.898, P = 0.007) were associated with improved OS, and this method of correcting for lead time did not nullify the survival advantage. Second, we tried to estimate lead time using the formula of Duffy et al.: E(s) = (1−e^{(-λ t)})/ λ, where λ is the a rate of transition to symptomatic disease (i.e., mean sojourn time = 1/λ), assuming an exponential distribution of the sojourn time (8). We corrected for lead time by subtracting E(s) from the observed survival time or time to last follow-up of patients diagnosed by screening. E(s) for each screened patient in the cohort was randomly generated using the rnorm function in the R package Compositions with specified mean and standard deviation for λ. For each pair mean and standard deviation for λ, Monte Carlo simulation of the same multivariate Cox model in Table 2 was performed 1000 times to assess the probability of obtaining a P > 0.05 for screening or XF residency status. The simulation covered values ranging from 0.05 to 1 for both mean and standard deviation for λ. Lead-time-corrected survival for the patients diagnosed by screening was expected cancers as compared with symptomatic cancers. While the survival of unscreened patient remained the same, lead time correction would lead to a lower survival estimate in the patients diagnosed by screening. By Monte Carlo simulation, we assessed the probability of obtaining a non-significant (P ≥ 0.05) HR for screening in the Cox regression model for combinations of mean and standard deviation of λ. The highest probability was < 0.006% (Fig. 3A). The HR for screening ranged from 0.642 to 0.654 (Fig. 3B). Similar results were obtained for the probability of obtaining a non-significant HR for XF residency status (Fig. 3C). The HR for XF residency status ranged from 0.792 to 0.800 (Fig. 3D). Therefore, it was very unlikely or not plausible for the benefits of screening or XF residency status on OS to be completely nullified by lead time bias.

To assess the potential impact of length bias, we used the formula of Duffy et al. to estimate the length bias-corrected relative hazard²³. We calculated the corrected results for q (the complement of the size of the group causing length bias) ranging from 0 to 1 and θ (the relative rate of screen detection and fatality in the length-bias group) also ranging from 0 to 1. The probability of 5-year case fatality for symptomatic tumors (p1) was 1829/3148 = 0.58, that probability for screen-detected tumors (p2) was 57/270 = 0.21, and the observed probability of diagnosis by screening (p3) was 270/3415 = 0.079. The corrected relative hazard for combinations of q and θ that had real solutions from the formulae were plotted (Fig. 4), and analyses showed that in order for the length bias to account completely for the survival benefit (i.e., corrected relative hazard = 1), the length-bias group would need to be 8 times more likely to be diagnosed by screening and 8 times less likely to cause death.Since the mean sojourn time was 2.24 years for lung cancer (Mayo Lung Project)²⁷, it was highly unlikely that length time bias could account for the entire difference in survival associated with screening.

Table 3 Shared frailty models for overall survival.

Full size table

Evaluation of a latent common group effect by shared frailty modeling

A shared frailty random effects model may account for the regional heterogeneity in survival data²⁸. After separating the cohort into subcohorts of patients diagnosed by screening or not, we used shared frailty models to examine for the presence of significant frailty term for covariates. The frailty term for XF residency modified the hazard multiplicatively and assigns each patient in XF or non-XF region the same level of frailty. The result was compared with the Cox models without a frailty term to determine the impact of clustering by regions of residence. Models using gamma, gaussian or t distributions all yielded similar results, and only results using gamma distribution were shown in Table 3. There was no frailty associated with residence in XF in patients diagnosed by screening, but there was significant (P = 0.0021) frailty associated with XF residency status in patients diagnosed not by screening. In the shared frailty models for patients diagnosed by screening or those diagnosed by screening, patients in a cluster (residing in XF or residing elsewhere in Yunnan) were assumed to share the same unmeasured/unobserved risk factor (frailty). Therefore, while there was no significant unmeasured heterogeneity between the clusters (XF vs non-XF), there was significant unmeasured heterogeneity between XF vs. non-XF residents that cannot be explained by observed covariates alone.

Discussion

The burden of lung cancer worldwide is increasing from smoking and air pollution²⁹. In countries like India, China and regions that are still using biomass fuels, females get lung cancer from indoor air pollution due to cooking fumes and poor ventilation^30,31. In contrast to decreasing lung cancer incidence rates in Western countries, lung cancer incidence rates are still rising in China, other Asian countries in and Africa^30,32^. The 5-year lung cancer survival rates was 13% in 1975–1977 and 1984–1986 but did not improve by much to only 16% for 1999–2005³³. In 2015, the cancer data statistics from China showed that the lung cancer incidence and mortality rate were both the highest among cancers for both sexes⁹. The XF region in Yunnan Province (a province in southwestern China) is a high-incidence area for lung cancer due to air pollution from burning of bituminous coal and geographic factors. Efforts for lung cancer screening, raising public awareness and reduction of environmental exposure (e.g., vented stoves) have been ongoing to try to improve lung cancer mortality.

The current study used institutional data from a regional cancer center to examine lung cancer survival in Yunnan. This retrospective cohort consisted of newly diagnosed cases of lung cancer in Yunnan Cancer Hospital from January 1st, 2012 to July 10th, 2015, a cohort more recent when compared to other studies³³. The relatively low male: female ratio and the high percentage of adenocarcinoma among cancer histopathological types perhaps reflected the dominance of coal-related air pollution in lung carcinogenesis in XF. We found that the XF lung cancer patients were more likely to be diagnosed by screening than non-XF patients. Both the XF and non-XF patients had 5-year survival rates that were markedly higher than reported about a decade ago³³. They were younger, in better general health conditions, in earlier malignancy stages, and more likely to undergo surgery for treatment than non-XF patients. These observations were likely due to prior years of effort in lung cancer screening and raising public awareness about lung cancer.

XF lung cancer patients appeared to have better OS than non-XF patients. Kaplan–Meier analysis of OS showed that XF patients survive longer (difference in median survival > 3.5 years) than non-XF patients. This finding was confirmed by univariate proportional hazard analysis, i.e., 42% reduction in hazard for XF patients. Multivariate analysis of OS using Cox proportional hazard modeling showed that XF patients were less likely to die than non-XF patients after adjusting for demographic, clinicopathological, and treatment factors. The covariates in the Cox model included the top 12 most important factors identified by random forest survival analysis. In addition to residence in XF being a significant factor, the results from the Cox model for the entire cohort also showed that increased age, male sex, non-Han ethnic minority and increased stage were adverse risk factors and that being overweight, increased Karnofsky performance score, surgery, chemotherapy, radiotherapy and tyrosine kinase inhibitor targeted therapy were beneficial risk factors.

Lung cancer screening has been supported by the findings of the Dutch–Belgian NELSON (Nederlands-Leuven Longkanker Screenings Network) Randomized Lung Cancer Screening Trial³⁴ and the National Lung Screening Trial (NLST)³⁵. NELSON was the largest European lung cancer CT screening trial, and it confirmed the findings of the NLST. These trials and real world implementation in Taiwan^13,36 showed that screening can increase the proportion of lung cancer diagnosed in early stages with curative possibility. Lung cancer screening policies vary from country to country and region to region; outside the setting of a formal clinical trial, there is little data to show that real-world screening effort (governmental or civilian) do make a difference in survival in general. Moreover, the patient’s decision to undergo lung cancer screening is likely to be influenced by availability, out-of-pocket/financial cost and self-awareness of lung cancer risk³⁷, which will vary region-to-region due to local prevalence of lung cancer, presence of risk factors, public awareness, and promotion of lung cancer screening (both governmental, health care industry, or philanthropic).

Cancer screening introduces biases that complicate interpretations of survival analysis. Ongoing lung cancer screening effort has focused on high risk regions that included XF, including grants funded by the government^38,39, provincial⁴⁰ and local⁴¹ hospital efforts, and collaboration among governmental agencies, private foundations and philanthropic groups⁴². The Chinese National Lung Cancer Screening Guidelines specifically mentioned the high risk group in XF^43,44. It was quite obvious that lung cancer screening effort had focused on the XF region since 16.8% of lung cancer cases in XF patients were diagnosed through screening compared with 5.0% in non-XF patients. Length bias is associated with slow growing tumors with a long presymptomatic screen-detectable duration, but lung cancers generally do not fit this profile of clinical progression, and our analysis using the method by Duffy et al.²³ showed that it was not plausible for length bias to account for all the survival benefits of screening in our data set. Lead time bias, however, is an important issue for in analysis of OS, which is defined as the time duration from diagnosis to death, because the lead time would artificially lengthen OS in screen-diagnosed cases. The mean lead time has been estimated to be 3.4 months for OS of early stage (I or II) lung cancer patients, and only ≤ 1 month for advanced stage (III or IV) lung cancer²⁰. Given the large difference in median survival between XF and non-XF patients (> 3.5 years) and our simulation of lead time bias, lead time bias could not nullify the survival advantage of screening in our data set. To examine the impact of lead time bias in survival analysis, sensitivity analysis by dividing the cohort into those that were diagnosed by screening or not showed that residence in XF was an independent factor associated with improved survival in XF residents but no longer a significant factor for OS in the subcohort diagnosed by screening.

In a frailty model, the random component accounts for association unknown factors and unobserved heterogeneity^45,46. A frailty is an unobserved random factor that modifies multiplicatively the risk of event occurrence of a cluster of individuals. In our analysis (Table 3), the lung cancer patients were clustered within their region of residence and possible correlation between patients within XF or non-XF were modeled with a shared frailty model, and we found that a significant unobserved random factor related to the region of residence was present in patients diagnosed not by screening but not in those diagnosed by screening. Therefore, other than screening, there remained important variations in lung cancer outcomes linked to region of residence. These unobserved factors potentially include socio-economic status, genetic prognostic factors or biological markers of prognosis, and variations in relation to local patterns of oncology follow up and care delivery^47,48,49, which may be influenced by increased governmental investment in health care resources⁵⁰.

A limitation of our study was that the data came from a single institution. Even though Yunnan Cancer Hospital is the only tertiary referral center for cancer care in Yunnan province, not all lung cancer patients in XF or non-XF regions would have sought care in this hospital. Although unsuccessful telephone follow up may possibly be associated with worse prognosis, the rates of unsuccessful follow up were essentially the same for XF and non-XF groups. Therefore, its impact would equally affect both the XF and non-XF groups and not bias one group against another. The sample size of lung cancer patients diagnosed by screening was relatively small. The patients who were diagnosed by screening in our study cohort were a mixture of patients diagnosed in real life clinical practice and participants in various lung cancer screening programs which might have participation rates of the target population to be screened that varied from 31.91% to 84.5%^51,52. Given the retrospective nature of our study, errors in data collection might have occurred although care was taken to independently confirm the data wherever possible (e.g., verification of region of residence using geocode and map plotting). Efforts are in progress to discourage smoking, decrease air pollution and continue cancer screening, and these will affect the generalizability and applicability of our results over time.

In conclusion, the improved survival for lung cancer patients in the XF region is the combined effect of higher percentage of patients diagnosed by screening and unobserved factors associated with the region of residence. The magnitude of the survival advantage could not be fully accounted for by lead time bias introduced by screening. In patients diagnosed by screening, the lung cancer was likely to be diagnosed at an early stage, and surgery was the most important and influential factor for survival. This is perhaps the first report that have shown that lung cancer screening effort has produced a beneficial effect on survival of lung cancer patients in Yunnan. Long term prospective studies are needed to confirm our findings. Our results would justify further promotion of cancer screening, early diagnosis and treatment in an efficient, effective manner for this province and other regions, particularly those that are underserved.

References

1Cancer Facts and Statistics|American Cancer Society. (2018).
Gaga, M., Sculier, J. P. & Rabe, K. F. Pulmonologists and lung cancer: pivotal role in multidisciplinary approach. Eur Respir J 42, 1183–1185. https://doi.org/10.1183/09031936.00145813 (2013).
Article PubMed Google Scholar
Chen, W. et al. Cancer statistics in China, 2015. CA Cancer J Clin 66, 115–132. https://doi.org/10.3322/caac.21338 (2016).
Article PubMed Google Scholar
Smith, C. J., Perfetti, T. A., Mullens, M. A., Rodgman, A. & Doolittle, D. J. “IARC group 2B Carcinogens” reported in cigarette mainstream smoke. Food Chem. Toxicol. 38, 825–848 (2000).
Article CAS PubMed Google Scholar
Smith, C. J., Perfetti, T. A., Rumple, M. A., Rodgman, A. & Doolittle, D. J. “IARC group 2A Carcinogens” reported in cigarette mainstream smoke. Food Chem. Toxicol. 38, 371–383 (2000).
Article CAS PubMed Google Scholar
Lin, K. F. et al. Propensity score analysis of lung cancer risk in a population with high prevalence of non-smoking related lung cancer. BMC Pulm. Med. 17, 120. https://doi.org/10.1186/s12890-017-0465-8 (2017).
Article PubMed PubMed Central Google Scholar
Chen, J.-G. et al. Cancer survival in Qidong between 1972 and 2011: a population-based analysis. Mol. Clin. Oncol. 6, 944–954. https://doi.org/10.3892/mco.2017.1234 (2017).
Article PubMed PubMed Central Google Scholar
Hu, W. et al. Personal and indoor PM2.5 exposure from burning solid fuels in vented and unvented stoves in a rural region of China with a high incidence of lung cancer. Environ. Sci. Technol. 48, 8456–8464 (2014).
Article ADS CAS PubMed PubMed Central Google Scholar
Cao, Y. & Gao, H. Prevalence and causes of air pollution and lung cancer in Xuanwei City and Fuyuan County, Yunnan Province, China. Front. Med. 6, 217–220. https://doi.org/10.1007/s11684-012-0192-8 (2012).
Article PubMed Google Scholar
Lan, Q., Chapman, R. S., Schreinemachers, D. M., Tian, L. & He, X. Household stove improvement and risk of lung cancer in Xuanwei, China. J. Natl. Cancer Inst. 94, 826–835 (2002).
Article PubMed Google Scholar
Zhou, Q. et al. Demonstration program of population-based lung cancer screening in China: rationale and study design. Thorac. Cancer 5, 197–203. https://doi.org/10.1111/1759-7714.12078 (2014).
Article PubMed PubMed Central Google Scholar
Zhou, ,. QMS1602 NELCIN B3 screening program in China. J. Thorac. Oncol. 13, S272–S273. https://doi.org/10.1016/j.jtho.2018.08.154 (2018).
Article Google Scholar
Wu, F. Z. et al. Prognostic effect of implementation of the mass low-dose computed tomography lung cancer screening program: a hospital-based cohort study. Eur. J. Cancer Prev. 29, 445–451. https://doi.org/10.1097/CEJ.0000000000000569 (2020).
Article CAS PubMed Google Scholar
Wu, F. Z. et al. Assessment of selection criteria for low-dose lung screening CT among Asian ethnic groups in Taiwan: from mass screening to specific risk-based screening for non-smoker lung cancer. Clin. Lung Cancer 17, e45–e56. https://doi.org/10.1016/j.cllc.2016.03.004 (2016).
Article PubMed Google Scholar
Hsu, H. T. et al. Modified lung-RADS improves performance of screening LDCT in a population with high prevalence of non-smoking-related lung cancer. Acad. Radiol. 25, 1240–1251. https://doi.org/10.1016/j.acra.2018.01.012 (2018).
Article PubMed Google Scholar
Tang, E. K. et al. Natural history of persistent pulmonary subsolid nodules: long-term observation of different interval growth. Heart Lung Circ. 28, 1747–1754. https://doi.org/10.1016/j.hlc.2018.08.015 (2019).
Article PubMed Google Scholar
Lin, H. et al. Temporal trend of mortality from major cancers in Xuanwei, China. . Front Med. 9, 487–495. https://doi.org/10.1007/s11684-015-0413-z (2015).
Article PubMed Google Scholar
Li, J. et al. Five-year lung cancer mortality risk analysis and topography in Xuan Wei: a spatiotemporal correlation analysis. BMC Public Health 19, 173. https://doi.org/10.1186/s12889-019-6490-1 (2019).
Article CAS PubMed PubMed Central Google Scholar
Hemminki, K. et al. Surveillance bias in cancer risk after unrelated medical conditions: example urolithiasis. Sci. Rep. 7, 8073. https://doi.org/10.1038/s41598-017-08839-5 (2017).
Article ADS CAS PubMed PubMed Central Google Scholar
Ge, Z., Heitjan, D. F., Gerber, D. E., Xuan, L. & Pruitt, S. L. Estimating lead-time bias in lung cancer diagnosis of patients with previous cancers. Stat. Med. 37, 2516–2529. https://doi.org/10.1002/sim.7691 (2018).
Article MathSciNet PubMed PubMed Central Google Scholar
Yunnan Cancer Hospital-About Us. https://www.ynszlyy.com/ZLYYEN/Subject/AboutUs/Article/39a19180-05ab-432d-ab9e-04da9e929530.htm (2018).
Pan, W. H. & Yeh, W. T. How to define obesity? Evidence-based multiple action points for public awareness, screening, and treatment: an extension of Asian-Pacific recommendations. Asia Pac. J. Clin. Nutr. 17, 370–374 (2008).
PubMed Google Scholar
Duffy, S. W. et al. Correcting for lead time and length bias in estimating the effect of screen detection on cancer survival. Am. J. Epidemiol. 168, 98–104. https://doi.org/10.1093/aje/kwn120 (2008).
Article PubMed Google Scholar
Day, N. E. & Walter, S. D. Simplified models of screening for chronic disease: estimation procedures from mass screening programmes. Biometrics 40, 1–14 (1984).
Article CAS PubMed Google Scholar
Mogensen, U. B., Ishwaran, H. & Gerds, T. A. Evaluating random forests for survival analysis using prediction error curves. J. Stat. Softw. 50, 1–23 (2012).
Article PubMed PubMed Central Google Scholar
Ishwaran, H., Kogalur, U. B., Blackstone, E. H. & Lauer, M. S. Random survival forests. . Ann. Appl. Stat. 2008, 841–860 (2008).
Article MathSciNet Google Scholar
Wu, D., Erwin, D. & Rosner, G. L. Sojourn time and lead time projection in lung cancer screening. Lung Cancer 72, 322–326. https://doi.org/10.1016/j.lungcan.2010.10.010 (2011).
Article PubMed Google Scholar
VoPham, T. et al. Environmental radon exposure and breast cancer risk in the Nurses’ Health Study II. Environ. Health 16, 97. https://doi.org/10.1186/s12940-017-0305-6 (2017).
Article CAS PubMed PubMed Central Google Scholar
Raaschou-Nielsen, O. et al. Air pollution and lung cancer incidence in 17 European cohorts: prospective analyses from the European Study of Cohorts for Air Pollution Effects (ESCAPE). Lancet Oncol. 14, 813–822. https://doi.org/10.1016/s1470-2045(13)70279-1 (2013).
Article PubMed Google Scholar
Jemal, A. et al. Global cancer statistics. CA Cancer J. Clin. 61, 69–90. https://doi.org/10.3322/caac.20107 (2011).
Article PubMed Google Scholar
Thun, M. J. et al. Lung cancer occurrence in never-smokers: an analysis of 13 cohorts and 22 cancer registry studies. PLoS Med. 5, e185. https://doi.org/10.1371/journal.pmed.0050185 (2008).
Article PubMed PubMed Central Google Scholar
Siegel, R., Ma, J., Zou, Z. & Jemal, A. Cancer statistics 2014. CA Cancer J. Clin. 64, 9–29 (2014).
Article PubMed Google Scholar
Kris, M. G. et al. Clinical cancer advances 2010: annual report on progress against cancer from the American Society of Clinical Oncology. J. Clin. Oncol. 28, 5327–5347 (2010).
Article PubMed Google Scholar
National Lung Screening Trial Research, T. et al. Reduced lung-cancer mortality with low-dose computed tomographic screening. N. Engl. J. Med. 365, 395–409 (2011).
Article Google Scholar
Aberle, D. R. et al. Results of the two incidence screenings in the National Lung Screening Trial. N. Engl. J. Med. 369, 920–931. https://doi.org/10.1056/NEJMoa1208962 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Wu, F. Z. et al. Differences in lung cancer characteristics and mortality rate between screened and non-screened cohorts. Sci. Rep. 9, 19386. https://doi.org/10.1038/s41598-019-56025-6 (2019).
Article ADS CAS PubMed PubMed Central Google Scholar
Wu, F. Z., Kuo, P. L., Wu, C. C. & Wu, M. T. The impact of patients’ preferences on the decision of low-dose computed tomography lung cancer screening. Transl Lung Cancer Res. 7, S236–S238. https://doi.org/10.21037/tlcr.2018.08.17 (2018).
Article PubMed PubMed Central Google Scholar
Wang, Y. & Zhao, G. (Lung cancer disease cohort study - high incidence population screening. Grant # 2017YFC0907902 to Yunnan Cancer Hospital by Ministry of Science and Technology of the People's Republic of China, National Key Research and Development Program for Precision Medicine Research. , Xuanwei, Qujing, and Guizhou., 2017).
Chen, W. (Central Governmental Subsidy Project for Early Diagnosis and Early Treatment of Lung Cancer. Grant # 20100409. Yunnan Center of Disease Control, Xuanwei., 2010).
40Bai, X. & Zhao, Y. Cancer Prevention and Cancer Treatment. https://www.ynszlyy.com/Subject/XWZX_KSDT/Article/e08eee45-b7f3-45fe-9038-e7f4109cdf91.htm (2017).
41QujingTraditionalMedicineHospital. National Cancer Prevention Week. https://m.sohu.com/a/389902447_100196158/?pvid=000115_3w_a (2020).
QujingTraditionalMedicineHospital. Qujing City's 2019 Healthy China Tour into Xuanwei theme publicity activities and public welfare project activities for early screening, diagnosis and treatment of lung cancer-Qujing Lung Cancer Center of Traditional Chinese and Western Medicine "Lung Cancer Early Screening and Early Diagnosis" project. https://www.qjszyy.com/plus/view.php?aid=3838 (2019).
Zhou, Q. H. et al. China national lung cancer screening guideline with low-dose computed tomography (2015 version). Thorac Cancer 6, 812–818. https://doi.org/10.1111/1759-7714.12287 (2015).
Article PubMed PubMed Central Google Scholar
Zhou, Q. et al. China national lung cancer screening guideline with low-dose computed tomography (2018 version). Zhongguo Fei Ai Za Zhi 21, 67–75. https://doi.org/10.3779/j.issn.1009-3419.2018.02.01 (2018).
Article PubMed Google Scholar
Klein, J. P. & Moeschberger, M. L. Survival analysis: techniques for censored and truncated data (Springer, Berlin, 2006).
MATH Google Scholar
Gohari, M. R., Mahmoudi, M., Mohammed, K., Pasha, E. & Khodabakhshi, R. Recurrence in breast cancer. Analysis with frailty model. Saudi Med. J. 27, 1187–1193 (2006).
PubMed Google Scholar
Jack, R. H., Davies, E. A. & Moller, H. Lung cancer incidence and survival in different ethnic groups in South East England. Br. J. Cancer 105, 1049–1053. https://doi.org/10.1038/bjc.2011.282 (2011).
Article CAS PubMed PubMed Central Google Scholar
Khakwani, A. et al. The impact of the “hub and spoke” model of care for lung cancer and equitable access to surgery. Thorax 70, 146–151. https://doi.org/10.1136/thoraxjnl-2014-205841 (2015).
Article PubMed Google Scholar
Luchtenborg, M. et al. High procedure volume is strongly associated with improved survival after lung cancer surgery. J. Clin. Oncol. 31, 3141–3146. https://doi.org/10.1200/JCO.2013.49.0219 (2013).
Article PubMed Google Scholar
50QujingHealth. [Disease Prevention] Xuanwei City implements comprehensive prevention and control measures to curb the high incidence of lung cancer. https://m.sohu.com/a/290955328_100196175/?pvid=000115_3w_a (2019).
Lin, Y., Ma, J., Feng, J., Zhang, Q. & Huang, Y. Results of lung cancer screening among urban residents in Kunming. Zhongguo Fei Ai Za Zhi 22, 413–418. https://doi.org/10.3779/j.issn.1009-3419.2019.07.02 (2019).
Article PubMed Google Scholar
Wei, M. N. et al. Performance of lung cancer screening with low-dose CT in Gejiu, Yunnan: a population-based, screening cohort study. Thorac Cancer 11, 1224–1232. https://doi.org/10.1111/1759-7714.13379 (2020).
Article PubMed PubMed Central Google Scholar

Download references

Funding

There was no specific funding for this work. Dr. Yang was supported by the National Natural Science Foundation of China [81860488 and 81560432 to Dr. Runxiang Yang].

Author information

Authors and Affiliations

The Second Department of Medical Oncology, Yunnan Cancer Hospital, Kunming, Yunnan, The People’s Republic of China
Runxiang Yang, Ming He, Dongmei Wang, Rongrong Ye & Rouyu Deng
Department of Oncology, Sun Yat-Sen University Cancer Center, Guangzhou, Guangdong, The People’s Republic of China
Lu Li
Center for Clinical Epidemiology and Biostatistics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
Mohsin Shah
Department of Emergency Medicine, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
Sai-Ching Jim Yeung

Authors

Runxiang Yang
View author publications
You can also search for this author in PubMed Google Scholar
Ming He
View author publications
You can also search for this author in PubMed Google Scholar
Dongmei Wang
View author publications
You can also search for this author in PubMed Google Scholar
Rongrong Ye
View author publications
You can also search for this author in PubMed Google Scholar
Lu Li
View author publications
You can also search for this author in PubMed Google Scholar
Rouyu Deng
View author publications
You can also search for this author in PubMed Google Scholar
Mohsin Shah
View author publications
You can also search for this author in PubMed Google Scholar
Sai-Ching Jim Yeung
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

RY, and SJY were involved in the study concept and design. RY, MH, DW, RY, RD participated in the acquisition of data. RY, MH, DW, RY, LL, RD participated in the analysis and interpretation of data. RY, MH, DW, RY, RD, LL, MS and SJY participated in the drafting of the manuscript. RY, LL, SJY and MS participated in a critical revision of the manuscript for important intellectual content. RY and SJY participated in statistical expertise. SJY prepared all the figures. SJY and RY participated in study supervision. All authors approved the final version of the manuscript.

Corresponding author

Correspondence to Runxiang Yang.

Ethics declarations

Competing interests

No potential conflicts of interest directly relevant to this work to declare. Dr. Yeung had received funding from Bristol-Myer Squibb and DepMed, Inc. for investigator-initiated studies. Dr. Yeung was a member of an expert panel for Celgene Corp.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary file1

Supplementary file2

Supplementary file3

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Yang, R., He, M., Wang, D. et al. Association of cancer screening and residing in a coal-polluted East Asian region with overall survival of lung cancer patients: a retrospective cohort study. Sci Rep 10, 17432 (2020). https://doi.org/10.1038/s41598-020-74082-0

Download citation

Received: 12 April 2020
Accepted: 23 September 2020
Published: 15 October 2020
DOI: https://doi.org/10.1038/s41598-020-74082-0

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Incidence trends and spatial distributions of lung adenocarcinoma and squamous cell carcinoma in Taiwan

Occupation as a risk factor of small cell lung cancer

Trends in lung cancer incidence by age, sex and histology from 2012 to 2025 in Catalonia (Spain)

Introduction

Material and methods

Participants, study and design

Data variables

Statistical analysis

Ethical approval

Results

Demographic and Clinicopathological Characteristics

Survival analyses

Examination of screening and XF residential status as predictive factors of OS.

Cox proportional hazard regression analysis

Evaluation of potential bias introduced by lung cancer screening

Evaluation of a latent common group effect by shared frailty modeling

Discussion

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary information

Supplementary file1

Supplementary file2

Supplementary file3

Rights and permissions

About this article

Cite this article

Share this article

Comments

Search

Quick links