Introduction

Lung cancer is a significant global health burden with an estimated 2.2 million new cases and 1.8 million deaths worldwide in 2020 [1]. While active smoking is the most prominent risk factor for lung cancer, nearly 25% of lung cancer cases diagnosed worldwide are among people who never smoke, with the proportion varying geographically [2]. It is estimated that 10% of lung cancer cases in the United States (US) and as high as 50% in Asia are diagnosed among patients who never smoke, with higher proportions among female populations [3]. Among lung cancer patients who never smoke, lung adenocarcinoma (LUAD), which originates in the mucous-producing epithelial lining of the lung, is the most commonly diagnosed histological subtype [2, 4], with a higher proportion among females compared to males [4, 5].

Lung cancer is a highly fatal disease, with an overall 5-year survival rate of 22.9% in the US from 2012 to 2018, which ranges from 61.2% for early (localized) stage to as low as 7% for distant stage (metastasized) [6]. Age at diagnosis is an important prognostic factor that influences clinical treatment decisions and survival of lung cancer patients [7,8,9,10,11], as those with younger age at diagnosis often present with earlier tumor stages and benefit from early targeted treatment [12, 13]. According to the Surveillance, Epidemiology, and End Results Program (SEER), the median age at lung cancer diagnosis in the US is 71 years. However, evidence from cancer registries, patient medical records, and case-series analyses suggest differences in age at lung cancer diagnosis between subpopulations defined by smoking status, race and ethnicity, and geographic location [14,15,16,17,18,19,20,21,22,23,24]. Notably, some studies have reported differences with females more often diagnosed [25, 26] at younger ages [27,28,29], with LUAD [28, 30, 31], and with earlier tumor stage at diagnosis, compared to males [25, 26]. However, the existence of sex differences in age at lung cancer diagnosis among individuals who never smoke, which may contribute to lung cancer-related disparities between females and males [25, 26, 28,29,30,31], are unclear.

Given the importance of age at diagnosis for disease prognosis, clinical treatment decisions, and survival of lung cancer patients [6,7,8,9,10], we conducted an international study to characterize age at diagnosis for lung cancer patients who never smoke, as well as examined sex differences by histological subtypes (i.e., LUAD and squamous cell carcinoma [SCC]) and race and ethnicity.

Methods

Study Design

The lung cancer case ascertainment procedures and criteria, as well as histological subtype confirmation for each participating study have been described in detail elsewhere [24, 32,33,34,35,36,37,38,39,40,41,42,43,44,45,46,47,48,49,50,51]. We received data on age at lung cancer diagnosis among patients who never smoke by self-reported sex, race and ethnicity, as well as histology and SEER tumor stage (when available) from 8 cohort studies and two population-based cancer registries. Individual-level data were provided for 1,571 lung cancer patients who never smoke from the Shanghai Women’s Health Study (SWHS; n = 760) [52], the Shanghai Men’s Health Study (SMHS; n = 289) [40], the Prostate, Lung, Colorectal, and Ovarian Cancer (PLCO) Screening Trial (n = 39) [41], and the UK Biobank (n = 483) [42, 43]. Summary statistics (i.e., mean, standard deviation (SD), min, max, median, 25th and 75th percentile) were provided for 32,222 lung cancer cases who never smoke from the Taiwan Cancer Registry (TCR) [32, 33] population study (n = 29,832), the Multiethnic Cohort Study (MEC; n = 578) [34], the Cancer Prevention Study-II Nutrition Cohort and Cancer Prevention Study-3 (CPSII, n = 365; CPS-3, n = 64) [35, 36], Sutter Health of Northern California and Kaiser Permanente Hawaiʽi (STKP; n = 515) [24, 37], the Southern Community Cohort Study (SCCS; n = 138) [38, 39], and the National Cancer Center Japan (NCC; n = 730) [53]. In total, we received data for 33,793 lung cancer cases.

Each participating study obtained its own institutional review board approval. Sutter Health of Northern California and Kaiser Permanente Hawaiʽi received waiver of consent, and consent for participants in MEC was received with receipt of baseline questionnaire. All remaining participants provided written or electronic informed consent.

Statistical analyses

In each of the participating studies, we characterized the distribution of age at diagnosis of lung cancer and its histological subtypes (i.e., LUAD and SCC) among individuals who never smoke by self-reported biological sex (i.e., male, female), race and ethnicity (i.e., Chinese and Japanese individuals living in Asia, Chinese and Japanese individuals living in the US, non-Hispanic White and African American individuals living in the US, and White individuals living in the UK), and SEER tumor stage (i.e., local, regional, distant) for studies with available data. Within each participating study, we estimated differences in average age at diagnosis of overall lung cancer, LUAD, and SCC between females and males who never smoke and tested the differences using Student’s t-tests. The differences in average age at diagnosis (Δ years) between females and males were then meta-analyzed across populations of similar racial and ethnic subgroup using random effects inverse variance method. All analyses were performed using the R statistical software (version 4.2.2).

Results

Information on each participating study, including study design, years of enrollment, geographical region, type of data provided, sample size and criteria for age at enrollment is shown in Table 1. Study specific and combined estimates for the difference in age at diagnosis for lung cancer comparing East Asian females and males who never smoke living in Taiwan, China, and US, overall and by histological subtype are presented in Table 2. We observed that among individuals who never smoke in TCR, a population-based registry and the largest study in our analyses, and in the SWHS and SMHS, large population-based prospective cohort studies of Chinese individuals, females were diagnosed younger than males by 2.2 years (Δ years = −2.2 (95% confidence interval (CI): −2.5, −1.88; Pdifference = 2.19 × 1040) and 1.6 years (Δ years = −1.6 (95% CI: −2.9, −0.3; Pdifference = 0.02), respectively. We observed similar patterns in the Chinese populations living in East Asia for LUAD and SCC. Further, we found that Chinese females who never smoke living in the US from STKP were also diagnosed with overall lung cancer younger than males (Δ years = −11.3 (95% CI: −17.7, −4.9; Pdifference 5.38 x 10−4). Conversely, we observed that among Japanese individuals who never smoke living in Japan from the NCC, the age at LUAD diagnosis was older for females compared to males (Δ years = 3.3 (95% CI: 1.0, 5.6; Pdifference = 5.00 × 10-3). This pattern was further supported in Japanese individuals who never smoke living in the US from MEC, where females had a suggestive older age at LUAD diagnosis compared to males (Δ years = 2.9 (95% CI: −0.2, 6.0; Pdifference = 0.07) (Supplementary Table 1).

Table 1 Characteristics of participating studies
Table 2 Differences in age at lung cancer diagnosis among Chinese females and males who never smoke

Study specific and combined estimates for the difference in age at diagnosis for lung cancer comparing non-Hispanic White females and males living in the US and White females and males living in the UK overall and by histological subtype are presented in Table 3. Similar to the Chinese population, we observed that non-Hispanic White females living in the US from CPSII were diagnosed with overall lung cancer younger than males (Δ years = −2.0 (95% CI: −3.7, −0.3; Pdifference = 0.02). While less pronounced, the meta-analysis across the 7 studies with non-Hispanic White and White individuals who never smoke living in the US and UK suggested that females had younger age at diagnosis compared to males for overall lung cancer (Δ yearscombined = −0.8 (95% CI: −1.9, 0.4; Pmeta-analysis = 0.18) and LUAD (Δ yearscombined = −1.1 (95% CI: −2.4, 0.1; Pmeta-analysis = 0.08); however, the findings were not consistent across studies and were not statistically significant. We further found that among African American individuals who never smoke in the two studies with available data (SCCS and MEC), females had younger age at lung cancer diagnosis compared to males (Δ yearscombined = −1.0 (95% CI: −4.0, 1.9; Pmeta-analysis = 0.49); however, the sample size was limited and the findings were not statistically significant (Supplementary Table 2).

Table 3 Differences in age at lung cancer diagnosis among non-Hispanic White and White European females and males who never smoke

In additional analyses examining differences in age at lung cancer diagnosis across tumor stages (local, regional, distant) for subjects in TCR, we found among individuals who never smoke, females were diagnosed with lung cancer younger than males for local (Δ years = −1.8 (95% CI: −2.4, −1.2; Pdifference = 1.68 × 108), regional (Δ years = −3.1 (95% CI: −4.1, −2.2; Pdifference = 2.18 × 1010), and distant tumor stage (Δ years = −1.9 (95% CI: −2.31, −1.49; Pdifference = 1.14 × 10-20). Similar patterns were found for LUAD and SCC in analyses stratified by tumor stage (Supplementary Table 3).

Discussion

We conducted a large-scale study to characterize age at diagnosis for lung cancer and its histological subtypes, comparing females versus males by race and ethnicity. We analyzed summary and individual-level data from 33,793 lung cancer cases who never smoke in 10 studies from Taiwan, China, Japan, US, and UK. Overall, we found that among Chinese individuals living in East Asia and in the US, females were diagnosed with lung cancer younger than males. While there was a suggestion of similar patterns in African American and non-Hispanic White individuals living in the US and White individuals living in the UK, the estimated differences were not consistent across studies and were not statistically significant. These findings were consistent across histological subtypes, as well as tumor stages. An exception to this pattern was found among Japanese individuals who never smoke living in Japan and in the US, where females were diagnosed with lung cancer at a later age compared to males.

The demographic makeup of lung cancer has substantially shifted over the last few decades, with the proportion of never-smoking lung cancer increasing due to the decline in smoking prevalence, and sex differences in the epidemiology, pathogenesis and outcomes of the disease have become apparent [29, 54]. Factors that may contribute to sex differences in the characteristics of lung cancer, such as age at diagnosis, may include screening, as well as endogenous and exogenous exposures. A study in the US on gender differences in cancer screening behaviors have showed that women are more likely than men to have more frequent contact with health care providers and seek cancer screening [55], which may on average lead to an earlier diagnosis. However, given that our study focused on individuals who never smoke, a population for which there are no established lung cancer screening guidelines, we suspect that screening behavior had limited impact on the observed differences. Further, we found that females had consistent younger age at lung cancer diagnoses across different tumor stages, suggesting that additional factors may impact the observed sex differences.

Endogenous factors that potentially contribute to sex differences in age at diagnosis among lung cancer in patients who never smoke include the distribution of lung cancer histological subtypes, tumor mutations and genetic susceptibility [44, 45, 56,57,58,59]. For example, females are at an increased risk compared to males of developing lung cancer with a driver mutation, such as the epidermal growth factor receptor (EGFR) [60,61,62]. However, studies have found that lung cancer patients with an EGFR mutation are more likely to have an older age at diagnosis [63], and therefore may not be the reason for our findings. Additionally, estrogen may also lead to sex differences in lung cancer. While research on this topic is ongoing, estrogen receptors have been shown to be overexpressed in many lung cancers [61, 64] and studies have linked the use of hormone replacement therapy [65] and reproductive factors to the risk of lung cancer in females [66,67,68].

Exogenous factors, such as environmental and occupational exposure profiles may also impact sex differences in lung cancer diagnosis. For example, lung cancer risk factors among East Asian individuals who never smoke include indoor solid fuel combustion, fumes from cooking oil, outdoor air pollution, diesel exhaust, workplace exposures, and secondhand smoke [2, 4, 69,70,71,72,73,74]. Notably, East Asian women are more likely to be exposed to indoor air pollution from fuel and cooking oil early in life [75], as well as to spousal secondhand smoke. Conversely, male individuals in East Asia are more likely to be exposed to occupational sources of air pollution, such as secondhand smoke in the work environment. Despite potential differences between East Asian individuals living in Asia and the US, we found similar patterns of age at diagnoses, which could suggest similar exposures or genetic predisposition.

Our study had notable strengths. We compiled a large pool of ethnically and geographically diverse data of lung cancer cases who never smoke, which allowed for comparison of age at lung cancer diagnosis by sex, within subpopulations defined by race and ethnicity, histological subtype, and tumor stage. Importantly, we obtained data from the TCR, one of the largest sources of lung cancer data for patients who never smoke in the world, which included information on tumor stage. Further, we obtained data from the CPSII, MEC and Sutter Health / Kaiser Permanente- Hawaiʽi, which greatly improved the representation in our study of Chinese and Japanese individuals living in the US.

Our study had some limitations and interpretation of our results should be done with caution. This study was intended primarily as a descriptive analysis as we could not account or adjust for potential sources of confounding or heterogeneity that potentially drive differences in age at diagnosis across the participating studies and their subgroups. Given the heterogeneity among the participating studies, we opted not to calculate a combined weighted average age at diagnosis across studies, as this would not be meaningful. Further, our analyses were primarily conducted in cohort studies, which do not represent the general population. Despite these caveats, a consistently younger age at diagnosis among females versus males was observed across multiple studies, including a large population-based registry, as well as across histological subtypes and tumor stage. Additionally, although some data on African American individuals were obtained from SCCS, MEC, and CPSII, we did not have sufficient case numbers to comprehensively analyze this population and its subgroups. Future studies that include larger African American, as well as Hispanic populations will be needed to accurately characterize differences in age at diagnosis.

In summary, we characterized the average age at diagnosis for lung cancer and its subtypes among patients who never smoke in a comprehensive analysis of 33,793 lung cancer patients from the US, China, Taiwan, Japan and the UK. We found that, among individuals who never smoke, females had a younger age at diagnosis compared to males, which was especially apparent among Chinese individuals living in Taiwan and Mainland China, as well as among Chinese individuals living in the US. Future studies are needed to elucidate the potential reasons for these observed differences.