Abstract
The genetic architecture of non-small cell lung cancer (NSCLC) is relevant to smoking status. However, the genetic contribution of long-term smoking cessation to the prognosis of NSCLC patients remains largely unknown. We conducted a genome-wide association study primarily on the prognosis of 1299 NSCLC patients of long-term former smokers from independent discovery (n = 566) and validation (n = 733) sets, and used in-silico function prediction and multi-omics analysis to identify single nucleotide polymorphisms (SNPs) on prognostics with NSCLC. We further detected SNPs with at least moderate association strength on survival within each group of never, short-term former, long-term former, and current smokers, and compared their genetic similarity at the SNP, gene, expression quantitative trait loci (eQTL), enhancer, and pathway levels. We identified two SNPs, rs34211819TNS3 at 7p12.3 (P = 3.90 × 10−9) and rs1143149SEPT7 at 7p14.2 (P = 9.75 × 10−9), were significantly associated with survival of NSCLC patients who were long-term former smokers. Both SNPs had significant interaction effects with years of smoking cessation (rs34211819TNS3: Pinteraction = 8.0 × 10−4; rs1143149SEPT7: Pinteraction = 0.003). In addition, in silico function prediction and multi-omics analysis provided evidence that these QTLs were associated with survival. Moreover, comparison analysis found higher genetic similarity between long-term former smokers and never-smokers, compared to short-term former smokers or current smokers. Pathway enrichment analysis indicated a unique pattern among long-term former smokers that was related to immune pathways. This study provides important insights into the genetic architecture associated with long-term former smoking NSCLC.
Similar content being viewed by others
Introduction
Non-small cell lung cancer (NSCLC) accounts for more than 85% of lung cancer cases and is the most commonly diagnosed malignant disease1. Tobacco smoking is a well-known environmental exposure leading to lung cancer2,3 and has been found to be linked to the majority of NSCLC deaths4. Emerging evidence shows that NSCLC in smokers and never smokers are different and separate entities5,6,7.
It is, however, less well known that former smokers who quitted smoking long time before (e.g., >10 years), termed long-term former smokers, may also develop lung cancer8, and NSCLC patients who are long-term former smokers still harbor a high mortality risk9. So far, few studies have focused on the impact of genetic architecture on the prognosis of such a special group of patients, whose molecular mechanism for cancer death still remains unclear.
Leveraging the well-established International Lung Cancer Consortium (ILCCO) and Harvard Lung Cancer Study (HLCS), we conducted a GWAS by focusing on NSCLC patients who were long-term former smokers and identified germline genetic variants associated with overall survival time. We detected a set of SNPs with at least moderate association signals with survival from this subgroup as well as from other smoking subgroups, namely, never smokers, short-term former smokers, and current smokers. Comparing these SNPs, we investigated the genetic similarity across these smoking subgroups at the SNP, gene, expression quantitative trait loci (eQTL), enhancer, and pathway levels.
Results
Genetic variants are associated with survival
The genome control inflation factor (λ) for the overall population under the additive model was estimated to be 1.079 (Supplementary Fig. 1), which was comparable to a previous genome-wide survival study10, indicating that the confounding effect caused by population stratification was well controlled. Two SNPs (rs34211819 at chromosome 7p12.3, and rs1143149 at 7p14.2) reached genome-wide significance among NSCLC patients who were long-term former smokers (Fig. 1a). rs34211819 was in an intron region of tensin-3 (TNS3), with an MAF of 0.35; rs1143149 was an intron variant of septin 7 (SEPT7), with an MAF of 0.34. The regional plots (Fig. 1b, c) present a cluster of significant prognostic SNPs that were moderately or highly correlated with rs34211819 and rs1143149 (Supplementary Table 1). Although the chromosome locations of the two SNPs were close, they had a very low correlation (D’ = 0.017, r2 = 2.27 × 10−5). The C allele of rs34211819 was significantly associated with better survival (HRdiscovery = 0.66, 95% CI: 0.56-0.78, P = 2.01 × 10−7; HRvalidation = 0.84, 95% CI: 0.73–0.96, P = 1.36 × 10−2; HRcombined = 0.73, 95% CI: 0.66–0.81, P = 3.90 × 10−9) (Fig. 1d). In contrast, the C allele of rs1143149 was associated with worse survival (HRdiscovery = 1.42, 95% CI: 1.16–1.60, P = 2.04 × 10−6; HRvalidation = 1.33, 95% CI: 1.15–1.53, P = 7.88 × 10−5; HRcombined = 1.36, 95% CI: 1.22–1.51, P = 9.75 × 10−9) (Fig. 1e). The associations remained significant if SNPs were coded in co-dominant, dominant, and recessive models (Supplementary Table 2), indicating the robustness of our results toward coding. The same conclusion held with analyses of subgroups, defined by age, gender, histology, and clinical stage (Fig. 2a), as both SNPs were significantly associated with survival in almost all subgroups, except for rs1143149 among the early-stage patients.
Genetic variants interaction with years of smoking cessation
We performed a stratified analysis by years of smoking cessation for both long-term and short-term former smokers to evaluate the modifying effect of years of smoking cessation. As years of smoking cessation increased, the protective effect of rs34211819 and the detrimental effect of rs1143149 on survival were both elevated (Fig. 2b, c). These effects of two SNPs were only significant in long-term former smokers, and not in short-term former smokers, indicating significant heterogeneity between these two groups (rs34211819: Pheterogeneity = 0.042; rs1143149: Pheterogeneity = 0.034). The trend test detected significant trends for both SNPs across different subgroups (rs34211819: Ptrend = 0.003; rs1143149: Ptrend = 0.045). Further, we detected significant interaction effects between SNPs and years of smoking cessation, a type of gene–environment interactions (rs34211819 × years: Pinteraction = 8.0 × 10−4; rs1143149 × years: Pinteraction = 0.003) (Fig. 2d, e).
Evidence of association with lung cancer survival from multi-omics studies
In meta-analysis of eQTL effects from GTEx and TCGA, the two SNPs had a significant cis-eQTL relationship with gene expression (TNS3: β = −0.09, 95% CI: −0.16 to −0.02, P = 0.009; SEPT7: β = 0.04, 95% CI: 0.01 to 0.08, P = 0.033) (Fig. 3b). In gene expression survival analysis of long-term former smokers, higher expression of TNS3 (HR = 1.84, 95% CI: 1.01–3.34, P = 0.045) and SEPT7 (HR = 1.67, 95% CI: 1.02–3.21, P = 0.023) were significantly associated with worse survival in the TCGA database (Fig. 3c).
In DNA methylation analysis, we extracted 94 CpG probes that were located within TNS3 and SEPT7. Two CpG sites, cg22455271 and cg07462932, had significant methylation QTL (meQTL) effects with rs1143149 (β = -0.08, 95% CI: −0.14 to −0.01, P = 0.009, q-FDR = 0.034) and rs34211819 (β = −0.17, 95% CI: −0.32 to −0.02, P = 0.002, q-FDR = 0.039), respectively (Fig. 3d). They were significantly associated with survival in long-term former smokers (HRcg22455271 = 2.40, 95% CI: 1.47–3.93, P = 2.5 × 10−4, q-FDR = 0.001; HRcg07462932 = 0.47, 95% CI: 0.30–0.74, P = 7.4 × 10−4, q-FDR = 0.018) (Fig. 3e and Supplementary Table 3).
We predicted the functional relevance by SNPinfo, RegulomeDB, and HaploReg v4.1 (Supplementary Table 4). rs34211819 in TNS3 had a high score of protein binding and could bind two proteins: B cell lymphoma 3 (BCL3) and octamer transcription factor 2 (OCT2). In the CPTAC proteomics project, the two bound proteins BCL3 (fold change = 1.69, P = 2.16 × 10−5) and OCT2 (fold change = 2.21, P = 3.13 × 10−13) were significantly upregulated in the lung cancer tumor tissues compared to the adjacent normal tissues (Fig. 3f).
Genetic similarity across patients with different smoking statuses
We first performed genetic similarity comparisons at the SNP level by extracting moderate-to-high signals from the results of survival analysis within different smoking subgroups. A total of 7789 independent SNPs was observed in the long-term former smokers, which were comparable to that of never-smokers (n = 7358) but 31.4% and 85.5% more than that of short-term former smokers (n = 5343) and current smokers (n = 4198), respectively. No SNPs were shared across the four smoking subgroups. Long-term former smokers only had 23, 20, and 11 overlapping SNPs with never smokers, short-term former smokers, and current smokers, respectively (Fig. 4a), indicating that these significant prognostic SNPs for NSCLC patients who were long-term former smokers seemed to differ from those for the other smoking groups.
For gene-level comparisons, these identified SNPs were assembled to genes within each subgroup (Fig. 4b). We identified 3,837 genes in long-term former smokers, and 1006 genes of them were shared with never-smokers, 22.1% more than those shared with short-term former smokers (824 genes) and 45.8% more than with current smokers (690 genes). A total of 187 genes were commonly shared by all the subgroups (Supplementary Table 5). Protein-coding genes showed these same patterns of similarity (Fig. 4c).
We further investigated the eQTL-related genes based on the GTEx lung tissue database (Fig. 4d) and enhancers from the FANTOM5 database (Fig. 4e). For long-term former smokers, the same trend was observed with germline-regulated genes, which may indicate the higher similarity with never-smoking subgroup than others. Only two eQTL-related genes were shared among all subgroups: ARHGAP15 and TSPAN9. We also performed sensitivity analysis under different thresholds (P < 10−4 and P < 10−5) and obtained the similar results (Supplementary Table 6).
Unique and shared pathways in long-term former smokers
We explored Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways in the gene set enrichment analysis of germline-regulated genes. A total of 53 pathways were significant in long-term former smokers (Fig. 4f). A total of 18 pathways were shared across all the subgroups, including the well-known signaling pathways such as mitogen-activated protein kinase, oxytocin, Ras, Rap1, and calcium pathways (Fig. 4g). However, 11 pathways were only significant in long-term former smokers, most of which were linked to immune function such as the B cell receptor signaling pathway, human T cell leukemia virus 1 infection, human immunodeficiency virus 1 infection, and T helper type 1 and T helper type 2 cell differentiation (Fig. 4h).
Discussion
Understanding genetic risk factors in cancer is important for uncovering its underlying biological mechanisms11,12. Our genome-wide investigation among long-term former smokers with NSCLC detected rs34211819 in TNS3 and rs1143149 in SEPT7 which were associated with survival. Significant interactions revealed that the effects of SNPs could be modified by years of smoking cessation, perhaps indicating opportunities for clinical adjuvant therapy with immunotherapies in patients with risk alleles, while an immune function relationship was found for cis-regulated genes or SNP-binding proteins. Multi-omics analyses provided evidence of their eQTL/meQTL relationships, survival-related methylation, and expression patterns and upregulated proteins in NSCLC. Furthermore, we observed higher similarities between long-term former smokers and never-smokers, compared to short-term former smokers or current smokers. The distinct SNP patterns in long-term former smokers were linked to immune signals.
Tobacco smoking is associated with worse outcomes in lung cancer, as it leads to downregulation of proinflammatory cytokines, immunosuppression, and anti-inflammatory effects mediated by oxidants, carbon monoxide, nicotine, and transcriptional modifying compounds, especially in lung tissues13,14. Chronic inhalation of cigarette smoke affects a wide range of immunological functions, including innate and adaptive immune responses15. As a result, the immune system may be suppressed as long as individuals are exposed to smoking regardless of years of quitting or smoking, as evidenced by the same signaling pathways shared by all the smoking subgroups in our study. Tobacco smoking can affect the immune system by chemically modifying signaling pathways as well as the extracellular matrix through acetylation, nitrosylation, carbonylation, and oxidation, thereby affecting cell survival, activation, and differentiation16.
The identified genes were associated with NSCLC survival at the genomic, epigenomic, and transcriptomic levels and were related to immunological functions. SEPT7 is a member of the septin family of GTP-binding proteins, which form higher-order filamentous structures and function primarily in spatial organization and compartmentalization of many cellular processes. SEPT7 is structurally related to RAS oncogenes, which promote tumorigenesis17. SEPT7 is also implicated in several types of cancer18,19,20. While the septin family plays a critical role in cytokinesis21, SEPT7 is also involved in the cytoskeleton and participates in regulation of cytokinesis22,23. Septin-deficient T cells fail to complete cytokinesis when prompted by pharmacological activation or cytokines. Meanwhile, SEPT7-deficient fibroblasts display incomplete cytokinesis and constitutive multinucleation by affecting the mitotic spindle and midbody rather than the contractile ring24. As a result, SEPT7 plays a crucial role in immune functions including cytokinesis and mitosis, which are closely related to molecular changes from smoking exposure.
Another locus at 7p12 is marked by an intronic SNP in TNS3, which encodes tensin-3, a member of a family of focal adhesion-associated proteins that regulate cell adhesion and migration25. This gene may be an activator of cell migration and a promoter of invasion in tumor metastasis26,27. Here, we found that TNS3 acts as an oncogene, affecting regulation of methylation in lung cancer. A TNS3 methylation pattern in the promoter region can silence expression in renal cell carcinoma28. Further, the binding proteins BCL3 on 19q13.32 and OCT2 on 19q13.2 of rs34211819 are strongly linked to immune function. BCL3 translocates to the immunoglobulin alpha-locus in B cell chronic lymphocytic leukemia29. As an oncogene, it is an atypical member of the inhibitor of nuclear factor kappa B (NF-κB) family of proteins that can activate the NF-κB signaling cascade by directly binding to the transcription factors NFKB1 and NFKB230. It is unregulated by cytokines such as tumor necrosis factor alpha, interleukin 4 (IL-4), IL-1, and IL-631. OCT2 acts as a DNA-binding transcriptional activator of immunoglobulin in B-lineage cells32. It enables B cells to respond normally to antigen receptor signals and mediate the physical interaction with T cells or to produce and respond to cytokines that are critical drivers of B cell and T cell differentiation during the immune response31,33.
Genetic similarity analysis of subgroups showed lower overlap at the SNP level but relatively higher overlap of germline-regulated genes. It is possible that each SNP is unique to each subgroup and acts as an eQTL to regulate cancer-specific genes. Additionally, the effects of somatic mutations driving lung cancer should be investigated. We also found higher similarities between long-term former smokers and never-smokers, which mainly included inflammation and immune mechanisms, as reported in the previous studies10,34.
This study had some limitations. First, although we used the largest lung cancer consortium to date, further external cohort validation with follow-up information and smoking cessation details is warranted. Second, we selected SNPs with moderate association strengths (from P < 10−5 to P < 10−3); although it was a reasonable approach35,36, some false-positive SNPs may have been included. More well-designed functional experiments are necessary to validate the biological functions.
This study also had several strengths. To the best of our knowledge, this is the first GWAS to investigate the effects of genetic variants on NSCLC patients among long-term former smokers. We included a large and relatively homogeneous study population with relatively complete and accurate follow-up, demographic, and clinical covariate information from ILCCO and HLCS. In addition, we investigated the association of candidate genes and lung cancer at the multi-omics levels, including genomics, transcriptomics, epigenomics, and proteomics. Similarity comparisons among different smoking subgroups revealed the shared genetics status at multiple levels.
In summary, our study demonstrated that TNS3 at 7p12.3 and SEPT7 at 7p14.2 are genetic regions associated with survival among long-term former smokers, and the findings with subgroup were related to immune function. Our results may shed light on the important roles of genetic architecture on cancer outcomes among long-term former smokers with NSCLC, a subpopulation that has been less studied.
Methods
Study population
In accordance with the previous studies37,38,39, long-term former smokers were defined as patients who quitted smoking at least 10 years before diagnosis, whereas short-term former smokers quitted <10 years before diagnosis, and never smokers were those who smoked <100 cigarettes during their lifetime. To identify prognostic SNPs, we focused on long-term former smokers, whose characteristics are presented in Table 1. Patients in the discovery set were recruited from ILCCO, including the Cancer de Pulmon en Asturias study, Carotene and Retinol Efficacy Trial (CARET), Liverpool Lung Cancer Project, MD Anderson Cancer Center Study, and Mount-Sinai Hospital-Princess Margaret Study. While, patients in the independent validation set were recruited from HLCS (Supplementary Information). Approval for ILCCO studies was obtained from each of the participating institutional research ethics review boards. For HLCS, the Institutional Review Board of MGH and the Human Subjects Committee of the Massachusetts General Hospital and Harvard School of Public Health approved the study. All the participants were provided written informed consent to take part in the study.
Of the 6129 NSCLC patients with follow-up information totally, 4351 eligible cases were with available smoking information, including 504 never smokers, 1299 long-term former smokers, 687 short-term former smokers, and 1861 current smokers (Supplementary Table 7).
OncoArray genotype quality control and imputation
The ILCCO study and HLCS were originally designed and genotyped as case–control studies of lung cancer risk. In this study, we extracted all NSCLC patients from these two studies with survival information. Patient genotypes were generated using the Infinium OncoArray-500k BeadChip (Illumina, San Diego, CA, USA), with standard quality control procedures performed on all eligible individuals. Briefly, excluded were samples with <95% completion and SNP assays with call rates <95% or deviating from Hardy–Weinberg equilibrium (P < 10−6). Only SNPs with minor allele frequencies (MAFs) ≥ 0.05 mapping to autosomal chromosomes were included in the analysis. A total of 416,861 SNPs passed quality control40.
Genome-wide imputation following the Michigan Imputation Server pipeline41 was performed to estimate missing genotype information. We phased haplotypes with Eagle v2.3 using 1000 Genomes Project data (phase 3) as a reference panel42 and then performed imputations using the Minimac (version 3) software. SNPs with an imputation quality score R2 < 0.8, MAF < 0.05, or P < 10−6 for the Hardy–Weinberg equilibrium test were excluded from analyses.
Two-stage GWAS survival analysis
We used a two-stage strategy to identify significant prognostic SNPs for NSCLC patients who were long-term former smokers (Fig. 5). SNPs were encoded with an additive model (0: wild type; 1: heterozygosity; 2: homozygosity), unless otherwise stated. To quantify the association of each SNP with survival, we used the Cox proportional hazards regression model to evaluate its effect on survival, after adjusting for age, gender, clinical stage (I–IV), histology, pack-years of smoking, years of smoking cessation, and study center. Hazard ratio (HR) and 95% confidence interval (CI) were described for mortality risk for patients per minor allele carried. To control for the confounding effects of population stratification, we also performed principal component analysis (PCA) and included the first three principal components in the model, although no population stratification was observed in the PCA plot (Supplementary Fig. 2). Following the same selection criteria commonly used in the previous GWAS survival studies of cancers43,44, we defined significant SNPs as those that met the following criteria: (i) P ≤ 10−5 in the discovery set; (ii) P ≤ 0.05 in the validation set; and (iii) P ≤ 5×10−8 in the combined set, reaching genome-wide significance. Kaplan-Meier curves were generated to illustrate survival differences between groups with different SNP genotypes, DNA methylation levels or gene expression levels.
Gene–environment interaction analysis
Since these SNPs significantly associated with prognosis of long-term former smokers with NSCLC, we further conducted gene–environment interaction analysis by estimating how their effects varied with increased years of smoking cessation. Specifically, we included the interaction terms between these SNPs and years of smoking cessation in the Cox proportional hazards regression model adjusted for the same covariates aforementioned.
Multi-omics analysis
We evaluated gene expression, DNA methylation of these significant SNPs, and further performed in silico function prediction and protein analysis.
To examine the eQTL relationships between SNPs and the expression levels of the corresponding genes, we evaluated their associations by using both the summary-level data from 383 lung tissue samples through the GTEx portal (V7 release)45 and the individual-level data from 208 NSCLC Caucasian patients who were long-term former smokers in The Cancer Genome Atlas (TCGA) database (Supplementary Table 8). The cis-eQTL information in GTEx including normalized effect size, standard error, and nominal p value for each SNP-expression pair was collected. Genotype data from TCGA were collected from the GDC Legacy Data Portal (Level 2, birdseed data), which were generated from Affymetrix Genome-Wide Human SNP 6.0 Array. We performed the same quality control and imputation procedures as aforementioned. Gene expression values were normalized using the RNA-seq by expectation–maximization method46, and dichotomized, when needed, into low- and high-expression subgroups by the median values. To summarize the eQTL effects from TCGA and GTEx, we used meta-analysis with the fixed-effects model. The Cox proportional hazards regression model adjusted for the same covariates as aforementioned was utilized to evaluate the prognostic effects of gene expressions in tumor tissues in TCGA.
The association between SNP and DNA methylation was tested among 155 Caucasian patients who were long-term former smokers in TCGA. DNA methylation data were profiled using Illumina HumanMethylation450 BeadChips. The details of quality control were described in47. We used the linear regression model to assess meQTL effects and the Cox proportional hazards regression model to evaluate the association between methylation CpG probes (dichotomized by the median values) and survival. These models were adjusted for the same covariates. False discovery rate adjusted P value (q value) was used to correct for multiple comparisons.
We used an in silico approach through SNPinfo48, RegulomeDB49, and HaploReg v4.150 to predict potential functions of the identified SNPs. We also compared protein levels among the 101 tumors paired with normal adjacent lung cancer tissue samples using Student’s paired t test in the Clinical Proteomic Tumor Analysis Consortium (CPTAC) project51. The data were normalized following the CPTAC Common Data Analysis Pipeline.
Genetic similarity comparative analysis
To assess genetic similarity of long-term former smoking patients with other smoking subgroups (e.g., never, short-term former, and current smokers), we performed GWAS survival analysis within each smoking subgroup using the Cox proportional hazards regression model adjusted for the same covariates aforementioned. We selected SNPs with moderate association strengths (P < 1 × 10−3)36. Genetic similarity comparisons were made at the SNP, gene, eQTL, enhancer, and pathway levels, respectively.
For gene-level analyses, germline-regulated genes were defined as the nearest genes using the Ensembl definitions on genome build GRCh37 (hg19), which annotated the genomic locations for each subgroup using gene start and stop coordinates52. All gene features were defined by GENCODE V2553. We kept SNPs with no strong linkage disequilibrium (LD) using the PLINK indep-pairwise function; the r2 used for all LD trimming was 0.5. eQTLs were collected from GTEx lung tissues as described above.
For enhancer-level analysis, the FANTOM5 human enhancer database was used to identify enhancer activities across most cell types and tissues54. SNPs located within a permissive enhancer region ±1 kb were defined as the enhancer-related variants.
For pathway-level analysis, we performed gene enrichment pathway analysis based on the KEGG database. All enrichment analyses were performed using the R package clusterProfiler55.
All statistical analyses were performed using R (v3.5.2) or PLINK (v1.9).
Reporting summary
Further information on research design is available in the Nature Research Reporting Summary linked to this article.
Data availability
The data generated and analyzed during this study are described in the following data record: https://doi.org/10.6084/m9.figshare.1422934756.
This study utilized subsets of the Oncoarray Consortium—Lung Cancer Studies data available from the dpGap repository: https://identifiers.org/dbgap:phs001273.v3.p257. Specifically data from CAPUA study (CARET), the Roy Castle Lung Study (Liverpool Lung Cancer Project), the M.D. Anderson Cancer Center Study (MDACC study), the Mount-Sinai Hospital-Princess Margaret Study (MSH-PMH), and the Harvard Lung Cancer Study (HLCS) were accessed for this study. Prospective users of these data must apply for access, and details of how to apply can be found on the dataset landing page.
Functional prediction analyses of rs34211819 and rs1143149 (supporting Supplementary Table 4) are available from the HaploReg website https://pubs.broadinstitute.org/mammals/haploreg/haploreg.php. De-identified participant demographic and phenotype data for lung adenocarcinoma patients are available from the National Cancer Institute GDC Legacy Archive https://portal.gdc.cancer.gov/legacy-archive/files/0f56f656-18f5-4648-97d9-bdbb0f2184a2. The CPTAC proteomics data file (CPTAC_Pro.xlsx) can be openly accessed from the NCI Cancer Research Data Commons repository https://proteomic.datacommons.cancer.gov/pdc/study/PDC000153. The eQTL data (GTEx_Analysis_v8_eQTL.tar) can be accessed directly from https://storage.googleapis.com/gtex_analysis_v8/single_tissue_qtl_data/GTEx_Analysis_v8_eQTL.tar.
Code availability
The code that supports the findings of this study is available from the corresponding author on reasonable request.
References
L., S. R., D., M. K. & Ahmedin, J. Cancer statistics, 2017. CA: Cancer J. Clinicians 67, 7–30 (2017).
Boyle, P. & Maisonneuve, P. Lung cancer and tobacco smoking. Lung Cancer 12, 167–181 (1995).
Murphy, S. E. et al. Tobacco biomarkers and genetic/epigenetic analysis to investigate ethnic/racial differences in lung cancer risk among smokers. NPJ Precis Oncol. 2, 17 (2018).
Moolgavkar, S. H. et al. Impact of reduced tobacco smoking on lung cancer mortality in the United States during 1975–2000. J. Natl Cancer Inst. 104, 541–548 (2012).
Sun, S., Schiller, J. H. & Gazdar, A. F. Lung cancer in never smokers—a different disease. Nat. Rev. Cancer 7, 778 (2007).
Yano, T. et al. Never-smoking nonsmall cell lung cancer as a separate entity: clinicopathologic features and survival. Cancer 113, 1012–1018 (2008).
Couraud, S., Zalcman, G., Milleron, B., Morin, F. & Souquet, P. J. Lung cancer in never smokers—a review. Eur. J. Cancer 48, 1299–1311, (2012).
Slatore, C. G., Au, D. H., Littman, A. J., Satia, J. A. & White, E. Association of nonsteroidal anti-inflammatory drugs with lung cancer: results from a large cohort study. Cancer epidemiology, biomarkers & prevention: a publication of the American Association for Cancer Research, cosponsored by the American Society of Preventive. Oncology 18, 1203–1207 (2009).
Huxley, R. et al. Impact of smoking and smoking cessation on lung cancer mortality in the Asia-Pacific region. Am. J. Epidemiol. 165, 1280–1286 (2007).
Wu, X. et al. Genome-wide association study of genetic predictors of overall survival for non-small cell lung cancer in never smokers. Cancer Res. 73, 4028–4038 (2013).
Shen, H., Zhu, M. & Wang, C. Precision oncology of lung cancer: genetic and genomic differences in Chinese population. NPJ Precis Oncol. 3, 14 (2019).
Huang, X. et al. Genomic investigation of co-targeting tumor immune microenvironment and immune checkpoints in pan-cancer immunotherapy. NPJ Precis Oncol. 4, 29 (2020).
Chen, H., Cowan, M. J., Hasday, J. D., Vogel, S. N. & Medvedev, A. E. Tobacco smoking inhibits expression of proinflammatory cytokines and activation of IL-1R-associated kinase, p38, and NF-kappaB in alveolar macrophages stimulated with TLR2 and TLR4 agonists. J. Immunol. 179, 6097–6106 (2007).
Smith, C. J. & Hansch, C. The relative toxicity of compounds in mainstream cigarette smoke condensate. Food Chem. Toxicol. 38, 637–646 (2000).
Sopori, M. Effects of cigarette smoke on the immune system. Nat. Rev. Immunol. 2, 372–377 (2002).
Stampfli, M. R. & Anderson, G. P. How cigarette smoke skews immune responses to promote infection, lung disease and cancer. Nat. Rev. Immunol. 9, 377–384 (2009).
Angelis, D. & Spiliotis, E. T. Septin mutations in human cancers. Front Cell Dev. Biol. 4, 122 (2016).
Connolly, D., Abdesselam, I., Verdier-Pinard, P. & Montagna, C. Septin roles in tumorigenesis. Biol. Chem. 392, 725–738 (2011).
Russell, S. & Hall, P. Do septins have a role in cancer? Br. J. Cancer 93, 499 (2005).
Liu, M., Shen, S., Chen, F., Yu, W. & Yu, L. Linking the septin expression with carcinogenesis. Mol. Biol. Rep. 37, 3601–3608 (2010).
Wang, X. et al. The role of septin 7 in physiology and pathological disease: a systematic review of current status. J. Cell Mol. Med 22, 3298–3307 (2018).
Hou, M., Liu, X., Cao, J. & Chen, B. SEPT7 overexpression inhibits glioma cell migration by targeting the actin cytoskeleton pathway. Oncol. Rep. 35, 2003–2010 (2016).
Abbey, M. et al. GTPase domain-driven dimerization of SEPT7 is dispensable for the critical role of septins in fibroblast cytokinesis. Sci. Rep. 6, 20007 (2016).
Menon, M. B. et al. Genetic deletion of SEPT7 reveals a cell type-specific role of septins in microtubule destabilization for the completion of cytokinesis. PLoS Genet. 10, e1004558 (2014).
Touaitahuata, H. et al. Tensin 3 is a new partner of Dock5 that controls osteoclast podosome organization and activity. J. Cell Sci. 129, 3449–3461 (2016).
Qian, X. et al. The Tensin-3 protein, including its SH2 domain, is phosphorylated by Src and contributes to tumorigenesis and metastasis. Cancer Cell 16, 246–258 (2009).
Cao, X. et al. A phosphorylation switch controls the spatiotemporal activation of Rho GTPases in directional cell migration. Nat. Commun. 6, 7721 (2015).
Carter, J. A., Gorecki, D. C., Mein, C. A., Ljungberg, B. & Hafizi, S. CpG dinucleotide-specific hypermethylation of the TNS3 gene promoter in human renal cell carcinoma. Epigenetics 8, 739–747 (2013).
Ohno, H., Takimoto, G. & McKeithan, T. W. The candidate proto-oncogene bcl-3 is related to genes implicated in cell lineage determination and cell cycle control. Cell 60, 991–997 (1990).
Wulczyn, F. G., Naumann, M. & Scheidereit, C. Candidate proto-oncogene bcl-3 encodes a subunit-specific inhibitor of transcription factor NF-κB. Nature 358, 597 (1992).
Maldonado, V. & Melendez-Zajgla, J. Role of Bcl-3 in solid tumors. Mol. Cancer 10, 152 (2011).
Staudt, L. M. et al. Cloning of a lymphoid-specific cDNA encoding a protein binding the regulatory octamer DNA motif. Science 241, 577–580 (1988).
Corcoran, L. et al. Oct2 and Obf1 as facilitators of B:T cell collaboration during a humoral immune response. Front. Immunol. 5, 108 (2014).
Pu, X. et al. Predictors of survival in never-smokers with non-small cell lung cancer: a large-scale, two-phase genetic study. Clin. Cancer Res. 18, 5983–5991 (2012).
Jia, P., Wang, L., Meltzer, H. Y. & Zhao, Z. Pathway-based analysis of GWAS datasets: effective but caution required. Int. J. Neuropsychopharmacol. 14, 567–572 (2011).
O’Brien, T. D., Jia, P., Caporaso, N. E., Landi, M. T. & Zhao, Z. Weak sharing of genetic association signals in three lung cancer subtypes: evidence at the SNP, gene, regulation, and pathway levels. Genome Med. 10, 16 (2018).
Ellickson, P. L., Mcguigan, K. A. & Klein, D. J. Predictors of late-onset smoking and cessation over 10 years. J. Adolesc. Health 29, 101–108 (2001).
Edwards, R. The problem of tobacco smoking. BMJ 328, 217–219 (2004).
Cornuz, J., Feskanich, D., Willett, W. C. & Colditz, G. A. Smoking, smoking cessation, and risk of hip fracture in women. Am. J. Med. 106, 311–314 (1999).
McKay, J. D. et al. Large-scale association analysis identifies new lung cancer susceptibility loci and heterogeneity in genetic susceptibility across histological subtypes. Nat. Genet. 49, 1126 (2017).
Das, S. et al. Next-generation genotype imputation service and methods. Nat. Genet. 48, 1284–1287 (2016).
Loh, P.-R. et al. Reference-based phasing using the Haplotype Reference Consortium panel. Nat. Genet. 48, 1443 (2016).
Wu, C. et al. Genome-wide association study of survival in patients with pancreatic adenocarcinoma. Gut 63, 152–160 (2014).
Tang, H. et al. Genetic polymorphisms associated with pancreatic cancer survival: a genome-wide association study. Int. J. Cancer 141, 678–686 (2017).
Lonsdale, J. et al. The genotype-tissue expression (GTEx) project. Nat. Genet. 45, 580 (2013).
Li, B. & Dewey, C. N. RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome. BMC Bioinform. 12, 323 (2011).
Shen, S. et al. A multi-omic study reveals BTG2 as a reliable prognostic marker for early-stage non-small cell lung cancer. Mol. Oncol. 12, 913–924 (2018).
Xu, Z. & Taylor, J. A. SNPinfo: integrating GWAS and candidate gene information into functional SNP selection for genetic association studies. Nucleic Acids Res. 37, W600–W605 (2009).
Boyle, A. P. et al. Annotation of functional variation in personal genomes using Regulome DB. Genome Res. 22, 1790–1797 (2012).
Ward, L. D. & Kellis, M. HaploReg: a resource for exploring chromatin states, conservation, and regulatory motif alterations within sets of genetically linked variants. Nucleic Acids Res. 40, D930–D934 (2011).
Gillette, M. A. et al. Proteogenomic characterization reveals therapeutic vulnerabilities in lung adenocarcinoma. Cell 182, 200–225 e235 (2020).
Zerbino, D. R., Wilder, S. P., Johnson, N., Juettemann, T. & Flicek, P. R. The ensemble regulatory build. Genome Biol. 16, 56 (2015).
Frankish, A. et al. GENCODE reference annotation for the human and mouse genomes. Nucleic Acids Res. 47, D766–d773 (2019).
Andersson, R. et al. An atlas of active enhancers across human cell types and tissues. Nature 507, 455 (2014).
Yu, G., Wang, L.-G., Han, Y. & He, Q.-Y. clusterProfiler: an R package for comparing biological themes among gene clusters. Omics: J. Integr. Biol. 16, 284–287 (2012).
Shen, S. et al. Metadata record for the article: a multi-omics study links TNS3 and SEPT7 to long-term former smoking NSCLC survival. figshare https://doi.org/10.6084/m6089.figshare.14229347 (2021).
Oncoarray Consortium—Lung Cancer Studies. dbGaP https://identifiers.org/dbgap:phs001273.v3.p2.
Acknowledgements
This study was funded by the National Key Research and Development Program of China (2016YFE0204900 to F.C.), National Natural Science Foundation of China (81530088 to F.C. and 81973142 to Y.W.), Jiangsu Planned Projects for Postdoctoral Research Funds (2020Z019 to S.S.), Natural Science Foundation of Jiangsu Province (BK20191354 to R.Z.), Natural Science Foundation of the Jiangsu Higher Education Institutions of China (18KJB310011 to R.Z.), China Postdoctoral Science Foundation (2020M681671 to S.S. and 2018M633767 to R.Z.), US National Institutes of Health (CA209414 to D.C.C. and CA203654 to C.I.A. and R.J.H.), and Priority Academic Program Development of Jiangsu Higher Education Institutions (PAPD). CARET is funded by the National Cancer Institute, National Institutes of Health through grants U01-CA063673, UM1-CA167462, and U01-CA167462. R.Z. was partially supported by the Outstanding Young Teachers Training Program of Nanjing Medical University.
Author information
Authors and Affiliations
Contributions
S.S., Y.W., D.C.C., F.C., and R.Z. contributed to the study design. A.T., C.C., J.F., R.H., G.L., D.Z., C.A., T.F., L.S., and D.C.C. contributed to the data collection. S.S., R.Z., Y.W., W.D., X.D., L.L., D.Y., and Y.Z. performed the statistical analysis and interpretation. S.S. and R.Z. drafted the manuscript. S.S., R.Z., Y.W., R.H., Z.H., H.S., and Y.L. revised the manuscript. All authors contributed to critical revision of the final manuscript and approved the final version of the manuscript.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Shen, S., Wei, Y., Li, Y. et al. A multi-omics study links TNS3 and SEPT7 to long-term former smoking NSCLC survival. npj Precis. Onc. 5, 39 (2021). https://doi.org/10.1038/s41698-021-00182-3
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41698-021-00182-3
This article is cited by
-
ATHENA: an independently validated autophagy-related epigenetic prognostic prediction model of head and neck squamous cell carcinoma
Clinical Epigenetics (2023)
-
Tensin-3 is involved in osteogenic versus adipogenic fate of human bone marrow stromal cells
Cellular and Molecular Life Sciences (2023)