Introduction

Lung cancer (LC) is one of the cancers with a high mortality rate in the world, accounting for approximately one quarter of all cancer deaths1. And smoking is currently considered to be a major risk factor for it2. In addition, exposure to environmental factors such as radon, secondhand smoke and dust, asbestos, cooking fumes and air pollution are also the main causes of LC in non-smokers3,4,5,6. However, it’s not only the environmental factors but also genetic differences that contribute to LC susceptibility. Over the past two decades, multi-population Genome-wide association studies(GWAS) have identified dozens of risk loci for LC7,8, and most of these loci are concentrated in 5p15.33 (Telomerase reverse transcriptase—Cleft lip and cleft palate transmembrane protein 1)TERT-CLPTM1L region9,10,11,12. Several precise localization studies in the following years have also identified some new LC risk loci in this region13,14,15. Telomeres are consisted of repeated "TTAGGG" at the ends of chromosomes that gradually shorten in length with each round of cell division until cell cycle arrest is triggered, of which process is known as replicative senescence16,17,18,19. Telomeres can normally be elongated by the ribonucleoprotein telomerase to maintain the replication potential20,21. In human cancer cells, however, telomerase has been activated to escape the initial growth arrest and continue to divide22. Unlimited cell growth and proliferation following the activation of telomerase is one of the clinical cancer phenotypes23,24,25. It has been proved that long telomeres can promote the survival of cells with acquired oncogenic DNA alterations, thereby promoting tumorigenesis26,27,28. Telomerase is consisted of a catalytic protein component encoded by the TERT gene and an RNA template encoded by the Telomerase RNA component(TERC). Among them, TERT is located at the short arm 15.33 of chromosome 5 (5p15.33), which is responsible for encoding the catalytic subunit of telomerase29, regulating the expression level of telomerase, and maintaining telomere length, chromosomal stability and cell proliferation by adding "TTAGG" repeats at the end of the chromosomes30,31.

Variations of the TERT promoter are an important prerequisite for high telomerase expression to stabilize telomere length32, and this process has been observed in cancer cells23. Polymorphic genes in TERT and TERC have been reported to be associated with telomere length33,34,35, and longer telomere length contributes to increasing the risk of LC, especially for Lung adenocarcinoma (LUAD)36,37,38. In addition, the TERT gene is significantly overexpressed in LC tissues, which may also confirm the underlying mechanism of LC risk39. However, the association between LC risk and telomere length is inconclusive as telomere length varies with the histological type of LC40,41. Several single nucleotide polymorphisms (SNPs) in the TERT locus have been reported to be associated with cancer risk, and these SNPs are located in the exons or introns of TERT or its promoter42. The rs2736100 (A > C) polymorphism located in the second intron of TERT is the most common SNP in the TERT gene, and its association with cancer susceptibility, including LC, has been reported in various malignant tumors29. In TERT rs2736100, the C allele upregulates TERT expression in normal and LC tissues19 and is associated with longer telomere length35,43. Studies also have found that the increased telomere length of the C allele is associated with cancer44. Some studies have also shown an increased frequency of the C allele of TERT rs2736100 in LC patients9,45,46,47,48. These evidences imply that the C allele upregulates TERT expression, maintains and prolongs telomere length, and thus increases the risk of LC. In addition, some studies have conducted racial stratification analysis for different types of LC and proved that the influence of TERT variants in Asians is stronger than that in Caucasians45,49. These results in turn imply that the frequency of TERT rs2736100 variants varies across ethnic populations. However, there are some studies have not found the association between the C allele and LC50,51. The reasons for these different results may also be related to different ethnicities, countries, research methods, sample sizes, LC types, and linkage disequilibrium patterns. Therefore, there’s inconsistency in the results of the association of TERT rs2736100 with LC. While meta-analysis is an effective way to combine data to increase the sample size, obtain sufficient power to clarify inconsistent results in genetic association studies52.

Several meta-analyses have reported the association of the TERT rs2736100 polymorphism with LC, but these meta-analyses have some shortcomings: some meta-analyses have shown an increased frequency of the C allele of TERT rs2736100 in LC patients but they ignored the effect of different ethnic groups53,54; there are some meta-analyses of ethnic stratification of rs2736100, but most of them focused on different types of cancer, and they were not subjected to a stratified analysis of LC48; some studies have done racially stratified meta-analyses for different types of LC, however, they are outdated55. Therefore, there is still a lack of a unified conclusion on the association of TERT rs2736100 polymorphism with LC, especially the variability of this association in different ethnic populations and in different LC subtypes. This study included data from GWAS and case–control studies reporting the association of TERT rs2736100 (A > C) polymorphisms with LC up to date with the aim of clarifying its association with LC and the differences in this association between different ethnicities and different types of LC.

Data and methods

Inclusion and exclusion criteria

Inclusion criteria

They must be GWAS or case–control studies on TERT rs2736100 A/C gene polymorphism and LC susceptibility, the language should be English, and the detection methods and means should be accurately described; The gene frequency data can be used to calculate the Odds ratio(OR) and 95% Confidence interval(95% CI); The distribution of genotype frequency of all controls conforms to Hardy–Weinberg(HWE)56; The score of Newcastle Ottawa scale(NOS)57 should be no less than 7 (≥ 7).

Exclusion criteria

Studies without allele-related data; Studies of the types of reviews, meta-analyses, conference reports and case reports; Studies with pedigree as the reporting object; same studies have published for multiple times, only the one with the most complete data will be included, and the others will be excluded.

Outcomes

The pre-specified primary outcomes were to investigate whether TERT rs2736100 A/C polymorphism increased the risk of LC in the overall population. The secondary outcomes were to determine whether there were differences in the intensity of the association between the TERT rs2736100 A/C polymorphism and LC (including various subtypes) between different ethnic groups.

Retrieval strategy

Relevant literatures on TERT rs2736100 polymorphism and LC susceptibility in PubMed, EMbase, CENTRAL, MEDLINE databases published before May 7, 2022 were searched by theme words and keywords. The language was limited to English.

Search terms in PubMed(Table 1/Table S1 in supplemental content): "Lung cancer" OR "LC"AND "rs2736100" OR "TERT" AND "polymorphism". Manual retrieval and literature tracing methods were used at the same time to expand the search scope.

Table 1 PubMed search strategy.

Literature screening and data extraction

Two relatively independent researchers (X–ZW and WL) completed literature searching and screening according to the inclusion criteria. They cross checked and discussed them afterwards. For the literatures with different opinions, the third party (Y–ZC) made the decision. For some literatures with incomplete data, they tried to contact the author by e-mail to obtain the complete data. Finally, data extraction was carried out for the literatures being chosen after the final decision. These data include: author, year of publication, country, ethnicity, smoking status of subjects, type of LC, number of cases in case and control groups, frequency of each genotype in case and control groups, and the OR and 95% CI of each genotype.

Literature quality evaluation

The quality of the included literature was evaluated in the NOS57 (X–ZW and WL), and those with a score of no less than 7 were considered as literatures with high-quality.

Statistical methods

The HWE of the genotypes of the controls was detected by Pearson's chi-square test in SPSS 22.0 software. All results were statistically counted and analyzed in Revman 5.3 software, including drawing forest plots and funnel plots. When there was no heterogeneity among all studies or among all subgroups (P > 0.1 or I2< 50%), the fixed-effects model was used for statistical analysis; otherwise, the random-effects model was used for statistical analysis. The effect size and effect value of the statistical results were presented by OR value and 95% CI. Begg's Test and Egger's Test were performed in Stata 14.0 software to assess publication bias among studies, and sensitivity analysis was performed to assess the results of statistical analysis with greater heterogeneity. TSA 0.9.5.10 software was performed for the Trial sequential analysis(TSA) tests to evaluate the stability of the conclusion ((Type I error) probability = 5%, statistical test power = 80%, relative risk reduction = 20%).

Ethics and dissemination

This review does not require ethical approval because the included studies are published data and do not involve the patients’ privacy. The results of this review will be reported in accordance with the PRISMA extension statement and disseminated to a peer-reviewed journal.

Results

Characteristic of eligible studies

A total of 398 literatures were initially retrieved from the 4 databases and 43 studies in 40 literatures were finally included after the screening10,12,14,39,45,46,47,50,51,58,59,60,61,62,63,64,65,66,67,68,69,70,71,72,73,74,75,76,77,78,79,80,81,82,83,84,85,86,87,88 , of which there were 25 GWAS in 22 literatures10,12,14,45,47,50,58,59,60,61,63,64,65,68,69,74,76,77,79,80,87,88. And a flow chart was made according to the PRISMA statement (Fig. 1). Among these studies, 12 in Caucasians and 31 in Asians were included. There were 99,941 LC patients (including 36,943 Caucasian patients and 62,998 Asian patients) and 131,856 controls (Tables 2, 3). All 43 studies had high NOS57 assessment scores (≥ 7), indicating that they are all at low risk of bias (Table 4).

Figure 1
figure 1

PRISMA literature screening flow chart.

Table 2 Basic features of the included study (1).
Table 3 Basic features of the included study (2).
Table 4 Newcastle Ottawa scale (NOS).

Quantitative analysis

LC

The allelic model (C vs. A) was used to evaluate the association of TERT 2736100 with LC susceptibility. The random effects model was used for analysis as the test results showed that there was heterogeneity after the heterogeneity test (Overall population: P < 0.00001, I2= 83%; Caucasians: P < 0.0001, I2 = 73%; Asians: P < 0.00001, I2 = 74%) (Fig. 2a, Table 5). It was found that the C allele was associated with the risk of LC (Overall population: [OR] = 1.21, 95%CI [1.17, 1.25]; Caucasians: [OR] = 1.11, 95%CI [1.06, 1.17]; Asians: [OR] = 1.26, 95%CI [1.21, 1.30]), and Asians had a higher risk of LC than Caucasians (C vs. A: Caucasians: [OR] = 1.11 /Asians: [OR] = 1.26) (Fig. 2a, Table 5). The additive, heterozygous, dominant and recessive genetic models (CC vs. AA, CA vs. AA, CA + CC vs. AA and CC vs. AA + CA) were further used to evaluate the correlation between TERT 2736100 and LC since 29 of the 43 studies reported complete genotype frequencies. And the fixed-effects model (P > 0.1 or I2 < 50%) and random-effects model (P < 0.1 or I2 > 50%) were used to analyze each subgroup due to the different heterogeneity of each subgroup. Meta-analysis showed that people with "C" genotype had higher risks of LC than those with "A" genotype (P < 0.00001), and Asians had higher risks of LC than Caucasians (CC vs. AA: Caucasians: [OR] = 1.33/Asians: [OR] = 1.60; CA vs. AA: Caucasians: [OR] = 1.17/Asians: [OR] = 1.26; CA + CC vs. AA: Caucasians: [OR] = 1.22/Asians: [OR] = 1.34; CC vs. AA + CA: Caucasians: [OR] = 1.19/Asians: [OR] = 1.41) (Fig. 2b–e, Table 5). It’s also found that carriers of the CC genotype ([OR] = 1.56) were more likely to develop LC than carriers of the CA genotype ([OR] = 1.25) (Table 5).

Figure 2
figure 2figure 2

Forest plots of LC. (a) Forest plot of the allele genetic model (C vs. A) (Random). (b) Forest plot of the additive genetic model (CC vs. AA) (Random). (c) Forest plot of the heterozygous genetic model (CA vs. AA) (Fixed). (d) Forest plot of the dominant genetic model (CA + CC vs. AA) (Random). (e) Forest plot of the recessive genetic model (CC vs. AA + CA) (Fixed).

Table 5 The results of Meta-analysis and publication bias (LC).

LC subtypes

A further stratified analysis of these LC studies was performed since there were four different disease types in LC studies: Non-small-cell lung carcinoma(NSCLC, N = 21), Small cell lung carcinoma (SCLC, N = 7), Lung adenocarcinoma(LUAD, N = 17) and Lung squamous cell carcinoma(LUSC, N = 13). Meta-analysis of the allele model (C vs. A) found that the C allele was associated with the risk of NSCLC (Overall population: [OR] = 1.27, 95%CI [1.22, 1.33]; Caucasians: [OR] = 1.19, 95%CI [1.09, 1.31]; Asians: [OR] = 1.28, 95%CI [1.22, 1.34]), and Asians had a higher risk of NSCLC than Caucasians (C vs. A: Caucasians: [OR] = 1.19/Asians: [OR] = 1.28) (Fig. S1 in supplemental content, Table 6). In SCLC patients, the C allele was associated with the risk of SCLC only in Asians (Overall population: [OR] = 1.03, 95%CI [0.98, 1.09]; Caucasians: [OR] = 1.00, 95%CI [0.94, 1.06]; Asians: [OR] = 1.11, 95%CI [1.01, 1.22]) (Fig. S1 in supplemental content, Table 6). In LUAD patients, the C allele was associated with the risk of developing LUAD (Overall population: [OR] = 1.32, 95%CI [1.26, 1.38]; Caucasians: [OR] = 1.22, 95%CI [1.16, 1.28]; Asians: [OR] = 1.34, 95%CI [1.27, 1.41]), and Asians had a higher risk of developing LUAD than Caucasians (C vs. A: Caucasians: [OR] = 1.22/Asians: [OR] = 1.34) (Fig. S2 in supplemental content, Table 6). In LUSC patients, the C allele was associated with LUSC risk in Asians but not in Caucasians (Overall population: [OR] = 1.09, 95%CI [1.06, 1.13]; Caucasians: [OR] = 1.04, 95%CI [0.99, 1.10]; Asians: [OR] = 1.13, 95%CI [1.08, 1.18]) (Fig. S2 in supplemental content, Table 6). It’s also found that NSCLC patients ( [OR] = 1.27) had a stronger disease association than SCLC patients ( [OR] = 1.03) when the OR values were compared (Fig. S1 in supplemental content, Table 6), and LUAD patients ([OR] = 1.32) had a stronger disease association than LUSC patients ([OR] = 1.09) (Fig. S2 in supplemental content, Table 6).

Table 6 The results of Meta-analysis and publication bias (Allele genetic model, C vs.A).

Analysis of smoking status in LC patients

Among the included studies, 25 reported smoking or non-smoking in LC patients, of which 9 reported smoking history in LC patients and 16 reported no smoking history in LC patients. Therefore, a stratified analysis of smoking in LC patients in these 25 studies was conducted to clarify whether smoking caused variation in TERT rs2736100 and increased the risk of LC. Meta-analysis of the allele model (C vs. A) found that the C allele was associated with the risk of LC in both the smoking group and the non-smoking group (Smoking: [OR] = 1.16, 95%CI [1.09, 1.23]; Non-smoking: [OR] = 1.34, 95%CI [1.26, 1.41]), and the risk of LC in the non-smoking group was higher than that in the smoking group (C vs. A: Smoking: [OR] = 1.16/Non-smoking: [OR] = 1.34), and it was also found that non-smokers had the highest risk of LC in Asians ([OR] = 1.36, 95%CI [1.27, 1.46]) (Fig. S3 in supplemental content, Table 7).

Table 7 Meta-analysis results of smoking status (Allele genetic model, C vs. A).

A further stratified analysis of the smoking status of patients with different types of LC was performed due to the presence of different types of LC in the included studies. For NSCLC, the TERT polymorphism (C vs. A) was associated with the risk of NSCLC in both smoking group and non-smoking group (Smoking: [OR] = 1.20, 95%CI [1.05, 1.36]; Non-smoking: [OR] = 1.33, 95%CI [1.18, 1.50]), and the non-smoking group had a higher risk of NSCLC than the smoking group (C vs. A: Smoking: [OR] = 1.20/Non-smoking: [OR] = 1.33), and it’s also found that non-smokers had the highest risk of NSCLC in Asians ( [OR] = 1.35, 95%CI [1.17, 1.55]) (Fig. S4 in supplemental content, Table 7). For LUAD, the TERT polymorphism (C vs. A) was associated with the risk of LUAD in both the smoking group and the non-smoking group (Smoking: [OR] = 1.26, 95%CI [1.16, 1.37]; Non-smoking: [OR] = 1.37, 95%CI [1.20, 1.56]), and the risk of developing LUAD in the non-smoking group was higher than that in the smoking group (C vs. A: Smoking: [OR] = 1.26/Non-smoking: [OR] = 1.37). In addition, the risk of LUAD was found to be the highest among non-smokers in Caucasians ( [OR] = 1.40, 95%CI [1.17, 1.68]) (Fig. S5 in supplemental content, Table 7). For LUSC and SCLC, TERT polymorphisms (C vs. A) were not associated with the risk of LUSC and SCLC in both smoking group and non-smoking group in all populations (P > 0.05) (Table 7).

Sensitivity analysis

For LC, the sensitivity analysis results of the allele, additive, heterozygous, dominant and recessive genetic models (C vs. A, CC vs. AA, CA vs. AA, CA + CC vs. AA and CC vs. AA + CA) showed that none of the studies had significant sensitivity, indicating that there’s no significant difference in the result of the meta-analysis after removing any study (Fig. S6, Tables S2S6 in supplemental content). For NSCLC, SCLC, LUAD and LUSC, the sensitivity analysis of the allele model (C vs. A) also showed no significant sensitivity (Fig. S7, Tables S7, S8 in supplemental content).

Heterogeneity analysis

For LC, there was some heterogeneity in the overall population analysis results for the allele, additive, heterozygous, dominant and recessive genetic models (C vs. A, CC vs. AA, CA vs. AA, CA + CC vs. AA and CC vs. AA + CA) (P < 0.1 or I2> 50%), and this heterogeneity mainly exists in Asians (Table 5). In the stratified analysis, the allele model (C vs. A) of NSCLC and LUAD analysis results in the overall population also showed a certain degree of heterogeneity (P < 0.1 or I2> 50%), and this heterogeneity mainly existed in Asians (Table 6).

Publication bias

For LC, the funnel plots of the allele, additive, heterozygous, dominant and recessive genetic models (C vs. A, CC vs. AA, CA vs. AA, CA + CC vs. AA and CC vs. AA + CA) were all roughly symmetrical, suggesting there’s no apparent bias (Fig. S8 in supplemental content). In terms of NSCLC, SCLC, LUAD and LUSC, the funnel plots of the allele model (C vs. A) were all roughly symmetrical (Fig. S9 in supplemental content). Additionally, the results of publication bias for all genetic models suggested that there were no obvious biases (PBegg > 0.05, PEgger > 0.05) (Tables 5, 6/Figs. S10S12 in supplemental content).

Trial sequential analysis (TSA)

For LC, TSA analysis of the allele, additive, heterozygous, dominant and recessive genetic models (C vs. A, CC vs. AA, CA vs. AA, CA + CC vs. AA and CC vs. AA + CA) showed Z-curve (blue line) crossed both the traditional boundary (green dashed line) and the TSA boundary (red line) (Figs. S13S17 in supplemental content). In terms of NSCLC, SCLC, LUAD and LUSC, TSA analysis of the allele model (C vs. A) in the overall and Asian populations also showed the same results (Figs. S18S21 in supplemental content). Similar results were found in TSA analysis of the allele model (C vs. A) for patients with LC, NSCLC, and LUAD in terms of smoking status (Figs. S22S24 in supplemental content). These results showed the overall stability and credibility of the results of this meta-analysis. The TSA results of NSCLC, SCLC, LUAD and LUSC in Caucasians cannot be comprehensively analyzed due to the reasons such as small sample size or the absence of complete gene frequencies in some of the original data reported in the literature. In addition, TSA results for smoking status in SCLC and LUSC couldn’t be comprehensively analyzed because of these reasons as well.

Summary of all the results

Due to the large amount of data in this study, a summative forest plot of all the results was created to show the statistical results more visually and more clearly, see Fig. 3.

Figure 3
figure 3

Summary forest plot of all results.

Discussion

Current studies have reported that gene polymorphisms in TERT and TERC are associated with telomere length33,34,35, and longer telomeres length contributes to an increased risk of LC36,37,38. The increased telomere length of the C allele of the rs2736100 (A > C) polymorphism in the second intron of TERT is related to cancer44. A number of research reports have also reported that the frequency of the C allele of TERT rs2736100 increases in patients with LC9,45,46,47,48. It’s showed that the C allele can upregulate the expression of TERT, maintain and prolong telomere length, thereby increasing the risk of LC. However, due to the existence of factors such as ethnic differences, different types of LC, environmental pollution and smoking, the association between TERT rs2736100 polymorphism and LC still lacks a unified conclusion. This study included the data of GWAS and case–control studies on the association of rs2736100 polymorphism with LC that have been reported so far to clarify the association between this polymorphism and LC and the differences in the association between different ethnic groups and different types of LC.

43 studies (including 99,941 LC patients and 131,856 healthy controls) were included in this meta-analysis. The association of TERT polymorphisms with LC susceptibility was first evaluated by using the allele, additive, heterozygous, dominant and recessive genetic models (C vs. A, CC vs. AA, CA vs. AA, CA + CC vs. AA and CC vs. AA + CA). And the results showed that the C allele and "C" genotype were associated with the risk of LC comparing with the A allele and "A" genotype in the overall population. These results are consistent with those of previous GWAS studies10,12,14,45,47,50,59,61,64,65,68,69,74,76,77,79,80,87,88. It indicates that people with C allele are more likely to suffer from LC, and C allele and "C" genotype are the risk factors for LC, and the C allele increases the risk of LC by extending telomere length. However, there are some GWAS that haven’t found the association between the C allele and LC58,59,60,63. The reasons for these different results may also be related to different ethnicities, countries, research methods, sample sizes, LC types, and linkage disequilibrium patterns. Previous studies also reported that the impact of TERT variation in Asians was stronger than that in Caucasians45,55. Another study showed that rs2735947 was the most significant SNP in the Caucasians rather than rs273610049. Our findings also confirmed that the C allele and "C" genotype frequencies were indeed higher in Asians than in Caucasians, suggesting that Asians may have longer telomeres that leads to an increased risk of LC.

Since telomere length can vary with the histological type of LC40,41, different types of LC may have different degrees of association with TERT gene polymorphism due to their different pathological types. Therefore, a stratified analysis of the included LC studies was performed. Previous studies have found that longer telomere length contributes to increase the risk of LC, especially for NSCLC and LUAD36,37,38, and the C allele can increase the risk of NSCLC65,83. The results of our study also suggested that the C allele was associated with the risk of NSCLC. It indicates that the population carrying the C allele are more susceptible to NSCLC due to telomere lengthening. And it’s also found in our study that Asians had a higher risk of NSCLC than Caucasians, proving that Asians may have longer telomeres, which contribute to an increased risk of NSCLC. Some studies47,69 found that TERT rs2736100 wasn’t associated with the risk of SCLC in Caucasians, but Hu et al10 found that TERT rs2736100 could increase the risk of developing SCLC in Asians. Our results showed that the C allele was only associated with the risk of SCLC in Asians. It suggests that the C allele is a risk factor for SCLC in Asians but not in Caucasians, and the reason may be strongly related to the fact that Asian populations may have longer telomeres. When the OR values of NSCLC and SCLC were compared, it was found that NSCLC patients had a stronger disease association than SCLC patients. A previous study92 identified a locus on chromosome 5p15.33 that was significantly associated with the risk of LUAD in NSCLC, but not with other major histological types. Another study found that TERT s2736098 was significantly associated with an increased risk of SCLC in the Chinese population instead of rs273610089. These findings, combined with our results, suggested that Asian populations and NSCLC patients may have longer telomeres, which triggered the risk of cancer, and TERT rs2736100 is of a higher value as a genetic marker for diagnosing the pathogenesis of NSCLC than SCLC.

NSCLC is the most common type of LC, and LUAD is the most prevalent subtype of NSCLC73. Previous studies90 have found rs2736100 to be a risk factor associating with increased susceptibility to LC, especially for LUAD. results of this study also showed that the C allele was associated with the risk of LUAD, confirming that the risk of developing LUAD is also strongly associated with telomere lengthening36,37,38. The results of this study also showed that Asians had a higher risk of LUAD than Caucasians, suggesting that Asians may possess longer telomeres, which contribute to an increased risk of LUAD. Some studies have found that there’s no such a risk association among LUSC patients49. Several other studies47,69 also showed that TERT rs2736100 wasn’t associated with the risk of developing LUSC in Caucasians. However, in some studies on Asians10, the C allele of TERT rs2736100 was found to increase the risk of developing LUSC. Results of this study also showed that the C allele was associated with LUSC risk in Asians but not Caucasians. It proves that the C allele is a risk factor for LUSC in Asians but not in Caucasians and the reason has a lot to do with the fact that Asian populations may has longer telomeres. It’s found that patients with LUAD had a stronger disease association than patients with LUSC. Previous studies have confirmed that rs2736100 was more associated with LUAD than with LUSC69,91, which is consistent with our findings. Similarly, there are studies92 have identified a locus on chromosome 5p15.33 that is clearly associated with the risk of LUAD but not with other major histological types. These evidences demonstrate that Asian populations and patients with LUAD may have longer telomeres, thereby triggering the risk of cancer, and TERT rs2736100 has a higher value as a genetic marker for diagnosing the pathogenesis of LUAD than LUSC.

Epidemiological surveys showed that although smoking was identified as a major environmental risk factor for LC worldwide, only a small proportion of smokers develop LC during their lifetime. In contrast, a large proportion of LC cases have no history of smoking93,94. LC in never-smokers differs from LC in smokers, and a large proportion of LC patients in never-smokers carry genetic variants in oncogenes95. Recent studies have shown that the genetic susceptibility of never-smokers to LC is associated with genetic variants with pan-cancer risk effects, and that gene-environment interactions are important in LC etiology96. Tumor suppressor genes are normally expressed in healthy cells due to key regulators of cell division, such as cyclin and cyclin-dependent kinases, as well as other cell cycle checkpoints that limit this process97. However, when oncogenes triggered by environmental factors are activated and tumor suppressor genes are turned off, the control of cell division is altered, and cancer starts from a single cell98,99. Studies have shown that multiple environmental risk factors such as smoking, heavy alcohol consumption, high intake of red meat and fat, low fiber intake , indoor and outdoor air pollution, and exposure to chemicals and radiation can contribute to genomic instability100,101,102,103,104. Genomic instability leads to nucleotide dysfunction, such as base substitution, base loss, nucleotide deletion, insertion or amplification of base pairs, which further induce DNA breaks, chromosomal remodeling or translocation. And if the damage is not fixed, it can lead to irreversible cell mutation and continuous growth105,106. In LC studies, CT and TT genotype carriers of miR-26a-1 rs7372209 and miR-16-1 rs1022960 who have been exposed to cooking fumes have a higher risk of LC than those who have not been exposed107. Another study evaluating the association between gene-radon interactions among uranium miners and LC indicated that the OR interaction effect of SNP rs6891344 and rs11747272 with chromosomes 5q23.2 was estimated to be 3.9 and 3.4, suggesting that uranium miners exposed to the radioactive gas radon are more susceptible to LC108. These evidences suggest that a variety of environmental factors other than smoking can also cause genetic variants that lead to LC. Therefore, a stratified analysis on the smoking status of LC patients included in the study was conducted to clarify whether smoking or non-smoking caused variation in TERT rs2736100 and increased the risk of LC. The results showed that the C allele was associated with the risk of LC in both smokers and non-smokers, and the risk of LC in non-smokers was higher than that in smokers. It’s been reported that rs2736100 is the most significant variation among non-smokers, while rs2736100 is less significant than rs3601944649 among smokers, which confirms that TERT variation has a stronger impact on non-smokers than on smokers45,109.A study also showed that TERT SNP was a risk factor for LC in never smokers110. Similarly, a case–control study also showed that the C allele increased the risk of LC in never smokers111. Therefore, smoking is not the most critical factor to cause variation in TERT rs2736100 and increase the risk of LC.

To further clarify this genetic difference between smokers and non-smokers, we performed a stratified analysis of different types of LC in different ethnic groups as the telomere length and the frequency of TERT gene variants were different in different ethnic groups and different histological types of LC40,41. The results of this study also showed that TERT polymorphism (C vs. A) was associated with the risk of NSCLC in both smokers and non-smokers, and the risk of NSCLC in non-smokers was higher than that in smokers. For LUAD, the same result existed: TERT polymorphism (C vs. A) was associated with the risk of LUAD in both smokers and non-smokers, and the risk of LUAD in non-smokers was higher than that in smokers. Previous studies have also found that there are non-tobacco related risk factors in the pathogenesis of NSCLC. These possible risk factors include: the exposure to cooking fume, hormones and viral infection112. Subramanian113 mentioned before that LUAD was the most common type among never smokers. Therefore, non-smokers are more likely to be at the risk of NSCLC and LUAD due to variation in TERT rs2736100 leading to telomere lengthening. It’s confirmed that smoking does cause variation in TERT rs2736100, which increases the risk of most LC (NSCLC, LUAD), however, it’s not the most critical factor. Evidence shows that82 education level, BMI, prior diagnosis of COPD, occupational exposure to pesticides, duration of smoking, exposure to a large number of cooking emissions, dietary factors (including less fish and shrimp, vegetables, soy products and nuts) and the excessive intake of meat in LC patients are all related to the development of LC. When combined with many environmental and lifestyle factors, TERT rs2736100 is still significantly associated with LC82. Therefore, LC (NSCLC, LUAD) is a multi-etiological disease caused by a combination of genetic and lifestyle factors. Comparing with different ethnic groups, it’s found that the risk of LC and NSCLC in the non-smokers was the highest in Asians. Combined with the results above, it’s proved that the Asian non-smoking populations may be more likely at the risk of LC and NSCLC due to the elevated frequency of TERT rs2736100 C allele combined with environmental factors that cause telomere lengthening. But for LUAD, non-smokers were found to have the highest risk of developing LUAD in Caucasians rather than Asians. The reason for this is still related to the small sample size of non-smokers in Caucasians, and the fact that there’s not only one pathological type of LUAD in NSCLC but also many other types such as LUSC and large cell lung cancer (LCLC), which can lead to inconsistent results in the analysis of NSCLC and LUAD. In addition, the majority of non-smoking LUAD patients included in this study are Asian females (Asian females: N = 9618/Overall: N = 12,327), indicating that non-smoking females in the Asians are more likely to have the risk of LUAD. Previous studies have also confirmed that LUAD is more common in females114,115. Patel et al. showed that among the never-smoking LC patients, the number of females exceeded that of males116. There was evidence confirmed that the common genetic variation of TERT-CLPTM1L was associated with the risk of LUAD in non-smoking Asian females45. This can be explained by the following assumptions: females are more likely to be exposed to second-hand smoking, and exposed to coal for cooking at home and hormone replacement therapy. All these reasons can lengthen telomere to avoid apoptosis and ultimately lead to cancer117.

For LUSC and SCLC, TERT polymorphisms (C vs. A) were not associated with the risk of them in all populations, both in smokers and in non-smokers. Therefore, smoking may not cause variation in TERT rs2736100 that increase the risk of LUSC and SCLC. The cause of variation in TERT rs2736100 leading to LUSC and SCLC remains to be further clarified.

Limitations of this study: This meta-analysis is based on the research reports of different ethnic groups and different types of LC, which will inevitably produce some heterogeneity; The methods of gene detection and genotyping used in all studies were different, and there will be some differences in data results; In terms of sample size, this study is sufficient in general. However, after subgroup analysis according to different LC types and ethnicity, the results signify that the sample size of SCLC and LUSC is still small. This will inevitably produce some false negative results for SCLC and LUSC; Although this study discussed the effects of smoking, environment, lifestyle and other factors on LC in details, from the perspective of smoking status, the sample size of smoking patients reported in these studies is still relatively small, especially those of SCLC and LUSC studies. Therefore, to some extent, the reliability of the results of the correlation between smoking and the risk of SCLC and LUSC will be affected; All the literatures included in this study are in English, not in the other languages.

Conclusion

In conclusion, the C allele of TERT rs2736100 is a risk factor for LC, NSCLC, and LUAD in different ethnic groups, and the risk is more common in Asians. Moreover, the C allele is a risk factor for LUSC and SCLC in Asians but not in Caucasians. Among the different types of LC, NSCLC patients have stronger risk correlation than SCLC patients, and LUAD patients have a stronger disease risk correlation than LUSC patients. Asians have a more common risk of various types of LC because they may have longer telomeres than Caucasians. The C allele is correlated with the risk of LC, NSCLC and LUAD in smokers and non-smokers, and the risk of LC in non-smokers of different ethnic groups is more common than that in smokers. In the Asians, non-smoking females are more at the risk of developing LUAD. Therefore, smoking does cause variation in TERT rs2736100 and increases the risk of most LC (NSCLC, LUAD), but it’s not the most critical factor.

LC (NSCLC, LUAD) is a multi-etiological disease caused by a combination of genetic, environmental and lifestyle factors. Of course, it’s necessary to integrate and analyze the data of studies with a larger sample size to draw more reliable conclusions in the future.