Introduction

Cyclin-dependent kinase 5 (CDK5), located in 7q36 chromosome, is a proline-directed serine/threonine kinase that was originally identified as a member of the CDK family, on the basis of sequence homology.1, 2 It is activated by p35Nck5a, which is autophosphorylated into a complex with CDK5.3 Although it is classified as a CDK, CDK5 does not regulate cell cycle.4 CDK5 is recognized as an essential molecule in the brain, where it regulates several neuronal activities, including cytoskeletal remodeling and synaptic transmission. There is presently a substantial amount of data to indicate that CDK5 plays important roles in extraneuronal cells during the process of myogenesis, lens differentiation, spermatogenesis, insulin secretion, apoptosis of ovarian cells and hematopoietic cell differentiation.3, 5, 6, 7, 8, 9, 10, 11

Furthermore, CDK5 has been suggested to play roles in the regulation of motility of prostate cancer cells,12 control of glioblastoma cell invasion,13 modulation of proliferation of medullary thyroid carcinoma cells14 and apoptotic control of leukemia cells.15 These potential roles of CDK5 activity in the carcinogenesis of each cell type seem to involve distinct target substrates.16

Frequent amplification and overexpression of the CDK5 gene along with multiple epidermal growth factor receptor (EGFR)-family-signaling pathway components, including SHC1, AKT1 and MYC, was recently discovered in non-small cell lung cancer (NSCLC).17 As it has been known earlier that CDK5/p35 activation by neuregulin subsequently activates PI3K and Akt through ErbB3 and ErbB2 receptors in cultured cortical neurons,18 the activation of CDK5, along with the other genes, may play a role in NSCLC tumorigenesis.

It has been reported that an individual's susceptibility to a disease such as lung cancer can vary depending on the presence of single-nucleotide polymorphisms (SNPs), an important class of genetic variation, in genes affecting cell proliferation or cell death.19, 20, 21 Moreover, SNPs in the coding regions or the regulatory regions of genes are more likely to cause functional differences than SNPs located elsewhere.19

Therefore, this study was undertaken to examine those possible relationship between three novel SNPs found in CDK5 promoter region and the risk of lung cancer occurrence in Korean lung cancer patients.

Materials and methods

Study patients

From August 2001 to February 2007, blood samples were collected from successive series of 809 participants, including 407 lung cancer cases and 402 controls. The lung cancer cases were recruited from patients at the Korea Genomic Research Center for Lung and Breast/Ovarian Cancer and Inha University Medical Center, and age- and gender- matched control individuals were selected from a pool of healthy volunteers at the Cardiovascular Genome Center, Genomic Research Center for Allergy and Respiratory Diseases and Keimyung University Dongsan Medical Center. All participants were given a structured questionnaire that elicited information on diet, smoking status, alcohol consumption, lifestyle and medical history. All study participants were ethnic Koreans, and their written consent and institutional review board approval for the study protocol were obtained.

Screening and genotyping of CDK5 promoter polymorphism

The genotype analysis was carried out by investigators blinded to all patient identifiers and information. Genomic DNA of all participants was isolated from blood samples using a Puregene blood DNA kit (Gentra, Minneapolis, MN, USA), according to the manufacturer’s instructions. To identify SNPs in the promoter of the CDK5 gene, the genomic DNA of 24 randomly selected samples from the lung cancer patients was amplified for the 5′ regulatory region (3 kb) from the translation initiation site of the CDK5 gene. PCR amplification was carried out in a PTC-225 Peltier Thermal cycler (MJ research Inc., Waltham, MA, USA) using AmpliTaqGold (Roche, Branchburg, NJ, USA). An Applied Biosystems 3700 DNA analyzer (Applied Biosystems, Foster City, CA, USA) was used for direct sequencing. Next, we focused on the three SNPs (−904, −270 and −238 locus) of the CDK5 gene using the PCR-based restriction fragment length polymorphism (PCR-RFLP) method for further analyses of the three CDK5 SNPs in 407 lung cancer cases and 402 controls. The designs of primers for RFLP were based on the GenBank sequence (NT_007914). The genotypes of the sample were amplified by PCR using 5′-CTAGGCAGAGCCATCACTTA-3′ (forward) and 5′-GATTTTAGTGGAGGGACACA-3′ (reverse) primer pairs for −904 locus, and 5′-CTGGGAGCACATAAAAGGAA-3′ (forward) and 5′-CTTCTGGGGCTTGTAGTCCT-3′ (reverse) primer pairs for −270 locus, and 5′-ACTTGAAGTCCAAGGTTTGC-3′ (forward) and 5′-CTTCTGGGGCTTGTAGTCCT-3′ (reverse) primer pairs for −238 locus. The PCR was carried out using 50 ng of genomic DNA, 1 × Taq buffer, 10 μM each of primers, 200 μM 2′-deoxynucleoside 3′-triphosphate and 0.5 μl (2.5 U μl–1) of Taq polymerase (Han Taq, Genenmed, Seoul, Korea) in a final volume of 20 μl. PCR amplification was carried out for 35 or 40 cycles with heat denaturing at a temperature of 94 °C for 1 min, annealing at 55 °C (−904 locus), 60 °C (−270 locus), 58 °C (−238 locus) for 30 s, primer extending at 72 °C for 30 s and a final step at 72 °C for 7 min. PCR products were verified by agarose gel electrophoresis. The PCR products were digested with restriction endonuclease to determine the genotype of each participant. DNA fragments were separated using 2–3% agarose gel and were detected by ethidium bromide staining. In the −904 locus, the A to G transition created an SfaNI (New England Biolabs, Beverly, MA, USA) restriction site. The PCR product containing the G allele was broken into two fragments, whereas the PCR product containing the A allele could not be cut by SfaNI (Figure 1b). In the same manner, the G to C transition in the −270 locus created a BsmFI (New England Biolabs) restriction site (Figure 1c), whereas the A to C transition in the −238 locus created a HpaII (New England Biolabs) restriction site (Figure 1d). More than 10% of the DNA samples were randomly selected for genotype confirmation using the SNaPshot (PE Applied Biosystems, Warrington, UK) ddNTP primer extension method as described earlier,22 and the results were in complete concordance.

Figure 1
figure 1

The cyclin-dependent kinase 5 (CDK5) gene promoter region and genotyping. In this schematic, the translation initiation site for the CDK5 protein is indicated as +1 (a). Polymorphism analysis of –904 (b), –270 (c) and –238 (d) loci in the CDK5 gene by PCR, followed by specific restriction enzyme digestion. The left panel of each figure represents schematic presentation of the enzyme restriction pattern generated by polymorphisms at each locus. As indicated on the top of each lane on the right panels, representative samples with polymorphisms at –904, –270 and –238 are shown. Lane M was loaded with appropriate DNA markers.

Statistical analysis

Allele frequencies, genotype frequencies and departures of the genotype distribution from Hardy–Weinberg equilibrium for each SNP were analyzed using the χ2-test or Fisher's exact test. Linkage disequilibrium (LD) was tested on pairwise combinations of SNPs using the absolute value of the standardized measure of LD, D, calculated using the Haploview program version 3.2. A P-value of <0.05 was considered statistically significant. Genotype-specific risks were estimated as odd ratios (ORs) with associated 95% confidence intervals (CIs) by unconditional logistic regression and adjusted for age, gender and smoking status. In the codominant model, the common homozygous genotype in the controls was defined as the reference group; the rare homozygous and heterozygous genotypes were variant genotypes, and their effects were individually estimated by comparison with the reference. Alternatively, the rare homozygotes and the heterozygotes were combined to form the reference group and compared with the common homozygote in the dominant model. In the recessive model, only the rare homozygotes were defined as the variant genotype, whereas the other two genotypes were combined as the reference. Statistical analyses were carried out using the SAS system, version 9.1 (SAS Institute, Cary, NC, USA).

Results

The study population consisted of 407 lung cancer patients and 402 controls (Table 1). All the clinical characteristics, except for smoking history, were not significantly different between the cases and controls. The initial screening of CDK5 promoter region from the genomic DNA of 24 Korean lung cancer patients identified three polymorphisms: −904 G>A, −270 C>G and −238 A>C, and the minor allele frequencies of the study population were 0.260, 0.313 and 0.298, respectively. Further genotyping on the lung cancer and control populations showed that the genotype distributions of the three polymorphisms among all study populations were in Hardy–Weinberg equilibrium. Furthermore, the three polymorphisms showed strong LD in both the controls and the cases (D 0.92, P<0.0001) (data not shown).

Table 1 Baseline characteristics of study participants

The distribution of the selected CDK5 genotypes and the estimated associations between the CDK5 functional SNPs and lung cancer risk are shown in Table 2. Among the distributions of the genotypes between the overall lung cancer cases and the controls, the −904 G>A genotype was shown to affect lung cancer risk. Patients with the A allele at the site had a significantly higher risk of lung cancer than those with only the G allele (dominant model; adjusted OR=1.53, 95% CI=1.02–2.32). Genotypes −270 C>G and −238 A>C were also shown to be somewhat related to the increase in risk of lung cancer; however, their CIs were too wide to be statistically significant (dominant model; −270 C>G OR=1.18, 95% CI=0.86–1.61; −237 A>C OR=1.20, 95% CI=0.88–1.65). In the analysis stratified by gender, smoking and drinking history, the effect of A genotype of −904 G>A on the lung cancer risk was found in females (OR=2.02, 95% CI=1.12–3.64) and non-smokers (OR=1.78, 95% CI=1.06−3.00) in the dominant model.

Table 2 Distribution of SNPs and their association with lung cancer risk in overall population and subpopulation stratified by gender, smoking and drinking status

Subsequent haplotype analysis of the three CDK5 SNP sites (−904 G>A, −270 C>G, −238 A>C) showed that the A–G–C haplotype was associated with a higher overall risk of lung cancer (OR=1.59, 95% CI=1.16–2.18) in the dominant model (Table 3). Further analysis stratified by gender, smoking and drinking history showed that the individuals carrying the A–G–C haplotype had significantly higher risks of lung cancer in subgroups, including females, non-smokers and non-drinkers (female: OR=2.09, 95% CI 1.16–3.77; non-smoker: OR=1.94, 95% CI=1.15–3.26; non-drinker: OR=2.48, 95% CI=1.29–4.78).

Table 3 Distribution of haplotypes of selected SNPs of CDK5 and their association with lung cancer risk in overall population and subpopulation stratified by gender, smoking and drinking status

Discussion

It is quite possible that DNA sequence variations in the CDK5, a gene that has recently been studied for its association with non-neuronal cell differentiation and apoptosis in many areas, may lead to alteration in the activity of CDK5, which can cause individual differences in lung cancer susceptibility. Indeed, a recent study on CDK5-mediated regulation of the PIKE-A-Akt pathway suggests an important clue to the association of CDK5 with tumorigenesis, as the accumulation of phosphorylated PIKE-A (PI3-kinase enhancer), a recently identified GTP-binding protein shown to regulate the PI3K–Akt pathway and activation of Akt in the nucleus of glioblastoma cell, is a CDK5-dependent activity,23 suggesting possible association of CDK5 with non-neuronal cell survival. This was further supported by another study that showed the overexpression of CDK5, along with the other downstream components of the EGFR-family signaling pathway, such as AKT1 and SHC1, as a direct result of gene amplification, specifically in lung cancer cells,17 signifying that the alteration of the CDK5 gene may play a causal role in the development of lung cancer.

In this study, we evaluated the associations between three newly found SNPs (−904 G>A, −270 C>G and −238 A>C) in the promoter region of the CDK5 gene and the risk for lung cancer. Of the three SNPs, −904 G>A showed a possibility to play an important role in carcinogenesis (overall OR=1.53, 95% CI=1.11–2.09), whereas the other two SNPs (−270 C>G and −238 A>C) showed some increase in the risk of lung cancer, although not adequately proven to be statistically significant.

Many studies showed that analysis of haplotypes has a greater power to detect the association with disease than an individual polymorphism. Moreover, haplotype analysis offers the advantages of not assuming that any of the genotyped polymorphisms is functional, and it allows for the possibility of an ungenotyped functional variant to be in LD with the genotyped polymorphisms.24, 25, 26, 27 Therefore, we conducted a subsequent analysis on CDK5 haplotypes consisting of −904 G>A, −270 C>G and −238 A>C polymorphisms, which are in strong LD. In the dominant haplotype analysis, the A–G–C haplotype of the three polymorphic sites was shown to be related to the higher risk of lung cancer (overall OR=1.59, 95% CI=1.16–2.18). Notably, our stratified analyses of CDK5 showed that, when having the A–G–C haplotype, the risk of lung cancer increased significantly in females, non-smokers and non-drinkers (female: OR=2.09, 95% CI=1.16−3.77; non-smoker: OR=1.94, 95% CI=1.15−3.26; non-drinker: OR=2.48, 95% CI=1.29–4.78). To examine whether the A–G–C haplotype influences promoter activity of the CKD5 gene, the CDK5 promoter sequence connected to the luciferase reporter gene was transfected into NSCLC cell lines. However, we found no significant correlation (data not shown).

Tobacco smoking has well been established to play a critical role in the development of lung cancer; however, only a fraction of the smokers actually develops cancer. Moreover, the cause of increasing incidence of lung cancer in never-smokers has never been clearly explained.

This observation suggests that individual genetic variations may determine susceptibility to lung cancer. The present result on our study that shows a close association between lung cancer in Korean population and the polymorphisms in the promoter region of CDK5 further validates this claim. The GWA database on various ethnicities has been reported to have no frequency data on −270 C>G polymorphism, whereas data on the −904 G>A and −238 A>C polymorphisms have been reported (HmapEU, 0.25 and 0.21; HmapCN, 0.26 and 0.26; HmapJP, 0.33 and 0.30; HmapNG, 0.43 and 0.20, respectively).

To the best of our knowledge, this is the first study to delineate the association between CDK5 promoter polymorphisms and the risk of lung cancer, and to show that one of the CDK5 polymorphisms, −904 G>A, and the A–G–C haplotype (−904 A/−270 G/−238 C) cause overall increase in the susceptibility of lung cancer. Moreover, subsequent stratified analyses showed statistically significant increases in the risk of lung cancer in females, non-smokers and non-drinkers who have the A–G–C haplotype of the three polymorphic sites. Therefore, the present result suggests an important association between the genetic polymorphisms of CDK5 and lung cancer development, and the necessity for further functional studies on the relationship between the CDK5 gene and lung cancer.