Identification of a Potential Regulatory Variant for Colorectal Cancer Risk Mapping to 3p21.31 in Chinese Population

Ke, Juntao; Lou, Jiao; Zhong, Rong; Chen, Xueqin; Li, Jiaoyuan; Liu, Cheng; Gong, Yajie; Yang, Yang; Zhu, Ying; Zhang, Yi; Chang, Jiang; Gong, Jing

doi:10.1038/srep25194

Download PDF

Article
Open access
Published: 28 April 2016

Identification of a Potential Regulatory Variant for Colorectal Cancer Risk Mapping to 3p21.31 in Chinese Population

Juntao Ke¹,
Jiao Lou¹,
Rong Zhong¹,
Xueqin Chen¹,
Jiaoyuan Li¹,
Cheng Liu¹,
Yajie Gong¹,
Yang Yang¹,
Ying Zhu¹,
Yi Zhang¹,
Jiang Chang¹ &
…
Jing Gong¹

Scientific Reports volume 6, Article number: 25194 (2016) Cite this article

1260 Accesses
3 Citations
1 Altmetric
Metrics details

Subjects

Abstract

Genome-wide association studies (GWAS) have established chromosome 3p21.31 as a susceptibility locus for colorectal cancer (CRC) that lacks replication and exploration in the Chinese population. We searched potentially functional single nucleotide polymorphisms (SNPs) in the linkage disequilibrium (LD) block of 3p21.31 with chromatin immunoprecipitation-sequencing (ChIP-seq) data of histone modification and tested their association with CRC via a case-control study involving 767 cases and 1397 controls in stage 1 and 528 cases and 678 controls in stage 2. In addition to the tag SNP rs8180040 (odds ratio (OR) = 0.875, 95% confidence interval (95% CI) = 0.793−0.966, P = 0.008, P-FDR (false discovery rate) = 0.040), rs1076394 presented consistently significant associations with CRC risk at both stages with OR = 0.850 (95% CI = 0.771−0.938, P = 0.001, P-FDR = 0.005) under the additive model in combined analyses. Supported by the analyses of data from The Cancer Genome Atlas (TCGA) and Gene Expression Omnibus (GEO), it was suggested that rs1076394 served as an expression Quantitative Trait Loci (eQTL) for gene CCDC12 and NME6, while NME6’s expression was obviously higher in CRC tissues. Using biofeature information such as ChIP-seq and RNA sequencing (RNA-seq) data might help researchers to interpret GWAS results and locate functional variants for diseases in the post-GWAS era.

Tissue-specific enhancer–gene maps from multimodal single-cell data identify causal disease alleles

Article 09 April 2024

Saori Sakaue, Kathryn Weinand, … Soumya Raychaudhuri

Genome-wide association studies

Article 26 August 2021

Emil Uffelmann, Qin Qin Huang, … Danielle Posthuma

Utility of polygenic scores across diverse diseases in a hospital cohort for predictive modeling

Article Open access 12 April 2024

Ting-Hsuan Sun, Chia-Chun Wang, … Kai-Cheng Hsu

Introduction

Colorectal cancer (CRC) was the third most commonly diagnosed cancer in males and the second in females worldwide in 2012¹, while it was the fifth in males and the third in females, with an estimated 310,244 new cases and 149,722 deaths in China in 2011². Several environmental factors, including diet, physical inactivity, obesity, cigarette smoking and alcohol consumption, were indicated as being involved in the occurrence and development of CRC^3,4,5. On the other hand, it has been well established that genetic factors play an important role in CRC etiology^6,7,8. Revolutionary genome-wide association studies (GWAS) and subsequent fine mapping researches have positioned over 30 susceptibility loci of CRC in Europeans^{9,10,11,12,13,14,15,16,17,18,19} and Asians^20,21,22,23, however most variants have been found to be only tag single nucleotide polymorphism (SNPs) residing in intergenic and intronic regions without a clear function.

A major challenge in the post-GWAS era is to identify the specific genetic variants that accounts for phenotype based on their functional biology²⁴. Recent reports showed that regulatory genome elements can greatly help to identify these causal SNPs, which could exert an effect on gene expression by modulating the activity of promoters, enhancers, insulators and silencers^25,26,27. Today, regulatory genomic regions are usually characterized by various histone modifications in the flanking nucleosomes^28,29. For example, enhancers are typically marked by H3K4me1 (histone H3 monomethylated at lysine 4) and promoters by H3K4me3 (histone H3 trimethylated at lysine 4) and both are regarded as active when additionally marked by H3K27ac (histone H3 acetylated at lysine 27)^30,31,32,33. And chromatin immunoprecipitation-sequencing (ChIP-seq) of histone modifications has been widely used to map genome-wide enhancers and promoters^34,35,36,37.

Fernandez-Rozadilla et al. mapped 3p21.31 as a CRC-relevant genomic locus in a Spanish population with 2362 cases and 2517 controls³⁸, with a pooled P = 2.163E-06 (odds ratio (OR) = 0.784, 95% confidence interval (95% CI) = 0.709–0.867) for the tag SNP rs8180040. Although it didn’t reach the common GWAS significance threshold of 10⁻⁸ and wasn’t included in the larger-scale genetic study by Zhang et al. in East Asians²³, we considered this locus containing abundant genes as an attractive region to be researched in the Chinese population. Moreover, the strongest risk polymorphism rs8180040 was not in any known transcribed or regulatory sequences and not likely the causal SNP, which meant the real functional SNPs remained mined in 3p21.31.

Using epigenomic data obtained from relevant cell types represents a powerful approach to identifying functional SNPs in post-GWAS genetic researches^39,40,41,42. In this study, we analyzed ChIP-seq data of histone modifications from CRC cell lines, searched common variants within the regulatory elements of the risk-associated locus 3p21.31, investigated their associations with CRC risk via a two-stage case-control study in the Chinese population and tried to explain the underlying function by analyzing the data from The Cancer Genome Atlas (TCGA) and Gene Expression Omnibus (GEO). To the best of our knowledge, this is the first replication and exploration study on 3p21.31 in East Asians.

Results

Selection of Candidate SNPs

The LD block of GWAS susceptibility loci 3p21.31 was chromosome 3: 47035735-47452118. After a bioinformatics analysis (details in Methods), four common polymorphisms, rs2276854, rs807936, rs1076394 and rs807937, situated within the peaks of histone modification ChIP-Seq data generated from HCT116 or Caco2, were found in the above loci. These four SNPs were in high LD with each other (r² > 0.9) according to the 1000 Genomes Project Phase 3 data of the CHB population. Among them, we saw rs1076394 as the most credibly functional variant due to its location in the overlapping region of H3k4me1 and H3k27ac ChIP-seq peaks (Table 1). The coexistence of these two histone modifications is broadly considered as a mark of active enhancer, while the appearance of either one of them is not. For the replication and exploration of 3p21.31, we genotyped the tag SNP rs8180040 and the potential regulatory SNP rs1076394 in Stage 1. We further validated the positive SNPs in another independent sample of Stage 2.

Table 1 Candidate regulatory SNPs in chromosome locus 3p21.31.

Full size table

Population Characteristics

Descriptive characteristics of the subjects in this study are detailed in Table 2. In both stages, no significant differences were found between patients and controls in the distribution of sex and age. As expected, significantly more smokers were presented in the cases than in the controls, given that cigarette smoking was a well-established risk factor for CRC³. And we did not see the same distribution in drinking status.

Table 2 The characteristics of the study population.

Full size table

Association Analysis

Both investigated polymorphisms, the presumably regulatory SNP rs1076394 and tag SNP rs8180040, were significantly associated with CRC risk in both stages and the combined analysis (Table 3).

Table 3 Association between individual SNP and colorectal cancer risk.

Full size table

In Stage 1, under a multivariable logistic regression model adjusted for gender, age, smoking and drinking status, individuals with AA genotype of rs1076394 had a significantly reduced risk of CRC (OR = 0.723, 95% CI = 0.559–0.936, P = 0.014, P-FDR (false discovery rate) = 0.035) compared to those with GG homozygotes. A dominant model was used to improve statistical power by combining the GA with AA into an A-carrier group (GA plus AA) and it showed that the allele A carriers had an obviously protective effect on CRC susceptibility (OR = 0.802, 95% CI = 0.669–0.963, P = 0.018, P-FDR = 0.03). Likewise, a positive outcome was found in the additive models, with a per-A-allele OR of 0.847 (95% CI = 0.748–0.960, P = 0.009, P-FDR = 0.045). As for rs8180040, we successfully replicated this GWAS tagSNP under the dominant model (OR = 0.811, 95% CI = 0.677–0.970, P = 0.022, P-FDR = 0.825) and additive model (OR = 0.861, 95% CI = 0.758–0.979, P = 0.022, P-FDR = 0.825) with nominal significance, but failed after FDR corrections.

Two promising variants were both further genotyped in the validation Stage 2. In agreement with the Stage 1, nominal significant associations were still exhibited between CRC risk and rs1076394 (dominant model: OR = 0.745, 95% CI = 0.584–0.951, P = 0.018, P-FDR = 0.09; additive model: OR = 0.833, 95% CI = 0.708–0.980, P = 0.027, P-FDR = 0.068), or rs8180040 (dominant model: OR = 0.747, 95% CI = 0.586–0.952, P = 0.018, P-FDR = 0.09; additive model: OR = 0.837, 95% CI = 0.713–0.983, P = 0.030, P-FDR = 0.075). When we combined two stages, positive results were still observed after FDR corrections (Table 3). And two polymorphisms were in high LD (r² = 0.895) with each other in our total samples, similar to the data of 1000 Genomes Phase 3 of CHB (r² = 1.000).

The results of interaction analysis between the promising SNP rs1076394 and smoking were detailed in Table S2, where no significant interactions were observed under either the multiplicative or the additive model in the two stages and the combined study.

TCGA and GEO Data Analyses

We downloaded the data of gene expression, germline genotypes, CpG methylation and somatic copy number for COAD (colon adenocarcinoma) and READ (rectum adenocarcinoma) from the TCGA portal (http://cancergenome.nih.gov/) up to October 2014. Then, we performed a modified eQTL (expression Quantitative Trait Loci) analysis of the correlation between rs8180040 (rs1076394 wasn’t included in the Affymetrix GenomeWide SNP 6.0 Array of genotype profiles) and expression of genes within 1 Mb flanking regions, with the effects of somatic copy number and methylation being adjusted⁴³. As shown in Table 4 and Fig. 1, rs8180040 was identified as an eQTL for genes NME6 (nucleoside diphosphate kinase 6, R² = 0.019, P = 0.029, P-FDR = 0.404) and CCDC12 (coiled-coil domain-containing protein 12, R² = 0.031, P = 0.005, P-FDR = 0.139), respectively. As the presumed regulatory variant rs1076394 was in complete LD with rs8180040 in the 1000 Genomes database, rs1076394 was also indicated as being closely related to the expression of CCDC12 and NME6. In addition, we compared the two genes’ expression between cancer and normal tissue and found a significant difference for NME6 (P = 0.029; CRC tissues: 229.5 ± 3.2 RPKM (reads per kilobases per million reads), peritumoral tissue: 210.5 ± 5.4 RPKM), but not for CCDC12 (P = 0.258; CRC tissues: 707.7 ± 15.3 RPKM, peritumoral tissue: 661.6 ± 16.49 RPKM).

Table 4 Expression correlation between rs8180040 and flanking 1 Mb genes.

Full size table

In GEO database, the datasets of expression profiles in Asian samples (Dataset Records: GDS2609 and GDS4718) were provided, while TCGA data was mostly from Caucasian samples. We compared the NME6 expression between cancerous and normal colon tissues and found the same higher expression in cancer tissues from the Chinese population (P = 0.017; CRC tissues: 585.8 ± 29.3 RPKM, normal colon tissue: 482.2 ± 25.6 RPKM) and the Japanese population (P = 0.013; CRC tissues: 973.6 ± 79.9 RPKM, normal colon tissue: 771.4 ± 31.0 RPKM). As for CCDC12, we could not observe significant differences between cancer and normal tissues in two datasets (Chinese samples: P = 0.583, CRC tissues: 2219.9 ± 147.9 RPKM, normal colon tissue: 2118.0 ± 91.1 RPKM; Japanese samples: P = 0.062, CRC tissues: 22137.9 ± 971.7 RPKM, normal colon tissue: 24648.9 ± 870.9 RPKM).

Discussion

The identification of tag SNPs through GWAS is the important first step in understanding the relationship between genomic variation and CRC risk. But now, the foremost goal in the post-GWAS era is to shed light on the causal SNPs and their functional consequences, progressing from indirectly statistical to directly biological associations between genetic variation and disease. Accumulating evidence showed that the most likely mechanistic basis that links those noncoding genetic variants to phenotype and disease is being regulatory³⁴.

In our study, by overlapping the LD boundaries of the locus 3p21.31 and regulatory regions predicted by CRC-specific histone modifications, we screened out the most promisingly functional rs1076394 among the four original polymorphisms in high LD. By conducting association studies in two independent Chinese populations containing 1327 cases and 2075 controls in total, we replicated the significances of tag SNP rs8180040 and found a significant protective effect for the potentially regulatory variant rs1076394 that might serve as an eQTL for the genes CCDC12 and NME6, while NME6 presented significantly higher expression in cancer tissues.

The findings led us to assume that rs1076394 might influence CRC risk by altering the activity of an enhancer that controlled NME6 expression. The potentially functional variant rs1076394 lay within a region of the genome exhibiting chromatin modifications H3k4me1 and H3k27ac in a CRC cell line HCT116, consistent with characteristics of an active enhancer across diverse tissues^44,45. It was situated in the first intron of gene KIF9 (kinesin family member 9), which was involved in mitotic progression by maintaining correct spindle length⁴⁶ and the degradation of the matrix by regulating macrophage podosomes⁴⁷. However, according to our calculation of TCGA data, rs1076394 was related to the expression of CCDC12 and NME6, but not related to KIF9. We saw the SNP as a potential eQTL for CCDC12 and NME6, while we could not rule out the possibility that it was actually in LD with these two genes. The SNP rs1076394 was approximately 300kb upstream of CCDC12, which participated in promoting early erythroid differentiation⁴⁸ and over 1000 kb upstream of NME6, which might play a role in the regulation of cell growth and cell cycle progression^49,50,51. Either CCDC12 or NME6 was located in 3p21.3. But, NME6 was more suggested to be the actual contributing gene in this locus due to its significantly higher expression in cancer samples of both Caucasian and Asian populations. At the beginning, NME6 was discovered as a gene encoding a nucleoside diphosphate kinase that suppressed p53-induced apoptosis⁵⁰. In a shRNA functional screen, Nme6 was reported to be crucial for the renewal of embryonic stem cells (ESCs). And ESCs are characterized by immortalization ability, pluripotency and oncogenicity⁵². More recently, enriched somatic mutations in NME6 was suggested to be associated with the deregulation of pyrimidine metabolism and the promotion of malignant progression in human melanoma⁵³. Accordingly, NME6 may act as an oncogene that still needs more investigations.

The application of biofeature information such as ChIP-seq and RNA-seq data has represented an effective approach to identifying functionally regulatory SNPs and different databases including UCSC, Encode, GEO and TCGA have provided easy access to massive amounts of relevant data. Incorporating epigenetic and expression analyses into traditional molecular epidemiology could assist in the interpretation of GWAS results and the discovery of functional variants for diseases in post-GWAS studies. Using a similar strategy to other CRC-associated loci should deepen our understanding of CRC risk.

Nevertheless, several limitations should be noted here. First of all, due to the lack of relevant functional experiments, biological reality beneath the statistically significant association is uncertain. In the analysis of TCGA data, we have not restricted it to Chinese samples, when its composition is mostly Caucasian samples. It may not reflect the exact outcome of the Chinese population that we researched on. Second, the strategy of retrieving candidate polymorphisms depended on the prediction from ChIP-seq data of relevant cell lines, which was not rigorous enough to define exact regulatory elements and all the functional variants inside. Focusing on common SNPs, we could not rule out the possibility that sets of rare variants or haplotypes in LD with the tag SNP are actually causal in this locus. Third, insufficient epidemiological and clinical information prevented us from further investigating the interactions between gene and environment.

In summary, we discovered a probably regulatory SNP that was associated with CRC risk in the Chinese population. Researches on 3p21.31 and other CRC susceptibility loci with greater sample sizes and follow-up functional analyses are warranted to elaborate the biological mechanism of genetic etiology.

Methods

Study Participants

A two-stage case-control study was applied to evaluate the association between candidate variations and the risk of CRC. The discovery stage (Stage 1) consisted of 767 cases and 1397 controls, which were recruited from Tongji Hospital of Huazhong University of Science and Technology (HUST) between 2008 and 2012. The validation stage (Stage 2) involved 528 cases and 678 controls enrolled from 2013 to 2015 at the same hospital. All subjects were unrelated ethnic Han Chinese in both stages. Patients with histopathologically confirmed CRC and without previous chemotherapy or radiotherapy, were included without restriction to gender and age,. In the same time period, cancer-free controls were recruited form participants in physical examination programs of the same hospital and were adequately matched to cases by gender and age (±5 years). Definitions of smoking and drinking status were the same as in a previous study by our group^54,55,56. At recruitment of each subject, a written informed consent was obtained and 2 millimeters peripheral venous blood was collected. This study was approved by the ethnics committee of Tongji Medical College of Huazhong University of Science and Technology and the methods were carried out in accordance with the approved guidelines.

SNP Selection and Genotyping

Candidate SNPs were common genetic variants (minor allele frequency, MAF > 0.05) that located in the putative regulatory elements of the 3p21.31 locus. First, we applied the software HaploView to calculate the linkage disequilibrium (LD) block of 3p21.31 with the criterion of r² > 0.8, by inputting the Chinese Han Beijing (CHB) genotype information of 500 kb flanking the tagSNP rs8180040. This LD block was defined as the CRC susceptibility locus. Second, we downloaded ChIP-seq data regarding histone modification from two CRC cell lines, HCT116 and Caco2, form the UCSC database integrated with Encode data (S1 Table). And the extents of their signal peaks were considered as putative regulatory elements. Third, based on the CHB MAF data in dbSNP database, we only selected the common polymorphisms (MAF > 0.05) that situated within the overlapping parts between the LD block and peaks. Finally, four polymorphisms in high LD with each other (r² > 0.9) survived after this step-wise analysis. Among them, rs1076394 was chosen as the most potentially functional variant for the genotyping assays, because of its more indicative location in an active enhancer than others’. At the same time, we also tried to replicate the tag SNP rs8180040 in our sample. In Stage 2, the nominal significant SNPs of Stage 1 were further validated. SNPs of both stages were genotyped by a TaqMan real-time polymerase chain reaction (PCR) assay (Applied Biosystems, Foster city, CA). Quality control was preformed by including 5% duplicate samples in blinded fashion, with a concordance rate of 100%.

Statistical Analysis

The differences in the distributions of gender, age, smoking, drinking status and genotypes between cases and controls were estimated by a χ² test or t-test, where appropriate. The Hardy-Weinberg equilibrium (HWE) in controls was evaluated with a goodness-of-fit χ² test. The odds ratio (ORs) and corresponding 95% confidence intervals (95% CIs) were used to measure the associations between SNPs and CRC susceptibility And they were calculated after adjusting for gender, age, smoking and drinking status under a multivariate logistic regression model. For multiple comparison corrections, a simple procedure (Benjamini and Hochberg) was performed in two stages and combined study to control the false discovery rate⁵⁷. The LD of the candidate SNPs was analyzed using HaploView v4.2⁵⁸. With regard to TCGA and GEO data, the expression differences among three genotypes (TT, TA and AA) of rs8180040 were measured under a linear regression model adjusting the effects of somatic copy number and CpG methylation⁴³ and the differences between cancer and normal samples were measured by t-test. The gene-environment interactions were evaluated by a pair-wise analysis under multiplicative⁵⁹ and additive interaction models⁶⁰. All the above statistical analyses were conducted using SPSS Software v20.0 (SPSS, Chicago, Illinois, USA), with the exception that the P values of additive interaction were assessed using Stata v11.0 (Stata Corporation, College Station, TX). P values in this study were two-sided with a significance criterion of P < 0.05.

Additional Information

How to cite this article: Ke, J. et al. Identification of a Potential Regulatory Variant for Colorectal Cancer Risk Mapping to 3p21.31 in Chinese Population. Sci. Rep. 6, 25194; doi: 10.1038/srep25194 (2016).

References

Torre, L. A. et al. Global cancer statistics, 2012. CA: A Cancer Journal for Clinicians 65, 87–108, doi: 10.3322/caac.21262 (2015).
Article Google Scholar
Chen, W., Zheng, R., Zeng, H., Zhang, S. & He, J. Annual report on status of cancer in China, 2011. Chin J Cancer Res 27, 2–12, doi: 10.3978/j.issn.1000-9604.2015.01.06 (2015).
Article PubMed PubMed Central Google Scholar
Haggar, F. & Boushey, R. Colorectal Cancer Epidemiology: Incidence, Mortality, Survival and Risk Factors. Clinics in Colon and Rectal Surgery 22, 191–197, doi: 10.1055/s-0029-1242458 (2009).
Article PubMed PubMed Central Google Scholar
Botteri, E. et al. Smoking and colorectal cancer: a meta-analysis. JAMA 300, 2765–2778, doi: 10.1001/jama.2008.839 (2008).
Article CAS PubMed Google Scholar
Zhu, B., Zou, L., Qi, L., Zhong, R. & Miao, X. Allium Vegetables and Garlic Supplements Do Not Reduce Risk of Colorectal Cancer, Based on Meta-analysis of Prospective Studies. Clinical Gastroenterology and Hepatology 12, 1991–2001. e1994, doi: 10.1016/j.cgh.2014.03.019 (2014).
Article CAS Google Scholar
Lichtenstein, P. et al. Environmental and heritable factors in the causation of cancer–analyses of cohorts of twins from Sweden, Denmark and Finland. N Engl J Med 343, 78–85, doi: 10.1056/NEJM200007133430201 (2000).
Article CAS PubMed Google Scholar
de la Chapelle, A. Genetic predisposition to colorectal cancer. Nat Rev Cancer 4, 769–780, doi: 10.1038/nrc1453 (2004).
Article CAS PubMed Google Scholar
Gong, J. et al. A functional polymorphism in lnc-LAMC2-1:1 confers risk of colorectal cancer by affecting miRNA binding. Carcinogenesis, doi: 10.1093/carcin/bgw024 (2016).
Zanke, B. W. et al. Genome-wide association scan identifies a colorectal cancer susceptibility locus on chromosome 8q24. Nat Genet 39, 989–994, doi: 10.1038/ng2089 (2007).
Article CAS PubMed Google Scholar
Tomlinson, I. et al. A genome-wide association scan of tag SNPs identifies a susceptibility variant for colorectal cancer at 8q24.21. Nat Genet 39, 984–988, doi: 10.1038/ng2085 (2007).
Article CAS PubMed Google Scholar
Broderick, P. et al. A genome-wide association study shows that common alleles of SMAD7 influence colorectal cancer risk. Nat Genet 39, 1315–1317, doi: 10.1038/ng.2007.18 (2007).
Article CAS PubMed Google Scholar
Jaeger, E. et al. Common genetic variants at the CRAC1 (HMPS) locus on chromosome 15q13.3 influence colorectal cancer risk. Nat Genet 40, 26–28, doi: 10.1038/ng.2007.41 (2008).
Article CAS PubMed Google Scholar
Tenesa, A. et al. Genome-wide association scan identifies a colorectal cancer susceptibility locus on 11q23 and replicates risk loci at 8q24 and 18q21. Nat Genet 40, 631–637, doi: 10.1038/ng.133 (2008).
Article CAS PubMed PubMed Central Google Scholar
Tomlinson, I. P. et al. A genome-wide association study identifies colorectal cancer susceptibility loci on chromosomes 10p14 and 8q23.3. Nat Genet 40, 623–630, doi: 10.1038/ng.111 (2008).
Article CAS PubMed Google Scholar
Houlston, R. S. et al. Meta-analysis of genome-wide association data identifies four new susceptibility loci for colorectal cancer. Nat Genet 40, 1426–1435, doi: 10.1038/ng.262 (2008).
Article CAS PubMed Google Scholar
Houlston, R. S. et al. Meta-analysis of three genome-wide association studies identifies susceptibility loci for colorectal cancer at 1q41, 3q26.2, 12q13.13 and 20q13.33. Nat Genet 42, 973–977, doi: 10.1038/ng.670 (2010).
Article CAS PubMed PubMed Central Google Scholar
Tomlinson, I. P. et al. Multiple common susceptibility variants near BMP pathway loci GREM1, BMP4 and BMP2 explain part of the missing heritability of colorectal cancer. PLoS Genet 7, e1002105, doi: 10.1371/journal.pgen.1002105 (2011).
Article CAS PubMed PubMed Central Google Scholar
Dunlop, M. G. et al. Common variation near CDKN1A, POLD3 and SHROOM2 influences colorectal cancer risk. Nat Genet 44, 770–776, doi: 10.1038/ng.2293 (2012).
Article CAS PubMed PubMed Central Google Scholar
Peters, U. et al. Identification of Genetic Susceptibility Loci for Colorectal Tumors in a Genome-Wide Meta-analysis. Gastroenterology 144, 799–807 (2013).
Article CAS PubMed Google Scholar
Jia, W. H. et al. Genome-wide association analyses in East Asians identify new susceptibility loci for colorectal cancer. Nat Genet 45, 191–196, doi: 10.1038/ng.2505 (2013).
Article CAS PubMed Google Scholar
Zhang, B. et al. Genome-wide association study identifies a new SMAD7 risk variant associated with colorectal cancer risk in East Asians. Int J Cancer 135, 948–955, doi: 10.1002/ijc.28733 (2014).
Article CAS PubMed PubMed Central Google Scholar
Cui, R. et al. Common variant in 6q26-q27 is associated with distal colon cancer in an Asian population. Gut 60, 799–805, doi: 10.1136/gut.2010.215947 (2011).
Article CAS PubMed Google Scholar
Zhang, B. et al. Large-scale genetic study in East Asians identifies six new loci associated with colorectal cancer risk. Nat Genet 46, 533–542, doi: 10.1038/ng.2985 (2014).
Article CAS PubMed PubMed Central Google Scholar
Zhang, X., Bailey, S. D. & Lupien, M. Laying a solid foundation for Manhattan–‘setting the functional basis for the post-GWAS era’. Trends Genet 30, 140–149, doi: 10.1016/j.tig.2014.02.006 (2014).
Article CAS PubMed PubMed Central Google Scholar
Schaub, M. A., Boyle, A. P., Kundaje, A., Batzoglou, S. & Snyder, M. Linking disease associations with regulatory information in the human genome. Genome Research 22, 1748–1759, doi: 10.1101/gr.136127.111 (2012).
Article CAS PubMed PubMed Central Google Scholar
Hardison, R. C. Genome-wide Epigenetic Data Facilitate Understanding of Disease Susceptibility Association Studies. Journal of Biological Chemistry 287, 30932–30940, doi: 10.1074/jbc.R112.352427 (2012).
Article CAS PubMed PubMed Central Google Scholar
Farnham, P. J. Thematic minireview series on results from the ENCODE Project: Integrative global analyses of regulatory regions in the human genome. J Biol Chem 287, 30885–30887, doi: 10.1074/jbc.R112.365940 (2012).
Article CAS PubMed PubMed Central Google Scholar
Kouzarides, T. Chromatin Modifications and Their Function. Cell 128, 693–705, doi: 10.1016/j.cell.2007.02.005 (2007).
Article CAS PubMed Google Scholar
Bell, O., Tiwari, V. K., Thomä, N. H. & Schübeler, D. Determinants and dynamics of genome accessibility. Nature Reviews Genetics 12, 554–564, doi: 10.1038/nrg3017 (2011).
Article CAS PubMed Google Scholar
Bonn, S. et al. Tissue-specific analysis of chromatin state identifies temporal signatures of enhancer activity during embryonic development. Nat Genet 44, 148–156, doi: 10.1038/ng.1064 (2012).
Article CAS MathSciNet PubMed Google Scholar
Arnold, C. D. et al. Genome-wide quantitative enhancer activity maps identified by STARR-seq. Science 339, 1074–1077, doi: 10.1126/science.1232542 (2013).
Article CAS ADS PubMed Google Scholar
Rada-Iglesias, A. et al. A unique chromatin signature uncovers early developmental enhancers in humans. Nature 470, 279–283, doi: 10.1038/nature09692 (2011).
Article CAS ADS PubMed Google Scholar
Shlyueva, D., Stampfel, G. & Stark, A. Transcriptional enhancers: from properties to genome-wide predictions. Nat Rev Genet 15, 272–286, doi: 10.1038/nrg3682 (2014).
Article CAS PubMed Google Scholar
An integrated encyclopedia of DNA elements in the human genome. Nature489, 57–74, doi: 10.1038/nature11247 (2012).
Ernst, J. et al. Mapping and analysis of chromatin state dynamics in nine human cell types. Nature 473, 43–49, doi: 10.1038/nature09906 (2011).
Article CAS ADS PubMed PubMed Central Google Scholar
Kharchenko, P. V. et al. Comprehensive analysis of the chromatin landscape in Drosophila melanogaster. Nature 471, 480–485, doi: 10.1038/nature09725 (2011).
Article CAS ADS PubMed Google Scholar
Wamstad, J. A. et al. Dynamic and coordinated epigenetic regulation of developmental transitions in the cardiac lineage. Cell 151, 206–220, doi: 10.1016/j.cell.2012.07.035 (2012).
Article CAS PubMed PubMed Central Google Scholar
Fernandez-Rozadilla, C. et al. A colorectal cancer genome-wide association study in a Spanish cohort identifies two variants associated with colorectal cancer risk at 1p33 and 8p12. BMC Genomics 14, 55, doi: 10.1186/1471-2164-14-55 (2013).
Article CAS PubMed PubMed Central Google Scholar
Yao, L., Tak, Y. G., Berman, B. P. & Farnham, P. J. Functional annotation of colon cancer risk SNPs. Nat Commun 5, 5114, doi: 10.1038/ncomms6114 (2014).
Article CAS ADS PubMed Google Scholar
Maurano, M. T. et al. Systematic localization of common disease-associated variation in regulatory DNA. Science 337, 1190–1195, doi: 10.1126/science.1222794 (2012).
Article CAS ADS PubMed PubMed Central Google Scholar
Biancolella, M. et al. Identification and characterization of functional risk variants for colorectal cancer mapping to chromosome 11q23.1. Human Molecular Genetics 23, 2198–2209, doi: 10.1093/hmg/ddt584 (2013).
Article CAS PubMed PubMed Central Google Scholar
Ke, J. et al. Identification of a Potential Regulatory Variant for Colorectal Cancer Risk Mapping to Chromosome 5q31.1: A Post-GWAS Study. Plos ONE 10, e0138478, doi: 10.1371/journal.pone.0138478 (2015).
Article CAS PubMed PubMed Central Google Scholar
Li, Q. et al. Integrative eQTL-based analyses reveal the biology of breast cancer risk loci. Cell 152, 633–641, doi: 10.1016/j.cell.2012.12.034 (2013).
Article CAS PubMed PubMed Central Google Scholar
Creyghton, M. P. et al. Histone H3K27ac separates active from poised enhancers and predicts developmental state. Proceedings of the National Academy of Sciences 107, 21931–21936, doi: 10.1073/pnas.1016071107 (2010).
Article ADS Google Scholar
Pasquali, L. et al. Pancreatic islet enhancer clusters enriched in type 2 diabetes risk-associated variants. Nat Genet 46, 136–143, doi: 10.1038/ng.2870 (2014).
Article CAS PubMed PubMed Central Google Scholar
Andrieu, G., Quaranta, M., Leprince, C. & Hatzoglou, A. The GTPase Gem and its partner Kif9 are required for chromosome alignment, spindle length control and mitotic progression. FASEB J 26, 5025–5034, doi: 10.1096/fj.12-209460 (2012).
Article CAS PubMed Google Scholar
Cornfine, S. et al. The kinesin KIF9 and reggie/flotillin proteins regulate matrix degradation by macrophage podosomes. Mol Biol Cell 22, 202–215, doi: 10.1091/mbc.E10-05-0394 (2011).
Article CAS PubMed PubMed Central Google Scholar
Fan, C. et al. Isolation of siRNA target by biotinylated siRNA reveals that human CCDC12 promotes early erythroid differentiation. Leuk Res 36, 779–783, doi: 10.1016/j.leukres.2011.12.017 (2012).
Article CAS PubMed Google Scholar
Mehus, J. G. & Deloukas, P. & Lambeth, D. O. NME6: a new member of the nm23/nucleoside diphosphate kinase gene family located on human chromosome 3p21.3. Hum Genet 104, 454–459 (1999).
Article CAS PubMed Google Scholar
Tsuiki, H. et al. A novel human nucleoside diphosphate (NDP) kinase, Nm23-H6, localizes in mitochondria and affects cytokinesis. J Cell Biochem 76, 254–269 (1999).
Article CAS PubMed Google Scholar
Desvignes, T., Pontarotti, P., Fauvel, C. & Bobe, J. Nme protein family evolutionary history, a vertebrate perspective. BMC Evol Biol 9, 256, doi: 10.1186/1471-2148-9-256 (2009).
Article CAS PubMed PubMed Central Google Scholar
Wang, C. H. et al. A shRNA functional screen reveals Nme6 and Nme7 are crucial for embryonic stem cell renewal. Stem Cells 30, 2199–2211, doi: 10.1002/stem.1203 (2012).
Article CAS PubMed Google Scholar
Edwards, L., Gupta, R. & Filipp, F. V. Hypermutation of DPYD Deregulates Pyrimidine Metabolism and Promotes Malignant Progression. Mol Cancer Res 14, 196–206, doi: 10.1158/1541-7786.mcr-15-0403 (2016).
Article CAS PubMed Google Scholar
Zhong, R. et al. Genetic variations in the TGFbeta signaling pathway, smoking and risk of colorectal cancer in a Chinese population. Carcinogenesis 34, 936–942, doi: 10.1093/carcin/bgs395 (2013).
Article CAS PubMed Google Scholar
Ke, J. et al. Replication study in Chinese population and meta-analysis supports association of the 5p15.33 locus with lung cancer. Plos ONE 8, e62485, doi: 10.1371/journal.pone.0062485 (2013).
Article CAS ADS PubMed PubMed Central Google Scholar
Zhu, B. et al. Genetic variants in the SWI/SNF complex and smoking collaborate to modify the risk of pancreatic cancer in a Chinese population. Mol Carcinog 54, 761–768, doi: 10.1002/mc.22140 (2015).
Article CAS PubMed Google Scholar
Benjamini, Y., Drai, D., Elmer, G., Kafkafi, N. & Golani, I. Controlling the false discovery rate in behavior genetics research. Behav Brain Res 125, 279–284 (2001).
Article CAS PubMed Google Scholar
Barrett, J. C., Fry, B., Maller, J. & Daly, M. J. Haploview: analysis and visualization of LD and haplotype maps. Bioinformatics 21, 263–265, doi: 10.1093/bioinformatics/bth457 (2005).
Article CAS PubMed Google Scholar
Andersson, T., Alfredsson, L., Kallberg, H., Zdravkovic, S. & Ahlbom, A. Calculating measures of biological interaction. Eur J Epidemiol 20, 575–579 (2005).
Article PubMed Google Scholar
Knol, M. J., van der Tweel, I., Grobbee, D. E., Numans, M. E. & Geerlings, M. I. Estimating interaction on an additive scale between continuous determinants in a logistic regression model. Int J Epidemiol 36, 1111–1118, doi: 10.1093/ije/dym157 (2007).
Article PubMed Google Scholar

Download references

Acknowledgements

The authors wish to thank all the study participants, research staff and students who participated in this work.

Author information

Authors and Affiliations

State Key Laboratory of Environment Health (Incubation), MOE (Ministry of Education) Key Laboratory of Environment & Health, Ministry of Environmental Protection Key Laboratory of Environment and Health (Wuhan), School of Public Health, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China
Juntao Ke, Jiao Lou, Rong Zhong, Xueqin Chen, Jiaoyuan Li, Cheng Liu, Yajie Gong, Yang Yang, Ying Zhu, Yi Zhang, Jiang Chang & Jing Gong

Authors

Juntao Ke
View author publications
You can also search for this author in PubMed Google Scholar
Jiao Lou
View author publications
You can also search for this author in PubMed Google Scholar
Rong Zhong
View author publications
You can also search for this author in PubMed Google Scholar
Xueqin Chen
View author publications
You can also search for this author in PubMed Google Scholar
Jiaoyuan Li
View author publications
You can also search for this author in PubMed Google Scholar
Cheng Liu
View author publications
You can also search for this author in PubMed Google Scholar
Yajie Gong
View author publications
You can also search for this author in PubMed Google Scholar
Yang Yang
View author publications
You can also search for this author in PubMed Google Scholar
Ying Zhu
View author publications
You can also search for this author in PubMed Google Scholar
Yi Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Jiang Chang
View author publications
You can also search for this author in PubMed Google Scholar
Jing Gong
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conception and design of the experiments: J.G. Conduct of the experiments: J.K. and J.L. DNA extraction: R.Z., X.C., J.L., C.L., Y.G., Y.Y., Ying, Z. and Yi, Z. Analysis and interpretation the data: J.K. Writing of the paper: J.K. Administrative, technical and material support: J.C. and J.G. All authors reviewed the manuscript.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Electronic supplementary material

Supplementary Information

Rights and permissions

This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

Reprints and permissions

About this article

Cite this article

Ke, J., Lou, J., Zhong, R. et al. Identification of a Potential Regulatory Variant for Colorectal Cancer Risk Mapping to 3p21.31 in Chinese Population. Sci Rep 6, 25194 (2016). https://doi.org/10.1038/srep25194

Download citation

Received: 11 January 2016
Accepted: 12 April 2016
Published: 28 April 2016
DOI: https://doi.org/10.1038/srep25194

This article is cited by

CCDC12 promotes tumor development and invasion through the Snail pathway in colon adenocarcinoma
- Fengying Du
- Lipan Peng
- Jizhun Zhang
Cell Death & Disease (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.