Introduction

Gallbladder cancer (GC) is the most common malignancy of the biliary tract and the fifth most common cancer of the gastrointestinal tract.1 With an overall 5-year survival rate of <5%, GC is a highly lethal malignancy with very poor prognosis.2 Most of GCs were diagnosed at a very late stage because of the lack of symptoms and non-specific symptoms of early-stage tumors.1, 3 Although the incidence of GC is relatively rare compared with other cancers, higher incidences of GC have been reported in certain geographical regions including India, Pakistan, Ecuador, Korea and Japan.4 In addition, the prevalence of GC is known to be three times higher in women than in men.5 Although several clinical risk factors of GC such as gallstones, cholecystitis, porcelain gallbladder, gallbladder polyps, anomalous panreatobiliary duct junction and obesity have been indicated,1, 6 etiology of GC is largely unknown. Only few somatic genetic changes including mutations in K-ras, TP53 and p16Ink4/CDKN2 as well as loss of heterozygosity at several chromosomal regions harboring known or putative tumor-suppressor genes have been reported in GCs.5, 7 Hence, identification of novel genetic factors associated with susceptibility to GC should provide new insights into pathogenesis and novel therapeutic interventions of GC. Here, we report a genome-wide association study (GWAS) that aims to identify genetic factors associated with GC susceptibility.

Materials and methods

Subjects

In the GWAS, 41 patients who were diagnosed to have GC and 866 control subjects were examined. All cases were registered into Biobank Japan supported by the Ministry of Education, Culture, Sports, Science and Technology, Japan (http://www.biobankjp.org/). Controls consisted of healthy volunteers from Osaka-Midosuji Rotary Club. In the replication study, 30 GC patients and 898 controls were investigated. All cases and controls in the replication study were obtained from Biobank Japan. All control subjects do not have medical history of GC or gallstones, cholecystitis and other known confounding diseases for GC. Among the 71 cases, 45 were adenocarcimas and one was a squamous cell carcinoma, and the histological information for the remaining ones was not obtained. All subjects had given written informed consent to participate in the study in accordance with the process approved by Ethical Committee at the Institute of Medical Science of the University of Tokyo and Center for Genomic Medicine of RIKEN. Demographical information of subjects was summarized in Supplementary Table 1.

Genotyping and quality control

All cases and controls in the GWAS were genotyped by using the Illumina HumanHap550 Genotyping BeadChip (San Diego, CA, USA). QC of genotyping data was performed whereby subjects with a call rate of <98%, single-nucleotide polymorphisms (SNPs) with a call rate of <99% or minor allele frequency of <0.01, as well as SNPs with a Hardy–Weinberg equilibrium test's P-value of <1 × 10−6 were excluded from the subsequent statistical analysis. In the replication study, cases and controls were genotyped by using the Illumina HumanHap610 Genotyping BeadChip (San Diego).

Statistical analysis

Association of each SNP with susceptibility to GC was evaluated by logistic regression analysis conditioned on age and gender of subjects. The significance of association was evaluated based on the minimum P-value among the additive, dominant and recessive model of inheritance. SNPs that showed a minimum P-value of <1 × 10−4 in the GWAS were considered as showing suggestive association with GC and were examined in additional subjects. QC and statistical analysis were performed by using the PLINK statistical software (v1.06) (http://pngu.mgh.harvard.edu/~purcell/plink/).8

Imputation

Imputation analysis was performed based on genotype of Japanese (JPT) individuals in the Phase II HapMap database (release 24) by using software MACH v1.0 (http://www.sph.umich.edu/csg/yli/mach/index.html).9 SNPs located in the genomic region within 1500 kb upstream and downstream of the marker SNP, which showed the strongest association with GC, was imputed by implementing 50 Markov Chain iterations. As a QC measure, imputed SNPs with an imputation quality score of <0.3 were excluded from the subsequent association analysis. Pair-wise linkage disequilibrium (r2) between each SNP with the marker SNP was estimated by using the PLINK statistical software (v1.06); whereas regional association plot was generated by the R program v2.10.0 (http://www.r-project.org/). Possible functional consequences of SNPs were predicted in silico by using the SNPinfo web server (http://manticore.niehs.nih.gov/cgi-bin/snpinfo/snpfunc.cgi). eQTL analysis was performed based on data available from the Sanger Institute GENEVAR project10 for lymphoblastoid cell lines from the four HapMap populations.

Results

In this GWAS, 425 706 SNPs with a total genotyping rate of >99% in 41 GC patients and 866 controls were analyzed after QC of the genotyping data. A genomic inflation factor (λ) in the quantile–quantile plot (Supplementary Figure 1a) was calculated to be 0.9903, implying low possibility of false-positive associations resulting from the population stratification or cryptic relatedness. The Manhattan plot (Supplementary Figure 1b) indicated that none of the genotyped SNPs achieved genome-wide significant association (P<1 × 10−7) with GC. However, 130 SNPs showed suggestive association with a minimum P-value of <1 × 10−4. These SNPs were examined in the additional 30 GC patients and 898 controls, and the genotyping results are shown in Supplementary Table 2. Among these SNPs, SNP rs7504990 that is located in the DCC (deleted in colorectal cancer) achieved a minimum P-value of 9.67 × 10−5 under the recessive model of inheritance in the replication study. As illustrated in Table 1, this SNP achieved a P-value of 7.46 × 10−8 (OR=6.95; 95% CI=3.43–14.08) in the combined study. Genotype AA the risk genotype, was found to be enriched in cases than in the controls, with a genotype frequency of 0.19 versus 0.04. Interestingly, the SNP that showed the second lowest P-value, SNP rs4078288, is also located in the DCC (Table 1). To further characterize the association of the DCC region with the GC risk, genotypes of SNPs located in the genomic region within 1500 kb upstream and downstream of the marker SNP rs7504990 were imputed. Association analysis for the imputed and genotyped SNPs with GC susceptibility was then conducted. Figure 1 illustrates the regional plot for association results of the investigated SNPs. We observed that multiple SNPs in the genomic region adjacent to the marker SNP also showed associations, as strong as or stronger than the marker SNP, with GC susceptibility. SNPs that showed P-values of <1 × 10−6 with GC susceptibility were listed in Supplementary Table 3. All these SNPs are located in intronic regions of the DCC. Online prediction by using the SNPinfo web server (http://manticore.niehs.nih.gov/cgi-bin/snpinfo/snpfunc.cgi) revealed that none of these SNPs was likely to alter transcription factor binding site. In addition, none of them was linked to exonic SNPs of the DCC with an r2 value of 0.8 or over. We examined eQTL data for all SNPs in Supplementary Table 3 that showed strong association with GC (P<1 × 10−6) and found that none of them potentially altered expression of DCC in the four investigated HapMap populations.

Table 1 SNPs in DCC associated with GC
Figure 1
figure 1

Regional plot for associations of SNPs located within 1500 kb upstream and downstream of the marker SNP, rs7504990, with gallbladder cancer (GC). The −log10(P) values of SNPs were plotted against relative chromosomal locations. Diamond and circle signs represent genotyped and imputed SNPs, respectively. All SNPs are color-coded as red (0.8–1.0), orange (0.5–0.8), yellow (0.2–0.5) and white (<0.2), according to their pairwise r2 to the marker SNP. The marker SNP is indicated by an arrow and the combined P-value of the marker SNP is represented by a blue diamond sign. SNP positions followed NCBI build 36 coordinates. Estimated recombination rates (cM/Mb) were plotted in dark blue line.

Discussion

Through the current GWAS and the subsequent replication study, we report here that SNP rs7504990 in the DCC showed genome-wide significant association with GC susceptibility (PCombined=7.46 × 10−8; OR=6.95; 95% CI=3.43–14.08) in the Japanese population. Subsequent imputation and association analysis revealed additional SNPs located adjacent to the marker SNP rs7504990 in the DCC to be associated with GC susceptibility. Regional association plot indicated that SNPs showing strong association with GC are located in one linkage disequilibrium block that was not disrupted by recombination event. Results from subsequent in silico analysis and eQTL analysis are concordant, whereby none of these SNPs potentially alter transcription factor binding site nor possibly alter the expression of DCC.

DCC encodes a netrin 1 receptor, which is a transmembrane protein that is a member of the immunoglobulin superfamily of cell adhesion molecules.11 Loss of heterozygosity and microsatellite instability in the chromosome 18q21 region that contains the DCC have been observed in multiple cancer types, particularly cancers in the digestive organ including colon,12 stomach,13 esophagus,14 pancreas,15 and gallbladder.16 In addition, reduced or loss of the DCC expression has been associated with the poorly differentiated histological type, increased proliferation and metastasis through the loss of adhesiveness.17 Although the pathophysiological role of DCC in gallbladder carcinogenesis has not been clarified, DCC was demonstrated to induce apoptosis in the absence of its ligand netrin-1.18 Furthermore, enforced expression of netrin 1 in mouse gastrointestinal tract has been found to induce spontaneous formation of hyperplastic and neoplastic lesions, highlighting the potential role of DCC as a tumor-suppressor gene.19

Our finding that SNPs in DCC were associated with GC susceptibility supports findings of several recent studies, which also pointed out a tumor-suppressing role of the gene in gallbladder carcinogenesis. For instances, previous reports have revealed the incidences of loss of heterozygosity at DCC in GC to be as high as 30–45%.16, 19, 20 Chromosomal loss in human genome often implies that the affected region may harbor a tumor-suppressor gene, where the loss of which could lead to carcinogenesis. Nevertheless, both in silico and eQTL analysis indicated that SNPs in DCC showed no association with expression of the gene, implying other unknown mechanisms might be in action. Subsequent fine-mapping and resequencing for this region and functional analysis would be necessary to clarify association between DCC and GC.

In the current study, because of the limited number of subjects examined, we might not have enough statistical power to detect other genetic variants with modest or weak effects on susceptibility to GC. Hence, some SNPs of clinical importance might be missed in this study. Further replication study involving a larger number of samples and functional analysis of the DCC is urgently needed for validation of the association of DCC with the risk of GC.