12 new susceptibility loci for prostate cancer identified by genome-wide association study in Japanese population

Takata, Ryo; Takahashi, Atsushi; Fujita, Masashi; Momozawa, Yukihide; Saunders, Edward J.; Yamada, Hiroki; Maejima, Kazuhiro; Nakano, Kaoru; Nishida, Yuichiro; Hishida, Asahi; Matsuo, Keitaro; Wakai, Kenji; Yamaji, Taiki; Sawada, Norie; Iwasaki, Motoki; Tsugane, Shoichiro; Sasaki, Makoto; Shimizu, Atsushi; Tanno, Kozo; Minegishi, Naoko; Suzuki, Kichiya; Matsuda, Koichi; Kubo, Michiaki; Inazawa, Johji; Egawa, Shin; Haiman, Christopher A.; Ogawa, Osamu; Obara, Wataru; Kamatani, Yoichiro; Akamatsu, Shusuke; Nakagawa, Hidewaki

doi:10.1038/s41467-019-12267-6

Download PDF

Article
Open access
Published: 27 September 2019

12 new susceptibility loci for prostate cancer identified by genome-wide association study in Japanese population

Nature Communications volume 10, Article number: 4422 (2019) Cite this article

5363 Accesses
31 Citations
12 Altmetric
Metrics details

Subjects

Abstract

Genome-wide association studies (GWAS) have identified ~170 genetic loci associated with prostate cancer (PCa) risk, but most of them were identified in European populations. We here performed a GWAS and replication study using a large Japanese cohort (9,906 cases and 83,943 male controls) to identify novel susceptibility loci associated with PCa risk. We found 12 novel loci for PCa including rs1125927 (TMEM17, P = 3.95 × 10⁻¹⁶), rs73862213 (GATA2, P = 5.87 × 10⁻²³), rs77911174 (ZMIZ1, P = 5.28 × 10⁻²⁰), and rs138708 (SUN2, P = 1.13 × 10⁻¹⁵), seven of which had crucially low minor allele frequency in European population. Furthermore, we stratified the polygenic risk for Japanese PCa patients by using 82 SNPs, which were significantly associated with Japanese PCa risk in our study, and found that early onset cases and cases with family history of PCa were enriched in the genetically high-risk population. Our study provides important insight into genetic mechanisms of PCa and facilitates PCa risk stratification in Japanese population.

Characterizing prostate cancer risk through multi-ancestry genome-wide discovery of 187 novel risk variants

Article 09 November 2023

Anqi Wang, Jiayi Shen, … Christopher A. Haiman

Trans-ancestry genome-wide association meta-analysis of prostate cancer identifies new susceptibility loci and informs genetic risk prediction

Article 04 January 2021

David V. Conti, Burcu F. Darst, … Christopher A. Haiman

An exome-wide rare variant analysis of Korean men identifies three novel genes predisposing to prostate cancer

Article Open access 20 November 2019

Jong Jin Oh, Manu Shivakumar, … Seok-Soo Byun

Introduction

The incidence of prostate cancer (PCa) among Asian males has been dramatically rising even though it is still lower compared to that in western countries¹. In 2015, PCa became the most common cancer type among Japanese males². The best-known risk factor for PCa is family history³. A patient with a family history of PCa in first-degree relative has twice the risk of developing PCa during his lifetime compared to a patient without a family history, indicating a strong influence of inherited genetics on PCa susceptibility³. Early onset PCa is also recognized as a marker of genetic susceptibility for hereditary PCa, and PCa cases with rare variants of BRCA2 and HOXB13 constitute 2.0 and 3.1% of early onset cases^4,5. To identify genetic polymorphisms associated with PCa, a number of genome-wide association studies (GWASs) have been conducted, which identified ~170 risk loci associated with PCa^6,7,8,9,10. However, most of these studies were carried out for population of European ancestry or cohorts of mixed ethnicity. The considerable diversity of genetic background between different populations as shown by 1000 Genomes Project, together with the large difference in the incidence of PCa among ethnic groups, suggests a role of genetic risk factors in PCa disparities^11,12. Even though a recent study suggests that the number of genetic loci associated with PCa has almost saturated after studying more than a hundred thousand patients¹⁰, it is possible that even more PCa associated risk loci will be identified by studying populations of non-European ancestry.

We have initially identified five PCa associated risk loci by GWAS in the Japanese population¹³, with a further meta-analysis in the Japanese population revealing three additional loci for PCa susceptibility¹⁴. We have also developed a highly reproducible polygenic risk estimation model for PCa detection, confirming polygenic risk of PCa in the Japanese population¹⁵. However, it is possible that increasing sample size will further improve power and may lead to the identification of yet-identified risk loci in this population. Moreover, it remains to be known which of the ~140 risk loci associated with PCa in the European ancestry population are also reproducible in the Asian population.

In this study, we have performed an even larger GWAS using an independent Japanese cohort to identify novel susceptibility loci associated with PCa and also validate the association of the previously reported risk loci with PCa risk in the Japanese population. Furthermore, we stratified the polygenic risk for Japanese PCa patients by using 82 significant single nucleotide polymorphisms (SNPs) and examined clinical feature of those at genetically high risk.

Results

GWAS in the Japanese population

The overall design of the GWAS is depicted in Fig. 1. 5088 cases of PCa from Biobank Japan (BBJ) were included^16,17. The controls consisted of 10,682 Japanese male subjects from four large cohort studies (the Japan Multi-Institutional Collaborative Cohort (J-MICC) study, the Japan Public Health Center-based Prospective (JPHC) study, Iwate Medical Megabank Organization (IMM), and Tohoku Medical Megabank Organization (ToMMo)) who had never been diagnosed with PCa^18,19,20. Detailed clinical characteristics of the subjects are shown in Supplementary Table 1. After quality control, the association study was performed for 523,051 SNPs. All case and control samples belonged to the same cluster in the principal component analysis (Supplementary Fig. 1). A quantile–quantile (Q-Q) plot revealed modest inflation of the test statistics (λ_GC = 1.189); however, the value adjusted by the sample size, λ_GC　1000, was 1.027 (Supplementary Fig. 2). We further conducted imputation analysis using the data of 275 Asians in the 1000 Genomes Project Phase 1 as a reference (JPT: 89, CHB: 97, CHS: 89). As a result, 2997 SNPs represented P < 1 × 10⁻⁵ in GWAS (Fig. 1). Many of these SNPs existed in independent genetic regions including multiple loci at 8q24 (Fig. 2).

Replication study and novel PCa-susceptibility loci

Next, we conducted an independent replication study using 4818 cases from BBJ and JIKEI cohorts, and 73,261 male controls from the BBJ cohort. Of the 2997 SNPs, we removed SNPs that showed low imputation quality (R square < 0.3) and SNPs that have previously been reported to be associated with PCa. Then we selected representative SNPs for each genetic region that showed the strongest association with PCa in the same linkage disequilibrium (LD) block. We also selected 20 SNPs which showed exceptionally strong association with PCa compared to the other SNPs in the same LD block. In total, we selected 101 SNPs for the replication study (Supplementary Table 2). In the replication study, genotyping was conducted using multi-index sequencing and the multiplex invader method (see Methods). Among the candidate SNPs, five SNPs could not be genotyped by either method and were excluded. When we combined both stages using the inverse method, 12 SNPs which have not been reported previously were identified to be significantly associated with PCa at P < 5 × 10⁻⁸ (Table 1).

Table 1 12 novel PCa-susceptibility loci identified by GWAS and replication study in Japanese population

Full size table

Locus explore plot of each locus is shown in Fig. 3 and Supplementary Fig. 3²¹. All novel SNPs identified in this study except rs138708 existed in non-exonic regions. Of the 12 new loci, five loci (rs7542260 at chr.1, rs75777376 at chr.8, rs16901814 at chr.8, rs4554825 at chr.10, and rs8023793 at chr.15) contained no protein-coding genes in their LD blocks or in their vicinities (within 100 kb). On the other hand, rs11125927 at chr.2 was located at the intron of the TMEM17 gene. rs73862213 at chr.3 was located near GATA2. rs77911174 at chr.10 region included ZMIZ1 gene. rs11055034 at chr.12 was in the intron of APOLD1 and the region also contained the 3’ end of CDKN1B. rs6117562 at chr.20 was in the region containing SLC52A3. rs138708 at chr.22, formed a very large LD block with many genes, however, rs138708 is a non-synonymous SNP in the exon of SUN2 gene. The region spanning rs4826594 at chr.X included three genes, FGD1, GNL3L, and TSR2.

Expression analysis of the new loci

It is possible that these SNPs reside in enhancer or suppressor regions of nearby genes and influence prostate carcinogenesis by altering the expression of these genes. In order to check the association between the genotype of newly identified SNPs and expression of the genes in their 1 Mb proximity, we conducted eQTL analysis using the GTEX database²². As a result, four SNPs showed weak association with expression of nearby genes (P < 0.05, Supplementary Table 3). Among them, rs16901814 showed association with expression of the FAM84B gene which is 311 kb away from the index SNP (P = 0.0204) (Supplementary Fig. 3b). Even though the function of FAM84B is not well known, its expression is elevated in various cancer, and its overexpression and copy number gain in PCa is reported to be associated with poor prognosis²³. rs4554825 was associated with expression of ZMIZ1-AS (P = 0.0361). As previously mentioned, ZMIZ1 is a gene that was present in the LD block containing rs77911174 (Fig. 3c). It is possible that both rs4554825 and rs77911174 are independently associated with PCa by altering expression or function of ZMIZ1. rs6117562 was associated with expression of FAM110A (P = 0.0104). FAM110A is a cell-cycle regulated gene whose function is not well known. Families of FAM110 proteins localize to centrosomes and are associated with microtubule aberrations²⁴, and FAM110A may affect prostate carcinogenesis through dysregulation of cell-cycle progression. rs6117562 was also associated with CSNK2A1 expression (P = 0.0276), which is known as responsible gene for Okur-Chung neurodevelopmental syndrome²⁵.

Association with the reported loci

Recent PCa GWAS have analyzed more than one-hundred thousand patients and identified many genetic regions associated with PCa^6,7,8,9,10. However, these previous studies as well as meta-analysis have failed to identify the 12 susceptibility loci identified in this study (Supplementary Fig. 4). Among the novel loci identified, minor allele frequency (MAF) in Caucasians were significantly lower compared to that in the Japanese in seven loci, rs7542260, rs73862213, rs75777376, rs4554825, rs77911174, rs138708, and rs4826594 (Supplementary Table 4). The difference in MAF likely accounts for the lack of identification of these loci in previous studies. On the other hand, the MAF in the European population did not significantly differ from that in the Japanese for the other SNPs. In these cases, LD structure pattern spanning the marker SNPs might be different by population.

Since several of the SNPs newly identified in this study existed in regions close to previously reported PCa susceptibility loci, we checked the independency of association by conditional analysis with GWAS data (Supplementary Table 5). rs721048, a known PCa susceptibility loci, is located near a newly identified SNP, rs11125927. However, rs11125927 was independently associated with PCa (P = 1.02 × 10⁻⁹) in the conditional analysis. We also checked for the independency between rs73862213 and EEFSEC region (rs10934853), and between rs4826594 and NUT10/11 region (rs5945619); however, both rs73862213 and rs4826594 were statistically independent of the previously reported loci. Both rs75777376 and rs16901814 in 8q24 region, which exist near rs12543663, were also independently associated with PCa by conditional analysis. In addition, the two SNPs newly identified on chromosome 10 were independent of each other in the conditional analysis.

In the study, rs114780236 at chr.2 and rs4842687 at chr.12 showed significant association with Japanese PCa (Supplementary Table 6). But these SNPs were located in regions close to reported loci, rs58235267 and rs5799921²⁶. Since our GWAS data did not contain the genotypes of rs58235267 and rs5799921, we could not conduct conditional analysis. Although these loci were not in complete LD with the reported loci in Japanese (The R square for rs114780236 and rs58235267 was 0.4701 and the R square for rs4842687 and rs579921 was 0.6962, respectively), the relatively high correlation suggests that rs114780236 and rs4842687 may be in the same susceptibility region with these reported loci in Japanese.

We examined whether the 167 previously reported PCa related loci are associated with PCa susceptibility in Japanese using our GWAS data. We excluded 13 SNPs for which data were not available and 18 SNPs which showed MAF < 0.01 or were mono-allelic, leaving 136 SNPs for analysis. We found 68 SNPs showed weak association (P < 0.05), and 28 SNPs to be strongly associated with PCa in Japanese after Bonferroni correction (P < 0.00037) (Supplementary Data 1). We then checked whether the rate of validation differs by the ethnic group the original report studied with 85 SNPs. The validation rate was the highest for the SNPs discovered using Asian cohort (Supplementary Table 7). Of the 75 SNPs found in whites, 34 (46%) were nominally significant at P < 0.05 in Japanese, with 18 at P < 0.0005. Of the 10 SNPs found previously in Asian men, 8 (80%) were nominally significant (P < 0.05) with all at P < 0.0005. The result highlights the large heterogeneity of PCa associated genetics factors between different ethnic groups.

Polygenic risk score in Japanese population

Finally, we selected 12 SNPs which were newly discovered in the study and 68 SNPs which showed nominal association with the Japanese PCa risk from previously reported SNPs (Supplementary Table 8). Since our GWAS data did not contain two reported loci, rs58235267 and rs5799921, we also included rs114780236 and rs4842687 to represent the two regions. We calculated a polygenic risk score (PRS) by counting the number of risk alleles and their effects in each individual of the GWAS samples. The distribution of the PRS in the PCa cases (n = 4893) and the male controls (n = 10,682) are shown in Fig. 4a. We defined the upper 5% of the cases (n = 245) as the genetic high-risk group and the lower 5% as the low-risk group (n = 245), and examined clinical features of these genetically risk–stratified groups. Notably, we found that the mean diagnosis age of the high-risk group was 2.7 years younger than the non-high risk group (mean age 68.7-year old vs 71.4 year old, P = 6.54 × 10⁻⁸, by t-test), while we observed little difference of mean diagnosis age between the low-risk and non-low risk groups (72.3 vs 71.1, P = 0.020). We observed the enrichment of high PRS in early onset PCa cases (P = 0.00221 for cases with age < 60-year old, by t-test, and P = 4.30 × 10⁻⁹ for cases with age < 65-year old, Fig. 4b). The high-risk group was also enriched with patients who have a positive family history of PCa (P = 0.00339, by Fisher test, Fig. 4c). On the other hand, when we recalculated the PRS using 150 SNPs after adding 68 reported SNPs which indicated no association with Japanese PCa, statistical association between PRS and early onset PCa in the high-risk group became weaker (P = 0.02395 for cases with age < 60-year old, by t-test, and P = 3.24 × 10⁻⁷ for cases with age < 65-year old,　Supplementary Fig. 5a). Association between PRS and positive family history of PCa also declined (P = 0.02395, by Fisher test, Supplementary Fig. 5b).

Furthermore, we calculated PRS in the PCa cases of the two validation cohorts (BBJ n = 2386, and JIKEI n = 2218) by counting the risk allele of 63 SNPs for which genotypes were available in the replication study (Supplementary Table 8). The distribution of PRS in both cohorts is shown in Supplementary Fig. 5c. We confirmed that in both cohorts, the age at diagnosis for the 5%-high-risk group is approximately 2 years younger than the non-high risk group (P = 0.023 in BBJ cohort and P = 0.0061 in JIKEI cohort). We observed that early onset PCa was enriched in the high PRS group of the BBJ cohort (P = 0.0111 for cases with age < 60-year old, by t-test, and P = 0.0451 for cases with age < 65-year old, Supplementary Fig. 5d) and in the JIKEI cohort (P = 0.00378, for cases with age < 55-year old, P = 0.00119 for cases with age < 60-year old, and P = 8.90 × 10⁻⁵ for cases with age < 65-year old, by t-test, Fig. 4d).　We also confirmed that the high-risk group was enriched with patients who have positive family history of PCa in BBJ replication cohort (P = 0.0281, by Fisher test, Supplementary Fig. 5e).

Discussions

In this study, we identified 12 new PCa-susceptibility loci in the Japanese population, but their functional or biological significances related to PCa development are still unclear, which is also the case with many other previously reported loci. Because most of them are located in non-coding regions and our expression analysis found only four loci to be weakly related to expression of nearby genes, these loci are likely associated with gene regulatory functions.

Among the 12 loci, rs11125927 at chr.2 contained TMEM17 gene in the LD block, which is a cilium associated gene reported to suppress invasion and migration of non-small cell lung cancer by restoring Occuludin and Zo-1 expression through inactivation of ERK-P90RSK-Snail pathway²⁷. rs73862213 at chr.3 contained GATA2 in the same LD block, which plays an important role in promoting high grade PCa²⁸. In addition to its role as a transcription factor that promotes androgen receptor (AR) binding and activation, it regulates a subset of clinically relevant PCa associated genes in an AR independent manner. Functionally, GATA2 overexpression promotes cell motility, migration, growth, tumorigenesis, and therapy resistance in PCa. rs77911174 at chr.10 region included the ZMIZ1 gene, which binds to AR and enhances its transcriptional activity in PCa cells²⁹. In addition, it co-localizes with AR and SUMO1 and promotes sumoylation of AR in vivo. The same genetic region has previously been reported to be associated with susceptibility to colon cancer and breast cancer^30,31. Our eQTL analysis suggested the association of ZMIZ1-AS (antisense) expression with another independent loci rs4554825, indicating that regulation of ZMIZ1 expression should be important in prostate carcinogenesis. The LD block spanning rs11055034 at chr.12 contained the 3’ end of CDKN1B, which is strongly expressed in non-proliferating cells and plays important roles in the regulation of both quiescence and G1 progression³². It is known to act as a tumor-suppressor in PCa and suppression of CDKN1B leads to growth promotion of PCa cells³². rs6117562 at chr.20 was in the region containing SLC52A3, which is known as a transporter for riboflavin, however, association with PCa has not been reported³³. rs138708 at chr.22 is a non-synonymous SNP of SUN2, which is a member of LINK complex and plays an important role in nuclear-cytoplasmic connection and suppress Warburg effect in cancer³⁴. Overexpression of SUN2 inhibits PCa cell growth and SUN2 knockdown promotes PCa growth³⁵. The LD block spanning rs4826594 at chr.X included three genes, FGD1, GNL3L, and TSR2. Fgd1 is transiently associated with invadopodia and required for their formation and function in extracellular matrix degradation and acts by direct modulation of Cdc42 activation³⁶. FGD1 is overexpressed in human prostate and breast cancer and is associated with tumor aggressiveness³⁶. GNL3L has been reported to directly bind to and stabilize MDM2 protein, however its role in PCa has not been studied³⁷. TSR2 is known to enhance apoptosis by suppressing NF-kB signaling in laryngeal cancer³⁸. The genes listed above potentially influence PCa development and further functional analysis is warranted.

Polygenic risk estimation by using common variants for PCa has been attempted. For breast cancer, PRS consisting of 77 SNPs tested in 33,000 cases demonstrated significant interaction between PRS and age and family history³⁹. Genetic risk prediction of PCa was first reported using five common susceptibility variants⁴⁰, which was established by simply counting the number of risk alleles. Subsequently, models incorporated increasing number of the significant variants¹⁵, and some models uses much more variants including the variants that are not significant at the genome-wide level⁴¹. It is still controversial which and how many variants should be used for PRS⁴². Ethnic-specific or genomic structure-specific information is also an important issue for PRS. In this study for PRS we used 82 SNPs which were statistically associated with PCa in Japanese population and found that early onset and familial PCa were enriched in the high-risk group in Japanese population. Early onset and familial PCa, with which genetic factors should be more involved, was reported to be enriched in cases with rare variants of BRCA2 and HOXB13 in Caucasian population^4,5. However, this is the first report to show that risk assessment using 50~ common variants may explain hereditary phenotype of PCa.　In addition, PRS using 82 SNPs which are associated with Japanese PCa showed stronger association with early onset PCa and positive family history of PCa than PRS with 150 SNPs which also included the SNPs that failed to show association with Japanese PCa. The result suggests that more precise selection of patients with high risk of developing PCa may be possible with ethnic population-specific PRS. Early onset PCa is often more aggressive and may have a different etiology than later-onset PCa⁴³. Among men diagnosed with high grade and advanced stage PCa, men with early onset PCa are more likely to die of their cancer, with higher cause-specific mortality than later-onset disease⁴³. In addition, familial PCa accounts for a greater proportion of PCa in early onset cases than it does in men diagnosed at older ages. These PCas have been shown to have a more significant genetic component indicating that this group may benefit the most from evaluation of genetic risk⁴⁴. Since the PRS model is useful for the early detection of early onset PCa and familial PCa, PRS might have a greater impact on the clinical examination and treatment of PCa. However, further replication of this risk-stratification by other larger cohort is required before the model is applied to clinical use.

As with other GWAS for PCa, contamination of control with undiagnosed PCa is a limitation of our study. In Japan, as with western countries, increasing number of PCa are detected by PSA screening. However, it is estimated that still less than 50% of males over 50 are actually exposed to PSA screening in Japan, and 10% of newly diagnosed PCa patients present with metastatic disease. Therefore, it is likely that the control in this study includes undiagnosed PCa cases to certain extent. This certainly implies that the SNPs identified in this study is potentially associated with factors other than PCa carcinogenesis, such as health consciousness to receive screening. Continuing efforts should be made to reveal the biological significance of each SNPs reported in GWAS studies including this study, which hopefully delineate the complex interaction between genetic susceptibility and environmental exposure.

In summary, we have conducted a large-scale GWAS for Japanese and identified 12 PCa susceptibility loci that have not been reported previously. We stratified the polygenic risk for Japanese PCa patients by using 82 associated-SNPs and indicated that early onset and familial PCa cases were enriched in the genetically high-risk population.

Methods

Study population

GWAS included 5088 cases from BioBank Japan, which was established in the Institute of Medical Science at the University of Tokyo^16,17. Among the 5088 cases for the GWAS, 272 (5.3%) subjects had a family history of PCa, 1219 (23.9%) subjects revealed PSA ≥ 10 and 1989 (39.1%) subjects were diagnosed to have cancer with Gleason score of 7 or higher (Supplementary Table 1). From the BBJ, pathologically proven PCa cases were selected. Non-cancer controls were from three population-based cohorts, including the J-MICC study¹⁸, the JPHCStudy¹⁹, IMM, and ToMMo²⁰. Genomic DNA samples were extracted from peripheral blood leukocytes and normal tissues using a standard method. All participating studies obtained informed consent from all participants by following the protocols approved by their institutional ethical committees before enrollment, and the ethical committees at each institute approved the project (BBJ: https://biobankjp.org/english/index.html, J-MICC: http://www.jmicc.com/en/, JPHC: https://epi.ncc.go.jp/en/jphc/index.html, IMM: http://iwate-megabank.org/en/, ToMMo: https://www.megabank.tohoku.ac.jp/english/).

Genotyping and quality control

GWAS was conducted using Illumina OmniExpress Exome or the OmniExpress + HumanExome BeadChip (Illumina Inc., San Diego, California, U.S.). Of the 947,830 SNPs genotyped, 195,588 were mono-allelic and were excluded from further analysis. Cluster plots of the top 100 SNPs showing the smallest P-values were checked by visual observation, and 604,992 SNPs met the criteria of call rate ≧ 0.99 both in case and control samples. Finally, SNPs P ≧ 1.00 × 10⁻⁶ in a Hardy-Weinberg Equilibrium test were selected. Association study was performed for the total of 523,051 SNPs.

Imputation of the un-genotyped SNPs was conducted by MaCH⁴⁵ and minimac⁴⁶ using the data from the JPT/CHS/CHD subjects and using the 1000 Genome Project Phase 1 (release 16 March 2012) as a reference. We exclude SNP with a large allele frequency difference between the reference panel and the GWAS (>0.16)⁴⁷. We also excluded SNPs with low imputation quality score (R square < 0.3) and insertion/deletion polymorphisms.

Samples and genotyping for the replication studies

We conducted a replication study using independent 4818 PCa cases and 73,261 controls. Case samples were obtained from the BioBank Japan (2236 cases) and JIKEI samples (2582 samples) from The Jikei University School of Medicine. The JIKEI sample has been described previously with new samples being added for this study⁴⁸. All cases were histologically diagnosed by local pathologists, and clinical data were collected by local urologists. Controls in the replication study were the 73,261 male samples from BBJ that were subjected to GWAS for diseases other than PCa.

A multi-index PCR-based target sequencing method was used to sequence the target region of case samples⁴⁹. We used a two-step PCR method to construct DNA libraries. The 1st PCR (25 cycle) was performed with 202 primer pairs and 2X Platinum Multiplex PCR Master Mix (Thermo Fisher Scientific) to amplify the target region, followed by the 2nd PCR (4 cycle) with 8-bp barcode and adapter sequences added using primers targeting shared 5’ overhangs introduced during the 1st PCR and KAPA HiFi HotStart DNA Polymerase (KAPA). After purification and quantification of pooled libraries, we sequenced them by 2 150-bp paired-end reads on a HiSeq 2500 (Illumina) instrument. Sequence reads allocated to each individual were aligned to the human reference sequence (hg19) using Burrows-Wheeler Aligner (ver. 0.7.12) and processed using Genome Analysis Toolkit (GATK, ver. 3.4–46)^50,51. For quality control, we selected individuals in which more than 98% of the target region was covered with 20 or more sequencing reads. We called variants of each individual separately using UnifiedGenotyper and HaplotypeCaller of GATK, and VCMM (ver. 1.0.2)⁵². Genotypes for all individuals were jointly determined for each variant based on the sequencing read ratio of reference and alternative alleles. When the alternative allele frequency was between 0 and 0.15, between 0.25 and 0.75, and between 0.85 and 1, we assigned homozygote of the reference allele, heterozygote, and homozygote of the alternative allele, respectively. The SNPs that could not be analyzed by multi-index sequencing were genotyped by multiplex PCR-based invader assay⁵³. The five SNPs that could not be genotyped by both assays were excluded in the replication study.

Statistical analysis

In all stages, association of each SNP was assessed under an additive model.　In the GWAS, the genetic inflation factor λ_GC was derived from P-values obtained by the Cochran-Armitage trend test for all the tested SNPs. The quantile–quantile plot was drawn using the R program. λ_{GC 1000} was calculated using the following formula:⁵⁴

$$\lambda _{{\mathrm{GC}}\;1000} = 1 + \left( {1 - \lambda _{{\mathrm{GC}}\;{\mathrm{obs}}}} \right) \times (1/n_{{\mathrm{cases}}} + 1/n_{{\mathrm{controls}}})/(1/1000_{{\mathrm{caes}}} + 1/1000_{{\mathrm{controls}}}).$$

Odds ratios were calculated using the non-effect alleles as references, unless stated otherwise. The results of the combined analyses of the GWAS and the replication study were verified by the Mantel–Haenszel method. Heterogeneity across the two stages was examined using P-link⁵⁵. We considered P = 5 × 10⁻⁸ (GWAS and meta-analysis) as the significance threshold after Bonferroni correction for multiple testing.

Polygenic risk score

Polygenic score was computed as the weighted sum of the number of risk alleles. Log odds ratios computed in the GWAS part of this study were used as the weights. The number of incorporated SNPs was 82 and 63 in the GWAS and the validation cohorts, respectively (Supplementary Table 8). For imputed alleles in GWAS, dosage values were used as the number of risk alleles. Statistical association between polygenic score and clinical information was analyzed using the statistical software R⁵⁶.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

GWAS summary statistics of prostate cancer will be publicly available at our website (JENGER, http://jenger.riken.jp/en/) and the National Bioscience Database Center (NBDC, https://humandbs.biosciencedbc.jp/en/) Human Database. Genotype data of case samples are available at NBDC under research ID hum0014.

References

Bray, F. et al. Global Cancer Statistics 2018: GLOBOCAN Estimates of Incidence and Mortality Worldwide for 36 Cancers in 185 Countries. CA Cancer J. Clin. 68, 394–424 (2018).
Article Google Scholar
Center for Cancer Control and Information Services, National Cancer Center. Projected Cancer Statistics. http://ganjoho.jp/reg_stat/statistics/stat/short_pred.html (2015).
Lichtenstein, P. et al. Environmental and heritable factors in the causation of cancer–analyses of cohorts of twins from Sweden, Denmark, and Finland. N. Engl. J. Med. 343, 78–85 (2000).
Article CAS Google Scholar
Edwards, S. M. et al. Two percent of men with early-onset prostate cancer harbor germline mutations in the BRCA2 gene. Am. J. Hum. Genet. 72, 1–12 (2003).
Article CAS Google Scholar
Ewing, C. M. Germline mutations in HOXB13 and prostate-cancer risk. N. Engl. J. Med. 366, 141–149 (2012).
Article CAS Google Scholar
Eeles, R. A. et al. Identification of 23 new prostate cancer susceptibility loci using the iCOGS custom genotyping array. Nat. Genet. 45, 385–391 (2013).
Article CAS Google Scholar
Al Olama, A. A. et al. A meta-analysis of 87,040 individuals identifies 23 new susceptibility loci for prostate cancer. Nat. Genet. 46, 1103–1109 (2014).
Article CAS Google Scholar
Kote-Jarai, Z. et al. Seven prostate cancer susceptibility loci identified by a multi-stage genome-wide association study. Nat. Genet. 43, 785–791 (2011).
Article CAS Google Scholar
Thomas, G. et al. Multiple loci identified in a genome-wide association study of prostate cancer. Nat. Genet. 40, 310–315 (2008).
Article CAS Google Scholar
Schumacher, F. R. et al. Association analyses of more than 140,000 men identify 63 new prostate cancer susceptibility loci. Nat. Genet. 50, 928–936 (2018).
Article CAS Google Scholar
1000 Genomes Project Consortium. An integrated map of genetic variation from 1,092 human genomes. Nature 491, 56–65 (2012).
Article Google Scholar
1000 Genomes Project Consortium. A global reference for human genetic variation. Nature 526, 68–74 (2015).
Article Google Scholar
Takata, R. et al. Genome-wide association study identifies five new susceptibility loci for prostate cancer in the Japanese population. Nat. Genet. 42, 751–754 (2010).
Article CAS Google Scholar
Akamatsu, S. et al. Common variants at 11q12, 10q26 and 3p11.2 are associated with prostate cancer susceptibility in Japanese. Nat. Genet. 44, 426–429 (2012).
Article CAS Google Scholar
Akamatsu, S. et al. Reproducibility, performance, and clinical utility of a genetic risk prediction model for prostate cancer in Japanese. PLoS ONE 7, e46454 (2012).
Article ADS CAS Google Scholar
Nagai, A. et al. Overview of the BioBank Japan Project: study design and profile. J. Epidemiol. 27, S2–S8 (2017).
Article Google Scholar
Hirata, M. et al. Overview of BioBank Japan follow-up data in 32 diseases. J. Epidemiol. 27, S22–S28 (2017).
Article Google Scholar
J-MICC Study Group. The Japan Multi-Institutional Collaborative Cohort Study (J-MICC Study) to detect gene-environment interactions for cancer. Asian Pac. J. Cancer Prev. 8, 317–323 (2007).
Google Scholar
Tsugane, S. et al. The JPHC study: design and some findings on the typical Japanese diet. Jpn J. Clin. Oncol. 44, 777–782 (2014).
Article Google Scholar
Kuriyama, S. et al. The Tohoku Medical Megabank Project: design and mission. J. Epidemiol. 26, 493–511 (2016).
Article Google Scholar
Dadaev, T. et al. LocusExplorer: a user-friendly tool for integrated visualization of human genetic association data and biological annotations. Bioinformatics 32, 949–951 (2016).
Article CAS Google Scholar
GTEx Consortium. The Genotype-Tissue Expression (GTEx) project. Nat. Genet. 45, 580–585 (2013).
Article Google Scholar
Wong, N. et al. Upregulation of FAM84B during prostate cancer progression. Oncotarget 8, 19218–19235 (2017).
PubMed PubMed Central Google Scholar
Hauge, H. et al. Characterization of the FAM110 gene family. Genomics 90, 14–27 (2007).
Article CAS Google Scholar
Okur, V. et al. De novo mutations in CSNK2A1 are associated with neurodevelopmental abnormalities and dysmorphic features. Hum. Genet. 135, 699–705 (2016).
Article CAS Google Scholar
Al Olama, A. A. et al. Multiple novel prostate cancer susceptibility signals identified by fine-mapping of known risk loci among Europeans. Hum. Mol. Genet. 24, 5589–5602 (2015).
Article CAS Google Scholar
Zhang, X. et al. TMEM17 depresses invasion and metastasis in lung cancer cells via ERK signaling pathway. Oncotarget 8, 70685–70694 (2017).
PubMed PubMed Central Google Scholar
Galsky, M. D. et al. The role of GATA2 in lethal prostate cancer aggressiveness. Nat. Rev. Urol. 14, 38–48 (2017).
Article Google Scholar
Sharma, M. et al. hZimp10 is an androgen receptor co-activator and forms a complex with SUMO-1 at replication foci. EMBO J. 22, 6101–6114 (2003).
Article CAS Google Scholar
Song, N. et al. Common risk variants for colorectal cancer: an evaluation of associations with age at cancer onset. Sci. Rep. 7, 40644 (2017).
Article ADS CAS Google Scholar
Turnbull, C. et al. Genome-wide association study identifies five new breast cancer susceptibility loci. Nat. Genet. 42, 504–507 (2010).
Article CAS Google Scholar
Tsihlias, J. et al. The prognostic significance of altered cyclin-dependent kinase inhibitors in human cancer. Annu. Rev. Med. 50, 401–423 (1999).
Article CAS Google Scholar
Yao, Y. et al. Identification and comparative functional characterization of a new human riboflavin transporter hRFT3 expressed in the brain. J. Nutr. 140, 1220–1226 (2010).
Article CAS Google Scholar
Lv, X. B. et al. SUN2 exerts tumor suppressor functions by suppressing the Warburg effect in lung cancer. Sci. Rep. 5, 17940 (2015).
Article ADS CAS Google Scholar
Yajun, C. et al. Loss of Sun2 promotes the progression of prostate cancer by regulating fatty acid oxidation. Oncotarget 8, 89620–89630 (2017).
Article Google Scholar
Ayala, I. et al. Faciogenital dysplasia protein Fgd1 regulates invadopodia biogenesis and extracellular matrix degradation and is up-regulated in prostate and breast cancer. Cancer Res. 69, 747–752 (2009).
Article CAS Google Scholar
Meng, L. et al. GNL3L depletion destabilizes MDM2 and induces p53-dependent G2/M arrest. Oncogene 30, 1716–1726 (2011).
Article CAS Google Scholar
He, H. J. et al. TSR2 Induces laryngeal cancer cell apoptosis through inhibiting NF-κB signaling pathway. Laryngoscope 128, E130–E134 (2018).
Article CAS Google Scholar
Mavaddat, N. et al. Prediction of breast cancer risk based on profiling with common genetic variants. J. Natl Cancer Inst. 107, djv036 (2015).
Article Google Scholar
Zheng, S. L. et al. Cumulative association of five genetic variants with prostate cancer. N. Engl. J. Med 358, 910–919 (2008).
Article CAS Google Scholar
Khere, A. V. et al. Genome-wide polygenic scores for common diseases identify individuals with risk equivalent to monogenic mutations. Nat. Gent 50, 1219–1224 (2018).
Article Google Scholar
Chatterjee, N. et al. Projecting the performance of risk prediction based on polygenic analyses of genome-wide association studies. Nat. Genet. 45, 400–405 (2013).
Article CAS Google Scholar
Barry, K. H. et al. Risk of early-onset prostate cancer associated with occupation in the Nordic countries. Eur. J. Cancer 87, 92–100 (2017).
Article Google Scholar
Salinas, C. A. et al. Prostate cancer in young men: an important clinical entity. Nat. Rev. Urol. 11, 317–323 (2014).
Article CAS Google Scholar
Scott, L. J. et al. A genome-wide association study of type 2 diabetes in Finns detects multiple susceptibility variants. Science 316, 1341–1345 (2007).
Article ADS CAS Google Scholar
Howie, B. et al. Fast and accurate genotype imputation in genome-wide association studies through pre-phasing. Nat. Genet. 44, 955–959 (2012).
Article CAS Google Scholar
Low, S. K. et al. Identification of six new genetic loci associated with atrial fibrillation in the Japanese population. Nat. Genet. 49, 953–958 (2017).
Article CAS Google Scholar
Yamada, H. et al. Replication of prostate cancer risk loci in a Japanese case-control association study. J. Natl Cancer Inst. 101, 1330–6 (2009).
Article CAS Google Scholar
Momozawa, Y. et al. Low-frequency coding variants in CETP and CFB are associated with susceptibility of exudative age-related macular degeneration in the Japanese population. Hum. Mol. Genet. 25, 5027–5034 (2016).
CAS PubMed Google Scholar
Li, H. & Durbin, R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 25, 1754–1760 (2009).
Article CAS Google Scholar
Depristo, M. A. et al. A framework for variation discovery and genotyping using next-generation DNA sequencing data. Nat. Genet. 43, 491–498 (2011).
Article CAS Google Scholar
Shigemizu, D. et al. A practical method to detect SNVs and indels from whole genome and exome sequencing data. Sci. Rep. 3, 2161 (2013).
Article Google Scholar
Ohnishi, Y. et al. A high-throughput SNP typing system for genome-wide association studies. J. Hum. Genet. 46, 471–7 (2001).
Article CAS Google Scholar
Freedman, M. L. et al. Assessing the impact of population stratification on genetic association studies. Nat. Genet. 36, 388–393 (2004).
Article CAS Google Scholar
Breslow, N. E. et al. Statistical methods in cancer research. Volume II—the design and analysis of cohort studies. IARC Sci. Publ. 82, 1–406 (1987).
R Core Team. R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. https://www.R-project.org/.

Download references

Acknowledgements

The authors acknowledge the staff of the BBJ Project, the JPHC Study, the J-MICC Study, IMM and ToMMo, for their outstanding assistance in collecting samples and clinical information. The BioBank Japan (BBJ) and Japanese GWAS were supported by the Ministry of Education, Culture, Sports, Sciences and Technology of the Japanese government. The J-MICC Study was supported by Grants-in-Aid for Scientific Research for Priority Areas of Cancer (No. 17015018) and Innovative Areas (No. 221S0001) and by JSPS KAKENHI Grants (No. 16H06277) from the Japanese Ministry of Education, Culture, Sports, Science and Technology. The JPHC Study has been supported by the National Cancer Center Research and Development Fund since 2011 and was supported by a Grant-in-Aid for Cancer Research from the Ministry of Health, Labour and Welfare of Japan from 1989 to 2010.

Author information

Authors and Affiliations

Department of Urology, Iwate Medical University, Morioka, 020-8505, Japan
Ryo Takata & Wataru Obara
Laboratory for Cancer Genomics, RIKEN Center for Integrative Medical Sciences, Yokohama, 230-0045, Japan
Ryo Takata, Masashi Fujita, Kazuhiro Maejima, Kaoru Nakano, Shusuke Akamatsu & Hidewaki Nakagawa
Laboratory for Statistical Analysis, RIKEN Center for Integrative Medical Sciences, Yokohama, 230-0045, Japan
Atsushi Takahashi & Yoichiro Kamatani
Department of Genomic Medicine, National Cerebral and Cardiovascular Center Research Institute, Suita, 564-8565, Japan
Atsushi Takahashi
Laboratory for Genotyping Development, RIKEN Center for Integrative Medical Sciences, Yokohama, 230-0045, Japan
Yukihide Momozawa
The Institute of Cancer Research, London, SW7 3RP, UK
Edward J. Saunders
Department of Urology, Jikei University School of Medicine, 105-8461, Tokyo, Japan
Hiroki Yamada & Shin Egawa
Department of Preventive Medicine, Faculty of Medicine, Saga University, 840-8502, Saga, Japan
Yuichiro Nishida
Department of Preventive Medicine, Nagoya University Graduate School of Medicine, Nagoya, 466-8550, Japan
Asahi Hishida & Kenji Wakai
Division of Cancer Epidemiology and Prevention, Aichi Cancer Center Research Institute, 464-8681, Nagoya, Japan
Keitaro Matsuo
Department of Cancer Epidemiology, Nagoya University Graduate School of Medicine, Nagoya, 466-8550, Japan
Keitaro Matsuo
Division of Epidemiology, Center for Public Health Sciences, National Cancer Center, Tokyo, 104-0045, Japan
Taiki Yamaji, Norie Sawada & Motoki Iwasaki
Center for Public Health Sciences, National Cancer Center, 104-0045, Tokyo, Japan
Shoichiro Tsugane
Iwate Tohoku Medical Megabank Organization, Iwate Medical University, Yahaba, 028-3694, Japan
Makoto Sasaki, Atsushi Shimizu & Kozo Tanno
Tohoku Medical Megabank Organization, Tohoku University, Sendai, 980-8573, Japan
Naoko Minegishi & Kichiya Suzuki
Laboratory of Clinical Genome Sequencing, Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, The University of Tokyo, 108-8639, Tokyo, Japan
Koichi Matsuda
RIKEN Center for Integrative Medical Sciences, Yokohama, 230-0045, Japan
Michiaki Kubo
Department of Molecular Cytogenetics, Medical Research Institute, Tokyo Medical and Dental University, Tokyo, 113-8510, Japan
Johji Inazawa
Center for Genetic Epidemiology, Department of Preventive Medicine, Keck School of Medicine, University of Southern California, Los Angeles, California, 90033, USA
Christopher A. Haiman
Department of Urology, Kyoto University Graduate School of Medicine, Kyoto, 606-8501, Japan
Osamu Ogawa & Shusuke Akamatsu

Authors

Ryo Takata
View author publications
You can also search for this author in PubMed Google Scholar
Atsushi Takahashi
View author publications
You can also search for this author in PubMed Google Scholar
Masashi Fujita
View author publications
You can also search for this author in PubMed Google Scholar
Yukihide Momozawa
View author publications
You can also search for this author in PubMed Google Scholar
Edward J. Saunders
View author publications
You can also search for this author in PubMed Google Scholar
Hiroki Yamada
View author publications
You can also search for this author in PubMed Google Scholar
Kazuhiro Maejima
View author publications
You can also search for this author in PubMed Google Scholar
Kaoru Nakano
View author publications
You can also search for this author in PubMed Google Scholar
Yuichiro Nishida
View author publications
You can also search for this author in PubMed Google Scholar
Asahi Hishida
View author publications
You can also search for this author in PubMed Google Scholar
Keitaro Matsuo
View author publications
You can also search for this author in PubMed Google Scholar
Kenji Wakai
View author publications
You can also search for this author in PubMed Google Scholar
Taiki Yamaji
View author publications
You can also search for this author in PubMed Google Scholar
Norie Sawada
View author publications
You can also search for this author in PubMed Google Scholar
Motoki Iwasaki
View author publications
You can also search for this author in PubMed Google Scholar
Shoichiro Tsugane
View author publications
You can also search for this author in PubMed Google Scholar
Makoto Sasaki
View author publications
You can also search for this author in PubMed Google Scholar
Atsushi Shimizu
View author publications
You can also search for this author in PubMed Google Scholar
Kozo Tanno
View author publications
You can also search for this author in PubMed Google Scholar
Naoko Minegishi
View author publications
You can also search for this author in PubMed Google Scholar
Kichiya Suzuki
View author publications
You can also search for this author in PubMed Google Scholar
Koichi Matsuda
View author publications
You can also search for this author in PubMed Google Scholar
Michiaki Kubo
View author publications
You can also search for this author in PubMed Google Scholar
Johji Inazawa
View author publications
You can also search for this author in PubMed Google Scholar
Shin Egawa
View author publications
You can also search for this author in PubMed Google Scholar
Christopher A. Haiman
View author publications
You can also search for this author in PubMed Google Scholar
Osamu Ogawa
View author publications
You can also search for this author in PubMed Google Scholar
Wataru Obara
View author publications
You can also search for this author in PubMed Google Scholar
Yoichiro Kamatani
View author publications
You can also search for this author in PubMed Google Scholar
Shusuke Akamatsu
View author publications
You can also search for this author in PubMed Google Scholar
Hidewaki Nakagawa
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

R.T., S.A., and H.N. directed and designed the study and wrote the manuscript. R.T., A.T., M.F., Y.K., and S.A. performed statistical analysis. Y.M., K.A., K.N., and H.N. performed genotyping in replication study. E.S. and C.H. analyzed the new loci. H.Y., Y.N., A.H., K.M., K.W., T.Y., N.S., M.I., S.T., M.S., A.S., K.T., N.M., K.S., K.M., M.K., S.E., O.O., W.O., and H.N. contributed to sample and data acquisition. J.I., O.O., and W.O. acquired the funding.

Corresponding authors

Correspondence to Ryo Takata, Shusuke Akamatsu or Hidewaki Nakagawa.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Peer review information Nature Communications thanks Stephen Chanock and William Nelson for their contribution to the peer review of this work. Peer reviewer reports are available.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Description of Additional Supplementary Files

Supplementary Data 1

Reporting Summary

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Takata, R., Takahashi, A., Fujita, M. et al. 12 new susceptibility loci for prostate cancer identified by genome-wide association study in Japanese population. Nat Commun 10, 4422 (2019). https://doi.org/10.1038/s41467-019-12267-6

Download citation

Received: 05 April 2019
Accepted: 02 September 2019
Published: 27 September 2019
DOI: https://doi.org/10.1038/s41467-019-12267-6

This article is cited by

ZMIZ1 Regulates Proliferation, Autophagy and Apoptosis of Colon Cancer Cells by Mediating Ubiquitin–Proteasome Degradation of SIRT1
- Min Huang
- Junfeng Wang
- Xueliang Zuo
Biochemical Genetics (2024)
Polygenic risk score for tumor aggressiveness and early-onset prostate cancer in Asians
- Sang Hun Song
- Eunae Kim
- Seok-Soo Byun
Scientific Reports (2023)
Androgen receptor binding sites enabling genetic prediction of mortality due to prostate cancer in cancer-free subjects
- Shuji Ito
- Xiaoxi Liu
- Chikashi Terao
Nature Communications (2023)
Exploring the effects of genetic variation on gene regulation in cancer in the context of 3D genome structure
- Noha Osman
- Abd-El-Monsif Shawky
- Michal Brylinski
BMC Genomic Data (2022)
Prostate cancer risk in men of differing genetic ancestry and approaches to disease screening and management in these groups
- Jana McHugh
- Edward J. Saunders
- Rosalind Eeles
British Journal of Cancer (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.