Article | Open | Published:

Identification of multiple risk loci and regulatory mechanisms influencing susceptibility to multiple myeloma

Nature Communicationsvolume 9, Article number: 3707 (2018) | Download Citation

Abstract

Genome-wide association studies (GWAS) have transformed our understanding of susceptibility to multiple myeloma (MM), but much of the heritability remains unexplained. We report a new GWAS, a meta-analysis with previous GWAS and a replication series, totalling 9974 MM cases and 247,556 controls of European ancestry. Collectively, these data provide evidence for six new MM risk loci, bringing the total number to 23. Integration of information from gene expression, epigenetic profiling and in situ Hi-C data for the 23 risk loci implicate disruption of developmental transcriptional regulators as a basis of MM susceptibility, compatible with altered B-cell differentiation as a key mechanism. Dysregulation of autophagy/apoptosis and cell cycle signalling feature as recurrently perturbed pathways. Our findings provide further insight into the biological basis of MM.

Introduction

Multiple myeloma (MM) is a malignancy of plasma cells primarily located within the bone marrow. Although no lifestyle or environmental exposures have been consistently linked to an increased risk of MM, the two- to four-fold increased risk observed in relatives of MM patients provides support for inherited genetic predisposition1. Our understanding of MM susceptibility has recently been informed by genome-wide association studies (GWAS), which have so far identified 17 independent risk loci for MM2,3,4,5, with an additional locus being subtype-specific for t(11;14) translocation MM6. Much of the heritable risk of MM, however, remains unexplained and statistical modelling indicates that further common risk variants remain to be discovered7.

To gain a more comprehensive insight into MM aetiology, we performed a new GWAS followed by a meta-analysis with existing GWAS and replication genotyping (totalling 9974 cases and 247,556 controls). Here, we report the identification of six new MM susceptibility loci as well as refined risk estimates for the previously reported loci. In addition, we have investigated the possible gene regulatory mechanisms underlying the associations seen at all 23 GWAS risk loci by analysing in situ promoter Capture Hi-C (CHi-C) in MM cells to characterise chromatin interactions between predisposition single-nucleotide polymorphism (SNPs) and target genes, integrating these data with chromatin immunoprecipitation-sequencing (ChIP-seq) data generated in house and a range of publicly available genomics data. Finally, we have quantified the contribution of both new and previously discovered loci to the heritable risk of MM and implemented a likelihood-based approach to estimate sample sizes required to explain 80% of the heritability.

Results

Association analysis

We conducted a new GWAS using the OncoArray platform8 (878 MM cases and 7083 controls from the UK), followed by a meta-analysis with six published MM GWAS data sets (totalling 7319 cases and 234,385 controls) (Fig. 1, Supplementary Tables 13)2,3,4,5. To increase genomic resolution, we imputed data to >10 million SNPs. Quantile–quantile (Q–Q) plots for SNPs with minor allele frequency (MAF) >1% after imputation did not show evidence of substantive over-dispersion for the OncoArray GWAS (λ = 1.03, λ1000 = 1.02, Supplementary Fig. 1). We derived joint odds ratios (ORs) under a fixed-effects model for each SNP with MAF >1%. Finally, we sought validation of nine SNPs associated at P < 1 × 10−6 in the meta-analysis, which did not map to known MM risk loci and displayed a consistent OR across all GWAS data sets, by genotyping an additional 1777 cases and 6088 controls from three independent series (Germany, Denmark and Sweden). After meta-analysis of the new and pre-existing GWAS data sets and replication series, we identified genome-wide significant associations (i.e. P < 5 × 10−8)9 for six new loci at 2q31.1, 5q23.2, 7q22.3, 7q31.33, 16p11.2 and 19p13.11 (Table 1, Supplementary Table 4 and 5, Fig. 2). Additionally, borderline associations were identified at two loci with P values of 5.93 × 10−8 (6p25.3) and 9.90 × 10−8 (7q21.11), which have corresponding Bayesian false-discovery probabilities (BFDP)10 of 4% and 6%, respectively (Supplementary Table 4 and 5). We found no evidence for significant interactions between any of the 23 risk loci. Finally, we found no evidence to support the existence of the putative risk locus at 2p12.3 (rs1214346), previously proposed by Erickson et al.11 (GWAS meta-analysis P value = 0.32).

Fig. 1
Fig. 1

GWAS study design. Details of the new and existing GWAS samples, including recruitment centres or trials and quality control, are provided in Supplementary Tables 1 and 2. Trials or centres from which replication samples were recruited are detailed in Supplementary Table 3. Ca.: cases, Co.: controls, eQTL: expression quantitative trait loci, SNP: single-nucleotide polymorphism, LD: linkage disequilibrium

Table 1 Summary of genotyping results for all 23 risk SNPs
Fig. 2
Fig. 2

Regional plots of the six new risk loci. Regional plots of loci a 2q31.1, b 5q23.2, c 7q22.3, d 7q31.33, e 16p11.2 and f 19p13.11. Plots show results of the meta-analysis for both genotyped (triangles) and imputed (circles) single-nucleotide polymorphisms (SNPs) and recombination rates. −log10(P) (y axes) of the SNPs are shown according to their chromosomal positions (x axes). The sentinel SNP in each combined analysis is shown as a large circle or triangle and is labelled by its rsID. The colour intensity of each symbol reflects the extent of LD with the top SNP, white (r2 = 0) through to dark red (r2 = 1.0). Genetic recombination rates, estimated using 1000 Genomes Project samples, are shown with a light blue line. Physical positions are based on NCBI build 37 of the human genome. Also shown are the relative positions of genes and transcripts mapping to the region of association. Genes have been redrawn to show their relative positions; therefore maps are not to physical scale. The middle track represents the chromatin-state segmentation track (ChromHMM) for KMS11

Risk SNPs and myeloma phenotype

We did not find any association between sex or age at diagnosis and the 23 MM risk SNPs using case-only analysis (Supplementary Table 6 and 7). Aside from previously reported relationships between the risk loci at 11q13.3 and 5q15 with t(11;14) MM6 and hyperdiploid MM12, respectively, we found no evidence for subtype-specific associations (Supplementary Table 8-11) or an impact on MM-specific survival (Supplementary Table 12). A failure to demonstrate additional relationships may, however, be reflective of limited study power. Collectively, these data suggest that the risk variants are likely to have generic effects on MM development.

Contribution of risk SNPs to heritability

Using linkage disequilibrium adjusted kinships (LDAK)13, the heritability of MM ascribable to all common variation was 15.6% (±4.7); collectively the previously identified and new risk loci account for 15.7% of the GWAS heritability (13.6% and 2.1%, respectively). To assess the collective impact of all identified risk SNPs, we constructed polygenic risk scores (PRS) considering the combined effect of all risk SNPs modelled under a log-normal relative risk distribution14. Using this approach, an individual in the top 1% of genetic risk has a threefold increased risk of MM when compared to an individual with median genetic risk (Supplementary Fig. 2). We observed an enrichment of risk variants among familial MM compared with both sporadic MM cases and population-based controls comparable to that expected in the absence of a strong monogenic predisposition (respective P values 0.027 and 1.60 × 10−5; Supplementary Fig. 3). Undoubtedly, the identification of further risk loci through the analysis of larger GWAS are likely to improve the performance of any PRS model. To estimate the sample size required to explain a greater proportion of the GWAS heritability, we implemented a likelihood-based approach using association statistics in combination with LD information to model the effect-size distribution15,16. The effect-size distributions for susceptibility SNPs were best modelled using the three-component model (mixture of two normal distributions) (Supplementary Fig. 4). Under this model, to identify SNPs explaining 80% of the GWAS heritability is likely to require sample sizes in excess of 50,000 (Supplementary Fig. 5).

Functional annotation and biological inference of risk loci

To the extent that they have been studied, many GWAS risk SNPs localise to non-coding regions and influence gene regulation17. To investigate the functional role of previously reported and new MM risk SNPs, we performed a global analysis of SNP associations using ChIP-seq data generated on the MM cell line KMS11 and publicly accessible naive B-cell Blueprint Epigenome Project data18. We found enrichment of MM SNPs in regions of active chromatin, as indicated by the presence of H3K27ac, H3K4Me3 and H3K4Me1 marks (Supplementary Fig. 6). We also observed an enrichment of relevant B-cell transcription factor-(TF) binding sites using ENCODE GM12878 lymphoblastoid cell line data (Supplementary Fig. 7). Collectively these data support the tenet that the MM predisposition loci influence risk through effects on cis-regulatory networks involved in transcriptional initiation and enhancement.

Since genomic spatial proximity and chromatin looping interactions are key to the regulation of gene expression, we interrogated physical interactions at respective genomic regions in KMS11 and naive B-cells using CHi-C data19. We also sought to gain insight into the possible biological mechanisms for associations by performing an expression quantitative trait locus (eQTL) analysis using mRNA expression data on CD138-purified MM plasma cells; specifically, we used Summary data-based Mendelian Randomisation (SMR) analysis20 to test for pleiotropy between GWAS signal and cis-eQTL for genes within 1 Mb of the sentinel SNP to identify a causal relationship. We additionally annotated risk loci with variants mapping to binding motifs of B-cell-specific TFs. Finally, we catalogued direct promoter variants and non-synonymous coding mutations for genes within risk loci (Supplementary Data 1 and Fig. 1).

Although preliminary, and requiring functional validation, our analysis delineates four potential candidate disease mechanisms across the 23 MM risk loci (Supplementary Data 1). Firstly, four of the risk loci contain candidate genes linked to regulation of cell cycle and genomic instability, as evidenced by Hi-C looping interactions in KMS11 cells to MTAP (at 9p21.3) and eQTL effects for CEP120 (at 5q23.2). CEP120 is required for microtubule assembly and elongation with overexpression of CEP120 leading to uncontrolled centriole elongation21. rs58618031 (7q31.33) maps 5′ of POT1, the protection of telomeres 1 gene. POT1 is part of the shelterin complex that functions to protect telomeres and maintain chromosomal stability22,23. While mutated POT1 is not a feature of MM, it is commonly observed in B-cell chronic lymphocytic leukaemia24,25,26. The looping interaction from the rs58618031 annotated enhancer element implicates ASB15. Members of the ASB family feature as protein components of the ubiquitin–proteasome system, intriguingly a therapeutic target in MM27,28,29.

Second, candidate genes encoding proteins involved in chromatin remodelling were implicated at three of the MM risk loci, supported by promoter variants at 2q31.1, 7q36.1 and 22q13.1. The new locus at 2q31.1 implicates SP3, encoding a TF, which through promoter interaction, has a well-established role in B-cell development influencing the expression of germinal centre genes, including activation-induced cytidine deaminase AID30,31.

Third, the central role IRF4-MYC-mediated apoptosis/autophagy in MM oncogenesis is supported by variation at five loci, including eQTL effects WAC (at 10p12.1) and Hi-C looping interactions (at 8q24.21 and 16q23.1). The 7p15.3 association ascribable to rs4487645 has been documented to influence expression of c-MYC-interacting CDCA7L through differential IRF4 binding32. Similarly, the long-range interaction between CCAT1 (colon cancer-associated transcript 1) and MYC provides an attractive biological basis for the 8q24.21 association, given the notable role of MYC in MM33,34. It is noteworthy that the promising risk locus at 6p25.3 contains IRF4. At the new locus 19p13.11, the missense variant (NP_057354.1:p.Leu104Pro) and the correlated promoter SNP rs11086029 implicates KLF2 in MM biology. Demethylation by KDM3A histone demethylase sustains KLF2 expression and influences IRF4-dependent MM cell survival35. The new 16p11.2 risk locus contains a number of genes including Proline-Rich Protein 14 (PRR14), which is implicated in PI3-kinase/Akt/mTOR signalling, a therapeutic target in myelomatous plasma cells36.

Fourth, loci related to B cell and plasma cell differentiation and function are supported by variation at three loci, including eQTL effects (ELL2 at 5q15) and Hi-C looping interactions (at 6q21). As previously inferred from GM12878 cell line data, the region at 6q21 (rs9372120, ATG5) participates in intra-chromosome looping with the B-cell transcriptional repressor PRDM1 (alias BLIMP1)4. Additionally, SNP rs34562254 at 17p11.2 is responsible for the amino acid substitution (NP_036584.1:p.Pro251Leu) in TNFRSF13B, a key regulator of normal B-cell homoeostasis, which has an established role in MM biology37,38,39,40,41,42.

Discussion

Our meta-analysis of a new GWAS series in conjunction with previously published MM data sets has identified six novel risk loci. Together, the new and previously reported loci explain an estimated 16% of the SNP heritability for MM in European populations. Ancestral differences in the risk of developing MM are well recognised, with a greater prevalence of MM in African Americans as compared with those with European ancestry43. It is plausible that the effects of MM risk SNPs may differ between Europeans and non-Europeans and hence contribute to differences in prevalence rates. Thus far, there has only been limited evaluation of this possibility with no evidence for significant differences44.

Integration of Hi-C data with ChIP-seq chromatin profiling from MM and lymphoblastoid cell lines and naive B cells and eQTL analysis, using patient expression data, has allowed us to gain preliminary insight into the biological basis of MM susceptibility. This analysis suggests a model of MM susceptibility based on transcriptional dysregulation consistent with altered B-cell differentiation, where dysregulation of autophagy/apoptosis and cell cycle signalling feature as recurrently modulated pathways. Specifically, our findings implicate mTOR-related genes ULK4, ATG5 and WAC, and by virtue of the role of IRF4-MYC related autophagy, CDCA7L, DNMT3A, CBX7 and KLF2 in MM development (Supplementary Data 1). Further investigations are necessary to decipher the functional basis of risk SNPs, nevertheless we highlight mTOR signalling and the ubiquitin–proteasome pathway, targets of approved drugs in MM. As a corollary of this, genes elucidated via the functional annotation of GWAS that discovered MM risk loci may represent promising therapeutic targets for myeloma drug discovery. Finally, our estimation of sample sizes required to identify a larger proportion of the heritable risk of MM attributable to common variation underscore the need for further international collaborative analyses.

Methods

Ethics

Collection of patient samples and associated clinico-pathological information was undertaken with written informed consent and relevant ethical review board approval at respective study centres in accordance with the tenets of the Declaration of Helsinki. Specifically for the Myeloma-IX trial by the Medical Research Council (MRC) Leukaemia Data Monitoring and Ethics committee (MREC 02/8/95, ISRCTN68454111), the Myeloma-XI trial by the Oxfordshire Research Ethics Committee (MREC 17/09/09, ISRCTN49407852), HOVON65/GMMG-HD4 (ISRCTN 644552890; METC 13/01/2015), HOVON87/NMSG18 (EudraCTnr 2007-004007-34, METC 20/11/2008), HOVON95/EMN02 (EudraCTnr 2009-017903-28, METC 04/11/10), University of Heidelberg Ethical Commission (229/2003, S-337/2009, AFmu-119/2010), University of Arkansas for Medical Sciences Institutional Review Board (IRB 202077), Lund University Ethical Review Board (2013/54), the Norwegian REK 2014/97, the Danish Ethical Review Board (no. H-16032570) and Icelandic Data Protection Authority (2,001,010,157 and National Bioethics Committee 01/015).

The diagnosis of MM (ICD-10 C90.0) in all cases was established in accordance with World Health Organization guidelines. All samples from patients for genotyping were obtained before treatment or at presentation.

Primary GWAS

We analysed constitutional DNA (EDTA-venous blood derived) from 931 cases ascertained through the UK Myeloma XI trial; detailed in Supplementary Table 1. Cases were genotyped using the Illumina OncoArray (Illumina Inc. San Diego, CA 92122, USA). Controls were also genotyped using the OncoArray and comprised: (1) 2976 cancer-free men recruited by the PRACTICAL Consortium—the UK Genetic Prostate Cancer Study (UKGPCS) (age <65 years), a study conducted through the Royal Marsden NHS Foundation Trust and SEARCH (Study of Epidemiology & Risk Factors in Cancer), recruited via GP practices in East Anglia (2003–2009) and (2) 4446 cancer-free women across the UK, recruited via the Breast Cancer Association Consortium (BCAC).

Standard quality-control measures were applied to the GWAS45. Specifically, individuals with low SNP call rate (<95%) as well as individuals evaluated to be of non-European ancestry (using the HapMap version 2 CEU, JPT/CHB and YRI populations as a reference) were excluded (Supplementary Fig. 8). For apparent first-degree relative pairs, we excluded the control from a case–control pair; otherwise, we excluded the individual with the lower call rate. SNPs with a call rate <95% were excluded as were those with a MAF <0.01 or displaying significant deviation from Hardy–Weinberg equilibrium (P < 10−5). GWAS data were imputed to >10 million SNPs using IMPUTE2 v2.346 software in conjunction with a merged reference panel consisting of data from 1000 Genomes Project47 (phase 1 integrated release 3 March 2012) and UK10K48. Genotypes were aligned to the positive strand in both imputation and genotyping. We imposed predefined thresholds for imputation quality to retain potential risk variants with MAF >0.01 for validation. Poorly imputed SNPs with an information measure <0.80 were excluded. Tests of association between imputed SNPs and MM was performed under an additive model in SNPTESTv2.549. The adequacy of the case–control matching and possibility of differential genotyping of cases and controls was evaluated using a Q–Q plot of test statistics (Supplementary Fig. 1). The inflation λ was based on the 90% least-significant SNPs50 and assessment of λ1000. Details of SNP QC are provided in in Supplementary Table 2.

Published GWAS

The data from six previously reported GWAS2,3,4,5 are summarised in Supplementary Table 1. All these studies were based on individuals with European ancestry and comprised: UK-GWAS (2282 cases, 5197 controls), Swedish-GWAS (1714 cases, 10,391 controls), German-GWAS (1508 cases, 2107 controls), Netherlands-GWAS (555 cases, 2669 controls), US-GWAS (780 cases, 1857 controls) and Iceland (480 cases, 212,164 controls).

Replication studies and technical validation

To validate promising associations, we analysed three case–control series from Germany, Sweden and Denmark, summarised in Supplementary Table 3. The German replication series comprised 911 cases collected by the German Myeloma Study Group (Deutsche Studiengruppe Multiples Myeloma (DSMM)), GMMG, University Clinic, Heidelberg and University Clinic, Ulm. Controls comprised 1477 healthy German blood donors recruited between 2004 and 2007 by the Institute of Transfusion Medicine and Immunology, University of Mannheim, Germany. The Swedish replication series comprised 534 MM cases from the Swedish National Myeloma Biobank and the Danish replication series comprised 332 MM cases from the University Hospital of Copenhagen. As controls, we analysed 2382 Swedish blood donors and 2229 individuals from Denmark and Skåne County, Sweden (the southernmost part of Sweden adjacent to Denmark). Replication genotyping of German and Scandinavian samples was performed using competitive allele-specific PCR KASPar chemistry (LGC, Hertfordshire, UK). Call rates for SNP genotypes were >95% in each of the replication series. To ensure the quality of genotyping in all assays, at least two negative controls and duplicate samples (showing a concordance of >99%) were genotyped at each centre. The fidelity of imputation was assessed by directly sequencing a set of 147 randomly selected samples from the UK OncoArray case series. Imputation was found to be robust; concordance was >90% (Supplementary Table 13). Genotyping and sequencing primers are detailed in Supplementary Table 14 and 15, respectively.

Meta-analysis

Meta-analyses were performed using the fixed-effects inverse-variance method using META v1.651. Cochran’s Q-statistic to test for heterogeneity and the I2 statistic to quantify the proportion of the total variation due to heterogeneity was calculated. Using the meta-analysis summary statistics and LD correlations from a reference panel of the 1000 Genomes Project combined with UK10K, we implemented Genome-wide Complex Trait Analysis52 to perform conditional association analysis. Association statistics were calculated for all SNPs conditioning on the top SNP in each loci showing genome-wide significance. This was carried out step-wise.

For borderline associations, the BFDP10 was calculated based on a plausible OR of 1.2 and a prior probability of association of 0.0001. For both promising associations, the BFDP was <10%.

Fluorescence in situ hybridisation

Fluorescence in situ hybridisation (FISH) and ploidy classification of UK and German samples were performed as previously described53,54. Logistic regression in case-only analyses was used to assess the relationship between SNP genotype and IgH translocations or tumour ploidy.

eQTL analysis

eQTL analyses were performed using CD138-purified plasma cells from 183 UK MyIX trial patients and 658 German GMMG patients32. Briefly, German and UK data were pre-processed separately, followed by analysis using a Bayesian approach to probabilistic estimation of expression residuals to infer broad variance components, accounting for hidden determinants influencing global expression. The association between genotype of SNPs and expression of genes within 1 Mb either side of each MM risk locus was evaluated based on the significance of linear regression coefficients. We pooled data from each study under a fixed-effects model.

The relationship between SNP genotype and gene expression was carried out using SMR analysis as per Zhu et al.2 Briefly, if bxy is the effect size of x (gene expression) on y (slope of y regressed on the genetic value of x), bzx is the effect of z on x and bzy be the effect of z on y, bxy (bzy/bzx) is the effect of x on y. To distinguish pleiotropy from linkage where the top associated cis-eQTL is in LD with two causal variants, one affecting gene expression and the other affecting a trait, we tested for heterogeneity in dependent instruments (HEIDI), using multiple SNPs in each cis-eQTL region. Under the hypothesis of pleiotropy, bxy values for SNPs in LD with the causal variant should be identical. For each probe that passed significance threshold for the SMR test, we tested the heterogeneity in the bxy values estimated for multiple SNPs in the cis-eQTL region using HEIDI.

GWAS summary statistics files were generated from the meta-analysis. We set a threshold for the SMR test of PSMR < 1 × 10−3 corresponding to a Bonferroni correction for 45 tests, i.e. 45 probes which demonstrated an association in the SMR test. For all genes passing this threshold, we generated plots of the eQTL and GWAS associations at the locus, as well as plots of GWAS and eQTL effect sizes (i.e. input for the HEIDI heterogeneity test). HEIDI test P values < 0.05 were considered as reflective of heterogeneity. This threshold is, however, conservative for gene discovery because it retains fewer genes than when correcting for multiple testing. SMR plots for significant eQTLs are shown in Supplementary Fig. 9 and 10, and a summary of results are shown in Supplementary Table 16.

Promoter CHi-C

To map risk SNPs to interactions involving promoter contacts and identify genes involved in MM susceptibility, we analysed publicly accessible promoter CHi-C data on the naive B cells downloaded from Blueprint Epigenome Project. Additionally, we also analysed promoter CHi-C data that we have previously generated for the MM cell line KMS1112. Interactions were called using the CHiCAGO pipeline to obtain a unique list of reproducible contacts55 and those with a −log(weighted P) ≥5 were considered significant.

Chromatin state annotation

Variant sets (i.e. sentinel risk SNP and correlated SNPs, r2>0.8) were annotated for putative functional effect based upon histone mark ChIP-seq data for H3K27ac, H3K4Me1, H3K27Me3, H3K9Me3, H3K36Me3 and H3K27Me3 from KMS11 cell lines, generated in-house and naive B cells from Blueprint Epigenome Project56. We used ChromHMM to infer chromatin states by integrating information on these histone modifications, training the model on three MM cell lines; KMS11, MM1S and JJN3. Genome-wide signal tracks were binarized (including input controls for ChIP-seq data), and a set of learned models were generated using ChromHMM software57. A 12-state model was suitable for interpretation, and biological meaning was assigned to the states based on chromatin marks that use putative rules as previously described (Supplementary Fig. 11).

TF and histone mark enrichment analysis

To examine enrichment in specific TF binding across risk loci, we adapted the method of Cowper-Sal lari et al.58. Briefly, for each risk locus, a region of strong LD (defined as r2>0.8 and D′>0.8) was determined, and these SNPs were considered the associated variant set (AVS). Publically available data on TF ChIP-seq uniform peak data were obtained from ENCODE for the GM12878 cell line, including data for 82 TF and 11 histone marks59. In addition, ChIP-seq peak data for six histone marks from KMS11 cell line were generated in-house, and naive B-cell ChIP-seq data were downloaded from Blueprint Epigenome Project56. For each mark, the overlap of the SNPs in the AVS and the binding sites was assessed to generate a mapping tally. A null distribution was produced by randomly selecting SNPs with the same characteristics as the risk-associated SNPs, and the null mapping tally calculated. This process was repeated 10,000 times, and P values were calculated as the proportion of permutations, where null mapping tally was greater or equal to the AVS mapping tally. An enrichment score was calculated by normalising the tallies to the median of the null distribution. Thus, the enrichment score is the number of standard deviations of the AVS mapping tally from the median of the null distribution tallies. Enrichment plots are shown in Supplementary Fig. 6 and 7.

Functional annotation

For the integrated functional annotation of risk loci, variant sets (i.e. all SNPs in LD r2 > 0.8 with the sentinel SNP) were annotated with: (i) presence of a Hi-C contact linking to a gene promoter, (ii) presence of an association from SMR analysis, (iii) presence of a regulatory ChromHMM state, (iv) evidence of transcription factor binding and (v) presence of a nonsynonymous coding change. Candidate causal genes were then assigned to MM risk loci using the target genes implicated in annotation tracks (i), (ii), (iiii) and (iv). If the data supported multiple gene candidates, the gene with the highest number of individual functional data points was considered as the candidate. Where multiple genes have the same number of data points, all genes are listed. Direct non-synonymous coding variants were allocated additional weighting. Competing mechanisms for the same gene (e.g. both coding and promoter variants) were permitted.

Heritability analysis

We used LDAK to estimate the polygenic variance (i.e. heritability) ascribable to all genotyped and imputed GWAS SNPs from summary statistic data. SNP-specific expected heritability, adjusted for LD, MAF and genotype certainty was calculated from the UK10K and 1000 Genomes data. Samples were excluded with a call rate <0.99 or if individuals were closely related or of divergent ancestry from CEU. Individual SNPs were excluded if they showed deviation from HWE with P < 1 × 10−5, an individual SNP genotype yield <95%, MAF <1%, SNP imputation score <0.99 and the absence of the SNP in the GWAS summary statistic data. This resulted in a total 1,254,459 SNPs which were used to estimate the heritability of MM.

To estimate the sample size required for a given proportion of the GWAS heritability, we implemented a likelihood-based approach to model the effect-size distribution15 using association statistics, from the MM meta-analysis, and LD information, obtained from individuals of European ancestry in the 1000 Genomes Project Phase 3. LD values were based on an r2 threshold of 0.1 and a window size of 1 MB. The goodness of fit of the observed distribution of P values against the expected from a two-component model (single normal distribution) and a three-component model (mixture of two normal distributions) were assessed15, and a better fit was observed for the latter model (Supplementary Figure 4). The percentage of GWAS heritability explained for a projected sample size was determined using this model and is based on power calculations for the discovery of genome-wide significant SNPs. The genetic variance explained was calculated as the proportion of total GWAS heritability explained by SNPs reaching genome-wide significance at a given sample size. The 95% confidence intervals were determined using 10,000 simulations.

PRS for familial MM (n = 38) from 25 families were compared with sporadic MM (n = 1530) and population-based controls (n = 10,171); first as a simple sum of risk alleles and secondly as sum of risk alleles weighted by their log-transformed ORs. Family member scores were averaged. A one-sided Student’s t-test was used to assess difference between groups. The genetic data have been previously described5,60 with the familial MM cases having been identified by linkages of Swedish registry information.

Data availability

SNP genotyping data that support the findings of this study have been deposited in Gene Expression Omnibus with accession codes GSE21349, GSE19784, GSE24080, GSE2658 and GSE15695; in the European Genome-phenome Archive (EGA) with accession code EGAS00000000001; in the European Bioinformatics Institute (Part of the European Molecular Biology Laboratory) (EMBL-EBI) with accession code E-MTAB-362 and E-TABM-1138; and in the database of Genotypes and Phenotypes (dbGaP) with accession code phs000207.v1.p1. Expression data that support the findings of this study have been deposited in GEO with accession codes GSE21349, GSE2658, GSE31161 and EMBL-EBI with accession code E-MTAB-2299. The remaining data are contained within the paper and Supplementary Files or are available from the author upon request. KMS11 Hi-C data used in this manuscript are deposited in EGA under accession number EGAS00001002614. The accession number for the KMS11 ChIP-seq data reported in this paper is EGA: S00001002414. Naive B-cell HiC data used in this work is publically available from Blueprint Blueprint Epigenome Project [https://osf.io/u8tzp/]. ChIP-seq data for H3K27ac, H3K4Me1, H3K27Me3, H3K9Me3, H3K36Me3 and H3K27Me3 from naive B cells are publically available and were obtained from Blueprint Epigenome Project [http://www.blueprint-epigenome.eu/].

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

  1. 1.

    Altieri, A., Chen, B., Bermejo, J. L., Castro, F. & Hemminki, K. Familial risks and temporal incidence trends of multiple myeloma. Eur. J. Cancer 42, 1661–1670 (2006).

  2. 2.

    Broderick, P. et al. Common variation at 3p22.1 and 7p15.3 influences multiple myeloma risk. Nat. Genet. 44, 58–61 (2011).

  3. 3.

    Chubb, D. et al. Common variation at 3q26.2, 6p21.33, 17p11.2 and 22q13.1 influences multiple myeloma risk. Nat. Genet. 45, 1221–1225 (2013).

  4. 4.

    Mitchell, J. S. et al. Genome-wide association study identifies multiple susceptibility loci for multiple myeloma. Nat. Commun. 7, 12050 (2016).

  5. 5.

    Swaminathan, B. et al. Variants in ELL2 influencing immunoglobulin levels associate with multiple myeloma. Nat. Commun. 6, 7213 (2015).

  6. 6.

    Weinhold, N. et al. The CCND1 c.870G>A polymorphism is a risk factor for t(11;14)(q13;q32) multiple myeloma. Nat. Genet. 45, 522–525 (2013).

  7. 7.

    Mitchell, J. S. et al. Implementation of genome-wide complex trait analysis to quantify the heritability in multiple myeloma. Sci. Rep. 5, 12473 (2015).

  8. 8.

    Amos, C. I. et al. The OncoArray Consortium: a network for understanding the genetic architecture of common cancers. Cancer Epidemiol. Biomark. Prev. 26, 126–135 (2017).

  9. 9.

    Risch, N. & Merikangas, K. The future of genetic studies of complex human diseases. Science 273, 1516–1517 (1996).

  10. 10.

    Wakefield, J. A Bayesian measure of the probability of false discovery in genetic epidemiology studies. Am. J. Hum. Genet. 81, 208–227 (2007).

  11. 11.

    Erickson, S. W. et al. Genome-wide scan identifies variant in 2q12.3 associated with risk for multiple myeloma. Blood 124, 2001–2003 (2014).

  12. 12.

    Li, N. et al. Genetic predisposition to multiple myeloma at 5q15 is mediated by an ELL2 enhancer polymorphism. Cell Rep. 20, 2556–2564 (2017).

  13. 13.

    Speed, D. et al. Reevaluation of SNP heritability in complex human traits. Nat. Genet. 49, 986–992 (2017).

  14. 14.

    Frampton, M. J. et al. Implications of polygenic risk for personalised colorectal cancer screening. Ann. Oncol. 27, 429–434 (2016).

  15. 15.

    Chatterjee, N. Estimation of complex effect-size distributions using summary-level statistics from genome-wide association studies across 32 complex traits and implications for the future. Preprint at https://www.biorxiv.org/content/early/2017/08/11/175406 (2017).

  16. 16.

    Chatterjee, N. et al. Projecting the performance of risk prediction based on polygenic analyses of genome-wide association studies. Nat. Genet. 45, 400–405 (2013). 405e1-3.

  17. 17.

    Freedman, M. L. et al. Principles for the post-GWAS functional characterization of cancer risk loci. Nat. Genet. 43, 513–518 (2011).

  18. 18.

    Javierre, B. M. et al. Lineage-specific genome architecture links enhancers and non-coding disease variants to target gene promoters. Cell 167, 1369–1384.e19 (2016).

  19. 19.

    Risca, V. I. & Greenleaf, W. J. Unraveling the 3D genome: genomics tools for multiscale exploration. Trends Genet. 31, 357–372 (2015).

  20. 20.

    Zhu, Z. et al. Integration of summary data from GWAS and eQTL studies predicts complex trait gene targets. Nat. Genet. 48, 481–487 (2016).

  21. 21.

    Comartin, D. et al. CEP120 and SPICE1 cooperate with CPAP in centriole elongation. Curr. Biol. 23, 1360–1366 (2013).

  22. 22.

    Rice, C. et al. Structural and functional analysis of the human POT1-TPP1 telomeric complex. Nat. Commun. 8, 14928 (2017).

  23. 23.

    Pinzaru, A. M. et al. Telomere replication stress induced by POT1 inactivation accelerates tumorigenesis. Cell Rep. 15, 2170–2184 (2016).

  24. 24.

    Chang, S. Cancer chromosomes going to POT1. Nat. Genet. 45, 473–475 (2013).

  25. 25.

    Speedy, H. E. et al. A genome-wide association study identifies multiple susceptibility loci for chronic lymphocytic leukemia. Nat. Genet. 46, 56–60 (2014).

  26. 26.

    Speedy, H. E. et al. Germ line mutations in shelterin complex genes are associated with familial chronic lymphocytic leukemia. Blood 128, 2319–2326 (2016).

  27. 27.

    McDaneld, T. G., Hannon, K. & Moody, D. E. Ankyrin repeat and SOCS box protein 15 regulates protein synthesis in skeletal muscle. Am. J. Physiol. Regul. Integr. Comp. Physiol. 290, R1672–R1682 (2006).

  28. 28.

    McDaneld, T. G. & Spurlock, D. M. Ankyrin repeat and suppressor of cytokine signaling (SOCS) box-containing protein (ASB) 15 alters differentiation of mouse C2C12 myoblasts and phosphorylation of mitogen-activated protein kinase and Akt. J. Anim. Sci. 86, 2897–2902 (2008).

  29. 29.

    Kile, B. T. et al. The SOCS box: a tale of destruction and degradation. Trends Biochem. Sci. 27, 235–241 (2002).

  30. 30.

    Park, S. R. et al. HoxC4 binds to the promoter of the cytidine deaminase AID gene to induce AID expression, class-switch DNA recombination and somatic hypermutation. Nat. Immunol. 10, 540–550 (2009).

  31. 31.

    Steinke, J. W. et al. Identification of an Sp factor-dependent promoter in GCET, a gene expressed at high levels in germinal center B cells. Mol. Immunol. 41, 1145–1153 (2004).

  32. 32.

    Li, N. et al. Multiple myeloma risk variant at 7p15.3 creates an IRF4-binding site and interferes with CDCA7L expression. Nat. Commun. 7, 13656 (2016).

  33. 33.

    Holien, T., Vatsveen, T. K., Hella, H., Waage, A. & Sundan, A. Addiction to c-MYC in multiple myeloma. Blood 120, 2450–2453 (2012).

  34. 34.

    Kuehl, W. M. & Bergsagel, P. L. MYC addiction: a potential therapeutic target in MM. Blood 120, 2351–2352 (2012).

  35. 35.

    Ohguchi, H. et al. The KDM3A-KLF2-IRF4 axis maintains myeloma cell survival. Nat. Commun. 7, 10258 (2016).

  36. 36.

    Yang, M., Lewinska, M., Fan, X., Zhu, J. & Yuan, Z. M. PRR14 is a novel activator of the PI3K pathway promoting lung carcinogenesis. Oncogene 35, 5527–5538 (2016).

  37. 37.

    Moreaux, J. et al. APRIL and TACI interact with syndecan-1 on the surface of multiple myeloma cells to form an essential survival loop. Eur. J. Haematol. 83, 119–129 (2009).

  38. 38.

    Moreaux, J. et al. The level of TACI gene expression in myeloma cells is associated with a signature of microenvironment dependence versus a plasmablastic signature. Blood 106, 1021–1030 (2005).

  39. 39.

    Ju, S. et al. Correlation of expression levels of BLyS and its receptors with multiple myeloma. Clin. Biochem. 42, 387–399 (2009).

  40. 40.

    Novak, A. J. et al. Expression of BCMA, TACI, and BAFF-R in multiple myeloma: a mechanism for growth and survival. Blood 103, 689–694 (2004).

  41. 41.

    Mackay, F. & Schneider, P. TACI, an enigmatic BAFF/APRIL receptor, with new unappreciated biochemical and biological properties. Cytokine Growth Factor Rev. 19, 263–276 (2008).

  42. 42.

    Moreaux, J. et al. TACI expression is associated with a mature bone marrow plasma cell signature and C-MAF overexpression in human myeloma cell lines. Haematologica 92, 803–811 (2007).

  43. 43.

    Landgren, O. & Weiss, B. M. Patterns of monoclonal gammopathy of undetermined significance and multiple myeloma in various ethnic/racial groups: support for genetic factors in pathogenesis. Leukemia 23, 1691–1697 (2009).

  44. 44.

    Rand, K. A. et al. A meta-analysis of multiple myeloma risk regions in African and European ancestry populations identifies putatively functional loci. Cancer Epidemiol. Biomark. Prev. 25, 1609–1618 (2016).

  45. 45.

    Anderson, C.A. et al. Data quality control in genetic case-control association studies. Nat. Protoc. 5, 1564-1573 (2010).

  46. 46.

    Howie, B. N., Donnelly, P. & Marchini, J. A flexible and accurate genotype imputation method for the next generation of genome-wide association studies. PLoS. Genet. 5, e1000529 (2009).

  47. 47.

    Abecasis, G. R. et al. A map of human genome variation from population-scale sequencing. Nature 467, 1061–1073 (2010).

  48. 48.

    Huang, J. et al. Improved imputation of low-frequency and rare variants using the UK10K haplotype reference panel. Nat. Commun. 6, 8111 (2015).

  49. 49.

    Marchini, J., Howie, B., Myers, S., McVean, G. & Donnelly, P. A new multipoint method for genome-wide association studies by imputation of genotypes. Nat. Genet. 39, 906–913 (2007).

  50. 50.

    Clayton, D. G. et al. Population structure, differential bias and genomic control in a large-scale, case-control association study. Nat. Genet. 37, 1243–1246 (2005).

  51. 51.

    Liu, J. Z. et al. Meta-analysis and imputation refines the association of 15q25 with smoking quantity. Nat. Genet. 42, 436–440 (2010).

  52. 52.

    Yang, J., Lee, S. H., Goddard, M. E. & Visscher, P. M. GCTA: a tool for genome-wide complex trait analysis. Am. J. Hum. Genet. 88, 76–82 (2011).

  53. 53.

    Neben, K. et al. Combining information regarding chromosomal aberrations t(4;14) and del(17p13) with the International Staging System classification allows stratification of myeloma patients undergoing autologous stem cell transplantation. Haematologica 95, 1150–1157 (2010).

  54. 54.

    Walker, B. A. et al. A compendium of myeloma-associated chromosomal copy number abnormalities and their prognostic value. Blood 116, e56–e65 (2010).

  55. 55.

    Cairns, J. et al. CHiCAGO: robust detection of DNA looping interactions in Capture Hi-C data. Genome Biol. 17, 127 (2016).

  56. 56.

    Fernandez, J. M. et al. The BLUEPRINT data analysis portal. Cell Syst. 3, 491–495 (2016).

  57. 57.

    Ernst, J. & Kellis, M. ChromHMM: automating chromatin-state discovery and characterization. Nat. Methods 9, 215–216 (2012).

  58. 58.

    Cowper-Sal lari, R. et al. Breast cancer risk-associated SNPs modulate the affinity of chromatin for FOXA1 and alter gene expression. Nat. Genet. 44, 1191–1198 (2012).

  59. 59.

    de Souza, N. The ENCODE project. Nat. Methods 9, 1046 (2012).

  60. 60.

    Halvarsson, B. M. et al. Direct evidence for a polygenic etiology in familial multiple myeloma. Blood Adv. 1, 619–623 (2017).

  61. 61.

    Via, M., Gignoux, C. & Burchard, E. G. The 1000 genomes project: new opportunities for research and social challenges. Genome Med. 2, 3 (2010).

Download references

Acknowledgements

In the United Kingdom, Myeloma UK and Bloodwise provided principal funding. Additional funding was provided by Cancer Research UK (C1298/A8362 supported by the Bobby Moore Fund) and The Rosetrees Trust. M.W. is supported by funding from Mr Ralph Stockwell. A.S. is supported by a clinical fellowship from Cancer Research UK and charitable funds from the Royal Marsden Hospital. N.W. was supported by the National Institute of General Medical Sciences of the National Institutes of Health under Award Number P20GM125503. This study made use of genotyping data on the 1958 Birth Cohort generated by the Wellcome Trust Sanger Institute (http://www.wtccc.org.uk). We thank the High-Throughput Genomics Group at the Wellcome Trust Centre for Human Genetics (funded by Wellcome Trust grant reference 090532/Z/09/Z) for the generation of UK myeloma OncoArray data. The BCAC study would not have been possible without the contributions of the following: Manjeet K. Bolla, Qin Wang, Kyriaki Michailidou and Joe Dennis. BCAC is funded by Cancer Research UK (C1287/A10118, C1287/A16563). For the BBCS study, we thank Eileen Williams, Elaine Ryder-Mills and Kara Sargus. The BBCS is funded by Cancer Research UK and Breast Cancer Now and acknowledges NHS funding to the National Institute of Health Research (NIHR) Biomedical Research Centre (BRC), and the National Cancer Research Network (NCRN). We thank the participants and the investigators of EPIC (European Prospective Investigation into Cancer and Nutrition). The coordination of EPIC is financially supported by the European Commission (DG-SANCO) and the International Agency for Research on Cancer. The national cohorts are supported by: Ligue Contre le Cancer, Institut Gustave Roussy, Mutuelle Générale de l’Education Nationale, Institut National de la Santé et de la Recherche Médicale (INSERM) (France); German Cancer Aid, German Cancer Research Center (DKFZ), Federal Ministry of Education and Research (BMBF) (Germany); the Hellenic Health Foundation, the Stavros Niarchos Foundation (Greece); Associazione Italiana per la Ricerca sul Cancro-AIRC-Italy and National Research Council (Italy); Dutch Ministry of Public Health, Welfare and Sports (VWS), Netherlands Cancer Registry (NKR), LK Research Funds, Dutch Prevention Funds, Dutch ZON (Zorg Onderzoek Nederland), World Cancer Research Fund (WCRF), Statistics Netherlands (The Netherlands); Health Research Fund (FIS), PI13/00061 to Granada, PI13/01162 to EPIC-Murcia, Regional Governments of Andalucía, Asturias, Basque Country, Murcia and Navarra, ISCIII RETIC (RD06/0020) (Spain); Cancer Research UK (14136 to EPIC-Norfolk; C570/A16491 and C8221/A19170 to EPIC-Oxford), Medical Research Council (1000143 to EPIC-Norfolk, MR/M012190/1 to EPIC-Oxford) (UK). We thank the SEARCH and EPIC teams which were funded by a programme grant from Cancer Research UK [A16561] and supported by the UK NIHR BRC at the University of Cambridge. We thank Breast Cancer Now and The Institute of Cancer Research (ICR) for support and funding of the UKBGS, and the study participants, study staff, and the doctors, nurses and other health care providers and health information sources who have contributed to the study. We acknowledge NHS funding to the Royal Marsden/ICR NIHR BRC. UKGPCS would like to thank The Institute of Cancer Research and The Everyman Campaign for funding support. The UKGPCS acknowledges The Prostate Cancer Research Foundation, Prostate Action, The Orchid Cancer Appeal, The National Cancer Research Network UK, The National Cancer Research Institute (NCRI), the NIHR funding to the NIHR Biomedical Research data managers and consultants for their work in the UKGPCS study and urologists and other persons involved in the planning, and data collection of the CAPS study. Genotyping of the OncoArray was funded by the US National Institutes of Health (NIH) [U19 CA 148537 for ELucidating Loci Involved in Prostate cancer SuscEptibility (ELLIPSE) project and X01HG007492 to the Center for Inherited Disease Research (CIDR) under contract number HHSN268201200008I]. Additional analytic support was provided by NIH NCI U01 CA188392 (PI: Schumacher). The PRACTICAL consortium was supported by Cancer Research UK Grants C5047/A7357, C1287/A10118, C1287/A16563, C5047/A3354, C5047/A10692, C16913/A6135, European Commission’s Seventh Framework Programme grant agreement no. 223175 (HEALTH-F2-2009-223175) and The National Institute of Health (NIH) Cancer Post-Cancer GWAS initiative grant no. 1 U19 CA 148537-01 (the GAME-ON initiative). We would also like to thank the following for funding support: The Institute of Cancer Research and The Everyman Campaign, The Prostate Cancer Research Foundation, Prostate Research Campaign UK (now Prostate Action), The Orchid Cancer Appeal, The National Cancer Research Network UK, The National Cancer Research Institute (NCRI) UK. We are grateful for support of NIHR funding to the NIHR Biomedical Research Centre at The Institute of Cancer Research and The Royal Marsden NHS Foundation Trust. The APBC BioResource, which forms part of the PRACTICAL consortium, consists of the following members: Wayne Tilley, Gail Risbridger, Renea Taylor, Judith A Clements, Lisa Horvath, Vanessa Hayes, Lisa Butler, Trina Yeadon, Allison Eckert, Pamela Saunders, Anne-Maree Haynes, Melissa Papargiris. We thank the staff of the CTRU University of Leeds and the NCRI haematology Clinical Studies Group. The US GWAS was supported by a grant from the National Institutes of Health (P01CA055819). The German study was supported by the Dietmar-Hopp-Stiftung, Germany, the German Cancer Aid (110,131), the German Ministry of Education and Science (CLIOMMICS 01ZX1309), The German Research Council (DFG; Project SI 236/81, SI 236/)-1, ER 155/6-1 and the DFG CRI 216), the Harald Huppert Foundation and the Multiple Myeloma Research Foundation. The patients were collected by the GMMG and DSMM studies. The German GWAS made use of genotyping data from the population-based HNR study, which is supported by the Heinz Nixdorf Foundation (Germany). The genotyping of the Illumina HumanOmni-1 Quad BeadChips of the HNR subjects was financed by the German Center for Neurodegenerative Disorders (DZNE), Bonn. We are grateful to all investigators who contributed to the generation of this data set. The German replication controls were collected by Peter Bugert, Institute of Transfusion Medicine and Immunology, Medical Faculty Mannheim, Heidelberg University, German Red Cross Blood Service of Baden-Württemberg-Hessen, Mannheim, Germany. This work was supported by research grants from the Swedish Foundation for Strategic Research (KF10-0009), the Marianne and Marcus Wallenberg Foundation (2010.0112), the Knut and Alice Wallenberg Foundation (2012.0193), the Swedish Research Council (2012–1753), the Royal Swedish Academy of Science, ALF grants to the University and Regional Laboratories (Labmedicin Skåne), the Siv-Inger and Per-Erik Andersson Foundation, the Medical Faculty at Lund University, the Borås foundation for cancer research, and the Swedish Society of Medicine. We thank Jörgen Adolfsson, Tomas Axelsson, Anna Collin, Ildikó Frigyesi, Patrik Magnusson, Bertil Johansson, Jan Westin and Helga Ögmundsdóttir for their assistance. This work was supported by Center for Translational Molecular Medicine (BioCHIP), a clinical research grant from the European Hematology Association, an EMCR Translational Research Grant, a BMBF grant from CLIOMMICS (01ZX1309A) and FP7 grant MSCNET (LSHC-Ct-2006-037602). We thank the staff of the HOVON, as well as patients and physicians at participating sites. In addition, we also thank Jasper Koenders, Michael Vermeulen, André Uitterlinden and Nathalie van der Velde for their assistance. We are indebted to the clinicians who contributed samples to Swedish, Icelandic, Norwegian and Danish biobanks. We are indebted to the patients and other individuals who participated in the project.

Author information

Author notes

  1. These authors contributed equally: Molly Went, Amit Sud, Asta Försti, Britt-Marie Halvarsson, Niels Weinhold.

  2. These authors jointly supervised the work: Hartmut Goldschmidt, Kari Stefansson, Gareth J. Morgan, Björn Nilsson, Kari Hemminki, Richard S. Houlston.

  3. Deceased: Brian E. Henderson

Affiliations

  1. Division of Genetics and Epidemiology, The Institute of Cancer Research, London, SW7 3RP, UK

    • Molly Went
    • , Amit Sud
    • , Amy Holroyd
    • , Ni Li
    • , Giulia Orlando
    • , Philip J. Law
    • , Jonathan S. Mitchell
    • , Peter Broderick
    • , Anthony Swerdlow
    • , Rosalind A. Eeles
    • , ZSofia Kote-Jarai
    • , Richard S. Houlston
    • , Sara Benlloch
    •  & Sue Ann Ingles
  2. German Cancer Research Center, 69120, Heidelberg, Germany

    • Asta Försti
    • , Bowang Chen
    • , Uta Bertsch
    • , Chiara Campo
    • , Obul R Bandapalli
    • , Miguel Inacio da Silva Filho
    • , Hauke Thomsen
    •  & Kari Hemminki
  3. Center for Primary Health Care Research, Lund University, SE-205 02, Malmo, Sweden

    • Asta Försti
    •  & Kari Hemminki
  4. Hematology and Transfusion Medicine, Department of Laboratory Medicine, BMC B13, Lund University, SE-221 84, Lund, Sweden

    • Britt-Marie Halvarsson
    • , Mina Ali
    • , Urban Gullberg
    • , Ellinor Johnsson
    • , Anna-Karin Wihlborg
    • , Markus Hansson
    •  & Björn Nilsson
  5. Myeloma Institute for Research and Therapy, University of Arkansas for Medical Sciences, Little Rock, AR, 72205, USA

    • Niels Weinhold
    • , Owen W. Stephens
    • , Brian A. Walker
    • , Faith E. Davies
    •  & Gareth J. Morgan
  6. Department of Internal Medicine V, University of Heidelberg, 69117, Heidelberg, Germany

    • Niels Weinhold
    • , Jens Hillengass
    • , Jolanta Nickel
    •  & Hartmut Goldschmidt
  7. Division of Molecular Pathology, The Institute of Cancer Research, London, SW7 3RP, UK

    • Scott Kimber
    • , David C. Johnson
    • , Martin Kaiser
    •  & Richard S. Houlston
  8. Department of Hematology, Erasmus MC Cancer Institute, 3075 EA, Rotterdam, The Netherlands

    • Mark van Duin
    • , Rowan Kuiper
    • , Annemiek Broyl
    •  & Pieter Sonneveld
  9. deCODE Genetics, Sturlugata 8, IS-101, Reykjavik, Iceland

    • Gudmar Thorleifsson
    • , Daniel F. Gudbjartsson
    • , Thorunn Rafnar
    • , Unnur Thorsteinsdottir
    •  & Kari Stefansson
  10. School of Engineering and Natural Sciences, University of Iceland, IS-101, Reykjavik, Iceland

    • Daniel F. Gudbjartsson
  11. National Centre of Tumor Diseases, 69120, Heidelberg, Germany

    • Uta Bertsch
    •  & Hartmut Goldschmidt
  12. University Clinic of Würzburg, 97080, Würzburg, Germany

    • Hermann Einsele
  13. Clinical Trials Research Unit, University of Leeds, Leeds, LS2 9PH, UK

    • Walter A. Gregory
  14. Institute of Human Genetics, University of Bonn, D-53127, Bonn, Germany

    • Per Hoffmann
    •  & Markus M. Nöthen
  15. Division of Medical Genetics, Department of Biomedicine, University of Basel, 4003, Basel, Switzerland

    • Per Hoffmann
  16. Royal Victoria Infirmary, Newcastle upon Tyne, NE1 4LP, UK

    • Graham H. Jackson
  17. Institute for Medical Informatics, Biometry and Epidemiology, University Hospital Essen, University of Duisburg–Essen, Essen, D-45147, Germany

    • Karl-Heinz Jöckel
  18. Department of Hematology, Landspitali, National University Hospital of Iceland, IS-101, Reykjavik, Iceland

    • Sigurður Y. Kristinsson
  19. Section of Hematology, Sahlgrenska University Hospital, Gothenburg, 413 45, Sweden

    • Ulf-Henrik Mellqvist
  20. Center for Hematology and Regenerative Medicine, SE-171 77, Stockholm, Sweden

    • Hareth Nahi
  21. Centre for Cancer Genetic Epidemiology, Department of Oncology, University of Cambridge, Cambridge, CB1 8RN, UK

    • Douglas Easton
    • , Paul Pharoah
    • , Alison Dunning
    •  & Nora Pashayan
  22. Centre for Cancer Genetic Epidemiology, Department of Public Health and Primary Care, University of Cambridge, Cambridge, CB1 8RN, UK

    • Douglas Easton
    • , Paul Pharoah
    •  & Ali Amin Al Olama
  23. Department of Non-Communicable Disease Epidemiology, London School of Hygiene and Tropical Medicine, London, WC1E 7HT, UK

    • Julian Peto
  24. Genomic Epidemiology Group, German Cancer Research Center (DKFZ), Heidelberg, 69120, Germany

    • Federico Canzian
  25. Division of Breast Cancer Research, The Institute of Cancer Research, London, SW7 3RP, UK

    • Anthony Swerdlow
  26. Royal Marsden NHS Foundation Trust, Fulham Road, London, SW3 6JJ, UK

    • Rosalind A. Eeles
  27. Institute of Population Health, University of Manchester, Manchester, M13 9PL, UK

    • Kenneth Muir
  28. Warwick Medical School, University of Warwick, Coventry, CV4 7AL, UK

    • Kenneth Muir
  29. Department of Applied Health Research, University College London, London, WC1E 7HB, UK

    • Nora Pashayan
  30. Department of Genomics, Life & Brain Center, University of Bonn, D-53127, Bonn, Germany

    • Markus M. Nöthen
  31. Wessex Regional Genetics Laboratory, University of Southampton, Salisbury, SP2 8BJ, UK

    • Fiona M. Ross
  32. Hematology Clinic, Skåne University Hospital, SE-221 85, Lund, Sweden

    • Ingemar Turesson
    •  & Markus Hansson
  33. Department of Haematology, University Hospital of Copenhagen at Rigshospitalet, Blegdamsvej 9, DK-2100, Copenhagen, Denmark

    • Annette Vangsted
  34. Department of Haematology, Aarhus University Hospital, Tage-Hansens Gade 2, DK-8000, Aarhus C, Denmark

    • Niels Frost Andersen
  35. Department of Cancer Research and Molecular Medicine, Norwegian University of Science and Technology, Box 8905, N-7491, Trondheim, Norway

    • Anders Waage
  36. Faculty of Medicine, University of Iceland, IS-101, Reykjavik, Iceland

    • Unnur Thorsteinsdottir
  37. Department of Internal Medicine III, University of Ulm, D-89081, Ulm, Germany

    • Christian Langer
  38. Broad Institute, 7 Cambridge Center, Cambridge, MA, 02142, USA

    • Björn Nilsson
  39. Department of Preventive Medicine, Keck School of Medicine, University of Southern California/Norris Comprehensive Cancer Center, Los Angeles, 90033, CA, USA

    • Brian E. Henderson
    • , Christopher A. Haiman
    •  & David V. Conti
  40. Department of Epidemiology and Biostatistics, Case Western Reserve University, Cleveland, 44106, OH, USA

    • Fredrick R. Schumacher
  41. Seidman Cancer Center, University Hospitals, Cleveland, 44106, OH, USA

    • Fredrick R. Schumacher
  42. Department of Clinical Neurosciences, University of Cambridge, Cambridge, CB2 1TN, UK

    • Ali Amin Al Olama
  43. Division of Cancer Epidemiology and Genetics, National Cancer Institute, NIH, Bethesda, 20892, MD, USA

    • Sonja I. Berndt
    • , Stephen Chanock
    • , Demetrius Albanes
    • , Stephanie Weinstein
    •  & Stella Koutros
  44. Department of Medical Epidemiology and Biostatistics, Karolinska Institute, Stockholm, SE-177 77, Sweden

    • Fredrik Wiklund
    •  & Henrik Gronberg
  45. Epidemiology Research Program, American Cancer Society, 250 Williams Street, Atlanta, 30303, GA, USA

    • Victoria L. Stevens
    •  & Catherine M. Tangen
  46. Australian Prostate Cancer Research Centre-Qld, Institute of Health and Biomedical Innovation and School of Biomedical Science, Queensland University of Technology, Brisbane, 4059, QLD, Australia

    • Jyotsna Batra
    •  & Judith Clements
  47. Translational Research Institute, Brisbane, 4102, QLD, Australia

    • Jyotsna Batra
    •  & Judith Clements
  48. Department of Medical Biochemistry and Genetics, Institute of Biomedicine, University of Turku, Turku, FI-20520, Finland

    • Johanna Schleutker
  49. Tyks Microbiology and Genetics, Department of Medical Genetics, Turku University Hospital, Turku, FI-20520, Finland

    • Johanna Schleutker
  50. BioMediTech, University of Tampere, Tampere, 33100, Finland

    • Johanna Schleutker
  51. Division of Nutritional Epidemiology, Institute of Environmental Medicine, Karolinska Institutet, Solna, SE-171 77, Sweden

    • Alicja Wolk
  52. Institute of Cancer Sciences, University of Manchester, Manchester Academic Health Science Centre, Radiotherapy Related Research, The Christie Hospital NHS Foundation Trust, Manchester, M13 9NT, UK

    • Catharine West
  53. Department of Epidemiology, Harvard School of Public Health, Boston, 02115, MA, USA

    • Lorelei Mucci
  54. CeRePP, Pitie-Salpetriere Hospital, Paris, 75013, France

    • Géraldine Cancel-Tassin
  55. UPMC Univ Paris 06, GRC No. 5 ONCOTYPE-URO, CeRePP, Tenon Hospital, Paris, 75020, France

    • Géraldine Cancel-Tassin
  56. Department of Molecular Medicine, Aarhus University Hospital, Aarhus, 8000, Denmark

    • Karina Dalsgaard Sorensen
  57. Department of Clinical Medicine, Aarhus University, Aarhus, 8000, Denmark

    • Karina Dalsgaard Sorensen
  58. Department of Medical Genetics, Oslo University Hospital, Oslo, N-0424, Norway

    • Eli Marie Grindedal
  59. University of Cambridge, Department of Oncology, Addenbrooke’s Hospital, Cambridge, CB2 0QQ, UK

    • David E. Neal
  60. Cancer Research UK Cambridge Research Institute, Li Ka Shing Centre, Cambridge, CB2 0RE, UK

    • David E. Neal
  61. Nuffield Department of Surgical Sciences, University of Oxford, Oxford, OX3 9DU, UK

    • Freddie C. Hamdy
  62. Faculty of Medical Science, University of Oxford, John Radcliffe Hospital, Oxford, OX3 9DU, UK

    • Freddie C. Hamdy
  63. School of Social and Community Medicine, University of Bristol, Bristol, BS8 2PS, UK

    • Jenny L. Donovan
  64. Cancer Epidemiology, Nuffield Department of Population Health, University of Oxford, Oxford, OX3 7LF, UK

    • Ruth C. Travis
  65. Department of Surgical Oncology, Princess Margaret Cancer Centre, Toronto, M5G 2M9, Canada

    • Robert J. Hamilton
  66. Department of Radiation Oncology, Icahn School of Medicine at Mount Sinai, New York, 10029, NY, USA

    • Barry Rosenstein
  67. Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, 10029, USA

    • Barry Rosenstein
  68. Centre for Molecular Oncology, Barts Cancer Institute, Queen Mary University of London, John Vane Science Centre, London, EC1M 6BQ, UK

    • Yong-Jie Lu
  69. Cancer Epidemiology & Intelligence Division, The Cancer Council Victoria, Melbourne, 3004, VIC, Australia

    • Graham G. Giles
  70. Centre for Epidemiology and Biostatistics, Melbourne School of Population and Global Health, The University of Melbourne, Melbourne, 3053, VIC, Australia

    • Graham G. Giles
  71. Division of Urologic Surgery, Brigham and Womens Hospital, Boston, 02115, MA, USA

    • Adam S. Kibel
  72. Fundación Pública Galega de Medicina Xenómica-SERGAS, Grupo de Medicina Xenómica, CIBERER, IDIS, Santiago de Compostela, 15782, Spain

    • Ana Vega
  73. Centre for Research in Environmental Epidemiology (CREAL), Barcelona Institute for Global Health (ISGlobal), Barcelona, 60803, Spain

    • Manolis Kogevinas
  74. CIBER Epidemiología y Salud Pública (CIBERESP), Madrid, 28029, Spain

    • Manolis Kogevinas
  75. IMIM (Hospital del Mar Research Institute), Barcelona, 08003, Spain

    • Manolis Kogevinas
  76. Universitat Pompeu Fabra (UPF), Barcelona, 08002, Spain

    • Manolis Kogevinas
  77. Channing Division of Network Medicine, Department of Medicine, Brigham and Women’s Hospital/Harvard Medical School, Boston, 02115, MA, USA

    • Kathryn L. Penney
  78. Department of Cancer Epidemiology, Moffitt Cancer Center, Tampa, 33612, FL, USA

    • Jong Y. Park
  79. Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, 98109, WA, USA

    • Janet L. Stanford
    •  & Lisa F. Newcomb
  80. Department of Epidemiology, School of Public Health, University of Washington, Seattle, 98195, WA, USA

    • Janet L. Stanford
  81. International Hereditary Cancer Center, Department of Genetics and Pathology, Pomeranian Medical University, Szczecin, 70-001, Poland

    • Cezary Cybulski
  82. Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, 1165, Denmark

    • Børge G. Nordestgaard
  83. Department of Clinical Biochemistry, Herlev and Gentofte Hospital, Copenhagen University Hospital, Herlev, 2900, Denmark

    • Børge G. Nordestgaard
  84. Division of Clinical Epidemiology and Aging Research, German Cancer Research Center (DKFZ), Heidelberg, 69120, Germany

    • Hermann Brenner
  85. German Cancer Consortium (DKTK), German Cancer Research Center (DKFZ), Heidelberg, 69120, Germany

    • Hermann Brenner
  86. Division of Preventive Oncology, German Cancer Research Center (DKFZ) and National Center for Tumor Diseases (NCT), Heidelberg, 69120, Germany

    • Hermann Brenner
  87. Institute for Human Genetics, University Hospital Ulm, Ulm, 89081, Germany

    • Christiane Maier
  88. Department of Genitourinary Medical Oncology, The University of Texas M. D. Anderson Cancer Center, Houston, 77030, TX, USA

    • Jeri Kim
  89. Cancer Prevention Institute of California, Fremont, 94538, CA, USA

    • Esther M. John
  90. Department of Health Research & Policy (Epidemiology) and Stanford Cancer Institute, Stanford University School of Medicine, Stanford, 94305, CA, USA

    • Esther M. John
  91. Department of Genetics, Portuguese Oncology Institute of Porto, Porto, 4200-072, Portugal

    • Manuel R. Teixeira
  92. Biomedical Sciences Institute (ICBAS), University of Porto, Porto, 4200-072, Portugal

    • Manuel R. Teixeira
  93. Department of Population Sciences, Beckman Research Institute of the City of Hope, Duarte, 91016, CA, USA

    • Susan L. Neuhausen
  94. Ghent University, Faculty of Medicine and Health Sciences, Basic Medical Sciences, Ghent, 9000, Belgium

    • Kim De Ruyck
  95. Department of Surgery, Faculty of Medicine, University of Malaya, Kuala Lumpur, 50603, Malaysia

    • Azad Razack
  96. Department of Urology, University of Washington, Seattle, 98105, WA, USA

    • Lisa F. Newcomb
  97. Institute of Human Genetics, University Medical Center Hamburg-Eppendorf, Hamburg, 20246, Germany

    • Davor Lessel
  98. Molecular Medicine Center, Department of Medical Chemistry and Biochemistry, Medical University, Sofia, 1431, Bulgaria

    • Radka Kaneva
  99. Department of Oncology, Cross Cancer Institute, University of Alberta, Edmonton, T6G 2R3, Alberta, Canada

    • Nawaid Usmani
  100. Division of Radiation Oncology, Cross Cancer Institute, Edmonton, T6G 1Z2, AB, Canada

    • Nawaid Usmani
  101. Molecular Endocrinology Laboratory, Department of Cellular and Molecular Medicine, KU Leuven, Leuven, 3000, Belgium

    • Frank Claessens
  102. Institute of Cancer Sciences, Manchester Cancer Research Centre, University of Manchester, Manchester Academic Health Science Centre, St. Mary’s Hospital, Manchester, M13 9WL, UK

    • Paul A. Townsend
  103. Genomic Medicine Group, Galician Foundation of Genomic Medicine, Instituto de Investigacion Sanitaria de Santiago de Compostela (IDIS), Complejo Hospitalario Universitario de Santiago, Servicio Galego de Saúde, SERGAS, Santiago De Compostela, 15706, Spain

    • Manuela Gago Dominguez
  104. University of California San Diego, Moores Cancer Center, La Jolla, 92093, CA, USA

    • Manuela Gago Dominguez
  105. Department of Urology, Erasmus University Medical Center, Rotterdam, 3015, The Netherlands

    • Monique J. Roobol
  106. Cancer & Environment Group, Center for Research in Epidemiology and Population Health (CESP), INSERM, University Paris-Sud, University Paris-Saclay, Villejuif, F-94805, France

    • Florence Menegaux
  107. Clinical Gerontology Unit, University of Cambridge, Cambridge, CB2 2QQ, UK

    • Kay-Tee Khaw
  108. Division of Genetic Epidemiology, Department of Medicine, University of Utah School of Medicine, Salt Lake City, 84132, UT, USA

    • Lisa Cannon-Albright
  109. George E. Wahlen Department of Veterans Affairs Medical Center, Salt Lake City, 84148, UT, USA

    • Lisa Cannon-Albright
  110. The University of Surrey, Guildford, Surrey, GU2 7XH, UK

    • Hardev Pandha
    •  & Stephen N. Thibodeau

Authors

  1. Search for Molly Went in:

  2. Search for Amit Sud in:

  3. Search for Asta Försti in:

  4. Search for Britt-Marie Halvarsson in:

  5. Search for Niels Weinhold in:

  6. Search for Scott Kimber in:

  7. Search for Mark van Duin in:

  8. Search for Gudmar Thorleifsson in:

  9. Search for Amy Holroyd in:

  10. Search for David C. Johnson in:

  11. Search for Ni Li in:

  12. Search for Giulia Orlando in:

  13. Search for Philip J. Law in:

  14. Search for Mina Ali in:

  15. Search for Bowang Chen in:

  16. Search for Jonathan S. Mitchell in:

  17. Search for Daniel F. Gudbjartsson in:

  18. Search for Rowan Kuiper in:

  19. Search for Owen W. Stephens in:

  20. Search for Uta Bertsch in:

  21. Search for Peter Broderick in:

  22. Search for Chiara Campo in:

  23. Search for Obul R Bandapalli in:

  24. Search for Hermann Einsele in:

  25. Search for Walter A. Gregory in:

  26. Search for Urban Gullberg in:

  27. Search for Jens Hillengass in:

  28. Search for Per Hoffmann in:

  29. Search for Graham H. Jackson in:

  30. Search for Karl-Heinz Jöckel in:

  31. Search for Ellinor Johnsson in:

  32. Search for Sigurður Y. Kristinsson in:

  33. Search for Ulf-Henrik Mellqvist in:

  34. Search for Hareth Nahi in:

  35. Search for Douglas Easton in:

  36. Search for Paul Pharoah in:

  37. Search for Alison Dunning in:

  38. Search for Julian Peto in:

  39. Search for Federico Canzian in:

  40. Search for Anthony Swerdlow in:

  41. Search for Rosalind A. Eeles in:

  42. Search for ZSofia Kote-Jarai in:

  43. Search for Kenneth Muir in:

  44. Search for Nora Pashayan in:

  45. Search for Jolanta Nickel in:

  46. Search for Markus M. Nöthen in:

  47. Search for Thorunn Rafnar in:

  48. Search for Fiona M. Ross in:

  49. Search for Miguel Inacio da Silva Filho in:

  50. Search for Hauke Thomsen in:

  51. Search for Ingemar Turesson in:

  52. Search for Annette Vangsted in:

  53. Search for Niels Frost Andersen in:

  54. Search for Anders Waage in:

  55. Search for Brian A. Walker in:

  56. Search for Anna-Karin Wihlborg in:

  57. Search for Annemiek Broyl in:

  58. Search for Faith E. Davies in:

  59. Search for Unnur Thorsteinsdottir in:

  60. Search for Christian Langer in:

  61. Search for Markus Hansson in:

  62. Search for Hartmut Goldschmidt in:

  63. Search for Martin Kaiser in:

  64. Search for Pieter Sonneveld in:

  65. Search for Kari Stefansson in:

  66. Search for Gareth J. Morgan in:

  67. Search for Kari Hemminki in:

  68. Search for Björn Nilsson in:

  69. Search for Richard S. Houlston in:

Consortia

  1. The PRACTICAL consortium

Contributions

M.W. and R.S.H. designed the study. M.W. and R.S.H. drafted the manuscript with contributions from K.H., B.N., A.S. and N.W. M.W. performed principal statistical and bioinformatics analyses. A.S., N.L., P.L., D.C.J., J.S.M. and G.O. performed additional bioinformatics analyses. S.K. performed in situ CHi-C. P.B. coordinated UK laboratory analyses. M.W. and A.H. performed sequencing of UK samples. D.C.J. managed and prepared Myeloma IX and Myeloma XI Case Study DNA samples. M.K., G.J.M., F.E.D., W.A.G. and G.H.J. performed ascertainment and collection of Case Study samples. B.A.W. performed UK expression analyses. F.M.R. performed UK fluorescence in situ hybridisation analyses. The PRACTICAL consortium, D.E., P.P., A.D., J.P., F.C., R.E., Z.K.-J., K.M. and N.P. provided control samples for the UK OncoArray GWAS. H.G., U.B., J.H., J.N. and N.W. coordinated and managed Heidelberg samples. C.L. and H.E. coordinated and managed Ulm/Wurzburg samples. A.F. coordinated German genotyping. C.C. and O.R.B. performed German genotyping. P.H. and M.M.N. performed GWAS of German cases and controls. H.T., B.C. and M.I.d.S.F. carried out statistical analysis. K.H. coordinated the German part of the project. M.M.N. generated genotype data from the Heinz-Nixdorf recall study. K.-H.J. contributed towards the Heinz-Nixdorf control data set. N.N. from Bonn and K.-H.J. provided samples for the German GWAS. M.H. and B.N. coordinated the Swedish/Norwegian part of the project. M.A. and B.-M.H. performed data analysis. E.J., A.-K.W., U.-H.M., H.N., A.V., N.F.A., A.W., I.T. and U.G. performed sample acquisition, sample preparation, clinical data acquisition and additional data analyses of Sweden/Norway samples. In Iceland, G.T. and D.F.G. performed statistical analysis. S.Y.K. provided clinical information. T.R. performed additional analyses. U.T. and K.S. performed project oversight. M.v.D., P.S., A.B. and R.K. coordinated and prepared HOVON65/GMMG-HD4, HOVON87/NMSG18 and HOVON95/EMN02 studies for participating in this study, and coordinated genotyping and pre-processing. At the Myeloma Institute, University of Arkansas for Medical Sciences, N.W. coordinated the US part of the project and performed statistical and eQTL analyses. O.W.S. and N.W. managed Case Study samples. G.J.M. and F.E.D. performed ascertainment and collection of Case Study samples.

Competing interests

G.T., D.F.G., T.R., K.S. and U.T. are employed by deCode Genetics/Amgen Inc. The remaining authors declare no competing interests.

Corresponding authors

Correspondence to Kari Hemminki or Björn Nilsson or Richard S. Houlston.

Electronic supplementary material

About this article

Publication history

Received

Accepted

Published

DOI

https://doi.org/10.1038/s41467-018-04989-w

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.