Genome-wide association analysis implicates dysregulation of immunity genes in chronic lymphocytic leukaemia

Law, Philip J.; Berndt, Sonja I.; Speedy, Helen E.; Camp, Nicola J.; Sava, Georgina P.; Skibola, Christine F.; Holroyd, Amy; Joseph, Vijai; Sunter, Nicola J.; Nieters, Alexandra; Bea, Silvia; Monnereau, Alain; Martin-Garcia, David; Goldin, Lynn R.; Clot, Guillem; Teras, Lauren R.; Quintela, Inés; Birmann, Brenda M.; Jayne, Sandrine; Cozen, Wendy; Majid, Aneela; Smedby, Karin E.; Lan, Qing; Dearden, Claire; Brooks-Wilson, Angela R.; Hall, Andrew G.; Purdue, Mark P.; Mainou-Fowler, Tryfonia; Vajdic, Claire M.; Jackson, Graham H.; Cocco, Pierluigi; Marr, Helen; Zhang, Yawei; Zheng, Tongzhang; Giles, Graham G.; Lawrence, Charles; Call, Timothy G.; Liebow, Mark; Melbye, Mads; Glimelius, Bengt; Mansouri, Larry; Glenn, Martha; Curtin, Karen; Diver, W Ryan; Link, Brian K.; Conde, Lucia; Bracci, Paige M.; Holly, Elizabeth A.; Jackson, Rebecca D.; Tinker, Lesley F.; Benavente, Yolanda; Boffetta, Paolo; Brennan, Paul; Maynadie, Marc; McKay, James; Albanes, Demetrius; Weinstein, Stephanie; Wang, Zhaoming; Caporaso, Neil E.; Morton, Lindsay M.; Severson, Richard K.; Riboli, Elio; Vineis, Paolo; Vermeulen, Roel C. H.; Southey, Melissa C.; Milne, Roger L.; Clavel, Jacqueline; Topka, Sabine; Spinelli, John J.; Kraft, Peter; Ennas, Maria Grazia; Summerfield, Geoffrey; Ferri, Giovanni M.; Harris, Robert J.; Miligi, Lucia; Pettitt, Andrew R.; North, Kari E.; Allsup, David J.; Fraumeni, Joseph F.; Bailey, James R.; Offit, Kenneth; Pratt, Guy; Hjalgrim, Henrik; Pepper, Chris; Chanock, Stephen J.; Fegan, Chris; Rosenquist, Richard; de Sanjose, Silvia; Carracedo, Angel; Dyer, Martin J. S.; Catovsky, Daniel; Campo, Elias; Cerhan, James R.; Allan, James M.; Rothman, Nathanial; Houlston, Richard; Slager, Susan

doi:10.1038/ncomms14175

Download PDF

Article
Open access
Published: 06 February 2017

Genome-wide association analysis implicates dysregulation of immunity genes in chronic lymphocytic leukaemia

Philip J. Law ORCID: orcid.org/0000-0001-9663-4611¹^na1,
Sonja I. Berndt²^na1,
Helen E. Speedy¹^na1,
Nicola J. Camp³^na1,
Georgina P. Sava¹^na1,
Christine F. Skibola⁴^na1,
Amy Holroyd¹,
Vijai Joseph ORCID: orcid.org/0000-0002-7933-151X⁵,
Nicola J. Sunter⁶,
Alexandra Nieters⁷,
Silvia Bea⁸,
Alain Monnereau^9,10,11,
David Martin-Garcia⁸,
Lynn R. Goldin²,
Guillem Clot⁸,
Lauren R. Teras¹²,
Inés Quintela¹³,
Brenda M. Birmann¹⁴,
Sandrine Jayne¹⁵,
Wendy Cozen^16,17,
Aneela Majid¹⁵,
Karin E. Smedby¹⁸,
Qing Lan²,
Claire Dearden¹⁹,
Angela R. Brooks-Wilson^20,21,
Andrew G. Hall⁶,
Mark P. Purdue²,
Tryfonia Mainou-Fowler²²,
Claire M. Vajdic ORCID: orcid.org/0000-0002-3612-8298²³,
Graham H. Jackson²⁴,
Pierluigi Cocco²⁵,
Helen Marr⁶,
Yawei Zhang²⁶,
Tongzhang Zheng²⁶,
Graham G. Giles^27,28,
Charles Lawrence²⁹,
Timothy G. Call³⁰,
Mark Liebow³¹,
Mads Melbye^32,33,
Bengt Glimelius³⁴,
Larry Mansouri³⁴,
Martha Glenn³,
Karen Curtin³,
W Ryan Diver³⁵,
Brian K. Link³⁶,
Lucia Conde⁴,
Paige M. Bracci³⁷,
Elizabeth A. Holly³⁷,
Rebecca D. Jackson³⁸,
Lesley F. Tinker³⁹,
Yolanda Benavente^40,41,
Paolo Boffetta⁴²,
Paul Brennan⁴³,
Marc Maynadie⁴⁴,
James McKay⁴³,
Demetrius Albanes²,
Stephanie Weinstein²,
Zhaoming Wang⁴⁵,
Neil E. Caporaso²,
Lindsay M. Morton²,
Richard K. Severson⁴⁶,
Elio Riboli⁴⁷,
Paolo Vineis^48,49,
Roel C. H. Vermeulen^50,51,
Melissa C. Southey⁵²,
Roger L. Milne ORCID: orcid.org/0000-0001-5764-7268^27,28,
Jacqueline Clavel^53,54,
Sabine Topka⁵,
John J. Spinelli^55,56,
Peter Kraft^57,58,
Maria Grazia Ennas⁵⁹,
Geoffrey Summerfield⁶⁰,
Giovanni M. Ferri⁶¹,
Robert J. Harris⁶²,
Lucia Miligi⁶³,
Andrew R. Pettitt⁶²,
Kari E. North^64,65,
David J. Allsup ORCID: orcid.org/0000-0001-6159-6109⁶⁶,
Joseph F. Fraumeni²,
James R. Bailey⁶⁶,
Kenneth Offit⁵,
Guy Pratt⁶⁷,
Henrik Hjalgrim³²,
Chris Pepper⁶⁸,
Stephen J. Chanock²,
Chris Fegan⁶⁹,
Richard Rosenquist³⁴,
Silvia de Sanjose^42,43,
Angel Carracedo^13,70,
Martin J. S. Dyer¹⁵,
Daniel Catovsky⁷¹,
Elias Campo ORCID: orcid.org/0000-0001-9850-9793^8,72,
James R. Cerhan⁷³,
James M. Allan⁶,
Nathanial Rothman²,
Richard Houlston¹^na2 &
…
Susan Slager⁷³^na2

Nature Communications volume 8, Article number: 14175 (2017) Cite this article

8286 Accesses
59 Citations
45 Altmetric
Metrics details

Subjects

Abstract

Several chronic lymphocytic leukaemia (CLL) susceptibility loci have been reported; however, much of the heritable risk remains unidentified. Here we perform a meta-analysis of six genome-wide association studies, imputed using a merged reference panel of 1,000 Genomes and UK10K data, totalling 6,200 cases and 17,598 controls after replication. We identify nine risk loci at 1p36.11 (rs34676223, P=5.04 × 10⁻¹³), 1q42.13 (rs41271473, P=1.06 × 10⁻¹⁰), 4q24 (rs71597109, P=1.37 × 10⁻¹⁰), 4q35.1 (rs57214277, P=3.69 × 10⁻⁸), 6p21.31 (rs3800461, P=1.97 × 10⁻⁸), 11q23.2 (rs61904987, P=2.64 × 10⁻¹¹), 18q21.1 (rs1036935, P=3.27 × 10⁻⁸), 19p13.3 (rs7254272, P=4.67 × 10⁻⁸) and 22q13.33 (rs140522, P=2.70 × 10⁻⁹). These new and established risk loci map to areas of active chromatin and show an over-representation of transcription factor binding for the key determinants of B-cell development and immune response.

Genome-wide association study identifies risk loci for progressive chronic lymphocytic leukemia

Article Open access 28 January 2021

Insight into genetic predisposition to chronic lymphocytic leukemia from integrative epigenomics

Article Open access 09 August 2019

Genome-wide association study identifies susceptibility loci for acute myeloid leukemia

Article Open access 29 October 2021

Introduction

Chronic lymphocytic leukaemia (CLL) is an indolent B-cell malignancy that has a strong genetic component, as evidenced by the eightfold increased risk seen in relatives of CLL patients¹. Our understanding of CLL genetics has been transformed by genome-wide association studies (GWAS) that have identified risk alleles for CLL^{2,3,4,5,6,7,8,9}. So far, common genetic variation at 33 loci has been shown to influence CLL risk. Although projections indicate that additional risk variants for CLL can be discovered by GWAS, the statistical power of the individual existing studies is limited.

To gain a more comprehensive insight into CLL predisposition, we analysed genome-wide association data from populations of European ancestry from Europe, North America and Australia, identifying nine new risk loci. Our findings provide additional insights into the genetic and biological basis of CLL risk.

Results

Association analysis

After quality control, the six GWAS provided single-nucleotide polymorphism (SNP) genotypes on 4,478 cases and 13,213 controls (Supplementary Tables 1 and 2). To increase genomic resolution, we imputed >10 million SNPs using the 1000 Genomes Project¹⁰ combined with UK10K¹¹ as reference. Quantile–Quantile (Q–Q) plots for SNPs with minor allele frequency (MAF) >0.5% post imputation did not show evidence of substantive overdispersion (λ between 1.00 and 1.10 across the studies; Supplementary Fig. 1). Meta-analysing the association test results from the six series, we derived joint odds ratios per-allele and 95% confidence intervals under a fixed-effects model for each SNP and associated P values. In this analysis, associations for the established risk loci were consistent in direction and magnitude of effect with previously reported studies (Fig. 1 and Supplementary Table 3).

**Figure 1: Manhattan plot of association P values.**

We identified 16 loci where at least one SNP showed evidence of association with CLL (defined as P<1.0 × 10⁻⁷ in fixed-effects meta-analysis of the six series) and which were not previously implicated with CLL risk at genome-wide significance (that is, P<5.0 × 10⁻⁸; Table 1 and Supplementary Tables 4 and 5). Where the signal was provided by an imputed SNP, we confirmed the fidelity of imputation by genotyping (Supplementary Table 6). We substantiated the 16 SNPs using de novo genotyping in two studies and in silico replication in two additional studies, totalling 1,722 cases and 4,385 controls. Meta-analysis of the discovery and replication studies revealed genome-wide significant associations for eight novel loci (Table 1) at 1p36.11 (rs34676223, P=5.04 × 10⁻¹³), 1q42.13 (rs41271473, P=1.06 × 10⁻¹⁰), 4q35.1 (rs57214277, P=3.69 × 10⁻⁸), 6p21.31 (rs3800461, P=1.97 × 10⁻⁸), 11q23.2 (rs61904987, P=2.64 × 10⁻¹¹), 18q21.1 (rs1036935, P=3.27 × 10⁻⁸), 19p13.3 (rs7254272, P=4.67 × 10⁻⁸) and 22q13.33 (rs140522, P=2.70 × 10⁻⁹). We also confirmed 4q24 (rs71597109, P=1.37 × 10⁻¹⁰), which has previously been identified as a suggestive risk locus⁹. Conditional analysis of GWAS data showed no evidence for additional independent signals at these nine loci. In the remaining seven loci that did not replicate with genome-wide significance, the 9q22.33 locus (rs7026022, P=7.00 × 10⁻⁸) remains suggestive (Supplementary Table 5). In analyses limited to the exomes of 141 CLL cases from 66 families, we found no evidence to suggest that any of the association signals might be a consequence of linkage disequilibrium (LD) with a rare disruptive coding variant.

Table 1 Summary results for SNPs associated with CLL risk.

Full size table

Several of the newly identified risk SNPs map in or near to genes with established roles in B-cell biology, hence representing credible candidates for susceptibility to CLL. The 4q24 association marked by rs71597109 (Fig. 2) maps to intron 1 of the gene encoding BANK1 (B-cell scaffold protein with ankyrin repeats 1), a B-cell-specific scaffold protein. SNPs at this locus have been associated with systemic lupus erythematosus risk¹². BANK1 expression is only seen in functional B-cell antigen receptor (BCR)-expressing B cells, mediating effects through LYN-mediated tyrosine phosphorylation of inositol triphosphate receptors. BANK1-deficient mice display higher levels of mature B cells and spontaneous germinal centre B cells¹³, while studies in humans found lower BANK1 transcript levels in CLL versus normal B cells¹⁴. The 19p13.3 association marked by rs7254272 (Fig. 2) maps 2.5 kb 5′ to ZBTB7A (zinc finger and BTB domain-containing protein 7a, alias LRF, leukaemia/lymphoma-related factor, pokemon). ZBTB7A is a master regulator of B versus T lymphoid fate. Loss of ZBTB7A results in aberrant activation of the NOTCH pathway in lymphoid progenitors. NOTCH is constitutively activated in CLL and is a determinant of resistance to apoptosis in CLL cells. rs34676223 at 1p36.11 maps ∼10 kb upstream of MDS2 (Fig. 2), which is the fusion partner of ETV6 in t(1;12)(p36;p13) myelodysplasia. Based on RNA sequencing (RNA-seq) data from patients, MDS2 is overexpressed in CLL versus normal cells and also differentially expressed between two experimentally determined CLL subgroups¹⁴. The SNP rs57214277 maps to 4q35.1 and resides ∼140 kb centromeric to IRF2 (interferon regulatory factor 2, Fig. 2). Interferon (IFN)-αβ, a family of antiviral immune genes, induces IRF2 that inhibits the reactivation of murine gamma herpesvirus¹⁵. Furthermore, SNPs in strong LD with rs57214277 are associated with increased expression of IRF2 as well as trans-regulation of a network of genes in lipopolysaccharide and IFNγ-treated monocytes¹⁶. rs140522 maps to 22q13.33 (Fig. 2), which has previously been associated with multiple sclerosis risk¹⁷. This region of LD contains four genes, of which only NCAPH2 (non-SMC condensin II complex subunit H2) shows differential expression between CLL and normal B cells¹⁴ (∼2.5-fold lower levels in CLL), and plays an essential role in mitotic chromosome assembly and segregation. rs41271473, rs3800461, rs61904987 and rs1036935 mark genes that have roles in WNT signalling (RHOU), autophagy (C6orf106), transcriptional activation (CXXC1), kinetochore association (SKA1, ZW10) and protein degradation (USP28, TMPRSS5; Fig. 3).

**Figure 2: Regional plots of association results and recombination rates for new risk loci for chronic lymphocytic leukaemia.**

**Figure 3: Regional plots of association results and recombination rates for new risk loci for chronic lymphocytic leukaemia.**

New CLL risk SNPs and clinical phenotype

We tested for differences in the associations by sex or age at diagnosis for each of the nine risk SNPs using case-only analysis, and observed no relationships (Supplementary Data 1). In addition, case-only analysis in a subset of studies provided no evidence for associations between risk SNP genotypes and IGVH (immunoglobulin variable region heavy chain) mutation subtype (Supplementary Data 1) or overall patient survival (Supplementary Table 7). Collectively, these data suggest that these nine risk variants have generic effects on CLL development rather than tumour progression per se.

Functional annotation of new risk loci

To gain insight into the biological basis underlying the novel association signals, we first evaluated profiles for three histone marks (H3K4me1, H3K27ac marking active chromatin and the repressive mark H3K27me3) at each locus, in GM12878 lymphoblastoid cell line (LCL; ref. 18) as well as primary CLL samples¹⁹ (Supplementary Fig. 2). We also examined ATAC-seq profiles from CLL samples and primary B cells as a marker of chromatin accessibility^19,20. Since the strongest associated GWAS SNP may not represent the causal variant, we examined signals across an interval spanning all variants in LD r²>0.2 with the sentinel SNP (based on the 1000 Genomes EUR reference panel). These data revealed regions of active chromatin state at all nine risk loci, in at least one of the cell types. Furthermore, based on the analyses of Hnisz et al.²¹, five of the loci fall within regions designated as ‘super enhancers’ in either LCLs or CD19 B cells (Supplementary Fig. 2). Overall, these findings suggest that the risk loci annotate regulatory regions and may, therefore, have an impact on CLL risk through modulation of enhancer or promoter activity.

Given the possibility that SNPs might influence enhancer or promoter activity by causing changes in transcription factor (TF) binding, we next evaluated the SNPs at each GWAS locus based on their overlap with TF-binding sites. In the absence of comprehensive TF chromatin immunoprecipitation sequencing (ChIP-Seq) data from CLL samples, we used regions of chromatin accessibility defined by ATAC-seq data¹⁹ as a surrogate marker for TF binding, identifying 47 SNPs in LD r²>0.2 with the sentinel SNPs that also overlapped ATAC-seq peaks. Using motifbreakR²² to predict whether these SNPs might disrupt TF-binding motifs, we found 478 potentially disrupted motifs, corresponding to 349 TF-binding sites (Supplementary Table 7). Moreover, at 10 of the SNPs, the altered motif matched the TFs bound in ChIP-seq data from the ENCODE project (Supplementary Table 8 and Supplementary Fig. 3). In particular, we noted that rs13149699 at 4q35 (r²=0.83 with lead SNP rs57214277) was predicted to disrupt SPI1 binding. In addition, rs13149699 showed evidence of evolutionary constraint, and in LCL ChIP-seq data, the SNP was bound by SPI1 as well as other TFs with roles in B-cell function including IRF4, PAX5, POU2F2 (alias OCT2) and RELA (Supplementary Table 8).

We explored whether there was any association between the genotypes of the nine new risk SNPs and the transcript levels of genes within 1 Mb of each respective variant by performing expression quantitative trait loci (eQTL) analysis using gene expression profiles of 468 CLL cases. In addition, we interrogated publicly accessible expression data on whole blood and LCLs (Supplementary Data 2). There were significant (false discovery rate (FDR)<0.05) and consistent eQTLs between rs3800461 and C6orf106, rs1036935 and SKA1, rs140522 and ODF3B, and rs140522 and TYMP.

Biological inference of all CLL risk loci

Given our observation that the nine novel risk loci annotate putative regulatory regions, we sought to examine the epigenetic landscape of CLL risk loci on a broader scale, evaluating the enrichment of both histone modifications (N=11) and TF binding (N=82) in GM12878 LCLs, across the new and previously published CLL GWAS risk SNPs. Using the variant set enrichment method of Cowper-Sal lari et al.²³, we identified regions of strong LD (defined as r²>0.8 and D′>0.8) and determined the overlap between these variants and ENCODE ChIP-seq data. Imposing a P value threshold of 5.37 × 10⁻⁴ (that is, 0.05/93, based on permutation), we identified a significant enrichment of histone marks associated with active enhancer and promoter elements (HK4Me1, H3K27ac and H3K9ac) as well as actively transcribed regions (H3K36me3). We also identified an over-representation of TF binding for POLR2A, IRF4, RUNX3, NFATC1, STAT5A, PML and WRNIP1 (Fig. 4). In addition, although not statistically significant, POU2F2 showed evidence for enriched binding (P=7.78 × 10⁻⁴). Several of these TFs have established roles in B-cell function. OCT2, IRF4 and RUNX3 have been shown to be targeted for hypomethylation in B cells²⁴. MYC is a direct target of IRF4 in activated B cells, with IRF4 being itself a direct target of MYC transactivation. It is noteworthy that variations at IRF4 and 8q24-MYC are recognized risk factors for CLL^2,3. Collectively, these findings are consistent with CLL GWAS SNPs mapping within regions of active chromatin state that exert effects on B-cell cis-regulatory networks.

**Figure 4: Enrichment of transcription factors and histone marks.**

We investigated the genetic pathways between the gene products in proximity to the GWAS SNPs using the LENS pathway tool²⁵. These gene products were primarily involved in immune response, BCR-mediated signalling, apoptosis and maintenance of chromosome integrity, as well as interconnectivity between the gene products (Fig. 5). Pathways that were enriched included those related to interferon signalling and apoptosis (Supplementary Data 3).

**Figure 5: Hive Plot of common protein–protein interactions in CLL.**

Impact of risk SNPs on heritability of CLL

By fitting all SNPs from GWAS simultaneously using Genome-wide Complex Trait Analysis, the estimated heritability of CLL attributable to all common variation is 34% (±5%), thus having potential to explain 57% of the overall familial risk. This estimate represents the additive variance and, therefore, does not include the potential impact of interactions or dominance effects or gene–environment interactions, having an impact on CLL risk. The currently identified risk SNPs (newly discovered and previously identified) only account for 25% of the additive heritable risk.

Discussion

Besides providing additional evidence for genetic susceptibility to CLL, the new and established risk loci identified further insights into the biological basis of CLL development. These loci annotate genes that participate in interconnecting cellular pathways, which are central to B-cell development. In particular, we note the involvement of BCR-mediated signalling with immune responses and apoptosis. Importantly, gene discovery initiatives can have an impact on the successful development of new therapeutic agents²⁶. In this respect it is notable that Ibrutinib²⁷ (a BTK inhibitor) and Idelalisib²⁸ (a PI3KCD inhibitor) mediate their effects through interference of BCR signalling, and Venetoclax²⁹ targets the anti-apoptotic behaviour of BCL-2. The power of our GWAS to identify common alleles conferring relative risks of 1.2 or greater (such as the rs35923643 variant) is high (∼80%). Hence, there are unlikely to be many additional SNPs with similar effects for alleles with frequencies greater than 0.2 in populations of European ancestry. In contrast, our analysis had limited power to detect alleles with smaller effects and/or MAF<0.1. Hence, further GWAS studies in concert with functional analyses should lead to additional insights into CLL biology and afford the prospect of development of novel therapies.

Methods

Ethics

Collection of patient samples and associated clinicopathological information was undertaken with written informed consent and relevant ethical review board approval at respective study centres in accordance with the tenets of the Declaration of Helsinki. Specifically, these centres are UK-CLL1 and UK-CLL2: UK Multi-Research Ethics Committee (MREC 99/1/082); GEC: Mayo Clinic Institutional Review Board, Duke University Institutional Review Board, University of Utah, University of Texas MD Anderson Cancer Center Institutional Review Board, National Cancer Institute, ATBC: NCI Special Studies Institutional Review Board, BCCA: UBC BC Cancer Agency Research Ethics Board, CPS-II: American Cancer Society, ENGELA: IRB00003888—Comite d’ Evaluation Ethique de l’Inserm IRB #1, EPIC: Imperial College London, EpiLymph: International Agency for Research on Cancer, HPFS: Harvard School of Public Health (HSPH) Institutional Review Board, Iowa-Mayo SPORE: University of Iowa Institutional Review Board, Italian GxE: Comitato Etico Azienda Ospedaliero Universitaria di Cagliari, Mayo Clinic Case–Control: Mayo Clinic Institutional Review Board, MCCS: Cancer Council Victoria’s Human Research Ethics Committee, MSKCC: Memorial Sloan-Kettering Cancer Center Institutional Review Board, NCI-SEER (NCI Special Studies Institutional Review Board), NHS: Partners Human Research Committee, Brigham and Women’s Hospital, NSW: NSW Cancer Council Ethics Committee, NYU-WHS: New York University School of Medicine Institutional Review Board, PLCO: (NCI Special Studies Institutional Review Board), SCALE: Scientific Ethics Committee for the Capital Region of Denmark, SCALE: Regional Ethical Review Board in Stockholm (Section 4) IRB#5, Utah: University of Utah Institutional Review Board, UCSF and UCSF2: University of California San Francisco Committee on Human Research, Women’s Health Initiative (WHI): Fred Hutchinson Cancer Research Center and Yale: Human Investigation Committee, Yale University School of Medicine. Informed consent was obtained from all participants. The diagnosis of CLL (ICD-10-CM C91.10, ICD-O M9823/3 and 9670/3) was established in accordance with the International Workshop on Chronic Lymphocytic Leukemia guidelines³⁰.

Genome-wide association studies

The meta-analysis was based on six GWAS^2,6,7,9 (Supplementary Tables 1 and 2). Briefly, the six GWAS comprised—UK-CLL1: 517 CLL cases and 2,698 controls; UK-CLL2: 1,403 CLL cases, 2,501 controls; Genetic Epidemiology of CLL (GEC) Consortium: 396 CLL cases and 296 controls; NHL GWAS Consortium: 1,851 CLL cases and 6,649 controls; UCSF: 214 CLL cases, 751 controls; Utah: 331 CLL cases, 420 controls.

Quality control of GWAS

Standard quality-control measures were applied to the GWAS³¹. Specifically, individuals with low call rate (<95%) as well as all individuals evaluated to be of non-European ancestry (using the HapMap version 2 CEU, JPT/CHB and YRI populations as a reference) were excluded. For apparent first-degree relative pairs, we removed the control from a case–control pair; otherwise, we excluded the individual with the lower call rate. SNPs with a call rate <95% were excluded as were those with a MAF <0.01 or displaying significant deviation from Hardy–Weinberg equilibrium (that is, P<10⁻⁶). GWAS data were imputed to >10 million SNPs with the IMPUTE2 v2.3 software³² using a merged reference panel consisting of data from 1000 Genomes Project (phase 1 integrated release 3 March 2012)¹⁰ and UK10K (ref. 11). Genotypes were aligned to the positive strand in both imputation and genotyping. Imputation was conducted separately for each study, and in each the data were pruned to a common set of SNPs between cases and controls before imputation. We set thresholds for imputation quality to retain potential risk variants with MAF>0.005 for validation. Poorly imputed SNPs defined by an information measure <0.80 were excluded. Tests of association between imputed SNPs and CLL was performed using logistic regression under an additive genetic model in SNPTESTv2.5 (ref. 33). The adequacy of the case–control matching and possibility of differential genotyping of cases and controls were formally evaluated using Q–Q plots of test statistics (Supplementary Fig. 1). The inflation factor λ was based on the 90% least-significant SNPs³⁴. Where appropriate, principal components, generated using common SNPs, were included in the analysis to limit the effects of cryptic population stratification that otherwise might cause inflation of test statistics. Eigenvectors for the GWAS data sets were inferred using smartpca (part of EIGENSOFT³⁵) by merging cases and controls with Phase II HapMap samples.

Replication studies and technical validation

The 16 SNPs in the most promising loci were taken forward for de novo replication (Supplementary Table 2). The UK replication series comprised 645 cases collected through the NCLLC and Leicester Haematology Tissue Bank and 2,341 controls comprised 2,780 healthy individuals ascertained through the National Study of Colorectal Cancer (1999–2006; ref. 36). These controls were the spouses or unrelated friends of individuals with malignancies. None had a personal history of malignancy at the time of ascertainment. Both cases and controls were British residents and had self-reported European ancestry. The Mayo replication series comprised 407 newly diagnosed cases and 1,207 clinic-based controls from the Mayo Clinic CLL case–control study³⁷. The eligibility criteria of the cases were age 20 years and older, consented within 9 months of their initial diagnosis at presentation to Mayo Clinic and no history of HIV. The eligibility criteria for the controls were age 20 years and older, a resident of Minnesota, Iowa or Wisconsin at the time of appointment at Mayo Clinic, no history of lymphoma or leukaemia and no history of HIV infection. Controls were frequency matched to the regional case distribution on 5-year age group, sex and geographic area. In silico replication was performed in 444 cases and 609 controls from International Cancer Genome Consortium (ICGC), and 226 cases and 228 controls from the WHI study^38,39.

The fidelity of imputation as assessed by the concordance between imputed and directly genotyped SNPs was examined in a subset of samples (Supplementary Table 5). Replication genotyping of UK samples was performed using competitive allele-specific PCR KASPar chemistry (LGC, Hertfordshire, UK); replication genotyping of Mayo samples was performed using Sequenom MassARRAY (Sequenom Inc., San Diego, CA, USA). Primers are listed in Supplementary Table 9. Call rates for SNP genotypes were >95% in each of the replication series. To ensure the quality of genotyping in all assays, at least two negative controls and duplicate samples (showing a concordance of >99%) were genotyped at each centre. To exclude technical artefacts in genotyping, we performed cross-platform validation of 96 samples and sequenced a set of 96 randomly selected samples from each case and control series to confirm genotyping accuracy. Assays were found to be performing robustly; concordance was >99%.

Meta-analysis

Meta-analyses were performed using the fixed-effects inverse-variance method based on the β estimates and s.e.’s from each study using META v1.6 (ref. 40). Cochran’s Q-statistic to test for heterogeneity and the I² statistic to quantify the proportion of the total variation due to heterogeneity were calculated⁴¹. Using the meta-analysis summary statistics and LD correlations from a reference panel of the 1000 Genomes Project combined with UK10K we used Genome-wide Complex Trait Analysis to perform conditional association analysis⁴². Association statistics were calculated for all SNPs conditioning on the top SNP in each loci showing genome-wide significance. This is carried out in a step-wise manner.

Analysis of exome-sequencing data

Previously published exome-sequencing data from 141 cases from 66 CLL families⁴³ were interrogated to search for deleterious (missense, nonsense, frameshift or splice site) variants within a genomic interval spanning all SNPs with LD r²>0.2 with each index SNP. Positions resulting in protein-altering changes were identified using the Ensembl Variant Effect Predictor (version 78).

Mutational status

IGVH mutation status was determined according to the BIOMED-2 protocols as described previously⁴⁴. Sequence analysis was conducted using the Chromas software version 2.23 (Applied Biosystems) and the international immunogenetics information system database. In accordance with published criteria, we classified sequences with a germline identity of ≥98% as unmutated and those with an identity of <98% as mutated.

Association between genotype and patient outcome

To examine the relationship between SNP genotype and patient outcome, we analysed two patient series: (1) 356 patients from the UK Leukaemia Research Fund (LRF) CLL-4 trial⁴⁵, which compared the efficacy of fludarabine, chlorambucil and the combination of fludarabine plus cyclophosphamide; (2) 377 newly diagnosed patients from Mayo Clinic who were prospectively followed. Cox-regression analysis was used to estimate genotype-specific hazard ratios and 95% CIs with overall survival. Statistical analyses were undertaken using R version 2.5.0.

eQTL analysis

eQTL analyses were performed by examining the gene expression profiles of 452 CLL cases (Affymetrix Human Genome U219 Array)⁴⁶. Additional data were obtained by querying publicly available eQTL mRNA expression data using MuTHER⁴⁷, the Blood eQTL browser⁴⁸ and data from the GTEx consortium⁴⁹. MuTHER contains expression data on LCLs, skin and adipose tissue from 856 healthy twins. The Blood eQTL browser contains expression data from 5,311 non-transformed peripheral blood samples. We used the whole-blood RNA-seq data from GTEx, which consisted of data from 338 individuals.

Functional annotation

Novel risk SNPs and their proxies (that is, r²>0.2 in the 1000 Genomes EUR reference panel) were annotated for putative functional effect based upon histone mark ChIP-seq/ChIPmentation data for H3K27ac, H3K4Me1 and H3K27Me3 from GM12878 (LCL)¹⁸ and primary CLL cells¹⁹. We searched for overlap with ‘super-enhancer’ regions as defined by Hnisz et al.²¹, restricting the analysis to the GM12878 cell line and CD19⁺ B cells. We also interrogated ATAC-seq data from CLL cells¹⁹ and primary B cells²⁰. The novel risk SNPs and their proxies (r²>0.2 as above) were intersected with regions of accessible chromatin in CLL cells, as defined by Rendeiro et al.¹⁹, which were used as a surrogate for likely sites of TF binding. SNPs falling within accessible sites (n=47) were taken forward to TF-binding motif analysis and were also annotated for genomic evolutionary rate profiling score⁵⁰ as well as bound TFs based on ENCODE project¹⁸ ChIP-seq data.

TF-binding disruption analysis

To determine whether the risk variants or their proxies were disrupting motif-binding sites, we used the motifbreakR package²². This tool predicts the effects of variants on TF-binding motifs, using position probability matrices to determine the likelihood of observing a particular nucleotide at a specific position within a TF-binding site. We tested the SNPs by estimating their effects on over 2,800 binding motifs as characterized by ENCODE⁵¹, FactorBook⁵², HOCOMOCO⁵³ and HOMER⁵⁴. Scores were calculated using the relative entropy algorithm.

TF and histone mark enrichment analysis

To examine enrichment in specific TF binding across risk loci, we adapted the variant set enrichment method of Cowper-Sal lari et al.²³. Briefly, for each risk locus, a region of strong LD (defined as r²>0.8 and D′>0.8) was determined, and these SNPs were termed the associated variant set (AVS). TF ChIP-seq uniform peak data were obtained from ENCODE for the GM12878 cell line, which included data for 82 TF and 11 histone marks. For each of these marks, the overlap of the SNPs in the AVS and the binding sites was determined to produce a mapping tally. A null distribution was produced by randomly selecting SNPs with the same characteristics as the risk-associated SNPs, and the null mapping tally calculated. This process was repeated 10,000 times, and approximate P-values were calculated as the proportion of permutations where null mapping tally was greater or equal to the AVS mapping tally. An enrichment score was calculated by normalizing the tallies to the median of the null distribution. Thus, the enrichment score is the number of s.d.’s of the AVS mapping tally from the mean of the null distribution tallies.

Heritability analysis

We used genome-wide complex trait analysis⁴² to estimate the polygenic variance (that is, heritability) ascribable to all genotyped and imputed GWAS SNPs. SNPs were excluded based on low MAF (MAF<0.01), poor imputation (info score <0.4) and evidence of departure from Hardy Weinberg Equilibrium (HWE) (P<0.05). Individuals were excluded for poor imputation and where two individuals were closely related. A genetic relationship matrix of pairs of samples was used as input for the restricted maximum likelihood analysis to estimate the heritability explained by the selected set of SNPs. To transform the estimated heritability to the liability scale, we used the lifetime risk^55,56 for CLL, which is estimated to be 0.006 by SEER (http://seer.cancer.gov/statfacts/html/clyl.html). The variance of the risk distribution due to the identified risk loci was calculated as described by Pharoah et al.⁵⁷, assuming that the relative risk when a first-degree relative has CLL is 8.5 (ref. 1).

Pathway analysis

To investigate the interaction between the gene products of the GWAS hits, we performed a pathway analysis. We selected the closest coding genes for the lead-associated SNPs and then performed pathway analysis using the LENS tool²⁵, which identifies gene product and protein–protein interactions from HPRD⁵⁸ and BioGRID⁵⁹. Enrichment of pathways was assessed using Fisher’s exact test, comparing the overlap of the genes in the network with the genes in the pathway. Pathway data were obtained from REACTOME⁶⁰. Cytoscape was used to perform network analyses⁶¹, and the Hive Plot was drawn using HiveR (academic.depauw.edu/~hanson/HiveR/HiveR.html).

Data availability

Genotype data that support the findings of this study have been deposited in the database of Genotypes and Phenotypes (dbGAP) under accession code phs000802.v2.p1 and in the European Genome-phenome Archive (EGA) under accession codesEGAS00001000090, EGAD00001000195, EGAS00001000108, EGAD00000000022 and EGAD00000000024.

Transcriptional profiling data from the MuTHER consortium that support the findings of this work have been deposited in the European Bioinformatics Institute (Part of the European Molecular Biology Laboratory, EMBL-EBI) under accession code E-TABM-1140. Data from Blood eQTL have been deposited in the EBI-EMBL under accession codes E-TABM-1036, E-MTAB-945 and E-MTAB-1708. GTEx data are deposited in dbGaP under accession code phs000424.v6.p1. The remaining data are contained within the paper and its Supplementary files or are available from the authors upon reasonable request.

Additional information

How to cite this article: Law, P. J. et al. Genome-wide association analysis implicates dysregulation of immunity genes in chronic lymphocytic leukaemia. Nat. Commun. 8, 14175 doi: 10.1038/ncomms14175 (2017).

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

Goldin, L. R., Pfeiffer, R. M., Li, X. & Hemminki, K. Familial risk of lymphoproliferative tumors in families of patients with chronic lymphocytic leukemia: results from the Swedish Family-Cancer Database. Blood 104, 1850–1854 (2004).
Article CAS Google Scholar
Di Bernardo, M. C. et al. A genome-wide association study identifies six susceptibility loci for chronic lymphocytic leukemia. Nat. Genet. 40, 1204–1210 (2008).
Article CAS Google Scholar
Crowther-Swanepoel, D. et al. Common variants at 2q37.3, 8q24.21, 15q21.3 and 16q24.1 influence chronic lymphocytic leukemia risk. Nat. Genet. 42, 132–136 (2010).
Article CAS Google Scholar
Slager, S. L. et al. Genome-wide association study identifies a novel susceptibility locus at 6p21.3 among familial CLL. Blood 117, 1911–1916 (2011).
Article CAS Google Scholar
Slager, S. L. et al. Common variation at 6p21.31 (BAK1) influences the risk of chronic lymphocytic leukemia. Blood 120, 843–846 (2012).
Article CAS Google Scholar
Berndt, S. I. et al. Genome-wide association study identifies multiple risk loci for chronic lymphocytic leukemia. Nat. Genet. 45, 868–876 (2013).
Article CAS Google Scholar
Speedy, H. E. et al. A genome-wide association study identifies multiple susceptibility loci for chronic lymphocytic leukemia. Nat. Genet. 46, 56–60 (2014).
Article CAS Google Scholar
Sava, G. P. et al. Common variation at 12q24.13 (OAS3) influences chronic lymphocytic leukemia risk. Leukemia 29, 748–751 (2015).
Article CAS Google Scholar
Berndt, S. I. et al. Meta-analysis of genome-wide association studies discovers multiple loci for chronic lymphocytic leukemia. Nat. Commun. 7, 10933 (2016).
Article ADS CAS Google Scholar
The 1000 Genomes Project Consortium. A map of human genome variation from population-scale sequencing. Nature 467, 1061–1073 (2010).
Huang, J. et al. Improved imputation of low-frequency and rare variants using the UK10K haplotype reference panel. Nat. Commun. 6, 8111 (2015).
Article CAS Google Scholar
Kozyrev, S. V. et al. Functional variants in the B-cell gene BANK1 are associated with systemic lupus erythematosus. Nat. Genet. 40, 211–216 (2008).
Article CAS Google Scholar
Aiba, Y. et al. BANK negatively regulates Akt activation and subsequent B cell responses. Immunity 24, 259–268 (2006).
Article CAS Google Scholar
Ferreira, P. G. et al. Transcriptome characterization by RNA sequencing identifies a major molecular and clinical subdivision in chronic lymphocytic leukemia. Genome Res. 24, 212–226 (2014).
Article CAS Google Scholar
Mandal, P. et al. A gammaherpesvirus cooperates with interferon-alpha/beta-induced IRF2 to halt viral replication, control reactivation, and minimize host lethality. PLoS Pathog. 7, e1002371 (2011).
Article CAS Google Scholar
Fairfax, B. P. et al. Innate immune activity conditions the effect of regulatory variants upon monocyte gene expression. Science 343, 1246949 (2014).
Article Google Scholar
The International Multiple Sclerosis Genetics Consortium & The Wellcome Trust Case Control Consortium 2. Genetic risk and a primary role for cell-mediated immune mechanisms in multiple sclerosis. Nature 476, 214–219 (2011).
de Souza, N. The ENCODE project. Nat. Methods 9, 1046 (2012).
Article Google Scholar
Rendeiro, A. F. et al. Chromatin accessibility maps of chronic lymphocytic leukaemia identify subtype-specific epigenome signatures and transcription regulatory networks. Nat. Commun. 7, 11938 (2016).
Article ADS CAS Google Scholar
Corces, M. R. et al. Lineage-specific and single-cell chromatin accessibility charts human hematopoiesis and leukemia evolution. Nat. Genet. 10, 1193–1203 (2016).
Article CAS Google Scholar
Hnisz, D. et al. Super-enhancers in the control of cell identity and disease. Cell 155, 934–947 (2013).
Article CAS Google Scholar
Coetzee, S. G., Coetzee, G. A. & Hazelett, D. J. motifbreakR: an R/Bioconductor package for predicting variant effects at transcription factor binding sites. Bioinformatics 31, 3847–3849 (2015).
CAS PubMed PubMed Central Google Scholar
Cowper-Sal lari, R. et al. Breast cancer risk-associated SNPs modulate the affinity of chromatin for FOXA1 and alter gene expression. Nat. Genet. 44, 1191–1198 (2012).
Article CAS Google Scholar
Oakes, C. C. et al. DNA methylation dynamics during B cell maturation underlie a continuum of disease phenotypes in chronic lymphocytic leukemia. Nat. Genet. 48, 253–264 (2016).
Article CAS Google Scholar
Handen, A. & Ganapathiraju, M. K. LENS: web-based lens for enrichment and network studies of human proteins. BMC Med. Genomics 8, S2 (2015).
Article Google Scholar
Nelson, M. R. et al. The support of human genetic evidence for approved drug indications. Nat. Genet. 47, 856–860 (2015).
Article CAS Google Scholar
Byrd, J. C. et al. Targeting BTK with Ibrutinib in relapsed chronic lymphocytic leukemia. N. Engl. J. Med. 369, 32–42 (2013).
Article CAS Google Scholar
Furman, R. R. et al. Idelalisib and rituximab in relapsed chronic lymphocytic leukemia. N. Engl. J. Med. 370, 997–1007 (2014).
Article CAS Google Scholar
Roberts, A. W. et al. Targeting BCL2 with venetoclax in relapsed chronic lymphocytic leukemia. N. Engl. J. Med. 374, 311–322 (2016).
Article CAS Google Scholar
Hallek, M. et al. Guidelines for the diagnosis and treatment of chronic lymphocytic leukemia: a report from the International Workshop on Chronic Lymphocytic Leukemia updating the National Cancer Institute-Working Group 1996 guidelines. Blood 111, 5446–5456 (2008).
Article CAS Google Scholar
Anderson, C. A. et al. Data quality control in genetic case-control association studies. Nat. Protoc. 5, 1564–1573 (2010).
Article CAS Google Scholar
Howie, B. N., Donnelly, P. & Marchini, J. A flexible and accurate genotype imputation method for the next generation of genome-wide association studies. PLoS Genet. 5, e1000529 (2009).
Article Google Scholar
Marchini, J., Howie, B., Myers, S., McVean, G. & Donnelly, P. A new multipoint method for genome-wide association studies by imputation of genotypes. Nat. Genet. 39, 906–913 (2007).
Article CAS Google Scholar
Clayton, D. G. et al. Population structure, differential bias and genomic control in a large-scale, case-control association study. Nat. Genet. 37, 1243–1246 (2005).
Article CAS Google Scholar
Price, A. L. et al. Principal components analysis corrects for stratification in genome-wide association studies. Nat. Genet. 38, 904–909 (2006).
Article CAS Google Scholar
Penegar, S. et al. National study of colorectal cancer genetics. Br. J. Cancer 97, 1305–1309 (2007).
Article CAS Google Scholar
Cerhan, J. R. et al. Design and validity of a clinic-based case-control study on the molecular epidemiology of lymphoma. Int. J. Mol. Epidemiol. Genet. 2, 95–113 (2011).
PubMed PubMed Central Google Scholar
Anderson, G. L. et al. Implementation of the Women’s Health Initiative study design. Ann. Epidemiol. 13, S5–S17 (2003).
Article Google Scholar
Berndt, S. I. et al. Meta-analysis of genome-wide association studies discovers multiple loci for chronic lymphocytic leukemia. Nat. Commun. 7, 10933 (2016).
Article ADS CAS Google Scholar
Liu, J. Z. et al. Meta-analysis and imputation refines the association of 15q25 with smoking quantity. Nat. Genet. 42, 436–440 (2010).
Article CAS Google Scholar
Higgins, J. P. & Thompson, S. G. Quantifying heterogeneity in a meta-analysis. Stat. Med. 21, 1539–1558 (2002).
Article Google Scholar
Yang, J., Lee, S. H., Goddard, M. E. & Visscher, P. M. GCTA: a tool for genome-wide complex trait analysis. Am. J. Hum. Genet. 88, 76–82 (2011).
Article CAS Google Scholar
Speedy, H. E. et al. Germline mutations in shelterin complex genes are associated with familial chronic lymphocytic leukemia. Blood 128, 2319–2326 (2016).
Article CAS Google Scholar
van Krieken, J. H. et al. Improved reliability of lymphoma diagnostics via PCR-based clonality testing: report of the BIOMED-2 concerted action BHM4-CT98-3936. Leukemia 21, 201–206 (2007).
Article CAS Google Scholar
Catovsky, D. et al. Assessment of fludarabine plus cyclophosphamide for patients with chronic lymphocytic leukaemia (the LRF CLL4 Trial): a randomised controlled trial. Lancet 370, 230–239 (2007).
Article CAS Google Scholar
Puente, X. S. et al. Non-coding recurrent mutations in chronic lymphocytic leukaemia. Nature 526, 519–524 (2015).
Article ADS CAS Google Scholar
Grundberg, E. et al. Mapping cis- and trans-regulatory effects across multiple tissues in twins. Nat. Genet. 44, 1084–1089 (2012).
Article CAS Google Scholar
Westra, H. J. et al. Systematic identification of trans eQTLs as putative drivers of known disease associations. Nat. Genet. 45, 1238–1243 (2013).
Article CAS Google Scholar
Ardlie, K. G. et al. The genotype-tissue expression (GTEx) pilot analysis: multitissue gene regulation in humans. Science 348, 648–660 (2015).
Article ADS Google Scholar
Davydov, E. V. et al. Identifying a high fraction of the human genome to be under selective constraint using GERP⁺⁺. PLoS Comput. Biol. 6, e1001025 (2010).
Article Google Scholar
Kheradpour, P. & Kellis, M. Systematic discovery and characterization of regulatory motifs in ENCODE TF binding experiments. Nucleic Acids Res. 42, 2976–2987 (2014).
Article CAS Google Scholar
Wang, J. et al. Sequence features and chromatin structure around the genomic regions bound by 119 human transcription factors. Genome Res. 22, 1798–1812 (2012).
Article CAS Google Scholar
Kulakovskiy, I. V. et al. HOCOMOCO: a comprehensive collection of human transcription factor binding sites models. Nucleic Acids Res. 41, D195–D202 (2013).
Article CAS Google Scholar
Heinz, S. et al. Simple combinations of lineage-determining transcription factors prime cis-regulatory elements required for macrophage and B cell identities. Mol. Cell 38, 576–589 (2010).
Article CAS Google Scholar
Lu, Y. et al. Most common ‘sporadic’ cancers have a significant germline genetic component. Hum. Mol. Genet. 23, 6112–6118 (2014).
Article CAS Google Scholar
Lee, S. H. et al. Estimation and partitioning of polygenic variation captured by common SNPs for Alzheimer’s disease, multiple sclerosis and endometriosis. Hum. Mol. Genet. 22, 832–841 (2013).
Article CAS Google Scholar
Pharoah, P. D., Antoniou, A. C., Easton, D. F. & Ponder, B. A. Polygenes, risk prediction, and targeted prevention of breast cancer. N. Engl. J. Med. 358, 2796–2803 (2008).
Article CAS Google Scholar
Keshava Prasad, T. S. et al. Human Protein Reference Database--2009 update. Nucleic Acids Res. 37, D767–D772 (2009).
Article CAS Google Scholar
Chatr-Aryamontri, A. et al. The BioGRID interaction database: 2013 update. Nucleic Acids Res. 41, D816–D823 (2013).
Article CAS Google Scholar
Croft, D. et al. The Reactome pathway knowledgebase. Nucleic Acids Res. 42, D472–D477 (2014).
Article ADS CAS Google Scholar
Smoot, M. E., Ono, K., Ruscheinski, J., Wang, P. L. & Ideker, T. Cytoscape 2.8: new features for data integration and network visualization. Bioinformatics 27, 431–432 (2011).
Article CAS Google Scholar
Scales, M., Jager, R., Migliorini, G., Houlston, R. S. & Henrion, M. Y. visPIG--a web tool for producing multi-region, multi-track, multi-scale plots of genetic data. PLoS ONE 9, e107497 (2014).
Article ADS Google Scholar

Download references

Acknowledgements

In the United Kingdom, Bloodwise provided funding for the study (LRF05001, LRF06002 and LRF13044) with additional support from Cancer Research UK (C1298/A8362 supported by the Bobby Moore Fund) and the Arbib Fund. G.P.S. is in receipt of a PhD studentship from The Institute of Cancer Research. The NCI/InterLymph NHL GWAS initiative was supported by the intramural programme of the Division of Cancer Epidemiology and Genetics, National Cancer Institute, US National Institutes of Health. ATBC—This research was supported in part by the Intramural Research Program of the NIH and the National Cancer Institute. In addition, this research was supported by U.S. Public Health Service contracts N01-CN-45165, N01-RC-45035, N01-RC-37004 and HHSN261201000006C from the National Cancer Institute, Department of Health and Human Services. BC—Canadian Institutes for Health Research (CIHR); Canadian Cancer Society; Michael Smith Foundation for Health Research. CPS-II—The Cancer Prevention Study-II (CPS-II) Nutrition Cohort is supported by the American Cancer Society. Genotyping for all CPS-II samples was supported by the Intramural Research Program of the National Institutes of Health, NCI, Division of Cancer Epidemiology and Genetics. We also acknowledge the contribution to this study from central cancer registries supported through the Centers for Disease Control and Prevention National Program of Cancer Registries, and cancer registries supported by the National Cancer Institute Surveillance Epidemiology and End Results program. ELCCS—Leukemia and Lymphoma Research. ENGELA—Association pour la Recherche contre le Cancer (ARC), Institut National du Cancer (INCa), Fondation de France, Fondation contre la Leucémie, Agence nationale de sécurité sanitaire de l’alimentation, de l’environnement et du travail (ANSES). EPIC—Coordinated Action (Contract #006438, SP23-CT-2005-006438); HuGeF (Human Genetics Foundation), Torino, Italy; Cancer Research UK. EpiLymph—European Commission (grant references QLK4-CT-2000-00422 and FOOD-CT-2006-023103); the Spanish Ministry of Health (grant references CIBERESP, PI11/01810, PI14/01219, RCESP C03/09, RTICESP C03/10 and RTIC RD06/0020/0095), the Marató de TV3 Foundation (grant reference 051210), the Agència de Gestiód’AjutsUniversitarisi de Recerca—Generalitat de Catalunya (grant reference 2014SRG756), who had no role in the data collection, analysis or interpretation of the results; the NIH (contract NO1-CO-12400); the Compagnia di San Paolo—Programma Oncologia; the Federal Office for Radiation Protection grants StSch4261 and StSch4420, the José Carreras Leukemia Foundation grant DJCLS-R12/23, the German Federal Ministry for Education and Research (BMBF-01-EO-1303); the Health Research Board, Ireland, and Cancer Research Ireland; Czech Republic supported by MH CZ—DRO (MMCI, 00209805) and RECAMO, CZ.1.05/2.1.00/03.0101; Fondation de France and Association de Recherche Contre le Cancer. GEC/Mayo GWAS—National Institutes of Health (CA118444, CA148690, CA92153). Intramural Research Program of the NIH, National Cancer Institute. Veterans Affairs Research Service. Data collection for Duke University was supported by a Leukemia and Lymphoma Society Career Development Award, the Bernstein Family Fund for Leukemia and Lymphoma Research and the National Institutes of Health (K08CA134919), National Center for Advancing Translational Science (UL1 TR000135). HPFS—The HPFS was supported in part by National Institutes of Health grants CA167552, CA149445 and CA098122. We would like to thank the participants and staff of the Health Professionals Follow-up Study for their valuable contributions as well as the following state cancer registries for their help: AL, AZ, AR, CA, CO, CT, DE, FL, GA, ID, IL, IN, IA, KY, LA, ME, MD, MA, MI, NE, NH, NJ, NY, NC, ND, OH, OK, OR, PA, RI, SC, TN, TX, VA, WA, WY. We assume full responsibility for analyses and interpretation of these data. Iowa-Mayo SPORE—NCI Specialized Programs of Research Excellence (SPORE) in Human Cancer (P50 CA97274); National Cancer Institute (P30 CA086862, P30 CA15083); Henry J. Predolin Foundation. Italian GxE—Italian Association for Cancer Research (AIRC, Investigator Grant 11855; PC); Fondazione Banco di Sardegna 2010–2012 and Regione Autonoma della Sardegna (LR7 CRP-59812/2012; MGE). Mayo Clinic Case–Control—National Institutes of Health (R01 CA92153); National Cancer Institute (P30 CA015083). MCCS—The Melbourne Collaborative Cohort Study recruitment was funded by VicHealth and Cancer Council Victoria. The MCCS was further supported by Australian NHMRC grants 209057, 251553 and 504711 and by infrastructure provided by Cancer Council Victoria. Cases and their vital status were ascertained through the Victorian Cancer Registry (VCR). MD Anderson—Institutional support to the Center for Translational and Public Health Genomics. MSKCC—Geoffrey Beene Cancer Research Grant, Lymphoma Foundation (LF5541); Barbara K. Lipman Lymphoma Research Fund (74419); Robert and Kate Niehaus Clinical Cancer Genetics Research Initiative (57470); U01 HG007033; ENCODE; U01 HG007033. R21 CA178800. NCI-SEER—Intramural Research Program of the National Cancer Institute, National Institutes of Health and Public Health Service (N01-PC-65064, N01-PC-67008, N01-PC-67009, N01-PC-67010, N02-PC-71105). NHS—The NHS was supported in part by National Institutes of Health grants CA186107, CA87969, CA49449, CA149445 and CA098122. We would like to thank the participants and staff of the Nurses’ Health Study for their valuable contributions as well as the following state cancer registries for their help: A.L., A.Z., A.R., C.A., C.O., C.T., D.E., F.L., G.A., I.D., I.L., I.N., I.A., K.Y., L.A., M.E., M.D., M.A., M.I., N.E., N.H., N.J., N.Y., N.C., N.D., O.H., O.K., O.R., P.A., R.I., S.C., T.N., T.X., V.A., W.A. and W.Y. The authors assume full responsibility for analyses and interpretation of these data. NSW—NSW was supported by grants from the Australian National Health and Medical Research Council (ID990920), the Cancer Council NSW and the University of Sydney Faculty of Medicine. NYU-WHS—National Cancer Institute (R01 CA098661, P30 CA016087); National Institute of Environmental Health Sciences (ES000260). PLCO—This research was supported by the Intramural Research Program of the National Cancer Institute and by contracts from the Division of Cancer Prevention, National Cancer Institute, NIH, DHHS. SCALE—Swedish Cancer Society (2009/659). Stockholm County Council (20110209) and the Strategic Research Program in Epidemiology at Karolinska Institute. Swedish Cancer Society grant (02 6661). National Institutes of Health (5R01 CA69669-02); Plan Denmark. UCSF2—The UCSF studies were supported by the NCI, National Institutes of Health, CA1046282, CA154643, CA45614, CA89745, CA87014. The collection of cancer incidence data used in this study was supported by the California Department of Health Services as part of the statewide cancer reporting programme mandated by California Health and Safety Code Section 103885; the National Cancer Institute’s Surveillance, Epidemiology and End Results Program under contract HHSN261201000140C awarded to the Cancer Prevention Institute of California, contract HHSN261201000035C awarded to the University of Southern California, and contract HHSN261201000034C awarded to the Public Health Institute; and the Centers for Disease Control and Prevention’s National Program of Cancer Registries, under agreement #1U58 DP000807-01 awarded to the Public Health Institute. The ideas and opinions expressed herein are those of the authors, and endorsement by the State of California, the California Department of Health Services, the National Cancer Institute or the Centers for Disease Control and Prevention or their contractors and subcontractors is not intended nor should be inferred. UTAH—National Institutes of Health CA134674. Partial support for data collection at the Utah site was made possible by the Utah Population Database (UPDB) and the Utah Cancer Registry (UCR). Partial support for all data sets within the UPDB is provided by the Huntsman Cancer Institute (HCI) and the HCI Comprehensive Cancer Center Support grant, P30 CA42014. The UCR is supported in part by NIH contract HHSN261201000026C from the National Cancer Institute SEER Program with additional support from the Utah State Department of Health and the University of Utah. WHI—WHI investigators are: Program Office (National Heart, Lung, and Blood Institute, Bethesda, Maryland)—Jacques Rossouw, Shari Ludlam, Dale Burwen, Joan McGowan, Leslie Ford and Nancy Geller; Clinical Coordinating Center (Fred Hutchinson Cancer Research Center, Seattle, WA)—Garnet Anderson, Ross Prentice, Andrea LaCroix and Charles Kooperberg; Investigators and Academic Centers (Brigham and Women’s Hospital, Harvard Medical School, Boston, MA, USA)—JoAnn E. Manson; (MedStar Health Research Institute/Howard University, Washington, DC, USA) Barbara V. Howard; (Stanford Prevention Research Center, Stanford, CA, USA) Marcia L. Stefanick (The Ohio State University, Columbus, OH, USA); Rebecca Jackson (University of Arizona, Tucson/Phoenix, AZ, USA); Cynthia A. Thomson; (University at Buffalo, Buffalo, NY, USA); Jean Wactawski-Wende (University of Florida, Gainesville/Jacksonville, FL, USA); Marian Limacher (University of Iowa, Iowa City/Davenport, IA, USA); Robert Wallace (University of Pittsburgh, Pittsburgh, PA, USA); Lewis Kuller (Wake Forest University School of Medicine, Winston-Salem, NC, USA); Sally Shumaker WHI Memory Study (Wake Forest University School of Medicine, Winston-Salem, NC, USA) Sally Shumaker. The WHI programme is funded by the National Heart, Lung, and Blood Institute, National Institutes of Health, U.S. Department of Health and Human Services through contracts HHSN268201100046C, HHSN268201100001C, HHSN268201100002C, HHSN268201100003C, HHSN268201100004C and HHSN271201100004C. YALE—National Cancer Institute (CA62006); National Cancer Institute (CA165923). The Spanish replication study was supported by the Spanish Ministry of Economy and Competitiveness through the Instituto de Salud Carlos III (FIS PI13/01136; International Cancer Genome Consortium-Chronic Lymphocytic Leukemia Genome Project). We thank L. Padyukov (Karolinska Institutet) and the Epidemiological Investigation of Rheumatoid Arthritis (EIRA) group for providing control samples from the Swedish population for the Swedish replication study. MCCS cohort recruitment was funded by VicHealth and Cancer Council Victoria. The MCCS was further supported by Australian NHMRC grants 209057, 251553 and 504711, and by infrastructure provided by Cancer Council Victoria. Cases and their vital status were ascertained through the Victorian Cancer Registry (VCR) and the Australian Institute of Health and Welfare (AIHW), including the National Death Index and the Australian Cancer Database. This study makes use of data generated by the Wellcome Trust Case Control Consortium. A full list of the investigators who contributed to the generation of the data is available in www.wtccc.org.uk. Funding for the project was provided by the Wellcome Trust under award 076113. We are grateful to all investigators and all the patients and individuals for their participation. We also thank the clinicians, other hospital staff and study staff that contributed to the blood sample and data collection for this study.

Author information

Philip J. Law, Sonja I. Berndt, Helen E. Speedy, Nicola J. Camp, Georgina P. Sava and Christine F. Skibola: These authors contributed equally to this work.
Richard Houlston and Susan Slager: These authors jointly supervised the work.

Authors and Affiliations

Division of Genetics and Epidemiology, The Institute of Cancer Research, London, SW7 3RP, UK
Philip J. Law, Helen E. Speedy, Georgina P. Sava, Amy Holroyd & Richard Houlston
Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, 20892, Maryland, USA
Sonja I. Berndt, Lynn R. Goldin, Qing Lan, Mark P. Purdue, Demetrius Albanes, Stephanie Weinstein, Neil E. Caporaso, Lindsay M. Morton, Joseph F. Fraumeni, Stephen J. Chanock & Nathanial Rothman
Department of Internal Medicine, Huntsman Cancer Institute, University of Utah School of Medicine, Salt Lake City, 84112, Utah, USA
Nicola J. Camp, Martha Glenn & Karen Curtin
Department of Epidemiology, School of Public Health and Comprehensive Cancer Center, University of Alabama at Birmingham, Birmingham, 35233, Alabama, USA
Christine F. Skibola & Lucia Conde
Department of Medicine, Memorial Sloan Kettering Cancer Center, New York, 10065, New York, USA
Vijai Joseph, Sabine Topka & Kenneth Offit
Northern Institute for Cancer Research, Newcastle University, Newcastle upon Tyne, NE2 4HH, UK
Nicola J. Sunter, Andrew G. Hall, Helen Marr & James M. Allan
Center for Chronic Immunodeficiency, University Medical Center Freiburg, Freiburg, 79108, Baden-Württemberg, Germany
Alexandra Nieters
Institut d’Investigacions Biomèdiques August Pi iSunyer (IDIBAPS), Hospital Clínic, Barcelona, 08036, Spain
Silvia Bea, David Martin-Garcia, Guillem Clot & Elias Campo
Registre des hémopathies malignes de la Gironde, Institut Bergonié, Inserm U1219 EPICENE, Bordeaux, 33076, France
Alain Monnereau
Epidemiology of Childhood and Adolescent Cancers Group, Inserm, Center of Research in Epidemiology and Statistics Sorbonne Paris Cité, Paris, F-94807, France
Alain Monnereau
Université Paris Descartes, Paris, 75270, France
Alain Monnereau
Epidemiology Research Program, American Cancer Society, Atlanta, 30303, Georgia, USA
Lauren R. Teras
Grupo de Medicina Xenomica, Universidade de Santiago de Compostela, Centro Nacional de Genotipado (CeGen-PRB2-ISCIII), CIBERER, Santiago de Compostela, 15782, Spain
Inés Quintela & Angel Carracedo
Channing Division of Network Medicine, Department of Medicine, Brigham and Women’s Hospital and Harvard Medical School, Boston, 02115, Massachusetts, USA
Brenda M. Birmann
Ernest and Helen Scott Haematological Research Institute, University of Leicester, Leicester, LE2 7LX, UK
Sandrine Jayne, Aneela Majid & Martin J. S. Dyer
Department of Preventive Medicine, USC Keck School of Medicine, University of Southern California, Los Angeles, 90033, California, USA
Wendy Cozen
Norris Comprehensive Cancer Center, USC Keck School of Medicine, University of Southern California, Los Angeles, 90033, California, USA
Wendy Cozen
Department of Medicine Solna, Unit of Clinical Epidemiology, Karolinska Institutet, Hematology Center, Karolinsak University Hospital, Stockholm, 17176, Sweden
Karin E. Smedby
The Royal Marsden NHS Foundation Trust, London, SM2 5PT, UK
Claire Dearden
Genome Sciences Centre, BC Cancer Agency, Vancouver, V5Z1L3, British Columbia, Canada
Angela R. Brooks-Wilson
Department of Biomedical Physiology and Kinesiology, Simon Fraser University, Burnaby, V5A1S6, British Columbia, Canada
Angela R. Brooks-Wilson
Haematological Sciences, Medical School, Newcastle University, Newcastle-upon-Tyne, NE2 4HH, UK
Tryfonia Mainou-Fowler
Centre for Big Data Research in Health, University of New South Wales, Sydney, 2052, New South Wales, Australia
Claire M. Vajdic
Department of Haematology, Royal Victoria Infirmary, Newcastle upon Tyne, NE1 4LP, UK
Graham H. Jackson
Department of Public Health, Clinical and Molecular Medicine, University of Cagliari, Monserrato, 09042, Cagliari, Italy
Pierluigi Cocco
Department of Environmental Health Sciences, Yale School of Public Health, New Haven, 06520, Connecticut, USA
Yawei Zhang & Tongzhang Zheng
Cancer Epidemiology Centre, Cancer Council Victoria, Melbourne, 3004, Victoria, Australia
Graham G. Giles & Roger L. Milne
Centre for Epidemiology and Biostatistics, Melbourne School of Population and Global Health, University of Melbourne, Melbourne, 3010, Victoria, Australia
Graham G. Giles & Roger L. Milne
Westat, Rockville, 20850, Maryland, USA
Charles Lawrence
Division of Hematology, Mayo Clinic, Rochester, 55905, Minnesota, USA
Timothy G. Call
Department of Medicine, Mayo Clinic, Rochester, 55905, Minnesota, USA
Mark Liebow
Department of Epidemiology Research, Division of Health Surveillance and Research, Statens Serum Institut, Copenhagen, 2300, Denmark
Mads Melbye & Henrik Hjalgrim
Department of Medicine, Stanford University School of Medicine, Stanford, 94305, California, USA
Mads Melbye
Department of Immunology, Genetics and Pathology, Science for Life Laboratory, Uppsala University, Uppsala, 75105, Sweden
Bengt Glimelius, Larry Mansouri & Richard Rosenquist
Epidemiology Research Program, American Cancer Society, Atlanta, 30303, Georgia, USA
W Ryan Diver
Department of Internal Medicine, Carver College of Medicine, The University of Iowa, Iowa City, 52242, Iowa, USA
Brian K. Link
Department of Epidemiology and Biostatistics, University of California San Francisco, San Francisco, 94118, California, USA
Paige M. Bracci & Elizabeth A. Holly
Division of Endocrinology, Diabetes and Metabolism, Ohio State University, Columbus, 43210, Ohio, USA
Rebecca D. Jackson
Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, 98117, Washington, USA
Lesley F. Tinker
Cancer Epidemiology Research Programme, Catalan Institute of Oncology-IDIBELL, L’Hospitalet de Llobregat, Barcelona, 08908, Spain
Yolanda Benavente
CIBER de Epidemiología y Salud Pública (CIBERESP), Barcelona, 08036, Spain
Yolanda Benavente
The Tisch Cancer Institute, Icahn School of Medicine at Mount Sinai, New York, 10029, New York, USA
Paolo Boffetta & Silvia de Sanjose
International Agency for Research on Cancer, Lyon, 69372, France
Paul Brennan, James McKay & Silvia de Sanjose
Registre des Hémopathies Malignes de Côte d’Or, University of Burgundy and Dijon University Hospital, Dijon, 21070, France
Marc Maynadie
Department of Computational Biology, St Jude Children’s Research Hospital, Memphis, 38105, Tennessee, USA
Zhaoming Wang
Department of Family Medicine and Public Health Sciences, Wayne State University, Detroit, 48201, Michigan, USA
Richard K. Severson
School of Public Health, Imperial College London, London, W2 1PG, UK
Elio Riboli
MRC-PHE Centre for Environment and Health, School of Public Health, Imperial College London, London, W2 1PG, UK
Paolo Vineis
Human Genetics Foundation, Turin, 10126, Italy
Paolo Vineis
Institute for Risk Assessment Sciences, Utrecht University, Utrecht, 3508 TD, The Netherlands
Roel C. H. Vermeulen
Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht, 3584 CX, The Netherlands
Roel C. H. Vermeulen
Department of Pathology, Genetic Epidemiology Laboratory, University of Melbourne, Melbourne, 3010, Victoria, Australia
Melissa C. Southey
Epidemiology of Childhood and Adolescent Cancers Group, Inserm, Center of Research in Epidemiology and Statistics Sorbonne Paris Cité (CRESS), Paris, F-94807, France
Jacqueline Clavel
Université Paris Descartes, Paris, 75270, France
Jacqueline Clavel
Cancer Control Research, BC Cancer Agency, Vancouver, V5Z1L3, British Columbia, Canada
John J. Spinelli
School of Population and Public Health, University of British Columbia, Vancouver, V6T1Z3, British Columbia, Canada
John J. Spinelli
Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, 02115, Massachusetts, USA
Peter Kraft
Department of Biostatistics, Harvard T. H. Chan School of Public Health, Boston, 02115, Massachusetts, USA
Peter Kraft
Department of Biomedical Science, University of Cagliari, Monserrato, Cagliari, 09042, Italy
Maria Grazia Ennas
Department of Haematology, Queen Elizabeth Hospital, Gateshead, NE9 6SX, UK
Geoffrey Summerfield
Interdisciplinary Department of Medicine, University of Bari, Bari, 70124, Italy
Giovanni M. Ferri
Department of Molecular and Clinical Cancer Medicine, University of Liverpool, Liverpool, L69 3BX, UK
Robert J. Harris & Andrew R. Pettitt
Environmental and Occupational Epidemiology Unit, Cancer Prevention and Research Institute (ISPO), Florence, 50139, Italy
Lucia Miligi
Department of Epidemiology, University of North Carolina at Chapel Hill, Chapel Hill, 27599, North Carolina, USA
Kari E. North
Carolina Center for Genome Sciences, University of North Carolina at Chapel Hill, Chapel Hill, 27599, North Carolina, USA
Kari E. North
Queens Centre for Haematology and Oncology, Castle Hill Hospital, Hull and East Yorkshire NHS Trust, Cottingham, HU16 5JQ, UK
David J. Allsup & James R. Bailey
Department of Haematology, Birmingham Heartlands Hospital, Birmingham, B9 5SS, UK
Guy Pratt
Division of Cancer and Genetics, School of Medicine, Cardiff University, Cardiff, CF14 4XN, UK
Chris Pepper
Cardiff and Vale National Health Service Trust, Heath Park, Cardiff, CF14 4XW, UK
Chris Fegan
Center of Excellence in Genomic Medicine Research, King Abdulaziz University, Jeddah, 21589, KSA
Angel Carracedo
Division of Molecular Pathology, The Institute of Cancer Research, London, SW7 3RP, UK
Daniel Catovsky
Unitat de Hematología, Hospital Clínic, IDIBAPS, Universitat de Barcelona, Barcelona, 08036, Spain
Elias Campo
Department of Health Sciences Research, Mayo Clinic, Rochester, 55905, Minnesota, USA
James R. Cerhan & Susan Slager

Authors

Philip J. Law
View author publications
You can also search for this author in PubMed Google Scholar
Sonja I. Berndt
View author publications
You can also search for this author in PubMed Google Scholar
Helen E. Speedy
View author publications
You can also search for this author in PubMed Google Scholar
Nicola J. Camp
View author publications
You can also search for this author in PubMed Google Scholar
Georgina P. Sava
View author publications
You can also search for this author in PubMed Google Scholar
Christine F. Skibola
View author publications
You can also search for this author in PubMed Google Scholar
Amy Holroyd
View author publications
You can also search for this author in PubMed Google Scholar
Vijai Joseph
View author publications
You can also search for this author in PubMed Google Scholar
Nicola J. Sunter
View author publications
You can also search for this author in PubMed Google Scholar
Alexandra Nieters
View author publications
You can also search for this author in PubMed Google Scholar
Silvia Bea
View author publications
You can also search for this author in PubMed Google Scholar
Alain Monnereau
View author publications
You can also search for this author in PubMed Google Scholar
David Martin-Garcia
View author publications
You can also search for this author in PubMed Google Scholar
Lynn R. Goldin
View author publications
You can also search for this author in PubMed Google Scholar
Guillem Clot
View author publications
You can also search for this author in PubMed Google Scholar
Lauren R. Teras
View author publications
You can also search for this author in PubMed Google Scholar
Inés Quintela
View author publications
You can also search for this author in PubMed Google Scholar
Brenda M. Birmann
View author publications
You can also search for this author in PubMed Google Scholar
Sandrine Jayne
View author publications
You can also search for this author in PubMed Google Scholar
Wendy Cozen
View author publications
You can also search for this author in PubMed Google Scholar
Aneela Majid
View author publications
You can also search for this author in PubMed Google Scholar
Karin E. Smedby
View author publications
You can also search for this author in PubMed Google Scholar
Qing Lan
View author publications
You can also search for this author in PubMed Google Scholar
Claire Dearden
View author publications
You can also search for this author in PubMed Google Scholar
Angela R. Brooks-Wilson
View author publications
You can also search for this author in PubMed Google Scholar
Andrew G. Hall
View author publications
You can also search for this author in PubMed Google Scholar
Mark P. Purdue
View author publications
You can also search for this author in PubMed Google Scholar
Tryfonia Mainou-Fowler
View author publications
You can also search for this author in PubMed Google Scholar
Claire M. Vajdic
View author publications
You can also search for this author in PubMed Google Scholar
Graham H. Jackson
View author publications
You can also search for this author in PubMed Google Scholar
Pierluigi Cocco
View author publications
You can also search for this author in PubMed Google Scholar
Helen Marr
View author publications
You can also search for this author in PubMed Google Scholar
Yawei Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Tongzhang Zheng
View author publications
You can also search for this author in PubMed Google Scholar
Graham G. Giles
View author publications
You can also search for this author in PubMed Google Scholar
Charles Lawrence
View author publications
You can also search for this author in PubMed Google Scholar
Timothy G. Call
View author publications
You can also search for this author in PubMed Google Scholar
Mark Liebow
View author publications
You can also search for this author in PubMed Google Scholar
Mads Melbye
View author publications
You can also search for this author in PubMed Google Scholar
Bengt Glimelius
View author publications
You can also search for this author in PubMed Google Scholar
Larry Mansouri
View author publications
You can also search for this author in PubMed Google Scholar
Martha Glenn
View author publications
You can also search for this author in PubMed Google Scholar
Karen Curtin
View author publications
You can also search for this author in PubMed Google Scholar
W Ryan Diver
View author publications
You can also search for this author in PubMed Google Scholar
Brian K. Link
View author publications
You can also search for this author in PubMed Google Scholar
Lucia Conde
View author publications
You can also search for this author in PubMed Google Scholar
Paige M. Bracci
View author publications
You can also search for this author in PubMed Google Scholar
Elizabeth A. Holly
View author publications
You can also search for this author in PubMed Google Scholar
Rebecca D. Jackson
View author publications
You can also search for this author in PubMed Google Scholar
Lesley F. Tinker
View author publications
You can also search for this author in PubMed Google Scholar
Yolanda Benavente
View author publications
You can also search for this author in PubMed Google Scholar
Paolo Boffetta
View author publications
You can also search for this author in PubMed Google Scholar
Paul Brennan
View author publications
You can also search for this author in PubMed Google Scholar
Marc Maynadie
View author publications
You can also search for this author in PubMed Google Scholar
James McKay
View author publications
You can also search for this author in PubMed Google Scholar
Demetrius Albanes
View author publications
You can also search for this author in PubMed Google Scholar
Stephanie Weinstein
View author publications
You can also search for this author in PubMed Google Scholar
Zhaoming Wang
View author publications
You can also search for this author in PubMed Google Scholar
Neil E. Caporaso
View author publications
You can also search for this author in PubMed Google Scholar
Lindsay M. Morton
View author publications
You can also search for this author in PubMed Google Scholar
Richard K. Severson
View author publications
You can also search for this author in PubMed Google Scholar
Elio Riboli
View author publications
You can also search for this author in PubMed Google Scholar
Paolo Vineis
View author publications
You can also search for this author in PubMed Google Scholar
Roel C. H. Vermeulen
View author publications
You can also search for this author in PubMed Google Scholar
Melissa C. Southey
View author publications
You can also search for this author in PubMed Google Scholar
Roger L. Milne
View author publications
You can also search for this author in PubMed Google Scholar
Jacqueline Clavel
View author publications
You can also search for this author in PubMed Google Scholar
Sabine Topka
View author publications
You can also search for this author in PubMed Google Scholar
John J. Spinelli
View author publications
You can also search for this author in PubMed Google Scholar
Peter Kraft
View author publications
You can also search for this author in PubMed Google Scholar
Maria Grazia Ennas
View author publications
You can also search for this author in PubMed Google Scholar
Geoffrey Summerfield
View author publications
You can also search for this author in PubMed Google Scholar
Giovanni M. Ferri
View author publications
You can also search for this author in PubMed Google Scholar
Robert J. Harris
View author publications
You can also search for this author in PubMed Google Scholar
Lucia Miligi
View author publications
You can also search for this author in PubMed Google Scholar
Andrew R. Pettitt
View author publications
You can also search for this author in PubMed Google Scholar
Kari E. North
View author publications
You can also search for this author in PubMed Google Scholar
David J. Allsup
View author publications
You can also search for this author in PubMed Google Scholar
Joseph F. Fraumeni
View author publications
You can also search for this author in PubMed Google Scholar
James R. Bailey
View author publications
You can also search for this author in PubMed Google Scholar
Kenneth Offit
View author publications
You can also search for this author in PubMed Google Scholar
Guy Pratt
View author publications
You can also search for this author in PubMed Google Scholar
Henrik Hjalgrim
View author publications
You can also search for this author in PubMed Google Scholar
Chris Pepper
View author publications
You can also search for this author in PubMed Google Scholar
Stephen J. Chanock
View author publications
You can also search for this author in PubMed Google Scholar
Chris Fegan
View author publications
You can also search for this author in PubMed Google Scholar
Richard Rosenquist
View author publications
You can also search for this author in PubMed Google Scholar
Silvia de Sanjose
View author publications
You can also search for this author in PubMed Google Scholar
Angel Carracedo
View author publications
You can also search for this author in PubMed Google Scholar
Martin J. S. Dyer
View author publications
You can also search for this author in PubMed Google Scholar
Daniel Catovsky
View author publications
You can also search for this author in PubMed Google Scholar
Elias Campo
View author publications
You can also search for this author in PubMed Google Scholar
James R. Cerhan
View author publications
You can also search for this author in PubMed Google Scholar
James M. Allan
View author publications
You can also search for this author in PubMed Google Scholar
Nathanial Rothman
View author publications
You can also search for this author in PubMed Google Scholar
Richard Houlston
View author publications
You can also search for this author in PubMed Google Scholar
Susan Slager
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

R.H. and S.L.S. developed the project and provided overall project management; R.H., S.L.S., P.J.L., H.E.S. and G.P.S. drafted the manuscript. At the ICR: P.J.L., G.P.S. and H.E.S. performed bioinformatic and statistical analyses; H.E.S. performed project management and supervised genotyping; G.P.S. and A.H. performed sequencing and genotyping. In Newcastle, J.M.A. and D.J.A. conceived of the NCLLC; J.M.A. obtained financial support, supervised laboratory management and oversaw genotyping of cases with NCLLC; N.J.S. and H.M. performed sample management of cases; A.G.H. developed the Newcastle Haematology Biobank, incorporating NCLLC; and T.M.-F., G.H.J., G.S., R.J.H., A.R.P., D.J.A., J.R.B., G.P., C.P. and C.F. developed protocols for recruitment of individuals with CLL and sample acquisition and performed sample collection of cases. In Leicester, M.J.S.D. performed overall management, collection and processing of samples; S.J. and A.M. performed DNA extractions and IGVH mutation assays. In Spain, S.B., G.C., D.M.-G., I.Q., A.C. and E.C. performed sample collection, genotyping and expression analysis in CLL cells. In Sweden, L.M. and R.R. performed collection of cases, and H.H. and K.E.S. performed sample collection in the Scandinavian Lymphoma Etiology (SCALE) study. At NCI GWAS/GEC GWAS, S.S., S.I.B., N.R. and S.J.C. conducted and supervised the genotyping of samples. S.I.B., N.J.C., C.F.S., J.V., A.N., A.M., L.R.G., L.R.T., B.M.B., S.J., W.C., K.E.S., Q.L., A.R.B.-W., M.P.P., C.M.V., P.C., Y.Z., T.Z., G.G.G., C.L., T.G.C., M.L., M. Melbye, B.G., M.G., K.C., W.R.D., B.K.L., L.C., P.M.B., E.A.H., R.D.J., L.F.T., Y.B., P. Boffetta, P. Brennan, M. Maynadie, J.M., D.A., S.W., Z.W., N.E.C., L.M.M., R.K.S., E.R., P.V., R.C.H.V., M.C.S., R.L.M., J.C., S.T., J.J.S., P.K., M.G.E., G.S., G.F., R.J.H., L.M., A.R.P., K.E.N., J.F.F., K.O., H.H., S.J.C., R.R., S.d.S., J.R.C., N.R. and S.L.S. conducted the epidemiological studies and contributed samples to the GWAS. Utah GWAS: N.J.C. designed and directed all aspects of the study; M.G. provided clinical oversight; K.C. provided statistical expertise. UCSF GWAS: C.S. supervised all aspects of the overall study; P.M.B. provided project management; L.C. performed bioinformatic and statistical analyses.

Corresponding authors

Correspondence to Richard Houlston or Susan Slager.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Supplementary information

Supplementary Information

Supplementary Figures, Supplementary Tables, Supplementary References. (PDF 5432 kb)

Supplementary Data 1

Association between SNP genotype and a) sex; b) age at diagnosis; and c) IGHV mutational status in CLL cases (XLSX 34 kb)

Supplementary Data 2

Table of eQTL results for the new risk loci in CLL primary cells, as well as data from publicly available databases. Shown are all genes within 1MB of the risk SNP. (XLSX 34 kb)

Supplementary Data 3

Significant pathways as determined by LENS. (XLSX 46 kb)

Peer Review File (PDF 55 kb)

Rights and permissions

This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

Reprints and permissions

About this article

Cite this article

Law, P., Berndt, S., Speedy, H. et al. Genome-wide association analysis implicates dysregulation of immunity genes in chronic lymphocytic leukaemia. Nat Commun 8, 14175 (2017). https://doi.org/10.1038/ncomms14175

Download citation

Received: 04 July 2016
Accepted: 06 December 2016
Published: 06 February 2017
DOI: https://doi.org/10.1038/ncomms14175

This article is cited by

Implementation of individualised polygenic risk score analysis: a test case of a family of four
- Manuel Corpas
- Karyn Megy
- Edmund Lehmann
BMC Medical Genomics (2022)
Meiotic drive in chronic lymphocytic leukemia compared with other malignant blood disorders
- Viggo Jønsson
- Haneef Awan
- Geir Erland Tjønnfjord
Scientific Reports (2022)
Polygenic risk score and risk of monoclonal B-cell lymphocytosis in caucasians and risk of chronic lymphocytic leukemia (CLL) in African Americans
- Geffen Kleinstern
- J. Brice Weinberg
- Susan L. Slager
Leukemia (2022)
Distinct germline genetic susceptibility profiles identified for common non-Hodgkin lymphoma subtypes
- Sonja I. Berndt
- Joseph Vijai
- Nathaniel Rothman
Leukemia (2022)
The clinical utility of polygenic risk scores for chronic lymphocytic leukemia
- Amit Sud
- Philip J. Law
- Richard S. Houlston
Leukemia (2021)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.