Host immune genetic variations influence the risk of developing acute myeloid leukaemia: results from the NuCLEAR consortium

The purpose of this study was to conduct a two-stage case control association study including 654 acute myeloid leukaemia (AML) patients and 3477 controls ascertained through the NuCLEAR consortium to evaluate the effect of 27 immune-related single nucleotide polymorphisms (SNPs) on AML risk. In a pooled analysis of cohort studies, we found that carriers of the IL13rs1295686A/A genotype had an increased risk of AML (PCorr = 0.0144) whereas carriers of the VEGFArs25648T allele had a decreased risk of developing the disease (PCorr = 0.00086). In addition, we found an association of the IL8rs2227307 SNP with a decreased risk of developing AML that remained marginally significant after multiple testing (PCorr = 0.072). Functional experiments suggested that the effect of the IL13rs1295686 SNP on AML risk might be explained by its role in regulating IL1Ra secretion that modulates AML blast proliferation. Likewise, the protective effect of the IL8rs2227307 SNP might be mediated by TLR2-mediated immune responses that affect AML blast viability, proliferation and chemorresistance. Despite the potential interest of these results, additional functional studies are still warranted to unravel the mechanisms by which these variants modulate the risk of AML. These findings suggested that IL13, VEGFA and IL8 SNPs play a role in modulating AML risk.


Introduction
Acute Myeloid Leukaemia (AML) is a common haematological malignancy characterised by the clonal transformation of haematopoietic precursors that alter normal hematopoietic cell growth and differentiation 1 . Epidemiological studies suggested that AML onset can be triggered by multiple factors including age, sex, lifestyle, exposure to chemicals and a number of blood and congenital disorders 2 . However, the biological mechanisms underlying AML aetiology remain largely elusive. Even though cytogenetic analysis have allowed the stratification of AML patients into favourable, intermediate and unfavourable classes and has improved our ability to predict clonal evolution and disease progression 3 , many AML patients (~45%) have a normal karyotype, which suggests that additional genetic alterations are needed to develop the disease. Sequencing studies identified genes frequently mutated in AML, some of which predict poor prognosis (NPM1 wt /FLT3-ITD high , RUNX1, ASXL1 and TP53) 4 . Furthermore, it is increasingly evident that host immunity might also be implicated in AML risk and survival 5 . AML blasts activate immunosuppressive mechanisms to evade the immune system whereas immune response changes induced by the gut microbiota can also influence the antileukaemic effects of immune cells 6 . In addition, the efficacy of allogeneic stem cell transplantation (SCT) in eradicating AML is linked to the appearance of the graftversus-leukaemia effect, mediated by the recognition of major histocompatibility antigens present in malignant blasts by T cells 7 . Likewise, the disappearance of these circulating T cells recognising AML or the loss of costimulatory (CD28/CD80, ICAM-1/CD11a) or inhibitory interactions (PD-1/PDL-1) eventually leads to relapse 8 and the infusion of donor-derived CD8+ memory T cells induces remission in patients who relapsed following allogeneic SCT 9 . Considering that around two-thirds of AML patients relapse within the first 18 months after first-line therapy, clinical trials are trying to assess the efficacy of immunotherapies in AML and to unravel the interplay between the immune system and AML blasts.
Considering the aspects detailed above, the purpose of this study was to conduct a two-stage case control association study including 654 AML patients and 3477 controls ascertained through the NuCLEAR consortium to evaluate whether 27 single nucleotide polymorphisms (SNPs) within the IL4, IL8, IL8RB (CXCR2), IL12A, IL12B, IL13, IFNG, IFNGR2, CCR5, MIF and VEGFA loci influence the risk of developing AML. We also decided to investigate the correlation of selected SNPs with serum steroid hormone levels and their role in modulating immune responses after stimulation of whole blood, peripheral mononuclear cells (PBMCs) and macrophages with lipopolysaccharide (LPS), phytohemagglutinin (PHA), Pam3Cys and CpG.

Study design and study populations
We conducted a two-stage genetic association study to assess whether 27 functional single nucleotide polymorphisms (SNPs) within host immunity-related genes could influence AML risk. The discovery population consisted of 2027 European subjects (338 AML patients and 1689 healthy controls). AML patients were diagnosed by experienced clinicians and ascertained through the iNternational Consortium for LEukaemiA Research (NuCLEAR; Table 1). A set of AML patients were recruited from 2 Spanish medical institutions (Virgen de las Nieves University Hospital, Granada and Hospital of Salamanca, Salamanca), the University of Würzburg (Würzburg, Germany) and the University of Innsbruck (Innsbruck, Austria) 10 . Healthy controls included 667 Spanish blood donors from the REPAIR consortium 11 , 1000 German controls came from the Heinz-Nixdorf Recall (HNR) study 12 and 22 donors of allogeneic stem cell transplantation from the Medical University of Innsbruck (Innsbruck, Austria). In accordance with the Declaration of Helsinki, all study participants provided their written informed consent to participate in the study and the ethical committees of all participating centres and hospitals approved the study.

DNA extraction, SNP selection criteria and genotyping
Genomic DNA from all individuals was extracted from saliva or blood samples using the Oragen®-DNA Self-Collection kit (Oragene) or the Maxwell® 16 Blood DNA Purification kit (Promega) according to manufacturer's instructions. SNP selection criteria were based on previous associations with haematological malignancies (AML, ALL, CML, CLL and non-Hodgkin lymphomas) or solid tumours and clinical related parameters (graft versus host disease, whole blood leucocyte counts, anthropometric measures, etc.) but also according to their functionality in Haploreg (https://pubs.broadinstitute.org/mammals/haploreg/ haploreg.php), Regulome (https://www.regulomedb.org/ Data are means ± standard deviation or percentage (%). A set of 99 patients (39 and 61 from the discovery and replication cohorts, respectively) could not be classified according to the FAB classification. AML acute myeloid leukaemia *Age was not available in a set of German controls included in the discovery (n = 1000) and replication cohorts (n = 1068).  Table 2 continued   (Table  2 and Supplementary Fig. 1). Genotyping was performed using KASP® probes (LGC Genomics, Hoddesdon, UK) according to previously reported protocols 13 . For quality control, ∼5% of DNA samples were randomly included as duplicates and concordance between duplicate samples was ≥99.0%. AML cases and controls were randomly distributed in 384-well plates and the person doing genotyping experiments did not know how AML cases and controls were distributed.

Statistical analysis
Deviation from Hardy-Weinberg Equilibrium (HWE) was tested in the controls by chi-square (χ 2 ). Logistic regression adjusted for sex and country of origin was used to assess the associations of the SNPs with AML risk assuming log-additive, dominant and recessive models. According the M eff method 14 , 24 of 27 SNPs were independent and, consequently, the study-wide significant threshold was set to 0.0007 (0.05/24SNPs/3models). Statistical power was calculated using Quanto (v.12.4) assuming a log-additive model of inheritance.

Replication cohort
For replication purposes, the most relevant findings (P < 0.05) were replicated in a cohort of 2104 subjects (316 AML cases and 1788 healthy controls). AML cases were recruited from an independent Spanish medical institution (Hospital General of Valencia, Valencia, Spain), from the University Hospital of Würzburg (Germany) and from two Italian medical institutions (Università Cattolica del Sacro Cuore, Rome and University of Modena and Reggio Emilia, AOU Policlinico, Modena) between 2015 and 2017. Five hundred and seven Spanish controls were blood donors recruited from the Blood Transfusion Centre (CRTS, Granada-Almería), 194 Italian controls from the REPAIR consortium, 1068 German controls from a second and independent set of the Heinz-Nixdorf Recall (HNR) study (University Hospital of Essen) and 19 donors of allogeneic stem cell transplantation from the University of Würzburg (Germany). The ethical committees of these centres approved the study.

Functional analysis of the host immune-related variants
In order to determine the biological function of the most relevant SNPs, cytokine production in response to stimulation was measured in the 500 Functional Genomics cohort from the Human Functional Genomics Project (HFGP; http://www.humanfunctionalgenomics.org/). The Arnhem-Nijmegen Ethical Committee approved the study (42561.091.12) and biological specimens were collected after informed consent was obtained. We investigated whether any SNP was correlated with cytokine levels (IFNγ, IL1Ra, IL1β, IL6, IL8, IL10, TNFα, IL17, and IL22) after stimulation of peripheral blood mononuclear cells (PBMCs), whole blood or monocyte-derived macrophages from 408 healthy subjects with LPS (1 or 100 ng/ml), PHA (10 μg/ml), Pam3Cys (10 μg/ml), and CpG (100 ng/ml). After log transformation, linear regression analyses adjusted for age and sex were used to determine the correlation of selected SNPs with cytokine expression quantitative trait loci (cQTLs). All analyses were performed using R software (www.r-project.org/). In order to account for multiple comparisons, we used a significant threshold of 0.00006, i.e., the quotient of 0.05/ (24 independent SNPs × 9 cytokines × 4 cell stimulants).
Detailed protocols for PBMCs isolation, macrophage differentiation and stimulation assays have been reported elsewhere [15][16][17] . Briefly, PBMCs were washed twice in saline and suspended in medium (RPMI 1640) supplemented with gentamicin (10 mg/ml), L-glutamine (10 mM) and pyruvate (10 mM). PBMC stimulations were performed with 5×10 5 cells/well in round-bottom 96wells plates (Greiner) for 24 h in the presence of 10% human pool serum at 37°C and 5% CO 2 . Supernatants were collected and stored in −20°C until used for ELISA. LPS (100 ng/ml), PHA (10μg/ml) and Pam3Cys (10 μg/ ml) and CpG (100 ng/ml) were used as stimulators for 24 or 48 h. Whole blood stimulation experiments were conducted using 100 μl of heparin blood that was added to a 48 well plate and subsequently stimulated with 400 μl of LPS and PHA (final volume 500ul) for 48 h at 37°C and 5% CO 2 . Supernatants were collected and stored in −20°C until used for ELISA. Concentrations of human TNFα, IFNγ, IL1β, IL1RA, IL6, IL8, IL10, IL17, and IL22 were determined using specific commercial ELISA kits (PeliKine Compact, Amsterdam, or R&D Systems), in accordance with the manufacturer's instructions.

Correlation between steroid hormone levels and immunoregulatory SNPs
Given the impact of steroid hormones in modulating immune responses, we also evaluated the correlation of SNPs with serum levels of 7 steroid hormones (androstenedione, cortisol, 11-deoxy-cortisol, 17-hydroxy progesterone, progesterone, testosterone and 25 hydroxy vitamin D3) in a subset of subjects without hormonal replacement therapy or oral contraceptives (n = 280). Complete protocol details have been reported elsewhere 17 . Steroid hormones were analysed by liquid chromatography tandem-mass spectrometry (LC-MS) after protein precipitation and solid-phase extraction as described in Ter Horst et al. 17 (see also Supplementary Material). Hormone levels and genotyping data were available for a total of 406 subjects. After log transformation, correlation between SNPs and serum steroid hormone levels was evaluated using linear regression adjusted for age and sex in R (http://www.r-project.org/). Significance thresholds were set to 0.0003 (0.05/24 independent SNPs/7 hormones).

Results
This study was conducted in a discovery population comprised of 338 AML patients and 1689 healthy controls. AML patients had a similar age than controls (55.19 ±15.12 vs. 56.91±17.25) and showed a slightly increased male/female ratio compared to healthy controls (1.13 [179/159] vs. 1.07 [871/818]. Ninety five percent of the patients had de novo AML whereas the remaining 5% presented secondary disease evolving from a preceding dysplasia ( Table 1).
The association analysis of the discovery population revealed that 11 immunoregulatory SNPs were associated with AML risk (P < 0.05; Table 3). We found that carriers of the IFNGR2 rs1059293T allele or the IL4 rs2243248G/G , IL13 rs20541T/T , IL13 rs1295686A/A and VEGFA rs998584T/T genotypes showed an increased risk of developing the disease (OR Dom = 1.51, P = 0.0074; OR Rec = 4.33, P = 0.012; OR Rec = 1.98, P = 0.028; OR Rec = 2.16, P = 0.012; and OR Rec = 1.40, P = 0.034). In addition, we observed that each copy of the IL4 rs2243268C allele was associated with a 1.31-fold increased risk of AML (OR Add = 1.31, P = 0.042). On the other hand, we found that carriers of the IL8 rs2227307G and VEGFA rs25648T alleles had a significantly decreased risk of AML (OR Dom = 0.70, P = 0.012 and OR Dom = 0.42, P = 0.00002) whereas each copy of the IL8 rs4073A , CCR5 rs1799987G , CCR5 rs2734648T alleles was associated with~20-25% decreased risk of AML (OR Add = 0.81, P = 0.020; OR Add = 0.82, P = 0.043 and OR Add = 0.75, P = 0.0044). Even though only the association of the VEGFA rs25648 SNP with a decreased risk of developing AML remained significant after correction for multiple testing in the discovery cohort (P Corr = 0.0014), we found that the association of IL8 rs2227307 and IL13 rs1295686 with AML risk was confirmed in the replication population (OR Dom = 0.74, P = 0.040 and OR Dom = 2.24, P = 0.0051, respectively; Table 3). The pooled analysis including 4131 subjects (654 AML cases and 3477 controls) confirmed that carriers of the IL13 rs1295686 genotype had a significantly increased risk of AML (OR Rec = 2.18, P = 0.0002, P Corr = 0.0144) whereas carriers of the IL8 rs2227307G allele had a decreased risk of developing the disease that remained marginally significant after correction for multiple testing (OR Dom = 0.72, P = 0.0010, P Corr = 0.072). Interestingly, although it was not statistically significant in the replication population likely due to the relatively limited power, the pooled analysis also revealed a strong association of the VEGFA rs25648T allele with a decreased risk of AML that largely surpassed the stringent study-wide significant threshold (OR Dom = 0.60, P = 0.0000012, P Corr = 0.00086; Table 3).
In an effort to determine the functional relevance of these polymorphisms, we performed in vitro stimulation experiments in a large cohort of healthy donors to investigate whether IL8, IL13 and VEGFA SNPs could correlate with levels of IFNγ, IL1Ra, IL1β, IL6, IL8, IL10, TNFα, IL17, and IL22 after stimulation of PBMCs, whole blood or monocyte-derived macrophages with LPS, PHA, Pam3Cys, and CpG. These experimental studies revealed that carriers of the IL8 rs2227307T allele had increased levels of IL1β after the stimulation of PBMCs with Pam3Cys (P = 0.00058; Fig. 1a). Although this association did not survive multiple testing correction, these results suggested that this variant might have an impact on AML risk through the modulation of TLR2-immune responses. In support of a functional role of the IL8 rs2227307 SNP in AML, it has been also reported that this SNP represents an eQTL for PF4V (Fig. 1b), a locus involved in chemokine-mediated immune responses. Interestingly, although it neither reached statistical significance after multiple testing correction, we also found a negative correlation between the IL13 rs1295686A allele and levels of IL1Ra after stimulation of PBMCs with LPS (P = 0.002; Fig. 1c), which suggested that the IL13 locus might play a role in the pathogenesis of AML likely through the modulation of IL1Ra-mediated immune responses. No correlation between selected SNPs and serum steroid hormone levels was found suggesting that the functional effect of these markers on the immune responses was not mediated by steroid hormones.

Discussion
AML has been the object of investigations that have demonstrated that host immunity contributes to disease susceptibility. This study reports for the first time an association of the IL13 rs1295686 , IL8 rs2227307, and VEG-FA rs25648 polymorphisms with AML risk. The association of the IL13 and VEGFA SNPs with AML risk remained significant after multiple testing correction, whereas the association of IL8 rs2227307 was not significant but close to the multiple testing significance threshold. These results suggested that the IL13, VEGFA and IL8 loci might be susceptibility markers for AML.
The IL13 gene is located on chromosome 5q31 and encodes for IL13, an immunoregulatory cytokine with pleiotropic functions. Several SNPs (rs20541, rs18000925 and rs1295686) within this gene have been consistently associated, at GWAS level, with immune-related diseases 18,19 and haematological malignancies 20 . In this two-stage case control association study we found a consistent and statistically significant association of the IL13 rs1295686A/A genotype with an increased risk of developing AML that suggested a role of this locus in the pathogenesis of the disease. Mechanistically, we observed a negative correlation between the IL13 rs1295686A allele and IL1Ra levels after stimulation of PBMCs with LPS   (P = 0.002; Fig. 1c). Although this association did not remain significant after correction for multiple testing, this finding supported our genetic results suggesting a role of the IL13 rs1295686 SNP in the pathogenesis of AML. Considering our results but also those from an early report that demonstrated that IL1Ra levels are decreased in AML patients compared to controls 21 , we hypothesise that the effect of the IL13 rs1295686A allele on AML risk might be explained by its role in inhibiting IL1Ra secretion, likely through the inhibition of IL1Ra secretion from either AML blasts or healthy cells. In line with this argument, it has been consistently reported that IL1Ra inhibits AML blast proliferation 22 and that it is associated with the immunosuppressive effect of the mesenchymal stem cells (MSCs) in the bone marrow that accounts for macrophage polarisation (toward the M2 phenotype) and B cell differentiation and survival 23 . Although at this point it is tempting to speculate that the IL13 rs1295686A allele, which correlates with lower levels of IL1Ra secretion, might represent a biomarker with a potential benefit in AML by antagonising IL1 effects on blast proliferation and blocking inflammation, we believe that additional functional experiments are still warranted to explain the exact mechanism by which the IL13 rs1295686 variant influence the risk of AML.
Another interesting finding of this study was the consistent association of the IL8 rs2227307T allele with a decreased risk of developing AML. Although the association of the IL8 rs2227307 SNP with AML risk remained only marginally significant after multiple testing correction, this finding suggested that the IL8 locus might play a role in the pathogenesis of AML. The IL8 gene is located on chromosome 4q12-q21 and encodes for IL8, a chemokine mainly produced by macrophages and epithelial cells. Previous studies have suggested that the blocking IL8-CXCR2 pathway might have a therapeutic potential in a variety of tumours [24][25][26][27] including AML and myelodysplastic syndromes (MDS) 28 . However, the role of IL8 in AML is still scarce. A recent study has demonstrated that IL8 and its receptor are significantly overexpressed in Fig. 1 Functional impact of the IL8 rs2227307 SNP on immune responses. Correlation between the IL8 rs2227307 SNP and IL1β levels after stimulation of PBMCs (n=408) with Pam3Cys (10μg/ml) (a) or PF4V expression in peripheral blood (b) and correlation between the IL13 rs1295686 SNP with IL1Ra levels after stimulation of PBMCs with LPS (100ng/ml) (c). Gene expression plot from the GTEx portal; https://gtexportal.org/home/index.html).
AML and MDS patients 28 and that the expression of these molecules also correlates with poor outcomes. In addition, it has been reported that the IL8-CXCR2 axis is highly expressed in hematopoietic stem cells and progenitor compartments in comparison with healthy controls 28 and that this pathway plays a key role in the regulation of cancer stem cell function [29][30][31] and mesenchymal stem cell-induced T cell proliferation. In addition, Schinke et al. (2015) have experimentally demonstrated that the inhibition of CXCR2 leads to decreased viability and clonogenic capacity of primary cells from AML patients, which pointed towards the use of IL8-CXCR2 pathway as novel therapeutic target 28 . In line with our genetic data and the notion of a role of the IL8 locus in the pathogenesis of AML, we found that carriers of the IL8 rs2227307T allele had increased levels of IL1β after the stimulation of PBMCs with Pam3Cys (P = 0.00058; Fig. 1a). These results suggested that the protective effect of the IL8 rs2227307 SNP on AML risk might be mediated by TLR2induced immune responses that are initially regulating IL1β secretion and, subsequently, IL8 production in a wide range of pathological conditions [32][33][34][35] . Given that the correlation of the IL8 rs2227307 SNP with increased levels of IL1β did not reach the significance threshold after correction for multiple testing, we need to interpret these results with caution. Nonetheless, it worth mentioning that they were in agreement with previous studies showing that TLRs are expressed in multiple AML cell lines and primary AML samples 36 and that stimulation of TLR2 in normal hematopoietic cells led to differentiation and proliferation of hematopoietic stem cells and myeloid progenitor cells. Furthermore, another study proposed a TLR2-binding cellpenetrating peptide as a promising candidate for targeted drug development in AML 37 . In addition to these findings, IL8 rs2227307 has been also reported to be an eQTL for PF4V (Fig. 1b), a locus involved in chemokine-mediated immune responses. These results suggest that the IL8 rs2227307 polymorphism might also influence the risk of AML through chemotaxis stimulation in the microenvironment of the bone marrow (BM). In line with this notion, it has been demonstrated that IL8 is a hypoxia-regulated cytokine that promotes migration in mesenchymal stromal cells in the BM 38 and that both endogenous and hypoxia-induced production of IL8 was higher in AML cases compared to controls and was prognostically unfavourable 38 . A more recent study has also suggested that IL8 blockade might be used as new therapeutic strategy for AML, as it prevents activated endothelial cell mediated proliferation and chemoresistance 39 .
Finally, even though we did not find any functional effect of the VEGFA rs25648 SNP to modulate immune responses, our genetic findings are in line with previous studies reporting an increased vascularity and VEGFA levels in AML patients, and a specific VEGFA-dependent vascular morphology in the leukemic BM 40 . In addition, it has been reported that VEGFA levels are an independent prognostic factor 41 and that they modulate the appearance of graft versus host disease after SCT 42 . Based on the current evidence, we hypothesize that the VEGFA rs25648 SNP might influence the risk of developing AML through changes in BM vascularity and morphology and migration of human leukemia cells.
One of the major strengths of our study is the inclusion of two large populations. In the combined analysis, we had 80% power to detect an odds ratio of 1.33 (α = 0.0007) for a SNP with a frequency of 0.25, which underlined the feasibility of the study design. Another important strength of this study is the development of cytokine stimulation experiments and the measurement of seven serum steroid hormones in a large cohort of healthy subjects, which allowed us to investigate the functional role of the most relevant markers in modulating immune responses but also in determining serological steroid hormone levels. A drawback is the multicentric nature of this study that placed inevitable limitations such as the impossibility of uniformly collect cytogenetic and mutation profiles for a significant set of patients. Another limitation was that age was unknown for a subset of German controls. However, given that selected SNPs have not been linked to survival in AML, we think that age is not a modifying factor that could significantly influence the results.
In conclusion, we identified for the first time IL8, IL13, and VEGFA SNPs as susceptibility biomarkers for AML and provided new insights about the possible role of these loci in modulating innate and adaptive immune responses, and thereby becoming potentially clinical targets for enhancement of the antileukemic effects of immune cells.
Functional data used in this project have been meticulously catalogued and archived in the BBMRI-NL data infrastructure (https://hfgp. bbmri.nl/) using the MOL-GENIS open source platform for scientific data 43 . This allows flexible data querying and download, including sufficiently rich metadata and interfaces for machine processing (R statistics, REST API) and using FAIR principles to optimise Findability, Accessibility, Interoperability and Reusability 44 .