Dear Editor,
Monoclonal gammopathy of undetermined significance (MGUS) is a benign plasma cell disorder, common in the Western population (3–5% ≥50 years) and characterized by an asymptomatic clonal plasma cell expansion [1]. MGUS progresses to multiple myeloma (MM) at a rate of 1% per year [1], but can also progress to light chain amyloidosis (AL amyloidosis), Waldenström macroglobulinemia, and lymphoma. Familial clustering of MGUS or MM support the role for genetic susceptibility [2]. MM and MGUS have shared heritability, with a genetic correlation of 55% and SNP-based heritability estimates of 17% and 15%, respectively (3,4). This suggests a large portion of missing heritability to be identified. Previous genome-wide association studies (GWAS) have successfully identified 24 common loci associated with MM risk [3, 4]; of these, 12 are also associated with MGUS [5]. Twenty additional loci have been identified for risk of MGUS but the impact of these loci on progression is unknown [5, 6]. Identifying additional common variants contributing to MGUS may elucidate the unaccounted missing heritability for both MGUS and MM. Further, understanding genetic determinants of MGUS are important regardless of MM, given the associations of MGUS with multiple conditions, not just MM. In this study, we performed the largest MGUS GWAS meta-analysis to date and validated associations of known MM/MGUS risk variants.
We included four independent GWAS of MGUS patients and controls (European/European–American) from the United States, Sweden, Germany, and the Czech Republic, described elsewhere [5, 6]. Informed consent was obtained through each study. MGUS cases were primarily identified clinically and defined by the internationally accepted criteria of monoclonal protein concentration <30 g/L, <10% monoclonal plasma cells in the bone marrow, normal plasma calcium, and otherwise asymptomatic features like kidney function, no bone destruction, and no anemia (4,6). However, 33% of MGUS cases from Mayo Clinic were identified via screening alone; sensitivity analysis excluding MGUS identified by screening were performed. Genotyping, imputation, population stratification, principal components analysis, and related quality control steps were performed according to established standards (genotype call rate >90% and imputation quality >0.6) [4, 6]. Common single nucleotide polymorphisms (SNPs) with minor allele frequency (MAF) > 0.01 within each GWAS were analyzed using SNPTEST or PLINK v1.9. Odds ratios (OR) and 95% confidence intervals (CI) were estimated. Summary statistics from all cohorts were meta-analyzed with META using inverse variance weighted random effects linear regression adjusted for imputation call stability. The choice of the statistical design was inspired by observed deviation in MAFs between different studies post quality control. Statistical significance for the GWAS was defined by the conventional threshold, P < 5.0 × 10−8; SNPs above this value were interrogated further using bionformatic approaches. In addition to GWAS, we performed two analyses including replication of published MM (N = 24) and MGUS (N = 20) risk loci and the association of identified MGUS variants (at P < 1 × 10−6) with progressive MGUS, that is progressed to MM. For the latter, we used a case-control design to compare MGUS that was known to progress to MM to MGUS not known to have progressed. Statistical significance was defined as P < 0.05 for additional analyses. We meta-analyzed the summary statistics generated from these analyses with METAL. Functional annotation of genetic associations was performed using FUMA (https://fuma.ctglab.nl/), Ensembl Release 104 (May 2021) and HaploReg V4.1 (Broadinstitute.org). In FUMA, expression quantitative trait locus (eQTL) analysis was performed based on cis-eQTLs in blood from 31,684 individuals through the eQTLGen Consortium (https://www.eqtlgen.org/cis-eqtls.html). Further, SNP regions were annotated with the chromatin-state segmentation track (ChromHMM) from Roadmap Epigenome data for all blood, T-cell, hematopoietic stem cells, and B cells. Code for the analyses described herein may be requested from authors.
There was a total of 1738 MGUS cases and 3,755 controls of (European/European–American) included in the meta-analysis (Table S1). After standard GWAS quality control, available SNPs for all platforms ranged from 5.3–6.6 million and were included in the meta-analysis (Table S1).
SNP rs195314 mapping intronic to CSNK1E on 22q13.1 was significantly associated with MGUS (OR: 1.35, 95% CI: 1.22–1.49, P = 3.66 × 10−11) and was consistent in both the European and US cohorts (Figs. 1 and 2, Figs. S1A, B, and Table S2). Sensitivity analysis excluding MGUS identified by screening were performed and revealed a similar association with SNP rs195314 (Meta-analysis: OR = 1.37, 95% CI = 1.16–1.62, P = 3.0 × 10−4). A limitation of our study is that family history status is unknown for the majority of MGUS cases (EU and MD Anderson). Only 15% of Mayo Clinic MGUS cases had a positive family history of MM/MGUS, and an exploratory analysis of rs195314 restricted to the familial cases and controls, while under-powered, yielded a similar effect and direction of the association (OR = 1.33, 95% CI = 0.90–1.96, P = 0.15). rs195314 is in strong linkage disequilibrium (LD) (r2 > 0.9) with a cluster of SNPs which are cis-eQTLs identified by eQTLgen (Fig. 2). Indicative of function, SNP rs195314 and SNPs in strong LD are located in chromatin states noted as transcriptional sites across several hematological cell types, including primary mononuclear cells, T cells, B cells, and hematopoietic stem cells from peripheral blood (Fig. 2).
According to Haploreg, rs195314 changes the motif for IRF-6. The core binding site for IRF-6 was defined in the ENCODE analyses as AA(G or C)(T or A)CAA which matches our risk allele (the SNP position is bolded) ATGTCAA in all but the second nucleotide [7]. Binding strengths of protein-DNA interactions can be measured by position weight matrix (PWM) scores which are calculated for probabilities of any of 4 nucleotides residing in the defined positions [8]. Haploreg cites the ENCODE study (and the related transcription factor database) and gives a PWM score of 11.2 for G allele and of −0.7 for the C allele, implying strong binding of IRF-6 to the reference allele only.
Of the 20 previously identified MGUS risk loci identified by Thomsen et al. [6], we replicated (P < 0.05) all but 4 (rs3118053, rs28381958, rs974120, rs10744861) (Table S2). The replication is largely expected given the overlap of cohorts in Thomsen et al. [6] and our study. We also saw associations of MGUS (P < 0.05) with 4 of the 24 known MM risk loci rs4487645 (DNAH11), rs6599192 (ULK4), rs7193541 (RFWD3), rs34562254 (TNFRSF13B) (Table S2). rs195314 is not in linkage disequilibrium (r2 = 0) with the previously described SNP rs877529 on chr22q13.1 in MM [9]. Given this overlap of genetic variation MM and MGUS, these genes may play an important role in MGUS and MM shared etiology.
In the combined cohorts, there were 165 MGUS cases who progressed to MM and 1,079 known not to have progressed (status unknown for remaining). The average median follow-up time for progression from MGUS to MM was 6.5 years (Mayo Clinic = 5.8 years/Germany = 7.2 years). Unfortunately, we did not have follow-up time for Sweden. The average median age of diagnosis for MGUS patients who eventually progressed was 62, and the median age of progression to MM diagnosis was 67. In our exploratory analyses of 18 SNPs significantly associated with MGUS at P < 1.0 × 10−6, we found only one MGUS SNP associated with progression to MM (P < 0.05). SNP rs12401480 allele C was inversely associated with MGUS progression (OR = 0.90, 95% CI: 0.83–0.98). rs12401480 on chromosome 1 maps to the gene TDRD5 and impacts motif changes in Mef2 and ZBTB33 (HaploReg), important in histone modification and methylation. According to the Gene Cards database (GeneCards), TDRD5 is required during spermiogenesis to participate in the repression of transposable elements and prevent their mobilization, which is essential for germline integrity.
None of the other MGUS risk loci identified were associated with progression, including the top MGUS SNP, rs195314 (OR = 1.01, 95% CI: 0.90–1.13, p = 0.86). Further, when rs195314 was examined in a GWAS of MM including 4403 MM cases and 7265 controls [10], the OR was 1.02 (95% CI: 0.82–1.24, p = 0.5). In the same MM GWAS, the OR for rs12401480 was also null (0.99; 0.97–1.01, p = 0.84). Considering the possibility of progression of MGUS to AL amyloidosis, we also interrogated the associations of the MGUS SNPs in a GWAS of 1230 AL amyloidosis patients and 7589 controls, [11] finding no evidence of an association for either rs195314 (OR = 1.01 (0.99–1.02, p = 0.87)) or rs12401480 (OR = 0.96 (0.87–1.06, p = 0.41)). These data suggest that the two SNPs are not associated with progression of MGUS to MM or AL amyloidosis.
IRF-6 mutations have been associated with orofacial clefting disorders, including Van der Woude and popliteal pterygium syndromes and genital anomalies, which may be related to disturbances in epidermal differentiation and barrier functions [12]. IRF-6 is expressed in white blood cells and bone marrow but data on possible hematological functions of IRF-6 are limited. However, the IRF family of transcription factors share structural homology and there is a correlation with DNA binding affinity between IRF-6 and IRF-4; the coefficient is 0.6 [7]. IRF-4 has important regulatory function in MM, including allele-specific regulation of the MYC-interacting gene CDCA7L [13, 14]. Rs4487645, one of the MM SNPs also found associated with MGUS in our meta-analysis, creates a binding site for IRF-4 at an enhancer site to CDCA7L and negatively influences survival in MM patients [13]. IRF-4 has also been shown to be associated with MGUS-associated diseases of Waldenstrom macroglobulinemia/ lymphoplasmacytic lymphoma LPL at 6p25.3 (rs116446171, near IRF-4) [15] and of amyloidosis (AL) at 7p15.3 (rs4487645) [11].
In summary, our MGUS meta-analysis identified a novel locus (rs195314) which may cause a motif change in the binding site of IRF-6, a member of the interferon regulatory transcription factor family. Limited data on MGUS progression suggest that this SNP is not associated with progression to MM or AL amyloidosis. Well-powered studies are needed for validation of these results and to identify further genetic variation contributing to MGUS risk in populations of European and non-European ancestry.
Data availability
Summary statistics are available from the authors.
References
Kyle RA, Therneau TM, Rajkumar SV, Larson DR, Plevak MF, Offord JR, et al. Prevalence of monoclonal gammopathy of undetermined significance. N Engl J Med. 2006;354:1362–9.
Greenberg AJ, Rajkumar SV, Vachon CM. Familial monoclonal gammopathy of undetermined significance and multiple myeloma: epidemiology, risk factors, and biological characteristics. Blood. 2012;119:5359–66.
Went M, Sud A, Forsti A, Halvarsson BM, Weinhold N, Kimber S, et al. Identification of multiple risk loci and regulatory mechanisms influencing susceptibility to multiple myeloma. Nat Commun. 2018;9:3707.
Duran-Lozano L, Thorleifsson G, Lopez de Lapuente Portilla A, Niroula A, Went M, Thodberg M, et al. Germline variants at SOHLH2 influence multiple myeloma risk. Blood Cancer J. 2021;11:76.
Clay-Gilmour AI, Hildebrandt MAT, Brown EE, Hofmann JN, Spinelli JJ, Giles GG, et al. Coinherited genetics of multiple myeloma and its precursor, monoclonal gammopathy of undetermined significance. Blood Adv. 2020;4:2789–97.
Thomsen H, Chattopadhyay S, Weinhold N, Vodicka P, Vodickova L, Hoffmann P, et al. Genome-wide association study of monoclonal gammopathy of unknown significance (MGUS): comparison with multiple myeloma. Leukemia. 2019;33:1817–21.
Kheradpour P, Kellis M. Systematic discovery and characterization of regulatory motifs in ENCODE TF binding experiments. Nucleic Acids Res. 2014;42:2976–87.
Ambrosini G, Vorontsov I, Penzar D, Groux R, Fornes O, Nikolaeva DD, et al. Insights gained from a comprehensive all-against-all transcription factor binding motif benchmarking study. Genome Biol. 2020;21:114.
Chubb D, Weinhold N, Broderick P, Chen B, Johnson DC, Forsti A, et al. Common variation at 3q26.2, 6p21.33, 17p11.2 and 22q13.1 influences multiple myeloma risk. Nat Genet. 2013;45:1221–5.
Chattopadhyay S, Thomsen H, Weinhold N, Meziane I, Huhn S, da Silva Filho MI, et al. Eight novel loci implicate shared genetic etiology in multiple myeloma, AL amyloidosis, and monoclonal gammopathy of unknown significance. Leukemia. 2020;34:1187–91.
da Silva Filho MI, Försti A, Weinhold N, Meziane I, Campo C, Huhn S, et al. Genome-wide association study of immunoglobulin light chain amyloidosis in three patient cohorts: comparison to myeloma. Leukemia. 2017;31:1735–42.
Oberbeck N, Pham VC, Webster JD, Reja R, Huang CS, Zhang Y, et al. The RIPK4-IRF6 signalling axis safeguards epidermal differentiation and barrier function. Nature. 2019;574:249–53.
Li N, Johnson DC, Weinhold N, Studd JB, Orlando G, Mirabella F, et al. Multiple myeloma risk variant at 7p15.3 creates an IRF4-binding site and interferes with CDCA7L expression. Nat Commun. 2016;7:13656.
Weinhold N, Meissner T, Johnson DC, Seckinger A, Moreaux J, Forsti A, et al. The 7p15.3 (rs4487645) association for multiple myeloma shows strong allele-specific regulation of the MYC-interacting gene CDCA7L in malignant plasma cells. Haematologica. 2015;100:e110–3.
McMaster ML, Berndt SI, Zhang J, Slager SL, Li SA, Vajdic CM, et al. Two high-risk susceptibility loci at 6p25.3 and 14q32.13 for Waldenstrom macroglobulinemia. Nat Commun. 2018;9:4182.
Acknowledgements
We acknowledge the patients who provided samples for this study. KH was supported by the European Union’s Horizon 2020 research and innovation program, grant No 856620. AF was supported by the German Jose Carreras Leukemia Foundation. The work of the Houlston laboratory is supported by grants from Myeloma UK and Cancer Research UK (C1298/A8362). CV, SVR and SK are supported by Mayo Clinic Cancer Center, P50 CA186781, R01 CA168762. ACG was supported by U01 CA257679 and R25 CA092049.
Author information
Authors and Affiliations
Contributions
Conception and responsible for overall content: CV, KH Data collection and study design: MH, HT, NW, PV, LV, PH, MN, KHJ, BS, CL, RH, GH, UPK, CO, FS, RH, HG, EM, AN, SK, SVR, AF, CV, KH. Statistical analysis: ACG, SC, SS Manuscript drafting: ACG, SC, CV, KH. Manuscript editing and approval: All.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Clay-Gilmour, A., Chattopadhyay, S., Hildebrandt, M.A.T. et al. Genome-wide meta-analysis of monoclonal gammopathy of undetermined significance (MGUS) identifies risk loci impacting IRF-6. Blood Cancer J. 12, 60 (2022). https://doi.org/10.1038/s41408-022-00658-w
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41408-022-00658-w