Germline variants at SOHLH2 influence multiple myeloma risk

Duran-Lozano, Laura; Thorleifsson, Gudmar; Lopez de Lapuente Portilla, Aitzkoa; Niroula, Abhishek; Went, Molly; Thodberg, Malte; Pertesi, Maroulio; Ajore, Ram; Cafaro, Caterina; Olason, Pall I.; Stefansdottir, Lilja; Bragi Walters, G.; Halldorsson, Gisli H.; Turesson, Ingemar; Kaiser, Martin F.; Weinhold, Niels; Abildgaard, Niels; Andersen, Niels Frost; Mellqvist, Ulf-Henrik; Waage, Anders; Juul-Vangsted, Annette; Thorsteinsdottir, Unnur; Hansson, Markus; Houlston, Richard; Rafnar, Thorunn; Stefansson, Kari; Nilsson, Björn

doi:10.1038/s41408-021-00468-6

Download PDF

Article
Open access
Published: 19 April 2021

Germline variants at SOHLH2 influence multiple myeloma risk

Laura Duran-Lozano ORCID: orcid.org/0000-0003-3557-5018¹,
Gudmar Thorleifsson²,
Aitzkoa Lopez de Lapuente Portilla¹,
Abhishek Niroula^1,3,
Molly Went⁴,
Malte Thodberg¹,
Maroulio Pertesi¹,
Ram Ajore¹,
Caterina Cafaro¹,
Pall I. Olason²,
Lilja Stefansdottir²,
G. Bragi Walters ORCID: orcid.org/0000-0002-5415-6487²,
Gisli H. Halldorsson ORCID: orcid.org/0000-0001-7067-9862²,
Ingemar Turesson⁵,
Martin F. Kaiser ORCID: orcid.org/0000-0002-3677-4804⁴,
Niels Weinhold⁶,
Niels Abildgaard⁷,
Niels Frost Andersen⁸,
Ulf-Henrik Mellqvist⁹,
Anders Waage¹⁰,
Annette Juul-Vangsted ORCID: orcid.org/0000-0002-2131-731X¹¹,
Unnur Thorsteinsdottir^2,12,
Markus Hansson ORCID: orcid.org/0000-0002-7715-4548^1,5,
Richard Houlston ORCID: orcid.org/0000-0002-5268-0242⁴,
Thorunn Rafnar ORCID: orcid.org/0000-0003-0491-7046²,
Kari Stefansson^2,12 &
…
Björn Nilsson ORCID: orcid.org/0000-0001-5542-0254^1,3

Blood Cancer Journal volume 11, Article number: 76 (2021) Cite this article

2099 Accesses
6 Citations
17 Altmetric
Metrics details

Subjects

An Author Correction to this article was published on 16 November 2021

This article has been updated

Abstract

Multiple myeloma (MM) is caused by the uncontrolled, clonal expansion of plasma cells. While there is epidemiological evidence for inherited susceptibility, the molecular basis remains incompletely understood. We report a genome-wide association study totalling 5,320 cases and 422,289 controls from four Nordic populations, and find a novel MM risk variant at SOHLH2 at 13q13.3 (risk allele frequency = 3.5%; odds ratio = 1.38; P = 2.2 × 10⁻¹⁴). This gene encodes a transcription factor involved in gametogenesis that is normally only weakly expressed in plasma cells. The association is represented by 14 variants in linkage disequilibrium. Among these, rs75712673 maps to a genomic region with open chromatin in plasma cells, and upregulates SOHLH2 in this cell type. Moreover, rs75712673 influences transcriptional activity in luciferase assays, and shows a chromatin looping interaction with the SOHLH2 promoter. Our work provides novel insight into MM susceptibility.

Tissue-specific enhancer–gene maps from multimodal single-cell data identify causal disease alleles

Article 09 April 2024

Saori Sakaue, Kathryn Weinand, … Soumya Raychaudhuri

A single-cell atlas enables mapping of homeostatic cellular shifts in the adult human breast

Article Open access 28 March 2024

Austin D. Reed, Sara Pensa, … Walid T. Khaled

Genome-wide association studies

Article 26 August 2021

Emil Uffelmann, Qin Qin Huang, … Danielle Posthuma

Introduction

Multiple myeloma (MM) is characterized by an uncontrolled, clonal expansion of plasma cells in the bone marrow, producing a monoclonal immunoglobulin (“M protein”). Clinically, MM is complicated by bone marrow and kidney failure, hypercalcemia, and lytic bone lesions¹. MM is preceded by monoclonal gammopathy of undetermined significance (MGUS)^2,3, detectable in 3% of individuals older than 50 years⁴, which progresses to MM at an annual rate of about 1%⁵.

Epidemiological studies have shown that first-degree relatives of patients with MM have 2–4 times higher risk of MM and MGUS^6,7,8, as well as an increased risk of chronic lymphocytic leukemia, lymphomas, and certain solid tumours^9,10,11. Genome-wide association studies (GWAS) have identified inherited sequence variants at 24 loci that influence MM risk and sequencing studies of familial cases have implicated candidate genes where rare MM-predisposing variants might reside¹². Still, identified loci explain less than 20% of the estimated heritability¹³.

To advance our understanding of MM genetics, we carried out a GWAS based on cases and controls from four Nordic populations, with follow-up in additional data sets of European ancestry. We detect a novel MM risk locus at 13q13.3, spanning the SOHLH2 gene. Dissecting the functional architecture, we identify a putative causal variant within the linkage disequilibrium (LD) block that likely acts by upregulating SOHLH2 in plasma cells.

Methods

Sample sets

We carried out a GWAS totalling 5,320 MM and non-IgM MGUS cases and 422,293 controls recruited from Denmark, Iceland, Norway and Sweden (Tables 1 and 2); Swedish series 2,338 MM cases from the Swedish National MM Biobank (Skåne University Hospital, Lund) and 11,971 Swedish controls (random blood donors and primary care patients) genotyped within an ongoing GWAS on blood cell traits (Lopez de Lapuente Portilla et al., ongoing). Danish series 940 MM cases from Rigshospitalet in Copenhagen, Odense University Hospital, and Aarhus University Hospital and 91,744 controls from the Danish Blood Donor Study¹⁴; Norwegian series 500 MM cases from the Norwegian MM Biobank (St. Olavs Hospital-Trondheim University Hospital) and 4,696 blood donor controls (Oslo University Hospital, Ullevål Hospital); Icelandic series 598 MM cases, 944 non-IgM MGUS cases and 313,882 controls from the deCODE Genetics database¹⁵. Information on cases with MM and non-IgM MGUS was obtained from Landspitali – The University Hospital of Iceland and the Icelandic Cancer Registry¹⁶. The MGUS cases are individuals with MGUS who have not yet progressed to MM according to the Icelandic Cancer Registry.

Table 1 Association data for the 13q13 association signal.

Full size table

Table 2 Characteristics of study populations.

Full size table

For follow-up, we analysed: (1) an additional 473 MM cases from the Swedish National MM Biobank and 3,430 Swedish controls (a second cohort of random blood donors and primary care patients) from an ongoing GWAS on blood cell traits (Lopez de Lapuente Portilla et al., ongoing); (2) pre-existing GWAS data from the United Kingdom, Germany, USA and the Netherlands¹³ (Table 1).

All samples were collected with informed consent and ethical approval (Lund University Ethical Review Board, 2013/54; Rigshospitalet Ethical Committee no. 69466; Icelandic National Bioethics Committee ref. 17–143; Regional Committee for Medical and Health Research Ethics, Trondheim 2014/97; Regional Committee for Medical and Health Research Ethics, Oslo), and in accordance with the principles of the Declaration of Helsinki.

Genotyping and imputation

Swedish, Danish and Norwegian samples were genotyped with Illumina single-nucleotide polymorphism microarrays and phased together with 442,737 samples from North-Western Europe using Eagle2¹⁷. Samples and variants with <98% yield were excluded. We used the same methods as used for the Icelandic data^15,18 to create a haplotype reference panel by phasing the whole-genome sequence (WGS) genotypes for 15,575 individuals from North-Western Europe, including 3,012 Swedish, 8,429 Danish and 2,550 Norwegian samples, together with the phased microarray data, and to impute the genotypes from the haplotype reference panel into the phased microarray data.

Sample preparation and WGS of 49,962 Icelanders are previously reported^15,19. Briefly, 37.6 million sequence variants were identified by WGS in 49,962 Icelanders using Illumina technology to a mean depth of at least 18×. SNPs and indels were identified and their genotypes were called jointly using Graphtyper²⁰. In addition, over 165,000 Icelanders, including all those with WGS data, have been microarray-genotyped and long-range phased¹⁸, improving genotype calls using information based on haplotype sharing. The genotypes of the high-quality sequence variants were imputed into the microarray-typed Icelanders²¹. To increase the sample size and power to detect associations, the sequence variants were also imputed into relatives of the microarray-typed using genealogic information. All tested variants had imputation information > 0.8. Variants were mapped to hg38 and matched on position and alleles to harmonize the four data sets. rs145374408, rs78351393 and rs17202418 were genotyped in the Swedish follow-up sample set.

Ancestry analysis

Genetic ancestry analysis was done in two stages for the Danish, Swedish and Norwegian sample sets separately. Firstly, ADMIXTURE v1.23²² was run in supervised mode with 1000 Genomes populations CEU, CHB, and YRI²³ as training samples and Danish, Swedish or Norwegian individuals as test samples. Input data for ADMIXTURE had long-range LD regions removed²⁴ and was then LD-pruned with PLINK v.190b3a²⁵ using the–indep-pairwise 200 25 0.3 option. Samples with <0.9 CEU ancestry were excluded. Secondly, remaining samples were projected onto a principal component analysis (PCA), calculated with an in-house European reference panel to calculate the 20 first principal components for each population. UMAP²⁶ was used to reduce the coordinates of test samples on 20 principal components to two dimensions. Additional European samples not in the original reference set were also projected onto the PCA and UMAP components to identify ancestries represented in the clusters, and samples with Swedish, Danish and Norwegian ancestries were identified.

Association testing

We performed logistic regression in the Icelandic, Swedish, Danish and Norwegian data set separately to test for association between MM and genotypes using deCODE software¹⁵. In the Danish, Swedish and Norwegian association analysis, we adjusted for gender, whether the individual had been microarray-typed and/or sequenced, and the first 20 principal components. In the Icelandic association analysis, we adjusted for gender, county-of-origin, current age or age at death, blood sample availability for the individual, and an indicator function for the overlap of the lifetime of the individual with the time span of phenotype collection. We used LD score regression to account for distribution inflation due to cryptic relatedness and population stratification²⁷.

For the meta-analysis, we used a fixed-effects inverse variance method²⁸. Of note, using a large number of controls, primarily in the Icelandic and Danish data sets, will not bias the results for individual data sets as it only provides more accurate estimate of the allelic frequency in the control group and hence increases power. The inverse variance method used to combine effect size estimates, in essence, weights effects by sample size through the use of corresponding standard errors. This meta-analysis method is well recognized and will not bias results when the ratio of cases to controls is unequal²⁹.

Genome-wide significance was determined using class-based Bonferroni significance thresholds for about 33 million variants. Sequence variants were split into five classes based on their genome annotation, and the significance threshold for each class was based on the number of variants in that class³⁰. The adjusted significance thresholds used are 2.49 × 10⁻⁷ for variants with high impact (including stop-gained and stop-loss, frameshift, splice acceptor or donor and initiator codon variants), 4.97 × 10⁻⁸ for variants with moderate impacts (including missense, splice-region variants, inframe deletions and insertions), 4.52 × 10⁻⁹ for low-impact variants (including synonymous, 3′ and 5′ UTR, and upstream and downstream variants), 2.26 × 10⁻⁹ for deep intronic and intergenic variants in DNase I hypersensitivity sites and 7.53 × 10⁻¹⁰ for all other variants, including those in intergenic regions.

eQTL analysis

To identify expression quantitative locus (eQTL) effects in plasma cells, we used previously published RNA-seq data for CD138⁺ immunomagnetic bead-enriched MM plasma cells³¹. To test for association, we used linear regression with and without 10 principal components as covariates. To identify eQTLs in blood, we used whole-blood RNA-seq from 13,127 Icelanders³². Gene expression was quantified based on transcript abundances estimates using Kallisto³³. Association between sequence variants and gene expression was calculated using generalized linear regression³⁴. The additive genetic effect was assumed and quantile-normalized gene expression estimates were calculated while adjusting for sequencing artefacts, demography variables, and hidden factors³⁵. Finally, as a complement to the Icelandic data, we used data from the eQTLGen database (www.eqtlgen.org)³⁶.

ATAC-seq data for blood cell populations

Sequencing reads for published ATAC-seq libraries from sorted hematopoietic cell types were downloaded from https://atac-blood-hg38.s3.amazonaws.com/hg38/ using the rtracklayer R package³⁷, and processed using hg38 as a reference genome.

Chromatin immunoprecipitation sequencing

As a complementary approach to identify variants located in genomic regions with regulatory activity in plasma cells, we analyzed previously published³¹ chromatin immunoprecipitation sequencing (ChIP-seq) data for the H3K4me3 histone modification³⁸. Briefly, L363 cells (DSMZ) were cross-linked with 1% paraformaldehyde (ThermoFisher, #28908). DNA was sonicated into 200–400 bp fragments (Bioruptor Pico Sonication System, Diagenode, Belgium). For pull-down, we used 1–10 μg of H3K4me3 antibody (Millipore, #04-745). Fragments were de-cross-linked and purified (Zymogen, #D5205). ChIP-seq libraries were prepared using the ThruPLEX DNA-seq Kit (Rubicon Genomics, #R400406) and sequenced on Illumina HiSeq 2500 sequencer (paired-end; 2 × 125 cycles). De-multiplexing and generation of FASTQ files was performed using bcl2fastq v.1.8. FastQC (v0.11.5)³⁹ was used to assess read quality low-quality bases were removed using Trimmomatic (v.0.36)^40,41 prior to alignment. using Bowtie2 (v.2.3.0)⁴¹. Coverage in 50 bp over the SOHL2 region was calculated with the GenomicAlignments and GenomicRanges R-packages⁴² (coverage and binnedAverage functions) and scaled to Counts-per-million (CPM) relative to the total number of reads per library.

Luciferase assays

Luciferase constructs representing the reference and alternative allele of rs75712673 were made by cloning 120-bp genomic sequences (Integrated DNA Technologies) centered on the variant into the pGL3-basic vector. Using electroporation (Neon Transfection system; Life technologies, USA) constructs were co-transfected with renilla plasmid into L363 and OPM2 cells (DSMZ). Twenty hours after electroporation, luciferase and renilla activity was measured using DualGlo Luciferase (Promega no. E1960) on a GLOMAX 20/20 Luminometer (Promega, USA). Based on luciferase/renilla readings, we calculated log₂ scores for each variant.

Transcription factor motif analysis

To identify differentially binding transcription factors, we used the PERFECTOS-APE tool (http://opera.autosome.ru/perfectosape) with the HOCOMOCO-10, JASPAR, HT-SELEX, SwissRegulon and HOMER motif databases.

Figure generation

Region plots were generated using tidyGenomeBrowser (https://github.com/MalteThodberg/tidyGenomeBrowser). Transcript models were obtained via TxDb.Hsapiens.UCSC.hg38.knownGene and org.Hs.eg.db packages⁴³ and collapsed to meta gene models with the exonsBy-function.

Results

Identification of a novel MM risk locus at 13q13

To find new MM risk loci, we carried out a GWAS based on four case-control data sets from Iceland, Denmark, Norway and Sweden totalling 5,320 MM patients and 422,289 controls (Table 1). We performed association testing in the four data sets separately, and combined the resulting statistics for 33 million variants that passed quality filtering. Two versions of the Icelandic case-control data were used for meta-analysis: one with MM patients only, and one that was expanded with non-IgM MGUS patients to increase power (Table 1). The latter is motivated because MM evolves from MGUS, relatives of MGUS patients have increased MM risk, and several studies support pleiotropy between MM and MGUS⁴⁴.

Our analysis identified genome-wide significant association signals at 10 loci, and all previously reported MM lead variants were nominally significant with effects in the same direction as in the discovery studies (Supplementary Table 1). Nine of the genome-wide significant signals correspond to signals correspond to known MM risk loci. In addition, a previously unreported low-frequency variant at 13q13.3 (lead variant rs200203825; RAF ̴ 3,5%; combined P = 2.65 × 10⁻¹⁰ with Icelandic MGUS cases; Table 1) showed significant association. This signal is represented by a haplotype of 14 non-coding variants in high LD (r² > 0.8) spanning the SOHLH2 gene (spermatogenesis and oogenesis specific basic helix-loop-helix 2; Supplementary Table 2). The detected variants showed comparable effects in the same direction in all four discovery sets (Table 1). The conditional analysis did not reveal any additional independent signals. For follow-up, we genotyped an additional 473 MM cases and 3,430 controls from Sweden, and also looked for association in published MM association data sets from the United Kingdom, Germany, Holland and USA (Table 1). The association with MM was nominally significant in three of the six follow-up data sets, including in the two largest series from the UK and Germany. For all the series the effects were in the same direction as in our discovery GWAS.

Functional annotation of the 13q13.3 association

Because non-coding variants generally act by altering the regulation of gene expression, we sought to identify candidate causal variants responsible for the 13q13.3 association based on epigenetic features associated with regulatory activity.

Firstly, ATAC-seq data from 17 blood cell subpopulations showed that rs75712673 (r² = 0.985 with rs200203825 in Swedes) maps to a genomic region selectively open in plasma cells (Fig. 1d). Secondly, consistent with this, chromatin immunoprecipitation and sequencing (ChIP-seq) data for the MM plasma cell line L363, showed that rs75712673 maps to a H3K4me3 histone mark (Fig. 1e). Collectively, these data are consistent with the 13q13.3 association affecting regulatory activity at rs75712673 in plasma cells.

**Fig. 1: Region plots of the 13q13.3 association.**

To identify a putative target gene, we carried out eQTL analysis using RNA-seq data for CD138⁺ plasma cell isolated from the bone marrow of 188 MM patients³¹ (Fig. 2a). We identified an association between rs75712673 and SOHLH2 expression (Wilcoxon rank sum test P = 0.0081 for heterozygotes versus major allele homozygotes without principal components; P = 0.043 with 10 expression and 10 genotype principal component covariates). By contrast, we did not detect a plasma cell-specific eQTL for any of the nearby genes SPG20, DLCK1, CCDC169, or CCNA1, nor a SOHLH2 eQTL in peripheral blood. SOHLH2 is normally expressed in hematopoietic stem and progenitor cells, with its expression in B-cells and plasma cells being low (Supplementary Fig. 1).

**Fig. 2: Candidate variant rs75712673 eQTL effect on *SOHLH2*.**

Further, consistent with the eQTL, luciferase analysis of rs75712673 in MM plasma cell lines L363 and OPM2 showed higher luciferase activity for the rs75712673-G risk allele relative to the reference allele (Fig. 2b). Computational motif analysis predicted that rs75712673-G alters multiple transcription factor motifs, including multiple members of the FOX and SOX families, IKZF2, and EN1 (Supplementary Table 3). Finally, using GeneHancer⁴⁵ promoter-capture HiC data, we detected a chromatin looping interaction between rs75712673 and the SOHLH2 promoter (Fig. 1c).

Discussion

In conclusion, we have identified a new genetic association for MM at SOHLH2, increasing the number of risk loci to 25. This locus was not significant in our recent six-center meta-analysis totalling 9,974 cases. The likely reason was that the Went et al. study combined data from different geographic populations, and, moreover, that the imputation was done using non-population-matched reference genomes. By contrast, the present study was done in homogenous populations of Nordic ancestry and the imputation was done using population-matched reference genomes, increasing the power to detect low-frequency variants such as the one at SOHLH2.

Interestingly, SOHLH2 encodes a transcription factor with a basic helix-loop-helix domain that has previously been implicated in spermatogenesis⁴⁶ and development of breast and ovarian cancer⁴⁷, but is normally expressed only at a low lever in plasma cells. To explore the mechanism underlying the SOHLH2 association, we functionally fine-mapped the 13q13.3 signal, and identified rs75712673 as a likely causal variant that upregulates SOHLH2 in plasma cells. Our findings provide novel insight into the molecular basis of inherited MM susceptibility.

Change history

16 November 2021
A Correction to this paper has been published: https://doi.org/10.1038/s41408-021-00575-4

References

Rajkumar, S. V. et al. International Myeloma Working Group updated criteria for the diagnosis of multiple myeloma. Lancet Oncol. 15, e538–e548 (2014).
PubMed Google Scholar
Greenberg, A. J., Rajkumar, S. V. & Vachon, C. M. Familial monoclonal gammopathy of undetermined significance and multiple myeloma: epidemiology, risk factors, and biological characteristics. Blood 119, 5359–5366 (2012).
CAS PubMed PubMed Central Google Scholar
González-Calle, V. & Mateos, M. V. Monoclonal gammopathies of unknown significance and smoldering myeloma: assessment and management of the elderly patients. Eur. J. Intern. Med. 58, 57–63 (2018).
PubMed Google Scholar
Kyle, R. A. et al. Prevalence of monoclonal gammopathy of undetermined significance. N. Engl. J. Med. 354, 1362–1369 (2006).
CAS PubMed Google Scholar
Mouhieddine, T. H., Weeks, L. D. & Ghobrial, I. M. Monoclonal gammopathy of undetermined significance. Blood 133, 2484–2494 (2019).
CAS PubMed Google Scholar
Altieri, A., Chen, B., Bermejo, J. L., Castro, F. & Hemminki, K. Familial risks and temporal incidence trends of multiple myeloma. Eur. J. Cancer 42, 1661–1670 (2006).
PubMed Google Scholar
Vachon, C. M. et al. Increased risk of monoclonal gammopathy in first-degree relatives of patients with multiple myeloma or monoclonal gammopathy of undetermined significance. Blood 114, 785–790 (2009).
CAS PubMed PubMed Central Google Scholar
Clay-Gilmour, A. I. et al. Coinherited genetics of multiple myeloma and its precursor, monoclonal gammopathy of undetermined significance. Blood Adv. 4, 2789–2797 (2020).
CAS PubMed PubMed Central Google Scholar
Chubb, D. et al. Common variation at 3q26.2, 6p21.33, 17p11.2 and 22q13.1 influences multiple myeloma risk. Nat. Genet. 45, 1221–1225 (2013).
CAS PubMed PubMed Central Google Scholar
Frank, C. et al. Search for familial clustering of multiple myeloma with any cancer. Leukemia 30, 627–632 (2016).
CAS PubMed Google Scholar
Read, J. et al. Increased incidence of bladder cancer, lymphoid leukaemia, and myeloma in a cohort of Queensland melanoma families. Fam. Cancer 15, 651–663 (2016).
CAS PubMed Google Scholar
Pertesi, M. et al. Genetic predisposition for multiple myeloma. Leukemia 34, 697–708 (2020).
PubMed Google Scholar
Went, M. et al. Identification of multiple risk loci and regulatory mechanisms influencing susceptibility to multiple myeloma. Nat. Commun. 9, 1–10 (2018).
CAS Google Scholar
Pedersen, O. B. et al. The Danish Blood Donor Study: a large, prospective cohort and biobank for medical research. Vox Sang. 102, 271 (2012).
CAS PubMed Google Scholar
Gudbjartsson, D. F. et al. Large-scale whole-genome sequencing of the Icelandic population. Nat. Genet. 47, 435–444 (2015).
CAS PubMed Google Scholar
Sigurdardottir, L. G. et al. Data quality at the Icelandic cancer registry: comparability, validity, timeliness and completeness. Acta Oncol. 51, 880–889 (2012).
PubMed Google Scholar
Loh, P. R. et al. Reference-based phasing using the Haplotype Reference Consortium panel. Nat. Genet. 48, 1443–1448 (2016).
CAS PubMed PubMed Central Google Scholar
Kong, A. et al. Detection of sharing by descent, long-range phasing and haplotype imputation. Nat. Genet. 40, 1068–1075 (2008).
CAS PubMed PubMed Central Google Scholar
Jónsson, H. et al. Whole genome characterization of sequence diversity of 15,220 Icelanders. Sci. Da 4, 170115 (2017).
Google Scholar
Eggertsson, H. P. et al. Graphtyper enables population-scale genotyping using pangenome graphs. Nat. Genet. 49, 1654–1660 (2017).
CAS PubMed Google Scholar
Gudbjartsson, D. F. et al. Sequence variants from whole genome sequencing a large group of Icelanders. Sci. Data 2, 150011 (2015).
PubMed PubMed Central Google Scholar
Alexander, D. H., Novembre, J. & Lange, K. Fast model-based estimation of ancestry in unrelated individuals. Genome Res. 19, 1655–1664 (2009).
CAS PubMed PubMed Central Google Scholar
Auton, A. et al. A global reference for human genetic variation. Nature 526, 68–74 (2015).
PubMed Google Scholar
Price, A. L. et al. Long-range LD Can Confound Genome Scans in Admixed Populations. Am. J. Hum. Genet. 83, 132–135 (2008).
CAS PubMed PubMed Central Google Scholar
Purcell, S. et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet. 81, 559–575 (2007).
CAS PubMed PubMed Central Google Scholar
Diaz-Papkovich, A., Anderson-Trocmé, L. & Gravel, S. A review of UMAP in population genetics. J. Hum. Genet. https://doi.org/10.1038/s10038-020-00851-4 (2020).
Bulik-Sullivan, B. et al. LD score regression distinguishes confounding from polygenicity in genome-wide association studies. Nat. Genet. 47, 291–295 (2015).
CAS PubMed PubMed Central Google Scholar
Mantel, N. & Haenszel, W. Statistical aspects of the analysis of data from retrospective studies of disease. J. Natl Cancer Inst. 22, 719–748 (1959).
CAS PubMed Google Scholar
Begum, F., Ghosh, D., Tseng, G. C. & Feingold, E. Comprehensive literature review and statistical considerations for GWAS meta-analysis. Nucleic Acids Res. 40, 3777–3784 (2012).
CAS PubMed PubMed Central Google Scholar
Sveinbjornsson, G. et al. Weighting sequence variants based on their annotation increases power of whole-genome association studies. Nat. Genet. 48, 314–317 (2016).
CAS PubMed Google Scholar
Ali, M. et al. The multiple myeloma risk allele at 5q15 lowers ELL2 expression and increases ribosomal gene expression. Nat. Commun. 9, 1649 (2018).
PubMed PubMed Central Google Scholar
Saevarsdottir, S. et al. FLT3 stop mutation increases FLT3 ligand level and risk of autoimmune thyroid disease. Nature 584, 619–623 (2020).
CAS PubMed Google Scholar
Bray, N. L., Pimentel, H., Melsted, P. & Pachter, L. Near-optimal probabilistic RNA-seq quantification. Nat. Biotechnol. 34, 525–527 (2016).
CAS PubMed Google Scholar
Oskarsson, G. R. et al. A truncating mutation in EPOR leads to hypo-responsiveness to erythropoietin with normal haemoglobin. Commun. Biol. 1, 1–7 (2018).
Google Scholar
Stegle, O., Parts, L., Durbin, R. & Winn, J. A bayesian framework to account for complex non-genetic factors in gene expression levels greatly increases power in eQTL studies. PLoS Comput. Biol. 6, 1–11 (2010).
Google Scholar
van der Wijst, M. G. P. et al. The single-cell eQTLGen consortium. Elife 9, 1–21 (2020).
Google Scholar
Lawrence, M., Gentleman, R. & Carey, V. rtracklayer: An R package for interfacing with genome browsers. Bioinformatics 25, 1841–1842 (2009).
CAS PubMed PubMed Central Google Scholar
Barski, A. et al. High-resolution profiling of histone methylations in the human genome. Cell 129, 823–837 (2007).
CAS PubMed Google Scholar
Wingett, S. W. & Andrews, S. Fastq screen: a tool for multi-genome mapping and quality control [version 1; referees: 3 approved, 1 approved with reservations]. F1000Research 7, 1338 (2018).
PubMed PubMed Central Google Scholar
Bolger, A. M., Lohse, M. & Usadel, B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30, 2114–2120 (2014).
Article CAS PubMed PubMed Central Google Scholar
Langmead, B. & Salzberg, S. L. Fast gapped-read alignment with Bowtie 2. Nat. Methods 9, 357–359 (2012).
CAS PubMed PubMed Central Google Scholar
Lawrence, M. et al. Software for computing and annotating genomic ranges. PLoS Comput. Biol. 9, 1–10 (2013).
Google Scholar
Huber, W. et al. Orchestrating high-throughput genomic analysis with bioconductor. Nat. Methods 12, 115–121 (2015).
CAS PubMed PubMed Central Google Scholar
Landgren, O. et al. Risk of plasma cell and lymphoproliferative disorders among 14 621 first-degree relatives of 4458 patients with monoclonal gammopathy of undetermined significance in Sweden. Blood 114, 791–795 (2009).
CAS PubMed PubMed Central Google Scholar
Fishilevich, S. et al. GeneHancer: genome-wide integration of enhancers and target genes in GeneCards. Database 2017, 1–17 (2017).
Google Scholar
Hao, J. et al. Sohlh2 knockout mice are male-sterile because of degeneration of differentiating Type A spermatogonia. Stem Cells 26, 1587–1597 (2008).
CAS PubMed Google Scholar
Zhang, H. et al. Sohlh2 inhibits ovarian cancer cell proliferation by upregulation of p21 and downregulation of cyclin D1. Carcinogenesis 35, 1863–1871 (2014).
CAS PubMed Google Scholar

Download references

Acknowledgements

This work was supported by grants from the Knut and Alice Wallenberg Foundation (2012.0193 and 2017.0436), the Swedish Research Council (2017-02023), the Swedish Cancer Society (2017/265), Stiftelsen Borås Forsknings- och Utvecklingsfond mot Cancer, the Nordic Cancer Union (R217-A13329-18-S65), EU-MSCA-COFUND 754299 CanFaster, the Myeloma UK and Cancer Research UK (C1298/A8362), a Jacquelin Forbes-Nixon Fellowship, and Mr. Ralph Stockwell. We thank Ellinor Johnsson and Anna Collin for their assistance. We are indebted to the clinicians and patients who contributed samples. Open access funding provided by Lund University.

Author information

Authors and Affiliations

Hematology and Transfusion Medicine, Department of Laboratory Medicine, 221 84, Lund, Sweden
Laura Duran-Lozano, Aitzkoa Lopez de Lapuente Portilla, Abhishek Niroula, Malte Thodberg, Maroulio Pertesi, Ram Ajore, Caterina Cafaro, Markus Hansson & Björn Nilsson
deCODE genetics, Sturlugata 8, IS-101, Reykjavik, Iceland
Gudmar Thorleifsson, Pall I. Olason, Lilja Stefansdottir, G. Bragi Walters, Gisli H. Halldorsson, Unnur Thorsteinsdottir, Thorunn Rafnar & Kari Stefansson
Broad Institute, 415 Main Street, Cambridge, MA, 02124, USA
Abhishek Niroula & Björn Nilsson
Division of Genetics and Epidemiology, The Institute of Cancer Research, 123 Old Brompton Road, London, SW7 3RP, UK
Molly Went, Martin F. Kaiser & Richard Houlston
Hematology Clinic, Lund University Hospital, 221 85, Lund, Sweden
Ingemar Turesson & Markus Hansson
Department of Internal Medicine V, University Hospital of Heidelberg, 69120, Heidelberg, Germany
Niels Weinhold
Hematology Research Unit, Department of Clinical Research, University of Southern Denmark and Department of Hematology, Odense University Hospital, Odense, Denmark
Niels Abildgaard
Department of Haematology, Aarhus University Hospital, 8200, Aarhus N, Denmark
Niels Frost Andersen
Södra Älvsborgs Sjukhus Borås, Borås, Sweden
Ulf-Henrik Mellqvist
Institute of Clinical and Molecular Medicine, Norwegian University of Science and Technology, Department of Hematology, and Biobank1, St Olavs hospital, Trondheim, Norway
Anders Waage
Department of Haematology, University Hospital of Copenhagen at Rigshospitalet, Blegdamsvej 9, DK-2100, Copenhagen, Denmark
Annette Juul-Vangsted
Faculty of Medicine, University of Iceland, Reykjavik, Iceland
Unnur Thorsteinsdottir & Kari Stefansson

Authors

Laura Duran-Lozano
View author publications
You can also search for this author in PubMed Google Scholar
Gudmar Thorleifsson
View author publications
You can also search for this author in PubMed Google Scholar
Aitzkoa Lopez de Lapuente Portilla
View author publications
You can also search for this author in PubMed Google Scholar
Abhishek Niroula
View author publications
You can also search for this author in PubMed Google Scholar
Molly Went
View author publications
You can also search for this author in PubMed Google Scholar
Malte Thodberg
View author publications
You can also search for this author in PubMed Google Scholar
Maroulio Pertesi
View author publications
You can also search for this author in PubMed Google Scholar
Ram Ajore
View author publications
You can also search for this author in PubMed Google Scholar
Caterina Cafaro
View author publications
You can also search for this author in PubMed Google Scholar
Pall I. Olason
View author publications
You can also search for this author in PubMed Google Scholar
Lilja Stefansdottir
View author publications
You can also search for this author in PubMed Google Scholar
G. Bragi Walters
View author publications
You can also search for this author in PubMed Google Scholar
Gisli H. Halldorsson
View author publications
You can also search for this author in PubMed Google Scholar
Ingemar Turesson
View author publications
You can also search for this author in PubMed Google Scholar
Martin F. Kaiser
View author publications
You can also search for this author in PubMed Google Scholar
Niels Weinhold
View author publications
You can also search for this author in PubMed Google Scholar
Niels Abildgaard
View author publications
You can also search for this author in PubMed Google Scholar
Niels Frost Andersen
View author publications
You can also search for this author in PubMed Google Scholar
Ulf-Henrik Mellqvist
View author publications
You can also search for this author in PubMed Google Scholar
Anders Waage
View author publications
You can also search for this author in PubMed Google Scholar
Annette Juul-Vangsted
View author publications
You can also search for this author in PubMed Google Scholar
Unnur Thorsteinsdottir
View author publications
You can also search for this author in PubMed Google Scholar
Markus Hansson
View author publications
You can also search for this author in PubMed Google Scholar
Richard Houlston
View author publications
You can also search for this author in PubMed Google Scholar
Thorunn Rafnar
View author publications
You can also search for this author in PubMed Google Scholar
Kari Stefansson
View author publications
You can also search for this author in PubMed Google Scholar
Björn Nilsson
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

L.D.L., G.T., A.L.d.L.P., U.T., T.R., M.H., K.S., and B.N. conceived the project. L.D.L., M.P., R.A., and C.C. performed experiments. L.D.L., G.T., A.L.d.L.P., A.N., M.W., M.T., P.O., L.S., G.H., G.B.W., N.W., T.R., and B.N. analyzed data. I.T., N.A., N.F.A., U.-H.M., A.W., A.J.-V., U.T., R.H., T.R., K.S., and M.H. contributed samples and/or data. L.D.L., G.T., A.L.d.L.P., A.N., T.R., and B.N. drafted the manuscripts. All authors contributed to the final manuscript.

Corresponding author

Correspondence to Björn Nilsson.

Ethics declarations

Conflict of interest

The authors declare no competing interests. Authors G.T., P.U., L.S., G.B.W., G.H., U.T., T.R., and K.S. are employed by deCODE Genetics/Amgen.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Figure 1

Supplementary Table 1

Supplementary Table 2

Supplementary Table 3

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Duran-Lozano, L., Thorleifsson, G., Lopez de Lapuente Portilla, A. et al. Germline variants at SOHLH2 influence multiple myeloma risk. Blood Cancer J. 11, 76 (2021). https://doi.org/10.1038/s41408-021-00468-6

Download citation

Received: 21 January 2021
Revised: 17 March 2021
Accepted: 31 March 2021
Published: 19 April 2021
DOI: https://doi.org/10.1038/s41408-021-00468-6

This article is cited by

Deficit of homozygosity among 1.52 million individuals and genetic causes of recessive lethality
- Asmundur Oddsson
- Patrick Sulem
- Daniel F. Gudbjartsson
Nature Communications (2023)
Genome-wide meta-analysis of monoclonal gammopathy of undetermined significance (MGUS) identifies risk loci impacting IRF-6
- Alyssa Clay-Gilmour
- Subhayan Chattopadhyay
- Kari Hemminki
Blood Cancer Journal (2022)