Mutational mechanisms shaping the coding and noncoding genome of germinal center derived B-cell lymphomas

Hübschmann, Daniel; Kleinheinz, Kortine; Wagener, Rabea; Bernhart, Stephan H.; López, Cristina; Toprak, Umut H.; Sungalee, Stephanie; Ishaque, Naveed; Kretzmer, Helene; Kreuz, Markus; Waszak, Sebastian M.; Paramasivam, Nagarajan; Ammerpohl, Ole; Aukema, Sietse M.; Beekman, Renée; Bergmann, Anke K.; Bieg, Matthias; Binder, Hans; Borkhardt, Arndt; Borst, Christoph; Brors, Benedikt; Bruns, Philipp; Carrillo de Santa Pau, Enrique; Claviez, Alexander; Doose, Gero; Haake, Andrea; Karsch, Dennis; Haas, Siegfried; Hansmann, Martin-Leo; Hoell, Jessica I.; Hovestadt, Volker; Huang, Bingding; Hummel, Michael; Jäger-Schmidt, Christina; Kerssemakers, Jules N. A.; Korbel, Jan O.; Kube, Dieter; Lawerenz, Chris; Lenze, Dido; Martens, Joost H. A.; Ott, German; Radlwimmer, Bernhard; Reisinger, Eva; Richter, Julia; Rico, Daniel; Rosenstiel, Philip; Rosenwald, Andreas; Schillhabel, Markus; Stilgenbauer, Stephan; Stadler, Peter F.; Martín-Subero, José I.; Szczepanowski, Monika; Warsow, Gregor; Weniger, Marc A.; Zapatka, Marc; Valencia, Alfonso; Stunnenberg, Hendrik G.; Lichter, Peter; Möller, Peter; Loeffler, Markus; Eils, Roland; Klapper, Wolfram; Hoffmann, Steve; Trümper, Lorenz; Küppers, Ralf; Schlesner, Matthias; Siebert, Reiner

doi:10.1038/s41375-021-01251-z

Download PDF

Article
Open access
Published: 05 May 2021

Lymphoma

Mutational mechanisms shaping the coding and noncoding genome of germinal center derived B-cell lymphomas

Daniel Hübschmann ORCID: orcid.org/0000-0002-6041-7049^1,2,3,4^na2,
Kortine Kleinheinz^1,2^na2,
Rabea Wagener^5,6^na2^nAff39,
Stephan H. Bernhart^7,8,9^na2,
Cristina López^5,6^na2,
Umut H. Toprak^1,10,11,
Stephanie Sungalee¹²,
Naveed Ishaque^1,13,
Helene Kretzmer^7,8,9,14,
Markus Kreuz¹⁵,
Sebastian M. Waszak ORCID: orcid.org/0000-0003-3042-9521¹²,
Nagarajan Paramasivam^1,16,
Ole Ammerpohl^5,6,
Sietse M. Aukema ORCID: orcid.org/0000-0001-5824-0320^6,17,
Renée Beekman¹⁸,
Anke K. Bergmann^6,19,
Matthias Bieg^1,13,
Hans Binder^7,8,
Arndt Borkhardt ORCID: orcid.org/0000-0002-6121-4737²⁰,
Christoph Borst²¹,
Benedikt Brors ORCID: orcid.org/0000-0001-5940-3101²²,
Philipp Bruns¹,
Enrique Carrillo de Santa Pau ORCID: orcid.org/0000-0002-2310-2267²³^nAff40,
Alexander Claviez¹⁹,
Gero Doose^7,8,9,
Andrea Haake⁶,
Dennis Karsch²⁴,
Siegfried Haas²¹,
Martin-Leo Hansmann²⁵,
Jessica I. Hoell²⁰,
Volker Hovestadt²⁶,
Bingding Huang ORCID: orcid.org/0000-0002-4748-2882¹^nAff41,
Michael Hummel²⁷,
Christina Jäger-Schmidt¹,
Jules N. A. Kerssemakers ORCID: orcid.org/0000-0002-0460-7032¹,
Jan O. Korbel ORCID: orcid.org/0000-0002-2798-3794¹²,
Dieter Kube²⁸,
Chris Lawerenz¹,
Dido Lenze²⁷,
Joost H. A. Martens²⁹,
German Ott³⁰,
Bernhard Radlwimmer²⁶,
Eva Reisinger¹,
Julia Richter ORCID: orcid.org/0000-0002-9543-4084^6,17,
Daniel Rico ORCID: orcid.org/0000-0002-6561-2310²³^nAff42,
Philip Rosenstiel³¹,
Andreas Rosenwald³²,
Markus Schillhabel³¹,
Stephan Stilgenbauer³³,
Peter F. Stadler ORCID: orcid.org/0000-0002-5016-5191⁸,
José I. Martín-Subero ORCID: orcid.org/0000-0001-8809-5195¹⁸,
Monika Szczepanowski ORCID: orcid.org/0000-0001-5482-8880¹⁷,
Gregor Warsow¹,
Marc A. Weniger^34,35,
Marc Zapatka ORCID: orcid.org/0000-0001-8287-5967²⁶,
Alfonso Valencia^36,37,
Hendrik G. Stunnenberg²⁹,
Peter Lichter ORCID: orcid.org/0000-0002-2960-5279²⁶,
Peter Möller³⁸,
Markus Loeffler¹⁵,
Roland Eils^1,2,
Wolfram Klapper ORCID: orcid.org/0000-0001-7208-4117¹⁷,
Steve Hoffmann^7,8,9,
Lorenz Trümper²⁸,
ICGC MMML-Seq consortium,
ICGC DE-Mining consortium,
BLUEPRINT consortium,
Ralf Küppers ORCID: orcid.org/0000-0002-6691-7191^34,35^na3,
Matthias Schlesner ORCID: orcid.org/0000-0002-5896-4086^1,11^na3^nAff43 &
…
Reiner Siebert^5,6^na3

Leukemia volume 35, pages 2002–2016 (2021)Cite this article

5926 Accesses
34 Citations
9 Altmetric
Metrics details

Subjects

Abstract

B cells have the unique property to somatically alter their immunoglobulin (IG) genes by V(D)J recombination, somatic hypermutation (SHM) and class-switch recombination (CSR). Aberrant targeting of these mechanisms is implicated in lymphomagenesis, but the mutational processes are poorly understood. By performing whole genome and transcriptome sequencing of 181 germinal center derived B-cell lymphomas (gcBCL) we identified distinct mutational signatures linked to SHM and CSR. We show that not only SHM, but presumably also CSR causes off-target mutations in non-IG genes. Kataegis clusters with high mutational density mainly affected early replicating regions and were enriched for SHM- and CSR-mediated off-target mutations. Moreover, they often co-occurred in loci physically interacting in the nucleus, suggesting that mutation hotspots promote increased mutation targeting of spatially co-localized loci (termed hypermutation by proxy). Only around 1% of somatic small variants were in protein coding sequences, but in about half of the driver genes, a contribution of B-cell specific mutational processes to their mutations was found. The B-cell-specific mutational processes contribute to both lymphoma initiation and intratumoral heterogeneity. Overall, we demonstrate that mutational processes involved in the development of gcBCL are more complex than previously appreciated, and that B cell-specific mutational processes contribute via diverse mechanisms to lymphomagenesis.

Frequent mutations in the amino-terminal domain of BCL7A impair its tumor suppressor role in DLBCL

Article 24 June 2020

Distinct genetic changes reveal evolutionary history and heterogeneous molecular grade of DLBCL with MYC/BCL2 double-hit

Article Open access 16 December 2019

TET deficiency perturbs mature B cell homeostasis and promotes oncogenesis associated with accumulation of G-quadruplex and R-loop structures

Article 22 December 2021

Introduction

B-cell neoplasms encompass more than 80% of lymphoid malignancies worldwide [1]. The most common types of mature B-cell neoplasms are diffuse large B-cell lymphoma (DLBCL) and follicular lymphoma (FL), accounting for more than 50% of adult B-cell lymphomas. Both are germinal center (GC)-derived B-cell lymphomas (gcBCL). While DLBCL is a heterogeneous group of aggressive lymphomas, FL is indolent but can progress to DLBCL. DLBCL comprises two subgroups, defined by gene expression as germinal center B-cell like (GCB) and activated B-cell like (ABC), with some cases left unclassified [2, 3]. More recently, new subdivisions of DLBCL based on the patterns of mutated genes were proposed [4,5,6,7].

Lymphocytes are the only somatic cells in humans which actively alter their genomes in their physiological maturation program. Early in B-cell development, V(D)J recombination rearranges immunoglobulin (IG) genes to generate initial antigen receptor diversity. In response to T cell-dependent antigens, B cells undergo rapid proliferation in the GC [8]. Concurrently, mutations are introduced in the IG variable region genes which encode the antigen binding sites in a process called somatic hypermutation (SHM) to further diversify the IG repertoire [8]. Moreover, activated B cells can change the antibody isotype via class-switch recombination (CSR), which involves excision of a DNA fragment [9].

Both SHM and CSR are initiated by activation-induced cytidine deaminase (AID), which deaminates cytosine (C) to uracil (U) [10]. SHM introduces single nucleotide variants (SNVs) in the IG variable regions due to diverse error-prone DNA repair processes activated in response to AID activity. CSR, in contrast, is focused on the generation of DNA strand breaks into switch regions located 5’ of the IG heavy chain constant region genes (IG-switch), involving distinct factors [9].

Physiologic activity of AID is restricted to the IG loci and at much lower frequency also to a few non-IG off-targets (e.g., BCL6) [11]. However, AID activity also causes chromosomal translocations, and in particular in DLBCL, numerous additional genes are aberrantly targeted by SHM [12,13,14]. AID-mediated mutations have hence been implicated as key events in B-cell lymphomagenesis [14, 15]. Indeed, most gcBCLs exhibit oncogene translocations and recurrent targeting of B cell-specific genes by mutations ascribed to aberrant SHM [13, 14, 16]. However, a comprehensive understanding of the mutational mechanisms and genome-wide patterns in gcBCL is missing. We analyzed whole genome and transcriptome sequencing data of 181 and 176 gcBCL, respectively, in order to understand the origin and implications of somatic mutations in gcBCL. We dissect the mutational mechanisms shaping their genomes and use a comprehensive approach to elucidate how these mutate the driver genes.

Material and methods

Sample selection, genomic and transcriptomic sequencing and bioinformatic evaluations followed the guildelines of the International Cancer Genome Consortium (ICGC) [17,18,19,20]. For details see Supplementary Methods.

Results

Mutational landscape

We performed whole genome sequencing of 181 pre-treatment lymphoma samples from adult patients, and 179 matching nontumor tissues using inclusion criteria described in the “Methods” section (Supplementary Table S1A). The cohort encompasses 86 FL, 17 FL/DLBCL (As FL/DLBCL cases were classified which either were composite of two compartments or in which histopathologic reviews did not yield an unambgious differentiation between both), 76 DLBCL, 1 unspecified B-cell lymphoma, and 1 lymphoma with features intermediate between DLBCL and Burkitt lymphoma (BL) (Supplementary Table S1B). Transcriptomes were obtained from 176 of the cases and used to molecularly classify them, adapting published indices [2]. We assigned 171 cases to the nonmolecular BL group, two to the molecular BL group, and three showed an intermediate profile. To increase statistical power for detecting common mutational mechanisms of B cells, these gcBCL subgroups were analyzed together in a subset of the analyses.

Whole genome sequencing data were obtained with a median coverage of 36.4 (range 24.1–56.4) and 37.0 (range 26.4–77.5) in tumors and controls, respectively, and interrogated for somatic mutations including SNVs, insertions and deletions (indels), structural variants (SVs), and copy number aberrations (CNAs). We identified a median of 8186 (range 1,236–138,620; subgroup specific median DLBCL: 12,943, FL: 5,933, FL/DLBCL: 13,381) somatic small variants (SNVs and indels) per tumor (Supplementary Fig. S1A).

A median of 55 SVs (range 2–1317; inversions: 9, deletions: 7, duplications: 26, translocations: 6) was detected per case. Most SVs were detected in FL/DLBCL (median: 100) and DLBCL (77). The number in FLs was considerably lower (35), indicating higher genomic instability in DLBCL and FL/DLBCL than in FL. The number of SVs correlated with the number of small mutations (Supplementary Fig. S1B). Regarding CNAs (deleted or gained genomic segments >1 Mb) DLBCLs (median 9 gains/5 losses) and FL/DLBCL (8/5) showed more CNAs than FLs (2/3), matching previous studies (Supplementary Fig. S1D, E) [21,22,23].

SNVs exhibited a highly uneven distribution across the genome (Fig. 1A, Supplementary Fig. S2A). Cohort-wide analysis of SNV density in 1 Mb windows revealed a correlation between SNV density and replication timing [24], with higher SNV density in late replicating regions (Fig. 1B), as described [25]. However, some early replicating regions showed a very high mutation density. An increased fraction of SNVs in those windows affected the DGYW sequence motif, a preferred SHM target [26]. Many targets of physiological and aberrant SHM are located in these windows [13], e.g., BCL2 and PAX5 (Fig. 1B).

**Fig. 1: Mutation density and replication timing.**

Since the cohort-wide analysis masks inter-individual differences, we analyzed fluctuations in SNV density in individual genomes, excluding two cases without matched normal tissue. We identified 4538 clusters of very high mutation density (termed kataegis clusters) [27, 28] in 219 lymphoma genomes (consisting of 179 genomes from this study plus 39 pediatric BLs [17] and one adult BL, Supplementary Table S2), using a definition of a maximal intermutation distance of 1000 bp and a minimum of five mutations per cluster. Almost half of these (2,145, 47.3%) were recurrent in at least three patients and affected 166 genomic regions, which we term kataegis regions (Fig. 2 and Supplementary Fig. S3; upon omission of four hypermutated DLBCL cases, defined by more than two standard deviations above mean SNV mutational load, cf. Supplementary Information, 157 kataegis regions were identified—the difference of nine recurrent kataegis clusters contains exclusively known and established targets of SHM). 91 kataegis regions were located outside of IG loci. DLBCLs and FL/DLBCLs displayed higher median numbers of kataegis clusters, affected kataegis regions both inside the IG loci, outside the IG loci and overall as well as higher counts of SNVs in kataegis clusters (Supplementary Fig. S2B and Supplementary Table S3A). Among DLBCLs, GCB-DLBCL had higher mutational load (medians for ABC-DLBCL: 8,978 and GCB-DLBCL: 12,478), higher median numbers of kataegis clusters and affected kataegis regions, higher counts of SNVs in kataegis clusters and regions than ABC-DLBCL (Supplementary Fig. S2D and Supplementary Table S3B, C).

**Fig. 2: Analysis of mutation density dissects aberrant targeting of SHM and CSR.**

Beyond kataegis clusters with high mutation density, we also found regions with an intermediate mutation density, which we term psichales (ψιχάλεϛ, ancient greek for “drizzling rain”; Supplementary Figs. S2B and S4). Kataegis and psichales exhibit a remarkably different distribution over the genome, with kataegis clusters being bound to early replicating regions [24], whereas psichales is characteristic of late replicating regions (Fig. 1C, Supplementary Fig. S4). This suggests that psichales corresponds to the known increased mutation rate in late replicating heterochromatic regions [25, 29] (Supplementary Fig. S5, Supplementary Table S4) caused by differential DNA mismatch repair [30], while kataegis in gcBCL is caused by focal hypermutation of active genomic regions. Replication timing profiles differ between cell lines originating from different tissues [24]. Interestingly, kataegis clusters were enriched in genomic regions where lymphoblastoid cell lines show earlier replication than nonlymphoid cell lines (Fig. 1D). Similar to gcBCL, lymphoblastoid cell lines represent immortalized mature B cells and share with DLBCL a mature B-cell phenotype, strong proliferation, and expression of numerous B cell-typical genes. The enrichment of kataegis clusters in genomic regions where lymphoblastoid cell lines show earlier replication than the other cell lines shows that genomic regions early replicating specifically in B cells are particularly prone to become hypermutated.

Aberrant SHM and aberrant CSR cause clusters of hypermutation

To understand the mutational mechanisms introducing the high number of kataegis clusters in gcBCL genomes we analyzed the SNV profiles at the IG loci, the physiological targets of B cell-specific mutagenesis. We derived consensus coordinates of the IG switch regions (Supplementary Figs. S6 and S7, Supplementary Information), and then extracted SNVs in the switch regions and IG V regions (IG-VDJ). Profiles of nucleotide exchanges in their triplet context (corresponding to the concept of a mutational catalog [27]) differed strongly between SNVs located in IG-switch, IG-VDJ and the overall mutational catalog (Fig. 2A). We defined the SNV profile derived from IG-switch as CSR profile and from IG-VDJ as SHM profile. The CSR profile consists almost exclusively of four triplets corresponding to a DGC/GCH motif (mutation hotspot underlined), and is therefore much more focused than the previously described RGYW/WRCY or DGYW/WRCH motifs. In contrast, the SHM profile shows a much more diverse nucleotide exchange pattern. These patterns are consistent with SNVs introduced by CSR being mainly the result of focused repair of AID-mediated C to U deamination, while SHM includes strong modulation by error-prone DNA repair pathways.

We hypothesized that kataegis outside the IG loci may be due to aberrant targeting of either SHM or CSR. Assessment of the contributions of these two mechanisms to all kataegis clusters revealed three classes of kataegis clusters (Fig. 2B): one with predominant contributions of SHM (n = 2,323, 51.2%), one with predominant contributions of CSR (n = 428, 9.4%) and one with low contributions of SHM and CSR (n = 1,787, 39.4%). This classification persisted when clustering only the kataegis clusters located outside IG loci with 97.6% identical assignments (Supplementary Fig. S2F). Some kataegis-regions showed strong enrichment of SHM-like kataegis clusters like those in proximity of RHOH and DTX1, whereas others, like CIITA, PAX5 or CD74, had mainly contributions from CSR-like clusters (Fig. 2C). This suggests that beyond aberrant SHM, the mutational landscape of gcBCLs is also shaped by aberrant targeting of the CSR machinery.

Due to the fact that FL showed less kataegis clusters in total than DLBCL and FL/DLBCL, absolute numbers of CSR-like, SHM-like and “other” kataegis clusters (Supplementary Fig. S8 A, E, I) and SNVs (Supplementary Fig. S2C, items 1, 2) were lower in FL. However, when assessing the relative fraction of the respective classes of kataegis clusters among all kataegis clusters, remarkable differences were observed: while these fractions showed a trend towards lower values in FL than in DLBCL and FL-DLBCL for CSR-like (Supplementary Fig. S8 B, J) kataegis clusters and SNVs (Supplementary Fig. S2C, items 3–4), they were higher for SHM-like kataegis clusters (Supplementary Fig. S8F) and SNVs (Supplementary Fig. S2C, item 5).

Hypermutation by proxy

SHM typically introduces mutations within a window of roughly 2.5 kb 3’ of the transcription start site (TSS). 2581/4538 (56.9%) of all kataegis clusters and 2142/2460 (87.1%) of the recurrent kataegis clusters fulfilled these criteria. However, 1056 (23.3%) of all and 39 (1.6%) of the recurrent kataegis clusters were more than 20 kb away from the next TSS (Supplementary Information). The SHM-like and CSR-like profiles were depleted among these so called “TSS-distant” kataegis clusters (Supplementary Table S5A, H and J). We annotated chromatin states computed from ChIP-Seq of three GC B-cell samples [31, 32] to the kataegis clusters (Fig. 2C). As expected, both SHM-like (1236/2321, 53%) and CSR-like clusters (317/427, 74%) were primarily located in promoters (Supplementary Table S5B). In contrast, most kataegis clusters of type “non-CSR/non-SHM-like” mapped to heterochromatin (917/1784 = 51%).

As there is indication that AID off-target activity is linked to topologically associated chromatin domains in the interphase nuclei of B cells [33], we hypothesized that TSS-distant kataegis hypermutation is caused by secondary targeting of the hypermutation machinery while primarily affecting aberrant hypermutation of target regions in spatial proximity. Hence, we systematically analyzed co-occurrence of kataegis regions (Fig. 3A). Per sample, hypermutation in certain kataegis-regions (termed object regions) occurred only if another kataegis-region (subject region) is affected (Fig. 3B, Supplementary Figs. S9, S10A, and S10F, Supplementary Table S6). Counting subject and object regions together, 77 kataegis regions outside and 16 inside the IG loci were involved in such relationships. Restricting the analysis to the 192 identified conditional co-occurrence relationships outside IG loci, 167 were inter-chromosomal, 10 were long-range intra-chromosomal (defined by a distance > 1 Mbp), and 15 were short-range intra-chromosomal effects. This suggests that the subject regions are primary targets of hypermutation, while the object regions may be exposed to the hypermutation machinery due to spatial co-localization. Indeed, the fraction of TSS-far kataegis regions was higher among the objects than among the subjects, regardless of whether IG loci are taken into consideration or not (Supplementary Table S7A–C). We introduced the term hypermutation by proxy (HbP) to describe such a relationship. Examples for subject regions include BCL6 (Supplementary Fig. S10A–E) and PAX5 (Supplementary Fig. S10F–I). Both BCL6 and PAX5 are located in gene clusters, and the HbP effect leads to secondary targeting of one or several object regions in genes within these clusters. Several objects of PAX5 overlap with the PAX5 enhancer described as recurrently mutated in chronic lymphocytic leukemia [34], suggesting that HbP may cause enhancer hypermutation. Another example affects S1PR2 as subject and DNMT1 as object (Fig. 3B–D; Supplementary Information).

**Fig. 3: *Hypermutation by proxy* (HbP).**

In order to relate the concept of HbP to actual spatial colocalization in the nucleus, we investigated the concordance between the HbP relationships and published chromatin conformation data [35, 36] (Supplementary Table S8). Indeed, many intrachromosomal HbP relationships were reflected by strong interaction signals in the chromatin conformation data, such as gene clusters around PAX5 and BCL6. However, inter-chromosomal HbP relationships could not be confirmed by the conformation data, probably because very long-range intrachromosomal and interchromosmal interactions are typically less reliably identified than short- to medium-range intrachromosomal interactions [37]. Our analysis suggests that the machinery for hypermutation has an outreach to other regions if these regions are in spatial proximity in the interphase nucleus of lymphoma cells.

Although the total number of HbP instances per sample is higher in DLBCL and FL/DLBCL than in FL (Supplementary Fig. S8M), when normalizing the number of HbP instances to the square of the number of kataegis clusters per sample (quadratic relationship between number of kataegis foci and number of HbP instances, Supplementary Fig. S8Q), FL showed higher values of this ratio (Supplementary Fig. S8N).

New mutational signatures reflect mutagenic mechanisms active in GC B cells

We investigated mutational signatures as traces of mutational mechanisms active in tumors [27]. We used 2,133,341 somatic SNVs from 219 lymphomas from the extended cohort defined above to perform a combination of unsupervised and supervised analyses of mutational signatures and found 14 different signatures (Fig. 4, Supplementary Figs. S11, S12, Supplementary Table S9). Of those, 11 (labeled “AC”) have been described before [27], including four of six signatures previously identified in gcBCL (Supplementary Table S9). Three new signatures were discovered, termed L1, L2, and L3 (Fig. 4A, B). Two of six mutational signatures from the original analysis of gcBCL were not identified in this analysis: AC13 (linked to the action of APOBEC enzymes) and AC5 (related to the age of the patients at diagnosis, mechanism unknown). Signature AC5 has high cosine similarity (see Methods) to L1, L2, and AC9. Because of this high similarity, most mutations we assign to L1 and L2 would have been assigned to AC5, if AC5 was included and L1 and L2 were not included in the analysis. Among the previously not extracted signatures is AC3, which we detected in 21 lymphomas. Signature AC3 has been linked to defects in homologous recombination repair (BRCAness) [38] which potentially confer synthetic lethality to poly(ADP-ribose) polymerase inhibitors [39].

**Fig. 4: New mutational signatures are partially linked to B-cell-specifc mutagenic effects and exhibit characteristic enrichment and depletion patterns.**

To relate B cell-specific mutational processes to the new mutational signatures we compared all 14 signatures to the AID target motif DGYW [40] and to our CSR and SHM profiles by cosine similarity. L1 showed the highest similarity to DGYW and to the CSR profile, while L2 showed the highest similarity to the SHM profile. Signature L3 may have some link to APOBEC enzyme activity. With increasing factorization ranks in NMF, L3 splits apart from AC2 (an APOBEC signature) at rank 7 (Supplementary Fig. S11). Hence, the mutational mechanism causative for L3 remains presently unclear. In a complementary approach, we compared the extracted mutational signatures to a synthetic mutational signature based on data by Yaari et al. [41], who extracted synonymous mutations from V and J genes of the IGH locus from normal B cells in their 5-mer sequence context to obtain the fingerprint of physiologic SHM. We aggregated these into 3-mer context and used the resulting triplet frequencies to derive a synthetic SHM signature. Again, the newly identified signature L2 had highest similarity to the synthetic SHM signature, providing further evidence for SHM being the mechanism behind L2.

Several mutational processes show varying activity in distinct genomic regions [42], and in particular for the B cell-specific mechanisms a strong preference of certain target regions is known [43]. We stratified SNVs according to different genomic features and performed supervised analysis of mutational signatures (Fig. 4C–E, Supplementary Table S10). First, to relate mutational signatures to the physiological sites of B-cell mutagenesis, we checked for enrichment and depletion patterns in the IG-VDJ genes and the switch regions (Fig. 4C). L1 was enriched in the switch regions while L2 was enriched in the VDJ regions, corroborating our previous assignment. Second, we related the mutational signatures to chromatin states from normal GC B cells (Fig. 4D). L1 was enriched in promoters, while L2 was enriched in transcribed regions and enhancers as compared to heterochromatic regions, consistent with previous observations that B cell-specific mutagenesis primarily affects active regions of the genome [44]. Third, we assessed the influence of replication timing on exposure to the mutational signatures (Fig. 4E, Supplementary Fig. S10C). L1 showed a strong and L2 a moderate enrichment in early replicating regions. Strikingly, AC9, which has exclusively been found in B-cell malignancies and described as being linked to SHM [27] shows an enrichment in the heterochromatic, late replicating regions. In the IG loci, AC9 is enriched in the constant, non-switch regions, further corroborating that AC9 is not the fingerprint of SHM or CSR. As expected, L1 and L2 were enriched in kataegis clusters, with L1 being enriched in CSR-like and L2 in SHM-like kataegis clusters as compared to the nonclustered SNV stratum (Supplementary Fig. S12G). FLs had higher contributions of L1, L2, and AC1 but lower contributions of AC17, AC10, AC6, and AC2 as compared to DLBCLs (Fig. 4F).

We propose that L1 and L2 are the mutational footprints of CSR and SHM, respectively. The etiology of the B cell-specific signature AC9 and the new signature L3 remain enigmatic, though L3 may have some link to APOBEC activity.

Mutational mechanisms during lymphoma evolution

To dissect the activity of the different mutational processes during B-cell lymphoma evolution, we stratified SNVs according to their cancer cell fractions (CCFs), i.e., the fraction of tumor cells harboring the respective variant. A high CCF identifies mutations which arose in the precursor cell or early in tumor evolution, while a low CCF is characteristic for mutations which arose late in tumor evolution (see Methods). Stratified analysis of mutational signatures showed an enrichment of AC1 (spontaneous deamination) and AC2 (APOBEC) in early clonal evolution (Supplementary Fig. S12D). Among the mutational signatures related to B cell-specific mutational processes, L1 showed a trend towards enrichment in early and AC9 in late clonal evolution. No enrichment was observed for L2. Following the hypothesis that the absence of enrichment patterns for L2 might indicate ongoing SHM activity in gcBCL, we investigated the distribution of CCFs in the IG loci. SNVs in the constant part of IGH were significantly earlier and SNVs in the variable parts of the IG loci were significantly later in clonal evolution than SNVs outside of the IG loci (Supplementary Fig. S12B). Hence, SHM in the variable parts of the IG loci is ongoing in gcBCL, while CSR appears to happen mostly before clonal expansion, in agreement with the genome-wide enrichment patterns for L1 (CSR) and L2 (SHM).

Drivers of gcBCL

Only roughly 1% of somatic mutations were in protein coding sequences, with a median of 88 coding variants per sample (range 11–974, subgroup specific median DLBCL: 114, FL: 59.5, FL/DLBCL: 128; Supplementary Fig. S1, Supplementary Table S3). After integrating all types of variants with coding potential, we observed high mutational recurrence in known gcBCL drivers like KMT2D, CREBBP, BCL2, TNFRSF14, PIM1, SOCS1, and CDKN2A (Fig. 5, Supplementary Figs. S13, S14; see Supplementary Information for recurrently mutated noncoding genes). To differentiate between passenger and driver mutations and to identify subgroup-specific low recurrence drivers we applied IntOGen [45] to the whole cohort and to FL and DLBCL separately. We identified 118 driver genes in the 179 gcBCL with matched normal control (Supplementary Table S11), of which 9 and 8 were not significant in FLs (ADAMTS1, ANKRD12, DHX16, DNM2, LRP12, SIAH2, SIN3A, ZNF217, ZNF292) and not significant in DLBCLs (BCL2, CDC42BPB, CXCR4, DHX15, JUP, MGEA5, MYCBP2, PDS5B), respectively.

**Fig. 5: B cell-specific mutagenesis alone is not sufficient to drive lymphomagenesis.**

Encouraged by recent studies proposing genomic classifications of DLBCL based on data from whole exome sequencing [4,5,6] we applied NMF as a soft clustering technique on binarized data of driver gene alterations, both to the subset of DLBCL in our cohort (initially 76 cases, but 72 after excluding four hypermutated cases, defined by mutational load more than two standard deviations above mean SNV mutational load), and to the whole cohort. As described in the Supplementary Information and shown in Supplementary Fig. S15 this yielded consensus clusters comparable to the prior studies, which supports the validity of our approach for driver gene identification from the whole genome sequences. Notably, when we extended the approach from DLBCL to the full cohort (again excluding the four hypermutated DLBCLs), the optimal number of consensus clusters was nine, thereby revealing a more detailed substructure of gcBCL entities than in the published studies (Supplementary Fig. S16). We furthermore investigated congruence and cross-over of the DLBCL cases between the consensus clusters extracted only among the DLBCLs (Supplementary Fig. S15) and those consensus clusters extracted among all gcBCL cases (Supplementary Fig. S16), showing that the majority of cases in the MYD88-like and TP53-like DLBCL-only consensus clusters also mapped to the respective gcBCL consensus clusters, whereas cases from the BCL2-like DLBCL-only consensus cluster also populated the CSMD1-like gcBCL consensus cluster and cases from the BCL6-like DLBCL-only consensus cluster also populated the PIM1-like gcBCL consensus cluster. Numbers are displayed in Supplementary Table S12C. We then took these consensus clusters and investigated enrichment and depletion patterns of the mutational signatures identified in our analysis (Fig. 4G, H). L3 was enriched in the SOCS1-like, B2M-like and TP53-like consensus clusters, AC1 was enriched in the CSMD1-like and BCL2-like consensus clusters, AC2 was depleted in the BCL2-like cluster, L1 showed a trend and was higher in the PIM1-like, BCL2-like and MYD88-like consensus clusters compared to background, and L2 showed a trend and was higher in the B2M-like, BCL2-like, and CSMD1-like consensus clusters compared to background (Fig. 4G). Stratification by consensus clustering of only the DLBCL subgroup (Fig. 4H) revealed only trends for L3 (high in TP53-like) and AC1 (high in BCL2-like).

We sought to identify the mechanisms mutating the driver genes as well as other recurrently mutated genes. To assess the contribution of B cell-specific hypermutation, we mapped kataegis clusters to driver genes and found that 57.1% of the driver or recurrently mutated genes showed indications for kataegis in at least one case (36.4% when restricting the analysis to coding mutations, Supplementary Table S13). Complementarily, 42.9% of the driver and recurrently mutated genes were depleted in mutations affecting the DGC/CGH motif, indicating non-AID-mediated mutagenesis (Fig. 5, Supplementary Fig. S14). While genes that were recurrently affected by kataegis generally showed an enrichment of SNVs in the DGC motif, the reverse relation was often not fulfilled, suggesting that several genes are recurrently targeted by AID-mediated, but nonclustered mutations.

Next we compared cohort-wide mutational profiles for each driver and recurrently mutated gene with the previously identified mutational signatures using cosine similarity. 37.7% of the driver genes exhibited a profile most similar to signature L1 and 18.2% to L2, while 15.6% were most similar to AC9 (Fig. 5). Several drivers showed no evidence for B cell-specific mutagenesis, i.e., no enrichment for the AID target motif, no kataegis, and no predominant mutagenesis by a B cell-specific signature. Examples are TP53 and CARD11 with a pattern of SNVs dominated by signatures AC1 and AC6 (associated with defects in DNA mismatch repair).

Finally, we investigated the timing of coding driver mutations in the course of lymphoma evolution. We determined the median CCF per driver gene and ranked the genes accordingly to classify driver genes as early or late (Supplementary Fig. S17). In agreement with our previous analyses, early drivers were predominantly mutated by L1, whereas for intermediate and late drivers L2 and AC9 were the dominating signatures. Genes affecting NFκB signaling (PPP4C, NFKBIE, NFKBIA) [46,47,48] were mutated early during clonal evolution, suggesting that activation of NFκB signaling is essential for initiation of B-cell lymphomagenesis.

Discussion

Most B-cell lymphomas derive from GC B cells [15]. Considering that most newly generated B cells will never participate in a GC reaction during their lifetime, and that those which do will be GC B cells only for a short time of about three weeks [49], and then continue to live as memory B cells or plasma cells for years or decades in humans, it becomes evident that the GC is a highly dangerous place for B cells. Key factors that contribute to the risky life of GC B cells are (i) the very high proliferation rate of GC B cells [50], which increases the risk for DNA replication-associated genetic lesions and may prepare the cells for continuous proliferation as transformed cells, (ii) the generation of chromosomal translocations as mistakes of SHM and CSR [16], (iii) off-target mutation activity of SHM [13, 14], and iv) a dampened DNA repair activity needed to tolerate the genotoxic stress imposed on GC B cells by their fast proliferation and SHM activity [51]. Moreover, B cells can repeatedly undergo GC reactions, and this repeated exposure to the mutagenic GC microenvironment may indeed play a role in FL pathogenesis [52]. However, a comprehensive understanding of the mutagenic mechanisms causing the malignant transformation of GC B cells is still missing. By analyzing a large number of prototypical gcBCL for mutations not only in the coding but also the noncoding genome we were in a position to study mutational mechanisms in gcBCL at unprecedented depth.

One of the major findings from our study is that besides kataegis regions of very high mutational density, the lymphomas also show recurrent regions of psichales with an intermediate mutation density. The observation that kataegis regions mostly affect early replicating genomic regions, while psichales focusses on late replicating regions, points to the involvement of distinct mutational mechanisms. Indeed, the distinct mutation patterns in kataegis and psichales clusters suggest a major role of off-target AID activity for kataegis, and of diminished DNA repair activity in late replicating regions of psichales clusters. The GC-dependency of kateagis regions is supported by a recent study published during the review process of this paper which reported that IGHV gene unmutated chronic lymphocytic leukemias lack kataegis regions outside the IGH switch regions [53]. The increased mutation density in late replicating regions is not B cell-specific and known from other types of cancer [25, 29]. A second novel mutation feature that we uncovered is hypermutation by proxy. This describes the surprising observation that some kataegis clusters generated by strong hypermutation activity can apparently promote hypermutation in other loci if co-localized in the nucleus. Hence, accumulation of hypermutation complexes on particular genomic regions apparently poses the risk to also mutagenize spatially closely localized chromosomal regions in trans. This concept is supported by a recent lymphoma cell line study showing hot spots for SHM in topologically associated chromatin domains, although that study lacked the aspect of directionality that we revealed [54]. Third, while prior studies on off-target AID activity only considered off-target SHM [13], we revealed that also the mutation machinery involved in CSR apparently has off-target mutation activity beyond inducing translocations and contributes to the SNV burden of gcBCL. Please note that the two AID-associated signatures we describe here are distinct from the canonical and noncanonical AID signatures reported previously [27, 55, 56]. Whereas the canonical AID signature is a general AID signature based on the AID hotspot motif, not distinguishing SHM and CSR machinery associated mutagenesis, the noncanoncial AID signature (signature 9 in ref. [27]) is indeed primarily linked to polymerase eta mutagenesis, and not AID directly [27, 55, 56]. Fourth, overall, about half of the gcBCL driver genes show signs of targeting by B cell-specific mutational processes, and the resulting mutations likely play a major pathogenetic role both in the initiation of lymphomagenesis and in the generation of intratumoral heterogeneity. Fifth, using NMF consensus clustering on data integrating various mutation types across the different gcBCLs, we identified nine consensus clusters corresponding to genomic subtypes. In conclusion, the development of gcBCL is much more complex than previously appreciated and gcBCL are unique among human cancers in the extent and diversity of how cell-type-specific processes contribute to mutations, localized hypermutation and malignant transformation.

References

Swerdlow SH, et al. The 2016 revision of the World Health Organization classification of lymphoid neoplasms. Blood. 2016;127:2375–91.
Article CAS PubMed PubMed Central Google Scholar
Hummel M, et al. A biologic definition of Burkitt’s lymphoma from transcriptional and genomic profiling. N Engl J Med. 2006;354:2419–30.
Article CAS PubMed Google Scholar
Rosenwald A, et al. The use of molecular profiling to predict survival after chemotherapy for diffuse large-B-cell lymphoma. N Engl J Med. 2002;346:1937–47.
Article PubMed Google Scholar
Chapuy B, et al. Molecular subtypes of diffuse large B cell lymphoma are associated with distinct pathogenic mechanisms and outcomes. Nat Med. 2018;24:679–90.
Article CAS PubMed PubMed Central Google Scholar
Schmitz R, et al. Genetics and pathogenesis of diffuse large B-cell lymphoma. N Engl J Med. 2018;378:1396–407.
Article CAS PubMed PubMed Central Google Scholar
Wright GW, et al. A probabilistic classification tool for genetic subtypes of diffuse large B cell lymphoma with therapeutic implications. Cancer Cell. 2020;37:551–68.e514.
Article CAS PubMed Google Scholar
Lacy SE, et al. Targeted sequencing in DLBCL, molecular subtypes, and outcomes: a Haematological Malignancy Research Network report. Blood. 2020;135:1759–71.
Article CAS PubMed PubMed Central Google Scholar
Victora GD, Nussenzweig MC. Germinal centers. Annu Rev Immunol. 2012;30:429–57.
Article CAS PubMed Google Scholar
Methot SP, Di Noia JM. Molecular mechanisms of somatic hypermutation and class switch recombination. Adv Immunol. 2017;133:37–87.
Article CAS PubMed Google Scholar
Nagaoka H, Muramatsu M, Yamamura N, Kinoshita K, Honjo T. Activation-induced deaminase (AID)-directed hypermutation in the immunoglobulin Smu region: implication of AID involvement in a common step of class switch recombination and somatic hypermutation. J Exp Med. 2002;195:529–34.
Article CAS PubMed PubMed Central Google Scholar
Pasqualucci L, et al. BCL-6 mutations in normal germinal center B cells: evidence of somatic hypermutation acting outside Ig loci. Proc Natl Acad Sci USA. 1998;95:11816–21.
Article CAS PubMed PubMed Central Google Scholar
Goossens T, Klein U, Küppers R. Frequent occurrence of deletions and duplications during somatic hypermutation: implications for oncogene translocations and heavy chain disease. Proc Natl Acad Sci USA. 1998;95:2463–8.
Article CAS PubMed PubMed Central Google Scholar
Khodabakhshi AH, et al. Recurrent targets of aberrant somatic hypermutation in lymphoma. Oncotaeget. 2012;3:1308–19.
Article Google Scholar
Pasqualucci L, et al. Hypermutation of multiple proto-oncogenes in B-cell diffuse large-cell lymphomas. Nature. 2001;412:341–6.
Article CAS PubMed Google Scholar
Küppers R. Mechanisms of B-cell lymphoma pathogenesis. Nat Rev Cancer. 2005;5:251–62.
Article PubMed Google Scholar
Küppers R, Dalla-Favera R. Mechanisms of chromosomal translocations in B cell lymphomas. Oncogene. 2001;20:5580–94.
Article PubMed Google Scholar
López C, et al. Genomic and transcriptomic changes complement each other in the pathogenesis of sporadic Burkitt lymphoma. Nat Commun. 2019;10:1459–9.
Article PubMed PubMed Central Google Scholar
Richter J, et al. Recurrent mutation of the ID3 gene in Burkitt lymphoma identified by integrated genome, exome and transcriptome sequencing. Nat Genet. 2012;44:1316–20.
Article CAS PubMed Google Scholar
ICGC TCGA Pan-Cancer Analysis of Whole Genomes Consortium. Pan-cancer analysis of whole genomes. Nature. 2020;578:82–93.
Article Google Scholar
Hübschmann D, Schlesner M. Evaluation of whole genome sequencing data. Methods Mol Biol. 2019;1956:321–36.
Article PubMed Google Scholar
Cheung KJ, et al. High resolution analysis of follicular lymphoma genomes reveals somatic recurrent sites of copy-neutral loss of heterozygosity and copy number alterations that target single genes. Genes Chromosomes Cancer. 2010;49:669–81.
Article CAS PubMed Google Scholar
Loeffler M, et al. Genomic and epigenomic co-evolution in follicular lymphomas. Leukemia. 2015;29:456–63.
Article CAS PubMed Google Scholar
Scholtysik R, et al. Characterization of genomic imbalances in diffuse large B-cell lymphoma by detailed SNP-chip analysis. Int J Cancer. 2015;136:1033–42.
Article CAS PubMed Google Scholar
Hansen RS, et al. Sequencing newly replicated DNA reveals widespread plasticity in human replication timing. Proc Natl Acad Sci USA. 2010;107:139–44.
Article CAS PubMed Google Scholar
Liu L, De S, Michor F. DNA replication timing and higher-order nuclear organization determine single-nucleotide substitution patterns in cancer genomes. Nat Commun. 2013;4:1502–2.
Article PubMed Google Scholar
Dörner T, et al. Analysis of the frequency and pattern of somatic mutations within nonproductively rearranged human variable heavy chain genes. J Immunol. 1997;158:2779–89.
Article PubMed Google Scholar
Alexandrov LB, et al. Signatures of mutational processes in human cancer. Nature. 2013;500:415–21.
Article CAS PubMed PubMed Central Google Scholar
Nik-Zainal S, et al. Mutational processes molding the genomes of 21 breast cancers. Cell. 2012;149:979–93.
Article CAS PubMed PubMed Central Google Scholar
Schuster-Böckler B, Lehner B. Chromatin organization is a major influence on regional mutation rates in human cancer cells. Nature. 2012;488:504–7.
Article PubMed Google Scholar
Supek F, Lehner B. Differential DNA mismatch repair underlies mutation rate variation across the human genome. Nature. 2015;521:81–4.
Article CAS PubMed PubMed Central Google Scholar
Carrillo-de-Santa-Pau E, et al. Automatic identification of informative regions with epigenomic changes associated to hematopoiesis. Nucl Acids Res. 2017;45:9244–59.
Article CAS PubMed PubMed Central Google Scholar
Stunnenberg HG, The International Human Epigenome Consortium Hirst P. The International Human Epigenome Consortium: a blueprint for scientific collaboration and discovery. Cell. 2016;167:1145–9.
Article CAS PubMed Google Scholar
Qian J, et al. B cell super-enhancers and regulatory clusters recruit AID tumorigenic activity. Cell. 2014;159:1524–37.
Article CAS PubMed PubMed Central Google Scholar
Puente XS, et al. Non-coding recurrent mutations in chronic lymphocytic leukaemia. Nature. 2015;526:519–24.
Article CAS PubMed Google Scholar
Beekman R, et al. The reference epigenome and regulatory chromatin landscape of chronic lymphocytic leukemia. Nat Med. 2018;24:868–80.
Article CAS PubMed PubMed Central Google Scholar
Javierre BM, et al. Lineage-specific genome architecture links enhancers and non-coding disease variants to target gene promoters. Cell. 2016;167:1369–84.e1319.
Article CAS PubMed PubMed Central Google Scholar
Tjong H, et al. Population-based 3D genome structure analysis reveals driving forces in spatial genome organization. Proc Natl Acad Sci USA. 2016;113:E1663–72.
Article CAS PubMed PubMed Central Google Scholar
Alexandrov LB, Nik-Zainal S, Siu HC, Leung SY, Stratton MR. A mutational signature in gastric cancer suggests therapeutic strategies. Nat Commun. 2015;6:8683–3.
Article CAS PubMed PubMed Central Google Scholar
Lord CJ, Ashworth A. BRCAness revisited. Nat Rev Cancer. 2016;16:110–20.
Article CAS PubMed Google Scholar
Rogozin IB, Diaz M. DGYW/WRCH is a better predictor of mutability at G:C bases in Ig hypermutation than the widely accepted RGYW/WRCY motif and probably reflects a two-step activation-induced cytidine deaminase-triggered process. J Immunol. 2004;172:3382–4.
Article CAS PubMed Google Scholar
Yaari G, et al. Models of somatic hypermutation targeting and substitution based on synonymous mutations from high-throughput immunoglobulin sequencing data. Front Immunol. 2013;4:1–6.
Article CAS Google Scholar
Lim B, Mun J, Kim S-Y. Intrinsic molecular processes: impact on mutagenesis. Trends Cancer. 2017;3:357–71.
Article CAS PubMed Google Scholar
Meng F-L, et al. Convergent transcription at intragenic super-enhancers targets aid-initiated genomic instability. Cell. 2014;159:1538–48.
Article CAS PubMed PubMed Central Google Scholar
Liu M, et al. Two levels of protection for the B cell genome during somatic hypermutation. Nature. 2008;451:841–5.
Article CAS PubMed Google Scholar
Gundem G, et al. IntOGen: integration and data mining of multidimensional oncogenomic data. Nat Meth. 2010;7:92–3.
Article CAS Google Scholar
Hu MCT, et al. Protein phosphatase X interacts with c-Rel and stimulates c-Rel/nuclear factor KB activity. J Biol Chem. 1998;273:33561–5.
Article CAS PubMed Google Scholar
Verma I. M., Stevenson J. K., Schwarz E. M., Antwerp D.V. Rel/NF-KB /IKB family: intimate tales of association and dissociation. Genes Dev. 1995;9:2723–35.
Whiteside ST, Epinat J-C, Rice NR, Israël A. I kappa B epsilon, a novel member of the I kappa B family, controls RelA and cRel NF-kappa B activity. EMBO J. 1997;16:1413–26.
Article CAS PubMed PubMed Central Google Scholar
MacLennan ICM. Germinal centers. Annu Rev Immunol. 1994;12:117–39.
Article CAS PubMed Google Scholar
MacLennan IC, Liu YJ, Johnson GD. Maturation and dispersal of B-cell clones during T cell-dependent antibody responses. Immunol Rev. 1992;126:143–61.
Article CAS PubMed Google Scholar
Phan RT, Dalla-Favera R. The BCL6 proto-oncogene suppresses p53 expression in germinal-centre B cells. Nature. 2004;432:635–9.
Article CAS PubMed Google Scholar
Sungalee S, et al. Germinal center reentries of BCL2-overexpressing B cells drive follicular lymphoma progression. J Clin Invest. 2014;124:5337–51.
Article PubMed PubMed Central Google Scholar
Ye X., et al. Genome-wide mutational signatures revealed distinct developmental paths for human B cell lymphomas. J Exp Med. 2021;218:e20200573.
Senigl F, et al. Topologically associated domains delineate susceptibility to somatic hypermutation. Cell Rep. 2019;29:3902–15. e3908
Article CAS PubMed PubMed Central Google Scholar
Kasar S, et al. Whole-genome sequencing reveals activation-induced cytidine deaminase signatures during indolent chronic lymphocytic leukaemia evolution. Nat Commun. 2015;6:8866–6.
Article CAS PubMed Google Scholar
Maura F, et al. A practical guide for mutational signature analysis in hematological malignancies. Nat Commun. 2019;10:2969.
Article PubMed PubMed Central Google Scholar

Download references

Acknowledgements

This study has been supported by the German Ministry of Science and Education (BMBF) in the framework of the ICGC MMML-Seq project (01KU1002A-J) the MMML-MYC-SYS project (036166B) and the project ICGC DE-MINING (01KU1505E), the European Union in the framework of the BLUEPRINT Project (HEALTH-F5-2011-282510) and the KinderKrebsInitiative Buchholz/Holm-Seppensen. This work was supported by the BMBF-funded Heidelberg Center for Human Bioinformatics (HD-HuB) within the German Network for Bioinformatics Infrastructure (de.NBI) (#031A537A, #031A537C). Former grant support of MMML by the Deutsche Krebshilfe (2003–2011) is gratefully acknowledged. We acknowledge COSMIC and use of Cancer Gene Census. Part of the work was performed in association with SFB1074 (particularly subproject B1) funded by DFG. We wish to thank Barbara Hutter, Ivo Buchhalter, Zuguang Gu, and Natalie Jäger for skillful technical assistance. We thank the High-Throughput Sequencing Unit of the Genome and Proteome Core Facility and the Omics IT and Data Management Core Facility of the German Cancer Research Center (DKFZ, Heidelberg) as well as the Institute of Clinical Molecular Biology (IKMB, Christian-Albrechts-University Kiel) for excellent technical support and expertise. DH is a member of the Hartmut Hoffmann-Berling International Graduate School of Molecular and Cellular Biology (HBIGS) and of the MD/PhD-program of the University of Heidelberg. KK and UHT are funded by the Helmholtz International Graduate School for Cancer Research at the German Cancer Research Center. SHB, HK, and SH acknowledge support by LIFE (Leipzig Research Center for Civilization Diseases), Leipzig University. LIFE is funded by the European Union, the European Regional Development Fund (ERDF), the European Social Fund (ESF), and the Free State of Saxony. This work has been carried out with the help of the Interdisciplinary Bank of Biomaterials and Data of the University Hospital of Würzburg and the Julius Maximilian University of Würzburg (idbw).

Funding

Open Access funding enabled and organized by Projekt DEAL.

Author information

Rabea Wagener
Present address: University of Duesseldorf, Medical Faculty, Department of Pediatric Oncology, Hematology and Clinical Immunology, Center for Child and Adolescent Health, Düsseldorf, Germany
Enrique Carrillo de Santa Pau
Present address: Computational Biology Group, Precision Nutrition and Cancer Research Program, IMDEA Food Institute, Madrid, Spain
Bingding Huang
Present address: College of Big Data and Internet, Shenzhen Technology University, Shenzhen, China
Daniel Rico
Present address: Biosciences Institute, Newcastle University, Newcastle upon Tyne, UK
Matthias Schlesner
Present address: Institute for Informatics, Faculty of Computer Science and Medical Faculty, University of Augsburg, Augsburg, Germany
Lists of members and their affiliations appear in the Supplementary Information.
These authors contributed equally: Daniel Hübschmann, Kortine Kleinheinz, Rabea Wagener, Stephan H. Bernhart, Cristina López.
These authors jointly supervised this work: Ralf Küppers, Matthias Schlesner, Reiner Siebert.

Authors and Affiliations

Division of Theoretical Bioinformatics (B080), German Cancer Research Center (DKFZ), Heidelberg, Germany
Daniel Hübschmann, Kortine Kleinheinz, Umut H. Toprak, Naveed Ishaque, Nagarajan Paramasivam, Matthias Bieg, Philipp Bruns, Bingding Huang, Christina Jäger-Schmidt, Jules N. A. Kerssemakers, Chris Lawerenz, Eva Reisinger, Gregor Warsow, Roland Eils & Matthias Schlesner
Department for Bioinformatics and Functional Genomics, Institute of Pharmacy and Molecular Biotechnology and Bioquant, University of Heidelberg, Heidelberg, Germany
Daniel Hübschmann, Kortine Kleinheinz & Roland Eils
Heidelberg Institute of Stem Cell Technology and Experimental Medicine (HI-STEM), Heidelberg, Germany
Daniel Hübschmann
Computational Oncology, Molecular Diagnostics Program, National Center for Tumor Diseases (NCT), German Cancer Research Center (DKFZ) and German Cancer Consortium (DKTK), Heidelberg, Germany
Daniel Hübschmann
Institute of Human Genetics, Ulm University and Ulm University Medical Center, Ulm, Germany
Rabea Wagener, Cristina López, Ole Ammerpohl & Reiner Siebert
Intitute of Human Genetics, Christian-Albrechts-University, Kiel, Germany
Rabea Wagener, Cristina López, Ole Ammerpohl, Sietse M. Aukema, Anke K. Bergmann, Andrea Haake, Julia Richter & Reiner Siebert
Interdisciplinary Center for Bioinformatics, University of Leipzig, Leipzig, Germany
Stephan H. Bernhart, Helene Kretzmer, Hans Binder, Gero Doose & Steve Hoffmann
Bioinformatics Group, Department of Computer, University of Leipzig, Leipzig, Germany
Stephan H. Bernhart, Helene Kretzmer, Hans Binder, Gero Doose, Peter F. Stadler & Steve Hoffmann
Transcriptome Bioinformatics, LIFE Research Center for Civilization Diseases, University of Leipzig, Leipzig, Germany
Stephan H. Bernhart, Helene Kretzmer, Gero Doose & Steve Hoffmann
Faculty of Biosciences, Heidelberg University, Heidelberg, Germany
Umut H. Toprak
Bioinformatics and Omics Data Analytics (B240), German Cancer Research Center (DKFZ), Heidelberg, Germany
Umut H. Toprak & Matthias Schlesner
EMBL Heidelberg, Genome Biology, Heidelberg, Germany
Stephanie Sungalee, Sebastian M. Waszak & Jan O. Korbel
DKFZ-HIPO, German Cancer Research Center (DKFZ), Heidelberg, Germany
Naveed Ishaque & Matthias Bieg
Department of Genome Regulation, Max Planck Institute for Molecular Genetics, Berlin, Germany
Helene Kretzmer
Institute for Medical Informatics Statistics and Epidemiology, Leipzig, Germany
Markus Kreuz & Markus Loeffler
Medical Faculty Heidelberg, Heidelberg University, Heidelberg, Germany
Nagarajan Paramasivam
Hematopathology Section, Christian-Albrechts-University, Kiel, Germany
Sietse M. Aukema, Julia Richter, Monika Szczepanowski & Wolfram Klapper
Institut d’Investigacions Biomèdiques August Pi i Sunyer (IDIBAPS), Barcelona, Spain
Renée Beekman & José I. Martín-Subero
Department of Pediatrics, University Hospital Schleswig-Holstein, Campus Kiel, Kiel, Germany
Anke K. Bergmann & Alexander Claviez
University of Duesseldorf, Medical Faculty, Department of Pediatric Oncology, Hematology and Clinical Immunology, Center for Child and Adolescent Health, Düsseldorf, Germany
Arndt Borkhardt & Jessica I. Hoell
Department of Internal Medicine/Hematology, Friedrich-Ebert-Hospital, Neumünster, Neumünster, Germany
Christoph Borst & Siegfried Haas
Division of Applied Bioinformatics (G200), German Cancer Research Center (DKFZ), Heidelberg, Germany
Benedikt Brors
Structural Biology and BioComputing Programme, Spanish National Cancer Research Centre (CNIO), Madrid, Spain
Enrique Carrillo de Santa Pau & Daniel Rico
Department for Internal Medicine II, University Hospital Schleswig-Holstein, Campus Kiel, Kiel, Germany
Dennis Karsch
Senckenberg Institute of Pathology, University of Frankfurt Medical School, Frankfurt am Main, Germany
Martin-Leo Hansmann
Division of Molecular Genetics, German Cancer Consortium (DKFK), German Cancer Research Center (DKFZ), Heidelberg, Germany
Volker Hovestadt, Bernhard Radlwimmer, Marc Zapatka & Peter Lichter
Institute of Pathology, Charité – University Medicine Berlin, Berlin, Germany
Michael Hummel & Dido Lenze
Department of Hematology and Oncology, Georg-Augusts-University of Göttingen, Göttingen, Germany
Dieter Kube & Lorenz Trümper
Department of Molecular Biology, Radboud University, Faculty of Science, Nijmegen, The Netherlands
Joost H. A. Martens & Hendrik G. Stunnenberg
Department of Clinical Pathology, Robert-Bosch-Hospital and Dr. Margarete Fischer-Bosch Institute for Clinical Pharmacology, Stuttgart, Germany
German Ott
Institute of Clinical Molecular Biology, Christian-Albrechts-University, Kiel, Germany
Philip Rosenstiel & Markus Schillhabel
Institute of Pathology, University of Wuerzburg and Comprehensive Cancer Center Mainfranken, Wuerzburg, Germany
Andreas Rosenwald
Department for Internal Medicine III, Ulm University, Ulm, Germany
Stephan Stilgenbauer
Institute of Cell Biology (Cancer Research), University of Duisburg-Essen, Medical School, Essen, Germany
Marc A. Weniger & Ralf Küppers
German Cancer Consortium (DKTK), Essen, Germany
Marc A. Weniger & Ralf Küppers
Barcelona Supercomputing Centre (BSC), Barcelona, Spain
Alfonso Valencia
ICREA, Barcelona, Spain
Alfonso Valencia
Institute of Pathology, Medical Faculty of the Ulm University, Ulm, Germany
Peter Möller

Authors

Daniel Hübschmann
View author publications
You can also search for this author in PubMed Google Scholar
Kortine Kleinheinz
View author publications
You can also search for this author in PubMed Google Scholar
Rabea Wagener
View author publications
You can also search for this author in PubMed Google Scholar
Stephan H. Bernhart
View author publications
You can also search for this author in PubMed Google Scholar
Cristina López
View author publications
You can also search for this author in PubMed Google Scholar
Umut H. Toprak
View author publications
You can also search for this author in PubMed Google Scholar
Stephanie Sungalee
View author publications
You can also search for this author in PubMed Google Scholar
Naveed Ishaque
View author publications
You can also search for this author in PubMed Google Scholar
Helene Kretzmer
View author publications
You can also search for this author in PubMed Google Scholar
Markus Kreuz
View author publications
You can also search for this author in PubMed Google Scholar
Sebastian M. Waszak
View author publications
You can also search for this author in PubMed Google Scholar
Nagarajan Paramasivam
View author publications
You can also search for this author in PubMed Google Scholar
Ole Ammerpohl
View author publications
You can also search for this author in PubMed Google Scholar
Sietse M. Aukema
View author publications
You can also search for this author in PubMed Google Scholar
Renée Beekman
View author publications
You can also search for this author in PubMed Google Scholar
Anke K. Bergmann
View author publications
You can also search for this author in PubMed Google Scholar
Matthias Bieg
View author publications
You can also search for this author in PubMed Google Scholar
Hans Binder
View author publications
You can also search for this author in PubMed Google Scholar
Arndt Borkhardt
View author publications
You can also search for this author in PubMed Google Scholar
Christoph Borst
View author publications
You can also search for this author in PubMed Google Scholar
Benedikt Brors
View author publications
You can also search for this author in PubMed Google Scholar
Philipp Bruns
View author publications
You can also search for this author in PubMed Google Scholar
Enrique Carrillo de Santa Pau
View author publications
You can also search for this author in PubMed Google Scholar
Alexander Claviez
View author publications
You can also search for this author in PubMed Google Scholar
Gero Doose
View author publications
You can also search for this author in PubMed Google Scholar
Andrea Haake
View author publications
You can also search for this author in PubMed Google Scholar
Dennis Karsch
View author publications
You can also search for this author in PubMed Google Scholar
Siegfried Haas
View author publications
You can also search for this author in PubMed Google Scholar
Martin-Leo Hansmann
View author publications
You can also search for this author in PubMed Google Scholar
Jessica I. Hoell
View author publications
You can also search for this author in PubMed Google Scholar
Volker Hovestadt
View author publications
You can also search for this author in PubMed Google Scholar
Bingding Huang
View author publications
You can also search for this author in PubMed Google Scholar
Michael Hummel
View author publications
You can also search for this author in PubMed Google Scholar
Christina Jäger-Schmidt
View author publications
You can also search for this author in PubMed Google Scholar
Jules N. A. Kerssemakers
View author publications
You can also search for this author in PubMed Google Scholar
Jan O. Korbel
View author publications
You can also search for this author in PubMed Google Scholar
Dieter Kube
View author publications
You can also search for this author in PubMed Google Scholar
Chris Lawerenz
View author publications
You can also search for this author in PubMed Google Scholar
Dido Lenze
View author publications
You can also search for this author in PubMed Google Scholar
Joost H. A. Martens
View author publications
You can also search for this author in PubMed Google Scholar
German Ott
View author publications
You can also search for this author in PubMed Google Scholar
Bernhard Radlwimmer
View author publications
You can also search for this author in PubMed Google Scholar
Eva Reisinger
View author publications
You can also search for this author in PubMed Google Scholar
Julia Richter
View author publications
You can also search for this author in PubMed Google Scholar
Daniel Rico
View author publications
You can also search for this author in PubMed Google Scholar
Philip Rosenstiel
View author publications
You can also search for this author in PubMed Google Scholar
Andreas Rosenwald
View author publications
You can also search for this author in PubMed Google Scholar
Markus Schillhabel
View author publications
You can also search for this author in PubMed Google Scholar
Stephan Stilgenbauer
View author publications
You can also search for this author in PubMed Google Scholar
Peter F. Stadler
View author publications
You can also search for this author in PubMed Google Scholar
José I. Martín-Subero
View author publications
You can also search for this author in PubMed Google Scholar
Monika Szczepanowski
View author publications
You can also search for this author in PubMed Google Scholar
Gregor Warsow
View author publications
You can also search for this author in PubMed Google Scholar
Marc A. Weniger
View author publications
You can also search for this author in PubMed Google Scholar
Marc Zapatka
View author publications
You can also search for this author in PubMed Google Scholar
Alfonso Valencia
View author publications
You can also search for this author in PubMed Google Scholar
Hendrik G. Stunnenberg
View author publications
You can also search for this author in PubMed Google Scholar
Peter Lichter
View author publications
You can also search for this author in PubMed Google Scholar
Peter Möller
View author publications
You can also search for this author in PubMed Google Scholar
Markus Loeffler
View author publications
You can also search for this author in PubMed Google Scholar
Roland Eils
View author publications
You can also search for this author in PubMed Google Scholar
Wolfram Klapper
View author publications
You can also search for this author in PubMed Google Scholar
Steve Hoffmann
View author publications
You can also search for this author in PubMed Google Scholar
Lorenz Trümper
View author publications
You can also search for this author in PubMed Google Scholar
Ralf Küppers
View author publications
You can also search for this author in PubMed Google Scholar
Matthias Schlesner
View author publications
You can also search for this author in PubMed Google Scholar
Reiner Siebert
View author publications
You can also search for this author in PubMed Google Scholar

Consortia

ICGC MMML-Seq consortium

ICGC DE-Mining consortium

BLUEPRINT consortium

Contributions

AB, AC, AR, AKB, CB, DK, GO, LT, PM, SH, SS, and WK provided tumor samples and clinical data. LT made the clinical coordination of the project. DL, MH, MS, and WK stained and reviewed cryomaterial, prepared and performed quality control. WK and MH coordinated extraction of analytes. AR, GO, MH, PM, and WK performed pathology review. PM coordinated pathology review. MAW and RK provided normal B cell samples. BR and M Schillhabel generated next-generation sequencing data. BB, J Korbel, ML, M Schlesner, MZ, PL, PR, RE, and SH supervised next-generation sequencing analysis. BB, CL, DH, J Korbel, KK, ML, M Schlesner, MZ, PL, PR, RE, RK, RS, RW, SH, SHB interpreted the data. CJS, C Lawerenz, ER, and JNAK performed and coordinated data transfer and data mangment of NGS data. BH, BR, DH, GW, JIH, KK, MB, MK, M Schlesner, NI, NP, PB, S Sungalee, SMW, UHT, and VH performed analysis of next-generation sequencing data. GD, HK PFS, SHB performed analysis of RNA-Seq data. AV, DR, ECdSP, HGS, JHAM, JIMS, and RB contributed data and analysis from the BLUEPRINT consortium. MK, ML, and RS provided and analyzed data from the MMML cohort. CL performed FISH and validation analyses. AH, JR, OA, RW, and SMA performed validation analysis. AH, AKB, CL, JR, SMA, and RW supported coordination of the project. D Kube, HB, and JOK contributed to the study design and data interpretation. CL, DH, KK, RW, M Schlesner, RK, RS and SHB wrote the manuscript. PFS, RE, OA, PL, SH, RK, M Schlesner, and RS designed the study and coordinated the project. All authors read and approved the final manuscript.

Corresponding authors

Correspondence to Ralf Küppers, Matthias Schlesner or Reiner Siebert.

Ethics declarations

Conflict of interest

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Methods

Supplementary Information

Supplemental Figures

Supplementary Tables 1–5, 7, 9–13

Supplemental Table 6

Supplemental Table 8

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Hübschmann, D., Kleinheinz, K., Wagener, R. et al. Mutational mechanisms shaping the coding and noncoding genome of germinal center derived B-cell lymphomas. Leukemia 35, 2002–2016 (2021). https://doi.org/10.1038/s41375-021-01251-z

Download citation

Received: 18 September 2020
Revised: 08 March 2021
Accepted: 29 March 2021
Published: 05 May 2021
Issue Date: July 2021
DOI: https://doi.org/10.1038/s41375-021-01251-z

This article is cited by

Activation-induced cytidine deaminase causes recurrent splicing mutations in diffuse large B-cell lymphoma
- Maria S. Benitez-Cantos
- Carlos Cano
- Pedro P. Medina
Molecular Cancer (2024)
MEF2B C-terminal mutations enhance transcriptional activity and stability to drive B cell lymphomagenesis
- Chuanjiang Yu
- Qiong Shen
- Katia Basso
Nature Communications (2024)
Non-IG::MYC in diffuse large B-cell lymphoma confers variable genomic configurations and MYC transactivation potential
- Chunye Zhang
- Ellen Stelloo
- Ming-Qing Du
Leukemia (2024)
Large B-cell lymphomas with CCND1 rearrangement have different immunoglobulin gene breakpoints and genomic profile than mantle cell lymphoma
- Ece Özoğul
- Anna Montaner
- Elias Campo
Blood Cancer Journal (2024)
Lack of SMARCB1 expression characterizes a subset of human and murine peripheral T-cell lymphomas
- Anja Fischer
- Thomas K. Albert
- Kornelius Kerl
Nature Communications (2024)