The genomic and transcriptional landscape of primary central nervous system lymphoma

Radke, Josefine; Ishaque, Naveed; Koll, Randi; Gu, Zuguang; Schumann, Elisa; Sieverling, Lina; Uhrig, Sebastian; Hübschmann, Daniel; Toprak, Umut H.; López, Cristina; Hostench, Xavier Pastor; Borgoni, Simone; Juraeva, Dilafruz; Pritsch, Fabienne; Paramasivam, Nagarajan; Balasubramanian, Gnana Prakash; Schlesner, Matthias; Sahay, Shashwat; Weniger, Marc; Pehl, Debora; Radbruch, Helena; Osterloh, Anja; Korfel, Agnieszka; Misch, Martin; Onken, Julia; Faust, Katharina; Vajkoczy, Peter; Moskopp, Dag; Wang, Yawen; Jödicke, Andreas; Trümper, Lorenz; Anagnostopoulos, Ioannis; Lenze, Dido; Küppers, Ralf; Hummel, Michael; Schmitt, Clemens A.; Wiestler, Otmar D.; Wolf, Stephan; Unterberg, Andreas; Eils, Roland; Herold-Mende, Christel; Brors, Benedikt; Siebert, Reiner; Wiemann, Stefan; Heppner, Frank L.

doi:10.1038/s41467-022-30050-y

Download PDF

Article
Open access
Published: 10 May 2022

The genomic and transcriptional landscape of primary central nervous system lymphoma

Josefine Radke ORCID: orcid.org/0000-0001-9860-2007^1,2,3^na1,
Naveed Ishaque ORCID: orcid.org/0000-0002-8426-901X⁴^na1,
Randi Koll^1,2,
Zuguang Gu⁵,
Elisa Schumann^1,2,
Lina Sieverling ORCID: orcid.org/0000-0002-0595-4976^6,7,
Sebastian Uhrig^5,8,
Daniel Hübschmann ORCID: orcid.org/0000-0002-6041-7049^5,8,9,10,11,
Umut H. Toprak¹²,
Cristina López^13,14,
Xavier Pastor Hostench ORCID: orcid.org/0000-0003-2089-3777⁵,
Simone Borgoni¹⁵,
Dilafruz Juraeva ORCID: orcid.org/0000-0003-0035-3503⁵,
Fabienne Pritsch^1,2,
Nagarajan Paramasivam⁵,
Gnana Prakash Balasubramanian^16,17,
Matthias Schlesner ORCID: orcid.org/0000-0002-5896-4086^18,19,
Shashwat Sahay^3,4,
Marc Weniger^20,21,
Debora Pehl¹,
Helena Radbruch ORCID: orcid.org/0000-0001-6941-3397¹,
Anja Osterloh¹,
Agnieszka Korfel²²,
Martin Misch²³,
Julia Onken²³,
Katharina Faust²³,
Peter Vajkoczy²³,
Dag Moskopp ORCID: orcid.org/0000-0002-4173-7653²⁴,
Yawen Wang²⁴,
Andreas Jödicke²⁵,
Lorenz Trümper²⁶,
Ioannis Anagnostopoulos²⁷,
Dido Lenze²⁷,
Ralf Küppers ORCID: orcid.org/0000-0002-6691-7191^20,21,
Michael Hummel²⁷,
Clemens A. Schmitt ORCID: orcid.org/0000-0002-4731-2226^28,29,
Otmar D. Wiestler³⁰,
Stephan Wolf³¹,
Andreas Unterberg³²,
Roland Eils ORCID: orcid.org/0000-0002-0034-4036⁴,
Christel Herold-Mende³²,
Benedikt Brors ORCID: orcid.org/0000-0001-5940-3101⁵,
ICGC MMML-Seq Consortium,
Reiner Siebert¹³^na2,
Stefan Wiemann ORCID: orcid.org/0000-0003-4683-3174^11,15^na2 &
…
Frank L. Heppner ORCID: orcid.org/0000-0001-9816-8917^1,2,33,34^na2

Nature Communications volume 13, Article number: 2558 (2022) Cite this article

11k Accesses
44 Citations
9 Altmetric
Metrics details

Subjects

Abstract

Primary lymphomas of the central nervous system (PCNSL) are mainly diffuse large B-cell lymphomas (DLBCLs) confined to the central nervous system (CNS). Molecular drivers of PCNSL have not been fully elucidated. Here, we profile and compare the whole-genome and transcriptome landscape of 51 CNS lymphomas (CNSL) to 39 follicular lymphoma and 36 DLBCL cases outside the CNS. We find recurrent mutations in JAK-STAT, NFkB, and B-cell receptor signaling pathways, including hallmark mutations in MYD88 L265P (67%) and CD79B (63%), and CDKN2A deletions (83%). PCNSLs exhibit significantly more focal deletions of HLA-D (6p21) locus as a potential mechanism of immune evasion. Mutational signatures correlating with DNA replication and mitosis are significantly enriched in PCNSL. TERT gene expression is significantly higher in PCNSL compared to activated B-cell (ABC)-DLBCL. Transcriptome analysis clearly distinguishes PCNSL and systemic DLBCL into distinct molecular subtypes. Epstein-Barr virus (EBV)+ CNSL cases lack recurrent mutational hotspots apart from IG and HLA-DRB loci. We show that PCNSL can be clearly distinguished from DLBCL, having distinct expression profiles, IG expression and translocation patterns, as well as specific combinations of genetic alterations.

Single-cell landscape of primary central nervous system diffuse large B-cell lymphoma

Article Open access 12 June 2023

Nianping Liu, Chen Jiang, … Jun Lin

Distinct genetic changes reveal evolutionary history and heterogeneous molecular grade of DLBCL with MYC/BCL2 double-hit

Article Open access 16 December 2019

Francesco Cucco, Sharon Barrans, … Ming-Qing Du

Molecular heterogeneity in peripheral T-cell lymphoma, not otherwise specified revealed by comprehensive genetic profiling

Article Open access 15 May 2019

Yosaku Watatani, Yasuharu Sato, … Keisuke Kataoka

Introduction

Central nervous system (CNS) lymphomas are predominantly aggressive neoplasms involving brain, meninges, spinal cord, and eyes^1,2. Two clinical subtypes of CNSL can be distinguished: primary central nervous system lymphoma (PCNSL), which is confined to the CNS; and secondary central nervous system lymphoma (SCNSL) presenting initially with systemic, non-CNS or synchronous systemic and CNS involvement. The term SCNSL comprises all systemic lymphomas that spread to the CNS and its presentation, tropism, outcome and therapeutic options differ from PCNSL^3,4. Typically, SCNSL are classified as diffuse large B-cell lymphoma (DLBCL), while other types such as follicular lymphoma (FL), T-cell lymphoma or Hodgkin lymphoma are extremely rare^5,6.

PCNSL incidence is increased in immunocompromised patients, in which the tumor cells are typically Epstein-Barr virus (EBV)-positive^1,7,8. In contrast, PCNSL in immunocompetent patients is typically EBV-negative. The mechanisms leading to the observed exclusive topographical restriction of PCNSL to the CNS are not fully elucidated⁹. PCNSL is classified as DLBCL in the vast majority of cases (approx. 90%) which immunohistochemically most often show a non-germinal center B-cell-like (non-GCB) immunophenotype^1,10,11 according to the Hans classification¹². The tumor cells express pan B-cell markers (CD19, CD20, and CD79a), the germinal center (GC)-associated molecule BCL6¹³, and the post-GC-associated marker MUM1/IRF4¹⁴. By gene expression profiling, the tumor cells are most closely related to late germinal center (exit) B-cells¹⁵. Pathomechanistic genomic alterations involving Toll-like- and B-cell receptor (TLR, BCR) signaling pathways are identified in previous studies revealing a very high frequency of somatic nonsynonymous mutations in genes such as MYD88, CARD11, and CD79B^{16,17,18,19,20}. Additionally, often homozygous HLA class II^21,22 and CDKN2A loss, recurrent BCL6 translocations^23,24 and structural variants at chromosome band 9p24.1 (affecting CD274/PD-L1 and PDCD1LG2/PD-L2)²⁵ as well as TBL1XR1 variants²⁶ are repeatedly described in PCNSL^27,28. These mutational patterns suggest PCNSL to be genetically similar to recently described “MCD”, “C5” or “MYD88-like” subtypes for which a derivation from long-lived memory B-cells is proposed^{29,30,31,32,33,34,35}.

The outcome of PCNSL, even in immunocompetent hosts, is poor compared to most primary systemic DLBCL³⁶, though probably not worse than that of DLBCL of the MCD/C5 group in general³⁰. High-dose methotrexate (MTX) remains the commonly administered therapy but the use of rituximab (monoclonal anti-CD20 antibody) is shown to be effective^37,38. However, reports on rituximab efficiency in PCNSL are conflicting^39,40,41,42. Genomic studies suggest that lymphoma cell proliferation and survival are driven at least in part, by deregulated TLR, BCR, JAK-STAT, and NFκB signaling pathways inducing constitutive NFκB activation^43,44,45. Therefore, inhibitors up- and downstream of NFκB such as ibrutinib, known to inhibit Bruton’s tyrosine kinase (BTK) as critical mediator of BCR signaling, and lenalidomide which is shown to have indirect effects on tumor immunity are applied and seem to be effective therapeutic alternatives in PCNSL^{46,47,48,49,50,51}. PD-L1/2 blockade is discussed as another therapeutic option⁵².

Despite all progress in the molecular characterization of PCNSL in the last decades, our understanding of the genetic and transcriptional alterations of PCNSL is by far not comprehensive. The few previous next generation sequencing (NGS) studies of PCNSL are limited to target enrichment only of exons, or whole-genome analysis of very few samples^{44,53,54,55,56,57,58}. Therefore, pathogenic mechanisms other than coding variants, such as non-coding and regulatory changes, structural variants or mutational mechanisms related to the genome-wide distribution of somatic hypermutation (SHM) are not fully elucidated in PCNSL. Unbiased omics profiling, such as whole-genome sequencing (WGS) studies integrated with transcriptome sequencing, are currently the methods of choice to illuminate the role of non-coding mutations^59,60. In addition, these approaches can unravel various molecular mechanisms deregulating driver genes in PCNSL, which are necessary for diagnosis, risk stratification, and treatment in the era of precision and targeted therapies.

In this work, we perform whole-genome and transcriptome sequencing in 51 B-cell lymphomas presenting in the CNS, including 42 PCNSL samples from immunocompetent patients, to comprehensively describe the mutational and transcriptional landscape of PCNSL.

Results

Study cohort

We enrolled CNSL samples from 51 adults diagnosed with PCNSL or SCNSL. According to the site of manifestation, the following subgroups were defined: PCNSL within the brain parenchyma (PCNSL; n = 39), PCNSL with meningeal manifestation (PCNSL-M; n = 3), SCNSL within the brain parenchyma (SCNSL; n = 3), SCNSL with meningeal manifestation (SCNSL-M; n = 3) and EBV-positive lymphomas (EBV+; n = 3). Median age was 69, mean age was 66.5 years at diagnosis (range 40–82 years). The female:male ratio was 1.3:1. Follow up data were available for 44 patients. The follow up time ranged from 1 to 104 months with a median survival of 15.0 months (Supplementary Fig. 1a). The detailed study cohort information and subgroup-specific demographics are given in Fig. 1a, b and Supplementary Data 1. Patient samples were histologically and immunohistochemically classified according to the WHO criteria^2,11,61,62, and further stratified according to the Hans classification¹² into non-GCB (n = 37) and GCB subgroup (n = 5, Fig. 1c, Supplementary Data 1). For nine samples, the tissue was not sufficient for non-GCB/GCB characterization. Furthermore, we integrated data from the ICGC MMML-Seq cohort (www.icgc.org) for comparison of WGS and transcriptome data from systemic DLBCL, FL, naïve B-cells, and GC B-cells^34,59,60

**Fig. 1: Study design and multi-omic analysis of the CNSL cohort.**

Mutational landscape of central nervous system lymphoma (CNSL)

WGS data of 38 CNSL (30 PCNSL, 1 PCNSL-M, 2 SCNSL, 3 SCNSL-M, and 2 EBV+ samples, Fig. 1b) was obtained with a median coverage of 77 (range 31–100) for tumors and 45 (range 27–85) for matched germline controls. We identified a median of 18584 (range: 1987–48280; median of the 30 PCNSL: 20263 (range: 9185–48280)) total SNVs, of which a median of 5759 (range: 686–16731; PCNSL: 6274 (range: 2850–16731)) were intronic, a median of 10218 (range: 983–24033; PCNSL: 10790 (range: 5063–24033) were intergenic, and a median of 194 (range: 47–436), PCNSL: 200 (range: 100–436) were nonsynonymous exonic variants (1% of all SNVs). Furthermore, we identified a median of 2406 (range: 711–9430; PCNSL: 2485 (range: 941–9430)) indels per CNSL sample, of which the majority was intergenic (1333 (range: 403–5218), PCNSL: 1375 (range: 517–5218). The median number of variants (SNVs and indels) in non-coding RNA genes was 2744 (range: 551–6913), PCNSL: 2901 (range: 1220–6913). Selected variants were verified using Sanger sequencing (see “Methods” section).

The CNSL cohort presented a median of 152 (PCNSL: 147) SVs (range: 24–517 (PCNSL: 47–517), inversions: 21 (PCNSL: 20), deletions: 76 (PCNSL: 81), duplications: 20 (PCNSL: 19), translocations: 14 (PCNSL: 16)). We also investigated chromosome level CNVs (based on 30% or more of a chromosome being amplified or deleted) and found a median of 8 CNVs (median 1 cnLOH (PCNSL: 2), median 4 gains (PCNSL: 4), and median 2 losses (PCNSL: 3)). The detailed mutational statistics (CNVs, indels, SNVs, and SVs) of the CNSL, DLBCL, and FL samples are displayed in Supplementary Data 2.

PCNSL represents MCD genetic subtype of DLBCLs

Recent exome studies described the existence of different genetic subtypes of DLBCL, which show activation of distinct signaling pathways and different clinical outcomes^25,30,31. We used the LymphGen algorithm described by Wright et al.³¹ to classify our samples according to these genetic subtypes based on the obtained WGS data. The results of the CNSL cohort are displayed in Fig. 1d. In line with previous results^17,20,30,31, the majority of PCNSL samples were classified as MCD (based on the co-occurrence of MYD88 L265P and CD79B mutations, 67%, 20/30). One sample was each assigned to BN2 (BCL6 fusions and NOTCH2 mutations, 3%, 1/30) and ST2 (SGK1 and TET2 mutated, 3%, 1/30), seven samples were non-subtyped cases (“Other”, 23%, 7/30), and one sample was equally assigned to both groups BN2/MCD (3%, 1/30; Supplementary Data 1).

PCNSL samples classified as “Other” exhibited different CNV profiles affecting chromosome arms 1q, 2p, 2q, 3q, 4p, and 11p, as well as significantly more deletions of CREBBP compared to PCNSL samples classified as MCD by the LymphGen algorithm (Supplementary Fig. 2a). CREBBP gene inactivation is considered an early event in FLs and a subset of systemic DLBCL, mostly of GCB origin^{63,64,65,66,67}. CREBBP inactivation is also described as a hallmark of the EZB class, but LymphGen’s classification model is restricted to CREBBP point mutations and not focal deletions. The finding of a significantly increase number of CREBBP alterations (p = 0.046, Mann–Whitney U test) in PCNSLs classified as “Other” compared to MCD might, thus, imply a small subset of PCNSL to more resemble GCB-like DLBCL or, alternatively, the existence of a group of occult systemic GCB-lymphomas with first clinical presentation in the CNS. Additionally, PCNSL-Other demonstrated significantly fewer mutations in GRHPR, ETV6, and PIM1 (Supplementary Fig. 2b, c).

Driver mutations in CNSL

We first identified the genes recurrently mutated in CNSL (Fig. 2a) and used Metascape⁶⁸ for further pathway and process enrichment analysis. The top three level enriched terms were ‘Regulation of hemopoiesis’, ‘Chromatin organization involved in negative regulation of transcription’, and ‘Cytokine signaling in immune system’ ((hypergeometric test, FDR 8.91 × 10⁻⁹, 1.04 × 10⁻⁴, 1.17 × 10⁻⁴, respectively; Fig. 2b). The enrichment analysis in TRRUST revealed ‘Regulated by: STAT3’ as the most significant term (hypergeometric test, FDR 3.98 × 10⁻⁷; Fig. 2c). STAT3 has been associated with intracranial spreading and poor survival in PCNSL^69,70, and reports of STAT3 inhibition via small molecules achieve complete tumor regression in vivo for lymphoma cell lines⁷¹. As STAT3 is not highly mutated or hit by SVs or CNVs, its activation seems—in line with previous reports—induced by extrinsic factors such as infiltrating macrophages/microglial cells⁷², or intrinsic factors such as activation downstream of MYD88⁷³.

**Fig. 2: Recurrent coding mutations and genomic drivers in PCNSL.**

Next, we used IntOGen and MutSigCV to discover putative driver mutations in the PCNSL WGS sub-cohort (Fig. 2d and Supplementary Data 3). We identified a total of 50 mutated driver genes, of which only 21 were previously known drivers. Many of the predicted drivers were associated with MCD enriched genes, including MYD88 (67%), CD79B (63%), OSBPL10 (83%), HLA-A/B/C (40%/63%/53%), PRDM1 (40%), TOX (50%), TBL1XR1 (40%), CD58 (37%), PIM1 (70%), ETV6 (50%), BCL11A (30%), CDKN2A (83%), GRHPR (60%), FOXC1 (20%), and DAZAP1 (20%). These driver genes were significantly enriched for genes containing the BCL6 binding motif (TRANSFAC and JASPAR PWMs, Enrichr enrichment test, adjusted p = 0.03193).

OSBPL10 was previously reported as a target of aSHM in PCNSL⁵³. Consistent with observations in DLBCL⁷⁴, most of the identified mutations in PCNSL were confined to the exon 1 coding region (Supplementary Fig. 3).

Concerning MYD88, we only detected the classical pathogenic hotspot L265P mutation, which was validated by Sanger sequencing in all PCNSL samples investigated (100%, n = 26, Supplementary Data 4 and 5). Notably, MYD88 mutation rates in the extension FFPE cohort were 78.9% in PCNSL and 55.6% in SCNSL. None of the five EBV-positive cases investigated harbored oncogenic MYD88 L265P mutations, which is in line with previous findings^25,75. Mutations in TBL1XR1 also modulating TLR/MYD88 signaling²⁵ were identified in 40% of PCNSL (Fig. 2a, b). We investigated mutual exclusivity and co-occurrence patterns for MYD88 among the driver genes that affect at least five patients using Fisher and CoMET test. We observed mutual exclusivity between alterations in MYD88 and the NOTCH signaling inhibitor SPEN³⁰ (Fisher test, p = 0.0009, FDR = 0.033). In line with previous reports on ABC-DLBCLs^76,77, we found coexisting alterations in MYD88 and CD79B. Nevertheless, this co-occurrence was not significant (Fisher test, p = 0.16, FDR = 1.0). However, MYD88 was most significantly co-occurring with TBL1XR1 (Fisher test, p = 0.04), both activating the NFκB signaling pathway²⁶. Although this was not significant after correction for multiple testing (FDR = 1, Supplementary Data 6).

Compared to the MCD driver genes identified in the series presented by Wright et al.³¹, our PCNSL series exhibited a higher proportion of samples with mutations in PABPC1 (10% vs 0%), P2RY8 (13% vs 1.2%), ITPKB (23% vs 2.5%), GNA13 (20% vs 5.1%), and B2M (13.3% vs 2.8%). Furthermore, predicted driver genes in our PCNSL series included genes enriched in all other LymphGen classes: BN2 (CCND3, BCL6, HIST1H1D, SPEN, PABPC1, and UBE2A), EZB (GNA13, IRF8, BCL7A, KM2TD, and EP300), ST2 (P2RY8, TET2, ZFP36L1, and ITPKB) and A53 (B2M and TP53).

While the majority of identified drivers were reported by Wright et al., a number were not, including FBXW7, ATM, TMSB4X, THRAP3, ID2, GRB2, ZEB2, GLI3, UBA1, MAPKAPK2, AXIN2, TAP2, ROCK1, CEP290, and HLA-DQB1. These were previously recognized as general DLBCL drivers by Reddy et al.⁷⁸ and/or Chapuy et al.²⁹. ZEB2 was additionally identified as a genetic alteration associated either with the ABC subgroup⁷⁸ or the DLBCL C1 cluster²⁹ (Supplementary Data 3).

A remarkable finding was the identification of MYC mutations in 17% of PCNSL in the absence of MYC translocations. MYC alteration does not belong to the defining feature of the LymphGen algorithm nor has it been described as a driver in DLBCL by Chapuy et al.²⁹, though its functional relevance as oncogene in DLBCL has been shown by Reddy et al.⁷⁸. Mutation of MYC in lymphomas is frequently linked to IGH translocations, which nevertheless are rare in the PCNSL as shown in the present as well as previous studies^23,79. Whereas previous studies showing a high frequency of MYC mutations in PCNSL focused on the region underlying SHM in PCNSL⁸⁰, we here show that these mutations scatter across the gene (Supplementary Fig. 4). The function of the changes remains elusive but it is intriguing to speculate that at least part of them might contribute to the “double expression” of BCL2 and MYC in the absence of MYC translocation in PCNSL which has been associated with unfavorable outcome in systemic DLBCL⁸¹.

Recurrent somatic alterations in non-protein-coding genes

The landscape of mutations affecting ncRNA in PCNSL was comparable to ABC-DLBCL, apart from significantly more mutations in AL122127.1 and AL122127.4 (Fig. 3a), situated in the IGH locus, and in RP11-211G3.2, situated in the first intron of BCL6. While the implications of these mutations are unclear, it is possible that these mutations are accumulated as part of the SHM/aSHM process affecting IGH and BCL6. Additionally, we identified recurrent aberrations in the aSHM target MIR142 (80%; Fig. 3a, b) as well as MALAT1 (70%) and NEAT1 (60%), both located 53 kb apart on 11q13.1. The mechanistic roles of many ncRNAs are poorly understood because their exact function is difficult to assess. However, the lncRNAs NEAT1 (nuclear enriched abundant transcript 1) and MALAT1 (metastasis-associated lung adenocarcinoma transcript 1) are well known to play essential roles in the development and progression of various cancers by influencing gene expression by alternative splicing and epigenetic modification of regulatory elements^82,83,84. Both, MALAT1 and NEAT1, which have not been linked to PCNSL before are known to be mutated and highly expressed in DLBCL³⁴ and predict poor prognosis^85,86. Further aberrations in lncRNAs affected KCNQ1OT1 (33%) and SNHG3 (23%), both reported to have oncogenic functions in multiple cancers^87,88 as well as SNHG14 (37%), promoting immune evasion in DLBCL⁸⁹.

**Fig. 3: Recurrent non-coding RNA mutations and Kataegis events in PCNSL.**

Kataegis shapes the mutational repertoire of PCNSL

Kataegis is a pattern of mutational hotspots that has been associated with a number of cancers⁹⁰, and is a frequent consequence of AID activity in lymphomas⁹¹. Many of the recurrently mutated genes in PCNSL were dominated by alterations that are located in these highly mutated hotspots¹¹, of which several have previously been described as targets of aSHM, such as OSBPL10, PIM1, BTG2, and PAX5 (Fig. 3b)^25,53,80. Of the 50 identified protein-coding driver genes and the top 50 mutated ncRNA in PCNSL, 15 and 21 were targeted by kataegis, respectively (Figs. 2a, b, 3a, b, additional supplement [https://doi.org/10.5281/zenodo.6054242]⁹². Consistent with previous reports⁹³, expression of miRNA, lncRNA, antisense RNA, and protein coding genes with kataegis loci were expressed significantly higher than those without (Wilcoxon rank sum test, p < 0.05; Fig. 3c). This implicates that either aSHM preferentially targets highly expressed genes, or that aSHM may cause hyperactivation of these genes. Interestingly, the largest difference in RNA expression was observed in miRNA genes, again highlighting the importance of the non-coding alterations in PCNSL. This observation was consistent for subgroups, including systemic DLBCL (Supplementary Fig. 5).

Physiologically, SHM is the process of introducing mutations in the antibody genes to alter the antigen-binding site, increasing the immunoglobulin (IG) diversity⁹⁴. Kataegis events were at IGH (100%), IGL (100%) and IGK (70%) loci but were also found outside IG loci, targeting BTG2 (63%), GRHPR (50%), PIM1 (43%), DTX1 (40%), OSBPL10 (37%), ZNF860 (37%), BCL6 (33%), RHOH (33%), CXCR4 (30%), BACH2 (27%), and PAX5 (27%; Fig. 3b). The recurrently targeted genes in PCNSL mostly overlapped with those targeted in ABC-DLBCLs. However, samples with mutational hotspots in BTG2, GRHPR, OSBPL10 and ZNF860 were significantly more frequent in PCNSL (in 18, 15, 11 and 11 of 30 samples, respectively) compared to ABC-like DLBCL (in 2, 0, 0 and 0 of 13 samples, p = 0.009, 0.001, 0.019 and 0.019, Fisher’s exact test, respectively). Taking all non-IG genes that overlapped a mutational hotspot in at least one PCNSL sample (242 genes, Supplementary Data 7), we found the BCR signaling pathway to be most significantly enriched (Enrichr enrichment test, adjusted p = 0.0046). Taken together, kataegis and aSHM play a decisive role in shaping the mutational repertoire of PCNSL and are associated with functional pathways in PCNSL pathogenesis.

While patterns of aSHM and kataegis were similar between CNSL and systemic DLBCL subtypes, we identified that EBV+ CNSL cases did not share many of the recurrent mutational hotspots apart from IGH and the HLA-DRB locus. (Fig. 3b, Supplementary Fig. 6a, b, Supplementary Data 5).

Recurrent copy number alterations (CNAs)

Compared to systemic ABC-DLBCL and GCB-DLBCL of which the copy number profiles reflected previously published results^34,60,95, PCNSL demonstrated significantly more CN losses in 6p21 (HLA-D locus, Fig. 4a–c, Supplementary Data 8) as well as recurrent losses in 9p21 (MTAP, CDKN2A/B) and 19p13 (CDKN2D). The loss of the HLA-D locus that encode for MHC class II molecules lead to reduced immune surveillance and poor survival in DLBCL⁹⁶. CDKN2A is an established tumor suppressor gene with roles in angiogenesis, cell death, invasiveness, and growth suppression^97,98,99. Additionally, we found deletions on chromosomes 1p13 and 3q13, affecting genes such as CD58 and CD80, both candidates reported to lead to immune evasion¹⁰⁰. Further CN losses were detected on chromosomes 8q12 (TOX), 12p13 (ETV6), and 15q21 (B2M) as well as 3p14, affecting the fragile site tumor suppressor gene, fragile histidine triad (FHIT). TOX deletions have been previously described by array-based imbalance profiling¹⁰¹. TOX is required for the development of various T-cell subsets and was described as putative tumor suppressor in MCD DLBCL³⁰. TOX downregulation has been associated with poor prognosis in different cancers¹⁰² and is a predictor for anti-PD1 response¹⁰³. Significant CN gains in PCNSL mapped to 2q37 and 18q21 affecting DIS3L2 and MALT. DIS3L2 encodes for an exoribonuclease that is responsible for Perlman syndrome¹⁰⁴ and was recently described to promote HCC tumor progression by upregulating production of the oncogenic isoform of RAC1, RAC1B¹⁰⁵. MALT is a regulator of NFκB signaling and potential therapeutic target in B-cell lymphoma¹⁰⁶.

**Fig. 4: Genomic structural variation in PCNSL.**

Recurrent structural variations (SVs)

We defined SVs as genomic breakpoints, which can correspond the borders of amplifications and deletion, but also balanced translocations and inversions. PCNSL showed a median of 147 SVs (range: 24–517, Supplementary Data 2). IG gene rearrangements were found in all PCNSL, ABC-DLBCL, and GCB-DBCL cases and affected the IGH (100, 100, and 100%), IGL (73, 46, and 31%) and IGK (87, 54, and 63%) loci. Furthermore, direct SVs affected FHIT (73, 23, and 38%), CDKN2A (67, 38, and 25%), BCL6 (37, 21, and 19%), OSBPL10 (33, 8, and 13%), ETV6 (33%, 15%, 6%), PAX5 (27, 0, and 13%), PIM1 (23, 0, and 6%), TOX (23, 8, and 19%), BTG2 (23, 8, and 0%), WWOX (23, 8, and 25%), as well as CD58 (20, 8, and 19%; Fig. 4d, Supplementary Data 9). WWOX and FHIT represent common fragile site (CFS) and have been classified as tumor suppressor genes in DLBCL^107,108.

Recent studies have shown that translocation can act as enhancer hijacking even when the events is several hundred thousand base-pairs away from target genes¹⁰⁹. To investigate this, we also annotated SV breakpoints to genes within 100 kbp and also to the closest genes. We found a number of genes involved in G protein-coupled receptor signaling (ARAP2, LPHN2, LPHN3, EPHA4, ADGRL2, and GPC5) consistent with observations in pan-cancer studies¹¹⁰. A number of other genes exhibited at least three times as many distal translocations (while still being the closest gene) than directly on the gene, including PIK3C3, EPHA4, SI, ALCAM, NCAM2, CADM2, CDH9, PABPC4L, GRIK2, POM121L12, ACO1, KLHL1, SLITRK1, and SLITRK6. Hyperactivation of PI3K signaling is one of the most common events in human cancers, and PIK3C3 has been shown to promote cell proliferation¹¹¹ and autophagy¹¹², and its inhibition has shown therapeutic benefit in bladder, hepatocellular (HCC), and colon cancer^113,114,115. EPHA4 has been described to promote cell proliferation and migration^116,117 and was associated with tumour aggressiveness and poor patient survival in human breast and rectal cancer^118,119. Inhibition of EphA4 has been shown to overcome intrinsic resistance to chemotherapy¹²⁰. Many of these other potential enhancer-hijacking targets do not have well-established roles in cancer pathogenesis, however, we did notice a number of genes involved in cell adhesion (ALCAM, NCAM2, CADM2, and CDH9) and 2 SLIT and NTRK like family members (SLITRK1, SLITRK6).

Immunoglobulin translocations implicate distinct CNSL subtypes

IG translocations are established oncogenic drivers of many lymphatic neoplasms^121,122,123. IGH-BCL6 fusions are recurrent in PCNSL²⁴, which mirrors observations of ABC-DLBCL¹²⁴. IGH-BCL2 fusions are more prominent in GCB-DLBCL¹²⁵. We investigated the recurrent translocations (≥2 patients) in our cohort and identified five CNSL samples with IGH-BCL6 translocations (Fig. 5a and Supplementary Fig. 7a–d). We also identified three cases with IGH-BCL2 translocations (Fig. 5b and Supplementary Fig. 7e, f) one in each of SCNSL, SCNSL-M, and PCNSL-M, further implicating that meningeal and secondary CNSL are distinct from intraparenchymal PCNSLs. Two PCNSL cases showed IGL and IGH translocations with breakpoints close to CD274 (PD-L1; Fig. 5c, Supplementary Fig. 7g), which resulted in strong PD-L1 protein expression (Supplementary Fig. 7h) and therefore implicates a potential target for immunotherapy. All other, non-recurrent translocations are listed in Supplementary Data 10.

**Fig. 5: Immunoglobulin translocations in PCNSL.**

The analyses of the IG breakpoints provided in all informative junctions evidence that these occurred due to illegitimate CSR or aberrant SHM, with the notable exception of the IGH-BCL2 junctions, which were the consequence of an aberrant VDJ rearrangement (Supplementary Data 11). Thus, all IG translocations in PCNSL are supposed to occur in the GC process rather than in a pre B-cell.

Mutational signatures in PCNSL

Mutational signatures were analyzed with regard to SNVs (single base substitutions, SBS) and indels (ID) of all tumor samples as defined by Alexandrov et al.¹²⁶ (Fig. 6). For single base substitution signatures (SBS) we found mutational patterns that have been associated with spontaneous deamination of 5-methylcytosine (SBS1), defective activity of the AID/APOBEC family (SBS2), failure of double-strand DNA break repair by homologous recombination (SBS3), SHM (SBS9), and damage by reactive oxygen species (SBS18). Additionally, the samples frequently revealed mutations caused by mutational signatures SBS5, SBS17b, and SBS40, which are of unknown etiology (Fig. 6a). The presence of SBS3, hallmark of defective DNA break repair by homologous recombination, and SBS40 may be therapeutically relevant as these indicate potential effectiveness of combination therapy with PARP inhibitors (e.g., Olaparib) alongside cytotoxic chemotherapy^127,128. The three most prominent signatures in DLBCL, FL, and CNSL were SBS9, SBS5, and SBS40 (Fig. 6b). Direct comparison of PCNSL and DLBCL revealed that signature SBS1, which correlates with DNA replication at mitosis (mitotic clock)¹²⁶, was significantly enriched in PCNSL (p = 0.0027; Fig. 6c, Supplementary Fig. 8a–g).

Analysis of small insertion and deletion signatures (ID) revealed mutational patterns associated with slippage during DNA replication of the replicated DNA strand (ID1) and template DNA strand (ID2); both of these signatures appeared significantly (p < 1 × 10⁻⁴, Wilcoxon) more prominent in PCNSL compared to DLBCL and FL (Fig. 6d), though different read-depths may have influence this analysis.

Interestingly, only CNSL samples but not DLBCL or FL revealed mutations caused by mutational signature ID12 that is of unknown etiology and has been observed in prostate adenocarcinoma and soft tissue liposarcoma¹²⁶.

PCNSL RNA expression signatures are distinct from systemic DLBCL

The relative rarity of PCNSL and limited availability of fresh frozen tissue have thus far complicated the implementation of larger molecular studies needed for patient stratification. To unravel the molecular signature of PCNSL, we employed an unsupervised consensus clustering approach (using the cola tool¹²⁹) to identify expression groupings between PCNSL samples and samples from the ICGC MMML-seq project (mainly consisting of non-GCB and GCB type DLBCLs, and FLs). This yielded the following major clusters: FL, PCNSL, GCB-type DLBCL, ABC-type DLBCL, non-tumorous GC B-cells, and naïve B-cells (Fig. 7a). For each cluster, we identified signature gene sets that significantly correlated with the groupings. Interestingly, all meningeal PCNSL (PCNSL-M) and SCNSL-M grouped together with either GCB- or ABC-DLBCL, clearly indicating that these subtypes are molecularly and pathomechanistically distinct from intraparenchymal CSNL, which formed one separate cluster suggesting a distinct signature of CNS tropism. The ABC-type DLBCL cluster was enriched for MYD88 mutant samples, which were still distinct from MYD88 mutant PCNSL at the gene expression level (Fig. 7a).

**Fig. 7: Transcriptomic signatures distinguish PCNSL from DLBCL.**

To further exclude an impact of potentially contaminating surrounding CNS tissue on gene expression signatures, we analyzed total RNA from normal brain controls (n = 2) and compared this to PCNSL. To investigate the gradient of various tumor cell contents of samples, we spiked increasing concentrations of RNA from non-diseased brain tissue into a PCNSL sample with very high tumor cell content (0, 20, 40, 60, and 80%). Then, we further stratified the PCNSL group by another round of consensus clustering using two different classification methods, which both revealed two groups (Fig. 7b and Supplementary Fig. 9a–c). The first PCNSL expression group (PCNSL subcluster 1) consisted of samples with high tumor cell content (determined by WGS and histopathological analysis). Expression of its signature gene set did not show similarity to normal brain tissue expression. However, the second PCNSL expression group (PCNSL subcluster 2) contained mainly samples with lower tumor cell content, and expression of its signature gene set was indeed similar to normal brain tissue expression (Fig. 7b). We identified the PCNSL signature gene sets relative to ABC and GCB type DLBCLs and FLs, and removed potential background signatures from contaminating brain tissue. The marker genes in each group were identified based on differential gene expression analysis (Supplementary Data 12). Among the marker genes for PCNSL were e.g., LAPTM5, a CD40-related gene expressed in malignant B-cell lymphoma¹³⁰ and ITGAE, mediating cell adhesion, migration, and lymphocyte homing through interaction with E-cadherin¹³¹. We used Metascape⁶⁸ for functional analysis of marker genes. Further pathway and process enrichment analysis revealed that pathways such as ‘ribonucleoprotein complex biogenesis_, ‘mRNA processing’, ‘cell cycle’, ‘RNA modification’, ‘DNA conformation change’, and ‘DNA-templated transcription, initiation’ were enriched (Supplementary Fig. 9d). The top three level Gene Ontology biological processes included ‘cellular component organization or biogenesis’, ‘metabolic process’, and ‘localization’ (Supplementary Fig. 9e).

Expression of IGHM is characteristic for PCNSL

Additionally, we analyzed the expression of IG constant genes, which again revealed the same clusters as the unsupervised consensus clustering approach, demonstrating that PCNSL can be differentiated from DLBCL based on only the expression of IG constant genes. In contrast to DLBCL and FL, PCNSL show generally low expression of IG constant genes, but higher expression of IGHM (Fig. 7c).

TERT expression but not telomere content upregulated in PCNSL

Telomerase activity and telomerase reverse transcriptase (TERT) gene expression have been reported as prognostic factors in PCNSL patients¹³². We used TelomereHunter, a software for detailed characterization of telomere maintenance mechanisms¹³³ to estimate the telomere content in a representative cohort of PCNSL, SCNSL, peripheral lymphoma, as well as non-tumorous naïve and GC B-cells as control⁵⁹. In approximately 1/3 of the samples, the TCC-corrected telomere content was higher in the tumor than in the matched control (whole blood) (Fig. 8a and Supplementary Fig. 10a). Nevertheless, telomere content was not significantly different between the different histological, clinical and molecular subgroups irrespective of whether the results were corrected for TCC (Fig. 8b and Supplementary Fig. 10b (tumor/control log2 ratio)) or not (Supplementary Fig. 10c (uncorrected for the control sample)). As expected, we found a negative correlation between age and telomere content in the control (Supplementary Fig. 10d). However, expression of the TERT gene, the main activity of the encoded protein is the elongation of telomeres, was significantly higher in GC B-cells⁵⁹ and in PCNSL compared to ABC-DLBCL (Fig. 8c and Supplementary Fig. 10e). This was consistent with observations when stratifying samples by RNA subgroups, where TERT expression was significantly higher in PCNSL compared to ABC-DLBCL, GCB-DLBCL, and FL (Fig. 8d).

**Fig. 8: *TERT* expression correlates with telomere content in PCNSL.**

Interestingly, the higher TERT expression in PCNSL significantly correlated with normalized telomere content (uncorrected for the control sample: Pearson’s R = 0.67, p = 0.003; (Fig. 8e) and telomere content T/C log2 ratio (Supplementary Fig. 10f)). However, SCNSL-M, DLBCL and FL did not show such trends (R = −1, p = 0.3, R = −0.06, p = 0.7, and R = 0.07, p = 0.7, respectively (Fig. 8e)). This suggests that TERT has an active role in combatting telomere degradation in PCNSL. Two well-known promoter hotspot mutations (−124C>T (C228T) and −146C>T (C250T)) have been described to increase TERT expression and cell-cycle progression^134,135. These mutations have been found in several solid and hematological malignancies including different brain tumors and PCNSL^136,137,138. Therefore, we next investigated the TERT promoter mutation status in our WGS (n = 38) and FFPE extension cohort (n = 31). Sanger sequencing of the TERT promoter region was performed (i) for all WGS samples having only low coverage in the promoter sequence (below 40×, n = 6, Supplementary Fig. 10g), (ii) the FFPE extension cohort, and (iii) three oligodendrogliomas, known to carry high frequency TERT promoter mutation¹³⁹. We detected no TERT promoter mutations in 67 samples of PCNSL and SCNSL (Supplementary Data 5, Supplementary Fig. 10h), while the well-known TERT rs2853669 polymorphism, which has been associated with increased cancer risk¹⁴⁰, was identified in 40% (14/35 (8 PCNSL, 6 SCNSL)) of the patients in the extension FFPE cohort. The Sanger sequencing results of two samples were not conclusive.

Discussion

Here we have performed a comprehensive analysis of recurrent protein coding and non-coding mutations, CNVs, SVs, and driver mutations in a large cohort of PCNSL and compared the genetic features to systemic DLBCL and FL. The vast majority of PCNSLs are of non-GCB-DLBCL subtype¹⁴¹ and share many genetic alterations with non-CNS ABC-DLBCL in the same signaling pathways. Previous studies made use of whole-exome sequencing^25,53 which (i) limits the investigation to protein-coding regions and (ii) may not be ideal for understanding the patterns of mutational hotspots—e.g., attributed to AID induced SHM in B-cell non-Hodgkin lymphomas¹⁴²—as well as the structural variation in genomes¹⁴³. PCNSL showed significantly more SNVs and indels compared to systemic DLBCL, even in intronic and intergenic regions, also underlining the importance of non-protein coding aberrations in PCNSL pathogenesis. Many of the recurrent mutations in non-protein coding genes affected non-coding RNAs (ncRNAs), which are among other functions involved in epigenetic regulation of gene expression, cell differentiation, and development^82,144. The molecular profile of SCNSL, on the other hand, corresponded to that of systemic DLBCL.

In line with previous results^25,43,145, we here demonstrate that PCNSL are defined by recurrent and often biallelic CDKN2A deletions, MYD88 L265P mutations, and mutations that activate BCR signaling, genetic hallmarks of the DLBCL subtype MCD/C5^30,31. Furthermore, we found high frequencies of SVs affecting the IGH, IGL, and IGK loci as well as losses of chromosome 6p affecting the HLA gene complex as a mechanism to escape recognition by cytotoxic T-cells¹⁴⁶. MYD88 L265P mutation and CDKN2A loss have been described as early mutational events in PCNSL⁴⁵ and we confirmed both to be major drivers in PCNSL. While TP53 alterations seem to play a minor role in PCNSL, the CDKN2A/B genes encode several proteins that regulate either the p53 (p19 ARF) or the RB1 (p16 INK4a) pathway^147,148, underlining the relevance of the TP53 pathway in the context of PCNSL and cell cycle control.

The frequencies of MYD88 mutations had varied between 38 and 94% in previous PCNSL studies^{26,31,53,54,149}, which might reflect a selection bias among small study populations, given the rarity of PCNSL. This huge range could alternatively result also from an imprecise definition of PCNSL, which includes all malignant NHL within the brain, eyes, spinal cord, or leptomeninges without systemic involvement. In contrast, we here defined PCNSL as only intraparenchymal CNS-DLBCL and found a high prevalence of the MYD88 L265P variant in this cohort (WGS cohort, extension FFPE cohort; mean: 73%). This is further supported by our robust classification of PCNSL by the RNA sequencing results, which demonstrated that the expression profiles of PCNSLs were distinct from PCNSL-M, SCNSL-M, and peripheral DLBCL without CNS manifestation, the latter three entities sharing similar profiles.

Moreover, SHM has previously been described as having a pathogenic role in PCNSL development and that its extent was greater there than in systemic DLBCL⁸⁰. In agreement with previous reports, we identified several aSHM targets including the proto-oncogenesPIM1, PAX5, BTG2, and OSBPL10^25,53,80. Exploiting a WGS approach, we observe additional mutational hotspots indicative of aSHM also in other genes including MIR142, FHIT, ETV6, BTG1, GRHPR, and CD79B. Our data suggest that katagis loci are reasonable indications of aSHM. We observed significantly higher RNA expression of genes with putative aSHM loci compared to those without. In addition, these putative aSHM loci were significantly enriched in genes involved in BCR signaling. Together this implicates that BCR signaling genes are both upregulated and targeted by putative aSHM, raising the question of cause and effect—is aSHM upregulating these genes, or is the high expression levels of these genes priming them for aSHM? This becomes even more complex when considering that highly expressed genes should have lower mutational rates due to transcriptional coupled repair¹⁵⁰.

The landscape of CNAs and SVs revealed potentially clinically exploitable deletion of TOX as a predictor for anti-PD1 response¹⁰³, amplification of MALT1, whose inhibition has been shown to be selectively toxic for ABC-DLBCL¹⁵¹, and potential enhancer-hijacking events involving PIK3C3 and EPHA4, whose inhibition has shown therapeutic advantage in a number of cancer models^{113,114,115,120}.

While the genetic landscape of PCNSL was described in some detail before^{16,17,18,20,24,25,45,53,55}, studies investigating the global gene expression profile of PCNSL have been scarce so far. Therefore, we performed RNA sequencing of 37 CNSL samples and 2 normal brain controls. Global gene expression profiles demonstrates that PCNSL are indeed distinct and can be distinguished from systemic ABC-DLBCL. This was perfectly mirrored based on the expression repertoire of IG constant genes, implicating the role of B-cell maturation in classification of PCNSL and other lymphomas, as employed in leukemia and multiple myeloma^152,153.

PCNSL are highly proliferative¹⁵⁴. TERT activation confers unlimited proliferation, and activating TERT promoter mutations are frequent in different types of human cancers¹⁵⁵. Mutations at two hotspots positions (−124G>A and −146G>A) are causal for enhanced TERT promoter activity. Bruno et al. have previously reported these TERT promoter mutations to be present in PCNSL located in the splenium¹³⁶. Therefore, we investigated 69 CNSL (including 49 PCNSL), but could not identify any TERT promoter mutations suggesting that this mechanisms of TERT activation is likely not relevant in PCNSL. Yet, we observed significantly more TERT expression in PCNSL compared to non-CNS ABC-DLBCL and this was consistent when stratifying the cohort based on our RNAseq subgrouping. However, we were not able to identify increased telomere content in PCNSL (or MCD) compared to the other groups, suggesting a role of telomere maintenance to overcome telomere shortening, which is imposed by the high levels of proliferation. This concept was supported by a within-group correlation of TERT expression and normalized telomere content (Pearson’s R = 0.67, p = 0.0023), implicating a role of TERT in overcoming telomere degradation in PCNSL. Supporting the high proliferation in these tumors, PCNSL showed significantly elevated presence of the mutational signatures SBS1, which correlates with DNA replication, as well as of ID1 and ID2 which are associated with slippage during DNA replication.

With our study, we have substantiated the genomic and transcriptomic alterations characterizing PCNSL. We show that PCNSL can be clearly distinguished from systemic DLBCL, having distinct expression profiles, IG expression, and translocation patterns, as well as specific combinations of genetic alterations that are characterized by genomic instability, BCR activation, and most importantly, oncogenic TLR and NFκB signaling, which should be in the focus of future drug development.

Methods

CNS lymphoma (CNSL) study cohort

All procedures performed in this study were in accordance with the ethical standards of the respective institutional research committees and with the 1964 Helsinki declaration and its later amendments or comparable ethical standards. The ethics committee of the Medical Faculty Heidelberg and the Charité ethics committee (Charitéplatz 1, 10117 Berlin, Germany, approval number: EA1/245/13) approved the study. Informed consent was obtained from all participants in the study. Fresh frozen and paraffin-embedded PCNSL and SCNSL tumor tissue and matching blood samples (germline control) were acquired from the Department of Neuropathology, Charité, Berlin (Germany), and the Department of Neurosurgery, Heidelberg (Germany) from chemotherapy-naïve patients. Age at diagnosis, tumor localization, peripheral manifestation (bone marrow biopsy result, CT/MRI scan), first line therapy, as well as overall survival (OS) in months were evaluated. Additional control samples from age-matched, postmortem, and non-neoplastic brain (n = 2) were analyzed. The diagnosis was confirmed by at least two experienced (neuro)pathologists. The morphologic characteristics were assessed by using the fresh-frozen (FF) as well as the formalin-fixed and paraffin-embedded (FFPE) tissue sections of the respective tumor specimen. The tumor cell content in the cryopreserved sample material was estimated to be at least 60% based on histomorphological evaluation. Immunophenotypic characterization was performed on FFPE tissue sections (Supplementary Fig. 1b) of each tumor biopsy using an immunohistochemical panel including antibodies directed against CD20, CD10, BCL6, CD3, Ki67, MUM1/IRF4, and EBV (LMP1). To further exclude an EBV association, all cases with unclear EBV immunohistochemistry (n = 9) were investigated by an EBV-specific PCR as previously described¹⁵⁶ (Supplementary Fig. 1c, see “Methods”). For classifying GCB or non-GCB types, all samples were stratified according to the Hans classification¹² (CD10, BCL6, MUM1). We enrolled CNSL from a total of 51 patients for whole-genome (WGS, n = 38) and RNA sequencing (RNAseq, n = 37), including n = 24 samples subjected to both workflows. The study cohort and sample size as well as the experimental design, analysis workflow, diagnosis, and quality metrics of WGS and RNAseq are displayed in Fig. 1 and Supplementary Data 1. We included DLBCL confined to the CNS as PCNSL according to the recent WHO classification of tumors of hematopoietic and lymphoid organs and one of the tumors of the central nervous system. DLBCL, which presented initially with systemic, non-CNS, or synchronous systemic and CNS involvement were included as SCNSL^1,2,11,61,62. In our SCNSL cohort, three patients presented with initial lymph node manifestation, one patient with testicular involvement, and three patients with involvement of parotid gland, liver, or urinary tract, respectively.

ICGC MMML-Seq Consortium samples

For comparison, we used and reanalyzed an early release of meanwhile published whole-genome and RNA sequencing data obtained by the ICGC MMML-Seq Consortium from systemic diffuse large B-cell lymphoma (DLBCL, total: n = 36, WGS: n = 29, RNAseq: n = 36, both workflows: n = 29), follicular lymphoma (FL, total: n = 39, WGS: n = 39, RNAseq: n = 38, both workflows: n = 38), and one “double hit” (DH)-lymphoma with a molecular BL signature³⁴. In addition, we included WGS and RNAseq data from a single EBV-PCNSL case as well as RNAseq data from two nodal marginal zone lymphomas (nMZL) as well as naïve (n = 5) and GC B-cells (n = 5) as normal controls¹⁵⁷. These data were obtained by the ICGC MMML-Seq consortium in accordance to protocols previously published^59,60.

DNA and RNA isolation

DNA and RNA were obtained from fresh frozen CNSL tumor samples. RNA and genomic DNA were isolated from 15 to 30 10 μm cryosection slices (depending on the tissue size). DNA from tumor samples and their matched blood controls was isolated according to standard procedures. Total RNA from tumor samples was extracted using the RNeasy^® Plus Mini Kit (Qiagen, Hilden, Germany) according to the manufacturer’s instructions. The RNA integrity number (RIN) was determined using an Agilent 4200 TapeStation system (Agilent Technologies, Santa Clara, CA).

Whole-genome sequencing and data processing

Whole-genome sequencing

The DNA libraries of the tumor and matched control samples were prepared according to the Illumina TruSeq Nano DNA Library protocol using the TruSeq Nano DNA library Preparation Kit (Illumina, Hayward, CA; estimated insert size of 350 bp). Paired-end sequencing was performed on Illumina HiSeq X (2 × 150 bp) instruments using the TruSeq SBS kit, Version 3.

Alignment of sequencing reads

Sequencing reads were aligned using the DKFZ alignment workflow from ICGC Pan-Cancer Analysis of Whole Genome projects (DKFZ AlignmentAndQCWorkflows v1.2.73, https://github.com/DKFZ-ODCF/AlignmentAndQCWorkflows). Briefly, read pairs were mapped to the human reference genome (build 37, version hs37d5) using bwa mem (version 0.7.8) with minimum base quality threshold set to zero [-T 0] and remaining settings left at default values¹⁵⁸, followed by coordinate sorting with biobambam bamsort (version 0.0.148) with compression option set to fast (1) and marking duplicate read pairs with biobambam bammarkduplicates with compression option set to best (9)¹⁵⁹. To allow the required and meaningful comparability to previous whole genome sequencing studies in lymphomas^{34,95,160,161}, the human reference genome version GRCh37/hg19 was used.

Small mutation calling and annotation

Somatic small variants (SNVs and indels) in matched tumor normal pairs were called using the DKFZ in-house pipelines (SNVCallingWorkflow v1.2.166-1, https://github.com/DKFZ-ODCF/SNVCallingWorkflow; IndelCallingWorkflow v1.2.177, https://github.com/DKFZ-ODCF/IndelCallingWorkflow) as previously described¹⁶². Briefly, the SNVs were identified using samtools and bcftools version 0.1.1957¹⁶³ and then classified as somatic or germline by comparing the tumor sample to the control, and later assigned a confidence which is initially set to 10, and subsequently reduced based on overlaps with repeats, DAC blacklisted regions, DUKE excluded regions, self-chain regions, segmental duplication records as introduced by the ENCODE project¹⁶⁴ and additionally if the SNV exhibited PCR or sequencing strand bias. Only SNVs with confidence 8 or above were considered for further analysis. Tumor and matched blood samples were analyzed by Platypus¹⁶⁵ to identify indels. Indel calls were filtered based on Platypus internal confidence calls, and only indels with confidence 8 or greater were used for subsequent analysis. In order to remove recurrent artifacts and misclassified germline events, somatic indels that were identified as germline in at least two patients in the CNS lymphoma cohort were excluded.

The protein coding effect of somatic SNVs and indels from all samples were annotated using ANNOVAR¹⁶⁶ according to GENCODE gene annotation (version 19) and overlapped with variants from dbSNP10 (build 141) and the 1000 Genomes Project database. Mutations of interest were defined as somatic SNV and indels that were predicted to cause protein coding changes (non-synonymous SNVs, gain or loss of stop codons, splice site mutations, and both frameshift and non-frameshift indels), and also synonymous exonic mutations on non-coding genes.

Tumor in normal contamination detection

We applied the TiNDA (tumor in normal detection algorithm) workflow to account for potential tumor in normal contamination leading to false negative calls as previously described¹⁶². The TiNDA algorithm is implemented in the DKFZ indel calling workflow (v1.2.177, https://github.com/DKFZ-ODCF/IndelCallingWorkflow). Briefly, the B-allele frequency (BAF) was calculated from the tumor and control samples. Positions overlapping with common variants were filtered out. Then, the clustering algorithm from Canopy¹⁶⁷ was applied to the BAF values for the positions in tumor vs control using a single pass run, assuming 9 clusters. The clusters that were determined to be tumor-in-normal had to have 75% of positions above the identity line (where the VAF in the tumor sample is the same as the VAF in the control sample). These identified mutations were then reclassified as somatic instead of the original germline annotation. All but 4 CNSL WGS samples exhibited evidence for tumor in normal. On average 31 SNVs (range 0–136) were “rescued” in PCNSL, 6 in PCNSL-M (6-6, single sample), 27 (19–34) in SCNSL, 22 (9–43) in SCNSL-M, and 0 in the EBV-positive sub-cohorts (0-0, 2 samples). In total, only 6 SNVs with protein coding effects were rescued, including the MYD88 p.L265P mutation in sample LS-0102, which had 3 of 47 read support in the control, and 86 of 170 reads supporting the variant in the tumor sample (Supplementary Data 13). In our series, we only found a very low level of tumor in normal. The rescued mutations followed a genomic distribution similar to the overall mutational landscape of PCNSL. We observed that rescued mutations were 1% exonic (compared to 1% in the mutational landscape), 32% intronic (c.f. 33%), 53% intergenic (c.f. 55%), and 13% on ncRNA (c.f. 12%).

Genomic structural rearrangements

Genomic structural rearrangements (SVs) were detected using SOPHIA v.34.0¹⁶⁸ as implemented in the DKFZ structural variation calling workflow (SophiaWorkflow:1.2.16, https://github.com/DKFZ-ODCF/SophiaWorkflow). Briefly, SOPHIA uses supplementary alignments as produced by bwa-mem as indicators of a possible underlying SV. SV candidates are filtered by comparing them to a background control set of sequencing data obtained using normal blood samples from a background population database of 3261 WGS samples from patients from published ICGC-PedBrain, ICGC-MMMLseq and ICGC-Prostate studies and DKFZ-HIPO studies, sequenced using Illumina HiSeq 2000, 2500 (100 bp) and HiSeq X (151 bp) platforms and aligned uniformly using the same workflow as in this study. All studies have been approved by appropriate ethics committees. Gencode V19 was used for the gene annotations. We used the script draw_fusions.R from the Arriba package¹⁶⁹ to visualize SVs generated by SOPHIA.

Copy number alterations and allelic imbalances

Allele-specific copy-number aberrations were detected using ACEseq (allele-specific copy-number estimation from WGS)¹⁷⁰ as implemented in the DKFZ CNV calling workflow (ACEseqWorkflow:1.2.8-4, https://github.com/DKFZ-ODCF/ACEseqWorkflow). ACEseq determines absolute allele-specific copy numbers as well as tumor ploidy and tumor cell content (TCC) based on coverage ratios of tumor and control as well as the B-allele frequency (BAF) of heterozygous SNPs. SVs called by SOPHIA were incorporated to improve genome segmentation.

Final copy number segments were further smoothed to calculate the total number of gains and losses. Neighboring segments were merged if they rounded to the same copy number and deviated by less than 0.5 copies in case of segments <20 kb or deviated by less than 0.3 copies otherwise. Remaining segments <500 kb were merged with their closer neighbor based on allele-specific and total copy number and once again segments smaller than 2 Mb deviating by less than 0.4 copies were merged. Based on the resulting segments the number of gains and losses was estimated.

Furthermore, the fraction of aberrant genome was calculated as the fraction of the genome that is classified either as duplication or deletion (>0.7 deviation from the ploidy) or was identified as a loss of heterozygosity.

Classification of mutational hotspots (kataegis events)

Mutational hotspots indicating putative kataegis events (likely due to SHM or aberrant SHM (aSHM)) were defined as regions with at least 6 somatic SNVs within an average intermutational distance of 1000 bp or less, as previously used by Alexandrov and colleagues⁹⁰. A gene was described to be targeted by kataegis if its definition (from Gencode version 19 gene models) overlapped with at least 1 kataegis region in at least 1 sample. While many of these kataegis loci are indeed SHM/aSHM targets, located 2.5 kb from the transcription start site (TSS), we cannot completely control for all PCNSL-specific TSSs due to the normal brain background tissue.

Mutational signatures

Supervised mutational signature analysis was performed using YAPSA development version 3.13¹⁷¹ using R 4.0.0. Briefly, the linear combination decomposition (LCD) of the mutational catalog with known and predefined PCAWG COSMIC signatures¹²⁶ was computed by non-negative least squares (NNLS). The mutational signature analysis was applied to the mutational catalogs for SNVs (or single base substitutions, SBS) and indels of all tumor samples. Signature-specific cutoffs were applied and cohort level analysis was used for detecting signatures as recommended by Huebschmann et al.³⁴. The cutoff used corresponds to “cost factors” of 10 for SNVs and 3 for indels in the modified ROC analysis.

Integration of different variant types

SNVs, indels, SVs, and CNAs were integrated in order to account for all variant types in the recurrence analysis. All genes with SNVs or indels in coding regions (nonsynonymous, stop gain, stop loss, splicing, frameshift, and non-frameshift events) and ncRNA (exonic) were included. Any SV with breakpoints directly lying on a gene (SV direct) were considered for oncoprints, however, SVs were also annotated to a gene when they were either within 100 kb of a gene (SV near), or to the closest gene (SV close) for SV recurrence analysis to account for regulatory mutations such as enhancer hijacking events. Genes were annotated with CNAs if they were completely or partially affected. Chromosome level CNVs events were determined once >30% of a chromosome arm was altered. Only focal CNA events were taken into account for variant integration, as these are more likely to target specific genes within the affected region than large events such as whole chromosome arm events. To capture the precise target of CNVs, we employed results from GISTIC. Finally, genes affected by SNVs, indels, directly hit by SVs, or genes with focal CNAs were considered for the recurrence analysis and added to the oncoprints. The mutations were integrated and plotted as oncoprint plots using using R v3.4.0 (library yapsa v3.13), perl v5.26.2 (libraries perl-getopt-long v2.50) and bedtools v2.16.2. SV cohort plots were generated using perl v5.20.0, bedtools v2.24.0, R v3.3.1 (libraries circlize v0.4.5 and dplyr v0.7.8), using the gencode v19 gene models for annotation.

Mutual exclusivity and inclusivity analysis

Mutual exclusivity analysis was performed to investigate the relationship between MYD88 mutations with other implicated drivers from the IntOGen analysis including SNVs, indels, SVs, CNAs. The minimal recurrence threshold was set to 5. We applied the commonly used Fisher’s exact test and the CoMET test¹⁷² (v0.1.5) for both co-occurrence and mutual exclusivity using R (v3.4.0). Fisher’s right tailed test was used to support co-occurrence when the number of samples with alterations in both genes is significantly higher than expected by chance. Additionally, Fisher’s left tailed test was used to suggest mutual exclusivity when the number of samples with alterations in both genes is significantly lower than expected. Resultant p-values were corrected for multiple testing by FDR.

Mutational significance analysis

The IntOGen pipeline¹⁷³ algorithm was applied to identify significant cancer drivers in the core set of PCNSL samples (n = 30) based on the hg19 genome assembly. IntOGen v 3.0.8 was installed via conda from the bbglab anaconda channel. The relevant conda environment setup included explicit definitions of python v3.5.0 (with libraries scipy v0.16.0, pycurl v7.43.0.3, numpy v1.10.0, pandas 0.17.0). In addition, a local installation of perl v5.20.3.1 (with libraries perl-digest-perl-md5 v1.9, perl-threaded v5.26.0) was used, with installation of perl libraries Digest-MD5 v2.52 via cpan and perl-DBI v1.626 via yum package managers. The background intogen database (bgdata) was automatically downloaded using the command ‘intogen -setup‘ which downloaded the 20150729 background databases. The IntOGen run specific parameters included running on 4 cores, Matlab Compiler Runtime v8.1 (2013a) and MutSigCV v1.4. Significance thresholds of 10% FDR were used for oncodrivefm, oncodriveclust and mutsig. Sample thresholds of 2 and 5 were used for oncodrivefm and oncodriveclust respectively. IntOGen reported 50 genes to be significant drivers.

Significant CNV were identified using GISTIC v2.0.23 using MCR v83, using the following parameters: “-broad 1 –genegistic 1 -savegene=1 -brlen=0.8 -conf=0.2 -maxseg=2500”.

Telomere content estimation

The telomere content was determined from WGS data using the software tool TelomereHunter⁵⁹ (v1.1.0) which uses python v3.5.6 (using libraries pyyaml v3.13, pysam v0.9.1, pynacl v1.2.1), samtools v1.3.1, bcftools v1.3.1 and htslib v1.3.2. Telomere hunter was run using default settings (filtering of telomere reads: at least 6 telomere repeats per 100 bp read length)¹³³. Briefly, unmapped reads or reads with a very low alignment confidence (mapping quality lower than 8) containing six non-consecutive instances of the four most common telomeric repeat types (TTAGGG, TCAGGG, TGAGGG, and TTGGGG) were extracted. The telomere content was determined by normalizing the telomere read count to all reads in the sample with a GC-content of 48–52%. In the case of tumor samples, the telomere content was further corrected for the tumor cell content (TCC, as estimated by ACEseq) using the following formula as previously described⁵⁹, which corrects for inter-patient differences in telomere content assuming that the non-malignant cells in the tumor sample have a similar telomere content as in the control sample, as shown in Eq. 1:

$${{T}}_{{{{{{\rm{TCCcorrected}}}}}}}=({{{{{\rm{T}}}}}}-{{{{{\rm{C}}}}}}(1-{{{{{\rm{TCC}}}}}}))/{{{{{\rm{TCC}}}}}}$$

(1)

Here, T and C are the telomere contents of the tumor and control sample, and T_TCCcorrected is the TCC-corrected telomere content of the tumor sample.

LymphGen classification

All WGS samples were classified according to the LymphGen v2.0 algorithm described by Wright et al.³¹ which categorizes DLBCL samples into the different genetic subtypes MCD, N1, A53, BN2 ST2, EZB (MYC+ and MYC−), based on genetic aberrations in subtype predictor genes. The algorithm requires information on mutations, copy-number alterations, and fusions. The results of the DKFZ SNV and indels calling workflows were used to define the small mutations. BCL2 and BCL6 translocations were determined by Sophia calls, and copy number changes were derived from the DKFZ CNV workflow (ACEseq) results. The outputs from all workflows were filtered for somatic regions with all different variations occurring in exons and 5′UTR region of the gene. The files were created using Python and Perl scripts based on the description provided on the LymphGen website [https://llmpp.nih.gov/lymphgen/LymphGenInstructions.pdf?v=1600863825]. The individual sample inputs are further merged together to form the input dataset for the LymphGen algorithm and uploaded to the website [https://llmpp.nih.gov/lymphgen/lymphgendataportal.php] for classification the samples.

According to Wright et al.³¹, the results are displayed in Supplementary Data 1. Samples, where only RNA was available, were listed as “NA” in Supplementary Data 1.

RNA sequencing and data processing

RNA library preparation and sequencing

RNA libraries of the tumor samples and normal brain samples were prepared using the TruSeq RNA library preparation Kit Set A and B, following the manufacturer’s instructions at an insert size of ~300 bp. Two barcoded libraries were pooled per lane and sequenced on Illumina HiSeq2000 or HiSeq4000 platforms.

RNAseq alignment and expression quantification

RNAseq reads were aligned and gene expression quantified using the DKFZ RNAseq workflow (v1.2.22-6, https://github.com/DKFZ-ODCF/RNAseqWorkflow) as previously described¹⁷⁴. Briefly, the RNAseq read pairs were aligned to the STAR index generated reference genome (build 37, version hs37d5) using STAR in 2 pass mode (version 2.5.2b)^174,175. Duplicate reads were marked using sambamba (version 0.6.5) and BAM files were coordinate sorted using SAMtools (version 0.1.19). featureCounts (version 1.5.1)¹⁷⁶ was used to perform non-strand-specific read counting for genes over exon features based on the Gencode V19 gene model (without excluding read duplicates). When both read pairs aligned uniquely (indicated by a STAR alignment quality score of 255) they were used towards gene reads counts. For total library abundance calculations, during TPM and FPKM expression values estimation, genes on chromosomes X, Y, MT, and rRNA and tRNA were omitted as they can introduce library size estimation biases.

Hierarchical consensus clustering was applied using the cola package (version 1.5.6) with “MAD” as top-value method and “kmeans” as partitioning method. Classification on CNS samples was applied using cola with “ATC” as top-value method and “skmeans” as partitioning method. All other parameters took default values¹²⁹.

RNA dilution experiment

To further investigate the impact of brain tissue contamination in unsupervised clustering analysis of gene expression data on PCNSL, we performed a serial dilution experiment with total RNA from a PCNSL sample considered “pure” (LS-027, estimated tumor cell content >80%) and a normal brain tissue control (CTRL). Total RNA from LS-027 was mixed with CTRL RNA with increasing concentrations (0, 20, 40, 60, and 80%) and sequenced. The z-score transformed TPM expression levels for PCNSL subcluster 1 and subcluster 2 signature genes for the serially diluted H050-0027 sample was compared against the cohort and individually using clustering analysis.

Differential expression (DE) analysis to identify signature genes

DE of genes was analyzed using DESeq2 (version 1.14.1) with default settings using raw read counts from featureCounts. Genes without any count in all samples were excluded from the analysis.

Validation of whole-genome and RNA sequencing results

Sanger sequencing

Bidirectional Sanger sequencing (bSS) was performed (i) to validate WGS results in the PCNSL/SCNSL study cohort if sufficient DNA quantity was available (total: n = 35; PCNSL: n = 26; PCNSL-M: n = 1; SCNSL: n = 2; SCNSL: n = 3; EBV+ : n = 2), and (ii) to identify mutations of recurrently mutated candidates in a larger set of additional PCNSL/SCNSL FFPE samples (FFPE extension cohort). The following genes were analyzed: MYD88, KMT2D (MLL2), HLA-B, SETD1B, HIST1H1E, CD79B, BTG1, MYC, TP53, TERT, GRHPR, TBL1XR1, DST, PRDM15, OBSCN, FAT4, GRP98, and OSBPL10. Briefly, the PCR conditions were: 94 °C for 4 min (1 cycle), followed by 3 cycles of 94 °C for 30 s, 61 °C for 45 s, 72 °C for 60 s, 3 cycles of 94 °C for 30 s, 59 °C for 45 s, 72 °C for 60 s, 3 cycles of 94 °C for 30 s, 57 °C for 45 s, 72 °C for 60 s, 31 cycles of 94 °C for 30 s, 55 °C for 45 s, 72 °C for 60 s, and finally extension at 72 °C for 10 min with AmpliTaq™ 360 DNA Polymerase (Applied Biosystems, Waltham, USA). The PCR primers for the genomic regions of interest are displayed in Supplementary Data 14. Sequencing was performed at Eurofins Genomics, Ebersberg, Germany. 180/189 (95%) of the selected variants (allele frequency above 10%) identified by WGS were confirmed. The results are displayed in Supplementary Data 4 and 5.

Formaldehyde-fixed paraffin-embedded (FFPE) CNSL extension cohort

Candidate genes were validated in 31 additional FFPE specimens of PCNSL (n = 19), SCNSL (n = 9), and n = 3 EBV positive cases. The DNA was extracted using the QIAamp DNA FFPE Tissue Kit (Qiagen, Hilden, Germany) according to the manufacturer’s instructions. The following, recurrently mutated genes (exons) were investigated: PIM1 (exons 1–4), MYD88 (exon 5), GPHPR (exons 2, 4, and 5), TBL1XR1 (exons 7, 8, 10, 12, and 14), KMT2D (exons 32, 34, 38, 48, and 50), KLHL14 (exon 2), HLA-B (exons 2, 3), PRDM15 (exons 9–12), GPR98 (exons 65, 70, and 81), DST (exons 13, 14, 23, 24, and 36), OBSCN (exons 32, 63, 64, and 85), FAT4 (exons 1, 9, and 17), HIST1H3D (exon 2), HIST1H1E (exon 1), TERT (promoter region). The results are displayed in Supplementary Data 5.

Fluorescence in situ hybridization (FISH)

FISH analysis was performed as previously described¹⁷⁷. Briefly, 4 μm FFPE sections were deparaffinized, dehydrated, and incubated in pre-treatment solution (Dako, Denmark) for 10 min at 95–99 °C. Samples were treated with pepsin solution for 3–6 min at 37 °C, washed, dehydrated, air dried, and incubated with the respective DNA probe: CDKN2A (9p21.3): Orange, Biocare Medical, USA; Vysis CEP 9 SpectrumGreen Probe (Abbott), The Netherlands). The sections were sealed, denatured in humidified atmosphere at 82 °C for 5 min, and then incubated overnight at 45 °C to achieve hybridization. After post-hybridization washing, slides were counterstained with 4′6-diamidino-2-phenylindole (DAPI) and analyzed using an automated scanning system (Duet, BioView Ltd. Rehovot, Israel; Supplementary Fig. 1d).

Real-time quantitative PCR (RT-qPCR)

We performed SYBR Green quantitative real-time PCR (qPCR) measuring six amplicons covering the CDKN2A/B gene (Supplementary Fig. 1e) as well as five amplicons covering the FHIT, NOTCH4, SPIB, and MIR650 gene. The primer sequences are annotated in Supplementary Data 14. qPCR analysis was performed on ABI Prism 7900HT Sequence Detection System (Applied Biosystems, Foster City, CA, USA).

Immunohistochemical (IHC) procedures

Immunohistochemical stainings were performed on a Benchmark XT autostainer (Ventana Medical Systems, Tuscon, AZ, USA) with standard antigen retrieval methods (CC1 buffer, pH8.0, Ventana Medical Systems, Tuscon, AZ, USA) using 7 μm-thick frozen or 4 μm-thick FFPE tissue sections. The following primary antibodies were used: monoclonal mouse anti-BCL6 (DAKO, M7211, 1:10), monoclonal mouse anti-CD10 (Novocastra, NCL-CD10-270, 1:10), monoclonal mouse anti-CD20 (DAKO, M0755, 1:400), polyclonal rabbit anti-CD3 (DAKO, A0452, 1:100), monoclonal mouse anti-CD45 (DAKO, M0701, 1:400), monoclonal mouse anti-CD79a (DAKO, M7051, 1:100), monoclonal mouse anti-EBV-LMP1 (DAKO, M0897, 1:1000), monoclonal mouse anti-Ki-67 (DAKO, M7240, 1:100), monoclonal mouse anti-MUM1 (DAKO, M7259, 1:50), monoclonal mouse anti-PD-L1 (Cell Signaling, 13684, 1:200). The iVIEW DAB Detection Kit (Ventana Medical Systems, Tuscon, AZ, USA) was used according to the manufacturer’s instructions. Sections were counterstained with hematoxylin, dehydrated in graded alcohol and xylene, mounted, and coverslipped. IHC stained sections were evaluated by two skilled neuropathologists with concurrence. The DLBCL subtypes of GCB and non-GCB were categorized using CD10, BCL6, and MUM1 according to the Hans classification¹².

Epstein-Barr virus PCR

EBV-specific PCR was performed as previously described¹⁵⁶. Briefly, a highly conserved region of the EBNA-1 (BKRF1) gene specific for EBV was amplified by endpoint PCR using the following primers: 5′-GAG GGT GGT TTG GAA AGC-3′ and 5′-AAC AGA CAA TGG ACT CCC TTA G-3′, 0.1 µM each. The PCR conditions were: 95 °C for 5 min (1 cycle), followed by 40 cycles of 94 °C for 1 min, 55 °C for 2 min, 72 °C for 3 min, and finally extension at 72 °C for 7 min with ThermoPrime™ Taq DNA Polymerase (Thermo Fisher Scientific, Waltham, USA). Subsequently, amplification products were analyzed by ELISA (Roche, Basel, Switzerland).

Statistics and reproducibility

No statistical methods were used to predetermine sample sizes. We included all individuals with DLBCL of the CNS where sufficient material was available as specified in the description of study design. No data were excluded from the analyses. Statistical details for each analysis are mentioned in each figure legend or in the respective part of the text. WGS, RNA-sequencing, Sanger sequencing, quantitative real-time PCR, immunohistochemical stainings, and FISH were performed in a blinded fashion. Evaluation of histological and immunohistochemical stainings, as well as FISH images, was performed separately by at least two independent (neuro-)pathologists in Berlin and Heidelberg. Histological staining, immunohistochemistry, and FISH analyses were replicated at least once. CDKN2A/B FISH was performed exemplarily for n = 4 CNSL patients. The representative images shown were adjusted in brightness and contrast to different degrees (depending on the need resulting from the range of brightness and contrast of the raw images) in Adobe Photoshop, and for these cases, raw image files are publicly available [https://doi.org/10.5281/zenodo.6054242]⁹². The experiments were not randomized.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

The whole genome sequencing (WGS) and RNA sequencing data of the 51 CNSL samples generated in this study as well as the raw data of Sanger sequencing and quantitative real-time PCR data have been deposited in the European Genome-Phenome Archive under the accession number EGAS00001005339. The data are available under controlled access due to the sensitive nature of genome sequencing data, and access can be obtained by contacting the appropriate Data Access Committee listed for each dataset in the study. Access will be granted to commercial and non-commercial parties according to patient consent forms and data transfer agreements. We have an institutional process in place to deal with requests for data transfer. A response to requests for data access can be expected within 14 days. After access has been granted, the data is available for two years. Access to the ICGC MMML-Seq raw sequencing data is available via the EGA under the accession number EGAS00001002199 and EGAS00001001692. Access to the ICGC MMML-Seq data is available via the data access committee of the ICGC (www.ICGC.org). Raw image files of histological stainings, immunohistochemistry, and FISH images generated in this study, as well as all somatic mutation calls, integrated mutations tables, and RNAseq counts on which the analysis was performed have been deposited publicly at Zenodo [https://doi.org/10.5281/zenodo.6054242]⁹². The uncropped PCR gel images as well as the processed real-time PCR data and Kaplan-Meier survival data shown in Supplementary Fig. 1 are provided in the Source Data file with this paper. The remaining data are available within the article, Supplementary Information or Source Data file.

References

Louis, D. N., Ohgaki, H., Wiestler, O. D. & Cavenee, W. K. WHO Classification of Tumours of the Central Nervous System 4 edn, 1 (International Agency for Research on Cancer Lyon, 2016).
Swerdlow, S. H. et al. WHO Classification of Tumours of Haematopoietic and Lymphoid Tissues 2 (International Agency for Research on Cancer Lyon, 2008).
Maciocia, P. et al. Treatment of diffuse large B-cell lymphoma with secondary central nervous system involvement: encouraging efficacy using CNS-penetrating R-IDARAM chemotherapy. Br. J. Haematol. 172, 545–553 (2016).
Article CAS PubMed Google Scholar
Ferreri, A. J. M. Secondary CNS lymphoma: The poisoned needle in the haystack. Ann. Oncol. 28, 2335–2337 (2017).
Article CAS PubMed Google Scholar
Ferreri, A. J. Risk of CNS dissemination in extranodal lymphomas. Lancet Oncol. 15, e159–e169 (2014).
Article PubMed Google Scholar
Malikova, H. et al. Secondary central nervous system lymphoma: Spectrum of morphological MRI appearances. Neuropsychiatr. Dis. Treat. 14, 733–740 (2018).
Article PubMed PubMed Central Google Scholar
Bashir, R., McManus, B., Cunningham, C., Weisenburger, D. & Hochberg, F. Detection of Eber-1 RNA in primary brain lymphomas in immunocompetent and immunocompromised patients. J. Neurooncol. 20, 47–53 (1994).
Article CAS PubMed Google Scholar
Kleinschmidt-DeMasters, B. K., Damek, D. M., Lillehei, K. O., Dogan, A. & Giannini, C. Epstein Barr virus-associated primary CNS lymphomas in elderly patients on immunosuppressive medications. J. Neuropathol. Exp. Neurol. 67, 1103–1111 (2008).
Article CAS PubMed Google Scholar
Montesinos-Rongen, M., Siebert, R. & Deckert, M. Primary lymphoma of the central nervous system: Just DLBCL or not? Blood 113, 7–10 (2009).
Article CAS PubMed Google Scholar
Grommes, C. & DeAngelis, L. M. Primary CNS lymphoma. J. Clin. Oncol.: Off. J. Am. Soc. Clin. Oncol. 35, 2410–2418 (2017).
Article CAS Google Scholar
Louis, D. N. et al. The 2016 World Health Organization Classification of Tumors of the Central Nervous System: A summary. Acta Neuropathol. 131, 803–820 (2016).
Article PubMed Google Scholar
Hans, C. P. et al. Confirmation of the molecular classification of diffuse large B-cell lymphoma by immunohistochemistry using a tissue microarray. Blood 103, 275–282 (2004).
Article CAS PubMed Google Scholar
Klein, U. et al. Transcriptional analysis of the B cell germinal center reaction. Proc. Natl Acad. Sci. USA 100, 2639–2644 (2003).
Article ADS CAS PubMed PubMed Central Google Scholar
Klein, U. et al. Transcription factor IRF4 controls plasma cell differentiation and class-switch recombination. Nat. Immunol. 7, 773–782 (2006).
Article CAS PubMed Google Scholar
Montesinos-Rongen, M. et al. Gene expression profiling suggests primary central nervous system lymphomas to be derived from a late germinal center B cell. Leukemia 22, 400–405 (2008).
Article CAS PubMed Google Scholar
Deckert, M., Montesinos-Rongen, M., Brunn, A. & Siebert, R. Systems biology of primary CNS lymphoma: from genetic aberrations to modeling in mice. Acta Neuropathol. 127, 175–188 (2014).
Article CAS PubMed Google Scholar
Montesinos-Rongen, M. et al. Activating L265P mutations of the MYD88 gene are common in primary central nervous system lymphoma. Acta Neuropathol. 122, 791–792 (2011).
Article PubMed Google Scholar
Montesinos-Rongen, M. et al. Mutations of CARD11 but not TNFAIP3 may activate the NF-kappaB pathway in primary CNS lymphoma. Acta Neuropathol. 120, 529–535 (2010).
Article CAS PubMed Google Scholar
Bodor, C. et al. Molecular subtypes and genomic profile of primary central nervous system lymphoma. J. Neuropathol. Exp. Neurol. 79, 176–183 (2020).
Article CAS PubMed Google Scholar
Montesinos-Rongen, M., Schafer, E., Siebert, R. & Deckert, M. Genes regulating the B cell receptor pathway are recurrently mutated in primary central nervous system lymphoma. Acta Neuropathol. 124, 905–906 (2012).
Article PubMed Google Scholar
Jordanova, E. S. et al. Hemizygous deletions in the HLA region account for loss of heterozygosity in the majority of diffuse large B-cell lymphomas of the testis and the central nervous system. Genes Chromosomes Cancer 35, 38–48 (2002).
Article CAS PubMed Google Scholar
Riemersma, S. A. et al. Extensive genetic alterations of the HLA region, including homozygous deletions of HLA class II genes in B-cell lymphomas arising in immune-privileged sites. Blood 96, 3569–3577 (2000).
Article CAS PubMed Google Scholar
Montesinos-Rongen, M. et al. Interphase cytogenetic analysis of lymphoma-associated chromosomal breakpoints in primary diffuse large B-cell lymphomas of the central nervous system. J. Neuropathol. Exp. Neurol. 61, 926–933 (2002).
Article CAS PubMed Google Scholar
Schwindt, H. et al. Chromosomal translocations fusing the BCL6 gene to different partner loci are recurrent in primary central nervous system lymphoma and may be associated with aberrant somatic hypermutation or defective class switch recombination. J. Neuropathol. Exp. Neurol. 65, 776–782 (2006).
Article CAS PubMed Google Scholar
Chapuy, B. et al. Targetable genetic features of primary testicular and primary central nervous system lymphomas. Blood 127, 869–881 (2016).
Article CAS PubMed PubMed Central Google Scholar
Gonzalez-Aguilar, A. et al. Recurrent mutations of MYD88 and TBL1XR1 in primary central nervous system lymphomas. Clin. Cancer Res.: Off. J. Am. Assoc. Cancer Res. 18, 5203–5211 (2012).
Article CAS Google Scholar
Kreher, S. et al. Prognostic impact of B-cell lymphoma 6 in primary CNS lymphoma. Neuro-Oncol. 17, 1016–1021 (2015).
Article CAS PubMed PubMed Central Google Scholar
Riemersma, S. A. et al. High numbers of tumour-infiltrating activated cytotoxic T lymphocytes, and frequent loss of HLA class I and II expression, are features of aggressive B cell lymphomas of the brain and testis. J. Pathol. 206, 328–336 (2005).
Article CAS PubMed Google Scholar
Chapuy, B. et al. Molecular subtypes of diffuse large B cell lymphoma are associated with distinct pathogenic mechanisms and outcomes. Nat. Med. 24, 679–690 (2018).
Article MathSciNet CAS PubMed PubMed Central Google Scholar
Schmitz, R. et al. Genetics and pathogenesis of diffuse large B-cell lymphoma. N. Engl. J. Med. 378, 1396–1407 (2018).
Article CAS PubMed PubMed Central Google Scholar
Wright, G. W. et al. A probabilistic classification tool for genetic subtypes of diffuse large B cell lymphoma with therapeutic implications. Cancer Cell 37, 551–568.e514 (2020).
Article CAS PubMed PubMed Central Google Scholar
Venturutti, L. & Melnick, A. The dangers of deja vu: Memory B-cells as the cell-of-origin of ABC-DLBCLs. Blood https://doi.org/10.1182/blood.2020005857 (2020).
Article PubMed PubMed Central Google Scholar
Venturutti, L. et al. TBL1XR1 mutations drive extranodal lymphoma by inducing a pro-tumorigenic memory fate. Cell 182, 297–316.e227 (2020).
Article CAS PubMed PubMed Central Google Scholar
Hubschmann, D. et al. Mutational mechanisms shaping the coding and noncoding genome of germinal center derived B-cell lymphomas. Leukemia https://doi.org/10.1038/s41375-021-01251-z (2021).
Article PubMed PubMed Central Google Scholar
Lacy, S. E. et al. Targeted sequencing in DLBCL, molecular subtypes, and outcomes: A Haematological Malignancy Research Network report. Blood 135, 1759–1771 (2020).
Article CAS PubMed PubMed Central Google Scholar
Niparuck, P. et al. Treatment outcome and prognostic factors in PCNSL. Diagn. Pathol. 14, 56 (2019).
Article PubMed PubMed Central CAS Google Scholar
Coiffier, B. et al. CHOP chemotherapy plus rituximab compared with CHOP alone in elderly patients with diffuse large-B-cell lymphoma. N. Engl. J. Med. 346, 235–242 (2002).
Article CAS PubMed Google Scholar
Schmitt, A. M. et al. Rituximab in primary central nervous system lymphoma—A systematic review and meta-analysis. Hematol. Oncol. 37, 548–557 (2019).
Article CAS PubMed Google Scholar
Seidel, S. & Schlegel, U. Have treatment protocols for primary CNS lymphoma advanced in the past 10 years. Expert Rev. Anticancer Ther. 19, 909–915 (2019).
Article CAS PubMed Google Scholar
van der Meulen, M. et al. Primary therapy and survival in patients aged over 70-years-old with primary central nervous system lymphoma: A contemporary, nationwide, population-based study in the Netherlands. Haematologica 106, 597–600 (2021).
Article PubMed Google Scholar
Van Dijck, R., Doorduijn, J. K. & Bromberg, J. E. C. The role of rituximab in the treatment of primary central nervous system lymphoma. Cancers https://doi.org/10.3390/cancers13081920 (2021).
Bromberg, J. E. C. et al. Rituximab in patients with primary CNS lymphoma (HOVON 105/ALLG NHL 24): A randomised, open-label, phase 3 intergroup study. Lancet Oncol. 20, 216–228 (2019).
Article CAS PubMed Google Scholar
Deckert, M. et al. Modern concepts in the biology, diagnosis, differential diagnosis and treatment of primary central nervous system lymphoma. Leukemia 25, 1797–1807 (2011).
Article CAS PubMed Google Scholar
Braggio, E. et al. Genome-wide analysis uncovers novel recurrent alterations in primary central nervous system lymphomas. Clin. Cancer Res.: Off. J. Am. Assoc. Cancer Res. 21, 3986–3994 (2015).
Article CAS Google Scholar
Nayyar, N. et al. MYD88 L265P mutation and CDKN2A loss are early mutational events in primary central nervous system diffuse large B-cell lymphomas. Blood Adv. 3, 375–383 (2019).
Article CAS PubMed PubMed Central Google Scholar
Grommes, C., Nayak, L., Tun, H. W. & Batchelor, T. T. Introduction of novel agents in the treatment of primary CNS lymphoma. Neuro-Oncol. 21, 306–313, https://doi.org/10.1093/neuonc/noy193 (2019).
Article PubMed Google Scholar
Ghesquieres, H. et al. Lenalidomide in combination with intravenous rituximab (REVRI) in relapsed/refractory primary CNS lymphoma or primary intraocular lymphoma: a multicenter prospective ‘proof of concept’ phase II study of the French Oculo-Cerebral lymphoma (LOC) Network and the Lymphoma Study Association (LYSA)dagger. Ann. Oncol. 30, 621–628 (2019).
Article CAS PubMed Google Scholar
Soussain, C. et al. Ibrutinib monotherapy for relapse or refractory primary CNS lymphoma and primary vitreoretinal lymphoma: Final analysis of the phase II ‘proof-of-concept’ iLOC study by the Lymphoma study association (LYSA) and the French oculo-cerebral lymphoma (LOC) network. Eur. J. Cancer 117, 121–130 (2019).
Article CAS PubMed Google Scholar
Kotla, V. et al. Mechanism of action of lenalidomide in hematological malignancies. J. Hematol. Oncol. 2, 36 (2009).
Article PubMed PubMed Central CAS Google Scholar
Lionakis, M. S. et al. Inhibition of B cell receptor signaling by Ibrutinib in primary CNS lymphoma. Cancer Cell 31, 833–843 e835 (2017).
Article CAS PubMed PubMed Central Google Scholar
Grommes, C. et al. Ibrutinib unmasks critical role of Bruton tyrosine kinase in primary CNS lymphoma. Cancer Discov. 7, 1018–1029 (2017).
Article CAS PubMed PubMed Central Google Scholar
Nayak, L. et al. PD-1 blockade with nivolumab in relapsed/refractory primary central nervous system and testicular lymphoma. Blood 129, 3071–3073 (2017).
Article CAS PubMed PubMed Central Google Scholar
Vater, I. et al. The mutational pattern of primary lymphoma of the central nervous system determined by whole-exome sequencing. Leukemia 29, 677–685 (2015).
Article CAS PubMed Google Scholar
Bruno, A. et al. Mutational analysis of primary central nervous system lymphoma. Oncotarget 5, 5065–5075 (2014).
Article PubMed PubMed Central Google Scholar
Zhou, Y. et al. Analysis of genomic alteration in primary central nervous system lymphoma and the expression of some related genes. Neoplasia 20, 1059–1069 (2018).
Article CAS PubMed PubMed Central Google Scholar
Takashima, Y. et al. Target amplicon exome-sequencing identifies promising diagnosis and prognostic markers involved in RTK-RAS and PI3K-AKT signaling as central oncopathways in primary central nervous system lymphoma. Oncotarget 9, 27471–27486 (2018).
Article PubMed PubMed Central Google Scholar
Kaulen, L. D. et al. Whole exome sequencing identifies novel SLIT2 mutations in primary CNS lymphoma (3962). Neurology 94, 3962 (2020).
Google Scholar
Gandhi, M. K. et al. EBV-tissue positive primary CNS lymphoma occurring after immunosuppression is a distinct immunobiological entity. Blood https://doi.org/10.1182/blood.2020008520 (2020).
Article PubMed PubMed Central Google Scholar
Lopez, C. et al. Genomic and transcriptomic changes complement each other in the pathogenesis of sporadic Burkitt lymphoma. Nat. Commun. 10, 1459 (2019).
Article ADS PubMed PubMed Central CAS Google Scholar
Consortium, I. T. P.-C. Ao. W. G. Pan-cancer analysis of whole genomes. Nature 578, 82–93 (2020).
Article ADS CAS Google Scholar
Arber, D. A. et al. The 2016 revision to the World Health Organization classification of myeloid neoplasms and acute leukemia. Blood 127, 2391–2405 (2016).
Article CAS PubMed Google Scholar
Leonard, J. P., Martin, P. & Roboz, G. J. Practical implications of the 2016 revision of the World Health Organization classification of lymphoid and myeloid neoplasms and acute leukemia. J. Clin. Oncol.: Off. J. Am. Soc. Clin. Oncol. 35, 2708–2715 (2017).
Article CAS Google Scholar
Zhang, J. et al. The CREBBP acetyltransferase is a haploinsufficient tumor suppressor in B-cell lymphoma. Cancer Discov. 7, 322–337 (2017).
Article PubMed PubMed Central CAS Google Scholar
Pasqualucci, L. et al. Inactivating mutations of acetyltransferase genes in B-cell lymphoma. Nature 471, 189–195 (2011).
Article ADS CAS PubMed PubMed Central Google Scholar
Meyer, S. N. et al. Unique and shared epigenetic programs of the CREBBP and EP300 acetyltransferases in germinal center B cells reveal targetable dependencies in lymphoma. Immunity 51, 535–547.e539 (2019).
Article CAS PubMed PubMed Central Google Scholar
Schmidt, J. et al. CREBBP gene mutations are frequently detected in in situ follicular neoplasia. Blood 132, 2687–2690 (2018).
Article CAS PubMed PubMed Central Google Scholar
Loeffler, M. et al. Genomic and epigenomic co-evolution in follicular lymphomas. Leukemia 29, 456–463 (2015).
Article CAS PubMed Google Scholar
Zhou, Y. et al. Metascape provides a biologist-oriented resource for the analysis of systems-level datasets. Nat. Commun. 10, 1523 (2019).
Article ADS PubMed PubMed Central CAS Google Scholar
Yang, X. et al. STAT3 activation is associated with interleukin-10 expression and survival in primary central nervous system lymphoma. World Neurosurg. 134, e1077–e1084 (2020).
Article PubMed Google Scholar
Tang, D. et al. Clinicopathologic significance of MYD88 L265P mutation and expression of TLR4 and P-STAT3 in primary central nervous system diffuse large B-cell lymphomas. Brain Tumor Pathol. 38, 50–58 (2021).
Article CAS PubMed Google Scholar
Bai, L. et al. A potent and selective small-molecule degrader of STAT3 achieves complete tumor regression in vivo. Cancer Cell 36, 498–511 e417 (2019).
Article CAS PubMed PubMed Central Google Scholar
Komohara, Y. et al. M2 macrophage/microglial cells induce activation of Stat3 in primary central nervous system lymphoma. J. Clin. Exp. Hematop. 51, 93–99 (2011).
Article PubMed Google Scholar
Ngo, V. N. et al. Oncogenically active MYD88 mutations in human lymphoma. Nature 470, 115–119 (2011).
Article ADS CAS PubMed Google Scholar
Dobashi, A. et al. TP53 and OSBPL10 alterations in diffuse large B-cell lymphoma: Prognostic markers identified via exome analysis of cases with extreme prognosis. Oncotarget 9, 19555–19568 (2018).
Article PubMed PubMed Central Google Scholar
Gandhi, M. K. et al. EBV-associated primary CNS lymphoma occurring after immunosuppression is a distinct immunobiological entity. Blood 137, 1468–1477 (2021).
Article CAS PubMed PubMed Central Google Scholar
Wang, J. Q. et al. Synergistic cooperation and crosstalk between MYD88(L265P) and mutations that dysregulate CD79B and surface IgM. J. Exp. Med. 214, 2759–2776 (2017).
Article CAS PubMed PubMed Central Google Scholar
Visco, C. et al. Oncogenic mutations of MYD88 and CD79B in diffuse large B-cell lymphoma and implications for clinical practice. Cancers https://doi.org/10.3390/cancers12102913 (2020).
Reddy, A. et al. Genetic and functional drivers of diffuse large B. Cell Lymphoma Cell 171, 481–494.e415 (2017).
CAS PubMed Google Scholar
Nosrati, A. et al. MYC, BCL2, and BCL6 rearrangements in primary central nervous system lymphoma of large B cell type. Ann. Hematol. 98, 169–173 (2019).
Article CAS PubMed Google Scholar
Montesinos-Rongen, M., Van Roost, D., Schaller, C., Wiestler, O. D. & Deckert, M. Primary diffuse large B-cell lymphomas of the central nervous system are targeted by aberrant somatic hypermutation. Blood 103, 1869–1875 (2004).
Article CAS PubMed Google Scholar
Brunn, A. et al. Frequent triple-hit expression of MYC, BCL2, and BCL6 in primary lymphoma of the central nervous system and absence of a favorable MYC(low)BCL2 (low) subgroup may underlie the inferior prognosis as compared to systemic diffuse large B cell lymphomas. Acta Neuropathol. 126, 603–605 (2013).
Article PubMed Google Scholar
Amodio, N. et al. MALAT1: A druggable long non-coding RNA for targeted anti-cancer approaches. J. Hematol. Oncol. 11, 63 (2018).
Article PubMed PubMed Central CAS Google Scholar
Li, S., Li, J., Chen, C., Zhang, R. & Wang, K. Pan-cancer analysis of long non-coding RNA NEAT1 in various cancers. Genes Dis. 5, 27–35 (2018).
Article CAS PubMed Google Scholar
Fujimoto, A. et al. Whole-genome mutational landscape and characterization of noncoding and structural mutations in liver cancer. Nat. Genet. 48, 500–509 (2016).
Article CAS PubMed Google Scholar
Deng, L. et al. Aberrant NEAT1_1 expression may be a predictive marker of poor prognosis in diffuse large B cell lymphoma. Cancer Biomark. 23, 157–164 (2018).
Article CAS PubMed Google Scholar
Wang, Q. M., Lian, G. Y., Song, Y., Huang, Y. F. & Gong, Y. LncRNA MALAT1 promotes tumorigenesis and immune escape of diffuse large B cell lymphoma by sponging miR-195. Life Sci. 231, 116335 (2019).
Article CAS PubMed Google Scholar
Zheng, Z. H., You, H. Y., Feng, Y. J. & Zhang, Z. T. LncRNA KCNQ1OT1 is a key factor in the reversal effect of curcumin on cisplatin resistance in the colorectal cancer cells. Mol. Cell Biochem. https://doi.org/10.1007/s11010-020-03856-x (2020).
Xu, B. et al. LncRNA SNHG3, a potential oncogene in human cancers. Cancer Cell Int. 20, 536 (2020).
Article CAS PubMed PubMed Central Google Scholar
Tian, Y. et al. lncRNA SNHG14 promotes oncogenesis and immune evasion in diffuse large-B-cell lymphoma by sequestering miR-152-3p. Leuk Lymphoma https://doi.org/10.1080/10428194.2021.1876866 (2021).
Alexandrov, L. B. et al. Signatures of mutational processes in human cancer. Nature 500, 415–421 (2013).
Article CAS PubMed PubMed Central Google Scholar
Casellas, R. et al. Mutations, kataegis and translocations in B cells: Understanding AID promiscuous activity. Nat. Rev. Immunol. 16, 164–176 (2016).
Article CAS PubMed PubMed Central Google Scholar
Radke, J. et al. The genomic and transcriptional landscape of primary central nervous system lymphoma (1.1.0) [Data set]. Zenodo https://doi.org/10.5281/zenodo.6054242 (2021).
Khodabakhshi, A. H. et al. Recurrent targets of aberrant somatic hypermutation in lymphoma. Oncotarget 3, 1308–1319 (2012).
Article PubMed PubMed Central Google Scholar
Papavasiliou, F. N. & Schatz, D. G. Somatic hypermutation of immunoglobulin genes: Merging mechanisms for genetic diversity. Cell 109, S35–S44 (2002).
Article CAS PubMed Google Scholar
Morin, R. D. et al. Mutational and structural analysis of diffuse large B-cell lymphoma using whole-genome sequencing. Blood 122, 1256–1265 (2013).
Article CAS PubMed PubMed Central Google Scholar
Rimsza, L. M. et al. Loss of MHC class II gene and protein expression in diffuse large B-cell lymphoma is related to decreased tumor immunosurveillance and poor patient survival regardless of other prognostic factors: A follow-up study from the Leukemia and Lymphoma Molecular Profiling Project. Blood 103, 4251–4258 (2004).
Article CAS PubMed Google Scholar
Baruah, P. et al. Impact of p16 status on pro- and anti-angiogenesis factors in head and neck cancers. Br. J. Cancer 113, 653–659 (2015).
Article CAS PubMed PubMed Central Google Scholar
Eymin, B., Leduc, C., Coll, J. L., Brambilla, E. & Gazzeri, S. p14ARF induces G2 arrest and apoptosis independently of p53 leading to regression of tumours established in nude mice. Oncogene 22, 1822–1835 (2003).
Article CAS PubMed Google Scholar
England, N. L. et al. Identification of human tumour suppressor genes by monochromosome transfer: rapid growth-arrest response mapped to 9p21 is mediated solely by the cyclin-D-dependent kinase inhibitor gene, CDKN2A (p16INK4A). Carcinogenesis 17, 1567–1575 (1996).
Article CAS PubMed Google Scholar
Challa-Malladi, M. et al. Combined genetic inactivation of beta2-Microglobulin and CD58 reveals frequent escape from immune recognition in diffuse large B cell lymphoma. Cancer Cell 20, 728–740 (2011).
Article CAS PubMed PubMed Central Google Scholar
Schwindt, H. et al. Chromosomal imbalances and partial uniparental disomies in primary central nervous system lymphoma. Leukemia 23, 1875–1884 (2009).
Article CAS PubMed Google Scholar
Yu, X. & Li, Z. TOX gene: a novel target for human cancer gene therapy. Am. J. Cancer Res. 5, 3516–3524 (2015).
CAS PubMed PubMed Central Google Scholar
Kim, K. et al. Single-cell transcriptome analysis reveals TOX as a promoting factor for T cell exhaustion and a predictor for anti-PD-1 responses in human cancer. Genome Med 12, 22 (2020). https://doi.org/10.1186/s13073-020-00722-9.
Article CAS PubMed PubMed Central Google Scholar
Astuti, D. et al. Germline mutations in DIS3L2 cause the Perlman syndrome of overgrowth and Wilms tumor susceptibility. Nat. Genet. 44, 277–284, https://doi.org/10.1038/ng.1071 (2012).
Article CAS PubMed Google Scholar
Xing, S. et al. DIS3L2 promotes progression of hepatocellular carcinoma via hnRNP U-mediated alternative splicing. Cancer Res. 79, 4923–4936 (2019).
Article CAS PubMed Google Scholar
McAllister-Lucas, L. M., Baens, M. & Lucas, P. C. MALT1 protease: A new therapeutic target in B lymphoma and beyond? Clin. Cancer Res.: Off. J. Am. Assoc. Cancer Res. 17, 6623–6631 (2011).
Article CAS Google Scholar
Kameoka, Y. et al. Contig array CGH at 3p14.2 points to the FRA3B/FHIT common fragile region as the target gene in diffuse large B-cell lymphoma. Oncogene 23, 9148–9154 (2004).
Article CAS PubMed Google Scholar
Roy, D., Sin, S. H., Damania, B. & Dittmer, D. P. Tumor suppressor genes FHIT and WWOX are deleted in primary effusion lymphoma (PEL) cell lines. Blood 118, e32–e39 (2011).
Article PubMed PubMed Central Google Scholar
Haller, F. et al. Enhancer hijacking activates oncogenic transcription factor NR4A3 in acinic cell carcinomas of the salivary glands. Nat. Commun. 10, 368 (2019).
Article ADS CAS PubMed PubMed Central Google Scholar
Zhang, Y. et al. High-coverage whole-genome analysis of 1220 cancers reveals hundreds of genes deregulated by rearrangement-mediated cis-regulatory alterations. Nat. Commun. 11, 736 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Zhou, X., Takatoh, J. & Wang, F. The mammalian class 3 PI3K (PIK3C3) is required for early embryogenesis and cell proliferation. PLoS One 6, e16358 (2011).
Article ADS CAS PubMed PubMed Central Google Scholar
Munson, M. J. & Ganley, I. G. MTOR, PIK3C3, and autophagy: Signaling the beginning from the end. Autophagy 11, 2375–2376 (2015).
Article CAS PubMed PubMed Central Google Scholar
Chen, C. H. et al. Dual inhibition of PIK3C3 and FGFR as a new therapeutic approach to treat bladder cancer. Clin. Cancer Res.: Off. J. Am. Assoc. Cancer Res. 24, 1176–1189 (2018).
Article CAS Google Scholar
Kumar, B. et al. PIK3C3 inhibition promotes sensitivity to colon cancer therapy by inhibiting cancer stem cells. Cancers https://doi.org/10.3390/cancers13092168 (2021).
Liu, F. et al. PIK3C3 regulates the expansion of liver CSCs and PIK3C3 inhibition counteracts liver cancer stem cell activity induced by PI3K inhibitor. Cell Death Dis. 11, 427 (2020).
Article CAS PubMed PubMed Central Google Scholar
Fukai, J. et al. EphA4 promotes cell proliferation and migration through a novel EphA4-FGFR1 signaling pathway in the human glioma U251 cell line. Mol. Cancer Ther. 7, 2768–2778 (2008).
Article CAS PubMed Google Scholar
Iiizumi, M. et al. EphA4 receptor, overexpressed in pancreatic ductal adenocarcinoma, promotes cancer cell growth. Cancer Sci. 97, 1211–1216 (2006).
Article CAS PubMed Google Scholar
Hachim, I. Y. et al. Transforming growth factor-beta regulation of ephrin type-A receptor 4 signaling in breast cancer cellular migration. Sci. Rep. 7, 14976 (2017).
Article ADS PubMed PubMed Central CAS Google Scholar
Lin, C. Y. et al. High expression of EphA4 predicted lesser degree of tumor regression after neoadjuvant chemoradiotherapy in rectal cancer. J. Cancer 8, 1089–1096 (2017).
Article CAS PubMed PubMed Central Google Scholar
Kina, S. et al. Targeting EphA4 abrogates intrinsic resistance to chemotherapy in well-differentiated cervical cancer cell line. Eur. J. Pharm. 840, 70–78 (2018).
Article CAS Google Scholar
Schmitz, R., Ceribelli, M., Pittaluga, S., Wright, G. & Staudt, L. M. Oncogenic mechanisms in Burkitt lymphoma. Cold Spring Harb. Perspect. Med. https://doi.org/10.1101/cshperspect.a014282 (2014).
Kuppers, R. & Dalla-Favera, R. Mechanisms of chromosomal translocations in B cell lymphomas. Oncogene 20, 5580–5594 (2001).
Article CAS PubMed Google Scholar
Seifert, M., Scholtysik, R. & Kuppers, R. Origin and pathogenesis of B cell lymphomas. Methods Mol. Biol. 1956, 1–33 (2019).
Article CAS PubMed Google Scholar
Krull, J. E. et al. Somatic copy number gains in MYC, BCL2, and BCL6 identifies a subset of aggressive alternative-DH/TH DLBCL patients. Blood Cancer J. 10, 117 (2020).
Article PubMed PubMed Central Google Scholar
Scott, D. W. et al. High-grade B-cell lymphoma with MYC and BCL2 and/or BCL6 rearrangements with diffuse large B-cell lymphoma morphology. Blood 131, 2060–2064 (2018).
Article CAS PubMed PubMed Central Google Scholar
Alexandrov, L. B. et al. The repertoire of mutational signatures in human cancer. Nature 578, 94–101 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Fong, P. C. et al. Inhibition of poly(ADP-ribose) polymerase in tumors from BRCA mutation carriers. N. Engl. J. Med. 361, 123–134 (2009).
Article CAS PubMed Google Scholar
Poti, A. et al. Correlation of homologous recombination deficiency induced mutational signatures with sensitivity to PARP inhibitors and cytotoxic agents. Genome Biol. 20, 240 (2019).
Article PubMed PubMed Central CAS Google Scholar
Gu, Z., Schlesner, M. & Hubschmann, D. cola: An R/Bioconductor package for consensus partitioning through a general framework. Nucleic Acids Res. 49, e15 (2021).
Article CAS PubMed Google Scholar
Seimiya, M. et al. Stage-specific expression of Clast6/E3/LAPTM5 during B cell differentiation: elevated expression in human B lymphomas. Int. J. Oncol. 22, 301–304 (2003).
CAS PubMed Google Scholar
Kim, Y., Shin, Y. & Kang, G. H. Prognostic significance of CD103+ immune cells in solid tumor: A systemic review and meta-analysis. Sci. Rep. 9, 3808 (2019).
Article ADS PubMed PubMed Central CAS Google Scholar
Harada, K. et al. Telomerase activity in central nervous system malignant lymphoma. Cancer 86, 1050–1055 (1999).
Article CAS PubMed Google Scholar
Feuerbach, L. et al. TelomereHunter—in silico estimation of telomere content and composition from cancer genomes. BMC Bioinform. 20, 272 (2019).
Article Google Scholar
Panebianco, F., Nikitski, A. V., Nikiforova, M. N. & Nikiforov, Y. E. Spectrum of TERT promoter mutations and mechanisms of activation in thyroid cancer. Cancer Med 8, 5831–5839 (2019).
Article CAS PubMed PubMed Central Google Scholar
Bell, R. J. et al. Understanding TERT promoter mutations: A common path to immortality. Mol. Cancer Res. 14, 315–323 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Bruno, A. et al. TERT promoter mutations in primary central nervous system lymphoma are associated with spatial distribution in the splenium. Acta Neuropathol. 130, 439–440 (2015).
Article PubMed Google Scholar
Stogbauer, L., Stummer, W., Senner, V. & Brokinkel, B. Telomerase activity, TERT expression, hTERT promoter alterations, and alternative lengthening of the telomeres (ALT) in meningiomas—a systematic review. Neurosurg. Rev. https://doi.org/10.1007/s10143-019-01087-3 (2019).
Ichimura, K. TERT promoter mutation as a diagnostic marker for diffuse gliomas. Neuro-Oncol. 21, 417–418 (2019).
Article CAS PubMed PubMed Central Google Scholar
Lee, Y. et al. The frequency and prognostic effect of TERT promoter mutation in diffuse gliomas. Acta Neuropathol. Commun. 5, 62 (2017).
Article ADS PubMed PubMed Central CAS Google Scholar
Liu, Z. et al. Association between TERT rs2853669 polymorphism and cancer risk: A meta-analysis of 9,157 cases and 11,073 controls. PLoS One 13, e0191560 (2018).
Article PubMed PubMed Central CAS Google Scholar
Grommes, C. & Younes, A. Ibrutinib in PCNSL: The curious cases of clinical responses and aspergillosis. Cancer Cell 31, 731–733 (2017).
Article CAS PubMed PubMed Central Google Scholar
Rheinbay, E. et al. Analyses of non-coding somatic drivers in 2658 cancer whole genomes. Nature 578, 102–111, https://doi.org/10.1038/s41586-020-1965-x (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Biesecker, L. G., Shianna, K. V. & Mullikin, J. C. Exome sequencing: The expert view. Genome Biol. 12, 128 (2011).
Article PubMed PubMed Central Google Scholar
Fatica, A. & Bozzoni, I. Long non-coding RNAs: New players in cell differentiation and development. Nat. Rev. Genet. 15, 7–21 (2014).
Article CAS PubMed Google Scholar
Jardin, F. et al. Diffuse large B-cell lymphomas with CDKN2A deletion have a distinct gene expression signature and a poor prognosis under R-CHOP treatment: A GELA study. Blood 116, 1092–1104 (2010).
Article CAS PubMed Google Scholar
Fangazio, M. et al. Genetic mechanisms of HLA-I loss and immune escape in diffuse large B cell lymphoma. Proc. Natl Acad. Sci. USA https://doi.org/10.1073/pnas.2104504118 (2021).
Monti, S. et al. Integrative analysis reveals an outcome-associated and targetable pattern of p53 and cell cycle deregulation in diffuse large B cell lymphoma. Cancer Cell 22, 359–372 (2012).
Article CAS PubMed PubMed Central Google Scholar
Tao, W. & Levine, A. J. P19(ARF) stabilizes p53 by blocking nucleo-cytoplasmic shuttling of Mdm2. Proc. Natl Acad. Sci. USA 96, 6937–6941 (1999).
Article ADS CAS PubMed PubMed Central Google Scholar
Yamada, S., Ishida, Y., Matsuno, A. & Yamazaki, K. Primary diffuse large B-cell lymphomas of central nervous system exhibit remarkably high prevalence of oncogenic MYD88 and CD79B mutations. Leuk. Lymphoma 56, 2141–2145 (2015).
Article CAS PubMed Google Scholar
Georgakopoulos-Soares, I. et al. Transcription-coupled repair and mismatch repair contribute towards preserving genome integrity at mononucleotide repeat tracts. Nat. Commun. 11, 1980 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Ferch, U. et al. Inhibition of MALT1 protease activity is selectively toxic for activated B cell-like diffuse large B cell lymphoma cells. J. Exp. Med. 206, 2313–2320 (2009).
Article CAS PubMed PubMed Central Google Scholar
Oakes, C. C. et al. DNA methylation dynamics during B cell maturation underlie a continuum of disease phenotypes in chronic lymphocytic leukemia. Nat. Genet. 48, 253–264 (2016).
Article CAS PubMed PubMed Central Google Scholar
Bodker, J. S. et al. A multiple myeloma classification system that associates normal B-cell subset phenotypes with prognosis. Blood Adv. 2, 2400–2411 (2018).
Article PubMed PubMed Central CAS Google Scholar
Sugita, Y. et al. Primary central nervous system lymphomas and related diseases: Pathological characteristics and discussion of the differential diagnosis. Neuropathology 36, 313–324 (2016).
Article PubMed Google Scholar
Vinagre, J. et al. Frequency of TERT promoter mutations in human cancers. Nat. Commun. 4, 2185 (2013).
Article ADS PubMed CAS Google Scholar
Stocher, M. et al. Parallel detection of five human herpes virus DNAs by a set of real-time polymerase chain reactions in a single run. J. Clin. Virol. 26, 85–93 (2003).
Article CAS PubMed Google Scholar
Kretzmer, H. et al. DNA methylome analysis in Burkitt and follicular lymphomas identifies differentially methylated regions linked to somatic mutation and transcriptional control. Nat. Genet. 47, 1316–1325 (2015).
Article CAS PubMed PubMed Central Google Scholar
Li, H. & Durbin, R. Fast and accurate short read alignment with Burrows–Wheeler transform. Bioinformatics 25, 1754–1760 (2009).
Article CAS PubMed PubMed Central Google Scholar
Tischler, G. & Leonard, S. biobambam: tools for read pair collation based algorithms on BAM files. Source Code Biol. Med. 9, 13 (2014).
Article PubMed Central Google Scholar
Arthur, S. E. et al. Genome-wide discovery of somatic regulatory variants in diffuse large B-cell lymphoma. Nat. Commun. 9, 4001 (2018).
Article ADS PubMed PubMed Central CAS Google Scholar
Zhang, J. et al. Genetic heterogeneity of diffuse large B-cell lymphoma. Proc. Natl Acad. Sci. USA 110, 1398–1403 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Ishaque, N. et al. Whole genome sequencing puts forward hypotheses on metastasis evolution and therapy in colorectal cancer. Nat. Commun. 9, 4782 (2018).
Article ADS PubMed PubMed Central CAS Google Scholar
Li, H. et al. The sequence alignment/map format and SAMtools. Bioinformatics 25, 2078–2079 (2009).
Article PubMed PubMed Central CAS Google Scholar
Consortium, E. P. An integrated encyclopedia of DNA elements in the human genome. Nature 489, 57–74 (2012).
Article ADS CAS Google Scholar
Rimmer, A. et al. Integrating mapping-, assembly- and haplotype-based approaches for calling variants in clinical sequencing applications. Nat. Genet. 46, 912–918 (2014).
Article CAS PubMed PubMed Central Google Scholar
Wang, K., Li, M. & Hakonarson, H. ANNOVAR: Functional annotation of genetic variants from high-throughput sequencing data. Nucleic Acids Res. 38, e164 (2010).
Article PubMed PubMed Central CAS Google Scholar
Jiang, Y., Qiu, Y., Minn, A. J. & Zhang, N. R. Assessing intratumor heterogeneity and tracking longitudinal and spatial clonal evolutionary history by next-generation sequencing. Proc. Natl Acad. Sci. USA 113, E5528–E5537 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Sahm, F. et al. Meningiomas induced by low-dose radiation carry structural variants of NF2 and a distinct mutational signature. Acta Neuropathol. 134, 155–158 (2017).
Article CAS PubMed Google Scholar
Uhrig, S. et al. Accurate and efficient detection of gene fusions from RNA sequencing data. Genome Res. 31, 448–460 (2021).
Article PubMed PubMed Central Google Scholar
Kleinheinz, K. et al. ACEseq—allele specific copy number estimation from whole genome sequencing. Preprint at bioRxiv https://doi.org/10.1101/210807 (2017).
Hubschmann, D. et al. Analysis of mutational signatures with yet another package for signature analysis. Genes Chromosomes Cancer https://doi.org/10.1002/gcc.22918 (2020).
Leiserson, M. D., Wu, H. T., Vandin, F. & Raphael, B. J. CoMEt: A statistical approach to identify combinations of mutually exclusive alterations in cancer. Genome Biol. 16, 160 (2015).
Article PubMed PubMed Central CAS Google Scholar
Gonzalez-Perez, A. et al. IntOGen-mutations identifies cancer drivers across tumor types. Nat. Methods 10, 1081–1082 (2013).
Article CAS PubMed PubMed Central Google Scholar
Paramasivam, N. et al. Mutational patterns and regulatory networks in epigenetic subgroups of meningioma. Acta Neuropathol. 138, 295–308 (2019).
Article CAS PubMed Google Scholar
Dobin, A. et al. STAR: Ultrafast universal RNA-seq aligner. Bioinformatics 29, 15–21 (2013).
Article CAS PubMed Google Scholar
Liao, Y., Smyth, G. K. & Shi, W. featureCounts: An efficient general purpose program for assigning sequence reads to genomic features. Bioinformatics 30, 923–930 (2014).
Article CAS PubMed Google Scholar
Jurmeister, P. et al. Parallel screening for ALK, MET and ROS1 alterations in non-small cell lung cancer with implications for daily routine testing. Lung Cancer 87, 122–129 (2015).
Article PubMed Google Scholar

Download references

Acknowledgements

We thank the German Cancer Research Center (Deutsches Krebsforschungszentrum, DKFZ) Omics IT, Data Management Core Facility (ODCF), and the sequencing unit of the Genomics and Proteomics Core Facility (GPCF) for providing excellent technical support. We thank the NCT Molecular Precision Oncology Program for technical support and funding through project numbers HIPO H050 and A050 (SWi). We thank the German Cancer Consortium (DKTK), Partner site Berlin for technical support and funding through XD013 sequencing (JR, FH). Cartoon images in Fig. 1a, b were partially created with BioRender.com. We are indebted to Stefanie Mende, Petra Matylewski, Kathrein Permien, Vera Wolf, Sandra Meier, and Silvia Stefaniak for excellent technical assistance. We thank Werner Stenzel, Arend Koch, David Capper, Christine Sers and Ingeborg Tinhofer-Keilholz for valuable experimental advice. JR is a participant in the BIH-Charité Clinical Scientist Program funded by the Charité – Universitätsmedizin Berlin and the Berlin Institute of Health. CL is supported by postdoctoral Beatriu de Pinós from Secretaria d’Universitats I Recerca del Departament d’Empresa i Coneixement de la Generalitat de Catalunya and by Marie Sklodowska-Curie COFUND program from H2020 (2018-BP-00055). The ICGC MMML-seq Project has been supported by the German Ministry of Science and Education (BMBF) in the framework of the ICGC MMML-Seq (01KU1002A-J) and the ICGC DE-Mining (01KU1505G and 01KU1505E). Research of RS on PCNSL is supported by Deutsche Krebshilfe (grant: 70113053).

Funding

Open Access funding enabled and organized by Projekt DEAL.

Author information

These authors contributed equally: Josefine Radke, Naveed Ishaque.
These authors jointly supervised this work: Reiner Siebert, Stefan Wiemann, Frank L. Heppner.

Authors and Affiliations

Department of Neuropathology, Charité - Universitätsmedizin Berlin, corporate member of Freie Universität Berlin, Humboldt-Universität zu Berlin, Berlin Institute of Health, Berlin, Germany
Josefine Radke, Randi Koll, Elisa Schumann, Fabienne Pritsch, Debora Pehl, Helena Radbruch, Anja Osterloh & Frank L. Heppner
German Cancer Consortium (DKTK), Partner Site Charité Berlin, Berlin, Germany
Josefine Radke, Randi Koll, Elisa Schumann, Fabienne Pritsch & Frank L. Heppner
Institute of Pathology, Universitätsmedizin Greifswald, Greifswald, Germany
Josefine Radke & Shashwat Sahay
Digital Health Center, Berlin Institute of Health (BIH) at Charité - Universitätsmedizin Berlin, Berlin, Germany
Naveed Ishaque, Shashwat Sahay, Roland Eils, Roland Eils & Roland Eils
Molecular Precision Oncology Program, NCT Heidelberg, Heidelberg, Germany
Zuguang Gu, Sebastian Uhrig, Daniel Hübschmann, Xavier Pastor Hostench, Dilafruz Juraeva, Nagarajan Paramasivam & Benedikt Brors
Division of Applied Bioinformatics, German Cancer Research Center (DKFZ), Heidelberg, Germany
Lina Sieverling
Division of Translational Medical Oncology, National Center for Tumor Diseases (NCT), Heidelberg, Germany
Lina Sieverling
Computational Oncology, Molecular Diagnostics Program, NCT Heidelberg, Heidelberg, Germany
Sebastian Uhrig & Daniel Hübschmann
Heidelberg Institute for Stem Cell Technology and Experimental Medicine (HI-STEM gGmbH), Heidelberg, Germany
Daniel Hübschmann
Department of Pediatric Oncology, Hematology and Immunology, University Hospital, Heidelberg, Germany
Daniel Hübschmann
German Cancer Consortium (DKTK), Heidelberg, Germany
Daniel Hübschmann & Stefan Wiemann
Division Neuroblastoma Genomics, German Cancer Research Center (DKFZ), Heidelberg, Germany
Umut H. Toprak
Institute of Human Genetics, Ulm University & Ulm University Medical Center, Ulm, Germany
Cristina López, Reiner Siebert, Ole Ammerpohl, Cristina López, Rabea Wagener, Reiner Siebert & Reiner Siebert
Institut d’Investigacions Biomèdiques August Pi Sunyer (IDIBAPS), CIBER de Cáncer, Barcelona, Spain
Cristina López
Division of Molecular Genome Analysis, German Cancer Research Center (DKFZ), Heidelberg, Germany
Simone Borgoni & Stefan Wiemann
Hopp Children’s Cancer Center, NCT Heidelberg (KiTZ), Heidelberg, Germany
Gnana Prakash Balasubramanian
Division of Pediatric Neurooncology, German Cancer Consortium (DKTK), German Cancer Research Center (DKFZ), Heidelberg, Germany
Gnana Prakash Balasubramanian
Bioinformatics and Omics Data Analytics, German Cancer Research Center (DKFZ), Heidelberg, Germany
Matthias Schlesner
Biomedical Informatics, Data Mining and Data Analytics, Augsburg University, Augsburg, Germany
Matthias Schlesner
Institute of Cell Biology (Cancer Research), Medical Faculty, University of Duisburg-Essen, Essen, Germany
Marc Weniger & Ralf Küppers
German Cancer Consortium (DKTK), Partner Site Essen, Essen, Germany
Marc Weniger & Ralf Küppers
Department of Hematology, Oncology and Cancer Immunology, Charité - Universitätsmedizin Berlin, corporate member of Freie Universität Berlin, Humboldt-Universität zu Berlin, Berlin Institute of Health, Berlin, Germany
Agnieszka Korfel
Department of Neurosurgery, Charité - Universitätsmedizin Berlin, corporate member of Freie Universität Berlin, Humboldt-Universität zu Berlin, Berlin Institute of Health, Berlin, Germany
Martin Misch, Julia Onken, Katharina Faust & Peter Vajkoczy
Department for Neurosurgery, Vivantes Klinikum im Friedrichshain, Berlin, Germany
Dag Moskopp & Yawen Wang
Department for Neurosurgery, Vivantes Klinikum Neukölln, Berlin, Germany
Andreas Jödicke
Department of Hematology and Oncology, Georg-Augusts-University of Göttingen, Göttingen, 37075, Germany
Lorenz Trümper, Friederike Braulke, Nicole Klepl, Christina Stadler & Dieter Kube
Department of Pathology, Charité - Universitätsmedizin Berlin, corporate member of Freie Universität Berlin, Humboldt-Universität zu Berlin, Berlin Institute of Health, Berlin, Germany
Ioannis Anagnostopoulos, Dido Lenze & Michael Hummel
Medical Department of Hematology and Oncology, Johannes Kepler University, Kepler Universitätsklinikum, Linz, Austria
Clemens A. Schmitt
Department of Hematology, Oncology and Tumor Immunology and Molekulares Krebsforschungszentrum, Charité - Universitätsmedizin Berlin, corporate member of Freie Universität Berlin, Humboldt-Universität zu Berlin, Berlin Institute of Health, Berlin, Germany
Clemens A. Schmitt
German Cancer Research Center (DKFZ), Heidelberg, Germany
Otmar D. Wiestler
Genomics and Proteomics Core Facility, German Cancer Research Center (DKFZ), Heidelberg, Germany
Stephan Wolf
Department of Neurosurgery, University Hospital Heidelberg, Heidelberg, Germany
Andreas Unterberg & Christel Herold-Mende
Cluster of Excellence, NeuroCure, Charitéplatz 1, 10117, Berlin, Germany
Frank L. Heppner
German Center for Neurodegenerative Diseases (DZNE) Berlin, 10117, Berlin, Germany
Frank L. Heppner
Institute of Human Genetics, Christian-Albrechts-University, Kiel, Germany
Reiner Siebert, Susanne Wagner, Andrea Haake, Julia Richter, Gesine Richter, Anke K. Bergmann, Ole Ammerpohl, Sietse M. Aukema, Andrea Haake, Cristina López, Inga Nagel, Julia Richter, Inga Vater, Rabea Wagener & Reiner Siebert
Institute of Pathology, Hematopathology Section, Christian-Albrechts-University, Kiel, Germany
Julia Richter, Wolfram Klapper, Monika Szczepanowski, Sietse M. Aukema & Julia Richter
Division of Theoretical Bioinformatics (B080), German Cancer Research Center (DKFZ), Heidelberg, Germany
Roland Eils, Chris Lawerenz, Jürgen Eils, Jules Kerssemakers, Christina Jaeger-Schmidt, Ingrid Scholz, Roland Eils & Kortine Kleinheinz
Department for Bioinformatics and Functional Genomics, Institute of Pharmacy and Molecular Biotechnology and Bioquant, University of Heidelberg, Heidelberg, Germany
Roland Eils, Roland Eils & Kortine Kleinheinz
Department of Pediatrics, University Hospital Schleswig-Holstein, Campus Kiel, Kiel, Germany
Anke K. Bergmann & Alexander Claviez
Department of Internal Medicine/Hematology, Friedrich-Ebert-Hospital, Neumünster, Germany
Christoph Borst, Siegfried Haas & Siegfried Haas
Pediatric Hematology and Oncology, University Hospital Muenster, Muenster, Germany
Birgit Burkhardt, Jasmin Lisfeld & Marius Rohde
Pediatric Hematology and Oncology, University Hospital Giessen, Giessen, Germany
Birgit Burkhardt
Department of Medicine III - Campus Grosshadern, University Hospital Munich, Munich, Germany
Martin Dreyling & Sonja Eberth
Department of Medicine and Poliklinik II, University Hospital Würzburg, Würzburg, Germany
Hermann Einsele
Department of Medicine III, Hematology and Oncology, Dr. Horst-Schmidt-Kliniken of Wiesbaden, Wiesbaden, Germany
Norbert Frickhofen
Senckenberg Institute of Pathology, University of Frankfurt Medical School, Frankfurt am Main, Germany
Martin-Leo Hansmann & Martin-Leo Hansmann
Department of Internal Medicine II: Hematology and Oncology, University Medical Center Schleswig-Holstein, Campus Kiel, Kiel, Germany
Dennis Karsch & Michael Kneba
Hematology and Oncology, Hospital of Internal Medicine II, St-Georg Hospital Leipzig, Leipzig, Germany
Luisa Mantovani-Löffler
Department of Pathology, Robert-Bosch-Hospital, Stuttgart, Germany
German Ott & German Ott
Clinic for Hematology and Oncology, St.-Antonius-Hospital, Eschweiler, Germany
Peter Staib
Department for Internal Medicine III, University of Ulm and University Hospital of Ulm, Ulm, Germany
Stephan Stilgenbauer
National Centre for Tumor Disease, Heidelberg, Germany
Thorsten Zenz
Comprehensive Cancer Center Ulm (CCCU), University Hospital Ulm, Ulm, Germany
Ulrike Kostezka
Institute of Pathology, University of Ulm and University Hospital of Ulm, Ulm, Germany
Peter Möller
Institute of Pathology, Comprehensive Cancer Center Mainfranken, University of Würzburg, Würzburg, Germany
Andreas Rosenwald, Ellen Leich, Jordan Pischimariov & Andreas Rosenwald
Department of Pediatric Oncology, Hematology and Clinical Immunology, Heinrich-Heine-University, Düsseldorf, Germany
Vera Binder, Arndt Borkhardt & Jessica I. Hoell
Division of Molecular Genetics, German Cancer Research Center (DKFZ), Heidelberg, Germany
Peter Lichter & Bernhard Radlwimmer
Institute of Clinical Molecular Biology, Christian-Albrechts-University, Kiel, Germany
Philip Rosenstiel & Markus Schilhabel
Department of General Internal Medicine, University Kiel, Kiel, Germany
Stefan Schreiber
Interdisciplinary Center for Bioinformatics, University of Leipzig, Leipzig, Germany
Stephan H. Bernhart, Hans Binder, Gero Doose, Steve Hoffmann, Lydia Hopp, Helene Kretzmer, David Langenberger & Peter F. Stadler
Department of Computer science, University of Leipzig, Leipzig, Germany
Stephan H. Bernhart, Hans Binder, Gero Doose, Steve Hoffmann, Helene Kretzmer, David Langenberger & Peter F. Stadler
Transcriptome Bioinformatics, LIFE Research Center for Civilization Diseases, University of Leipzig, Leipzig, Germany
Stephan H. Bernhart, Gero Doose, Steve Hoffmann, Helene Kretzmer, David Langenberger & Peter F. Stadler
Institute for Medical Informatics Statistics and Epidemiology, University of Leipzig, Leipzig, Germany
Markus Kreuz
EMBL Heidelberg, Genome Biology, Heidelberg, Germany
Jan Korbel, Markus Loeffler, Maciej Rosolowski & Stephanie Sungalee
RNomics Group, Fraunhofer Institute for Cell Therapy and Immunology IZI, Leipzig, Germany
Peter F. Stadler
Santa Fe Institute, Santa Fe, NM, USA
Peter F. Stadler
Max-Planck-Institute for Mathematics in Sciences, Leipzig, Germany
Peter F. Stadler

Authors

Josefine Radke
View author publications
You can also search for this author in PubMed Google Scholar
Naveed Ishaque
View author publications
You can also search for this author in PubMed Google Scholar
Randi Koll
View author publications
You can also search for this author in PubMed Google Scholar
Zuguang Gu
View author publications
You can also search for this author in PubMed Google Scholar
Elisa Schumann
View author publications
You can also search for this author in PubMed Google Scholar
Lina Sieverling
View author publications
You can also search for this author in PubMed Google Scholar
Sebastian Uhrig
View author publications
You can also search for this author in PubMed Google Scholar
Daniel Hübschmann
View author publications
You can also search for this author in PubMed Google Scholar
Umut H. Toprak
View author publications
You can also search for this author in PubMed Google Scholar
Cristina López
View author publications
You can also search for this author in PubMed Google Scholar
Xavier Pastor Hostench
View author publications
You can also search for this author in PubMed Google Scholar
Simone Borgoni
View author publications
You can also search for this author in PubMed Google Scholar
Dilafruz Juraeva
View author publications
You can also search for this author in PubMed Google Scholar
Fabienne Pritsch
View author publications
You can also search for this author in PubMed Google Scholar
Nagarajan Paramasivam
View author publications
You can also search for this author in PubMed Google Scholar
Gnana Prakash Balasubramanian
View author publications
You can also search for this author in PubMed Google Scholar
Matthias Schlesner
View author publications
You can also search for this author in PubMed Google Scholar
Shashwat Sahay
View author publications
You can also search for this author in PubMed Google Scholar
Marc Weniger
View author publications
You can also search for this author in PubMed Google Scholar
Debora Pehl
View author publications
You can also search for this author in PubMed Google Scholar
Helena Radbruch
View author publications
You can also search for this author in PubMed Google Scholar
Anja Osterloh
View author publications
You can also search for this author in PubMed Google Scholar
Agnieszka Korfel
View author publications
You can also search for this author in PubMed Google Scholar
Martin Misch
View author publications
You can also search for this author in PubMed Google Scholar
Julia Onken
View author publications
You can also search for this author in PubMed Google Scholar
Katharina Faust
View author publications
You can also search for this author in PubMed Google Scholar
Peter Vajkoczy
View author publications
You can also search for this author in PubMed Google Scholar
Dag Moskopp
View author publications
You can also search for this author in PubMed Google Scholar
Yawen Wang
View author publications
You can also search for this author in PubMed Google Scholar
Andreas Jödicke
View author publications
You can also search for this author in PubMed Google Scholar
Lorenz Trümper
View author publications
You can also search for this author in PubMed Google Scholar
Ioannis Anagnostopoulos
View author publications
You can also search for this author in PubMed Google Scholar
Dido Lenze
View author publications
You can also search for this author in PubMed Google Scholar
Ralf Küppers
View author publications
You can also search for this author in PubMed Google Scholar
Michael Hummel
View author publications
You can also search for this author in PubMed Google Scholar
Clemens A. Schmitt
View author publications
You can also search for this author in PubMed Google Scholar
Otmar D. Wiestler
View author publications
You can also search for this author in PubMed Google Scholar
Stephan Wolf
View author publications
You can also search for this author in PubMed Google Scholar
Andreas Unterberg
View author publications
You can also search for this author in PubMed Google Scholar
Roland Eils
View author publications
You can also search for this author in PubMed Google Scholar
Christel Herold-Mende
View author publications
You can also search for this author in PubMed Google Scholar
Benedikt Brors
View author publications
You can also search for this author in PubMed Google Scholar
Reiner Siebert
View author publications
You can also search for this author in PubMed Google Scholar
Stefan Wiemann
View author publications
You can also search for this author in PubMed Google Scholar
Frank L. Heppner
View author publications
You can also search for this author in PubMed Google Scholar

Consortia

ICGC MMML-Seq Consortium

Coordination (C1)
- Reiner Siebert
- , Susanne Wagner
- , Andrea Haake
- , Julia Richter
- & Gesine Richter
Data Center (C2)
- Roland Eils
- , Chris Lawerenz
- , Jürgen Eils
- , Jules Kerssemakers
- , Christina Jaeger-Schmidt
- & Ingrid Scholz
Clinical Centers (WP1)
- Anke K. Bergmann
- , Christoph Borst
- , Friederike Braulke
- , Birgit Burkhardt
- , Alexander Claviez
- , Martin Dreyling
- , Sonja Eberth
- , Hermann Einsele
- , Norbert Frickhofen
- , Siegfried Haas
- , Martin-Leo Hansmann
- , Dennis Karsch
- , Nicole Klepl
- , Michael Kneba
- , Jasmin Lisfeld
- , Luisa Mantovani-Löffler
- , Marius Rohde
- , German Ott
- , Christina Stadler
- , Peter Staib
- , Stephan Stilgenbauer
- , Lorenz Trümper
- & Thorsten Zenz
Normal Cells (WPN)
- Martin-Leo Hansmann
- , Dieter Kube
- , Ralf Küppers
- & Marc Weniger
Pathology and Analyte Preparation (WP2-3)
- Siegfried Haas
- , Michael Hummel
- , Wolfram Klapper
- , Ulrike Kostezka
- , Dido Lenze
- , Peter Möller
- , Andreas Rosenwald
- , German Ott
- & Monika Szczepanowski
Sequencing and genomics (WP4-7)
- Ole Ammerpohl
- , Sietse M. Aukema
- , Vera Binder
- , Arndt Borkhardt
- , Andrea Haake
- , Jessica I. Hoell
- , Ellen Leich
- , Peter Lichter
- , Cristina López
- , Inga Nagel
- , Jordan Pischimariov
- , Bernhard Radlwimmer
- , Julia Richter
- , Philip Rosenstiel
- , Andreas Rosenwald
- , Markus Schilhabel
- , Stefan Schreiber
- , Inga Vater
- , Rabea Wagener
- & Reiner Siebert
Bioinformatics (WP8-9)
- Stephan H. Bernhart
- , Hans Binder
- , Benedikt Brors
- , Gero Doose
- , Roland Eils
- , Steve Hoffmann
- , Lydia Hopp
- , Daniel Hübschmann
- , Kortine Kleinheinz
- , Helene Kretzmer
- , Markus Kreuz
- , Jan Korbel
- , David Langenberger
- , Markus Loeffler
- , Maciej Rosolowski
- , Matthias Schlesner
- , Peter F. Stadler
- & Stephanie Sungalee

Contributions

J.R., R.K., E.S., F.P., S.B., D.L., and M.W. performed experiments and analyzed data. Z.G., L.S., D.H., U.H.T., X.P.H., N.P., M.S., C.L., D.J., G.P.B., S.S., S.U., and N.I. carried out the bioinformatics analyses and processed data. D.P., H.R., and A.O. performed sample acquisition and histological analyses. I.A. performed histological analyses. D.L. performed FISH analyses. A.K., M.M., J.O., K.F., P.V., D.M., Y.W., A.J., and A.U. clinically assessed or operated patients and contributed to sample acquisition. C.H.M., S.W.i., and J.R. coordinated sample acquisition. S.W.i. and S.W.o. oversaw sequencing of the samples. R.S., M.S., L.T., and R.K.ü. provided data from the MMML cohort. Experimental design and execution were overseen by F.L.H., S.W.i., O.W., C.H.M., B.B., C.A.S., R.S., M.H., R.K.ü., and R.E. J.R., N.I., S.W.i., F.L.H., and R.S. designed the study. RS coordinated the ICGC MMML-Seq network. J.R. and N.I. wrote the manuscript. J.R. prepared the final figures. F.L.H., S.W.i., O.W., C.H.M., B.B., C.A.S., R.S, and M.H. critically revised the manuscript. All authors critically read, reviewed, and approved the final manuscript.

Corresponding authors

Correspondence to Josefine Radke or Naveed Ishaque.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks Mark Roschewski and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Description of Additional Supplementary Files

Supplementary Data 1

Supplementary Data 2

Supplementary Data 3

Supplementary Data 4

Supplementary Data 5

Supplementary Data 6

Supplementary Data 7

Supplementary Data 8

Supplementary Data 9

Supplementary Data 10

Supplementary Data 11

Supplementary Data 12

Supplementary Data 13

Supplementary Data 14

Reporting Summary

Source data

Source Data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Radke, J., Ishaque, N., Koll, R. et al. The genomic and transcriptional landscape of primary central nervous system lymphoma. Nat Commun 13, 2558 (2022). https://doi.org/10.1038/s41467-022-30050-y

Download citation

Received: 19 August 2021
Accepted: 12 April 2022
Published: 10 May 2022
DOI: https://doi.org/10.1038/s41467-022-30050-y

This article is cited by

Machine learning-based pathomics signature of histology slides as a novel prognostic indicator in primary central nervous system lymphoma
- Ling Duan
- Yongqi He
- Feng Chen
Journal of Neuro-Oncology (2024)
Effectiveness and safety of orelabrutinib with rituximab and lenalidomide in low KPS score and elderly patients with newly diagnosed primary central nervous system lymphoma—a single-center retrospective study
- Xiao-Ling Guo
- Zi-Yu Zhao
- Jin-Hai Ren
Neurological Sciences (2024)
Neue Therapiestrategien beim primären Lymphom des Zentralnervensystems
- Sabine Seidel
- Leon Kaulen
- Louisa von Baumgarten
Der Nervenarzt (2024)
Liquid biopsy of cerebrospinal fluid enabling the detection and therapeutic targeting of the BRAFV600E mutation in a patient with overlapping Erdheim-Chester/Rosai-Dorfman disease
- Valentina Pieri
- Giulia Berzero
- Massimo Filippi
Journal of Neurology (2024)
Extranodal lymphoma: pathogenesis, diagnosis and treatment
- Hua Yang
- Yang Xun
- Hua You
Molecular Biomedicine (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.