# Integrative discovery of treatments for high-risk neuroblastoma

## Abstract

Despite advances in the molecular exploration of paediatric cancers, approximately 50% of children with high-risk neuroblastoma lack effective treatment. To identify therapeutic options for this group of high-risk patients, we combine predictive data mining with experimental evaluation in patient-derived xenograft cells. Our proposed algorithm, TargetTranslator, integrates data from tumour biobanks, pharmacological databases, and cellular networks to predict how targeted interventions affect mRNA signatures associated with high patient risk or disease processes. We find more than 80 targets to be associated with neuroblastoma risk and differentiation signatures. Selected targets are evaluated in cell lines derived from high-risk patients to demonstrate reversal of risk signatures and malignant phenotypes. Using neuroblastoma xenograft models, we establish CNR2 and MAPK8 as promising candidates for the treatment of high-risk neuroblastoma. We expect that our method, available as a public tool (targettranslator.org), will enhance and expedite the discovery of risk-associated targets for paediatric and adult cancers.

## Introduction

Neuroblastoma is a cancer of the sympathetic nervous system, which accounts for 7% of all childhood cancers and 15% of childhood cancer-related deaths1. Mostly diagnosed before 2 years of age, it can lead to a broad range of clinical outcomes, from spontaneous regression of the tumour to metastatic disease with poor prognosis2,3. Clinical manifestations and genetic markers help identify the children with high-risk disease and guide the treatment3. Sequencing of neuroblastoma genomes has uncovered actionable mutations, particularly ALK, present in 8% of neuroblastoma cases4,5. Still, the clinical management of high-risk cases remains difficult, with survival rates <50%6.

The goal of this study is to identify additional therapeutic targets against high-risk neuroblastoma. To achieve this, we combine integrative data analysis with experimental evaluation in cell lines from patient-derived xenografts. Specifically, we explore the hypothesis that the clinical risk of a neuroblastoma patient, as defined by established clinical and genomic markers, can be well approximated by an mRNA signature. By integrating such signatures with recent pharmacogenomic data from cancer and nonmalignant cell lines7 and drug-to-target networks8, we then propose that such high-risk signatures can be associated to drugs that share common and statistically enriched therapeutic targets. Supporting this idea, recent work on neuroblastoma has identified risk signatures defining differentiation, patient outcome, and 1p36.3 deletion9,10,11,12. Such signatures may reflect important druggable processes in tumour cells that are not readily implied by standard DNA sequencing (c.f. ALK). Integrative modelling across data sets enables the detection of genetic and epigenetic regulators of risk signatures13,14,15. Predicting drug targets for high-risk neuroblastoma with broad-scope integrative algorithms thus seems a promising strategy that has not yet been explored. We expect such large scale integration to be particularly productive when based on RNA data, partly because RNA signatures are well supported as indicators of neuroblastoma disease processes, but also because it is available in higher quantities than, for instance, protein profiling data.

When the mRNA signature is known, search tools16,17,18,19 can be used to propose drugs that match the gene markers. By comparison, performing an analysis that links clinical risk factors and disease signatures to protein targets is an analytical task of a different dimension, and requires additional integration and analysis steps. Therefore, a second goal of this work is to provide a generally applicable tool (not specific to neuroblastoma) that will facilitate the association between cancer risk factors, signatures and therapeutic targets.

We propose an algorithm, TargetTranslator, to systematically identify targets for high-risk neuroblastoma. Using data from US and EU patient cohorts, our algorithm finds 10 mRNA signatures of neuroblastoma risk and differentiation, which are mapped to 19,763 unique compounds in 14 cell line models, revealing 88 statistically significant protein targets against high-risk neuroblastoma. We then characterise selected targets by more than 700 RNA profiling experiments in drug-treated neuroblastoma cells and show that interfering with two drug targets, the mitogen-activated protein kinase 8 (MAPK8) and the cannabinoid receptor 2 (CNR2) suppress tumour growth in both zebrafish and mouse xenograft models. Together, these results deepen our understanding of neuroblastoma vulnerabilities and provide a tool for data-guided cancer target discovery.

## Results

### Discovery approach and data sources

Previous work provides a strong rationale for exploring neuroblastoma by RNA signatures, which can serve as proxy indicators of oncogene activation, patient risk, and tumour cell differentiation status9,10,11,12. The overriding goal of our strategy is, therefore, to identify targets with potential to modulate such RNA signatures and thereby suppress malignant phenotypes of neuroblastoma cells, potentially resulting in a reduction of tumour growth. Exploring this idea, our discovery approach has two key steps. First, we estimate mRNA signatures of neuroblastoma from patient omics data, which are optimised for integration with public data sources to predict therapeutic targets (Fig. 1a). Second, we evaluate the predicted targets in patient-derived neuroblastoma cells (Fig. 1b). Specifically, the experiments evaluate (i) if mRNA disease signatures change as predicted after drug treatment, (ii) if treatment affects malignant phenotypes in cells, and (iii) if treatment inhibits in vivo tumour growth. To facilitate the evaluation of multiple targets at a reasonable cost, we propose two methods: 384-well SMART-Seq2-based RNA sequencing (RNA-Seq), and automated imaging of GFP-tagged cells in zebrafish embryos (Fig. 1b).

We integrated three different levels of large scale data (Fig. 1a). The first level of data comprised neuroblastoma omics data from the R220, NIH-TARGET21 and SEQC22 biobanks, with a total of 833 cases. From the clinical, genetic and transcriptional parts of these data, we established 16 disease-associated risk factors, oncogenes, and signatures (Table 1, Supplementary Data 1). Among these, stage, age, MYCN and 11q deletion are routinely used for clinical management3,23, and ALK mutation for targeted therapy24. We also added gene signatures of patient risk11, oncogene activation25 and differentiation level9,12. (Because they were not genotyped in all three data sets, mutations of TERT and ATRX were not part of the analysis.) The two other levels of data were pharmaco-transcriptomic data from the LINCS/L1000 database of drug-induced mRNA changes in human cells7 and drug-to-protein target information from the STITCH5 database8. To gain predictive power, we used a version of the LINCS/L1000 data, in which the transcriptional effect of a drug is estimated from multiple replicates (Supplementary Fig. 1). The full data set thus comprised data for 833 cases, annotated with 16 risk factors, oncogenes and disease signatures, mRNA drug response data for 19,763 unique chemical compounds (we will use the term ‘drug’ below, for a more concise presentation) and 452,782 links between drugs and protein targets, involving 3421 unique LINCS/L1000 drugs and 17,086 unique targets.

### Association between risk factors, signatures and targets

Our algorithm, TargetTranslator, estimates mRNA signatures by solving a linear least squares problem, in which each risk factor (e.g. MYCN amplification) or genetic aberration is fitted by linear weights (i.e. the signature) to match the expression levels of the 978 genes in the LINCS/L1000 data (Eqs. (1)–(3) in Methods, and Supplementary Figs. 1 and 2). Applying this method to the neuroblastoma data, we confirmed the quality of the fitted signatures by cross-validation, whereby we checked the consistency (correlation) of signatures between the three different cohorts. For example, signatures of MYCN amplification estimated from each of the R2, TARGET and SEQC cohorts were all highly correlated, with an average Pearson correlation ($$r$$) of 0.86. Out of our 16 mRNA signatures, 10 were correlated (average $$r\ > \ 0.4$$; Fig. 2a), and were thus considered robust and used for further analyses. The consistency across cohorts was higher with our method than using the previously reported Characteristic Direction26 algorithm (Fig. 2a). We conclude that many, but not all, clinical, genetic or transcriptional markers associated with neuroblastoma are well approximated by TargetTranslator signatures fitted to the L1000 genes. A principal component analysis (PCA) of our RNA signatures also indicate that they separate largely into differentiation (PC2) and risk-related (PC1) factors (Fig. 2b).

Mathematically, target discovery is done in two steps. TargetTranslator first computes a match score (between 0 and 1) between each neuroblastoma signature and the drug-induced expression profiles in LINCS/L1000 (Eq. (4) in Methods). The scores (one score per drug) are summarized to an empirical cumulative distribution function (ecdf) plot and their significances are assessed via permutation tests (Fig. 2c). For each protein target in STITCH5, we then separate the drugs in LINCS/L1000 into two groups: those that have a link to the target and those that do not (by link, we mean a STITCH score above a stringent threshold, see Methods). A two-sample, one-tailed, Kolmogorov–Smirnov test is subsequently used to compare scores for the two groups of target-associated vs non-associated compounds (Fig. 2d). Proteins for which the null hypothesis is rejected are predicted to be targets. We correct for multiple testing across targets and report FDR adjusted q-values. This procedure is repeated twice, with alternating sign (−/+) of the signature, to identify targets likely to be associated with suppression (−) and enhancement (+) of each signature. Both cases can be therapeutically relevant since we both seek to suppress signatures associated with risk (like MYCN) and enhance signatures likely to be associated with slower tumour growth (like differentiation).

### Identification of 88 targets for high-risk neuroblastoma

Following this strategy, TargetTranslator gave a rich set of predictions of druggable targets for high-risk neuroblastoma. Applied to the three neuroblastoma cohorts, we obtained a total of 88 enriched drug targets that have a q-value of <0.0001 for at least one risk factor (Fig. 3). First, we noted that already-established targets, including MTOR and PIK3 isoforms27, EGFR28 and CDK229, were associated with several of the molecular and clinical risk signatures, as indicated by a shift in the ecdf curve (Fig. 3, example of MTOR in Fig. 2c). Second, there was a spectrum of targets that have limited evidence of neuroblastoma effect in the literature which still had a strong enrichment. Notable examples include MAPK8 and the cannabinoid receptor CNR2 (Fig. 2c), as well as MCL1 (Fig. 3). As further validation of our method, we found that MYC was detected as a target selectively associated with the MYCN amplification signature and that the RARB receptor of retinoic acid (which induces a differentiation phenotype in neuroblastoma30), was significantly associated to differentiation signatures (Fig. 2c). Inspecting the results further, we also found a number of interesting drugs, which had a high ranking match score for at least one risk factor, but where LINCS/L1000 contained too few similar drugs (fewer than 4 with the same STITCH5 target) to motivate target enrichment with the Kolmogorov–Smirnov test. Notable examples were drugs targeting glycosylceramide synthase UGCG (DL-PDMP), the benzodiazepine receptor TSPO (PK11195) and ROCK (fasudil).

Taken together, TargetTranslator detected targets and small molecules that merit further consideration for neuroblastoma therapy. The detected targets show differential association with molecular risk and differentiation signatures (Fig. 3). Minimal spanning tree (MST) analysis of highly scoring drug targets visualized possible relationships between predicted targets (Fig. 2d), suggesting possibilities for single-agent or multiple agent interventions.

### RNA profiling confirmed TargetTranslator predictions

To explore the therapeutic potential of our predictions, we selected a set of representative inhibitors or agonists against 11 targets for experimental evaluation (Table 2a). Of these, four were known targets and under clinical development: PI3K (trial NCT03458728), MTOR (NCT03213678), CDK4/6 (NCT01747876, NCT02644460) and HMGCR (NCT02390843). Three, in turn, show some promise in preclinical animal studies (PPARG31, CDK229, ROCK32 and four were comparatively less studied (MAPK8, CNR2, UGCG, TSPO).

In a first experiment, we tested if neuroblastoma cells treated with these compounds would change their gene expression pattern as predicted (i.e. by suppression or induction of the risk signatures, c.f. Fig. 3). As a representative model, we selected two patient-derived xenograft (PDX) cultures from two high-risk patients with MYCN amplified neuroblastoma, termed NB-PDX2 and NB-PDX3. Both cell lines were treated with 13 drugs (the 11 targeted drugs above, plus the differentiation agent retinoic acid and the BET bromodomain inhibitor JQ1, which downregulates MYCN33), at three concentrations equivalent to IC10, IC20 and IC50, for 6 and 24 h, followed by RNA profiling using smart-Seq2 (Fig. 4a). The drug-induced transcriptional changes in NB-PDX2 and NB-PDX3 cells were observable as distinct directions in the principal component space (Fig. 4b). The magnitude of transcriptional change along the principal components was proportional to drug dose and drug exposure time (Supplementary Table 1); drug-induced vectors were also relatively conserved between the NB-PDX2 and NB-PDX3 cultures (Supplementary Fig. 3). In both cell cultures, MTOR/PI3K and CDK4/6 inhibitors (omipalisib, torin-2 and palbociclib) induced a change along PC1, shifting the transcriptome away from the risk signatures. Lipid metabolism targeted agents (lovastatin, DL-PDMP and rosiglitazone) had high magnitudes along PC2, and PC3 was led by MAPK8 inhibitor AS601245. PC 4, finally, contained the differentiation-inducing agents RA and JQ1, shifting the transcriptome towards the differentiation signatures.

To formally assess the accuracy of our predictions, we performed a receiver operating characteristic (ROC) analysis. First, we identified all (n = 24) cases in which one of the 11 drugs either induced (n = 8) or suppressed (n = 16) a risk signature in the neuroblastoma patient-derived cells. Next, we computed the sensitivity and specificity of TargetTranslator with respect to detecting these associations. ROC analysis showed an excellent sensitivity vs specificity curve for both negative associations (vs no association) and positive associations (vs no association), with an average area under the curve (AUC) of 0.916 (Fig. 4c). The reproducibility of the scores was on a comparable level when contrasting the two PDX cultures (AUC 0.848, Supplementary Fig. 3E, F).

Pathways affected by the tested compounds were consistent with drug mechanism of action. Specifically, Gene Set Enrichment Analysis (GSEA) revealed that cholesterol homoeostasis was altered upon lovastatin and rosiglitazone treatment, the suppression of MYC targets by JQ1 and the inhibition of MTOR pathway members following torin-2 treatment (Fig. 4d). Most of our predictions, including the CNR2 inhibitor GW405833, the UGCG inhibitor DL-PDMP, the TSPO antagonist PK11195, and the MAPK8/JNK inhibitor AS601245, downregulated the expression of genes induced during the cell cycle, implicating effects on neuroblastoma cell growth (Fig. 4d).

Together, these data show that the effects in neuroblastoma cells were dose-dependent, correlated well with our predictions, consistent with known targets and affected therapeutically relevant pathways.

### Predicted compounds suppress malignant phenotypes

The observed transcriptional changes suggested that treatment with our predicted compounds should also produce phenotypic effects. Indeed, all 11 tested compounds suppressed viability of neuroblastoma cultures, as determined by dose–response curves after 72 h of drug exposure. For 10 compounds, the reduction in viability was more marked in neuroblastoma cells (NB-PDX2, NB-PDX3, SK-N-BE(2) and SK-N-SH) when compared with glioblastoma reference cells (U3013MG), the difference being largest for the CNR2 modulator GW405833 (Fig. 5a). Using the BET inhibitor JQ1, which suppresses MYCN transcription33, and the differentiation agent retinoic acid as positive controls, we found that reduced viability coincided with an induction of apoptosis markers for seven compounds, as observed by live-cell monitoring (Fig. 5b, c).

The predicted compounds also produced significant effects on N-Myc protein levels and cell differentiation, both of which are prognostic in neuroblastoma patients3 and are mechanistically linked34,35,36. Specifically, N-Myc protein was suppressed by all compounds except torin-2, omipalisib and rosiglitazone (Fig. 5d, e). Neurite outgrowth—a proxy for differentiation—was observed following treatment by four compounds (palbociclib, DL-PDMP, lovastatin and GW05833) as well as the positive control retinoic acid (Fig. 5f, g).

Jointly, these results support the conclusion that compounds against predicted targets were active against in vitro growth of neuroblastoma cell lines and patient-derived cultures, with (for some compounds) concurrent effects on N-Myc protein levels and differentiation.

### Cellular action of CNR2 agonists and MAPK8 antagonists

The finding that a CNR2 agonist GW405833 and a MAPK8 inhibitor AS601245 were active in neuroblastoma cells, motivated experiments to evaluate their on-target selectivity. For cannabinoid receptors, a number of agents are available to target both CNR237 and its pharmacologically relevant paralog CNR138,39 (Table 2b). In experiments on NB-PDX3 cells, two additional specific CNR2 agonists, JWH133 and HU308, reduced neuroblastoma cell viability, whereas the CNR1 agonist ACEA did not (Supplementary Fig. 4A), nor did antagonists of either CNR1 or CNR2 suppress neuroblastoma cell growth (Supplementary Fig. 4B). We further found that neuroblastoma cells that were pre-treated with a CNR2 antagonist (SR144528) were protected against GW405833 (p = 0.014, Student’s t-test), while pre-treatment with a CNR1 antagonist (otenabant) had no protective effect (Supplementary Fig. 4C, D). Together, these findings were consistent with CNR2 activation as the key molecular pharmacological mechanism.

We similarly investigated the effect of multiple agents targeting MAPK8 and its pharmacologically relevant homologues MAPK9 and MAPK10, against which AS601245 has reported activity. Neuroblastoma cells were sensitive to JNK-IN-840 and SP60012541, but not the MAPK9/10 selective compound CC-93042, which lacked effect even at high doses (Supplementary Fig. 4E, F). The three compounds with anti-neuroblastoma effect (AS601245, JNK-IN-8 and SP600125) have higher affinity for MAPK8 than at least one of the other isoforms, while CC-930 has a 12-fold lower affinity for MAPK8 than the other isoforms (Supplementary Fig. 4F). As a complementary analysis based on genetic methods, we considered a public data set of RNA interference data from the Broad institute DepMap43, in which the tested neuroblastoma cell lines (n = 9) are more vulnerable to MAPK8 knockdown than non-neuroblastoma cell lines (n = 704), but not MAPK9 or MAPK10 (Supplementary Fig. 4G). Jointly, these results indicated that MAPK8 is an important protein in the response to AS601245.

### Targeting MAPK8 and CNR2 reduces tumour growth in zebrafish

Following the promising in vitro results, we evaluated the in vivo potential of the compounds predicted by TargetTranslator. First, we implemented a high-throughput zebrafish xenograft assay of neuroblastoma. As the interrenal gland (the fish homologue of the human adrenal gland) is too small for high-throughput and consistent tumour injections, we chose the developing brain as the site to inject PDX-derived neuroblastoma cells. This injection site was also motivated by the fact that 8% of all neuroblastomas and 20% of progressive neuroblastomas spread to the brain44,45. NB-PDX3 cells were GFP-tagged using a lentiviral construct, FACS sorted to enrich for highly fluorescent cells and ~150 cells per fish were injected into the midbrain of 1-day post fertilization (dpf) Casper zebrafish embryos (Fig. 6a). This resulted in consistent engraftments with 90% of the tumours localized to the midbrain/hindbrain (Fig. 6b). Neuroblastoma proliferation was not affected by either decrease in temperature or GFP-tagging and xenografts contained MKI67-positive human cells at 5 dpf (Supplementary Fig. 5A–D). GFP signal highly correlated with confluence over time (n = 15, Supplementary Fig. 6E) and was used as a proxy for tumour growth.

Next, we determined the tolerance to each compound by exposing 2 dpf embryos to increasing (IC20, IC50 and IC80) doses of the predicted drugs (Fig. 6c). Of the 12 tested compounds, 8 were well tolerated at neuroblastoma-inhibiting doses (AZD5438, omipalisib, palbociclib, fasudil, GW405833, torin-2, AS601245 and positive control doxorubicin) whereas 4 showed signs of toxicity (lovastatin, DL-PDMP, PK11195 and rosiglitazone). The latter group corresponds to drugs implicated as modifiers of lipid metabolism (c.f. PC2 in Fig. 4b), and their toxicity likely reflects essentiality of these processes in this particular model system, as both statins and rosiglitazone are clinically approved drugs and PK11195 has been used as a positron emission tomography (PET) radiotracer in humans46.

Focusing on the CNR2 and MAPK8 targeted compounds, zebrafish xenografts were treated for 72 h by adding drug (GW405833 or AS601245) to the surrounding water. Before (2 dpf) and after (5 dpf) treatment, zebrafish were imaged both dorsally and laterally using a Vertebrate Automated Screening Technology (VAST) system (Fig. 6d), and each fish was scored for the increase in GFP signal. MAPK8 inhibitor AS601245 inhibited tumour size to 21% of DMSO treated control after 3 days ($$p\ <\ 0.0001$$, Dunnett’s multiple comparison test), while GW405833, which was the most neuroblastoma-selective inhibitor in vitro, reduced the tumour size to 75% of control (p = 0.0335, Dunnett’s multiple comparison test (Fig. 6e, f). The lower temperature in the assay did not potentiate NB cells to either compound (Supplementary Fig. 5F). Together, these first in vivo results supported TargetTranslator as a viable tool for predicting neuroblastoma targets with significant observed effects on tumour size in a zebrafish model.

### CNR2 agonist GW405833 reduces tumour growth in mice

To evaluate the in vivo effect of AS601245 and GW405833 in a mammalian model, we performed a treatment study on the MYCN amplified SK-N-BE(2) flank-injected mouse xenografts. Mice were monitored during 8 days of treatment, with assessment of tumour growth rate, tumour weight after 8 days and immunohistochemistry as endpoints (Fig. 7). After 8 days, GW405833 significantly reduced tumour size to 75% of vehicle-treated animals (p = 0.034, linear mixed model), with no indications of toxicity in any of the 10 treated mice (Fig. 7a–e). Consistent with the in vitro results (c.f. Fig. 5b–c), GW405833 caused elevated apoptosis in treated tumours, measured by immunohistochemistry of cleaved PARP (Fig. 7f–h). AS601245, and reduced growth rate (p = 0.048, linear mixed model), but showed signs of toxicity in 3 of 8 mice, after which the treatment group was discontinued and remaining mice were sacrificed (Supplementary Fig. 6). Together, these data provided an independent set of data to support the potential of a CNR2 agonist against neuroblastoma, and encourage investigation to define MAPK8 (JNK1) inhibitors with improved toxicity profile as a future direction.

### The TargetTranslator tool

So far, we have used our method to study neuroblastoma; to facilitate the application of TargetTranslator to other cancer diagnoses as well, we have implemented a web version of TargetTranslator, which comprises a simple workflow, in which users (i) select data sets of interest, (ii) identifies risk factors and signatures of interest and (iii) run the pipeline to find risk-associated drugs and therapeutic targets. The current implementation enables users to upload their own data but also lets users analyse pre-uploaded data sets of R2, TARGET, and SEQC neuroblastoma patient cohorts and an additional 33 clinically annotated cancers from the cancer genome atlas (TCGA) consortium47. TargetTranslator is also available as a standalone R package, intended for expert users.

## Discussion

This study aimed to identify treatment options with in vivo relevance for high-risk neuroblastoma. A tailored algorithm, TargetTranslator, estimated robust mRNA signatures of neuroblastoma, which were associated with drug targets. These, in turn, were characterized by multiplexed RNA-Seq, tested in cells and xenograft models, and resulted in leads for neuroblastoma treatment. Possible applications include target discovery for tumours other than neuroblastoma, particularly to discover druggable cancer targets not found by standard methods such as DNA sequencing.

Taken together, our computational and experimental results strongly support neuroblastoma therapy development through RNA risk signature modulation (Fig. 4). The observed transcriptional changes were concurrent with the suppression of malignant phenotypes associated with elevated risk in patients (N-Myc protein levels, proliferation), and induction of favourable features (cellular differentiation, the onset of apoptosis). Conceptually, we remark that using mRNA signatures to approximate risk factors rests on specific assumptions that must be met. Above all, we assume a causative link between gene expression in tumours and observable disease outcome. One example where this is true is MYCN amplification, which is causatively linked to neuroblastoma outcome, in a manner that depends on mRNA expression48. The Differentiation 1 signature of adrenergic (ADRN) cells, here linked to several targets, also has a well understood mechanistic basis9. A second key assumption is that modulation of mRNA levels is possible, either by suppressing a signature driven by an oncogene, such as MYCN, or shifting a plastic cell state towards a more favourable one, such as forcing differentiation. When these conditions are met, TargetTranslator makes logical sense as a tool for target discovery. By contrast, in the minority of cases where there is no reproducible RNA signature (i.e. limited consistency between cohorts, Fig. 2a), further exploration will be needed. For instance, while localized oncogene amplification of MYCN yields consistent signatures, our analysis speaks against the use of broad chromosomal regions, like 11q and 17q. TargetTranslator also depends on a sufficient number of patient cases to produce robust signatures, as seen for ALK mutation status in this study. In these particular cases, better definition of functionally important markers, and more extensive patient cohorts would be needed to gain analytical power.

Our in vivo results highlight a possible therapeutic potential of two targets: MAPK8/JNK1 and the cannabinoid receptor 2 (CNR2). Signalling pathways upstream of MAPK (receptor tyrosine kinases and RAS) are commonly altered in relapse neuroblastoma49 and activated pathways are correlated with poor prognosis in primary tumours50. The role of MAPK8/JNK1 is less well understood, although it has been linked to neuroblastoma progression51. Based on our in vivo results, we propose that MAPK8 partly regulates neuroblastoma growth. The cannabinoid receptor CNR2 (encoded by a gene in the commonly deleted 1p36 region) is a relatively unexplored target in cancer. Our computational analysis identified CNR2-targeting compounds linked to multiple neuroblastoma risk signatures (Fig. 2c, Supplementary Fig. 7). Extending results in serum-cultured cells52, we found that activation of CNR2 by three different agonists reduces viability in patient-derived cell cultures in vitro (Fig. 5a, Supplementary Fig. 4A) and induces apoptosis in vivo (Fig. 7g). Interestingly, we noted that the CNR2 ligand GW405833 activated genes regulated by the rhodopsin-like G-protein coupled receptors (GPCRs), which is a family of GPCRs that includes the cannabinoid receptors (Supplementary Fig. 4H), and the effect on neuroblastoma cells could be blocked by CNR2 antagonist (Supplementary Fig. 4C, D). We speculate that GW405833 acts by downregulation of N-Myc protein leading to apoptotic onset. A more exact mechanism of action of cannabinoid receptor 2 in high-risk neuroblastoma will be reserved for future work. However, judging from the absence of specific mutations or clear molecular patterns connected to the CNR2 biology in neuroblastoma, atypical targets like CNR2 would most likely be missed by common data mining strategies. This supports the idea that integrated data analysis spanning both multi-patient cohorts and robust statistical methods that detect latent factors in the data will play a crucial role in the future exploration of patient molecular data.

The rapidly growing use of RNA data as a readout in molecular pharmacological experiments motivates further analysis to define how drug action is manifested transcriptionally. Our analysis of selected drugs in neuroblastoma cells revealed consistent drug-specific patterns between cell lines, the magnitude of which was dose-dependent and time-dependent (Fig. 4, Supplementary Table 1). As in previous pharmaco-transcriptomic experiments (e.g. ref. 7), targeted drugs generally induce changes in their primary targeted pathway, as well as secondary pathways (Fig. 4, yellow circles). At this point, it is not known whether drugs exist that induce transcriptional changes entirely outside of the L1000 gene space or which would only be detected in other layers of data (e.g. protein profiles). We regard this as a possibility and expect that future databases that span the entire transcriptome or substantial parts of the proteome will have increased power to disease signatures in neuroblastoma and other cancers.

The zebrafish xenograft system based on GFP-tagged NB-PDX3 cells developed herein provides a complementary tool to assess candidate therapeutics in neuroblastoma. Implemented in an automated 96-well format, it provides toxicological information, as well as a first assessment of tumour growth and invasion, as determined by the relative increase of observed GFP signal, with an assay time of 5 days. The assay should be used with awareness of the possible impact of reduced temperature conditions, and the intrinsic limitations of measuring tagged tumour cells by imaging methods. Since zebrafish embryos are small and carry their food supply up to 5 dpf, these short term studies require little handling. The assay thereby presents a complementary tool to prioritize candidate treatments for further studies in genetically predisposed animal models of neuroblastoma such as the TH-MYCN mice48, the LIN28B transgenic mice53 or the D$$\beta$$h-EGFP-MYCN zebrafish harbouring coexpression of activated ALK and MYCN54.

Our computational method extends and complements current tools for pharmacological data mining. Our specific goal has been to build a pipeline that takes risk factors as input and provides targets to the user. We solve this problem by a 3-step algorithm: (i) estimation of mRNA signatures directly into the 978 gene space of L1000, (ii) match scoring of mRNA signatures and drug-specific L1000 expression profiles and (iii) deconvolution of targets using STITCH5 data. Some of the individual steps have counterparts in existing methods, others do not. Two interesting search engines have been proposed that carry out a task similar to the second step of our pipeline: L1000CDS2 and CREEDS18,19. Both web tools match pre-defined gene expression signatures to pharmaco-transcriptomic databases. L1000CDS2 makes use of the L1000 compendium and CREEDS uses expression profiles manually extracted and curated from public repositories. Acknowledging that an increasing number of studies are exploring possible relevance of LINCS/L1000 or related data to disease7, our methods will thus complement existing tools and adds an evaluated algorithm that helps the user through the multi-step process of linking risk phenotypes to therapeutic targets. Users of TargetTranslator can also opt to prioritize target discovery for specific cell lines by weighing their contribution to the pipeline. An example of this is given in (c.f. Supplementary Fig. 8). Looking ahead, emerging data from, e.g. the St Jude Cloud or recent paediatric cancer genome atlases5 will likely strengthen the analysis further.

In summary, TargetTranslator improves our understanding of neuroblastoma interventions and provides a practical tool for drug discovery. Users can upload their own data, which makes TargetTranslator easy to extend to other research areas where phenotype-drug-target relations are of interest, e.g. in other types of cancer or in the reprogramming of cell states (compare ADRN/MES in this paper).

## Methods

### Data sources

We obtained 835 neuroblastoma patient molecular and clinical data from the R2, TARGET and SEQC data sets (download links in Supplementary Methods). All mRNA data were log-transformed (log Affymetrix gene signal for R2 and TARGET, log(FPKM+1) for SEQC) and row centred. Clinical data, genetic aberrations were obtained for each data set and gene signatures of differentiation used markers (listed in Supplementary Data 1). Level 3 LINCS/L1000 data were processed by the Remove Unwanted Variation (RUV) algorithm to remove batch effects and pooling of replicates to yield drug-specific log2 fold change profiles for each drug, relative to vehicle (DMSO) control (Supplementary Methods and 55). We used RUV to pool information across doses and time points, to yield integrated fold change profiles for each drug that were used for analysis (Supplementary Methods and benchmarking in Supplementary Fig. 1B). We obtained protein–protein and drug-protein network data from string-db.org and stitch.embl.de defining a combined score above 900 of 1000 as a link (Supplementary Methods). The processed data can be accessed at targettranslator.org/downloads.

### TargetTranslator method

Data from profiled and clinically annotated tumour samples are arranged into two types of matrices. The first, which we term the disease data, contains uni- or multivariate data, available for all patients, which is relevant for risk or outcome, i.e. all items in Table 2. The second contains the corresponding RNA data for these tumour samples, restricted to the gene set in LINCS/L1000. We denote disease data matrices by $${\bf{Y}}(i)$$ and RNA-matrices by $${\bf{Z}}(i)$$, for cohorts $$i=1,2,\ldots$$. The disease data matrices thus have dimensionality $${\mathrm{dim}}({\bf{Y}}(i))={p}_{{\mathrm{disease}}}\times {n}_{i}$$, where the same set of $${p}_{{\mathrm{disease}}}$$ risk-associated factors are considered across all cohorts. The RNA data are of dimension $${\mathrm{dim}}({\bf{Z}}(i))={p}_{{\mathrm{LINCS}}}\times {n}_{i}$$. We assume that the risk data in $${\bf{Y}}(i)$$ can be summarized with a low-rank feature such that

$${\bf{Y}}(i)={\bf{H}}(i){\bf{F}}(i)+{\epsilon }_{Y}(i),$$
(1)

where $${\mathrm{dim}}({\bf{H}}(i))={p}_{{\mathrm{disease}}}\times k,{\mathrm{dim}}({\bf{F}}(i))=k\ \times n$$. $${\bf{F}}(i)$$ represents a $$k$$-dimensional feature across patients in cohort $$i$$ that summarizes the patient variability across patients with respect to the $${p}_{{\mathrm{disease}}}$$ outcome associated factors. TargetTranslator extracts these features through a low-rank matrix decomposition (SVD). Next, we project data $${\bf{Z}}(i)$$ onto the extracted features $${\bf{F}}(i)$$;

$${\bf{Z}}(i)={\bf{B}}(i){\bf{F}}(i)+{\epsilon }_{{\bf{Z}}}(i)$$
(2)

where $${\mathrm{di}}({\bf{B}}(i))={p}_{{\mathrm{LINCS}}}\times k,{\mathrm{dim}}({\bf{F}}(i))=k\ \times {n}_{i}$$. The matrix $${\bf{B}}(i)$$ are the k-dimensional signatures for the L1000 landmark genes. These are obtained through the regression

$$\widehat{{\bf{B}}}(i)={\bf{Z}}(i){\bf{F}}{(i)}^{T}{({\bf{F}}(i){\bf{F}}{(i)}^{T})}^{-1}.$$
(3)

TargetTranslator can also be run in a supervised setting; here, the user provides the latent variables $${\bf{H}}(i)$$, typically as gene signatures (c.f. Supplementary Data 1), whereby $${\bf{F}}(i)$$ is found by least squares. In another special case, the method can accommodate left-censored survival data. For this, the feature $${\bf{F}}$$ is defined to be the log-proportional hazard of each patient, as obtained by a Cox regression model (Supplementary Methods).

Next, we are concerned with the matching between the signatures $$\widehat{{\bf{B}}}(i),i=1,2,...$$ and the full LINCS/L1000 compendium, which we have organised into a 3-dimensional table, in which the $${p}_{\mathrm{LINCS}}$$ rows are genes, columns $$C=19763$$ are drugs (or shRNAs) and $$S=14$$ layers are cell lines (we use the $$N=14$$ cell lines in LINCS/L1000 with at least 1000 unique drugs). We use $${{\bf{g}}}_{r,s}$$ to denote the (z-transformed) gene expression vector for drug $$r$$ in cell line $$s$$ and the function $$\sigma ({\bf{g}},{\bf{h}})=\sigma ({{\bf{g}}}^{T}{\bf{h}})$$ to denote the similarity between any two profiles, where $${{\bf{g}}}^{T}{\bf{h}}$$ is the scalar product between $${\bf{g}}$$ and $${\bf{h}}$$. The explicit form for $$\sigma (\cdot ,\cdot )$$ is given in the Supplementary Methods. Finally, we compute the match score of a perturbation $$r$$, is defined as:

$$S(r)=\bar{\sigma }(r)\frac{1}{N}\sum _{s=1}^{N}\sigma ({{\bf{g}}}_{r,s},\pm {\bf{B}}(1))\ \times \sigma ({{\bf{g}}}_{r,s},\pm \ {\bf{B}}(2))\times \ldots ,$$
(4)

where $${\pm}{\bf{B}}$$ denotes that the score is computed either with respect to $$-1\times {\bf{B}}$$, to detect drugs that suppress the signature, or $$+1\times {\bf{B}}$$, to detect drugs that enhance the signature. $$\bar{\sigma }(r)$$ denotes the drug-specific propensity for signatures to match across cell lines, estimated from LINCS/L1000 data (precise definition in the Supplement). This is weighed together with the average (expected) similarities between LINCS profiles and the TargetTranslator signatures $${\bf{B}}(i)$$ across all cohorts. The resulting match score has three key properties. First, since each of the $$\sigma$$ terms has a value between 0 (no match) and 1 (match at a level expected for the same target), the match score will be on the interval between 0 and 1. Second, the $$\bar{\sigma }$$ term specifically serves the purpose of giving a stronger weight to any perturbation that gives a consistent response across model cell lines. Third, since a product is formed between the $$\sigma$$ values across all cohorts, this score will emphasize matches that are consistently observed across cohorts. Permutation simulations are described in the Supplementary Methods, benchmarking in (Supplementary Fig. 1B–D).

For target deconvolution, we consider the relationship between the aggregate scores $$S(r),r=1,2,\ldots$$ of the different perturbations; and the pathway context or target of each perturbation. For a given protein target, we use the data in STITCH to divide all perturbations into two sets: R+ and R−. The set R+ contains all perturbations with a STITCH score >900 (this is considered a stringent hit, whereas 700 is significant8). The set R- contains all perturbations below that threshold. We subsequently apply a two-sample, one-tailed, Kolmogorov–Smirnov test to test the sample $$\{S(r),r\in R+\}$$ vs the sample $$\{S(r),r\in R-\}$$. P-values are corrected by mafdr to obtain FDR q-values.

### Compounds and cell culture

High scoring compounds chosen for further analysis were obtained from Selleckchem (omipalisib, torin-2, palbociclib, AZD5438, rosiglitazone, fasudil, JQ1, GW842166X, otenabant), Santa Cruz Biotechnology (DL-PDMP), Sigma-Aldrich (GW405833, PK11195, AS601245, retinoic acid, SR144528), Tocris (ACEA, HU308, JWH133), MedChemExpress (CC-930, SP600125, D-JNKI-1, JNK-IN-8) and Enzo (lovastatin). All compounds were dissolved in DMSO or MilliQ water (palbociclib) according to the vendor’s instruction to a stock of 10–20 mM. Patient-derived xenograft (PDX) cell lines NB-PDX2 and NB-PDX3 (termed LU-NB-2 and LU-NB-3 in previous publications) were cultured as described on LN-521 (Biolamina) coated primaria plates (VWR) in defined, bFGF/EGF supplemented medium56,57. NB-PDX2 derives from a cerebral metastasis of a stage 4 neuroblastoma patient and NB-PDX3 derives from a primary tumour in the adrenal gland of a stage 3 patient; both are MYCN amplified with 1p loss and 17q gain. SK-N-BE(2) (ATCC®CRL-2271$${}^{{}^{\text{TM}}}$$) and SK-N-SH (ATCC®HTB-11$${}^{{}^{\text{TM}}}$$) neuroblastoma lines were grown in the same medium as the PDX lines but on uncoated primaria plates. U3013MG glioblastoma cells from the HGCC biobank58 were grown in bFGF/EGF supplemented medium58,59. NB-PDX2 and NB-PDX3 cells (Figs. 47) were authenticated by SNP profiling (Multiplexion, Germany). SK-N-BE(2) cell lines (Fig. 7) and U3013MG were confirmed by STR profiling. SK-N-SH (used in Fig. 5a) were not authenticated. Cells were tested for mycoplasma contamination on a regular basis. All cells were cultured under 5% CO2 pressure at 37 °C to maximum confluency of 80% and detached using StemPro Accutase (ThermoFisher Scientific) during passaging.

### Pharmaco-transcriptomics of predicted drugs

The full protocol is provided in the Supplementary Methods. In outline, cells were seeded 1 day prior to treatment in LN-521-coated 384-well microplates (BD Falcon Optilux #353962), at a density of 2000 cells per well. Drugs and vehicle controls were added by an automated workstation to implement a design that consisted of 16 treatments (13 compounds, 2 DMSO concentrations, 1 untreated cell medium) × 3 doses (equivalent to IC50, IC20 and IC10 (Supplementary Table 2)) × 4 replicates × 2 cell lines (total 384). The RNA-Seq libraries were prepared in a 384-well plate format using the SMART-Seq2 method60 and sequenced on an Illumina HiSeq 3000 instrument for 51 cycles, targeting around 0.8 million reads per well as an output, which is sufficient to probe the transcriptome at the level of principal components, gene signatures and pathways61,62. High-quality reads were mapped against the Ensembl Homo sapiens (GRCh38, ftps://ensembl.org/pub/release-90/fasta/homo_sapiens/dna/). The uniquely-mapped reads aligned to exons were counted by HTSeq v.0.6.163, then normalized by the DESeq2 R package v.1.14.164.

The principal component analysis (PCA) plot in (Fig. 4) was computed from drug-induced log fold change values, pooling replicates for each dose. Receiver operating characteristic (ROC) curves based on comparison of (i) the correlation between neuroblastoma risk signatures and each drug’s average fold change signature in LINCS/L1000 (here seen as the prediction) and (ii) the correlation between neuroblastoma risk signatures and each drug’s corresponding average fold change signature in our RNA-Seq experiment in PDX cells (here seen as the ground truth). ROC curves (x = 1-specificity of prediction in relation to ground truth; y = sensitivity) and area under the curve (AUC) integrals of ROC curves were computed using Matlab’s roc.m and trapz.m functions. Gene Set Enrichment Analysis (GSEA) was performed using the GSEAPreranked option in the GSEA software (version 2.0.9)65. For each compound, a ranked list was prepared by running differential expression analysis on transcriptomic data from treated and control samples using the DESeq2 R package v.1.14.164. GeneIDs and log2 fold change values were extracted from the analysis output and used to create the ranked list. These were then submitted to GSEA in the pre-ranked mode using default parameters.

### In vitro evaluation of predicted drugs

Cells were seeded on the day before treatment, followed by 72 h of drug exposure. Viability was detected using the metabolic activity assay AlamarBlue (Invitrogen), according to the manufacturer’s protocol, and standardized by vehicle controls. Doses were chosen to include the TargetTranslator predicted dose. Dose–response sigmoidal curves were fitted using GraphPad Prism and used to select doses corresponding to the IC10, IC20, IC50 and IC80 concentrations for subsequent experiments. One-way ANOVA and Dunnett’s multiple comparisons test were used to compare all neuroblastoma cell lines against the non-neuroblastoma line U3013MG.

For western blotting, 500,000 cells were seeded in LN-521 (Biolamina) coated 6-well Primaria plates (VWR), treated for 48 h with IC50 equivalent concentrations of positive controls (JQ1 and retinoic acid), as well as explored compounds. After lysis in RIPA buffer (ThermoScientific), 25 μg of protein was separated on a NuPAGE 4–12% bis-tris gel (Invitrogen) and transferred using the iBlot Gel transfer Stacks nitrocellulose mini kit (Invitrogen) on an iblot®gel transfer device (Invitrogen). Membranes were blocked with StartingBlocking T20 TBS (ThermoScientific) and stained with antibodies against the N-Myc (#ab16898, Abcam, 1:250) and with cyclophilin B (#ab16045, Abcam, 1:500) as a loading control. N-Myc protein expression was normalized against cyclophilin loading control and analysed as a ratio to non-treated control using a one-sample t-test with Benjamini–Hochberg false discovery rate (FDR) correction.

For apoptosis analysis, 10,000 NB-PDX2/3 cells were seeded on LN-521 coated 96-well primaria plates and treated with 0.1% IncuCyte Caspase-3/7 Green reagent for apoptosis (EssenBioscience #4440). Cells were treated with palbociclib 0.1 μM, lovastatin 3 μM, fasudil 50 μM, GW405833 6 μM, omipalisib 0.02 μM, AZD5438 7 μM, JQ1 3 μM, DL-PDMP 50 μM, retinoic acid 120 μM, PK11195 90 μM, torin-2 0.04 μM, DMSO 0.5%. Phase-contrast and fluorescent images were acquired over 96 h with the IncuCyte Zoom instrumentation (EssenBioscience). Time-lapse confluence and apoptosis were automatically masked using the built-in software and apoptotic response was quantified as apoptotic area (μm2 of image) per confluence (% of image) and shown as a ratio to vehicle control.

To analyse confluence and differentiation, we collected and analysed phase-contrast images of NB-PDX2 or NB-PDX3 cells (2000 cells per well) plated in LN-521-coated 384-well microplates (BD Falcon Optilux #353962). Images were taken every 2 h for a total of 3 days, using a design with two replicate wells for each 2 cell lines, 7 doses and 11 compounds. The images were analysed by an in-house pipeline (Supplementary Methods, Supplementary Fig. 9) based on the U-net66 convolutional neural network (CNN) for semantic segmentation. The CNN was trained to find cell protrusions as well as cell confluency. The number of protrusions of length > d in an image is roughly distributed as an exponential distribution. Accordingly, we fit the function $$A\cdot {e}^{-k\cdot d}$$, where A is the number of protrusions, d is the length and k is the decay, for each image. We define:

$${Y}_{np}=\frac{A}{{N}_{bp}}$$
(5)
$${Y}_{md}=-1000\cdot k$$
(6)

where $${N}_{bp}$$ is the total length of the cell-background interface, $${Y}_{np}$$ is the normalized amount of protrusions and $${Y}_{md}$$ is the morphological differentiation score and describes the relationship between long and short protrusions in the image. Cells with long protrusions will hence have a larger morphological differentiation score. The confluence and protrusion data were quantified and assessed statistically using linear mixed effects models and bootstrapping, as described in the Supplementary Methods.

### Neuroblastoma zebrafish xenografts

All zebrafish experiments have been approved by the regional animal ethics board (C68/15, 5.8.1-08213/2017, EP 161/14) and the conducted zebrafish experiments comply with all relevant ethical regulations for animal testing. Drug tolerance was measured by exposure of 2-day post fertilization (dpf) zebrafish embryos to water containing drugs with IC20, 50 and 80 levels of drug based on the viability data above. The level of toxicity was noted on a numerical scale, where 0 = no toxicity, 1 = toxicity noted only in one or two fish, 2 = weak signs of toxicity in several fish but no deaths, 3 = strong signs of toxicity but no deaths, 4–5 = lethal doses of drugs, with 5 suggesting that the effect might have been instant.

NB-PDX3 cells were transfected with a lentiviral construct containing GFP, luciferase and puromycin resistance genes (pBMN(CMV-copGFP-Luc2-Puro), Addgene plasmid # 80389, a kind gift from Prof. Magnus Essand, Uppsala University) and selected under puromycin treatment, following FACS sorting to obtain GFP expressing cells which were cultured as described above. Prior to embryo injection, cells were detached with Accutase, passed over a 40 μm sterile filter, pelleted, and resuspended in minimal media. Using a procedure described in detail in the Supplementary Methods, anesthetized 1 dpf fish were injected with 100-200 GFP-tagged neuroblastoma cells into the midbrain. Xenografted zebrafish were kept in E3 water at 33 °C. At 2 dpf, zebrafish were screened for tumour engraftment, manually sorted in 200 μl of E3 water into 96-well plates (1 per well), anesthetized, and imaged dorsally and laterally (×10 objective) for GFP baseline signal with a Vertebrate Automated Screening Technology (VAST) BioImager (Union Biometrica) using a LEICA DM6 B microscope and a Leica DFC9000 camera. Each drug was diluted in an additional 100 μl of E3 water and added to yield the desired end concentration. On day 5, zebrafish were sacrificed with 2.4 mM tricane and an endpoint image was acquired.

Zebrafish embryos were fixed in formaldehyde (PFA) and embedded in paraffin. Stainings were done on 6-μm sections, either with hematoxylin and eosin using standard procedures or with immunofluorescent co-staining of MKI67 (#M7240, DAKO), NuMA (#ab97585, Abcam) and DAPI (#00-4959-52, ThermoFisher Scientific). Stained zebrafish were imaged on an AxioImager (Zeiss). The detailed protocol is in Supplementary Methods.

### Neuroblastoma mouse xenografts

All mouse experiments were approved by the regional ethics committee for animal research (N 231/14) and the conducted mouse experiments comply with all relevant ethical regulations for animal testing. For the xenograft study, SK-N-BE(2) cells ($$15\times 1{0}^{6}$$) were injected subcutaneously into the flanks of immunodeficient nude mice (female 5–6-weeks-old NMRI-nu/nu, Taconic). Mice with palpable tumours (0.20 cm$${}^{3}$$ or greater) were randomized to receive either GW405833 (45 mg per kg, n = 10), AS601245 (90 mg per kg, n = 8) or vehicle (n = 12) intraperitoneally for 8 days, or for 2–8 days. At the endpoint, tumours were dissected in smaller parts and either frozen or fixed in 4% PFA, for further analyses. Additional details and statistics are provided in Supplementary Methods.

Paraffin-embedded mouse samples from all treatment groups (n = 3–5) were cut into 6 μm sections and stained with hematoxylin and eosin (HE) or immunostained against proliferation marker MKI67 (#M7240, DAKO) and apoptosis marker cleaved PARP (#5625, Cell Signaling Technology) using a standard immunohistochemistry protocol. Antigens were retrieved in citric acid-based buffer (#H-3300, Vector Laboratories) containing 0.05% Tween-20 in 2100 Antigen Retriever (Aptum Biologics Ltd.), sections were blocked with Animal-free blocking solution (#15019, Cell Signaling Technology), primary antibodies were diluted in normal antibody diluent (Immunologic), BrightVision Goat Anti-Rabbit or Anti-Mouse HRP Polymer (Immunologic) was applied and signal visualized using BrightDAB (Immunologic). Sections were counterstained with Mayer’s hematoxylin (Histolab) and mounted with Pertex (Histolab). Slides were scanned using Nanozoomer S60 (Hamamatsu) and representative images were taken with ×40 magnification. The number of positive nuclei was assessed using ImmunoRatio67 plugin for ImageJ (NIH). From each section 10–15 images were taken, covering at least 50% of the total section area. Results were presented as a percent of positive nuclei.

### Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

## Data availability

The sequencing data that support the findings of this study have been deposited in the Gene Expression Omnibus (GEO) with the accession code GSE120920. The source data underlying Figs. 4g, 5, 6b, e, 7 and Supplementary Figs. 4, 5, 6, 9, are provided in the Source Data file. Signatures used in the manuscript are defined in Supplementary Data 1. Additional data files, including the processed LINCS/L1000 and STITCH data can be accessed at targettranslator.org/downloads. For information on materials, contact SN (sven.nelander[at]igp.uu.se).

## Code availability

An R version of the Targettranslator server is available on GitHub https://github.com/Targettranslator/targettranslator. For question regarding code, contact (sven.nelander[at]igp.uu.se).

## References

1. 1.

Maris, J. M., Hogarty, M. D., Bagatell, R. & Cohn, S. L. Neuroblastoma. Lancet 369, 2106–2120 (2007).

2. 2.

Brodeur, G. M. Spontaneous regression of neuroblastoma. Cell Tissue Res. 372, 277–286 (2018).

3. 3.

Cohn, S. L. et al. The International Neuroblastoma Risk Group (INRG) classification system: an INRG Task Force report. J. Clin. Oncol. 27, 289–297 (2009).

4. 4.

Bresler, S. C. et al. ALK mutations confer differential oncogenic activation and sensitivity to ALK inhibition therapy in neuroblastoma. Cancer Cell 26, 682–694 (2014).

5. 5.

Grobner, S. N. et al. The landscape of genomic alterations across childhood cancers. Nature 555, 321–327 (2018).

6. 6.

Pinto, N. R. et al. Advances in risk classification and treatment strategies for neuroblastoma. J. Clin. Oncol. 33, 3008–3017 (2015).

7. 7.

Subramanian, A. et al. A next generation connectivity map: L1000 platform and the first 1,000,000 profiles. Cell 171, 1437–1452 (2017).

8. 8.

Szklarczyk, D. et al. STITCH 5: augmenting protein-chemical interaction networks with tissue and affinity data. Nucleic Acids Res. 44, D380–D384 (2016).

9. 9.

van Groningen, T. et al. Neuroblastoma is composed of two super-enhancer-associated differentiation states. Nat. Genet. 49, 1261–1266 (2017).

10. 10.

White, P. S. et al. Definition and characterization of a region of 1p36.3 consistently deleted in neuroblastoma. Oncogene 24, 2684–2694 (2005).

11. 11.

De Preter, K. et al. Accurate outcome prediction in neuroblastoma across independent data sets using a multigene signature. Clin. Cancer Res. 16, 1532–1541 (2010).

12. 12.

Ribeiro, D. et al. Regulation of nuclear hormone receptors by MYCN-driven miRNAs impacts neural differentiation and survival in neuroblastoma patients. Cell Rep. 16, 979–993 (2016).

13. 13.

Kling, T. et al. Integrative modeling reveals annexin A2-mediated epigenetic control of mesenchymal glioblastoma. EBioMedicine 12, 72–85 (2016).

14. 14.

Jornsten, R. et al. Network modeling of the transcriptional effects of copy number aberrations in glioblastoma. Mol. Syst. Biol. 7, 486 (2011).

15. 15.

Kling, T. et al. Efficient exploration of pan-cancer networks by generalized covariance selection and interactive web content. Nucleic Acids Res. 43, e98 (2015).

16. 16.

Gundersen, G. W. et al. GEO2Enrichr: browser extension and server app to extract gene sets from GEO and analyze them for biological functions. Bioinformatics 31, 3060–3062 (2015).

17. 17.

Kuleshov, M. V. et al. Enrichr: a comprehensive gene set enrichment analysis web server 2016 update. Nucleic Acids Res. 44, W90–W97 (2016).

18. 18.

Duan, Q. et al. L1000CDS2: LINCS L1000 characteristic direction signatures search engine. NPJ Syst. Biol. Appl. 2, 16015 (2016).

19. 19.

Wang, Z. et al. Extraction and analysis of signatures from the Gene Expression Omnibus by the crowd. Nat. Commun. 7, 12846 (2016).

20. 20.

Molenaar, J. J. et al. Sequencing of neuroblastoma identifies chromothripsis and defects in neuritogenesis genes. Nature 483, 589–593 (2012).

21. 21.

Pugh, T. J. et al. The genetic landscape of high-risk neuroblastoma. Nat. Genet. 45, 279–284 (2013).

22. 22.

Wang, C. et al. The concordance between RNA-seq and microarray data depends on chemical treatment and transcript abundance. Nat. Biotechnol. 32, 926–932 (2014).

23. 23.

Brodeur, G. M. et al. Revisions of the international criteria for neuroblastoma diagnosis, staging, and response to treatment. J. Clin. Oncol. 11, 1466–1477 (1993).

24. 24.

Guan, J. et al. Clinical response of the novel activating ALK-I1171T mutation in neuroblastoma to the ALK inhibitor ceritinib. Cold Spring Harb. Mol. Case Stud. 4, a002550 (2018).

25. 25.

Lambertz, I. et al. Upregulation of MAPK negative feedback regulators and RET in mutant ALK neuroblastoma: implications for targeted treatment. Clin. Cancer Res. 21, 3327–3339 (2015).

26. 26.

Clark, N. R. et al. The characteristic direction: a geometrical approach to identify differentially expressed genes. BMC Bioinformatics 15, 79 (2014).

27. 27.

Vaughan, L. et al. Inhibition of mTOR-kinase destabilizes MYCN and is a potential therapy for MYCN-dependent tumors. Oncotarget 7, 57525–57544 (2016).

28. 28.

Richards, K. N. et al. Signaling of ERBB receptor tyrosine kinases promotes neuroblastoma growth in vitro and in vivo. Cancer 116, 3233–3243 (2010).

29. 29.

Chen, Z. et al. Multiple CDK inhibitor dinaciclib suppresses neuroblastoma growth via inhibiting CDK2 and CDK9 activity. Sci. Rep. 6, 29090 (2016).

30. 30.

Sidell, N., Altman, A., Haussler, M. R. & Seeger, R. C. Effects of retinoic acid (RA) on the growth and phenotypic expression of several human neuroblastoma cell lines. Exp. Cell Res. 148, 21–30 (1983).

31. 31.

Cellai, I. et al. In vivo effects of rosiglitazone in a human neuroblastoma xenograft. Br. J. Cancer 102, 685–692 (2010).

32. 32.

Dyberg, C. et al. Rho-associated kinase is a therapeutic target in neuroblastoma. Proc. Natl Acad. Sci. USA 114, E6603–E6612 (2017).

33. 33.

Puissant, A. et al. Targeting MYCN in neuroblastoma by BET bromodomain inhibition. Cancer Discov. 3, 308–323 (2013).

34. 34.

Kang, J. H. et al. MYCN silencing induces differentiation and apoptosis in human neuroblastoma cells. Biochem. Biophys. Res. Commun. 351, 192–197 (2006).

35. 35.

Westermark, U. K., Wilhelm, M., Frenzel, A. & Henriksson, M. A. The MYCN oncogene and differentiation in neuroblastoma. Semin. Cancer Biol. 21, 256–266 (2011).

36. 36.

Guglielmi, L. et al. MYCN gene expression is required for the onset of the differentiation programme in neuroblastoma cells. Cell Death Dis. 5, e1081 (2014).

37. 37.

Soethoudt, M. et al. Cannabinoid CB2 receptor ligand profiling reveals biased signalling and off-target activity. Nat. Commun. 8, 13958 (2017).

38. 38.

Hadcock, J. R. et al. In vitro and in vivo pharmacology of CP-945,598, a potent and selective cannabinoid CB(1) receptor antagonist for the management of obesity. Biochem. Biophys. Res. Commun. 394, 366–371 (2010).

39. 39.

Hillard, C. J. et al. Synthesis and characterization of potent and selective agonists of the neuronal cannabinoid receptor (CB1). J. Pharmacol. Exp. Ther. 289, 1427–1433 (1999).

40. 40.

Zhang, T. et al. Discovery of potent and selective covalent inhibitors of JNK. Chem. Biol. 19, 140–154 (2012).

41. 41.

Bennett, B. L. et al. SP600125, an anthrapyrazolone inhibitor of Jun N-terminal kinase. Proc. Natl Acad. Sci. USA 98, 13681–13686 (2001).

42. 42.

Plantevin Krenitsky, V. et al. Discovery of CC-930, an orally active anti-fibrotic JNK inhibitor. Bioorg. Med. Chem. Lett. 22, 1433–1438 (2012).

43. 43.

Tsherniak, A. et al. Defining a cancer dependency map. Cell 170, 564–576 (2017).

44. 44.

Matthay, K. K. et al. Central nervous system metastases in neuroblastoma: radiologic, clinical, and biologic features in 23 patients. Cancer 98, 155–165 (2003).

45. 45.

Zhu, J. et al. Brain metastasis in children with stage 4 neuroblastoma after multidisciplinary treatment. Chin. J. Cancer 34, 531–537 (2015).

46. 46.

Hirvonen, J. et al. Human biodistribution and radiation dosimetry of 11C-(R)-PK11195, the prototypic PET ligand to image inflammation. Eur. J. Nucl. Med. Mol. Imaging 37, 606–612 (2010).

47. 47.

Liu, J. et al. An integrated TCGA pan-cancer clinical data resource to drive high-quality survival outcome analytics. Cell 173, 400–416 (2018).

48. 48.

Weiss, W. A., Aldape, K., Mohapatra, G., Feuerstein, B. G. & Bishop, J. M. Targeted expression of MYCN causes neuroblastoma in transgenic mice. EMBO J. 16, 2985–2995 (1997).

49. 49.

Eleveld, T. F. et al. Relapsed neuroblastomas show frequent RAS-MAPK pathway mutations. Nat. Genet. 47, 864–871 (2015).

50. 50.

Eleveld, T. F. et al. RAS-MAPK pathway-driven tumor progression is associated with loss of CIC and other genomic aberrations in neuroblastoma. Cancer Res. 78, 6297–6307 (2018).

51. 51.

Fey, D. et al. Signaling pathway models as biomarkers: Patient-specific simulations of JNK activity predict the survival of neuroblastoma patients. Sci. Signal 8, ra130 (2015).

52. 52.

Wojcieszak, J., Krzemień, W. & Zawilska, J. B. JWH-133, a selective cannabinoid $$C{B}_{2}$$ receptor agonist, exerts toxic effects on neuroblastoma SH-SY5Y cells. J. Mol. Neurosci. 58, 441–445 (2016).

53. 53.

Molenaar, J. J. et al. LIN28B induces neuroblastoma and enhances MYCN levels via let-7 suppression. Nat. Genet. 44, 1199–1206 (2012).

54. 54.

Zhu, S. et al. Activated ALK collaborates with MYCN in neuroblastoma pathogenesis. Cancer Cell 21, 362–373 (2012).

55. 55.

Lonnstedt, I. M. & Nelander, S. FC1000: normalized gene expression changes of systematically perturbed human cells. Stat. Appl. Genet. Mol. Biol. 16, 217–242 (2017).

56. 56.

Braekeveldt, N. et al. Neuroblastoma patient-derived orthotopic xenografts retain metastatic patterns and geno- and phenotypes of patient tumours. Int. J. Cancer 136, E252–E261 (2015).

57. 57.

Persson, C. U. et al. Neuroblastoma patient-derived xenograft cells cultured in stem-cell promoting medium retain tumorigenic and metastatic capacities but differentiate in serum. Sci Rep 7, 10274 (2017).

58. 58.

Xie, Y. et al. The human glioblastoma cell culture resource: validated cell models representing all molecular subtypes. EBioMedicine 2, 1351–1363 (2015).

59. 59.

Pollard, S. M. et al. Glioma stem cell lines expanded in adherent culture have tumor-specific phenotypes and are suitable for chemical and genetic screens. Cell Stem Cell 4, 568–580 (2009).

60. 60.

Picelli, S. et al. Full-length RNA-seq from single cells using Smart-seq2. Nat. Protoc. 9, 171–181 (2014).

61. 61.

Bush, E. C. et al. PLATE-Seq for genome-wide regulatory network analysis of high-throughput screens. Nat. Commun. 8, 105 (2017).

62. 62.

Ye, C. et al. DRUG-seq for miniaturized high-throughput transcriptome profiling in drug discovery. Nat. Commun. 9, 4307 (2018).

63. 63.

Anders, S., Pyl, P. T. & Huber, W. HTSeq—a Python framework to work with high-throughput sequencing data. Bioinformatics 31, 166–169 (2015).

64. 64.

Love, M. I., Huber, W. & Anders, S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 15, 550 (2014).

65. 65.

Subramanian, A. et al. Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc. Natl Acad. Sci. USA 102, 15545–15550 (2005).

66. 66.

Ronneberger, O., Fischer, P. & Brox, T. in Medical Image Computing and Computer-Assisted Intervention – MICCAI 2015 (eds Navab, N., Hornegger, J., Wells, W. M. & Frangi, A. F.) 234–241 (Springer International Publishing, Cham, 2015).

67. 67.

Tuominen, V. J., Ruotoistenmaki, S., Viitanen, A., Jumppanen, M. & Isola, J. ImmunoRatio: a publicly available web application for quantitative image analysis of estrogen receptor (ER), progesterone receptor (PR), and Ki-67. Breast Cancer Res. 12, R56 (2010).

## Acknowledgements

We thank Dr. Mauro Dacasto and Dr. Mery Giantin for providing access to their computational cluster. We thank Prof Björn Nilsson for valuable comments on the manuscript. We thank Wasifa Kabir for help with cell experiments, Dr María Victoria Ruiz Peréz for valuable help with marker gene lists and Dr Marek Skupiński for assistance with illustrations. We thank the Swedish Childhood Cancer Foundation, Swedish Cancer Society, Swedish Research Council and the Swedish Strategic Research Foundation for financial support. Open access funding provided by Uppsala University.

## Author information

Authors

### Contributions

S.N. conceived the project. E.A. and S.N. designed the study with support from C.K. and R.E. S.N., C.W., A.S. and R.J. developed the computational method. E.A., N.H., R.E., designed and performed the cell experiments. R.E. and M.V. performed the RNA profiling. MAH developed marker gene lists for Diff2 and Diff3. E.A., N.H. and S.N. developed the zebrafish assay. E.R. performed the image analysis and statistical analyses. T.K.O., C.D., P.K., M.D. performed the mouse study. S.P. and D.B. developed the NB-PDX cell cultures. E.A., R.E. and S.N. drafted the figures.  R.E. and I.L. performed the GSEA. E.A. and S.N. wrote the first version of the paper.

### Corresponding author

Correspondence to Sven Nelander.

## Ethics declarations

### Competing interests

The authors declare no competing interests.

Peer review information Nature Communications thanks Bin Chen, Rodney Stewart and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

## Rights and permissions

Reprints and Permissions

Almstedt, E., Elgendy, R., Hekmati, N. et al. Integrative discovery of treatments for high-risk neuroblastoma. Nat Commun 11, 71 (2020). https://doi.org/10.1038/s41467-019-13817-8