Perturbation-based gene regulatory network inference to unravel oncogenic mechanisms

Morgan, Daniel; Studham, Matthew; Tjärnberg, Andreas; Weishaupt, Holger; Swartling, Fredrik J.; Nordling, Torbjörn E. M.; Sonnhammer, Erik L. L.

doi:10.1038/s41598-020-70941-y

Download PDF

Article
Open access
Published: 25 August 2020

Perturbation-based gene regulatory network inference to unravel oncogenic mechanisms

Daniel Morgan¹,
Matthew Studham¹,
Andreas Tjärnberg^1,2,
Holger Weishaupt³,
Fredrik J. Swartling³,
Torbjörn E. M. Nordling⁴ &
…
Erik L. L. Sonnhammer¹

Scientific Reports volume 10, Article number: 14149 (2020) Cite this article

3958 Accesses
2 Citations
6 Altmetric
Metrics details

Subjects

Abstract

The gene regulatory network (GRN) of human cells encodes mechanisms to ensure proper functioning. However, if this GRN is dysregulated, the cell may enter into a disease state such as cancer. Understanding the GRN as a system can therefore help identify novel mechanisms underlying disease, which can lead to new therapies. To deduce regulatory interactions relevant to cancer, we applied a recent computational inference framework to data from perturbation experiments in squamous carcinoma cell line A431. GRNs were inferred using several methods, and the false discovery rate was controlled by the NestBoot framework. We developed a novel approach to assess the predictiveness of inferred GRNs against validation data, despite the lack of a gold standard. The best GRN was significantly more predictive than the null model, both in cross-validated benchmarks and for an independent dataset of the same genes under a different perturbation design. The inferred GRN captures many known regulatory interactions central to cancer-relevant processes in addition to predicting many novel interactions, some of which were experimentally validated, thus providing mechanistic insights that are useful for future cancer research.

Inferring gene regulatory networks from single-cell multiome data using atlas-scale external data

Article Open access 12 April 2024

Genome-wide association studies

Article 26 August 2021

Assessing GPT-4 for cell type annotation in single-cell RNA-seq analysis

Article Open access 25 March 2024

Introduction

Cancer can be seen as an altered state of the regulatory systems that control cell proliferation and cell death. Such systems are generally not sensitive to individual gene malfunctions, but an aggregation of aberrations can lead to sufficient dysregulation to cause cancer. Reliable models of these regulatory interactions would offer insight into key mechanistic alterations for therapeutic targeting. Cancer subtype-specific gene regulatory networks (GRN) encode intracellular dynamics¹, and offer understanding into the functional changes driving disease development. Inference of such models generally exploits certain aspects of the experimental setup, such as pooling among replicates to amplify signal, or makes use of prior knowledge^2,3. The experimental techniques, setup, and data collection quality determine the quality of an inferred GRN model. However, practical limitations of experimentation, such as high noise levels and few experiments relative to the vast combinatorial landscape of possible regulatory interactions, often prevent any GRN inference methods from inferring a correct GRN⁴. Methods using data from known and systematic perturbations have shown greater accuracy among inference techniques since more information is available to determine regulatory causal mechanisms in the system⁵.

GRN inference has proven its value to unravel novel regulatory links of biological significance. For instance, ARACNe was applied to gene expression profiles to predict a glioma-specific GRN, revealing that C/EBPbeta and STAT3 are master regulators of mesenchymal transformation, which was validated experimentally⁶. In another study, eight key genes were knocked down by siRNA, and the gene expression together with prior knowledge were used to infer a GRN network in the RAS pathway with good validation performance⁷.

A large number of GRN inference algorithms exist. In a survey by the DREAM5 project⁸ it was shown on simulated and E. coli data that some methods performed better than random predictions. However, many methods did not outperform random prediction, and on yeast data no method performed much better than random selection. Since this study, the community has developed methods for integrating various priors: literature/database, ATAC-seq, DNase I hypersensitive sites, ChIP-Seq, or proteomics data to increase information about the system^9,10,11. A few trends can be seen from the three GRN benchmarks DREAM5⁸, GeneNetWeaver¹² and NetBenchmark¹³. In these studies, each benchmarking 35, 6, and 10 inference methods, the methods were found to produce AUPR values ranging from 0 to 0.3, with high values being rare. This variability is also seen for individual methods across different benchmark studies. For instance, while Genie3 is the best performing method in DREAM5, it performed relatively poorly in the other two. Such disparities may be caused by differences in the specific conditions under which the benchmarks were run or in parameters of the synthetic data creation such as size, noise, and network properties. While some methods employ relatively simple computational techniques and therefore can scale to thousands of genes, they tend to produce low accuracy in benchmarks. DREAM5 grouped inference methods into the categories Regression, Mutual Information, Correlation, Bayesian networks, Other approaches, and Meta predictors. Overall, no category clearly outperformed all the other ones, and within each category there is a mix of well and poorly performing methods. Even methods such as neural networks, which in other research settings have performed exceedingly well¹⁴, performed poorly here.

These benchmarks and surveys testify to the fact that network inference remains a very challenging task considering expression data alone. Integrative approaches can improve performance but depend on the availability of different types of omics data, and face challenges such as varying experimental setups, heterogeneity, and quality of input data. In many cases, only expression data are available however, and here the quality of data is paramount for accurate GRN inference^15,16.

In this study, we deployed perturbations through siRNA gene knockdown of each gene in our literature-curated set of cancer-related regulator genes, each followed by transcriptomics measurements of all genes, in order to measure the global influence of each individual gene. Knockdown experiments are more informative about the system than irreversible and complete knockout which may cause drastic rewiring of the underlying network into another system entirely. Assuming a linear time invariant (LTI) system¹⁷, once the system has reached a steady-state, a GRN can be inferred by solving a set of first order ordinary differential equations (ODEs)¹⁸ in the form of our linear model (Eq. 1). Importantly, our linear model is reliant on a known perturbation design, which adds valuable information to the inference. In this way a selected set of 40 genes relevant to cancer were perturbed, and the transcriptomic response data were used to construct a model of underlying regulatory interactions. We inferred these interactions by relating the effect of the gene perturbations to the expression of the readout genes, using three GRN inference algorithms well suited for employing our linear model and perturbation design: LASSO, LSCO, and TLSCO. The regulatory interactions inferred by these methods are not limited to direct physical interactions, but should be seen as regulatory influences, which may be indirect via genes that are not modeled because they were not measured.

A drawback of all GRN inference algorithms is that they generally produce erroneous GRNs if the noise level is high^15,16. To ensure inference of reliable GRNs, we employed NestBoot¹⁹, a recent framework implementing nested bootstrapping, wrapped around any individual GRN inference method to better account for sample variation and noise. Contained within the GeneSPIDER package¹⁶, NestBoot generates bootstrap support distributions for links inferred from measured as well as shuffled data²⁰, and minimizes false links by comparing them. This way Nestboot is able to discard links even if they have high bootstrap support, if they also have this in the null distribution. NestBoot has been shown to give substantially increased inference accuracy across both synthetic and experimental datasets when compared to the methods in their native implementation.

In order to measure the accuracy of an inferred GRN, a true GRN is required. Because this is generally not available in the case of real data, we here introduce a framework to assess the predictiveness of an inferred GRN in the absence of a true GRN. Note that we are not presenting a GRN inference method on its own but rather a way to assess the quality of a given GRN. We first use it to measure a GRN’s ability to predict the data compared to a distribution of GRNs with the same topology as the inferred one but whose links have been shuffled. We complemented this performance evaluation by measuring the GRN’s ability to predict the data compared to a distribution of shuffled data. Finally, we present the best performing GRN in detail, although the other inferred GRNs are largely subsets of each other and mostly perform well too. Two of the novel links of the best GRN were experimentally validated. The presented GRN captures regulatory interactions central to cancer-relevant processes and we foresee that it can provide mechanistic insights that can help to guide future cancer research. For instance, many cancers are caused by dysregulation of the MYC oncogene, hence our finding of a new regulator of MYC may potentially lead to new therapies.

Methods

Knockdown data collection

A set of genes was assembled from different pathways and complexes, each interacting to some degree with the oncogene MYC²¹ (Tables S2 and S6). Each readout gene was perturbed in the human squamous carcinoma cell line A431 via transfection with short interfering RNAs (siRNAs). We then harvested, purified and prepared libraries using the Ambion Library Construction Kit²². A precise record of perturbations is key to modeling (next section). In order to minimize siRNA off-target effects, two to three siRNAs were used per target (Table S4), which were then averaged to purify the effects of the targeted siRNA perturbation. Cells were collected 72 h after siRNA knockdown and washed of Phosphate-Buffer Solution (PBS), and lysed using CelluLyser²³. Cell counts were calculated using the resazurin fluorescence assay. Since no endogenous gene can be assumed to be free of MYC regulation, which is thought to be a universal transcriptional amplifier²⁴, a spike-in RNA transcript was added to each sample to act as a reference gene for the quantitative polymerase chain reaction (qPCR) analysis, added in proportion to the cell count before RNA isolation. It consisted of a 1,000-base sequence with a 5′ cap and a polyA tail. This was only used for normalization of mRNA level across samples²⁵. Negative controls were included of siRNA not mapping to human genes, as well as an untreated control absent of any siRNA. The cDNA was prepared from the RNA and preamplified in preparation for the high-throughput qPCR screening. Finally, the transcript profiles with respect to the 40 genes were determined with TaqMan qPCR assays (Table S5) using Fluidigm Biomark 96 × 96 Dynamic Array integrated fluidic circuits. Raw qPCR output was processed with the ddct R package²⁶ into log transformed fold changes relative to the experimental controls. Three experimental replicates were made per targeted perturbation and five outlying replicates were discarded due to clear machine read error, thus the dataset is composed of 40 genes (N) and 115 samples. Including all controls, a total of 18,432 qPCRs were performed on 192 samples. Two technical replicates were performed to ensure minimal machine error. They generated very similar values up to 25 qPCR cycles (Fig. S1).

Experimental validation of individual interactions was performed on GTML2 brain tumor cells, which were cultured in serum-free stem cell medium as previously described²⁷ and treated for 2 h with DMSO or JQ1 (500 nM). RNA was purified using the RNeasy Kit (Qiagen). RNA sequencing was performed using the Ion Proton System for Next-Generation Sequencing at NGI, SciLifeLab, Uppsala Biomedical Center (BMC), Sweden. All treatment conditions were performed in triplicates. All RNA sequence reads were processed and the differentially expressed genes were analyzed as previously described²⁷. An additional validation was based on a gene expression data set comprising DMSO- and JQ1-treated glioma cell lines, which we had previously published²⁸. Specifically, in this study we were able to distinguish between JQ1-resistant and JQ1-sensitive human adult high-grade glioma cell lines. From the four cell lines with available AmpliSeq expression data, only one (U3056) was characterized as JQ1-sensitive and expressing high MYC levels, and was accordingly selected to investigate the interaction between BRD4 and CCNB1. Expression data for the U3056 cell line was downloaded from the Gene Expression Omnibus (GSE138942) and comprised 6 h DMSO and JQ1 treatments in triplicates each.

GRN inference

The fold change is calculated in comparison to the spike-in for all knockdown experiments. It is used in combination with the collective experimental design matrix (describing the location of perturbed and readout genes) to determine the GRN, i.e. the interaction matrix A, of regulatory effects from gene j to i in element a_ij. We use a linear ODE model, similar to^17,29, which simplifies to

$$Y=-{A}^{\dag}(P-F)+E$$

(1)

where Y is an expression matrix of calculated fold changes, with N genes (rows) and M experiments (columns). In Eq. 1, P is the design matrix if we solve for ${A}^{\dag}$, where the Moore–Penrose generalized inverse, denoted †, is used throughout in place of the inverse due to computational intractability wherein sparse GRNs might be rank deficient. However as we want to solve for $A$ and not ${A}^{\dag}$, we reformulate Eq. 1 to a traditional regression problem on errors-in-variables form, $-(P-F) =A(Y-E)$. The error in Y and P are represented as measurement error E and process error F, respectively, as defined in Table S1. F is used as an estimate of the variation in the perturbation, e.g. siRNA efficiency or environment, while E is used as an estimate of the variation inherent to the cells’ expression as well as error in plate reading²⁶.

Three methods are employed to perform model selection and parameter estimation simultaneously. LSCO (least squares with a cut off to produce variably sparse networks)³⁰ was chosen for its resemblance to the standard ordinary least squares method, LASSO (least absolute shrinkage and selection operator)³¹ was chosen for its proven ability to find the sparse solution with minimum errors, and TLSCO (total least squares³² with the same sparsity inducing cut off as LSCO) for its ability to model error in both the dependent and independent variables¹⁵. Each method is encapsulated within the nested bootstrapping framework to estimate the linear model in an accurate and reproducible manner by limiting false discovery rate (FDR), in their native configuration.

GRN validation without gold standard

To evaluate the goodness-of-fit of the inferred network in a prediction error framework one needs to balance the measurement and process errors. This optimization occurs during the leave one out procedure (Algorithm 1), using the CVX convex optimization package (v1.22)³³ for MATLAB, where the left out gene (g) is expressed as a linear combination of the other experiments (cross-validation). The aim of this procedure is to equally balance the measurement and process errors when predicting the left out gene under cross-validation.

In the BalanceFitError algorithm, A contains the inferred GRN structure (topology), with each non-zero value representing a regulatory interaction and each zero a lack of interaction, i.e. pseudo-direct influence. The algorithm estimates each gene’s perturbation and response based on the balanced measurement and process errors of all other genes and compares it to the intended perturbation and observed response. Since error is a function of the degrees of freedom of the given matrix, relative error (E_rel, F_rel) is used to more equally balance these errors. From step (i) to (iii) all matrices have the perturbation experiments of the left out gene g removed and are thus denoted !g. However, A maintains all genes, remaining square throughout, and later the left out experiments can be predicted from the remaining data. This method to evaluate the goodness-of-fit is used on all inferred networks. All inference methods used here have a regularization parameter that determines the number of nonzero parameters in the models, which is varied to span the complete range from empty to full network.

Because our leave out procedure assesses individual gene prediction errors, we assembled null GRN performance distributions by shuffling GRN links and fitting these new networks to the data to create both a standardized and fairly conservative link weight using a constrained least squares (CLS) algorithm^30,34. To this end we implement a Monte Carlo sampling method, sampling links to maintain the node in degree and preserving hubs thereby approximating an estimated link null distribution based on the inferred GRN judged to generate conservative and fair null GRNs. For a fair comparison, both the inferred and shuffled GRNs are fit to the original data. However topology and sign are preserved in the GRNs. To obtain a measure of the goodness of fit of both inferred and shuffled GRNs, cross-validation was used to calculate the weighted residual sum of squares (wRSS) of the original training data while balancing the measurement and process errors as described in Algorithm 1. We are able to predict a left out gene in step c and d by expressing it as a linear combination of the other genes. This goodness of fit measure was also made of the inferred GRN’s ability to under cross-validation predict the original data compared to the distribution of prediction errors using shuffled data. The relative error metric comparing measured and shuffled wRSS (Figs. 2, 3, S3, S4) is complemented by R² values (Fig. S5). Before calculating these, each GRN parameters were modified to ensure that the predicted response remained similarly bounded as the observed gene expression. This was done by performing singular value decomposition of the GRN, setting singular values below a cutoff to zero, and then reconstructing the GRN without the smallest singular values. This GRN was then fit to the training data under cross-validation to generate predicted expression responses in the same way as described above. The cutoff on the minimum singular value was set independently for each GRN to ensure that the predicted expression values were within the range of the measured values. The small singular values generally represent noise if the data is ill-conditioned and removing them reduces the effect of noise.

To further verify both predictiveness and generalizability, these GRNs are also applied to a second independent, validation dataset based on the same genes knocked down in pairs, in single replicates. While this data is not used to infer GRNs, we apply the same cross-validation strategy as for the original data to validate the GRNs. This is necessary for parameter fitting, since the process error is different from the single knockdown data. Furthermore, by running the same pipeline we obtain a comparable measure of how well the independent data fits our inferred GRNs, and build null distributions of expected error from shuffled GRNs to examine an inferred GRNs’ ability to predict the data.

Results

A four step procedure was implemented to generate a cancer-centric GRN oriented towards the MYC oncogene (Fig. 1). First, a list of 303 cancer-associated genes, gathered from the NCI Pathway Interaction Database³⁵, the myccancergene.org websiteS6³⁶, FunCoup output³⁷, and 29 other sources (supplemental Table )^{38,39,40,41,42,43,44,45,46,47,48,49,50,51,52,53,54,55,56,57,58,59,60,61,62,63,64,65,66}, was ranked heuristically by what was known of each, giving preference to genes with known associations to both cancer and MYC. The criteria were as follows, in decreasing order of rank: (i) members of a complex with MYC affecting or not affecting transcripts, (ii) genes directly affecting MYC or MYC transcripts (activating/repressing), (iii) genes affecting MYC targeted transcripts, and (iv) genes indirectly affecting MYC transcripts. Only genes expressed in the used cell line were further considered. The 40 top ranked genes were perturbed by siRNA in the well characterized human A431 squamous carcinoma cell line (Table S2). Of the selected genes, 31 are transcriptional regulators, 10 are oncogenes, and 7 are tumor suppressors.

RNA silencing experiments were carried out to knock down the expression of each individual gene, whereafter the expression of all genes in response to the perturbation was measured. The experiments were carried out with three biological replicates per gene. At steady-state, gene expression was measured using high-throughput RT-qPCR, totalling 18,432 quantifications. Most targeted genes were seen to have dramatic reduction in expression, generally a stronger effect than for the other genes (Fig. S2). In fact 31 targets were significantly downregulated (p < 0.1 and log₂ fold change < − 2).

The perturbation response gene expression data were used for network inference with three methods, LASSO, LSCO, and TLSCO, each run in conjunction with nested bootstrapping. Each GRN inference method was run with varying parameters to produce GRNs in a range of different sparsities. Nested bootstrapping was then used to select the significantly supported links in each GRN, resulting in final sparsities that tend to never exceed 3–5 links/gene even if the natively inferred GRN had 40 links/gene, for example.

In order to select the GRN that has the links most likely to exist in reality, we compared how well each inferred GRN’s topology fits the data compared to shuffled topologies of the same GRN. The reason for validating the topology rather than the complete GRN including the actual parameter values is that those parameters are optimized to fit the data and therefore no suitable null model exists. Note that by topology we mean the structure of the GRN, i.e. the inferred links and their sign. For each topology, the parameters were fit to the observed data using cross-validation, and the error was measured as the difference between predicted and observed values of the hold-out samples, after assembling the individual gene predictions into the full predicted matrix. This was done with the novel Balanced Fitting of Errors with cross validation (BEFCV) algorithm (Algorithm 1), which ensures that the error is balanced between sources, i.e. that the error is not merely pushed from the measurement to the process estimation or vice versa. Note that this algorithm is not a GRN inference method on its own but is rather a way to assess the quality of an inferred GRN.

Each inferred GRN was shuffled a hundred times, and the data was fit the same way to these topologies to estimate a null distribution of expected inference error. Note that since the parameters of the shuffled topologies are fit to minimize the error, this is a very stringent test of the inferred topology and for a suboptimal GRN one expects some of its shuffled topologies to by chance have lower error. Yet, several of the inferred GRNs greatly outperformed their null model, both when using the original training data (Fig. 2) and the independent validation data with double knockdown design in the same cell line (Fig. S3). We also calculated R² values to show the proportion of the variation that our GRNs explain (Fig. S5). All GRNs are available at https://dcolin.shinyapps.io/CancerGRN/. Five of the inferred topologies had 1,000 times lower error than the median of the shuffled null model. The most accurate GRNs were inferred by Lasso, in terms of outperforming their null distributions. All but one GRN outperformed the median of their null distributions, and eight of the nineteen GRNs across all three methods outperformed all shuffled GRNs in their respective null distribution.

We also applied another null model to test how well the data fits the inferred GRN. Here we shuffled the original data one hundred times and fit these datasets to the inferred GRN in order to generate a null distribution. For many inferred GRNs the error was significantly lower than the median of the null distribution, both for the original training data (Fig. 3) and for independent validation data (Figs. 3 and S4).

The GRN that outperformed its null distribution by the largest margin was Bolasso_network_L1145_M115_support97.5_1.52e-03, which we will refer to as the best GRN, with 125 links, including 39 self-links, between 39 genes (Fig. 4) and a sparsity of 3.2 links/gene. The full name indicates certain properties, namely that 1,145 links were natively inferred before NestBoot, 115 experiments were used, 97.5% bootstrap support was attained at FDR = 0.05, and 1.52e-03 is the sparsity penalty parameter used. In this GRN’s overlap plot (Fig. 5) one can see the distributions of bootstrap values for both measured and shuffled data. The frequency of bootstrap support for measured data increases sharply at the right end above 98%, suggesting that this part of the distribution represents real and therefore highly reproducible links. In contrast, the shuffled data decreases towards support = 1. The fact that some links inferred from shuffled data can attain such high bootstrap values can be attributed to the fact that the inference is done at a sparsity that yields very dense GRNs which may result in spurious links with high bootstrap support. However, the nested bootstrap framework monitors the distribution of spurious links and takes them into account when calculating FDR. The plot shows how FDR varies for different bootstrap support cutoffs.

One can also see the level of overlap between one hundred nested bootstrap runs in Fig. 5. Each run yields a bootstrap support for every link, which can be converted to a GRN for a given cutoff. For the measured data, the overlap (Jaccard) between runs stays relatively high (0.6) all the way to links with 100% bootstrap support, indicating that the reproducibility is high. In contrast, for the shuffled data not a single link with bootstrap support above 70% overlaps with another nested run, indicating poor reproducibility despite relatively high bootstrap support.

Validation of the best GRN

Of the 125 links inferred in the top performing GRN, two novel MYC-related links were experimentally validated. The novel regulatory relationships BRD4 → CCNB1 and CCNB1 → MYC (Fig. 3) were examined in an independent study²⁷ in which JQ1 was used to inhibit BRD4 in the GTML2 cell line, a mouse brain tumor cell line that overexpresses human MYCN. The inferred activation of CCNB1 by BRD4 was supported by a significant reduction of CCNB1 expression when BRD4 was inhibited, from 7.12 to 7.04 average log(CPM) after 6 h (Fig. S6). However, in order to study immediate effects of BRD4 inhibition we here performed a new analysis in which GTML2 cells were treated with JQ1 for just 2 h. Again, CCNB1 expression was significantly reduced, from 7.48 to 7.04 average log(CPM). Furthermore, in the MYC-expressing human adult high-grade glioma cell line U3056²⁸ we again observed a significant downregulation of CCNB1 after 6 h of BRD4 inhibition via JQ1 (Fig. S7). Longer JQ1 treatment (24 h) further decreases CCNB1 in high-grade glioma cells⁶⁷. The same was observed 24 h after JQ1 inhibition in ovarian cancer cells⁶⁸. Additional support for this link is provided by co-expression between BRD4 and CCNB1 in the GEO dataset GSE7307 (Spearman correlation 0.473, p = 4 × 10^–39.

Support for the inferred activation of MYC by CCNB1 was found by the fact that the CCNB1 expression changed from normal newborn mouse brain to adult mouse brain (from FPKM 20.3 to 0.2) which agrees with the change of MYC (from FPKM 9.2 to 0.8)²⁷. Additional support for this link is provided by co-expression between CCNB1 and MYC in the GEO datasets GSE2503 (Spearman correlation 0.99, p = 1.4 × 10^–24 for squamous cell carcinomas), GSE69925 (Spearman correlation 0.25, p = 1.4 × 10^–5 for esophageal squamous cell carcinomas), and GSE7307 (Spearman correlation 0.456 p = 1.2 × 10^–37) Furthermore, this link is found in the STRING⁶⁹, GeneMania⁷⁰, and Funcoup⁷¹ databases.

These validations support a novel mechanism for MYC regulation inferred in the best GRN. While it is well known that BRD4 can activate MYC in some cancer types⁷², the best GRN presents a regulatory route that goes via CCNB1 (Cyclin B1). Bound with cyclin-dependent protein kinases, CCNB1 is involved in controlling the cell cycle at mitosis. The findings here suggest that CCNB1′s role in regulating biological processes such as proliferation and oncogenesis can proceed via the activation of MYC.

Another type of validation is comparison to known links in public network resources. The links in the best GRN were searched for in the databases TRRUST⁷³, FunCoup⁷¹, HumanNe⁷⁴, and STRING⁶⁹ as well as in our prior network from data mining. Where these reference networks contained undirected links, we compared them to an undirected version of our GRN. Many known interactions were witnessed in the best GRN (21 recovered from STRING), speaking to its ability to accurately infer what is known. The overlap with the other GRNs was significant (p < 0.1) in a hypergeometric test in all cases but one (Table S3).

Discussion

This study carries out a complete workflow for inferring reliable GRNs, from the selection of genes, experimental perturbation, data collection, and GRN inference, to validation of the inferred GRNs. It was applied to 40 cancer-related genes whose GRN was studied in a human squamous carcinoma cell line. The collected dataset was used to infer variously sparse GRN using three inference techniques within the NestBoot framework. The predictiveness of the inferred GRNs was estimated using the novel BalanceFitError algorithm under cross-validation. This is not a GRN inference method on its own but can be applied to GRNs inferred with any method. Almost all inferred GRNs were more predictive than expected by chance, and some were vastly more predictive. These top performing GRN were also able to predict an independent pairwise-gene perturbation validation dataset significantly better than expected by chance. The best GRN contains many known links as well as proposes many novel links, two of which were verified experimentally.

The performed gene perturbations caused a range of fold changes in both targeted and readout genes (Fig. S2). Knockdown is advantageous for GRN inference compared to complete inhibition through knockout, as that could alter the gene functioning within the cells to such an extent as to potentially drive the cell to any number of non-native states by activating alternative pathways to cope with the loss of the knocked out one. This would result in measurement of an altogether different cellular GRN which lacks the knocked out gene. With knockdown, a gene's effect is lowered in the hope of measuring an otherwise wild-type GRN from the perspective of the single gene perturbation, across the gene repertoire.

The knockdown efficiency of each siRNA is unknown, and varies between genes. It may seem desirable to know the siRNA efficiency since this is a parameter in the perturbation design matrix that is used in the mathematical modelling. However, its value does not affect the inferred GRN’s topology, and since the topology is the main outcome of the inference, and what we compare to null, this lack of knowledge is inconsequential. Prior information, whether literature-curated, ChIP-seq, ATAC-seq has been shown to be of value in modern GRN investigations, and may also be helpful. Such integration could be built into the model as a method of constraining spurious link additions much the same as NestBoot restricts links based on shuffled link distributions. The NestBoot algorithm produces substantial accuracy improvement and we would anticipate further accuracy improvements from the addition of priors. However, such experimental information is not available for this study and we therefore pursued a strictly data driven approach.

Despite our efforts to measure absolute mRNA levels using spiked-in RNA as qPCR reference, MYC was not found to be a universal amplifier as previously claimed^24,75,76. Our observation agrees with the results of⁷⁷. In both their and our study, measurements were done after 72 h. It is possible that MYC knockdown activates a response leading to rapid restoration of MYC expression, so that cells return to their original state within that time span, instead of reaching a new steady state⁶⁸. In our study, the targeted MYC transcript was not significantly repressed by the MYC siRNA. This may be caused by its unusually high turnover rate⁷⁸, which can make it difficult to knock down with siRNA⁷⁹. Another possibility is that the introduced siRNAs compete out miRNAs for available RiSC and thereby relieves repression of endogenous miRNA targets⁸⁰. The same lack of observed knockdown for the target was noted for three other transcription factors: SP1, LMYC, and JUN. This seems to suggest a need for optimizing the experimental protocol to obtain perturbed steady state conditions when knocking down certain transcription factors.

During the NestBoot procedure the sparsity of the native GRNs is varied from almost a full to almost an empty network. However, as NestBoot selects the strongest supported links only, the sparsity of the GRNs output by NestBoot does not vary much for the denser GRNs, and even less in gene makeup. We observe consistency across different sparsities, i.e. the smaller GRNs are mostly a subset of the larger ones. This consistency among sparsities adds further confidence beyond the GRNs’ predictiveness relative to a null distribution of shuffled topologies. Selecting the GRN with the optimal sparsity can be done in several ways. Here we followed the strategy of selecting the GRN with the best combination of coverage and predictiveness. Another criterion to select GRNs is the biological rationale that natural systems usually contain 3–5 links per gene^8,29.

In this study we face the problem of how to measure accuracy in the absence of a true network. Lacking such a gold standard it is impossible to determine if an inferred link is true or false. Instead, we compared each inferred network to a null distribution of GRNs with the same sparsity and indegree distribution. Since the prediction error depends on the weights of the links, it is crucial to fit each shuffled-link GRN to the data to give it reasonable weight estimates. To make the comparison fair, both the inferred GRN and the shuffled-link GRNs are refit to the data. By showing that the inferred GRN outperform their shuffled counterparts in terms of ability to explain the data, measured both by the wRSS and R², we know that they have a topology closer to the unknown real GRN. The exact same procedure can then be applied to other data, such as the independent validation dataset. With enough repeated shuffled-link GRNs to produce a sufficient null distribution, this results in an unbiased estimate of how predictive a given GRN is compared to what is expected, despite lacking a known gold standard network. Benchmarking on data with a known gold standard shows that increased predictiveness measured this way generally agrees with higher accuracy.

Data availablity

Data available at GSE125958. Inferred GRNs and inference statistics available at https://dcolin.shinyapps.io/CancerGRN/. Software available at https://bitbucket.org/sonnhammergrni/genespider/src/BFECV/.

References

Shulman, L. P. Analysis of microarray experiments of gene expression profiling. Yearbook Obstet. Gynecol. Women Health 2007, 58–59 (2007).
Google Scholar
Haury, A.-C., Mordelet, F., Vera-Licona, P. & Vert, J.-P. TIGRESS: trustful inference of gene regulation using stability selection. BMC Syst. Biol. 6, 145 (2012).
PubMed PubMed Central Google Scholar
Mordelet, F. & Vert, J.-P. SIRENE: supervised inference of regulatory networks. Bioinformatics 24, i76-82 (2008).
PubMed Google Scholar
Guo, S., Jiang, Q., Chen, L. & Guo, D. Gene regulatory network inference using PLS-based methods. BMC Bioinform. 17, 545 (2016).
Google Scholar
Tegnér, J. & Björkegren, J. Perturbations to uncover gene networks. Trends Genet. 23, 34–41 (2007).
PubMed Google Scholar
Carro, M. S. et al. The transcriptional network for mesenchymal transformation of brain tumours. Nature 463, 318–325 (2010).
ADS PubMed CAS Google Scholar
Olsen, C. et al. Inference and validation of predictive gene networks from biomedical literature and gene expression data. Genomics 103, 329–336 (2014).
PubMed PubMed Central CAS Google Scholar
Marbach, D. et al. Wisdom of crowds for robust gene network inference. Nat. Methods. 9, 796–804 (2012).
PubMed PubMed Central CAS Google Scholar
Castro, D. M., de Veaux, N. R., Miraldi, E. R. & Bonneau, R. Multi-study inference of regulatory networks for more accurate models of gene regulation. PLoS Comput. Biol. 15, e1006591 (2019).
PubMed PubMed Central Google Scholar
Banf, M., Zhao, K. & Rhee, S. Y. METACLUSTER-an R package for context-specific expression analysis of metabolic gene clusters. Bioinformatics 35, 3178–3180 (2019).
PubMed CAS Google Scholar
Wani, N. & Raza, K. Integrative approaches to reconstruct regulatory networks from multi-omics data: a review of state-of-the-art methods. Comput. Biol. Chem. 83, 107120 (2019).
MathSciNet PubMed CAS Google Scholar
Schaffter, T., Marbach, D. & Floreano, D. GeneNetWeaver: in silico benchmark generation and performance profiling of network inference methods. Bioinformatics 27, 2263–2270 (2011).
PubMed CAS Google Scholar
Bellot, P., Olsen, C., Salembier, P., Oliveras-Vergés, A. & Meyer, P. E. NetBenchmark: a bioconductor package for reproducible benchmarks of gene regulatory network inference. BMC Bioinform. 16, 312 (2015).
Google Scholar
McKinney, S. M. et al. International evaluation of an AI system for breast cancer screening. Nature 577, 89–94 (2020).
ADS PubMed CAS Google Scholar
Tjärnberg, A., Nordling, T. E. M., Studham, M., Nelander, S. & Sonnhammer, E. L. L. Avoiding pitfalls in L1-regularised inference of gene networks. Mol. Biosyst. 11, 287–296 (2015).
PubMed Google Scholar
Tjärnberg, A., Morgan, D. C., Studham, M., Nordling, T. E. M. & Sonnhammer, E. L. L. GeneSPIDER: gene regulatory network inference benchmarking with controlled network and data properties. Mol. Biosyst. 13, 1304–1312 (2017).
PubMed Google Scholar
Gardner, T. S., di Bernardo, D., Lorenz, D. & Collins, J. J. Inferring genetic networks and identifying compound mode of action via expression profiling. Science 301, 102–105 (2003).
ADS PubMed CAS Google Scholar
Bonneau, R. et al. The inferelator: an algorithm for learning parsimonious regulatory networks from systems-biology data sets de novo. Genome Biol. 7, R36 (2006).
PubMed PubMed Central Google Scholar
Morgan, D., Tjärnberg, A., Nordling, T. E. M. & Sonnhammer, E. L. L. A generalized framework for controlling FDR in gene regulatory network inference. Bioinformatics https://doi.org/10.1093/bioinformatics/bty764 (2018).
Article PubMed Google Scholar
Gobbi, A. et al. Fast randomization of large genomic datasets while preserving alteration counts. Bioinformatics 30, i617–i623 (2014).
PubMed PubMed Central CAS Google Scholar
Hsieh, A. L., Walton, Z. E., Altman, B. J., Stine, Z. E. & Dang, C. V. MYC and metabolism on the path to cancer. Semin. Cell Dev.. Biol. 43, 11–21 (2015).
PubMed PubMed Central CAS Google Scholar
Ambion RNA-Seq Library Construction Kit. ThermoFisher; 01/2017. https://tools.thermofisher.com/content/sfs/manuals/4452440C.pdf.
CelluLyser Lysis and cDNA Synthesis Kit. TATTA. https://www.tataa.com/wp-content/uploads/2012/10/prodblad_v03_tataa-CelluLyser.pdf (2012).
Lovén, J. et al. Revisiting global gene expression analysis. Cell 151, 476–482 (2012).
PubMed PubMed Central Google Scholar
Biocenter T. TATAA Universal RNA Spike I. https://webshop.tataa.com/dokument/Manual_TATAA%20Universal%20RNA%20Spike%20I%20SYBR-%20Probe_v1.3.pdf (2017).
Zhang, J. D., Biczok, R., & Ruschhaupt, M. The ddCt algorithm for the analysis of quantitative real-time PCR (qRT-PCR). https://www.bioconductor.org/packages/release/bioc/html/ddCt.html (2017).
Bolin, S. et al. Combined BET bromodomain and CDK2 inhibition in MYC-driven medulloblastoma. Oncogene 37, 2850–2862 (2018).
PubMed PubMed Central CAS Google Scholar
Čančer, M. et al. BET and aurora kinase A inhibitors synergize against MYCN-positive human glioblastoma cells. Cell Death Dis. 10, 881 (2019).
PubMed PubMed Central Google Scholar
Nordling, T. Robust inference of gene regulatory networks. Jacobsen, E. (eds). PhD, KTH Royal Institute of Technology (2013).
Tjärnberg, A., Nordling, T. E. M., Studham, M. & Sonnhammer, E. L. L. Optimal sparsity criteria for network inference. J. Comput. Biol. 20, 398–408 (2013).
MathSciNet PubMed Google Scholar
Tibshirani, R. Regression shrinkage and selection via the lasso. J. R. Stat. Soc. B. 1, 267–288. https://doi.org/10.1111/j.2517-6161.1996.tb02080.x (1996).
Article MathSciNet MATH Google Scholar
de Groen, P. P. N. An introduction to total least squares (1998).
Grant, M. & Boyd, S. CVX: Matlab Software for Disciplined Convex Programming, version 2.1. https://cvxr.com/cvx (2014).
Chang, L. Y. & Pollard, N. S. Constrained least-squares optimization for robust estimation of center of rotation. J. Biomech. 40, 1392–1400 (2007).
PubMed Google Scholar
Schaefer, C. F. et al. PID: the pathway interaction database. Nucleic Acids Res. 37, D674–D679 (2009).
PubMed CAS Google Scholar
Zeller, K. I., Jegga, A. G., Aronow, B. J., O’Donnell, K. A. & Dang, C. V. An integrated database of genes responsive to the Myc oncogenic transcription factor: identification of direct genomic targets. Genome Biol. 4, R69 (2003).
PubMed PubMed Central Google Scholar
Schmitt, T., Ogris, C. & Sonnhammer, E. L. L. FunCoup 3.0: database of genome-wide functional coupling networks. Nucleic Acids Res. 42, D380–D388 (2014).
PubMed CAS Google Scholar
Amente, S., Lavadera, M. L., Palo, G. D. & Majello, B. SUMO-activating SAE1 transcription is positively regulated by Myc. Am J Cancer Res. 2, 330–334 (2012).
PubMed PubMed Central CAS Google Scholar
Benassi, B. et al. MYC is activated by USP2a-mediated modulation of microRNAs in prostate cancer. Cancer Discov. 2, 236–247 (2012).
PubMed PubMed Central CAS Google Scholar
Bommert, K. S. et al. The feed-forward loop between YB-1 and MYC is essential for multiple myeloma cell survival. Leukemia 27, 441–450 (2013).
PubMed CAS Google Scholar
Chung, E. Y. et al. CD19 is a major B cell receptor-independent activator of MYC-driven B-lymphomagenesis. J. Clin. Invest. 122, 2257–2266 (2012).
PubMed PubMed Central CAS Google Scholar
Das, S., Anczuków, O., Akerman, M. & Krainer, A. R. Oncogenic splicing factor SRSF1 is a critical transcriptional target of MYC. Cell Rep. 1, 110–117 (2012).
PubMed PubMed Central CAS Google Scholar
Delmore, J. E. et al. BET bromodomain inhibition as a therapeutic strategy to target c-Myc. Cell 146, 904–917 (2011).
PubMed PubMed Central CAS Google Scholar
Davis, C. A. et al. The Encyclopedia of DNA elements (ENCODE): data portal update. Nucleic Acids Res. 46, D794–D801 (2018).
PubMed CAS Google Scholar
Zwolinska, A. K., Heagle Whiting, A., Beekman, C., Sedivy, J. M. & Marine, J.-C. Suppression of Myc oncogenic activity by nucleostemin haploinsufficiency. Oncogene 31, 3311–3321 (2012).
PubMed CAS Google Scholar
Kessler, J. D. et al. A SUMOylation-dependent transcriptional subprogram is required for Myc-driven tumorigenesis. Science 335, 348–353 (2012).
ADS PubMed CAS Google Scholar
Kimura, Y. et al. MM-1 facilitates degradation of c-Myc by recruiting proteasome and a novel ubiquitin E3 ligase. Int. J. Oncol. 31, 829–836 (2007).
PubMed CAS Google Scholar
Li, L. et al. Oncogenic activation of glypican-3 by c-Myc in human hepatocellular carcinoma. Hepatology 56, 1380–1390. https://doi.org/10.1002/hep.25891 (2012).
Article PubMed CAS Google Scholar
Hakem, A. et al. Role of Pirh2 in mediating the regulation of p53 and c-Myc. PLoS Genet. 7, e1002360 (2011).
PubMed PubMed Central CAS Google Scholar
Menssen, A. et al. The c-MYC oncoprotein, the NAMPT enzyme, the SIRT1-inhibitor DBC1, and the SIRT1 deacetylase form a positive feedback loop. Proc. Natl. Acad. Sci. USA 109, E187–E196 (2012).
PubMed CAS Google Scholar
Neri, F. et al. Myc regulates the transcription of the PRC2 gene to control the expression of developmental genes in embryonic stem cells. Mol. Cell Biol. 32, 840–851 (2012).
PubMed PubMed Central CAS Google Scholar
Piccinni, E. et al. Direct interaction of Gas41 and Myc encoded by amplified genes in nervous system tumours. Acta Biochim. Pol. 58, 529–534 (2011).
PubMed CAS Google Scholar
Magudia, K., Lahoz, A. & Hall, A. K-Ras and B-Raf oncogenes inhibit colon epithelial polarity establishment through up-regulation of c-myc. J. Cell Biol. 198, 185–194 (2012).
PubMed PubMed Central CAS Google Scholar
Narita, R. et al. Rabring7 degrades c-Myc through complex formation with MM-1. PLoS ONE 7, e41891 (2012).
ADS PubMed PubMed Central CAS Google Scholar
Paul, I., Ahmed, S. F., Bhowmik, A., Deb, S. & Ghosh, M. K. The ubiquitin ligase CHIP regulates c-Myc stability and transcriptional activity. Oncogene 32, 1284–1295 (2013).
PubMed CAS Google Scholar
Peck, B., Ferber, E. C. & Schulze, A. Antagonism between FOXO and MYC regulates cellular powerhouse. Front. Oncol. 3, 96 (2013).
PubMed PubMed Central Google Scholar
Romero, O. A. et al. The tumour suppressor and chromatin-remodelling factor BRG1 antagonizes Myc activity and promotes cell differentiation in human cancer. EMBO Mol. Med. 4, 603–616 (2012).
PubMed PubMed Central CAS Google Scholar
Qi, H. & Pei, D. The magic of four: induction of pluripotent stem cells from somatic cells by Oct4, Sox2, Myc and Klf4. Cell Res. 17, 578–580. https://doi.org/10.1038/cr.2007.59 (2007).
Article PubMed CAS Google Scholar
Zimonjic, D. B. & Popescu, N. C. Role of DLC1 tumor suppressor gene and MYC oncogene in pathogenesis of human hepatocellular carcinoma: potential prospects for combined targeted therapeutics (review). Int. J. Oncol. 41, 393–406 (2012).
PubMed PubMed Central CAS Google Scholar
Goga, A., Yang, D., Tward, A. D., Morgan, D. O. & Bishop, J. M. Inhibition of CDK1 as a potential therapy for tumors over-expressing MYC. Nat. Med. 13, 820–827 (2007).
PubMed CAS Google Scholar
Campaner, S. et al. Cdk2 suppresses cellular senescence induced by the c-myc oncogene. Nat. Cell Biol. 12, 54–59 (2010).
PubMed CAS Google Scholar
García-Gutiérrez, L. et al. Myc stimulates cell cycle progression through the activation of Cdk1 and phosphorylation of p27. Sci. Rep. 9, 18693 (2019).
ADS PubMed PubMed Central Google Scholar
Zhang, J. et al. BAG2 is a target of the c-Myc gene and is involved in cellular senescence via the p21(CIP1) pathway. Cancer Lett. 318, 34–41 (2012).
ADS PubMed CAS Google Scholar
Hayashi, K. & Anzai, N. Novel therapeutic approaches targeting L-type amino acid transporters for cancer treatment. World J. Gastrointest. Oncol. 9, 21–29 (2017).
PubMed PubMed Central Google Scholar
Wu, N. & Gidrol, X. The wind rose of human keratinocyte cell fate. Cell Mol. Life Sci. 71, 4697–4702 (2014).
PubMed PubMed Central CAS Google Scholar
Doe, M. R., Ascano, J. M., Kaur, M. & Cole, M. D. Myc posttranscriptionally induces HIF1 protein and target gene expression in normal and cancer cells. Cancer Res. 72, 949–957 (2012).
PubMed CAS Google Scholar
Wen, N. et al. Bromodomain inhibitor jq1 induces cell cycle arrest and apoptosis of glioma stem cells through the VEGF/PI3K/AKT signaling pathway. Int. J. Oncol. 55, 879–895 (2019).
PubMed PubMed Central CAS Google Scholar
Zhang, Z. et al. BET bromodomain inhibition as a therapeutic strategy in ovarian cancer by downregulating FoxM1. Theranostics 6, 219–230 (2016).
PubMed PubMed Central CAS Google Scholar
Szklarczyk, D. et al. The STRING database in 2017: quality-controlled protein-protein association networks, made broadly accessible. Nucleic Acids Res. 45, D362–D368 (2017).
PubMed CAS Google Scholar
Montojo, J., Zuberi, K., Rodriguez, H., Bader, G. D. & Morris, Q. GeneMANIA: fast gene network construction and function prediction for Cytoscape. F1000 Res. https://doi.org/10.12688/f1000research.4572.1 (2014).
Article Google Scholar
Ogris, C., Guala, D. & Sonnhammer, E. L. L. FunCoup 4: new species, data, and visualization. Nucleic Acids Res. 46, D601–D607 (2018).
PubMed CAS Google Scholar
Shi, J. et al. Disrupting the interaction of BRD4 with diacetylated Twist suppresses tumorigenesis in basal-like breast cancer. Cancer Cell 25, 210–225 (2014).
PubMed PubMed Central CAS Google Scholar
Han, H. et al. TRRUST v2: an expanded reference database of human and mouse transcriptional regulatory interactions. Nucleic Acids Res. 46, D380–D386 (2018).
PubMed CAS Google Scholar
Zeng, X., Lin, J., Lin, C., Liu, X. & Rodriguez-Paton, A. Structural hole spanner in humannet identifies disease gene and drug targets. IEEE Access. 6, 35392–35401 (2018).
Google Scholar
Lin, C. Y. et al. Transcriptional amplification in tumor cells with elevated c-Myc. Cell 151, 56–67 (2012).
PubMed PubMed Central CAS Google Scholar
Nie, Z. et al. c-Myc is a universal amplifier of expressed genes in lymphocytes and embryonic stem cells. Cell 151, 68–79 (2012).
PubMed PubMed Central CAS Google Scholar
Nishiyama, A. et al. Systematic repression of transcription factors reveals limited patterns of gene expression changes in ES cells. Sci. Rep. 3, 1390 (2013).
PubMed PubMed Central Google Scholar
Jones, T. R. & Cole, M. D. Rapid cytoplasmic turnover of c-myc mRNA: requirement of the 3’ untranslated sequences. Mol. Cell Biol. 7, 4513–4521 (1987).
PubMed PubMed Central CAS Google Scholar
Larsson, E., Sander, C. & Marks, D. mRNA turnover rate limits siRNA and microRNA efficacy. Mol Syst Biol. 6, 433 (2010).
PubMed PubMed Central CAS Google Scholar
Khan, A. A. et al. Transfection of small RNAs globally perturbs gene regulation by endogenous microRNAs. Nat. Biotechnol. 27, 549–555 (2009).
PubMed PubMed Central CAS Google Scholar

Download references

Acknowledgements

We thank S. Nelander and L.G. Larsson for helpful discussions and M. Kaduk for useful comments. The work of TEMN was in part supported by Swedish strategic research program eSSENCE, Sweden, 2-year post-doctoral fellowship, the National Cheng Kung University, Taiwan, and Ministry of Science and Technology of Taiwan [MOST 105-2218-E-006-016-MY2, 107-2634-F-006-009, 108-2634-F-006-009, 108-2218-E-006-046].

Funding

Open Access funding provided by Stockholm University.

Author information

Authors and Affiliations

Department of Biochemistry and Biophysics, Stockholm University, Science for Life Laboratory, Box 1031, 17121, Solna, Sweden
Daniel Morgan, Matthew Studham, Andreas Tjärnberg & Erik L. L. Sonnhammer
Lionel Lab, Center for Developmental Genetics, New York University, New York, USA
Andreas Tjärnberg
Rudbeck Laboratory, Department of Immunology, Genetics and Pathology, Science for Life Laboratory, Uppsala University, Uppsala, Sweden
Holger Weishaupt & Fredrik J. Swartling
Department of Mechanical Engineering, National Cheng Kung University, Tainan, Taiwan
Torbjörn E. M. Nordling

Authors

Daniel Morgan
View author publications
You can also search for this author in PubMed Google Scholar
Matthew Studham
View author publications
You can also search for this author in PubMed Google Scholar
Andreas Tjärnberg
View author publications
You can also search for this author in PubMed Google Scholar
Holger Weishaupt
View author publications
You can also search for this author in PubMed Google Scholar
Fredrik J. Swartling
View author publications
You can also search for this author in PubMed Google Scholar
Torbjörn E. M. Nordling
View author publications
You can also search for this author in PubMed Google Scholar
Erik L. L. Sonnhammer
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

E.S., M.S., T.N. conceived, D.M., A.T., M.S., T.N. performed, M.S., H.W., F.S. provided materials, D.M., A.T., T.N. analyzed, D.M., E.S., T.N. wrote.

Corresponding authors

Correspondence to Torbjörn E. M. Nordling or Erik L. L. Sonnhammer.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Morgan, D., Studham, M., Tjärnberg, A. et al. Perturbation-based gene regulatory network inference to unravel oncogenic mechanisms. Sci Rep 10, 14149 (2020). https://doi.org/10.1038/s41598-020-70941-y

Download citation

Received: 26 October 2019
Accepted: 22 July 2020
Published: 25 August 2020
DOI: https://doi.org/10.1038/s41598-020-70941-y

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.