Abstract
Modular Response Analysis (MRA) is a suite of methods that under certain assumptions permits the precise reconstruction of both the directions and strengths of connections between network modules from network responses to perturbations. Standard MRA assumes that modules are insulated, thereby neglecting the existence of intermodular protein complexes. Such complexes sequester proteins from different modules and propagate perturbations to the protein abundance of a downstream module retroactively to an upstream module. MRAbased network reconstruction detects retroactive, sequestrationinduced connections when an enzyme from one module is substantially sequestered by its substrate that belongs to a different module. Moreover, inferred networks may surprisingly depend on the choice of protein abundances that are experimentally perturbed, and also some inferred connections might be false. Here, we extend MRA by introducing a combined computational and experimental approach, which allows for a computational restoration of modular insulation, unmistakable network reconstruction and discrimination between solely regulatory and sequestrationinduced connections for a range of signaling pathways. Although not universal, our approach extends MRA methods to signaling networks with retroactive interactions between modules arising from enzyme sequestration effects.
Introduction
The reconstruction of connections in signaling networks from experimental data is a key problem in systems biology. An intrinsic challenge in capturing direct network connections is that a signal originating from a component first causes changes in its immediate targets, but then rapidly propagates through the entire network, producing widespread (global) changes that mask direct (local) connections between nodes. Many groups around the world have suggested approaches to reconstruct direct, causative connections between molecules.^{1,2,3,4,5,6} For signaling and gene networks, Modular Response Analysis (MRA) was developed to infer immediate connections (termed local responses) from the steady state global responses to perturbations.^{7,8,9,10,11} To reduce the vast complexity of signaling networks, MRA divides these networks into modules connected through socalled communicating species, which affect the species dynamics in other modules.^{7,12} Therefore, in a modular framework each network node can be a single species or a module containing internal species interacting within this module. To infer connections between modules, each module is perturbed either alone or together with other modules, and the steady state responses of all communicating species are measured.
MRA neglects mass transfer between network modules, assuming that signaling involves only information transfer.^{7,13} However, activation or inhibition of signaling proteins commonly occurs through posttranslational modifications (PTMs) carried out by enzymatic reactions, such as reactions catalyzed by kinases and phosphatases. These reactions can create mass transfer between modules, if a communicating species (e.g., a kinase) from one module binds to a species from another module, forming a protein–protein complex. When a considerable fraction of a communicating protein is sequestered in a complex that contains species from two different modules, this complex cannot be neglected. The word retroactivity was coined to describe this effect.^{14,15,16,17} This has also been referred to as protein sequestration, for instance, the sequestration of an active kinase by its substrate.^{18} Interactions between modules that solely occur as a result of protein sequestration differ from regulatory interactions that activate or inhibit proteins through PTMs. We will term the former as sequestrationinduced connections.
When protein moiety conservation includes species from two modules, a perturbation to only one of the species that form a complex bridging two modules will affect both modules. As a result, the modular structure is lost, leading to a breakdown of intermodular insulation, which is required by MRA. In this case, network reconstruction becomes an illposed problem,^{19} and the inferred network topology might depend on particular perturbations that are used for its reconstruction as was recently shown.^{20} Because MRA and its statistical derivatives (e.g. the maximum likelihood and Bayesian MRA) have become broadly applied network reconstruction methods,^{4,8,9,10,11,21,22,23,24,25,26,27,28,29,30,31,32} it is necessary to find out if causative regulatory connections can be precisely inferred in case of extensive retroactivity interactions between modules.
Here we explore how intermodular protein sequestration affects MRAbased network inference. We show that additional measurements of perturbationinduced changes in intermodular protein complexes can be exploited to accurately infer network circuitries. An approach is proposed that restores modular insulation by defining communicating species as weighted sums of free species and intermodular complexes. Similarly to the original work where MRA was developed,^{7} we use mathematical models of signaling pathways to simulate network responses to perturbations, but our approach is solely based on experimental data of responses to perturbations (such as changes in the conserved protein abundances) and it is modelindependent. When applicable, this approach allows us to discriminate between solely regulatory network connections (e.g., mediated by protein phosphorylation) and enzyme sequestrationinduced connections. We analyze limitations of the proposed MRA extension and cases when it can be efficiently applied.
Results
MRA requires the condition of insulation for different modules
A signaling network can often be described by ordinary differential equations,
where z_{l} are the concentrations of components, such as genes or different protein forms, the function g_{l} includes the z_{l} production and consumption rates, and p is a vector of parameters, such as stoichiometric coefficients and rate constants. It is assumed that only linearly independent concentrations are considered in Eq. (1), and, therefore, the Jacobian matrix has full rank L. Consequently, the parameter vector p can also contain the total abundances of different protein forms that are constrained by moiety conserved cycles.^{12} We consider steadystate conditions and steadystate responses to parameter p perturbations.
MRA conceptually partitions the network into N ≤ L modules. A module contains a group of genes or signaling components, which together perform one or more identifiable tasks.^{7} Each module i can harbor m_{i} (m_{i} ≥ 0) internal species (y_{ik}) and contains a communicating species (x_{i}), which represents the module output. At a steady state \((\dot y_{ik} = 0)\), internal species of each module (i) can be expressed as functions (h_{ik}) of the communicating species and parameters,
Equation (2) allows us to use a smaller set of N algebraic equations, which governs the steady state behavior of module outputs (x_{i}), which become nodes of a modular network,
We quantify a direct connection from module j to module i by a relative change (Δx_{i}/x_{i}) in the activity of communicating species x_{i} of module i brought about by a change (Δx_{j}/x_{j}) in the output activity x_{j} of module j, provided that these two modules are conceptually isolated from the network. This condition implies that all other modules except these two remain unperturbed (∂x_{k} = 0, k ≠ i, j), whereas the affected module i is allowed to relax to its steady state.^{7,13} Under this condition, the ratio \(r_{ij} = \partial {\mathrm{ln}}\,x_i/\partial {\mathrm{ln}}\,x_j\) can be found via implicit differentiation of the function f_{i} in Eq. (3).
The coefficients r_{ij} are called the connection coefficients or the local responses and form the connection matrix that determines the direction and strengths of direct network connections.^{7,9} These connection coefficients cannot be immediately measured, because a perturbation to a single module propagates through the network, and the experimentally observed changes in other modules might be indirect.
MRA calculates connection coefficients (r_{ij}) from steadystate responses of an entire network to parameter (p_{j}) perturbations. Experimentally, perturbations use siRNA (affecting protein abundances), inhibitors, drugs and genetic alterations.^{23,24,25,29,30,33} Resulting steadystate responses are termed the global response coefficients (R_{ij}),^{7,13}
MRA has developed an experimental design that determines network connections (r_{ij}) by measuring global responses (R_{ij}).^{8,9} A specific feature of this design is selecting a set of experimental interventions that do not directly influence the output x_{i} of module i in order to find network connections (r_{ij}) leading to this module i. Each of these perturbations may directly affect one or many nodes x_{k} different from x_{i}. Formally, for each x_{i} (i = 1, …, N), we choose and then perturb N − 1 parameters p_{j} known to have the property that the function f_{i} in Eq. (3) does not depend upon p_{j},
These N − 1 parameters selected for perturbation will be termed perturbation parameters. The condition (Eq. 6) that parameter p_{j} does not directly affect module i, whereas p_{j} can affect other modules j (j ≠ i) is called the module insulation condition. Usually biological information to select such a parameter p_{j} is available, for instance, it can be known that an inhibitor of a membrane kinase has no direct influence on a cytoplasmic phosphatase, or the abundance of a certain protein has no direct influence on unrelated biochemical interactions in a different module. Differentiating the function f_{i} in Eq. (3) with respect to p_{j} and using the module insulation condition (6) and Eqs. (4) and (5), we arrive at MRA equations (Eq. 7),
For every module i, Eq. (7) determines the connection coefficients r_{ij} using the global network responses (R_{kj}) of each module (k = 1, …, N) to perturbations of N − 1 parameters p_{j} (statistical MRA formulations can use less or more than N − 1 perturbations^{4,23,24,25}). Each of the selected perturbations (parameters p_{j} in Eq. 6) cannot directly influence module i, but together these N − 1 independent perturbations should affect all the other (N − 1) modules of the network except module i.^{7,9} Importantly, the connection coefficients determined by Eq. (7) do not depend on a particular choice of N − 1 parameters p_{j}, provided that the module insulation condition (6) is satisfied for each parameter p_{j}, j = 1, …, N − 1.^{7} Indeed, connection coefficients are uniquely determined by a system steady state that does not depend on the choice of perturbation parameters, see Eq. (4).
Violation of insulation condition by complexes of proteins that belong to different modules
Module outputs are often represented by signaling enzymes, such as kinases.^{4,23,25} Suppose a communicating species of module i, e.g., a kinase, forms a complex with its substrate that belongs to another module j. If the concentration of this complex is comparable with the free concentration of the kinase or its substrate, the complex concentration cannot be neglected. Because protein synthesis and degradation usually occur at much longer timescale than (de)phosphorylation reactions, the total concentrations of different protein forms are conserved and, thus, the protein abundances are parameters of the system. Consequently, at a network steady state, the concentration of a complex containing proteins from two different modules (i and j) will depend on the total abundances of both these proteins, which will be parameters denoted as, p_{i} and p_{j}. If we assign the kinasesubstrate complex to module i that includes the kinase as a communicating species, then a perturbation to parameter p_{j} (the total concentration of the substrate) will affect not only module j but also the free kinase and the complex concentrations, i.e., module i (see Supplementary material section 1). Alternatively, if we assign the complex to module j that includes the kinase substrate, then a perturbation to parameter p_{i} (the total kinase concentration) will affect not only module i but also the free substrate and the complex concentrations, i.e., module j. Consequently, the choice of perturbation parameters as the total protein abundances will violate the module insulation condition (Eq. 6) for one or both of these modules. At the same time, perturbations of the other parameters, such as rate constants of enzymatic reactions, might not violate the module insulation condition.
Sequestration of a kinase (or a phosphatase) from module i by a substrate from module j means that module j retroactively affects module i, although module j is only a recipient of a signal from module i. Proper parameter perturbations that are consistent with Eq. (6) can reveal both regulatory influences and sequestrationinduced feedbacks. However, the violation of the module insulation condition might lead to contradictory results of inferring different network circuitries by using different perturbations, as illustrated below and in Section 1 of Supplementary material, using a simple example.
Using MRA to map network connections when protein complexes bridge modules
We first illustrate the challenges arising from the protein sequestration using paradoxical, at first glance, results of finding distinct network circuitries while perturbing different parameter sets. Prabakaran and colleagues^{20} showed network inference challenges both experimentally, using an in vitro reconstituted system of purified recombinant kinases (RAF, MEK, and ERK) and phosphatases (the serine/threonine phosphatase PP2A and the tyrosine specific phosphatase PTP), and also theoretically using a simplified model of the MEK/ERK cascade (Fig. 1), described as follows. A constant external signal (mimicked by a mutated constitutively active RAF kinase) phosphorylated MEK on two serines in the activation loop, yielding active ppMEK (MEK phosphorylation was considered as one step in the model^{20}). MEK phosphatase PP2A was not explicitly considered in the model, and ppMEK dephosphorylation was described by a first order process. Active ppMEK phosphorylated ERK on the tyrosine in the activation loop. The other activating site on ERK, threonine, was mutated to a nonphosphorylatable residue, thus rendering only phosphorylated ERK (pERK) susceptible to dephosphorylation by PTP, yielding ERK. The abundances of MEK, ERK, and PTP were considered constant.^{20}
Accordingly, out of 7 network species, only four of the species concentrations were linearly independent (Fig. 1). We can select these species as [ppMEK], [pERK], \(\left[ {ppMEK \cdot \cdot ERK} \right]\) and \(\left[ {pERK \cdot \cdot PTP} \right]\) (indicated by the shaded quadrilaterals in Fig. 1), whose dynamics is governed by Eq. (8). The remaining linearly dependent concentrations, [MEK], [ERK], and [PTP], are expressed through moietyconservation laws using the total protein abundances, ERK^{tot}, MEK^{tot}, and PTP^{tot},
This model MEK/ERK cascade was divided into two modules (highlighted in pink and blue colors in Fig. 1). The total concentration of phosphorylated MEK and ERK (including protein–protein complexes) were chosen as module outputs  communicating species x_{1} and x_{2},^{20}
To infer network interactions, the MEK module was perturbed by varying the total MEK abundance (MEK^{tot}), and the ERK module was perturbed by varying either the total ERK abundance (ERK^{tot}) or the total PTP abundance (PTP^{tot}). Surprisingly, the circuitries of the reconstructed networks were found different for these two different sets of perturbations.^{20} Perturbation of ERK^{tot} revealed an activating influence of ERK on MEK (manifested by positive connection coefficient r_{12}), while perturbation of PTP^{tot} revealed an inhibiting influence of ERK on MEK (negative connection coefficient r_{12}).^{20}
Whereas the inhibiting influence of ERK on MEK can be interpreted as the sequestration of active MEK by inactive ERK within the \(ppMEK \cdot \cdot ERK\) complex, the inferred activating influence of ERK on MEK is clearly a false positive result for this in vitro reconstituted network. Moreover, this MEK/ERK network has only one nonzero regulatory connection, a connection from MEK to ERK (positive connection coefficient r_{21}). These findings by Prabakaran and colleagues^{20} become less surprising, if we recall that assigning the \(ppMEK \cdot \cdot ERK\) complex to either MEK module or ERK module violates the module insulation condition (6). Next, we demonstrate how the unique circuitry of this network can unequivocally be inferred, using the same perturbations that led to the inconsistent topologies determined by Prabakaran and colleagues.^{20}
A system of equations governing steady state behavior of the communicating species x_{1} and x_{2} (Eq. 9) is derived using Eq. (8). For purposes of readability, only arguments of the governing functions (f_{1} and f_{2}) are presented below. Full expressions for these equations can be found in Supplementary material (section 2.1).
Equation (10) shows that both functions f_{1} and f_{2} depend on the ERK abundance. Therefore when the communicating species are selected according Eq. (9), a perturbation to ERK^{tot} not only directly affects the ERK module (f_{2}) but also immediately perturbs the MEK module, because ∂f_{1}/∂ERK^{tot} ≠ 0, violating Eq. (6). Likewise, perturbations to the rate constants of the ppMEKERK complex formation/dissociation \((k_3^{on},k_3^{off},k_4)\) also violate Eq. (6), because the governing functions for both modules depend on these parameters. At the same time, perturbations of parameters, which are the rate constants of other reactions (intrinsic to single modules) \((k_1,k_2,k_5^{on},k_5^{off},k_6)\) and the PTP abundance (PTP^{tot}), do not violate the insulation condition (6) for both communicating species (module outputs) x_{1} and x_{2}. Perturbations of any two parameters from this set will allow the inference of the unique, true network circuitry with an activating connection from MEK to ERK and sequestration feedback from ERK to MEK (see Supplementary material, section 2).
We can envision the situation when a selected parameter (p_{j}) might directly affect internal species but does not influence the module output, i.e. a communicating species (x_{i}). Using Eqs. (2) and (3), the insulation condition (6) for module i can be reformulated in terms of the derivatives of its internal species, y_{ik} = h_{ik}(x_{1}, …, x_{N}, p), with respect to p_{j},
Formally, Eq. (11) allows the dependence of some internal species (y_{ik} = h_{ik}) on p_{j} provided that the sum of the partial derivatives in Eq. (11) equals 0. Thus, within the MRA framework a perturbation to parameter p_{j} can be applicable for inferring connections (r_{ij}) leading to this module i even if internal species of module i are directly perturbed, but the governing equation f_{i} and, thus, the communicating species x_{i} are not directly perturbed. Clearly, the given choice of a communication intermediate (that determines its governing function) also informs if the selected parameter violates or does not violate the module insulation condition (Eqs. 6 and 11).
Because perturbations to the protein abundances, using siRNA or irreversible covalentlybound inhibitors are commonly used, we might ask whether alternative choices of the communicating species (i.e., the module outputs) can ensure that the module insulation condition holds for both ERK and PTP abundance perturbations. Assuming that the absolute or relative (see Supplementary material, sections 2 and 3) changes in the concentrations of both free ppMEK and the ppMEKERK complex can be individually measured, we introduce a new communicating species of the MEK module, \(x_1^a\) that depends on a free nonnegative parameter a, while keeping the same ERK module output, x_{2}, as follows (cf. Eq. 9),
This new MEK module output, \(x_1^a\), is the weighted sum of the concentrations of the free active enzyme and the enzymesubstrate complex, which is multiplied by a free weight parameter a. If a = 0, \(x_1^a\) is the free form of active MEK. If a = 1, \(x_1^a\) is the total concentration of the phosphorylated MEK forms, the communicating intermediate selected by Prabakaran and colleagues.^{20} If a → ∞, then only the ppMEKERK complex acts as a communicating species.
The rationale behind selecting the new module MEK output \(x_1^a\) is the following. A perturbation, e.g., an increase in ERK^{tot} leads to an increase in the free ERK concentration and the ppMEKERK complex, but to a decrease in free ppMEK (sequestered by ERK). Because \(x_1^a\) is chosen as a linear combination of ppMEK and ppMEKERK, at some value of the weight parameter a, these opposite changes of the terms within the communicating species \(x_1^a\) will cancel each other out. At this a = a^{opt} value, perturbations to the ERK abundance will no longer directly affect the module MEK output \(x_1^{a^{opt}}\). The expression for a^{opt} can be obtained by solving the module insulation condition (Eq. 11), after substituting p_{j} = ERK^{tot} and the governing function for \(x_1^a\) given by Eq. (12) (see Supplementary material, section 2.1 for derivation),
Consequently, at a = a^{opt}, we obtain the following dependencies of the governing functions of communicating species \(x_1^{a^{opt}}\) and x_{2} on parameters (see Supplementary material, section 2 for details),
It follows from Eq. (14), that the insulation condition (Eq. 6) is satisfied for perturbations to any of the three conserved total abundances, MEK^{tot}, PTP^{tot}, ERK^{tot}, resulting in a unique MRAreconstructed network circuitry. However, perturbations to the rate constants (k_{1}, k_{2}) now violate Eq. (6) in contrast to the case of the initially selected communicating species of the MEK module (Eqs. 9 and 10). At the same time, perturbations to the rate constants \((k_5^{on},k_5^{off},k_6)\) internal to the ERK module do not contradict Eq. (6). This is supported by numerical calculations that solve Eq. (7),  the MRA equations, using simulated network perturbation responses, R_{ij}, which normally would be obtained experimentally. The results demonstrate that for the governing functions defined by Eq. (14), perturbations to any parameters, MEK^{tot}, ERK^{tot}, PTP^{tot}, \(k_3^{on}\), \(k_3^{off}\), k_{4}, \(k_5^{on}\), \(k_5^{off}\), and k_{6}, do not break up the module insulation condition, thus resulting in the reconstruction of the quantitatively identical matrices for the connections coefficients and the unmistakable network circuitry (see Supplementary material, Eqs. 18 and 19). We will term parameter perturbations, which do not violate the insulation condition, permissible perturbations (see Eqs. 6 and 11).
Interestingly, the choice of \(x_1^{a^{opt}}\) (when a = a^{opt}) results in the connection coefficient r_{12} (which determines the sequestration feedback and is calculated by solving Eq. 7) equal to zero. Consequently, by changing the communicating species (module outputs), regulatory connections in this network can be distinguished from solely retroactivity connections induced by protein sequestration. This is illustrated in Fig. 2a, showing how the regulatory activating connection from MEK to ERK (coefficient r_{21}, the red curve) and sequestration feedback connection from ERK to MEK (coefficient r_{12}, the black curve) vary with the change in the weight parameter a. Using Eq. (7), these connection coefficients (Fig. 2a) are obtained following small perturbations of the protein abundances (MEK^{tot} and ERK^{tot}) and numerical calculations of perturbation responses for the module outputs that are given by Eq. (12). The only requirement for the corresponding experimental setup is the separate determination of the ppMEKERK complex response and ppMEK response (here these responses are simulated using a model but they would be obtained experimentally for the normal MRA inference procedure^{7}). Importantly, the perturbation responses need to be measured experimentally or simulated computationally only once, whereas the connection coefficients are calculated for a range of different values of the weight parameter a, using the MRA equations (Eq. 7). At a = 0 (point 1 on Fig. 2a), the MEK module communicating species is free active MEK (ppMEK), r_{21} is positive because ppMEK activates ERK, whereas r_{12} is negative, reflecting the sequestration of ppMEK by ERK. Importantly, with an increase in the weight parameter a > 0, the negative sequestration connection coefficient r_{12} increases, assumes 0 at a = a^{opt} (point 2 in Fig. 2a) and then changes its sign, further increasing with increasing a. In fact, at a = 1, r_{12} is surprisingly positive, as found for MEK^{tot} and ERK^{tot} perturbations by Prabakaran and colleagues^{20} (Fig. 2a). Both coefficients r_{21} and r_{12} reach positive values when a tends to infinity (point 3 in Fig. 2a).
In Eq. (12), only one module (MEK) output was selected as a weighted sum of the free active kinase (ppMEK) and the kinasesubstrate complex (ppMEKERK). However, in the absence of prior information about which protein in the complex is an enzyme and which is a substrate, we can add the concentration of the ppMEKERK complex (scaled by a parameter a) to both module outputs,
This symmetrical choice of module outputs also allows us to uniquely infer the network circuitry, eliminating retroactivity connection coefficient at the same value of the weight parameter a = a^{opt} given by Eq. (13). Moreover, for the module outputs given by Eq. (15), the governing functions \(f_1^{a^{opt}}\) and \(f_2^{a^{opt}}\) will depend on the same parameter sets presented in Eq. (14) (although the expressions for these functions will change, see Supplementary material, section 2). Therefore similarly as above, perturbations to any two parameters in the following set, MEK^{tot}, ERK^{tot}, PTP^{tot}, \(k_3^{on}\), \(k_3^{off}\), k_{4}, \(k_5^{on}\), \(k_5^{off}\), and k_{6}, will reveal a unique network circuitry (see Supplementary material, section 2.1). For the choice of module outputs given in Eq. (15) and perturbations to MEK^{tot} and ERK^{tot}, Fig. 2b illustrates the dependencies of the connection coefficients on the parameter a. However, the asymptotic behavior (at large a values) of the connection coefficients is different. Now both r_{12} and r_{21} approach 1, when a → ∞, because the ppMEKERK complex becomes a main output for both MEK and ERK modules (r_{12} and r_{21} equal to 1 merely describes the influence of the ppMEKERK complex on itself).
Summarizing, by measuring the changes in the free active kinase (ppMEK) and the kinasesubstrate complex (ppMEKERK) concentrations separately upon perturbations to any two protein abundances (MEK^{tot} and ERK^{tot} or MEK^{tot} and PTP^{tot}), MRA can precisely reconstruct the signaling network analyzed by Prabakaran and colleagues,^{20} distinguishing between regulatory and retroactive connections and avoiding the inconsistent topologies.
A model of threetier cascade with no regulatory feedback connections
Next, using several examples of activating cascades where intermodular proteinprotein complexes are formed both upstream and downstream of a cascade tier, we show that regulatory connections can be unmistakably reconstructed while sequestration connections can be nullified using our method. First, we considered a cascade without regulatory feedback connections (Fig. 3a). As above, we used a mass action kinetic model to simulate steadystate responses of the cascade to perturbations. The kinetic equations of the model and parameter values are presented in Supplementary material (Tables 2 and 3).
The waterfall structure (Fig. 3a) of kinase activation cascades is common for many signal transduction cascades, such as the ERK, p38, JNK, and PI3K/AKT cascades.^{34} Proteins in each tier can be in inactive ([X_{1}], [X_{2}], and [X_{3}]) or active ([pX_{1}], [pX_{2}], and [pX_{3}]) states. The enzyme forms, pX_{1} and pX_{2}, activate inactive enzyme forms, X_{2} and X_{3}, respectively, in a twostep process which involves the formation of an enzymesubstrate complex followed by phosphorylation. The total protein abundances, \(X_1^{tot}\), \(X_2^{tot}\) and \(X_3^{tot}\), are conserved. We divided the cascade into three modules along its tiers, as shown by different colors in Fig. 3a. As above, we considered module outputs, in which the enzymesubstrate complex that bridges two modules was added to a single module (Eq. 18) or to both neighboring modules (Eq. 19),
Similarly to the model presented above (Eq. 8), if the concentrations of enzymesubstrate complexes cannot be neglected, the choice of the parameters to be perturbed can affect the inferred network circuitry. A standard selection of model outputs corresponds to weight parameters a_{i} equal to zero, if only free phosphorylated enzyme forms are considered, or to 1, if the total phosphorylated forms are measured and analyzed. Using this standard selection for all three networks and making perturbations to each of the total protein abundances, we calculated matrices of connection coefficients, r_{ij}, which showed nonzero feedback connections from each downstream module to its upstream module (see two reconstructed connection matrices for weight parameters a_{i} = 0 and a_{i} = 1 and the module outputs in Eq. (18) in the right panel of Fig. 3a). However, we do not know whether these connections correspond to regulatory or solely protein sequestrationinduced feedbacks. Moreover, reconstructed network circuitries might differ for differently selected sets of perturbation parameters.
To elucidate the nature of inferred connections, we calculated the dependences of the connection coefficients on positive values of weight parameters a_{i} for two sets of module outputs (Eqs. 18 and 19). We found that the connection coefficients, r_{12}, r_{23} and r_{13}, changed their signs with the a_{i} changes, that suggested sequestration connection, whereas connections r_{21} and r_{32} did not change the sign, indicating regulatory feedforward activation connections. For both choices of module outputs, the same \(\vec a = \vec a^{opt}\) values yielded zero values for suggested sequestration connections coefficients, r_{12}, r_{23}, and r_{13} (see Supplementary material, Eqs. 23 and 24). Also, for both choices of communicating species (Eqs. 18 and 19) we found that at \(\vec a = \vec a^{opt}\), the network was uniquely reconstructed using perturbations to wide range of perturbation parameters that included all three total protein abundances (\(X_1^{tot}\), \(X_2^{tot}\), and \(X_3^{tot}\)) and numerous kinetic constants (see Supplementary material, section 4). Recapping, at \(\vec a = \vec a^{opt}\) the connections coefficients that have changed their sign with changing \(\vec a\) become zero, and the connection matrix r_{ij} becomes invariant to a wide range of applied perturbations.
The results shown in Fig. 3 were obtained numerically, because the equations that govern the steady state behavior of communicating species do not allow for an analytical solution in this case. However, as in the previous example, in which the governing equations (Eq. 14) were solved analytically and numerically, our calculations suggested that nonzero connections, r_{12}, r_{23} and r_{13}, inferred for the standard selection of model outputs, were solely induced by protein sequestration (retroactivity). Importantly, at the \(\vec a^{opt}\) values of the weights a_{i}, the invariance of the connection matrix r_{ij} with respect to different perturbations was brought about by a restoration of the modular insulation condition (Eq. 6).
In summary, these result suggest that if the connection coefficients r_{ij} have different signs for different values of weight parameters a_{i}, and these r_{ij} are nullified at certain values, \(a_i^{opt}\), then these \(a_i^{opt}\) values restore the modular insulation condition (Eq. 6) for a wide range of perturbations. Consequently, the resulting connection matrix (r) becomes invariant with respect to the choice of permissible perturbations. The exact set of permissible perturbations at \(\vec a = \vec a^{opt}\), however, may differ, depending on the choice of communicating species.
A model of threetier cascade with positive and negative regulatory feedback connections
Signaling cascades considered above did not have regulatory feedback loops. To explore how the regulatory feedbacks can be distinguished from retroactivity, i.e., solely sequestration induced feedbacks, we next analyzed models of threetier enzymatic cascades with both types of feedback connections. First, we considered cascades with regulatory feedback loops connecting modules, which are not linked by immediate feedforward connections within a waterfall cascade structure. A reaction scheme in Fig. 3b presents a signaling cascade with tiers 1 and 2 that activate their immediate downstream tiers 2 and 3, respectively, and regulatory feedback from tier 3 to tier 1, which can be negative or positive.
For simplicity, we first used a standard MichaelisMenten description of the regulatory feedback.^{35} Assuming noncompetitive activation or inhibition of the reaction of X_{1} phosphorylation by an active form (pX_{3}) of tier 3 enzyme, the reaction rate was multiplied by the following multiplier, (1 + u_{1}[pX_{3}]/k_{9})/(1 + [pX_{3}]/k_{9}).^{36} We can readily see that regulatory feedback from module 3 to module 1 is positive, if u_{1} > 1, and it is negative, if 0 < u_{1} < 1.
We defined the communicating species using Eq. (18), in which an enzymesubstrate complex that bridges two modules is added to a single module output, or using Eq. (19) where these complexes are added to both neighboring modules. Similarly as above, for either selection of communicating species we found that the connection coefficients, r_{12} and r_{23}, changed their sign at certain values \(a_i = a_i^{opt}\), while the connection coefficients, r_{21}, r_{32}, and r_{13}, did not (Fig. 3b, right panel). The same \(a_i^{opt}\) values yielded zero values for the connections r_{12} and r_{23} for both choices of module outputs (see Supplementary material, section 5.1). We conclude that the inferred connections, r_{12} and r_{23}, are merely retroactive and are induced solely by protein sequestration, while the interactions described by connection coefficients, r_{21}, r_{32}, and r_{13}, are regulatory connections. Thus, in case of a regulatory feedback connection between modules, which are not linked by immediate feedforward connections, our approach correctly distinguishes between regulatory and sequestration connections. Sets of permissible parameters for unique network reconstruction include both the total protein abundances and numerous kinetic constants (see Supplementary material, section 5.1).
More complex regulatory feedback mechanisms are found for a threetiered RAF/MEK/ERK cascade, which is evolutionary conserved in eukaryotic cells. Active ERK (module 3) binds to active CRAF and BRAF monomers (module 1) and inhibits their kinase activities by phosphorylation of inhibitory sites.^{37,38} A complete mechanistic description considers homo and hetero CRAF and BRAF dimers and includes numerous reaction steps.^{39} Using a simplified mechanistic description of this regulatory feedback, Supplementary material (section 5) shows that selecting communicating species as weighted sums of the free phosphorylated proteins and intermodular protein complexes, precise discrimination between sequestration and regulatory feedbacks and unique network reconstruction can also be achieved.
We next considered a regulatory feedback loop between two cascade tiers that are connected through immediate feedforward activation (Fig. 3c). Here module 2 activates module 3, whereas module 3 output routes back into module 2, as a regulatory feedback loop. Because module 3 can also retroactively affect module 2 through sequestration, we can ask how effects of sequestration compete with regulatory feedback. To simplify the analysis, we again used a standard MichaelisMenten description of the regulatory feedback. Assuming that an active form (pX_{3}) of the tier 3 enzyme activates or inhibits the formation of the productive complex \(\left[ {pX_1 \cdot \cdot X_2} \right]\) in a noncompetitive manner, the reaction rate of X_{2} activation was multiplied by the following multiplier, (1 + u_{2}[pX_{3}]/k_{10})/(1 + [pX_{3}]/k_{10}).^{36} This regulatory feedback is positive, if u_{2} > 1, and it is negative, if 0 < u_{2} < 1.
As above, we defined communicating species by Eqs. (18) and (19) and monitored the signs of connection coefficients r_{ij} when changing the weight parameters a_{i}. The signs of connection coefficients depend on the signs of the global response coefficients (Eq. 7), which in turn depend on the changes in the concentrations of free active enzymes and enzymesubstrate complexes (components of communicating species) caused by parameter perturbations. Instructively, upon perturbations to the protein abundance \((X_3^{tot})\) of module 3, regulatory and sequestration connections affected the concentrations \(\left[ {pX_2 \cdot \cdot X_3} \right]\) and [pX_{2}] in different ways. When \(X_3^{tot}\) is perturbed, regulatory feedback loops decreased or increased both these concentrations together, whereas sequestration (i.e., retroactive) connections changed \(\left[ {pX_2 \cdot \cdot X_3} \right]\) and [pX_{2}] in opposite directions. For example, if \(X_3^{tot}\) decreases, negative regulatory feedback increases and positive regulatory feedback decreases both \(\left[ {pX_2 \cdot \cdot X_3} \right]\) and [pX_{2}], whereas sequestration effects decrease \(\left[ {pX_2 \cdot \cdot X_3} \right]\) and increase [pX_{2}].
As a result, when both regulatory feedback loop and sequestration feedback connection are present, following perturbations to \(X_3^{tot}\), the concentrations \(\left[ {pX_2 \cdot \cdot X_3} \right]\) and [pX_{2}] change either in concert or in opposite ways, depending on the relative strengths of these two feedback interactions. If a regulatory feedback dominates (when u_{2} is greater than a certain threshold value), both \(\left[ {pX_2 \cdot \cdot X_3} \right]\) and [pX_{2}] move in the same direction. In this case, the coefficients r_{12} and r_{13} change their sign at certain values \(a_i = a_i^{opt}\), suggesting solely sequestration connections (Fig. 3c, right panel). However, the coefficients, r_{21}, r_{32}, and r_{23}, do not change their signs, indicating regulatory feedforward activation connections (r_{21} and r_{32}) and positive regulatory feedback from module 3 to module 2 (r_{23}). In other words, at \(a_i = a_i^{opt}\) the connection matrix r_{ij} displays zero values for r_{12} and r_{13} and nonzero values for r_{21}, r_{32}, and r_{23} (Fig. 3c, the connection matrix for a strong regulatory feedback, u_{2} = 50.5).
However, when sequestration effects dominate (when u_{2}is smaller than a threshold value), the concentrations \(\left[ {pX_2 \cdot \cdot X_3} \right]\) and [pX_{2}] change in opposite directions upon perturbations to \(X_3^{tot}\). Then, the connection coefficient r_{23} also changes the sign (together with the other sequestration connections, r_{12} and r_{13}) at the \(a_i = a_i^{opt}\) values, whereas the connection coefficients, r_{21} and r_{32}, remain nonzero (Fig. 3c, the connection matrix for a weak regulatory feedback, u_{2} = 1.75).
Supplementary Table 11 illustrates similar results for a negative regulatory feedback. When this feedback dominates, the coefficient r_{23} does not change the sign for different weight parameters a_{i}, suggesting a regulatory feedback, whereas the connections that appear as a result of solely enzyme sequestrations (r_{12} and r_{13}) change their signs. However, when a negative regulatory feedback is weak and enzyme sequestration dominates, the sequestration feedback forces the coefficient r_{23} to change its sign with changes in the weights a_{i}.
Discussion
A computational method, termed Modular Response Analysis (MRA), allows reconstructing direct causative connections in intracellular signaling networks from measured responses of an entire network to systematic perturbations.^{7,28} However, MRA, as any method for solving reverse engineering problems, suffers from several limitations. One weak point is the instability of solutions with respect to noise in the input data.^{9} Fortunately, numerous statistical reformulations of MRA, including Maximum likelihood (ML), Monte CarloML and Bayesian variants of MRA^{4,23,24,25,27,29,30} have successfully addressed this problem for practical applications of MRA to noisy and incomplete data (as a recent review see ref. ^{31}).
The other limitation of MRA is related to enzyme sequestrations in protein modification reactions (also known as retroactivity or intermodular mass transfer).^{7,15,16,40} This problem of mass transfer has been known for a long time and also discussed in the original MRA paper,^{7} yet it still challenges signaling network reconstruction.^{41,42,43} Recently, Prabakaran and colleagues have highlighted this challenge for MRA by inferring surprisingly different network circuitries, depending on which protein abundances in the network were perturbed.^{20}
In the present work, we conclude that findings of different network circuitries using distinct sets of perturbations are explained by the violation of the modular insulation condition (see Eqs. 6 and 11). This key MRA condition is commonly violated when the concentrations of intermodular complexes are of the same order of magnitude as the conserved abundance of a protein participating in an intermodular complex (which is formed, for instance, by an enzyme from one module and its substrate from the other module). Experimentally, the concentration of the intermodular complex and the conserved abundance of a protein in this complex can be compared using coimmunoprecipitation of an enzyme and its substrate and comparing the western blot intensity with the intensity for enzyme or substrate concentration in the leftover lysate, using the same blot. The modular insulation condition^{7} did not hold in the experiments of Prabakaran and colleagues due to considerable sequestration of active MEK (ppMEK) by its substrate ERK, which belongs to a module downstream of the MEK module.^{20} Because enzyme sequestration and resulting retroactivity is often observed in cell signaling networks, it is imperative to extend MRAbased reconstruction methods to networks with protein complexes bridging different modules.
Here we show that additional measurements allow us to computationally restore the modular insulation condition for a range of network topologies, including those used in the experiments of Prabakaran and colleagues.^{20} This permits a unique network reconstruction for different selections of applied perturbations, including all conserved protein abundances and a range of kinetic constants. A key to our approach is an alternative definition of communicating species (solely for computational network reconstruction purposes), as weighted combinations of free active enzymes and enzymesubstrate complexes that bridge different network modules. Provided that global responses to perturbations of free active enzymes and enzymesubstrate complexes can be separately measured, we have computationally reconstructed connection coefficients (r_{ij}) of direct, causative interactions between network modules for different values of weights (a_{i}). We considered kinase cascades (ubiquitous for cell signaling) and first reconstructed cascades with no regulatory feedback loops and cascades where regulatory feedback loops connect modules that are not linked by immediate feedforward connections. We demonstrated that sequestrationinduced connections (i.e. retroactivity) are distinguished from regulatory connections by computationally determining the weight parameter values (termed \(a_i^{opt}\)) that simultaneously nullify all sequestrationinduced connections. We also showed that at these \(a_i^{opt}\) values the modular insulation condition (Eq. 6) is restored. Thus, for these network circuitries, we were able to uniquely reconstruct networks and reveal the mechanistic nature of direct, causative connections.
Current biochemical techniques allow us to measure the concentrations of both free active enzymes and intermodular protein complexes. For instance, coimmunoprecipitation of an enzyme and its substrate from the other module will determine the concentration of an intermodular protein complex, whereas immunoprecipitation using an antibody against phosphorylated enzyme will determine the active enzyme concentration. Importantly, only the relative concentration changes are detected using Western blotting, but our approach performs equally well when the input data are relative changes in the concentration of proteins and protein complexes. This is explained by the fact that both global responses and local connection coefficients can be determined in terms of either absolute changes^{7,28} or relative changes, defined by the logarithmic derivatives in Eqs. (4) and (5).^{7} Therefore, the perturbationinduced global responses of module outputs can be readily analyzed in terms of the relative changes in the concentrations of proteins and protein complexes (see Supplementary material, section 3).
When cascade modules are connected by both regulatory and sequestration feedbacks, MRA infers a dominant feedback. In particular, a regulatory feedback will manifest itself if its strength exceeds a certain threshold, whereas for weaker feedback strengths only a sequestration feedback will be revealed. For different and more complex network topologies this approach may also have a limited applicability, requiring more prior information. For instance, for inhibitory cascades additional knowledge about which of two proteins in an intermodular complex is an enzyme and which is a substrate is required (see Supplementary material, section 6). Importantly, this knowledge can be obtained from consensus phosphorylation sequences for many kinases and enzymesubstrate databases.^{44,45,46,47,48} For signaling networks where a module operates as a hub activating several downstream modules (see Supplementary material, sections 7 and 8), our approach is capable of inferring retroactivity feedback loops. However, for these networks, not all sequestrationinduced connections can be nullified at a single set of \(a_i = a_i^{opt}\), because they change their signs at the alternative weight coefficient sets. As a result, the inferred connections can differ for alternative perturbations. Sections 7 and 8 in Supplementary material demonstrate that for different sets of perturbation parameters, sequestration connections exhibit the greatest variability. Therefore, minimization of the sum of squares of sequestrationinduced connections minimizes the discrepancy between the inferred connection matrices, improving the network inference quality. Importantly, all inferred regulatory connections are qualitatively similar, when the sum of squares of the sequestrationinduced connection coefficients is minimized (see Supplementary material, sections 7 and 8). Because a network reconstruction process can be concurrently impaired by both protein sequestration effects and noise, we have also checked that the use of the new communicating species (module outputs) suggested by our approach does not significantly decrease accuracy and precision of MRAbased network reconstruction^{49} (see Supplementary material, section 9).
Crosstalk between pathways often operates as feedforward and feedback regulatory loops mediated by protein (de)phosphorylation.^{50} Yet, these regulatory interactions are not the only biological mechanisms of pathway crosstalk. Protein sequestration in competing proteinprotein interactions is a key mechanism that regulates crosstalk between the Hippo and RAS/RAF/MEK/ERK pathways.^{51,52} In this and similar cases, our approach correctly identifies sequestration connections, which also play regulatory roles. Yet, similarly as for a hub network topology, sequestrationinduced connections cannot be nullified at a single set of weight parameters, \(a_i = a_i^{opt}\). Consequently, the selection of module outputs, which we have computationally explored, does not restore the modular insulation condition.
In summary, our approach significantly extends MRAbased methods to cover a range signaling networks with considerable reactivity interactions between modules. At the cost of additional measurements, this approach computationally restores the modular insulation condition and permits unmistakable network reconstruction for a range of signaling motifs and experimental perturbations.
Methods
All numerical simulations were carried out in R^{53} using the package dMod^{54} and its dependencies and custom functions. The magnitude of parameter perturbations used in calculations was 10% expect in the calculations that explored noise in the data (Section 9 in Supplementary material), where the perturbation magnitude was 50%. Plots were generated with the package ggplot2^{55} which is part of the collection of packages called tidyverse. The scripts (file “Code.tar.gz”) are available as supplementary information for numerical results in the main text. Analytical calculations were partly done using Mathematica^{56} and Sage^{57} software packages.
Data availability
All data generated or analysed during this study are included in this published article (and its supplementary information files).
Code availability
The comprehensive set of scripts written in R (file “Code.tar.gz”) is available for download as supplementary information.
References
de la Fuente, A., Bing, N., Hoeschele, I. & Mendes, P. Discovery of meaningful associations in genomic data using partial correlation coefficients. Bioinformatics 20, 3565–3574 (2004).
Feizi, S., Marbach, D., Medard, M. & Kellis, M. Network deconvolution as a general method to distinguish direct dependencies in networks. Nat. Biotechnol. 31, 726–733 (2013).
Friedman, J., Hastie, T. & Tibshirani, R. Regularization paths for generalized linear models via coordinate descent. J. Stat. Softw. 33, 1–22 (2010).
Halasz, M., Kholodenko, B. N., Kolch, W. & Santra, T. Integrating network reconstruction with mechanistic modeling to predict cancer therapies. Sci. Signal. 9, ra114 (2016).
Meinshausen, N. et al. Methods for causal inference from gene perturbation experiments and validation. Proc. Natl Acad. Sci. USA 113, 7361–7368 (2016).
Papoutsoglou, G. et al. SCENERY: a web application for (causal) network reconstruction from cytometry data. Nucleic Acids Res. 45, W270–W275 (2017).
Kholodenko, B. N. et al. Untangling the wires: a strategy to trace functional interactions in signaling and gene networks. Proc. Natl Acad. Sci. USA 99, 12841–12846 (2002).
Sontag, E., Kiyatkin, A. & Kholodenko, B. N. Inferring dynamic architecture of cellular networks using time series of gene expression, protein and metabolite data. Bioinformatics 20, 1877–1886 (2004).
Andrec, M., Kholodenko, B. N., Levy, R. M. & Sontag, E. Inference of signaling and gene regulatory networks by steadystate perturbation experiments: structure and accuracy. J. Theor. Biol. 232, 427–441 (2005).
Yalamanchili, N. et al. Quantifying gene network connectivity in silico: scalability and accuracy of a modular approach. Syst. Biol. (Stevenage) 153, 236–246 (2006).
Kholodenko, B. N. Untangling the signalling wires. Nat. Cell Biol. 9, 247–249 (2007).
Bruggeman, F. J., Westerhoff, H. V., Hoek, J. B. & Kholodenko, B. N. Modular response analysis of cellular regulatory networks. J. Theor. Biol. 218, 507–520 (2002).
Kholodenko, B. N., Hoek, J. B., Westerhoff, H. V. & Brown, G. C. Quantification of information transfer via cellular signal transduction pathways [published erratum appears in FEBS Lett 1997 Dec 8;419(1):150]. FEBS Lett. 414, 430–434 (1997).
SaezRodriguez, J., Kremling, A. & Gilles, E. D. Dissecting the puzzle of life: modularization of signal transduction networks. Comput. Chem. Eng. 29, 619–629 (2005).
Conradi, C., SaezRodriguez, J., Gilles, E. D. & Raisch, J. Using chemical reaction network theory to discard a kinetic mechanism hypothesis. Syst. Biol. (Stevenage) 152, 243–248 (2005).
Del Vecchio, D., Ninfa, A. J. & Sontag, E. D. Modular cell biology: retroactivity and insulation. Mol. Syst. Biol. 4, 161 (2008).
Sontag, E.D. Modularity, in Design and Analysis of Biomolecular Circuits: Engineering Approaches to Systems and Synthetic Biology (eds. Koeppl, H., Setti, G., di Bernardo, M. & Densmore, D.) 183–200 (Springer New York, New York, NY, 2011).
Bluthgen, N. et al. Effects of sequestration on signal transduction cascades. FEBS J. 273, 895–906 (2006).
Tikhonov, A.N. Solutions of illposed problems/Andrey N. Tikhonov and Vasiliy Y. Arsenin. Translation editor, Fritz John, (Winston; distributed solely by Halsted Press, Washington, New York, 1977).
Prabakaran, S., Gunawardena, J. & Sontag, E. Paradoxical results in perturbationbased signaling network reconstruction. Biophys. J. 106, 2720–2728 (2014).
de la Fuente, A., Brazhnik, P. & Mendes, P. Linking the genes: inferring quantitative gene networks from microarray data. Trends Genet. 18, 395–398 (2002).
Stark, J., Callard, R. & Hubank, M. From the top down: towards a predictive biology of signalling networks. Trends Biotechnol. 21, 290–293 (2003).
Santos, S. D., Verveer, P. J. & Bastiaens, P. I. Growth factorinduced MAPK network topology shapes Erk response determining PC12 cell fate. Nat. Cell Biol. 9, 324–330 (2007).
StelniecKlotz, I. et al. Reverse engineering a hierarchical regulatory network downstream of oncogenic KRAS. Mol. Syst. Biol. 8, 601 (2012).
Klinger, B. et al. Network quantification of EGFR signaling unveils potential for targeted combination therapy. Mol. Syst. Biol. 9, 673 (2013).
Kholodenko, B., Yaffe, M. B. & Kolch, W. Computational approaches for analyzing information flow in biological networks. Sci. Signal 5, re1 (2012).
Santra, T., Kolch, W. & Kholodenko, B. N. Integrating Bayesian variable selection with Modular Response Analysis to infer biochemical network topology. BMC Syst. Biol. 7, 57 (2013).
Bastiaens, P. et al. Silence on the relevant literature and errors in implementation. Nat. Biotechnol. 33, 336–339 (2015).
Gong, C., Zhang, Y., Shankaran, H. & Resat, H. Integrated analysis reveals that STAT3 is central to the crosstalk between HER/ErbB receptor signaling pathways in human mammary epithelial cells. Mol. Biosyst. 11, 146–158 (2015).
Speth, Z., Islam, T., Banerjee, K. & Resat, H. EGFR signaling pathways are wired differently in normal 184A1L5 human mammary epithelial and MDAMB231 breast cancer cells. J. Cell Commun. Signal. 11, 341–356 (2017).
Santra, T., Rukhlenko, O., Zhernovkov, V. & Kholodenko, B. N. Reconstructing static and dynamic models of signaling pathways using Modular Response Analysis. Curr. Opin. Syst. Biol. 9, 11–21 (2018).
Dorel, M. et al. Modelling signalling networks from perturbation data. Bioinformatics 34, 4079–4086 (2018).
Lun, X. K. et al. Influence of node abundance on signaling network state and dynamics analyzed by mass cytometry. Nat. Biotechnol. 35, 164–172 (2017).
Fey, D. et al. Signaling pathway models as biomarkers: patientspecific simulations of JNK activity predict the survival of neuroblastoma patients. Sci. Signal. 8, ra130 (2015).
Kholodenko, B. N. Cellsignalling dynamics in time and space. Nat. Rev. Mol. Cell Biol. 7, 165–176 (2006).
Tsyganov, M. A., Kolch, W. & Kholodenko, B. N. The topology design principles that determine the spatiotemporal dynamics of Gprotein cascades. Mol. Biosyst. 8, 730–743 (2012).
Ritt, D. A., Monson, D. M., Specht, S. I. & Morrison, D. K. Impact of feedback phosphorylation and Raf heterodimerization on normal and mutant BRaf signaling. Mol. Cell Biol. 30, 806–819 (2010).
Dougherty, M. K. et al. Regulation of Raf1 by direct feedback phosphorylation. Mol. Cell 17, 215–224 (2005).
Rukhlenko, O. S. et al. Dissecting RAF inhibitor resistance by structurebased modeling reveals ways to overcome oncogenic RAS signaling. Cell Syst. 7, 161–179.e14 (2018).
SaezRodriguez, J., Gayer, S., Ginkel, M. & Gilles, E. D. Automatic decomposition of kinetic models of signaling networks minimizing the retroactivity among modules. Bioinformatics 24, i213–i219 (2008).
SaezRodriguez, J., Conzelmann, H., Ederer, M. & Gilles, E.D. in Design and Analysis of Biomolecular Circuits: Engineering Approaches to Systems and Synthetic Biology (eds. Koeppl, H., Setti, G., di Bernardo, M. & Densmore, D.) 139–159 (Springer New York, New York, NY, 2011).
Sontag, E.D. Remarks on structural identification, modularity, and retroactivity. In Proc. 49th IEEE Conference on Decision and Control (CDC) 3373–3378 (IEEE, Atlanta, GA, USA, 2010).
PantojaHernández, L. & MartínezGarcía, J. C. Retroactivity in the context of modularly structured biomolecular systems. Front. Bioeng. Biotechnol. 3, 85 (2015).
Wiredja, D. D., Chance, M. R. & Koyutürk, M. The KSEA App: a webbased tool for kinase activity inference from quantitative phosphoproteomics. Bioinformatics 33, 3489–3491 (2017).
Lachmann, A. & Ma’ayan, A. KEA: kinase enrichment analysis. Bioinformatics 25, 684–686 (2009).
Murray, B. et al. PhosphoSitePlus: a comprehensive resource for investigating the structure and function of experimentally determined posttranslational modifications in man and mouse. Nucleic Acids Res. 40, D261–D270 (2011).
Duan, G., Li, X. & Köhn, M. The human DEPhOsphorylation database DEPOD: a 2015 update. Nucleic Acids Res. 43, D531–D535 (2014).
Pasculescu, A. et al. NetworKIN: a resource for exploring cellular phosphorylation networks. Nucleic Acids Res. 36, D695–D699 (2007).
Thomaseth, C. et al. Impact of measurement noise, experimental design, and estimation methods on Modular Response Analysis based network reconstruction. Sci. Rep. 8, 16217 (2018).
Volinsky, N. & Kholodenko, B.N. Complexity of Receptor Tyrosine Kinase Signal Processing. Cold Spring Harb. Perspect. Biol. 5, 009043 (2013).
Romano, D. et al. Protein interaction switches coordinate Raf1 and MST2/Hippo signalling. Nat. Cell Biol. 16, 673–684 (2014).
Nguyen, L. K., Matallanas, D. G., Romano, D., Kholodenko, B. N. & Kolch, W. Competing to coordinate cell fate decisions: the MST2Raf1 signaling device. Cell Cycle 14, 189–199 (2015).
Team, R.C. R: A language and environment for statistical computing. (2018).
Kaschek, D., Rosenblatt, M., Mader, W., FehlingKaschek, M. & Timmer, J. Dynamic modeling, parameter estimation and uncertainty analysis in R. J. Stat. Softw. 88, 10 (2019).
Wickham, H. ggplot2: Elegant Graphics for Data Analysis. (SpringerVerlag, New York, 2016).
Wolfram Research, I. Mathematica, Version 11.3 (2018).
Stein, W.A. et al. Sage Mathematics Software Version 7.3, The Sage Development Team https://www.sagemath.org (2016).
Acknowledgements
We thank Walter Kolch for critical reading of the manuscript and suggested changes. This work was supported by German Federal Ministry of Education and Research grant LiSyM (BMBF 031L0048) and EU Horizon 2020 grants MSCAIF SAMNets (grant no. 750688), SmartNanoTox (grant no. 686098) and NanoCommons (grant no. 731032).
Author information
Authors and Affiliations
Contributions
B.N.K., O.S.R., and J.T. conceived the study. D.L., O.S.R., A.J.M.E., and E.K. developed algorithm and conducted numerical simulations. B.N.K., O.S.R., and D.L. wrote the manuscript. D.L. and O.S.R. contributed equally. All authors read, corrected, and approved the manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
41540_2019_96_MOESM4_ESM.zip
Sage worksheet that illustrates restoration of modular insulation condition for 2tier inhibitory cascade (Supplemental material, section 6)
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Lill, D., Rukhlenko, O.S., Mc Elwee, A.J. et al. Mapping connections in signaling networks with ambiguous modularity. npj Syst Biol Appl 5, 19 (2019). https://doi.org/10.1038/s4154001900961
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s4154001900961
This article is cited by

Network inference from perturbation time course data
npj Systems Biology and Applications (2022)

An R package for generic modular response analysis and its application to estrogen and retinoic acid receptor crosstalk
Scientific Reports (2021)