Inferring a nonlinear biochemical network model from a heterogeneous single-cell time course data

Shindo, Yuki; Kondo, Yohei; Sako, Yasushi

doi:10.1038/s41598-018-25064-w

Download PDF

Article
Open access
Published: 01 May 2018

Inferring a nonlinear biochemical network model from a heterogeneous single-cell time course data

Scientific Reports volume 8, Article number: 6790 (2018) Cite this article

1966 Accesses
5 Citations
2 Altmetric
Metrics details

Subjects

Abstract

Mathematical modeling and analysis of biochemical reaction networks are key routines in computational systems biology and biophysics; however, it remains difficult to choose the most valid model. Here, we propose a computational framework for data-driven and systematic inference of a nonlinear biochemical network model. The framework is based on the expectation-maximization algorithm combined with particle smoother and sparse regularization techniques. In this method, a “redundant” model consisting of an excessive number of nodes and regulatory paths is iteratively updated by eliminating unnecessary paths, resulting in an inference of the most likely model. Using artificial single-cell time-course data showing heterogeneous oscillatory behaviors, we demonstrated that this algorithm successfully inferred the true network without any prior knowledge of network topology or parameter values. Furthermore, we showed that both the regulatory paths among nodes and the optimal number of nodes in the network could be systematically determined. The method presented in this study provides a general framework for inferring a nonlinear biochemical network model from heterogeneous single-cell time-course data.

Network inference from perturbation time course data

Article Open access 01 November 2022

BioNetGMMFit: estimating parameters of a BioNetGen model from time-stamped snapshots of single cells

Article Open access 22 September 2023

Gene regulatory network inference from sparsely sampled noisy data

Article Open access 13 July 2020

Introduction

A biochemical reaction network is a key concept in understanding how higher-order functions in the cell emerge from relatively simple individual elements, such as proteins and metabolites. The reaction network system is often nonlinear and complex and can potentially display various dynamic behaviors, such as ultrasensitivity, bistability, and oscillation^1,2,3,4,5,6, that form the basis of diverse cellular phenotypes. Because of its complexity, in silico analysis based on mathematical modeling and numerical simulation is an essential strategy for quantitatively understanding a system of interest. Mathematical analysis can help to eliminate the nonessential individuality of biological targets and identify core principles that govern the behaviors and function of the system in the cell. Using these approaches, various studies have revealed relationships between the behavior of a system and its underlying mechanisms, including feedback/feedforward loops, cross-talk, compartmentalization, and noise^{7,8,9,10,11,12}.

There are at least two distinct stages of in silico network analysis. The first step involves construction of a mathematical model that describes the system, and the second step involves analysis of the model. Although the second step strongly depends upon the aim of the study, a mathematical model is needed for the analysis, regardless of the details of the second step. Typically, modeling of a target system is performed in a patchwork manner, which means that fragments of studies regarding a specific reaction are integrated to construct a map of the reaction network^13,14,15,16. Although this procedure is straightforward, selecting the sources of each reaction that constitutes the network is a non-trivial task that might raise concerns regarding the validity of the modeling. Alternatively, a data-driven approach incorporating as few assumptions as possible for inferring a network model can compensate for the defect of the patchwork modeling.

Data-driven inference of biochemical network models has previously been extensively studied^17,18,19,20, and both genome-wide networks and cell-specific gene regulatory and posttranslational modification networks have been systematically reconstructed^21,22. Additionally, although the regulatory relationships in the inferred network often represent linear or binary correlations among nodes, efforts are underway to identify nonlinear ordinary differential equation (ODE) systems^23,24,25,26. However, the intersection between systematic model inference and network modeling with nonlinear ODEs has received less attention^27,28. Therefore, a framework that enables data-driven modeling of the network while considering the nonlinearity of the system is needed. Furthermore, recent advances in experimental methods have made available highly quantitative and time-resolved data at the single-cell level²⁹, thereby making it desirable for the framework to handle single-cell datasets.

To address these problems, we developed a method combining an expectation-maximization (EM) algorithm with a particle smoother and sparse regularization. Using this method, we showed that an oscillatory network model can be systematically inferred based only on single-cell time-course data. Briefly, our strategy is as follows (Fig. 1): (1) quantitatively measure components of the network and obtain a single-cell dataset, (2) prepare a “redundant” model where an excessive number of reaction paths and nodes are defined using nonlinear ODEs, and (3) perform model learning using the dataset while eliminating unnecessary paths in the redundant model to identify the most probable model. We evaluated the performance of the method using artificial time-course data and showed that the algorithm accurately inferred the true network model in a data-driven manner.

Results

Maximum likelihood parameter estimation in a biochemical network model

We introduced the following nonlinear state space model:

$$\begin{array}{rcl}{{\boldsymbol{x}}}_{t} & = & {\boldsymbol{f}}({{\boldsymbol{x}}}_{t-1})+{{\boldsymbol{v}}}_{t}\\ {{\boldsymbol{y}}}_{t} & = & {\boldsymbol{h}}({{\boldsymbol{x}}}_{t})+{{\boldsymbol{w}}}_{t}\end{array}$$

(1)

where x and y denote state variables (e.g., amounts of mRNA and protein) and measurements (e.g., fluorescence intensity), respectively. Function f describes the evolution of the system and can be calculated as ${\boldsymbol{f}}({{\boldsymbol{x}}}_{t-1})={{\boldsymbol{x}}}_{t-1}+{\int }_{t-1}^{t}{\boldsymbol{g}}({{\boldsymbol{x}}}_{\tau },{{\boldsymbol{\theta }}}_{sys})d\tau $, where, in general, g represents ODEs that describe a biochemical reaction network of interest, and θ_sys denotes model parameters. Function h represents the process of measurement of x. Vectors v_t and w_t denote system noise and measurement noise, respectively, where we assumed that they followed a Gaussian distribution. Given dataset $Y=\{{Y}^{(a)}\}(a=1,\ldots ,A)$, where a is an index of each cell and Y^(a) represents the single-cell time-course data, estimation of θ, a set of parameters that characterize the state space model, can be accomplished by maximizing the log-likelihood of the model. Here, we employed an EM algorithm to find the maximum likelihood estimates of θ. Note that the algorithm is analytically intractable, because it requires a probability distribution of the time course at all time points. Therefore, we numerically approximated the probability distribution using a particle smoother algorithm³⁰ (Materials and Methods). We referred to the algorithm^31,32 as the EM-PS (particle smoother) algorithm.

Next, we tested whether the EM-PS algorithm could provide correct estimates in a given model using artificial time-course data. To generate artificial data, we constructed a gene regulatory network in silico that consisted of three genes (X, Y, and Z) and a negative feedback loop (Fig. 2a). The network produced an oscillatory expression pattern with appropriate parameters. We used the Hill function to express reactions involving either activator or repressor molecules, because the activity of such regulators is often nonlinear (Supplementary Information). For simplicity, we used first-order kinetics for the degradation process. To mimic a realistic biological experiment in which cell-to-cell variability and observation noise exist, we numerically solved the model as nonlinear stochastic Langevin equations and added Gaussian noise as observation error to each value to generate artificial single-cell time-course data (Fig. 2b).

Using the artificial data and EM-PS algorithm, we conducted maximum likelihood estimation of the model parameters. We observed a monotonic increase in the log-likelihood during iterations of the algorithm, and eventually the estimated states were consistent with the data (Figs 2c and d and S1). Note that small fluctuations were observed, even after convergence due to the stochastic nature of the particle smoother algorithm implemented in the EM-PS algorithm. Differences between the correct and estimated values were <11%, even though the dataset contained significant cell-to-cell variability. Additionally, the algorithm was robust over a wide range of initial values (Fig. 2e; see Initial #1 and #2). Interestingly, the dynamics of parameter convergence varied among parameters and were not always monotonic (Supplementary Fig. S2), indicating that the likelihood function had a complicated landscape in the parameter space. These results revealed that the EM-PS algorithm represented a powerful approach for parameter estimation in nonlinear biochemical network models.

Inferring network topology using a sparse regularized EM-PS algorithm

Next, we extended the algorithm to infer not only parameter values but also network topology. We focused on the fact that biochemical networks are sparse^33,34,35, which means that the number of regulatory paths is much smaller than the number of possible links between nodes. To utilize the sparsity of biochemical networks for inference^35,36,37, we introduced a regularization term referred to as the least absolute shrinkage and selection operator (Lasso), which is a simple yet powerful technique that provides a sparse solution³⁸. In our strategy, we first prepared a “redundant” model consisting of an excessive number of regulatory paths among genes, followed by elimination of less important paths by Lasso.

To construct the redundant model, we used the Hill function, because it can express both a linear and nonlinear reaction depending on parameters K and n, which denote the (apparent) association constant [reciprocal of the (apparent) dissociation constant] and Hill coefficient, respectively. For example, the activity of transcription activator, A, or repressor, R, was expressed as ${c}_{A}={(K[A])}^{n}/(1+{(K[A])}^{n}),\,{c}_{R}=1/(1+{(K[R])}^{n})$. Assuming a common situation in which regulators function independently³⁹, overall gene expression that is regulated by virtually any gene in the system (redundant model) can be written as follows:

$${\rm{production}}\,{\rm{rate}}=(\sum _{i}\,{a}_{i}\frac{{({K}_{i}[{X}_{i}])}^{{n}_{i}}}{1+{({K}_{i}[{X}_{i}])}^{{n}_{i}}})\cdot \prod _{j}\frac{1}{1+{({K}_{-j}[{X}_{j}])}^{{n}_{j}}}$$

(2)

where i and j represent indices of the activator and repressor, respectively. We focused on the fact that a path does not exist [c_A = 0 (no activator activity) or c_R = 1 (no repressor activity)] when parameter K = 0 (i.e., a zero or nonzero association constant can be used to characterize the presence or absence of the regulatory path in the model). Using this notation, the condition that biochemical networks are sparse is equivalent to the fact that most association constants (K) in the redundant model are equal to zero. Therefore, the association constants were subjected to regularization, thereby virtually removing the less important path from the network. Therefore, we rewrote the equations regarding the EM steps as follows:

$$\begin{array}{rcl}Q^{\prime} ({\boldsymbol{\theta }},{{\boldsymbol{\theta }}}^{({\rm{old}})}) & = & Q({\boldsymbol{\theta }},{{\boldsymbol{\theta }}}^{({\rm{old}})})-\,\lambda \sum _{s}|{K}_{s}|\\ {{\boldsymbol{\theta }}}^{({\rm{new}})} & = & {\rm{\arg }}\,\mathop{{\rm{\max }}}\limits_{{\boldsymbol{\theta }}}Q^{\prime} ({\boldsymbol{\theta }},{{\boldsymbol{\theta }}}^{({\rm{old}})})\end{array}$$

(3)

where s represents the index of the association constant, and λ denotes the strength of the regularization term (details are provided in Materials and Methods). We referred to this algorithm as the EM-PS-Lasso algorithm.

We then tested the performance of the EM-PS-Lasso algorithm using artificial data (Fig. 2b). We assumed a situation where we had time-course data for genes X, Y, and Z, but no prior knowledge of their regulatory relationships. Therefore, we constructed a model where any possible regulatory paths among the three genes (18 paths) were incorporated (Fig. 3 and Supplementary Information). Using the redundant model and artificial single-cell time-course data, we conducted network inference and parameter estimation using the EM-PM-Lasso algorithm. Because this algorithm requires parameter λ, which controls the strength of the penalty term, we evaluated the log-likelihood of the model as a function of λ (Figs 4a and S3). We also examined the log-likelihood on unseen test data and confirmed that the estimation did not suffer from overfitting. As expected, too large a value of λ resulted in a failure to fit the data, because all parameters were estimated to be zero (Supplementary Fig. S4). Values of λ from 0.1 to 10 yielded high log-likelihood values, implying potential good inference; however, too small a value of λ (λ = 0.1, 1) resulted in inference of overly redundant and biologically inconsistent models (e.g., gene Z simultaneously autoactivated and autorepressed gene Z) (Supplementary Fig. S4). Thus, we rejected these models (Supplementary Fig. S4). Consequently, the results at λ = 3,10 were systematically selected as candidates for the inferred model.

At λ = 3, the estimated states based on the inferred model were consistent with the data (Fig. 5a). In the inferred model, three association constants of 18 had nonzero values, indicating that only these three regulatory paths were crucial to reproduce the data (Fig. 5b). The paths consisted of activation of gene Y by gene X, activation of gene Z by gene Y, and repression of gene X by gene Z, which were equivalent to the true network (Fig. 2a). Removal of paths from the redundant model during iterations of the algorithm occurred at several steps rather than at a single step (Fig. 5c and d). We also confirmed that model parameters other than the association constants, such as degradation rate constants and Hill coefficients, were also successfully estimated (Fig. 5e). The same network model was inferred at λ = 10 (Supplementary Fig. S5). By contrast, the dynamics of the removal of paths from the redundant model were highly different from those at λ = 3. Overall, we demonstrated that the EM-PS-Lasso algorithm enabled both estimation of model parameters and inference of network topology. Furthermore, our results indicated that rich information regarding network topology was embedded in single-cell time-course data, even when the data were highly dynamic, nonlinear, and heterogeneous.

In general, the number of effective paths with nonzero association constants decreased as λ increased (Fig. 4b). Note that there was an apparent increase in the number of estimated paths at λ = 30. Indeed, most of the estimated paths had nonzero but extremely small values of association constants and had practically little effect on system behavior. This issue could be overcome by defining a threshold for the parameter value and/or for a degree of response to parameter changes (i.e., sensitivity analysis).

Inferring the number of components in the network

In our analysis, we assumed that the number of genes constituting the network was known, whereas their regulatory relationships were unknown. However, it is more common that neither factor is known. Therefore, we examined whether the algorithm could infer both the number of components and network topology. Again, we generated an artificial dataset using a network consisting of two genes (Fig. 6a and Supplementary Information) showing oscillatory dynamics with appropriate parameters. We also prepared a redundant model (equivalent to that in Fig. 3) consisting of three components rather than two, because we assumed that we had no prior knowledge regarding the number of components in the network. Using the artificial data and redundant model, we performed model inference of the gene regulatory network via the EM-PS-Lasso algorithm and evaluated the log-likelihood of the inferred models, finding that the model with λ = 15 showed the highest log-likelihood value (Fig. 6b) and was consistent with the data (Fig. 6c). Next, we evaluated the values of the association constant for all regulatory paths in the redundant model. The paths in the redundant model were removed in several steps during iterations of the algorithm (Fig. 6d and e), with three association constants of 18 eventually found to have nonzero values (Fig. 6f). The paths remaining in the model described autoactivation of gene X, activation of gene Z by gene X, and repression of gene X by gene Z. All regulatory paths related to gene Y had no activity, indicating the absence of gene Y in the network model. Therefore, the inferred model practically consisted of two genes and three regulatory paths (Fig. 6e, right) and was completely equivalent to the true network (Fig. 6a). Overall, these results revealed that the EM-PS-Lasso algorithm was able to infer not only the regulatory paths but also the number of components in the model.

Discussion

The concept of data-driven inference and analysis of biochemical networks has gained attention in computational systems biology and biophysics. However, this remains a difficult task due to the highly nonlinear nature of biological systems. Here, we proposed an EM algorithm-based method combining a particle smoother and sparse regularization to enable data-driven and systematic inference of nonlinear biochemical network models. Our method was successfully applied to construct mathematical models showing oscillations, which is one of the stereotypical nonlinear behaviors. Furthermore, because the elemental reaction in our modeling is described by a Hill function commonly used to express various types of biochemical reactions, our method can be directly applied to a wide range of networks, including transcriptional control, signal transduction, and metabolic regulation.

In this study, we focused on the fact that a regulatory path can be negligible when the association constant in the Hill function describing the path is equal to zero. Penalizing the association constants using Lasso resulted in elimination of unnecessary paths in the redundant model and enabled inference of the network topology. The proposed algorithm might also be useful when the model is described by other schemes, such as mass action kinetics, because the biological meaning of the association constant is straightforward. In such a system, the reaction is negligible when the association rate constant (k_on) in the mass action kinetics is estimated at zero. Therefore, the algorithm would be applicable to the mass action-based model with only a slight modification, where the association rate constant instead of the association constant is subjected to regularization.

Although Lasso is a simple yet powerful technique that provides a sparse solution, there are also other methods for sparse regularization. For example, automatic relevance determination and Bayesian masking are superior to Lasso in terms of sparsity-shrinkage tradeoff^40,41, although we did not use these techniques in the present study because of their slow convergence. Another promising approach to regularization is Group Lasso^24,42,43, which can provide a sparse solution at the grouped variable level. Recently, a problem involving insulation of network activity attracted interest, and a condition that insulates the activity of a sub-network from the overall network was also studied⁴⁴. The prominent feature of Group Lasso, where the sparse solution is given at the group level, might make it compatible with this problem.

Different system configurations often produce qualitatively similar behaviors⁴⁵. For example, ~10 types of different synthetic circuits reportedly generate “oscillatory” dynamics⁴⁶. Therefore, our finding that the network can be reconstructed based solely on time-course data might be surprising. Although it seems difficult to strictly define a condition that achieves the most effective inference, our results suggest that time-course data and possibly their associated noise⁴⁷ contain rich information and would be sufficient to reconstruct the regulatory network. Methods for data-driven analysis will become increasingly important as the number of various experimental technologies, including super-multiplexed color live-cell imaging⁴⁸, continue to rapidly progress. The present study provides a general framework for analyzing the intersections of nonlinear biochemical systems, model inference, and single-cell time-course data analysis in a data-driven manner.

Materials and Methods

Nonlinear state space model

We introduce a nonlinear state space model, which is given by

$$\begin{array}{rcl}{{\boldsymbol{x}}}_{t} & = & {\boldsymbol{f}}({{\boldsymbol{x}}}_{t-1})+{{\boldsymbol{v}}}_{t}\\ {{\boldsymbol{y}}}_{t} & = & {\boldsymbol{h}}({{\boldsymbol{x}}}_{t})+{{\boldsymbol{w}}}_{t}\end{array}$$

(4)

where x is a k-dimensional vector consisting of state variables, and y denotes an l-dimensional vector representing measurements of x. Functions f and h are nonlinear functions describing the evolution of the system and measurement process, respectively. Vectors v_t = {v_t,i} (i = 1, …, k) and w_t = {w_t,j} (j = 1, …, l) denote system noise and measurement noise, respectively, where we assumed that they followed a Gaussian distribution: v_t,i ~ N (0, (σ_i)²), w_t,j ~ N (0, (η_j)²). Initial values of the state are given by x_0,i ~ N (μ_i, (γ_i)²). We used standard ODEs to model the biochemical reaction network of interest as dx/dt = g (x, θ_sys), where, in general, g is a nonlinear function consisting of arbitrary equations, such as the Hill equation, and θ_sys indicates model parameters. Function f can be calculated by numerically integrating the equations as ${\boldsymbol{f}}({{\boldsymbol{x}}}_{t-1})={{\boldsymbol{x}}}_{t-1}+\,{\int }_{t-1}^{t}\,{\boldsymbol{g}}({{\boldsymbol{x}}}_{\tau },{{\boldsymbol{\theta }}}_{sys})d\tau $. Numerical integration of ODEs was performed using routines implemented in the scipy.integrate package (https://docs.scipy.org/doc/scipy/reference/integrate.html) as described previously⁴⁹. In the present study, we used a linear function for h (h(x) = αx) for simplicity, where α = 1 unless otherwise explicitly indicated. Given dataset $Y=\{{Y}^{(a)}\}=\{{Y}^{(1)},{Y}^{(2)},\ldots ,{Y}^{(A)}\}=\{{{\boldsymbol{y}}}_{1:T}^{(1)},{{\boldsymbol{y}}}_{1:T}^{(2)},\ldots {{\boldsymbol{y}}}_{1:T}^{(A)}\}$, where a is an index of each cell and Y^(a) represents the single-cell time-course data, estimation of ${\boldsymbol{\theta }}=\{{{\boldsymbol{\theta }}}_{sys},{\boldsymbol{\sigma }},{\boldsymbol{\eta }},{\boldsymbol{\mu }},{\boldsymbol{\gamma }}\},({\boldsymbol{\sigma }}=\{{\sigma }_{i}\},{\boldsymbol{\eta }}=\{{\eta }_{j}\},{\boldsymbol{\mu }}=\{{\mu }_{i}^{(a)}\},{\boldsymbol{\gamma }}\,=$ $\{{\gamma }_{i}\}\,(i=1,\ldots ,k,j=1,\ldots ,l,a=1,\ldots ,A))$ can be accomplished by maximizing the log-likelihood of the model. Note that only μ is dependent on the cell index a to describe cell-to-cell variability of initial states.

EM-PS algorithm for parameter estimation

Maximum likelihood estimation of θ can be accomplished by maximizing log-likelihood $\mathrm{ln}\,p(Y|{\boldsymbol{\theta }})=\,\mathrm{ln}\,\sum _{X}\,p(Y|X,{\boldsymbol{\theta }})p(X|{\boldsymbol{\theta }})$, which requires intractable integration with respect to state variables $X=\{{X}^{(a)}\}=\{{X}^{(1)},{X}^{(2)},\ldots ,{X}^{(A)}\}=\{{{\boldsymbol{x}}}_{1:T}^{(1)},{{\boldsymbol{x}}}_{1:T}^{(2)},\ldots ,{{\boldsymbol{x}}}_{1:T}^{(A)}\}$. Therefore, we used an EM algorithm to find maximum likelihood estimates of θ. The EM algorithm was run by iterating steps E (expectation) and M (maximization), which are defined as

$$\begin{array}{rcl}Q({\boldsymbol{\theta }},{{\boldsymbol{\theta }}}^{({\rm{old}})}) & = & {\langle \mathrm{ln}p(X,Y|{\boldsymbol{\theta }})\rangle }_{p(X|Y,{{\boldsymbol{\theta }}}^{({\rm{old}})})}\\ {{\boldsymbol{\theta }}}^{({\rm{new}})} & = & {\rm{\arg }}\,\mathop{{\rm{\max }}}\limits_{{\boldsymbol{\theta }}}Q\,({\boldsymbol{\theta }},{{\boldsymbol{\theta }}}^{({\rm{old}})})\end{array}$$

(5)

respectively. Given that the E step is analytically intractable, because it requires the probability distribution of the time series at all time points, we numerically approximated p(X|Y, θ) using a particle smoother as previously reported^31,32. Briefly, the particle smoother algorithm approximates the distribution as an ensemble of particles:

$$p({X}^{(a)}|{Y}^{(a)},{\boldsymbol{\theta }})=\sum _{p=1}^{P}\,{\beta }^{(a,p)}\delta ({X}^{(a)}-{X}^{(a,p)}),\sum _{p=1}^{P}{\beta }^{(a,p)}=1,{\beta }^{(a,p)}\ge 0$$

(6)

where P is the number of particles, X^(a,p) indicates a trajectory of the p^th particle sampled by the algorithm for data Y^(a), β^(a,p) represents the weight of the particle, and 𝛿 is Dirac’s delta. This weight is given as ${\beta }^{(a,p)}={l}^{(a,p)}/\sum _{p}{l}^{(a,p)}$, where l^(a,p) = p (Y^(a,p)|X^(a,p)) denotes the likelihood of the particle. The calculation was performed using the pyParticleEst package⁵⁰. Finally, the log-likelihood estimate was obtained by averaging over the particles: ${\rm{l}}{\rm{n}}\,{L}^{(a)}({\boldsymbol{\theta }})={\rm{l}}{\rm{n}}\,(\frac{1}{P}\sum _{p}\,{l}^{(a,p)})$. Note that $\mathrm{ln}\,p(X|Y,{\boldsymbol{\theta }})$ can be written as $\mathrm{ln}\,p(X|Y,{\boldsymbol{\theta }})=\sum _{a}\,\mathrm{ln}\,p({X}^{(a)}|{Y}^{(a)},{\boldsymbol{\theta }})$, because different time-course data are independent. Thus, the E step can be completed using the following approximation:

$$\begin{array}{ccc}Q({\boldsymbol{\theta }},{{\boldsymbol{\theta }}}^{({\rm{o}}{\rm{l}}{\rm{d}})}) & = & \sum _{a=1}^{A}\,{\langle {\rm{l}}{\rm{n}}p({X}^{(a)},{Y}^{(a)}|{\boldsymbol{\theta }})\rangle }_{p({X}^{(a)}|{Y}^{(a)},{{\boldsymbol{\theta }}}^{({\rm{o}}{\rm{l}}{\rm{d}})})}\\ & = & \sum _{a=1}^{A}\sum _{p=1}^{P}{\beta }^{(a,p)}\,{\rm{l}}{\rm{n}}\,p({X}^{(a,p)},{Y}^{(a)}|{\boldsymbol{\theta }})\,\\ & = & \,\sum _{a=1}^{A}\sum _{p=1}^{P}\sum _{i=1}^{k}{\beta }^{(a,p)}(-\frac{1}{2}\,{\rm{l}}{\rm{n}}\,2\pi {({\gamma }_{i})}^{2}-\frac{{({x}_{0,i}^{(a,p)}-{\mu }_{i}^{(a)})}^{2}\,}{2{({\gamma }_{i})}^{2}})\\ & & +\,\sum _{a=1}^{A}\sum _{p=1}^{P}\sum _{t\in T}\sum _{i=1}^{k}{\beta }^{(a,p)}(-\frac{1}{2}\,{\rm{l}}{\rm{n}}\,2\pi {({\sigma }_{i})}^{2}-\frac{{({x}_{t,i}^{(a,p)}-{f}_{i}({{\boldsymbol{x}}}_{t-1}^{(a,p)},{{\boldsymbol{\theta }}}_{sys}))}^{2}}{2{({\sigma }_{i})}^{2}})\\ & & +\,\,\sum _{a=1}^{A}\sum _{p=1}^{P}\sum _{t\in T}\sum _{j=1}^{l}{\beta }^{(a,p)}(-\frac{1}{2}\,{\rm{l}}{\rm{n}}\,2\pi {({\eta }_{j})}^{2}-\frac{{({y}_{t,j}^{(a)}-{h}_{j}({{\boldsymbol{x}}}_{t}^{(a,p)}))}^{2}}{2{({\eta }_{j})}^{2}}).\end{array}$$

(7)

For the M step, we numerically maximized the Q function using the quasi-Newton method with respect to θ_sys, because, in general, dQ/dθ_sys = 0 cannot be solved analytically. This optimization was performed using the L-BFGS-B function implemented in the scipy.optimize package (http://docs.scipy.org/doc/scipy/reference/optimize.html), with a non-negative constraint for the parameter values. Additionally, the equations for the derivative of Q with respect to σ, η, μ, and γ are linear equations; therefore, updated values for the parameters were easily found. Note that we defined minimum values for γ, because if the value is too small, sample impoverishment can occur⁵¹. We also defined maximum values for σ, η in order to avoid overestimation of the noise that could cause meaningless inference.

Artificial data generation

Artificial data were generated by numerically solving the model as nonlinear stochastic Langevin equations: dx/dt = g(x) + ξ(t), where ξ(t) is Gaussian noise with 〈ξ_i(t) = 0〉 and 〈ξ_i(t)ξ_j (t′)〉 = 2Dδ_i,jδ (t − t′) with Kronecker’s δ_i,j and Dirac’s δ(t) distribution, where parameter D characterizes the amplitude of the noise. Computation was conducted using a stochastic Runge-Kutta algorithm⁵². The measurement process was simulated by adding Gaussian noise to each variable: y_i = x_i + ηϕ, where ϕ is a random number sampled from a standard normal distribution, and η characterizes the amplitude of the noise. Stochastic simulation was performed over the simulation period T = 400, and data points from T = 351 to T = 400 were collected at a time resolution of 1. The simulation was repeated 10 times to generate 10 independent time-course data that served as training data. Similarly, an additional 10 independent time-course data were generated and used as test data for validation. Details of the model equations and parameter values are described in the Supporting Information.

References

Ferrell, J. E. & Machleder, E. M. The biochemical basis of an all-or-none cell fate switch in Xenopus oocytes. Science 280, 895–898 (1998).
Article ADS CAS PubMed Google Scholar
Xiong, W. & Ferrell, J. E. A positive-feedback-based bistable ‘memory module’ that governs a cell fate decision. Nature 426, 460–465 (2003).
Article ADS CAS PubMed Google Scholar
Arata, Y. et al. Cortical Polarity of the RING Protein PAR-2 is Maintained by Exchange Rate Kinetics at the Cortical-Cytoplasm Boundary. Cell Rep. 16, 2156–2168 (2016).
Article CAS PubMed Google Scholar
Tay, S. et al. Single-cell NF-κB dynamics reveal digital activation and analogue information processing. Nature 466, 267–271 (2010).
Article ADS CAS PubMed PubMed Central Google Scholar
Lev Bar-Or, R. et al. Generation of oscillations by the p53-Mdm2 feedback loop: a theoretical and experimental study. Proc. Natl. Acad. Sci. USA 97, 11250–11255 (2000).
Article ADS CAS PubMed PubMed Central Google Scholar
Shinohara, H. et al. Positive feedback within a kinase signaling complex functions as a switch mechanism for NF-κB activation. Science 344, 760–764 (2014).
Article ADS CAS PubMed Google Scholar
Alon, U. Network motifs: theory and experimental approaches. Nat. Rev. Genet. 8, 450–461 (2007).
Article CAS PubMed Google Scholar
Uda, S. et al. Robustness and compensation of information transmission of signaling pathways. Science 341, 558–561 (2013).
Article ADS CAS PubMed Google Scholar
Roob, E., Trendel, N., Rein ten Wolde, P. & Mugler, A. Cooperative Clustering Digitizes Biochemical Signaling and Enhances its Fidelity. Biophys. J. 110, 1661–1669 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Stoeger, T., Battich, N. & Pelkmans, L. Passive Noise Filtering by Cellular Compartmentalization. Cell 164, 1151–1161 (2016).
Article CAS PubMed Google Scholar
Kellogg, R. A. & Tay, S. Noise Facilitates Transcriptional Control under Dynamic Inputs. Cell 160, 381–392 (2015).
Article CAS PubMed Google Scholar
Waltermann, C. & Klipp, E. Information theory based approaches to cellular signaling. Biochim. Biophys. Acta 1810, 924–932 (2011).
Article CAS PubMed Google Scholar
Kholodenko, B. N., Demin, O. V., Moehren, G. & Hoek, J. B. Quantification of short term signaling by the epidermal growth factor receptor. J. Biol. Chem. 274, 30169–30181 (1999).
Article CAS PubMed Google Scholar
Schoeberl, B., Eichler-Jonsson, C., Gilles, E. D. & Müller, G. Computational modeling of the dynamics of the MAP kinase cascade activated by surface and internalized EGF receptors. Nat. Biotechnol. 20, 370–375 (2002).
Article PubMed Google Scholar
Sasagawa, S., Ozaki, Y., Fujita, K. & Kuroda, S. Prediction and validation of the distinct dynamics of transient and sustained ERK activation. Nat. Cell Biol. 7, 365–373 (2005).
Article CAS PubMed Google Scholar
Iwamoto, K., Shindo, Y. & Takahashi, K. Modeling Cellular Noise Underlying Heterogeneous Cell Responses in the Epidermal Growth Factor Signaling Pathway. PLoS Comput. Biol. 12, e1005222 (2016).
Article ADS PubMed PubMed Central Google Scholar
Hempel, S., Koseska, A., Nikoloski, Z. & Kurths, J. Unraveling gene regulatory networks from time-resolved gene expression data – a measures comparison study. BMC Bioinformatics 12, 292 (2011).
Article PubMed PubMed Central Google Scholar
Omony, J. Biological Network Inference: A Review of Methods and Assessment of Tools and Techniques. Annu. Res. Rev. Biol. 4, 577–601 (2014).
Article Google Scholar
Omranian, N., Eloundou-Mbebi, J. M. O., Mueller-Roeber, B. & Nikoloski, Z. Gene regulatory network inference using fused LASSO on multiple data sets. Sci. Rep. 6, 20533 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Yan, B. et al. An integrative method to decode regulatory logics in gene transcription. Nat. Commun. 8, 1044 (2017).
Article ADS PubMed PubMed Central Google Scholar
Forrest, A. R. R. et al. A promoter-level mammalian expression atlas. Nature 507, 462–470 (2014).
Article ADS CAS PubMed Google Scholar
Lundby, A. et al. Quantitative maps of protein phosphorylation sites across 14 different rat organs and tissues. Nat. Commun. 3, 876 (2012).
Article PubMed PubMed Central Google Scholar
Toni, T., Welch, D., Strelkowa, N., Ipsen, A. & Stumpf, M. P. H. Approximate Bayesian computation scheme for parameter inference and model selection in dynamical systems. J. R. Soc. Interface 6, 187–202 (2009).
Article PubMed Google Scholar
Wu, H., Lu, T., Xue, H. & Liang, H. Sparse additive ordinary differential equations for dynamic gene regulatory network modeling. J. Am. Stat. Assoc. 109, 700–716 (2014).
Article MathSciNet CAS PubMed PubMed Central MATH Google Scholar
Chen, S., Shojaie, A. & Witten, D. M. Network Reconstruction From High Dimensional Ordinary Differential Equations. J. Am. Stat. Assoc., https://doi.org/10.1080/01621459.2016.1229197 (2016).
Brunton, S. L., Proctor, J. L. & Kutz, J. N. Discovering governing equations from data: Sparse identification of nonlinear dynamical systems. Proc. Natl. Acad. Sci. USA 113, 3932–3937 (2016).
Article ADS MathSciNet CAS PubMed PubMed Central MATH Google Scholar
Oates, C. J. et al. Causal network inference using biochemical kinetics. Bioinformatics 30, 468–474 (2014).
Article Google Scholar
Mangan, N. M., Brunton, S. L., Proctor, J. L. & Kutz, J. N. Inferring Biological Networks by Sparse Identification of Nonlinear Dynamics. IEEE Trans. Mol. Biol. Multi-Scale Commun. 2, 52–63 (2016).
Article Google Scholar
Spiller, D. G., Wood, C. D., Rand, D. A. & White, M. R. H. Measurement of single-cell dynamics. Nature 465, 736–745 (2010).
Article ADS CAS PubMed Google Scholar
Url, S. & Statistics, G. Monte Carlo Filter and Smoother for Non-Gaussian Nonlinear State Space Models Genshiro Kitagawa. J. Comput. Graph. Stat. 5, 1–25 (1996).
Google Scholar
Andrieu, C., Doucet, A., Singh, S. S. & Tadic, V. B. Particle methods for change detection, system identification, and control. Proc. IEEE 92, 423–438 (2004).
Article Google Scholar
Kondo, Y., Kaneko, K. & Ishihara, S. Identifying dynamical systems with bifurcations from noisy partial observation. Phys. Rev. E 87, 42716 (2013).
Article ADS Google Scholar
Thieffry, D., Huerta, A. M., Pérez-Rueda, E. & Collado-Vides, J. From specific gene regulation to genomic networks: a global analysis of transcriptional regulation in Escherichia coli. BioEssays 20, 433–440 (1998).
Article CAS PubMed Google Scholar
Tegner, J., Yeung, M. K. S., Hasty, J. & Collins, J. J. Reverse engineering gene networks: Integrating genetic perturbations with dynamical modeling. Proc. Natl. Acad. Sci. USA 100, 5944–5949 (2003).
Article ADS CAS PubMed PubMed Central Google Scholar
Cai, X., Bazerque, J. A. & Giannakis, G. B. Inference of Gene Regulatory Networks with Sparse Structural Equation Models Exploiting Genetic Perturbations. PLoS Comput. Biol. 9, e1003068 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Jia, B. & Wang, X. Regularized EM algorithm for sparse parameter estimation in nonlinear dynamic systems with application to gene regulatory network inference. EURASIP J. Bioinformatics Syst. Biol. 2014, 5 (2014).
Article Google Scholar
Hasegawa, T., Yamaguchi, R., Nagasaki, M., Miyano, S. & Imoto, S. Inference of gene regulatory networks incorporating multi-source biological knowledge via a state space model with L1 regularization. Plos One 9, e105942 (2014).
Article ADS PubMed PubMed Central Google Scholar
Tibshirani, R. Regression Shrinkage and Selection via the Lasso. J. R. Stat. Soc. Ser. B 58, 267–288 (1996).
MathSciNet MATH Google Scholar
Alon, U. An Introduction to Systems Biology: Design Principles of Biological Circuits. (Chapman and Hall/CRC 2006).
Aravkin, A., Burke, J. V., Chiuso, A. & Pillonetto, G. Convex vs Non-Convex Estimators for Regression and Sparse Estimation: the Mean Squared Error Properties of ARD and GLasso. J. Mach. Learn. Res. 15, 217–252 (2014).
MathSciNet MATH Google Scholar
Kondo, Y., Hayashi, K. & Maeda, S. Bayesian Masking: Sparse Bayesian Estimation with Weaker Shrinkage Bias. Proc. Mach. Learn. Res. 45, 49–64 (2016).
Google Scholar
Yuan, M. & Lin, Y. Model selection and estimation in regression with grouped variables. J. R. Stat. Soc. Ser. B 68, 49–67 (2006).
Article MathSciNet MATH Google Scholar
Yang, G., Wang, L. & Wang, X. Reconstruction of Complex Directional Networks with Group Lasso Nonlinear Conditional Granger Causality. Sci. Rep. 7, 2991 (2017).
Article ADS PubMed PubMed Central Google Scholar
Atay, O., Doncic, A. & Skotheim, J. M. Switch-like Transitions Insulate Network Motifs to Modularize Biological Networks. Cell Syst. 3, 121–132 (2016).
Article CAS PubMed PubMed Central Google Scholar
Bansal, M., Belcastro, V., Ambesi-Impiombato, A. & di Bernardo, D. How to infer gene networks from expression profiles. Mol. Syst. Biol. 3, 78 (2007).
Article PubMed PubMed Central Google Scholar
Purcell, O., Savery, N. J., Grierson, C. S. & Bernardo, M. A comparative analysis of synthetic genetic oscillators. J. R. Soc. Interface 7, 1503–1524 (2010).
Article CAS PubMed PubMed Central Google Scholar
Lipinski-Kruszka, J., Stewart-Ornstein, J., Chevalier, M. W. & El-Samad, H. Using Dynamic Noise Propagation to Infer Causal Regulatory Relationships in Biochemical Networks. ACS Synth. Biol. 4, 258–264 (2015).
Article CAS PubMed Google Scholar
Wei, L. et al. Super-multiplex vibrational imaging. Nature 544, 465–470 (2017).
Article ADS CAS PubMed Google Scholar
Shindo, Y. et al. Conversion of graded phosphorylation into switch-like nuclear translocation via autoregulatory mechanisms in ERK signalling. Nat. Commun. 7, 10458 (2016).
Article Google Scholar
Nordh, J. pyParticleEst: A Python Framework for Particle-Based Estimation Methods. J. Stat. Softw. 78 (2017).
Arulampalam, M. S., Maskell, S., Gordon, N. & Clapp, T. A tutorial on particle filters for online nonlinear/non-Gaussian Bayesian tracking. IEEE Trans. Signal Process. 50, 174–188 (2002).
Article ADS Google Scholar
Honeycutt, R. L. Stochastic Runge-Kutta algorithms. I. White noise. Phys. Rev. A 45, 600–603 (1992).
Article ADS CAS PubMed Google Scholar

Download references

Acknowledgements

We thank Koichi Takahashi (RIKEN) for computational resources. Y.Sh. was supported by a Grant-in-Aid for JSPS Fellows (14J03140). Y.K. was supported by the Platform Project for Supporting Drug Discovery and Life Science Research (Platform for Dynamic Approaches to Living System) from AMED, Japan. Y.Sa. was supported by grants from MEXT, Japan (JP17H06021, JP16H00788, JP15H02394, and JP15KT0087).

Author information

Authors and Affiliations

Cellular Informatics Laboratory, RIKEN, Wako, Saitama, 351-0198, Japan
Yuki Shindo & Yasushi Sako
Graduate School of Informatics, Kyoto University, Sakyo-ku, Kyoto, 606-8501, Japan
Yohei Kondo

Authors

Yuki Shindo
View author publications
You can also search for this author in PubMed Google Scholar
Yohei Kondo
View author publications
You can also search for this author in PubMed Google Scholar
Yasushi Sako
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Y.Sh., Y.K. and Y.Sa. designed the study; Y.Sh. and Y.K. constructed the algorithm; Y.Sh. performed computational analysis; and Y.Sh., Y.K. and Y.Sa. wrote the manuscript.

Corresponding authors

Correspondence to Yuki Shindo or Yasushi Sako.

Ethics declarations

Competing Interests

The authors declare no competing interests.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Supplementary information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Shindo, Y., Kondo, Y. & Sako, Y. Inferring a nonlinear biochemical network model from a heterogeneous single-cell time course data. Sci Rep 8, 6790 (2018). https://doi.org/10.1038/s41598-018-25064-w

Download citation

Received: 10 October 2017
Accepted: 09 April 2018
Published: 01 May 2018
DOI: https://doi.org/10.1038/s41598-018-25064-w

This article is cited by

A mathematically rigorous algorithm to define, compute and assess relevance of the probable dissociation constants in characterizing a biochemical network
- Siddhartha Kundu
Scientific Reports (2024)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.