Not just a colourful metaphor: modelling the landscape of cellular development using Hopfield networks

Fard, Atefeh Taherian; Srihari, Sriganesh; Mar, Jessica C; Ragan, Mark A

doi:10.1038/npjsba.2016.1

Download PDF

Article
Open access
Published: 18 February 2016

Not just a colourful metaphor: modelling the landscape of cellular development using Hopfield networks

Atefeh Taherian Fard¹,
Sriganesh Srihari ORCID: orcid.org/0000-0002-0713-6723¹,
Jessica C Mar^2,3 &
…
Mark A Ragan¹

npj Systems Biology and Applications volume 2, Article number: 16001 (2016) Cite this article

5201 Accesses
17 Citations
2 Altmetric
Metrics details

Subjects

Abstract

The epigenetic landscape was introduced by Conrad Waddington as a metaphor of cellular development. Like a ball rolling down a hillside is channelled through a succession of valleys until it reaches the bottom, cells follow specific trajectories from a pluripotent state to a committed state. Transcription factors (TFs) interacting as a network (the gene regulatory network (GRN)) orchestrate this developmental process within each cell. Here, we quantitatively model the epigenetic landscape using a kind of artificial neural network called the Hopfield network (HN). An HN is composed of nodes (genes/TFs) and weighted undirected edges, resulting in a weight matrix (W) that stores interactions among the nodes over the entire network. We used gene co-expression to compute the edge weights. Through W, we then associate an energy score (E) to each input pattern (pattern of co-expression for a specific developmental stage) such that each pattern has a specific E. We propose that, based on the co-expression values stored in W, HN associates lower E values to stable phenotypic states and higher E to transient states. We validate our model using time course gene-expression data sets representing stages of development across 12 biological processes including differentiation of human embryonic stem cells into specialized cells, differentiation of THP1 monocytes to macrophages during immune response and trans-differentiation of epithelial to mesenchymal cells in cancer. We observe that transient states have higher energy than the stable phenotypic states, yielding an arc-shaped trajectory. This relationship was confirmed by perturbation analysis. HNs offer an attractive framework for quantitative modelling of cell differentiation (as a landscape) from empirical data. Using HNs, we identify genes and TFs that drive cell-fate transitions, and gain insight into the global dynamics of GRNs.

MIRA: joint regulatory modeling of multimodal expression and chromatin accessibility in single cells

Article 06 September 2022

NETISCE: a network-based tool for cell fate reprogramming

Article Open access 20 June 2022

A neural network-based model framework for cell-fate decisions and development

Article Open access 14 March 2024

Introduction

In the course of development, cells take on a succession of distinct phenotypic states, from an initial totipotent or pluripotent state through to a final differentiated state in which the cell is committed to a particular location and function. This commitment is typically progressive, with cells passing through a hierarchy of increasingly specialized intermediate states along a developmental trajectory. The transition from one intermediate state to the next is driven by the concerted action of transcription factors (TFs) and other biomolecules as part of the gene regulatory network (GRN).

Conrad Waddington introduced the epigenetic landscape as a metaphor for cellular development.¹ Like a population of balls rolling down a rough hillside, cells follow specific trajectories (valleys) and encounter decision points (inflections) before eventually coming to rest in one or another potentially steady state, termed attractors.^2,3 Importantly, Waddington depicted the topography of the landscape as determined by a system of underpinning interconnected cables. Although this metaphor preceded our current understanding of the relationship among genes, transcripts and proteins, it is easily interpretable today as depicting the framework for control of cellular differentiation and phenotype by dynamics of the underlying GRN.

To model this landscape, Huang⁴ proposed a “quasi-potential” that connects the elevation of the landscape to the likelihood of the corresponding cell state. For Huang⁴, each point in the landscape represents a gene-expression configuration of a binary regulatory circuit. An alternative formulation⁵ emphasises the possibility of cell trajectories without necessarily “rolling downhill”, e.g., the landscape is modelled as a non-hierarchical “epigenetic disc” in which cell fates can be interconverted without necessarily traversing back up through a developmental hierarchy. Other formulations emphasise the underlying molecular mechanisms including the role of DNA methylation, histone modification and signalling pathways in driving cell-fate decisions.^6–8

Modern experimental evidence does in fact show hierarchies of cell fates⁹ with TFs driving cell development. For instance, Takahashi and Yamanaka¹⁰ demonstrated that small sets of TFs (the “Yamanaka cocktail”) are sufficient to induce pluripotency in a differentiated somatic cell, i.e., the cell is reprogrammed back to its original state at the top of Waddington’s landscape. Experimental protocols exist for generating specialized cells including neurons,¹¹ hepatocytes,¹² macrophages¹³ and cardiomyocytes from undifferentiated fibroblasts¹⁴ using small sets of TFs, demonstrating cell-fate conversion and reprogramming.

Nonetheless it remains controversial whether a Waddington (or similar) landscape can be computed solely from empirical data. The view of Waddington’s landscape as a “colourful metaphor”¹⁵ that cannot be quantified has been echoed by several authors.^15–17 The recent availability of single-cell transcriptomic (and other omic) time course data provides new opportunity to explore landscape models that could provide insight into the GRNs that underpin cellular development. However, there are issues in quantifying this landscape, including design of the algorithmic framework per se, the approach to data utilisation and mapping of actual developmental stages to features (e.g., attractors) of landscape models.

Dynamic systems of biomolecular interactions have been modelled using Bayesian networks,^18–20 generalised logical networks,²¹ constraint-based models,²² Petri nets,²³ stochastic master equations²⁴ or agent-based approaches.²⁵ Although these approaches can identify stable states, in general they do not generate a solution landscape of the sort introduced above.

In the context of GRNs and cell fate, attractor landscapes have been described using Boolean networks,^26–28 neural networks^29–31 or systems of ordinary differential equations (ODEs);³² however, these have seen application mostly in computational simulation rather than in the analysis of empirical data. For instance, Wang et al.³³ used ODEs and simulated data to construct a probabilistic “pseudo-potential” energy landscape using only two TFs, GATA1 and PU.1, to identify the developmental path of cells from undifferentiated to differentiated states. This system permits a binary cell-fate decision such as macrophage/monocyte or megakaryocyte/erythrocyte. Similar probabilistic potential landscapes have been used to model lysis-lysogen switching in bacteriophage λ³⁴ and the mitogen-activated protein kinase (MAPK) signal transduction network.³⁵ Srihari et al.³⁶ used binarized gene-expression data in an optimisation framework to derive attractors of cell states and identify TFs switched between these attractors. Ferrell³⁷ proposed an alternative landscape for the developmental processes of cell-fate induction and inhibition, in which cell-fate commitment corresponds to the disappearance of a valley rather than the creation of a new valley. Davila-Velderrain et al.³⁸ provide a comprehensive overview. Problems associated with these models include (lack of) computational scalability and, for ODEs, the need for rate constants.

Mathematical models and GRNs have their own descriptions of states, trajectories and attractors. Questions pertain to whether and how phenotypic states of a cell, which are maintained and regulated by a GRN, can be mapped to an attractor landscape model computed from gene-expression and/or other empirical data. Here we use Hopfield networks (HNs) to model an attractor landscape that can serve as a framework to understand the dynamics of the underpinning GRN.

HNs, introduced by John Hopfield in 1982, are auto-associative (recurrent) artificial neural networks. Input patterns to the HN are associated with distinct attractors of the network, and can later be recalled even from partial or noisy inputs. Koulakov and Lazebnik²⁹ used an HN to simulate fusions between different types of cells corresponding to distinct attractors of the HN, concluding that fusion helps the cells reach an attractor state that would otherwise be inaccessible. Lang et al.³¹ used a similar approach to model and explain partially reprogrammed cell fates, and identified driver TFs involved in this process. They employed binarized gene-expression values, based on a conditional probability distribution derived from global histone-modification data, which reflect the epigenetic state of TFs and developmental signals in the landscape. Maetschke and Ragan,³⁰ on the other hand, constructed an HN from static gene-expression data for different subtypes of cancer, and showed that cancer subtypes can be characterized as distinct attractors of an HN.

Here we construct HNs from large-scale gene-expression time course data, and map developmental trajectories to Hopfield energy profiles in this landscape. We find that after cells are induced to differentiate, parts of their GRNs become less tightly correlated, as expected for a cell in transition from one cell-fate to another. The topography of these landscapes reflects the correlated activity of key genes that have been experimentally shown to drive cell-fate transitions and decision-making.

Results

Here we describe the HN energy landscape generated from the first case study.

Differentiation of HES2 stem cells

This data set consisted of 12 samples in 4 groups, assayed with 48,687 expression probes of which 3,753 remained after feature selection. Figure 1 shows the energy landscape of fractionated HES2 stem cells. For cells of group P7, which are dormant but inducible to a pluripotent state, we observe a relatively low-energy score E_P7=−1320897, i.e., a low elevation on the Hopfield energy landscape. On induction, these cells enter intermediate state P6 followed by P5, where we see an increase in energy to E_P6=−755220 and E_P5=−599724 before the cells reach the differentiated state P4 with the lowest energy E_P4=−3307223 (Figure 2a). This progression traces a trajectory on the Hopfield energy landscape. Two views of this trajectory are shown in Figure 1: the left plot shows the three-dimensional view of the Hopfield energy landscape, while the right plot shows the (inferred) trajectory driven by changes in expression of the genes that contribute to principal component (PC) 1 and 2.

To ensure that the Hopfield energies indeed correspond to stages of differentiation as captured by the expression profiles (i.e., that low Hopfield energies represent stable phenotypic states and high energies transition states) and are not merely an artefact, we progressively perturbed the gene-expression values for each stage, and computed the energies of the resulting HNs (Figure 3a). If random perturbation does not alter the Hopfield energy significantly, that cellular phenotype is stable and the underlying network maps to an attractor in the Hopfield landscape. On the other hand, if the perturbed networks tend to have significantly different energies, or conversely if the energy of the network before perturbation is similar to that of the randomly perturbed states, then the cells are in a transient state that can be represented as a hill in the Hopfield landscape.

We observed that when 5% of the gene-expression values are perturbed, energies for the first (P7) and last (P4) groups show greater difference vis-a-vis the random network (Figure 3a), and maintain lower energy values. By contrast, energies of the transient groups P5 and P6 are significantly (each P value 0.0) closer to that of the random network at all investigated levels of perturbation up to 50% (Figure 3a). This demonstrates that attractors are robust to perturbation, and confirms that Hopfield energy profiles can describe profiles of cellular differentiation.

We identified TFs and genes that are potentially major drivers of cell-state transition by comparing their discretized expression values between successive stages, and checking to see if they have switched (from −1 to +1 or vice versa). The feature-selected genes that switch activity between groups at each transition are listed in Supplementary Table S2. Different numbers of genes are switched at each transition, reflecting the dynamic behaviour of the GRN during development.³⁹ Table 1 lists the Gene Ontology (GO) Biological Process terms⁴⁰ and KEGG pathways⁴¹ most over-represented among the top 100 genes switched between the first and last groups. Among the enriched GO terms are cell-fate determination and cellular differentiation.^42,43 The KEGG pathways⁴¹ most-enriched were Hedgehog signalling, which governs a wide range of processes during embryonic development and controls stem-cell proliferation⁴⁴ and Wnt signalling, a major player in stem-cell self-renewal and differentiation.⁴⁵ Among the switched genes we find FOXA1, GATA4, CD9 and OCT4 TFs involved in regulation of stem-cell differentiation (DAVID^46,47). Specifically, the differentiation markers FOXA1 and GATA4 are upregulated in P4, whereas the pluripotency markers CD9 and OCT4 are upregulated in P7 (Supplementary Figure S1).

Table 1 Functional analysis of the switched genes across stages of differentiation/development

Full size table

For the remaining case studies, we inferred different genes to have switched expression, and different pathways and biological processes to be enriched, but we always observed the same pattern of energy changes across time points (Supplementary Tables S2 and S3). Results of the three other main case studies are presented in Figures 2,3,4 and Supplementary Figure S1, and Table 1 and Supplementary Table S2; for all others please see the Supplementary Material (Supplementary Tables S1, S3 and S4; Supplementary Figures S2 and S3).

Discussion and Conclusion

We computed developmental landscapes from single-cell gene-expression data in 12 sets of time course data sets, using the mathematical formalism of the Hopfield recurrent neural network. Each point in each landscape represents the state of a cell; its elevation is determined by the energy E, which we compute from the pattern of co-variation of gene expression in that cell. For each landscape model, groups of cells at the same developmental stage, and thus sharing a common pattern of gene expression, are positioned close together. As we construct the landscapes based on the set of genes showing the greatest variation in expression across the time points in each data set, the models reflect the molecular biological processes underlying cell development.

In our HN models, the energy of a state space does not reflect its likelihood; instead, E measures the extent of co-variation over all pairs of feature-selected genes, capturing a distinct pattern of gene expression which represents a cellular phenotype.⁴⁸ Strong co-variation yields a relatively low energy, whereas looser co-variation gives a high energy. Along a time course as the GRN is differentially regulated or rewired, changes in the patterns of co-variation are reflected as a trajectory on our landscape. For these 12 data sets, identities of the switched genes provide insights into functional modules, many of which (e.g., Hedgehog, Wnt and MAPK signalling pathways) have previously been validated experimentally.

We selected a form of perturbation analysis to build confidence in our energy values. The perturbation results make it clear that the energy values we compute for the initial and final states are far from those of random networks; this allows us to make comparisons among states. The set of energy values arising from perturbation analysis should not be taken as an estimate of the stability of an attractor, or the time or energy a cell needs to move out of that attractor state; for such concepts, new methods remain to be developed.

Different algorithmic frameworks are available to describe systems of biomolecular interactions. HNs are computationally simple, and our results demonstrate that they can capture trajectories of cellular development using only gene-expression data.

Following Maetschke and Ragan³⁰ and Lang et al.³¹ we employed gene-expression data, typically at genome-wide scale, as input. Like these authors, we sought to reduce noise (and thereby avoid spurious attractors) and improve computability by first carrying out feature selection. Thus we based each analysis on the set of genes (probes) most-informative in its respective context. Lang et al.³¹ further used histone-modification data to establish a threshold for the binarization of gene-expression values. In principle, any type of data that provides patterns characteristic of cells at different developmental stages or time points could be input directly into the HN, including transcriptomic data for gene expression. Indeed it would be possible to use HNs to build a landscape model over mixed data types, yielding a unique approach to integrating data in developmental systems biology.

Models are abstractions of reality, and a mapping is required to link features of a model to events in a real process. Here, particular care is required to describe this mapping, as both the real physical GRN in cells, and the HN framework of our model, employ concepts of states, trajectories and attractors.³⁰ At a given time, a cell is described by a state that reflects the overall pattern of interactions within its GRN. In the Hopfield framework, state refers to the value of H^(t)(s), where t is the update cycle of the HN. Because our aim here is to position each cell on the landscape, we did not allow the HN to converge to its lowest energy, but instead use the observed expression pattern to compute E.

These data sets are not dynamic in the sense of measuring gene expression in a fixed set of cells through a succession of time points. Rather, we position snapshots of the system (Hopfield energy profiles) in a common landscape model, then draw on external information (e.g., sampling order from a population or expression of surface markers) to trace a temporal progression (trajectory) across the landscape.

In the HN formalism, attractors are local minima of the energy landscape, and in the present context correspond to phenotypic states maintained by the underlying GRN. Here, cells start out in a (perhaps artificially) stable state for which we calculate a low energy. After induction they progress through transient states described by higher energy values, and eventually reach another low-energy phenotype represented by an attractor. As in Waddington’s metaphor, in each data set we observed sets of cells positioned along a developmental trajectory, with each point on the landscape representing a state of the GRN at a specific time. While Waddington implied that the topography of his hillside might be dynamic (by depicting interconnected cables beneath it), here we employ a static weight matrix, so our cells (networks) map onto a static landscape. Moreover, unlike in the Waddington metaphor, the trajectories we infer do not run downhill into a globally low-energy attractor at one edge of a state-space dimension; rather, we find low-energy states at both ends of every trajectory.

In principle, the HN model can be applied to contexts other than normal cellular development. For example, following the study by Huang,⁴⁹ who considered cancer as a pre-existing (but usually unvisited) attractor in his quasi-potential landscape, we might construct an HN from cancer-progression data and track trajectories of cells as they progress from a normal to a cancerous state. We could extend the model by employing targeted perturbation to measure the contribution of subsets of genes or TFs to the GRN, and thus to trajectories of disease progression.

As envisioned by Waddington, Kauffman and others, it is thus possible to compute a robust quantitative landscape model, based on empirical data, which reflects the collective behaviour of genes and TFs in driving cellular differentiation. By providing the framework for such models, HNs show that developmental landscapes need not be just a colourful metaphor.

Materials and methods

HN model: general background

An HN consists of nodes and weighted undirected interactions (represented as an interaction matrix) between these nodes. So-called patterns, for instance gene-expression profiles, can be stored as the weights of the network. Stored patterns (even when distorted) can then be retrieved from the network by a recurrent recall procedure.⁵⁰ The similarity of an arbitrary input pattern to a stored pattern can be expressed by an energy function. Local minima of the energy function are the attractors of the system, to which input patterns converge during recall. A network can have multiple attractors with different minimum values. The HN associates similar stored patterns with the same attractor, whereas distinct stored patterns tend to be associated with different attractors.

Let S be a set of m samples (cells) under study, and let G={g₁, g₂,…,g_n} be the set of genes profiled from these samples. For any sample s∈S, each gene is assigned to a node, thereby giving n nodes H(s)={H₁, H₂,_…,H_n}(s) in the HN. Each node H_i carries the expression value for g_i normalised and discretized to the values {−1, 0, 1}. For any pair of nodes (H_i, H_j), H_i≠H_j, we assign the interaction weight w(H_i, H_j) ∈ [−1,1] as the co-expression (computed here as Pearson's correlation) between {g_i, g_j} across the m samples and w(H_i, H_i)=0, resulting in a zero-diagonal symmetric weight matrix W. For each node H_i, N(H_i) is its set of neighbours (connected nodes).

Each sample s∈S, consisting of gene-expression values for the genes in G, can be stored by iterating through the HN. At each iteration, the node H_i is updated to $H_{i} = \sum_{j \in N (H_{i})} [H_{j} w (H_{i}, H_{j})],$ the weighted sum across all connected nodes of H_i. If w(H_i, H_j)>0 then H_i is updated to a value of the same sign (positive or negative) as H_j, whereas if w(H_i, H_j)<0 then H_i is updated to a value of the opposite sign as H_j. Consequently, H_i is either “pulled towards” or “pushed away” from H_j depending on w(H_i, H_j). After each update, we can capture the extent of agreement or disagreement between H_i and H_j as an energy function E(H_i,H_j)=−H_iW_ijH_j, with the convention that lower values represent higher agreement or stability. Thus the energy for the entire network is given by $E [H (s)] = - 1 / 2 H W H^{T}$

Such an energy function belongs to the Lyapunov family of monotonically non-increasing functions,⁵¹ in which as the iterations progress and the values assigned to the nodes are repeatedly updated, E[H(s)] converges to a low-energy state. This converged energy value represents the attractor for the sample s, and s is said to have converged to its attractor. This energy value scales linearly and is unitless. Given this framework, if we have a collection of samples $S = {S_{1}, S_{2}, \dots, S_{k}}$ where each S_i represents a set of samples from a specific stage (or time point) of cellular differentiation, we expect all samples in S_i to converge to the same attractor in the HN.

Hopfield model for cellular differentiation

Here, we aim to link the values of the HN energy function to stages of cellular differentiation. We hypothesise that as a cell transits from an initial to a differentiated state, the interplay (co-variation) between genes changes (in its GRN), and the cell as a network of genes moves along the Hopfield energy profile. By computing the energy values based on gene-expression profiles of samples at different stages of cellular development, we capture this transition. In particular, we demonstrate that if pair-wise co-expression coefficients are used to construct the weight matrix, these energy values indeed reflect the developmental stages of the cell. Tight correlation corresponds to lower energy values, and looser correlation to less-favourable energies.

Using sets of cells from different stages of differentiation (e.g., pluripotent to differentiated states), here we demonstrate that the computed energy level reflects the stage of differentiation. Our approach differs in two main respects from that presented in the study by Maetschke and Ragan,³⁰ which was based on the standard Hopfield model: we build the weight matrix using Pearson's correlation rather than Hebbian learning, and omit the iteration step so as to compute energies that represent the actual biological states of the network rather than iterated values.

By visualising the transitions between these energy levels, we realise the landscape of cellular differentiation. The landscape is an n+1 dimensional space with n dimensions for the genes, and one dimension for the energy E(H). To visualise this landscape in three dimensions, we render the surface of the energy landscape by interpolating energy values over a two-dimensional surface: first the dimensionality of the gene-expression data is reduced to 2 using principal component analysis (PCA), a regular grid is constructed with the same dimensions as the reduced data, inverse PCA is performed to map the grid points to the high-dimensional space, then E is computed for the grid data.³⁰ In this landscape, the two major principal components of the n genes serve as our dimensions x and y, and the energy is represented as the third dimension z. Stages S_i are points in this space, represented here by unique colours (Figure 5).

Case studies and data sets

We constructed HNs for 12 time course data sets covering a broad range of case studies. Four of these (Table 2) are described here; for the others see Supplementary Material. The first case study⁵² analyses HES2 human embryonic stem (ES) cells. In this study, cells were fractionated by flow cytometry into different categories of pluripotency based on the expression of two stem-cell surface markers, GCTM-2 and CD9, yielding the four groups P7 (GCMT2^HIGH-CD9^HIGH), P6 (GCMT2^MID-CD9^MID), P5 (GCMT2^LOW-CD9^LOW) and P4 (GCMT2⁻-CD9⁻). Cells in group P7 are in a dormant state inducible to pluripotency, whereas P4 cells are committed to their lineage, and P6 and P5 cells are in intermediate states of differentiation. The data set includes expression profiles from 12 samples corresponding to 3 replicates from each category.

Table 2 Data sets for the four main case studies

Full size table

The three other case studies cover maturation of embryonic neural cells during mouse brain development, time course differentiation of THP1 monocyte cells to macrophages and trans-differentiation of epithelial to mesenchymal cells in cancer (Table 2). Please refer to the Supplementary Document for detailed description of these data sets. Eight additional case studies cover time course differentiation of mouse and human ES cells, induced pluripotent stem cells and organogenesis (Supplementary Table S1). For each data set, we performed z-score normalisation, followed by feature selection to extract the probes with the highest variation across groups and time points. To determine number of features we used the elbow of the variance plot over features, choosing the number such that including one more probe does not change the variance.

Robustness analysis

For each data set, we assessed robustness of the energy values by randomly changing the expression values of a randomly selected subset (5, 10, 20, 50 or 90%) of the feature-selected genes. A new expression value was randomly chosen from within the interval {−1, +1}. We then constructed the HN network as above, and computed E for the network for each round of perturbation. We also compared the resulting energy value with that of a random network of the same size (ΔE=E_random−E_perturbed). This process was repeated 100 times for each data set and each proportion of perturbed genes. Please refer to the Supplementary Document for details.

References

Waddington, C. H. Organisers and Genes (Cambridge Univ. Press: Cambridge, UK, 1940).
Google Scholar
Waddington, C. H. The Strategy of the Genes (Allens & Unwin: London, UK, 1957).
Google Scholar
Franceschelli, S. in Morphogenesis, Structural stability and epigenetic landscape (eds Lesne A. & Bourgine P.) (Springer, 2011).
Huang, S. Reprogramming cell fates: reconciling rarity with robustness. Bioessays 31, 546–560 (2009).
Article CAS Google Scholar
Ladewig, J., Koch, P. & Brustle, O. Leveling Waddington: the emergence of direct programming and the loss of cell fate hierarchies. Nat. Rev. Mol. Cell Biol. 14, 225–236 (2013).
Article CAS Google Scholar
Goldberg, A. D., Allis, C. D. & Bernstein, E. Epigenetics: a landscape takes shape. Cell 128, 636–638 (2007).
Article Google Scholar
Alvarez-Errico, D., Vento-Tormo, R., Sieweke, M. & Ballestar, E. Epigenetic control of myeloid cell differentiation, identity and function. Nat. Rev. Immunol. 15, 7–17 (2015).
Article CAS Google Scholar
Reddington, J. P., Sproul, D. & Meehan, R. R. DNA methylation reprogramming in cancer: does it act by re-configuring the binding landscape of Polycomb repressive complexes? Bioessays 36, 134–140 (2014).
Article CAS Google Scholar
Takahashi, K. Cellular reprogramming—lowering gravity on Waddington's epigenetic landscape. J. Cell Sci. 125, 2553–2560 (2012).
Article CAS Google Scholar
Takahashi, K. & Yamanaka, S. Induction of pluripotent stem cells from mouse embryonic and adult fibroblast cultures by defined factors. Cell 126, 663–676 (2006).
Article CAS Google Scholar
Vierbuchen, T. et al. Direct conversion of fibroblasts to functional neurons by defined factors. Nature 463, 1035–1041 (2010).
Article CAS Google Scholar
Huang, P. et al. Induction of functional hepatocyte-like cells from mouse fibroblasts by defined factors. Nature 475, 386–389 (2011).
Article CAS Google Scholar
Feng R. et al. PU.1 and C/EBPα/β convert fibroblasts into macrophage-like cells. Proc. Natl Acad. Sci. USA 105, 6057–6062 (2008).
Article CAS Google Scholar
Ieda, M. et al. Direct reprogramming of fibroblasts into functional cardiomyocytes by defined factors. Cell 142, 375–386 (2010).
Article CAS Google Scholar
Thom, R. in Theorical Biology: Epigenetic and Evolutionary Order from Complex Systems, Vol. 6 (eds by Goodwin B. & Saunders P.) (Edinburgh Univ. Press Edinburgh, 1989).
Gilbert, S. Epigenetic landscaping: Waddington's use of cell fate bifurcation diagrams. Biol. Philos. 6, 135–154 (1991).
Article Google Scholar
Slack, J. M. W. Conrad Hal Waddington: the last Renaissance biologist? Nat. Rev. Genet. 3, 889–895 (2002).
Article CAS Google Scholar
Friedman, N., Linial, M., Nachman, I. & Pe'er, D. Using Bayesian networks to analyze expression data. J. Comp. Biol. 7, 601–620 (2000).
Article CAS Google Scholar
de Jong, H. Modeling and simulation of genetic regulatory systems: a literature review. J. Comp. Biol. 9, 36 (2002).
Article Google Scholar
Pearl, J. Probabilistic Reasoning in Intelligent Systems (Morgan Kaufmann: San Francisco, CA, USA, 1988).
Google Scholar
Thomas, R. Regulatory networks seen as asynchronous automata: a logical description. J. Theor. Biol. 153, 1–23 (1991).
Article Google Scholar
Becker, S. A. et al. Quantitative prediction of cellular metabolism with constraint-based models: the COBRA Toolbox. Nat. Protoc. 2, 727–738 (2007).
Article CAS Google Scholar
Chaouiya, C. Petri net modelling of biological networks. Brief Bioinform. 8, 210–219 (2007).
Article CAS Google Scholar
Gillespie, D. T. Exact stochastic simulation of coupled chemical reactions. J. Phys. Chem. 81, 2340–2361 (1977).
Article CAS Google Scholar
An, G., Mi, Q., Dutta-Moscato, J. & Vodovotz, Y. Agent-based models in translational systems biology. Wiley Interdiscip. Rev. Syst. Biol. Med. 1, 159–171 (2009).
Article CAS Google Scholar
Qiu, Y., Tamura, T., Ching, W.-K. & Akutsu, T. On control of singleton attractors in multiple Boolean networks: integer programming-based method. BMC Syst. Biol. 8, S7 (2014).
Article Google Scholar
Melkman, A. A. & Akutsu, T. An improved satisfiability algorithm for nested canalyzing functions and its application to determining a singleton attractor of a Boolean network. J. Comp. Biol. 20, 958–969 (2013).
Article CAS Google Scholar
Peican, Z. & Jie, H. Asynchronous stochastic Boolean networks as gene network models. J. Comp. Biol. 21, 771–783 (2014).
Article Google Scholar
Koulakov, A. A. & Lazebnik, Y. The problem of colliding networks and its relation to cancer. Biophys. J. 103, 2011–2020 (2012).
Article CAS Google Scholar
Maetschke, S. R. & Ragan, M. A. Characterizing cancer subtypes as attractors of Hopfield networks. Bioinformatics 30, 1273–1279 (2014).
Article CAS Google Scholar
Lang, A., Li, H., Collins, J. & Mehta, P. epigenetic landscapes explain partially reprogrammed cells and identify key reprogramming genes. PLoS Comp. Biol. 10, e1003734 (2014).
Article Google Scholar
Bhattacharya, S., Zhang, Q. & Andersen, M. E. A deterministic map of Waddington's epigenetic landscape for cell fate specification. BMC Syst. Biol. 5, 85 (2011).
Article Google Scholar
Wang, J., Zhang, K., Xu, L. & Wang, E. Quantifying the Waddington landscape and biological paths for development and differentiation. Proc. Natl Acad. Sci. USA 108, 8257–8262 (2011).
Article CAS Google Scholar
Zhu, X. M., Yin, L., Hood, L. & Ao, P. Robustness, stability and efficiency of phage λ genetic switch: dynamical structure analysis. J. Bioinform. Comput. Biol. 2, 785–817 (2004).
Article CAS Google Scholar
Lapidus, S., Han, B. & Wang, J. Intrinsic noise, dissipation cost, and robustness of cellular networks: the underlying energy landscape of MAPK signal transduction. Proc. Natl Acad. Sci. USA 105, 6039–6044 (2008).
Article CAS Google Scholar
Srihari, S., Raman, V., Leong, H. W. & Ragan, M. A. evolution and controllability of cancer networks: a boolean perspective. IEEE/ACM Trans. Comput. Biol. Bioinf. 11, 83–94 (2013).
Article Google Scholar
Ferrell, J. E. Jr. Bistability, bifurcations, and Waddington's epigenetic landscape. Curr. Biol. 22, R458–R466 (2012).
Article CAS Google Scholar
Davila-Velderrain, J., Martínez-García, J. C. & Alvarez-Buylla, E. R. Modeling the epigenetic attractors landscape: towards a post-genomic mechanistic understanding of development. Front. Genet. 6, 160 (2015).
Article Google Scholar
Srihari, S. & Ragan, M. A. Systematic tracking of dysregulated modules identifies novel genes in cancer. Bioinformatics 29, 1553–1561 (2013).
Article CAS Google Scholar
Ashburner, M. et al. Gene Ontology: tool for the unification of biology. Nat. Genet. 25, 25–29 (2000).
Article CAS Google Scholar
Kanehisa, M. & Goto, S. KEGG: Kyoto encyclopedia of genes and genomes. Nucleic Acids Res. 28, 27–30 (2000).
Article CAS Google Scholar
Hough, S., Laslett, A., Grimmond, S., Kolle, G. & Pera, M. A continuum of cell states spans pluripotency and lineage commitment in human embryonic stem cells. PLoS ONE 4, e7708 (2009).
Article Google Scholar
Hartl, D. et al. Transcriptome and proteome analysis of early embryonic mouse brain development. Proteomics 8, 1257–1265 (2008).
Article CAS Google Scholar
Jiang, J. & Hui, C.-C. Hedgehog signaling in development and cancer. Dev. Cell 15, 801–812 (2008).
Article CAS Google Scholar
Nusse, R. Wnt signaling and stem cell control. Cell Res. 18, 523–527 (2008).
Article CAS Google Scholar
Huang, D. W., Sherman, B. T. & Lempicki, R. A. Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat. Protoc. 4, 44–57 (2008).
Article Google Scholar
Huang, D. W., Sherman, B. T. & Lempicki, R. A. Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists. Nucleic Acids Res. 37, 1–13 (2009).
Article Google Scholar
Mar, J. C. & Quackenbush, J. Decomposition of gene expression state space trajectories. PLoS Comput. Biol. 5, e1000626 (2009).
Article Google Scholar
Huang, S. On the intrinsic inevitability of cancer: from foetal to fatal attraction. Semin. Cancer Biol. 21, 183–199 (2011).
Article Google Scholar
Hopfield, J. J. Neural networks and physical systems with emergent collective computational abilities. Proc. Natl Acad. Sci. USA 79, 2554–2558 (1982).
Article CAS Google Scholar
Lyapunov, A. M. The general problem of the stability of motion. Int. J. Control 55, 531–534 (1992).
Article Google Scholar
Kolle, G. et al. Identification of human embryonic stem cell surface markers by combined membrane-polysome translation state array analysis and immunotranscriptional profiling. Stem Cells 27, 2446–2456 (2009).
Article CAS Google Scholar
Kouno, T. et al. Temporal dynamics and transcriptional control using single-cell gene expression analysis. Genome Biology 10, R118 (2013).
Article Google Scholar
Sartor, M. A. et al. ConceptGen: a gene set enrichment and gene set relation mapping tool. Bioinformatics 26, 456–463 (2010).
Article CAS Google Scholar

Download references

Acknowledgements

We thank Stefan Maetschke for valuable comments. A.T.F. was supported by a UQ Graduate School International Travel Award (GSITA). The project was supported in part by strategic funds from The University of Queensland.

Author information

Authors and Affiliations

Institute for Molecular Bioscience and ARC Centre of Excellence in Bioinformatics, The University of Queensland, St Lucia, Brisbane, QLD, Australia
Atefeh Taherian Fard, Sriganesh Srihari & Mark A Ragan
Department of Systems & Computational Biology, Albert Einstein College of Medicine, Bronx, NY, USA,
Jessica C Mar
Department of Epidemiology & Population Health, Albert Einstein College of Medicine, Bronx, NY, USA
Jessica C Mar

Authors

Atefeh Taherian Fard
View author publications
You can also search for this author in PubMed Google Scholar
Sriganesh Srihari
View author publications
You can also search for this author in PubMed Google Scholar
Jessica C Mar
View author publications
You can also search for this author in PubMed Google Scholar
Mark A Ragan
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Mark A Ragan.

Ethics declarations

Competing interests

The authors declare no conflict of interest.

Supplementary information

Supplementary Information (DOC 2155 kb)

Supplementary Table S1 (PDF 29 kb)

Supplementary Table S2 (XLS 1682 kb)

Supplementary Table S3 (PDF 40 kb)

Supplementary Table S4 (XLS 1934 kb)

Rights and permissions

This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

Reprints and permissions

About this article

Cite this article

Fard, A., Srihari, S., Mar, J. et al. Not just a colourful metaphor: modelling the landscape of cellular development using Hopfield networks. npj Syst Biol Appl 2, 16001 (2016). https://doi.org/10.1038/npjsba.2016.1

Download citation

Received: 03 July 2015
Revised: 05 November 2015
Accepted: 15 December 2015
Published: 18 February 2016
DOI: https://doi.org/10.1038/npjsba.2016.1