Single-cell variability in multicellular organisms

Smith, Stephen; Grima, Ramon

doi:10.1038/s41467-017-02710-x

Download PDF

Article
Open access
Published: 24 January 2018

Single-cell variability in multicellular organisms

Stephen Smith¹ &
Ramon Grima¹

Nature Communications volume 9, Article number: 345 (2018) Cite this article

5838 Accesses
39 Citations
24 Altmetric
Metrics details

Subjects

Abstract

Noisy gene expression is of fundamental importance to single cells, and is therefore widely studied in single-celled organisms. Extending these studies to multicellular organisms is challenging since their cells are generally not isolated, but individuals in a tissue. Cell–cell coupling via signalling, active transport or pure diffusion, ensures that tissue-bound cells are neither fully independent of each other, nor an entirely homogeneous population. In this article, we show that increasing the strength of coupling between cells can either increase or decrease the single-cell variability (and, therefore, the heterogeneity of the tissue), depending on the statistical properties of the underlying genetic network. We confirm these predictions using spatial stochastic simulations of simple genetic networks, and experimental data from animal and plant tissues. The results suggest that cell–cell coupling may be one of several noise-control strategies employed by multicellular organisms, and highlight the need for a deeper understanding of multicellular behaviour.

A top-down measure of gene-to-gene coordination for analyzing cell-to-cell variability

Article Open access 26 May 2021

Independent control of mean and noise by convolution of gene expression distributions

Article Open access 29 November 2021

Interpretable and tractable models of transcriptional noise for the rational design of single-molecule quantification experiments

Article Open access 09 December 2022

Introduction

It is now well established that stochastic gene expression is the main driver of phenotypic variation in populations of genetically identical cells^1,2. In populations of single-celled organisms, individuals are known to switch between metabolic states³ or antibiotic resistant states⁴, and to randomly choose the timing of reproduction⁵, among other stochastic survival strategies. The availability of single-cell fluorescence data has precipitated a wealth of mathematical modelling approaches to understand single-cell noise based on the chemical master equation (CME)⁶, such as the stochastic simulation algorithm (SSA)⁷, the finite-state projection algorithm (FSP)⁸, and the linear noise approximation (LNA)^9,10.

In multicellular organisms, mouse olfactory development¹¹ and Drosophila vision¹² are well-known examples of stochastic gene expression in tissues, along with pattern formation^13,14 and phenotypic switching of cancer cells¹⁵. More recently, it has been observed that tissue-bound cells can take advantage of polyploidy to reduce noise¹⁶. Nevertheless, single-cell variability in tissues is considerably less well understood than in isolated cells, for two main reasons.

Firstly, acquiring fluorescence data for tissue-bound cells requires a combination of high-resolution imaging and cell segmentation software that has only recently become possible for mRNA localisation¹⁷ and still poses a significant challenge for proteins. The difficulty of accurate segmentation of tissue-bound cells means that the majority of segmented time course data still concerns populations of isolated cells¹⁸, while tissue-level data has historically been too low-resolution to distinguish individual cell outlines¹⁹, though improvements in microscopy are increasingly eliminating this problem¹⁶.

Secondly, the transfer of material between tissue-bound cells makes mathematical modelling of tissues significantly more complex than equivalent isolated cell models. In addition to the long-range endocrine networks which connect all cells in a tissue, neighbouring cells communicate via complex paracrine signalling networks²⁰, and also via small watertight passages such as gap junctions in animals, and plasmodesmata in plants. In plant cells, molecules up to and including proteins are known to move through plasmodesmata by pure diffusion^21,22, while those as large as mRNA are actively transported²³. In animal cells, peptides diffuse through gap junctions²⁴, while larger molecules have been shown to be transported across cytoplasmic bridges²⁵ or tunnelling nanotubes²⁶. A single cell in a tissue is therefore partially dependent on its neighbour cells, but also partially independent of them, and so mathematical models of cells within multicellular organisms must take account of this coupling.

In this article, we start from a general mathematical description of a tissue of cells, in which each cell contains an identical stochastic genetic network, with identical reaction rates. Our description permits molecules to move from a cell to a neighbouring cell with a given transport rate or coupling strength, representing signalling, active transport, or pure diffusion. We subsequently consider two special cases: when the coupling is very weak and very strong. In both of these cases, our complex mathematical description reduces to simple expressions for the single-cell variability. These equations are completely generic, and apply to any biochemical network including oscillatory and multimodal systems.

The implication of the equations is that single-cell variability is controlled by the strength of cell–cell coupling, in a manner that depends on the Fano factor (FF) of the underlying genetic network. If FF > 1, then cell–cell coupling will tend to reduce the single-cell variability (or equivalently, the heterogeneity of the tissue); whereas if FF < 1, then coupling will tend to increase the single-cell variability. To confirm our theory, we use spatial stochastic simulations of three biochemical networks, and experimental data from rat pituitary tissue, a leaf of Arabidopsis thaliana, and a population of mouse fibroblast cells.

Results

Illustratory examples

Modelling approaches to genetic networks such as the CME, SSA, or LNA assume that the cell is an isolated volume with no molecules entering or leaving the system from outside (Fig. 1a). Tissues of cells violate this assumption: each cell is connected to a number of neighbour cells (Fig. 1b), and molecules involved in the genetic network can be transported from cell to cell. The differences between a population of identical independent cells and a tissue of identical connected cells can be seen with stochastic simulations of a simple genetic network.

In Fig. 1c we plot three independent realisations of the SSA for the well-known two-stage gene expression network⁶:

$$\emptyset \begin{array}{*{20}{c}} {v_0} \\ \rightleftharpoons \\ {d_0} \end{array}M,M\mathop{\longrightarrow}\limits^{{v_1}}M + P, P\mathop{\longrightarrow}\limits^{{d_1}}\emptyset ,$$

(1)

in which a molecule of mRNA (M) is transcribed with rate v₀ and decays with rate d₀. The mRNA can translate a protein (P) with rate v₁ which in turn decays with rate d₁. The trajectories in Fig. 1c correspond to the number of protein molecules in three independent cells.

To model a tissue, we imagine a N × N grid of cells (Fig. 1b) numbered from 1 to N² with the genetic network (1) inside each cell. In addition, we couple neighbouring pairs of cells by allowing the protein P to be transported between them with a rate t. To model this, we think of protein transport from cell i to cell j as a simultaneous decay of protein in cell i and creation of protein in cell j. Specifically, we can write the system in cell i as:

$$\emptyset \begin{array}{*{20}{c}} {v_0} \\ \rightleftharpoons \\ {d_0} \end{array}M_i,M_i\mathop{\longrightarrow}\limits^{{v_1}}M_i + P_i,P_i\mathop{\longrightarrow}\limits^{{d_1}}\emptyset ,P_i\begin{array}{*{20}{c}} t \\ \rightleftharpoons \\ t \end{array}P_j,$$

(2)

where M_i and P_i denote the mRNA and protein respectively in cell i, and the reaction $P_i\begin{array}{*{20}{c}} t \\ \rightleftharpoons \\ t \end{array}P$ denotes the transport of protein from cell i to cell j, if i and j are neighbouring cells. Transport is therefore modelled as a kind of 'reaction' involving two species P_i and P_j, though biologically these are really the same species in different locations. We note that this model of transport implies exponentially distributed waiting times between successive transport events, an assumption that has previously been used when modelling active transport in tissues^27,28, and in modelling reaction-diffusion systems^29,30. The main results of this article do not depend on the exponential assumption since we only analyse the fast limit of transport, though it is convenient for simulations at finite transport rates.

This description of transport has a clear advantage for modelling: we have reframed a complex problem of cell–cell transport into a simpler problem of species and reactions on which, in principle, we can use the FSP, SSA, or even LNA. In reality, the FSP and LNA are impractical for such systems, owing to their large dimensions: for a tissue with 100 cells, the system (2) consists of many more species than system (1) (200 rather than 2), many more chemical reactions (400 rather than 4) and many additional transport “reactions” (roughly 400, rather than 0). The SSA, however, is still a useful technique for getting accurate data about tissue systems like 2, though it will obviously be substantially slower than for single-celled systems like (1).

We simulated system (2) with a version of the SSA³¹ with N² = 100 cells, giving 100 trajectories of protein number, one for each cell. Three typical trajectories are plotted in Fig. 1d. Notably, the tissue trajectories in Fig. 1d are considerably less variable (more homogeneous) than the isolated cell trajectories in Fig. 1c.

This apparent increase in homogeneity is perhaps unsurpising, and may be thought of as the obvious consequence of increasing coupling. However, remarkably, coupling can also reduce the homogeneity in a tissue. For example, a simple system representing the synthesis of a protein P, and its consequent dimerisation into a homodimer D, is defined by the reactions:

$$\emptyset \mathop{\longrightarrow}\limits^{{k_1}}P, P + P\mathop{\longrightarrow}\limits^{{k_2}}D.$$

(3)

In a tissue, this system involves the transport of P between neighbouring cells. We simulated system (3) both without and with transport using the same version of the SSA, and three typical single-cell trajectories of the protein P are plotted in Fig. 1e, f respectively. The independent cell trajectories are relatively homogeneous, while the tissue-bound cell trajectories are substantially more variable (more heterogeneous).

An intuitive explanation can be made for these initially surprising observations. The transport of molecules between cells has two distinct effects on the single-cell variability: (1) by moving molecules into and out of cells, it allows for greater cell–cell variation; (2) by smoothing out concentration gradients between neighbouring cells, it homogenises concentrations across the tissue.

The effect of transport on single-cell variability is determined by the trade-off between effects (1) and (2). In Fig. 1c, the cells have large fluctuations in molecule number which will be reduced by effect (2), but new fluctuations will be induced by effect (1). In Fig. 1d new fluctuations have been added, but these are not large enough to offset the reduction in the original fluctuations, and so, overall, homogeneity is increased by transport. Meanwhile, for system (3), effect (2) is much less significant because the single-cell variability in Fig. 1e is already small, so effect (1) dominates and there is an overall increase in heterogeneity in Fig. 1f.

Theory

To make this intuition mathematically precise (see Methods), we consider a system of any number of species and any number of reactions, a tissue with volume V_T, and a cell of volume V_C. Furthermore, we let n be the number of molecules of a species of interest in the tissue, while m is the number of molecules of that species in the cell. Note that because each cell contains the same genetic network, the variance of fluctuations in a single cell, $\left\langle {m^2} \right\rangle - \left\langle m \right\rangle ^2$, is a non-normalised measure of the single-cell variability. We define our measure of single-cell variability, L, to be:

$$L = \frac{{{V_{\mathrm{T}}}\left( {\left\langle {m^2} \right\rangle - \left\langle m \right\rangle ^2} \right)}}{{V_{\mathrm{C}}\left( {\left\langle {n^2} \right\rangle - \left\langle n \right\rangle ^2} \right)}},$$

(4)

i.e., the ratio of the variance of fluctuations in a single cell to the variance of fluctuations in the entire tissue, scaled by volume. The reason for this definition becomes clear when we consider the limiting case of weak cell–cell coupling (i.e., isolated cells). In this case, each cell is completely independent and hence it directly follows by the Bienaymé formula³² (Methods section) that:

$$\left( {\left\langle {m^2} \right\rangle - \left\langle m \right\rangle ^2} \right) = \frac{{V_{\mathrm{C}}}}{{V_{\mathrm{T}}}}\left( {\left\langle {n^2} \right\rangle - \left\langle n \right\rangle ^2} \right),$$

(5)

which immediately implies that L = 1. That is, if L = 1, then the cell-to-cell variability is at the level we would expect if the cells were completely independent of each other. This can be considered as a neutral state, neither particularly heterogeneous, nor especially homogeneous.

If L < 1, then the single-cell variability is lower than we would expect from the Bienaymé formula, given the tissue-level variance. It follows that the cells are more homogeneous than decoupled cells. On the other hand, if L >1, then the cell-to-cell variability is higher than we would expect from the Bienaymé formula, given the tissue-level variance. It follows that the cells are more heterogeneous than decoupled cells. L is therefore a non-dimensional statistical measure of single-cell variability (or equivalently, population heterogeneity).

With this in mind, we next consider what happens to a tissue of cells with cell–cell coupling. At zero coupling, we will naturally have L = 1. As the coupling strength increases, L will change, but the magnitude of the change will depend on a number of system-specific factors including the topology of the tissue (which cells are coupled to which), the structure of the genetic network, and the rates of the reactions involved. To bypass these issues, we consider the special case of infinitely fast cell–cell transport, and we reason that the behaviour at finite transport rates will lie between the zero coupling and infinite coupling cases.

Biologically we can think of infinite coupling as the extreme case where a protein will move from cell to cell many times during its lifetime. In this case, the probability of finding a given molecule in a given cell is simply $\frac{{V_{\mathrm{C}}}}{{V_{\mathrm{T}}}}$. This implies that the probability distribution governing the number of molecules in the cell is a convolution of the solution of the CME (which describes the whole tissue) and a binomial distribution (Methods section). While the convolution is generally impossible to solve, remarkably we can obtain a simple expression linking the variance of fluctuations in the cell, with the variance in the entire tissue:

$$\left\langle {m^2} \right\rangle - \left\langle m \right\rangle ^2 = \frac{{V_{\rm C}}}{{V_{\rm T}}}\left\langle n \right\rangle + \frac{{V_{\rm C}^2}}{{V_{\rm T}^2}}\left( {\left\langle {n^2} \right\rangle - \left\langle n \right\rangle - \left\langle n \right\rangle ^2} \right).$$

(6)

Combining Eqs. (4) and (6), and defining the Fano factor (FF) as the ratio of tissue-level variance to the mean, ${\rm FF} = \frac{{\left\langle {n^2} \right\rangle - \left\langle n \right\rangle ^2}}{{\left\langle n \right\rangle }}$, we find that the single-cell variability at infinite coupling is given by:

$$L = \frac{{V_{\rm C}}}{{V_{\rm T}}} + \frac{{1 - \frac{{V_{\rm C}}}{{V_{\rm T}}}}}{{{\rm FF}}}.$$

(7)

We note now that FF is a standard statistical measure of the size of fluctuations. Probability distributions with FF = 1 are said to have Poissonian fluctations, while FF < 1 corresponds to subpoissonian and FF > 1 to superpoissonian. Our earlier intuition suggested that systems with large fluctuations would tend to see a reduction in cell-to-cell variability as coupling strength increases. Now we see that this is indeed the case: combining FF > 1 with Eq. (7), we find that L < 1 at infinite coupling strength, suggesting that coupling tends to decrease single-cell variability for superpoissonian systems. Alternatively, choosing FF < 1 we find that L > 1 at infinite coupling strength, implying that coupling will increase single-cell variability for subpoissonian systems.

For system (1), the Fano factor can be computed exactly since the moment equations for the corresponding CME are closed³³. In particular, we have that ${\rm FF} = 1 + \frac{{v_1}}{{d_0 + d_1}} > 1$, implying that increasing the transport rate will reduce the single-cell variability, as shown in Fig. 1c, d.

For system (3), the presence of a bimolecular reaction prevents the moment equations from closing, and so the moments are instead obtained from the steady-state distribution of molecule numbers. The mean and variance are given in ref.³⁴, but we will not state them here since they are complicated expressions. Instead we note that FF < 1 for all parameter values, suggesting that the single-cell variability will increase as cell–cell transport increases. See the next section for more details of these calculations.

For these examples the qualitative changes in single-cell variability are independent of parameter values, though this would not be the case for systems with Fano factors which can vary from subpoissonian to superpoissonian. We note that these results are independent of the spatial structure of the tissue: they apply equally to neighbour-neighbour and long-range interactions, and indeed any kind of coupling provided no cells in the tissue are disconnected from the population. We also stress that these results apply equally to systems out of equilbrium, including oscillatory systems and systems far from steady-state, since no assumptions have been made on the type of biochemical network inside each cell.

Verification of theory using stochastic simulations

Our theory pedicts that the single-cell variability L should move from 1 to the value given in Eq. (7) as cell–cell transport increases. In this section we test the accuracy of this prediction on data from detailed stochastic simulations using a version of the SSA³¹ that is well-suited to simulating tissues.

First, we again consider the two-stage gene expression system (1) as shown in Fig. 1c, d and Fig. 2 inset. Since the moments of the CME are closed for this system³³ we can find exact expressions for the tissue-level mean, $\frac{{V_{\rm T}v_0v_1}}{{d_0d_1}}$, and the tissue-level variance, $\frac{{V_{\rm T}v_0v_1}}{{d_0d_1}}\left( {1 + \frac{{v_1}}{{d_0 + d_1}}} \right)$. It follows that ${\rm FF} = 1 + \frac{{v_1}}{{d_0 + d_1}}$, and so Eq. (7) implies that L will decrease from 1 to $\frac{{d_0 + d_1 + v_1\frac{{V_{\rm C}}}{{V_{\rm T}}}}}{{d_0 + d_1 + v_1}}$ as transport increases.

As a second example we consider the protein synthesis and dimerisation system (3) as shown in Fig. 1e, f and 3 inset. The mean and variance are given in ref.³⁴, and they imply that ${\rm FF} = \frac{3}{4} - \phi \frac{{I_1^\prime (4\phi )}}{{I_1(4\phi )}} + \frac{\phi }{{\left( {\frac{1}{{4\phi }} + \frac{{I_1^\prime (4\phi )}}{{I_1(4\phi )}}} \right)}}$, where $\phi = V_{\rm T}\sqrt {\frac{{k_1}}{{2k_2}}}$ and I₁(x) is the modified Bessel function of the first kind³⁵ and $I_1^\prime$ (x) is its derivative. Numerical analysis confirms that FF < 1 for all values of ϕ, suggesting that L will increase from 1 as cell–cell transport increases.

For our third example we consider the bimodal three-stage gene expression network studied in refs. 6,9 (Fig. 4 inset):

$$\begin{array}{l}G_{{\mathrm{on}}}\begin{array}{*{20}{c}} {k_{{\mathrm{off}}}} \\ \rightleftharpoons \\ {k_{{\mathrm{on}}}} \end{array}G_{{\mathrm{off}}},G_{{\mathrm{on}}}\mathop{\longrightarrow}\limits^{{v_0^{{\mathrm{on}}}}}G_{{\mathrm{on}}} + M,G_{{\mathrm{off}}}\mathop{\longrightarrow}\limits^{{v_0^{{\mathrm{off}}}}}G_{{\mathrm{off}}} + M,\\ M\mathop{\longrightarrow}\limits^{{d_0}}\emptyset ,M\mathop{\longrightarrow}\limits^{{v_1}}M + P,P\mathop{\longrightarrow}\limits^{{d_1}}\emptyset ,\end{array}$$

(8)

in which a gene can be in an active state (G_on) or an inactive state (G_off). The active gene transcribes mRNA (M) with a rate $v_0^{{\mathrm{on}}}$, while the inactive gene transcribes mRNA with a rate $v_0^{{\mathrm{off}}}$. The protein is translated as in the earlier system (1). We again calculate the mean and variance of fluctuations for the protein P from the moment equations, as for the previous examples, and we find that the Fano factor is larger than 1 so Eq. (7) again implies that L will decrease as cell transport increases.

In summary, in Figs. 2, 3 and 4 we compare the analytical expressions for fast transport, Eq. (7), with the simulation data for systems (1), (3) and (8), respectively. It is clear that in every case our theoretical predictions are correct. For each example the single-cell variability, L, from simulations (blue squares) moves from the slow limit, 1, (green line) to the predicted fast transport limit (red line). As predicted by the Fano factor criterion, for systems (1) and (8) L decreases with transport rate, while L increases for system (3).

Application to experimental data

Testing our predictions on simulations is useful, because by varying the rate of transport we can confirm that increasing it leads to the predicted change in single-cell variability, but with experimental data the transport rate is both fixed and completely unknown. However, we know that L lies between 1 and the value given by Eq. (7), and that the parameters of Eq. (7) are either tissue-level quantities (FF), or easily calculable (V_T and V_C). It follows that we can use tissue-level time course data to estimate L, without any knowledge of the underlying genetic network. In general, the corresponding single-cell data would not be available, however we specifically choose examples with both tissue-level and single-cell data so as to check that our estimates are correct.

We first apply our method to fluorescence data of GFP concentration in two distinct rat pituitary tissues²⁰ in which cells communicate both via paracrine signalling and active transport across gap junctions. The fluorescence data is available at the single-cell level, so the tissue-level data is obtained simply by summing up the single-cell fluorescence. We apply our method to this tissue-level data, and subsequently check its accuracy using single cells.

The first tissue is taken from a day 18.5 embryonic rat (E18.5), where cell–cell junctions are rare and their related proteins (E-, N-cadherin and β-catenin) and paracrine signalling proteins are expressed at a low level; while the second tissue is taken from a day 1.5 post-natal rat (P1.5) where junctions are considerably more common, and there is a high level of expression of related proteins²⁰. The authors of ref. ²⁰ note that the P1.5 tissue, while clearly more mature than the E18.5 tissue, has not yet reached the level of connectivity of the adult tissue for which the number of gap junctions is likely to be even higher. With this in mind, we expect L in the E18.5 cells to be noticeably closer to 1 than the P1.5 cells, but the P1.5 cells should not be too close to (7) since they still are not fully mature.

In Fig. 5a, b we plot the fast and slow transport limits (green and red lines), and L averaged over each cell (blue squares) and bars representing one standard deviation above the mean (blue bars) for the E18.5 and P1.5 tissues respectively. As expected, L remains between the two in both cases, but is noticeably closer to 1 in Fig. 5a than in Fig. 5b.

The above dataset is further confirmation of our theory, but both it and the simulated systems are either in equilibrium or approaching it. Since this is frequently not the case in reality, we now apply our method to two oscillating datasets, one which we expect to have fast transport and one with slow transport. We stress that our method should apply to oscillatory systems since we have made no assumptions about the underlying genetic networks—only that the same genetic networks should be present in each cell, with the same reaction rates.

The second dataset corresponds to luminescence data of an oscillating protein concentration in a single leaf of Arabidopsis thaliana¹⁹. The luminescence data is available in image form, in which each pixel is close to single-cell resolution, so we can apply our method to the whole-leaf protein trajectory and subsequently check its accuracy with the single-pixel data.

In Fig. 6a we plot the slow and fast transport limits (green and red lines respectively) over time, and also L averaged over each pixel of the leaf image (blue squares) and bars representing one standard deviation above the mean (blue bars). Since proteins are frequently transferred between cells in a plant tissue, we might expect L to remain between the two limits but close to the fast limit, and such proves to be the case.

The third dataset consists of luminescence data of an oscillating protein concentration in a small population of mouse fibroblast cells³⁶. The cells were imaged on the same plate for a period of over a month, and were sufficiently far apart that single-cell resolution is easily possible. The cells therefore do not exactly form a tissue (though we might expect some very low-level exchange of material) so we expect L to be close to 1.

In Fig. 6b we plot the limits (green and red lines), and L averaged over each cell (blue squares) and bars representing one standard deviation above the mean (blue bars). As expected, L remains between the two, but is significantly closer to 1 than to the fast limit.

Discussion

Single-cell variability in tissues is an immediate consequence of the stochastic nature of gene expression, but it can have significant phenotypic implications ranging from pattern formation to cancer. The cell–cell coupling characteristic of tissues ensures that the question of single-cell variability will be more complex than in the well-studied case of noise in single-celled organisms. As an attempt to address this question, we introduced a measure of single-cell variability, L, which is a non-dimensional statistical coefficient which determines whether the cells in a tissue are more or less heterogeneous than an equivalent population of independent cells. We found that L is sensitive to the strength of coupling between cells in the tissues, in a manner that depends on the statistics of the underlying genetic network.

In the case of biochemical systems which naturally have large stochastic fluctuations (superpoissonian systems), we showed that increasing the coupling strength will tend to decrease the single-cell variability. On the other hand, for systems with small stochastic fluctuations (subpoissonian systems), we found that increasing the coupling strength will tend to increase the single-cell variability. These predictions were confirmed with stochastic simulations of simple genetic networks, and experimental data from both animal and plant tissues. These results suggest that cell–cell coupling could be one of several techniques cells use to control noise, while also highlighting the need for much greater understanding of multicellular behaviour.

We note here that we have focussed on fluctuations caused by the stochastic nature of biochemical reactions (intrinsic noise) and not noise induced by fluctuations in the enviromental conditions (e.g., light level, temperature). This is because environmental fluctuations affect each cell in the population equally³⁷, and so will not affect the heterogeneity of the population.

We also note that, although we are interested in heterogeneity, we have concentrated here on homogeneous tissues, that is, tissues where each cell contains an identical genetic network. We have found that it may be possible to extend our results to tissues with heterogeneous populations of cells, where different cells could have different noise statistics, though the relevant analyses are more complex than those in this article. This extension of our results would have fascinating applications to tissue ageing³⁸ and tumour growth, and will be the subject of a future paper.

Methods

Derivation of fast and slow limits of L

We consider a tissue of N² cells with volume V_T, and a single cell with volume V_C = V_T/N², as well as identical systems of M chemical species X₁, …, X_M interacting in each cell. Let $P\left( {\vec n;V_{\rm T}} \right)$ be the probability that there are $\vec n = \left( {n_1, \ldots ,n_{ M}} \right)$ molecules of X₁, …, X_M respectively in the entire tissue. The tissue-level Fano factor of species X_j is defined as ${\rm FF}_j = \frac{{\left\langle {n_j^2} \right\rangle - \left\langle {n_j} \right\rangle ^2}}{{\left\langle {n_j} \right\rangle }}$. Now, let $\vec m = \left( {m_1, \ldots ,m_{M}} \right)$ be the number of molecules of X₁, …, X_M, respectively, in the single cell, and let the corresponding probability distribution be $Q\left( {\vec m;V_{\rm C}} \right)$.

If transport is slow, the system in each cell is independent of the rest of the population, and so, by the Bienaymé formula³², the sum of the variances in each cell is equal to the variance in the tissue. The variance in a single cell is then given by, $\left\langle {m_j^2} \right\rangle - \left\langle {m_j} \right\rangle ^2 = \left[ {\left\langle {n_j^2} \right\rangle - \left\langle {n_j} \right\rangle ^2} \right]{\mathrm{/}}N^2$ = $\left( {V_{\rm C}{\mathrm{/}}V_{\rm T}} \right)\left[ {\left\langle {n_j^2} \right\rangle - \left\langle {n_j} \right\rangle ^2} \right]$. Furthermore, since all cells are statistically identical the mean concentration in each cell is the same and equal to that of tissue, $\left\langle {m_j} \right\rangle {\mathrm{/}}V_{\rm C} = \left\langle {n_j} \right\rangle {\mathrm{/}}V_{\rm T}$. It follows immediately from these considerations that L = 1.

For the fast transport limit we can relate the local solution Q to the global solution P using the theorem of total probability, $Q\left( {\vec m;V_{\rm C}} \right)$ = $\mathop {\sum}\nolimits_{\vec n = 0}^\infty Pr\left( {\vec m|\vec n;V_{\rm C},V_{\rm T}} \right)P\left( {\vec n;V_{\rm T}} \right)$, where the notation $\mathop {\sum}\nolimits_{\vec n = 0}^\infty ( \cdot )_{}^{}$ is shorthand for $\mathop {\sum}\nolimits_{n_1 = 0}^\infty \ldots \mathop {\sum}\nolimits_{n_M = 0}^\infty ( \cdot )$, and $Pr\left( {\vec m|\vec n;V_{\rm C},V_{\rm T}} \right)$ is the probability of finding $\vec m$ molecules of X₁, …, X_M respectively in V_C given that there are $\vec n_{}^{}$ molecules respectively in V_T.

The limit of fast transport implies that molecules move into and out of the the cell much more frequently than they are involved in reactions. The molecules are uniformly distributed in V_T under these conditions, so that the probability that a randomly chosen molecule is in V_C is simply $\frac{{V_{\mathrm C}}}{{V_{\mathrm T}}}$. It follows from combinatorics that the probability of finding m_j molecules of species X_j in V_C given that there are n_j in V_T is $\left( {n_j!} \right.{\mathrm{/}}\left( {m_j!\left( {n_j - m_j} \right)!} \right)$ $\left( {V_{\rm C}{\mathrm{/}}V_{\rm T}} \right)^{m_j}\left( {1 - V_{\rm C}{\mathrm{/}}V_{\rm T}} \right)^{n_j - m_j}$, that is, a Binomial distribution. It follows that $Pr\left( {\vec m|\vec n;V_{\rm C},V_{\rm T}} \right)$ is the product of the mass functions of M Binomial $\left( {n_j,V_{\rm C}{\mathrm{/}}V_{\rm P}} \right)$ distributions for each species X_j. An expression for the single-cell distribution Q in terms of the global distribution P can then be written, $Q\left( {\vec m;V_{\rm C}} \right) = \mathop {\sum}\nolimits_{\vec n = 0}^\infty P\left( {\vec n;V_{\rm T}} \right)\mathop {\prod}\nolimits_{j = 1}^M$ $\left[ {\left( {\begin{array}{*{20}{c}} {n_j} \\ {m_j} \end{array}} \right)\left( {\frac{{V_{\rm C}}}{{V_{\rm T}}}} \right)^{m_j}\left( {1 - \frac{{V_{\rm C}}}{{V_{\rm T}}}} \right)^{n_j - m_j}{\bf{1}}_{\vec m \le \vec n}} \right].$

The indicator function ${\bf{1}}_{\vec m \le \vec n}$ prevents the expression from evaluating the impossible probabilities of finding more molecules in V_C than in V_T, and therefore permits us to sum from zero to infinity without worry. Since this equation gives the single-cell distribution Q, we can use it to evaluate the single-cell second moment which is given by $\left\langle {m_j^2} \right\rangle = \mathop {\sum}\nolimits_{\vec m = 0}^\infty m_j^2Q\left( {\vec m,V_{\mathrm{C}}} \right)$. Swapping the summations over $\vec n$ and $\vec m$, and absorbing the indicator function into the latter summations, gives:

$$\begin{array}{l}\left\langle {m_j^2} \right\rangle = \mathop {\sum}\limits_{\vec n = 0}^\infty P\left( {\vec n;V_{\rm T}} \right) \times \\ \left[ {\mathop {\sum}\limits_{m_1 = 0}^{n_1} \ldots \mathop {\sum}\limits_{m_M = 0}^{n_M} m_j^2\mathop {\prod}\limits_{k = 1}^M \left( {\begin{array}{*{20}{c}} {n_k} \\ {m_k} \end{array}} \right)\left( {\frac{{V_{\rm C}}}{{V_{\rm T}}}} \right)^{m_k}\left( {1 - \frac{{V_{\rm C}}}{{V_{\rm T}}}} \right)^{n_k - m_k}} \right].\end{array}$$

(9)

The local second moment is therefore the expected value of the quantity in square brackets under the global distribution P. The quantity in square brackets, however, is merely the expected value of $m_j^2$ under the M independent Binomial$\left( {n_j,V_{\rm C}{\mathrm{/}}V_{\rm T}} \right)$ distributions, and is therefore equal to $n_j\left( {\frac{{V_{\rm C}}}{{V_{\rm T}}}} \right)\left( {1 - \frac{{V_{\rm C}}}{{V_{\rm T}}}} \right) + n_j^2\frac{{V_{\rm C}^2}}{{V_{\rm T}}}$. It follows that the local second moment is simply given by $\left\langle {m_j^2} \right\rangle = \frac{{V_{\rm C}}}{{V_{\rm T}}}\left\langle {n_j} \right\rangle$ + $\frac{{V_{\rm C}^2}}{{V_{\rm T}^2}}\left\langle {n_j^2} \right\rangle - \frac{{V_{\rm C}^2}}{{V_{\rm T}^2}}\left\langle {n_j} \right\rangle$ It subsequently follows that the fast transport limit of L has the form of Eq. (7).

Data analysis

Fluorescence trajectories are passed through a moving average filter with a window size dependent on the time-resolution of the data. The smoothed trajectory is considered to be a time-dependent mean, $\left\langle n \right\rangle$. Subtracting the mean from the raw trajectory gives a stationary noise component. The variance of this component is used to obtain a time-dependent estimate for L or FF.

Data availability

All relevant data are available from the authors.

References

Maamar, H., Raj, A. & Dubnau, D. Noise in gene expression determines cell fate in Bacillus subtilis. Science 317, 526–529 (2007).
Article CAS PubMed ADS Google Scholar
Raj, A. & van Oudenaarden, A. Nature, nurture, or chance: stochastic gene expression and its consequences. Cell 135, 216–226 (2008).
Article CAS PubMed PubMed Central Google Scholar
Ozbudak, E. M., Thattai, M., Lim, H. N., Shraiman, B. I. & Van Oudenaarden, A. Multistability in the lactose utilization network of Escherichia coli. Nature 427, 737–740 (2004).
Article CAS PubMed ADS Google Scholar
Balaban, N. Q., Merrin, J., Chait, R., Kowalik, L. & Leibler, S. Bacterial persistence as a phenotypic switch. Science 305, 1622–1625 (2004).
Article CAS PubMed ADS Google Scholar
Nachman, I., Regev, A. & Ramanathan, S. Dissecting timing variability in yeast meiosis. Cell 131, 544–556 (2007).
Article CAS PubMed Google Scholar
Shahrezaei, V. & Swain, P. S. Analytical distributions for stochastic gene expression. Proc. Natl Acad. Sci. USA 105, 17256–17261 (2008).
Article CAS PubMed PubMed Central ADS Google Scholar
McAdams, H. H. & Arkin, A. Stochastic mechanisms in gene expression. Proc. Natl Acad. Sci. USA 94, 814–819 (1997).
Article CAS PubMed PubMed Central ADS Google Scholar
Munsky, B. & Khammash, M. The finite state projection algorithm for the solution of the chemical master equation. J. Chem. Phys. 124, 044104 (2006).
Article PubMed MATH ADS Google Scholar
Thomas, P., Popovic, N. & Grima, R. Phenotypic switching in gene regulatory networks. Proc. Natl Acad. Sci. USA 111, 6994–6999 (2014).
Article CAS PubMed PubMed Central ADS Google Scholar
Schnoerr, D., Sanguinetti, G. & Grima, R. Approximation and inference methods for stochastic biochemical kinetics—a tutorial review. J. Phys. A Math. Theor. 50, 093001 (2017).
Article MathSciNet MATH ADS Google Scholar
Vassar, R., Ngai, J. & Axel, R. Spatial segregation of odorant receptor expression in the mammalian olfactory epithelium. Cell 74, 309–318 (1993).
Article CAS PubMed Google Scholar
Wernet, M. F. et al. Stochastic spineless expression creates the retinal mosaic for colour vision. Nature 440, 174–180 (2006).
Article CAS PubMed ADS Google Scholar
Zhonghuai, H., Lingfa, Y., Zuo, X. & Houwen, X. Noise induced pattern transition and spatiotemporal stochastic resonance. Phys. Rev. Lett. 81, 2854 (1998).
Article Google Scholar
Sanz-Anchelergues, A., Zhabotinsky, A. M., Epstein, I. R. & Munuzuri, A. P. Turing pattern formation induced by spatially correlated noise. Phys. Rev. E 63, 056124 (2001).
Article CAS ADS Google Scholar
Gupta, P. B. et al. Stochastic state transitions give rise to phenotypic equilibrium in populations of cancer cells. Cell 146, 633–644 (2011).
Article CAS PubMed Google Scholar
Halpern, K. B. et al. Bursty gene expression in the intact mammalian liver. Mol. Cell. 58, 147–156 (2015).
Article PubMed Central Google Scholar
Lyubimova, A. et al. Single-molecule mRNA detection and counting in mammalian tissue. Nat. Protoc. 8, 1743–1758 (2013).
Article PubMed Google Scholar
Downey, M. J. et al. Extracting fluorescent reporter time courses of cell lineages from high-throughput microscopy at low temporal resolution. PLoS ONE 6, e27886 (2011).
Article CAS PubMed PubMed Central ADS Google Scholar
Wenden, B., Toner, D. L., Hodge, S. K., Grima, R. & Millar, A. J. Spontaneous spatiotemporal waves of gene expression from biological clocks in the leaf. Proc. Natl Acad. Sci. USA 109, 6757–6762 (2012).
Article CAS PubMed PubMed Central ADS Google Scholar
Featherstone, K. et al. Spatially coordinated dynamic gene transcription in living pituitary tissue. Elife 5, e08494 (2016).
Article PubMed PubMed Central Google Scholar
Crawford, K. M. & Zambryski, P. C. Subcellular localization determines the availability of non-targeted proteins to plasmodesmatal transport. Curr. Biol. 10, 1032–1040 (2000).
Article CAS PubMed Google Scholar
Crawford, K. M. & Zambryski, P. C. Non-targeted and targeted protein movement through plasmodesmata in leaves in different developmental and physiological states. Plant Physiol. 125, 1802–1812 (2001).
Article CAS PubMed PubMed Central Google Scholar
Lucas, W. J. et al. Selective trafficking of KNOTTED1 homeodomain protein and its mRNA through plasmodesmata. Science 270, 1980–1983 (1995).
Article CAS PubMed ADS Google Scholar
Kumar, N. M. & Gilula, N. B. The gap junction communication channel. Cell 84, 381–388 (1996).
Article CAS PubMed Google Scholar
Ventela, S., Toppari, J. & Parvinen, M. Intercellular organelle traffic through cytoplasmic bridges in early spermatids of the rat: mechanisms of haploid gene product sharing. Mol. Biol. Cell 14, 2768–2780 (2003).
Article PubMed PubMed Central Google Scholar
Wang, X., Veruki, M. L., Bukoreshtliev, N. V., Hartveit, E. & Gerdes, H. H. Animal cells connected by nanotubes can be electrically coupled through interposed gap-junction channels. Proc. Natl Acad. Sci. USA 107, 17194–17199 (2010).
Article CAS PubMed PubMed Central ADS Google Scholar
Twycross, J., King, J. R., Band, L. R., Bennett, M. J. & Krasnogor, N. Stochastic and deterministic multiscale models for systems biology: an auxin-transport case study. BMC Syst. Biol. 4, 34 (2010).
Article PubMed PubMed Central Google Scholar
Smith, S., Cianci, C. & Grima, R. Analytical approximations for spatial stochastic gene expression in single cells and tissues. J. R. Soc. Interface 13, 20151051 (2016).
Article PubMed PubMed Central Google Scholar
Baras, F. & Mansour, M. M. Reaction-diffusion master equation: a comparison with microscopic simulations. Phys. Rev. E 54, 6139 (1996).
Article CAS ADS Google Scholar
Smith, S. & Grima, R. Breakdown of the reaction-diffusion master equation with nonelementary rates. Phys. Rev. E 93, 052135 (2016).
Article MathSciNet PubMed ADS Google Scholar
Stundzia, A. B. & Lumsden, C. J. Stochastic simulation of coupled reaction-diffusion processes. J. Comput. Phys. 127, 196–207 (1996).
Article CAS MATH ADS Google Scholar
Loeve, M. Elementary probability theory. in Probability Theory I, 1–52 (Springer, NY, 1977).
Grima, R. A study of the accuracy of moment-closure approximations for stochastic chemical kinetics. J. Chem. Phys. 136, 04B616 (2012).
Article Google Scholar
Erban, R. & Chapman, S. J. Stochastic modelling of reaction-diffusion processes: algorithms for bimolecular reactions. Phys. Biol. 6, 046001 (2009).
Article PubMed ADS Google Scholar
Weisstein, E. W. “Modified Bessel Function of the First Kind.” From MathWorld-A Wolfram Web Resource. mathworld.wolfram.com/ModifiedBesselFunctionoftheFirstKind.html, accessed: 2017-09-19.
Leise, T. L., Wang, C. W., Gitis, P. J. & Welsh, D. K. Persistent cell-autonomous circadian oscillations in fibroblasts revealed by six-week single-cell imaging of PER2:: LUC bioluminescence. PLoS ONE 7, e33334 (2012).
Article CAS PubMed PubMed Central ADS Google Scholar
Elowitz, M. B., Levine, A. J., Siggia, E. D. & Swain, P. S. Stochastic gene expression in a single cell. Science 297, 1183–1186 (2002).
Article CAS PubMed ADS Google Scholar
Bahar, R. et al. Increased cell-to-cell variation in gene expression in ageing mouse heart. Nature 441, 1011–1014 (2006).
Article CAS PubMed ADS Google Scholar
Zielinski, T. & Millar, A. Biodare. www.biodare.ed.ac.uk, accessed: 2017-02-08.

Download references

Acknowledgements

This work was supported by a BBSRC EASTBIO PhD studentship to S.S. and by a Leverhulme grant award RPG-2013-171 to R.G. We thank Andrew Millar and Uriel Urquiza for useful discussions.

Author information

Authors and Affiliations

School of Biological Sciences, University of Edinburgh, Mayfield Road, Edinburgh, EH9 3JR, Scotland, UK
Stephen Smith & Ramon Grima

Authors

Stephen Smith
View author publications
You can also search for this author in PubMed Google Scholar
Ramon Grima
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

S.S. designed research, carried out research and wrote the manuscript. R.G. designed research and helped write the manuscript.

Corresponding author

Correspondence to Ramon Grima.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Peer Review File

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Smith, S., Grima, R. Single-cell variability in multicellular organisms. Nat Commun 9, 345 (2018). https://doi.org/10.1038/s41467-017-02710-x

Download citation

Received: 28 February 2017
Accepted: 20 December 2017
Published: 24 January 2018
DOI: https://doi.org/10.1038/s41467-017-02710-x

This article is cited by

Studying temporal dynamics of single cells: expression, lineage and regulatory networks
- Xinhai Pan
- Xiuwei Zhang
Biophysical Reviews (2024)
Voices carry
- Adam L. MacLean
Nature Chemical Biology (2023)
Evidence for close molecular proximity between reverting and undifferentiated cells
- Souad Zreika
- Camille Fourneaux
- Sandrine Gonin-Giraud
BMC Biology (2022)
Hematopoietic differentiation is characterized by a transient peak of entropy at a single-cell level
- Charles Dussiau
- Agathe Boussaroque
- Olivier Kosmider
BMC Biology (2022)
The macroscopic limit to synchronization of cellular clocks in single cells of Neurospora crassa
- Jia Hwei Cheong
- Xiao Qiu
- Jonathan Arnold
Scientific Reports (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.