Learning processes in hierarchical pairs regulate entire gene expression in cells

Yamaguchi, Tomoyuki

doi:10.1038/s41598-022-10998-z

Download PDF

Article
Open access
Published: 09 May 2022

Learning processes in hierarchical pairs regulate entire gene expression in cells

Tomoyuki Yamaguchi¹

Scientific Reports volume 12, Article number: 7549 (2022) Cite this article

881 Accesses
1 Altmetric
Metrics details

Subjects

Abstract

Expression of numerous genes is precisely controlled in a cell in various contexts. While genetic and epigenetic mechanisms contribute to this regulation, how each mechanism cooperates to ensure the proper expression patterns of the whole gene remains unclear. Here, I theoretically show that the repetition of simple biological processes makes cells functional with the appropriate expression patterns of all genes if the inappropriateness of current expression ratios is roughly fed back to the epigenetic states. A learning pair model is developed, in which two factors autonomously approach the target ratio by repeating two stochastic processes; competitive amplification with a small addition term and decay depending on the difference between the current and target ratios. Furthermore, thousands of factors are self-regulated in a hierarchical-pair architecture, in which the activation degrees competitively amplify, while transducing the activation signal, and decay at four different probabilities. Changes in whole-gene expression during human early embryogenesis and hematopoiesis are reproduced in simulation using this epigenetic learning process in a single genetically-determined hierarchical-pair architecture of gene regulatory cascades. On the background of this learning process, I propose the law of biological inertia, which means that a living cell basically maintains the expression pattern while renewing its contents.

Epigenetic Ratchet: Spontaneous Adaptation via Stochastic Gene Expression

Article Open access 16 January 2020

Competition and evolutionary selection among core regulatory motifs in gene expression control

Article Open access 13 December 2023

Plasticity-led evolution as an intrinsic property of developmental gene regulatory networks

Article Open access 14 November 2023

Introduction

A living cell is a complex adaptive system. The expression of a gene is controlled by many mechanisms, including transcription factors, chromatin modifications, and non-coding RNAs. Fine regulation of multiple genes is required for a cell to function appropriately, depending on the cell type and environment. Big omics data, including whole gene expression data in a single cell, are accumulated by using single-cell RNA sequencing (scRNA-seq) and other technologies. Based on these data, systems biology proposes gene regulatory networks (GRNs) that generate outputs from inputs^1,2,3. However, the modeling of complicated GRNs for many genes requires tuning of numerous parameters. It remains unclear how the expression level of more than 10,000 genes is properly controlled in a cell¹.

While molecular biology investigates causal relationships in cells as if they were well-designed machines, superior machines have acquired learning ability. Deep reinforcement learning and the AlphaGo algorithm in computer science have made great advances to play board games^4,5. Prior to these technologies, conventional software could not overcome professional board game players, even after collecting large volumes of data and tuning many parameters⁴. Deep reinforcement learning includes a deep neural network and a Monte Carlo tree search. The neural network is a multilayer architecture, through which input data with high dimensionality are processed using a weight matrix⁶. Error is calculated as the difference between the output and the correct answer to alter the weight matrix through backpropagation. Monte Carlo tree search stochastically selects a series of actions⁵. By repeating these trial-and-error processes, the algorithm determines an optimal weight matrix for selecting correct actions in any situation. The learning processes may be required for cells in which there are too many situations to prepare in advance⁷.

The importance of stochastic and feedback processes is proposed in complex adaptive systems^7,8,9. Waddington epigenetic landscape schematically visualizes the processes through which a cell autonomously, not deterministically, reaches an appropriate gene-expression pattern^10,11. A cell may alter the gene expression depending on the inappropriateness of the current pattern. This is a kind of learning process. In this study, I theoretically show that biological processes in gene expression can regulate the expression of the whole gene at appropriate levels by acting as a learning process.

Results

Amplification and error-dependent decay for learning

I attempt to clarify the processes through which factors autonomously reach their target ratio without individual commands by using a simple simulation model with two factors (Fig. 1a; Table 1). Here, the non-negative integer values of two factors, x_A and x_B, change by 10⁴–10⁵ repeats of stochastic processes of increase and decrease from 1, which is set as the initial value. The target ratio, T_A:T_B, is set to 1:2. In the increase process, which proceeds at a probability of α_inc = 0.1, either A or B is selected, and the value of the selected factor increases by one. x_A and x_B stochastically decay at a probability α_dec ε, where α_dec = 0.1. Thus, after a decrease process, x_A decays to a value selected from a binomial distribution with the number of trials x_A and the probability (1 − 0.1 ε). In this text, the assumption or settings are written in the present tense, whereas the results of simulation are written in the past tense.

Table 1 Variables and parameters in the models.

Full size table

For a concrete example, two factors A and B denote family genes A and B that have high similarity in the promoter regions. The value x_A indicates the amount of acetylation of histones, or the openness degree of chromatin, at the gene A locus. Then, the ratio of x_A:x_B is equivalent to the mRNA amount ratio. The delay time from the chromatin modification to the change of protein amounts is not implemented in this simulation. In other possible interpretations, x_A and x_B denote the number of type A cells and B cells in a tissue, the cross-sectional area of main and side branches of a living tree, or the degrees of synapse connectivity of a bifurcating neuron. This abstract model can be applied to various biological phenomena in which two related individual-things are autonomously controlled to an appropriate ratio.

In the first model, x_A and x_B are assumed to change at a fixed probability (Fig. 1b–d). Either x_A or x_B is selected at a 1:1 ratio for the increase, and the decay probability is fixed at ε = 0.1 or 0.01. Simulation results showed similar values in x_A and x_B (Fig. 1b). If the probability of increase in x_B is two-fold of that in x_A (Fig. 1c) or if the decay probability in x_A is two-fold of that in x_B (Fig. 1d), the x_A/x_B ratio approached the target ratio, 0.5. However, these conventional models with individualized probabilities require something that determines the appropriate parameter-setting.

In the second model (Fig. 1e), the decay probability is the same for x_A and x_B but changes over time, taking a value that is the mean squared error (MSE) between the current and target ratios: \(\varepsilon_{\left( x \right)} = MSE = \left( {x_{A} /\left( {x_{A} + x_{B} } \right) - T_{A} /\left( {T_{A} + T_{B} } \right)} \right)^{2}\). Regarding the increase, either x_A or x_B is selected at a 1:1 ratio. The dynamics of x_A and x_B exhibited a pattern similar to predator–prey in ecology, in which the fluctuation in the number of prey x_B slightly preceded that of predator x_A.

In the third model (Fig. 1f, g), x_A and x_B are assumed to increase by competitive amplification, in which either x_A or x_B is selected at a ratio of (x_A + β_A):(x_B + β_B) to increase by one, where bias β_A= β_B= 1. When the decay probability ε = 0.1 or 0.01, x_A and x_B fluctuated with a switching pattern in which either A or B dominated transiently (Fig. 1f). When ε is as low as 0.001, the x_A/x_B ratio persisted at a certain value that was stochastically determined at early time points (Fig. 1g).

In the fourth model (Fig. 1h, i), x_A and x_B are assumed to increase by competitive amplification as in the third model, and to decrease by decay with a probability of MSE between the current and target ratios as in the second model. The simulation results showed that the x_A/x_B ratio approached the target ratio of 0.5. Some deviations observed at 10⁴ repeats were reduced after 10⁵ repeats of stochastic processes (Fig. 1i). This model was applicable for other target ratios (Fig. 1h) without tuning parameters. Thus, repeating stochastic processes of competitive amplification and MSE-dependent decay is a system that autonomously learns the target ratio through trial-and-error. The epigenomic regulation of chromatin modification can be interpreted as a competitive amplification process (Table 2)¹². Importantly, actual cells do not know the final target ratio (the correct mRNA ratio) a priori while approaching the target with MSE-dependent decay. As an example of the decay, high stress due to improper expression reduces transcription rates via RNA-mediated epigenomic modification¹³.

Table 2 Assumptions in the learning hierarchical-pair model are supported by biological knowledge.

Full size table

Amplification may induce a large difference in the value of each factor by exponential growth, making a factor all or nothing. In non-competitive amplification, in which either A or B is selected at a 1:1 ratio and the selected term increases by x_A + 1 or x_B + 1, x_B reached a much higher value than x_A (Fig. 1j). When bias β in competitive amplification is not 1 but rather 10⁻⁷ (which is almost equivalent to 0 and avoids the 0/0 error in processing), x_A decreased to 0 in six of the ten tests (Fig. 1k, l left). Interestingly, the x_A/x_B ratio approached the target ratio in the other four tests. Competition and the addition term of bias are required to avoid extinction in amplification.

In our previously reported immune response model, three processes were assumed to occur during changes in the interaction intensity or cell number: competitive amplification (proliferation), regulated reduction (dissociation), and additive increase (migration)¹⁴. Based on the model, a process of additive increase, in which either x_A or x_B is selected at a 1:1 ratio to increases by one, is chosen at a probability γ in an increase process (Fig. 1l). The condition γ = 0 is equivalent to that in Fig. 1k, whereas γ = 1 is equivalent to that in Fig. 1e. As γ is set to a lower value, the x_A/x_B ratio after 10⁵ repeats became skewed from 1 to 0.5 (target ratio). When γ is negligibly low, x_A sometimes disappeared. When the additive increase is chosen at low probabilities (γ = 0.01, 0.1), the x_A/x_B ratio approached the target ratio (Fig. 1l, m). Therefore, the role of bias term (β = 1) is equivalent to this small additive increase or white noise.

The learning process can be explained as follows. When MSE and decay probability are high, the x_A/x_B ratio fluctuates in full range, like a rough adjustment, by avoiding the extinction using bias (additive increase) or noise (Fig. 1f). The x_A/x_B ratio is improved on average by the error-dependent decay, which is a random walk with smaller step-size as it gets closer to the target. When the x_A/x_B ratio approaches the target ratio, the ratio persists because the decay rate becomes low and because A or B becomes selected with the ideal ratio. The changes in this autonomously-reached stable state are equivalent to those in conventional models (Fig. 1c), in which the parameters need to be set accurately in advance in contrast to the learning model. In the main simulation hereafter, competitive amplification implies selecting A or B at a (x_A + 1):(x_B + 1) ratio to increase by one with β = 1, γ = 0. This is designated as a learning pair process.

Hierarchical pairs and approximated MSE

To regulate gene expression, more than two factors must be controlled. When the ratios of four or eight factors are examined to be controlled by competitive amplification and MSE-dependent decay, the value ratios of eight factors in a single list failed to approach the target ratios (Fig. 2a, b). The eight factors can be divided into seven pairs in three layers (Fig. 2c, d). The fraction of each factor in total is calculated as an infinite product of all ratios in the pairs that include the factor. When the values in each pair independently change by the stochastic learning pair process, eight factors successfully approached the target ratio after 10⁵ repeats (Fig. 2c).

Next, the required accuracy of MSE is tested because accurate detection of errors is difficult in vivo. When the MSE between the current and target ratios is accurately calculated, 64 factors approached the target ratio, which is set as a linear distribution in the range of 1–64 (correlation coefficient between the target and result ratios after 10⁵ repeats, r, was 0.99, Fig. 2e, f). As an approximation of MSE, the calculated MSE is rounded to 10⁻¹, 10⁻², 10⁻³, … ,10⁻ⁱ, where i is a natural integer, in stepwise error. When this approximated error is used, the correlation between the result and target ratios decreased but remained high (r = 0.95, Fig. 2e, f). By setting a maximum value for i that indicates the lower limit of the stepwise error, five additional types of approximated MSE are compared (6-, 5-, 4-, 3-, and 2-step error in Fig. 2e, g). The results indicated that 3-step error was required for learning (median r = 0.89) and that the stepwise error was almost equivalent to 5-step error (median r = 0.95). The approximation of MSE decreased the learning accuracy, but multiple factors in the model using stepwise detection of error approached the target ratio to an acceptable level.

Next, the relationship between indexes for making pairs and targets is randomly shuffled. The ratios of each factor after 10⁵ repeats approached the target ratios to an almost equivalent level to that without shuffling (Fig. 2e, h). Furthermore, 4096 = 2¹² factors approached the target that is set to shuffled values ranging from 1 to 4096 (r = 0.97–0.98 with accurate MSE and r = 0.84–0.91 with stepwise error in five tests, Fig. 2i, j).

When the gene expression data from bacteria without antibiotics (GSM2538622 RNA-seq dataset)¹⁵ are used as the target ratio, the ratio of 4096 factors changed from the initial even-distribution to the expression pattern after 10⁵ repeats of stochastic processes with stepwise error (r = 0.98, Fig. 2k). Subsequently, when the target ratios are reset to the gene expression pattern observed in the presence of antibiotics (Fig. 2l)¹⁵, the ratio of 4096 factors changed from the pattern without antibiotics to the new target pattern (Fig. 2m). Thus, bacteria may autonomously produce proper gene expression patterns by reducing the error caused by antibiotics.

Hierarchical clustering of human genes

I next apply the learning hierarchical-pair process to human gene expression. In advance, it is necessary to set the genes that are paired. Six hierarchical clustering analysis methods (Fig. 3a–c), which are Ward, WCO, Single, and three newly-developed methods (AreaSum, CvSum and Cvarea), are applied to a total of 16,921 genes in 20 differently labeled cells from preimplantation human embryos, human embryonic stem cells, and downstream early mesoderm and endoderm progenitors (scRNA-seq datasets E-MTAB-3929, GSM2257302, and GSE75748)^16,17,18. The number of layers in hierarchical pairs generated by the AreaSum method was the smallest (27), whereas that by the Single method was largest (10,796) (Fig. 3c). In the AreaSum method, the area formed by two vectors from the origin is calculated as the distance between two genes, and the total gene expression level is used as the representative value of the cluster (Fig. 3a, b).

As another modification, the probability of entering a process of competitive amplification, α_inc, is set as a variable in the range of 0.001–0.101 depending on the coverage of the pair. This assumption is further modified in another model with an mRNA pool. For test data, another scRNA-seq dataset from human preimplantation embryos (GSE36552) is used¹⁹. The initial and target ratios in each pair are set with the data of a zygote and a cell at 4-cell stage, respectively. The correlation coefficient between the initial and target ratios is a median r = 0.78 (range 0.67–0.84) in 12 tests (Fig. 3d). For each pair, the stochastic processes of competitive amplification and decay using the stepwise-approximated MSE are repeated 10⁵ times.

The learning efficiency was compared among the six different hierarchical-pair architectures. The expression ratio most closely approached the target ratio when hierarchical pairs generated by the AreaSum method are used (r = 0.98, Fig. 3c, d), with even a closer correlation than another 4-cell data in scRNA-seq (Fig. 3e)¹⁹. In pairs generated by the Single method, the expression ratio did not approach the target ratio (Fig. 3c). These results indicate that the architecture of hierarchical pairs affects the ability to approach the target ratio. In contrast, even when the initial and target patterns are independently shuffled to test non-correlated artificial patterns, the expression ratio approached the target ratio (median r = 0.98, range 0.94–0.99 in six tests) in the hierarchical pairs generated by the AreaSum method (Fig. 3f). Owing to the high adaptability of this learning process, it was difficult to validate the accuracy of gene pairing.

A model with a signal transduction cascade and an mRNA pool

I assume that the hierarchical-pair architecture is a signal transduction cascade to select a gene for transcription in a model with an mRNA pool. Rather than using parameter α_inc, a pair is stochastically selected at each repetition among pairs in the top seven layers depending on the coverage of the pair. In the selected pair, the competitive amplification is performed; branch A or B is selected at a ratio (x_A + β):(x_B + β), where β = 1, and the value of the selected branch, x_A or x_B, increases by one. Additionally, the downstream pair of the selected branch enters the process of competitive amplification until the selected branch is a leaf indicating a single gene. In an mRNA pool, mRNA of the selected gene increases by one, with randomly replacing one mRNA. Initially, 360,000 mRNAs in the mRNA pool are set based on the initial ratio (zygote). In addition to the mRNA, the expression probability is calculated as an infinite product of ratios in pairs including the gene, which is equivalent to the expression ratio in the previous model without an mRNA pool. The ratios of mRNA and expression-probability approached the target ratio (4-cell) after 5 × 10⁵ repeats (r = 0.95–0.97 and r = 0.97–0.99, respectively, in six tests), although genes with 0 to several mRNAs were plotted discretely in mRNA ratios (Fig. 4a). Furthermore, even when decay probability or MSE is approximated to three different values, 0.1, 0.01, and 0.001 (3-step error as in Fig. 2e), similar changes approaching the target ratio were observed with setting 4-cell as the targets (r = 0.94–0.98 in 12 tests, Fig. 4b) and with shuffling the targets (r = 0.92–0.94 in six tests, Fig. 4c).

Next, to analyze the dynamics in the simulation, the initial and target ratios are set with zygote and blastocyst data. The mRNA ratio gradually approached the target blastocyst pattern over 10⁶ repeats, but not via the patterns of the 2-cell, 8-cell, or morula stages (Fig. 4d). The expression probability more quickly and directly reached near the target ratio within 10⁴ repeats (Fig. 4e), and then the similar correlation levels persisted during 10⁴–10⁶ repeats. The mRNA levels of each gene approached the target level with fluctuations (Fig. 4f). In a simulation, the dynamics of GATA3 were more similar to those of GATA2 than those of DAB2, although the initial and target values of GATA3 and DAB2 are closer than those of GATA2. The higher correlation during stochastic fluctuations is explained by the hierarchical-pair architecture, where GATA2 and GATA3 are paired in the 7th layer from the top, whereas they are separated from DAB2 in the 2nd layer.

For gene regulation during homeostatic state, the bias term β may not be required because MSE or decay rate can be kept low. When both the initial and target ratios are set with the same 4-cell data, the mRNA ratio deviated from the pattern during 5 × 10⁵ repeats in the model with β = 10⁻⁷ and 3-step error (r = 0.66–0.88 in six tests, Fig. 4g). In contrast, the mRNA ratio maintained the set pattern in the model with 4-step error at least for 5 × 10⁵ repeats (r = 0.98–0.99 in six tests), while the correlation between expression probability and the target ratio gradually decreased (r = 0.76–0.87, Fig. 4h). When the initial state is set with a zygote and the target ratio is set with scRNA-seq data of 2-cell stage, which has a highly-similar expression pattern to a zygote (r = 0.94–0.96)¹⁹, the mRNA ratios approached the target ratio, except for one case in six tests (median r = 0.98, range 0.51–0.98, Fig. 4i). However, the change from a zygote to the 4-cell stage was poorly reproducible in the model with β = 10⁻⁷ (median r = 0.88, range 0.85–0.94 in six tests). In the absence of the bias term or white noise, a homeostatic state with a similar expression pattern was maintained while allowing some limited changes in differentiation.

A common model for human gene expression

Based on these findings, I propose that a single model can control whole gene expression during any differentiation processes in human cells and evaluate this in early embryogenesis and hematopoiesis. To generate hierarchical pairs, I collect 13 scRNA-seq datasets from human tissues^{16,17,18,20,21,22,23,24,25,26,27,28,29}, in which 11,281 gene names were commonly labeled in 11,803 cells. Using the relative expression ratio of these 11,281 genes in each cell, a hierarchical-pair architecture was generated using the AreaSum clustering method. This architecture contained 11,280 pairs in 22 layers (Supplementary Table 1).

The model with an mRNA pool, 4-step approximated error, and this hierarchical-pair architecture is applied to the regulation of 11,281 genes, setting bias β = 10⁻⁷ or 1 depending on the situation. When the initial state of pairs and mRNA pool is set with a zygote scRNA-seq data, and the target ratio is changed in the order of zygote, 2-cell, 4-cell, 8-cell, morula, and blastocyst stages every 5 × 10⁵ repeats, the ratio in the mRNA pool dynamically approached the target ratios until the 4-cell stage in the model with β = 10⁻⁷ (Fig. 5a, b). When the model with β = 1 is applied after 1.5 × 10⁶ repeats, the gene expression patterns sequentially approached the 4-cell, 8-cell, morula, and blastocyst patterns with a correlation coefficient of more than 0.95 at the peaks (Fig. 5c, d).

In hematopoiesis, multi-lymphoid progenitors (MLPs) differentiate into B cells or T cells in peripheral blood mononuclear cells (PBMCs), whereas granulocyte–macrophage progenitors (GMPs) differentiate into myeloid cells³⁰. When the initial state and target ratio are set with a progenitor, the mRNA ratios were maintained during 5 × 10⁵ repeats in the model with β = 10⁻⁷ (r = 0.97–1.0 in six tests, Fig. 5e–g). The expression patterns in progenitors are largely different from those in PBMCs³¹ (median r = 0.29, range 0.15–0.57 in 9 tests, Fig. 5h). When the target ratio is changed to a PBMC pattern, and β is set to 1, the mRNA ratio approached the target ratio during the next 5 × 10⁵ repeats (r = 0.86–0.97), with more rapid adaptation in the expression probability (Fig. 5e, f, i). The mRNA ratio, but not the expression probability, further approached the target ratio during the following 5 × 10⁵ repeats in the model with β = 10⁻⁷ (r = 0.97–0.99, Fig. 5e, f, j). These results demonstrate that the learning hierarchical-pair model using one common architecture can reproduce various differentiations and not-immortal homeostasis by adding bias terms in the former.

Discussion

I propose a principle underlying whole gene regulation within cells, which includes learning ability and a common architecture of gene regulation. The learning ability is implemented as a repeat of two stochastic processes: competitive amplification in a pair and decay depending on MSE between the current and target ratios. The hierarchical structure of the pairs enables multiple factors to reach any target ratio.

In this model, the expression of each gene is regulated by itself, in contrast with conventional GRNs in which each gene is regulated by other genes (Fig. 6a, b). Conventional models require the control of all genes to appropriate expression levels in high-dimensional space. In contrast, in my model, the expression ratio of two functionally-related genes or gene clusters, such as GATA2 vs GATA3, actin-myosin vs microtubule, or mitochondria vs endoplasmic reticulum, is controlled by destabilizing the ratio when inappropriate. The simplicity of self-regulation in each pair is critical to increase the number of regulated genes in modeling, actual evolution, and organizing complex systems.

Importantly, the simple self-regulation system is not uncontrollable but rather efficient to generate a proper diversity. When n number of genes changes the expression at L levels, an infinitely large number of patterns Lⁿ may exist. Conventional models set m number of master regulators (m << n) that control cell types and generate L^m states. The number of L^m is almost infinite and larger than the number of cell types we can understand, but it explains negligibly small space in Lⁿ. In my model, by using the four decay rates in n − 1 pairs, only 4(n − 1) regulations are sufficient to generate any appropriate pattern. The increase probability of each factor in competitive amplification is autonomously tuned to the correct ratio, x_i/Σx_j. Thus, amplification and stochasticity, which are misunderstood as interfering with strict control at a specific level, are essential for complex systems. I propose that the homeostasis, in which a cell keeps the expression pattern while the contents are metabolized, is not a result of complicated GRNs but a basic operating system shared in living things. This homeostatic system, which I refer to as the law of biological inertia, contains the learning process (Fig. 6c).

Biological knowledge of gene regulation is consistent with the assumptions in the learning hierarchical-pair model (Table 2). The first assumption, competition, is supported by the epigenomic regulation of transcription¹². An RNA polymerase or transcription factor chooses a binding locus among candidates, depending on the local openness ratio of the chromatin. To be noticed, the binding candidates are genetically determined, as discussed in the next paragraph. This process would be repeated more than 10⁵ times during one cell-division cycle, estimated from the number of newly-generated mRNAs. The second assumption, amplification, is supported by positive feedback in the epigenomic regulation. The binding of transcription factors opens the chromatin at the locus, using cofactors with histone acetyltransferase activity. The third assumption, additive increase using a bias term, is supported by the fact that all somatic cells have the whole genome. The fourth assumption, decay rates dependent on the error between the current and target ratios, is not clearly described, possibly because the contribution is low under good cell-conditions. In speculation, when some proteins accumulate unused due to the inappropriate expression ratios, mRNAs for the excess proteins may be specifically cleaved, and the RNA fragments may close the chromatin. Histone deacetylase and DNA methyltransferase close the chromatin structure. Non-coding RNAs, which can be induced by biological stress, degrade a group of mRNAs with a particular sequence. RNA-mediated epigenomic regulations are described in several organisms¹³. Although many studies, especially for the regulation of decay in each pair, are required to demonstrate the molecular basis of the learning process, the assumptions are applicable to cells.

The hierarchical-pair architecture, the fifth assumption, is also supported by findings on topologically associating domains and signal transduction cascades for gene expression (Table 2). Topologically associating domains are conserved gene clusters with similar epigenomic states and high expression-correlations³². The learning pair process becomes more plausible in real cells by assuming that functionally related gene clusters are paired and regulated to the appropriate expression ratio. In the conventional view of signal transduction cascades, multimerization of specific receptors is assumed to deterministically trigger activation of a signal cascade to express a set of genes. However, the cascades and the induced genes vary depending on the cell type, which reflects the current expression and activation state. Active branches in the cascades may be preferentially used, just like the stochastic competitive amplification in the assumption. Further, many signal-transducing proteins are kinases that are activated by phosphorylation at multiple sites. Decay of activation is regulated by phosphatases and polyubiquitin ligases. The architecture of possible signal transduction, which is genetically determined by 3D molecular structure and promoter sequence, should be discriminated from the branch activity, which is regulated epigenetically or post-translationally (Table 2). In my model, the former genetic regulation is set as a common hierarchical-pair architecture conserved in all cells, whereas the latter epigenetic regulation is dynamically controlled following the basic law.

The learning hierarchical-pair model is consistent with the concept provoked by Waddington epigenetic landscape^10,11 or the free-energy principle⁸, in which gene expression pattern becomes appropriate as if a cell rolls down on a landscape. Accordingly, the landscape itself or the gene-expression pattern under a new condition is not predictable in my model. Instead, the model predicts that cells are functionally robust to perturbations. A testable prediction is that the hierarchical architecture of regulated gene pairs is common in all human cells. Based on this single GRN, biologists will manipulate cells by predicting preferential differentiation or gene expression patterns.

My model differs from gene regulation in vivo in several aspects. First, the parameters are not based on experimental observations. Delay time for feedback regulation with MSE is ignored. Therefore, the time in the simulation does not linearly correlate with the actual time. Second, the bias β or additive noise should be controlled. Bias β_A and β_B may differ for each branch in each pair. Finely regulated bias is equivalent to an additive increase in gene or module activity, in which conventional deterministic regulations can be included. These additive bias terms are transiently and roughly required during differentiation, while the additive noise could be negligibly small or fixed to the correct values in terminally differentiated cells. Third, the calculation of the approximated error values from the target and current ratios is highly simplified. Pairs of crucial genes may be controlled more strictly, whereas many other pairs are controlled less strictly. Using scRNA-seq data as the target ratio, I show that gene expression reaches acceptable patterns for the cell. Fourth, the hierarchical pairs of genes generated by the AreaSum clustering method (Supplementary Table 1) should be revised to a true architecture. There is no evidence that my pairing is correct, because the shuffling of gene pairs did not significantly affect learning efficiency in the model with β = 1. Formally, forming a pair is equivalent to reducing the number of dimensions by one. The molecular biology of gene regulation, big data obtained by RNA-seq, and simulations and clustering using supercomputers would reveal the single correct gene-regulation architecture, which might be as useful as the periodic table of the elements in chemistry.

If the complexity of living organisms requires a template for increase, the increase of template would be formulated as competitive amplification. Death is formulated as error-dependent decay. A tissue composed of numerous cell-types regulates the cell ratio through proliferation (competitive amplification) and apoptosis (decay). In the immune system, we previously proposed that regulatory T cells, which are crucial for immune suppression, reduce decay probability and can be redefined as an indicator of low error¹⁴. Thus, the law of biological inertia provides insights for understanding various complex systems, implying the importance of individual freedom.

Methods

Computation

The simulation is performed using Python 3.7 software. Four files, including codes for the learning hierarchical-pair model (Code File 1), clustering of genes (Code File 2), model of human gene expression (Code File 3), and model with an mRNA pool (Code File 4), are available at GitHub. I perform Monte Carlo simulations in which the stochastic processes of increase and decrease are repeated 10⁴–10⁶ times for each pair as explained in Table 1 and Figs. 1a and 6. Note that, time does not indicate the actual time but the number of trials and errors (cycles of increase and decay). In the learning pair model, the values of factors A and B in each pair, x_A and x_B, increase by one after selecting either A or B at an A:B = (x_A + β_A):(x_B + β_B) ratio as competitive amplification, where bias β is 1 if not indicated to be 10⁻⁷, and decays at MSE-dependent probability. In “shuffle”, the factor indexes for each target value and for the location in the hierarchical pairs are randomly shuffled to set randomized target values in the hierarchical pairs. The expression ratio of each factor in total is calculated as an infinite product of the ratios in all pairs containing the factor. In the text, the assumption or settings are written in the present tense, whereas the results of simulation are written in the past tense.

Approximation of error

The MSE is calculated for each pair as the difference between the current and target ratios. In stepwise error, the value is expressed in exponential notation with a base of 10, the mantissa is rounded to 1, and only the exponent value is used as the level of error and as the decay probability. Accordingly, the stepwise error takes a value \(\in \left\{ {10^{ - 1} , 10^{ - 2} , 10^{ - 3} , \ldots , 10^{ - i} , \ldots } \right\}\), where i is a natural integer. In 6-, 5-, 4-, 3-, and 2-step errors, the lower limit of the error value is set to 10⁻⁶, 10⁻⁵, 10⁻⁴, 10⁻³, and 10⁻², respectively. Whereas the stepwise error may take an unlimitedly small value to zero, 6-step error can take six types of error, from 10⁻¹ to 10⁻⁶, and 2-step error can take two types of error, 10⁻¹ or 10⁻². The code is available in Code File 1.

Hierarchical clustering analysis

The hierarchical pairs in the learning hierarchical-pair model indicate groups of genes with similar expression patterns that might be controlled by a particular regulation-module. To generate optimal hierarchical pairs for the model, six hierarchical clustering analysis methods are compared.

Hierarchical clustering analysis repeats the following two calculations until a pair containing all genes is created: (1) pairing two genes or clusters with the closest distance, and (2) calculating the distances to the new cluster of genes. Ward method uses the Euclidean distance. The WCO method uses the cosine distance, which takes a high value in the case of a low correlation, and “weighted method” that is Weighted Pair Group Method with Arithmetic Mean (WPGMA). The Single method uses the Euclidian distance and “single method” that selects the nearest point in clusters. These three clustering methods are available in scipy.cluster.hierarchy.linkage of the Python tool. The three new clustering methods, AreaSum, CvSum, and Cvarea, are available in Code File 2. In these three methods, the total expression of genes in a cluster is used as the representative value of the cluster. This is appropriate because pairing in the learning hierarchical-pair model is equivalent to separation into two subgroups. In the AreaSum method, the area between two vectors from the origin to the values of the clusters is used as the distance. A small angle indicates a constant expression ratio among different cells. A large vector size allows genes with high expression to skip many layers in the hierarchical pairs. The two clusters with the smallest distance are paired. In the CvSum method, the total expression level of genes in the pair, including both branches, is summed for every cell, and the variation (cv) of the summed value among cells is used as the distance between two clusters. Family genes with functional substitutability can be paired. In the Cvarea method, the product of the area and the cv is used as the distance between two clusters.

In Fig. 3, hierarchical clustering analyses are applied to the expression of 16,921 genes in 20 cells. To generate the hierarchical-pair architecture used in Figs. 3d, f, and 4, the AreaSum method is applied to this dataset. For the hierarchical-pair architecture used in Fig. 5, the AreaSum method is applied to the expression of 11,281 genes in 11,803 human cells. The gene list and hierarchical cluster are available in Supplementary Table 1.

Learning hierarchical-pair model with an mRNA pool

In the model in Figs. 4 and 5, the hierarchical pairs are assumed to be signal transduction cascades to select a gene in an mRNA pool, similarly to the Monte Carlo tree search. At each repetition, a pair is stochastically chosen among pairs in the top seven layers depending on the coverage of the pair. In the selected pair, the competitive amplification is performed; branch A or B is selected at a ratio (x_A + β):(x_B + β), where bias β is 1 if not indicated to be 10⁻⁷, and the value of the selected branch, x_A or x_B, increases by one. The downstream pair of the selected branch also enters the process of competitive amplification until the selected branch indicates a single gene. In an mRNA pool, mRNA of the selected gene increases by one, with randomly replacing one mRNA. Initially, 360,000 mRNAs in the mRNA pool are set based on the initial ratio, in addition to the ratios in each pair. The values in each pair decrease by MSE-dependent decay every 10 repetitions on average. The expression probability is calculated as an infinite product of ratios in all pairs that contain the gene. The code is available in Code File 4.

Resource datasets

For bacterial genes (Fig. 2), RNAseq data from Escherichia coli (BWk3) (GSE96706) are used¹⁵. Among the 4296 genes, 4096 genes with expression values greater than 2^1.9 under either culture condition in the dataset are selected. The genes are set to form a hierarchical-pair architecture using the order within the genome. GSM2538622 (1A), GSM2538631 (10A), and GSM2538649 (27A) are used as data without antibiotics, with kanamycin, and with ciprofloxacin, respectively.

For human early embryogenesis (Figs. 3 and 4), 20 differently-labeled cells are selected from three scRNA-seq datasets (E-MTAB-3929, GSM2257302, and GSE75748) for gene expression in preimplantation embryos, in vitro cultured embryonic stem cells, and the downstream early mesoderm and endoderm progenitors^16,17,18. 'E3.1.443', 'E4.1.1', 'E5.1.26', 'E6.1.72', and 'E7.2.138' are selected from E-MTAB-3929. 'APS.p1c1r2', 'D2_25somitomere.p9c1r1', 'DLL1PXM.p8c1r1', 'Earlysomite.p10c2r8', 'H7hESC.p7c1r4', 'LatM.p3c1r1', 'MPS3.p5c1r1', 'Sclerotome.p2c1r1', and 'cDM.p4c1r1' are selected from GSM2257302. 'H9.00hb4s_001', 'H9.12h_001', 'H9.24h_013', 'H9.36h_001', 'H9.72h_001', and 'H9.96h_001' are selected from GSE75748. For the 16,921 genes expressed at more than 10 TPM in either cell, hierarchical clustering analyses are applied. For the test data, another scRNA-seq dataset of human preimplantation embryos is used (GSE36552) after merging the 16,921 genes with the gene names and assigning 0 as the expression level of non-annotated genes¹⁹. GSM896806, GSM896809, GSM922146, GSM922158, GSM922178, and GSM922194 are used for scRNA-seq data of the zygote, 2-cell, 4-cell, 8-cell, morula, and blastocyst stages, respectively. In Fig. 3c, 3 zygotes and 12 4-cell datasets are used.

To generate a common hierarchical-pair architecture in Fig. 5 and Supplementary Table 1, 13 scRNA-seq datasets of human tissues are used; 515 peripheral blood cells (GSE97531)²⁰, 836 hematopoietic stem and progenitor cells in the bone marrow, spleen, and peripheral blood (GSE143567)²¹, 1567 trophoblast and stromal cells from the placenta (GSE89497)²⁸, 559 cardiomyocytes (GSE95140_human)²⁴, 2148 endometrium cells from the uterus (GSE111976)²⁶, 766 renal cells from kidney biopsy (GSE160048_human)²², 91 fallopian tube epithelial cells (GSE132149_sc16)²⁷, 2036 retina cells from the eyes (GSE133707_P1)²³, 134 primordial germ cells from a female embryo at 10 weeks of gestation (GSM2295850) and from a male embryo at 25 weeks of gestation (GSM2306040)²⁹, 372 in vitro-cultured primary myoblasts (GSE52529)²⁵, 498 in vitro-cultured embryonic stem cells and early mesoderm progenitors (GSM2257302)¹⁷, 758 in vitro-cultured embryonic stem cells and endoderm progenitors (GSE75748_sc_time_course_ec)¹⁸, and 1529 cells from early preimplantation embryos (E-MTAB-3929)¹⁶. Gene names are used, if available, to integrate multiple datasets. If not available, the gene name is determined as ‘symbol’ using MyGene.py in the Python package. The code is written in a comment form in Code File 2. For the 11,281 genes successfully annotated in all 13 datasets (11,803 cells), gene expression ratios are recalculated by normalizing the total expression of the 11,281 genes to 1,000,000. The gene list and hierarchical clustering are available in Supplementary Table 1.

For the test data in Fig. 5, 39 single-cell datasets of the zygote, 2-cell, 4-cell, 8-cell, morula, and blastocyst stages in the preimplantation human embryo GSE36552¹⁹, 9 datasets of hematopoietic progenitors, including GMP, MLP, and lymphoid-primed multi-potential progenitors in the human cord blood of normal donors from GSE100618³⁰, and 13 PBMCs from normal donors from GSE161901³¹ are collected. Gene expression ratios are recalculated by normalizing the total expression of 11,281 genes to 1,000,000. GSM896806, GSM896809, GSM922146, GSM922158, GSM922178, and GSM922194 are used for scRNA-seq data of the zygote, 2-cell, 4-cell, 8-cell, morula, and blastocyst stages, respectively¹⁹. For PBMCs, cell types are determined based on the high expression of CD19 and IGHM (immunoglobulin heavy constant mu) for B lymphocytes, TRBC2 (T cell receptor beta constant 2) for T lymphocytes, and CD33 for myeloid cells. Among the test data, GSM2689351 (P5_E5_MLP) is used for MLP, GSM2689085 (P5_G11_GMP) for GMP, GSM4916527 (NormalDonor1_untreated_PBMC_027) for a B cell, GSM4916502 (NormalDonor1_untreated_PBMC_002) for a T cell, and GSM4916594 (NormalDonor1_untreated_PBMC_094) for a myeloid cell^30,31. In the text, GSM2689298 (P3_F5_MLP) and GSM2689390 (P6_G6_MLP) are used for MLPs, GSM2689057 (P4_G9_GMP) and GSM2689102 (P6_C9_GMP) for GMP, GSM4916525 (NormalDonor1_untreated_PBMC_025) and GSM4916536 (NormalDonor1_untreated_PBMC_036) for B cells, and GSM4916573 (NormalDonor1_untreated_PBMC_073) and GSM4916783 (NormalDonor4_untreated_PBMC_001) for T cells, and GSM4916557 (NormalDonor1_untreated_PBMC_057) and GSM4916559 (NormalDonor1_untreated_PBMC_059) for myeloid cells^30,31.

Statistical analysis

Paired t-test was applied for statistical analysis to compare the results in 12 target ratios in Fig. 3c.

Data availability

All relevant data supporting the key findings of this study are available within the article and its Supplementary information files. Further information and requests for resources and reagents should be directed to and will be fulfilled by the lead contact, Tomoyuki YAMAGUCHI (t.yamaguchi@tokushukai.jp).

Code availability

The codes generated in this study are provided as a supplementary file. They are available at GitHub (https://github.com/tyamaguc-tky/Learning_pair).

References

Meyer, P. & Saez-Rodriguez, J. Advances in systems biology modeling: 10 years of crowdsourcing DREAM challenges. Cell Syst. 12, 636–653. https://doi.org/10.1016/j.cels.2021.05.015 (2021).
Article CAS PubMed Google Scholar
Karlebach, G. & Shamir, R. Modelling and analysis of gene regulatory networks. Nat. Rev. Mol. Cell Biol. 9, 770–780. https://doi.org/10.1038/nrm2503 (2008).
Article CAS PubMed Google Scholar
Djebali, S. et al. Landscape of transcription in human cells. Nature 489, 101–108. https://doi.org/10.1038/nature11233 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Mnih, V. et al. Human-level control through deep reinforcement learning. Nature 518, 529–533. https://doi.org/10.1038/nature14236 (2015).
Article ADS CAS PubMed Google Scholar
Silver, D. et al. Mastering the game of Go with deep neural networks and tree search. Nature 529, 484–489. https://doi.org/10.1038/nature16961 (2016).
Article ADS CAS PubMed Google Scholar
LeCun, Y., Bengio, Y. & Hinton, G. Deep learning. Nature 521, 436–444. https://doi.org/10.1038/nature14539 (2015).
Article ADS CAS PubMed Google Scholar
Pezzulo, G. & Levin, M. Top-down models in biology: explanation and control of complex living systems above the molecular level. J. R. Soc. Interface https://doi.org/10.1098/rsif.2016.0555 (2016).
Article PubMed PubMed Central Google Scholar
Friston, K. The free-energy principle: A unified brain theory?. Nat. Rev. Neurosci. 11, 127–138. https://doi.org/10.1038/nrn2787 (2010).
Article CAS PubMed Google Scholar
Himeoka, Y. & Kaneko, K. Epigenetic Ratchet: Spontaneous adaptation via stochastic gene expression. Sci. Rep. 10, 459. https://doi.org/10.1038/s41598-019-57372-0 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Waddington, C. The Strategy of the Genes. (Allen & Unwin, 1957).
Wang, J., Zhang, K., Xu, L. & Wang, E. Quantifying the Waddington landscape and biological paths for development and differentiation. Proc Natl Acad Sci USA 108, 8257–8262. https://doi.org/10.1073/pnas.1017017108 (2011).
Article ADS PubMed PubMed Central Google Scholar
Jaenisch, R. & Bird, A. Epigenetic regulation of gene expression: How the genome integrates intrinsic and environmental signals. Nat. Genet. 33(Suppl), 245–254. https://doi.org/10.1038/ng1089 (2003).
Article CAS PubMed Google Scholar
Holoch, D. & Moazed, D. RNA-mediated epigenetic regulation of gene expression. Nat. Rev. Genet. 16, 71–84. https://doi.org/10.1038/nrg3863 (2015).
Article CAS PubMed PubMed Central Google Scholar
Yamaguchi, T. et al. Theoretical modeling reveals that regulatory T cells increase T-cell interaction with antigen-presenting cells for stable immune tolerance. Int Immunol 31, 743–753. https://doi.org/10.1093/intimm/dxz043 (2019).
Article CAS PubMed PubMed Central Google Scholar
Lázár, V. et al. Antibiotic-resistant bacteria show widespread collateral sensitivity to antimicrobial peptides. Nat Microbiol. 3, 718–731. https://doi.org/10.1038/s41564-018-0164-0 (2018).
Article CAS PubMed PubMed Central Google Scholar
Petropoulos, S. et al. Single-cell RNA-Seq reveals lineage and X chromosome dynamics in human preimplantation embryos. Cell 165, 1012–1026. https://doi.org/10.1016/j.cell.2016.03.023 (2016).
Article CAS PubMed PubMed Central Google Scholar
Loh, K. M. et al. Mapping the pairwise choices leading from pluripotency to human bone, heart, and other mesoderm cell types. Cell 166, 451–467. https://doi.org/10.1016/j.cell.2016.06.011 (2016).
Article CAS PubMed PubMed Central Google Scholar
Chu, L. F. et al. Single-cell RNA-seq reveals novel regulators of human embryonic stem cell differentiation to definitive endoderm. Genome Biol. 17, 173. https://doi.org/10.1186/s13059-016-1033-x (2016).
Article CAS PubMed PubMed Central Google Scholar
Yan, L. et al. Single-cell RNA-Seq profiling of human preimplantation embryos and embryonic stem cells. Nat. Struct. Mol. Biol. 20, 1131–1139. https://doi.org/10.1038/nsmb.2660 (2013).
Article CAS PubMed Google Scholar
Parker, M. M. et al. RNA sequencing identifies novel non-coding RNA and exon-specific effects associated with cigarette smoking. BMC Med. Genomics 10, 58. https://doi.org/10.1186/s12920-017-0295-9 (2017).
Article CAS PubMed PubMed Central Google Scholar
Mende, N. et al. Quantitative and molecular differences distinguish adult human medullary and extramedullary haematopoietic stem and progenitor cell landscapes. bioRxiv. https://doi.org/10.1101/2020.01.26.919753 (2020).
Article Google Scholar
He, B. et al. Single-cell RNA sequencing reveals the mesangial identity and species diversity of glomerular cell transcriptomes. Nat. Commun. 12, 2141. https://doi.org/10.1038/s41467-019-12917-9 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Liang, Q. et al. Single-nuclei RNA-seq on human retinal tissue provides improved transcriptome profiling. Nat. Commun. 10, 5743. https://doi.org/10.1038/s41467-018-06639-7 (2019).
Article ADS CAS PubMed PubMed Central Google Scholar
Nomura, S. et al. Cardiomyocyte gene programs encoding morphological and functional signatures in cardiac hypertrophy and failure. Nat. Commun. 9, 4435. https://doi.org/10.1038/s41467-018-06639-7 (2018).
Article ADS CAS PubMed PubMed Central Google Scholar
Trapnell, C. et al. The dynamics and regulators of cell fate decisions are revealed by pseudotemporal ordering of single cells. Nat. Biotechnol. 32, 381–386. https://doi.org/10.1038/nbt.2859 (2014).
Article CAS PubMed PubMed Central Google Scholar
Wang, W. et al. Single-cell transcriptomic atlas of the human endometrium during the menstrual cycle. Nat. Med. 26, 1644–1653. https://doi.org/10.1038/s41591-020-1040-z (2020).
Article CAS PubMed Google Scholar
Hu, Z. et al. The repertoire of serous ovarian cancer non-genetic heterogeneity revealed by single-cell sequencing of normal fallopian tube epithelial cells. Cancer Cell 37, 226e227-242e227. https://doi.org/10.1016/j.ccell.2020.01.003 (2020).
Article CAS Google Scholar
Liu, Y. et al. Single-cell RNA-seq reveals the diversity of trophoblast subtypes and patterns of differentiation in the human placenta. Cell Res. 28, 819–832. https://doi.org/10.1038/s41422-018-0066-y (2018).
Article CAS PubMed PubMed Central Google Scholar
Li, L. et al. Single-cell RNA-Seq analysis maps development of human germline cells and gonadal niche interactions. Cell Stem Cell 20, 858e854-873e854. https://doi.org/10.1016/j.stem.2017.03.007 (2017).
Article CAS Google Scholar
Karamitros, D. et al. Single-cell analysis reveals the continuum of human lympho-myeloid progenitor cells. Nat. Immunol. 19, 85–97. https://doi.org/10.1038/s41590-017-0001-2 (2018).
Article CAS PubMed Google Scholar
Anand, P. et al. Single cell RNA-seq reveals developmental plasticity with coexisting oncogenic and immune evasion programs in ETP-ALL. Blood 137, 2463–2480. https://doi.org/10.1182/blood.2019004547 (2021).
Article CAS PubMed PubMed Central Google Scholar
Nora, E. P. et al. Spatial partitioning of the regulatory landscape of the X-inactivation centre. Nature 485, 381–385. https://doi.org/10.1038/nature11049 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

I wish to thank T. Shinagawa, H. Nakamoto, and Y. Yamaguchi for critically reading the manuscript, and Editage (www.editage.com) for English language editing. This work was financially supported by the Tokushukai Medical Group.

Author information

Authors and Affiliations

Research Institute, Nozaki Tokushukai Hospital, Daito City, Osaka, 574-0074, Japan
Tomoyuki Yamaguchi

Authors

Tomoyuki Yamaguchi
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conceptualization, T.Y.; Investigation, T.Y.; Writing, T.Y.

Corresponding author

Correspondence to Tomoyuki Yamaguchi.

Ethics declarations

Competing interests

The author declares no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Yamaguchi, T. Learning processes in hierarchical pairs regulate entire gene expression in cells. Sci Rep 12, 7549 (2022). https://doi.org/10.1038/s41598-022-10998-z

Download citation

Received: 03 December 2021
Accepted: 18 April 2022
Published: 09 May 2022
DOI: https://doi.org/10.1038/s41598-022-10998-z

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.