Abstract
Although ubiquitous, interactions in groups of individuals are not yet thoroughly studied. Frequently, single groups are modeled as criticalmass dynamics, which is a widespread concept used not only by academics but also by politicians and the media. However, less explored questions are how a collection of groups will behave and how their intersection might change the dynamics. Here, we formulate this process as binarystate dynamics on hypergraphs. We showed that our model has a rich behavior beyond discontinuous transitions. Notably, we have multistability and intermittency. We demonstrated that this phenomenology could be associated with community structures, where we might have multistability or intermittency by controlling the number or size of bridges between communities. Furthermore, we provided evidence that the observed transitions are hybrid. Our findings open new paths for research, ranging from physics, on the formal calculation of quantities of interest, to social sciences, where new experiments can be designed.
Introduction
How individuals interact in groups has motivated research in many different areas ranging from sociology and political sciences^{1,2,3,4,5} to physics and mathematics^{6,7,8,9,10,11,12,13,14,15,16,17,18,19,20}. From a sociological viewpoint, the interest frequently lies in the role played by committed minorities. One of the main questions is when and how this committed group of individuals can overturn a given consensus. Implicitly, we are assuming that the interaction between groups of individuals follows a criticalmass dynamics. Despite the informal use of the term criticalmass by politicians, the media, and even academics, there is evidence that individuals might behave in this way when changing social conventions. This evidence ranges from theoretical models^{6,7,8,9} and observational studies^{1,3,4,21,22}, to real experimental approaches^{5}. Although these studies suggest that the criticalmass threshold might range between 10% and 40%, there is evidence that it can be low as 0.3% in linguistic norm changes in English and Spanish^{23,24} or even just a few of individuals that are not comparable with the size of the population under study^{24,25}. Despite this wide range of observed thresholds, the criticalmass paradigm provides a reasonable abstraction to analyze and understand real social systems. Thus, from an analytical approach, we begin with the premise that the criticalmass dynamics is a reasonable assumption about how a group of people acts. So, the natural questions that emerge are: (1) How will a collection of groups behave? (2) How might the intersection between these groups change the global dynamics? (3) Can smaller groups have a higher criticalmass threshold than the whole population? Note that, as we allow for a collection of criticalmass dynamics, their intersections might be able to generate a cascade of events. In other words, by inducing change at a small scale, it might be possible to reach the threshold of other groups, therefore triggering global changes.
Recently, some of us proposed a formal model able to provide insights about these questions^{12}. In this model, society is modeled as a hypergraph, where individuals are nodes, and the group interactions of arbitrary sizes are encoded as hyperedges. The model presents discontinuous transitions, bistability, and hysteresis, thus, suggesting that interactions in groups might be the driver for such phenomenology, hence, already providing some initial answers and insights about question (1). In practice, the model suggests that some intermediate levels of activation are not reachable as the activation of groups might be able to trigger a larger scale cascade. Regarding questions (2) and (3), the model provides a theoretical foundation for, and a phenomenological explanation to, the seemingly different experimental findings of expected criticalmass thresholds. More specifically, ethnographic studies show a critical mass around 25–30%^{1} and align with experimental results, which report a critical mass around 25%^{5}. On the other hand, considering linguistic norm changes, the observed threshold is as low as 0.3% in English and Spanish^{23}. The first studies consider a single group, while the linguistic norm changes consider a whole population, which can be understood as a collection of groups. Thus, in the latter, we might have groups with different sizes, each with a different threshold. In other words, from the perspective of the model in ref. ^{12}, it is possible to have individual groups with thresholds between 25% and 40%, and, at the same time, due to the group intersections, having a critical mass at the population level around a much lower value. A second possible explanation is bistability, which enables two possible solutions for the same set of parameters. For example, the system might be operating in a region where both solutions are larger than zero and stable.
Here, through a dynamical analysis of the social contagion model presented in ref. ^{12}, we show that the richness of this model is not constrained to discontinuous transitions and hysteresis. First, by evaluating a real hypergraph, we show that social contagion in hypergraphs can display a bimodal distribution of the number of active nodes, leading to multistability or intermittency in time. We also observe that at the transitions between branches, the susceptibility diverges. In the rest of the paper, we dedicate our efforts to give theoretical support to these findings and explain the mechanisms that might trigger them. We demonstrate that these features could be linked to the community structure in the hypergraph and we show that bridges between communities play a crucial role. Here, we define bridges as hyperedges that are composed by nodes belonging to different communities. Our second main result concerns the nature of the observed transitions. As we have multiple stable branches, due to the mentioned multistability, we might also have multiple transitions. Despite the expected discontinuities (see refs. ^{10,12,13,14}), we show that these transitions display features of hybrid transitions, that is, they display discontinuities and scaling behaviors for the order parameter and susceptibility.
The paper is organized as follows: in section “Model definition and theoretical analysis,” we discuss the theoretical basis of the model presented in ref. ^{12}, including its analytical and numerical aspects. In section “Example of realworld hypergraphs: the case of blues reviews,” we present the numerical simulations we performed on a real hypergraph, which show evidence of multistability, intermittent behavior, and hybrid phase transitions. In the following sections, we focus on explaining our findings. In section “Multistability and intermittent behavior,” we show that our firstorder approximation predicts multistability. Next, using an artificial model, we relate multiple stable branches and intermittency to community structures. We also show how bridge hyperedges modulate the transition from multistability to intermittency. In section “Analysis of the transition between stable branches,” we focus on a hypergraph with special symmetries, which allow us to derive exact equations for the dynamics and perform a finite size analysis, providing a strong argument for the presence of hybrid phase transitions in our model. Finally, in section “Discussion,” we discuss our findings in more general terms, provide the conclusion, and show some of the perspectives opened by our work.
Results
Model definition and theoretical analysis
A hypergraph, \({{{{{{{\mathcal{H}}}}}}}}\), is defined as a set of nodes, \({{{{{{{\mathcal{V}}}}}}}}=\{{v}_{i}\}\) and a set of hyperedges \({{{{{{{\mathcal{E}}}}}}}}=\{{e}_{j}\}\), where e_{j} is a subset of \({{{{{{{\mathcal{V}}}}}}}}\) with arbitrary cardinality ∣e_{j}∣. The number of nodes is defined as \(N={{{{{{{\mathcal{V}}}}}}}}\). It is also convenient to define \({{{{{{{{\mathcal{E}}}}}}}}}_{i}\) as the set of hyperedges that contain the node v_{i}. If \(\max ({e}_{j})=2\) we recover a graph. If for each hyperedge with ∣e_{j}∣ > 2 its subsets are also contained in \({{{{{{{\mathcal{E}}}}}}}}\), we recover a simplicial complex. Figure 1a shows an example of a hypergraph. Moreover, the adjacency matrix^{13,26} can be defined as
for i ≠ k, and A_{ii} = 0 for all i. Note that it can be interpreted as a weighted projected graph. Here we will adopt this matrix for visualization purposes, but it has previously been used to study the spectra of hypergraphs^{26} and linked to the stability of dynamical processes^{13}.
Our dynamics are defined through the activation and deactivation of nodes. In an arbitrary hypergraph, we associate a Bernoulli random variable Y_{i} to each individual v_{i} indicating whether the node v_{i} is active (Y_{i} = 1) or not (Y_{i} = 0). For each active node, we associate a deactivation mechanism, modeled as a Poisson process with parameter δ, \({N}_{i}^{\delta }\). For each hyperedge, e_{j}, we define a random variable \({T}_{j}={\sum }_{k:{v}_{k}\in {e}_{j}}{Y}_{k}\), which is the number of active nodes in the hyperedge. See the tables next to each hyperedge in Fig. 1a for a graphical representation of all the possible microstates and the T_{j} variables. If T_{j} is equal or above a given threshold, Θ_{j}, we associate a Poisson process with parameter λ_{j}, \({N}_{j}^{{\lambda }_{j}}\). We point out that the random variables defined above depend on t, for any t ≥ 0, but we remove t from our notation for the sake of simplicity. Formally, our dynamics can be written as a continuoustime Markov chain \({({Y}_{t})}_{t\ge 0}\), with state space \({\{0,1\}}^{{{{{{{{\mathcal{V}}}}}}}}}\). That is, for any t ≥ 0, Y_{t} is a random function from \({{{{{{{\mathcal{V}}}}}}}}\) into {0, 1}, which associates to each node v_{i} the Bernoulli random variable Y_{i}. Moreover, the states of nodes change according to the following transitions and rates:
where \({{\mathbb{1}}}_{\{{{{{{{{\rm{condition}}}}}}}}\}}\) is the indicator function, which is one if the “condition” is satisfied, and zero otherwise.
In other words, the group dynamics is given by a threshold process that becomes active only above a critical mass of activated nodes. When above the threshold, for a given hyperedge e_{j}, the Poisson process \({N}_{j}^{{\lambda }_{j}}\) induces that all the nodes inside this hyperedge become activate. So, given an hyperedge e_{j}, after the threshold is hit (T_{j} ≥ Θ_{j}) and a random time exponentially distributed with parameter λ_{j} passes (as a consequence of the Poisson process \({N}_{j}^{{\lambda }_{j}}\)), all the the inactive vertices become activate simultaneously. If enough nodes are deactivated before the time associated with the process passes, the process is removed. Moreover, if ∣e_{j}∣ = 2, we consider that the Poisson processes are directed. This definition allows for recovering traditional SIS contagion models. Figure 1b shows an example of the graphical representation for our process.
For simplicity, we assume that λ_{j} = f(∣e_{j}∣), where f is an arbitrary function of the cardinality of the hyperedge. It is also convenient to define Θ_{j} = ⌈Θ^{*}∣e_{j}∣⌉, where ⌈x⌉ is the ceiling function, which returns the least integer greater than or equal to x and Θ^{*} is a global parameter that is invariant to the cardinality of the hyperedges and lies in the range 0 ≤ Θ ^{*} ≤ 1. Note that, if we had defined Θ_{j} = ∣e_{j}∣ − 1 we would have recovered a model similar to the one proposed in ref. ^{10}. It would be the same if \({{{{{{{\mathcal{H}}}}}}}}\) is constrained to a simplicial complex. For more on this relationship, we refer to refs. ^{12}^{,}^{14}. The exact equation that describes our model can be formally written as
where the first summation is over all hyperedges containing v_{i} and the second summation is over all the possible dynamical microstates inside the hyperedge e_{j}, denoted by the set B_{j}. Furthermore, \({{\mathbb{1}}}_{\{{Y}_{i}=0,{T}_{j}\ge {{{\Theta }}}_{j}\}}\) is an indicator function that is 1 if Y_{i} = 0 and the criticalmass in the hyperedge is reached, and 0 otherwise. Moreover, we assumed that the spreading rate is composed by the product of a free parameter and a function of the cardinality, i.e., λ_{j} = λ × λ^{*}(∣e_{j}∣). In all of our numerical simulations we assumed \({\lambda }^{*}({e}_{j})={\log }_{2}({e}_{j})\). This definition is convenient as, in the pairwise case, λ^{*}(2) = 1, guaranteeing that our dynamics reduces to the standard SIS model in a graph. Also, we choose \({\log }_{2}({e}_{j})\) as it grows sublinearly. So, in the limit of a large hyperedge, the average spreading rate tends to zero, i.e., \(\mathop{\lim }\nolimits_{{e}_{j}\to \infty }\frac{{\log }_{2}({e}_{j})}{{e}_{j}}=0\).
Example of realworld hypergraphs: the case of blues reviews
In this section, we present evidence that the behavior of real hypergraphs goes beyond the already surprising discontinuous transitions and bistability found in hypergraph and simplicial contagion models^{10,12}. Indeed, we found that in many regimes our model presents multiple stable solutions and regions of intermittent behavior, where we have an alternating dynamics of high and low activity. We divide our study in two parts: we begin with a macrostate analysis, and then move to a microstate evaluation. This approach allows us to formulate some hypotheses regarding the mechanisms behind the observed phenomenology.
We first present evidence of multistability and intermittency in a real system. We do this by analyzing the dynamics of our model on the blues reviews hypergraph, where nodes are Amazon reviewers, and hyperedges are groups of reviewers who reviewed a certain type of blues music within a month^{27}. This dataset is available at ref. ^{28}. This hypergraph has N = 1106 nodes and 694 hyperedges, whose maximum cardinality is \(\max ({e}_{j})=83\). In this dataset, the pairwise interactions are sparse, which alone would form a giant component of only 24 nodes. However, by accounting all the hyperedges, the giant component of the hypergraph has N = 1106 nodes. We remark that repeated hyperedges were not allowed. Moreover, for a structural analysis of this hypergraph, we refer to the Supplementary Information (SI).
Figure 2 shows the QS Monte Carlo simulations (see the “Methods” section for more details about this method) for our social contagion model in the blues reviews hypergraph and in a randomly rewired version obtained from the exact version of the vertexlabeled hypergraph configuration model presented in ref. ^{29} (Algorithm 2 in ref. ^{29} and code from ref. ^{30}) after 10^{7} rewirings. Moreover, in the SI we present 30 additional Monte Carlo simulations for different randomizations of the blues reviews hypergraph, showing that they have a similar behavior. In a–c, we present the order parameter (average fraction of active individuals), peaks of the state distributions, and the susceptibility, respectively (see the “Methods” section for the formal definition of these quantities). In the remaining subplots, we present examples of state distributions for the points marked in (a). Observe that only one solution was found in the randomized version of the hypergraph, and it has a single discontinuous transition. This behavior contrasts with the real case, where multiple stable solutions, and multiple transitions between branches, were found. The comparison between the real case and the rewired version suggests that correlations play a significant role in the emergence of multistability and intermittency.
Considering the real scenario, we notice that our process recurrently presents a bimodal distribution of states, where the probability of the modes change as we increase or decrease λ. For Branch I, light red curves in Fig. 2, we notice that by increasing λ, we have a discontinuous transition. The distribution of states for this region is shown in Fig. 2d, where λ = 0.036 is used as an example. This distribution is bimodal, where the first mode is near the absorbing state (near n ≈ 1 as we are using the QS method), and the second mode has an average n ≈ 250. For lower values of λ, a similar picture is observed, but the probability of the first mode is higher than for the second. For higher values of λ, the opposite happens. This pattern is reproduced until we have a single mode with a bellshaped distribution, similar to Fig. 2i. Next, as we increase λ, depending on the initial conditions and stochastic fluctuations the order parameter can jump (Fig. 2e, f). In this case, we again observe a bimodal distribution. In Fig. 2e, f, we show the state distribution for λ = 0.0816 and λ = 0.0848, respectively. Note that the mechanism that causes the bimodality in (d) is different from (e) and (f). In the first case, the bimodality appears as a consequence of the absorbing state, and it is similar to what is observed in an SIS process in a network. Note that for an SIS process in a network the second mode would be closer to the absorbing state and would increase continuously, originating a secondorder phase transition. In the second case, the different modes are related to intermittent behavior, where the process oscillates between high and low activity regimes, as can be seen in the insets of these figures.
We also observed similar intermittent behavior in Branch III (Fig. 2a, g, h). Although branches I and III display intermittency, in the first branch, this implies a discontinuity in the susceptibility, while in the second, it generates a continuous peak of susceptibility, as can be observed in Fig. 2c. This peak of susceptibility is related to the time the system spends in the high or low activity regimes. In other words, the relative time the system spends in each mode changes the variance and, therefore, the susceptibility. Note that, in an SIS process in networks, similar susceptibility bumps are related to localization features of the network. For instance, in a network with communities, we could find a similar pattern. In this case, the bumps would suggest that the process manages to reach a community or a group of nodes^{31}. Here, we use the term localization to denote a state where most of the probability of activation may be found within a constrained region, i.e., a subset of nodes. Note that, in graphs, we are usually interested in the localization properties at the transition, which can be quantified by the inverse participation ratio^{32}. In our case, for simplicity, we are extending the word localization to characterize the supercritical regime.
We remark that the absorbing state is always accessible. For the initial condition ρ = 0.1, all the simulations fall in the absorbing state. This solution was not presented in Fig. 2 because it is trivial, and the susceptibility is noisy, possibly confusing the interpretation.
As we have intermittency and bimodal distributions, the order parameter alone might not be enough to fully describe our dynamical behavior. To better understand the behavior of our model, we also show the position of the peaks of the state distributions. These peaks represent the states in which the system is “locally more likely to be”. For the blues reviews hypergraph, these peaks are reported in Fig. 2b. Despite the importance of the peaks, we argue that ρ is still an essential global measurement for our dynamics. The order parameter, ρ, unambiguously defines the state of our system, while the same cannot be said about Peaks(P). Notice that, in the multistable regions, the dynamics is not able to stay indefinitely in a single curve in Fig. 2b, as, in this case, the state is jumping between different modes. So, we argue that ρ and Peaks(P) should always be presented together.
Figure 2b shows that the bimodal distribution is present for a wide range of parameters, presenting regions where they vary continuously and regions with jumps. Moreover, in some cases, the different branches in Fig. 2b might be close to each other but can only be obtained in different simulations (see Peaks(P_{II}) and Peaks(P_{III}) in Fig. 2b). This observation suggests that the dynamics might be localized in different sets of nodes in the hypergraph. In this way, we might have similar macrostates as a consequence of significantly different microstates.
To better understand the localization properties of our process, we focus on the probability that an individual is active, sampled from the simulations. In Fig. 3a, we present the (hyper)adjacency matrix, as in Eq. (1), while in Fig. 3b–d we show the individual probabilities extracted from branches I to III, respectively. The matrix is ordered according to the individual probabilities of Branch III (lower). This figure shows that Branch III (panel (d)) is constrained to a group of nodes (a community roughly defined as C_{1} = {v_{1}, v_{2}, ⋯ , v_{600}}), while Branch II (intermediate branch, panel (c)) is restricted to a different set of nodes together with some bridge hyperedges, and branch I accounts for the activation of all the nodes (see panel (b)). Here, we recall that bridges are defined as the hyperedges that are composed by nodes in different communities.
Figure 4 depicts the intermittent behavior observed in Branch III of Fig. 2 for the blues reviews hypergraph with λ = 0.1016. A similar behavior was observed for a range of parameters, but we choose this specific value of λ for visualization purposes and to be consistent with the other figures. In Fig. 4a, we show the activity of the nodes as a function of the number of events (or iterations). The nodes are sorted by their activity for better visualization. Complementarily, in (b), we show the order parameter as a function of time. We observe that we have a set of nodes that are always active and a second set of nodes that can be activated due to some fluctuations. Comparing Fig. 4a with Fig. 3a, we notice that the group of nodes in the upperleft corner of Fig. 3a are the most active ones. Note that they participate in a larger number of hyperedges (note the colors). On the other hand, the rest of this community (the remaining nodes of community C_{1} in Fig. 3a) are the ones that have periods of activity and periods of inactivity.
Thus, the analysis at the individual level supports the initial hypothesis that intermittent behavior is a consequence of the activation and deactivation of a subset of nodes, or, in other words, the localization of states. The periods of high activity correspond to the activation of a sparser connected set of nodes by a more densely connected core. This latter core sustains the dynamics, and seeds the more sparsely connected nodes, which can only maintain its dynamics active for a limited time, and thus are responsible for the intermittent behavior.
Multistability and intermittent behavior
The main results of the previous section were the existence of multistability and intermittency. Here, our primary goal is to provide further arguments to support these findings and to explain the mechanism behind these phenomena. Using the theoretical framework developed in section “Model definition and theoretical analysis“ and the firstorder approximation presented in the “Methods” section, we offer a strong argument in favor of our findings and against the possibility of them being simulation artifacts. Moreover, we propose a simple generative model for hypergraphs with community structure, which provides a possible mechanism for the observed phenomenology.
Because of the finite size of the system, one may suspect the simulation to be trapped in metastates, that would vanish in a longer simulation. To counter this argument, we provide more robust evidence of multistability by providing numerical solutions of the ODE system in Eq. (5). In addition to strengthening our multistability argument, we also show that our firstorder approximation is qualitatively correct in this specific scenario, which provides an additional argument that our approximation indeed captures the essence of our model.
We observed that our simulations have a strong dependency on initial conditions. So, to numerically solve the ODE system in Eq. (5) we used one microstate obtained from our simulation as an initial condition. From this condition, we integrated Eq. (5) until reaching the steadystate. Finally, we used this solution as an initial condition to adjacent values of λ (increasing and decreasing λ). With this algorithm, we were able to uncover the five branches shown in Fig. 5. We remark that, using an uniform initial condition, e.g., y_{i}(0) = 0.1 for all i = 1, 2, . . . , N, we were not able to find most of the branches in the ODE system. The exceptions are the absorbing state and the uppermost branch, which can be find using y_{i}(0) = 1.0 for all i = 1, 2, . . . , N.
Comparing Figs. 2b and 5, we can see a clear correspondence between the predicted (ODE) and observed peaks of the state distributions (MC). Because our approximation neglects correlations and fluctuations, we are not able to capture the behavior in Fig. 2a but only the peaks of the bimodal distributions. This comparison strengthens the argument that the observed multistability is not a simulation artifact but rather a genuine feature of the model. Note that our firstorder approximation follows the same principles as the quenched meanfield in the SIS model in networks. In the network case, we only have unimodal distributions. Thus, this limitation is not an issue. However, in our case, further analysis is necessary, as we are not directly able to determine if the ODE’s solution is a peak of a multimodal distribution or not. Finally, we also remark that typically the ODE overestimates the MC predictions slightly.
The analysis in section “Example of realworld hypergraphs: the case of blues reviews” suggests that the community structure in the blues hypergraph might be responsible for the multistability and the intermittent behavior. As noted in the previous section, Fig. 3b–d, different branches are related to different sets of nodes, thus suggesting localization. Complementary, for a visual argument, see, for instance, Fig. 3a, where we can see the block organization in the adjacency matrix. In this section, we explore this hypothesis by proposing an artificial model that captures the community structure without including other correlations (see the “Methods” section for a description of this algorithm). In this way, we can test the hypothesis that this type of structure is responsible for the observed dynamical behavior.
Figure 6 shows results for the QS Monte Carlo simulations in the artificial random model with communities, and changing the number of bridges, m_{out}, for values between m_{out} = 200 and m_{out} = 600. Although not shown, in all the cases, the absorbing state is stable and can be reached from a small initial condition (e.g., ρ(t = 0) = 0.1). From the first to the third column, we increase the number of bridge hyperedges, m_{out}, thus diluting the modular structure. For m_{out} = 200 (see Fig. 6a, d, g), we have multistability, as different initial conditions lead to different solutions. We also find a region in λ where both coexist. In this case, the communities are sufficiently separated, and we do not observe intermittency. For m_{out} = 400 (see Fig. 6b, e, h), and m_{out} = 600 (see Fig. 6c, f, i), the multistability is not observed as different initial conditions led to the same solution. Interestingly, we observed intermittent behavior in the region between dashed lines in Fig. 6. In this region, we have a bimodal distribution of states and a susceptibility peak. Notice that, as we increase m_{out} the susceptibility peak also moves, appearing for lower values of λ (see Fig. 6h, i). More importantly, we recall that a similar behavior was observed in Branch III for the blues reviews hypergraph (see Fig. 2c), where we find a susceptibility peak caused by the intermittency.
For m_{out} = 200, we do not have a bimodal distribution. In this case, after the transition, we have two possible scenarios: one in which just one community is active, and a second one in which both communities are active. For m_{out} = 400 and m_{out} = 600, there is instead a region where a bimodal distribution is present. This distribution of states generates intermittent behavior due to the activation and deactivation of the sparser community, whereas the denser community sustains the process. However, the sparser one is only able to stay active for a limited time. During the lower activity periods, a strong enough fluctuation activates the sparser community. Nevertheless, after some time, this community will deactivate on its own due to another fluctuation.
These results suggest that when bridges are scarce, the communities are dynamically disconnected. Hence, we might have multiple stable solutions for a range of λ due to localization. As we add bridging hyperedges, we allow the process to travel across communities. However, this can destroy the multiple stable solutions by merging them into a bimodal distribution of states and creating intermittency. We highlight that a similar effect was also observed by increasing/decreasing the hyperedge cardinalities and by changing the criticalmass threshold Θ^{*}. In the first scenario, we noticed that by increasing the average hyperedge cardinality, we could change our system’s behavior from multistability to intermittency. Particularly, by (i) considering the same artificial model with communities as in previous numerical simulations, (ii) fixing the number of hyperedges and bridges, but (iii) changing the average hyperedge cardinality, μ, we were able to observe a shift from a multistable region for low μ to an intermittent behavior for larger μ. Moreover, by changing the criticalmass threshold Θ^{*}, we observed that, for higher values of Θ^{*}, we tend to favor multistability, while for lower values of Θ^{*} we favor intermittency. The numerical simulations of changing μ and Θ^{*} are presented in the SI. It is worth highlighting that it might be possible to construct more complex hypergraphs that would display more branches and possibly even allow for multistability and intermittency at the same region of λ. Please see also the SI for an example with four communities. We remark that here we focused on the simplest structure that reproduces both phenomena. Furthermore, one can see a relation between our results and the previous findings^{33} relative to the identification of network structures and of individuals best suited for spreading complex contagions. The authors proposed a centrality measure that accounts for the number of “enough wide bridges” between two nodes. Although in ref. ^{33} they are still using graphs (but the contagion is complex), this concept resembles the ideas behind criticalmass processes associated with our hyperedges. Thus, the term “enough wide bridges” might be understood as an abstraction of the criticalmass threshold in our context. We remark that the term “enough wide bridges” summarizes our results as it incorporates both the number of bridges (as shown in Fig. 6) and “how easy” it is to activate these bridges (results reported in the SI).
Analysis of the transition between stable branches
As we increase or decrease λ, branches can become unstable, and the process might experience a transition from one branch to another. For disease spreading on networks, this transition is usually continuous. For example, consider an SIS process in an infinite, homogeneous network (thermodynamic limit). In this case, we have an absorbing state (diseasefree state, ρ^{SIS} = 0) which is stable until the critical point. For any spreading rate larger than the critical point, the disease spreads through a collective activation of the network. In this regime, we have another branch that constitutes the active solutions (ρ^{SIS} > 0). This active branch “touches” the absorbing state at the critical point, making the transition continuous. However, when analyzing higherorder models, these transitions can be discontinuous^{10,12,14}. Furthermore, here we observed that we might have multiple transitions for the same initial condition (see Fig. 2a). Despite this evidence, a complete characterization of these transitions is still lacking. In this section, we will focus our analysis on the nature of this transition, providing an argument supporting the hybrid nature of the transitions. In this class of transitions, we have discontinuity and scalings at the same time. We highlight that this proposition seems to be general as our finding explains all the observed behavior in the susceptibility curves not only in this paper but also the one reported in refs. ^{11}^{,}^{12}.
To understand the nature of the transitions we study the hyperblob^{12}, which is a random regular graph, where every node has the same degree together with a hyperedge that includes every node. In this case, we can evaluate the exact distribution of states in the steadystate and observe how relevant quantities vary with system size. The graphical representation of the Markov chain that represents our dynamics in the hyperblob is shown in Fig. 7, where we already imposed the QS constraint, avoiding the absorbing state. In Fig. 8, we show the temporal behavior of our model, highlighting the importance of using the QS constrain as, for any finite system the dynamics will always converge to the absorbing state. Moreover, from this chain, we obtained the stationary distribution (see the “Methods” section for its derivation), which allow us to fully characterize our system in terms of the probability of having n active nodes, π_{n}. From this quantity and, in addition to the order parameter and susceptibility, here, we are also interested in the probabilities that the number of active nodes is lower or higher than the threshold Θ. The state with Θ active nodes is particularly important as, for n ≥ Θ, the Poisson Process \({N}_{j}^{{\lambda }^{*}}\) is created, which significantly changes our system’s behavior. Formally, these probabilities are, respectively, expressed as
In Fig. 9, we show the order parameter, the susceptibility and the probability of each solution for λ^{*} = 10. For a complementary analysis of the hyperblob, please see Sec. III in the SI. As we found rapid changes in both the order parameter and susceptibility, its characterization in the thermodynamic limit can be achieved using these curves’ left and right limits. These quantities are respectively denoted as ρ^{−}, χ^{−}, and ρ^{+}, χ^{+}. In practice, for the order parameter, ρ^{−} (ρ^{+}) is defined as the first value that is larger (smaller) than \(\frac{1}{N}\) from the lower (upper) solution. Complementary, for the susceptibility, we can use peaks in the derivatives of χ to define χ^{−} and χ^{+}. We observed that, if λ^{*} is low enough, the dynamics presents a secondorder phase transition followed by a hybrid transition. Note that, by “low enough” we assume that λ^{*} is constant for all sizes, i.e., do not scale with N, and it is not of the same order of the smallest size evaluated. These results are summarized in Figs. 9 and 10. In Fig. 9, we show the order parameter, the susceptibility and the probability of each solution for λ^{*} = 10. We notice a region where both solutions are possible, but only one solution exists for most of the evaluated parameters. The lower solution does not present any significant change compared to an SIS process in a homogeneous network. It exhibits a secondorder phase transition, as shown in Fig. 9a and Fig. 10a, b, where we can see a diverging peak of susceptibility. As we increase λ, the system moves from the lower to the upper solution. A hybrid phase transition characterizes the transition between these two regimes. In this type of transition, we have discontinuities on the order parameter, a feature of a firstorder phase transition, and also scalings, which are a feature of a secondorder phase transition^{34,35}. We characterize this transition by showing that \(\lambda ({\rho }_{2})\lambda ({\rho }_{2}^{+})\) and \(\lambda ({\chi }_{2}^{})\lambda ({\chi }_{2}^{+})\) tend to zero as we increase the system size, which is shown in Fig. 10e–h. The observed behavior implies that in the thermodynamic limit, we have a discontinuous transition. Importantly, the estimated exponent, \(\lambda ({\rho }_{2})\lambda ({\rho }_{2}^{+})\sim {N}^{\mu }\), μ ≈ 0.437 < 1, satisfies the conditions for a hybrid phase transition. We also note that the susceptibility peak for the whole system, χ, shows a diverging peak.
Interestingly, hybrid phase transitions were also found in a similar model for scalefree uniform hypergraphs^{11}. Specifically, these results can be translated in our model by considering Θ_{j} = ∣e_{j}∣ − 1. In other words, the model in^{11} considers that the higherorder spreading processes will only be present if all the nodes but one are already active. Moreover, they are restricted to uniform hypergraphs. Nonetheless, these results are aligned with our findings, providing additional evidence that hybrid phase transitions might be common in higherorder systems.
Discussion
A precise understanding of the dynamical properties of a model is fundamental for the correct observation, inference, and—possibly—control of the system. The expected behavior of social contagion models in simplicial complexes and hypergraphs are the discontinuous transitions and the emergency of a hysteresis cycle^{10,12,13,14}, which are not expected for processes in simple graphs^{36,37}. Although these results were surprising on their own, here we showed that these models present an even richer phenomenology, including multistability, intermittency, and hybrid phase transitions. Our results also highlight the interplay between higherorder interactions and community structure. Although not universal, these are standard features in a wide variety of real systems and are particularly common in social contexts^{38,39,40}.
Community organization might lead to localization of states
As observed in the real case and validated through artificial models, community structure in hypergraphs imposes dynamical localization of states. After a transition, the spreading can: (i) reach the whole population, while remaining delocalized; (ii) activate just a subset of individuals that scales with the system size, or (iii) activate just a node or a subset of nodes that does not scale with the system size. An example of the first scenario is the hyperblob, where the transition happens through a collective process, and all the nodes will be active with the same probability (for more details, see section “Analysis of the transition between stable branches” and ref. ^{12}). An example of the second one is instead observed in the case of the blues review hypergraph (see Fig. 3c, d, where the activity is constrained to a subset of nodes. Localization in community structured populations is not unexpected. However, the dynamic impact that it generates is indeed different from the graph cases. In graphs or multilayers we observe multiple susceptibility peaks associated to continuous changes in the order parameter^{31,37,41}, which contrasts with the phenomenology observed in our model and discussed next.
Localization in higherorder models might generate intermittency, multistability, and/or multiple transitions
The localization in a subset of individuals, item (ii), might lead to multistability and multiple transitions between branches, as we observed both in real data and artificial models (see Figs. 2 and 6). In this case, the branches are well separated, and the same set of parameters can activate different regions of the hypergraph, depending on the initial condition (see Fig. 6). This type of localization might also imply multiple transitions between stable branches. As an example, we can mention the solutions obtained by considering the initial condition ρ(t = 0) = 1, either in the real hypergraph or in the artificial model with m_{out} = 200, Figs. 2 and 6 respectively. We observe two discontinuities, one separating the absorbing state and an active solution and another separating two activity levels. However, if we consider the artificial models with m_{out} ≥ 400, we notice that the transition from the lower activity state to the higher activity state can also be continuous. Note that the concept of localization is not necessarily linked to multistability, as we might have localization with a single solution. Didactic examples are graphs with communities or multilayer networks. Here, our model reduces to an SIS in the graph scenario. In this case, we have a single absorbing state and localized processes, but the dynamics has a single accessible active state. This observation suggests that depending on the hyperedge size distribution (i.e., cardinality distribution), we might have localization without multistability. Note that, although the observed phenomena share some similar features with its graph^{32,37} and multilayer^{37,41,42} counterparts, here the mechanisms that guide localization and its macroscopic response are entirely different. In the pairwise case, the susceptibility and order parameter change continuously and, once a community is activated, it does not present abrupt temporal macroscopic variations. On the other hand, in the hypergraph case, we often observe significant macroscopic changes, which might be related to hyperedges intersections generating a cascading of activations. Moreover, in comparison with similar models on graphs (e.g., SIS), the social contagion model on hypergraphs displays a strong dependence on initial conditions. In fact, for a given set of parameters, the steadystate solution will depend mainly on the microscopic properties (e.g., localization of initial seeds) of the initial condition rather than on its macroscopic ones (e.g., total prevalence). For instance, in Fig. 2 we can see that, for the same macroscopic initial condition, depending on which community the initial seeds are placed, we reach a different branch. Moreover, we can observe the case in which a higher macroscopic initial condition leads to the absorbing state, while another with a lower macroscopic initial condition leads an active branch due to its microstate configuration. Although not shown, we observed this behavior in most of our numerical simulations (see the “Methods” section for the algorithms employed to sample specific branches).
Necessary and sufficient conditions for the observed behaviors
We were able to link the observed behaviors to the community structure. However, this does not imply that modular structures are the only ingredient able to generate multistability and intermittency. Indeed, other forms of structural correlation might play a similar dynamical role.
The stability of the absorbing state
For an SIS in an infinity graph, the absorbing state will be unstable after the epidemic threshold, and we will have an active stable solution. In the hypergraph, the conditions are not as simple as in the graph case. If the intersections between hyperedges are smaller than the criticalmass threshold, activating one hyperedge is not enough to trigger a collective behavior, regardless of the spreading rate. Although we did not study the structural constraints related to this issue, they were verified during our simulations. This effect is also related to the role of the initial conditions in our process. For example, we can think of a uniform hypergraph as a line whose intersection between hyperedges is smaller than Θ. In this way, for a high spreading rate but a small initial seed, the process will fall into the absorbing state, implying that the absorbing state might be stable for a broader range of parameters. In ref. ^{18}, the stability conditions for the absorbing state and the critical point estimations were derived, already providing additional insights about this issue. However, further numerical simulations and the spectral analysis of hypergraphs might deepen our understanding about this process.
Limitations
We must also point out the limitations of the methods employed here. We can not perform the finitesize analysis in most real systems, as we only have a single structure with a fixed size. In practice, this implies that we cannot precisely determine the phase transition type in these cases. However, through the analysis of both the order parameter and susceptibility, we obtain some understanding of these real systems. We showed that using Monte Carlo simulations and solving our ODE’s (Eq. (5)) provides a more robust argument regarding the nature of a transition (continuous vs. discontinuous) and the existence and stability of multiple branches. We expect a peak in the susceptibility curve for hybrid transitions in real scenarios right after the discontinuity. This peak can be a sign of scaling behavior. As mentioned, this was observed for the hyperblob, the hyperstar, the exponential and powerlaw distributions of cardinalities in ref. ^{12}. From a theoretical viewpoint, measuring localization by only looking at the leading eigenvector of the adjacency matrix, as can be done in graphs, is not trivial, as we can not write the probabilities of activation as an eigenvector problem plus secondorder terms. Although we have a visual indication that this matrix might encode some of the localization properties, further research is necessary to formalize this concept. Another limitation we identified is that our model does not incorporate backlash or cultural opposition, which is important from a sociological point of view. Indeed, we assume that the activation of a group increases the probability of activation of other groups. However, this might only be the case in some real scenarios. Such extension is left as a future work.
Perspectives
We have shown that the social contagion model in hypergraphs presents a rich and unexpected behavior beyond its discontinuous transitions. In particular, we showed that, depending on the structure, we might have multistability and intermittency due to bimodal state distributions. Using artificial random models, we were able to show that this phenomenology can be associated with community structures in the hypergraph. Specifically, by controlling the number of bridges between two communities with different densities, we showed that fewer bridges create multistability, while the creation of bridges destroys multistability and induces intermittency. We highlight that although community structure is not a universal feature, it is still a widespread characteristic of real social systems. Moreover, it is possible that other structural ingredients could generate similar dynamical outcomes. As we have multiple branches, the importance of the transition between them also increases. Often we observe a discontinuity in the order parameter^{10,12,13,14}. However, associated with this, we also have a divergence in the susceptibility, which is compatible with hybrid phase transitions. We formulated the exact equations for a hypergraph with structural symmetries, showing that the resulting dynamics indeed displays a hybrid transition. Although our argument is restricted to this specific structure, similar patterns were verified in all the simulations reported here, as well as in refs. ^{11}^{,}^{12}, suggesting that hybrid transitions might be general.
We hope our results open new paths for the exploration of social contagion models in hypergraphs. Analytically, understanding the necessary and sufficient conditions for the observed phenomenology is one of the most challenging future problems. From a numerical perspective, the exploration and characterization of other real systems might also reveal so far unobserved behaviors as well as confirm our findings. Another view would be motivating further research about understanding the impact of our results on different processes. For instance, how can localization impact on synchronization of oscillators, diffusion, or opinion dynamics? Would we have multistability in such dynamics? Independently and concurrently to our study, multistability was also found in coupled oscillator systems with higherorder interactions and community structure^{43}. This findings reinforce our conjecture that such phenomenology might be common in higherorder interactions.
Our findings might also impact the design of real experiments similar to the ones in refs. ^{5}^{,}^{44,45}. One of the main difficulties with this type of experiment is that the number of people participating is often reduced, and the signals in the observables are usually noisy. In such small systems, while accurately measuring multistability might be challenging, intermittency might be easier to capture as we would be interested in finding periods of high activity followed by periods of low activity. Along similar lines, data coming from online social systems, while abundant in volume and number of potential subjects, is less controlled, imposing limitations on the modeling possibilities. Despite these limitations, there are still many available datasets that are higherorder in nature (i.e., the most natural representation would be a group and not a collection of pairwise interactions), for instance, WhatsApp message exchange in groups (see refs. ^{46}^{,}^{47}) or data from Reddit as the collaboration in the social experiment r/place^{48}. We remark that, in principle, studying these datasets from the viewpoint of higherorder interactions is possible. However, this task is not trivial, and we left them as future work. Another foreseeable future direction would be incorporating different mechanisms as variants of the original model. For instance, one might propose variations that solve some of the abovementioned limitations, e.g., including backlash or cultural opposition. Another possibility would be a variant that explicitly considers Alport’s contact hypothesis^{49}.
To conclude, the literature on threshold models suggests that many processes can be modeled as binary choice criticalmass processes. For example, in ref. ^{2}, the author proposes a catalog of processes that includes diffusion of innovation, rumors and diseases, strikes, voting, educational attainment, leaving social occasions, migration, and experimental psychology. We must highlight that in ref. ^{2}, the author associates the threshold processes to the individuals and not the groups. However, the threshold is reached or not due to individual social interactions. Our approach is slightly different as we focus on the group rather than the individuals. Despite these differences, the proposed catalog is still valid in our case. The main difference is that our model might provide different mechanistic explanations for similar phenomena. We should also complement the argument for the case of disease spreading following a similar reasoning as in ref. ^{17}. We presume that our model may provide new insights into a disease spreading in which there is a viral load threshold^{50}. Since, in this case, sharing an environment with a few infected people might impose an increased risk higher than linear, which would be the standard complex network prediction, our model could better explain this process. Finally, we could also mention examples from our daily lives that can be conjectured as a result of group interactions. For example, some of the phenomena described by Malcolm Gladwell in his book, Tipping point: How Little Things Can Make a Big Difference^{51}, can also be interpreted or reanalyzed from the group dynamics point of view. A notable example would be the famous saying that “fashion is cyclic” is an effect of group interactions as fashion can be understood as a norm, as in refs. ^{5}^{,}^{24}. In this scenario, we hypothesize that the observed cyclic behavior is associated with the structural organization of our societies.
Methods
The firstorder approximation (individualbased)
Equation (2) expresses the exact process, however, it cannot be numerically solved due to its computational cost. Notice that we need O(2^{N}) equations to exactly solve this system. Thus, assuming that the random variables are independent and denoting \({y}_{i}=\left\langle {Y}_{i}\right\rangle\), we obtain the firstorder approximation as
where we assumed that the spreading rate is composed by the product of a free parameter and a function of the cardinality, i.e., λ_{j} = λ × λ^{*}(∣e_{j}∣) and \({{\mathbb{P}}}_{{e}_{j}}\left(K=k\right)\) is the probability that the hyperedge e_{j} has k active nodes inside. Specifically, we estimated the expectation of the indicator function as a Poisson binomial distribution. Formally,
where F_{k} is the set of all subsets of e_{j}⧹{v_{i}} with cardinality k, A is one of those sets, and A^{c} is its complementary, i.e., the remaining nodes of e_{j} who are not v_{i}. Intuitively, A accounts for the active nodes of each possible single microstate and A^{c} for the inactive ones. Notice that combining A and A^{c} represents ∣e_{j}∣ − 1 nodes as v_{i} is excluded. Thus, the summation over F_{k} considers all possible micro configurations in a given hyperedge. In Fig. 1 (a) we show an example of the possible micro states for each hyperedge and their associated value of T_{j}. Unfortunately, Eq. (7) is not numerically stable if ∣e_{j}∣ is large. Note that, calculating \({{\mathbb{P}}}_{{e}_{j}}\left(K=\ell \right)\) using Eq. (7) involves the multiplication of ∣e_{j}∣ terms that are smaller than one. Thus, for a large ∣e_{j}∣ we might have underflow issues. It is however possible to stabilize its solutions by considering the discrete Fourier transform^{52}
where \(C=\exp \left(\frac{2\pi {{{{{{{\bf{i}}}}}}}}}{{e}_{j}}\right)\), where i is the imaginary unit. Note that node v_{i} is excluded here, and the normalization should also change accordingly. We also remark that this approach allows us to compute the solution for arbitrarily large hyperedges. Thus, we can numerically solve the firstorder approximation in Eq. (5) using the approximation in Eq. (8). Note also that, Eq. (6) is an approximation as we assume that the nodes’ state is independent. However, Eqs. (7) and (8) are exact for independent random variables and are also identical, giving the same results.
The ODE solutions were implemented using the Gnu Scientific library^{53}. More specifically, we used the explicit embedded Runge–Kutta–Fehlberg (4, 5) method, with an adaptive stepsize control, where we keep the local error on each step within an absolute error of ϵ_{abs} = 10^{−4} and relative error of ϵ_{rel} = 10^{−3} with respect to the solution y_{i}(t).
Continuoustime simulations
We want to both validate the expressions developed in the previous sections, and statistically describe our model in arbitrary hypergraphs. To achieve this, we use continuoustime Monte Carlo simulations, more specifically, we use the Gillespie algorithm^{54}, which can be described as follows. First, we create a vector containing the times associated with all possible Poisson processes. As they are Poisson processes, the interevent times are sampled from an exponential distribution with the appropriate parameters. For instance, if it is a deactivation process, the exponential distribution has parameter δ. If the process is associated with a spreading, the parameter will be λ × λ^{*}(∣e_{j}∣). If the process is not active, we set it as ∞ (effectively the largest double). Thus, given an initial condition, the dynamics run on top of this vector of times. On each iteration, we find the element with the shortest time and execute its associated rules, which can be deactivation or spreading. Note that new processes might be created or deleted accordingly. For example, if a hyperedge reached its critical mass, the Poisson process for that event will be created. However, if, before its execution, a sufficient number of nodes is deactivated (making the hyperedge stays below its critical mass), the process should be removed. Next, our time variable is increased according to the time associated with the executed Poisson process. The same rules are repeated until reaching the absorbing state or a t_{max}. This algorithm was initially proposed in ref. ^{12}, and it is an extension of the methods described in Section 10.3 of ref. ^{37}.
Quasistationary method (QS)
Our model has a single absorbing state, the state in which every node is inactive. So, for any finite system with finite rates, the dynamics will reach this state. Mathematically this can be avoided by restricting our process to active states (see section “Quasistationary steadystate solutions”). Computationally, we adopt a similar approach. We avoid the absorbing state by moving to a previously visited activate state every time the system falls in the absorbing state. The algorithm is defined as follows. We keep a list of M previously visited active states. This list is continuously updated. If we are in an active state, with a probability p_{r}Δt, the current state replaces a random position of this list. If the absorbing state is reached, then a random state in the list replaces the absorbing state. We let the dynamics relax for t_{r} and, after that, during a time t_{s}, we sample the distribution of states, \({\mathbb{P}}\left(n\right)\), where n is the number of active nodes. Note that, on each iteration of the described algorithm, we are computing Freq(n) ← Freq(n) + Δt. In other words, we are computing the time our dynamics spent in the state n. Hence, \({\mathbb{P}}\left(n\right)\propto {{{{{{{\rm{Freq}}}}}}}}(n)\). From that, we characterize our dynamics using the order parameter and the susceptibility, respectively defined as
This method was initially proposed in ref. ^{55} and had been extensively used in the analysis of epidemic spreading^{31,37,56,57}.
We remark that t_{r} and t_{s} vary according to the system size, and the algorithm is stable to the choices of list size M and probability p_{r}. To reduce the computational cost of this method, we also employed an adaptive version. In this version, we define a variable sampling time given as \({t}_{r}+c{t}_{s}^{*}\), where \({t}_{s}^{*}\) is a smaller timewindow and c is not set but defined by the convergence of χ. In practice, we calculate χ before and after each \({t}_{s}^{*}\) timewindow. If the absolute difference between the susceptibility is lower than ϵ (here set as ϵ = 0.001), the algorithm stops. Additionally, we also define a c_{max} (here set as c_{max} = 500), which is the stop condition. Thus, we expect to reduce the computational cost with this adaptive version while keeping statistically reliable measurements.
Moreover, as we have bimodal distributions, aside from the orderparameter, ρ, and the susceptibility χ, it is also necessary to keep track and store the state distributions, P(n). We will be interested in looking at the multiple peaks of these distributions, especially the value of ρ at which these peaks appear. Here, this quantity is denoted as Peaks(P). Notice that, in the singlemode case, the peak represents the most likely value.
Multistability and simulation methods
As shown in the main text, our model strongly depends on the initial microstate, which might generate bimodal distributions or multiple stable branches for the same parameters. In the bimodal distribution case, we have an intermittent temporal behavior, and the main challenge, in this case, is sampling for long enough. Additionally, when the probability of moving from one branch to the other is very low (and not found in our numerical simulations – it will be zero only in the thermodynamic limit), we have multiple stable branches for the same value of parameters. In this case, the difficulty is finding the initial condition that will lead to such a solution. Thus, to properly explore our parameter space, we employ a twostep process. First, we explore different random initial conditions for a series of parameters, revealing some branches. They will be visible as a concentration of points in some regions of the ρ × λ diagrams. Next, to properly sample the already found branch, we use similar initial conditions to obtain the complete branch. We cannot guarantee that a given simulation will reach the expected branch due to stochastic fluctuations and the initial condition dependency. Thus, to circumvent this problem, we need to run many simulations using different initial conditions and discard those that fall in branches we are not interested in. With this process, we can sample from different branches. We remark that this procedure might be costly as we have no guarantee that the chosen initial condition will arrive at the desired branch. Despite that, in practice, this method gives reasonable results as it allows us to explore the parameter space without introducing any bias in the found solutions.
Alternatively to the random initial conditions, we can also use our knowledge of our structure and set specific initial conditions. As observed in the main text, communities are reasonable candidates to sustain the activity and, macroscopically, generate a stable solution. So, the alternative algorithm is to use as an initial condition one or more communities as active and the remaining communities as inactive. This approach was used in the Supplementary Information, sections IV and V, while the exploration of different random initial conditions was used in the main text. Naturally, this method can be extended to any initial condition of interest. We highlight that the alternative approach reduces the computational cost when we have some knowledge about our system. However, we might also be less likely to sample all the branches due to the introduced bias on the initial condition. Note also that it is very difficult to guarantee that we found all the possible branches for a given system and set of parameters.
Artificial hypergraph model
Here, we propose a hypergraph extension of the community structure model presented in ref. ^{38}. The algorithm is described as follows. The number of nodes, N, and communities, n_{c}, is fixed. The hyperedge cardinalities will be sampled from a fixed distribution, P(∣e_{j}∣). For each community c, we have \({m}_{in}^{c}\) hyperedges that will be constructed using only nodes inside the community. Each community can have a different density. To link two different communities, we have m_{out} hyperedges that will constitute the bridges. In this case, we extract a uniform number from ℓ ∈ [1, ∣e_{j}∣), where ℓ is the number of nodes in one community and ℓ − ∣e_{j}∣ will be in the other community.
In our numerical simulations in the main text we used \(P({e}_{j})={{{{{{{\rm{Exp}}}}}}}}(\mu )\) with μ = 8 but imposing that \(\min ({e}_{j})=2\) and \(\max ({e}_{j})=\frac{N}{{n}_{c}}\). For simplicity, we build a hypergraph with N = 10^{3} nodes organized in n_{c} = 2 communities. The community parameters are \({m}_{in}^{1}=1000\) and \({m}_{in}^{2}=500\), creating different levels of activation for the different groups. Finally, we leave m_{out} as a free parameter to control the number of bridges, aiming to observe and control the dynamical behavior of our model.
Exact equations for the hyperblob
In general, our exact formulation in Eq. (2) cannot be analytically solved for an arbitrary hypergraph. Nonetheless, by considering a homogeneous hypergraph, we can reduce the complexity of the problem and still calculate exact quantities. Henceforth, we focus on the socalled hyperblob^{12}. This hypergraph is defined as a set of homogeneous pairwise relationships, forming a random regular graph, where every node has \(\left\langle k\right\rangle\) edges, together with a hyperedge containing all the nodes. As the nodes are indistinguishable by their degree we can describe the state of our system by the number of active nodes n. Thus, the transition rates can be expressed as
where U(n − Θ) is the Heaviside step function and the element Q_{i,j} is the transition rate from the state with i active nodes to a state with j active nodes. The elements that are not explicitly defined in Eq. (11) are zero. Note that, if λ^{*} = 0 we recover an SIS dynamics in an homogeneous population. Figure 7 is a graphical representation of these transitions but restricted to active states (see section “Analysis of the transition between stable branches” in the main text). Consequently, we can express the temporal evolution of our dynamics as
where \(P={\left[{P}_{0},{P}_{1},...,{P}_{N}\right]}^{T}\) is a vector whose elements P_{n} are the probabilities of having n active nodes. This equation can be solved as
Moreover, denoting the steadystate solution as \(\pi \in {{\mathbb{R}}}^{N}\), it can be obtained as
Quasistationary steadystate solutions
For any finite hypergraph, the only absorbing state in our dynamics is the state n = 0. Consequently, regardless of the parameters of our dynamics, we will always reach this state. However, for sufficiently large hypergraphs, the dynamics will arrive at a metastate and remain there for some time. After leaving this state, the system will reach the absorbing state. In general, we are interested in the metastate instead of the absorbing state. So, to obtain insights about this state, we use the quasistationary distribution, which is constrained to active states. Computationally, this is effectively implemented by the QS method, described in section “Multistability and simulation methods.” Mathematically, this is done by imposing that the transition rate to this state is zero. As the process is defined in continuous time and the probability of two events happening at the same time is zero, we can implement the QS constraints as
A graphical representation of the QSconstrained chain is shown in Fig. 7. Moreover, Eqs. (12) and (13) are also valid after applying the QS constraint, Eq. (15). Note that, without this constraint, the process depends on the initial condition, while the QSconstrained system does not.
Under the QS constraint, Eq. (12) is expressed as
where P_{n} is defined for the interval n ∈ [0, N] and the limits are explicitly shown. In the steadystate, i.e., \(\frac{d{P}_{n}}{dt}=0\) for all n ∈ [0, N], we can analytically obtain the stationary distribution as
where the normalization \(\mathop{\sum }\nolimits_{i=0}^{N}{\pi }_{i}=1\) must be respected. Although we can not obtain a closed expression for π for a fixed size and set of parameters, we can calculate its exact distribution of states. The computational cost of this calculation is O(N), which allows us to evaluate reasonably large systems.
In Fig. 8, we show two examples of temporal behaviors by solving Eq. (13) with the appropriate matrices. In Fig. 8a, the system is below the critical point, while in Fig. 8b the dynamics operates above it. In the nonQS case, below the critical point, ρ goes exponentially fast to the absorbing state (ρ = 0), while in the QSconstrained case, it goes to a state near \(\rho \approx \frac{1}{N}\). Above the critical point, Fig. 8b, we can observe that ρ stays at the metastate before converging to the absorbing state. Moreover, we can see the dependency on the initial condition, where, for the same set of parameters but different initial conditions, the dynamics has a different metastate. For an example, compare Fig. 8b, curves for set 4 and set 5. Also, note that the respective QSconstrained system converges to a state compatible with the nonQS set 5. Intuitively, the differences in solutions for sets 4 and 5 are related to the probability of getting to the absorbing state due to finitesize fluctuations. For set 4, this is evident as the process begins with a single active node.
Reporting summary
Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.
Data availability
Most of the data used in our manuscript are artificially generated by computational simulations whose methods are explained in the text. The blues reviews hypergraph is publicly available at https://www.cs.cornell.edu/~arb/data/. The data are available from the corresponding author upon request.
Code availability
The algorithms used in our numerical simulations are described in the “Methods” section. Custom code is implemented in C/C++ and can be found at https://gitlab.com/guifarruda/socialcontagion.
References
Kanter, R. M. Some effects of proportions on group life: skewed sex ratios and responses to token women. Am. J. Sociol. 82, 965 (1977).
Granovetter, M. Threshold models of collective behavior. Am. J. Sociol. 83, 1420 (1978).
Dahlerup, D. From a small to a large minority: women in scandinavian politics. Scand. Polit. Stud. 11, 275 (1988).
Grey, S. Numbers and beyond: the relevance of critical mass in gender research. Polit. Gender 2, 492 (2006).
Centola, D., Becker, J., Brackbill, D. & Baronchelli, A. Experimental evidence for tipping points in social convention. Science 360, 1116 (2018).
Xie, J. et al. Social consensus through the influence of committed minorities. Phys. Rev. E 84, 011130 (2011).
Mistry, D., Zhang, Q., Perra, N. & Baronchelli, A. Committed activists and the reshaping of statusquo social consensus. Phys. Rev. E 92, 042805 (2015).
Niu, X., Doyle, C., Korniss, G. & Szymanski, B. K. The impact of variable commitment in the naming game on consensus formation. Sci. Rep. 7, 41750 (2017).
Baronchelli, A. The emergence of consensus: a primer. R. Soc. Open Sci. 5, 172189 (2018).
Iacopini, I., Petri, G., Barrat, A. & Latora, V. Simplicial models of social contagion. Nat. Commun. 10, 1 (2019).
Jhun, B., Jo, M. & Kahng, B. Simplicial SIS model in scalefree uniform hypergraph. J. Stat. Mech. Theory Exp. 2019, 123207 (2019).
de Arruda, G. F., Petri, G. & Moreno, Y. Social contagion models on hypergraphs. Phys. Rev. Res. 2, 023032 (2020).
Ferraz de Arruda, G., Tizzani, M. & Moreno, Y. Phase transitions and stability of dynamical processes on hypergraphs. Commun. Phys. 4, 24 (2021).
Barrat, A., Ferraz de Arruda, G., Iacopini, I. & Moreno, Y. in HigherOrder Systems (eds Battiston, F. & Petri, G.) 329–346 (Springer International Publishing, 2022).
Battiston, F. et al. Networks beyond pairwise interactions: structure and dynamics. Phys. Rep. 874, 1 (2020).
AlvarezRodriguez, U. et al. Evolutionary dynamics of higherorder interactions in social networks. Nat. Hum. Behav. 5, 586 (2021).
Higham, D. J. & de Kergorlay, H.L. Epidemics on hypergraphs: spectral thresholds for extinction. Proc. R. Soc. A Math. Phys. Eng. Sci. 477, 20210232 (2021).
Higham, D. J. & de Kergorlay, H.L. Mean field analysis of hypergraph contagion models. SIAM J. Appl. Math. 82, 1987 (2022).
Battiston, F. et al. The physics of higherorder interactions in complex systems. Nat. Phys. 17, 1093 (2021).
Kim, J.H. & Goh, K. I. Higherorder components in hypergraphs. https://arxiv.org/abs/2208.05718 (2022).
Cencetti, G., Battiston, F., Lepri, B. & Karsai, M. Temporal properties of higherorder interactions in social networks. Sci. Rep. 11, 7028 (2021).
O. Szabo, R., Chowdhary, S., Deritei, D. & Battiston, F. The anatomy of social dynamics in escape rooms. Sci. Rep. 12, 10498 (2022).
Amato, R., Lacasa, L., DíazGuilera, A. & Baronchelli, A. The dynamics of norm change in the cultural evolution of language. Proc. Natl Acad. Sci. USA 115, 8260 (2018).
Iacopini, I., Petri, G., Baronchelli, A. & Barrat, A. Group interactions modulate critical mass dynamics in social convention. Commun. Phys. 5, 64 (2022).
Diani, M. The concept of social movement. Sociol. Rev. 40, 1 (1992).
Banerjee, A. On the spectrum of hypergraphs. Linear Algebra Appl. 614, 82–110 (2021).
Ni, J., Li, J. & McAuley, J., Justifying recommendations using distantlylabeled reviews and finegrained aspects. In Proc. 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLPIJCNLP) 188–197 (2019).
Cornell University. Cat edge music blues reviews. https://www.cs.cornell.edu/arb/data/catedgemusicbluesreviews/ (2021).
Chodrow, P. S. Configuration models of random hypergraphs. J. Complex Netw. 8, cnaa018 (2020).
Chodrow, P. Configuration models of random hypergraphs. https://github.com/PhilChodrow/hypergraph (2022).
Costa, G. S. & Ferreira, S. C. Simple quasistationary method for simulations of epidemic processes with localized states. Comput. Phys. Commun. 267, 108046 (2021).
Goltsev, A. V., Dorogovtsev, S. N., Oliveira, J. G. & Mendes, J. F. F. Localization and spreading of diseases in complex networks. Phys. Rev. Lett. 109, 128702 (2012).
Guilbeault, D. & Centola, D. Topological measures for identifying and predicting the spread of complex contagions. Nat. Commun. 12, 4430 (2021).
Cho, Y. S., Lee, J. S., Herrmann, H. J. & Kahng, B. Hybrid percolation transition in cluster merging processes: continuously varying exponents. Phys. Rev. Lett. 116, 025701 (2016).
D’Souza, R. M., GómezGarde nes, J., Nagler, J. & Arenas, A. Explosive phenomena in complex networks. Adv. Phys. 68, 123 (2019).
PastorSatorras, R., Castellano, C., Van Mieghem, P. & Vespignani, A. Epidemic processes in complex networks. Rev. Mod. Phys. 87, 925 (2015).
de Arruda, G. F., Rodrigues, F. A. & Moreno, Y. Fundamentals of spreading processes in single and multilayer complex networks. Phys. Rep. 756, 1 (2018).
Girvan, M. & Newman, M. E. J. Community structure in social and biological networks. Proc. Natl Acad. Sci. USA 99, 7821 (2002).
Fortunato, S. Community detection in graphs. Phys. Rep. 486, 75 (2010).
Fortunato, S. & Hric, D. Community detection in networks: a user guide. Phys. Rep. 659, 1 (2016).
de Arruda, G. F., Cozzo, E., Peixoto, T. P., Rodrigues, F. A. & Moreno, Y. Disease localization in multilayer networks. Phys. Rev. X 7, 011014 (2017).
de Arruda, G. F., MéndezBermúdez, J. A., Rodrigues, F. A. & Moreno, Y. Universality of eigenvector delocalization and the nature of the SIS phase transition in multiplex networks. J. Stat. Mech. Theory Exp. 2020, 103405 (2020).
Skardal, P. S., Adhikari, S. & Restrepo, J. G. Multistability in coupled oscillator systems with higherorder interactions and community structure. Chaos 33, 023140 (2023).
PoncelaCasasnovas, J. et al. Humans display a reduced set of consistent behavioral phenotypes in dyadic games. Sci. Adv. 2, e1600451 (2016).
GutiérrezRoig, M., GraciaLázaro, C., Perelló, J., Moreno, Y. & Sánchez, A. Transition from reciprocal cooperation to persistent behaviour in social dilemmas at the end of adolescence. Nat. Commun. 5, 4362 (2014).
O’Sullivan, D. M., O’Sullivan, E., O’Connor, M., Lyons, D. & McManus, J. Whatsapp doc? BMJ Innovations 3, 238 (2017).
Caetano, J. A. et al. Characterizing attention cascades in whatsapp groups. In Proc. 10th ACM Conference on Web Science, WebSci ’19 27–36 (Association for Computing Machinery, 2019).
Rappaz, J., Catasta, M., West, R. & Aberer, K. Latent structure in collaboration: the case of reddit r/place. In Proc. International AAAI Conference on Web and Social Media (AAAI, 2018).
Allport, G. The Nature of Prejudice (AddisonWesley Publishing Company, 1954).
de Di Kobinger, M.A. L. V. et al. Ebola viral load at diagnosis associates with patient outcome and outbreak evolution. J. Clin. Investig. 125, 4421 (2015).
Gladwell, M. The Tipping Point: How Little Things can Make a Big Difference (Little, Brown and Company, 2000).
Fernandez, M. & Williams, S. Closedform expression for the poissonbinomial probability density function. IEEE Trans. Aerospace Electron. Syst. 46, 803 (2010).
Galassi, M. Gnu scientific library reference manual. https://www.gnu.org/software/gsl/ (2018).
Gillespie, D. T. Exact stochastic simulation of coupled chemical reactions. J. Phys. Chem. 81, 2340 (1977).
de Oliveira, M. M. & Dickman, R. How to simulate the quasistationary state. Phys. Rev. E 71, 016129 (2005).
Ferreira, S. C., Castellano, C. & PastorSatorras, R. Epidemic thresholds of the susceptibleinfectedsusceptible model on networks: a comparison of numerical and theoretical results. Phys. Rev. E 86, 041125 (2012).
Mata, A. S. & Ferreira, S. C. Pair quenched meanfield theory for the susceptibleinfectedsusceptible model on complex networks. Europhys. Lett. 103, 48003 (2013).
Acknowledgements
G.F.d.A. and Y.M. acknowledge the financial support of Soremartec S.A. and Soremartec Italia, Ferrero Group. G.P. acknowledges partial support from Intesa Sanpaolo Innovation Center. Y.M. was partially supported by the Government of Aragón, Spain and “ERDF A way of making Europe” through grant E36 20R (FENOL), and by Ministerio de Ciencia e Innovación, Agencia Española de Investigación (MCIN/AEI/10.13039/501100011033) Grant No. PID2020 115800GB I00. The authors acknowledge the use of the computational resources of COSNET Lab at Institute BIFI, funded by Banco Santander through grant Santander UZ 2020/0274 and by the Government of Aragón (FONDOC19UZ164255). The funders had no role in study design, data collection, and analysis, decision to publish, or preparation of the manuscript.
Author information
Authors and Affiliations
Contributions
G.F.d.A., G.P., and Y.M. conceived and designed the study; G.F.d.A. performed the experiments; G.F.d.A., G.P., P.M.R., and Y.M. analyzed, discussed the results, wrote the paper, and contributed to the revision of the final manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Peer review
Peer review information
Nature Communications thanks the other anonymous reviewer(s) for their contribution to the peer review of this work. Peer review reports are available.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Ferraz de Arruda, G., Petri, G., Rodriguez, P.M. et al. Multistability, intermittency, and hybrid transitions in social contagion models on hypergraphs. Nat Commun 14, 1375 (2023). https://doi.org/10.1038/s41467023371183
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41467023371183
This article is cited by

Growing hypergraphs with preferential linking
Journal of the Korean Physical Society (2023)
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.