Introduction

Real world processes are often characterized by the presence of a large number of interacting phenomena at multiple time- or length scales1, and thus, they are usually described by stochastic models that are strongly interacting or history-dependent2,3,4,5. A way to understand and classify these processes in terms of stationary and non-stationary probability densities is to generalize the concepts of statistical mechanics that already proved to be very powerful for describing weakly interacting, ergodic and Markovian systems6,7. One such concept is entropy, which assigns a likelihood to macrostates, that is, to stationary distributions over microstates. Maximizing this likelihood, possibly in the presence of external constraints, yields the most probable stationary distribution characterizing the system, called the Maximum Entropy (MaxEnt) distribution, which plays a key role in describing the stationary behaviour of stochastic systems. For example, the Boltzmann-Gibbs entropy form, \({S}_{BG}=-{\sum }_{i}\,{p}_{i}\,\mathrm{ln}\,{p}_{i}\), where i runs over the microstates, follows from the assumption that the system realizations are independent and distinguishable8.

In general, however, the realizations are not independent. Instead, their interaction can be macroscopically modelled by a corresponding entropy functional, which in principle can take infinitely many different forms. Similarly to the theory of renormalization group describing critical phenomena, an apprehensive characterization of these entropies can be made by observing what are the relevant and irrelevant parameters as we approach infinite system size9. Axiomatic considerations suggest that the asymptotic scaling of the generalized entropy forms with phase space volume provides a meaningful classification of the entropies. This classification is based on the fundamental result by Hanel and Thurner10 about the entropy functionals S[p] that can be written as a sum of a pointwise function over microstates

$$S[p]=\sum _{i}\,g({p}_{i}\mathrm{).}$$
(1)

As they showed, the first three Shannon-Khinchin (SK1–SK3) axioms11,12

  • SK1 S is continuous in p,

  • SK2 S is maximal for the uniform distribution, p i  ≡ 1/W,

  • SK3 S is invariant under adding a zero-probability state to the system, S(p1, …, p W ) = S(p1, …, p W , pW+1 = 0), permit only the following asymptotic scaling relation for any entropic forms:

$$\mathop{\mathrm{lim}}\limits_{W\to \infty }\frac{S[{p}_{i}\equiv {(\lambda W)}^{-1}]}{S[{p}_{i}\equiv {(W)}^{-1}]}={\lambda }^{1-c},$$
(2)

or, equivalently, in terms of g,

$$\mathop{\mathrm{lim}}\limits_{p\to 0}\frac{g(zp)}{g(p)}={z}^{c},$$
(3)

with 0 < z < 1 and 0 < c ≤ 1.

Hence, the scaling exponent c can be used to parametrize the equivalence classes of the generalized entropy forms. For example, the Boltzmann-Gibbs entropy, where \(g(p)=-p\,\mathrm{ln}\,p\), is corresponding to c = 1, whereas the Tsallis entropy6 \(S[p]=\frac{1-{\sum }_{i}\,{p}_{i}^{q}}{q-1}\), with 0 < q ≤ 1 is corresponding to c = q. Consequently, each such equivalence class can be represented by a Tsallis entropy. Note that the fourth Shannon-Khinchin axiom S(p AB ) = S(p A ) + 〈S(pB|A)〉 A is not considered in this analysis, therefore, the entropy of a joint distribution p AB is not always decomposable to the entropy of the marginal p A and the entropy of the conditional distribution pB|A, averaged over p A .

These considerations suggest that the asymptotic exponent c provides a measure of deviation from ergodic, uncorrelated and Markovian systems regarding its stationary behaviour. Our main motivation here is to understand how this exponent relates to similar macroscopic measures which are, however, characterizing the non-stationary behaviour of the system. One of the main approaches to model the non-stationary behaviour of stochastic processes macroscopically is through partial differential equations governing the time evolution of the probability density p(x, t), called Fokker-Planck equations (FPE)13,14,15. Once specified, the underlying microscopic rules completely determine the form of the FPE. For example, the assumption of memoryless, Gaussian noise and short range interaction between the units gives rise to linear FPEs in the form of

$${\partial }_{t}\,p(x,t)=-{\partial }_{x}(p(x,t)\,f(x,t))+{\partial }_{x}(D(x,t)\,{\partial }_{x}\,p(x,t)),$$
(4)

where f(x, t) and D(x, t) are called the drift and diffusion coefficients, respectively. If the diffusion coefficient is constant (D(x, t) ≡ D) and the drift is proportional to the spatial derivative of some time-independent external potential u(x), i.e., f(x, t) = − x u(x), (4) simplifies to

$${\partial }_{t}\,p(x,t)=D\beta {\partial }_{x}(p(x,t)\,{\partial }_{x}u(x))+D{\partial }_{x}^{2}\,p(x,t\mathrm{).}$$
(5)

Specifically, in the presence of no external potential, (5) becomes

$${\partial }_{t}\,p(x,t)=D{\partial }_{x}^{2}\,p(x,t\mathrm{).}$$
(6)

Such non-stationary processes can be classified phenomenologically by the scaling of the spread of p(x, t) over time. This can be phrased mathematically as the invariance of p(x, t) under appropriate rescaling of space and time16,17:

$$p(x,t)={\tau }^{-\gamma }p\,(\frac{x}{{\tau }^{\gamma }},\frac{t}{\tau }),$$
(7)

where the scaling factor τγ of the space coordinate keeps the probability density invariant when the timescale is changed as \(t\to \tfrac{t}{\tau }\). In general, (7) is satisfied only in the asymptotic limit, i.e., when p(x, t) → 0. Nevertheless, this scaling relation, parametrized by γ, classifies the governing dynamics described by (8). For example, (6) falls into the equivalence class γ = 1/2.

Non-stationary stochastic processes that are characterized by \(\gamma \ne \tfrac{1}{2}\) are termed as anomalous diffusion processes. There are two main types of microscopic rules that can lead to anomalous diffusion of the probability density. In one case, the trajectories of the individual units (e.g., particles) remain to be uncorrelated and Markovian, however, other stochastic properties of these trajectories deviate from those of standard Brownian motion17,18,19,20. Typically these deviations stem from the fact that either the waiting time distribution between successive jumps or the jump length distribution is characterized by having infinite variance or mean. The corresponding FPEs usually include fractional derivatives, hence, these processes are termed as fractional dynamics. We do not consider this type of processes in the rest of the paper. Instead, we focus exclusively on the other type of processes that can lead to anomalous diffusion: the case in which the dynamics of the units is correlated or non-Markovian3,6,15,21,22,23,24,25,26. One way of modelling macroscopically such systems is through FPEs in which the diffusive term, \({\partial }_{x}^{2}\,p\), is replaced by \({\partial }_{x}^{2}F[p]\), where F[p], called the effective density, is a given function of the probability density p, which is either derived from microscopic rules or simply defined based on other macroscopic arguments15,22,25,27. According to the above, in the following we consider non-linear FPEs that generalize (5) as

$${\partial }_{t}\,p=D\{\beta {\partial }_{x}(p{\partial }_{x}u)+{\partial }_{x}^{2}F[p]\}.$$
(8)

Nonlinear FPEs were used in modelling a variety of phenomena in physical, biological and social sciences, such as diffusion in porous media16,28, surface growth process28, stellar dynamics29, bacterial chemotaxis25 and financial transactions30.

As we show later, for a given effective density F[p], the asymptotic anomalous diffusion exponent γ can be determined. Therefore, similarly to the phase space volume scaling exponent c of the entropy, the anomalous diffusion exponent γ might also indicate the deviation of the underlying system from being uncorrelated and Markovian. This specifies the goal of this paper, which is to investigate the relation between these two exponents, c and γ, macroscopically characterizing the stationary and non-stationary regime of the process, respectively. In order to relate entropies to FPEs, in this paper we consider continuous entropy forms, which, analogously to (1), are assumed to be written as25,31,32

$$S[p(u)]=\int \,g(p(u))\,{\rm{d}}u,$$
(9)

where g is asymptotically characterized by (3), u = u(x) is a time-independent scalar function of the space coordinate x (e.g., a potential), and the integration is performed over the range of u(x). The definition given in (9) provides a very general form, and special cases of this entropy functional have already been applied in studies of statistical mechanics of special relativity33, chemotaxis of biological populations25, stellar dynamics and two dimensional turbulence29.

For the sake of consistency between the description of these two regimes, similarly to refs27,31,32,34,35, we consider cases where the stationary solution of the FPE, given by (8), equals to the maximum entropy distribution according to the generalized entropy. It is instructive to see how the consistency criterion specified above applies to the most well-known case, the Boltzmann-Gibbs entropy, \(S=-\int \,{\rm{d}}u\,p(u)\,\mathrm{ln}\,p(u)\). In this case, the MaxEnt distribution restricted by a constraint on the expected value of u takes the form of \(p(u)=\tfrac{1}{Z}{e}^{-\beta u}\). However, this is also equivalent to the stationary solution of the FPE describing ordinary diffusion in the presence of some external potential u(x), given in (5), which is a special case of (8) with F[p] = p. Setting zero net flux at the boundaries yields \(p(u)=\tfrac{1}{Z}{e}^{-\beta u(x)}\).

Results

Based on the above, the Boltzmann-Gibbs entropy, belonging to the entropy class c = 1, is corresponding to the Fokker-Planck equation describing simple diffusion, which in turn is a member of the anomalous diffusion scaling class γ = 1/2. A natural question arising based on this observation is the following: Does every entropy belonging to the c = 1 universality class correspond to a generalized Fokker-Planck equation from the anomalous diffusion class γ = 1/2? And does every generalized Fokker-Planck equation belonging to the class γ = 1/2 correspond to an entropy belonging to the c = 1 class? In other words, does c = 1 and γ = 1/2 give rise to the same equivalence class, therefore, bijectively determine each other? In this paper we show that this is true not only for c = 1 and γ = 1/2, but for every c (0, 1] and \(\gamma \in [\tfrac{1}{2},1)\), where the exponents c and γ are connected by a simple algebraic relation. This implies that the asymptotic scaling of generalized entropies with phase space volume and the asymptotic anomalous diffusion scaling of the corresponding generalized Fokker-Planck equation classify the processes in the same way, and consequently, they provide the same information about their asymptotic behaviour. In Fig. 1 we show a schematic illustration of the above concept. Our result also provides an asymptotic generalization of the relation derived by Tsallis and Bukman34 between c and γ for the class of Tsallis entropies.

Figure 1
figure 1

Summary of the results presented in this paper. We consider macroscopic descriptions of stochastic processes where the stationary and non-stationary regime are related by identifying the MaxEnt distribution with the stationary solution of the Fokker-Planck equation. We show that under these assumptions, two of the most frequently discussed asymptotic classifications of stochastic processes, one based on the scaling of the entropy with the phase space volume and the other based on the anomalous diffusion scaling described by the Fokker Planck equation, coincide. Thus, for any c (0, 1], the corresponding equivalence class of entropies (represented by an ellipse on the left) bijectively determines an equivalence class of Fokker-Planck equations (shown by an ellipse on the right), in which the anomalous diffusion scaling exponent \(\gamma \in [\tfrac{1}{2},1)\) is constant.

In order to derive a relationship between the asymptotic exponents c and γ, let us first consider the MaxEnt distribution corresponding to entropies given in the form of (9). By following a variational principle approach and taking into account the normalization and expected value constraints we can write

$$\frac{\delta }{\delta p}\,\int \,{\rm{d}}u(g(p(u))-{\lambda }_{0}p(u)-{\lambda }_{1}up(u))=\mathrm{0,}$$
(10)

where the constants are omitted for simplicity and the λ0 and λ1 Lagrange multipliers are introduced for fixing the zeroth and first moment, respectively. From (10) we obtain

$$u=\frac{g^{\prime} (p)-{\lambda }_{0}}{{\lambda }_{1}}={\rm{\Lambda }}(p),$$
(11)

where Λ(p) is the inverse of the MaxEnt distribution corresponding to the entropy defined by g(p). In case of the Boltzmann-Gibbs entropy the inverse of the MaxEnt distribution is given by the (appropriately shifted and rescaled) logarithm function, \({{\rm{\Lambda }}}_{{\rm{BG}}}(p)=-{\lambda }_{1}^{-1}(\mathrm{ln}\,p+{\lambda }_{0}+\mathrm{1)}=-{\beta }^{-1}(\mathrm{ln}\,p+\,\mathrm{ln}\,Z)\). Therefore, Λ(p) is usually referred to as the generalized logarithm for any entropy in general6,36.

Based on a given entropy S[g(p)] and the corresponding generalized logarithm Λ(p), our next step is to find the related generalized Fokker-Planck equations in the form of (8), where F[p] is chosen such that the stationary solution of the equation becomes equivalent to the MaxEnt distribution of the entropy. By replacing u with Λ(p) in (8) according to (11) we obtain that the stationarity condition ∂ t p = 0 is fulfilled if

$${\partial }_{x}^{2}\,F[p]=-\beta {\partial }_{x}(p{\partial }_{x}{\rm{\Lambda }}(p)).$$
(12)

The expression p x Λ(p) in the r.h.s. of (12) can be rewritten using the chain rule ∂ x  = (∂ x p)∂ p as

$$p{\partial }_{x}{\rm{\Lambda }}(p)=({\partial }_{x}\,p)\,p{\partial }_{p}{\rm{\Lambda }}(p),$$
(13)

which in turn is equivalent to

$$\begin{array}{rcl}({\partial }_{x}\,p)p{\partial }_{p}{\rm{\Lambda }}(p) & = & ({\partial }_{x}\,p){\partial }_{p}({\int }_{0}^{p}\,q{\partial }_{q}{\rm{\Lambda }}(q)\,{\rm{d}}q+C(x,t))\\ & = & {\partial }_{x}({\int }_{0}^{p}\,q{\partial }_{q}{\rm{\Lambda }}(q)\,{\rm{d}}q+C(x,t)),\end{array}$$
(14)

where C(x, t) is an arbitrary function which is independent of p. Substituting (14) into (12) yields

$${\partial }_{x}^{2}F[p]=-\beta {\partial }_{x}^{2}({\int }_{0}^{p}\,q{\partial }_{q}{\rm{\Lambda }}(q)\,{\rm{d}}q+C(x,t)),$$
(15)

which gives the general expression for the effective density F as

$$F[p]=-\beta \,{\int }_{0}^{p}\,q{\partial }_{q}{\rm{\Lambda }}(q)\,{\rm{d}}q+\tilde{C}=\frac{-\beta }{{\lambda }_{1}}\,{\int }_{0}^{p}\,q{\partial }_{q}^{2}\,g(q)\,{\rm{d}}q+\frac{\tilde{C}}{{\lambda }_{1}},$$
(16)

where \(\tilde{C}=-{\beta }^{-1}C(x,t)+a(t)+b(t)x\) is an arbitrary function that is constant in p. The obtained relation (16) between F and g has already been established in an implicit form in refs25,31. In the following, we assume that F has no explicit space- or time-dependence, consequently, it is defined by g up to an additive constant as \(\frac{-\beta }{{\lambda }_{1}}\,{\int }_{0}^{p}q{\partial }_{q}^{2}\,g(q){\rm{d}}q\). In particular, for the Boltzmann-Gibbs entropy \({g}_{{\rm{BG}}}(q)=-q\,\mathrm{ln}\,q\) and β = λ1, yielding \({\partial }_{q}^{2}\,{g}_{{\rm{BG}}}(q)=-{q}^{-1}\), which results in FBG[p] = p. Since F[p] is formulated based on g(p) in (16), we call the resulting equation

$${\partial }_{t}\,p=D\{\beta {\partial }_{x}(p{\partial }_{x}u)+{\partial }_{x}^{2}(\frac{-\beta }{{\lambda }_{1}}\,\int \,q{\partial }_{q}^{2}g(q)\,{\rm{d}}q)\}$$
(17)

as the g-Fokker-Planck equation in order to distinguish it from the many other possible generalizations of FPEs. Note that in general many possible dynamics can lead to the same stationary state. However, as our derivation shows, if the dynamics, given by a non-linear Fokker-Planck equation, is constrained to be in the form of (8), then F[p] is determined by the stationary state, or, equivalently, by g, up to an additive constant.

In the following, let us consider the g-Fokker-Planck equation with no external potential,

$${\partial }_{t}\,p(x,t)=D{\partial }_{x}^{2}F[p(x,t\mathrm{)].}$$
(18)

We assume that the solution of (18) exists, at least from an appropriate initial condition, and it reaches the asymptotic limit p(x, t) → 0 for all x. In this asymptotic limit, the scaling rule (7) applies to p(x, t). Thus, if we change to the rescaled variables x′ = x/τγ and t′ = t/τ, the derivatives according to the new variables can be written as

$${\partial }_{t}={\tau }^{-1}{\partial }_{t^{\prime} },\quad {\partial }_{x}^{2}={\tau }^{-2\gamma }{\partial }_{x^{\prime} }^{2}.$$
(19)

Using (7) and (19), the g-Fokker-Planck equation with no external potential given in (18) in the rescaled variables can be formulated as

$$\begin{array}{rcl}{\partial }_{t}\,p(x,t) & = & {\tau }^{-1}{\partial }_{t^{\prime} }({\tau }^{-\gamma }p(x^{\prime} ,t^{\prime} ))\\ & = & {\tau }^{-(\gamma +\mathrm{1)}}{\partial }_{t^{\prime} }p(x^{\prime} ,t^{\prime} )\\ & = & D{\partial }_{x}^{2}F[p(x,t)]\\ & = & {\tau }^{-2\gamma }D{\partial }_{x^{\prime} }^{2}(F[{\tau }^{-\gamma }p(x^{\prime} ,t^{\prime} )]).\end{array}$$
(20)

The F[τγp(x′, t′)] term on the right hand side can be further transformed based on the scaling of g(p) given in (3), where by a change of variable \(\tilde{q}=q{\tau }^{\gamma }\) we obtain \(g(\tilde{q}{\tau }^{-\gamma })=g(\tilde{q}){\tau }^{-\gamma c}\) (being valid for \(\tilde{q}\ll 1\)). By substituting this into (16) we obtain

$$\begin{array}{rcl}F[{\tau }^{-\gamma }p(x^{\prime} ,t^{\prime} )] & = & \frac{-\beta }{{\lambda }_{1}}\,{\int }_{0}^{{\tau }^{-\gamma }p(x^{\prime} ,t^{\prime} )}\,q{\partial }_{q}^{2}g(q)\,{\rm{d}}q\\ & = & \frac{-\beta }{{\lambda }_{1}}\,{\int }_{0}^{p(x^{\prime} ,t^{\prime} )}\,{\tau }^{-\gamma }\tilde{q}{\tau }^{2\gamma }{\partial }_{\tilde{q}}^{2}g(\tilde{q}){\tau }^{-\gamma c}{\tau }^{-\gamma }{\rm{d}}\tilde{q}\\ & = & {\tau }^{-\gamma c}F[p(x^{\prime} ,t^{\prime} )].\end{array}$$
(21)

According to that, the g-Fokker-Planck equation (20) in the rescaled variables yields

$${\tau }^{-(\gamma +\mathrm{1)}}{\partial }_{t^{\prime} }p(x^{\prime} ,t^{\prime} )={\tau }^{-2\gamma -\gamma c}D{\partial }_{x^{\prime} }^{2}F[p(x^{\prime} ,t^{\prime} )].$$
(22)

Consequently, the dynamics remains to be governed by the original g-Fokker-Planck equation (18) for any τ if and only if the prefactors of both sides are equal for any τ, that is, the exponents must coincide,

$$-(\gamma +\mathrm{1)}=-2\gamma -\gamma c.$$
(23)

By rearranging (23) we obtain the main result of the paper

$$\gamma =\frac{1}{1+c},$$
(24)

providing a general relation between the exponent γ related to the anomalous diffusion, characterizing the scaling of p(x, t) in the p(x, t) → 0 limit and the exponent c, describing the scaling of the generalized entropy with the phase space volume. Note that the derivation above only requires g(p) to obey the asymptotic scaling relation (3) and the existence of the solution to (18) with the asymptotic limit p(x, t) → 0 reached for all x.

In order to demonstrate this general result, in Table 1 we list a few different generalized entropy forms from the literature together with the corresponding g-Fokker-Planck equations and the related γ and c exponents. Although the actual algebraic form of the entropies along with their phase space volume scaling, their MaxEnt distributions and the corresponding generalized Fokker-Planck equations are different for any c, their asymptotic anomalous diffusion scaling is completely determined by c via (24). This exemplifies the fact that although the mapping between entropies and Fokker-Planck equations are defined at any (phase space volume or time) scale, any entropy, characterized by asymptotic exponent c, can only be mapped to a Fokker-Planck equation describing anomalous diffusion with asymptotic exponent given by (24). In close connection to Table 1, Fig. 2 shows the finite scale phase space volume scaling of some generalized entropies, illustrating the numerous possible ways of convergence to the asymptotic value c.

Table 1 Generalized entropies \(S=\int \,g(p(u))\,{\rm{d}}u\), the inverse of their MaxEnt distribution Λ(p), their corresponding g-Fokker-Planck equation (with no external potential), their phase space scaling exponent c (0, 1] and their anomalous diffusion scaling exponent γ.
Figure 2
figure 2

Finite scale phase space volume scaling exponent c f (p) of some generalized entropies belonging to asymptotic classes c = 0.1, c = 0.5 and c = 1. According to the rigorous formulation of the asymptotic phase space volume scaling, given by (3), g(p) ~ pc when p → 0. Based on this, we define c f (p) as the slope of the tangent of g(p) on a log-log plot, \({c}_{f}(p)=\frac{\partial \,\mathrm{ln}\,g(p)}{\partial \,\mathrm{ln}\,p}=p\frac{\partial \,\mathrm{ln}\,g(p)}{\partial p}\). The convergence of the curves in the low p regime indicates that the effects of the phase space volume scaling (which is illustrated in the left panel of Fig. 1) are apparent already at finite scales.

Discussion

In this paper, we considered a class of stochastic processes, describing systems possibly composed of strongly interacting units or governed by non-Markovian dynamics, which can be macroscopically modelled by non-linear Fokker-Planck equations in the form of (8). These equations generalize the linear Fokker-Planck equation by replacing the probability density p(x, t) in the diffusive term \({\partial }_{x}^{2}\,p(x,t)\) by an effective density F[p(x, t)]. The actual form of F[p] determines both the stationary and the non-stationary behaviour of the process. The non-stationary behaviour in the presence of no external potential, i.e., the solution of (18), can be classified according to the spread of the probability density by the anomalous diffusion scaling exponent γ, defined by eq. (7). This exponent γ provides a measure of deviation from ordinary diffusion, characterized by γ = 1/2.

Another macroscopic approach of modelling stochastic processes is through the construction of generalized entropy functionals S which are maximized by the stationary state of the processes. As it has been already shown, a meaningful classification of generalized entropies over a discrete phase space indexed by i, \(S[p]={\sum }_{i=1}^{W}\,g({p}_{i})\), can be given by their scaling with phase space volume W, characterized asymptotically by the exponent c. Here we consider analogous continuous entropy functionals in the form of \(S[p(u)]=\int \,g(p(u))\,{\rm{d}}u\). Similarly to the exponent γ regarding the non-stationary regime, the phase space volume scaling exponent c of the entropy quantifies the deviation from uncorrelated, Markovian systems (characterized by c = 1) at the stationary regime.

The two approaches, one based on Fokker-Planck equations and the other on entropies, are consistent at the stationary regime if the stationary solution of the Fokker-Planck equation equals to the maximum entropy distribution according to the generalized entropy. In this paper we show that this consistency criterion implies that asymptotically, i.e., at p → 0, the anomalous diffusion scaling exponent γ and the phase space volume scaling exponent of the entropy c bijectively determine each other via the relation \(\gamma =\tfrac{1}{1+c}\). Asymptotically, this result generalizes that of Tsallis and Bukman34, now being valid for any generalized entropy functional satisfying some general asymptotic conditions. In addition, since the explicit solution of the corresponding Fokker-Planck equation might either not be available, or possibly have infinite variance, our derivation do not rely on the computation of any of these. Our results suggests that either of the asymptotic exponents γ and c is indeed providing a useful characterization of the systems themselves, and not just describing their behaviour in the stationary or non-stationary regime. Furthermore, the surprising versatility of the theoretical framework behind Tsallis statistics to model various aspects of strongly interacting systems might be explained by the fact that the family of Tsallis entropies, characterized by their deformation index q, provides an algebraically simple representative of each such asymptotic equivalence class.