When optimization for governing human-environment tipping elements is neither sustainable nor safe

Barfuss, Wolfram; Donges, Jonathan F.; Lade, Steven J.; Kurths, Jürgen

doi:10.1038/s41467-018-04738-z

Download PDF

Article
Open access
Published: 15 June 2018

When optimization for governing human-environment tipping elements is neither sustainable nor safe

Wolfram Barfuss^1,2,
Jonathan F. Donges^1,3,
Steven J. Lade^3,4 &
…
Jürgen Kurths^1,2,5

Nature Communications volume 9, Article number: 2354 (2018) Cite this article

4486 Accesses
31 Citations
49 Altmetric
Metrics details

Subjects

Abstract

Optimizing economic welfare in environmental governance has been criticized for delivering short-term gains at the expense of long-term environmental degradation. Different from economic optimization, the concepts of sustainability and the more recent safe operating space have been used to derive policies in environmental governance. However, a formal comparison between these three policy paradigms is still missing, leaving policy makers uncertain which paradigm to apply. Here, we develop a better understanding of their interrelationships, using a stylized model of human-environment tipping elements. We find that no paradigm guarantees fulfilling requirements imposed by another paradigm and derive simple heuristics for the conditions under which these trade-offs occur. We show that the absence of such a master paradigm is of special relevance for governing real-world tipping systems such as climate, fisheries, and farming, which may reside in a parameter regime where economic optimization is neither sustainable nor safe.

Evaluating the efficacy and equity of environmental stopgap measures

Article 23 March 2020

Holly Jean Buck, Laura Jane Martin, … Shuchi Talati

Environmental sustainability is not worth pursuing unless it is achieved for ethical reasons

Article Open access 02 June 2020

Fabio Zagonari

Multi-objective optimization can balance trade-offs among boreal caribou, biodiversity, and climate change objectives when conservation hotspots do not overlap

Article Open access 13 July 2022

Amanda E. Martin, Erin Neave, … Cheryl A. Johnson

Introduction

The Sustainable Development Goals¹ and the Paris climate agreement set the target of prosperous development for people and our planet. Yet, it remains challenging to translate these aims into concrete policy implementations, accounting for non-linearities, such as tipping elements^2,3, regime shifts^4,5, and multi-stabilities⁶, as well as multiple kinds of uncertainties^7,8,9, and extreme events¹⁰.

To support the decision making processes in these contexts, we ask the question how the three prominent decision making paradigms of economic welfare optimization, sustainability and safe operating space compare with each other. Specifically, we investigate the parameter regimes for synergies and trade-offs when applying these paradigms to the management of tipping elements¹¹ and how these findings relate to the three real-world systems of climate, fisheries and farming.

Optimization approaches have emerged as the primary guiding principle to derive a policy strategy for environmental governance^12,13. Most often, the present value of macroeconomic social welfare, i.e., the sum of discounted future benefits minus costs, is the target to be optimized. Such optimization approaches have been criticized regarding the discount rates used, delivering short term gains at the expense of long-term environmental degradation^14,15. Further criticism targets the lack of a systems perspective required to understand the structural landscape of model dynamics, as well as the assumptions made due to imperfect information^6,9,10. This critique is partly dealt with in optimization variants, such as robust^7,16 or viable^17,18,19 control, which are dealing with multiple types of uncertainty²⁰. Naturally, other or multiple objectives²¹ and criteria^22,23 with possible constraints²⁴ can be optimized as well. In this work, we use the term solely in the narrow economic sense of maximizing the present value as defined in Eq. 1 below.

In recognition of increasing environmental and social threats²⁵ the policy paradigm of sustainability has emerged in the scientific and political discourse^26,27. The economics of sustainability has brought up many definitions of sustainability alone^28,29,30,31. In these analyses sustainability is usually imposed as a constraint within an economic welfare optimization paradigm. Trade-offs to economic welfare optimization are well known^28,32. However, these classic social welfare optimization approaches are challenged through the increasing recognition of non-linearities, such as tipping points, regime shifts, uncertainties and the risk of catastrophic outcomes^6,9. Taking up these challenges, e.g., non-convexities³³ and climate tipping elements^34,35 have been studied within an economic framework. Here, we derive our formal definition of sustainability from the Brundtland report²⁶. Its design is deliberately simple and targeted to the mathematical framework we use (see below). We do not intend our definition to be applicable to a general model of a welfare economy^12,27.

Recent advances in sustainability science have brought forth tolerable windows³⁶ or safe operating spaces^37,38 as a policy paradigm to derive concrete actions from³⁹. These concepts originate from resilience thinking⁴⁰ and a precautionary principle⁴¹ to deal with potential dangerous tipping elements in the environmental governance system. Trade-offs but also synergies with optimization thinking have been therefore discussed⁴². Also formal analyses studying relations between resilience as a system property and sustainability were conducted^43,44.

However, the reciprocal relationships between these three paradigms of economic optimization, sustainability and safe operating space is still insufficiently explored. Such an understanding is important in order to judge, for example, when economic optimization is, or is not, an appropriate policy goal. Also, guidance is required when a sustainability paradigm may conflict with a safe operating space paradigm and vice versa.

Here, we report progress towards a better understanding of the mutual relationships between these three paradigms of economic optimization, sustainability and safe operating space by applying them to a stylized model of a human-environment tipping element. We do so because of the increasing importance of tipping points and regime shifts in environmental governance. Our model is deliberately stylized, thereby applicable across multiple cases and scales, to gain a deeper understanding more complex models might miss. The formal definitions of the three paradigms are designed to fit our mathematical framework (see below). Since we do not focus on intragenerational justice in this article, one agent suffices as a decision making subject, in contrast to a multiagent setting. We find that there exists no master paradigm between the three examined, i.e., a policy can be any combination of optimal or not, sustainable or not and safe or not. This is of special relevance to the climate system which may reside at the edge in the parameter regime where economic welfare optimization becomes neither sustainable nor safe. This suggests the use of more advanced paradigms to support decision making in climate policy.

Results

Stylized model of a human-environment tipping element

We use the mathematical framework of Markov Decision Processes^45,46, in which an agent makes decisions about how to interact with its environment (Fig. 1a). Our particular environment can reside in either a prosperous state, which provides immediate rewards (also called payoffs) to the agent, or a degraded state, from which the agent receives no payoff. At each time step, the agent chooses between two actions a, exerting either a high or low pressure on the environment. Depending on the current state s, the current action a and the subsequent state s′, the agent receives an immediate reward r (Fig. 1b). At the prosperous state, taking the low pressure action the agent is guaranteed to receive reward r_l and remain at the prosperous state. However, taking the high pressure action, the agent may receive reward r_h (which is typically larger than r_l), but risks triggering a collapse of the environment to the degraded system state with non-zero probability δ and no immediate reward at all. From there, only the low pressure action opens the option to recover to the prosperous state with non-zero probability ρ.

For example, the high pressure action could correspond to emitting a business-as-usual amount of carbon to the atmosphere yielding a reward of high, short-term economic output as long as the system has not tipped. The low pressure action resembles emitting a reduced amount of carbon, assuming a lower short-term economic output for the guarantee to not trigger climate tipping elements into a disastrous state.

A policy π is a function that specifies what action a to apply at a system state s. The agent receives reward r_t at time step t. The value v_π(s) of a state s under a given policy π is given by the expected value of the normalized accumulated discounted rewards r with discount factor 0 ≤ γ ≤ 1 when starting in state S₀ = s and following policy π:

$$v_\pi \left( s \right) = {\Bbb E}_\pi \left[ {\mathop {{{\mathrm{lim}}}}\limits_{T \to \infty } \frac{{\mathop {\sum}\nolimits_{t = 0}^T {\gamma ^tr_t} }}{{\mathop {\sum}\nolimits_{t = 0}^T {\gamma ^t} }}\left| {S_0 = s} \right.} \right].$$

(1)

Note that the discount factor actually denotes the farsightedness of the agent. Thus, γ = 1 corresponds to no discounting (weighting all rewards equally regardless of when they are expected), whereas γ = 0 corresponds to completely myopic, fully discounting agents.

Paradigm definitions

We classify policies according to whether they are economic welfare optimal or not, sustainable or not, and safe or not. For the sake of simplicity we focus on two deterministic policies, distinguishing whether the agent should apply the low or the high pressure action at the prosperous state (Fig. 1c): the risky policy (π_r(p) = h, π_r(d) = l), applying the high pressure action at the prosperous state and the low pressure one at the degraded state and the cautious policy (π_c(p) = l, π_c(d) = l), applying the low pressure action at the prosperous, as well as the degraded state.

A policy π is defined as optimal (in the economic welfare sense) if its value v_π(s) (Eq. 1) for every state s is larger than or equal to the value of any other policy⁴⁶.

Based on the Brundtland Commission’s report on sustainable development²⁶ a sustainable policy should fulfill two requirements: First, meet the needs of the present. We translate this formally into the agent evaluating the present state s as acceptable (similar to viable¹⁷, tolerable³⁶ or desirable⁴⁷), if its value (Eq. 1) exceeds a normatively chosen minimum acceptable value r_min:

$$s\,{\mathrm{acceptable}}\,{\mathrm{under}}\,\pi \,{\mathrm{iff}}\;v_\pi \left( s \right) \ge r_{{\mathrm{min}}}$$

(2)

Note, that the division of state space into acceptable and unacceptable states is not identical for all polices, but depends on the rewards receivable through executing a policy. Second, a sustainable policy should sustain the ability to meet the needs of the future²⁶.

We define a policy π as sustainable if every state the agent eventually visits under policy π is acceptable (Eq. 2).

Note that this reduction of sustainability to the one-dimensional value v_π(s) has much similarity with the notion of weak sustainability⁴⁸.

The Safe Operating Space (SOS)³⁷ is typically defined as a subset of the whole state space ${\cal S}$, containing favorable system states bounded by thresholds^39,49. In practice, the position of these potential tipping thresholds is always uncertain and the boundaries are placed at the lower end of the uncertainty zone. In that way the definition of the safe operating space states constitutes a normative judgment about the risk the decision maker is willing to tolerate. In the subsequent analyses we take the extreme position of no risk tolerance and identify the SOS with only the (more favorable) prosperous state, independent of the collapse probability δ.

We define a policy π as safe if every state the agents eventually visits under policy π lies within the SOS.

In contrast to acceptable and unacceptable states, safe states are independent of the policy used.

In summary, our stylized model of a human-environment tipping element depends on the five parameters δ, ρ, γ, r_l/r_h, r_min/r_h: the probability of a collapse from the prosperous to the degraded state under the high pressure action δ, the probability of recovery from the degraded to the prosperous state under the low pressure action ρ, the agent’s discount factor γ, the high reward receivable from the high pressure action when staying at the prosperous state r_h, the low reward receivable by taking the low pressure action at the prosperous state r_l, and the normatively chosen minimum acceptable reward r_min a state value must have to be perceived as acceptable under a certain policy. Since all three rewards come in arbitrary units, the policy classification only depends on their ratios.

Classification of risky and safe policy

Based on Eqs. 1 and 2 we analytically compute whether the risky and the cautious policy are optimal or not, sustainable or not and safe or not depending on the model parameters (δ, ρ, γ, r_l/r_h, r_min/r_h) (see Methods and Fig. 2).

We observe that above a certain critical value of the collapse probability δ the cautious policy becomes optimal (Fig. 2a, pink), despite the smaller immediate reward r_l = 0.5r_h. This result confirms previous findings on optimal management with regime shifts⁵⁰.

Further, we find a decreasing critical collapse probability with increasing farsightedness γ. Hence, for more farsighted societies the risky policy is optimal only for small collapse probabilities δ (orange).

Provided the low pressure reward exceeds the normative minimum acceptable value threshold, r_l ≥ r_min, then the cautious policy is sustainable for all parameter combinations δ, ρ, γ, r_l/r_h (Fig. 2b, blue and purple). Only for small collapse probabilities δ and simultaneously high farsightedness γ the risky policy becomes sustainable as well (purple). This is because in this parameter region the risky policy is acceptable also at the degraded state (Methods).

The cautious policy is a safe policy independently from the parameter combinations δ, ρ, γ, r_l/r_h, r_min/r_h (Fig. 2c, green). It is important to emphasize that there is no combination of parameters at which the risky policy is safe.

Relationships between paradigms

We find that policies can be classified along all logical combinations of the three examined paradigms (optimization, sustainability, safe operating space). This yields a classification of policies into eight different categories (Fig. 3).

In particular, optimal policies are not necessarily sustainable (opt and not sus: Fig. 3, red and yellow). This is the case if the normative value threshold r_min is too large. The cautious policy does not return enough value to be sustainable (r_l < r_min, yellow) and the risky policy at the degraded state produces too little future reward to be sustainable, due to the low chance of recovery and lack of farsightedness.

Nor are optimal policies necessarily safe (opt and not safe: Fig. 3, red and purple). This occurs in parameter regions where the risky policy is optimal. The risky policy cannot be safe because of the risk of collapse to the degraded state.

A safe policy does not necessarily imply a sustainable policy either (safe and not sus: Fig. 3, green and yellow). When the normative threshold value for sustainability r_min exceeds the reward from a low pressure action r_l: r_min > r_l, then the cautious policy is safe but not sustainable. Following a similar line of argument, the SOS concept³⁷ has been extended to a Safe And Just Operating Space (SAJOS) which additionally accounts for social indicators⁵¹, such as the number of people living in extreme poverty. Thus, SAJOS policies can be interpreted as the overlap of safe with sustainable policies. Within our model, we can give a definite criterion for when this form of SAJOS exists: as long as the reward from a low pressure action r_l exceeds the normative threshold value r_min (r_l > r_min), the cautious policy is both safe and sustainable (Fig. 3, cyan and gray).

However, there exist also sustainable policies outside the SOS (sus and not safe: Fig. 3, blue and purple.) These are risky policies (hence, not safe) with simultaneously high farsightedness γ and low collapse probability δ. At those parameter regions the degraded state is still evaluated as acceptable due to sufficient anticipated future rewards and therefore the risky policy is sustainable. The circumstance that parameter regimes exist that are sustainable but not safe and vice versa clearly stems from our definition of sustainability which resembles a form of weak sustainability⁴⁸. By doing so we can conceptually separate issues of environmentally safe and socially just without compromising the target of a safe and just parameter space regime.

Note that this classification into the eight different policy paradigm combinations also applies to the case of absolute farsightedness (γ = 1; see the tops of Fig. 3b–e). Thus, the trade-offs between the examined paradigms do not vanish, as one might presume considering the debate about appropriate discount rates^14,52.

Volume of paradigm combinations

So far, we have visualized the parameter space of our stylized tipping element model in two dimensional sections and fixed the remaining parameters for illustrative purposes. By doing so, we showed the mutual dependence between parameters, foremost the discount factor γ and the collapse probability δ. However, in the light of considerable parameter uncertainty we ask how large the eight regimes of paradigm combinations are, given the whole parameter space (Fig. 4).

We observe the most likely option to be the regime that is neither optimal, neither sustainable nor safe followed by the parameter sweet spot regime in which all paradigms yield the cautious policy as optimal, sustainable and safe. Together they constitute a parameter space volume of approx. 45% in which the three paradigms of economic optimization, sustainability and safe operating space align with each other in yielding the same policy. Interestingly, the third likeliest option is the paradigm combination in which the risky policy is optimal but neither sustainable nor safe. This is the most likeliest parameter regime among those where the paradigms yield different policies. Thus, blindly applying economic optimization in a our stylized tipping element has a significant chance of leading to policies that are neither sustainable nor safe.

On the other hand, the volume of the safe and just operating space (gray and cyan bars in Fig. 4) is comparable to the most likeliest (black) regime. Thus, about one out of four random decision making agents interacting with a random tipping element will end up in the safe and just operating space.

Application to real-world human-environment tipping elements

The above policy classification offers valuable insights for the governance of real-world human-environment systems. We discuss how our analysis relates to the cases of the climate system, fisheries and farming. Our purpose is to gain a qualitative understanding how our model relates to important real-world challenges of environmental governance, not a detailed assessment of the latter. Therefore, we roughly estimate the respective collapse and recovery probabilities per time step δ and ρ of our model via the typical timescales on which these systems remain in one state or the other (see Methods). Additionally, we added a parameter sensitivity analysis by visualizing the likelihood of ending up in a certain parameter regime by color gradients between regimes (Fig. 5).

Regarding the climate system, we acknowledge that several interacting tipping elements contribute to the system’s behavior² and its representation as a single tipping element is a huge simplification on its own. Nevertheless, we assume that the current state of the climate system is still comparable to the prosperous one of our model and relevant timescales for triggering a collapse of 30 to 50 years under business-as-usual socio-economic development scenarios^2,53,54. Regarding the recovery timescale it has been shown that human perturbations of the climate system already changed its trajectory on a multi-millennial timescale^55,56. Therefore we assume a recovery probability per time step ρ close to zero (Fig. 5).

For sufficiently large collapse probabilities (collapse time scale near 20 years and smaller), the climate system is likely to reside in a parameter sweet spot (gray area), where applying an optimization, sustainability or SOS paradigm results in the cautious policy as the advisable way of governing the climate system. However, if the collapse probability per time step is smaller (collapse time scale near 50 years and larger) the situation is different. Here, an SOS and a sustainable paradigm would still yield the cautious policy (Fig. 5, cyan), but an optimization paradigm is likely to give the risky policy (Fig. 5, red), which at this point is neither sustainable nor safe. We conclude that in climate policy, economic welfare optimization alone may neither be sustainable nor safe.

For fishery systems, both transition probabilities certainly depend on a variety of factors, e.g., fisher’s technical and cultural traits or the dominant fish species in the system, as well as external factors such as climate change influencing habitat condition^57,58. The timescale of a fisheries collapse has been shown to lie within decades⁵⁹. Roughly consistent with observational and modeled data from the Baltic sea, where the stable regime of high cod biomass lasted approximately from 1970 to 1990^57,60, we assume a typical collapse timescale of around 20 years. Concerning the typical recovery time scale, successful attempts of fish stocks recovery lasted for decades⁶¹, but is estimated to generally exceed this duration⁶². We therefore assume a larger typical recovery timescale of around 50 years. The color gradient in Fig. 5 at the fisheries point does not clearly single out a paradigms regime, indicating the dependence on the other parameters at this point. A risky policy might be economically optimal (Fig. 5, red), but leads eventually to the collapse of fish stock (c.f⁵⁹.). At the collapsed and degraded state the conditions for the fishers are not acceptable. Therefore they have to leave the system and cannot wait for the fish’s recovery. But further investigation is needed to reduce the uncertainty with respect to the other parameters.

Last, we look at the case of land degradation by farming in our stylized model. Land degradation and restoration is a complex topic with many influencing factors⁶³. Nevertheless, land degradation by farming has been identified as a tipping element by Kinzig and others⁶⁴, where the authors discuss the case of the western Australian wheatbelt with a typical collapse timescale of about 100 years. Soil recovery is estimated to take place within 20 to 1000 years⁶⁵, which is roughly consistent to Kitzing et al., where the duration to reach equilibrium again is estimated with up to 300 years. We therefore assume a typical recovery timescale of about 300 years. In contrast to climate and fisheries, the transition probabilities we associated with the process of land degradation by farming suggest, that here an optimality paradigm is very likely to yield the risky policy which is neither sustainable nor safe despite considerate parameter uncertainty (red area in Fig. 5).

Taken together, it is interesting to see that in particular the climate system may reside at the edge of the parameter regime where economic welfare optimization becomes neither sustainable nor safe (Fig. 3). For land degradation by farming, our assessment suggests that an optimal policy is likely to yield a non-sustainable and non-safe policy whereas for fisheries the situation is less clear.

Discussion

Overall, our results show that there exists no master paradigm among the three examined in our model of environmental governance of a stylized tipping element. Policies can be classified by any combination of optimal, sustainable and safe. A master paradigm, in contrast, would guarantee fulfilling requirements imposed by other paradigms. Consequently, the selection of appropriate policy paradigms, especially in more complex settings and models, can be critical for effective environmental governance.

Specifically, our results show theoretically, as well as empirically that economic welfare optimization for managing tipping elements may be neither sustainable nor safe. For example, the volume of the corresponding paradigm combination in parameter space is the largest among those in which the three paradigms actually yield different policies. This suggests the conclusion that the mere structure of a tipping element causes a comparable high chance of obtaining a policy that is neither sustainable nor safe when blindly following an optimization paradigm. On the other hand, our model also indicates parameter regimes where economic optimization can safely and sustainably be used.

We derived simple heuristics to anticipate when a policy is economic welfare optimal, sustainable and safe. A risky policy may be optimal when the probability of collapse and/or the farsightedness are sufficiently small. It may be sustainable when the probability of a collapse is sufficiently small but the farsightedness is sufficiently large. However, it cannot be safe. A cautious policy may be optimal when the collapse probability and/or the farsightedness are sufficiently large. It is sustainable if its immediate reward exceeds the normatively chosen minimum acceptable reward and it is always safe. The absence of a master paradigm is of special relevance for governing the climate system, since the latter may reside at the edge between parameter regimes where economic welfare optimization becomes neither sustainable nor safe.

Extensions are possible in many directions. Constrained optimization²⁴ is a straight-forward way to combine the paradigms examined. Policy makers could aim for the maximum economic welfare delivering a policy that is safe and sustainable, or least-cost safe target strategies¹⁵. This is certainly a better approach than relying on economic welfare optimization alone for model-based policy advice. Examples of models for policy advice certainly include integrated assessment models or the use of the maximum sustainable yield in fisheries management. However, one might not desire to obtain the welfare optimal safe and sustainable policy but e.g., the most resilient one, which calls for an operationalization of modern social-ecological resilience concepts⁶⁶.

The application of our model to real-world systems in this article is of qualitative, illustrative nature. A more detailed analysis of real world tipping elements in which typical transition probabilities might be estimated from empirical time series could be a way forward to systematize and draw lessons from the multitude of human-environmental tipping elements⁶⁷.

Applying our analyses to larger, more complex Markov decision processes would be a way to extend the understanding of the relationships between the paradigms examined. Moreover, it may be desirable to include further policy paradigms into the analyses, e.g., aiming for a large option space of future decision makers^30,68. Based on such analyses, policy makers could make better informed decisions on how to translate the Sustainable Development Goals and the Paris climate agreement into concrete policy implementations.

Methods

Derivation of value functions

There are four deterministic policies in our Markov decision process model: (1) π_r(p) = h, π_r(d) = l, (2) π_c(p) = l, π_c(d) = l, (3) π₃(p) = h, π₃(d) = h, (4) π₄(p) = l, π₄(d) = h. We concentrate on deterministic policies only to simplify the calculation without loss of generality, because if an optimal policy exits there exits also a deterministic optimal policy⁴⁶. We further focus here only on the first two policies, named the risky and the cautious policy, since the remaining two apply a high pressure action at the degraded state. This will trap the agent at this position for eternity without receiving any reward. The math on these policies is left to the interested reader.

In the following we derive the analytical expressions of the state values of these policies as functions of the parameters (δ, ρ, γ, r_l, r_h). From Eq. 1 and for γ < 1 one can derive the recursive relationship between state values, known as the Bellman Equation⁶⁹:

$$v_{\pi} \left( s \right) = \mathop {\sum}\limits_{s{\prime}} p\left( {s{\prime}|s,\pi \left( s \right)} \right)\left[ {\left( {1 - {\gamma} } \right){r}\left( {s,\,{\pi} \left( s \right),{s}{\prime}} \right) + {\gamma} {v}_{\pi} \left( {s{\prime}} \right)} \right]$$

(3)

with p(s′|s, π(s)) being the probability to enter state s′ given the agent has started in state s and applied action π(s).

Applied to our model the value for the prosperous state reads

$${v}_{\pi} \left( p \right) = \left\{ {\begin{array}{*{20}{l}} {\delta \gamma {v}_{\pi} \left( d \right) + \left( {1 - {\delta} } \right)\left[ {\left( {1 - \gamma } \right){r}_{\mathrm{h}} + \gamma {v}_{\pi} \left( p \right)} \right]} \hfill & {{\mathrm{for}}\;a = h} \hfill \\ {\left( {1 - \gamma } \right){r}_{\mathrm{l}} + \gamma {v}_{\pi} \left( p \right)} \hfill & {{\mathrm{for}}\;a = l} \hfill \end{array}} \right..$$

(4)

The value for the degraded state is given by

$$v_\pi \left( d \right) = \left\{ {\begin{array}{*{20}{l}} {\gamma v_\pi \left( d \right)} \hfill & {{\mathrm{for}}\;a = h} \hfill \\ {\left( {1 - \rho } \right)\gamma v_\pi \left( d \right) + \rho \gamma v_\pi \left( p \right)} \hfill & {{\mathrm{for}}\;a = l} \hfill \end{array}} \right..$$

(5)

To obtain the explicit state values for the risky policy (π_r(p) = h, π_r(d) = l) we solve the system of equations

$${v}_{{\pi} _{r}}\left( p \right) = \delta \gamma {v}_{{\pi} _{r}}\left( d \right) + \left( {1 - \delta } \right)\left[ {\left( {1 - \gamma } \right){r}_{\mathrm{h}} + \gamma {v}_{{\pi} _{r}}\left( p \right)} \right]$$

(6)

$$v_{\pi _r}\left( d \right) = \left( {1 - \rho } \right)\gamma {v}_{\pi _{r}}\left( d \right) + \rho \gamma {v}_{\pi _{r}}\left( p \right),$$

(7)

which yields

$$v_{\pi _r}\left( p \right) = r_{\mathrm{h}}\frac{{\left( {1 - \delta } \right)\left( {1 - \left( {1 - \rho } \right)\gamma } \right)}}{{1 - \left( {1 - \delta - \rho } \right)\gamma }}$$

(8)

$$v_{\pi _r}\left( d \right) = r_{\mathrm{h}}\frac{{\left( {1 - \delta } \right)\rho \gamma }}{{1 - \left( {1 - \delta - \rho } \right)\gamma }}.$$

(9)

To obtain the explicit state values for the cautious policy (π_c(p) = l, π_c(d) = l) we solve the system of equations

$$v_{{\pi} _{c}}\left( p \right) = \left( {1 - \gamma } \right){r}_{\text{l}} + \gamma {v}_{{\pi} _{c}}\left( p \right)$$

(10)

$$v_{\pi _c}\left( d \right) = \left( {1 - \rho } \right)\gamma v_{\pi _c}\left( d \right) + \rho \gamma v_{\pi _{c}}\left( p \right),$$

(11)

which yields

$$v_{\pi _c}\left( p \right) = r_{\mathrm{l}}$$

(12)

$$v_{\pi _c}\left( d \right) = \frac{{\rho \gamma r_{\mathrm{l}}}}{{1 - \left( {1 - \rho } \right)\gamma }}.$$

(13)

For γ = 1 we compute the values v_π (which are independent from the initial state for γ = 1) by multiplying the stationary state of the effective Markov chain with the reward vector ${\mathbf{r}}^\pi \, \in \,{\Bbb R}^{|S|}$ whose components read

$$r_s^\pi = \mathop {\sum}\limits_{s{\prime}} p\left( {s{\prime}|s,\,\pi \left( s \right)} \right)r\left( {s,\pi \left( s \right),s{\prime}} \right).$$

(14)

The components of the transition matrix P^π of the effective Markov chain read

$$P_{s{\prime}s}^\pi = p\left( {s{\prime}|\pi \left( s \right),s} \right).$$

(15)

The stationary state σ_π is the normalized eigenvector of the transition matrix with eigenvalue one. Hence,

$$v_{\pi} = \sigma _{\pi} \cdot {\mathbf{r}}^{\pi} .$$

(16)

Performing this calculation for risky and cautious policy explicitly yields consistent results with the calculation for 0 ≤ γ < 1 from above. For γ = 1 the value v_π can be obtained by simply inserting γ = 1 into Eqs. 8 and 9 for the risky policy and Eqs. 12 and 13 for the cautious policy.

Analytical expressions for paradigm policy classification

To derive the analytical expression of the hypersurface in parameter space that separates the regions where either the risky or the cautious policy is optimal we set $v_{\pi _{r}}\left( p \right)\mathop { = }\limits^{{\mathrm{set}}} v_{\pi _{c}}\left( p \right)$ (or equivalently $v_{\pi _r}\left( d \right)\mathop { = }\limits^{{\mathrm{set}}} v_{{\pi} _c}\left( d \right)$, since the parameter combination where a policy is optimal is independent from the state) and implicitly obtain

$$\tilde {r}_{\mathrm{h}} \cdot \left( 1 - \tilde \delta \right)\left( 1 - \tilde \gamma \left( 1 - \tilde \rho \right) \right) = \tilde {r}_{\mathrm{l}} \cdot \left( 1 - \tilde \gamma \left( 1 - \tilde \delta - \tilde \rho \right) \right).$$

(17)

To obtain the hypersurface that separates state s being acceptable from being not acceptable under policy π we apply the definition from Eq. 2: $v_\pi \left( s \right)\mathop { = }\limits^{{\mathrm{set}}} r_{{\mathrm{min}}}$. Hence, for the risky policy at the prosperous state we set $v_{\pi _r}\left( p \right)\mathop { = }\limits^{{\mathrm{set}}} r_{{\mathrm{min}}}$ and obtain implicitly

$${\tilde {r}}_{\mathrm{h}} \cdot \left( {1 - {\tilde \delta}} \right) \left({1 - {\tilde \gamma} \left( {1 - {\tilde \rho}} \right)} \right) = {\tilde {r}}_{\mathrm{min}} \cdot \left( {1 - {\tilde \gamma} \left( {1 - {\tilde \delta} - {\tilde \rho}} \right)} \right).$$

(18)

For the risky policy at the degraded state we set $v_{{\pi} _{r}}\left( d \right)\mathop { = }\limits^{{\mathrm{set}}} {r}_{{\mathrm{min}}}$ and obtain implicitly

$$\tilde {r}_{\mathrm{h}} \cdot \left( {1 - \tilde \delta } \right){\tilde \rho} {\tilde \gamma} = {\tilde {r}}_{{\mathrm{min}}} \cdot \left( {1 - {\tilde \gamma} \left( {1 - {\tilde \delta} - {\tilde \rho} } \right)} \right).$$

(19)

For the cautious policy at the prosperous state we set $v_{\pi _c}\left( p \right)\mathop { = }\limits^{{\mathrm{set}}} r_{{\mathrm{min}}}$ and obtain implicitly

$$\tilde {r}_{\mathrm{l}} = \tilde {r}_{{\mathrm{min}}}.$$

(20)

For the cautious policy at the degraded state we set $v_{\pi _c}\left( d \right)\mathop { = }\limits^{{\mathrm{set}}} r_{{\mathrm{min}}}$ and obtain implicitly

$$\tilde {r}_{\mathrm{l}} \cdot \tilde \rho \tilde \gamma = \tilde {r}_{{\mathrm{min}}} \cdot \left( {1 - \tilde \gamma \left( {1 - \tilde \rho } \right)} \right)$$

(21)

To get from acceptability to sustainability for the risky policy one has to logically combine Eqs. 18 and 19. The risky policy is sustainable only if both the prosperous and the degraded state are acceptable since it will visit both states recurrently. The safe policy is sustainable exactly where the prosperous state is acceptable since it will eventually end up and remain at the prosperous state. Supplementary Fig. 1 shows an example of the acceptability division of state-parameter space and the resulting sustainability division.

The division of the parameter space according the safe operating space paradigm is obvious from its definition. Only the cautious policy is a safe policy since it will eventually end up and remain in the prosperous, safe operating space state. The risky policy switches recurrently between the prosperous and the degraded which makes it, by definition, not safe.

Conversion of timescales to transition probabilities

Let p be the probability per time step that a system state will transition into another state. The average number of time steps the system will be in that state is given by 〈N〉 = (1 − p)/p. Inverting yields p = 1/(〈N〉 + 1). We map a model time step to a year. Thus, a collapse time scale of e.g., 50 years corresponds to a collapse probability of δ ≈ 0.02. Supplementary Tab. 1 shows the assumed transition timescales and corresponding transition probabilities.

Code availability

Python code for the reproduction of the reported results plus interactive versions of the figures can be downloaded from https://github.com/wbarfuss/Paradigms.

Data availability

Data sharing not applicable to this article as no datasets were stored on disk during the production of the figures (see Code availability).

References

Griggs, D. et al. Policy: sustainable development goals for people and planet. Nature 495, 305–307 (2013).
Article ADS PubMed CAS Google Scholar
Lenton, T. M. et al. Tipping elements in the Earth’s climate system. Proc. Natl Acad. Sci. 105, 1786–1793 (2008).
Article ADS PubMed PubMed Central MATH Google Scholar
Schellnhuber, H. J. Tipping elements in the earth system. Proc. Natl Acad. Sci. 106, 20561–20563 (2009).
Article ADS PubMed PubMed Central Google Scholar
Scheffer, M., Carpenter, S., Foley, J. A., Folke, C., Walker, B. Catastrophic shifts in ecosystems. Nature 413, 591–596 (2001).
Lade, S. J., Tavoni, A., Levin, S. A. & Schlüter, M. Regime shifts in a social-ecological system. Theor. Ecol. 6, 359–372 (2013).
Article Google Scholar
Donges, J. F. et al. Closing the loop: reconnecting human dynamics to earth system science. Anthr. Rev. 4, 151–157 (2017).
Article Google Scholar
Anderies, J. M., Rodriguez, A. A., Janssen, M. A. & Cifdaloz, O. Panaceas, uncertainty, and the robust control framework in sustainability science. Proc. Natl Acad. Sci. 104, 15194–15199 (2007).
Article ADS PubMed PubMed Central Google Scholar
Polasky, S., Carpenter, S. R., Folke, C. & Keeler, B. Decision-making under great uncertainty: environmental management in an era of global change. Trends Ecol. Evol. 26, 398–404 (2011).
Article PubMed Google Scholar
Irwin, E. G., Gopalakrishnan, S. & Randall, A. Welfare, wealth, and sustainability. Annu. Rev. Resour. Econ. 8, 77–98 (2016).
Article Google Scholar
Farmer, J. D., Hepburn, C., Mealy, P. & Teytelboym, A. A third wave in the economics of climate change. Environ. Resour. Econ. 62, 329–357 (2015).
Article Google Scholar
Crépin, A.-S., Biggs, R., Polasky, S., Troell, M. & de Zeeuw, A. Regime shifts and management. Ecol. Econ. 84, 15–22 (2012).
Article Google Scholar
Perman, R., Ma, Y., McGilvray, J. & Common, M. Natural resource and environmental economics. (Pearson Education, Essex, 2003).
Google Scholar
Weyant, J. Integrated assessment of climate change: state of the literature. J. Benefit-Cost. Anal. 5, 377–409 (2014).
Article Google Scholar
Stern, N. The economics of climate change. Am. Econ. Rev. 98, 1–37 (2008).
Article Google Scholar
Ackerman, F., DeCanio, S. J., Howarth, R. B. & Sheeran, K. Limitations of integrated assessment models of climate change. Clim. Change 95, 297–315 (2009).
Article CAS Google Scholar
Woodward, R. T. & Tomberlin, D. Practical precautionary resource management using robust optimization. Environ. Manag. 54, 828–839 (2014).
Article ADS Google Scholar
Martinet, V. & Doyen, L. Sustainability of an economy with an exhaustible resource: a viable control approach. Resour. Energy Econ. 29, 17–39 (2007).
Article Google Scholar
De Lara, M. & Doyen, L. Sustainable Management of Natural Resources: Mathematical Models and Methods. (Springer Science & Business Media, 2008).
Rougé, C., Mathias, J. -D. & Deffuant, G. Extending the viability theory framework of resilience to uncertain dynamics, and application to lake eutrophication. Ecol. Indic. 29, 420–433 (2013).
Article CAS Google Scholar
Chadès, I., et al. Optimization methods to solve adaptive management problems. Theoretical Ecology, 1–20 (2017).
Branke, J., Deb, K., Miettinen, K., Słowinski, R. Multi-objective Optimization: Interactive and Evolutionary Approaches. (Springer-Verlag Berlin Heidelberg, 2008).
Greco, S., Ehrgott, M. & Figueira, J. R. Multiple Criteria Decision Analysis. (Springer Science+Business Media, New York, 2005).
MATH Google Scholar
Ehrgott, M. Multicriteria Optimization. (Springer Science & Business Media 2006).
Altman, E. Constrained Markov Decision Processes, Vol. 7 (CRC Press, 1999).
Meadows, D. H., Goldsmith, E. & Meadows, P. The Limits of Growth, Vol. 381 (Earth Island Limited, London, 1972).
MATH Google Scholar
World Commission on Environment and Development. Our Common Future. Technical report (1987).
Pezzey, J. Sustainable development concepts. World Bank Environ. Pap. 1, 45 (1992).
Google Scholar
Pezzey, J. C. V. Sustainability Constraints versus “Optimality” versus Intertemporal Concern, and Axioms versus Data. Land Econ. 73, 448–466 (1997).
Article Google Scholar
Arrow, K. J., Dasgupta, P., Goulder, L. H., Mumford, K. J. & Oleson, K. Sustainability and the measurement of wealth. Environ. Dev. Econ. 17, 317–353 (2012).
Article Google Scholar
Fleurbaey, M. On sustainability and social welfare. J. Environ. Econ. Manag. 71, 34–53 (2015).
Article Google Scholar
Gerlagh, R. Generous sustainability. Ecol. Econ. 136, 94–100 (2017).
Article Google Scholar
Pezzey, J. C. V. One-sided sustainability tests with amenities, and changes in technology, trade and population. J. Environ. Econ. Manag. 48, 613–631 (2004).
Article MATH Google Scholar
Dasgupta, P. & Karl-Göran, M. The Economics of Non-convex Ecosystems, Vol. 4. (Springer Science & Business Media 2006).
Lontzek, T. S., Cai, Y., Judd, K. L. & Lenton, T. M. Stochastic integrated assessment of climate tipping points indicates the need for strict climate policy. Nat. Clim. Change 5, 441 (2015).
Article ADS Google Scholar
Cai, Y., Lenton, T. M. & Lontzek, T. S. Risk of multiple interacting tipping points should encourage rapid co 2 emission reduction. Nat. Clim. Change 6, 520 (2016).
Article ADS Google Scholar
Petschel-Held, Gerhard, Schellnhuber, Hans-Joachim, Bruckner, Thomas, Toth, FerencL. & Hasselmann, Klaus The tolerable windows approach: theoretical and methodological foundations. Clim. Change 41, 303–331 (1999).
Article CAS Google Scholar
Rockström, J. et al. A safe operating space for humanity. Nature 461, 472–475 (2009).
Article ADS PubMed CAS Google Scholar
Dearing, J. A. et al. Safe and just operating spaces for regional social-ecological systems. Glob. Environ. Change 28, 227–238 (2014).
Article Google Scholar
Carpenter, S. R., Brock, W. A., Folke, C., van Nes, E. H. & Scheffer, M. Allowing variance may enlarge the safe operating space for exploited ecosystems. Proc. Natl Acad. Sci. 112, 14384–14389 (2015).
Article ADS PubMed PubMed Central CAS Google Scholar
Folke, C. et al. Resilience thinking: integrating resilience, adaptability and transformability. Ecol. Soc. 15, 20 (2010).
Article Google Scholar
Raffensperger, C. & Tickner, J. A. Protecting Public Health and the Environment: Implementing the Precautionary Principle. (Island Press, Wahington, DC, 1999).
Google Scholar
Fischer, J. et al. Integrating resilience thinking and optimisation for conservation. Trends Ecol. Evol. 24, 549–554 (2009).
Article PubMed Google Scholar
Karl-Göran, M. & Li, C.-Z. Measuring sustainability under regime shift uncertainty: a resilience pricing approach. Environ. Dev. Econ. 15, 707–719 (2010).
Article Google Scholar
Derissen, S., Quaas, M. F. & Baumgärtner, S. The relationship between resilience and sustainability of ecological-economic systems. Ecol. Econ. 70, 1121–1128 (2011).
Article Google Scholar
Bellman, R. A Markovian decision process. Indiana Univ. Math. J. 6, 679–684 (1957).
Article MathSciNet MATH Google Scholar
Puterman, M. L. Markov Decision Processes: Discrete Stochastic Dynamic Programming. (John Wiley and Sons, Inc, Hoboken, New Jersey, 2005).
MATH Google Scholar
Heitzig, J., Kittel, T., Donges, J. F. & Molkenthin, N. Topology of sustainable management of dynamical systems with desirable states: from defining planetary boundaries to safe operating spaces in the Earth system. Earth Syst. Dyn. 7, 21–50 (2016).
Article ADS Google Scholar
Neumayer, E. Weak Versus Strong Sustainability: Exploring the Limits of Two Opposing Paradigms. (Edward Elgar Publishing, 2003).
Steffen, W. et al. Planetary boundaries: guiding human development on a changing planet. Science 347, 1259855 (2015).
Article PubMed CAS Google Scholar
Polasky, S., Zeeuw, A. D. & Wagener, F. Optimal management with potential regime shifts. J. Environ. Econ. Manag. 62, 229–240 (2011).
Article Google Scholar
Raworth, K. A doughnut for the anthropocene: humanity’s compass in the 21st century. Lancet Planet. Health 1, e48–e49 (2017).
Article PubMed Google Scholar
Nordhaus, W. D. A review of the Stern review on the economics of climate change. J. Econ. Lit. 45, 686–702 (2007).
Article Google Scholar
Schellnhuber, H. J., Rahmstorf, S. & Winkelmann, R. Why the right climate target was agreed in paris. Nat. Clim. Change 6, 649–653 (2016).
Article ADS Google Scholar
Rockström, J. et al. A roadmap for rapid decarbonization. Science 355, 1269–1271 (2017).
Article ADS PubMed Google Scholar
Clark, P. U. et al. Consequences of twenty-first-century policy for multi-millennial climate and sea-level change. Nat. Clim. Change 6, 360 (2016).
Article ADS Google Scholar
Ganopolski, A., Winkelmann, R. & Schellnhuber, H. J. Critical insolation–co2 relation for diagnosing past and future glacial inception. Nature 529, 200–203 (2016).
Article ADS PubMed CAS Google Scholar
Moellmann, C. et al. Reorganization of a large marine ecosystem due to atmospheric and anthropogenic pressure: a discontinuous regime shift in the central baltic sea. Glob. Change Biol. 15, 1377–1393 (2009).
Article ADS Google Scholar
Worm, B. et al. Rebuilding global fisheries. Science 325, 578–585 (2009).
Article ADS PubMed CAS Google Scholar
Costello, C., Gaines, S. D. & Lynham, J. Can catch shares prevent fisheries collapse? Science 321, 1678–1681 (2008).
Article ADS PubMed CAS Google Scholar
Österblom, H. et al. Human-induced trophic cascades and ecological regime shifts in the baltic sea. Ecosystems 10, 877–889 (2007).
Article CAS Google Scholar
Hutchings, J. A. & Reynolds, J. D. Marine fish population collapses: consequences for recovery and extinction risk. AIBS Bull. 54, 297–309 (2004).
Google Scholar
Caddy, J. F. & Agnew, D. J. An overview of recent global experience with recovery plans for depleted marine resources and suggested guidelines for recovery planning. Rev. Fish. Biol. Fish. 14, 43 (2004).
Article Google Scholar
Blaikie, P. & Brookfield, H. Land Degradation and Society. (Routledge, 2015).
Kinzig, A.P., et al. Resilience and regime shifts: assessing cascading effects. Ecol. Soc. 11, 20 (2006).
Horrigan, L., Lawrence, R. S. & Walker, P. How sustainable agriculture can address the environmental and human health harms of industrial agriculture. Environ. Health Perspect. 110, 445 (2002).
Article PubMed PubMed Central Google Scholar
Donges, J. F. & Barfuss, W. From math to metaphors and back again: social-ecological resilience from a multi-agent-environment perspective. GAIA-Ecol. Perspect. Sci. Soc. 26, 182–190 (2017).
Google Scholar
Rocha, J., Yletyinen, J., Biggs, R., Blenckner, T. & Peterson, G. Marine regime shifts: drivers and impacts on ecosystems services. Philos. Trans. R. Soc. B 370, 20130273 (2015).
Article Google Scholar
Schellnhuber, H. -J. Earth system analysis and the second Copernican revolution. Nature 402, C19–C23 (1999).
Article CAS Google Scholar
Bellman, R. Dynamic Programming. (Princeton University Press, 1957).

Download references

Acknowledgements

This work was developed in the context of the COPAN project on Coevolutionary Pathways in the Earth system at the Potsdam Institute for Climate Impact Research. The authors are grateful for financial support from the Heinrich-Böll-Foundation, the Stordalen Foundation (via the Planetary Boundaries Research Network PB.net), the Earth League’s EarthDoc program, the Leibniz Association (project DOMINOES) and the Swedish Research Council Formas (Project Grant 2014-589). We thank David Collste, Jobst Heitzig, Antoine Levesque, Finn Müller-Hansen and Maja Schlüter for discussions and comments on the manuscript.

Author information

Authors and Affiliations

Potsdam Institute for Climate Impact Research, 14473, Potsdam, Germany
Wolfram Barfuss, Jonathan F. Donges & Jürgen Kurths
Department of Physics, Humboldt University, 12489, Berlin, Germany
Wolfram Barfuss & Jürgen Kurths
Stockholm Resilience Centre, Stockholm University, 11419, Stockholm, Sweden
Jonathan F. Donges & Steven J. Lade
Fenner School of Environment and Society, The Australian National University, Canberra, ACT, 2601, Australia
Steven J. Lade
Saratov State University, Saratov, 410012, Russia
Jürgen Kurths

Authors

Wolfram Barfuss
View author publications
You can also search for this author in PubMed Google Scholar
Jonathan F. Donges
View author publications
You can also search for this author in PubMed Google Scholar
Steven J. Lade
View author publications
You can also search for this author in PubMed Google Scholar
Jürgen Kurths
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

W.B. designed and analyzed the model with assistance from J.F.D. and S.L. J.F.D and J.K. supervised the project. All authors wrote the manuscript.

Corresponding author

Correspondence to Wolfram Barfuss.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Supplementary Information

Peer review file

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Barfuss, W., Donges, J.F., Lade, S.J. et al. When optimization for governing human-environment tipping elements is neither sustainable nor safe. Nat Commun 9, 2354 (2018). https://doi.org/10.1038/s41467-018-04738-z

Download citation

Received: 02 February 2018
Accepted: 16 May 2018
Published: 15 June 2018
DOI: https://doi.org/10.1038/s41467-018-04738-z

This article is cited by

Intrinsic fluctuations of reinforcement learning promote cooperation
- Wolfram Barfuss
- Janusz M. Meylahn
Scientific Reports (2023)
Responsibility Under Uncertainty: Which Climate Decisions Matter Most?
- Nicola Botta
- Nuria Brede
- Tim Richter
Environmental Modeling & Assessment (2023)
Coordination and equilibrium selection in games: the role of local effects
- Tomasz Raducha
- Maxi San Miguel
Scientific Reports (2022)
Dynamical systems as a level of cognitive analysis of multi-agent learning
- Wolfram Barfuss
Neural Computing and Applications (2022)
Landscape Engineering Impacts the Long-Term Stability of Agricultural Populations
- Jacob Freeman
- John M. Anderies
- Claudio Latorre
Human Ecology (2021)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.