A tighter constraint on Earth-system sensitivity from long-term temperature and carbon-cycle observations

Wong, Tony E.; Cui, Ying; Royer, Dana L.; Keller, Klaus

doi:10.1038/s41467-021-23543-9

Download PDF

Article
Open access
Published: 26 May 2021

A tighter constraint on Earth-system sensitivity from long-term temperature and carbon-cycle observations

Tony E. Wong ORCID: orcid.org/0000-0002-7304-3883¹,
Ying Cui²,
Dana L. Royer³ &
…
Klaus Keller^4,5

Nature Communications volume 12, Article number: 3173 (2021) Cite this article

5844 Accesses
9 Citations
63 Altmetric
Metrics details

Subjects

Abstract

The long-term temperature response to a given change in CO₂ forcing, or Earth-system sensitivity (ESS), is a key parameter quantifying our understanding about the relationship between changes in Earth’s radiative forcing and the resulting long-term Earth-system response. Current ESS estimates are subject to sizable uncertainties. Long-term carbon cycle models can provide a useful avenue to constrain ESS, but previous efforts either use rather informal statistical approaches or focus on discrete paleoevents. Here, we improve on previous ESS estimates by using a Bayesian approach to fuse deep-time CO₂ and temperature data over the last 420 Myrs with a long-term carbon cycle model. Our median ESS estimate of 3.4 °C (2.6-4.7 °C; 5-95% range) shows a narrower range than previous assessments. We show that weaker chemical weathering relative to the a priori model configuration via reduced weatherable land area yields better agreement with temperature records during the Cretaceous. Research into improving the understanding about these weathering mechanisms hence provides potentially powerful avenues to further constrain this fundamental Earth-system property.

Consistent multidecadal variability in global temperature reconstructions and simulations over the Common Era

Article 24 July 2019

Proxy evidence for state-dependence of climate sensitivity in the Eocene greenhouse

Article Open access 07 September 2020

Globally resolved surface temperatures since the Last Glacial Maximum

Article 10 November 2021

Introduction

Understanding the relationship between changes in atmospheric carbon dioxide (CO₂) concentration and global surface temperatures has been a scientific quest for more than a century¹. The current uncertainty surrounding this relationship poses considerable challenges for the design of climate change policies². Of particular interest is the equilibrium response of global mean surface temperature to a doubling of CO₂ relative to preindustrial conditions, termed the “equilibrium climate sensitivity”³ (ECS). The ECS is critical for mapping changes in radiative forcing, including CO₂ and other greenhouse gases, to changes in global temperature. ECS is based on “fast” feedback responses to changes in radiative forcing, including changes in water vapor, lapse rate, cloud cover, snow/sea-ice albedo, and the Planck feedback⁴. Even with detailed constraints from the instrumental period, ECS estimates based on the historic record alone are still subject to large uncertainties^5,6,7,8. Based on the understanding of feedback processes, historical climate and paleoclimate records, a recent summary by Sherwood et al.⁹ concluded that the most likely range (66% confidence) for the effective sensitivity (defined in terms of the 150-year temperature response to a quadrupling of CO₂ forcing in the context of their general circulation model experiments) is 2.6–3.9 °C. Similar to the ECS, the effective sensitivity does not include long-term feedbacks, such as ice sheets, vegetation, and carbon cycle (ref. ⁹ and references therein).

In contrast to the shorter-term ECS that responds to relatively fast feedback processes, consideration of longer-term responses offers a glimpse into the deep-time paleoclimate evolution of the sensitivity of the Earth-system temperature response to both fast and slow feedbacks. In particular, a deep-time perspective offers insight into the “Earth-system sensitivity” (ESS)—the long-term equilibrium surface temperature response to a given CO₂ forcing, including all Earth-system feedbacks¹⁰. Sherwood et al. estimate the ESS as their effective sensitivity multiplied by an inflation factor, (1 + f_ESS), where f_ESS is sampled from a normal distribution with mean value of 0.5 and standard deviation of 0.25 (refs. ^10,11). A growing body of evidence suggests covariations in CO₂ and temperature during the last 420 million years (Myr; ref. ¹²). This long-term record enables improved quantification of ESS and insights into factors affecting the climate response across a wide range of climate states, including both icehouse and greenhouse conditions^{10,13,14,15,16,17,18}. This wide range of states and variations in temperature and CO₂ is also important to help distinguish the long-term climate signal from the noise.

Previous studies estimate ESS over geological timescales using varying combinations of global climate models, long-term carbon-cycle models, and proxy data for temperature and atmospheric CO₂. Royer et al.¹⁹ combines a geochemical model and CO₂ proxies from the past 420 Myr, and concludes that ESS falls between 1.6 and 5.5 °C (95% confidence). In addition, during glacial periods, a given CO₂ forcing will lead to a stronger temperature change due to the land ice-albedo feedback. Thus, estimates of ESS that do not explicitly account for land-ice feedbacks will necessarily be higher than those that do. Arguments in (for example) Park and Royer¹⁴ and Hansen et al.¹⁸ support such a “glacial amplification” in ESS, giving 6 °C or more warming per doubling of CO₂. The former study uses model time steps of 10 Myr, so mechanisms such as orbital forcings, which operate on timescales of 10–100s of thousands of years, are averaged out and are not explicitly represented. Many studies suggest that ESS >1.5 °C is a general feature of the Phanerozoic^10,13,16,20, although these studies generally vary in the types of external forcings they consider and the confidence levels for the ranges they report. By assuming different sets of external feedbacks, forcings and (sea) surface temperatures, these previous studies report different kinds of Earth-system sensitivities⁴. It is therefore necessary to distinguish between various flavors of ESS^4,15. For example, the geochemical model from Royer et al.¹⁹ uses a form of ESS that computes the overall global mean surface temperature response by explicitly accounting for forcings from changes in CO₂, solar luminosity, and paleogeography. In the notation of Rohling et al.⁴, this ESS is based on the specific paleoclimate sensitivity S_{[CO2, geog, solar]}. Krissansen-Totton and Catling²¹ also account explicitly in their model for CO₂, solar, and paleogeographic forcing over the past 100 Myr, and compute a median ESS of 5.6 °C (3.7–7.5 °C 90% credible interval). By contrast, Anagnostou et al.²² account explicitly for CO₂, solar luminosity, paleogeography, and land ice, and find ESS estimates varying from ~5–7 °C 53 Myr ago to about 2 °C 30 Myr ago. Following the argument above, we expect that the inclusion of land-ice feedbacks leads the ESS estimate of ref. ²² (based on S_{[CO2, geog, solar]}) to be lower than that of ref. ²¹ (based on S_{[CO2, geog, solar, land ice]}).

In long-term carbon-cycle models, many uncertainties stem from how CO₂ proxy data can best be used to improve estimates of carbon-cycle model parameters^14,19. Specifically, the errors in proxy CO₂ data are often asymmetric, where it is typical for the upper error bound to be farther from the mean than the lower error bound^15,23. In addition, there is a complex interactive relationship among the model parameters and their combined effect on modeled CO₂ concentrations. Previous assessments do not fully account for these model parameter interactions¹⁵ or neglect the asymmetric error structure²¹. This raises the related questions of how these assumptions affect estimates of ESS, and which research has the greatest promise to reduce biases and constrain ESS, given this common model framework.

Here, we expand on previous work^14,15,19 by improving the uncertainty characterization of both proxy CO₂, surface temperature, and associated parameters in a commonly used long-term geochemical model. First, we consider interactions among model parameters via a Monte Carlo precalibration approach to account for uncertainties in the model parameters, and the surface temperature and CO₂ proxy data. We generate model ensembles that agree with CO₂ proxy data only, temperature reconstructions only, and both, by imposing constraints on the goodness-of-fit of the model simulations. This experimental setup allows us to characterize the ability of each source of information to better constrain estimates of model parameters, and to examine the correlations among model parameters and periods of bias in the model output. We demonstrate that improved constraint on the ESS model parameter ∆T_2x will result from both (i) improved constraint on the CO₂ and/or temperature hindcast and from (ii) using both CO₂ and temperature data to constrain estimates of ∆T_2x.

Results

The GEOCARBSULFvolc model

GEOCARBSULFvolc is a long-term carbon and sulfur cycle model that simulates atmospheric concentrations of CO₂ and O₂ based on mass and isotopic balance over the past 570 Myr. The GEOCARBSULFvolc model (henceforth, “GEOCARB”) and its previous incarnations^24,25 have been widely used in previous studies (e.g., refs. ^{14,15,19,26,27}), and includes a version of ESS where the only independent radiative forcings are CO₂, solar evolution, and changing geography. In the notation of refs. ^4,10, this ESS would be computed from the specific paleoclimate sensitivity, S_{[CO2, geog, solar]}. However, within GEOCARB and other such models (e.g., ref. ²¹), an ESS model parameter links CO₂ radiative forcing to the associated temperature response, but also accounts internally for other forcings in computing the total temperature response to radiative forcing. In GEOCARB, the ESS model parameter ∆T_2x corresponds to the long-term temperature change resulting from doubling CO₂ relative to preindustrial levels, accounting for changes in solar evolution and continental geography. GEOCARB assumes a linear increase in solar luminosity over time, corresponding to the parameter W_s, and uses results from general circulation model output to simulate the land temperature change, resulting solely from changes in paleogeography (GEOG; see Supplementary Fig. 1)^28,29. Thus, appropriate choices for the ESS parameter within GEOCARB are influenced by the balance of forcing between CO₂, solar luminosity, and paleogeographic changes. For brevity, we will use ∆T_2x when referring to ESS within the GEOCARB model, and reserve the term “ESS” for discussion of Earth-system sensitivity more generally.

While other long-term carbon-cycle model choices are available^21,30,31, we focus on the GEOCARB model due to its extensive use as an inverse modeling tool for leveraging CO₂ proxy data to constrain ESS and other geophysical uncertainties^14,15,19,27. The inverse approach generates model simulations using many different plausible values for ∆T_2x to determine which values for ∆T_2x are likely, given the (mis)match between the proxy data for CO₂ and temperature and model simulation output for these quantities. The model structure assumes that the atmosphere and ocean is a single system, where the weathering of organic-rich sediments and volcanic degassing deliver carbon to the atmosphere–ocean system, while carbon is lost via the burial of organic-rich sediments and carbonates^24,32. The shape of the modeled CO₂ curve is well-characterized, with high values (>1000 p.p.m.) between 540 and 400 Myr and ~250 Myr¹⁵, consistent with the lower solar luminosity in the early Phanerozoic²⁶.

There are 68 GEOCARB model parameters, of which 56 are constants and 12 are time series parameters. The constant parameters have well-defined prior distributions from previous work, and the time series parameters have central estimates and independent uncertainties defined for each time point¹⁵. Previous efforts to constrain the uncertainty in the GEOCARB model parameters relied on several important, but limiting, assumptions¹⁵. The prior distribution centers are held fixed in their Monte Carlo resampling strategy and only the widths are adjusted; if a parameter sample leads to model failure (e.g., through unphysical carbon or sulfur fluxes or unphysical O₂ or CO₂ concentrations), then the input range is considered unlikely and rejected. This resampling approach risks missing key parameter interactions, and propagating biases in the centers of parameters’ distributions.

Model configuration

Our adopted GEOCARB model¹⁵ is structurally identical to the model as presented in Royer et al.¹⁵. GEOCARB assumes a single ESS parameter (called ∆T_2x within the model and in previous studies^14,15,21) for the past 420 Myrs of non-glacial periods, and includes a parameter (GLAC) to amplify the ESS during the late Paleozoic (330–260 Myr) and late Cenozoic (40–0 Myr) glacial periods. During glacial periods, the effective ESS within GEOCARB is then GLAC × ∆T_2x. The two stable states, glacial and non-glacial, for ESS within GEOCARB provide a simple representation of the type II state dependence described by von der Heydt et al.³³. However, temporal variation in ESS within each of those stable states is not represented in GEOCARB^22,34. Some previous modeling efforts have assumed a single value of ESS for multiple climate states (e.g., glacial and non-glacial)^19,21. This will generally increase the uncertainty in the resulting scalar parameter estimate due to using a single parameter to represent a quantity that is changing over time.

We briefly discuss the temperature and carbon mass balance calculations within GEOCARB below, but further details on the GEOCARB model structure and parameterizations may be found in ref. ³⁵. The temperature in GEOCARB is computed as

$$T(t)-T(0)={\Delta {{T}}}_{2{{x}}}\frac{{\mathrm{ln}}({{\rm{RCO}}}_{2}(t))}{{\mathrm{ln}}(2)}-{W}_{{\rm{s}}}\frac{t}{570}+{\rm{GEOG}}(t),$$

(1)

where T(t)–T(0) denotes the global mean surface temperature at time t (Myr ago) relative to present (t = 0) and RCO₂(t) is the mass of atmospheric CO₂ at time t relative to present. The time series parameter GEOG describes the change in land temperature attributable to changes in paleogeography and the parameter W_s accounts for the linear trend in solar forcing over time. We follow ref. ³⁵ and assume a present (past 5 Myr) mean global surface temperature of T(0) = 15 °C. In GEOCARB, a mass balance governs changes in carbon over time in the surficial system, as given in Eq. 1 of ref. ¹⁵:

$$\frac{{\rm{d}}{M}_{{\rm{c}}}}{{\rm{dt}}}={F}_{{\rm{wc}}}+{F}_{{\rm{wg}}}+{F}_{{\rm{mc}}}+{F}_{{\rm{mg}}}-{F}_{{\rm{bc}}}-{F}_{{\rm{bg}}},$$

(2)

where M_c is the total carbon mass in the surficial system, F_wc represents the flux due to weathering of calcium and magnesium carbonates, F_wg represents the weathering flux of sedimentary organic carbon, F_mc represents the degassing flux from carbonates, F_mg represents the degassing from organic carbon, F_bc represents the burial of carbonate, and F_bg represents the burial of organic carbon. An associated carbon isotopic mass balance accompanies this as an additional constraint. Thus, the weathering processes (F_wc and F_wg) are parameterized to capture the average balance among these carbon sources and sinks, assuming a steady state balance over the course of a 10 Myr time step. For modeling the long-term carbon cycle, no perturbations around the steady state can persist for >500,000 years³⁶, including for alkalinity.

Parameter precalibration

The essence of our precalibration method to fuse the GEOCARB model with data is to sample a large number of model parameter sets from their prior distributions—these are the a priori parameter values, taken before any data are fused with the model. Then, we rule out any combinations of parameters that yield simulations that do not agree well with the CO₂ proxy or temperature data, given their uncertainties. What remains are the a posteriori ensembles of parameters, including ∆T_2x. We use Latin hypercube sampling to draw samples of the constant parameters and inverse Wishart sampling to account for uncertainty and autocorrelation in the time series parameters (see “Methods” and Supplementary Fig. 2). This method improves on previous GEOCARB-based ESS estimates by updating the centers of all parameters’ distributions.

In this setting, precalibration is preferable to formal calibration methods (e.g., Markov chain Monte Carlo) to avoid potentially overconstraining the system with a large and diverse calibration data set. For example, data points with relatively lower uncertainty can dominate the goodness-of-fit measure, leading to poor agreement with the other data points. Here, the CO₂ data uncertainties scale roughly with CO₂ concentration, so we employ precalibration to avoid a low-CO₂ bias (see Supplementary Fig. 3). We establish a maximal +/−1σ window around all of the time series data for each of temperature and CO₂. For the CO₂ data, we use the proxy compilation of Foster et al.²⁶, and for the temperature data, we use the Phanerozoic temperature compilation of Mills et al.¹². As a goodness-of-fit measure, we use the percentage of time steps in which a model simulation is outside the range of the precalibration windows around the data, termed “%outbound” following Mills et al.¹². We create ensembles of model simulations that match CO₂ data, temperature data, or both simultaneously by imposing limits of at most 50, 45, 40, 35, and 30% of time steps to be out-of-bounds (for a total of 15 main experiments). Unless otherwise stated, we present results for the 30 %outbound experiment, using both CO₂ and temperature data. Figure 1 gives a schematic depicting the precalibration workflow.

**Fig. 1: Schematic of the precalibration workflow.**

Inference for Earth-system sensitivity

We find an a posteriori ensemble median ∆T_2x of 3.4 °C per doubling of CO₂ (mean is 3.5 °C and 5–95% credible range is 2.6–4.7 °C; Fig. 2). Our estimates further improve constraint on the upper tail of the distribution for ∆T_2x from previous GEOCARB work: 2.8 °C (1.6–5.5 °C 95% confidence range) from Royer et al.¹⁹ and 3.8 °C (1.6–7.6 °C 5–95% probability range) from Park and Royer¹⁴. We find 0.1% probability associated with non-glacial ∆T_2x >6 °C, in contrast to 16% in Park and Royer¹⁴ (“PR2011” in Fig. 2).

**Fig. 2: A posteriori probability density for Earth-system sensitivity parameter (ΔT_2x, solid blue line).**

The fact that the ∆T_2x estimate of Krissansen-Totton and Catling²¹ (3.7–7.5 °C 5–95% probability range) is centered higher and has a wider uncertainty range than our study can be attributed largely to their selection of a single constant sensitivity value (see Supplementary Fig. 4). Our a posteriori estimates for the glacial scaling factor, GLAC, are centered at 2.1 (ensemble median; 5–95% credible range: 1.4–2.9), which is consistent with the central value of 2 used in previous work¹⁴. This leads to our estimated distribution for the net glacial period ∆T_2x to be centered at 7.1 °C (mean is 7.3 °C and 5–95% credible range is 4.4–11.0 °C). This result is centered slightly higher than the estimate of 6–8 °C from a previous GEOCARB analysis¹⁴, although still within the uncertainty ranges for that and other glacial period ∆T_2x estimates¹³. Our results thus reconcile the distribution of ESS between estimates that place more probability weight <2.5 °C (refs. ^4,14,19) and the high-end estimates of Krissansen-Totton and Catling²¹, whose posterior ∆T_2x values represent a mix of the glacial and non-glacial estimates presented here.

As we consider increasingly tighter bounds on acceptable CO₂ hindcasts without the use of temperature data, the corresponding constraint on ∆T_2x does not noticeably improve (Fig. 3). As we progressively tighten constraint on temperature hindcasts, however, the associated estimates of ∆T_2x become better-constrained: the uncertain ranges for ∆T_2x become narrower. This improvement is most prominent when CO₂ and temperature are used as complementary constraints (Fig. 3, bottom). In addition, the ensemble median estimate of ∆T_2x increases as constraint on paleo global mean surface temperature improves (Fig. 3, middle). Thus, two important related conclusions emerge: (i) temperature provides an important constraint on ∆T_2x, in addition to CO₂, and (ii) improved estimates of paleotemperatures likely lead to tighter estimates of ∆T_2x. These results highlight the importance of temperature data, in order to improve estimates of ESS more generally.

**Fig. 3: Medians and 50 and 90% credible intervals of the ESS model parameter, ΔT_2x.**

Constraint of paleo CO₂ evolution

We find that the assimilation of the CO₂ and temperature proxy data provides a tight constraint on the evolution of modeled paleoclimate CO₂ and surface temperature (Fig. 4). As expected, there is notable improvement in the simulation of paleoclimate CO₂ concentration when both temperature and CO₂ data are used for precalibration, as compared to when only temperature data are used (Fig. 4a). When we use only temperature data to constrain the model simulations, 10% of the 10,000 ensemble members are in agreement with the proxy CO₂ compilation at a %outbound level of 25% or better. By including CO₂ data in addition to temperature data, the number of simulations that agree at the 25 %outbound level or better improves to 85%. We focus on the 25 %outbound error level here because that is roughly the lowest error magnitude reported by Mills et al.¹² (c.f. Fig. 11 in that work). We also observe dramatic improvement in the temperature simulation: without temperature data, only 0.14% of the 10,000 ensemble members have an error <25 %outbound; by including temperature data in addition to CO₂ data, 3.8% of the ensemble members attain error margins <25 %outbound in temperature. While 3.8% seems like a low proportion of success, we note that (i) 25 %outbound in temperature is comparable to the best error margins for the tuned simulations of Mills et al.¹², and (ii) this constitutes an order of magnitude improvement relative to the model simulations that do not employ temperature data.

**Fig. 4: Model hindcast, using both CO₂ and temperature data, for precalibration and a %outbound threshold of 30% (shaded regions).**

Controls on Cretaceous temperature biases

Despite this improvement in the match to paleotemperatures, it is still striking that it is so rare to attain error margins that are <25 %outbound for temperature. These results, taken together with the results from the work of Mills et al.¹², who also found it difficult to further improve on the temperature simulation, highlight the importance of examining the controls on paleotemperature within the GEOCARB model structure. Specifically, during the early Cretaceous (~100 Myr ago), both our results and those of Mills et al.¹² display a substantial cool bias in temperature relative to the proxies.

In light of these biases, we perform an additional sensitivity experiment to investigate the controls on early Cretaceous (140–90 Myr ago) temperature, using the GEOCARB model. First, the point of this exercise is to examine the relationship between Cretaceous temperatures and the model parameters (in particular, the ESS parameter ∆T_2x), so we relax the %outbound threshold from 30 to 50%. This change allows more variation in the model’s temperature simulations. Later, after making further changes to improve the goodness-of-fit in the Cretaceous temperature simulations, we tighten the error margin back to 30%, to show that the GEOCARB model is indeed quite capable of matching well the Cretaceous temperature record. Our initial sensitivity experiment is similar to the 50 %outbound experiment from our main set of simulations, where the ensemble for analysis consists only of simulations that match the CO₂ temperature data windows in at least 50% of the time steps. In our new experiment, however, we retain only those simulations that pass through the temperature data window at 90 Myr ago. This time step was chosen because it corresponds to the peak in the temperature time series (Fig. 4b, gray-shaded region).

The Cretaceous-matching calibration experiment leads to an increase in the estimated distribution for ΔT_2x by ~0.2 °C relative to the original results for the 50 %outbound experiment (median of 3.6 °C as compared to 3.4 °C in the original 50 %outbound experiment). We examine the distributions of model input parameters for the Cretaceous-matching experiment and find no substantial changes in any of the 56 constant parameters. However, several of the time series parameters’ distributions change substantially. Specifically, we find that changes were required in the time series for the land area relative to present (f_A), global river runoff relative to present (f_D), the response of temperature change on river runoff (RT), and the fraction of land area that undergoes chemical weathering relative to present (f_AW/f_A). Not surprisingly, the main changes to these time series parameters occur primarily in the 90 Myr time step (see Supplementary Fig. 5). In order to match the Cretaceous temperatures during that time, we observe slight decreases in f_A, f_D, and RT 90 Myr ago. However, we observe a sizable decrease in f_AW/f_A (the weatherable land surface area), which is not well-supported by paleoclimate modeling studies^28,29.

To remove the effect of arguably unphysical parameter choices, we generate a new set of 10,000 simulations that all match the Cretaceous temperature 90 Myr ago. We sample the time series parameters by changing the centers of their multivariate normal distributions to match the mean time series shown in Supplementary Fig. 5 (dashed lines). We revert to using the 30 %outbound threshold, in order to assess the degree to which our best ESS estimates (the 30 %outbound experiments) are influenced by biases in the Cretaceous temperatures, and to improve these estimates by accounting for both the Cretaceous temperature bias and the plausibility of forcing parameter values. By restricting our set of simulations to only those in which the f_AW/f_A time series does not stray too far from its original central value, our updated set of Cretaceous-matching simulations has a median ΔT_2x of 3.3 °C, as compared to 3.4 °C in the original set of experiments. The 5–95% probability range also shifts ~0.1–0.2 °C cooler at 2.5–4.5 °C, as compared to 2.6–4.7 °C in the original 30 %outbound experiments (Fig. 5). From the fact that the distribution of estimated ΔT_2x changes by <0.2 °C, we conclude that our estimates of ESS are not unduly influenced by biases in the temperature simulation. Further, we conclude that GEOCARB is indeed capable of matching the temperature data, although these results highlight that sampling via brute force Monte Carlo requires a very large number of samples and some statistical care is needed, in order to bring the modeled and proxy temperatures into better agreement.

**Fig. 5: A posteriori probability density for Earth-system sensitivity parameter (ΔT_2x).**

Discussion

We make a number of improvements relative to previous work using the GEOCARBSULFvolc model^14,19, which reduce the ESS uncertainty compared to these previous studies¹⁴. This change can be explained by our improved calibration approach and our use of temperature data in addition to CO₂. Specifically, we find that a constraint on paleotemperature is critical for tightening our estimates of the GEOCARB ESS parameter, ΔT_2x; reducing the uncertainty surrounding paleo CO₂ concentrations on its own is not sufficient. In addition, we include a larger CO₂ proxy data record²⁶ and conduct a set of sensitivity experiments to analyze the parametric controls on simulated CO₂ concentrations and global mean surface temperatures. Our results refine the characterization of the Earth-system surface temperature response to changes in atmospheric CO₂ concentrations and can provide guidance on where to focus future research to better understand and quantify this relationship.

We adopt a well-studied, state-of-the-art, yet still relatively simple model. This model simplicity provides the advantages of transparency and the ability to perform careful and exhaustive uncertainty and sensitivity analyses³⁷. These advantages come, however, with several caveats that point to fruitful research directions. One key caveat stems from the fact that GEOCARB is a coarse-resolution and highly parameterized model with a long (10 Myr) time step and many (68) parameters (including 12 time series). A second related caveat arises from the still highly stylized representation of feedbacks and processes that is characteristic of such models (e.g., refs. ^21,30). As previously discussed (e.g., refs. ^12,22,33), the current assumption in the model of using a constant ΔT_2x for each of the glacial and non-glacial stable climate states risks missing processes leading to gradual changes in ΔT_2x within one of the larger stable climate states. The work of ref. ¹² further points to the potential importance of capturing this type I state dependence in ΔT_2x, because their results indicate an increasing trend in ΔT_2x beginning ~130 Myr ago. In the GEOCARB model, however, the ΔT_2x ESS parameter is assumed to be constant at its non-glacial value from 260 to 40 Myr ago, then shifts immediately to its glacial value from 40 to 0 Myr ago. We evaluate the impacts of this type I state dependence in an experiment, where we linearly increase ΔT_2x from its non-glacial value 130 Myr ago to its glacial value 40 Myr ago; the parameter remains constant at its glacial value from 40 to 0 Myr ago. This linear change in ΔT_2x (as opposed to the step function transitions in the base-case version of the model) has little effect on the temperature hindcast (Supplementary Fig. 6). This simple experiment, of course, scratches only the surface of the challenge to represent type I state dependence for ESS. This result suggests, however, that a simple refinement of type I state dependency does not substantially impact our results.

The assumed time series of forcing parameters may also introduce biases. For example, uncertainty in paleogeographical changes, such as the opening of the Drake Passage, while not explicitly represented in the GEOCARB inputs or processes, indeed contributes to uncertainty in such parameters as GEOG (the temperature change resulting from changes in paleogeography, assuming fixed CO₂ and solar luminosity). In addition, GEOCARB does not explicitly account for non-CO₂ greenhouse gases or aerosols. This limitation of GEOCARB and other similar models (e.g., ref. ²¹) may risk overestimating ΔT_2x by assuming that all of the observed temperature change is attributable to the CO₂ forcing (along with paleogeography and solar luminosity in the case of GEOCARB). However, our experiment examining the Cretaceous cool temperature bias suggests that our estimates of ESS are robust to these variations associated with improving the Cretaceous temperatures.

Our use of a precalibration approach to avoid overfitting data points with low-CO₂ concentrations minimizes the low-CO₂ bias found throughout the Mesozoic Era characteristic of previous GEOCARB analyses^14,15. Indeed, when we fit a mixture model distribution to the CO₂ proxy data, this distribution reveals strong multimodality in the CO₂ proxy record (see Supplementary Fig. 7). This multimodality is a likely culprit for the low-CO₂ bias observed in previous work^14,15, as formal calibration procedures (e.g., Markov chain Monte Carlo²¹) may improve the model fit to the data by tuning the model to better represent modes in the data that have narrower uncertainty ranges at the expense of adequately representing data points with higher uncertainties (Supplementary Fig. 7a).

We find that the efficiency of chemical weathering, as modulated by weatherable land surface area and riverine discharge to oceans, offers an avenue to improve the representation of paleotemperature in GEOCARB. Given the important role of temperature in obtaining better-constrained estimates of ΔT_2x, this highlights the importance of these weathering mechanisms for constraining ESS, thereby improving our understanding of the relationship between atmospheric CO₂ concentrations and changes in Earth’s climate.

Methods

Parameter precalibration

We use the parameter means and uncertainty ranges given by Park and Royer¹⁴. There are 68 parameters in total: 56 constant parameters and 12 time series parameters. The time series parameters include isotopic ratios for strontium (⁸⁷Sr/⁸⁶Sr, to track the weathering fraction of volcanic rocks), carbon and sulfur isotope ratios (δ¹³C and δ³⁴S, to track burial, degassing, and weathering fluxes); paleogeographical factors (including continental relief, total land area, land area susceptible to weathering, land area covered by carbonates, river runoff, and the effect of paleogeographical changes on temperature); and degassing and seafloor spreading. The parameters are described along with their prior and posterior ranges in the Supplemental Material accompanying this work, and in much greater detail in Royer et al.¹⁵. The essence of any Bayesian calibration scheme is to update our a priori beliefs about probable parameter values in light of the available data. Our a priori beliefs about the parameters’ probable values and their uncertainties are characterized by assigning the parameters prior distributions. The constant parameters are assigned Gaussian prior distributions, with the exception of the Earth-system sensitivity parameter, ΔT_2x, which we assign a log-normal prior distribution¹⁴. Each of the time series parameters takes on distinct values at each of the 58 model time steps. Following previous work, we assume the model and forcing time series parameters are in steady state between model time steps¹⁴. Each time series parameter is sampled from a 58-dimensional (number of time steps) multivariate normal distribution, whose mean is taken to match the central estimates from previous work¹⁵. The covariance matrix for this multivariate normal distribution is sampled from an inverse Wishart distribution. We choose the degrees of freedom for the inverse Wishart distributions such that the widths of the prior distributions match those from Royer et al.¹⁵. We update the time series for seafloor spreading rate (f_SR) to match the more recent work of Domeier and Torsvik³⁸, and evaluate the sensitivity of our results to this improvement in a set of supplemental experiments (see Supplementary Fig. 1). In our adopted GEOCARB model, we have fixed an error that was noted in previous GEOCARB versions²⁷, wherein the forcing time series for the fraction of land area that undergoes chemical weathering relative to present (the parameter f_AW/f_A) was previously not normalized to 1 relative to the final model time step (which roughly represents present-day conditions).

Model–data fusion

Using the CO₂ proxy data set as in Foster et al.²⁶, containing 1215 proxy data points, we first discard two data points with unphysical negative CO₂ concentration values. For each model time step (10 Myr), we construct a precalibration window as follows (see Supplementary Fig. 8). We pool all data points within 5 Myr of the given time step’s center. We compute the upper and lower 1σ bounds on each of the data points within the given time step. From the set of upper 1σ bounds, we take their maximum as the upper limit of the precalibration window for this time step. Similarly, we use the minimum of the data points’ lower 1σ bounds for the lower bound for each of the windows. Any time steps that have no CO₂ proxy data points within them are assigned a window of 0–50,000 p.p.m.v. CO₂ (ref. ¹⁵). For paleoclimate global mean surface temperature reconstructions, we use the reconstruction of Mills et al.¹². The gray-shaded regions in Fig. 4 correspond to the time series of precalibration windows. We measure a model simulation’s goodness-of-fit to the proxy data using the percentage of time steps, in which the model hindcast time series is outside of the precalibration windows around the data, termed “%outbound” following Mills et al.¹². We use thresholds of %outbound varying from 30 to 50%, in order to evaluate the impacts of improved fit to the data. As examples, a %outbound threshold of 100% amounts to sampling from the prior distributions, and a 0 %outbound threshold requires the model simulations to go through all of the precalibration windows. For each of the %outbound thresholds between 30 and 50% (in increments of 5%), we generate model ensembles that agree with CO₂ proxy data only, with temperature reconstructions only, and both data sources.

Parameter sampling

We use a Latin hypercube approach to sample from the prior distributions of the model parameters, and use the precalibration windowing procedure described above to cull the prior samples down to only those that match the data (temperature, CO₂ or both) to within the desired %outbound threshold. We use an initial sample size of 2 × 10⁷ parameter sets, but cease sampling once we achieve at least 10,000 samples that are within the %outbound threshold for the given experiment. Experiments adjusting the final sample size confirmed that our a posteriori estimates of ΔT_2x are insensitive to changes in sample size beyond ~1000 samples (see Supplementary Fig. 9).

In our experiment examining the Cretaceous temperature bias, we sample the time series parameters by changing the centers of their multivariate normal distributions to the a posteriori means from a set of simulations that are forced to agree with the temperature data at the 90 Myr ago time step. We retain only the plausible simulations in our experiment by removing any simulations where the value for the f_AW/f_A time series at 90 Myr ago was more than one standard deviation away from its original central value. This leaves 2139 simulations out of the original 10,000.

Data availability

All input data sets are provided with the model codes and are freely available from https://doi.org/10.5281/zenodo.4562996. Model output results files used for analysis are freely available from https://doi.org/10.5281/zenodo.4563019. All are provided under the GNU general public license.

Code availability

All model codes and analysis codes used for analysis are freely available from https://doi.org/10.5281/zenodo.4562996, and are distributed under the GNU general public license. Large model output data sets are linked in the “Data availability” section.

References

Arrhenius, S. On the influence of carbonic acid in the air upon the temperature of the ground. Lond. Edinb. Dublin Philos. Mag. J. Sci. 41, 237–276 (1896).
Article CAS Google Scholar
de Coninck, H. et al. in Global Warming of 1.5 °C an IPCC Special Report on the Impacts of Global Warming of 1.5 °C Above Pre-industrial Levels and Related Global Greenhouse Gas Emission Pathways, in the Context of Strengthening the Global Response to the Threat Of Climate Change (ed. Masson-Delmotte, V.) (Intergovernmental Panel on Climate Change, 2018).
Charney, J. G. et al. Carbon Dioxide and Climate: a Scientific Assessment (The National Academies Press, 1979).
Rohling, E. J. et al. Making sense of palaeoclimate sensitivity. Nature 491, 683–691 (2012).
Article ADS CAS Google Scholar
Rohling, E. J. et al. Comparing climate sensitivity, past and present. Ann. Rev. Mar. Sci. 10, 261–288 (2018).
Article Google Scholar
Olson, R. et al. A climate sensitivity estimate using Bayesian fusion of instrumental observations and an Earth System model. J. Geophys. Res. Atmos. 117, 1–11 (2012).
Article Google Scholar
Forest, C. E., Stone, P. H., Sokolov, A. P., Allen, M. R. & Webster, M. D. Quantifying uncertainties in climate system properties with the use of recent climate observations. Science 295, 113–117 (2002).
Article ADS CAS Google Scholar
Knutti, R., Rugenstein, M. A. A. & Hegerl, G. C. Beyond equilibrium climate sensitivity. Nat. Geosci. 10, 727–736 (2017).
Sherwood, S. et al. An assessment of Earth’s climate sensitivity using multiple lines of evidence. Rev. Geophys. 58, e2019RG000678 (2020).
Lunt, D. J. et al. Earth system sensitivity inferred from Pliocene modelling and data. Nat. Geosci. 3, 60–64 (2010).
Article ADS CAS Google Scholar
Haywood, A. M. et al. Large-scale features of Pliocene climate: results from the Pliocene Model Intercomparison Project. Clim. Past 9, 191-209 (2013).
Mills, B. J. W. et al. Modelling the long-term carbon cycle, atmospheric CO₂, and Earth surface temperature from late Neoproterozoic to present day. Gondwana Res. 67, 172–186 (2019).
Article ADS CAS Google Scholar
Pagani, M., Liu, Z., Lariviere, J. & Ravelo, A. C. High Earth-system climate sensitivity determined from Pliocene carbon dioxide concentrations. Nat. Geosci. 3, 27–30 (2010).
Article ADS CAS Google Scholar
Park, J. & Royer, D. L. Geologic constraints on the glacial amplification of Phanerozoic climate sensitivity. Am. J. Sci. 311, 1–26 (2011).
Article ADS CAS Google Scholar
Royer, D. L., Donnadieu, Y., Park, J., Kowalczyk, J. & Godderis, Y. Error analysis of CO₂ and O₂ estimates from the long-term geochemical model GEOCARBSULF. Am. J. Sci. 314, 1259–1283 (2014).
Article ADS CAS Google Scholar
Martínez-Botí, M. A. et al. Plio-Pleistocene climate sensitivity evaluated using high-resolution CO₂ records. Nature 518, 49–54 (2015).
Article ADS Google Scholar
Royer, D. L. Climate sensitivity in the geologic past. Annu. Rev. Earth Planet. Sci. 44, 277–293 (2016).
Article ADS CAS Google Scholar
Hansen, J. et al. Target atmospheric CO₂: where should humanity aim? Open Atmos. Sci. J. 2, 217–231 (2008).
Article ADS CAS Google Scholar
Royer, D. L., Berner, R. A. & Park, J. Climate sensitivity constrained by CO₂ concentrations over the past 420 million years. Nature 446, 530–532 (2007).
Article ADS CAS Google Scholar
Jones, T. D. et al. A Palaeogene perspective on climate sensitivity and methane hydrate instability. Philos. Trans. R. Soc. A Math. Phys. Eng. Sci. 368, 2395–2415 (2010).
Article ADS CAS Google Scholar
Krissansen-Totton, J. & Catling, D. C. Constraining climate sensitivity and continental versus seafloor weathering using an inverse geological carbon cycle model. Nat. Commun. 8, 15423 (2017).
Article ADS CAS Google Scholar
Anagnostou, E. et al. Proxy evidence for state-dependence of climate sensitivity in the Eocene greenhouse. Nat. Commun. 11, 4436 (2020).
Franks, S. J., Weber, J. J. & Aitken, S. N. Evolutionary and plastic responses to climate change in terrestrial plant populations. Evol. Appl. 7, 123–139 (2014).
Article Google Scholar
Berner, R. A. The phanerozoic carbon cycle: CO₂ and O₂. Am. J. Sci. 306, 774–776 (2006).
ADS Google Scholar
Berner, R. A. Addendum to ‘Inclusion of the weathering of volcanic rocks in the GEOCARBSULF model’: (R. A. Berner, 2006, V. 306, p. 295-302). Am. J. Sci. 308, 100–103 (2008).
Foster, G. L., Royer, D. L. & Lunt, D. J. Future climate forcing potentially without precedent in the last 420 million years. Nat. Commun. 8, 14845 (2017).
Article ADS CAS Google Scholar
Krause, A. J. et al. Stepwise oxygenation of the Paleozoic atmosphere. Nat. Commun. 9, 4081 (2018).
Article ADS Google Scholar
Goddéris, Y., Donnadieu, Y., Lefebvre, V., Le Hir, G. & Nardin, E. Tectonic control of continental weathering, atmospheric CO₂, and climate over Phanerozoic times. Comptes Rendus Geosci. 344, 652–662 (2012).
Goddéris, Y., Donnadieu, Y., Le Hir, G., Lefebvre, V. & Nardin, E. The role of palaeogeography in the Phanerozoic history of atmospheric CO₂ and climate. Earth-Sci. Rev. 128, 122–138 (2014).
Lenton, T. M., Daines, S. J. & Mills, B. J. W. COPSE reloaded: an improved model of biogeochemical cycling over Phanerozoic time. Earth-Sci. Rev. 178, 1–28 (2018).
Article ADS CAS Google Scholar
Arvidson, R. S., Mackenzie, F. T. & Guidry, M. W. Geologic history of seawater: A MAGic approach to carbon chemistry and ocean ventilation. Chem. Geol. 362, 287–304 (2013).
Article ADS CAS Google Scholar
Berner, R. A. GEOCARBSULF: a combined model for Phanerozoic atmospheric O₂ and CO₂. Geochim. Cosmochim. Acta 70, 5653–5664 (2006).
Article ADS CAS Google Scholar
von der Heydt, A. S. et al. Lessons on climate sensitivity from past climate changes. Curr. Clim. Chang. Rep. 2, 148–158 (2016).
Article Google Scholar
Tierney, J. E. et al. Past climates inform our future. Science 370, eaay3701 (2020).
Article CAS Google Scholar
Berner, R. A. The Phanerozoic Carbon Cycle: CO₂ and O₂ (Oxford University Press, 2004).
Sundquist, E. T. Steady- and non-steady-state carbonate-silicate controls on atmospheric CO₂. Quat. Sci. Rev. 10, 283–296 (1991).
Article ADS Google Scholar
Helgeson, C., Srikrishnan, V., Keller, K. & Tuana, N. Why simpler computer simulation models can be epistemically better for informing decisions. Philos. Sci. 88, 213–233 (2020).
Domeier, M. & Torsvik, T. H. Full-plate modelling in pre-Jurassic time. Geol. Mag. 156, 261–280 (2019).
Article ADS Google Scholar

Download references

Acknowledgements

This work was co-supported by the National Science Foundation through the Network for Sustainable Climate Risk Management (SCRiM) under NSF cooperative agreement GEO-1240507, the Penn State Center for Climate Risk Management, and the RIT College of Science Dean’s Research Initiation Grants program. Y.C. thanks support from NSF award #1603051, the National Science Foundation of China (Grant #41888101) and travel support from RCN NSF award #OCE-16-36005 to Bärbel Hönisch and Pratigya Polissar. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the authors and do not necessarily reflect the views of the funding entities. We thank Nathan Urban, Irene Schaperdoth, and Benjamin Mills for their contributions.

Author information

Authors and Affiliations

School of Mathematical Sciences, Rochester Institute of Technology, Rochester, NY, USA
Tony E. Wong
Department of Earth and Environmental Studies, Montclair State University, Montclair, NJ, USA
Ying Cui
Department of Earth and Environmental Sciences, Wesleyan University, Middletown, CT, USA
Dana L. Royer
Department of Geosciences, The Pennsylvania State University, University Park, PA, USA
Klaus Keller
Earth and Environmental Systems Institute, The Pennsylvania State University, University Park, PA, USA
Klaus Keller

Authors

Tony E. Wong
View author publications
You can also search for this author in PubMed Google Scholar
Ying Cui
View author publications
You can also search for this author in PubMed Google Scholar
Dana L. Royer
View author publications
You can also search for this author in PubMed Google Scholar
Klaus Keller
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Y.C., K.K., D.L.R., and T.E.W. contributed to the study design; T.E.W. carried out the experiments; T.E.W. and Y.C. wrote the first draft of the manuscript; and Y.C., K.K., D.L.R., and T.E.W. contributed to the final version of the manuscript.

Corresponding authors

Correspondence to Tony E. Wong or Ying Cui.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Peer review information Nature Communications thanks Eelco J Rohling and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Description of Additional Supplementary Files

Supplementary Data 1

Supplementary Data 2

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Wong, T.E., Cui, Y., Royer, D.L. et al. A tighter constraint on Earth-system sensitivity from long-term temperature and carbon-cycle observations. Nat Commun 12, 3173 (2021). https://doi.org/10.1038/s41467-021-23543-9

Download citation

Received: 19 November 2019
Accepted: 30 April 2021
Published: 26 May 2021
DOI: https://doi.org/10.1038/s41467-021-23543-9

This article is cited by

Investigating monthly geopotential height changes and mid-latitude Northern Hemisphere westerlies
- Hossein Asakereh
- Arman Jahedi
- Abdollah Faraji
Theoretical and Applied Climatology (2024)
Compression complexity with ordinal patterns for robust causal inference in irregularly sampled time series
- Aditi Kathpalia
- Pouya Manshour
- Milan Paluš
Scientific Reports (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

The GEOCARBSULFvolc model

Model configuration

Parameter precalibration

Inference for Earth-system sensitivity

Constraint of paleo CO2 evolution

Controls on Cretaceous temperature biases

Discussion

Methods

Parameter precalibration

Model–data fusion

Parameter sampling

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Additional information

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links

Constraint of paleo CO₂ evolution