Main

Soil carbon–climate feedbacks represent a major uncertainty in the response of the terrestrial biosphere to climate change1. This uncertainty stems, in part, from poorly constrained relationships between the temperature sensitivities of the decomposition and stabilization processes of soil organic carbon (C), which are key parameters in soil biogeochemical models2,3. Whereas many empirical and modelling studies have explored the effects of temperature on bulk soil C stocks and turnover rates at ecosystem to global scales4,5,6,7, they largely ignore heterogeneity in soil organic matter. This is a key oversight given that underlying soil organic matter fractions or ‘pools’ (for example, particulate or mineral-associated organic matter) can exhibit different responses to warming in laboratory- and field-based manipulations8,9,10,11. Understanding and quantifying the temperature sensitivity of these underlying pools at the global scale is thus critical for accurately predicting emergent feedbacks and soil organic C vulnerabilities to a changing climate.

Warmer soil temperatures accelerate the rate of organic matter decomposition, often leading to losses in bulk soil C stocks3,5,8,12. However, the potential decomposability of organic matter in a given soil can depend on its association with minerals11,13,14. Mineral-associated organic matter consists of small plant- or microbial-derived monomers and biopolymers that are chemically or physically bound to clay- and silt-sized minerals, whereas particulate organic matter is largely comprised of coarser, partially decomposed fragments of plant material15,16. Mineral associations can limit access of microbial constituents and their enzymes to organic matter, and as a result, mineral-associated C is often older than particulate C16,17,18. In soil biogeochemical models, these pools can be represented explicitly19,20,21,22 or implicitly22,23,24 (see ‘Model pool interpretability’ in the Methods), with implications for their transient dynamics and long-term response to novel conditions. For consistency across the data and models, we refer to mineral-associated C (and slowest-cycling pools) as ‘protected’ and particulate C (and all other model pools) as ‘unprotected’, while acknowledging that the bioavailability and persistence of each pool is more complex in practice25. Importantly, much uncertainty remains in the responses of protected and unprotected soil C to warming across broad spatial scales9, hindering model benchmarking efforts.

Evaluating the transient dynamics of soil biogeochemical models to warming remains a challenge, in part because observations of soil C at multi-decadal to centennial timescales are limited26,27. Laboratory incubations and field studies demonstrate that decomposition rates increase, and C stocks often decrease, with warming5,8,11. However, at multi-decadal timescales, acclimatory responses in microbial activity may limit the effect of warming on decomposition rates26,28. Furthermore, many experiments focus on the response of bulk soil C stocks, and neither measure nor report the relative responses of underlying soil C fractions. Given these data limitations, an alternative approach is to benchmark global models using spatial gradients4,29,30—although this approach is insufficient for constraining transient dynamics (that is, space-for-time assumptions)31,32, climate gradients do reflect long-term climatological temperature sensitivities that should, in principle, be captured by soil C models.

Here we assess the climatological temperature sensitivity of bulk soil C stocks, and the role of mineral-associated and particulate soil C pools in driving this emergent property in data and models. We first leverage a globally gridded data product of mineral-associated and particulate soil C (derived from an observational synthesis of soil fractions33) to quantify the distribution of soil organic C between these two pools and their respective climatological temperature sensitivities at the global scale. We then evaluate the distributions and temperature sensitivities of protected (mineral-associated-like) and unprotected (particulate-like) soil C stocks across an ensemble of 12 global models—namely, nine Earth system models (ESMs) from the Coupled Model Intercomparison Project Phase 6 (CMIP6)34 and three offline land models23, including two microbially explicit soil C models (Supplementary Tables 13). Although there are important considerations regarding the interpretability of modelled pools and measured fractions (see ‘Model pool interpretability’ in the Methods), pool-specific benchmarks offer a critical opportunity to refine model formulations and reduce uncertainties. We argue that resolving the distribution and temperature sensitivities of underlying soil C pools is imperative for developing a predictive understanding of the emergent temperature dependence of bulk soil C decomposition and constraining global biogeochemical models.

Climatological temperature sensitivity of soil carbon pools

Soil organic C stocks are known to broadly decrease with increasing temperature across climate gradients4,30,35. However, the magnitude of this climatological temperature sensitivity can vary across soils29. Indeed, this relationship appears to be modulated by clay and silt content, where fine-textured soils (that is, soils containing higher amounts of clay and silt minerals) have a lower climatological temperature sensitivity compared with coarse-textured sandy soils (inferred from the relative strengths of relationships in Fig. 1a). These contrasting temperature sensitivities can be attributed in part to the mineralogical capacity of each soil29,36, but more specifically to its effect on the distribution of soil C among mineral-associated and particulate C pools. Since mineral-associated C often has a lower temperature sensitivity than particulate C2,9,14,37, the proportion of C that is mineral-associated can play a dominant role in driving the emergent bulk soil temperature sensitivity (see conceptual schematic in Supplementary Fig. 1).

Fig. 1: Global distribution of observationally derived soil C stocks as a function of climatological temperature and clay and silt minerals.
figure 1

a,b, Total soil C stocks (a) and the proportion of total soil C that is protected (mineral-associated) (b) as a function of the mean annual temperature (MAT) globally. Each gridcell is coloured by the percentage of clay and silt minerals, and best-fit trends are depicted for fine- and coarse-textured soils; here, fine-textured soils were classified as those with >70% clay + silt content and coarse-textured soils with <20% clay + silt content.

Globally, our results indicate that the proportion of soil C that is mineral-associated (hereafter termed ‘protected’ for consistency with model pools22,23) increases with increasing mean temperature (Fig. 1b; a trend that is not captured in most global models (Supplementary Figs. 2 and 3)). Furthermore, fine-textured soils have greater proportions of protected C than do coarse-textured soils across all temperatures. We therefore expect fine-textured soils to exhibit a lower climatological temperature sensitivity compared with coarse-textured sandy soils where higher unprotected C stocks drive a higher temperature sensitivity. The climatological temperature sensitivity of bulk soil C emerges from the underlying pool distributions and their respective temperature dependencies.

To assess the climatological temperature sensitivity of each soil C pool, we calculated the proportional decline in C stocks for every 10 °C increase in mean temperature while controlling for potential confounding factors that include primary productivity, precipitation, and clay and silt content (Methods). Our approach builds upon previous studies29 and provides a benchmarking metric analogous to a climatological Q10 (that is, the increase in process rates for each 10 °C increase in temperature) but for C stocks instead of respiration or turnover rates4,6. We find that, at the global scale, unprotected C stocks have a 28% (95% confidence interval (CI): 26, 30%) greater climatological temperature sensitivity than protected C stocks (Fig. 2). This pattern is especially pronounced in cool regions (<15 °C) where unprotected C is 53% (46, 60%) more temperature sensitive than protected C, compared with warm regions (≥15 °C) where the relative difference is only 15% (11, 20%). Both soil C pools show weak climatological temperature sensitivities in warm regions (Fig. 2c). This latter result suggests that decomposition in cool regions is more temperature-limited, and that warm regions do not experience as stark a decline in soil C for a given increase in mean temperature. This difference is supported by gradient studies in tropical soils38,39, and is consistent with incubation studies showing that warm-adapted microbial communities typically show lower temperature sensitivities (namely, Q10) and higher microbial growth efficiencies40. Furthermore, warm regions contain higher proportions of protected C (Fig. 1b), such that the emergent bulk soil C temperature sensitivity more closely mirrors that of the protected C pool. Therefore, the proportion of protected C—in addition to its climatological temperature sensitivity—may serve as an important benchmark for evaluating global models.

Fig. 2: Global distribution of observationally derived protected and unprotected soil C stocks as a function of climatological temperature.
figure 2

a,b, Protected (mineral-associated) (a) and unprotected (particulate) (b) soil C stocks as a function of MAT in regions above 0 °C globally. Linear regressions are shown for cool (<15 °C) and warm (≥15 °C) regions (solid lines) and across all temperatures above 0 °C (dashed lines). c, Climatological temperature sensitivity calculated as the proportional decline of each soil C pool for a 10 °C increase in MAT (analogous to a climatological Q10, but for C stocks) while accounting for potential confounding variables (see ‘Data analysis’ in the Methods). Points depict the proportional decline of each soil C pool (derived from the regression slopes; n = 32,528 gridcells for non-permafrost mineral soils above 0 °C globally, with n = 13,410 in cool regions and n = 19,118 in warm regions), and error bars denote the 95% CIs (Supplementary Table 6). Higher values (>1) indicate greater decreases in C with increasing climatological temperature.

Global patterns in soil carbon pools across data and models

Soil C models vary in their underlying assumptions, mathematical representations and parameterizations. However, several studies have proposed that the slowest-cycling (‘passive’) pool of soil C in many ESMs corresponds broadly to mineral-associated (protected) C15,22,23,24. This reflects a common conceptualization of donor-pool sources, mineralogical controls, turnover times and stoichiometry of protected C that are used to approximate the formation and characteristics of mineral-associated organic matter (see ‘Model pool interpretability’ in the Methods). We thus adopt this convention here, while recognizing that there are still inherent uncertainties in the interpretability of modelled pools and operationally defined fractions. We posit that benchmarks based on relative pool distributions, in tandem with radiocarbon ages41,42, will be critical for refining future model parameterizations and constraining transient dynamics.

Global soil C models are typically benchmarked on the basis of bulk soil C metrics43,44. However, even models that predict similar bulk soil C stocks can vary widely in their underlying pool distributions (Supplementary Fig. 4). Whereas bulk soil C stocks in non-permafrost mineral soils vary 2.5-fold across the CMIP6 ESMs and offline land models in this study, protected soil C stocks vary nearly sevenfold. As a result, the proportion of protected C ranges from 16 to 85% across the global models (Fig. 3 and Supplementary Table 4). This large spread in model predictions presents a critical opportunity to apply data-driven constraints and reduce uncertainty. We find that about half of the analysed ESMs underestimate the proportion of protected C in soils globally compared with the observationally derived data product, and consequently, contain too much C in faster-cycling, unprotected C pools. This result is especially true in non-permafrost mineral soils, where several of the global models underestimate the proportion of protected C in both cool and warm regions (Supplementary Fig. 5 and Supplementary Table 5). In particular, six models (CESM2, CNRM-ESM2-1, E3SM-1-1-ECA, IPSL-CM6A-LR, NorESM2 and the microbially explicit model CORPSE) roughly capture the proportion of protected C, whereas the other six models (ACCESS-ESM1-5, BCC-CSM2-MR, MIROC-ES2L, MRI-ESM2-0, CASA-CNP and the microbially explicit model MIMICS) considerably underestimate this proportion (Fig. 3d). However, we note that all of the models in this study are amenable to matching pool-specific constraints in future work—indeed, both first-order and microbially explicit formulations with refined parameterizations have been able to better match observations in past studies42,45,46,47. Nevertheless, our results generally suggest that soil C is skewed towards faster-cycling C pools in several global models, consistent with radiocarbon-based benchmarking studies that have shown that current ESMs underestimate the mean age of soil C across latitudes41,42.

Fig. 3: Proportion of soil C in protected pools globally across data and models.
figure 3

ac, The percentage of total soil C that is protected in the observationally derived data product and in the multi-model mean of the ensemble of CMIP6 ESMs and offline land models, shown globally (a,c) and as latitudinal means (b; solid lines). Shaded uncertainty ranges in b correspond to 90% prediction intervals for the data product33, and 5th and 95th quantiles of the multi-model ensemble across each latitudinal band. d, The global percentage of total soil C that is protected across the individual ESMs and offline land models. Protected C was estimated as the mineral-associated fraction for the data product, physicochemically protected pools in MIMICS and CORPSE, and the slowest-cycling passive pool in the CMIP6 ESMs and CASA-CNP (see ‘Model pool interpretability’ in the Methods). Global means are given for non-permafrost mineral soils, excluding tundra, deserts and peatlands (n = 39,620 gridcells for 0.5° resolution), and the data product uncertainty range corresponds to the 90% prediction interval33 (summarized in Supplementary Table 5).

At higher latitudes, the two microbially explicit models depict a lower proportion of protected C (Supplementary Fig. 4), consistent with recent studies3,14,15,45. The observationally derived data product also supports this latitudinal trend (Fig. 3b). By contrast, several CMIP6 models show a larger proportion of protected C at higher latitudes (particularly above 0 °C; Supplementary Fig. 3), which suggests that the interpretability of the protected pool as mineral-associated may break down in these regions within certain CMIP6 models. For instance, in models with vertically resolved soil biogeochemistry (CESM2, E3SM-1-1-ECA and NorESM2) these larger proportions of protected C may be caused by the relative timescales of decomposition and vertical mixing processes. This is because high-latitude active-layer C is transported to permafrost layers via a slow constant diffusion rate representing cryoturbation48,49, which consequently transports a greater fraction of slower-cycling C relative to fast-cycling C pools. Whereas soil C is indeed old in these high latitudes41, its decomposition is limited more by low temperatures that reduce the availability of liquid water than by association with minerals4,50. Given that temperature and hydrology can change at decadal timescales, representing long-lived organic matter at high latitudes as an inherently slow-cycling pool (as opposed to unprotected, particulate forms that can be more sensitive to disturbance9,11,13) has important implications for the responsiveness of vast permafrost soil C pools to projected climate change (Supplementary Figs. 68).

Temperature sensitivity of soil carbon in global models

The climatological temperature sensitivity of bulk soil C is an emergent property that ultimately depends on the distribution of C within underlying pools and their respective temperature sensitivities. Whereas some models may appear to capture the observationally derived climatological temperature sensitivity of bulk soil C stocks, this agreement does not always occur for the right reasons and may break down when considering the contribution of underlying protected and unprotected soil C pools. For example, six of the CMIP6 ESMs (ACCESS-ESM1-5, CESM2, CNRM-ESM2-1, E3SM-1-1-ECA, IPSL-CM6A-LR and NorESM2) and two microbially explicit models (MIMICS and CORPSE) roughly capture the temperature sensitivity of bulk soil C stocks (Fig. 4 and Supplementary Table 6). However, these same models show divergent distributions and climatological temperature sensitivities of underlying protected and unprotected soil C pools (Fig. 4). Only a few models roughly capture the climatological temperature sensitivity of protected C (CESM2, CNRM-ESM2-1, IPSL-CM6A-LR, NorESM2 and CORPSE) and even fewer of unprotected C (ACCESS-ESM1-5, MIMICS and CORPSE).

Fig. 4: Climatological temperature sensitivity of total soil C and the contribution of protected and unprotected pools.
figure 4

The climatological temperature sensitivity of total soil C (denoted by a diamond) is an emergent property of the climatological temperature sensitivity of protected (brown circles) and unprotected (green circles) soil C pools, as well as the proportion of C within each of these pools (denoted by the marker area). The proportional decline in C for a 10 °C increase in MAT was calculated for each C pool, controlling for potential confounding variables (see ‘Data analysis’ in the Methods). Results are shown for the data product and an ensemble of ESMs and offline land models across gridcells with MAT > 0 °C globally; 95% CIs are given in Supplementary Table 6. Higher values (>1) indicate greater decreases in C with increasing climatological temperature, lower values (<1) indicate increases in C and values equal to 1 (grey dashed line) indicate thermal independence.

The observationally derived data product broadly suggests that protected C has a lower climatological temperature sensitivity than unprotected C (Fig. 2). The models generally agree with this trend, although the difference in climatological temperature sensitivities of pools is less pronounced in the CMIP6 models than in the microbially explicit models (Fig. 4). This difference may arise because many first-order soil models impose the same temperature-dependence parameters across all pools49,51, and consequently the emergent climatological temperature sensitivities are often similar between pools. There are a few notable exceptions among the CMIP6 models, in which unprotected C appears less temperature sensitive than protected C (namely, MRI-ESM2-0 and to a lesser degree E3SM-1-1-ECA (Fig. 4), as well as CESM2, CNRM-ESM2-1, MIROC-ES2L and NorESM in cool regions (Supplementary Fig. 9)). This could arise due to temperature-sensitivity parameterizations based on a presumed greater molecular complexity of protected C in select models2,10. Ultimately, the distinct climatological temperature sensitivities of underlying soil C pools observed in the data product and process-based models can have important implications for transient projections, especially with regard to C ages and ecosystem responsiveness, and constitute an important benchmark for evaluating model performance.

Since protected C makes up the majority of bulk soil C globally in the data product (Fig. 3d), it also dominates the climatological temperature sensitivity of bulk soil C stocks (Fig. 4). However, many models do not capture this pattern. In fact, unprotected C dominates the bulk soil C stocks and climatological temperature sensitivity in a few models (BCC-CSM2-MR, MRI-ESM2-0, CASA-CNP and MIMICS), whereas most other ESMs have similar proportions and temperature sensitivities of underlying C pools. Only a couple of global models have high enough proportions of protected C to drive bulk climatological temperature sensitivities that are dominated by the protected C pool (for example, E3SM-1-1-ECA and CORPSE), as observed in the data product (Fig. 4 and Supplementary Fig. 9).

Furthermore, climatological temperature sensitivities of bulk soil C stocks and turnover can vary across climates, where temperature dependencies are typically higher in cooler climates than in warmer climates4,29,52. However, the temperature sensitivities of underlying soil C pools within these climate regimes have not been explored in global-scale models or observations. We find that both protected and unprotected C pools show greater climatological temperature sensitivities in cool than warm climates, but interestingly, this contrast across climates is more pronounced for unprotected C than for protected C, and especially in the observationally derived data product compared with the global models (Figs. 2 and 5 and Supplementary Figs. 911).

Fig. 5: Climatological temperature sensitivity of protected and unprotected soil C across temperature regimes.
figure 5

a,b, The proportional decline in protected (a) and unprotected (b) soil C stocks for a 10 °C increase in MAT, controlling for potential confounding variables. Proportional declines are shown for cool climates (0 °C < MAT < 15 °C; blue circles) and warm climates (MAT ≥ 15 °C; red circles). Results are shown for the data product (black triangles), the multi-model ensemble mean (asterisks) and each ESM and offline land model (solid circles; labelled with abbreviations); 95% CIs are given in Supplementary Table 6. Kernel density estimates are shown for each panel, illustrating the distribution of climatological temperature sensitivities across the model ensemble for each pool and temperature regime. Higher values (>1) indicate greater decreases in C with increasing climatological temperature, lower values (<1) indicate increases in C and values equal to 1 (grey dashed line) indicate thermal independence.

Overall, our analysis shows that global models tend to overestimate the climatological temperature sensitivity of protected C stocks in both cool and warm climates, and of both C pools in warm climates (Fig. 5). This is especially true of MIROC-ES2L, CASA-CNP and MRI-ESM2-0, which overestimate the climatological sensitivity of protected C by more than a factor of two. However, the global models tend to underestimate the climatological temperature sensitivity of unprotected C in cool climates (Fig. 5). The inability of most ESMs to match these regional patterns suggests that the transient response to warming may also be biased, particularly in cool climates. Protected (slower-cycling) C in ESMs may be overly sensitive to warming in cool regions, whereas unprotected (faster-cycling) C may not be sensitive enough to warming, with implications on soil C dynamics and age distributions under future scenarios.

Implications and future perspectives

We propose that the distributions and climatological temperature sensitivities of underlying soil C pools are ecosystem properties that global models should be expected to reproduce. Our results highlight that even ESMs that predict similar bulk soil C stocks and climatological temperature sensitivities can vary widely in their predictions of underlying soil C pools, with implications for C ages and ecosystem responsiveness under changing conditions. In particular, owing to an overestimation of C in unprotected (faster-cycling) pools, several CMIP6 ESMs may be too responsive to C inputs and may overestimate the effects of increased primary production on potential soil C sequestration (for example, with large increases in unprotected C; Supplementary Fig. 6); a finding that agrees with independent radiocarbon-based studies41,42. This implies that model estimates of the amount of C sequestration that may result from climate change and CO2 fertilization of terrestrial ecosystems may be too strong. Furthermore, most ESMs tend to underestimate the climatological temperature sensitivity of unprotected C in cool climates, potentially leading to an underestimation of C losses from these unprotected pools with warming, an underestimation of the global carbon cycle–climate feedback and compounding projections of higher productivity-driven C accumulation in these regions. Whereas this bias may be partially offset in ESMs that overestimate the climatological temperature sensitivity of protected C across climates, the effect of this latter overestimate on projections of carbon cycle–climate feedbacks is probably weaker because these pools are intrinsically slow-cycling in first-order soil models and consequently change very little on timescales relevant to climate policy (Supplementary Figs. 68). Divergent pool-specific responses can ultimately lead to differences in soil C composition and functioning. It is therefore imperative that models benchmark not only bulk climatological sensitivities but also those of underlying soil C pools, to confidently project soil C dynamics into the future.

The CMIP protocol requests but does not require the reporting of soil C pool distributions53, and, thus, this information is currently available only for a subset of CMIP6 ESMs. Our results show that, with recent advances in global estimates of protected and unprotected soil C stocks33, redefining and benchmarking these underlying soil C pools can serve as an important constraint for ESMs. We therefore urge that all global models report the distribution of soil C among underlying pools in CMIP Phase 7 and beyond, so that future studies may leverage this benchmark for model evaluation across climate gradients. Although further study of transient responses is also imperative for constraining dynamic models31,54, the static relationships presented here serve as important benchmarks of long-term temperature effects on the formation and decomposition of soil C pools.

Methods

Data sources and processing

The overarching goal of this study was to use observed spatial gradients in soil C stocks—particularly in underlying unprotected (particulate) and protected (mineral-associated) C pools—and their relationships with temperature, to develop benchmarks for ESMs. Although such benchmarks derived from spatial gradients are insufficient for constraining transient responses or making projections (that is, space-for-time assumptions)31, they do reflect steady-state differences and long-term climatological temperature sensitivities of soil C pools. These observed differences across spatial climate gradients should, in principle, be predicted by dynamical soil C models, and, thus, these benchmarks serve as important tests4,29. However, meeting this constraint is not sufficient for ensuring the ability of models to produce accurate projections under global change, and so we urge experimental work that quantifies the transient responses of soil C pools across diverse climates and biomes8,9,11.

Observationally derived climate and edaphic datasets for all global analyses were used at 0.5 × 0.5° resolution. MAT was estimated from the CRU dataset (version 3.10)55 and mean annual precipitation from the GPCC dataset56, both as 30 year annual averages. Land cover was obtained from the MODIS MCD12C1 product57, and productivity was estimated from the MODIS net primary productivity product58. Following Shi et al.41, soil organic C stocks to 1 m depth were estimated as the mean of Harmonized World Soil Database59 and SoilGrids60 maps to reflect a best estimate and uncertainty across data products. Corresponding mineral-associated C stocks and 90% prediction intervals were estimated in Georgiou et al.33,61 using a machine-learning (random forest) algorithm, and multiple cross-validation approaches were used to provide rigorous assessments of predictability62,63. We also note that our conclusions were robust to the selected fractionation method (namely, size and density; Supplementary Fig. 5 and Supplementary Table 5) and reflect a global best estimate and uncertainty given existing observations. However, we encourage further measurements in cooler climates and sandy soils, where the relative contribution of fine particulate organic matter may be more substantial3,15. In the present study, we focused our analyses on non-permafrost mineral soils with MAT > 0 °C given data limitations at higher latitudes33,63. To exclude soils whose decomposition is limited by water saturation or aridity, we also removed gridcells that contain a coverage of more than 50% peat, defined as the Histosol soil order or Histel suborder, and those that are hyperarid (with an aridity index of <0.05) or receive less than 100 mm yr−1 of precipitation. These gridcells were masked in white in all maps (Fig. 3) and were excluded from subsequent analyses.

Global land model output

Model output was sourced from CMIP6 ESMs34 and three offline biogeochemical testbed models23 for present-day (‘historical’) and future projections (‘ssp585’, or RCP8.5). Only CMIP6 models that reported soil C pool distributions were used in this study. Model details are listed in Supplementary Tables 13. Carbon stocks and climate covariates from each model were taken over the historical period as ten year annual averages for C stocks and 30 year annual averages for climate (spanning 2005–2015 and 1985–2015, respectively) to match the observation-based quantities. Specifically, MAT (‘tas’ for CMIP6 models), mean annual precipitation (‘pr’) and net primary productivity (‘npp’) were used from each model. Since the models do not provide depth-resolved C stocks, we used total soil C stocks (‘cSoil’ for CMIP6 models) in our analyses. Although the depth of soil C in CMIP6 models is not clearly defined, previous studies have compared with 0–1 m integrated soil C stocks4,43,44. We believe that this represents the best benchmark on current models and follow this depth correspondence in comparison with the data, but note that any errors in the correspondence with depth could be reduced with future models that are depth-resolved and/or explicitly define a depth interval4,49.

For the offline models of the biogeochemical testbed (that is, CASA-CNP, MIMICS and CORPSE), mineral-associated (protected) C was estimated by the explicit physicochemically protected pools for MIMICS and CORPSE, and the slowest-cycling (passive) pool for the first-order CASA-CNP model (see Wieder et al.23 for details on physicochemical protection within the three biogeochemical testbed models). Analogously, for the first-order CMIP6 models, the slowest-cycling C pool (‘cSoilSlow’) was used to represent mineral-protected C (see ‘Model pool interpretability’ in the Methods). We note that the CMIP6 model output repository uses the terminology cSoil for total soil C stocks and ‘cSoilFast’, ‘cSoilMedium’ and cSoilSlow for the three soil C pools that correspond to the ‘active’, ‘slow’, and passive pools, respectively.

Of the ten CMIP6 ESMs that reported soil C pool distributions, only one model constituted a conceptualization that did not align with mineral protection; namely, in TaiESM1, the soil C dynamics were parameterized as a decomposition cascade based on a synthesis of microcosm studies using radio-labelled substrates64,65,66. We thus excluded TaiESM1 from our analyses, and note that the model can be redefined and parameterized for comparison with pool-specific benchmarks in future work. We also emphasize that not all first-order soil C models were formulated as a decomposition cascade based on recalcitrance. Rather, several first-order soil C models implicitly represent key processes by rerouting flows and imposing particular controls (for example, lignin, soil texture and climate) on the formation of select pools22,24 (see ‘Model pool interpretability’ in the Methods; Supplementary Fig. 12). To the best of our knowledge, the nine remaining CMIP6 ESMs (namely, ACCESS-ESM1-5, BCC-CSM2-MR, CESM2, CNRM-ESM2-1, E3SM-1-1-ECA, MIROC-ES2L, MRI-ESM2-0, IPSL-CM6A-LR, NorESM2) constitute formulations broadly based on the Century model51 and its derivatives, for which the slowest-cycling (passive) pool is conceptually aligned with physicochemical protection. Although parameterizations in implemented soil models may have diverged from published values, protected C appears to be defined consistently across the nine remaining CMIP6 ESMs (Supplementary Table 3). To facilitate future benchmarking studies and reduce potential uncertainties, we highly encourage more detailed documentation on parameterizations implemented in soil models within CMIP ESMs.

The protected C pool in each model was then compared with the total soil C for each gridcell globally to calculate the proportion of protected C (Supplementary Fig. 4). The global mean (with 90% CIs) was calculated for each model and in each indicated temperature regime (Fig. 3 and Supplementary Fig. 5). We then calculated the spatially explicit multi-model mean across the ensemble of soil C models (Fig. 3c), as well as the latitudinal means of the observationally derived data product and multi-model ensemble (Fig. 3b). We also quantified changes in soil C pools for each model under an RCP8.5 (or ssp585) scenario by comparing projections from the end of the twenty-first century (2090–2100) with the present day (2005–2015). Results are shown as absolute and relative changes in soil C stocks globally (Supplementary Figs. 6 and 7 and Supplementary Table 4) and across time and latitude (as Hovmöller diagrams; Supplementary Fig. 8).

Model pool interpretability

The comparison of protected C across different model formulations, following the above pool nomenclature, has been proposed in several past studies with first-order models15,22,23. We recognize that there are still inherent mismatches between operationally defined, measurable pools of soil organic C fractions and the modelled states that are simulated by different soil biogeochemical models24,67,68,69. Although progress is being made to help bridge this divide22,45,70,71, the global-scale models that are the focus of this work make assumptions that are broadly consistent with characteristics of mineral-associated and particulate soil C fractions22,23,24 (Supplementary Fig. 12).

Indeed, in non-permafrost mineral soils, the formation and composition of the passive pool in select first-order models (for example, in the Century model of Parton et al.51) was formulated to resemble the mineral-associated fraction, and mirrors the mathematical representations of physicochemically protected C in several microbially explicit models, including MIMICS and CORPSE22,23. Namely, the protected (passive in the first-order models) pool is formed primarily from the microbial (active) pool, where this transfer coefficient is modulated by soil clay and silt content, such that proportionally more C is stored within the protected pool in fine-textured soils22,23,24,51,72. More decomposable (metabolic) litter enters only the microbial (active) pool, whereas structural litter (controlled by lignin content) primarily enters the particulate (slow) pool23,51. A small amount of the protected pool forms from direct sorption of the particulate pool23,73. Consequently, the C:N ratio of the protected (passive) pool often resembles that of the microbial (active) pool (that is, typically ~11–12 for the protected pool in the Century model73), which is consistent with the stoichiometry of mineral-associated organic matter16,74,75. By contrast, the C:N ratio of the slow pool is more closely aligned with the stoichiometry of particulate organic matter (that is, ~12–20 in the Century model73).

Finally, losses from the protected pool are represented as a first-order process in most soil C models22,36,45, including the ones presented here, with typical intrinsic turnover times ranging between 75 and 1,000 years, which are broadly consistent with radiocarbon estimates8,16,18 (see Supplementary Table 3 and Supplementary Fig. 13). Whereas there is heterogeneity within all underlying soil fractions in practice, it is well accepted that mineral-associated organic C is generally slower-cycling (decades to millennia) than particulate organic C (days to decades)16,17,18. Indeed, the intrinsic turnover times for protected (passive) C ranged from 200 to 450 years in all analysed CMIP6 ESMs, except for MIROC-ES2L and MRI-ESM2-0 for which the turnover times were 1,000 years (Supplementary Table 3). By contrast, intrinsic turnover times of the particulate (slow) pool ranged from 5 to 10 years in all analysed CMIP6 ESMs, except for MIROC-ES2L for which the turnover time was 25 years. These intrinsic turnover times are baseline (minimum) pool-specific values, such that environmental modifiers (based on temperature and moisture) further increase the turnover time in most gridcells; for example, by a factor of ~2.5 in temperate forests up to a factor of ~10 in Arctic regions for the Century model73. These environmental controls are important when comparing pool-specific turnover times with global radiocarbon data (Supplementary Fig. 14), and future work could estimate the distribution of C ages for each modelled pool across gridcells globally. Leveraging Δ14C measurements from the International Soil Radiocarbon Database18,76, and following Shi et al.41 to convert between turnover time and Δ14C, we show that the protected (passive) pool turnover time within many first-order models is broadly aligned with mean C ages of the mineral-associated fraction (Supplementary Table 3 and Supplementary Figs. 13 and 14). However, future model versions may further improve the correspondence by reducing the intrinsic turnover time of protected C, especially in MIROC-ES2L and MRI-ESM2-0, and strengthening its environmental controls to better match the global distribution of Δ14C measurements (Supplementary Fig. 14).

By contrast, the turnover time of protected C is fixed at 75 years in CORPSE and ranges from 15 to 80 years in MIMICS based on a dependence on soil texture23. We note that the turnover times of MIMICS and CORPSE appear to be biased low compared with radiocarbon data of soil fractions (Supplementary Figs. 13 and 14), which may explain their strong responses under future conditions (Supplementary Figs. 68) and suggests that bulk soil C ages may also be too young in these models. In fact, subsequent versions of MIMICS that have yet to be integrated into the global biogeochemical testbed have implemented much longer turnover times of protected C—for instance, ranging from 180 to 1,000 years (that is, increased by a factor of 12.5) in MIMICS-CN77, and in the C-only MIMICS independently fitted to site-level data at the Reynolds Creek Experimental Watershed47. These modifications in subsequent versions of MIMICS have also resulted in much higher proportions of protected C (namely, 50–80%) that are better aligned with observational constraints. Thus, we emphasize that all models, including microbially explicit and first-order formulations, have the opportunity to be falsified or refined in future iterations, but this does not preclude the ability to benchmark them here.

The range of conceptual representations and parameterizations of physicochemical protection across ecosystem to global-scale models (for example, see the schematic in Sulman et al.22) presents an urgent need to develop and apply observationally derived constraints. We posit that both the microbially explicit and first-order soil C models in this study (Supplementary Tables 13) constitute conceptualizations that are compatible with benchmarking protected soil C to refine future model parameterizations. Although we recognize that there are still inherent uncertainties in the interpretability of modelled pools and operationally defined fractions, we believe that this study represents a valuable benchmark on model pools globally. Observational syntheses show that mineral-associated C is typically an old (Supplementary Fig. 13 and Supplementary Table 3) and large (Fig. 3 and Supplementary Fig. 5) component of total soil C. Thus, the slowest-cycling passive pool in first-order models—which broadly aligns with radiocarbon measurements of mineral-associated C (Supplementary Figs. 13 and 14)—should similarly represent a large fraction of the total soil C pool in mineral soils. Such a correspondence would also improve model performance metrics based on bulk soil C ages41,42. Since pool-specific turnover times appear to be broadly aligned with observations, future studies may seek to refine other parameters that strongly affect the proportion of protected C—for example, increasing the fraction of the active pool that flows to the passive pool (Supplementary Figs. 12 and 15) and its mineralogical controls78. Such parameters may also be informed by observational estimates of the relative flows of plant- or microbial-derived C to mineral-associated C15,79. Ultimately, we contend that benchmarking soil C pool distributions concurrently with radiocarbon ages in future work will present a powerful constraint on soil C dynamics across global models.

Data analysis

We investigated the relationship between soil C stocks (within unprotected and protected C pools) and MAT across the global data product and models. Soil C stocks for each pool were log-transformed and a linear regression was used to reflect a hypothesized exponential relationship with temperature (as in Hartley et al.29). To ensure that the observed global patterns were not driven by confounding environmental variables, we used a multiple linear regression to control for net primary productivity, mean annual precipitation and clay and silt content in our analyses. In all cases, the climate and productivity covariates were sourced from their respective model output or data product. Clay and silt content was obtained from the Harmonized World Soil Database59 as it was not reported for most CMIP6 models. The climatological temperature sensitivity was then calculated from the slope of the log-linear temperature relationship and, following Hartley et al.29, summarized as the proportional decline in C stocks for every 10 °C (analogous to a climatological Q10, but for C stocks) with corresponding parametric 95% CIs (Supplementary Table 6).

This calculation was then repeated for protected and unprotected soil C pools in the data and models, across all mean temperatures above 0 °C (Fig. 4) and across two underlying temperature regimes, namely cool (<15 °C) and warm (≥15 °C) (Fig. 5 and Supplementary Fig. 9). Soils with mean temperatures below 0 °C were not considered in our analyses due to distinct controls on soil C decomposition (for example, permafrost dynamics29,48) and consequent implications on the interpretability of protected pools in first-order models in these regions (Supplementary Fig. 3), as well as higher uncertainties in the data product within this temperature regime33. Furthermore, the same analyses were performed on an observational synthesis33 (Supplementary Figs. 10 and 11). The general agreement between the observations and data product provides further evidence for the robustness of the reported temperature sensitivity trends between protected and unprotected soil C across climate regimes (Fig. 2c). Differences in climatological temperature sensitivity between protected and unprotected soil C pools, and across the data and models, were deemed statistically significant when the 95% CIs did not overlap (Supplementary Table 6).