Tracking 21st century anthropogenic and natural carbon fluxes through model-data integration

Bultan, Selma; Nabel, Julia E. M. S.; Hartung, Kerstin; Ganzenmüller, Raphael; Xu, Liang; Saatchi, Sassan; Pongratz, Julia

doi:10.1038/s41467-022-32456-0

Download PDF

Article
Open access
Published: 26 September 2022

Tracking 21^st century anthropogenic and natural carbon fluxes through model-data integration

Nature Communications volume 13, Article number: 5516 (2022) Cite this article

4695 Accesses
9 Citations
83 Altmetric
Metrics details

Subjects

Abstract

Monitoring the implementation of emission commitments under the Paris agreement relies on accurate estimates of terrestrial carbon fluxes. Here, we assimilate a 21^st century observation-based time series of woody vegetation carbon densities into a bookkeeping model (BKM). This approach allows us to disentangle the observation-based carbon fluxes by terrestrial woody vegetation into anthropogenic and environmental contributions. Estimated emissions (from land-use and land cover changes) between 2000 and 2019 amount to 1.4 PgC yr⁻¹, reducing the difference to other carbon cycle model estimates by up to 88% compared to previous estimates with the BKM (without the data assimilation). Our estimates suggest that the global woody vegetation carbon sink due to environmental processes (1.5 PgC yr⁻¹) is weaker and more susceptible to interannual variations and extreme events than estimated by state-of-the-art process-based carbon cycle models. These findings highlight the need to advance model-data integration to improve estimates of the terrestrial carbon cycle under the Global Stocktake.

Surprising stability of recent global carbon cycling enables improved fossil fuel emission verification

Article Open access 17 August 2023

Benjamin Birner, Christian Rödenbeck, … Ralph F. Keeling

Vast CO2 release from Australian fires in 2019–2020 constrained by satellite

Article 15 September 2021

Ivar R. van der Velde, Guido R. van der Werf, … Ilse Aben

Synthesis of the land carbon fluxes of the Amazon region between 2010 and 2020

Article Open access 22 January 2024

Thais M. Rosan, Stephen Sitch, … Luiz E. O. C. Aragão

Introduction

Environmental change is altering the global balance between CO₂ emissions and uptakes by terrestrial ecosystems. The natural carbon sinks in terrestrial vegetation and soils provide an immense buffer for anthropogenic emissions, currently sequestering about one-third of fossil and land-use change CO₂ emissions¹. Opposing effects on the strength of the terrestrial carbon sinks, such as increased plant productivity through CO₂ fertilization² and enhanced wildfires triggered by pronounced droughts³, lead to large uncertainties when estimating present and future dynamics of the natural carbon sinks⁴. Reducing those uncertainties through analyzing the individual contributions of anthropogenic and environmental processes to the global carbon cycle is one of the main aims of the annually updated Global Carbon Budget (GCB), published by the Global Carbon Project¹. Within the scope of the GCB, the net land-atmosphere exchange of CO₂ is defined as the sum of (1) anthropogenic fluxes from land-use and (land-use induced) land cover change activities (LULCC), including management, and (2) fluxes due to environmental processes (e.g., effects of increased atmospheric CO₂ levels and N-deposition, pests, wildfires, altered precipitation patterns). The first term is called E_LUC and is estimated with semi-empirical bookkeeping models (BKMs), whereas the second term is referred to as the natural terrestrial carbon sink, S_LAND, which is estimated with process-based dynamic global vegetation models (DGVMs).

For both E_LUC and S_LAND, there is a large spread among model estimates. E_LUC is estimated to be 1.1 ± 0.7 PgC yr⁻¹ for 2011–2020, i.e., has an uncertainty of ±64% (for one standard deviation). The DGVM estimate for S_LAND for the same time frame has an uncertainty of ±19%¹. BKMs commonly simulate emissions due to LULCC in the absence of environmental influences by combining assumptions on the amount of carbon contained in vegetation and soils with empirical decay functions, describing their response to LULCC. DGVMs additionally account for environmental effects on the different carbon pools and simulate biogeochemical processes such as photosynthesis⁵.

The main sources of uncertainty depend on the model type. For BKMs, different assumptions regarding the amount of contained carbon per unit area (=carbon density) contribute substantially to the uncertainty in E_LUC⁶. Moreover, carbon densities of soils and vegetation in BKMs are typically based on contemporary carbon stocks, which inevitably introduces an error when simulating past (i.e., the period prior to the data record of the underlying carbon densities) E_LUC (“bookkeeping error”)⁷. For DGVMs, different parameterizations and whether and how vegetation and soil processes are captured lead to a large spread in the estimated soil and vegetation carbon stocks (vegetation carbon ±55% of the average for eight DGVMs from the TRENDY Model-Intercomparison Project (https://blogs.exeter.ac.uk/trendy/, last access: 11 April 2022); see Table 1)⁸.

Table 1 Comparison of global living biomass (above- plus belowground) carbon stocks and associated fluxes (positive for uptake and negative for release) from this study compared to a range of other recent studies

Full size table

Observational estimates of global vegetation carbon stock from existing datasets^9,10 and upcoming satellite missions (e.g., ESA BIOMASS mission¹¹) offer large potentials for reducing the mentioned model uncertainties by constraining models with observations^12,13. However, there are limitations to all satellite-based estimates of global vegetation carbon fluxes for terrestrial carbon budget analyses. Some of the major limitations are: the restriction to gross fluxes/sub-component fluxes of E_LUC (e.g., only carbon emissions from deforestation or carbon uptakes after the abandonment of agricultural lands) are captured; the difficulty to distinguish anthropogenic from environmental fluxes and the restriction to committed fluxes¹⁴. In committed fluxes, biomass loss is assumed equal to emissions to the atmosphere, unless additional assumptions are applied that track the fate of carbon on site and in products over time, as in legacy fluxes¹⁵.

Here, we propose an approach that overcomes the mentioned limitations of BKMs and satellite-based estimates by allowing us to decompose observationally constrained estimates of carbon stocks into anthropogenic (E_LUC) and environmental (S_LAND) contributions. We only count fluxes resulting from direct anthropogenic activities in the form of LULCC towards anthropogenic processes. These include carbon uptakes due to regrowth after wood harvesting and abandonment of agricultural lands and carbon emissions due to forest clearing, wood harvesting, etc. Indirect anthropogenic influences (e.g., effects of increasing CO₂ on plant productivity) are defined as environmental processes. Our analysis is based on the recently published time series of global woody vegetation carbon densities for 2000–2019 by ref. 16. The observation-based time series is assimilated into the BKM BLUE (“Bookkeeping of Land Use Emissions”)⁵, which is one of three BKMs used in the GCB. We apply our approach to analyse the implications of considering environmental processes on the estimated E_LUC. Furthermore, we assess uncertainties of the land cover and plant functional type distribution in BLUE. Lastly, we provide observation-based S_LAND estimates for woody vegetation, which are subsequently compared to DGVMs from the TRENDY project (v8)⁸.

Results

A model-data integration framework for separating anthropogenic and environmental carbon fluxes

Our methodological framework, as shown in Supplementary Fig. 1, is based on the BKM BLUE and the time series of woody vegetation carbon densities from ref. 16. In its default setup, BLUE simulates LULCC emissions in the absence of environmental influences. The carbon densities of vegetation and soils (see Methods) on different land cover types (primary land, secondary land, cropland and pasture) and plant functional types (PFTs) are based on fixed contemporary values¹⁷. Over time, the amount of carbon stored in the terrestrial biosphere is altered by LULCC prescribed from external data (here: the Land-Use Harmonization 2 (LUH2)¹⁸ dataset). Recovery and decay of soil and vegetation carbon follow fixed contemporary rates (see ref. 17 and Methods).

The dataset by ref. 16 provides annual estimates of carbon densities in living woody vegetation (i.e., trees and shrubs) for the period 2000–2019. The dataset was generated by integrating data from spaceborne LiDAR, RADAR, optical imagery, airborne laser scanning and ground inventory data in a spatio-temporal machine learning algorithm. The algorithm was trained by a large number of samples derived from LiDAR measurements of vegetation structure converted to aboveground and belowground woody vegetation carbon densities using allometric models¹⁶. We assimilate this dataset in BLUE in several steps to calculate S_LAND: We (1) distribute the grid cell-based (i.e., average per grid cell) biomass carbon densities of ref. 16 between sub-pixel fractions for the different land cover types and PFTs in BLUE, (2) define upper thresholds for the exclusion of unrealistic biomass carbon densities that arise from inconsistencies between the dataset by ref. 16 and the model assumptions, and (3) interpolate “missing values” (i.e., values that were excluded according to the chosen threshold approach) (see Methods). We employ two different simulation setups to isolate environmental carbon fluxes (S_LAND) by terrestrial woody vegetation (4). The first setup (4a) relies on transient biomass carbon densities, i.e., the biomass carbon densities from ref. 16 are assimilated into BLUE at each time step (i.e., each year). In this setup, carbon stocks of woody biomass between two time steps are affected by anthropogenic and environmental drivers. The second setup (4b) is based on fixed biomass carbon densities from the year 2000. In this setup, the biomass carbon densities from ref. 16 are assimilated into BLUE for the year 2000 and the carbon stocks are in the subsequent time steps only altered by LULCC, i.e., only anthropogenic processes are considered. We 5) define the difference in the annual change of woody biomass carbon between the transient and the fixed simulation setup as the woody terrestrial biomass carbon sink (=S_LAND,B). Our definition is consistent with the GCB in the sense that it defines S_LAND,B as the sum of all carbon sources and sinks due to environmental processes on any type of land (i.e., managed and unmanaged). However, there are substantial differences between our estimates and the GCB estimates, including differences in the assumed land cover distribution (see following explanation on the “loss of additional sink capacity”) and the restriction of our estimates to woody vegetation and to living biomass, excluding litter, dead wood and soil dynamics. Contrary to the GCB, our estimated carbon fluxes from woody vegetation due to LULCC (E_LUC,B) and S_LAND,B should not be used to create a balance to derive the net exchange of carbon between the land and the atmosphere, since E_LUC,B only captures fluxes to/from the atmosphere from/to woody vegetation, whereas S_LAND,B also includes fluxes that are in reality delayed (to the atmosphere) due to the deposition of carbon to litter and soil carbon pools. In our approach, carbon releases from woody vegetation are positive, whereas uptakes of carbon by woody vegetation are negative.

Our approach introduces various important novelties compared to DGVMs and to BKMs with fixed contemporary carbon densities. Compared to DGVMs, our approach has the advantage that estimates for S_LAND,B and E_LUC are estimated on transient, present-day land cover distribution, and do therefore not include the “loss of additional sink capacity” (LASC). The LASC implies that S_LAND from DGVM simulations without LULCC under transient environmental conditions and under pre-industrial land cover distribution is larger than under present-day land cover distribution. This is due to larger forested areas under pre-industrial land cover compared to present-day land cover, which allows for more carbon accumulation caused by favorable environmental conditions (e.g., increasing atmospheric CO₂)¹⁹. Similarly, a recent study by ref. 19 suggests that calculating E_LUC as the difference between DGVM simulations with and without LULCC (i.e., under pre-industrial land cover) under transient environmental conditions leads to a 40% larger E_LUC for 2009–2018 compared to the pre-industrial control simulation, which is attributable to the LASC. Furthermore, the bookkeeping error is resolved for E_LUC,B after 2000 due to the assimilation of observed woody biomass carbon densities. Lastly, our approach considers all impacts on carbon fluxes related to woody vegetation, including processes that are commonly not considered in model-based approaches (e.g., forest degradation caused by environmental processes).

Throughout our analysis, we use slightly different time frames for aggregating the data from our BLUE simulations. The data on E_LUC and on biomass carbon stocks is aggregated for the entire time series, i.e., 2000–2019. Fluxes that are calculated from annual changes in biomass carbon, including S_LAND,B, are available for 2001–2019. However, for any direct comparisons with only the TRENDY DGVMs, we restrict our S_LAND,B estimates to the time frame of the TRENDY data on S_LAND,B (2001–2018).

Anthropogenic effects (LULCC) on global woody vegetation carbon

To assess the general behavior of the BKM using updated woody biomass carbon stocks from observations, we compare the results of our fixed woody biomass carbon simulations to the default setup (of BLUE) and other models, which follow the classical bookkeeping approach (i.e., exclude environmental influences). The default setup of BLUE is based on carbon densities from ref. 17 and is referred to as ref. 5 in Table 1–3. On a global scale, E_LUC between 2000 and 2019 for the fixed woody biomass runs is on average 0.2 PgC yr⁻¹ (13%) lower than the estimate with the default setup. This brings our updated E_LUC estimate closer to the other BKMs used in the GCB^20,21 and to the multi-model average of the TRENDY simulations with fixed present-day carbon densities¹⁹ (Table 2). The spread between the BKMs is reduced by 43% (BLUE minus OSCAR²⁰) resp. 29% (BLUE minus H&N²¹), whereas the difference to the TRENDY multi-model average (1.4 PgC yr⁻¹) is reduced by 88% compared to the default BLUE setup.

Table 2 Comparison of estimated global carbon flux from land-use and (land-use induced) land cover change (positive into atmosphere) from this study compared to a range of other recent studies. Interannual variability (IAV) is calculated as the ratio of the standard deviation (SD) to the mean

Full size table

E_LUC estimated from the transient woody biomass carbon simulations amounts to 2.8 PgC yr⁻¹ for 2000–2019. The large difference in the fixed woody biomass carbon estimate for E_LUC is mainly related to higher biomass carbon stocks in the transient simulations (probably strongly driven by the effect of enhanced plant productivity under increasing CO₂)^2,22. In the fixed carbon density simulation, vegetation on cleared or harvested areas is recovering/slowly regrowing in the years after the respective land-use event. In the transient carbon density simulation, the effect of higher CO₂ on plant productivity, together with other environmental influences, leads to increased carbon uptake by woody vegetation. On managed lands, increasing atmospheric CO₂ can lead to a quicker recovery of vegetation after clearing/harvesting. Consequently, the higher biomass carbon in the transient simulation can lead to increased emissions upon wood harvesting and clearing but also to increased carbon uptake due to faster regrowth of vegetation on managed lands. To identify the drivers behind the higher net E_LUC in the transient woody biomass simulations, we analysed E_LUC for the major land-use transitions in BLUE (clearing, harvest, and abandonment) (Supplementary Table 1). Accordingly, emissions from clearing and wood harvesting are on average 1.7 PgC yr⁻¹ higher in the transient woody carbon density setup than in the fixed woody carbon density setup, which is not compensated by the increased carbon uptakes (i.e., negative flux from the atmosphere to the land) due to the quicker vegetation regrowth on abandoned agricultural land under higher CO₂ concentrations (transient minus fixed: −0.3 PgC yr⁻¹). Regional hotspots that make up ~60% of the cumulative increase in E_LUC in the transient simulation setup are found in Europe, South- and Southeast Asia. Furthermore, Europe and South Asia alone account for the majority (~76%) of the increase in cumulative harvest emissions. Building upon the uncertainty analysis in the Methods, these are all regions where uncertainties due to the LULCC forcing and its implementation in BLUE are high. On different spatial and temporal scales than the ones investigated in our analysis, the effect of larger E_LUC under transient woody biomass carbon could be reduced or compensated by an increased uptake of carbon due to faster vegetation regrowth under more favorable growing conditions (e.g., due to increasing atmospheric CO₂ concentrations).

Environmental effects on global woody vegetation carbon

We analyse similarities and differences between our estimates from BLUE simulations with transient and fixed woody biomass carbon to other model-based and observational estimates. Furthermore, we compare global and regional environmental carbon fluxes in the form of S_LAND,B from our approach to estimates of an ensemble of TRENDY models for the period 2001–2018.

Between 2000 and 2019, we estimate 399 ± 2 PgC contained in global living vegetation (woody and non-woody) in the transient woody biomass carbon simulations vs. 382 ± 2 PgC in the fixed woody biomass carbon simulations. The difference of 17 ± 1 PgC is due to environmental changes. The TRENDY estimates suggest that biomass carbon stocks under fixed climate (S5 setup, see Methods) are 18% higher than under transient climate (S3 setup, see Methods). Similar to our BLUE simulations, this is probably related to the fact that the TRENDY simulations under fixed climate rely on present-day CO₂ levels, leading to enhanced plant productivity compared to the simulations under a transient climate that also have transient CO₂ levels¹⁹. However, the assumption of constant, present-day CO₂ levels over the whole historical period in the TRENDY S5 simulations leads to a much stronger CO₂ fertilization effect on vegetation carbon stocks compared to our simulations. The comparison of our estimated vegetation carbon stocks to various other studies (Table 1) shows both BLUE estimates (transient and fixed) are more consistent with the multi-model average of eight TRENDY models (see Methods) and various observation-based datasets^23,24 than the default setup. Our updated estimates of global forest carbon stocks (Table 3) are also closer to other observation-based estimates²³ than the estimates from the default setup. The largest differences to the default setup and the biggest improvements concerning the reconciliation with other datasets are found for tropical and boreal forests. In terms of the interannual variability (IAV) of the net carbon fluxes from global woody vegetation (Table 1), we find that the IAV is on average around eight times larger when considering environmental effects on woody biomass carbon. In other words, ~88% (2.1 PgC yr⁻¹) of the IAV of the net carbon fluxes from woody biomass (2.4 PgC yr⁻¹) carbon is due to environmental effects and their synergies on E_LUC or conversely ~12% of the IAV (0.3 PgC yr⁻¹) is attributable to LULCC (Table 1). The same relation between biomass carbon simulated under fixed vs. transient climate is also shown for the TRENDY simulations, although our estimates suggest a stronger contribution of environmental processes to the IAV of carbon fluxes from vegetation. Between 2001 and 2018, S_LAND,B amounts to −1.6 PgC yr⁻¹ (−1.5 PgC yr⁻¹ for 2001–2019) based on our BLUE simulations, suggesting a ~13% smaller sink than the TRENDY multi-model average (Supplementary Table 2). There are some important differences between the TRENDY estimates and our BLUE results. First, the TRENDY results do not only include woody vegetation, but also herbaceous plants. Consequently, the IAV of the TRENDY estimates also includes dynamics of non-woody vegetation. However, we expect this effect to be small, since several studies^25,26 show that the IAV of the terrestrial carbon sink in semi-arid ecosystems is foremost attributable to soil dynamics (as opposed to vegetation dynamics). Second, as mentioned before, our estimates do not include the LASC, as they are based on the present-day land cover distribution. The overestimation of the terrestrial carbon sink strength due to the fixed pre-industrial land cover distribution in the TRENDY simulation is in line with the fact that our BLUE estimate for S_LAND,B shows a smaller sink than the TRENDY multi-model average. Third, the TRENDY models have a much coarser horizontal resolution (0.5^∘–2.8^∘) than BLUE (0.25^∘), which has important implications for the representation of sub-grid scale processes (discussed below).

Table 3 Comparison of forest living biomass carbon stocks (above- plus belowground) and associated fluxes (positive for uptake and negative for release) from this study compared to a range of other recent studies

Full size table

Figure 1 shows S_LAND,B from our BLUE simulations for 15 regions (Supplementary Fig. 2) against the TRENDY estimates of 13 DGVMs (see Methods for a description of TRENDY database). The 2001–2018 average based on our BLUE simulations are very similar to the TRENDY estimates for all regions (Fig. 1b). However, there are large differences between our estimates and the TRENDY estimates for the extreme values and the IAV of S_LAND,B. We calculate the IAV as the coefficient of variation, i.e., the standard deviation of S_LAND,B divided by the average¹⁶ S_LAND,B between 2001 and 2018. There are some regions where the TRENDY estimates show a higher IAV of S_LAND,B than our estimates. This mainly applies to Southern Africa and especially Oceania (mostly Australia, Supplementary Fig. 2) and is probably related to the dominating effects of grasslands there (Supplementary Fig. 3), which are not captured in our BLUE results. However, ref. 27 validate the Australian carbon cycle as simulated by the TRENDY (v8) DGVMs with various observational datasets and find that the uncertainty is large (Cumulative NBP 1901-2018: -4.7 to +9.5 PgC), mainly due to different model assumptions (e.g., land cover distribution, land-use implementation, atmospheric CO₂ concentration). Contrary to that, we estimate a much higher IAV in the boreal regions of Canada and Russia than the TRENDY models (Supplementary Fig. 4). Despite large differences between the individual models, we find that 12 out of 13 TRENDY models estimate that the IAV of the carbon sink in boreal (defined in Supplementary Fig. 5) vegetation is at least 67% (maximum: 96%) smaller than our BLUE estimates.

**Fig. 1: Regional estimates of the natural biomass land sink (S_LAND,B) between 2001 and 2018.**

The magnitude of IAV of S_LAND,B in our estimates is closely related to that from the underlying biomass changes from ref. 16. While IAV may seem high compared to regional estimates of disturbance impacts, it is not inconsistent with previous studies of land sink fluxes (see Section “Uncertainties” in the Methods Section). If we assume the IAV, as based on ref. 16, is correct, several other explanations as to why the TRENDY DGVMs estimate a lower IAV of S_LAND,B in boreal regions, emerge. For the North American (NAM) boreal forest, our analysis suggests that annual anomalies in air temperature have a large effect on the variability of S_LAND,B, as estimated by BLUE. We calculated the Spearman correlation coefficient between the anomalies in annual biomass carbon and the anomalies in (1) mean annual air temperature and (2) annual precipitation sums (both from ERA-5 reanalysis data²⁸). We find a strong positive correlation (mostly >0.7) between the annual mean air temperature anomaly and the annual anomaly in forest biomass carbon (Fig. 2), which mainly translates to a high IAV of S_LAND,B, as LULCC intensity is very low in this region (Fig. 3). Our findings are similar to ref. 29, who constrain a light-use efficiency model with satellite-derived vegetation dynamics and conclude that temperature, together with water availability, strongly affects the high IAV in plant productivity in northern high latitudes with an increasing influence of temperature under global warming. We further find a spatial gradient for the temperature-vegetation carbon correlation in the NAM boreal forest, which implies decreasing (i.e., positive to negative) correlations from east to west. This gradient represents the effect of temperature-/radiation- vs. water-limited ecosystems and is supported by refs. 30, 31, who find that air temperature and radiation are the dominant factors for plant growth in the eastern parts of the NAM boreal forest, whereas precipitation and soil moisture limit plant growth in the western parts of the NAM boreal forest. Since heterotrophic respiration is also primarily temperature-limited³², it should be noted that increased carbon uptakes by the NAM boreal forests related to warmer years are expected to be partly offset by increased heterotrophic respiration when the total natural land sink is investigated (i.e., when natural carbon fluxes from the soil, dead wood, and litter are accounted for). The average of the TRENDY models (calculated as average biomass carbon prior to the correlation analysis) show weaker (mostly <0.5) correlations between annual anomalies in (transient) biomass carbon and temperature anomalies and do not show a switch in sign of the correlation within the NAM boreal forest from east to west, which would be in line with the spatial biomass carbon gradient shown in our results (Supplementary Fig. 6b). Despite large differences between the individual models, we find that all DGVMs show much weaker correlations than the results based on BLUE and none of the models reproduces the spatial biomass gradient (not shown). Supplementary Fig. 7 shows that the analysed correlations between air temperature anomalies and biomass carbon anomalies based on BLUE are significant (p < 0.05) throughout the majority of the NAM boreal forest, whereas the correlations based on the TRENDY multi-model average are only significant in some parts.

**Fig. 2: Spatial correlations between annual anomalies of climate variables and biomass carbon between 2000 and 2019.**

**Fig. 3: Global maps of biomass assimilation bias and LULCC intensity according to LUH2.**

The lower IAV of S_LAND,B in the boreal regions based on the TRENDY results could also be related to the coarse horizontal resolution⁸, which might lead to incomplete representations of sub-grid scale processes (e.g., droughts or vegetation greening) that contribute substantially to the variability of biomass carbon. This is further supported by ref. 33, who suggest that the TRENDY (v6) models have deficits in capturing local to regional (<1000 km) spatial variability in (aboveground) biomass carbon. For reference, the NAM boreal forest area in Fig. 2 extends over a distance of around 5000 km (north-western corner to south-eastern corner of the blue frame).

Our BLUE estimates do not only suggest a larger IAV of S_LAND,B in some regions, but also a stronger reduction in sink capacity of the terrestrial woody vegetation in response to specific drought years in some regions. This is e.g., shown in Brazil, where our estimates indicate a 0.4–0.9 PgC yr⁻¹ weaker natural carbon sink than the TRENDY multi-model average for the documented strong El Niño years in 2005²² and 2015³⁴, and for the severe drought in the Amazon basin in 2010²². For 2005 and 2010, our estimates lie outside the TRENDY multi-model range. A similar dynamic is found for Europe, where our estimates suggest that the reduction in sink strength in 2015 was ten times (+0.4 PgC yr⁻¹) stronger (calculated as BLUE S_LAND,B,2015 minus TRENDY S_LAND,B,2015) than the TRENDY multi-model average (but still within the TRENDY range) as a response to the severe drought in the same year³⁵. An underestimation of the drought-mediated reduction in vegetation productivity by DGVMs is shown by other studies^36,37,38 and is suggested to be partly attributable to the fact that some models do not account for heat and water stress effects on plant productivity. Another suggested influential factor is the implementation of empirically-derived response curves for temperature and moisture in most models. The response curves capture plant productivity as a function of temperature and moisture and are based on historically observed climate, which might not be accurate for unprecedented extreme events³⁸. The stronger reduction in our S_LAND,B estimates as a response to drought events might also be related to the implicit consideration of forest degradation in our approach, which is not implemented in the DGVMs. A recent study by ref. 39 suggests that forest degradation leads to three times higher cumulative carbon losses of aboveground biomass than deforestation in the Brazilian Amazon between 2010 and 2019, with especially strong forest degradation due to el Niño events. This might be a further explanation for the differences between our results and the TRENDY results in Brazil.

Discussion

The separation of anthropogenic and natural carbon fluxes has been identified as one of the key challenges for reconciling and integrating models and observations¹⁴. The approach we developed tackles this challenge by disaggregating observation-based estimates of carbon stocks into the net LULCC flux (E_LUC) and the natural terrestrial sink (S_LAND). Our analysis highlights the importance of observational constraints for a more realistic representation of observed global vegetation dynamics in models and the attribution of anthropogenic vs. environmental impacts.

The comparison between our BLUE simulations with transient and fixed woody biomass carbon densities suggests that the biomass carbon sinks prior to clearing and wood harvesting were larger under transient environmental conditions due to increasing atmospheric CO₂ levels and other favorable environmental changes, which in turn led to larger carbon emissions upon clearing and wood harvesting compared to fixed environmental conditions.

That current carbon budgeting approaches exclude the effects of environmental changes on E_LUC may have important implications for our confidence in other budget terms. The budget imbalance (B_IM) is a measure of uncertainty in the estimated terms of the GCB, as it describes the difference between the emissions and sinks on the land, in the ocean, and in the atmosphere⁷. Following the GCB assessments, it is assumed that the atmospheric growth rate of CO₂ (G_atm) can be measured with high confidence, whereas the assessments of the natural carbon sinks on land and in the ocean are more uncertain⁷. If the S_LAND trends were depicted accurately, there should be an increase in S_LAND when considering environmental effects on carbon stocks (mainly due to more favourable growing conditions under elevated CO₂ levels), while the quantification of E_LUC through BKMs excludes all environmental effects, i.e., the budget imbalance would have to increase over time. Since the budget imbalance has been approximately constant with no trend since 1959 in the GCB assessments, we conclude that the global trend of increasing S_LAND is not captured accurately (Fig. 4) in the GCB.

**Fig. 4: Simplified scheme of the implications of not considering synergies between environmental effects on carbon stocks and E_LUC.**

Our analysis highlights the potential of using observational constraints for improving and reconciling model estimates, as multi-model uncertainties in E_LUC estimates are reduced to a substantial degree by assimilating observation-based woody biomass carbon densities in a BKM. Further, our results suggest that state-of-the-art DGVMs have deficits in capturing the IAV of S_LAND,B and the response of terrestrial vegetation to extreme events. These findings are in line with other recent studies, which find several explanations for the limitations of DGVMs to represent observed patterns in terrestrial carbon cycle dynamics. Our model-data integration also reveals hotspots where model assumptions or the underlying LULCC forcing are inconsistent with the observed carbon dynamics (see paragraph on “Uncertainties” in Methods).

A major advantage of our framework is that it can be extended flexibly to updated datasets and can constantly be improved with more observational datasets being made available.

There are various data additions to our approach that we would consider to be especially valuable for improving model-data assimilated estimates of global terrestrial carbon cycle dynamics. First, the inclusion of a time series of observational estimates of carbon contained in non-woody vegetation and soils would be a very valuable addition to our approach, as recent studies suggest a major contribution of soils to the IAV of the terrestrial carbon sink in transitional regimes^26,40. Second, extending the time series of carbon stocks to sub-annual time scales would enable a more detailed analysis of the intra-annual response of the terrestrial carbon cycle to extreme events, which has been shown to be highly variable in space and time³⁴. The need for observation-based estimates of terrestrial carbon stocks that are consistent in space and time is acknowledged by a wider community and steps towards this goal are currently being undertaken through various recently launched (Global Ecosystem Dynamics Investigation, ECOsystem Spaceborne Thermal Radiometer Experiment on Space Station)^41,42 and upcoming satellite missions (e.g., Geostationary carbon cycle observatory, BIOMASS)^11,43, which are dedicated to measuring the Earth’s vegetation properties. Combining those measurements with ground-based observations and models will be a major contribution towards more reliable estimates of environmental and anthropogenic CO₂ fluxes, which are independent of national GHG inventories. This is crucial for monitoring country-level emission commitments according to the 2015 Paris agreement and for future climate change mitigation and adaptation strategies.

Methods

External datasets

Woody biomass carbon data

The dataset by ref. 16 maps annual global woody biomass carbon densities for 2000–2019 at a spatial resolution of ~10 km. The annual estimates represent averages for the tropical regions and growing-season (April–October) averages for the extra-tropical regions. Ref. 16 analyse global trends of gains and losses in woody biomass carbon for 2000–2019. Overall, they find that grid cells with (significant) net gains of vegetation carbon are by a factor of 1.4 more abundant than grid cells with net losses of vegetation carbon, indicating that there is a global greening trend when only considering the areal extent of biomass gains and not the magnitude of carbon gains. Their regionally distinct analysis of trends shows that almost all regions, except for the tropical moist forests in South America and parts of Southeast Asia, experienced net gains in biomass carbon. On the country scale, the largest net increase in biomass carbon is shown in China, which is mainly attributed to the large-scale afforestation programs in the southern part of the country and increased carbon uptake of established forests. On the other hand, the largest vegetation carbon losses are shown for Brazil and Indonesia, which is partly attributed to deforestation, degradation, and drought events. All of the mentioned trends have been found to be significant¹⁶. The decreasing carbon sink in Brazil is in line with ref. 44, who, considering both natural and anthropogenic fluxes, show that the southeastern Amazon has even turned from a carbon sink to a carbon source, mainly owing to fire emissions from forest clearing. Isolating carbon fluxes in intact, old-growth Amazonian rainforests (i.e., S_LAND,B), ref. 45 also find evidence for a significantly decreasing carbon sink due to the negative effects of increasing temperatures and droughts on carbon uptake since the 1990s.

The dataset was remapped to the BLUE resolution of 0.25^∘ through conservative remapping (i.e., area-weighted averaging).

ERA-5 data

The ERA-5 variables were downloaded from the Copernicus Climate Data Store (https://cds.climate.copernicus.eu/cdsapp#!/home). Monthly air temperature (T_a) at 2 m height was averaged over each year, and annual precipitation was calculated by taking the sum of the monthly total precipitation (P). Both variables were regridded from the original resolution of ~0.1° to 0.25° resp. to the TRENDY resolution of 0.5° through conservative remapping.

TRENDY data

We used the TRENDY model ensemble version 8 (conducted for the 2019 GCB; ref. 8). We used net biome production (NBP) and annual vegetation carbon stocks (cVeg) for 2000–2018 from four different model setups (S2, S3, S5, and S6) and eight resp. 13 DGVMs (depending on the data available). The selection of DGVMs is done as in ref. 19 (Supplementary Tab. 3), but we included one additional model (ISAM) for the S2 simulations. The terrestrial biomass carbon sink (S_LAND,B) was calculated for 13 DGVMs following the GCB 2020 approach, i.e., from the S2 simulation, which is the simulation without LULCC (i.e., fixed pre-industrial land cover) under transient environmental conditions (climate, nitrogen deposition, CO₂ evolution). S_LAND,B is the annual difference of cVeg and makes no statements about the further fate of biomass if cVeg decreases. S_LAND,B, therefore, should not be interpreted as equivalent to the flux to/from the atmosphere, since parts of cVeg may be transferred to litter, dead wood, or soil. The same applies to our BLUE estimates of S_LAND,B, ensuring comparability between our BLUE estimates and the TRENDY estimates. Increases (decreases) of cVeg between two years are a net uptake (release) of carbon from the terrestrial biosphere. The global sums of biomass carbon stocks under transient climate and CO₂ were calculated from the S3 setup (LULCC under historical environmental conditions), whereas the S5 setup provides biomass carbon under constant present-day environmental forcing (closest to the classical bookkeeping approach). In line with the GCB, E_LUC was calculated under historical environmental conditions as the difference in NBP between the S2 and S3 simulations (E_LUC = NBP_S2 - NBP_S3). E_LUC under constant present-day environmental forcing was calculated as the difference in NBP between the S6 (fixed pre-industrial land cover under present-day environmental forcing) and S5 simulations (E_LUC = NBP_S6 - NBP_S5)¹⁹. All datasets were remapped to a common resolution of 0.5^∘ through conservative remapping (area-weighted average) for the data analysis.

Assimilation of observed woody biomass carbon in BLUE

The observed woody biomass carbon densities by ref. 16 are assimilated in BLUE in several steps.

Carbon transfer in the default setup of BLUE

The BLUE simulation is started in AD 850. Biomass and soil vegetation carbon densities are based on ref. 17 (see ref. 5 for details). These carbon densities are specific for eleven natural PFTs (Supplementary Fig. 3), which are assigned one of four land cover types (primary land, secondary land, cropland or pasture). The LULCC forcing is based on the LUH2 dataset¹⁸, defining the vegetated fractional area of each grid cell that is affected by a land-use transition. Each transition may lead to a change from one land cover type (=source land cover type: j) to another land cover type (=target land cover type: $j^{\prime}$). In the case of wood harvesting on secondary land, $j=j^{\prime}$, whereas all other transition types (e.g., clearing for agricultural expansion, abandonment of agricultural lands) induce a change in land cover. The fractional grid cell areas undergoing transitions are further distributed across PFTs proportionally to the temporally constant PFT area fractions (Supplementary Fig. 3). Upon each land-use transition, biomass carbon is transferred between the source land cover type and the target land cover type, whereby the amount of transferred carbon depends on the biomass carbon density of the source and target land cover types (in the respective PFT) and the area affected by the transition (in the respective PFT). The temporal evolution of the biomass carbon pool after any type of land-use transition is approximated by an exponential function with different time constants for decay and regrowth, depending on the type of land-use transition. The time constants are based on linear estimates by ref. 17, which are converted to exponential time constants. A detailed explanation of the exponential model can be found in ref. 5.

While in the default setup, changes are only due to LULCC, our assimilation approach now introduces environmental effects on woody vegetation carbon by assimilating the observed woody biomass carbon densities in BLUE from 2000 onward according to the methodological considerations explained below.

Calculation of woody biomass carbon densities for different land cover types and PFTs

Within each 0.25° cell of the global grid, the (remapped) woody biomass carbon density from ref. 16 must be the sum of woody biomass carbon stored in all woody PFTs of all woody land cover types. The distribution of the woody biomass carbon across PFTs and land cover types is achieved by distributing the observed (i.e., actual) woody biomass carbon densities (ρ_Ba) from ref. 16 across the two land cover types (j) and the eight PFTs (l) that can be woody vegetation (primary land, called virgin, “v” in BLUE and secondary, “s”, land) according to the fraction of total woody biomass carbon (f_B) contained in each land cover type and each PFT (f_B,j,l) as estimated by BLUE. f_B,j,l varies for different PFTs and land cover types, depending on their history of LULCC and their potential for carbon uptake (i.e., the potential carbon densities).

f_B,j,l is extracted from the default simulations for the first year of the time series (i.e., 2000) and calculated for subsequent years from the BLUE simulations using the assimilated woody vegetation carbon densities for that year:

$${f}_{B,j,l}(t)=\frac{{C}_{B,j,l}(t)}{{C}_{B}(t)}$$

(1)

where C_B is the woody biomass carbon stock.

Consequently, the assimilated woody biomass carbon stock per cover type and PFT (C_{B_as,j,l}) at each time step can be calculated as:

$${C}_{B\_as,j,l}(t)={\rho }_{Ba}(t)\;*\;A\;*\;{f}_{B,j,l}(t)$$

(2)

with j{v, s}; l{1. . 8}; t{2000. . 2019}. A is the area per grid cell.

Thresholds for excluding inconsistent woody biomass carbon densities

We eliminate unrealistically large values for woody biomass carbon densities that our assimilation framework produces. Woody biomass carbon densities in BLUE that exceed the highest value (~374 t ha⁻¹) of the original dataset indicate inconsistencies between the observed woody biomass carbon estimates and the fractional grid cell areas per PFT and land cover types that BLUE simulates. To account for uncertainties related to the criteria for exclusion of grid cells, multiple threshold approaches are applied and the results are compared. To maintain a temporally and spatially consistent time series of woody biomass carbon, grid cells that are excluded according to the chosen threshold approach are interpolated through linear barycentric interpolation. A first approach relies on a uniform upper threshold of <375 t ha⁻¹ for woody biomass carbon densities. This approach leads to the exclusion of ~3% of all grid cells, but is considered conservative in the sense that it may lead to an overestimation of woody biomass carbon densities of non-forested land, since it is expected that the maximum value of ~374 t ha⁻¹ occurs in heavily forested grid cells only. To account for this potential overestimation, additional threshold approaches are applied by cutting the distribution of grid cells with woody biomass carbon densities smaller than 375 t ha⁻¹ to a range of specific percentiles and choosing the values corresponding to each percentile as upper thresholds for the exclusion of further grid cells. In the first step, we choose the 97th, 98th, and 99th percentiles and evaluate the resulting dynamics of total vegetation carbon in terms of their agreement with the original dataset by ref. 16. The evaluation is done by analyzing the results from each percentile threshold approach in terms of the global dynamics of the biomass carbon stocks in comparison to the estimates from ref. 16 (see Supplementary material). This analysis reveals that the annual dynamics (i.e., increase/decrease) of the woody biomass carbon stocks start to diverge strongly from the original time series for thresholds smaller than the 99th percentile. This is related to an enhanced loss of spatial and temporal variability of the assimilated biomass carbon stocks due to an increased number of interpolated grid cells with smaller percentile thresholds. Consequently, we choose the two approaches with (1) <375 t ha⁻¹ and (2) <99th percentile of 375 t ha⁻¹ as upper limits for the exclusion of inconsistent biomass carbon densities and use their average, unless indicated otherwise. Both threshold approaches are applied to each woody PFT and the two woody land cover types separately over the whole time series (2000–2019). Consequently, it is ensured that differences in carbon storage potential between different PFTs and land cover types are considered within the percentile threshold approach.

Model initialization

In our transient woody biomass carbon approach, we need to initialize the woody biomass pools in BLUE at each time step (i.e., each year) to account for changes in biomass carbon densities due to environmental processes. As we do not assimilate soil carbon densities in our approach, the soil carbon pools are initialized once at the beginning of the BLUE simulations (described below) and subsequently only altered by LULCC. The re-initialization for the woody biomass pools at each time step is necessary, as BLUE only explicitly simulates annual changes in biomass carbon densities due to LULCC. In the default approach, total biomass carbon is partitioned between equilibrium pools (${\bar{C}}_{B,j,k,l}$) and excess pools (δ_B,j,k,l). The former mark the carbon stock that the biomass pools strive to reach (i.e., the carbon stock that the PFT and land cover type would reach after a sufficiently long time after a land-use disturbance), while the latter indicate whether the current biomass carbon stock is in equilibrium (δ_B,j,k,l = 0) or in excess of equilibrium (δ_B,j,k,l ≠ 0) (Note that k is the (land-use) history type, including clearing (“l”), harvest (“h”), abandonment (“a”), other (“g”)). An in-depth explanation of the different pool types in BLUE can be found in the original documentation by ref. 5. Biomass carbon is assumed to be in equilibrium upon model initialization, i.e., the equilibrium pools contain all biomass carbon and the excess pools are zero. Upon each land-use transition, the equilibrium and excess biomass carbon pools are altered, depending on the transition type.

In our approach, the model initialization is done by distributing the assimilated woody biomass carbon among the equilibrium biomass pools for all woody PFTs and all land cover types (see Supplementary materials for the handling of non-woody land cover types). This means that the equilibrium biomass pools and all excess biomass pools are then re-initialized at each time step (annually) of the simulation from 2000 onward. The excess carbon pools are changed upon each land-use transition, whereby the spatially explicit actual woody biomass carbon densities derived from ref. 16 replace the woody biomass carbon densities based on ref. 17 from 2000 onward. The actual woody biomass carbon densities from ref. 16 are assimilated in BLUE at the beginning of each year X and subsequently altered by the land-use transitions in year X. Consequently, the BLUE output of carbon stocks for year X represents the end of year X and changes in carbon stocks between year X+1 and year X are attributed to year X+1. Legacy fluxes (i.e., carbon fluxes from land-use that do not occur in the same time step as the corresponding land-use event) are tracked according to the approach explained below.

Handling of legacy carbon fluxes

Due to repeated initialization of the (equilibrium and excess) biomass carbon pools at each time step, legacy fluxes are not accounted for and need to be tracked separately. Such legacy fluxes from/to the atmosphere to/from the terrestrial woody biomass occur due to LULCC prior to the current time step, e.g., because the forest regrows slowly or because cleared biomass decomposes slowly on site or in products. We track these legacy fluxes separately for those from the LULCC prior to the assimilation period (2000–2019), and those occurring during the assimilation period, which are caused by the LULCC transitions and other biomass changes. To track the former, we introduce an additional set of excess pools δ_B,leg<2000 that include all excess woody biomass carbon from land-use transitions prior to 2000 upon initialization of the actual woody biomass carbon pools in 2000. Legacy carbon fluxes from land-use transitions prior to 2000 (θ_{B,leg<2000,j,k,l}) are calculated as in the default approach (Note: carbon fluxes from LULCC in time step (t) from/to the land to/from the atmosphere are realized at the beginning of time step (t+1) in BLUE):

$${\theta }_{B,leg\,{ < }\,2000,j,k,l}(t)={\delta }_{B,leg\,{ < }\,2000,j,k,l}(t-1)-{\delta }_{B,leg\,{ < }\,2000,j,k,l}(t-1)\;*\;{{{{{{{{\rm{e}}}}}}}}}^{\frac{-1}{{\tau }_{{{{{{{{\rm{B}}}}}}}},{{{{{{{\rm{j}}}}}}}},{{{{{{{\rm{k}}}}}}}},{{{{{{{\rm{l}}}}}}}}}}}$$

(3)

with j{v, s, p, c}; l{1. . 8}; t{2000. . 2019}; k{l,h,a,g}. τ is the time constant for relaxation processes, which varies for different pool types (biomass or soil), land cover types, and PFTs.

Excess woody biomass carbon from transitions from 2000 onward is tracked in another set of separate pools (δ_B,leg≥2000) to account for ≥2000 legacy fluxes. δ_B,leg≥2000 is adjusted at the beginning of each time step for all excess woody biomass carbon from the previous time step minus fluxes to/from the atmosphere (Eq. (4a)) (θ_{B,leg≥2000,j,k,l}) from relaxation processes in the respective time step (Eq. (4b)):

$${\delta }_{B,leg\ge 2000,j,k,l}(t)={\delta }_{B,leg\ge 2000,j,k,l}(t-1)+{\delta }_{B,j,k,l}(t-1)-{\theta }_{B,leg\ge 2000,j,k,l}(t)$$

(4a)

$${\theta }_{B,leg\ge 2000,j,k,l}(t)={\delta }_{B,leg\ge 2000,j,k,l}(t-1)-{\delta }_{B,leg\ge 2000,j,k,l}(t-1)\;*\;{{{{{{{\rm{{e}}}}}}}^{\frac{-1}{{\tau }_{B,j,k,l}}}}}$$

(4b)

Carbon fluxes between the terrestrial woody biomass pool and the atmosphere pool at each time step, including all legacy fluxes (θ_B,j,k,l), can then be calculated as the sum of instantaneous carbon fluxes at the current time step (resulting from LULCC in the previous time step), legacy carbon fluxes prior to 2000 and legacy carbon fluxes from 2000 onward, but prior to the current time step (resulting from LULCC prior to the previous time step).

$${\theta }_{B,j,k,l}(t)= {\delta }_{B,j,k,l}(t-1)-{\delta }_{B,j,k,l}(t-1)\;*\;{{{{{{{{\rm{e}}}}}}}}}^{\frac{-1}{{\tau }_{{{{{{{{\rm{B}}}}}}}},{{{{{{{\rm{j}}}}}}}},{{{{{{{\rm{k}}}}}}}},{{{{{{{\rm{l}}}}}}}}}}}\\ + {\theta }_{B,leg\,{ < }\,2000,j,k,l}(t)+{\theta }_{B,leg\ge 2000,j,k,l}(t)$$

(5)

Derivation of the terrestrial woody biomass carbon sink

To isolate anthropogenic from environmental (=S_LAND,B) carbon fluxes from woody vegetation, we performed two simulation setups based on different approaches for assimilating the observed woody biomass carbon densities. The biomass estimate by ref. 16 includes carbon stored in living woody vegetation (trees and shrubs), whereas carbon stored in dead plant material (litter, harvested wood products) is not included in the estimate. Consequently, the change in woody biomass carbon stocks within a certain time step results from carbon sources and sinks driven by LULCC in the respective time step, from carbon sinks due to regrowth of vegetation driven by past LULCC (i.e., prior to the respective time step) and from all environmental processes (on managed and unmanaged lands) in the respective time step:

$${{\Delta }}C(t)={{\Delta }}{C}_{{{{{{\mathrm{source}}}}}}}(t)+{{\Delta }}{C}_{{{{{{\mathrm{sink}}}}}}}(t)+{{\Delta }}{C}_{{{{{{\mathrm{reg}}}}}}\_{{{{{\mathrm{leg}}}}}}}(t)+{{\Delta }}{C}_{{{{{{\mathrm{source}}}}}},{{{{{\mathrm{env}}}}}}}(t)+{{\Delta }}{C}_{{{{{{\mathrm{sink}}}}}},{{{{{\mathrm{env}}}}}}}(t)$$

(6)

where ΔC_source resp. ΔC_sink are sources resp. sinks of biomass carbon due to LULCC in the current time step, ΔC_{reg_leg} are sinks of biomass carbon due to regrowth of vegetation from LULCC prior to the current time step and ΔC_source,env resp. ΔC_sink,env are sources resp. sinks of biomass carbon due to environmental processes in the current time step. We performed additional BLUE simulations with fixed (i.e., stationary in time) woody biomass carbon densities to split the carbon fluxes from woody vegetation into the anthropogenic and environmental terms of Eq. (6). The fixed woody biomass carbon setup is based on the 2000 estimates derived from ref. 16. As in the transient simulations, model initialization is done from the same state of woody biomass carbon in 2000 (i.e., anthropogenic and environmental effects on woody biomass carbon prior to 2000 are implicitly captured), but changes in woody biomass carbon in the subsequent years are only driven by LULCC in the fixed setup. In the fixed woody biomass carbon simulations, there is no need for separately tracking legacy fluxes from 2000 onward, since the biomass carbon pools are only initialized in 2000 and the excess pools are altered subsequently without re-initialization. Legacy carbon fluxes from land-use transitions prior to 2000 are considered following the same approach as in the transient woody biomass carbon density setup. The terms of Eq. (6) that capture environmental changes in biomass carbon stocks are only included in the BLUE simulations with transient biomass carbon, whereas biomass carbon changes driven by land-use change are captured in both the transient and fixed biomass carbon simulations. Consequently, our BLUE simulations allow us to isolate all environmental effects on woody biomass carbon by taking the difference in woody biomass carbon stocks between the two BLUE simulation setups:

$${S}_{{{{{{\mathrm{LAND}}}}}},B}(t)={{\Delta }}{C}_{{{{{{\mathrm{trans}}}}}},B}(t)-{{\Delta }}{C}_{{{{{{\mathrm{fix}}}}}},B}(t)$$

(7)

This term, the natural carbon sink in terrestrial woody vegetation, represents the net effect of environmental processes on managed and unmanaged lands on the terrestrial woody living vegetation.

Uncertainties

The main sources of uncertainty that affect the results from our biomass assimilation approach are (1) model assumptions regarding the global LULCC dynamics and the rates of vegetation regrowth, (2) potential misattributions of anthropogenic fluxes as natural fluxes owing to incomplete data on LULCC, and (3) uncertainties within the original time series of woody biomass carbon densities by ref. 16. We analyse the different sources of uncertainty as described in the following.

(1) The difference between the observed woody vegetation carbon stocks from ref. 16 and the woody vegetation carbon stocks at the beginning of each time step in the transient BLUE setup (=“assimilated woody biomass carbon”) can be used to evaluate the LULCC forcing and the PFT distribution in BLUE. Since the observed and the assimilated woody vegetation carbon time series are not independent of each other, the comparison solely aims at identifying potential model uncertainties. The observed woody vegetation carbon densities are assimilated into BLUE at each time step according to the spatial distribution of the land cover types and PFTs (see Eq. (2)). Consequently, a larger difference between the observed woody vegetation carbon stocks and the assimilated woody vegetation carbon stocks would indicate that the actual LULCC dynamics and/or the spatial distribution of PFTs are not captured well in BLUE. We call this difference the “biomass assimilation bias”. The average global biomass assimilation bias (±1 SD) amounts to 29 ± 4 PgC between 2000 and 2019. The agreement between the observational dataset by ref. 16 and the assimilated woody biomass carbon in terms of the trends in global biomass carbon is quantified as the number of years that show the same trend in both datasets related to the previous year divided by the total number of years. Following this definition, a temporal agreement of 100% would mean that the observed dataset and the assimilated dataset show the same trend in biomass carbon for all years. The regionally averaged agreement in the estimated biomass carbon trends is generally high (>80%) (Supplementary Fig. 8) but smaller in regions with strong LULCC dynamics (tropics and Europe). Some local hotspots exist (Fig. 3), where differences between the observed dataset and the assimilated dataset are larger. These are mainly located in South- and Southeast Asia, Europe, and Equatorial Africa (Fig. 3a), where clearing and wood harvesting rates of the forest as prescribed in the LULCC forcing (Fig. 3b) are very high, leading to much lower biomass carbon estimates in our assimilated woody biomass carbon estimates than in the observed time series. This suggests that the clearing and/or wood harvesting rates are overestimated in the LULCC forcing and/or the rate of vegetation regrowth is underestimated in BLUE, leading to a high biomass assimilation bias for the mentioned regions, which further affects E_LUC (see Results).

We further assessed the validity of our S_LAND,B estimates in terms of the high IAV shown in Canada, Russia, Brazil, and Europe. The comparison of our time series of assimilated woody biomass carbon to the original time series by ref. 16 shows that our assimilated dataset is very close to the original dataset in the respective regions and that the high IAV is also shown in the original time series¹⁶. Consequently, we conclude that the high IAV is not introduced by uncertainties in our model-data integration. Nevertheless, we acknowledge that the estimated IAV in NAM (especially Canada) may seem high compared to the IAV of carbon fluxes due to natural disturbances estimated by the National Inventory Report of Canada⁴⁶ or to specific disturbance events, such as the mountain pine beetle outbreak in British Columbia in the early 2000s⁴⁷. However, our estimated IAV of S_LAND,B of up to 2 PgC yr⁻¹ in NAM is not inconsistent with previous studies estimating annual changes of the total land sink⁴⁸ resp. the natural land sink^49,50 of up to 3 PgC yr⁻¹, including a switch in sign of the flux. Furthermore, ref. 51 combine atmospheric CO₂ measurements with inverse modelling and show that the North American net ecosystem exchange (NEE) between 2007 and 2015 and its variability was strongly driven by el Niño (more than average carbon uptake) and la Niña (less than average carbon uptake) conditions, with monthly anomalies (related to the mean for 2007–2015) of up to ±1.5 PgC yr⁻¹. They further find that the boreal coniferous forest is among the ecosystems with the largest difference in NEE anomalies between el Niño and la Niña periods. Our S_LAND,B estimates broadly follow the dynamics described by ref. 51, with higher than average carbon uptakes during el Niño conditions in 2010 and 2015 and lower than average carbon uptakes resp. carbon releases during la Niña conditions in 2011. Furthermore, we wish to clarify that we assume that the magnitude of fluxes to/from the atmosphere from/to the biomass is smaller than our estimated changes in S_LAND,B, since we include depositions to dead matter and soil in our estimates. According to the global carbon budget decomposition by ref. 49, around 50% of annual gross anthropogenic emissions (including natural fluxes on managed lands, as described in ref. 52) between 2000 and 2015 was due to direct emissions (i.e. in the year of the respective disturbance), while the rest was attributable to legacy fluxes (onsite decomposition and wood product degradation). Assuming a similar dynamic for (solely) natural disturbances, we expect that the magnitude of annual carbon fluxes to/from the atmosphere from/to the biosphere is lower - depending on the degree and type of disturbance - than the annual changes in S_LAND,B. The mentioned considerations highlight the need for future, independent estimates – especially on a regional scale – to foster our understanding of the IAV of terrestrial carbon fluxes. Furthermore, ref. 16 compared their estimated IAV of woody biomass carbon with FAO estimates of forest carbon, showing that there is no systematic overestimation of IAV in the mentioned regions.

(2) A general shortcoming of all accounting approaches based on observational datasets is their inability to capture all anthropogenic activities related to LULCC. Consequently, there might be anthropogenic carbon fluxes that are classified as environmental fluxes in our approach, simply because the LULCC forcing is not capturing the underlying anthropogenic activities completely. This caveat applies foremost to certain types of anthropogenic degradation: while the LULCC forcing covers logging (implemented as wood harvesting in BLUE) and rangeland degradation, it does not account for degradation caused by anthropogenic fires, which might lead to misattributions of the related fluxes towards S_LAND,B. However, there is currently no (global) dataset available that separates anthropogenic and natural degradation fires and that would allow us to provide an uncertainty estimate for the misattributed fluxes.

(3) Reference 16 defines errors in the order of ±0.5% (2 PgC) related to the 2000–2019 average global sum of carbon contained in woody vegetation (381 PgC). The error estimate includes pixel-level uncertainty and modeling uncertainty from parameter estimation. The global uncertainty range of ±0.5% is considered in all of our aggregated global estimates of woody biomass carbon. This means that in Table 1, the uncertainty range of ±2 PgC for the global living biomass (i.e., woody plus herbaceous vegetation) carbon stocks refers to the woody vegetation estimate only (357 PgC).

Global maps of absolute and relative pixel-level uncertainty (Supplementary Fig. 9) are provided and can be used as a reference to evaluate the accuracy of our estimates for different regions.

Data availability

The E_LUC data, S_LAND,B data, correlations between biomass carbon anomalies and climate anomalies, and the uncertainty data generated in this study have been deposited in World Data Centre for Climate (WDCC) database provided by the German Climate Computing Center (DKRZ: Deutsches Klimarechenzentrum GmbH) under https://doi.org/10.26050/WDCC/MoDataInToTr21stCLandFl. The LUH2 data were available at https://luh.umd.edu/data.shtml.

The woody biomass carbon time series by ref. 16 is available at https://doi.org/10.5281/zenodo.4161694.

The TRENDY v8 ensemble of simulation outputs is available upon request at https://sites.exeter.ac.uk/trendy.

Code availability

The code for the analysis in this paper is available on request to the corresponding author.

References

Friedlingstein, P. et al. Global carbon budget 2021. Earth Syst. Sci. Data 14, 1917–2005 (2022).
Article ADS Google Scholar
Walker, A. P. et al. Integrating the evidence for a terrestrial carbon sink caused by increasing atmospheric co2. N. Phytol. 229, 2413–2445 (2021).
Article CAS Google Scholar
Schwalm, C. R. et al. Reduction in carbon uptake during turn of the century drought in western North America. Nat. Geosci. 5, 551–556 (2012).
Article ADS CAS Google Scholar
Huntzinger, D. N. et al. Uncertainty in the response of terrestrial carbon sink to environmental drivers undermines carbon-climate feedback predictions. Sci. Rep. 7, 4765 (2017).
Article ADS CAS PubMed PubMed Central Google Scholar
Hansis, E., Davis, S. J. & Pongratz, J. Relevance of methodological choices for accounting of land use change carbon fluxes. Glob. Biogeochem. Cycles 29, 1230–1246 (2015).
Article ADS CAS Google Scholar
Bastos, A. et al. Comparison of uncertainties in land-use change fluxes from bookkeeping model parameterisation. Earth Syst. Dyn. 12, 745–762 (2021).
Article ADS Google Scholar
Friedlingstein, P. et al. Global carbon budget 2020. Earth Syst. Sci. Data 12, 3269–3340 (2020).
Article ADS Google Scholar
Friedlingstein, P. et al. Global carbon budget 2019. Earth Syst. Sci. Data 11, 1783–1838 (2019).
Article ADS Google Scholar
Saatchi, S. S. et al. Benchmark map of forest carbon stocks in tropical regions across three continents. Proc. Natl Acad. Sci. USA 108, 9899–904 (2011).
Article ADS CAS PubMed PubMed Central Google Scholar
Baccini, A. et al. Estimated carbon dioxide emissions from tropical deforestation improved by carbon-density maps. Nat. Clim. Change 2, 182–185 (2012).
Article ADS CAS Google Scholar
ESA. https://www.esa.int/Applications/Observing_the_Earth/FutureEO/Biomass (2021).
He, H. et al. Reference carbon cycle dataset for typical Chinese forests via colocated observations and data assimilation. Sci. Data 8, 42 (2021).
Article CAS PubMed PubMed Central Google Scholar
Raczka, B. et al. Improving clm5.0 biomass and carbon exchange across the western united states using a data assimilation system. J. Adv. Model. Earth Syst. 13, e2020MS002421 (2021).
Article ADS PubMed PubMed Central Google Scholar
Pongratz, J. et al. Land use effects on climate: current state, recent progress, and emerging topics. Curr. Clim. Change Rep. 7, 99–120 (2021).
Article Google Scholar
Davis, S. J., Burney, J. A., Pongratz, J. & Caldeira, K. Methods for attributing land-use emissions to products. Carbon Manag. 5, 233–245 (2014).
Article CAS Google Scholar
Xu, L. et al. Changes in global terrestrial live biomass over the 21st century. Sci. Adv. 7, eabe9829 (2021).
Article ADS CAS PubMed Google Scholar
Houghton, R. A. et al. Changes in the carbon content of terrestrial biota and soils between 1860 and 1980: A net release of co"2 to the atmosphere. Ecol. Monogr. 53, 235–262 (1983).
Article CAS Google Scholar
Hurtt, G. C. et al. Harmonization of global land use change and management for the period 850-2100 (luh2) for cmip6. Geosci. Model Dev. 13, 5425–5464 (2020).
Article ADS CAS Google Scholar
Obermeier, W. A. et al. Modelled land use and land cover change emissions - a spatio-temporal comparison of different approaches. Earth Syst. Dyn. 12, 635–670 (2021).
Article ADS Google Scholar
Gasser, T. et al. Historical co₂ emissions from land use and land cover change and their uncertainty. Biogeosciences 17, 4075–4101 (2020).
Article ADS CAS Google Scholar
Houghton, R. A. & Nassikas, A. A. Global and regional fluxes of carbon from land use and land cover change 1850-2015. Glob. Biogeochem. Cycles 31, 456–472 (2017).
Article ADS CAS Google Scholar
Tagesson, T. et al. Recent divergence in the contributions of tropical and boreal forests to the terrestrial carbon sink. Nat. Ecol. Evol. 4, 202–209 (2020).
Article PubMed Google Scholar
Erb, K. H. et al. Unexpectedly large impact of forest management and grazing on global vegetation biomass. Nature 553, 73–76 (2018).
Article ADS CAS PubMed Google Scholar
Spawn, S. A., Sullivan, C. C., Lark, T. J. & Gibbs, H. K. Harmonized global maps of above and belowground biomass carbon density in the year 2010. Sci. Data 7, 112 (2020).
Article PubMed PubMed Central Google Scholar
Ahlstrom, A. et al. Carbon cycle. the dominant role of semi-arid ecosystems in the trend and variability of the land co(2) sink. Science 348, 895–9 (2015).
Article ADS PubMed Google Scholar
Humphrey, V. et al. Soil moisture-atmosphere feedback dominates land carbon uptake variability. Nature 592, 65–69 (2021).
Article CAS PubMed PubMed Central Google Scholar
Teckentrup, L. et al. Assessing the representation of the Australian carbon cycle in global vegetation models. Biogeosciences 18, 5639–5668 (2021).
Hersbach, H. et al. The era5 global reanalysis. Q. J. R. Meteorological Soc. 146, 1999–2049 (2020).
Article ADS Google Scholar
Madani, N. et al. Recent amplified global gross primary productivity due to temperature increase is offset by reduced productivity due to water constraints. AGU Adv. 1, e2020AV000180 (2020).
Article ADS Google Scholar
D’Orangeville, L. et al. Northeastern North America as a potential refugium for boreal forests in a warming climate. Science 352, 1452–5 (2016).
Article ADS PubMed Google Scholar
Sulla-Menashe, D., Woodcock, C. E. & Friedl, M. A. Canadian boreal forest greening and browning trends: an analysis of biogeographic patterns and the relative roles of disturbance versus climate drivers. Environ. Res. Lett. 13, 014007 (2018).
Article ADS Google Scholar
Wang, X. et al. Soil respiration under climate warming: differential response of heterotrophic and autotrophic respiration. Glob. Chang Biol. 20, 3229–37 (2014).
Article ADS PubMed Google Scholar
Yang, H. et al. Comparison of forest above-ground biomass from dynamic global vegetation models with spatially explicit remotely sensed observation-based estimates. Glob. Chang. Biol. 26, 3997–4012 (2020).
Article ADS PubMed Google Scholar
Bastos, A. et al. Impact of the 2015/2016 el nino on the terrestrial carbon cycle constrained by bottom-up and top-down approaches. Philos. Trans. R. Soc. Lond. B Biol. Sci. 373, 20170304 (2018).
Article PubMed PubMed Central Google Scholar
Hanel, M. et al. Revisiting the recent European droughts from a long-term perspective. Sci. Rep. 8, 9499 (2018).
Article ADS PubMed PubMed Central Google Scholar
Kolus, H. R. et al. Land carbon models underestimate the severity and duration of drought’s impact on plant productivity. Sci. Rep. 9, 2758 (2019).
Article ADS PubMed PubMed Central Google Scholar
Powell, T. L. et al. Confronting model predictions of carbon fluxes with measurements of Amazon forests subjected to experimental drought. N. Phytol. 200, 350–365 (2013).
Article Google Scholar
Schewe, J. et al. State-of-the-art global models underestimate impacts from climate extremes. Nat. Commun. 10, 1005 (2019).
Article ADS PubMed PubMed Central Google Scholar
Qin, Y. et al. Carbon loss from forest degradation exceeds that from deforestation in the Brazilian amazon. Nat. Clim. Change 11, 442–448 (2021).
Article ADS Google Scholar
Chang, J. et al. Climate warming from managed grasslands cancels the cooling effect of carbon sinks in sparsely grazed and natural grasslands. Nat. Commun. 12, 118 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
NASA. https://gedi.umd.edu (2021).
NASA https://ecostress.jpl.nasa.gov (2021).
Moore Iii, B. et al. The potential of the geostationary carbon cycle observatory (geocarb) to provide multi-scale constraints on the carbon cycle in the Americas. Front. Environ. Sci. 6, (2018).
Gatti, L. V. et al. Amazonia as a carbon source linked to deforestation and climate change. Nature 595, 388–393 (2021).
Article ADS CAS PubMed Google Scholar
Hubau, W. et al. Asynchronous carbon sink saturation in African and amazonian tropical forests. Nature 579, 80–87 (2020).
Article ADS CAS PubMed Google Scholar
Kurz, W. A. et al. Quantifying the impacts of human activities on reported greenhouse gas emissions and removals in Canada’s managed forest: conceptual framework and implementation. Can. J. For. Res. 48, 1227–1240 (2018).
Kurz, W. A. et al. Mountain pine beetle and forest carbon feedback to climate change. Nature 452, 987–990 (2008).
Rödenbeck, C., Zaehle, S., Keeling, R. & Heimann, M. History of El Niño impacts on the global carbon cycle 1957-2017: a quantification from atmospheric CO₂ data. Philos Trans R Soc Lond B Biol Sci. 373, 20170303 (2018).
Yue, C., Ciais, P., Houghton, R. A. & Nassikas, A. A. Contribution of land use to the interannual variability of the land carbon cycle. Nat Commun. 11, 3170 (2020).
Lei, Z. et al. Decadal variability in land carbon sink efficiency. Carbon Balance Manag. 16, 15 (2021).
Lei, H. et al. Enhanced North American carbon uptake associated with El Niño. Sci Adv. 5, eaaw0076 (2019).
Loughran, T. F. et al. Past and Future Climate Variability Uncertainties in the Global Carbon Budget Using the MPI Grand Ensemble. Global Biogeochemical Cycles. 35, https://doi.org/10.1029/2021GB007019 (2021).
Yi, Y. et al. Recent reversal in loss of global terrestrial biomass. Nature Climate Change. 5, 470–474 (2015).

Download references

Acknowledgements

S.B. acknowledges support from the German Stifterverband für die Deutsche Wissenschaft e.V. in collaboration with Volkswagen AG (High-resolution monitoring of avoided carbon emissions and carbon restoration potentials from land use change). R.G. was supported by the European Commission through Horizon 2020 Framework Program (VERIFY, Grant No. 776810). L.X. and S.S. acknowledge support from the NASA Carbon Monitory System program (20-CMS20-0026). We acknowledge the TRENDY project and participating modeling groups for the DGVM data. This work used resources of the Deutsches Klimarechenzentrum (DKRZ) granted by its Scientific Steering Committee (WLA) under Project ID bm1240.

Funding

Open Access funding enabled and organized by Projekt DEAL.

Author information

Kerstin Hartung
Present address: Deutsches Zentrum für Luft- und Raumfahrt, Institut für Physik der Atmosphäre, Oberpfaffenhofen, Germany
Liang Xu
Present address: Pachama Inc., San Francisco, CA, USA

Authors and Affiliations

Department of Geography, Ludwig-Maximilians-Universität, Munich, Germany
Selma Bultan, Kerstin Hartung, Raphael Ganzenmüller & Julia Pongratz
Max Planck Institute for Meteorology, Hamburg, Germany
Julia E. M. S. Nabel & Julia Pongratz
Max Planck Institute for Biogeochemistry, Jena, Germany
Julia E. M. S. Nabel
Jet Propulsion Laboratory, California Institute of Technology, Pasadena, CA, USA
Liang Xu & Sassan Saatchi

Authors

Selma Bultan
View author publications
You can also search for this author in PubMed Google Scholar
Julia E. M. S. Nabel
View author publications
You can also search for this author in PubMed Google Scholar
Kerstin Hartung
View author publications
You can also search for this author in PubMed Google Scholar
Raphael Ganzenmüller
View author publications
You can also search for this author in PubMed Google Scholar
Liang Xu
View author publications
You can also search for this author in PubMed Google Scholar
Sassan Saatchi
View author publications
You can also search for this author in PubMed Google Scholar
Julia Pongratz
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

J.P. and S.B. designed the study. L.X. and S.S. provided the time series of woody biomass carbon densities. S.B. conducted the model-data integration with input from J.P., J.E.M.S.N., and K.H. S.B. carried out the data analysis with additional input from J.P., J.E.M.S.N., K.H. and R.G. S.B. drafted the initial version of the manuscript, and all authors contributed to the interpretation of the results and to writing the manuscript.

Corresponding author

Correspondence to Selma Bultan.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks the anonymous reviewer(s) for their contribution to the peer review of this work.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Bultan, S., Nabel, J.E.M.S., Hartung, K. et al. Tracking 21^st century anthropogenic and natural carbon fluxes through model-data integration. Nat Commun 13, 5516 (2022). https://doi.org/10.1038/s41467-022-32456-0

Download citation

Received: 05 November 2021
Accepted: 01 August 2022
Published: 26 September 2022
DOI: https://doi.org/10.1038/s41467-022-32456-0

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Surprising stability of recent global carbon cycling enables improved fossil fuel emission verification

Vast CO2 release from Australian fires in 2019–2020 constrained by satellite

Synthesis of the land carbon fluxes of the Amazon region between 2010 and 2020

Introduction

Results

A model-data integration framework for separating anthropogenic and environmental carbon fluxes

Anthropogenic effects (LULCC) on global woody vegetation carbon

Environmental effects on global woody vegetation carbon

Discussion

Methods

External datasets

Woody biomass carbon data

ERA-5 data

TRENDY data

Assimilation of observed woody biomass carbon in BLUE

Carbon transfer in the default setup of BLUE

Calculation of woody biomass carbon densities for different land cover types and PFTs

Thresholds for excluding inconsistent woody biomass carbon densities

Model initialization

Handling of legacy carbon fluxes

Derivation of the terrestrial woody biomass carbon sink

Uncertainties

Data availability

Code availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Peer review

Peer review information

Additional information

Supplementary information

Supplementary Information

Rights and permissions

About this article

Cite this article

Share this article

Comments

Search

Quick links