Background & Summary

Global climate change mitigation plans and efforts require countries, regions, cities and companies worldwide to set regional and local carbon emission control targets and emission reduction plans1,2 and to monitor progress towards these targets over time. As the climate crisis becomes increasingly severe, countries and regions are expected to raise their climate ambitions and set a schedule for “carbon neutrality”. As a climate mitigation management measure, “carbon neutrality” can effectively mitigate the global greenhouse effect. Carbon emission data is not only a critical tool for monitoring the progress toward carbon neutrality, but also an essential basis for assessing national carbon peaking levels, addressing climate change, and formulating corresponding climate policies. Timely, fine-grained gridded carbon emission data sets are particularly important for global climate change research3,4. Often, fine-grained data are challenging to visualize over the globe, and clear visualization tools are also needed.

Current global gridded CO2 emissions datasets contain the Open Source Data Inventory for Anthropogenic CO2 (ODIAC), the Community Emissions Data System (CEDS), and the Emission Database for Global Atmospheric Research (EDGAR)5,6,7,8. However, most of the datasets mentioned above have a data lag of at least one year. They do not present sub-monthly temporal changes associated with weather, climate seasonality, economic activity, and market shocks on energy production and trade.

In a previous study, we described GRACED, a near-real-time Global Gridded Daily CO2 Emissions Dataset, for 2019 and 2020 for the first time, which can be updated at a 0.1° × 0.1° spatial resolution and 1-day temporal resolution9. In this study, we present GRACED for 2021, including gridded daily fossil fuel CO2 emissions on the basis of multi-source datasets: the near-real-time global daily national dataset of sectoral CO2 emission due to fossil fuel consumption and cement production (Carbon Monitor), Emissions Database for Global Atmospheric Research (EDGAR) and spatiotemporal patterns of nitrogen dioxide (NO2) retrieved from satellites. GRACED provides gridded CO2 emissions in the following seven sectors: industry sector (incl. cement process), power sector, residential consumption sector, ground transport sector, international shipping sector, international aviation sector, and domestic aviation sector. By comparing with other global gridded CO2 emission datasets, we further analyze spatiotemporal and sectoral uncertainties in CO2 emissions over the period 2019–2021 to test the reliability of the estimated results.

This research presents the latest analysis of near-real-time high-resolution gridded fossil CO2 emissions for the year 2021. One of the advantages of our dataset is that it provides worldwide near-real-time monitoring of CO2 emissions with different fine spatial scales at the sub-national level, such as cities, thus enhancing our comprehension of spatial and temporal changes in CO2 emissions and anthropogenic activities. With the new extension of GRACED time series, we present crucial daily-level input to analyze CO2 emission changes in the post-COVID era, which will ultimately facilitate and aid in designing more localized and adaptive management policies for the purpose of climate change mitigation in the post-COVID era.

Compared with our previous work, the new research on data quality verification of GRACED was first presented. The gridded uncertainty of the GRACED dataset is quantified for the first time. GRACED exhibited a grid-level two-sigma uncertainty value of ±23.1%, ±19.9%, and ±19.9%, corresponding to 2019, 2020, and 2021. The primary source of GRACED uncertainty is Carbon Monitor data. Also, comparison with other gridded CO2 emission datasets revealed a relatively low uncertainty of GRACED dataset. In addition, we also examined the distribution of emission in a grid-wise perspective for major emission datasets, and compared it with GRACED. The similarity in emission distribution was observed in GRACED and other datasets.

Methods

In this study, the framework diagram is shown in Fig. 1. It is a top-down approach that uses geographical patterns supplied by EDGAR, as well as a sub-national proxy according to TROPOMI NO2 retrievals, in order to distribute Carbon Monitor daily emissions at country-level to our finer grid.

Fig. 1
figure 1

Framework diagram of a top-down spatially gridding approach.

Input datasets

Carbon monitor national-level emissions inventory

Estimated GRACED emissions are based on the near-real-time daily CO2 emissions from fossil fuel combustion and cement production provided by Carbon Monitor since January 1, 2019 (data accessible at https://carbonmonitor.org/)3,4,10. Carbon Monitor provides emissions at a national and sectoral level, drawing on near-real-time activity data and inventories for the reference year of 20193.

Since January 1, 2019, Carbon Monitor has calculated daily national CO2 emissions in five sectors (electricity, industrial production, ground transportation, home consumption, and domestic aviation) as well as daily international aviation and shipping emissions. We collect data from Carbon Monitor dataset, including emissions for the nations, groups of countries, or regions listed below: China, the United States (US), India, the United Kingdom (UK), Italy, Germany, France, the rest of the European Union, Russia, Japan, Brazil, and the rest of the globe. Carbon Monitor’s input activity data covers about more than 70% of global industry and power emissions, 85% of ground transport emissions, and 100% of residential consumption and international bunker emissions, respectively. Carbon Monitor also provides the emissions as an aggregate for the rest of the world where data for some sectors (mainly electricity generation) is not directly available. National and worldwide daily CO2 emissions with detailed information in six sectors and major nations were eventually computed by using these high temporal resolution data.

EDGAR v5.0_FT2019 data

In GRACED, emissions are spatially allocated using EDGARv5.0_FT2019 gridded spatial activity data underlying CO2 emissions defined for sub-sectors in Intergovernmental Panel on Climate Change (IPCC) and novel geospatial proxies7,8.

The European Union Emissions Database for Global Atmospheric Research (EDGAR) is an open-source platform developed and maintained by the European Environment Agency (EEA) and is widely used as the default for emission estimates in inventory systems. EDGAR supports the monitoring of climate policy implementations, and in particular the Paris Agreement and significantly contributes to the quantification of IPCC national inventory guidelines and to the evaluation of the GHG budgets at various levels (http://verify.lsce.ipsl.fr/)8. EDGAR emissions data are split into sub-sectors specified by IPCC methodology and geospatial data/spatial proxy data like point source and line source location with the resolution of 0.1° × 0.1° 7,8. Driven by the rapid developments in scientific knowledge on the generation process of GHG emissions as well as the accessibility of more recent information, the EDGARv5.0_FT2019 dataset now includes new spatial proxies to allocate the emissions related to population based upon the Global Human Settlements Layer (GHSL) product8,11.

We used the version EDGARv5.0_FT2019, which includes new spatial proxies to allocate population-related emissions based upon the Global Human Settlements Layer product8.

EDGAR v5.0_FT2019 includes spatial activity data for all fossil CO2 sources, including fossil fuel combustion, metal (ferrous and non-ferrous) production processes, non-metallic mineral processes (such as cement production), agricultural liming and solvents use, urea production8. Data are presented for every country, plus bunker fuels, at monthly intervals provided each major source category, and spatially distributed at a 0.1° × 0.1° grid over the globe from 1970 to 2018 (https://edgar.jrc.ec.europa.eu/overview.php?v=50_GHG)7,8.

TROPOMI NO2 retrievals data

EDGAR employs static subnational patterns and currently do not extend after 2019. Static subnational patterns cannot reflect temporal changes in the spatial distribution of the emissions, for instance related to regional weather anomalies. COVID-19 has exacerbated this limitation for the year 2020 and 2021 as some regions experienced stronger confinements and emissions reductions than others even in the same country. NO2 is emitted together with CO2 from the burning of fossil fuels and other fuels. In particular, due to the relatively short lifetime of NO2 (generally less than 24 hours), atmospheric observations of NO2 are less affected by long-range transport of long-distance emissions than CO2 measurements12, and short-term changes in NO2 (such as daily changes) are mainly affected by anthropogenic emissions activities13. So it is more direct and robust to interpret the changes in the observed NO2 column concentration as indicators of CO2 emissions12. Therefore, changes in the atmospheric distribution of a short-lived pollutant (NO2), measured from satellite, co-emitted with fossil CO2 by combustion processes are used to reflect the changes in the human activities that cause CO2 emissions13,14.

We use the density NO2 tropospheric vertical column retrieved from TROPOMI Level-2 products, including versions 1 and 2. Up until 5 August 2019, TROPOMI NO2 retrievals had a ground resolution of 7 × 3.5 km2, and afterwards at 5.5 × 3.5 km2. Most of the cloudless locations of the globe are observed every day. As in our previous study, we perform aggregation in the form of daily 0.1° × 0.1° maps using a quality assurance flag higher than 0.75 from the official retrieval15 and average them over rolling 14-day periods. This procedure was performed to reduce the retrieval noises and reduce gaps16.

Method overview

Transforming the EDGAR categories into Carbon Monitor sectors

We begin by mapping the Carbon Monitor emission categories to the EDGAR sectors based on the correspondence described in Table 1. We depend on EDGAR as the monthly spatial patterns to the fullest extent.

Table 1 Correspondence between Carbon Monitor categories and EDGAR categories used in this research.

In addition, in order to understand the sector classification adopted by GRACED in this study, the corresponding relationship between GRACED sector and IPCC sector is shown in Table 2.

Table 2 The relationship between the 7 super-sectors of GRACED and the IPCC sectors.

Spatial gridding process

The second step entails a spatial gridding process. To downscale daily CO2 emissions from Carbon Monitor, we utilize global monthly spatial patterns of CO2 emission from EDGAR subsectors. The spatial distribution of CO2 emissions was set to be unchanged compared to those of EDGAR in 2019. The nation and the time horizon for the adjustment will determine the validity of this assumption. Despite the rapid changes in subnational emissions within each country from 2019 to 2021, the impact of COVID-19 varied widely in time and magnitude across subnational areas. Consequently, for large countries being major emitters, we employ a sub-national proxy according to NO2 tropospheric vertical column concentration retrieved from TROPOMI to assign national-level total CO2 emissions into regional (first-level administrative division) totals, followed by a 0.1° down-scaling according to the spatial patterns from EDGAR. The first-level administrative division used in this study is equivalent to the principal administrative division which refers to the biggest subordinate division of a nation, such as a state for the U.S., province for China or equivalent for other nations according to the definition from the United Nations17.

A comprehensive description of the methodology framework is elaborated as follows:

  1. (1)

    First, use daily emissions at national level of the Carbon Monitor’s seven sectors (see Table 1) to generate gridded daily emissions with monthly spatial patterns from EDGAR, additional correction can be performed under monthly spatial patterns. In particular, for the international aviation and international shipping sectors, daily emissions at the global level from Carbon Monitor are directly distributed with monthly spatial proxy from EDGAR.

    $$\begin{array}{c}GRACED\_v{1}_{c,d,s}=C{M}_{country,d,s}\times \frac{E\_EDGA{R}_{c,m,s}}{{\sum }_{i=1}^{n}E\_EDGA{R}_{i,m,s}}\end{array}$$
    (1)

    Where GRACED_v1c,d,s represents the first step/version value of estimated CO2 emission for cell c, date d and sector s, CMcountry,d,s represents the Carbon Monitor’s value of country country which grid cell c belongs to, for date d and sector s. n is the aggregate of number of grid cells within the country country. E_EDGARc,m,s represents the gridded CO2 emission value from EDGAR for grid cell c, month m which date d belongs to and sector s.

  2. (2)

    Subnational emission patterns for major emitters can change from year to year, with far-reaching implications for global emissions. This was apparent in 2021, with regional differences in changes induced by COVID-19, such as between the eastern and western United States and between southeastern and northwestern China. Capturing these emission changes at sub-national level is critical. However, these differences are not resolved by Eq. 1, which use static subnational emission patterns. According to reports, worldwide CO2 emission variations are broadly consistent with worldwide NO2 column changes derived from satellite data13,14. As a result, we assume that subnational emission changes follow the variations of satellite NO2 concentration retrievals variations from 2019 to 2021. In particular, we compute an index R for each first-level administrative unit in major emitting countries. R is the mean NO2 column concentration for each first-level administrative unit based on TROPOMI NO2 column concentration data in year y.

$$\begin{array}{c}{R}_{u,y}=N{O}_{{2}_{u,y}}\end{array}$$
(2)

Where u represents first-level administrative unit; y represents the year. \(N{O}_{{2}_{u,y}}\) represents the temporal average of satellite NO2 column concentration for the first administrative unit u in year y over a rolling 14-day period (as mentioned above) and spatially over the 5% of grid cells within each first-level administrative unit that have the highest annual NO2 concentration average. The selection of the 5% largest mean values enables the filtering of patterns in emitting areas or in their immediate vicinity from the long range transport of NO2. Next step, we delete negative spurious NO2 concentration values for the 5% largest grid cells over the year 2019 and 2021 that may be generated and re-attribute the mass gain to the other 5% grid cells. Using TROPOMI NO2 retrievals data, we then compute the index R for each first-level administrative unit in 2019 (2021).

Next, we generate GRACED_v2u, d, s1, 2021, the daily emission value of a first-level administrative unit for sector s1 in day d of 2021 modified by the TROPOMI NO2 concentration in day d, which corresponds to the daily emission at national level from Carbon Monitor according to Eq. 3. s1 includes the Industry sector, Power sector, Residential consumption sector, and Ground transport sector. s2 incorporates the International shipping sector, International aviation sector, and Domestic aviation sector. For sector s2 with more fixed spatial distribution patterns from 2019 to 2021, their emissions are not modified by TROPOMI NO2 concentration to avoid causing spatial discontinuities.

$$\begin{array}{c}GRACED\_v{2}_{u,d,s1,2021}=\frac{GRACED\_v{1}_{u,d,s1,2019}\times \frac{{R}_{u,2021}}{{R}_{u,2019}}}{{\sum }_{u=1}^{nu}GRACED\_v{1}_{u,d,s1,2019}\times \frac{{R}_{u,2021}}{{R}_{u,2019}}}\times C{M}_{country,d,s1,2021}\end{array}$$
(3)

Where GRACED_v1u,d,s1,2019 denotes the GRACED Version 1 emission estimate value for first-level administrative unit u in date d of 2019 and for sector s1. The total number of first-level administrative units in the country is denoted by nu. The first step is to compute \(\frac{{R}_{u,2021}}{{R}_{u,2019}}\), the change ratio of R index in 2021 over 2019. We multiply GRACED_v1u, d, s1, 2019, the first-level administrative unit emission estimate aggregated from GRACED Version 1 for year 2019, to update the first-level administrative unit emission value for 2021. The last step is to divide the updated first-level administrative unit emission estimate by the sum of the updated first-level administrative unit emission value \(\mathop{\sum }\limits_{u=1}^{nu}GRACED\_v{1}_{u,d,s1,2019}\times \frac{{R}_{u,2021}}{{R}_{u,2019}}\) in 2021 to perform the normalization in Eq. 3. As a result, the sum of emissions of the updated first-level administrative unit within a country matches the emission total at national-level from Carbon Monitor of 2021 after multiplying the emission value CMcountry, d, s1, 2021 at the national-level from Carbon Monitor.

Then, on the basis of updated first-level administrative unit emission GRACED_v2u, d, s1, 2021 in 2021, we finalise the disaggregation of emissions with the spatial patterns from EDGAR data to distribute the emission estimate of each first-level administrative unit for major emitters to calculate the final version of gridded emission value GRACED_v2c,d,s1.

$$\begin{array}{c}GRACED\_v{2}_{c,d,{s}_{1}}=GRACED\_v{2}_{u,d,s1,2021}\times \frac{E\_EDGA{R}_{c,m,s1}}{{\sum }_{i=1}^{n}E\_EDGA{R}_{i,m,s1}}\end{array}$$
(4)

Where n denotes the total number of cells belonging to this first-level administrative unit.

After revising and adjusting the gridded emissions of major emitters such as China, US, India, Japan, Brazil, UK, Germany, Spain, Italy, and France in 2021, GRACED of 2021 is finally produced.

Code description

Python code and NCL(The NCAR Command Language) code for data generation and visualization is provided (link in the Code Availability section). The codes in temp.py distribute the daily emission for nations from Carbon Monitor into gridded data around the globe. The visualization of the aggregated data can be performed by NCL file named 2021_total_panel_code_for_paper.

Data Records

The GRACED2021 products18 are now available at https://doi.org/10.6084/m9.figshare.21427437.v1. We have also provided an example of Python code to help users produce, read in and plot emissions for any grid in the dataset (https://github.com/xinyudou820/GRACED2021). The emission data are stored in netCDF files per sector with the unit of kg carbon per hour (kgC/h) for each grid. GRACED2021 provided gridded CO2 emissions with the spatial resolution of 0.1 degree and the temporal resolution of daily. For each sector, all daily files of the same month are merged into monthly *.zip files. After merging, there are 96 files of GRACED2021 provided at Figshare. Among them, a total of 84 files are sectoral gridded CO2 emissions and a total of 12 files are total gridded CO2 emissions. The monthly *.zip file size is from 6 MB to 239 MB, respectively.

Technical Validation

Spatial distribution of global CO2 emissions

Based on activity data of fossil fuel and cement production, GRACED provides fine-grained CO2 emissions data at a 0.1° × 0.1° spatial resolution and 1-day temporal resolution. Figure 2 reports the updated global gridded daily average CO2 emissions in 2021, which identifies specific emission variations induced by sub-national emission allocation. According to Fig. 2, we can find that the spatial distribution of the average daily CO2 emissions in 2021 is characterized by significant regional differences. The high values are concentrated in high-income or rapidly developing areas such as the south-eastern China, Western Europe, eastern U.S., India, Japan, and Korea. In contrast, low values are mainly found in eastern Russia, Africa, and central South America with low population density or low-income levels. In 2021, the emission grid in which the maximum value appears is 231741 tons of carbon per day per grid. The daily average total emissions are estimated to be about 4092 kg of carbon per day (kgC/d) per grid, which is higher than that of 2020 (3830 kgC/d per cell).

Fig. 2
figure 2

The spatial CO2 emissions distributions from GRACED in 2021 (unit: kgC/d per grid).

Temporal patterns of global CO2 emissions

Emission reductions between weekends and weekdays

Figure 3 presents the temporal changes of the difference in emissions between weekends and weekdays in 2021. According to Fig. 3, we can find that in general, the global average weekend emissions are lower than the weekday emissions, with an average difference of −241 kgC/d per grid. The difference between weekday and weekend emissions is more pronounced in high-income regions such as Europe (dark blue areas) than in low-income regions such as Africa (light red or light blue areas). It is noteworthy that there are dark red areas in some states in the U.S. and parts of southern China, where the average weekend emissions in these regions are higher than the average weekday emissions.

Fig. 3
figure 3

Emission difference between weekends and weekdays map in 2021 (weekend minus weekday emissions).

For both 2020 and 2019, the global average weekday CO2 emission was higher than weekend CO2 emission generally9. The global average emission difference is −286 kgC per day per cell in 2020, which is larger than in 2021 (−241 kgC/d). Due to the economic recovery in 2021, general human travel has greatly increased on weekends compared to 2020, thus reducing the emission difference between weekdays and weekends.

Difference between quarterly and yearly mean emissions

We define January, February, and March as the first quarter, and then define other months included in other quarters. Figure 4 reports the difference between the average daily emissions for each quarter and the average daily emissions for the year in 2021. Figure 4 shows the four quarterly emission difference maps after removing the annual average emission value from the average emissions of each quarter. In this case, the first quarter is well above the annual average emission level, with a global average gap of 174 kgC per cell per day; and the second quarter is well below the annual average emission level, with a global average gap of −179 kgC per cell per day. This is mainly because energy consumption due to heating demand in the Northern Hemisphere versus cooling demand in the Southern Hemisphere makes CO2 emissions from January to March higher than the rest of the year. From the national or regional perspective, the quarterly distribution of emissions in some countries is consistent with the global average for the year in 2021. For example, India, a big emitter, has the highest emissions in the first quarter, and the lowest in the second quarter. While some countries, such as EU27&UK, overall have the highest emissions in the fourth quarter and the lowest emissions in the third quarter.

Fig. 4
figure 4

Map of quarterly average minus yearly average emissions in (a) the first quarter, (b) the second quarter, (c) the third quarter and (d) the fourth quarter of 2021 (Note: remove the yearly average from each quarter).

Emission variations between 2021 and 2020

The maximum reduction date is defined as the date of the greatest emission decrease in 2021 compared with 2020, which reflects the day of the year when significant emission reduction was caused by the remaining impact of the new outbreak in 2021 (Fig. 5a). Similarly, the maximum rebound date is defined as the date of the largest emission increase in 2021 compared with 2020, the day of the year experiencing the largest emission recovery in 2021 (Fig. 5b).

Fig. 5
figure 5

The day of the year in 2021 with the maximum reduction (a), and the maximum rebound (b) of each grid in 2021 compared with 2020.

Uncertainty analysis

Uncertainty associated to each variable propagates to the final uncertainty of GRACED data. By applying the rule of error propagation and IPCC 200619 uncertainty analysis on Eqs. 1, 4, the following equations were obtained:

$$\begin{array}{c}{U}_{GRACED\_v{2}_{c,y,s1}}=\sqrt{{U}_{C{M}_{u,y,s1}}^{2}+{U}_{E\_EDGA{R}_{c,s1}}^{2}}\end{array}$$
(5)
$$\begin{array}{c}{U}_{C{M}_{u,y,s1}}=\sqrt{{U}_{C{M}_{country,y,s1}}^{2}+{U}_{{R}_{u,y}}^{2}}\end{array}$$
(6)
$$\begin{array}{c}{U}_{GRACED\_v{2}_{c,y,s1}}=\sqrt{{U}_{C{M}_{country,y,s1}}^{2}+{U}_{{R}_{u,y}}^{2}+{U}_{E\_EDGA{R}_{c,s1}}^{2}}\end{array}$$
(7)
$$\begin{array}{c}{U}_{GRACED\_v{1}_{c,y,s2}}=\sqrt{{U}_{C{M}_{country,y,s2}}^{2}+{U}_{E\_EDGA{R}_{c,s2}}^{2}}\end{array}$$
(8)
$$\begin{array}{c}{U}_{total}=\frac{\sqrt{{\sum }_{i=0}^{6}{\left({U}_{c,y,s{r}_{i}}\times {X}_{c,y,s{r}_{i}}\right)}^{2}}}{{\sum }_{i=0}^{6}{X}_{c,y,s{r}_{i}}}\end{array}$$
(9)

Where Ux format indicates the uncertainty of variable x, sr0 ~ sr6 indicates the seven individual sectors that consist the total emission, and Xc, y, sri is the annual sum of total emission for grid cell c, year y and sector sr in GRACED. Due to the nature of Carbon Monitor data, as mentioned4 correlation between sectors were not considered in the uncertainty analysis of this research.

The methods utilized to calculate each uncertainty variable in the Eqs. 7, 8 are presented below:

For \({U}_{E\_EDGA{R}_{c,s}}\), we utilized the method proposed by Dai, et al.20. Probability density functions (PDF) of the values of \(E\_EDGA{R}_{c,s}\) were fitted and the validity of the fitness were verified by Kolmogorov-Smirnov test. If the KS test showed significance in the fitted PDF, the fitted PDF was used to produce 100,000 simulated sample grids by Monte Carlo simulation. Subsequently, the cumulative distribution function (CDF) of the simulated grids and the original grids were compared and the difference between the two functions were used to calculate the uncertainty corresponding to the weighted grid value.

$$\begin{array}{c}{U}_{v}=\frac{SCD{F}_{v}-TCD{F}_{v}}{SCD{F}_{v}}\times 100( \% )\end{array}$$
(10)

Where Uv is the uncertainty corresponding to an emission amount v, SCDFv is the simulated CDF value for emission amount v and TCDFv is the true CDF value for emission amount v.

Based on the equation and the original grid value, uncertainty value of each grid was calculated:

$$\begin{array}{c}{U}_{E\_EDGA{R}_{c,s}}=\frac{SCD{F}_{E\_EDGA{R}_{c,s}}-TCD{F}_{E\_EDGA{R}_{c,s}}}{SCD{F}_{E\_EDGA{R}_{c,s}}}\times 100( \% )\end{array}$$
(11)

For \({U}_{C{M}_{country,y,s}}\), we assumed that the \({U}_{C{M}_{country,y,s}}\) will be equal to the uncertainty of the corresponding sector s reported by previous studies of Carbon Monitor performed on the global scale regardless of the nation3,10.

For \({U}_{{R}_{u,y}}\), as this research went through a pre-processing process for the TROPOMI NO2 data to calculate Ru,y, we stated that the uncertainty of Ru,y will be a constant value throughout the gridded dataset based on the result of Goldberg, et al.21, and van Geffen, et al.22. Cooper, et al.23 stated that the uncertainty of NO2 are less than 5%, van Geffen, et al.22 stated that the uncertainty originating from the slant column density (SCD) method in TROPOMI NO2 is 10%, and Goldberg, et al.21 mentioned that TROPOMI product have a low bias of 20% in urban areas.

$$\begin{array}{c}{U}_{{R}_{u,y}}=\sqrt[3]{{U}_{Cooper}\ast {U}_{Geffen}\ast {U}_{Goldberg}}=\sqrt[3]{5\ast 10\ast 20}=10( \% )\end{array}$$
(12)

Where \({U}_{Cooper},{U}_{Geffen},{U}_{Goldberg}\) denote the uncertainty mentioned in Cooper, et al.23, van Geffen, et al.22 and Goldberg, et al.21.

The uncertainty of each sector was calculated based on the equations mentioned previously. As shown in Fig. 6, the smallest uncertainty in 2021 was obtained from the power sector (±13.5%), where the majority of this uncertainty originates from the uncertainty of Carbon Monitor’s power sector data (±10%)3. The average value of uncertainty in grids for industry, residential consumption, and ground transport, other three sectors included in s1, were ±33.4%, ±42.0%, and ±15.5%, respectively. On the other hand, average gridded uncertainty in international shipping sector, international aviation sector, and domestic aviation sector had a smaller uncertainty of ±16.7%, ±16.0%, and ±18.6%. The uncertainty values of EDGAR’s spatial proxies calculated were all in the range of ±3.2% ~±11.7%. Relatively small uncertainty value of EDGAR’s spatial proxies compared to GRACED’s indicates that the main difference between GRACED’s sectoral uncertainty originates from the difference in Carbon Monitor uncertainty for each sector. For instance, residential consumption sector, the sector with the highest uncertainty of ±42.0% for the year 2020 and 2021, has more than 90% of its uncertainty originating from Carbon Monitor (±40%)3, while contribution of EDGAR’s residential consumption sector uncertainty (±4.3%) is minimal. Also, there was a distinguishable difference in the average uncertainty between s1 and s2 sectors, which originates from higher value of Carbon Monitor uncertainty of s1 sectors as well as R uncertainty included in the calculation procedure for s1 sectors.

Fig. 6
figure 6

Annual sectoral uncertainty of GRACED throughout 2019–2021.

Comparing the CO2 emission uncertainty of 2019, 2020, and 2021, the uncertainty value is highest in 2019 (Fig. 6), while similar in 2020 and 2021. Higher average uncertainty in 2019 is induced by a larger Carbon Monitor uncertainty value for international aviation in 2019 (±34.2%) compared to 2021 (±10.2%)3, resulting in the majority of grids located at sea having a significant uncertainty value difference from the other two years (Fig. 7). Closer observation indicates a similar grid distribution for 2020 and 2021 due to the spatial proxy distribution and constant Carbon Monitor uncertainty for 2020 and 2021. The areas with high uncertainty values are mainly distributed in Northwest Africa. These grids have small emission values for multiple sectors affecting the high uncertainty value of EDGAR for those corresponding sectors.

Fig. 7
figure 7

Uncertainty of CO2 emission of GRACED for every sector in (a) 2019, (b) 2020 and (c) 2021.

The gridded sectoral uncertainty of 2021 displays additional information about the distribution of the uncertainties (Fig. 8). International aviation, power, and domestic aviation sector exhibited a relatively constant uncertainty in every grid due to even distribution in the corresponding sectors’ EDGAR spatial proxy uncertainty. However, the ground transport and residential consumption sector displayed a relatively high uncertainty in parts of Africa. This issue originates from low emission values of EDGAR data grids, causing higher spatial proxy uncertainty.

Fig. 8
figure 8

Uncertainty of sectoral CO2 emission in 2021 for the industry, international aviation, domestic aviation, power, international shipping, residential consumption, ground transport and total sector.

Validation against other datasets

We also examined the distribution of emission in a grid-wise perspective for major emission datasets, GCP-GridFED24, ODIAC25 and EDGAR26, and compared it with GRACED (Fig. 9). As GRACED employs extensive use of the EDGAR dataset, utilizing it as the main spatial proxy during the calculation procedure, the distribution of emission is very similar between GRACED and EDGAR throughout every emission range. This also originates from the total emission of Carbon Monitor’s small difference from EDGAR’s total emission. GRACED’s distribution is displayed in accordance with GCP in the middle range (8–16) where the majority of the grids are included. The variance was only observed in the two extremes. The similarity in emission distribution and the number of non-zero emission grids were observed in GRACED, EDGAR and GCP-GRIDFED. However, ODIAC showed huge variance with the other three datasets in every range of emission. ODIAC showed a higher number of grids than the other datasets in every emission range. This huge variance could be originating from the calculation method of ODIAC which employs nighttime light data27. The nightlight-based approach of ODIAC can cause higher calculation values in the dataset compared to emission inventories for certain cities, due to a poor correlation between anthropogenic activities and nightlight intensity28.

Fig. 9
figure 9

Distribution of grids’ emission for multiple emission datasets.

A sectoral comparison between GRACED and data from International Energy Agency (IEA)29 was also performed to validate the emission values of GRACED (Table 3). GRACED’s sectoral emission showed consistency with IEA’s emission, where in majority of the sectors, differences in emission were less than 10%. However, there exists a notable difference between the two datasets in residential consumption sector. This difference can be originating from the fact that IEA’s building sector account both direct and indirect emissions.

Table 3 Comparison of sectoral emission of GRACED and IEA in 2021.

The chart below (Table 4) shows information about other published gridded CO2 emission datasets. The majority of these datasets, including GRACED, have a spatial resolution of 0.1° × 0.1°. GRACED’s spatial coverage also showed an equal level of other datasets. As mentioned previously, GRACED is the only dataset to provide near-real-time emission data in a temporal resolution of 1-day. Due to the relatively recent creation dataset of GRACED, the time coverage is shorter than other datasets. This issue will gradually be solved in the future as data accumulates.

Table 4 Basic information of other emission datasets including GRACED.

In order to further validate GRACED’s data, uncertainty values of GRACED were compared with other datasets. We conducted the comparison between the uncertainty value of GRACED, and the published uncertainty value of EDGAR emission inventory. We used the latest inventory uncertainty value that EDGAR published for comparison. GRACED’s overall uncertainty of ±21.0% is higher than EDGAR’s uncertainty value of ±7.1%30. Considering additional uncertainties originating from spatial proxies of TROPOMI and EDGAR harnessed to distribute national-level emission, it is reasonable that the average grid-level uncertainty of GRACED is higher than certain national or global level emission inventories.

As GRACED uses a top-down spatially gridding approach, errors and uncertainty in sectoral emission from Carbon Monitor, the national-level emission inventory, were evenly distributed in each grid. Therefore, we also compared the uncertainty of GCP-GridFED, another gridded carbon emission dataset that published its uncertainty results, with the uncertainty of GRACED. The uncertainty value for GCP-GridFED was of 2018, as the latest uncertainty results of GCP-GridFED released by GCP team is of 2018. The GCP-GridFED showed a consistent uncertainty in the past six decades, ranging from ±23.2% to ±29.6%31. Therefore, using the uncertainty value of GCP-GridFED in 2018 does not impact the validity of this comparison. Comparing GRACED to GCP-GridFED, GRACED has a lower average uncertainty of ±21.0% (Fig. 6) during 2019–2021. As GCP-GridFED uncertainty is computed considering multiple variables31, a similar calculation process with GRACED’s, GRACED’s lower uncertainty than GCP-GridFED indicates the higher accuracy of gridded carbon emission data that GRACED provides.