Abstract
We constructed a frequently updated, near-real-time global power generation dataset: CarbonMonitor-Power since January, 2016 at national levels with near-global coverage and hourly-to-daily time resolution. The data presented here are collected from 37 countries across all continents for eight source groups, including three types of fossil sources (coal, gas, and oil), nuclear energy and four groups of renewable energy sources (solar energy, wind energy, hydro energy and other renewables including biomass, geothermal, etc.). The global near-real-time power dataset shows the dynamics of the global power system, including its hourly, daily, weekly and seasonal patterns as influenced by daily periodical activities, weekends, seasonal cycles, regular and irregular events (i.e., holidays) and extreme events (i.e., the COVID-19 pandemic). The CarbonMonitor-Power dataset reveals that the COVID-19 pandemic caused strong disruptions in some countries (i.e., China and India), leading to a temporary or long-lasting shift to low carbon intensity, while it had only little impact in some other countries (i.e., Australia). This dataset offers a large range of opportunities for power-related scientific research and policy-making.
Measurement(s) | Power generation by different energy sources |
Technology Type(s) | computational modeling technique |
Factor Type(s) | geographic location • energy sources • temporal interval |
Sample Characteristic - Environment | power system |
Sample Characteristic - Location | global |
Similar content being viewed by others
Background & Summary
Power is a fundamental element of human society. Access to affordable, reliable, and sustainable energy, including access to reliable electricity and power produced by renewable sources, are listed as important aspects of the United Nations Sustainable Development Goals1. Tracking dynamics and status of power production and consumption is of great importance as it reflects the manufacturing, social activities as well as human impacts on the environment. Current power statistics are based on inventories of power production, consumption, trade, etc2. This work usually has a time lag of at least one year3,4,5,6,7. Timely and effective management of the power sector, including monitoring shifts from fossil to low carbon sources, is valuable for effectively mitigating global climate change policy-making8,9. Thus low-latency data on global and national power production with the high-temporal resolution is urgently needed10.
As a result, high-temporal resolution power data is increasingly important and has received an increasing focus from governments, companies, and academic institutes11,12,13. Daily and hourly power data are critical to developing power system models14, or to understanding the patterns of human behaviors15,16. With the increasing awareness of the importance of such datasets, there has been an increase in open access at regional levels. For example, the EU has created an open platform ENTSO-E for electricity generation, load, and transmission data for Europe12. United States’ Energy Information Administration (EIA) also provides free access to data for its electricity generation and consumption13. China’s electricity generation and consumption data are available through its national grid or China’s National Bureau of Statistics17. However, the temporal coverage often varies between datasets. In addition, energy sources are reported or aggregated differently11,17,18,19,20. These inconsistencies have made it challenging to compare and evaluate progress in decarbonizing power systems across countries and regions.
International Agency (IEA) and BP provide well-integrated and unified data for power generated from different sources and cover a wide range of spatial regions2,20. International Renewable Agency (IRENA) also provides reports on global renewable energy installed capacity and generation19. However, those datasets have a time lag of at least several months and have at best a monthly time step. Monthly datasets may not provide sufficient information on power systems’ rapid changes, due to 1) changes in human behavior as the COVID-19 pandemic or the effects of weekends, and holidays21,22,23,24, 2) the impact of climate variabilities such as winter storms, summer heatwaves, and other climate variabilities causing shifts of demand, and intermittency of renewable power supply25, and 3) economic shocks such as abrupt variations of fuel prices or shortfalls of supply since the war between Ukraine and Russia26.
Here we constructed the first global daily and hourly power generation dataset (CarbonMonitor-Power) for the period going from 2016-01 to 2022-07. This dataset can be updated in near-real-time with a latency of between 1 day to a maximum of 1 month, depending on the country/region. The dataset includes daily and hourly power generation data from fossil fuels (coal, natural gas, and oil), nuclear, hydro, wind, solar, geothermal, biomass, and other renewables for 37 countries, which covers around 70% of the global power production and 68% of global power-related CO2 emissions. CarbonMonitor-Power provides a data basis to the Carbon Monitor dataset, to estimate the near-real-time daily CO2 emissions from power generation21,22,23. CarbonMonitor-Power represents a new resource for exploring high-time frequency patterns of the global power system and monitoring monthly to annual changes relevant to emissions reduction pledges (Fig. 1).
Methods
We mostly collected data from national grid operators which provide open-access power generation and consumption data at high temporal frequency. In constructing a harmonized database with global coverage, special attention was given to filling the data gaps. Although it is possible to directly acquire high-time-frequency power generation data for the EU for example, such data does not exist for some other countries like China. China’s national grid provides detailed information on installed capacity and utilization hours for major power sources, but for every month.
The framework used to generate the CarbonMonitor-Power dataset is shown in Fig. 2. We acquired raw data from the national grids of the 37 countries/regions listed in Tables 1–3. Raw data are acquired at the highest possible time resolution (5-minute intervals, hourly, daily, or monthly, depending on the source availability). We then developed national-specific methods for data processing and simulation (details see country-specific method below).
Power generation data acquisition
We firstly collected raw data from 12 regions (or 37 countries, including Australia, Brazil, China, 26 countries in EU27, UK, India, Japan, Russia, South Africa, the United States, Mexico, and Chile) with various energy sources. The raw data are collected from publicly available sources at the national or subnational levels. Data sources used in this study are summarized in Tables 1–3.
More than two million records of raw data have been collected from these 12 data sources, with nearly two thousand records newly generated and collected per day. Considerable data cleaning preprocess was performed as part of the data processing, due to frequent extreme values and missing values detected from near-real-time data.
To filter out extreme values, we first examine the quality of these high-temporal power generation datasets using an Interquartile Range (IQR) threshold method27 and detect the ‘outliers’. The IQR range is defined as the range between the 75th percentile and the 25th percentile. The upper limit is calculated as adding 0.5*IQR to the 75th percentile. The lower limit is calculated as subtracting 1.5*IQR from the 25th percentile. Values fall beyond the upper and lower limits are labeled as potential ‘outliers’. Afterward, manual processing was applied to evaluate whether each extreme value should be removed or to be kept. As a general rule, we keep extremes in the data set when there was evidence of abrupt social changes (COVID-19 confinements) and/or natural disasters (e.g., storms), which are known to have a strong and sudden impact on the power system. Then the linear interpolation function from the Python Pandas package was used to fill missing values.
After such detections, no outliers or missing values have been found in the raw data from most countries, even some of which may have pre-processed their raw data before releasing them publicly (such as the United States EIA13). In the end, pre-processing of removing outliers and filling missing values was only conducted on the raw data from China and EU27. In summary, there were 6.9% data records missing in the raw data from China, and 3.4% data records missing in the raw data from EU27. For EU27 countries, the data quality varied between different countries, ranged from 0% (no missing data or outliers, Sweden) to 29.1% (highest missing and outlier ratio, Croatia). Only two countries had higher than 10% data missing or identified as outliers. The missing data and outliers in raw data from EU27 mostly were detected at sub-hourly level (one out of two or four records missing for an hour). We filled in the missing data or replaced the outliers by interpolation method to provide a best estimate for the data at hourly time resolution. Data records which are identified as outliers or missing values are labeled as F (Filtered). Others are labeled as N (Normal)
Country-specific data processing
After the data preprocessing, for each country/region, we aggregate and/or dis-aggregate the power generation to daily (or hourly if possible) according to data availability, and to eight categories of power generation sources: coal, gas (natural gas), oil, nuclear, hydro (hydro-power), wind, solar and other renewables (including biomass, geothermal, and power generated from residual industrial heat and from other non-specified sources). In addition to those national data, three additional datasets are used to disaggregate the total generation data into specific generation types when needed: Monthly Electricity Statistics by IEA20, Statistical Review of World Energy by BP2, and the Renewable Energy Statistics by IRENA19.
However, the original data often do not have the same format and do not cover the same energy sources across countries. To establish a harmonized dataset with all sources of power, we collected information from other databases with a lower time frequency and disaggregated them to daily steps (details see country-specific method below).
Australia
The original data is acquired at hourly resolution for the following categories: Wind, Hydro, Solar (Rooftop), Solar (utility), Gas (Waste Coal Mine), Gas (Reciprocating), Gas (OCGT), Gas (CCGT), Gas (Steam), Distillate (Energy source: Diesel), Bioenergy (Biomass), Bioenergy (Biogas), Coal (Black), Coal (Brown), while, CCGT refers to power generated by combined cycle gas turbine, and OCGT refers to power generated by open gas cycle turbine. We aggregate all power generated from all types of power used in this study as following:
Brazil
The raw power generation data from Brazil is acquired from the Operator of the National Electricity System (http://www.ons.org.br/Paginas/). The data acquisition and download are performed at a daily base, with up to a week of latency (occasional delay caused by site maintenance). The original data is acquired at hourly resolution for the following categories: Wind, Hydro, Nuclear, Solar, Thermal (including Coal, Coal Mineral, Gas, Natural Gas, Combustive Oil, Diesel, Petrol (Gasoline), Biomass, Industrial Residuals). We aggregate power generated from all types of coals to coal power, and similarly, power generated from all gas types to gas power. The further aggregation to the power generated by energy source (s) at each hour (Ps, h) used in this study is as following:
The power generation data is firstly produced at hourly time resolution, then further aggregated to daily resolution.
China
There are two types of core datasets for China’s power generation: Power Generation by Energy Type (P) and Coal Consumption data (CC). The Power Generation by Energy Type data is acquired from China’s Electricity Council (CEC, http://cec.org.cn), which provides information on power generation, consumption and usage on China’s national grid at monthly, seasonal and yearly time steps. The information on power generation is given primarily as installed capacity per energy source (IC), and cumulative utilization hour (CUH) per type of source (s). The power generation for month m from energy source s (Ps, m) is calculated as:
This allows the direct calculation of power generation from energy sources including thermal, coal, natural gas, nuclear, hydro, wind, solar, biomass and geothermal (the last two with compromised time frequency and latency). Power generation by oil and other non-fossil sources, such as waste, recovery energy from industrial processes is not provided separately by CEC. But they are accounted for in the total thermal power. Therefore, we separated the thermal power from coal and gas production using factors derived from Monthly Electricity Statistics by IEA (PIEA) from the corresponding month of the latest year available:
The power generated by oil and by other renewable energy are then simulated as:
The monthly power generation data are then disaggregated to daily values using daily coal consumption (CC) data by power plants from eight coastal provinces in China (https://www.cctd.com.cn/) with the following equation:
Where Ps, d being the power generation by energy source s on day d, Ps,m being the power generation of the month m. CCd is the coal consumption on day d, and CCm is the monthly coal consumption of month m. Day d is within month m. The correlation between China’s power generation and coal consumption was established at monthly time steps, and the detailed method can be found in our previous work21,22.
EU27&UK
For the year between 2016 and 2018, the power generation data for EU28 are acquired from ENTSO-E (https://www.entsoe.eu). From 2018 onwards, data for EU27 was continuously acquired from ENTSO-E, while data for the UK was acquired from BMRS (https://www.bmreports.com/bmrs/) and from Sheffield Solar (for solar power only, https://www.solar.sheffield.ac.uk/). The data acquisition is carried out on a daily basis. The data aggregation method is database specific instead of country specific.
ENTSO-E provides power generation capacity data at time resolution between every 15 minutes to every 60 minutes. Therefore, hourly power generation was firstly computed as
The power generation (P) from country c from source s during the hour h is computed as the average power generation capacity during the corresponding time, multiplied by the length of duration (1 hour). Then the power generation data for each country (c) was further aggregated to the eight energy sources used in this study with the following equations at desired time resolution (excluding energy sources where aggregation was not required):
BMRS provides power generation capacity data over the UK for 48 equally distributed time periods per day. The conversion from power generation capacity to hourly power generation follows the same methods as Eq. 13. Two energy sources were aggregated from the database to match the sources defined in this study, with the following equations:
Among which, ccgt refers to power generated by combined cycle gas turbine, and ocgt refers to power generated by an open gas cycle turbine. The sources aggregated hourly power generation data was then further aggregated to daily, monthly and yearly resolution. In the end, we aggregate all countries to EU27 and UK to an aggregated dataset for EU27&UK.
India
The power generation data from India is initially acquired from the Power System Operation Corporation Limited (POSOCO, https://posoco.in/) on a daily basis, with one to two days of latency. The original data is provided for aggregated sources as compared to our required eight sources (Fig. 2): Gas and oil produced power are aggregated and called Gas_Naptha_Diesel; RES aggregates power produced by wind, solar, biomass, and other energy sources. To disaggregate these energy sources, we developed factors (F) based on the reference monthly power generation dataset (PIEA)20 of the corresponding month (m) from the last available year (the current year or the previous year). for oil and gas:
For solar and wind:
For other renewable:
With these factors, we disaggregate the daily power production from its original aggregated sources as follows:
With day d being in the month m.
Japan
We acquire Japan’s hourly power generation data from the Organization for Cross-regional Coordination of Transmission Operators (OCCTO, https://www.occto.or.jp/en/), with a latency of one to two months. The data provides fossil power in one aggregated sector. Therefore, we disaggregate fossil power with factors derived from reference monthly power generation data28 for the coal, gas, and oil sectors:
Hour h being on day d in the month m. While for the hydro, wind, solar and other renewable categories, we apply the following aggregation:
Russia
Hourly power generation data from Russia is acquired from the United Power System of Russia (http://www.so-ups.ru/index.php). It provides hourly power generation for Thermal, Nuclear, Hydro, and Renewables. We develop factors (Fs) based on the reference yearly power generation dataset (Pbp, from BP Statistical Review of World Energy2) from the last available year (being previous year or the year before, y). for fossil energy sources (s):
With these factors, we disaggregate the hourly fossil power generation to coal, gas, and oil power as follows:
For wind and other renewables, we develop Fs based on reference yearly renewable power generation dataset (PIRENA, from International Renewable Energy Agency19) from the last available year (being previous year or the year before, y):
With these factors, we disaggregate the hourly power generation of the category Other to and other renewable power as following:
United States
Hourly power generation data from the United States acquired from EIA (https://www.eia.gov/beta/electricity/gridmonitor/). As the energy sources provided by EIA match the source categories used in this study, we did not apply further data aggregation steps following data acquisition and data cleaning. The data are acquired at the local time and aggregated to national totals.
South Africa
The hourly power generation data from South Africa is acquired from Eskom (https://www.eskom.co.za/dataportal/supply-side/station-build-up-for-the-last-7-days/). Eskom provides power generation data for the following categories: Coal (labeled as Thermal in the source data), Natural-gas, Oil (labeled as OCGT in the source data), Nuclear, Pumped Water Generation, Hydro Water Generation, Photovoltaic generation (PV), Concentrated Solar Power generation (CSP), Wind, and Other Renewable. We aggregate hydro power and solar power sources with the following equations:
Mexico
The hourly power generation data from Mexico is acquired from Gobierno De Mexico (https://www.cenace.gob.mx/Paginas/SIM/Reportes/EnergiaGeneradaTipoTec.aspx) for the following categories: Coal, Gas, Combined cycle, Internal Combustion (major fuel types including hydrocarbons such as paraffinic, olefinic, naphthenic, aromatic), Conventional Thermal, Nuclear, Hydro, Wind, Solar, Biomass, and Geothermal. We aggregate gas, oil power and other renewable power sources with the following equations:
Chile
The hourly power generation data from Chile is acquired from Coordinador Eléctrico Nacional (https://www.coordinador.cl/operacion/graficos/operacion-real/generacion-real/). Coordinador Eléctrico Nacional provides power generation data for the following categories: Coal, Petcoke, Biogas, Natural Gas, Diesel, Fuel Oil, Run-of-river, Storage, Wind, Solar, Geothermal, Biomass, and Cogeneration. The coal, gas, oil, hydro and other renewables power generation are calculated with the equations below:
Data Records
Currently, there are two data records provided in this dataset, which can be downloaded at https://github.com/KowComical/CM_Power_Data or figshare29. All data are available for 1857 days (from January 1st, 2016 to 30th June, 2022 for countries except for China and South Africa. For China, the data record starts from 1st January 2018, and for South Africa from 1st April 2018):
-
A record of 4,015 records are the daily total and source-specific power generation from 8 power sources (i.e., coal, gas, oil, hydro-power, solar-power, wind-power, other renewables (biomass, geothermal and other renewable sources)) and for 2 individual countries/regions (China (from January 2018), India).
-
A record of 2,415,102 records are the hourly total and source-specific power generation from 8 power sources (i.e., coal, gas, oil, hydro-power, solar-power, wind-power, other renewables (biomass, geothermal and other renewable sources)) and for 35 individual countries/regions (i.e., United States, EU27 & UK, Russia, Japan and Brazil, Australia, Chile, Mexico, South Africa)
-
Two label types are used: N (Normal) and F (Filtered). F stands for data records where the raw data are filtered out for outliers and missing values, then subsequently filled with methods stated in Power Generation Data Acquisition of the Method section. N stands for data records where no outliers and missing values are detected.
Technical Validation
Correlation with reference data
We compared our dataset with the reference database (IEA monthly electricity generation data20 and BP annual electricity data2) over the overlapping time period of 2019 to 2022, and the results show that our data in general agrees well with the reference data (Fig. 3). For most countries, we used the monthly electricity generation as the reference database. For countries that are not covered by IEA (Russia and South Africa), we used the BP Statistical Review of World Energy2. For countries compared to the IEA database, the overlapping period is 2019 to April 2022. For countries compared to BP’s database, the overlapping period is 2019 to 2021. The data are displayed as monthly averaged power generation. In general, for annual total power generation, the CarbonMonitor-Power database shows good agreement with the reference dataset for all countries (R2 > 0.95). This indicates that on aggregated terms, the power generation provided by CarbonMonitor-Power is in line with the reference databases. There are also strong correlations between these two databases for electricity generated by major energy sources, including coal, gas, nuclear, hydro, solar, and wind. These account for about 95.4% of total electricity generation.
There are two energy sources, however, that show lower performance, namely power generated from oil and other renewables. The lowest correlation coefficient is observed in oil-fired power (R2 = 0.661). The data in Fig. 3 shows that we have a systematic overestimation for China and a systematic underestimation for Brazil for oil-generated power. For other countries, the data points are distributed with a scatter but without a bias. As oil is the smallest energy source in the power sector, it has the largest uncertainty. As for other renewables (“biomass, geothermal and other renewables” in figure), the low correlation coefficient (R2 = 0.682) with other databases is mainly driven by the higher estimation in China by CarbonMonitor-Power. This is mainly related to the method used: subtracting coal and gas-fired power from thermal power and then distributing the non-coal and non-gas thermal power with a disaggregation factor. We have noticed that the non-coal and non-gas thermal power collected by this database is much higher than the data provided by IEA. This led to the result that we estimate higher power produced by oil and by other renewables.
Systematic bias with reference datasets for most countries
The time series of CarbonMonitor-Power are compared to other datasets in Fig. 4. The main result is that the CarbonMonitor-Power data agree well with the reference dataset in terms of the overall trend, yet with significantly shorter latency and higher temporal frequency. For most of the countries, apart from China, CarbonMonitor-Power shows lower values than the other dataset, ranging from 1% in Ember to 7% in IRENA (detailed comparison for different energy sources see Tables 4, 5. This is likely due to differences in input data sources. CarbonMonitor-Power uses national grids as the main data sources. Reference data sources use a combination of data sources, including national census data and self-derived estimation methods. It is very likely that most countries have off-grid power generation, which is accounted for by IRENA19, IEA20 and BP30, and Ember’s18 methods, but not by this study (we focus on the power grid, as stated in the method section). The exception of China is caused by the same reason. For China’s power generation, we acquired our raw data from China’s Electricity Council, which provides all the power distributed by the national electricity grid. The reference databases such as IEA acquired their data from China’s National Bureau of Statistics, which provides power generated by Power plants above the designated size (measured by annual income). Therefore, it is likely that the power generated by small power plants and some distributed photovoltaics are not accounted for by the IEA database. But in general, the discrepancy between IEA and CarbonMonitor-Power for China is not large (except for solar power generation).
Case study: UK’s grid data
Due to the lack of high-temporal resolution data, it is difficult to comprehensively compare all countries’ energy sources with other daily or hourly power generation data. It was nevertheless possible to find one additional data source (UK_ep https://electricityproduction.uk/from/all-sources/?t=10y) for the United Kingdom. Therefore, we compared the two databases for power generation from all eight sources at high time frequency as a case study (Fig. 5). From the comparison, we could find that although with some discrepancies, The hourly profiles agree very well between the two data sources except for Solar power, which is not covered by the UK-ep database. This shows that the CarbonMonitor-Power database also provides reliable information at high time frequency in addition to satisfying accuracy at aggregated time steps.
Usage Notes
As an expansion of current near-real-time data collection, it is our hope that our newly developed CarbonMonitor-Power data offers new opportunities to reveal the most up-to-date trends and variations of global power sector, especially at hourly to daily scales. We hope that CarbonMonitor-Power will facilitate a wide range of stakeholders with their research and decision making. To facilitate the usage of the CarbonMonitor-Power data, we provide the following examples to serve as a starting point on how the dataset could be used.
Hourly profile of power generation in major countries
The CarbonMonitor-Power database we present offers a wide range of possibilities in analyzing the hourly dynamics, daily patterns and seasonality of power generated in different countries. In addition to the strong holiday effect (Fig. 1c) and weekend effect (Fig. 1d) reflected by the single day hourly profiles and monthly average hourly profiles for the United States, power generation also show unique seasonal and geographical patterns in all major countries (Fig. 6). These patterns could be linked to different social-economical characteristics. For example, Japan (Fig. 6b) and United States (Fig. 6d) both show significantly increased power generation during mid-day time period in the 3rd quarter (July to September) of year 2021. However, such phenomenon is not observed for countries like France and South Africa. The increased power generation during summer mid-day in Japan and United States may be caused by increased consumption of power due to cooling demands, enhanced solar power production, or a combination of both. Our CarbonMonitor-Power dataset also provides source-specific power generation data, which allows further analysis of the energy mix for such distinctive seasonal and regional patterns. This example illustrates the advantages of a near-real-time, global-coverage, source-specific power dataset in facilitating analyses for countries across various geographical locations and with different social-economic status.
Usage case: increased use of coal as power source in EU27&UK under the impact of Russian’s invasion of Ukraine
The near-real-time and high time frequency features of this CarbonMonitor-Power dataset present a unique opportunity to closely follow the dynamics of the global and regional power system. Here we provide a usage case on how CarbonMonitor-Power may provide policy relevant information in a timely manner.
Short after the Russian invasion of Ukraine on the 24th February, 2022, with no significant increase in power demand (as compared to the same time period in 2021, Fig. 7a), power generation in Europe (EU27&UK) temporarily increased its reliance on coal as energy source (red area highlighted in Fig. 7b). Meanwhile natural gas as power source stayed at a similar level, followed by a further decline in April (Fig. 7c). From 24th February to 28th February, total power generation increased by 4.8% but coal power generation increased by 45.1%, while natural gas generated power decreased by 6.2% (all compared to 2021). From March to April, total power generation decreased by 1.3%. Among which, power generated from natural gas decreased by 6.3%, while power generated from coal increased by 23.1% (all compared to 2021). The immediate decline of gas and rise of coal as energy source in the power system may result from decreased Russian gas import, increased gas price due to speculation and panic buying, or a combination of both. The import of natural gas from Russia decreased sharply following the invasion of Ukraine, caused a further decrease of Europe’s natural gas supply26. A sudden increase of EU gas price was also observed short after the invasion: EU gas price (as shown by the Dutch TTF Natural Gas Calendar price) showed a sudden spike on the 24th February, and stayed at a high plateau with large variability for the following months31.
The disproportional and substantial increase in coal generated power immediately following the Russian invasion of Ukraine may point to a previously undiscussed increase in power associated emission increase in Europe under the impact of Russian’s invasion of Ukraine. This highlights the importance of designing a risk-resilient regional power system.
Extension of the CarbonMonitor-Power dataset
The current CarbonMonitor-Power dataset covers power generation data from three types of fossil sources (coal, gas, and oil), nuclear energy and four groups of renewable energy sources (solar energy, wind energy, hydro energy and other renewables including biomass, geothermal, etc.). The current coverage is from January, 2016 and the spatial coverage is for 37 countries across all continents. In addition to the continuous update of the existing dataset, the CarbonMonitor team is working on further extending the dataset including improved spatial resolution and coverage. To achieve these, we are collecting additional power generation data for regions and countries that are not covered yet. For missing data, we try to use proxy data including fossil fuel consumption data and climate reanalysis data to fill the data gap. In addition, daily and hourly power generation profiles of neighboring countries with similar climate and social-economic conditions could be used as proxy data. Combined with monthly/annual power generation data, it is possible to construct daily/hourly power generation dynamics for countries where no such local data is available.
Code availability
The generated datasets and the codes for producing the datasets are available from https://github.com/KowComical/CM_Power_Data and figshare29. The most up-to-date, continuously updated data can be visualized and uploaded from https://power.carbonmonitor.org. Codes are available upon reasonable requests.
References
United Nations Statistics Division. Goal 7: Ensure access to affordable, reliable, sustainable and modern energy for all. https://unstats.un.org/sdgs/report/2016/Goal-07/.
BP. Statistical Review of World Energy. bp global https://www.bp.com/en/global/corporate/energy-economics/statistical-review-of-world-energy.html (2022).
UNFCCC. National Communication submissions from Non-Annex I Parties. UNFCCC https://unfccc.int/non-annex-I-NCs.
UNFCCC. Biennial Update Report submissions from Non-Annex I Parties. UNFCCC https://unfccc.int/BURs.
UNFCCC. National Inventory Submissions 2021. UNFCCC https://unfccc.int/ghg-inventories-annex-i-parties/2021.
Deng, Z. et al. Comparing national greenhouse gas budgets reported in UNFCCC inventories against atmospheric inversions. Earth Syst. Sci. Data 14, 1639–1675 (2022).
Friedlingstein, P. et al. Global Carbon Budget 2021. https://doi.org/10.5194/essd-2021-386 (2021).
Markard, J. The next phase of the energy transition and its implications for research and policy. Nature Energy 3, 628–633 (2018).
Pfenninger, S. Energy scientists must show their workings. Nature 542, 393–393 (2017).
Pfenninger, S., DeCarolis, J., Hirth, L., Quoilin, S. & Staffell, I. The importance of open data and software: Is energy research lagging behind? Energy Policy 101, 211–215 (2017).
IEA. World Energy Outlook 2018: The Future is Electrifying. https://www.iea.org/events/world-energy-outlook-2018-the-future-is-electrifying (2018).
ENTSO-E Transparency Platform. https://transparency.entsoe.eu/.
Real-time operating grid - U.s. energy Information Administration (EIA). https://www.eia.gov/electricity/gridmonitor/dashboard/electric_overview/US48/US48.
Ruggles, T. H., Farnham, D. J., Tong, D. & Caldeira, K. Developing reliable hourly electricity demand data through screening and imputation. Sci Data 7, 155 (2020).
He, P., Liang, J., Qiu, Y. (lucy), Li, Q. & Xing, B. Increase in domestic electricity consumption from particulate air pollution. Nature Energy 5, 985–995 (2020).
Finenko, A. & Cheah, L. Temporal CO2 emissions associated with electricity generation: Case study of Singapore. Energy Policy 93, 70–79 (2016).
National Data. https://data.stats.gov.cn/english/.
Ember. Data Methodology for Open data on global electricity. https://ember-climate.org/app/uploads/2022/03/GER22-Methodology.pdf (2022).
IRENA. Renewable Energy Statistics 2022. (2022).
Monthly Electricity Statistics. IEA https://www.iea.org/data-and-statistics/data-product/monthly-electricity-statistics.
Liu, Z. et al. Global patterns of daily CO2 emissions reductions in the first year of COVID-19. Nat. Geosci. 15, 615–620 (2022).
Liu, Z. et al. Carbon Monitor, a near-real-time daily dataset of global CO2 emission from fossil fuel and cement production. Scientific Data 7, 1–12 (2020).
Liu, Z. et al. Near-real-time monitoring of global CO2 emissions reveals the effects of the COVID-19 pandemic. Nat. Commun. 11, 5172 (2020).
Huo, D. et al. Carbon Monitor Cities near-real-time daily estimates of CO2 emissions from 1500 cities worldwide. Sci. Data 9, 533 (2022).
Conway, D., Dalin, C., Landman, W. A. & Osborn, T. J. Hydropower plans in eastern and southern Africa increase risk of concurrent climate-related electricity supply disruption. Nature Energy 2, 946–953 (2017).
Zhou, C. et al. Natural gas supply from Russia derived from daily pipeline flow data and potential solutions for filling a shortage of Russian supply in the European Union (EU). https://doi.org/10.5194/essd-15-949-2023 (2022).
Dekking, F. M., Kraaikamp, C., Lopuhaä, H. P. & Meester, L. E. A Modern Introduction to Probability and Statistics: Understanding Why and How., https://doi.org/10.1007/1-84628-168-7 (Springer Science & Business Media, 2006).
IEA. Monthly Electricity Statistics. (2022).
Zhu, B. et al. CM-Power near-real-time monitoring of global power generation on hourly to daily scales. Figshare https://doi.org/10.6084/m9.figshare.21938102.v2 (2022).
British Petroleum, B. P. bp Statistical Review of World Energy. http://www.bp.com/statisticalreview (2022).
ICE, Dutch TTF Natural Gas Futures. https://www.theice.com/products/27996665/Dutch-TTF-Natural-Gas-Futures/data?marketId=5493476&span=3 (2023).
Data explorer. Ember https://ember-climate.org/data/data-explorer/.
Acknowledgements
The authors acknowledge the National Natural Science Foundation of China (grant 72140002 and 41921005), the Initiative Scientific Research Program (grant number: 20223080041) and Dushi Program from Tsinghua University. B.Z. is supported by China Postdoctoral Science Foundation (YJ20200099). P.C. receives funding from The Copernicus Atmospheric Core Service CAMS551a project.
Author information
Authors and Affiliations
Contributions
B.Z., P.C. and Z.L. designed the research and wrote the manuscript. B.Z. and Z.D. designed the methods. B.Z., X.S. and Z.D. conducted the main data processing and model developments. W.L. and C.H. developed the algorithm to acquire and reformatting, including time zone change and quality control for data from the United States; T.S. designed the algorithm for acquiring, translating and reformatting the raw data from Japan; D.C. performed data acquisition, analysis, interpretation of monthly power generation data from China; H.Z. and J.Q. contributed to the design of the China submodule, including conception and algorithm developments. P.K. and C.L. contributed to the early-stage data analysis for the reference monthly, yearly data and for factor developments. H.D., S.D. and P.G. contributed to the conception and design of the research, including algorithm developments, data interpretation and quality control. All authors reviewed the manuscript.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Zhu, B., Deng, Z., Song, X. et al. CarbonMonitor-Power near-real-time monitoring of global power generation on hourly to daily scales. Sci Data 10, 217 (2023). https://doi.org/10.1038/s41597-023-02094-2
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41597-023-02094-2
This article is cited by
-
Global carbon emissions in 2023
Nature Reviews Earth & Environment (2024)
-
Carbon Monitor Europe near-real-time daily CO2 emissions for 27 EU countries and the United Kingdom
Scientific Data (2023)