Harmonized and Open Energy Dataset for Modeling a Highly Renewable Brazilian Power System

Deng, Ying; Cao, Karl-Kiên; Hu, Wenxuan; Stegen, Ronald; von Krbek, Kai; Soria, Rafael; Rochedo, Pedro Rua Rodriguez; Jochem, Patrick

doi:10.1038/s41597-023-01992-9

Download PDF

Data Descriptor
Open access
Published: 22 February 2023

Harmonized and Open Energy Dataset for Modeling a Highly Renewable Brazilian Power System

Scientific Data volume 10, Article number: 103 (2023) Cite this article

4216 Accesses
3 Citations
13 Altmetric
Metrics details

Subjects

Abstract

Improvements in modelling energy systems of populous emerging economies are highly decisive for a successful global energy transition. The models used–increasingly open source–still need more appropriate open data. As an illustrative example, we take the Brazilian energy system, which has great potential for renewable energy resources but still relies heavily on fossil fuels. We provide a comprehensive open dataset for scenario analyses, which can be directly used with the popular open energy system model PyPSA and other modelling frameworks. It includes three categories: (1) time series data of variable renewable potentials, electricity load profiles, inflows for the hydropower plants, and cross-border electricity exchanges; (2) geospatial data on the administrative division of the Brazilian federal states; (3) tabular data, which contains power plant data with installed and planned generation capacities, aggregated grid network topology, biomass thermal plant potential, as well as scenarios of energy demand. Our dataset could enable further global or country-specific energy system studies based on open data relevant to decarbonizing Brazil’s energy system.

An all-Africa dataset of energy model “supply regions” for solar photovoltaic and wind power

Article Open access 31 October 2022

A Dataset for Electricity Market Studies on Western and Northeastern Power Grids in the United States

Article Open access 22 September 2023

Predictive mapping of the global power system using open data

Article Open access 15 January 2020

Background & Summary

The decarbonization of energy systems in developing countries, especially in the most populous ones, becomes a determinant factor for a global “well below 2 °C” target¹. Achieving climate neutrality requires complete or nearly complete decarbonization of the electricity system. This goal is attainable today through many technologies that provide low-carbon or even carbon-free electricity–renewable energy, nuclear power, and fossil-fueled electricity with carbon capture and storage. Low social acceptance and low economic viability make the latter two technologies more challenging to deploy on a large scale, and their timely installation questionable. However, the generation profile and production costs of variable renewable energy sources (vRES) vary with the weather, i.e., the spatial location and the availability of wind resources and solar radiation. Consequently, the decision problems in the operation and planning of reliable, stable, and carbon-neutral power systems rely on large-scale models and datasets.

Open science promotes using open models to support the transition to carbon-neutral energy systems. Typically, such open models are populated with datasets specific to the power system. However, energy data can come from different sources, and the accessibility and licensing conditions of energy data affect the degree of openness of the modelling workflows². For this reason, the open data can help drive and support the efforts of improving transparency and productivity³. In developed countries, especially in Europe, various energy system models are available as open source⁴. There are several platforms, for instance, the Open Energy Platform (https://openenergy-platform.org/) and Open Power System Data platform⁵, which coordinate various open datasets (such as climate, demand profiles, transmission grids, and scenarios) for modelling the European power system.

In contrast, energy system models for developing countries use opaque and, in most cases, inaccessible datasets. Using those datasets makes it difficult for global energy models to represent emerging nations accurately. Language barriers may further hinder researchers who belong to a different language region from utilizing available energy data.

As one of the five most populous countries, Brazil is a developing country with significant wind resources and solar radiation potential, albeit in the early stages of deployment. Brazil’s energy system is facing a strategic transition, and the rainforest constrains its capacity expansion. All this makes it valuable to understand the Brazilian energy system in detail and its potential contribution to the global energy transition. An important dataset for modelling the Brazilian energy system is published in the context of Brazil’s National Ten-Year Expansion Plan⁶. It contains the input data for the corresponding investment model⁷. However, modellers, who would like to use this dataset, must have Portuguese language skills and modelling experience. The latter is necessary, e.g., to understand the context behind certain abbreviations or numerical values, which may be either based on empirical data or generically made up to fill data gaps. In particular, the dataset is provided for four electric zones plus ten nodes, which limits analyses at higher spatial resolutions, for instance, on the federal state level.

In this context, our contribution is to make the existing energy data of Brazil better applicable for energy systems modelling. By providing the first publicly available, spatially explicit, harmonized, and English version of Brazil’s energy data, we enable researchers to replicate the Brazilian energy system and/or to improve the integration into global energy models starting from a common basis.

The assembled dataset comprises the following subcategories as detailed in the Methods: (i) geospatial data for Brazil, (ii) aggregated grid network topology, (iii) vRES potentials–profile and installable generation capacity, (iv) geographically installable capacity of biomass thermal plants, (v) hydropower plants inflow, (vi) existing and planned power generators with their capacity, (vii) electricity load profile, (viii) scenarios of sectoral energy demand and (ix) cross-border electricity exchanges. This dataset is resolved geographically by Brazilian federal states, and time series data are resolved by hours, spanning 2012–2020.

In this way, the presented dataset provides the essential information and foundation for the operational and expansion planning studies necessary to explore Brazil’s highly decarbonized energy future. For example, the dataset was used in the PyPSA-Brazil model⁸ to assess the impact of transmission grid expansion in the Brazilian power system. The dataset published in this paper has been updated and includes more years of data than the version used⁸.

Methods

This work aims to create consolidated open energy data for Brazil based on open and accessible original datasets.

Supplementary Table S1 summarizes the sources and licenses of the raw data used for each subcategory of the dataset in this paper. The following subsections elaborate on knowledge of energy data in the Brazilian context, how we obtain each dataset from its sources, and assumptions made in processing and constructing the datasets.

Geospatial data for Brazil

Brazil has five macroeconomic regions, four electric regions, 27 federal levels (26 states and one federal district–Brasília), and 5572 municipalities.

The spatial resolution of the dataset we provide is at ISO 3166-2 level⁹ and comprises 27 defined regions, i.e., federal level, illustrated in Fig. 1.

Data collection

Even though there are several map sources, the original dataset used is from the Brazilian Institute of Geography and Statistics (Portuguese: Instituto Brasileiro de Geografia e Estatística, IBGE)¹⁰. This choice is not only motivated by the licensing but also because IBGE is Brazil’s official map source and is considered the most credible source for the country’s borders and topography. The shapefile’s Coordinate Reference System (CRS) is SIRGAS 2000 (commonly known as EPSG:4674).

Data processing

These attributes in the original dataset¹⁰ are converted to English, and the CRS is re-projected to EPSG:4087. Only the federation state and the geometric information of the polygon are retained. In addition, representative coordinates (x, y) of the federal states are added and are considered as the centroid of the state polygon.

Aggregated grid network topology

The power grid connects all power generators and loads. In Brazil, the electricity grid is known as the National Interconnected Network (Portuguese: Sistema Interligado Nacional, SIN) and is managed by the National Electricity System Operator (Portuguese: Operador Nacional do Sistema Elétrico, ONS). ONS divides Brazil into four electric regions, including several federal states, as shown in Table 1. SIN has a total length of 167,000 km and connects almost the entire country (96.6% of the national territory), except for some isolated places in the northern region. Over the next few decades, 434 lines with a total length of 32,000 km are planned to be built¹¹.

Table 1 Electrical regions defined in the SIN and the federal states covered.

Subjects

Abstract

Similar content being viewed by others

An all-Africa dataset of energy model “supply regions” for solar photovoltaic and wind power

A Dataset for Electricity Market Studies on Western and Northeastern Power Grids in the United States

Predictive mapping of the global power system using open data

Background & Summary

Methods

Geospatial data for Brazil

Data collection

Data processing

Aggregated grid network topology

Data collection

Data processing

Power plants

Data collection

Data processing

Installable capacity for biomass thermal plants

Data collection

Data processing

Electricity load profiles

Data collection

Data processing

Scenarios of energy demand

Data collection

Data processing

Inflow of hydropower plants

Data collection

Data processing

Variable renewable potentials (wind and solar)

Data collection

Data processing

Cross-border electricity exchanges

Data collection

Data processing

Data Records

Geospatial data for Brazil

Grid network topology

Variable renewable potentials (wind and solar)

Installable capacity for biomass thermal plants

Inflow for hydropower plants

Power plants

Electricity load profiles

Scenarios of energy demand

Cross-border electricity exchanges

Technical Validation

Solar feed-in

Wind feed-in

Conclusion

Usage Notes

Code availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Supplementary information

Supplementary Information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

SECURES-Met: A European meteorological data set suitable for electricity modelling applications

Carbon-neutral power system enabled e-kerosene production in Brazil in 2050

Search

Quick links