Abstract
This paper provides an open dataset of measured energy use, solar energy production, and building air leakage data from a 328 m2 (3,531 ft2) all-electric, zero energy commercial building in Virginia, USA. Over two years of energy use data were collected at 1-hour intervals using circuit-level energy monitors. Over six years of solar energy production data were measured at 1-hour resolution by 56 microinverters (presented as daily and monthly data in this dataset). The building air leakage data was measured post-construction per ASTM-E779 Standard Test Method for Determining Air Leakage Rate by Fan Pressurization and the United States Army Corps (USACE) Building Enclosure Testing procedure; both pressurization and depressurization results are provided. The architectural and engineering (AE) documents are provided to aid researchers and practitioners in reliable modeling of building performance. The paper describes the data collection methods, cleaning, and convergence with weather data. This dataset can be employed to predict, benchmark, and calibrate operational outcomes in zero energy commercial buildings.
Measurement(s) | measured energy use • solar energy production • building air leakage |
Technology Type(s) | circuit-level energy monitor • microinverter • ASTM-E779 Standard Test Method for Determining Air Leakage Rate by Fan Pressurization |
Factor Type(s) | temporal interval |
Sample Characteristic - Environment | zero energy commercial building |
Sample Characteristic - Location | Commonwealth of Virginia |
Machine-accessible metadata file describing the reported data: https://doi.org/10.6084/m9.figshare.16869407
Similar content being viewed by others
Background & Summary
Significant data is generated in the built environment. Design and construction documents, performance simulations, commissioning results, and building performance data (BPD) are generated on all modern buildings. Today, system advancements in energy efficiency, integration, monitoring, and connectivity are expanding our design and facility management options throughout the built environment. Internet-enabled monitoring, ubiquitous user interfaces, machine learning, and artificial intelligence are facilitating analytical approaches not previously afforded to our industry. As buildings become more complex, the need to measure, analyze, and share BPD has become salient.
Open datasets are critical for advancing building performance and analytical methods. Open BPD can be leveraged to calibrate design simulations and inform future design, construction, and operational decisions. Open BPD can facilitate benchmarking performance across building types, vintage, and geographic distributions1. Conversely, open BPD can be used to benchmark machine learning techniques for the built environment2. Simply put, open BPD is a critical component in the systems approach necessary to improve outcomes in the built environment including, but not limited to reducing carbon emissions and operating costs and improving human experiences in the built environment.
The Architecture, Engineering, Construction, and Operation (AECO) industry rarely systematically collects or shares BPD. Researchers increasingly collect BPD, yet rarely share BPD in open-source data repositories3. In recent years, however, there has been a trend to develop and share open-source BPD4,5,6,7,8,9,10,11,12. For example, there are recent open BPD contributions focused on occupant behavior impacts on energy use and indoor environmental quality (IEQ).
These contributions span human-building interactions with appliances5, heat pumps7, and natural ventilation systems8. Other recent contributions span energy use across building typologies (e.g., commercial and residential)9,10,11,12. This paper contributes to the recent trend toward open BPD datasets.
There are two novel contributions of this work. First, the longitudinal energy use and energy production data can be employed in the pursuit of zero energy building design and performance. As we race toward zero energy buildings, open BPD become critical for reducing performance gaps between expected and realized performance. The volume and variety of small commercial buildings compounds the need to go beyond simple benchmarking exercises and requires higher resolutions of data to achieve performance targets. The second contribution of this work is the building air leakage dataset combined with longitudinal energy use and energy production data. While building air leakage is an important factor in predicting energy use, estimating heating and cooling loads, and designing for indoor environmental quality (IEQ), it is rarely available in open BPD. The balance of this paper characterizes the case study project, data collection methods, and resulting data records.
Methods
Case study context
This dataset was developed from a single, non-random case study project. The building serves as a leasing office and community building for a national non-profit housing provider (referred hereafter as the “owner”). The owner’s mission is to create homes and communities that are healthy, sustainable, and affordable. The building was designed in 2013 and construction was completed in April 2014. The owner pursued EarthCraft Light Commercial (ECLC), a regional 3rd party green building program. The ECLC program was used to verify high performance design and construction targets were achieved. Table 1 provides an overview of the building specifications and the following section characterizes the data collection techniques for the energy use (demand over time – kWh), energy production, and building air leakage data.
Energy use
Energy use data (demand over time - kWh) were measured using a SiteSage energy monitoring system. 150 A current transformers (CTs) were installed on the building electrical mains, while 20 A or 50 A CTs were installed on the balance of the building’s circuits depending on the circuit load (e.g., water heating, lights, heat pump). Thirty discrete circuits are measured and reported in the dataset. Each CT transmitted energy use in one-hour intervals to an internet-enabled gateway located in the electrical panel. Data was accessed via an online user interface and downloaded as a .csv file for the purposes of this data descriptor. It is important to note that for safety reasons related to accessing a live electrical panel, a licensed electrician installed the CTs in the presence of the corresponding author.
Energy production
Energy production data were measured from the building’s roof mounted, 12.3 kWp solar photovoltaic (PV) system. The solar PV system was equipped with 56 microinverters that reported production data via power line communications to an Enphase internet-enabled gateway. The microinverter production data is aggregated and reported in daily intervals. Data was accessed through an online user interface and downloaded as .csv files for the purposes of this data descriptor. The energy production monitoring system was installed by a licensed solar contractor. Access to the user interface and data was provided to the authors by the owner. Figure 1 details the sensor architecture for the energy use and energy production data collection.
Building air leakage
The case study’s building enclosure was tested at the end of the project’s construction phase in April 2014. Building air leakage data were measured per the U.S. Army Corps of Engineers (USACE) Air Leakage Protocol for Building Envelopes13 and ASTM E779 - Standard Test Method for Determining Air Leakage Rate by Fan Pressurization14. Building air leakage rates were measured with two calibrated blower door fans, and two calibrated digital micromanometer gauges. Category 5 (CAT5) cable was used to connect the gauges to TECLOG3, a data logging software developed by the fan and gauge manufacturer. The software was also used to provide results in compliance with ASTM E779 (described in the Data Records section of this descriptor).
The USACE protocol requires two tests: a multi-point depressurization test and a multi-point pressurization test. During the test, all intentional enclosure penetrations (e.g., outdoor air, make-up air, and exhaust air penetrations) are sealed with masking. Following a pre-test baseline, ten points were collected from ±20 Pa (0.08 inH2O) to ±75 Pa (0.3 inH2O) at 5 Pa (0.02 inH2O) intervals. Following each test, a post-test baseline was measured. Each point was averaged over 10 seconds. Figure 2 provides a representation of the building air leakage tests per the USACE protocol.
Data Records
The primary and secondary data that accompany this data descriptor are provided at the Open Science Framework (OSF) respiratory15. The dataset is organized in five categories: (1) measured energy use, (2) measured energy production, (3) measured building air leakage, (4) weather, and (5) architectural and engineering (AE) documents. All data is provided in .xlsx, with the exception of the EnergyPlus weather files and AE documents. Table 2 characterizes the study’s primary data records.
To assist users of this dataset, the authors provide AE documents, publicly available weather data, and the EnergyPlus weather file (.EPW) for the case study building’s site. These data are considered as secondary data since they were not directly measured by authors. Table 3 provides an overview of the secondary data for the period of data measurements.
Energy consumption
Circuit-level energy consumption data (demand over time – kWh) were measured from June 18, 2014 to December 13, 2016. Data is organized into four datasets presenting energy use for the whole building and three end-use categories: lighting, plug & process loads (PPLs), HVAC & hot water energy use. Each dataset has circuit-level hourly data and aggregated data for total consumption for the whole building at hourly, daily, and monthly resolutions presented at separate tabs. In addition to the total energy use for the whole building and the three end-uses, energy use for first floor vs. second floor, interior vs. exterior lights, and HVAC vs. hot water are provided in the end-uses datasets in the tabs for hourly, daily, and monthly aggregated data.
Monthly energy use for the whole building varies from 985 kWh to 2,482 kWh over the measurement period (with the monthly average of 1,499 kWh). Average daily energy use over the whole measurement period is 49.4 kWh per day. Higher energy use for HVAC, hot water, and interior lighting is observed in colder months with longer night hours. Seasonal variations for HVAC energy use are over 1,000 kWh while interior lighting is about 100 kWh. No major seasonal or monthly variations were observed for exterior lightings. However, there is an increase in energy use for exterior lighting from December 2014 due to a change in the control system that resulted in some of the lights being on consistently during the day. Monthly energy use for PPLs were below 300 kWh per month with a fluctuation of 100 kWh between different months from July 2014 to October 2015. No major outliers were found in the datasets. A few outliers in PPLs were observed likely due to human-building interactions. Data for energy recovery ventilation (ERV) systems are included in the dataset, but it should be noted that the ERV was unplugged by the owner within the first month of operation due to concerns related to indoor humidity.
Energy production
Energy production data for the period of May 15, 2014 to December 31, 2020 is provided in a single .xlsx file. Data is organized in daily and monthly resolutions presented at separate tabs. Although data was measured at 1-hour intervals, the hourly resolution of data is not included in this dataset as the data export function on the solar PV user interface only exports measured data in daily intervals. Daily data was aggregated into monthly data in Excel and examined for missing data. Years 2015, 2016, and 2017 have missing data for two to eight months in the year due to accidental system shutoffs. Overall, 66 months have complete data for all days in the month. Missing data were shown in grey cells in the both daily and monthly data tabs. Monthly averages over the whole 7-year period and a 3-year period with complete data for all months of the year (2018 to 2020) are also provided in the monthly data tab. Minimum, maximum, and average values for daily energy production are provided in the daily data tab.
The Annual average of energy production is 1,218 kWh for years without missing data (2018 to 2020). More than 1,000 kWh energy is produced in a month from March through October. More energy production, however, was observed in April to August with July having the highest energy production on average over 7 years of data measurement. The highest monthly energy production was recorded as 2,213 kWh in May 2015. Figure 3 presents variations in daily energy production from August 2014 to May 2015.
Building air leakage
Building air leakage data is provided in a single .xlsx file. The USACE protocol and ASTM E779 standard require the reporting of both depressurization and pressurization tests. Results are separated by tab in the file. Each test (e.g., tab) reports three columns of data: (1) test point, (2) average induced enclosure pressure (+/− 75 Pa), and 3) total corrected fan flow (CFM).
Weather data
An EnergyPlus weather file (.EPW) for Charlottesville, VA is provided for building energy simulation applications16. Measured weather data for Charlottesville, VA provided by Weather Underground website17 from June 2014 to December 2016 is also available in the OSF respiratory. Daily and monthly resolutions of weather data are provided into two separate subfolders in the respiratory. Weather data includes temperature, dew point, humidity, wind speed, pressure, and precipitation. Daily weather data in the .xlsx file is provided in separate tabs for each month of the energy use measurement period. The average for each month is presented in the last row in the tab for that month. Monthly weather data includes heating degree days (HDDs) and cooling degree days (CDDs) and is provided in both SI and IP unit systems. The base temperature was 18 °C (65 °F) for both HDDs and CDDs. To facilitate investigating of the correlation between solar radiation and solar energy production, solar radiation data is provided from the National Oceanic and Atmospheric Administration (NOAA) website18 for Charlottesville, VA for the energy production measurement period (2014 to 2020) in hourly, daily, and monthly resolutions.
AE documents
Design documents are provided with permission from the design team in a single.pdf file. The architectural floor plans, elevations, sections, and details provide massing context and enclosure thermal performance context. Schedules and notes accompany the scaled mechanical, electrical, and plumbing plans. The corresponding author verified the field conditions against the plans during the project’s original commissioning process. Finally, a schematic plan of the solar PV system was developed by the research team and included in the AE documents folder.
Technical Validation
The authors recognize the importance of providing reliable measurements when reporting BPD. Measurement reliability is particularly important when data is sourced from a single, non-random case study since there are no other use cases to identify outliers. Table 4 summarizes the measurement instrumentation, associated standard(s), and accuracy of the study’s measurements.
The data presented in this descriptor required multiple instruments and represented multiple domains of BPD. A discrete experiment could not validate all data provided in this descriptor. During the measurement period, the research team periodically checked the energy use and production data via user interfaces for system status (e.g., on/off) and benchmarked the measured data against ASHRAE 90.1 and EnergyPlus simulations. Missing data were identified in the data cleaning process and further characterized in the dataset. Users of the dataset should recognize that the data is five years old and should account for changes over time in building technology, weather, and human-building interactions. The following section summarizes steps taken to ensure data reliability.
Energy use
The longitudinal collection of energy use data is an important feature of this dataset. The thirty months of energy use data provided in this descriptor can be used to identify monthly and yearly outliers. The authors also benchmarked the whole building site energy use intensity (EUI) with energy standard performance. For example, the case building site EUI is 56.3 kWh/m2-yr, and the ASHRAE 90.1-2016 benchmark for small office buildings is 82.0 kWh/m2-yr19. Finally, the authors validated monthly heating and cooling consumption with monthly HDDs and CDDs data as energy demand over time (kWh) for heating and cooling correlates with HDDs and CDDs for the building location. Figure 4 demonstrates the influence of monthly HDD/CDD on monthly heating and cooling energy use.
Energy production
The longitudinal collection of energy production data is an important feature of this dataset. The sixty-six months of energy production data provided in this descriptor can be used to identify monthly and yearly outliers. The provided weather data and solar radiation data can also be employed to correlate weather conditions with energy production outcomes.
Building air leakage
Beyond the fan and gauge calibration requirements, the building air leakage test must follow, and results must meet several reliability steps. Specifically, the test requires flow rates to be sampled at a minimum of 10 test pressure points (between 20–75 Pa). Flows are averaged for at least 10 seconds per test point. Per the USACE test protocol and ASTM E779, test results must achieve a squared correlation coefficient (r2) ≥0.98. Both depressurization and pressurization tests resulted in a r2 = 0.99, thus passing the standard. Finally, during the test, TECLOG3 (e.g., building air leakage software) records the multi-point test results and generates curve fit to the Power Law Eq. (1) in compliance with ASTM E779.
where Q is the infiltration/exfiltration rate, C is the air flow coefficient, ΔP is the pressure differences across the building enclosure, and n is the pressure exponent. The curve predicts infiltration rates from 0 Pa (0.0 inH2O) to 75 Pa (0.3 inH2O). See Fig. 5a for the pressurization curve and Fig. 5b for the depressurization curve.
Code availability
No code was used in the generation of this data. No code is required to access or analyze this dataset.
References
Roth, J., Lim, B., Jain, R. K. & Grueneich, D. Examining the feasibility of using open data to benchmark building energy usage in cities: A data science and policy perspective. Energy Policy 139, 111327 (2020).
Granderson, J. et al. Accuracy of automated measurement and verification (M&V) techniques for energy savings in commercial buildings. Applied Energy 173, 296–308 (2016).
Huebner, G. M. & Mahdavi, A. A structured open data collection on occupant behaviour in buildings. Sci. Data 6, 292 (2019).
Kriechbaumer, T. & Jacobsen, H.-A. BLOND, a building-level office environment dataset of typical electrical appliances. Sci. Data 5, 180048 (2018).
Kelly, J. & Knottenbelt, W. The UK-DALE dataset, domestic appliance-level electricity demand and whole-house demand from five UK homes. Sci. Data 2, 150007 (2015).
Mahdavi, A., Berger, C., Tahmasebi, F. & Schuss, M. Monitored data on occupants’ presence and actions in an office building. Sci. Data 6, 290 (2019).
Ruhnau, O., Hirth, L. & Praktiknjo, A. Time series of heat demand and heat pump efficiency for energy system modeling. Sci. Data 6, 189 (2019).
Schweiker, M., Kleber, M. & Wagner, A. Long-term monitoring data from a naturally ventilated office building. Sci. Data 6, 293 (2019).
Miller, C. et al. The Building Data Genome Project 2, energy meter data from the ASHRAE Great Energy Predictor III competition. Sci. Data 7, 368 (2020).
Rashid, H., Singh, P. & Singh, A. I-BLEND, a campus-scale commercial and residential buildings electrical energy dataset. Sci. Data 6, 190015 (2019).
Paige, F., Agee, P. & Jazizadeh, F. flEECe, an energy use and occupant behavior dataset for net-zero energy affordable senior residential buildings. Sci. Data 6, 291 (2019).
Klemenjak, C., Kovatsch, C., Herold, M. & Elmenreich, W. A synthetic energy dataset for non-intrusive load monitoring in households. Scie. Data 7, 108 (2020).
Zhivov, A., Bailey, D. & Herron, D. USACE Air Leakage Test Protocol for Building Envelopes- Version 3 (2012).
ASTM E779-10. Standard Test Method for Determining Air Leakage Rate by Fan Pressurization. http://www.astm.org/cgi-bin/resolver.cgi?E779-19 (2010).
Agee, P. & Nikdel, L. An energy use, energy production, and building air leakage dataset for a zero energy commercial building. Open Science Framework. https://doi.org/10.17605/OSF.IO/3KDHQ (2021).
Weather Data by Location. EnergyPlus https://www.energyplus.net/weather-location/north_and_central_america_wmo_region_4/USA/VA/USA_VA_Charlottesville-Albemarle.County.AP.724016_TMY3.
Charlottesville, VA Weather History. Weather Underground https://www.wunderground.com/history/monthly/us/va/charlottesville/KCHO.
NOAA NCEI. Integrated Surface Dataset (Global). NOAA National Centers for Environmental Information https://www.ncei.noaa.gov/access/metadata/landing-page/bin/iso?id=gov.noaa.ncdc:C00532 (2001).
PNNL. Energy Savings Analysis: ANSI/ASHRAE/IES Standard 90.1-2016. https://www.energycodes.gov/sites/default/files/documents/02222018_Standard_90.1-2016_Determination_TSD.pdf (2017).
Acknowledgements
The authors would like to acknowledge the assistance and support of the U.S. Department of Energy’s, Advanced Commercial Building Initiative (Contract Number: EE0006290, OSTI Identifier: 1351293), Community Housing Partners, Southface Energy Institute, Viridiant, and Arnold Design Studio.
Author information
Authors and Affiliations
Contributions
Philip Agee installed the energy monitoring equipment, performed the building air leakage testing, led the manuscript and dataset development, coordination, obtained the informed consent from the owner, and performed the final review. Leila Nikdel organized, cleaned, and converged the dataset. She also wrote the dataset description and reviewed the final manuscript. Sydney Roberts oversaw the energy use and building air leakage data collection and reviewed the final manuscript. https://guides.lib.vt.edu/oasf/request.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
The Creative Commons Public Domain Dedication waiver http://creativecommons.org/publicdomain/zero/1.0/ applies to the metadata files associated with this article.
About this article
Cite this article
Agee, P., Nikdel, L. & Roberts, S. A measured energy use, solar production, and building air leakage dataset for a zero energy commercial building. Sci Data 8, 299 (2021). https://doi.org/10.1038/s41597-021-01082-8
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41597-021-01082-8