Answers to frequently asked questions about the DOE Building Performance Database (BPD).
Database and Analysis Information
- What kinds of buildings does the BPD have?
- What are the data sources that populate the BPD?
- Does the BPD have time series data?
- How do you ensure that the data from these multiple sources is consistent and valid?
- What data format does the BPD utilize?
- Why are there no buildings in my zip code?
- I ran the same query twice and received different results. Is this an error?
- Can I export the results?
- Does the BPD conduct pre- and post-retrofit analysis of the same buildings?
- Is the BPD Weather Normalized?
- How can you make sure there aren't two records about the same building in the BPD?
Relationship to other Federal Government Efforts
- How does the BPD relate to ENERGY STAR Portfolio Manager?
- How does the BPD relate to CBECS and RECS?
- How does the BPD relate to the GreenButton initiative?
Data Privacy and Security
- Can I access individual records?
- Does the BPD include personally identifiable information (PII)?
- How does the BPD handle data privacy?
- How does the BPD handle data security?
What is the purpose of the DOE Buildings Performance Database?
The BPD provides access to empirical data on the actual energy performance and physical and operational characteristics of commercial and residential buildings. The database was developed to stimulate investment in energy-efficiency retrofit projects and services.
What building energy performance data is included in the BPD?
All records in the database include a minimum of one year of the building's energy usage, location, climate zone, building classification (residential or commercial), type (office, retail, etc), and floor area. Some records also contain age, operational characteristics (such as types of activities and associated floor areas, operating hours, and number of occupants) and/or asset information (such as lighting type and controls, air distribution configuration, heating and cooling equipment types, etc, wall, roof and window characteristics, etc).
How can I contribute data to the BPD?
The BPD is a crowd-sourcing effort that depends on the participation of the energy-efficiency industry to succeed. If you are interested in contributing a dataset to the database or have other recommended sources, please contact BPD@ee.doe.gov. Files can be sent in any electronic format and should be marked "PROPRIETARY." Once it is received, LBNL cleans and formats the dataset for upload into the BPD. Data contributors can request a copy of their cleansed and formatted dataset.
The BPD is seeking datasets of >50 records that include, at a minimum, one year of the building's energy usage, location at the zip code level, building classification (residential or commercial), type (office, retail, etc), and floor area. Additional fields that are useful are: age, operational characteristics (such as types of activities and associated floor areas, operating hours, and number of occupants) and/or asset information (such as lighting type and controls, air distribution configuration, heating and cooling equipment types, wall, roof and window characteristics, etc). The BPD accepts smaller datasets if they contain richer information.
Learn more on the BPD Contributing Data page.
Database and Analysis Information
What kinds of buildings does the BPD have?
The BPD has data on all types of commercial and residential buildings, but not campuses or industrial buildings. While there is not currently sufficient information to conduct analysis for some building types, it will become possible as the dataset grows.
What are the data sources that populate the BPD?
The BPD conducts ongoing outreach to public and private partners to contribute data, and new datasets are being added on a regular basis. Data contributors include dozens of federal, state and local governments, energy-efficiency programs, real estate owners, and product and service providers. Note that records are anonymized in the BPD, and their source is not distinguishable in the platform.
Does the BPD have time series data?
All of the buildings have at least one year of energy consumption data – as either an annual total, monthly, daily or interval. In the future, it will be possible to add more years or greater granularity of consumption data for buildings that are already in the database.
How do you ensure that the data from these multiple sources is consistent and valid?
There are two processes used to ensure consistent and valid data within the BPD. The data transformation process converts source data into a consistent format in the BPD. Data cleansing and validation procedures are then used to identify potentially suspect data, including in and out of range checks, statistical outliers, invalid values, etc.
Real data is messy and has collection and entry errors, but the BPD team is continually improving upon its data cleansing and error checking methodologies. As the dataset grows it will also become possible to filter the data to only view records that were professionally validated. Please contact us if you are interested in more detailed information about our data formatting and cleansing procedures.
What data format does the BPD utilize?
The Building Energy Data Exchange Specification is a data format designed to support analysis of the measured energy performance of commercial and residential buildings, with fields for building characteristics, efficiency measures and energy use. The BEDES defines and describes these data fields and their relationships. The DOE is currently in the process of refining the BEDES for more widespread use. If you are interested in more information, please visit the BEDES website or contact us.
Why are there no buildings in my zip code?
While the BPD has hundreds of thousands of buildings, there are still some zip codes for which no data are available. The BPD will include buildings in additional zip codes as further data are imported. If you know of any building performance data sources for your zip-code, please contact BPD@ee.doe.gov.
I ran the same query twice and received different results. Is this an error?
The database is updated when new data is available. If the number of records from your latest query is greater than the number of records from you initial query then more data has been added to the database. This will serve to increase the accuracy of your search.
Can I export the results?
The table allows users to generate and export characteristics of the peer group categorized by state or facility type. The table shows the building count, median EUI (kBtu/ft2), median floor area, median year built, and median hours occupied.
Does the BPD allow pre- and post-retrofit analysis of the same buildings?
The BPD does not rely on pre- and post- retrofit data of the same buildings. The large sample set enables comparison of the energy performance distribution of two sets of buildings that differ in one variable, such as the type of boiler. By comparing a group of single-family homes with boilers to a similar set of single-family homes that have heat pumps, the database can demonstrate the likely savings from an equipment change.
It is possible that this type of comparisons could include pre and post information for the same buildings. Each record in the BPD is a snapshot of a specific building, its characteristics and at least a year of energy consumption data. If a change is made in the building, it is entered as a new record with new characteristics and associated consumption. As the dataset grows, the BPD will add filters to look at buildings for which there are pre/post information or multi-year data.
Is the BPD Weather Normalized?
As a design principle, the raw BPD dataset includes empirical information that is not manipulated in any way.
How can you make sure there aren't two records about the same building in the BPD?
The BPD runs analysis to identify duplicates and ensure the same building isn't recorded twice. Duplicate records are identified by searching for records that have the same size, SF and EUI in the same zip code. But there is still a small chance that a building could be in the BPD twice if the records are inconsistent enough that the checks don't catch them. Real data is messy and has collection and entry errors, but the BPD team is continually improving upon its data cleansing and error checking methodologies.
Note that each record in BPD in a snapshot of a specific building and its characteristics that are associated with at least a year of consumption data. So if a change is made in the building, it is entered as a new record with new characteristics and associated consumption
Relationship to other Federal Government Efforts
How does the BPD relate to ENERGY STAR Portfolio Manager?
BPD and Portfolio Manager are complementary. Portfolio Manager is a benchmarking and tracking tool that collects information on your building in order to generate a whole-building energy-efficiency score. The BPD allows users to explore the relationship between energy performance and physical and operational characteristics of many similar buildings in order to identify and evaluate efficiency opportunities. In fact, the BPD contains data on all of the buildings in Portfolio Manager that have been engineer-verified.
How does the BPD relate to CBECS and RECS?
The BPD serves a complementary function to Energy Information Administration's Commercial Buildings Energy Consumption Survey (CBECS) and Residential Energy Consumption Survey (RECS). CBECS and RECS are statistically valid samples of the national commercial and residential building stocks, and are in fact two of the data sources for the BPD.
The BPD is intended to be an actuarial database (like what insurance companies use) that contains the critical mass of records needed for users to examine the performance of similar buildings, while analyzing and controlling for many factors. Because the BPD is collecting real data from a large number of sources in order to achieve scale quickly, it will never be a perfectly statistically valid sample. CBECS and RECS have many fewer buildings than the BPD, and the survey takes much longer to collect. As a result, CBECS or RECS may only have one record of a specific building type in a particular geographic area. By contrast, BPD users will be able to get a good sense of how certain types of buildings in specific local markets perform.
How does the BPD relate to the Green Button initiative?
The Green Button initiative is an industry-led effort that responds to a White House initiative to provide utility customers, such as building owners and managers and major tenants, with easy and secure access to their energy usage information in a standardized electronic format. Using Green Button Download My Data, customers are able to securely download their own detailed energy usage with a simple click of a literal "Green Button" on electric utilities' websites. The BPD is able to accept Green Button data but the minimum data requirements for the BPD include additional fields as well, such as square footage and building age.
Data Privacy and Security
Can I access individual records?
The BPD does not provide access to raw data – it serves as an interface to analyze the dataset. Data can only be analyzed in aggregate through the BPD interface. The BPD analyses will never return a small enough number of records that it reveals information that could be used to identify a building.
Does the BPD include personally identifiable information (PII)?
The BPD does not contain any personally identifiable information such as building owners and occupants, street address, owner, etc. The BPD team internally assigns a unique building identifier to each record in order to update the source datasets with new information, but these identifiers cannot be accessed by BPD users.
How does the BPD handle data privacy?
Individual building records or source datasets will not be released publicly. Any data file that is submitted and marked as "proprietary data" is protected by law from release under the Freedom of Information Act. Even data that are obtained from public sources without restrictions are treated according to the BPD's stringent use and privacy rules, and therefore cannot be distinguished from other records in the database or provided to outside entities. For a copy of our privacy terms, please contact us.
How does the BPD handle data security?
The BPD Databases security policies align with DOE policies and Information Security best-practices. The BPD employs 2048-bit Secure Sockets Layer (SSL) certificates and 256 bit encryption of data. If you would like more detailed information on our data security policies, please visit the BPD Developers' Security page or contact us.