| Literature DB >> 33110076 |
Clayton Miller1, Anjukan Kathirgamanathan2, Bianca Picchetti3, Pandarasamy Arjunan4, June Young Park5, Zoltan Nagy5, Paul Raftery6, Brodie W Hobson7, Zixiao Shi7, Forrest Meggers8.
Abstract
This paper describes an open data set of 3,053 energy meters from 1,636 non-residential buildings with a range of two full years (2016 and 2017) at an hourly frequency (17,544 measurements per meter resulting in approximately 53.6 million measurements). These meters were collected from 19 sites across North America and Europe, with one or more meters per building measuring whole building electrical, heating and cooling water, steam, and solar energy as well as water and irrigation meters. Part of these data was used in the Great Energy Predictor III (GEPIII) competition hosted by the American Society of Heating, Refrigeration, and Air-Conditioning Engineers (ASHRAE) in October-December 2019. GEPIII was a machine learning competition for long-term prediction with an application to measurement and verification. This paper describes the process of data collection, cleaning, and convergence of time-series meter data, the meta-data about the buildings, and complementary weather data. This data set can be used for further prediction benchmarking and prototyping as well as anomaly detection, energy analysis, and building type classification.Entities:
Year: 2020 PMID: 33110076 PMCID: PMC7591488 DOI: 10.1038/s41597-020-00712-x
Source DB: PubMed Journal: Sci Data ISSN: 2052-4463 Impact factor: 6.444
Overview of the sites from which the building energy meter data was collected.
| Site | UID | Kaggle | Actual Site Name | Location | Climate | Buildings | Meters |
|---|---|---|---|---|---|---|---|
| Panther | 1P4YFG | 0 | Univ. of Central Florida (UCF) | Orlando, FL | 2 A | 136 | 299 |
| Robin | 1TKL5P | 1 | Univ. College London (UCL) | London, UK | 4 A | 52 | 67 |
| Fox | 4QFLSM | 2 | Arizona State Univ. (ASU) | Tempe, AZ | 2B | 137 | 306 |
| Rat | 72SGIQ | 3 | Washington DC - City Buildings | Washington DC | 4 A | 305 | 305 |
| Bear | 7E44IQ | 4 | Univ. of California - Berkeley | Berkeley, CA | 3 C | 92 | 92 |
| Lamb | 9T5ZA2 | 5 | Cardiff - City Buildings | Cardiff, UK | 4 A | 147 | 265 |
| Eagle | EQDHIP | 6 | Anonymous | N/A | 4 A | 47 | 106 |
| Moose | H7PNXU | 7 | Ottawa - City Buildings | Ottawa, Ontario | 6 A | 15 | 43 |
| Gator | I9U4WZ | 8 | Anonymous | N/A | 2 A | 74 | 74 |
| Bull | JG98YH | 9 | Univ. of Texas - Austin | Austin, TX | 2 A | 124 | 308 |
| Bobcat | JP4TNW | 10 | Anonymous | N/A | 5B | 36 | 116 |
| Crow | JTM0LY | 11 | Carleton Univ. | Ottawa, Ontario | 6 A | 5 | 15 |
| Wolf | RFO3TV | 12 | Univ. College Dublin (UCD) | Dublin, Ireland | 5 A | 36 | 66 |
| Hog | SREOJG | 13 | Anonymous | Anonymous | 6 A | 163 | 336 |
| Peacock | WI83D6 | 14 | Princeton University | Princeton, NJ | 5 A | 106 | 298 |
| Cockatoo | YYAFES | 15 | Cornell University | Cornell, NY | 6 A | 124 | 282 |
| Shrew | L2HJLD | — | UK Parliment | London, UK | 4 A | 9 | 13 |
| Swan | N950XM | — | Anonymous | N/A | 3 C | 21 | 55 |
| Mouse | ZVJUMW | — | Ormand Street Hospital | London, UK | 4 A | 7 | 7 |
Each site is given an animal-like site code name, a UID that corresponds to some of the data convergence processes, the Kaggle Site ID that was included in the competition, and the Actual Site Name, Location and Climate Zone. Several of the sites are to remain anonymous based on discussions with the data donors. The last two columns indicate the number of buildings and meters where two years of hourly, whole building meter data were collected from each site.
Fig. 1Main features distribution in metadata file that describes the various buildings from which the meter data was collected. Several of the meta-data categories are available for all buildings including the Primary Use Category of the building (primaryspaceusage), the Sub-primary Use Category (subprimaryspaceusage), Gross Floor Area (sqm), Time Zone (timezone), Weather Data, and Meter Type.
Sites with data that are publicly available to download online.
| Site | Actual Site Name | Online source |
|---|---|---|
| Panther (0) | Univ. of Central Florida (UCF) | |
| Robin (1) | Univ. College London (UCL) | |
| Fox (2) | Arizona State Univ. (ASU) | |
| Bear (4) | UC Berkeley (UCB) | |
| Lamb (5) | Cardiff - City Buildings | |
| Cockatoo (15) | Cornell University | |
| Shrew | UK Parliment | |
| Mouse | Ormand Street Hospital |
The site name includes the Kaggle ID in parentheses.
ISD weather station data sources for the non-anonymous sites.
| Site | ISD Station Code |
|---|---|
| Panther (0) | 722050-12815 |
| Robin (1) | 037720-99999 |
| Fox (2) | 722780-23183 |
| Rat (3) | 724050-13743 |
| Bear (4) | 724930-23230 |
| Lamb (5) | 037150-99999 |
| Moose (7) | 710630-99999 |
| Bull (9) | 722544-13958 |
| Crow (11) | 710630-99999 |
| Wolf (12) | 039690-99999 |
| Peacock (14) | 724095-14792 |
| Cockatoo (15) | 725155-94761 |
| Shrew | 037720-99999 |
| Mouse | 037720-99999 |
The site name includes the Kaggle ID in parentheses.
Fig. 2Main feature distributions of the weather data set.
Overview of original measurement units for the raw data collected from each site.
| Site | Chilled water | Electricity | Gas | Hot water | Solar | Steam | Water | Irrigation |
|---|---|---|---|---|---|---|---|---|
| Panther (0) | kBTU | kBTU | kBTU | gallons | gallons | |||
| Robin (1) | kWh | |||||||
| Fox (2) | Tons | kW | mmBTU | |||||
| Rat (3) | kWh | |||||||
| Bear (4) | kW | |||||||
| Lamb (5) | kWh | kWh | ||||||
| Eagle (6) | mmBTU | kW | mmBTU | lbs | ||||
| Moose (7) | KJ | KJ | ||||||
| Gator (8) | kWh | |||||||
| Bull (9) | Tons | kWh | lbs | |||||
| Bobcat (10) | kBTU | kWh | kBTU | kBTU | kWh | gallons | ||
| Crow (11) | kWh | kWh | kWh | |||||
| Wolf (12) | kWh | m3 | liters | |||||
| Hog (13) | Tons | kWh | lbs | |||||
| Peacock (14) | Tons | kW | lbs | |||||
| Cockatoo (15) | Tons | kW | Tons | lbs | ||||
| Shrew | kWh | kWh | ||||||
| Swan | Tons | kWh | kBTU | |||||
| Mouse | kWh |
All data were subsequently converted to kWh or liters. The site name includes the Kaggle ID in parentheses.
Overview of measurement unit conversion process.
| Unit | Conversion Factor |
|---|---|
| kW | 1 kWh |
| tons | 1 kWh |
| kBTU | 1 kWh |
| MJ | 1 kWh |
| mmBTU | 1 kWh |
| therm | 1 kWh |
| cubic meter gas | 1 kWh |
| lb/hour steam | 1 kWh |
| gallons | 1 liter = gallons * 0.264172 |
All energy-related meters were converted to kWh or liters from the various raw data units.
Fig. 3Normalized meter consumption expressed as the daily energy consumption (kWh) per area unit (square feet) of the building that is then scaled to Min-max scaling (for a range of 0–1). Each heatmap corresponds to a meter type, the horizontal access for all graphics is the two year time range, and the vertical axis are the range of meters sorted anonymously from (bottom-to-top) from lowest to highest scaled daily normalized consumption.
Fig. 6Breakout detection heat map sorted (bottom-to-top) according to increasing number of breakouts detected. The more breakouts detected in a time-series data set, the more volatility is incurred in the data set.
Fig. 4Data quality plot of each meter type. Sorted (bottom-to-top) according to increasing number of good data.
Fig. 5Weather sensitivity plot of each meter type. Spearman rank coefficient was calculated between the meter reading (kWh or liters) and the outside air temperature (degrees Celsius) for each month. Sorted (bottom-to-top) according to increasing sum of coefficients.
| Measurement(s) | electrical energy • energy • solar energy • irrigation |
| Technology Type(s) | Gauge or Meter Device |
| Factor Type(s) | geographic location • temporal interval |
| Sample Characteristic - Environment | building |
| Sample Characteristic - Location | North America • Europe |