| Literature DB >> 28423039 |
Krzysztof Gajowniczek1, Tomasz Ząbkowski1.
Abstract
Leveraging smart metering solutions to support energy efficiency on the individual household level poses novel research challenges in monitoring usage and providing accurate load forecasting. Forecasting electricity usage is an especially important component that can provide intelligence to smart meters. In this paper, we propose an enhanced approach for load forecasting at the household level. The impacts of residents' daily activities and appliance usages on the power consumption of the entire household are incorporated to improve the accuracy of the forecasting model. The contributions of this paper are threefold: (1) we addressed short-term electricity load forecasting for 24 hours ahead, not on the aggregate but on the individual household level, which fits into the Residential Power Load Forecasting (RPLF) methods; (2) for the forecasting, we utilized a household specific dataset of behaviors that influence power consumption, which was derived using segmentation and sequence mining algorithms; and (3) an extensive load forecasting study using different forecasting algorithms enhanced by the household activity patterns was undertaken.Entities:
Mesh:
Year: 2017 PMID: 28423039 PMCID: PMC5396872 DOI: 10.1371/journal.pone.0174098
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Daily electricity demand load profiles across a 24 hr period on 21st July 2013 based on WikiEnergy data [2]; (A) for an individual dwelling; (B) aggregated for 46 households.
Appliances monitored in AMPds.
| Appliance | Cut-off value (Watts) |
|---|---|
| North Bedroom | 2 |
| Master and South Bedroom | 9 |
| Basement Plugs and Lights | 7 |
| Clothes Dryer | 7 |
| Clothes Washer | 1 |
| Dining Room Plugs | 1 |
| Dishwasher | 1 |
| Electronics Workbench | 1 |
| Security/Network Equipment | 1 |
| Kitchen Fridge | 1 |
| Forced Air Furnace: Fan and Thermostat | 1 |
| Garage | 1 |
| Heat Pump | 100 |
| Instant Hot Water Unit | 7 |
| Home Office | 20 |
| Outside Plug | 1 |
| Rental Suite Sub-Panel | 50 |
| Entertainment: TV, PVR, AMP | 30 |
| Utility Room Plug | 8 |
| Wall Oven | 1 |
The matrix with the probabilities of appliance turn ON events in each hour.
| hour | Clothes Dryer | Clothes Washer | Dishwasher | Heat Pump | Instant Hot Water Unit |
|---|---|---|---|---|---|
| 0.017 | 0 | 0.017 | 0.016 | 0.006 | |
| 0.002 | 0 | 0.005 | 0.028 | 0.001 | |
| 0.007 | 0 | 0 | 0.035 | 0.001 | |
| 0.006 | 0 | 0 | 0.038 | 0.001 | |
| 0.008 | 0 | 0 | 0.043 | 0.001 | |
| 0.008 | 0 | 0 | 0.047 | 0.005 | |
| 0.008 | 0.002 | 0 | 0.063 | 0.04 | |
| 0.013 | 0.008 | 0.005 | 0.059 | ||
| 0.01 | 0.042 | 0.016 | 0.061 | 0.05 | |
| 0.021 | 0.028 | 0.054 | 0.054 | ||
| 0.044 | 0.05 | 0.053 | 0.044 | ||
| 0.05 | 0.056 | 0.054 | 0.036 | ||
| 0.045 | 0.043 | 0.037 | |||
| 0.052 | 0.041 | 0.038 | |||
| 0.051 | 0.041 | 0.034 | |||
| 0.06 | 0.061 | 0.053 | 0.051 | 0.051 | |
| 0.065 | 0.057 | 0.057 | 0.052 | 0.068 | |
| 0.063 | 0.05 | 0.056 | 0.033 | ||
| 0.051 | 0.044 | 0.029 | |||
| 0.07 | 0.054 | 0.029 | |||
| 0.05 | 0.044 | 0.025 | |||
| 0.049 | 0.02 | 0.035 | 0.055 | ||
| 0.004 | 0.039 | 0.039 | |||
| 0.065 | 0 | 0.044 | 0.012 | 0.022 |
Fig 2Dendrogram for grouping the electrical appliances throughout the day.
Fig 3Dendrogram for grouping the electrical appliances throughout the week.
Fig 4The initial over-representation map.
Fig 5The final GCA over-representation map with clusters.
Fig 6The final GCA over-representation map including appliances and the days of their usage.
Selected sequential rules extracted from AMPDs data.
| Sequence rule | Support (%) | Confidence (%) | Lift |
|---|---|---|---|
| instant & clothes washer & dishwasher = = > clothes washer & dishwasher | 4 | 80 | 13.86 |
| instant & clothes washer = = > dryer & dishwasher = = > dryer & dishwasher | 3 | 56 | 9.77 |
| heat & clothes washer = = > instant & heat & clothes washer = = > heat & dryer | 3 | 67 | 6.68 |
| instant & heat & clothes washer = = > instant & heat & clothes washer = = > heat & dryer | 3 | 64 | 6.40 |
| instant = = > heat & clothes washer = = > instant & heat & dryer | 5 | 47 | 5.62 |
Feature vector used in forecasting.
| Attribute | Description | Formula |
|---|---|---|
| 1–24 | Hour indicator (dummy variable) | |
| 25–55 | Day of the month indicator (dummy variable) | |
| 56–62 | Day of the week indicator (dummy variable) | |
| 63–74 | Month indicator (dummy variable) | |
| 75 | Holiday indicator (dummy variable) | |
| 76 | Sunset indicator (dummy variable) | |
| 77–100 | Load of previous 24 hours | |
| 101–104 | Minimum load of previous 3, 6, 12, 24 hours | |
| 105–108 | Maximum load of previous 3, 6, 12, 24 hours | |
| 109–114 | Load in the same hour of the previous week (6 days) | |
| 115–118 | Load in the same hour of the same day in previous weeks (4 weeks) | |
| 119–122 | Average temperature observed over previous hourly periods | |
| 123–128 | Average temperature observed in the same hour over the previous week (6 days) | |
| 129–132 | Average weekly temperature observed in previous i-day periods | |
| 133–136 | Average humidity observed over previous hourly periods | |
| 137–142 | Average humidity observed in the same hour over the previous week (6 days) | |
| 143–146 | Average humidity observed in previous i-day periods |
Notation [+1] stands for the next element from the set of indices i {1,3,6,12,24} e.g. avg{T,…,T} or avg{T,…,T}.
Feature vector to describe hourly usage patterns.
| Attribute No. | Description | Formula |
|---|---|---|
| 147–166 | Number of switch on states (activations) for each appliance (Dryer, Wash, Dish, Heat, Instant) over previous hourly periods | ∑ |
| 167–176 | Number of switch on states (activations) for each appliance (Dryer, Wash, Dish, Heat, Instant) over previous daily periods | ∑ |
| 177–196 | Number of switch on states (activations) for each appliance (Dryer, Wash, Dish, Heat, Instant) in previous i-day periods | ∑ |
| 197–221 | Number of hours between the most recent five successive activations of each device | ∑ |
The average 24 hour forecasting results based on past usage data (without usage patterns variables).
| Model | MAPE (%) | r_MAPE (%) | Acc (%) | MSE | MAPE (%) | r_MAPE (%) | Acc (%) | MSE | MAPE (%) | r_MAPE (%) | Acc (%) | MSE |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Training dataset | Validation dataset | Testing dataset | ||||||||||
| Z24 | 42.94 | 33.79 | 40.90 | 0.61 | 40.33 | 31.58 | 43.68 | 0.59 | 37.69 | 29.78 | 45.83 | 0.44 |
| Fg | 51.86 | 41.66 | 34.28 | 0.72 | 45.74 | 37.89 | 37.59 | 0.77 | 56.27 | 42.66 | 37.80 | 0.63 |
| ARIMA | 38.24 | 35.27 | 31.01 | 0.35 | 50.60 | 49.34 | 18.57 | 0.49 | 41.14 | 39.39 | 20.83 | 0.26 |
| L_f | 36.08 | 33.67 | 34.37 | 0.32 | 33.72 | 32.08 | 35.81 | 0.43 | 36.50 | 34.25 | 35.71 | 0.23 |
| L_b | 36.03 | 33.64 | 34.51 | 0.32 | 34.19 | 32.37 | 35.07 | 0.40 | 36.05 | 34.07 | 33.63 | 0.22 |
| KNN | 33.36 | 31.42 | 36.15 | 0.31 | 35.01 | 33.67 | 33.28 | 0.39 | 30.27 | 28.89 | 41.37 | 0.22 |
| RPART | 37.08 | 33.66 | 36.36 | 0.32 | 38.69 | 35.33 | 37.00 | 0.42 | 38.50 | 34.20 | 35.12 | 0.24 |
| RF | 0.48 | 0.34 | 100.00 | 0.00 | 32.72 | 30.86 | 37.89 | 0.39 | 32.57 | 29.74 | 41.07 | 0.22 |
| NNET | 32.39 | 30.11 | 39.99 | 0.34 | 32.19 | 30.06 | 41.60 | 0.41 | 30.28 | 28.14 | 42.26 | 0.21 |
| SVR | 28.73 | 27.26 | 43.98 | 0.35 | 28.95 | 28.06 | 44.43 | 0.46 | 26.78 | 25.26 | 47.02 | 0.23 |
The average 24 hour forecasting results based on past usage data and enhanced data with usage pattern variables.
| Model | MAPE (%) | r_MAPE (%) | Acc (%) | MSE | MAPE (%) | r_MAPE (%) | Acc (%) | MSE | MAPE (%) | r_MAPE (%) | Acc (%) | MSE |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Training dataset | Validation dataset | Testing dataset | ||||||||||
| Z24 | 42.94 | 33.79 | 40.90 | 0.61 | 40.33 | 31.58 | 43.68 | 0.59 | 37.69 | 29.78 | 45.83 | 0.44 |
| Fg | 51.86 | 41.66 | 34.28 | 0.72 | 45.74 | 37.89 | 37.59 | 0.77 | 56.27 | 42.66 | 37.80 | 0.63 |
| ARIMA | 38.24 | 35.27 | 31.01 | 0.35 | 50.60 | 49.34 | 18.57 | 0.49 | 41.14 | 39.39 | 20.83 | 0.26 |
| L_f | 36.08 | 33.59 | 34.29 | 0.32 | 35.92 | 34.26 | 30.01 | 0.42 | 43.11 | 41.37 | 30.06 | 0.25 |
| L_b | 35.93 | 33.40 | 34.86 | 0.32 | 36.56 | 34.32 | 32.24 | 0.40 | 33.26 | 31.30 | 38.69 | 0.22 |
| KNN | 38.50 | 36.99 | 27.17 | 0.35 | 41.12 | 39.02 | 27.19 | 0.40 | 34.00 | 32.44 | 33.93 | 0.22 |
| RPART | 39.18 | 35.61 | 35.16 | 0.36 | 39.86 | 35.87 | 35.36 | 0.42 | 39.38 | 36.03 | 38.39 | 0.23 |
| RF | 0.48 | 0.36 | 100.00 | 0.00 | 33.26 | 31.13 | 37.15 | 0.39 | 29.41 | 27.33 | 48.21 | 0.21 |
| NNET | 29.65 | 28.10 | 42.05 | 0.36 | 27.61 | 26.28 | 46.21 | 0.45 | 23.62 | 22.54 | 54.46 | 0.25 |
| SVR | 32.94 | 31.53 | 34.67 | 0.34 | 32.39 | 31.39 | 35.51 | 0.46 | 40.49 | 38.80 | 23.51 | 0.24 |
Fig 7The real load vs. forecasts of the neural network and random forest models.
Aggregated results in terms of the MAPE for 46 households.
The number of households is presented in brackets.
| Results | Modeling method | ||||||
|---|---|---|---|---|---|---|---|
| L_f | L_b | KNN | RPART | RF | NNET | SVR | |
| 23.91% (11) | 10.87% (5) | 36.96% (17) | 39.13% (18) | 15.22% (7) | 82.61% (38) | 19.57% (9) | |
| 71.74% (33) | 84.78% (39) | 56.52% (26) | 26.09% (12) | 71.74% (33) | 6.52% (3) | 69.57% (32) | |
| 4.35% (2) | 4.35% (2) | 6.52% (3) | 34.78% (16) | 13.04% (6) | 10.87% (5) | 10.87% (5) | |
| 100.00% (46) | 100.00% (46) | 100.00% (46) | 100.00% (46) | 100.00% (46) | 100.00% (46) | 100.00% (46) | |
Fig 8Modelling techniques and their MAPE distributions observed on the testing dataset.
Duncan's Multiple Range Test to assess the differences between the forecasting algorithms.
The different capital letters within each column are significantly different (P < 0.05).
| Modelling method | Basic consumption dataset | Enhanced dataset including usage patterns |
|---|---|---|
| A | A | |
| A | B | |
| B | BC | |
| BC | BCD | |
| BC | BCDE | |
| BC | BCDE | |
| BCD | BCDE | |
| BCD | CDE | |
| CD | DE | |
| D | E |
The Kolmogorov-Smirnov (K-S) test to determine the significance of the forecasting results between the basic dataset and the enhanced dataset using the neural networks method, * denotes significance at P < 0.05.
| Household | Basic consumption dataset MAPE (%) | Enhanced dataset MAPE (%) | K-S p-value | Household | Basic consumption dataset MAPE (%) | Enhanced dataset MAPE (%) | K-S p-value |
|---|---|---|---|---|---|---|---|
| 1 | 46.07 | 36.13 | 0.039 * | 24 | 29.11 | 31.89 | 0.864 |
| 2 | 52.33 | 44.56 | 0.008 * | 25 | 33.18 | 30.51 | 0.097 |
| 3 | 99.46 | 97.68 | 0 * | 26 | 35.61 | 32.80 | 0.269 |
| 4 | 21.43 | 21.28 | 0.381 | 27 | 49.08 | 45.74 | 0 * |
| 5 | 67.71 | 39.69 | 0.000 * | 28 | 39.48 | 38.04 | 0.423 |
| 6 | 59.28 | 56.43 | 0.026 * | 29 | 37.98 | 35.13 | 0.114 |
| 7 | 37.21 | 32.15 | 0.269 | 30 | 44.62 | 45.13 | 0.269 |
| 8 | 45.20 | 39.14 | 0.207 | 31 | 60.89 | 50.11 | 0.304 |
| 9 | 45.96 | 45.65 | 0.511 | 32 | 46.18 | 46.10 | 0.651 |
| 10 | 40.13 | 39.24 | 0.010 * | 33 | 50.08 | 45.45 | 0.180 |
| 11 | 67.01 | 63.61 | 0.097 | 34 | 48.89 | 46.55 | 0.155 |
| 12 | 30.00 | 28.61 | 0.341 | 35 | 42.95 | 33.53 | 0 * |
| 13 | 48.33 | 45.27 | 0.010 * | 36 | 40.78 | 38.28 | 0.269 |
| 14 | 45.61 | 34.34 | 0.001 * | 37 | 29.74 | 26.71 | 0.01 * |
| 15 | 43.29 | 39.54 | 0.03 * | 38 | 42.08 | 37.00 | 0.047 * |
| 16 | 57.77 | 55.06 | 0.03 * | 39 | 45.54 | 70.94 | 0.864 |
| 17 | 50.16 | 40.31 | 0 * | 40 | 65.02 | 55.70 | 0 * |
| 18 | 31.67 | 27.67 | 0.001 * | 41 | 51.30 | 45.85 | 0.003 * |
| 19 | 41.24 | 42.89 | 0.097 | 42 | 38.69 | 36.28 | 0.207 |
| 20 | 38.62 | 37.49 | 0.697 | 43 | 39.54 | 27.72 | 0 * |
| 21 | 49.86 | 43.26 | 0.026 * | 44 | 27.60 | 25.13 | 0 * |
| 22 | 54.68 | 50.08 | 0.014 * | 45 | 46.61 | 42.56 | 0.133 |
| 23 | 38.51 | 34.87 | 0.017 * | 46 | 37.91 | 34.30 | 0.068 |