| Literature DB >> 34367876 |
Andrius Grybauskas1, Vaida Pilinkienė1, Alina Stundžienė1.
Abstract
As the COVID-19 pandemic came unexpectedly, many real estate experts claimed that the property values would fall like the 2007 crash. However, this study raises the question of what attributes of an apartment are most likely to influence a price revision during the pandemic. The findings in prior studies have lacked consensus, especially regarding the time-on-the-market variable, which exhibits an omnidirectional effect. However, with the rise of Big Data, this study used a web-scraping algorithm and collected a total of 18,992 property listings in the city of Vilnius during the first wave of the COVID-19 pandemic. Afterwards, 15 different machine learning models were applied to forecast apartment revisions, and the SHAP values for interpretability were used. The findings in this study coincide with the previous literature results, affirming that real estate is quite resilient to pandemics, as the price drops were not as dramatic as first believed. Out of the 15 different models tested, extreme gradient boosting was the most accurate, although the difference was negligible. The retrieved SHAP values conclude that the time-on-the-market variable was by far the most dominant and consistent variable for price revision forecasting. Additionally, the time-on-the-market variable exhibited an inverse U-shaped behaviour.Entities:
Keywords: Apartments; Big data; Machine learning; Pandemics; Real estate; TOM
Year: 2021 PMID: 34367876 PMCID: PMC8329615 DOI: 10.1186/s40537-021-00476-0
Source DB: PubMed Journal: J Big Data ISSN: 2196-1115
Fig. 1Research framework
Sample descriptive statistics for month August rent operations
| N | Mean | Std | Min | Max | VIF | |
|---|---|---|---|---|---|---|
| Number of rooms | 1434 | 2.036960 | 0.917779 | 1.000000 | 6.0000 | 3.037466 |
| Sq.m | 1434 | 53.40817 | 29.878975 | 1.000000 | 330.00 | 4.514540 |
| Apartment floor | 1434 | 3.157601 | 1.921412 | 1.000000 | 9.0000 | 1.221069 |
| Number of floors in the building | 1434 | 4.905858 | 2.179056 | 1.000000 | 9.0000 | 1.295373 |
| Year | 1434 | 1986.223 | 40.109960 | 1092.0000 | 2020 | 1.129623 |
| Distance to shop | 1434 | 304.5955 | 348.687315 | 10.000000 | 8100 | 2.593864 |
| Distance to school | 1434 | 365.4741 | 363.647421 | 10.000000 | 5300 | 2.524234 |
| Distance to kinder | 1434 | 331.6806 | 325.059069 | 10.000000 | 5300 | 1.959995 |
| Built_type | 1434 | 0.122734 | 0.016241 | 0.083333 | 0.3333 | 1.023045 |
| Heating | 1434 | 0.122905 | 0.038970 | 0.000000 | 0.4666 | 1.010538 |
| Time on the market (TOM) | 1434 | 24.04184 | 25.421190 | 6.000000 | 175.0 | 1.039517 |
| Initial listing price | 1434 | 554.7672 | 375.138502 | 58.350000 | 3800 | 3.376621 |
| If located at city center | 1434 | 0.374477 | 0.484156 | 0.000000 | 1.000 | 1.409588 |
| If price change occurred | 1434 | 0.122734 | 0.328246 | 0.000000 | 1.0000 |
Sample descriptive statistics for month July rent operations
| N | Mean | Std | Min | Max | VIF | |
|---|---|---|---|---|---|---|
| Number of rooms | 1474 | 2.035278 | 0.884415 | 1.0000 | 6.000000 | 2.819089 |
| Sq.m | 1474 | 53.59462 | 27.96496 | 8.0000 | 300.0000 | 4.012874 |
| Apartment floor | 1474 | 3.150611 | 1.930314 | 1.0000 | 9.000000 | 1.218704 |
| Number of floors in the building | 1474 | 4.954545 | 2.240156 | 1.0000 | 9.000000 | 1.282661 |
| Year | 1474 | 1988.978 | 28.75488 | 1850 | 2020 | 1.221040 |
| Distance to shop | 1474 | 293.2360 | 294.6002 | 10.0000 | 5300.0000 | 2.125102 |
| Distance to school | 1474 | 363.7516 | 362.1593 | 10.0000 | 4600.0000 | 2.067408 |
| Distance to kinder | 1474 | 330.9497 | 309.8768 | 20.0000 | 3400.0000 | 1.983364 |
| Built_type | 1474 | 0.162254 | 0.025613 | 0.1554 | 0.357143 | 1.025131 |
| Heating | 1474 | 0.162254 | 0.033542 | 0.0000 | 0.500000 | 1.023148 |
| Time on the market (TOM) | 1474 | 21.29036 | 27.06033 | 0.0000 | 178.0000 | 1.056918 |
| Initial listing price | 1474 | 535.6358 | 344.3829 | 95.000 | 3800.000 | 3.165490 |
| If located at city center | 1474 | 0.375170 | 0.484331 | 0.0000 | 1.000000 | 1.421617 |
| If price change occurred | 1474 | 0.162144 | 0.368708 | 0.0000 | 1.000000 |
Sample descriptive statistics for month June rent operations
| N | Mean | Std | Min | Max | VIF | |
|---|---|---|---|---|---|---|
| Number of rooms | 1799 | 2.016676 | 0.88035 | 1.000000 | 6.000000 | 2.794912 |
| Sq.m | 1799 | 52.51318 | 27.8767 | 1.000000 | 300.000 | 3.501264 |
| Apartment floor | 1799 | 3.067815 | 1.7935 | 1.000000 | 9.000000 | 1.145140 |
| Number of floors in the building | 1799 | 4.801556 | 2.1573 | 1.000000 | 9.000000 | 1.198230 |
| Year | 1799 | 1985.669 | 38.7276 | 1521.00 | 2102.000 | 1.185635 |
| Distance to shop | 1799 | 298.4991 | 290.181 | 10.000000 | 5200.0000 | 1.906072 |
| Distance to school | 1799 | 371.4508 | 371.727 | 10.000000 | 4600.0000 | 1.944286 |
| Distance to kinder | 1799 | 328.0322 | 311.322 | 10.000000 | 4300.0000 | 1.645901 |
| Heating | 1799 | 0.186525 | 0.026446 | 0.000000 | 0.333333 | 1.022265 |
| Time on the market (TOM) | 1799 | 20.2779 | 26.6619 | 0.000000 | 176.0000 | 1.069220 |
| Initial listing price | 1799 | 518.7050 | 318.1839 | 95.000000 | 3000.000 | 2.857612 |
| If located at city center | 1799 | 0.375208 | 0.484311 | 0.000000 | 1.000000 | 1.395546 |
| If price change occurred | 1799 | 0.186215 | 0.389388 | 0.000000 | 1.000000 |
Sample descriptive statistics for month May rent operations
| N | Mean | Std | Min | Max | VIF | |
|---|---|---|---|---|---|---|
| Number of rooms | 1799 | 1.986103 | 0.961625 | 1.000000 | 15.000 | 2.05927 |
| Sq.m | 1799 | 50.862696 | 25.88830 | 1.000000 | 196.000 | 2.35334 |
| Apartment floor | 1799 | 3.092273 | 1.820456 | 1.000000 | 9.000 | 1.11439 |
| Number of floors in the building | 1799 | 4.787104 | 2.161121 | 1.000000 | 9.000 | 1.14999 |
| TOM | 1799 | 21.105058 | 23.07058 | 1.000000 | 161.000 | 1.06832 |
| Time on the market (TOM) | 1799 | 519.745728 | 316.5270 | 89.0340 | 2500.0 | 2.55914 |
| Initial listing price | 1799 | 0.407449 | 0.491496 | 0.000000 | 1.0000 | 1.20428 |
| If located at city center | 1799 | 0.220122 | 0.414444 | 0.000000 | 1.00000 | 2.05927 |
Sample descriptive statistics for month August sell operations
| N | Mean | Std | Min | Max | VIF | |
|---|---|---|---|---|---|---|
| Number of rooms | 3036 | 2.500000 | 1.076788 | 1.000000 | 20.00 | 3.2699 |
| Sq.m | 3036 | 63.12611 | 34.971339 | 11.34000 | 670.0 | 5.6493 |
| Apartment floor | 3036 | 3.14822 | 1.968558 | 1.000000 | 9.00 | 1.2531 |
| Number of floors in the building | 3036 | 4.97924 | 2.245383 | 1.000000 | 9.000 | 1.3894 |
| Year | 3036 | 1996.91 | 34.568066 | 1019.000 | 2021.0 | 1.3681 |
| Distance to shop | 3036 | 376.874 | 378.27094 | 10.00000 | 6000 | 2.3123 |
| Distance to school | 3036 | 455.737 | 429.79288 | 10.00000 | 4700 | 2.8402 |
| Distance to kinder | 3036 | 368.695 | 369.82084 | 10.00000 | 4900 | 2.2769 |
| Built_type | 3036 | 0.09947 | 0.030916 | 0.016129 | 0.222 | 1.0961 |
| Furnish | 3036 | 1.73583 | 0.531121 | 1.000000 | 4.000 | 1.2020 |
| Heating | 3036 | 0.09960 | 0.034632 | 0.000000 | 0.500 | 1.0803 |
| Time on the market (TOM) | 3036 | 45.0303 | 55.466930 | 2.000000 | 360.0 | 1.0184 |
| Initial listing price | 3036 | 136,281 | 118,123 | 5.900000e + 03 | 1,600,000 | 3.1374 |
| If located at city center | 3036 | 0.2249 | 0.417629 | 0.000000 | 1.000 | 1.3206 |
| If price change occurred | 3036 | 0.0994 | 0.299345 | 0.000000 | 1.000 |
Sample descriptive statistics for month July sell operations
| N | Mean | Std | Min | Max | VIF | |
|---|---|---|---|---|---|---|
| Number of rooms | 3136 | 2.534439 | 1.037154 | 1.000000 | 15.0000 | 1.757259 |
| Sq.m | 3136 | 64.382672 | 47.938239 | 11.340000 | 1985 | 1.664433 |
| Apartment floor | 3136 | 3.164222 | 2.016394 | 1.000000 | 9.0000 | 1.285848 |
| Number of floors in the building | 3136 | 4.977679 | 2.270007 | 1.000000 | 9.000 | 1.427927 |
| Year | 3136 | 1997.441 | 34.164736 | 1061.000000 | 2021.000 | 1.423255 |
| Distance to shop | 3136 | 384.0082 | 399.657316 | 10.000000 | 6100.000 | 2.461288 |
| Distance to school | 3136 | 479.9489 | 462.888853 | 10.000000 | 6000.000 | 2.676897 |
| Distance to kinder | 3136 | 391.6422 | 426.655344 | 10.000000 | 6200.000 | 2.189394 |
| Built_type | 3136 | 0.100446 | 0.037032 | 0.029412 | 0.280000 | 1.210398 |
| Furnish | 3136 | 1.742666 | 0.609686 | 1.000000 | 4.000000 | 1.242468 |
| Heating | 3136 | 0.100510 | 0.047111 | 0.000000 | 0.500000 | 1.071377 |
| Time on the market (TOM) | 3136 | 43.45727 | 52.962261 | 1.000000 | 358.0000 | 1.041306 |
| Initial listing price | 3136 | 137,066 | 118,475 | 5.950000e + 03 | 1,600,000 | 1.967865 |
| If located at city center | 3136 | 0.221620 | 0.415403 | 0.000000 | 1.000000 | 1.344862 |
| If price change occurred | 3136 | 0.100446 | 0.300642 | 0.000000 | 1.000000 |
Sample descriptive statistics for month June sell operations
| N | Mean | Std | Min | Max | VIF | |
|---|---|---|---|---|---|---|
| Number of rooms | 3335 | 2.517241 | 1.083325 | 1.0000 | 20.00 | 3.191328 |
| Sq.m | 3335 | 63.437358 | 35.274419 | 10.000 | 680 | 5.531747 |
| Apartment floor | 3335 | 3.184408 | 2.036823 | 1.0000 | 9 | 1.251125 |
| Number of floors in the building | 3335 | 4.930135 | 2.282712 | 1.0000 | 9.00 | 1.385276 |
| Year | 3335 | 1996 | 29.746126 | 1520 | 2021 | 1.536711 |
| Distance to shop | 3335 | 377 | 369.776639 | 10.000 | 5400 | 2.218792 |
| Distance to school | 3335 | 475.80 | 444.692904 | 10.000 | 6000 | 2.572198 |
| Distance to kinder | 3335 | 387.9460 | 398.456617 | 10.000 | 6200 | 1.907936 |
| Furnish | 3335 | 1.753523 | 0.608231 | 1.000 | 4.0000 | 1.284736 |
| Heating | 3335 | 0.111344 | 0.040510 | 0.0000 | 0.600 | 1.093226 |
| Time on the market (TOM) | 3335 | 39.294 | 50.558899 | 0.0000 | 354.00 | 1.045139 |
| Initial listing price | 3335 | 133,794 | 1.167724e + 05 | 5630 | 1,600,000 | 3.050077 |
| If located at city center | 3335 | 0.220390 | 0.414572 | 0.0000 | 1.00 | 1.379555 |
| If price change occurred | 3335 | 0.111244 | 0.314482 | 0.0000 | 1.00 |
Sample descriptive statistics for month May sell operations
| N | Mean | Std | Min | Max | VIF | |
|---|---|---|---|---|---|---|
| Number of rooms | 2979 | 2.478684 | 1.062739 | 1.000000 | 20.000 | 1.821010 |
| Sq.m | 2979 | 56.853595 | 28.135194 | 10.000000 | 200.0 | 1.433096 |
| Apartment floor | 2979 | 3.148708 | 1.984081 | 1.000000 | 9.000 | 1.199743 |
| Number of floors in the building | 2979 | 4.896274 | 2.232382 | 1.000000 | 9.000 | 1.227095 |
| Time on the market (TOM) | 2979 | 31.515609 | 32.932334 | 0.000000 | 319.00 | 1.005587 |
| Initial listing price | 2979 | 134,638 | 120,524 | 7770 | 1,849,000 | 1.798703 |
| If located at city center | 2979 | 0.207452 | 0.405550 | 0.000000 | 1.000 | 1.193170 |
| If price change occurred | 2979 | 0.119168 | 0.324040 | 0.000000 | 1.0000 | 1.821010 |
Fig. 2Vilnius city vacancy maps
Machine learning model results
| Models | 4 Months average in rent and sell operations | |||||||
|---|---|---|---|---|---|---|---|---|
| Accuracy | AUC | “Recall” | Prec | F1 | Kappa | MCC | ||
| 0.782 | ||||||||
| 2 | CatBoost classifier | 0.857 | 0.784 | 0.324 | 0.477 | 0.378 | 0.300 | 0.311 |
| 3 | Light gradient boosting machine | 0.835 | 0.769 | 0.320 | 0.452 | 0.372 | 0.295 | 0.302 |
| 4 | Random forest classifier | 0.851 | 0.779 | 0.323 | 0.453 | 0.370 | 0.289 | 0.297 |
| 5 | Ridge classifier | 0.725 | 0.000 | 0.584 | 0.279 | 0.367 | 0.223 | 0.252 |
| 6 | Linear discriminant analysis | 0.725 | 0.735 | 0.582 | 0.279 | 0.367 | 0.223 | 0.251 |
| 7 | Gradient boosting classifier | 0.834 | 0.767 | 0.351 | 0.385 | 0.364 | 0.268 | 0.270 |
| 8 | Logistic regression | 0.723 | 0.715 | 0.563 | 0.272 | 0.359 | 0.213 | 0.238 |
| 9 | Extra trees classifier | 0.851 | 0.780 | 0.305 | 0.455 | 0.358 | 0.278 | 0.287 |
| 10 | Ada boost classifier | 0.791 | 0.737 | 0.421 | 0.311 | 0.356 | 0.233 | 0.237 |
| 11 | Decision tree classifier | 0.787 | 0.627 | 0.408 | 0.303 | 0.346 | 0.221 | 0.225 |
| 12 | Naive Bayes | 0.520 | 0.666 | 0.692 | 0.198 | 0.293 | 0.101 | 0.139 |
| 13 | K neighbors Classifier | 0.671 | 0.614 | 0.478 | 0.209 | 0.288 | 0.116 | 0.133 |
| 14 | Quadratic discriminant analysis | 0.451 | 0.464 | 0.713 | 0.186 | 0.277 | 0.076 | 0.097 |
| 15 | SVM—linear kernel | 0.485 | 0.000 | 0.638 | 0.200 | 0.237 | 0.076 | 0.103 |
Bold values indicate the most consistent machine learning algorithm
Average feature importance of the variables according to SHAP values
| Variable | Sell | Rent | ||||||
|---|---|---|---|---|---|---|---|---|
| May | June | July | August | May | June | July | August | |
| Rooms | 1.15580 | 0.49947 | 0.54986 | 0.55172 | 0.42647 | 0.35931 | 0.32831 | 0.46382 |
| Sq_m_ | 1.60400 | 0.79154 | 0.77398 | 0.87373 | 0.44485 | 0.34983 | 0.21439 | 0.32318 |
| Floor | 0.52938 | 0.40655 | 0.37032 | 0.36983 | 0.17232 | 0.20210 | 0.20735 | 0.26256 |
| Nr_Floors | 0.65013 | 0.38285 | 0.46130 | 0.44349 | 0.18105 | 0.16328 | 0.35156 | 0.22225 |
| Int_prices | 2.28727 | 1.27575 | 1.30110 | 1.41305 | 1.26008 | 0.71070 | 0.62880 | 0.47666 |
| Center | 0.37411 | 0.30217 | 0.31365 | 0.37559 | 0.37768 | 0.41106 | 0.29141 | 0.31965 |
| Year | – | 1.54729 | 1.55409 | 1.48564 | – | 0.30344 | 0.29409 | 0.49686 |
| Shop | – | 0.76183 | 0.74752 | 0.94674 | – | 0.32995 | 0.38609 | 0.38636 |
| School | – | 0.74277 | 0.65204 | 0.65167 | – | 0.25219 | 0.31284 | 0.29746 |
| Kinder | – | 0.87993 | 0.75086 | 0.74460 | – | 0.28952 | 0.33553 | 0.37077 |
| Furnish | – | 0.20347 | 0.15533 | 0.16374 | – | – | – | – |
| Heating | – | 0.72057 | 0.66822 | 0.47049 | – | 0.40635 | 0.28597 | 0.47998 |
| Built_type | – | – | 1.13302 | 1.00620 | – | – | 0.57337 | 0.47470 |
Bold values indicate the most dominant variable in predicting price change
Fig. 3The TOM variable effect