| Literature DB >> 26339227 |
Abstract
Among the recent data mining techniques available, the boosting approach has attracted a great deal of attention because of its effective learning algorithm and strong boundaries in terms of its generalization performance. However, the boosting approach has yet to be used in regression problems within the construction domain, including cost estimations, but has been actively utilized in other domains. Therefore, a boosting regression tree (BRT) is applied to cost estimations at the early stage of a construction project to examine the applicability of the boosting approach to a regression problem within the construction domain. To evaluate the performance of the BRT model, its performance was compared with that of a neural network (NN) model, which has been proven to have a high performance in cost estimation domains. The BRT model has shown results similar to those of NN model using 234 actual cost datasets of a building construction project. In addition, the BRT model can provide additional information such as the importance plot and structure model, which can support estimators in comprehending the decision making process. Consequently, the boosting approach has potential applicability in preliminary cost estimations in a building construction project.Entities:
Mesh:
Year: 2015 PMID: 26339227 PMCID: PMC4538588 DOI: 10.1155/2015/149702
Source DB: PubMed Journal: Comput Intell Neurosci
Figure 1Schematic of a boosting procedure.
Figure 2Gradient boosted decision tree ensemble.
Figure 3Training results of BRT.
Factors in construction cost estimation.
| Description | Min. | Max | Average | Remark |
|---|---|---|---|---|
| Input | ||||
| Budget | (1) BTL | Nominal | ||
| (2) National finance | ||||
| School levels | (1) Elementary | Nominal | ||
| (2) Middle | ||||
| (3) High | ||||
| Land acquisition | (1) Existing | Nominal | ||
| (2) Building lots | ||||
| (3) Green belts | ||||
| Class number | 12 | 48 | 31 | Numerical |
| Building area (m2) | 1,204 | 3,863 | 2,694 | Numerical |
| Gross floor area (m2) | 4,925 | 12,710 | 9,656 | Numerical |
| Storey | 3 | 7 | 4.7 | Numerical |
| Basement floor (storey) | 0 | 2 | 0.5 | Numerical |
| Floor Height (m) | 3.3 | 3.6 | 3.5 | Numerical |
| Output | ||||
| Total construction cost | 4,334,369 | 14,344,867 | 8,288,008 | Numerical |
Figure 4Fragment of cost dataset.
Figure 5Parameter setting for BRT.
Summary of results by estimation model.
| Error rate (%) | NN | BRT | ||
|---|---|---|---|---|
| Fre. (%) | Cum. (%) | Fre. (%) | Cum. (%) | |
| 0.0–2.5 | 3 (10.0) | 3 (10.0) | 6 (20.0) | 6 (20.0) |
| 2.5–5.0 | 11 (36.7) | 14 (46.7) | 10 (33.3) | 16 (53.3) |
| 5.0–7.5 | 6 (20.0) | 20 (66.7) | 6 (20.0) | 22 (73.3) |
| 7.5–10.0 | 8 (26.7) | 28 (93.3) | 2 (6.7) | 24 (80.0) |
| 10.0–12.5 | 1 (3.3) | 29 (96.7) | 3 (10.0) | 27 (90.0) |
| 12.5–15.0 | 1 (3.3) | 30 (100) | 2 (6.7) | 29 (96.7) |
| 15.0–17.5 | 0 (0) | 30 (100) | 1 (3.3) | 30 (100) |
|
| ||||
| MAERs | 6.05 | — | 5.80 | — |
Cost estimation results of each test set.
| Number |
Historical cost | Neural networks | Boosting regression tree | ||
|---|---|---|---|---|---|
| Predicted cost | Error rate (%) | Predicted cost | Error rate (%) | ||
| 1 | 6,809,450 | 7,704,034 | 13.14 | 7,206,795 | 5.84 |
| 2 | 9,351,716 | 10,015,906 | 7.10 | 9,805,656 | 4.85 |
| 3 | 6,656,230 | 7,251,317 | 8.94 | 6,322,112 | 5.02 |
| 4 | 7,119,470 | 7,128,513 | 0.13 | 7,418,373 | 4.20 |
| 5 | 7,304,747 | 7,978,990 | 9.23 | 7,349,178 | 0.61 |
| 6 | 9,729,392 | 9,516,946 | 2.18 | 9,259,162 | 4.83 |
| 7 | 10,801,826 | 9,817,999 | 9.11 | 9,682,119 | 10.37 |
| 8 | 7,944,318 | 7,246,763 | 8.78 | 7,136,773 | 10.17 |
| 9 | 10,879,004 | 10,136,431 | 6.83 | 10,572,777 | 2.81 |
| 10 | 7,552,814 | 7,764,300 | 2.80 | 7,683,295 | 1.73 |
| 11 | 8,845,099 | 8,558,536 | 3.24 | 8,370,497 | 5.37 |
| 12 | 10,690,800 | 10,001,503 | 6.45 | 10,015,284 | 6.32 |
| 13 | 8,694,721 | 8,258,452 | 5.02 | 8,446,796 | 2.85 |
| 14 | 6,582,636 | 6,810,406 | 3.46 | 6,954,507 | 5.65 |
| 15 | 7,583,680 | 8,312,216 | 9.61 | 8,194,292 | 8.05 |
| 16 | 7,099,220 | 7,955,966 | 12.07 | 8,292,381 | 16.81 |
| 17 | 8,145,147 | 8,604,444 | 5.64 | 8,522,009 | 4.63 |
| 18 | 8,652,810 | 7,853,765 | 9.23 | 8,270,169 | 4.42 |
| 19 | 10,527,278 | 10,040,039 | 4.63 | 9,611,194 | 8.70 |
| 20 | 6,679,924 | 6,467,344 | 3.18 | 7,397,923 | 10.75 |
| 21 | 8,383,830 | 9,203,887 | 9.78 | 8,487,286 | 1.23 |
| 22 | 7,298,932 | 8,018,225 | 9.85 | 8,294,895 | 13.65 |
| 23 | 7,505,428 | 7,749,053 | 3.25 | 7,967,265 | 6.15 |
| 24 | 7,710,921 | 7,622,053 | 1.15 | 7,795,563 | 1.10 |
| 25 | 6,196,652 | 6,503,022 | 4.94 | 5,940,634 | 4.13 |
| 26 | 8,897,861 | 8,554,455 | 3.86 | 8,714,123 | 2.06 |
| 27 | 7,840,787 | 8,535,617 | 8.86 | 8,863,975 | 13.05 |
| 28 | 8,023,067 | 7,666,898 | 4.44 | 6,900,068 | 14.00 |
| 29 | 7,495,213 | 7,270,806 | 2.99 | 7,695,613 | 2.67 |
| 30 | 7,653,005 | 8,003,292 | 4.58 | 7,775,139 | 1.60 |
|
| |||||
| MAERs | 6.05 | 5.80 | |||
Descriptive analysis of error rate estimation.
| MAERs | Std, deviation | Std, error |
95% confidence interval | ||
|---|---|---|---|---|---|
| Lower | Upper | ||||
| NN | 6.045 | 3.192 | 0.583 | 2.542 | 4.291 |
| BRT | 5.800 | 3.980 | 0.727 | 3.170 | 5.351 |
Figure 6Importance plot of dependent variables.
Figure 7An example of structure model.