| Literature DB >> 24489498 |
Amir H Gandomi1, Mark M Fridline2, David A Roke1.
Abstract
In the current study, the performances of some decision tree (DT) techniques are evaluated for postearthquake soil liquefaction assessment. A database containing 620 records of seismic parameters and soil properties is used in this study. Three decision tree techniques are used here in two different ways, considering statistical and engineering points of view, to develop decision rules. The DT results are compared to the logistic regression (LR) model. The results of this study indicate that the DTs not only successfully predict liquefaction but they can also outperform the LR model. The best DT models are interpreted and evaluated based on an engineering point of view.Entities:
Mesh:
Substances:
Year: 2013 PMID: 24489498 PMCID: PMC3893014 DOI: 10.1155/2013/346285
Source DB: PubMed Journal: ScientificWorldJournal ISSN: 1537-744X
Different liquefaction models proposed in the literature.
| Type of technique | Reference | Year | Soil test | Classification technique | Number of records |
|---|---|---|---|---|---|
| Soft computing | [ | 1994 | SPT | ANN | 85 |
| [ | 1996 | CPT | ANN | 109 | |
| [ | 1998 | SPT | ANN | 105 | |
| [ | 2002 | CPT | ANN | 170 | |
| [ | 2006 | CPT and SPT | SVM | 109 and 85 | |
| [ | 2007 | CPT and SPT | SVM | 170 and 105 | |
| [ | 2007 | SPT | ANN | 620 | |
| [ | 2007 | CPT | SVM | 226 | |
| [ | 2009 | CPT | ANN | 226 | |
| [ | 2011 | CPT | GP | 170 | |
| [ | 2011 | SPT | GP and ANN | 569 | |
| [ | 2012 | CPT | GP | 170 | |
| [ | 2013 | CPT | GP | 170 | |
| [ | 2013 | CPT | GP | 170 | |
|
| |||||
| Statistical | [ | 2006 | CPT | LR | 396 |
| [ | 2008 | CPT | DT | 178 | |
Descriptive statistics of variables used in the model development.
| Predictor | Minimum | Mean | Maximum | Standard deviation |
|---|---|---|---|---|
|
| 0.8 | 7.66 | 19.8 | 4.90 |
| ( | 1 | 14.48 | 75 | 11.39 |
|
| 1 | 62.99 | 100 | 34.28 |
|
| 0.35 | 1.45 | 10 | 1.20 |
|
| 12.1 | 144.60 | 408.9 | 98.20 |
|
| 7.5 | 82.48 | 233.7 | 52.84 |
|
| 0 | 0.07 | 0.85 | 0.07 |
|
| 0.12 | 0.37 | 0.77 | 0.15 |
|
| 37 | 166.98 | 500 | 67.09 |
|
| 23.46 | 31.96 | 52.08 | 4.85 |
|
| 7.4 | 7.49 | 7.6 | 0.10 |
| PHA | 0.18 | 0.38 | 0.67 | 0.15 |
Correlation matrix.
|
| ( |
|
|
|
|
|
|
|
|
| PHA | Liq. | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
| 1 | 0.39 | −0.25 | 0.08 | 1.00 | 0.98 | −0.23 | −0.12 | 0.58 | 0.53 | 0.49 | −0.11 | −0.30 |
| ( | 0.39 | 1 | −0.57 | 0.08 | 0.41 | 0.42 | 0.03 | 0.15 | 0.40 | 0.85 | 0.15 | 0.17 | −0.27 |
|
| −0.25 | −0.57 | 1 | −0.18 | −0.27 | −0.30 | −0.12 | −0.07 | −0.30 | −0.54 | −0.28 | −0.16 | −0.06 |
|
| 0.08 | 0.08 | −0.18 | 1 | 0.07 | 0.21 | 0.14 | −0.27 | 0.16 | 0.13 | 0.21 | −0.02 | −0.13 |
|
| 1.00 | 0.41 | −0.27 | 0.07 | 1 | 0.99 | −0.22 | −0.09 | 0.59 | 0.55 | 0.50 | −0.07 | −0.29 |
|
| 0.98 | 0.42 | −0.30 | 0.21 | 0.99 | 1 | −0.19 | −0.10 | 0.61 | 0.56 | 0.55 | −0.03 | −0.28 |
|
| −0.23 | 0.03 | −0.12 | 0.14 | −0.22 | −0.19 | 1 | −0.06 | 0.44 | −0.02 | 0.00 | 0.11 | −0.08 |
|
| −0.12 | 0.15 | −0.07 | −0.27 | −0.09 | −0.10 | −0.06 | 1 | −0.03 | 0.09 | −0.20 | 0.90 | 0.25 |
|
| 0.58 | 0.40 | −0.30 | 0.16 | 0.59 | 0.61 | 0.44 | −0.03 | 1 | 0.47 | 0.32 | 0.04 | −0.21 |
|
| 0.53 | 0.85 | −0.54 | 0.13 | 0.55 | 0.56 | −0.02 | 0.09 | 0.47 | 1 | 0.29 | 0.12 | −0.35 |
|
| 0.49 | 0.15 | −0.28 | 0.21 | 0.50 | 0.55 | 0.00 | −0.20 | 0.32 | 0.29 | 1 | −0.11 | −0.15 |
| PHA | −0.11 | 0.17 | −0.16 | −0.02 | −0.07 | −0.03 | 0.11 | 0.90 | 0.04 | 0.12 | −0.11 | 1 | 0.19 |
| Liq. | −0.30 | −0.27 | −0.06 | −0.13 | −0.29 | −0.28 | −0.08 | 0.25 | −0.21 | −0.35 | −0.15 | 0.19 | 1 |
Figure 1Importance of independent variables.
Risk estimation of the DT and LR models.
| Model | Testing | Training | All |
|---|---|---|---|
| CHAID | 0.234 |
|
|
| E-CHAID | 0.234 | 0.161 | 0.176 |
| CART | 0.194 | 0.163 | 0.169 |
| CHAID-SPT |
| 0.188 | 0.184 |
| E-CHAID-SPT | 0.177 | 0.161 | 0.165 |
| CART-SPT | 0.185 | 0.165 | 0.169 |
| LR | 0.258 | 0.270 | 0.268 |
aBold sets are the lowest risk values for each set.
Sensitivity and specificity of the DT and LR models.
| Model | Testing | Training | All | |||
|---|---|---|---|---|---|---|
| Sensitivity | Specificity | Sensitivity | Specificity | Sensitivity | Specificity | |
| CHAID | 0.622 | 0.848 |
| 0.898 | 0.773 | 0.887 |
| E-CHAID | 0.644 | 0.835 | 0.754 | 0.902 | 0.734 | 0.887 |
| CART | 0.667 | 0.886 | 0.739 | 0.909 | 0.727 | 0.904 |
| CHAID-SPT |
| 0.810 | 0.763 | 0.849 | 0.781 | 0.841 |
| E-CHAID-SPT | 0.711 | 0.886 | 0.768 | 0.891 | 0.758 | 0.890 |
| CART-SPT | 0.800 | 0.823 | 0.796 | 0.863 |
| 0.854 |
| LR | 0.822 | 0.696 | 0.673 | 0.772 | 0.699 | 0.755 |
aBold sets are the highest sensitivity values for each set.
Figure 2Gain plot of the CHAID and CHAID-SPT models.
Figure 3Final CHAID-SPT tree.
Figure 4Final CHAID tree.