| Literature DB >> 35336390 |
Hyeongki Ahn1, Sangkyeum Kim1, Kyunghyun Lee1, Ahyeong Choi1, Kwanho You1,2.
Abstract
The discrimination between earthquakes and artificial explosions is a significant issue in seismic analysis to efficiently prevent and respond to seismic events. However, the discrimination of seismic events is challenging due to the low incidence rate. Moreover, the similarity between earthquakes and artificial explosions with a local magnitude derives a nonlinear data distribution. To improve the discrimination accuracy, this paper proposes machine-learning-based seismic discrimination methods-support vector machine, naive Bayes, and logistic regression. Furthermore, to overcome the nonlinear separation problem, the kernel functions and regularized logistic regression are applied to design seismic classifiers. To efficiently design the classifier, P- and S-wave amplitude ratios on the time domain and spectral ratios on the frequency domain, which is converted by fast Fourier transform and short-time Fourier transform are selected as feature vectors. Furthermore, an adaptive synthetic sampling algorithm is adopted to enhance the classifier performance against the seismic data imbalance issue caused by the non-equivalent number of occurrences. The comparisons among classifiers are evaluated by the binary classification performance analysis methods.Entities:
Keywords: artificial explosion; oversampling method; seismic discrimination; supervised machine learning
Year: 2022 PMID: 35336390 PMCID: PMC8948764 DOI: 10.3390/s22062219
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1Peak amplitudes of P- and S-waves.
Figure 2A flowchart of the proposed seismic discrimination.
Figure 3The FFT and STFT results for the seismic signals in Figure 1.
Figure 4Test dataset and classification results of the ADAYSN-based machine-learning methods.
Numerical parameters for the classification models.
| Parameter | Method | Value |
|---|---|---|
|
| RLR | 0.01 |
|
| ADASYN | 5 |
|
| SVM with RBF | 0.3 |
|
|
| 1 |
|
|
| 5 |
|
|
| 6 |
|
|
| 10 |
|
| SVM | 0.7 |
The confusion matrix for binary classification.
| Confusion Matrix | Actual Class | ||
|---|---|---|---|
| Positive | Negative | ||
| Hypothesized class | Positive | TP | FP |
| Negative | FN | TN | |
Performance comparison of SVM with different kernel functions, NB, LR, and RLR trained based on an imbalanced seismic dataset.
|
|
|
|
| ||
|---|---|---|---|---|---|
|
|
| ||||
|
| 0.9615 | 1.0000 | 0.9615 | 0.8462 | 0.9231 |
|
| 0.8125 | 0.5000 | 0.8750 | 0.9375 | 0.8750 |
|
| 0.8870 | 0.7500 | 0.9183 | 0.8918 | 0.8990 |
|
| 0.9259 | 0.8667 | 0.9434 | 0.8980 | 0.9231 |
|
| 0.9048 | 0.8095 | 0.9286 | 0.8810 | 0.9048 |
|
| 0.7974 | 0.6183 | 0.8478 | 0.7646 | 0.7981 |
|
| 0.7740 | 0.5000 | 0.8365 | 0.7837 | 0.7891 |
|
| 0.9266 | 0.8745 | 0.9336 | 0.8896 | 0.9231 |
Performance comparison of ADASYN-based SVM with different kernel functions, NB, LR, and RLR.
|
|
|
|
| ||
|---|---|---|---|---|---|
|
|
| ||||
|
| 0.9615 | 1.0000 | 0.8846 | 0.4615 | 0.9615 |
|
| 0.8125 | 0.8750 | 0.9375 | 1.0000 | 0.8750 |
|
| 0.8870 | 0.9375 | 0.9111 | 0.7308 | 0.9183 |
|
| 0.9259 | 0.9630 | 0.9200 | 0.6316 | 0.9434 |
|
| 0.9048 | 0.9524 | 0.9048 | 0.6667 | 0.9286 |
|
| 0.7974 | 0.9014 | 0.8067 | 0.4961 | 0.8478 |
|
| 0.7740 | 0.8750 | 0.8221 | 0.4615 | 0.8365 |
|
| 0.8686 | 0.9354 | 0.8839 | 0.7184 | 0.9037 |
Figure 5ROC curve of machine-learning methods and ADASYN-based machine-learning methods.