| Literature DB >> 35161815 |
Abhilash Singh1, J Amutha2, Jaiprakash Nagar3, Sandeep Sharma4, Cheng-Chi Lee5,6.
Abstract
The dramatic increase in the computational facilities integrated with the explainable machine learning algorithms allows us to do fast intrusion detection and prevention at border areas using Wireless Sensor Networks (WSNs). This study proposed a novel approach to accurately predict the number of barriers required for fast intrusion detection and prevention. To do so, we extracted four features through Monte Carlo simulation: area of the Region of Interest (RoI), sensing range of the sensors, transmission range of the sensor, and the number of sensors. We evaluated feature importance and feature sensitivity to measure the relevancy and riskiness of the selected features. We applied log transformation and feature scaling on the feature set and trained the tuned Support Vector Regression (SVR) model (i.e., LT-FS-SVR model). We found that the model accurately predicts the number of barriers with a correlation coefficient (R) = 0.98, Root Mean Square Error (RMSE) = 6.47, and bias = 12.35. For a fair evaluation, we compared the performance of the proposed approach with the benchmark algorithms, namely, Gaussian Process Regression (GPR), Generalised Regression Neural Network (GRNN), Artificial Neural Network (ANN), and Random Forest (RF). We found that the proposed model outperforms all the benchmark algorithms.Entities:
Keywords: WSNs; feature learning; intrusion detection; machine learning; support vector regression
Mesh:
Year: 2022 PMID: 35161815 PMCID: PMC8838871 DOI: 10.3390/s22031070
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1Illustration of 3-barrier coverage for each intrusion path.
Simulation parameters.
| Parameters | Values |
|---|---|
| Simulator | NS-2.35 |
| Network region | Rectangular RoI |
| Network area ( | 100 × 50 to 250 × 200 |
| Number of sensors ( | 100 to 400 |
| Sensing range ( | 15 to 40 |
| Transmission range ( | 30 to 80 |
| Sensor’s deployment type | Uniform distribution |
| Sensing model | Binary sensing model |
Figure 2Illustration of the support vector regression with all input features and the corresponding response variable.
Figure 3Flowchart of the proposed methodology.
Figure 4Bar graph illustrating each feature’s relative feature importance score estimated through regression tree ensemble approach.
Figure 5Feature sensitivity analysis through partial dependency plot. Two features are considered at a time (a total of six pairs from a–f). The left image shows the 2-D variation profile (with histogram) for each pair, and the right image shows the corresponding 3-D variation profile.
Figure 6(a) Linear regression curve between the predicted result of LT-ZM-SVR model and observed values. (b) Error distribution analysis through error histogram.
Comparison of the performance of Z-score scaling (i.e., LT-ZM-SVR) with other scaling methods (i.e., LT-NS-SVR, LT-CM-SVR, and LT-MM-SVR).
| Performance | LT-NS-SVR | LT-CM-SVR | LT-ZM-SVR | LT-MM-SVR |
|---|---|---|---|---|
| R | 0.96 | 0.94 | 0.98 | 0.97 |
| RMSE | 12.66 | 2.39 | 6.47 | 4.59 |
| MSE | 160.15 | 5.727 | 41.87 | 21.10 |
| Bias | 36.30 | 6.24 | 12.35 | 15.62 |
| Time (s) | 2.21 | 0.59 | 0.65 | 0.51 |
Comparison of the proposed model with the benchmark algorithms.
| Performance | Methods | ||||
|---|---|---|---|---|---|
| LT-ZM-SVR | ANN | GRNN | GPR | Random Forest | |
| R | 0.98 | 0.38 | 0.96 | 0.94 | 0.99 |
| RMSE | 6.47 | 46.37 | 57.56 | 63.83 | 32.15 |
| MSE | 41.87 | 2150.20 | 3312.00 | 4074.7 | 1033.6 |
| Bias | 12.35 | -36.12 | 49.62 | 50.96 | 28.62 |
| Time (s) | 0.65 | 1.81 | 2.02 | 1.71 | 2.70 |
Figure 7Sensitivity analysis of LT-ZM-SVR for and uncertainty in the input feature.