| Literature DB >> 35957249 |
Mohammed M Alani1,2, Ali Miri1.
Abstract
As IoT devices' adoption grows rapidly, security plays an important role in our daily lives. As part of the effort to counter these security threats in recent years, many IoT intrusion detection datasets were presented, such as TON_IoT, BoT-IoT, and Aposemat IoT-23. These datasets were used to build many machine learning-based IoT intrusion detection models. In this research, we present an explainable and efficient method for selecting the most effective universal features from IoT intrusion detection datasets that can help in producing highly-accurate and efficient machine learning-based intrusion detection systems. The proposed method was applied to TON_IoT, Aposemat IoT-23, and IoT-ID datasets and resulted in the selection of six universal network-flow features. The proposed method was tested and produced a high accuracy of 99.62% with a prediction time reduced by up to 70%. To provide better insight into the operation of the classifier, a Shapley additive explanation was used to explain the selected features and to prove the alignment of the explanation with current attack techniques.Entities:
Keywords: IoT; dataset; intrusion detection; machine-learning; security
Mesh:
Year: 2022 PMID: 35957249 PMCID: PMC9371123 DOI: 10.3390/s22155690
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.847
Figure 1Internet-Connected IoT Devices Growth [1].
Traffic included in the TON_IoT network-based train-test dataset.
| Traffic Category | Number of Packets |
|---|---|
| Benign | 300,000 |
| Backdoors | 20,000 |
| DoS | 20,000 |
| DDoS | 20,000 |
| Injection | 20,000 |
| Password | 20,000 |
| Ransomware | 20,000 |
| Scanning | 20,000 |
| Cross-Site Scripting | 20,000 |
| MITM | 1043 |
Figure 2Score change with change in number of features.
Classifiers’ Performance Before and After Feature Selection.
| 37 Features | 6 Features | |||||||
|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
| |||
| RF | 0.9992 | 0.9992 | 0.0007 | 0.0006 | 0.9966 | 0.9966 | 0.0025 | 0.0042 |
| LR | 0.6502 | 0.3946 | 0.0000 | 0.0001 | 0.6500 | 0.3940 | 0.0000 | 1.0000 |
| DT | 0.9989 | 0.9988 | 0.0009 | 0.0013 | 0.9962 | 0.9962 | 0.0027 | 0.0046 |
| GNB | 0.6501 | 0.3947 | 0.0003 | 1.0000 | 0.4246 | 0.3381 | 0.8700 | 0.0300 |
Figure 3Confusion Matrix Plot of DT Classifier with 6 Features.
Results of 10-fold Cross-Validation Using DT Classifier.
| Fold | Accuracy | Precision | Recall | |
|---|---|---|---|---|
| 1 | 0.997289 | 0.995432 | 0.996848 | 0.996139 |
| 2 | 0.997245 | 0.995867 | 0.996297 | 0.996082 |
| 3 | 0.996920 | 0.995136 | 0.996005 | 0.995570 |
| 4 | 0.996681 | 0.995882 | 0.994579 | 0.995230 |
| 5 | 0.997245 | 0.995903 | 0.996212 | 0.996057 |
| 6 | 0.996855 | 0.995465 | 0.995402 | 0.995434 |
| 7 | 0.996877 | 0.995075 | 0.995944 | 0.995509 |
| 8 | 0.997397 | 0.995525 | 0.997013 | 0.996268 |
| 9 | 0.996942 | 0.995100 | 0.996259 | 0.995679 |
| 10 | 0.997180 | 0.995502 | 0.996484 | 0.995993 |
|
| 0.997063 | 0.995489 | 0.996104 | 0.995796 |
|
| 0.000224 | 0.000304 | 0.000667 | 0.000336 |
Figure 4Confusion Matrix Plotfor Testing with IoT-ID Dataset.
Figure 5Confusion Matrix Plot for Testing with Aposemat IoT-23 Dataset.
Figure 6SHAP Values Summary Plot for the Selected Features.
Timing parameters of machine-learning models.
| 37 Features | 6 Features | |||
|---|---|---|---|---|
| RF | 23.6616 | 8.7462 | 10.2907 | 6.5594 |
| LR | 1.3017 | 1.4834 | 0.5439 | 0.4346 |
| DT | 1.6723 | 1.4851 | 0.3339 | 0.4549 |
| GNB | 0.6859 | 2.0967 | 0.2129 | 0.5418 |
Figure 7Change in training time after feature reduction.
Figure 8Change in testing time after feature reduction.
Comparison of proposed system with related works.
| Paper | Dataset | Features | Classifier | Accuracy (%) | Training T (s) | Testing T (s) |
|---|---|---|---|---|---|---|
| [ | IoT-BoT | 16 | JRip | 99.992 | 80.94 | - |
| [ | TON_IoT | 44 | CART | 88 | 6.308 | 0.022 |
| RF | 85 | 10.884 | 0.164 | |||
| KNN | 84 | 58.018 | 109.361 | |||
| LSTM | 81 | 1596 | 9.023 | |||
| [ | TON_IoT | 13 | ANN | 84.39 | - | - |
| and | GBM | 99.897 | - | - | ||
| Aposemat | RF | 99.931 | - | - | ||
| IoT-23 | MLP | 99.022 | - | - | ||
| [ | TON_IoT | 43 | Extra Tree | 97.86% | - | 8.93 |
| Our work | TON_IoT | 6 |
|
| 0.3339 |
|
| IoT-ID | 6 | DT | 99.63 | 0.3528 | 0.4663 | |
| Aposemat-IoT-23 | 6 | DT | 99.61 | 0.2973 | 0.4682 |