| Literature DB >> 36262124 |
İsmail Atacak1, Kazım Kılıç1, İbrahim Alper Doğru1.
Abstract
Background: Android is the most widely used operating system all over the world. Due to its open nature, the Android operating system has become the target of malicious coders. Ensuring privacy and security is of great importance to Android users.Entities:
Keywords: Convolutional neural network; Fuzzy logic; Malware detection; Mobile security; Permission
Year: 2022 PMID: 36262124 PMCID: PMC9575934 DOI: 10.7717/peerj-cs.1092
Source DB: PubMed Journal: PeerJ Comput Sci ISSN: 2376-5992
Number of samples and sources used in the study.
| Class | Number of samples | Source | |
|---|---|---|---|
| First dataset | Malware | 250 | Drebin |
| Benign | 250 | Google Play Store | |
| Second dataset | Malware | 250 | CICMalDroid |
| Benign | 250 | Google Play Store |
Figure 1Architecture of the proposed method.
The feature extractor layers of the proposed model.
| Layer | Number of Kernels | Size of Kernel/ number of neuron | Stride | Hyperparameters | Activation |
|---|---|---|---|---|---|
| Conv2d_1 | 3 | 7 × 1 | 1 | RandomUniform Min:0–Max:1 | ReLU |
| Maxpooling2d_1 | 2 × 1 | 1 | |||
| Conv2d_2 | 1 | 5 × 1 | 1 | RandomUniform Min:0–Max:1 | ReLU |
| Maxpooling2d_2 | 2 × 1 | 2 | |||
| Flatten | – | – | – | – | |
| Dense | 5 | – | RandomUniform Min:0–Max:1 | ReLU |
Figure 2ANFIS architecture.
Figure 3The results of the proposed model for different membership functions.
Classification results.
Bold text shows the best results.
| Model | Accuracy | Precision | Recall | AUC | |
|---|---|---|---|---|---|
| LDA | 0.89 | 0.8933 | 0.8933 | 0.8933 | 0.8928 |
| SVM | 0.86 | 0.8776 | 0.8667 | 0.8665 | 0.8714 |
| Gaussian Naive Bayes | 0.59 | 0.6903 | 0.5867 | 0.4955 | 0.5589 |
| ExtraTreesClassifier | 0.89 | 0.8933 | 0.8933 | 0.8933 | 0,8928 |
| Decision Tree | 0.89 | 0.8941 | 0.8933 | 0.8931 | 0.8910 |
| KNN | 0.85 | 0.8677 | 0.8533 | 0.8530 | 0.8589 |
| Xgboost | 0.92 | 0.9200 | 0.9200 | 0.9200 | 0.9196 |
| ANFIS | 0,90 | 0,9098 | 0,9067 | 0,9068 | 0.9071 |
| Proposed Method |
|
|
|
|
|
Classification report of the proposed method.
|
|
|
| ||
|---|---|---|---|---|
| Proposed Method | 0 | 0.8919 | 0.9429 | 0.9167 |
| 1 | 0.9474 | 0.9000 | 0.9231 | |
| Macro avg. | 0.9196 | 0.9214 | 0.9199 | |
| Weighted avg. | 0.9215 | 0.9200 | 0.9201 |
Figure 4ROC Curve of proposed method.
Figure 5The results of the proposed model for different membership functions.
Classification results.
Bold text shows the best results.
| Model | Accuracy | Precision | Recall | AUC | |
|---|---|---|---|---|---|
| LDA | 0.7733 | 0.7739 | 0.7733 | 0.7735 | 0. 7732 |
| SVM | 0.8933 | 0.9132 | 0.8933 | 0.8929 | 0.9 |
| Gaussian Naive Bayes | 0.84 | 0.8681 | 0.8400 | 0.8386 | 0.8482 |
| ExtraTreesClassifier | 0.8533 | 0.8538 | 0.8533 | 0.8534 | 0,8535 |
| Decision Tree | 0.8267 | 0.8279 | 0.8267 | 0.8259 | 0. 8232 |
| KNN | 0.8667 | 0.8857 | 0.8667 | 0.8661 | 0.8732 |
| Xgboost | 0.8400 | 0. 8404 | 0. 8400 | 0.8396 | 0.8375 |
| ANFIS | 0,9333 | 0,9409 | 0,9333 | 0,9326 | 0.9431 |
| Proposed Method |
|
|
|
|
|
Classification report of the proposed method.
|
|
|
| ||
|---|---|---|---|---|
| Proposed Method | 0 | 0.9268 | 0.9744 | 0.9500 |
| 1 | 0.9706 | 0.9167 | 0.9429 | |
| Macro avg. | 0.9487 | 0.9455 | 0.9464 | |
| Weighted avg. | 0.9478 | 0.9467 | 0.9466 |
Figure 6ROC Curve of proposed method.
Fuzzy logic based studies.
| Author | Feature extraction | Feature selection | Classification model | Classification result |
|---|---|---|---|---|
| Juliza Muhamad Arif | Permission | İnformation Gain | Fuzzy AHP | %90.54 Acc |
| Altaher | Permission | İnformation Gain | EHNFC | %90 Acc |
| Afifi et al. | Network traffic | ClassifierSubsetEval | ANFIS + PSO | RMSE 0.4113 |
| Altaher & Barukap | Permission | İnformation Gain | FCM-ANFIS | %91 Acc |
| Abdulla & Altaher | Permission | İnformation Gain | k-ANFIS | %75 Acc |
| Proposed Method (First dataset) | Using Convolution layers from permission information | – | ANFIS | %92 Acc |
| Proposed Method (Second dataset) | Using Convolution layers from permission information | – | ANFIS | %94.66 Acc |
Machine learning based studies.
| Author | Classification model | Classification result |
|---|---|---|
|
| DNN | %98.16 |
|
| CNN(Efficient-B4) | %95.7 |
|
| CNN | %92 |
|
| Lineer Regression | %95,6 |
|
| LSTM | %93.7 |
|
| Naive Bayes | %91,1 |
|
| KNN | %91,95 |
| Proposed Method (First dataset) | ANFIS | %92 |
| Proposed Method (Second dataset) | ANFIS | %94,66 |