| Literature DB >> 35161762 |
Igor Kotenko1, Konstantin Izrailov1,2, Mikhail Buinevich3.
Abstract
Ensuring the security of modern cyberphysical devices is the most important task of the modern world. The reason for this is that such devices can cause not only informational, but also physical damage. One of the approaches to solving the problem is the static analysis of the machine code of the firmware of such devices. The situation becomes more complicated in the case of a Smart Home, since its devices can have different processor architectures (means instruction sets). In the case of cyberphysical devices of the Smart Home, the destruction of machine code due to physical influences is also possible. Therefore, the first step is to correctly identify the processor architecture. In the interests of this, a machine code model is proposed that has a formal notation and takes into account the possibility of code destruction. The article describes the full cycle of research (including experiment) in order to obtain this model. The model is based on byte-frequency machine code signatures. The experiment resulted in obtaining template signatures for the Top-16 processor architectures: Alpha, X32, Amd64, Arm64, Hppa64, I486, I686, Ia64, Mips, Mips64, Ppc, Ppc64, RiscV64, S390, S390x and Sparc64.Entities:
Keywords: byte-frequency allocation; code destruction; forensics; processor identification; signature
Year: 2022 PMID: 35161762 PMCID: PMC8840385 DOI: 10.3390/s22031017
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1Ontological model of the domain.
Size of all executable files for each Top-16 Architecture for the CPDoSH.
| No. | Architecture | Image File Name | Machine Code Size | ELF Header | |
|---|---|---|---|---|---|
| Bytes | Megabytes | ||||
| #1 | Alpha | stage3-alpha-20200215T160133Z.tar.bz2 | 339.283.340 | 324 | 0x9026 |
| #2 | X32 | stage3-x32-20210516T214503Z.tar.xz | 222.718.092 | 212 | 0x3e |
| #3 | Amd64 | stage3-amd64-20210516T214503Z.tar.xz | 203.487.272 | 194 | 0x3e |
| #4 | Arm64 | stage3-arm64-20210323T005051Z.tar.xz | 179.237.608 | 171 | 0xb7 |
| #5 | Hppa64 | stage3-hppa2.0-20200319T011207Z.tar.bz2 | 303.729.040 | 290 | 0xf |
| #6 | I486 | stage3-i486-20210517T214503Z.tar.xz | 196.319.128 | 187 | 0x3 |
| #7 | I686 | stage3-i686-20210517T214503Z.tar.xz | 194.963.232 | 186 | 0x3 |
| #8 | Ia64 | stage3-ia64-20210519T033521Z.tar.xz | 224.309.636 | 214 | 0x32 |
| #9 | Mips | stage3-mips32r2-20140316.tar.bz2 | 164.983.958 | 157 | 0x8 |
| #10 | Mips64 | stage3-mips64r2_multilib-20140904.tar.bz2 | 210.324.411 | 201 | 0x8 |
| #11 | Ppc | stage3-ppc-20210516T102000Z.tar.xz | 187.421.256 | 179 | 0x14 |
| #12 | Ppc64 | stage3-ppc64-20210516T102000Z.tar.xz | 192.572.304 | 184 | 0x15 |
| #13 | RiscV64 | stage3-rv64_lp64-20210509T171126Z.tar.xz | 184.422.060 | 176 | 0xf3 |
| #14 | S390 | stage3-s390-20200531T164023Z.tar.xz | 189.763.308 | 181 | 0x16 |
| #15 | S390x | stage3-s390x-20200531T164536Z.tar.xz | 191.553.620 | 183 | 0x16 |
| #16 | Sparc64 | stage3-sparc64-20210420T135502Z.tar.xz | 164.675.652 | 157 | 0x2b |
For ease of reading, thousandths of MC sizes are separated by a dot.
Figure 2Byte-frequency signatures of MC classes for Top-16 CPDoSH Architectures (for Alpha, X32, Amd64, Arm64, Hppa64, I486, I686 and Ia64).
Figure 3Byte-frequency signatures of MC classes for Top-16 CPDoSH Architectures (for Mips, Mips64, Ppc, Ppc64, RiscV64, S390, S390x and Sparc64).
Comparative analysis of the proposed Model with existing analogues.
| Analytical Modeling of MC | Criteria | Score | |||||
|---|---|---|---|---|---|---|---|
| A | B | C | D | E | F | ||
| 1. Malware detection by data mining techniques based on positionally | +/− | + | +/− | +/− | + | +/− | 4 |
| 2. Fine-Grained Compiler Identification With Sequence-Oriented | + | + | − | + | + | +/− | 4.5 |
| 3. Function Identification in Android Binaries With Deep Learning [ | − | + | − | +/− | + | + | 3.5 |
| 4. Polymorphic Malware Detection Using Hierarchical Hidden | +/− | − | + | − | − | − | 1.5 |
| 5. Feature Extraction Method for Cross-Architecture Binary | + | +/− | +/− | − | +/− | +/− | 3 |
| 6. Vision-Based Malware Detection: A Transfer Learning Approach Using | − | + | − | − | − | +/− | 1.5 |
| 7. Cross-Architecture Intemet-of-Things Malware Detection Based on Graph | + | + | +/− | + | +/− | − | 4 |
| 8. Asteria: Deep Learning-based AST-Encoding for Cross-platform Binary Code | + | − | +/− | +/− | +/− | − | 2.5 |
| Proposed Model | + | + | + | + | + | + | 6 |
The following designations and points were used: “+”—full compliance with the criterion (1 point); “+/−”—partial compliance with the criterion (0.5 points); “−”—failure to meet the criterion. Criterion (0 points).