| Literature DB >> 34199274 |
Alexandre Augusto Giron1,2, Jean Everson Martina2, Ricardo Custódio2.
Abstract
Steganography is one of the ways to hide data between parties. Its use can be worrisome, e.g., to hide illegal communications. Researchers found that public blockchains can be an attractive place to hide communications; however, there is not much evidence of actual use in blockchains. Besides, previous work showed a lack of steganalysis methods for blockchains. In this context, we present a steganalysis approach for blockchains, evaluating it in Bitcoin and Ethereum, both popular cryptocurrencies. The main objective is to answer if one can find steganography in real case scenarios, focusing on LSB of addresses and nonces. Our sequential analysis included 253 GiB and 107 GiB of bitcoin and ethereum, respectively. We also analyzed up to 98 million bitcoin clusters. We found that bitcoin clusters could carry up to 360 KiB of hidden data if used for such a purpose. We have not found any concrete evidence of hidden data in the blockchains. The sequential analysis may not capture the perspective of the users of the blockchain network. In this case, we recommend clustering analysis, but it depends on the clustering method's accuracy. Steganalysis is an essential aspect of blockchain security.Entities:
Keywords: bitcoin; blockchain; ethereum; steganalysis; steganography
Mesh:
Year: 2021 PMID: 34199274 PMCID: PMC8231769 DOI: 10.3390/s21124078
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Examples of data insertion methods in popular blockchains.
| Example | Type/Application | Public? | Data Insertion Methods |
|---|---|---|---|
| Bitcoin | Cryptocurrency | Yes | Coinbase, OP_Return, Standard |
| Ethereum | Cryptocurrency, | Yes | Smart Contract state stored in a |
| HyperLedger | Blockchain Platform | No | Private data collection: inserts |
| NEO | Cryptocurrency, | Yes | NeoFS stores data off-chain |
| Helium | Cryptocurrency | Yes | State-Channels for off-chain |
Figure 1A classification of the blockchain steganography approaches in the literature.
Summary of data hiding approaches for blockchains accordingly to the cover medium.
| Transaction | Cryptographic | Address | |
|---|---|---|---|
| Data | Basuki and Rosiyadi, | Partala, 2018. [ | Fionov, 2019 [ |
Figure 2Steganalysis approach proposed for blockchains, instantiated with Bitcoin and Ethereum.
Summary of the datasets manipulated in each experiment.
| Blockchain | Experiment | Dataset | Chunk/Cluster | Quantity of |
|---|---|---|---|---|
| Bitcoin | Sequential | 253.38 GiB, | Variable | 23 chunks |
| Clustering | Blocksci: 3.33 GiB | Variable | 5763155 | |
| Clustering | Blocksci: 28 GiB | Variable | 98266113 | |
| Ethereum | Sequential | ~107 GiB, | 0.5 M nonces, | 119 chunks |
Figure 3Comparison of nonces of bitcoin (left) and ethereum (right) from the first chunk of parsed blockchain data. AM is the Arithmetic Mean of each nonce.
Figure 4Comparison of LSBytes of nonces of bitcoin (left) and ethereum (right), considering the first and the last chunk of parsed blockchain data.
Results of the statistical analysis of the chunk number 14.
| Address Data | Dataset | Entropy | Arithmetic | Monobit Failures |
|---|---|---|---|---|
| (Synthetic) | 1 Gbit | 7.999998 | 127.5034 | ~0.012% |
| 14.2 KiB | 7.966716 | 119.6772 | 100.00% | |
| 10.2 KiB KiB | 1.392768 | 98.0966 | 100% | |
| P2PKH LSB | 10.5 MiB | 7.982699 | 128.2420 | 27.87% |
| P2SH LSB | 625.8 KiB | 7.899532 | 137.4060 | 94.80% |
| (Synthetic) | 1 Gbit | 7.999999 | 127.5016 | ~0.068% |
| Ethereum EOA LSB | 12.5 KiB | 7.472802 | 107.0722 | 100% |
Summary of the results of the clustering analysis.
| Clusterer | Average Cluster | Max. LSB Data | Statistics |
|---|---|---|---|
| Blocksci LSB (1) | 721.56 bytes | 26 KiB | Entropy: 7.993215 |
| Blocksci LSB (2) | 584.92 bytes | 360 KiB | Entropy: 7.999426 |
Summary of the file carving results by analysis type, considering the extracted LSB of addresses.
| Analysis Type | Total Extracted | Scalpel | False Positives? |
|---|---|---|---|
| Sequential | 161.54 MiB | RPM, PGP | All |
| Sequential | 34.52 MiB | FWS, MPG, PGP | All |
| Clustering | 1.45 MiB | None | N/A |
| Clustering | 288.9 KiB | None | N/A |