| Literature DB >> 33267303 |
Franco Valencia1, Alfonso Gómez-Espinosa1, Benjamín Valdés-Aguirre1.
Abstract
Cryptocurrencies are becoming increasingly relevant in the financial world and can be considered as an emerging market. The low barrier of entry and high data availability of the cryptocurrency market makes it an excellent subject of study, from which it is possible to derive insights into the behavior of markets through the application of sentiment analysis and machine learning techniques for the challenging task of stock market prediction. While there have been some previous studies, most of them have focused exclusively on the behavior of Bitcoin. In this paper, we propose the usage of common machine learning tools and available social media data for predicting the price movement of the Bitcoin, Ethereum, Ripple and Litecoin cryptocurrency market movements. We compare the utilization of neural networks (NN), support vector machines (SVM) and random forest (RF) while using elements from Twitter and market data as input features. The results show that it is possible to predict cryptocurrency markets using machine learning and sentiment analysis, where Twitter data by itself could be used to predict certain cryptocurrencies and that NN outperform the other models.Entities:
Keywords: cryptocurrencies; machine learning; price movement; sentiment analysis
Year: 2019 PMID: 33267303 PMCID: PMC7515078 DOI: 10.3390/e21060589
Source DB: PubMed Journal: Entropy (Basel) ISSN: 1099-4300 Impact factor: 2.524
Description of collected Tweets.
| Cryptocurrency | Collected Tweets | Total Percentage |
|---|---|---|
| Bitcoin | 13,096,598 | 63% |
| Ethereum | 5,366,126 | 25.81% |
| Ripple | 1,143,634 | 5.5% |
| Litecoin | 1,183,214 | 5.69% |
Price movements in days for each cryptocurrency.
| Cryptocurrency | Price Increased | Price Decreased |
|---|---|---|
| Bitcoin | 28 | 32 |
| Ethereum | 28 | 32 |
| Ripple | 23 | 37 |
| Litecoin | 29 | 31 |
Results of applying multi-layer perceptron (MLP), support vector machine (SVM) and random forest (RF) using Twitter data, market data or both for predicting daily market movements for Bitcoin.
| Model | Accuracy (95% CI) | Precision | Recall | |
|---|---|---|---|---|
| MLP Twitter | 0.39 (±0.02) | 0.38 | 0.39 | 0.38 |
| MLP Market | 0.72 (±0.03) | 0.74 | 0.72 | 0.71 |
| MLP Twitter and Market | 0.72 (±0.06) | 0.76 | 0.72 | 0.72 |
| SVM Twitter | 0.50 (±0.03) | 0.29 | 0.50 | 0.37 |
| SVM Market | 0.55 (±0.03) | 0.53 | 0.56 | 0.47 |
| SVM Twitter and Market | 0.55 (±0.03) | 0.31 | 0.56 | 0.40 |
| RF Twitter | 0.44 (±0.04) | 0.50 | 0.80 | 0.62 |
| RF Market | 0.61 (±0.04) | 0.67 | 0.25 | 0.36 |
| RF Twitter and Market | 0.44 (±0.04) | 0.28 | 0.44 | 0.34 |
| Random | 0.50 (±0.28) | 0.49 | 0.50 | 0.50 |
| Majority | 0.55 (±0.0) | 0.31 | 0.56 | 0.40 |
Results of applying MLP, SVM and RF using Twitter data, market data or both for predicting daily market movements for Ethereum.
| Model | Accuracy (95% CI) | Precision | Recall | |
|---|---|---|---|---|
| MLP Twitter | 0.39 (±0.02) | 0.44 | 0.39 | 0.38 |
| MLP Market | 0.44 (±0.02) | 0.44 | 0.39 | 0.35 |
| MLP Twitter and Market | 0.44 (±0.03) | 0.56 | 0.44 | 0.39 |
| SVM Twitter | 0.39 (±0.03) | 0.15 | 0.39 | 0.22 |
| SVM Market | 0.39 (±0.03) | 0.15 | 0.39 | 0.22 |
| SVM Twitter and Market | 0.39 (±0.03) | 0.15 | 0.39 | 0.22 |
| RF Twitter | 0.33 (±0.03) | 0.14 | 0.33 | 0.19 |
| RF Market | 0.28 (±0.03) | 0.12 | 0.28 | 0.17 |
| RF Twitter and Market | 0.39 (±0.03) | 0.15 | 0.39 | 0.22 |
| Random | 0.50 (±0.28) | 0.54 | 0.50 | 0.49 |
| Majority | 0.61 (±0.0) | 0.37 | 0.61 | 0.46 |
Results of applying MLP, SVM and RF using Twitter data, market data or both for predicting daily market movements for Ripple.
| Model | Accuracy (95% CI) | Precision | Recall | |
|---|---|---|---|---|
| MLP Twitter | 0.54 (±0.03) | 0.50 | 0.50 | 0.50 |
| MLP Market | 0.64 (±0.04) | 0.68 | 0.67 | 0.66 |
| MLP Twitter and Market | 0.56 (±0.02) | 0.56 | 0.56 | 0.55 |
| SVM Twitter | 0.53 (±0.04) | 0.60 | 0.56 | 0.50 |
| SVM Market | 0.50 (±0.04) | 0.50 | 0.50 | 0.41 |
| SVM Twitter and Market | 0.50 (±0.04) | 0.25 | 0.50 | 0.33 |
| RF Twitter | 0.39 (±0.03) | 0.39 | 0.39 | 0.39 |
| RF Market | 0.50 (±0.03) | 0.50 | 0.50 | 0.41 |
| RF Twitter and Market | 0.44 (±0.03) | 0.44 | 0.44 | 0.44 |
| Random | 0.50 (±0.28) | 0.50 | 0.50 | 0.49 |
| Majority | 0.50 (±0.0) | 0.25 | 0.50 | 0.33 |
Results of applying MLP, SVM and RF using Twitter data, market data or both for predicting daily market movements for Litecoin.
| Model | Accuracy (95% CI) | Precision | Recall | |
|---|---|---|---|---|
| MLP Twitter | 0.59 (±0.05) | 0.61 | 0.61 | 0.61 |
| MLP Market | 0.61 (±0.04) | 0.78 | 0.61 | 0.54 |
| MLP Twitter and Market | 0.61 (±0.04) | 0.62 | 0.61 | 0.60 |
| SVM Twitter | 0.52 (±0.04) | 0.50 | 0.50 | 0.41 |
| SVM Market | 0.52 (±0.04) | 0.25 | 0.50 | 0.33 |
| SVM Twitter and Market | 0.66 (±0.04) | 0.80 | 0.67 | 0.62 |
| RF Twitter | 0.50 (±0.03) | 0.50 | 0.50 | 0.49 |
| RF Market | 0.50 (±0.03) | 0.50 | 0.50 | 0.49 |
| RF Twitter and Market | 0.61 (±0.03) | 0.66 | 0.61 | 0.58 |
| Random | 0.50 (±0.28) | 0.50 | 0.50 | 0.50 |
| Majority | 0.50 (±0.0) | 0.25 | 0.50 | 0.33 |