| Literature DB >> 34723128 |
Daniele Ballinari1, Simon Behrendt2.
Abstract
Given the increasing interest in and the growing number of publicly available methods to estimate investor sentiment from social media platforms, researchers and practitioners alike are facing one crucial question - which is best to gauge investor sentiment? We compare the performance of daily investor sentiment measures estimated from Twitter and StockTwits short messages by publicly available dictionary and machine learning based methods for a large sample of stocks. To determine their relevance for financial applications, these investor sentiment measures are compared by their effects on the cross-section of stocks (i) within a Fama and MacBeth (J Polit Econ 81:607-636, 1973) regression framework applied to a measure of retail investors' order imbalances and (ii) by their ability to forecast abnormal returns in a model-free portfolio sorting exercise. Interestingly, we find that investor sentiment measures based on finance-specific dictionaries do not only have a greater impact on retail investors' order imbalances than measures based on machine learning approaches, but also perform very well compared to the latter in our asset pricing application.Entities:
Keywords: Investor sentiment; Order imbalances; Portfolio returns; StockTwits; Twitter
Year: 2021 PMID: 34723128 PMCID: PMC8550489 DOI: 10.1007/s42521-021-00038-2
Source DB: PubMed Journal: Digit Finance ISSN: 2524-6186
Overview of investor sentiment estimation techniques
| Panel A: dictionary based approaches | |
|---|---|
| Dictionary | Selection of studies using the dictionary |
| Harvard-IV |
Tetlock ( |
| LM |
Loughran and McDonald ( |
| VADER |
Audrino et al. ( |
| L1 and L2 |
Renault ( |
Note: This table lists the dictionaries and machine learning approaches used in this study to estimate investor sentiment from short messages published on Twitter and StockTwits. For each approach, we report a selection of studies that make use of the respective dictionary or machine learning approach
Comparison of shared terms among dictionaries
| Harvard-IV | LM | L1 | L2 | VADER | |
|---|---|---|---|---|---|
| Harvard-IV | 3642 | 597 | 312 | 235 | 1291 |
| (100%) | (97.3%) | (75.6%) | (93.2%) | (97.7%) | |
| LM | 2709 | 191 | 217 | 870 | |
| (100%) | (96.9%) | (99.5%) | (97.6%) | ||
| L1 | 8000 | 805 | 418 | ||
| (100%) | (99.9%) | (81.6%) | |||
| L2 | 1311 | 373 | |||
| (100%) | (97.3%) | ||||
| VADER | 7517 | ||||
| (100%) |
Note: This table summarizes commonalities and differences among five publicly available sentiment dictionaries, i.e., Harvard-IV, LM (Loughran and McDonald 2011), the two dictionaries introduced by Renault (2017) (L1 and L2), and VADER (Hutto and Gilbert 2014). More precisely, the table shows the number of common terms occurring in two dictionaries. The diagonal elements report the total number of words in each dictionary and the off-diagonal elements report the number of shared terms between two dictionaries. The share of words having the same sentiment connotation (positive or negative) in both dictionaries is reported in parentheses below the respective number of common terms
Correlations of daily bullish sentiment across sentiment measures
| Panel A: Twitter | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| (1) | (2) | (3) | (4) | (5) | (6) | (7) | (8) | (9) | |
| Harvard-IV (1) | 1.00 | 0.25 | 0.20 | 0.26 | 0.37 | 0.11 | 0.10 | 0.06 | 0.18 |
| LM (2) | 1.00 | 0.20 | 0.31 | 0.39 | 0.17 | 0.17 | 0.20 | 0.22 | |
| L1 (3) | 1.00 | 0.50 | 0.25 | 0.30 | 0.31 | 0.05 | 0.38 | ||
| L2 (4) | 1.00 | 0.33 | 0.34 | 0.34 | 0.11 | 0.41 | |||
| VADER (5) | 1.00 | 0.24 | 0.21 | 0.19 | 0.33 | ||||
| Naive Bayes (6) | 1.00 | 0.74 | 0.20 | 0.45 | |||||
| Max. entropy (7) | 1.00 | 0.16 | 0.45 | ||||||
| Deep-MLSA (8) | 1.00 | 0.15 | |||||||
| DeepMoji (9) | 1.00 | ||||||||
This table reports correlations between daily bullish sentiment scores estimated from short messages published on Twitter and StockTwits based on different approaches (dictionary based and machine learning techniques). Panel A and B report correlations between daily bullish sentiment scores estimated from Twitter and StockTwits short messages, respectively. Panel C reports correlations between daily bullish sentiment scores estimated from Twitter short messages with those estimated from StockTwits short messages
Summary statistics for daily bullish sentiment
| Panel A: Twitter | ||||||||
|---|---|---|---|---|---|---|---|---|
| Median | Mean | |||||||
| Harvard-IV | −1.61 | −0.69 | 0.00 | 0.56 | 0.51 | 1.10 | 1.61 | 2.56 |
| LM | −2.14 | −1.10 | −0.41 | 0.00 | 0.05 | 0.69 | 1.10 | 2.14 |
| L1 | −1.59 | −0.69 | 0.00 | 0.51 | 0.49 | 1.10 | 1.56 | 2.48 |
| L2 | −1.79 | −0.69 | 0.00 | 0.41 | 0.41 | 1.10 | 1.61 | 2.56 |
| VADER | −1.25 | 0.00 | 0.26 | 0.86 | 0.88 | 1.42 | 1.98 | 2.94 |
| Naive Bayes | −1.20 | −0.19 | 0.41 | 0.88 | 0.85 | 1.39 | 1.83 | 2.74 |
| Max. entropy | −1.39 | −0.41 | 0.12 | 0.69 | 0.62 | 1.10 | 1.61 | 2.48 |
| Deep-MLSA | −1.95 | −0.69 | 0.00 | 0.00 | 0.15 | 0.69 | 1.10 | 2.30 |
| DeepMoji | −1.10 | 0.00 | 0.58 | 1.10 | 1.06 | 1.61 | 2.08 | 3.00 |
Note: This table reports summary statistics for daily bullish sentiment constructed from Twitter (Panel A) and StockTwits (Panel B) data. More precisely, for each of the nine sentiment estimation approaches, the table reports the 1, 10, 25, 50, 75, 90, and 99%-quantiles as well as the mean daily bullish sentiment
Correlations of daily bullish sentiment across sentiment measures (unique cashtags)
| Panel A: Twitter | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| (1) | (2) | (3) | (4) | (5) | (6) | (7) | (8) | (9) | |
| Harvard-IV (1) | 1.00 | 0.26 | 0.24 | 0.29 | 0.41 | 0.12 | 0.11 | 0.12 | 0.23 |
| LM (2) | 1.00 | 0.21 | 0.31 | 0.33 | 0.11 | 0.12 | 0.20 | 0.17 | |
| L1 (3) | 1.00 | 0.53 | 0.30 | 0.35 | 0.34 | 0.11 | 0.45 | ||
| L2 (4) | 1.00 | 0.32 | 0.38 | 0.38 | 0.16 | 0.47 | |||
| VADER (5) | 1.00 | 0.20 | 0.18 | 0.16 | 0.30 | ||||
| Naive Bayes (6) | 1.00 | 0.73 | 0.08 | 0.46 | |||||
| Max. entropy (7) | 1.00 | 0.08 | 0.45 | ||||||
| Deep-MLSA (8) | 1.00 | 0.10 | |||||||
| DeepMoji (9) | 1.00 | ||||||||
Note: This table reports correlations between daily bullish sentiment scores estimated from short messages that contain a unique cashtag published on Twitter and StockTwits based on different approaches (dictionary-based and machine learning techniques). Panel A and B report correlations between daily bullish sentiment scores estimated from Twitter and StockTwits short messages, respectively. Panel C reports correlations between daily bullish sentiment scores estimated from Twitter short messages with those estimated from StockTwits short messages
Fama-MacBeth (1973) regression coefficients for daily bullish sentiment
| Panel A: Twitter | ||||
|---|---|---|---|---|
| Harvard-IV | 0.0005 | |||
| (2.02) | (2.30) | (2.27) | (0.77) | |
| LM | ||||
| (4.30) | (3.90) | (3.50) | (3.60) | |
| L1 | 0.0013 | 0.0003 | 0.0002 | |
| (2.43) | (1.88) | (0.37) | (0.33) | |
| L2 | ||||
| (4.94) | (5.69) | (3.01) | (2.33) | |
| VADER | 0.0004 | 0.0008 | 0.0007 | |
| (2.00) | (0.60) | (1.17) | (0.92) | |
| Naive Bayes | ||||
| (4.84) | (4.85) | (2.46) | (2.55) | |
| Max. entropy | ||||
| (3.85) | (4.93) | (2.73) | (1.97) | |
| Deep-MLSA | 0.0007 | 0.0006 | 0.0008 | 0.0004 |
| (1.11) | (0.86) | (1.20) | (0.59) | |
| DeepMoji | 0.0011 | 0.0004 | 0.0006 | |
| (2.16) | (1.65) | (0.50) | (0.75) | |
Note: The table reports average cross-sectional regression coefficients (see Fama and MacBeth 1973) for daily bullish investor sentiment estimated from short messages published on Twitter (Panel A) and StockTwits (Panel B). The rows refer to the respective investor sentiment measure. The columns represent the dependent variable, being the h-day ahead retail investors’ order imbalance, for . Newey-West (1987) standard errors are used to construct t-statistics, which are reported in parentheses below the respective coefficient estimate. All covariates are standardized such that the reported parameters can be interpreted as the effect of a one standard deviation change in the respective variable
Differences in Fama-MacBeth (1973) regression coefficients for daily bullish sentiment
| Panel A: Twitter | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| (1) | (2) | (3) | (4) | (5) | (6) | (7) | (8) | (9) | |
| Harvard-IV (1) | – | −0.32 | −0.06 | −1.62 | 0.71 | −0.15 | |||
| LM (2) | – | 1.53 | −0.54 | −0.54 | 0.01 | 1.78 | |||
| L1 (3) | – | 0.25 | −1.39 | 1.08 | 0.19 | ||||
| L2 (4) | – | −0.11 | 0.50 | ||||||
| VADER (5) | – | −1.59 | 0.82 | −0.10 | |||||
| Naive Bayes (6) | – | 1.01 | |||||||
| Max. entropy (7) | – | 1.88 | |||||||
| Deep-MLSA (8) | – | −0.85 | |||||||
| DeepMoji (9) | – | ||||||||
Note: The table reports t-statistics for the difference in the average regression coefficients for daily bullish investor sentiment estimated from short messages published on Twitter (Panel A) and StockTwits (Panel B). The dependent variable of the cross-sectional regressions is the 1-day ahead retail investors’ order imbalance. More precisely, we take the difference between coefficients obtained from methodologies reported in the rows with those reported in columns. The t-statistics are constructed using Newey-West (1987) standard errors. Differences which are statistically significant at the 5% level are highlighted by boldfaced numbers
Fama-MacBeth (1973) regression coefficients for daily bullish sentiment (unique cashtags)
| Panel A: Twitter | ||||
|---|---|---|---|---|
| Harvard-IV | 0.0010 | 0.0004 | 0.0003 | 0.0002 |
| (1.49) | (0.56) | (0.51) | (0.32) | |
| LM | 0.0012 | 0.0011 | ||
| (4.22) | (2.06) | (1.68) | (1.53) | |
| L1 | 0.0014 | 0.0005 | −0.0003 | 0.0003 |
| (1.83) | (0.63) | (−0.41) | (0.47) | |
| L2 | ||||
| (3.62) | (3.18) | (2.55) | (2.26) | |
| VADER | 0.0011 | 0.0007 | 0.0011 | 0.0007 |
| (1.50) | (1.00) | (1.53) | (0.99) | |
| Naive Bayes | 0.0006 | 0.0015 | ||
| (4.29) | (2.76) | (0.71) | (1.77) | |
| Max. entropy | 0.0006 | 0.0006 | ||
| (3.31) | (2.01) | (0.78) | (0.76) | |
| Deep-MLSA | −0.0004 | −0.0006 | −0.0003 | −0.0008 |
| (−0.68) | (−0.87) | (−0.41) | (−1.30) | |
| DeepMoji | 0.0013 | 0.0001 | −0.0007 | −0.0002 |
| (1.75) | (0.17) | (−0.73) | (−0.26) | |
Note: The table reports average cross-sectional regression coefficients (see Fama and MacBeth 1973) for daily bullish investor sentiment estimated from short messages published on Twitter (Panel A) and StockTwits (Panel B) mentioning a unique cashtag. The rows refer to the respective investor sentiment measure. The columns represent the dependent variable, being the h-day ahead retail investors’ order imbalance, for . Newey-West (1987) standard errors are used to construct t-statistics, which are reported in parentheses below the respective coefficient estimate. All covariates are standardized such that the reported parameters can be interpreted as the effect of a one standard deviation change in that variable
Differences in Fama-MacBeth (1973) regression coefficients for daily bullish sentiment (unique cashtags)
| Panel A: Twitter | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| (1) | (2) | (3) | (4) | (5) | (6) | (7) | (8) | (9) | |
| Harvard-IV (1) | – | -0.39 | −1.92 | −0.06 | −1.62 | 1.67 | −0.28 | ||
| LM (2) | – | 1.73 | 0.16 | −0.25 | 0.39 | 1.75 | |||
| L1 (3) | – | −1.92 | 0.30 | −1.36 | 0.10 | ||||
| L2 (4) | – | 1.92 | −0.45 | 0.27 | 1.94 | ||||
| VADER (5) | – | −1.51 | 1.74 | −0.23 | |||||
| Naive Bayes (6) | – | 1.01 | |||||||
| Max. entropy (7) | – | 1.52 | |||||||
| Deep-MLSA (8) | – | −1.86 | |||||||
| DeepMoji (9) | – | ||||||||
Note: The table reports t-statistics for the difference in the average regression coefficients for daily bullish investor sentiment estimated from short messages published on Twitter (Panel A) and StockTwits (Panel B) with unique cashtags. The dependent variable of the cross-sectional regressions is the one-day ahead retail investors’ order imbalance. More precisely, we take the difference between coefficients obtained from methodologies reported in the rows with those reported in columns. The t-statistics are constructed using Newey-West (1987) standard errors. Differences which are statistically significant at the 5% level are highlighted by boldfaced numbers
Annualized portfolio returns based on daily bullish sentiment
| Panel A: Twitter | ||||||
|---|---|---|---|---|---|---|
| Raw return | Risk-adjusted return | |||||
| Short | Long | Long-Short | Short | Long | Long-Short | |
| Harvard-IV | −0.92 | 0.88 | 0.13 | −0.90 | ||
| (3.18) | (2.88) | (−0.50) | (0.61) | (0.09) | (−0.51) | |
| LM | 1.69 | −1.58 | −0.06 | 1.37 | ||
| (2.37) | (3.11) | (0.82) | (−1.10) | (−0.05) | (0.70) | |
| L1 | 1.29 | −0.61 | 1.01 | 1.47 | ||
| (2.68) | (3.36) | (0.65) | (−0.38) | (0.67) | (0.77) | |
| L2 | 3.62 | −2.65 | 1.35 | 3.84 | ||
| (2.17) | (3.28) | (1.66) | (−1.59) | (0.87) | (1.89) | |
| VADER | 0.07 | 1.09 | 0.99 | −0.26 | ||
| (2.95) | (3.02) | (0.04) | (0.77) | (0.65) | (−0.14) | |
| Naive Bayes | −0.41 | 0.71 | 0.80 | −0.05 | ||
| (2.90) | (3.16) | (−0.22) | (0.50) | (0.51) | (−0.03) | |
| Max. entropy | −0.41 | −0.41 | −0.32 | −0.06 | ||
| (2.72) | (2.82) | (−0.21) | (−0.28) | (−0.20) | (−0.03) | |
| Deep-MLSA | 2.31 | −1.34 | 0.93 | 2.12 | ||
| (2.59) | (3.12) | (1.23) | (−0.97) | (0.74) | (1.16) | |
| DeepMoji | 0.81 | 0.33 | 1.15 | 0.66 | ||
| (2.98) | (3.22) | (0.43) | (0.24) | (0.78) | (0.35) | |
Note: The table depicts the annualized raw and risk-adjusted returns (in %) of three portfolios based on daily bullish sentiment estimated from short messages published on Twitter (Panel A) and StockTwits (Panel B). The first portfolio (Short) contains the stocks for which the estimated online investor sentiment on the previous (trading) day is smaller than the 10% cross-sectional quantile. Conversely, the second portfolio (Long) contains the stocks for which the estimated online investor sentiment on the previous trading day is larger than the 90% cross-sectional quantile. A raw long-short portfolio return (Long-Short) is obtained as the difference between these two raw portfolio returns. The stocks are held in the portfolio for one trading day. The risk-adjusted returns are defined as the intercept of the regression of portfolio returns on the three risk factors introduced by Fama and French (1993) and the momentum factor of Carhart (1997). The t-statistics, reported in parenthesis, are constructed using Newey-West (1987) standard errors. Returns which are statistically significant at the 5% level are highlighted by boldfaced numbers
Differences in raw annualized returns of long-short portfolios
| Panel A: Twitter | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| (1) | (2) | (3) | (4) | (5) | (6) | (7) | (8) | (9) | |
| Harvard-IV (1) | – | −1.07 | −0.97 | −1.93 | −0.47 | −0.21 | −0.22 | −1.35 | −0.76 |
| LM (2) | – | 0.16 | −0.81 | 0.76 | 0.84 | 0.86 | −0.26 | 0.35 | |
| L1 (3) | – | −1.29 | 0.55 | 0.84 | 0.80 | −0.41 | 0.23 | ||
| L2 (4) | – | 1.56 | 1.89 | 1.84 | 0.52 | 1.32 | |||
| VADER (5) | – | 0.22 | 0.22 | −0.94 | −0.39 | ||||
| Naive Bayes (6) | – | 0.00 | −1.18 | −0.61 | |||||
| Max. entropy (7) | – | −1.22 | −0.59 | ||||||
| Deep-MLSA (8) | – | 0.59 | |||||||
| DeepMoji (9) | – | ||||||||
Note: The table depicts the t-statistics of the average differences in raw returns of long-short portfolios based on daily bullish sentiment estimated from short messages published on Twitter (Panel A) and StockTwits (Panel B). More precisely, we take the difference between returns obtained from methodologies reported in the rows with those reported in columns. The t-statistics are constructed using Newey-West (1987) standard errors. Differences which are statistically significant at the 5% level are highlighted by boldfaced numbers
Differences in risk-adjusted annualized returns of long-short portfolios
| Panel A: Twitter | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| (1) | (2) | (3) | (4) | (5) | (6) | (7) | (8) | (9) | |
| Harvard-IV (1) | – | −0.97 | −1.08 | −0.31 | −0.36 | −0.38 | −1.28 | −0.69 | |
| LM (2) | – | −0.04 | −1.04 | 0.78 | 0.59 | 0.59 | −0.32 | 0.29 | |
| L1 (3) | – | −1.33 | 0.80 | 0.76 | 0.72 | −0.27 | 0.39 | ||
| L2 (4) | – | 1.86 | 1.85 | 1.80 | 0.69 | 1.52 | |||
| VADER (5) | – | −0.09 | −0.09 | −1.01 | −0.48 | ||||
| Naive Bayes (6) | – | 0.00 | −0.95 | −0.36 | |||||
| Max. entropy (7) | – | −0.97 | −0.35 | ||||||
| Deep-MLSA (8) | – | 0.58 | |||||||
| DeepMoji (9) | – | ||||||||
Note: The table depicts the t-statistics of the average differences in risk-adjusted returns of long-short portfolios based on daily bullish sentiment estimated from short messages published on Twitter (Panel A) and StockTwits (Panel B). More precisely, we take the difference between returns obtained from methodologies reported in the rows with those reported in columns. Risk-adjusted returns are defined as the intercept plus the residuals of the regression in Equation (5). The t-statistics are constructed using Newey-West (1987) standard errors. Differences which are statistically significant at the 5% level are highlighted by boldfaced numbers
Annualized portfolios returns based on daily bullish sentiment (unique cashtags)
| Panel A: Twitter | ||||||
|---|---|---|---|---|---|---|
| Raw return | Risk-adjusted return | |||||
| Short | Long | Long-Short | Short | Long | Long-Short | |
| Harvard-IV | 1.45 | −1.17 | 0.47 | 1.49 | ||
| (2.61) | (3.10) | (0.82) | (−0.91) | (0.35) | (0.86) | |
| LM | 1.08 | |||||
| (2.13) | (3.21) | (2.27) | (−2.06) | (0.82) | (2.24) | |
| L1 | −2.31 | 1.72 | ||||
| (2.42) | (3.55) | (2.07) | (−1.55) | (1.19) | (2.10) | |
| L2 | −2.15 | |||||
| (2.37) | (3.72) | (2.81) | (−1.56) | (2.74) | (3.04) | |
| VADER | 2.72 | −0.59 | 1.99 | 2.42 | ||
| (2.66) | (3.36) | (1.49) | (−0.39) | (1.51) | (1.33) | |
| Naive Bayes | 3.50 | −1.11 | 2.55 | 3.51 | ||
| (2.67) | (3.68) | (1.86) | (−0.79) | (1.64) | (1.89) | |
| Max. entropy | 3.18 | −0.81 | 2.51 | 3.17 | ||
| (2.67) | (3.69) | (1.64) | (−0.59) | (1.57) | (1.69) | |
| Deep-MLSA | 1.42 | |||||
| (2.14) | (3.17) | (2.74) | (−2.72) | (1.14) | (2.81) | |
| DeepMoji | −1.85 | 2.70 | ||||
| (2.39) | (3.60) | (2.39) | (−1.21) | (1.70) | (2.18) | |
Note: The table depicts the annualized raw and risk-adjusted returns (in %) of three portfolios based on daily bullish sentiment estimated from short messages published on Twitter (Panel A) and StockTwits (Panel B) mentioning a unique cashtag. The first portfolio (Short) contains the stocks for which the estimated online investor sentiment on the previous (trading) day is smaller than the 10% cross-sectional quantile. Conversely, the second portfolio (Long) contains the stocks for which the estimated online investor sentiment on the previous trading day is larger than the 90% cross-sectional quantile. A raw long-short portfolio return (Long-Short) is obtained as the difference between these two raw portfolio returns. The stocks are held in the portfolio for 1 trading day. The risk-adjusted returns are defined as the intercept of the regression of portfolio returns on the three risk factors introduced by Fama and French (1993) and the momentum factor of Carhart (1997). The t-statistics, reported in parenthesis, are constructed using Newey-West (1987) standard errors. Returns which are statistically significant at the 5% level are highlighted by boldfaced numbers
Differences in raw annualized returns of long-short portfolios (unique cashtags)
| Panel A: Twitter | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| (1) | (2) | (3) | (4) | (5) | (6) | (7) | (8) | (9) | |
| Harvard-IV (1) | – | −1.30 | −1.24 | −0.63 | −1.01 | −0.79 | −1.45 | −1.61 | |
| LM (2) | – | 0.13 | −0.73 | 0.72 | 0.34 | 0.46 | −0.34 | −0.29 | |
| L1 (3) | – | −1.04 | 0.59 | 0.24 | 0.39 | −0.45 | −0.49 | ||
| L2 (4) | – | 1.47 | 1.21 | 1.37 | 0.35 | 0.44 | |||
| VADER (5) | – | −0.36 | −0.21 | −0.96 | −1.00 | ||||
| Naive Bayes (6) | – | 0.22 | −0.63 | −0.74 | |||||
| Max. entropy (7) | – | −0.74 | −0.86 | ||||||
| Deep-MLSA (8) | – | 0.02 | |||||||
| DeepMoji (9) | – | ||||||||
The table depicts the t-statistics of the average differences in raw returns of long-short portfolios based on daily bullish sentiment estimated from short messages published on Twitter (Panel A) and StockTwits (Panel B) mentioning a unique cashtag. More precisely, we take the difference between returns obtained from methodologies reported in the rows with those reported in columns. The t-statistics are constructed using Newey-West (1987) standard errors. Differences which are statistically significant at the 5% level are highlighted by boldfaced numbers
Differences in risk-adjusted annualized returns of long-short portfolios (unique cashtags)
| Panel A: Twitter | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| (1) | (2) | (3) | (4) | (5) | (6) | (7) | (8) | (9) | |
| Harvard-IV (1) | – | −1.14 | −1.19 | −0.46 | −1.00 | −0.77 | −1.39 | −1.34 | |
| LM (2) | – | 0.02 | −0.91 | 0.71 | 0.18 | 0.31 | −0.44 | −0.20 | |
| L1 (3) | – | −1.09 | 0.70 | 0.19 | 0.35 | −0.43 | −0.25 | ||
| L2 (4) | – | 1.63 | 1.22 | 1.38 | 0.41 | 0.74 | |||
| VADER (5) | – | −0.51 | −0.35 | −1.06 | −0.88 | ||||
| Naive Bayes (6) | – | 0.23 | −0.58 | −0.44 | |||||
| Max. entropy (7) | – | −0.69 | −0.59 | ||||||
| Deep-MLSA (8) | – | 0.19 | |||||||
| DeepMoji (9) | – | ||||||||
The table depicts the t-statistics of the average differences in risk-adjusted returns of long-short portfolios based on daily bullish sentiment estimated from short messages published on Twitter (Panel A) and StockTwits (Panel B) mentioning a unique cashtag. More precisely, we take the difference between returns obtained from methodologies reported in the rows with those reported in columns. Risk-adjusted returns are defined as the intercept plus the residuals of the regression in Equation (5). The t-statistics are constructed using Newey-West (1987) standard errors. Differences which are statistically significant at the 5% level are highlighted by boldfaced numbers