| Literature DB >> 35058844 |
Paula Carolina Ciampaglia Nardi1, Evandro Marcos Saidel Ribeiro2, José Lino Oliveira Bueno3, Ishani Aggarwal4.
Abstract
The objective of this study was to jointly analyze the importance of cognitive and financial factors in the accuracy of profit forecasting by analysts. Data from publicly traded Brazilian companies in 2019 were obtained. We used text analysis to assess the cognitive biases from the qualitative reports of analysts. Further, we analyzed the data using statistical regression learning methods and statistical classification learning methods, such as Multiple Linear Regression (MRL), k-dependence Bayesian (k-DB), and Random Forest (RF). The Bayesian inference and classification methods allow an expansion of the research line, especially in the area of machine learning, which can benefit from the examples of factors addressed in this research. The results indicated that, among cognitive biases, optimism had a negative relationship with forecasting accuracy while anchoring bias had a positive relationship. Commonality, to a lesser extent, also had a positive relationship with the analyst's accuracy. Among financial factors, the most important aspects in the accuracy of analysts were volatility, indebtedness, and profitability. Age of the company, fair value, American Depositary Receipts (ADRs), performance, and loss were still important but on a smaller scale. The results of the RF models showed a greater explanatory power. This research sheds light on the cognitive as well as financial aspects that influence the analyst's accuracy, jointly using text analysis and machine learning methods, capable of improving the explanatory power of predictive models, together with the use of training models followed by testing.Entities:
Keywords: analysts’ accuracy; analysts’ forecast; cognitive biases; random forest; text analysis
Year: 2022 PMID: 35058844 PMCID: PMC8764190 DOI: 10.3389/fpsyg.2021.773894
Source DB: PubMed Journal: Front Psychol ISSN: 1664-1078
FIGURE 1Research intersection. AF, analyst forecast; OA, others aspects; AF, financial aspects; BA, behavioral aspects; TA, temporal aspects.
Definition of the final sample in 2019.
| Procedures for selecting the final sample | Brazil |
| Initial sample | 338 |
| (–) Financial | 34 |
| (–) Without estimated LPA | 181 |
| (–) Lack of accounting data | 29 |
|
|
|
Spearman’s correlation.
| Ac | Overconf | Repres | Real | Common | Time | Size | Profit | Surp | Growth | Volat | Indeb | ROA | Age | Popul | SpecAna | ExperAna | Exper | |
| Overconf | 0.0364 | |||||||||||||||||
| Repres | –0.0306 | 0.4069 | ||||||||||||||||
| Real | –0.0450 | –0.0102 | 0.0150 | |||||||||||||||
| Common | 0.0454 | 0.0464 | –0.0078 | –0.0477 | ||||||||||||||
| Time | −0.0707 | –0.0206 | 0.0225 | 0.0374 | −0.0698 | |||||||||||||
| Size | 0.0409 | 0.0402 | –0.0316 | –0.0313 | −0.1141 | –0.0009 | ||||||||||||
| Profit | 0.2032 | 0.0370 | –0.0108 | 0.0693 | 0.0044 | 0.0421 | –0.0259 | |||||||||||
| Surp | 0.4125 | –0.0241 | –0.0104 | 0.0027 | 0.0587 | 0.0071 | 0.1192 | 0.1451 | ||||||||||
| Growth | 0.0241 | −0.0633 | −0.0623 | 0.0451 | 0.0207 | 0.0016 | –0.0164 | 0.0693 | 0.3043 | |||||||||
| Volat | 0.0062 | 0.0227 | –0.0441 | –0.0103 | 0.0501 | –0.0248 | 0.2050 | −0.0525 | −0.1219 | –0.0318 | ||||||||
| Indeb | −0.1504 | 0.0548 | 0.0079 | 0.0099 | –0.0508 | 0.0059 | 0.2907 | −0.1189 | −0.1932 | −0.0634 | 0.5660 | |||||||
| ROA | 0.2038 | −0.0773 | −0.0761 | 0.0296 | 0.0306 | 0.1281 | –0.0464 | 0.4657 | 0.2452 | 0.0237 | −0.2947 | −0.2829 | ||||||
| Age | −0.0490 | 0.0353 | –0.0483 | –0.0061 | –0.0006 | 0.0596 | 0.2130 | −0.0895 | 0.0392 | –0.0381 | 0.1640 | 0.1323 | 0.0836 | |||||
| Popul | −0.0888 | 0.0060 | 0.0054 | –0.0387 | −0.1443 | 0.0322 | 0.5890 | 0.0833 | −0.0589 | −0.1168 | −0.1460 | 0.0506 | 0.1361 | 0.1840 | ||||
| SpecAna | 0.0480 | –0.0127 | –0.0192 | 0.0121 | 0.2250 | 0.0268 | −0.3645 | 0.0160 | –0.0082 | 0.0922 | 0.1040 | –0.0018 | –0.0258 | 0.0676 | −0.3936 | |||
| ExperAna | 0.0278 | 0.0203 | 0.0230 | 0.0610 | 0.0498 | −0.0788 | 0.1526 | −0.1152 | 0.0473 | –0.0011 | 0.0692 | 0.1244 | −0.0738 | 0.0589 | 0.0450 | 0.0640 | ||
| Exper | 0.0610 | –0.0336 | –0.0337 | 0.0804 | 0.2475 | –0.0207 | −0.1852 | –0.0254 | –0.0341 | 0.0466 | 0.1709 | 0.0259 | 0.0263 | 0.0432 | −0.4011 | 0.6956 | 0.3723 | |
| Portf | 0.0729 | –0.0388 | –0.0423 | 0.0614 | 0.2171 | 0.0141 | −0.3296 | 0.0181 | –0.0312 | 0.0839 | 0.1021 | –0.0450 | 0.0443 | –0.0328 | −0.4946 | 0.7927 | 0.0996 | 0.8893 |
Being: *, **, *** significant at 1%, 5% and 10%, respectively.
Regression in Ordinary Least Squares (OLS), carried out in the Caret package, for 2019.
| General | Otim | Ancho | Overconf | |||||
| Coef. | T | Coef. | t | Coef. | t | Coef. | t | |
|
| –0.86 | −2.304 | –1.08 | −3.001 | ||||
|
| 0.84 | 1.997 | 1.17 | 2.919 | ||||
|
| –0.02 | –0.924 | –0.03 | –1.16 | ||||
|
| –0.06 | –0.572 | ||||||
|
| –0.02 | –0.274 | ||||||
|
| 0.07 | 1.11 | ||||||
|
| 0.00 | –0.819 | ||||||
|
| –0.11 | –0.18 | 0.26 | 0.43 | –0.09 | –0.14 | 0.22 | 0.36 |
|
| 1.69 | 2.355 | 1.45 | 2.029 | 1.58 | 2.204 | 1.42 | 1.974 |
|
| –0.84 | –1.02 | –0.85 | –1.044 | –0.83 | –1.02 | –0.86 | –1.05 |
|
| 34.92 | 3.096 | 34.85 | 3.106 | 38.13 | 3.414 | 39.03 | 3.470 |
|
| 0.00 | –0.348 | –0.01 | –0.43 | –0.01 | –0.45 | –0.01 | –0.63 |
|
| –0.99 | –0.952 | –1.18 | –1.133 | –0.73 | –0.70 | –1.03 | –0.99 |
|
| 0.13 | 1.60 | 0.13 | 1.667 | 0.13 | 1.58 | 0.14 | 1.764 |
|
| –5.77 | −6.031 | –5.83 | −6.140 | –5.68 | −5.956 | –5.96 | −6.245 |
|
| –28.53 | −1.943 | –26.65 | −1.833 | –29.47 | −2.031 | –30.99 | −2.118 |
|
| –0.01 | −1.798 | –0.01 | −1.845 | –0.01 | –1.72 | –0.01 | –1.65 |
|
| 0.62 | 1.39 | 0.81 | 1.849 | 0.59 | 1.32 | 0.84 | 1.906 |
|
| 0.59 | 1.06 | 0.47 | 0.85 | 0.77 | 1.39 | 0.61 | 1.10 |
|
| 0.29 | 0.658 | 0.23 | 0.53 | 0.24 | 0.55 | 0.18 | 0.42 |
|
| –0.05 | –0.43 | –0.13 | –1.22 | –0.05 | –0.42 | –0.13 | –1.25 |
|
| 0.13 | 0.807 | 0.15 | 0.93 | 0.16 | 1.02 | 0.16 | 1.02 |
|
| –0.06 | –0.334 | –0.04 | –0.26 | –0.06 | –0.34 | –0.03 | –0.20 |
|
| 0.07 | 1.33 | 0.07 | 1.44 | 0.07 | 1.37 | 0.07 | 1.42 |
|
| –0.23 | –1.45 | –0.25 | –1.60 | –0.23 | –1.50 | –0.25 | –1.59 |
|
| 3.51 | 0.647 | 1.82 | 0.98 | 1.28 | 0.69 | 2.60 | 1.23 |
|
| 0.1527 | 0.1398 | 0.1467 | 0.1274 | ||||
|
| 0.1755 | 0.1509 | 0.1679 | 0.1379 | ||||
|
| 3.8849 | 3.8850 | 3.8806 | 3.9053 | ||||
|
| 2.5818 | 2.6312 | 2.5987 | 2.6539 | ||||
Being: *, **, *** significant at 1%, 5% and 10%, respectively.
Regression in OLS, carried out in the Caret package, for 2019.
| Repres | Real | Common | Time | |||||
| Coef. |
| Coef. |
| Coef. |
| Coef. |
| |
|
| –0.12 | –1.23 | ||||||
|
| 0.05 | 1.25 | ||||||
|
| 0.06 | 1.680 | ||||||
|
| –0.002 | –1.30 | ||||||
|
| 0.18 | 0.31 | 0.11 | 0.19 | 0.08 | 0.13 | 0.21 | 034 |
|
| 1.39 | 1.936 | 1.50 | 2.081 | 1.52 | 2.108 | 1.42 | 1.982 |
|
| –0.99 | –1.21 | –0.95 | –1.16 | –0.93 | –1.14 | –0.83 | –1.01 |
|
| 37.55 | 3.342 | 37.67 | 3.354 | 37.79 | 3.369 | 37.06 | 3.292 |
|
| –0.01 | –0.58 | –0.01 | –0.55 | –0.01 | –0.57 | –0.01 | –0.49 |
|
| –0.96 | –0.92 | –1.04 | –0.99 | –1.04 | –1.00 | –0.97 | –0.93 |
|
| 0.14 | 1.768 | 0.14 | 1.740 | 0.14 | 1.724 | 0.13 | 1.679 |
|
| –5.90 | −6.188 | –6.04 | −6.307 | –6.08 | −6.352 | –5.86 | −6.133 |
|
| –3.12 | −2.132 | –3.01 | −2.064 | –3.05 | −2.090 | –2.74 | −1.863 |
|
| –0.01 | −1.742 | –0.01 | −1.843 | –0.01 | −1.830 | –0.01 | −1.728 |
|
| 0.81 | 1.849 | 0.79 | 1.802 | 0.77 | 1.768 | 0.79 | 1.814 |
|
| 0.61 | 1.09 | 0.51 | 0.92 | 0.49 | 0.89 | 0.59 | 1.06 |
|
| 0.15 | 0.35 | 0.17 | 0.39 | 0.14 | 0.33 | 0.10 | 0.24 |
|
| –0.13 | –1.23 | –0.12 | –1.12 | –0.11 | –1.06 | –0.13 | –1.20 |
|
| 0.16 | 0.99 | 0.14 | 0.91 | 0.13 | 0.80 | 0.16 | 1.02 |
|
| –0.04 | –0.25 | –0.03 | –0.20 | –0.04 | –0.25 | –0.05 | –0.29 |
|
| 0.07 | 1.46 | 0.07 | 1.41 | 0.07 | 1.44 | 0.07 | 1.40 |
|
| –0.25 | –1.63 | –0.24 | –1.55 | –0.24 | –1.53 | –0.24 | –1.53 |
|
| 7.32 | 1.43 | –0.26 | –0.11 | –0.97 | –0.41 | 1.84 | 0.97 |
|
| 0.1213 | 0.1348 | 0.1348 | 0.1336 | ||||
|
| 0.1380 | 0.1444 | 0.1495 | 0.1546 | ||||
|
| 3.9189 | 3.8948 | 3.8940 | 3.8959 | ||||
|
| 2.6475 | 2.6381 | 2.6262 | 2.6146 | ||||
Being: *, **, *** significant at 1%, 5% and 10%, respectively.
FIGURE 2Importance of variables in Ordinary Least Squares (OLS).
Regression in Random Forest (RF) in 2019.
| General | Otim | Ancho | Overnconf | Repres | Real | Common | Time | |
|
| 2.1240 | 2.0907 | 2.2585 | 2.2739 | 2.2611 | 2.2835 | 2.2800 | 2.2420 |
|
| 0.6965 | 0.6986 | 0.6737 | 0.6579 | 0.6627 | 0.6533 | 0.6544 | 0.6711 |
|
| 13 | 10 | 10 | 10 | 10 | 10 | 10 | 10 |
|
| 1.8320 | 1.7953 | 2.1536 | 2.1601 | 2.1584 | 2.2154 | 2.1202 | 2.1314 |
|
| 0.5772 | 0.5928 | 0.5005 | 0.4867 | 0.4878 | 0.4812 | 0.5045 | 0.5055 |
FIGURE 3Importance of variables in Random Forest (RF).
FIGURE 4List of variables in the k-dependence Bayesian (k-DB) classification model, with a value of k = 5.
Confusion matrix for the k-dependence Bayesian (k-DB) classification model in 2019.
| Reference – training | Reference – test | ||||||||
|
|
| ||||||||
| Prediction | Poor | Median | Good | High | Prediction | Poor | Median | Good | High |
| Poor | 189 | 18 | 12 | 2 | Poor | 82 | 14 | 5 | 4 |
| Median | 8 | 92 | 14 | 11 | Median | 4 | 36 | 6 | 1 |
| Good | 12 | 17 | 124 | 23 | Good | 7 | 6 | 34 | 20 |
| High | 7 | 5 | 26 | 151 | High | 1 | 3 | 23 | 61 |
| Accuracy | 0.7744 | Accuracy | 0.6938 | ||||||
| Kappa | 0.6956 | Kappa | 0.5847 | ||||||
Rating Model in RF in 2019.
| Reference – training | Reference – test | ||||||||
|
|
| ||||||||
| Prediction | Poor | Median | Good | High | Prediction | Poor | Median | Good | High |
| Poor | 82 | 14 | 5 | 4 | Poor | 82 | 14 | 5 | 4 |
| Median | 4 | 36 | 6 | 1 | Median | 4 | 36 | 6 | 1 |
| Good | 7 | 6 | 34 | 20 | Good | 7 | 6 | 34 | 20 |
| High | 1 | 3 | 23 | 61 | High | 1 | 3 | 23 | 61 |
| Accuracy | 0.6938 | Accuracy | 0.6938 | ||||||
| Kappa | 0.5847 | Kappa | 0.5847 | ||||||
Importance of variables using the RF classification model in 2019.
| Importance | Importance | Importance | |||
| Volat.c.Q | 100 | Size.c.L | 5.957 | Port.c.C | 1.39 |
| Surp.c.L | 65.127 | Surp.c.C | 5.845 | Exper.c.Q | 1.376 |
| Surp.c.Q | 58.25 | Age.c.L | 5.64 | Real.c.C | 1.265 |
| Loss1 | 45.761 | Indeb.c.L | 5.114 | Sizeroker1 | 1.257 |
| Volat.c.L | 45.313 | Size.c.C | 5.078 | Especia.c.L | 1.255 |
| Profit.c.L | 40.512 | ADR1 | 4.781 | Real.c.Q | 1.027 |
| Otim1 | 34.861 | Volat.c.C | 4516 | Especia.c.C | 1.024 |
| Sector1 | 33.513 | Growth.c.C | 4.417 | Overnconf.c.Q | 0.827 |
| Anco1 | 28.636 | Popul.c.Q | 4.041 | Experi.c.L | 0.786 |
| Profit.c.Q | 23.082 | ROA.c.L | 3.919 | Experi.c.Q | 0.731 |
| Popul.c.C | 22.743 | FV1 | 3.73 | Common.c.L | 0.652 |
| Growth.c.L | 19.636 | Port.c.L | 3.128 | Time.c.C | 0.64 |
| Indeb.c.Q | 19.061 | ROA.c.C | 2.621 | Overnconf.c.L | 0.601 |
| Popul.c.L | 17.777 | Port.c.Q | 2.58 | Repres.c.C | 0.465 |
| Size.c.Q | 12.206 | Growth.c.Q | 2.575 | Exper.c.C | 0.44 |
| Indeb.c.C | 11.694 | Time.c.L | 2.003 | Common.c.Q | 0.403 |
| Profit.c.C | 11.299 | Specia.c.Q | 1.964 | Real.c.L | 0.347 |
| ROA.c.Q | 7.447 | Exper.c.L | 1.451 | Repres.c.Q | 0.292 |
| Age.c.Q | 6.632 | Experi.c.C | 1.416 | Repres.c.L | 0.241 |
| Age c.C | 6.389 | Time.c.Q | 1.398 | Overnconf.c.C | 0.09 |
C, median; Q, good; L, high.