| Literature DB >> 26157397 |
Ulrich Hoffrage1, Sebastian Hafenbrädl1, Cyril Bouquet2.
Abstract
In Bayesian inference tasks, information about base rates as well as hit rate and false-alarm rate needs to be integrated according to Bayes' rule after the result of a diagnostic test became known. Numerous studies have found that presenting information in a Bayesian inference task in terms of natural frequencies leads to better performance compared to variants with information presented in terms of probabilities or percentages. Natural frequencies are the tallies in a natural sample in which hit rate and false-alarm rate are not normalized with respect to base rates. The present research replicates the beneficial effect of natural frequencies with four tasks from the domain of management, and with management students as well as experienced executives as participants. The percentage of Bayesian responses was almost twice as high when information was presented in natural frequencies compared to a presentation in terms of percentages. In contrast to most tasks previously studied, the majority of numerical responses were lower than the Bayesian solutions. Having heard of Bayes' rule prior to the study did not affect Bayesian performance. An implication of our work is that textbooks explaining Bayes' rule should teach how to represent information in terms of natural frequencies instead of how to plug probabilities or percentages into a formula.Entities:
Keywords: applied business statistics; bayesian inference; executives; management; natural frequency; representation format; updating beliefs
Year: 2015 PMID: 26157397 PMCID: PMC4475789 DOI: 10.3389/fpsyg.2015.00642
Source DB: PubMed Journal: Front Psychol ISSN: 1664-1078
FIGURE 1Numerical information of the Skiwell Manufacturing Company task. (A) Information provided in percentages. Hit rate and false-alarm rate have been normalized with respect to the base rates of the two suppliers. (B) Natural frequencies with suppliers at the intermediate level. The frequencies of the four conjunctive events implicitly contain the base rate information about the suppliers. (C) Natural frequencies with diagnostic information at the intermediate level. From the perspective of (B), the tree in (C) represents a Bayesian update, in which the four distinct events are now conditioned on diagnostic information.
The four tasks used in the present study with the information provided and the Bayesian solution.
| Skiwell manufacturing | Percentages | 30 | 15 | 10 | 39.13 |
| Natural frequencies | 300 of 1000 | 45 of 300 | 70 of 700 | 45 of 115 | |
| IRS Audit | Percentages | 20 | 30 | 10 | 42.86 |
| Natural frequencies | 200 of 1000 | 60 of 200 | 80 of 800 | 60 of 140 | |
| Techtronics equipment | Percentages | 60 | 70 | 50 | 67.74 |
| Natural frequencies | 60 of 100 | 42 of 60 | 20 of 40 | 42 of 62 | |
| Varden soap | Percentages | 60 | 5 | 10 | 42.86 |
| Natural frequencies | 600 of 1000 | 30 of 600 | 40 of 400 | 30 of 70 |
FIGURE 2Percentages of Bayesian responses, depending on whether the numerical information has been communicated in terms of percentages or natural frequencies.
FIGURE 3Distribution of numerical estimates for the four tasks. The three straight lines indicate information that has been given in the task: Br indicates the base rate for the focal category, Hr indicates the hit rate (that is, diagnostic information conditioned on the focal category, p(D|H)), F indicates the false-alarm rate (that is, diagnostic information conditioned on the non-focal category, p(D|–H)). The two dotted lines indicate possible ways of combining this information: Bay indicates the Bayesian solution, p(H|D), and J stands for Joint Occurrence of D and H, p(D and H). The numbers on these lines (and those on the y-axes) denote the frequencies of the corresponding numerical estimates. For instance, 10 participants in the percentage condition of the IRS task provided a numerical estimate between 20 and 20.99% (in fact, all these 10 participants wrote exactly 20%, which was identical to the base rate of that task), and 12 participants provided an estimate that has been coded as a Bayesian response (three gave the exact Bayesian response, either as the ratio 45/115, or they wrote down the exact number including the decimal, 42.86%, most likely with the help of a pocket calculator, one responded with 42.8%, and one with 42.9%. These five responses are displayed in the bracket ranging from 42.0–42.99%. The remaining seven participants responded with 43%. These seven estimates are displayed in the adjacent bracket, namely 43.0–43.99%, but they were nevertheless coded as Bayesian because our classification criterion allowed for rounding within one percentage point, see above). Note that something similar could be observed for each of the four tasks: the responses that have been classified as Bayesian are spread across two adjacent brackets, and hence the number of Bayesian responses is not visualized by one single bar, but rather consists of two lower numbers visualized by two bars.
Use of cognitive strategies, split by representation format and participant sample.
| Bayesian | 14.57 | 26.6 | 40.53 | 36.21 | 29.2 | 1.47 | <0.001 | 0.22 | 0.39 | 1.0 | 0.012 |
| Base rate | 11.42 | 11.17 | 1.52 | 2.3 | 6.6 | –2.21 | 0.001 | 0.47 | 0.55 | –0.47 | 0.587 |
| Hit rate | 4.72 | 5.85 | 13.26 | 14.94 | 9.5 | 1.15 | 0.001 | 0.12 | 0.69 | 0.05 | 0.886 |
| False-alarm rate | 1.97 | 3.19 | 4.17 | 3.45 | 3.2 | 0.78 | 0.174 | 0.18 | 0.76 | 0.7 | 0.413 |
| Joint occurence | 12.6 | 15.43 | 4.55 | 11.49 | 10.6 | –1.11 | 0.009 | 1.0 | 0.02 | –0.77 | 0.158 |
| Total observations | 254 | 188 | 264 | 174 | 880 | ||||||
The coefficients (B) and p-values result from five different logistic regressions, one for each strategy, that were conducted to determine how representation format and participant sample affected strategy use (after controlling for task and order, and with standard errors clustered for each participant).
Total observations refer to the total number of responses on which the percentages reported in the cells are based, that is, the numbers in the cells denote column percentages.
FIGURE 4Percentages of Bayesian responses, separately for the different groups of participants.