| Literature DB >> 29943172 |
Dries Trippas1, David Kellen2, Henrik Singmann3, Gordon Pennycook4, Derek J Koehler5, Jonathan A Fugelsang5, Chad Dubé6.
Abstract
The belief-bias effect is one of the most-studied biases in reasoning. A recent study of the phenomenon using the signal detection theory (SDT) model called into question all theoretical accounts of belief bias by demonstrating that belief-based differences in the ability to discriminate between valid and invalid syllogisms may be an artifact stemming from the use of inappropriate linear measurement models such as analysis of variance (Dube et al., Psychological Review, 117(3), 831-863, 2010). The discrepancy between Dube et al.'s, Psychological Review, 117(3), 831-863 (2010) results and the previous three decades of work, together with former's methodological criticisms suggests the need to revisit earlier results, this time collecting confidence-rating responses. Using a hierarchical Bayesian meta-analysis, we reanalyzed a corpus of 22 confidence-rating studies (N = 993). The results indicated that extensive replications using confidence-rating data are unnecessary as the observed receiver operating characteristic functions are not systematically asymmetric. These results were subsequently corroborated by a novel experimental design based on SDT's generalized area theorem. Although the meta-analysis confirms that believability does not influence discriminability unconditionally, it also confirmed previous results that factors such as individual differences mediate the effect. The main point is that data from previous and future studies can be safely analyzed using appropriate hierarchical methods that do not require confidence ratings. More generally, our results set a new standard for analyzing data and evaluating theories in reasoning. Important methodological and theoretical considerations for future work on belief bias and related domains are discussed.Entities:
Keywords: Belief bias; Deductive reasoning; Hierarchical Bayesian; Meta-analysis; Signal detection theory; Syllogisms
Mesh:
Year: 2018 PMID: 29943172 PMCID: PMC6267550 DOI: 10.3758/s13423-018-1460-7
Source DB: PubMed Journal: Psychon Bull Rev ISSN: 1069-9384
The design of Evans et al. (1983, Experiment 1), example syllogisms, and endorsement rates
| Conclusion | ||
|---|---|---|
| Syllogism | Believable | Unbelievable |
| Valid | No cigarettes are inexpensive. | No addictive things are inexpensive. |
| Some addictive things are inexpensive. | Some cigarettes are inexpensive. | |
| Therefore, some addictive things are not | Therefore, some cigarettes are not addictive. | |
| cigarettes. | ||
| P(“valid”) = .92 | P(“valid”) = .46 | |
| Invalid | No addictive things are inexpensive. | No cigarettes are inexpensive. |
| Some cigarettes are inexpensive. | Some addictive things are inexpensive. | |
| Therefore, some addictive things are not | Therefore, some cigarettes are not addictive. | |
| cigarettes. | ||
| P(“valid”) = .92 | P(“valid”) = .08 | |
Fig. 1Left Panel: Examples of ROCs predicted by the linear model. Center Panel: Illustration of how differences in response bias and discriminability in the linear model are expressed in terms of ROCs. Right Panel: Example of data that according to the linear model imply differences in discriminability for believable and unbelievable syllogisms, but could be described in terms of different response biases if the predicted ROC was curvilinear
Fig. 2Illustration of the Gaussian SDT model. The top panel shows argument strength distributions for valid and invalid syllogisms (and their respective parameters) in the case of binary choices. The bottom panel illustrates the same model in the case of responses in a six-point confidence-rating scale (for clarity, some parameters and labels are omitted here)
Fig. 3Illustration of SDT predictions. The top row illustrates how ROC predictions change across values of μ (with μ = 0 and σ = 1). The middle row illustrates how ROC predictions change across values of σ. The bottom-left panel shows how hits and false alarms change due changes in the response criterion. The bottom-right panel illustrates how confidence-rating ROCs (4-point scale) are based on the cumulative probabilities associated with the different confidence responses
Fig. 4ROC data from Dube et al. (2010, Exp. 2)
Fig. 5Illustration of the effects caused by the aggregation of responses across heterogeneous participants and stimuli
Description of hierarchical linear model parameters and super/subscripts
| Parameter | Meaning |
|---|---|
|
| Grand mean |
|
| Study effect |
|
| Person effect |
|
| Item effect |
| Super/Subscript | Meaning |
| Valid/invalid | |
|
| Study |
|
| Participant in Study |
|
| Syllogistic forms |
Description of the data corpus
| Study ID | N participants | N trials | Study |
|---|---|---|---|
| 1 | 44 | 16 | Trippas et al., ( |
| 2 | 47 | 16 | Trippas et al., ( |
| 3 | 44 | 16 | Trippas ( |
| 4 | 42 | 16 | Trippas ( |
| 5 | 32 | 16 | Trippas ( |
| 6 | 34 | 16 | Trippas ( |
| 7 | 36 | 16 | Trippas et al., ( |
| 8 | 49 | 16 | Trippas et al., ( |
| 9 | 45 | 8 | Trippas (unpublished), complex-syllogisms, internal replication |
| 10 | 38 | 16 | Trippas et al., ( |
| 11 | 38 | 16 | Trippas et al., ( |
| 12 | 42 | 8 | Nuobaraite ( |
| 13 | 24 | 8 | Trippas (unpublished), complex-syllogisms, debias instructions |
| 14 | 191 | 16 | Trippas et al., ( |
| 15 | 38 | 8 | Dube et al., ( |
| 16 | 21 | 16 | Dube et al., ( |
| 17 | 24 | 16 | Dube et al., ( |
| 18 | 27 | 16 | Dube et al., ( |
| 19 | 45 | 8 | Heit and Rotello ( |
| 20 | 44 | 8 | Heit and Rotello ( |
| 21 | 44 | 8 | Heit and Rotello ( |
| 22 | 44 | 8 | Heit and Rotello ( |
Note. “N trials” gives the number of trials per participant and believability by validity cell (i.e., each participant responded to “N trials” times 4 syllogisms)
Fig. 6Believable-syllogism ROCs observed in each of the reanalyzed studies (for details, see Table 2). Note that these ROCs are based on the aggregated data. The shaded regions correspond to the hierarchical SDT model’s predictions based on its posterior parameter estimates
Fig. 7Unbelievable-syllogism ROCs observed in each of the reanalyzed studies (for details, see Table 2). Note that these ROCs are based on the aggregated data. The shaded regions correspond to the hierarchical SDT model’s predictions based on its posterior parameter estimates
Fig. 8Posterior estimates of group-level observed in each study (squares; ), and the posterior estimate obtained across studies (diamonds; alone). The bars and the width of the diamond correspond to the 95% credible intervals. The size of the squares reflects the width of the credible intervals
Fig. 9Forest plot with the posterior group-level estimates (and respective 95% credible intervals) of discriminability (d) for believable and unbelievable syllogisms. The size of the squares reflects the width of the credible intervals. The probability P (Unbel > Bel) corresponds to the posterior probability that the group-level d estimate for unbelievable syllogisms is larger than for believable syllogisms
Fig. 10Posterior estimates (and respective 95% credible intervals) of the stimulus-level deviations () for the SDT parameters x concerning valid and invalid syllogisms (s or s). The posterior estimates for the syllogisms used in study 2 are omitted here (see Foonote 5)
Validity Checks
| Model | Parameter / Derived Measure | |||
|---|---|---|---|---|
|
|
|
|
| |
| Original | 1.05 [.97, 1.14] | .62 [.54, .71] | .79 [.72, .86] | .32 [.25, .38] |
| Alternative | 1.05 [.97, 1.13] | .62 [.54, .70] | .78 [.72, .85] | .32 [.25, .39] |
| No | 1.05 [.98, 1.14] | .63 [.56, .71] | .78 [.72, .85] | .33 [.27, .39] |
| No | .98 [.92, 1.05] | .61 [.53, .68] | .76 [.69, .82] | .34 [.29, .40] |
|
| 1.05 [.98, 1.12] | .61 [.52, .70] | .77 [.70, .84] | .32 [.27, .38] |
|
|
|
|
| |
| Original | .50 [.46, .55] | .47 [.43, .53] | .51 [.45, .58] | .47 [.41, .54] |
| Alternative | .50 [.46, .54] | .47 [.43, .52] | .51 [.45, .57] | .47 [.41, .53] |
| No | .50 [.46, .55] | .48 [.44, .53] | .52 [.46, .58] | .49 [.44, .55] |
| No | .58 [.53, .62] | .56 [.52, .60] | .68 [.61, .76] | .56 [.52, .61] |
|
| .70 [.65, .76] | .48 [.43, .52] | .69 [.62, .76] | .47 [.41, .53] |
|
|
|
|
| |
| Original | 1.06 [.95, 1.17] | 1.09 [.96, 1.23] | .62 [.46, .78] | .67 [.52, .83] |
| Alternative | 1.06 [.95, 1.17] | 1.09 [.95, 1.23] | .63 [.47, .78] | .67 [.52, .83] |
| No | 1.05 [.97, 1.12] | 1.05 [.98, 1.13] | .60 [.47, .74] | .64 [.50, .78] |
| No | 1.03 [.99, 1.08] | 1.21 [1.14, 1.28] | .46 [.35, .58] | .47 [.35, .59] |
|
| 1.48 [1.35, 1.61] | 1.45 [1.30, 1.62] | .51 [.39, .64] | .54 [.42, .67] |
Note. Values in [ ] correspond to the 95% credible intervals of the posterior distributions. The “Original” model is referred to throughout the results section. The “Alternative” model has the same structure but a different prior distribution specification. The “No ” model has no stimulus effect (i.e., data aggregated within participants) and the “No , no ” has neither participant nor study effect (i.e., data aggregated within studies), both models are otherwise identical to the “Original” model. The “” model uses the same priors as the “Original” model, but is fitted to data generated from the parameters of the “Original” model with the sole difference that
Description of the variables in the probit regression analysis
| Variable | Description |
|---|---|
| subj | Identifier for each participant (factor) |
| syll | Identifier for each syllogistic structure (factor) |
| rsp | Response: 1 if endorsed as valid, 0 if rejected |
| logic | Logical Validity: valid or invalid (factor, first level = valid) |
| belief | Conclusion Believability: believable or unbelievable (factor, first level = believable) |
| crt | CRT-grouping: high or low (factor, first level = high) |