| Literature DB >> 35369253 |
Alaina Talboy1,2, Sandra Schneider2.
Abstract
This work examines the influence of reference dependence, including value selection bias and congruence effects, on diagnostic reasoning. Across two studies, we explored how dependence on the initial problem structure influences the ability to solve simplified precursors to the more traditional Bayesian reasoning problems. Analyses evaluated accuracy and types of response errors as a function of congruence between the problem presentation and question of interest, amount of information, need for computation, and individual differences in numerical abilities. Across all problem variations, there was consistent and strong evidence of a value selection bias in that incorrect responses almost always conformed to values that were provided in the problem rather than other errors including those related to computation. The most consistent and unexpected error across all conditions in the first experiment was that people were often more likely to utilize the superordinate value (N) as part of their solution rather than the anticipated reference class values. This resulted in a weakened effect of congruence, with relatively low accuracy even in congruent conditions, and a dominant response error of the superordinate value. Experiment 2 confirmed that the introduction of a new sample drew attention away from the provided reference class, increasing reliance on the overall sample size. This superordinate preference error, along with the benefit of repeating the PPV reference class within the question, demonstrated the importance of reference dependence based on the salience of information within the response prompt. Throughout, higher numerical skills were generally associated with higher accuracy, whether calculations were required or not.Entities:
Keywords: Bayesian reasoning; PPV; numeracy; problem presentation; problem solving; problem structure; reference dependence
Year: 2022 PMID: 35369253 PMCID: PMC8970303 DOI: 10.3389/fpsyg.2022.729285
Source DB: PubMed Journal: Front Psychol ISSN: 1664-1078
Bayesian reasoning task values organized in a 2 × 2 contingency table.
| Test positive | Test negative |
| |
|
| Condition positive and test positive (C+T+) | Condition positive and test negative (C+T–) | Total condition positive (C+) |
|
| Condition negative and test positive (C-T+) | Condition negative and test negative (C–T–) | Total condition negative (C–) |
|
| Total test positive (T+) | Total test negative (T–) | Superordinate set (N) |
Example condition-focus and test-focus presentations of the mammography problem.
| Condition-Focus (CF) | Condition-Focus (CF) |
| In this sample of 10,000 women, 100 have breast cancer. | In this sample of 10,000 women, 1,070 received a positive result on their mammogram. |
| Of the 100 women who have breast cancer: | Of the 1,070 women who received a positive result on their mammogram: |
| 80 received a positive result on their mammogram. | 80 have breast cancer. |
| Of the 9,900 women who do not have breast cancer: | Of the 8,930 women who received a negative result on there mammogram: |
| 990 received a positive result on their mammogram. | 20 have breast cancer. |
| Imagine another random sample of 10,000 women who had a mammogram. | Imagine another random sample of 10,000 women who had a mammogram. |
Examples show the classic presentation which provides only partial subset information.
Bayesian reasoning problems and relevant values.
| Base rate | True positive rate | False positive rate | PPV | |||
| Domain | Topic | C+ | N | (C+T+) | (C+) | (C–T+) | (C–) | % |
| Medical | Mammogram | 100 | 10,000 | 80 | 100 | 990 | 9,900 | 8 |
| Medical | Diabetes | 50 | 10,000 | 48 | 50 | 4,975 | 9,950 | 1 |
| Legal | Polygraph | 50 | 1,000 | 47 | 50 | 47 | 950 | 50 |
| Legal | Recidivism | 156 | 1,000 | 130 | 156 | 220 | 844 | 37 |
| Sports | Baseball | 185 | 250 | 130 | 185 | 15 | 65 | 90 |
| Sports | Tennis | 2,800 | 10,000 | 2,000 | 2,800 | 1,100 | 7,200 | 65 |
| College | Employment | 140 | 200 | 70 | 140 | 10 | 60 | 88 |
| College | Exam Prep | 350 | 500 | 275 | 350 | 25 | 150 | 92 |
Base rate, the number of condition occurrences (C+) within the specific sample size (N). PPV, positive predictive value (% who correctly test positive out of all those who test positive. (C-T+| C–), the number of people who test positive (erroneously) out of the number of people who are actually negative. PPV is rounded to the nearest whole percentage.
Example presentations with full subset information for the mammography problem.
| Incongruent condition-Focus problem – Full subset information |
| To determine whether a woman is at risk of breast cancer, doctors conduct mammogram screenings. Sometimes women test positive even when they should test negative or test negative when they should test positive Here is some information for a random sample of 10,000 women who had a mammogram: |
| In this sample of 10,000 women, 100 have breast cancer. |
| Of the 100 women who have breast cancer: |
| 80 received a positive result on their mammogram. |
| 20 received a negative result on their mammogram. |
| Of the 9,900 women who do not have breast cancer: |
| 990 received a positive result on their mammogram. |
| 8,910 received a negative result on their mammogram. |
| Imagine another random sample of 10,000 women who had a mammogram. |
|
|
|
|
|
|
| To determine whether a woman is at risk of breast cancer, doctors conduct mammograms screenings. Sometimes women test positive even when they should test negative or test negative when they should test positive. Here is some information for a random sample of 10,000 women who had a mammogram: |
| In this sample of 10,000 women, 1,070 received a positive result on their mammogram. |
| Of the 1,070 women who received a positive result on their mammogram: |
| 80 have breast cancer. |
| 990 do not have breast cancer. |
| Of the 8,930 women who received a negative result on their mammogram: |
| 20 have breast cancer. |
| 8,910 do not have breast cancer. |
| Imagine another random sample of 10,000 women who had a mammogram. |
FIGURE 1Experiment 1a condition accuracy means while controlling for numeracy. Error bars represent ± 1 SE.
FIGURE 2Proportion of participants who consistently used the incorrect denominator strategy on frequency responses. C± denotes the total number of people who have or do not have the condition. Total N denotes the total in the superordinate set (i.e., sample size).
Example presentations without reference class totals for the mammography problem.
| Incongruent condition-Focus problem without reference class totals |
| To determine whether a woman is at risk of breast cancer, doctors conduct mammogram screenings. Sometimes women test positive even when they should test negative or test negative when they should test positive Here is some information for a random sample of 10,000 women who had a mammogram: |
| In this sample of 10,000 women: |
| Of those who have breast cancer: |
| 80 received a positive result on their mammogram. |
| 20 received a negative result on their mammogram. |
| Of those who do not have breast cancer: |
| 990 received a positive result on their mammogram. |
| 8,910 received a negative result on their mammogram. |
| Imagine another random sample of 10,000 women who had a mammogram. |
|
|
|
|
|
|
| To determine whether a woman is at risk of breast cancer, doctors conduct mammograms screenings. Sometimes women test positive even when they should test negative or test negative when they should test positive. Here is some information for a random sample of 10,000 women who had a mammogram: |
| In this sample of 10,000 women: |
| Of those who received a positive result on their mammogram: |
| 80 have breast cancer. |
| 990 do not have breast cancer. |
| Of those who received a negative result on their mammogram: |
| 20 have breast cancer. |
| 8,910 do not have breast cancer. |
| Imagine another random sample of 10,000 women who had a mammogram. |
FIGURE 3Experiment 1b condition accuracy means while controlling for numeracy. Error bars represent ± 1 SE.
FIGURE 4Proportion of participants who consistently used an incorrect denominator strategy on the frequency response format. N denotes the total in the superordinate set (i.e., sample size), C± denotes the total number of people who have or do not have the condition.
Example presentations with superordinate set organization for the mammography problem.
| Incongruent condition-Focus problem with superordinate set organization |
| To determine whether a woman is at risk of breast cancer, doctors conduct mammogram screenings. Sometimes women test positive even when they should test negative or test negative when they should test positive Here is some information for a random sample of 10,000 women who had a mammogram: |
| Of the 10,000 women in this sample, |
| 80 have breast cancer and received a positive result on their mammogram. |
| 20 have breast cancer and received a negative result on their mammogram. |
| 990 do not have breast cancer and received a positive result on their |
| mammogram. |
| 8,910 do not have breast cancer and received a negative result on their |
| mammogram. |
| Imagine another random sample of 10,000 women who had a mammogram. |
|
|
|
|
|
|
| To determine whether a woman is at risk of breast cancer, doctors conduct mammograms screenings. Sometimes women test positive even when they should test negative or test negative when they should test positive. Here is some information for a random sample of 10,000 women who had a mammogram: |
| Of the 10,000 women in this sample, |
| 80 received a positive result on their mammogram and have breast cancer. |
| 990 received a positive result on their mammogram and do not have breast |
| cancer. |
| 20 received a negative result on their mammogram and have breast cancer. |
| 8,910 received a negative result on their mammogram and do not have |
| breast cancer. |
| Imagine another random sample of 10,000 women who had a mammogram. |
Example presentations with no explicit organization for the mammography problem.
| Incongruent condition-Focus problem with no explicit organization |
| To determine whether a woman is at risk of breast cancer, doctors conduct mammogram screenings. Sometimes women test positive even when they should test negative or test negative when they should test positive Here is some information for a random sample of 10,000 women who had a mammogram: |
| 80 have breast cancer and received a positive result on their mammogram. |
| 20 have breast cancer and received a negative result on their mammogram. |
| 990 do not have breast cancer and received a positive result on their |
| mammogram. |
| 8,910 do not have breast cancer and received a negative result on their |
| mammogram. |
| Imagine another random sample of the same number of women who had a mammogram. |
|
|
|
|
|
|
| To determine whether a woman is at risk of breast cancer, doctors conduct mammograms screenings. Sometimes women test positive even when they should test negative or test negative when they should test positive. Here is some information for a random sample of 10,000 women who had a mammogram: |
| 80 received a positive result on their mammogram and have breast cancer. |
| 990 received a positive result on their mammogram and do not have breast |
| cancer. |
| 20 received a negative result on their mammogram and have breast cancer. |
| 8,910 received a negative result on their mammogram and do not have |
| breast cancer. |
| Imagine another random sample of the same number of women who had a mammogram. |
FIGURE 5Experiment 1c condition accuracy means while controlling for numeracy. Error bars indicate ± 1 SE.
FIGURE 6Proportion of participants who consistently used the incorrect denominator strategy on the frequency response format. N denotes the total in the superordinate set (i.e., sample size), C± denotes the total number of people who have or do not have the condition.
FIGURE 7Experiment 2 condition accuracy means while controlling for numeracy. Error bars indicate ± 1 SE.
FIGURE 8Proportion of participants who consistently used the incorrect denominator strategy on the frequency response format. N denotes the total in the superordinate set (i.e., sample size), C± denotes the total number of people who have or do not have the condition, other includes both selected and not readily identifiable values.