| Literature DB >> 35300611 |
Emily Day1, David Eldred-Evans2, A Toby Prevost3, Hashim U Ahmed2,4, Francesca Fiorentino5,6,7.
Abstract
INTRODUCTION: Novel screening tests used to detect a target condition are compared against either a reference standard or other existing screening methods. However, as it is not always possible to apply the reference standard on the whole population under study, verification bias is introduced. Statistical methods exist to adjust estimates to account for this bias. We extend common methods to adjust for verification bias when multiple tests are compared to a reference standard using data from a prospective double blind screening study for prostate cancer.Entities:
Keywords: Begg and Greenes; Multiple imputation; Sensitivity; Specificity; Verification bias
Mesh:
Substances:
Year: 2022 PMID: 35300611 PMCID: PMC8932251 DOI: 10.1186/s12874-021-01481-w
Source DB: PubMed Journal: BMC Med Res Methodol ISSN: 1471-2288 Impact factor: 4.615
Screening test results for participants who did not undergo condition verification in the IP1-PROSTAGRAM trial
| Verification Status | MRI | Ultrasound | PSA | Total |
|---|---|---|---|---|
| 0 | 0 | 0 | ||
| 1 | 0 | 0 | 5 | |
| 0 | 1 | 0 | 5 | |
| 0 | 0 | 1 | 1 | |
| 1 | 1 | 0 | 3 | |
| 1 | 0 | 1 | 1 | |
| 0 | 1 | 1 | 2 | |
| 1 | 1 | 1 | 0 | |
1MRI/ultrasound/PSA = 1: screen-positive result
2MRI/ultrasound/PSA = 0: screen-negative result
3V = 0: did not undergo condition verification (reference standard)
Begg and Greenes method for three screening tests (R, S, T)
| Screening Tests | Condition Status | |||||
|---|---|---|---|---|---|---|
| Verification Status | Total | |||||
| 0 | 0 | 0 | ||||
| 1 | 0 | 0 | ||||
| 0 | 1 | 0 | ||||
| 0 | 0 | 1 | ||||
| 1 | 1 | 0 | ||||
| 1 | 0 | 1 | ||||
| 0 | 1 | 1 | ||||
| 1 | 1 | 1 | ||||
| 0 | 0 | 0 | ||||
| 1 | 0 | 0 | ||||
| 0 | 1 | 0 | ||||
| 0 | 0 | 1 | ||||
| 1 | 1 | 0 | ||||
| 1 | 0 | 1 | ||||
| 0 | 1 | 1 | ||||
| 1 | 1 | 1 | ||||
1Three screening tests R, S, T:
Screen-positive result: R = 1, T = 1, and/or S = 1
Screen-negative result: R = 0, T = 0, and S = 0
2V – Verification status:
V = 1 for verified participants (those who underwent the reference standard)
V = 0 for non-verified participants (those who did not undergo the reference standard).
3D – Condition status:
D = 1 for those who have the target condition (according to the reference standard result)
D = 0 for those who do not have the target condition (according to the reference standard result).
4T1_000, T1_100, T1_010, T1_001, T1_110, T1_101, T1_011, T1_111 are the total numbers of verified (V = 1) participants with each combination of screening test results. These totals can be found from the data. To satisfy the assumption that the prevalence of the target condition estimated in the subset of participants who are screen-negative and undergo verification applies to all screen-negatives, T1_000 > 0 must hold
5T0_000, T0_100, T0_010, T0_001, T0_110, T0_101, T0_011, T0_111 are the total numbers of non-verified (V = 0) participants with each combination of screening test results. These totals can be found from the data
6a, b, c, d, e, f, g, h, I, j, k, i, m, n, o, p are the numbers of verified participants (V = 1) with each of the combinations of screening test results, with (D = 1) or without (D = 0) the target condition. These frequencies are known from the data
7a’, b’, c’, d’, e’, f’, g’, h’, i’, j’, k’, l’, m’, n’, o’, p’ are numbers of non-verified participants (V = 0) with each of the combinations of screening test results, with (D = 1) or without (D = 0) the target condition. These frequencies are missing from the data, but can be estimated from the known values for the verified patients and the total numbers of non-verified patients with each combination of the screening test results
Accuracy Measure (Sensitivity, Specificity, PPV and NPV) Estimates, with 95% Confidence Intervals, Adjusted for Verification Bias
| Statistical Methods for Verification Bias Adjustment | ||||||
|---|---|---|---|---|---|---|
| Complete Case Analysis (Unadjusted Approach) | Begg and Greenes using Three Screening Tests | Multiple Imputation using Three Screening Tests (using standard logistic regression) | Multiple Imputation using Three Screening Tests, using penalised logistic regression (Firth’s method) | |||
| Prevalence of clinically significant prostate cancer | 9.6% | 4.5% | 4.7% | 4.0% | ||
53.0% (45.4 to 60.5%) | 24.1% (20.1 to 28.5%) | 24.1% (20.1 to 28.5%) | 24.1% (20.1 to 28.5%) | |||
Sens = 87.5% (61.7 to 98.4%) Spec = 50.7% (42.4 to 58.9%) PPV = 15.9% (9.0 to 25.2%) NPV = 97.4% (91.0 to 99.7%) | Sens = 87.9% (68.8 to 100.0%) Spec = 78.9% (74.8 to 82.9%) PPV = 16.4% (9.8 to 24.8%) NPV = 99.3% (98.0 to 100.0%) | Sens = 82.2% (58.9 to 88.0%) Spec = 78.8% (74.4 to 82.6%) PPV = 16.0% (10.0 to 24.9%) NPV = 98.9% (97.0 to 99.6%) | Sens = 87.5% (64.0 to 96.5%) Spec = 78.6% (74.2 to 82.4%) PPV = 14.4% (8.8 to 22.8%) NPV = 99.3% (97.6 to 99.8%) | |||
51.2% (43.6 to 58.8%) | 23.6% (19.7 to 28.0%) | 23.6% (19.7 to 28.0%) | 23.6% (19.7 to 28.0%) | |||
Sens = 56.3% (29.9 to 80.2%) Spec = 49.3% (41.1 to 57.6%) PPV = 10.6% (5.0 to 19.2%) NPV = 91.4% (83.0 to 96.5%) | Sens = 56.3% (31.7 to 78.3%) Spec = 78.0% (73.8 to 81.6%) PPV = 10.7% (5.2 to 18.1%) NPV = 97.4% (95.3 to 99.0%) | Sens = 53.7% (31.6 to 74.3%) Spec = 77.9% (73.5 to 81.7%) PPV = 10.6% (5.9 to 18.7%) NPV = 97.2% (94.7 to 98.5%) | Sens = 56.3% (33.2 to 76.9%) Spec = 77.8% (73.4 to 81.6%) PPV = 9.5% (5.1 to 17.0%) NPV = 97.7% (95.4 to 98.9%) | |||
21.1% (15.5 to 28.0%) | 9.7% (7.1 to 13.0%) | 9.7% (7.1 to 13.0%) | 9.7% (7.1 to 13.0%) | |||
Sens = 37.5% (15.2 to 64.6%) Spec = 80.7% (73.4 to 86.7%) PPV = 17.1% (6.6 to 33.6%) NPV = 92.4% (86.4 to 96.3%) | Sens = 36.4% (14.2 to 63.8%) Spec = 91.6% (88.5 to 94.0%) PPV = 16.9% (5.4 to 29.3%) NPV = 96.8% (94.5 to 98.4%) | Sens = 35.6% (17.1 to 58.0%) Spec = 91.5% (88.4 to 93.9%) PPV = 16.6% (8.1 to 31.4%) NPV = 96.6% (94.2 to 98.0%) | Sens = 37.5% (18.5 to 61.4%) Spec = 91.5% (88.3 to 93.9%) PPV = 15.4% (7.2 to 29.7%) NPV = 97.3% (95.0 to 98.5%) | |||
Accuracy measure (sensitivity, specificity, PPV and NPV) estimates adjusted for verification bias, using complete cases analysis (unadjusted approach), Begg and Greenes using three screening tests, multiple imputation using three screening tests, and multiple imputation using three screening tests and penalised logistic regression (Firth’s correction) (IP1-PROSTAGRAM trial data [12])