| Literature DB >> 19014457 |
Angel M Cronin1, Andrew J Vickers.
Abstract
BACKGROUND: A common feature of diagnostic research is that results for a diagnostic gold standard are available primarily for patients who are positive for the test under investigation. Data from such studies are subject to what has been termed "verification bias". We evaluated statistical methods for verification bias correction when there are few false negatives.Entities:
Mesh:
Year: 2008 PMID: 19014457 PMCID: PMC2600821 DOI: 10.1186/1471-2288-8-75
Source DB: PubMed Journal: BMC Med Res Methodol ISSN: 1471-2288 Impact factor: 4.615
Figure 1Example of data subject to verification bias.
Examples of data subject to verification bias and with a low number of false negatives
| Example | Number with normal imaging result and biopsied | Corrected Sensitivity* (%) | ||
| Diseased (False negatives) | Nondiseased (True negatives) | Total | ||
| 1 | 5 | 35 | 40 | 57 |
| 2 | 4 | 36 | 40 | 63 |
| 3 | 3 | 37 | 40 | 69 |
| 4 | 2 | 38 | 40 | 77 |
| 5 | 1 | 39 | 40 | 87 |
| 6 | 0 | 40 | 40 | 100 |
* Corrected for verification bias
Probability of verification used in the simulations for each decile of the diagnostic test result
| Decile of diagnostic test result | Probability of having the gold standard result (%) | ||
| 10% verified | 30% verified | 60% verified | |
| 1 | 0.6% | 3.0% | 16.0% |
| 2 | 0.9% | 4.9% | 23.9% |
| 3 | 1.5% | 7.8% | 34.2% |
| 4 | 2.5% | 12.2% | 46.1% |
| 5 | 4.1% | 18.7% | 58.5% |
| 6 | 6.5% | 27.5% | 69.9% |
| 7 | 10.3% | 38.5% | 79.3% |
| 8 | 15.9% | 50.7% | 86.3% |
| 9 | 23.8% | 62.9% | 91.2% |
| 10 | 34.0% | 73.7% | 94.5% |
| 1 | 6.4% | -- | 0.0% |
| 2 | 7.0% | -- | 0.0% |
| 3 | 7.7% | -- | 0.1% |
| 4 | 8.5% | -- | 7.6% |
| 5 | 9.3% | -- | 92.4% |
| 6 | 10.2% | -- | 99.9% |
| 7 | 11.1% | -- | 100.0% |
| 8 | 12.1% | -- | 100.0% |
| 9 | 13.2% | -- | 100.0% |
| 10 | 14.4% | -- | 100.0% |
Example of data generated under the setting of a screening study with 5000 participants and the underlying incidence of disease being 10%
| (1) | (2) | ||||
| Data Set | Number with negative diagnostic test and verified | Number of false negatives | Proportion of false negatives (2)/(1) | AUC | |
| Categorize in 10 bins | Alonzo-Pepe | ||||
| Fully verified | 2500 | 99 | 4.0% | 0.750 | 0.750 |
| With verification bias | |||||
| Replication 1 | 50 (2.0%) | 2 | 4.0% | 0.690 | 0.712 |
| Replication 2 | 64 (2.6%) | 4 | 6.3% | 0.852 | 0.857 |
| Replication 3 | 40 (1.6%) | 2 | 5.0% | 0.550 | 0.546 |
| Replication 4 | 61 (2.4%) | 0 | 0.0% | 0.812 | 0.826 |
| Replication 5 | 65 (2.6%) | 3 | 4.6% | 0.790 | 0.803 |
In the fully verified data set, definitive test results were known for all 5000 participants. In the replications with verification bias, only 500 (10%) participants underwent definitive testing. False negatives are defined as verified participants with a diagnostic test result less than the median of the diagnostic test results and with a positive gold standard result. The AUC of the fully verified data set was 0.750; all other estimates are with correction for verification bias.
Examples of estimates of verification bias corrected AUC when varying the true value of the AUC
| AUC | ||||
| Fully verified (True value) | 0.600 | 0.700 | 0.800 | 0.900 |
| With verification bias correction | ||||
| Replication 1 | 0.284 | 0.503 | 0.688 | 0.873 |
| Replication 2 | 0.754 | 0.838 | 0.742 | 0.894 |
| Replication 3 | 0.567 | 0.578 | 0.740 | 0.873 |
| Replication 4 | 0.689 | 0.779 | 0.883 | 0.920 |
| Replication 5 | 0.648 | 0.755 | 0.856 | 0.827 |
Simulation study with 2000 replications
| Percentage Verified | Proportion with | AUC over 2000 replications | Coverage Probability over 2000 replications | ||||
| Mean | 2.5th – 97.5th percentile | ||||||
| True | n/a | n/a | n/a | n/a | 0.750 | 0.728, 0.774 | 95% |
| 10% | 83% | 16% | 2% | 0% | 0.758 | 0.577, 0.860 | 77% |
| 30% | 41% | 37% | 17% | 6% | 0.752 | 0.677, 0.813 | 89% |
| 60% | 1% | 6% | 12% | 80% | 0.750 | 0.713, 0.786 | 93% |
| 10% | 25% | 35% | 23% | 16% | 0.751 | 0.677, 0.820 | 93% |
| 60% | 98% | 3% | 0% | 0% | 0.728 | 0.552, 0.819 | 67% |
5000 participants are enrolled in a screening study with an underlying incidence of disease of 10%. False negatives are defined as verified participants with a diagnostic test result less than the 20th percentile of the diagnostic test results and with a positive gold standard result.