| Literature DB >> 26315894 |
Joris Menten1,2, Emmanuel Lesaffre3.
Abstract
BACKGROUND: Selecting the most effective diagnostic method is essential for patient management and public health interventions. This requires evidence of the relative performance of alternative tests or diagnostic algorithms. Consequently, there is a need for diagnostic test accuracy meta-analyses allowing the comparison of the accuracy of two or more competing tests. The meta-analyses are however complicated by the paucity of studies that directly compare the performance of diagnostic tests. A second complication is that the diagnostic accuracy of the tests is usually determined through the comparison of the index test results with those of a reference standard. These reference standards are presumed to be perfect, i.e. allowing the classification of diseased and non-diseased subjects without error. In practice, this assumption is however rarely valid and most reference standards show false positive or false negative results. When an imperfect reference standard is used, the estimated accuracy of the tests of interest may be biased, as well as the comparisons between these tests.Entities:
Mesh:
Year: 2015 PMID: 26315894 PMCID: PMC4552463 DOI: 10.1186/s12874-015-0061-7
Source DB: PubMed Journal: BMC Med Res Methodol ISSN: 1471-2288 Impact factor: 4.615
Tabulation of an hypothetical diagnostic test accuracy meta-analysis. Columns T 1,T 2,T 3 indicate results for the 3 possible index tests. Columns T 4,T 5 indicate results for the 2 possible reference tests. + indicates a positive test result, - a negative test result. NA indicates that the test was not performed in that particular study. The observed frequency column report the number of subjects with a specific test result pattern in each study
| Study | Index tests | Reference | Observed | |||
|---|---|---|---|---|---|---|
| Nr. |
|
|
|
|
| Frequency |
| 1 | + | + | NA | + | NA | 30 |
| 1 | + | + | NA | - | NA | 1 |
| 1 | + | - | NA | + | NA | 3 |
| 1 | + | - | NA | - | NA | 6 |
| 1 | - | + | NA | + | NA | 0 |
| 1 | - | + | NA | - | NA | 3 |
| 1 | - | - | NA | + | NA | 8 |
| 1 | - | - | NA | - | NA | 160 |
| 2 | + | + | + | NA | + | 49 |
| … | … | … | … | … | … | … |
| 2 | - | - | - | NA | - | 99 |
| 3 | + | NA | NA | + | NA | 115 |
| … | … | … | … | … | … | … |
| 3 | - | NA | NA | - | NA | 244 |
| 4 | + | NA | NA | NA | + | 11 |
| … | … | … | … | … | … | … |
| 4 | - | NA | NA | NA | - | 19 |
| 5 | NA | + | NA | + | NA | 66 |
| … | … | … | … | … | … | … |
| 5 | NA | - | NA | - | NA | 29 |
| 6 | + | NA | + | NA | + | 27 |
| … | … | … | … | … | … | … |
| 6 | - | NA | - | NA | - | 56 |
| 7 | NA | + | + | NA | + | 77 |
| … | … | … | … | … | … | … |
| 7 | NA | - | - | NA | - | 13 |
| 8 | + | + | + | NA | NA | 143 |
| … | … | … | … | … | … | … |
| 8 | - | - | - | NA | NA | 85 |
Description of the different models. Example code for the models is given in Additional file 2. S and C represent the sensitivity and specificity of test j in study i
| Model | Reference standard | Model estimation |
|---|---|---|
| 1 | Assumed to be perfect | Independent estimation of |
| 2 | Assumed to be perfect | Direct comparisons only |
| 3 | Assumed to be perfect | Direct and indirect comparisons |
| 4 | Allowing for imperfect reference standards | Hierarchical latent class model |
| 5 | Allowing for imperfect reference standards | Network-based latent class model |
Overview of the real data example: a comparative meta-analysis of the RK39 dipstick and direct agglutination test (DAT) for the diagnosis of visceral leishmaniasis. The total sample size (N) and availability of test results (X) is given for all 10 studies. Other tests: IFAT=indirect fluorescent antibody test, KAtex =latex agglutination test, spleen =parasitological examination of tissue aspirates including spleen sample, no spleen: parasitological examination of tissue aspirates not including spleen sample
| Study information | Index tests | Reference test | ||||||
|---|---|---|---|---|---|---|---|---|
| Publication | Country | RK39 | DAT | KAtex | IFAT | Spleen | No Spleen | N |
| Boelaert-1999 | Sudan | X | X | X | 59 | |||
| Boelaert-2004 | Nepal | X | X | X | X | 309 | ||
| Boelaert-2008 | Nepal | X | X | X | X | 158 | ||
| Boelaert-2008 | India | X | X | X | X | 352 | ||
| Boelaert-2008 | Kenya | X | X | X | X | 307 | ||
| Boelaert-2008 | Ethiopia | X | X | X | X | 35 | ||
| Boelaert-2008 | Sudan | X | X | X | X | 291 | ||
| de Assis-2012 | Brazil | X | X | X | X | 407 | ||
| Toz-2004 | Turkey | X | X | X | 42 | |||
| Veeken-2003 | Sudan | X | X | X | 77 | |||
Fig. 1Forest plot for real data example. Estimated sensitivity and specificity of the RK39 dipstick (open circles) and DAT (closed squares) with 95 % confidence interval, using parasitology as gold standard
Fig. 2Summary of simulation results. Bias in estimates of the contrasts in diagnostic accuracy from the proposed meta-analytical models applied in the simulation study. The boxplots present the bias in and (first row), and (second row), and (third row). The first column presents Scenario 1 where a common imperfect reference standard with moderate S and high C was used, the second scenario 2 where systematic bias is induced by differing reference standards. Full explanation of the model is in the text; full explanation of the simulation setup and results in Additional file 3. Note for Scenario 1: For models 1 to 3 disease status was estimated from the results of T 4. Note for Scenario 2: Models 4 and 5 were applied both assuming it is known that the reference tests differ across studies (4a and 5a) and ignoring the difference in reference tests (4b and 5b)
Results of the meta-analysis of the diagnostic tests for visceral leishmaniasis
| Parasitology as Gold Standard | No Gold Standard | ||||
|---|---|---|---|---|---|
| Model 1 | Model 2 | Model 3 | Model 4 | Model 5 | |
| Parameter | Estimate | Estimate | Estimate | Estimate | Estimate |
|
| 94.2 | 95.5 | 94.8 | 95.9 | 94.7 |
|
| 85.2 | 84.0 | 86.5 | 84.6 | 88.1 |
|
| 95.7 | 97.2 | 96.4 | 96.3 | 96.4 |
|
| 94.4 | 93.5 | 95.4 | 96.1 | 96.5 |
|
| 78.6 | 75.5 | 78.3 | 90.2 | 91.0 |
|
| 80.1 | 80.9 | 81.5 | 93.0 | 94.1 |
|
| 1.5 | 1.7 | 1.6 | 0.4 | 1.7 |
|
| 9.2 | 9.6 | 8.9 | 11.5 | 8.4 |
|
| 1.5 | 5.3 | 3.2 | 2.8 | 3.1 |
|
| 1.02 | 1.02 | 1.02 | 1.00 | 1.02 |
|
| 1.11 | 1.12 | 1.10 | 1.14 | 1.10 |
|
| 1.02 | 1.07 | 1.04 | 1.03 | 1.03 |
|
| 1.3 | 1.7 | 1.5 | 1.1 | 1.6 |
|
| 2.7 | 2.8 | 3.3 | 4.7 | 3.9 |
|
| 1.1 | 1.4 | 1.2 | 1.5 | 1.7 |
S and C : sensitivity and specificity of Test i; S and D : difference in sensitivity and specificity between Test 1 and Test 2; S and D : relative sensitivity and specificity of Test 1 compared to Test 2 as a relative risk; S and D : relative sensitivity and specificity of Test 1 compared to Test 2 expressed as an odds-ratio. R1 and R2 indicate estimates obtained for East-Africa and the rest of the world, respectively