| Literature DB >> 26566275 |
Helen L Storey1, Ying Huang2, Chris Crudder1, Allison Golden1, Tala de los Santos1, Kenneth Hawkins1.
Abstract
Novel typhoid diagnostics currently under development have the potential to improve clinical care, surveillance, and the disease burden estimates that support vaccine introduction. Blood culture is most often used as the reference method to evaluate the accuracy of new typhoid tests; however, it is recognized to be an imperfect gold standard. If no single gold standard test exists, use of a composite reference standard (CRS) can improve estimation of diagnostic accuracy. Numerous studies have used a CRS to evaluate new typhoid diagnostics; however, there is no consensus on an appropriate CRS. In order to evaluate existing tests for use as a reference test or inclusion in a CRS, we performed a systematic review of the typhoid literature to include all index/reference test combinations observed. We described the landscape of comparisons performed, showed results of a meta-analysis on the accuracy of the more common combinations, and evaluated sources of variability based on study quality. This wide-ranging meta-analysis suggests that no single test has sufficiently good performance but some existing diagnostics may be useful as part of a CRS. Additionally, based on findings from the meta-analysis and a constructed numerical example demonstrating the use of CRS, we proposed necessary criteria and potential components of a typhoid CRS to guide future recommendations. Agreement and adoption by all investigators of a standardized CRS is requisite, and would improve comparison of new diagnostics across independent studies, leading to the identification of a better reference test and improved confidence in prevalence estimates.Entities:
Mesh:
Year: 2015 PMID: 26566275 PMCID: PMC4643909 DOI: 10.1371/journal.pone.0142364
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1PRISMA flowchart.
Study flow depicting search strategy, inclusion/exclusion criteria, and summary of systematic review.
Quality assessment of diagnostic accuracy studies.
| Domain | Criteria | Conclusion | Covariate |
|---|---|---|---|
| Patient selection | A consecutive or random sample of patients was enrolled | If no, then | High compared to low |
| A case-control design was avoided | If no, then | High compared to low | |
| The study avoided inappropriate exclusions | If no, then | High compared to low | |
| The included patients were individuals suspected of having typhoid fever and the diagnostics were used to diagnose the patients | If no, then | ||
| Index test | A threshold for the test result was pre-specified | If no, then | High compared to low |
| The test result was interpreted without knowledge of the reference test | If no, then | High compared to low | |
| The index test aimed to diagnose acute typhoid fever | If no, then | ||
| Reference test | The reference standard was likely to correctly classify the target condition of acute typhoid fever | If no, then | High compared to low |
| The reference test was interpreted without knowledge of the index test | If no, then | High compared to low | |
| The reference standard aimed to diagnose acute typhoid fever | If no, then | ||
| Flow and timing | There was an appropriate interval between index test and reference test | If no, then | High compared to low |
| All patients received a reference standard | If no, then | High compared to low | |
| All patients received the same reference standard | If no, then | High compared to low | |
| All patients included in the analysis | If no, then | High compared to low |
Summary of variables included in the QUADAS-2 tool assessing the quality of diagnostic accuracy studies. The criteria determined a study’s risk of bias or concern of applicability. When the domain-specific criteria were not met, the study had a high risk of bias or concern of applicability with respect to that domain. When the domain-specific criteria were all unclear, the risk of bias or concern of applicability was unclear.
1 The currently available tests to detect typhoid fever are not sufficiently accurate; therefore, this question was problematic.
2 “Unclear” = missing.
Fig 2Comparisons by reference test.
Summary of the 139 papers by reference test, including 413 index/reference comparisons. Of the culture reference tests, 80% were blood culture, making up 57% of all reference tests.
Meta-analysis results by study quality.
| Index test | # of comparisons (studies) | Sensitivity (95% CI) | p-value | Specificity (95% CI) | p-value | Diagnostic odds ratio (95% CI) | Any high concern of applicability (%) | |||
|---|---|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
| ||
|
| 7 | 0.93 | (0.83, 1.00) | 0.69 | 0.95 | (0.88, 1.00) | 0.04 | |||
| No | 9 | 0.98 | (0.95, 1.00) | 0.76 | (0.52, 1.00) | |||||
|
| 1 | 0.4 | (-0.37, 1.00) | 0.02 | 0.93 | (0.67, 1.00) | 0.12 | |||
| No | 11 | 0.96 | (0.91, 1.00) | 0.92 | (0.82, 1.00) | |||||
|
| 1 | 0.99 | (0.95, 1.00) | 0.001 | 0.59 | (-0.37, 1.00) | 0.71 | |||
| No | 11 | 0.97 | (0.92, 1.00) | 0.82 | (0.64, 1.00) | |||||
|
| 2 | 0.83 | (0.43, 1.00) | 0.49 | 0.94 | (0.75, 1.00) | 0.11 | |||
| No | 14 | 0.97 | (0.94, 1.00) | 0.86 | (0.72, 1.00) | |||||
|
|
|
|
|
|
|
|
|
| ||
|
| 16 | 0.91 | (0.87, 0.95) | 0.25 | 0.91 | (0.86, 0.96) | 0.96 | |||
| No | 10 | 0.77 | (0.68, 0.86) | 0.74 | (0.61, 0.87) | |||||
|
| 12 | 0.89 | (0.84, 0.95) | 0.28 | 0.92 | (0.85, 0.98) | 0.17 | |||
| No | 18 | 0.76 | (0.69, 0.84) | 0.89 | (0.82, 0.95) | |||||
|
| 0 | |||||||||
| No | 16 | 0.83 | (0.75, 0.89) | 0.81 | (0.71, 0.88) | |||||
|
| 6 | 0.89 | (0.79, 0.98) | 0.37 | 0.89 | (0.78, 1.00) | 0.34 | |||
| No | 27 | 0.83 | (0.77, 0.89) | 0.89 | (0.83, 0.94) | |||||
|
|
|
|
|
|
|
|
|
| ||
|
| 6 | 0.77 | (0.60, 0.94) | 0.94 | 0.91 | (0.86, 0.96) | 0.01 | |||
| No | 6 | 0.71 | (0.51, 0.91) | 0.86 | (0.81, 0.92) | |||||
|
| 3 | 0.55 | (0.22, 0.88) | 0.13 | 0.95 | (0.91, 0.98) | 0.07 | |||
| No | 9 | 0.79 | (0.67, 0.91) | 0.85 | (0.80, 0.89) | |||||
|
| 0 | |||||||||
| No | 12 | 0.75 | (0.59, 0.85) | 0.88 | (0.84, 0.92) | |||||
|
| 5 | 0.67 | (0.44, 0.90) | 0.24 | 0.9 | (0.85, 0.96) | 0.01 | |||
| No | 7 | 0.79 | (0.64, 0.93) | 0.87 | (0.82, 0.92) | |||||
|
|
|
|
|
|
|
|
|
| ||
|
| 7 | 0.79 | (0.68, 0.89) | 0.79 | 0.8 | (0.70, 0.90) | 0.02 | |||
| No | 5 | 0.68 | (0.53, 0.83) | 0.86 | (0.78, 0.95) | |||||
|
| 3 | 0.89 | (0.82, 0.97) | 0.7 | 0.77 | (0.61, 0.94) | 0.05 | |||
| No | 10 | 0.69 | (0.61, 0.77) | 0.85 | (0.78, 0.92) | |||||
|
| 0 | |||||||||
| No | 6 | 0.67 | (0.62, 0.71) | 0.87 | (0.79, 0.92) | |||||
|
| 3 | 0.89 | (0.82, 0.97) | 0.7 | 0.77 | (0.61, 0.94) | 0.05 | |||
| No | 10 | 0.69 | (0.61, 0.77) | 0.85 | (0.78, 0.92) | |||||
|
|
|
|
|
|
|
|
|
| ||
|
| 9 | 0.84 | (0.68, 0.99) | 0.57 | 0.87 | (0.75, 0.98) | 0.96 | |||
| No | 9 | 0.85 | (0.71, 0.99) | 0.79 | (0.63, 0.95) | |||||
|
| 2 | 0.5 | (-0.02, 1.00) | 0.14 | 0.79 | (0.47, 1.00) | 1 | |||
| No | 13 | 0.84 | (0.72, 0.95) | 0.8 | (0.68, 0.92) | |||||
|
| 0 | |||||||||
| No | 14 | 0.78 | (0.62, 0.89) | 0.85 | (0.71, 0.93) | |||||
|
| 5 | 0.63 | (0.34, 0.91) | 0.02 | 0.88 | (0.74, 1.00) | 0.58 | |||
| No | 15 | 0.89 | (0.81, 0.96) | 0.77 | (0.63, 0.91) | |||||
|
|
|
|
|
|
|
|
|
| ||
|
| 29 | |||||||||
| No | 31 | |||||||||
|
| 19 | 0.68 | (0.57, 0.79) | 0.24 | 0.88 | (0.81, 0.95) | 0.21 | |||
| No | 34 | 0.65 | (0.57, 0.74) | 0.79 | (0.71, 0.87) | |||||
|
| 0 | |||||||||
| No | 42 | 0.66 | (0.57, 0.73) | 0.84 | (0.76, 0.90) | |||||
|
| 13 | 0.73 | (0.59, 0.87) | 0.41 | 0.8 | (0.67, 0.93) | 0.04 | |||
| No | 52 | 0.68 | (0.60, 0.75) | 0.84 | (0.78, 0.90) | |||||
Summary diagnostic accuracies of index tests with five or more comparisons and blood culture as the reference test. Meta-analysis performed using bivariate random effects binomial regression.
1 Could not be determined.
Fig 3Meta-analysis results.
Graphical illustration of sensitivities (y-axis) and specificities (x-axis) corresponding to comparisons included in the meta-analysis: PCR-based assays (A), anti-LPS assays (B), TUBEX® assays (C), anti-S. typhi assays (D), Typhidot assays (E), Widal assays (F). Meta-analysis was performed using bivariate random effects binomial regression (STATA command: metandi). Sizes of individual study estimates (grey circle) represent sample size. Summary point (red square), hierarchical summary receiver operating characteristic curves (green line), 95% confidence regions (yellow dashed line), and 95% prediction regions (grey dashed line) are depicted.
Constructed numerical example.
| All tests independent conditional on disease status | Index test conditionally dependent on test A among diseased (correlation = 0.4) and independent of test B | Index test conditionally dependent on test B among both diseased and non-diseased (correlation = 0.4) and independent of test A | Index test conditionally dependent on test B among both diseased and non-diseased (correlation = 0.7) and independent of test A | |||||
|---|---|---|---|---|---|---|---|---|
| Sensitivity | Specificity | Sensitivity | Specificity | Sensitivity | Specificity | Sensitivity | Specificity | |
| Index test compared to test A | 80% | 82.2% | 96.0% | 84.0% | NA | NA | NA | NA |
| Index test compared to test B | 51.0% | 87.0% | NA | NA | 66.8% | 93.5% | 78.6% | 98.3% |
| Index test compared to CRS | 52.5% | 85.2% | 53.2% | 85.4% | 65.6% | 89.4% | 75.4% | 92.6% |
Assumed sensitivity and specificity of the three tests: index test, 80% and 90%; test A, 50% and 100%; test B, 85% and 85%. Comparing the index test to a CRS = (fever) AND ((test A positive) OR (test B positive)). Fever, test A, and test B are independent conditional on disease status. Index test is independent of fever conditional on disease status.