| Literature DB >> 31571244 |
Hayley E Jones1, Constantine A Gatsonsis2,3, Thomas A Trikalinos3, Nicky J Welton1, A E Ades1.
Abstract
Tests for disease often produce a continuous measure, such as the concentration of some biomarker in a blood sample. In clinical practice, a threshold C is selected such that results, say, greater than C are declared positive and those less than C negative. Measures of test accuracy such as sensitivity and specificity depend crucially on C, and the optimal value of this threshold is usually a key question for clinical practice. Standard methods for meta-analysis of test accuracy (i) do not provide summary estimates of accuracy at each threshold, precluding selection of the optimal threshold, and furthermore, (ii) do not make use of all available data. We describe a multinomial meta-analysis model that can take any number of pairs of sensitivity and specificity from each study and explicitly quantifies how accuracy depends on C. Our model assumes that some prespecified or Box-Cox transformation of test results in the diseased and disease-free populations has a logistic distribution. The Box-Cox transformation parameter can be estimated from the data, allowing for a flexible range of underlying distributions. We parameterise in terms of the means and scale parameters of the two logistic distributions. In addition to credible intervals for the pooled sensitivity and specificity across all thresholds, we produce prediction intervals, allowing for between-study heterogeneity in all parameters. We demonstrate the model using two case study meta-analyses, examining the accuracy of tests for acute heart failure and preeclampsia. We show how the model can be extended to explore reasons for heterogeneity using study-level covariates.Entities:
Keywords: Box-Cox transformation; ROC curve; evidence synthesis; sensitivity; specificity; test cutoff
Mesh:
Substances:
Year: 2019 PMID: 31571244 PMCID: PMC6856843 DOI: 10.1002/sim.8301
Source DB: PubMed Journal: Stat Med ISSN: 0277-6715 Impact factor: 2.373
Test accuracy data from a study, indexed i, providing estimates of sensitivity and specificity at T distinct thresholds (C ,…, )
| Population | Total number of patients | Number with test result > | ⋯ | Number with test result >
|
|---|---|---|---|---|
| Disease‐free |
|
| ⋯ |
|
| Diseased |
|
| ⋯ |
|
Figure 1Observed data on the accuracy of Brain Natriuretic Peptide (Triage assay only) in diagnosing acute heart failure across the full observed range of thresholds. Points from the same study are joined. tpr = true positive rate (sensitivity), fpr = false positive rate (1‐specificity). Also shown are point estimates with 95% credible intervals from a series of stratified bivariate meta‐analyses, in which similar thresholds are grouped and analysed together [Colour figure can be viewed at http://wileyonlinelibrary.com]
Figure 2Observed data on the accuracy of spot urinary protein to creatinine ratio in detecting significant proteinuria in suspected preeclampsia. Points from the same study are joined. tpr = true positive rate (sensitivity), fpr = false positive rate (1‐specificity). Also shown are summary point estimates with 95% confidence intervals from an analysis by Riley et al15 [Colour figure can be viewed at http://wileyonlinelibrary.com]
Comparison of model fit to the Brain natriuretic peptide data, according to the Deviance Information Criteria (DIC). = mean residual deviance, pD = effective number of parameters, DIC = + pD
| Model | Correlation structure | Transformation,
| Study level covariates |
| pD | DIC |
|---|---|---|---|---|---|---|
| 1 | Full correlation matrix | log() | None | 216.7 | 41.0 | 257.7 |
| 2 | Structured correlation matrix | log() | None | 215.0 | 40.8 | 255.8 |
| 3 | Independence | log() | None | 214.1 | 42.5 | 256.6 |
| 4 | Independence | Box‐Cox with unknown | None | 211.3 | 43.6 | 254.9 |
| 5 | Independence | log() |
| 214.7 | 42.9 | 257.6 |
Figure 3Summary true positive rate (tpr) and false positive rate (fpr) estimates (Models 1‐4) for the Brain natriuretic peptide data across the full range of thresholds. 95% credible intervals and prediction intervals shown are from Model 3 [Colour figure can be viewed at http://wileyonlinelibrary.com]
Figure 4Relationship between average patient age and false positive rate of Brain Natriuretic Peptide (Triage assay) in diagnosing acute heart failure (Model 5 results). Top panel: summary false positive rate across all thresholds for age 60 and age 80. Bottom panel: summary false positive rate at a threshold of 100 ng/litre, by average patient age. Shaded areas are 95% credible intervals [Colour figure can be viewed at http://wileyonlinelibrary.com]
Comparison of model fit to the protein to creatinine ratio data, according to the Deviance Information Criteria (DIC). = mean residual deviance, pD = effective number of parameters, DIC = + pD
| Model | Correlation structure | Transformation,
|
| pD | DIC |
|---|---|---|---|---|---|
| 1 | Full correlation matrix | log() | 143.5 | 33.7 | 177.2 |
| 2 | Structured correlation matrix | log() | 146.1 | 34.0 | 180.1 |
| 3 | Independence | log() | 144.9 | 33.5 | 178.4 |
| 4 | Independence | Box‐Cox with unknown | 148.1 | 34.4 | 182.5 |
Figure 5Summary true positive rate (tpr) and false positive rate (fpr) estimates (Models 1‐4) for the protein to creatinine ratio data across the full range of thresholds. 95% credible intervals and prediction intervals shown are from Model 3 [Colour figure can be viewed at http://wileyonlinelibrary.com]