| Literature DB >> 23413409 |
Penny Whiting1, Richard M Martin, Yoav Ben-Shlomo, David Gunnell, Jonathan A C Sterne.
Abstract
Interpreting information on diagnostic accuracy is an area that health professionals struggle with. In this paper, we use the example of Mr Samways, a 45-year-old man with joint symptoms, to illustrate how to apply the results of a diagnostic accuracy study in clinical practice. We consider the various measures used to quantify diagnostic accuracy and discuss their clinical utility. We provide an overview of potential biases to consider when evaluating a diagnostic accuracy study and consider how to determine whether the results can be applied to a particular patient.Entities:
Year: 2013 PMID: 23413409 PMCID: PMC3572661 DOI: 10.1258/shorts.2012.012089
Source DB: PubMed Journal: JRSM Short Rep ISSN: 2042-5333
Figure 1Fagan's nomogram[9]
| General table and formulae | Example: anti-CCP for diagnosing RA | ||
|---|---|---|---|
| True positives | People with the target condition who have a positive test result | TP | 82 |
| True negatives | People without the target condition who have a negative test result | TN | 301 |
| False-positives | People without the target condition who have a positive test result | FP | 13 |
| False-negatives | People with the target condition who have a negative test result | FN | 71 |
| Sensitivity | Proportion of patients with the target condition who have a positive test result | TP/(TP + FN) | 82/(82 + 71) = 54% |
| Specificity | Proportion of patients without the target condition who have a negative test result | TN/(FP + TN) | 301/(13 + 301) = 96% |
| Positive predictive value (PPV) | Probability that a patient with a positive test result has the target condition | TP/(TP + FP) | 82/(82 + 13) = 86% |
| Negative predictive value (NPV) | Probability that a patient with a negative test result does not have the target condition | TN/(FN + TN) | 301/(71 + 301) = 81% |
| Prevalence | The proportion of patients in the whole study population who have the target condition | (TP + FN)/(TP + FP + FN + TN) | (82 + 71)/(82 + 13 + 71 + 301) = 33% |
| Positive likelihood ratio (LR+) | The number of times more likely a person with the target condition is to have a positive test result compared with a person without the target condition | (TP/(TP + FN))/(FP/(FP + TN)) or sensitivity/(1 – specificity) | 0.54/(1–0.96) = 13.5 |
| Negative likelihood ratio (LR−) | The number of times more likely a person with the target condition is to have a negative test result compared with a person without the target condition | (FN/(TP + FN))/(TN/(FP + TN)) or (1 – sensitivity)/specificity | (1–0.54)/0.96 = 0.48 |
| General description of source of bias | Potential effects in anti-CCP example |
|---|---|
| Consecutive patients, or a random sample of patients, with suspected disease should be enrolled and criteria for enrolment should be clearly stated. Studies that avoid inclusion of ‘difficult to diagnose’ patients or ‘grey cases’ (in whom diagnostic tests are often most useful) may result in overoptimistic estimates of accuracy (‘spectrum bias’) (Supplementary Figure S1). | A study that enrols patients with definite RA and a control group of healthy people without symptoms may produce overoptimistic estimates of sensitivity and specificity that exaggerate the utility of the test in clinical practice. |
| The results of the index test should be interpreted without knowledge of the results of the reference standard, because knowledge of the reference standard may lead to inflated measures of accuracy (test review bias). The testing sequence and degree of subjectiveness in test interpretation will impact on the potential for bias. | As anti-CCP is a biochemical test the potential for bias is less than had it involved more subjective interpretation (e.g. X-ray). The anti-CCP test result will probably be interpreted without knowledge of whether included patients have RA, because the reference standard (ACR criteria) is applied after a period of follow-up. |
| Estimates of diagnostic accuracy assume that the reference standard is measured without error (100% sensitive and specific). Disagreements between the reference standard and index test are assumed to result from incorrect test results. In practice, a perfect reference standard may not exist. For example, interpretation of a reference standard such as this may be influenced by knowledge of the index test result (diagnostic review bias). A related source of bias is when the reference standard consists of compound criteria that include the index test (incorporation bias). | There is no definite way to make an early diagnosis of RA. The ACR criteria are applied some time after the anti-CCP test and could therefore be influenced by knowledge of test results; an explicit statement that the person interpreting the ACR criteria was blinded to the anti-CCP test results is therefore required. Incorporation bias was avoided as the ACR criteria did not include anti-CCP when the study was conducted. |
| Ideally all patients should undergo both the index test and reference standard within a short time, and all should be included in the analysis or accounted for. There is a potential for bias if the number of patients enrolled differs from the number included in the 2 × 2 table. A potential consequence of withdrawals is verification bias, which occurs when the index test result influences patients’ probability of receiving the reference standard, or receiving a different reference standard. If there is a delay between application of the index test and reference standard, misclassification due to recovery or progression to a more advanced stage of disease may occur (disease progression bias). | As the reference standard is not costly or invasive it is unlikely that the decision to apply it will be influenced by a patient's anti-CCP result. All patients enrolled into the study should be included in the 2 × 2 table, any omissions, should be explained. The reference standard incorporates a period of follow-up, so that a minimum rather than maximum period between index test and reference standard is required. |
CCP, cyclic citrullinated peptide; RA, rheumatoid arthritis; ACR, American College of Rheumatology
Interpretation of likelihood ratios (LRs) with some examples.
| LR | Effect on likelihood of the target condition | Example | Value | |
|---|---|---|---|---|
| Positive likelihood ratio | >10 | Strong increase | CAGE questionnaire, 4 positive | 25 |
| Positive anti-CCP (RA)2 | 13.5 | |||
| 5–10 | Moderate increase | High probability ventilation-perfusion | 7.3 | |
| 2–5 | Small increase | Ultrasound (pancreatic cancer)3 | 4.7 | |
| 1–2 | Minimal increase | Free/total (f/t) prostate-specific antigen | 1.7 | |
| 1 | No change | |||
| Negative likelihood ratio | 0.5–1.0 | Minimal decrease | No loss of urine with coughing/exercise | 0.74 |
| 0.2–0.5 | Small decrease | Negative anti-CCP (ruling out RA)2 | 0.48 | |
| <0.2 | Strong decrease | Normal ventilation-perfusion scan | 0.05 |