Literature DB >> 35615612

An Introduction to Statistics: Diagnostic Tests.

Abstract

Diagnostic tests are used to differentiate between those with and without disease. In this article, we examine some of the properties of diagnostic tests, such as sensitivity, specificity, predictive values, and receiver operating characteristic curves. How to cite this article: Ranganathan P. An Introduction to Statistics: Diagnostic Tests. Indian J Crit Care Med 2021; 25(Suppl 3):S283-S284.

Entities: Chemical

Keywords: Diagnostic test; Sensitivity; Specificity

Year: 2021 PMID： 35615612 PMCID： PMC9108787 DOI： 10.5005/jp-journals-10071-24070

Source DB: PubMed Journal: Indian J Crit Care Med ISSN： 0972-5229

Introduction

Various tests are used in the field of medicine to diagnose clinical conditions. For example, in a previous issue of this journal, Abdelshafey studied the role of presepsin levels in the early identification of sepsis.[1] In this article, we look at some of the statistical properties of diagnostic tests. Diagnostic tests are used to distinguish between individuals who have and do not have disease. Many conditions have a gold standard test—for example, liver biopsy and histopathology to diagnose liver cirrhosis. However, the test may have certain limitations: they may be invasive, risky, time-consuming, or expensive. Therefore, an alternate test that overcomes these limitations may be used in lieu of the gold standard—for example, ultrasound elastography of the liver. The new test needs to be compared to the gold standard to ensure adequate and acceptable performance.

Sensitivity of a Test

The sensitivity of a test refers to the ability of a test to detect disease when it is present. In other words, it is the probability that the test will be positive in an individual with disease. Sensitivity = Number of individuals with test positive/Number of individuals with disease A highly sensitive test is likely to be positive in almost all individuals with disease, and therefore if it is negative, it is useful to rule out disease. If a test has a sensitivity of 95%, it means that out of 100 individuals with disease, the test will be positive in 95 and only 5 of them will have a “false-negative” test. For such a test, a negative result is highly likely to be associated with the absence of disease. In the study by Abdelshafey, presepsin with a cutoff value >640 pg/mL had a diagnostic accuracy of identifying septic cases with sensitivity of 73%.[1] This means that out of 100 patients with early sepsis, 73 would have a presepsin value of 640 pg/mL or more whereas 27 would have a presepsin level lower than 640 pg/mL and be incorrectly labeled as not in early sepsis.

Specificity of a Test

The specificity of a test refers to the ability of the test to rule out disease when it is absent. It is the probability that the test will be negative in an individual without disease. Specificity = Number of individuals with test negative/Number of individuals without disease A highly specific test is likely to be negative in almost all individuals without disease and therefore, if it is positive, it is useful to rule in disease. A test with a specificity of 95% means that out of 100 individuals without disease, the test will be negative in 95 and only 5 individuals will have a “false-positive test.” For example, in the previously quoted study, presepsin level of 640 pg/mL or more had a specificity of 92% for detecting early sepsis.[1] This means that out of 100 patients without early sepsis, 92 would have a presepsin level less than 640 pg/mL, whereas only 8 of them would have a presepsin level higher than 640 pg/mL and be incorrectly labeled as at risk of sepsis. The choice of test would depend on whether sensitivity or specificity is the priority. For screening studies, where identification of the majority with disease is crucial, a highly sensitive test is preferred. However, for confirmation of diagnosis, especially if expensive or toxic treatment is planned, a highly specific test should be used. D-dimer is a sensitive test for ruling out pulmonary embolism. However, it is not specific and a confirmatory test such as a CT pulmonary angiography is required before starting thrombolytic therapy. Sensitivity and specificity are properties of the test and are useful parameters when we wish to determine the performance of the test against a gold standard—i.e., when we know whether the individual truly has disease or not. However, in clinical practice, we only know the results of the test, based on which we need to predict whether the individual has disease or not. For this, we use predictive values.

Positive Predictive Value

The positive predictive value (PPV) of a test is the probability that an individual with a positive test actually has disease. PPV = Number of individuals with disease/Number of individuals with test positive

Negative Predictive Value

The negative predictive value (NPV) of a test is the probability that an individual with a negative test does not have disease. NPV = Number of individuals without disease/Number of individuals with test negative Agarwal looked at the Sepsis in Obstetrics Score (SOS) value of 6 for predicting severe sepsis in pregnant women.[2] This cutoff had a PPV of 83% and a NPV of 65%. This means that of 100 women with a SOS of 6 or more, 83 would have severe sepsis (and 17 would not develop severe sepsis despite a high SOS). Similarly, out of 100 women with a SOS less than 6, 65 would not develop severe sepsis (and 35 would have severe sepsis despite a low SOS). Unlike sensitivity and specificity, which are properties of the test and are not affected by disease prevalence, NPV and PPV are dependent on disease prevalence.

Receiver Operating Characteristic (ROC) Curves

For tests where the results can give several values, the results of individuals with and without disease may have overlapping values. It becomes important to choose an appropriate cutoff to differentiate disease from no disease. In the presepsin example, it is essential to decide what cutoff value of presepsin will be chosen to define early sepsis. Choosing a very low cutoff will improve the sensitivity of the test (identify all with disease); however, some individuals without disease will also be classified as diseased (false positives). Choosing a high cutoff value will improve the specificity of the test (exclude those without disease) but may also result in some individuals with disease being missed (false negatives). As another example, procalcitonin levels are used to diagnose bacterial sepsis. A level of 2.0 ng/mL or more is very strongly predictive of sepsis (high specificity); however, this may not pick up all patients with sepsis (low sensitivity). Choosing a cutoff of 1.0 ng/mL will improve the sensitivity but may also include some patients who do not have sepsis (lower specificity). Lowering the cutoff even further to 0.5 ng/mL will result in very high sensitivity with poor specificity. Thus, the choice of cutoff is a play between the sensitivity and the specificity and is determined using a method known as the receiver operating characteristic (ROC) curve. The ROC curve plots sensitivity (on the Y-axis) vs 1—specificity (on the X-axis) and identifies the cutoff point that offers the best balance between sensitivity and specificity. The ability of a diagnostic test to differentiate between those with and without disease is given by the area under the ROC curve (AUROCC) and tests may be compared based on their AUROCC. A perfect test has an AUROCC of 1.0 and a worthless test has an AUROCC of 0.5. Values between 0.5 and 1.0 are considered moderate to excellent, with higher values representing a better test performance. In the study by Abdelshafey, presepsin with a cutoff value >640 pg/mL had an AUROCC of 0.85, which was higher than the SIRS (AUROCC 0.67) or qSOFA (AUROCC 0.65) criteria.[1] For further reading on this topic, readers are referred to more detailed articles.[3,4]

4 in total

1. Sepsis in Obstetrics Score (SOS) utility and validation for triaging patients with obstetric sepsis in the emergency department: Evidence from a low income health care setting.

Authors: Rachna Agarwal; Rajesh Kumar Yadav; Medha Mohta; Meera Sikka; Gita Radhakrishnan
Journal: Obstet Med Date: 2018-03-29