| Literature DB >> 30083119 |
Marco Tommasi1, Grazia Ferrara1, Aristide Saggino1.
Abstract
The validity of clinical diagnoses is a fundamental topic in clinical psychology, because now there are some political administrations, as the IOM or the U.K. government, which are focusing on best evidence-based practice in clinical psychology. The most problematic issue in clinical psychology is to avoid wrong diagnoses which can have negative consequences on individual life and on the utility of clinical treatments. In the case of diagnoses based on self-report tests, the diagnostic decision about individual health is based on the comparison between its score and the cutoff, according to the frequentist approach to probability. However, the frequentist approach underestimates the possible risks of incorrect diagnoses based on cutoffs only. The Bayesian approach is a valid alternative to make diagnoses on the basis of the scores from psychological tests. The Bayes' theorem estimates the posterior probability of the presence of a pathology on the basis of the knowledge about the diffusion of this pathology (prior probability) and of the knowledge of sensitivity and specificity values of the test. With all this information, it is possible to estimate the diagnostic accuracy of some self-report tests used for assessing depression. We analyzed the diagnostic accuracy of the most used psychological tests of depression (Zung's Self-Rating Depression Scale, Hamilton Rating Scale for Depression, Center for Epidemiological Studies for Depression and the Beck Depression Inventory), together with a new scale (Teate Depression Inventory) developed with the IRT procedure, by analyzing the published works in which data about sensitivity and specificity of these scales are reported. Except the TDI, none of these scales can reach a satisfactory level of diagnostic accuracy, probably for the absence of an optimal procedure to select test items and subjects with clearly defined pathological symptoms which could allow the reduction of false positives in test scoring.Entities:
Keywords: Bayes' theorem; clinical psychology; depression; diagnostic accuracy; sensitivity and specificity
Year: 2018 PMID: 30083119 PMCID: PMC6064972 DOI: 10.3389/fpsyg.2018.01240
Source DB: PubMed Journal: Front Psychol ISSN: 1664-1078
Figure 1Diagnostic accuracy of an hypothetical psychological test in relation to the different levels of TPs (P(A|B) values of the horizontal axis) and of FPs (series of curves with different line styles). Diagnostic accuracy is estimated with the Bayes' theorem (Equation 1). The selected levels of FPs were 30, 20, 10, 5, 2.5, 1, and 0.1%.
Studies using the five depression scales by authors, publication year, sample typology, sample size, sensitivity, specificity, FPs percentage, positive predictive value (PPV), negative predictive value (NPV), recommended or used cutoff, P(B) and P(A|B) value.
| BDI | Aben et al., | Medical | 202 | 80 | 61.4 | 22.2 | 95.7 | 10 | 0.41 | 0.13 | |
| Arnarson et al., | Psychiatric | Normal = 1454; Clinical = 248 | 82 | 75 | n.r. | n.r. | 20/21 | 0.29 | 0.19 | ||
| Arnau et al., | Medical | 333 | 94 | 92 | 54 | 99 | 18 | 0.14 | 0.46 | ||
| Berg et al., | Medical | 100 | 80 | 76 | n.r. | n.r. | 10 | 0.28 | 0.19 | ||
| Bunevicius et al., | Medical | 522 | 89 | 74 | 29 | 98 | 14 | 0.30 | 0.20 | ||
| Carney et al., | Medical | 140 | 81 | 79 | n.r. | n.r. | 17 | 0.25 | 0.22 | ||
| Cho and Kim, | Psychiatric | 164 | 90 | n.r. | 24 | n.r. | n.r. | n.r. | 0.28 | 0.21 | |
| Dolle et al., | Psychiatric | 141 | 100 | 77 | n.r. | n.r. | 23 | 0.28 | 0.24 | ||
| Dozois et al., | Normal | 1022 | 81 | 92 | n.r. | n.r. | 13 | 0.13 | 0.42 | ||
| Dutton et al., | Medical | 220 | 87.7 | 83.9 | 69.5 | 94.2 | 14 | 0.21 | 0.28 | ||
| Frasure-Smith and Lespérance, | Medical | 804 | 91.2 | 77.5 | n.r. | n.r. | 14 | 0.27 | 0.23 | ||
| Gorenstein et al., | Normal | n.r. | 70 | 87 | 84.3 | 77 | 10 | 0.17 | 0.28 | ||
| Jones et al., | Medical | 174 | 95.7 | 78.3 | 42 | 99 | 11 | 0.27 | 0.24 | ||
| Kang et al., | Medical | 423 | 92 | 84 | n.r. | n.r. | 11 | 0.21 | 0.29 | ||
| Kapci et al., | Psychiatric | Normal = 362; Clinical = 151 | 77 | 76 | n.r. | n.r. | 19 | 0.28 | 0.19 | ||
| Krefetz et al., | Psychiatric | 100 | 74 | 70 | n.r. | n.r. | 24 | 0.33 | 0.15 | ||
| Kumar et al., | Psychiatric | 100 | 85 | 83 | 85 | 83 | 21 | 0.22 | 0.26 | ||
| Leentjens et al., | Medical | 53 | 67 | 88 | 62 | 90 | 13/14 | 0.16 | 0.29 | ||
| Lincoln et al., | Medical | 143 | 91 | 56 | n.r. | n.r. | 16 | 0.47 | 0.13 | ||
| Low and Hubley, | Medical | 119 | 83 | 88 | 21 | 100 | 14 | 0.17 | 0.33 | ||
| Lykouras et al., | Medical | 150 | 85.7 | 86.7 | 40 | 97.5 | 29 | 0.18 | 0.32 | ||
| Osman et al., | Psychiatric | Normal = 414; Clinical = 167 | 86.8 | 56.8 | n.r. | n.r. | 10 | 0.46 | 0.13 | ||
| Pérez-Stable et al., | Psychiatric | Normal = 195; Clinical = 70 | 91 | n.r. | 58 | n.r. | n.r. | 10 | 0.60 | 0.10 | |
| Perry and Gilbody, | Normal | 1166 | 65.9 | 67.9 | n.r. | n.r. | 21 | 0.34 | 0.13 | ||
| Pohjasvaara et al., | Medical | 390 | 73 | 67 | n.r. | n.r. | 10 | 0.36 | 0.14 | ||
| Rampling et al., | Medical | 266 | 93.8 | 78.9 | 49.5 | 98 | 15/16 | 0.26 | 0.24 | ||
| Psychiatric | Normal = 57; Clinical = 61 | 97 | 77 | n.r. | n.r. | 5 | 0.28 | 0.23 | |||
| Seignourel et al., | Psychiatric | 582 | 81 | 61 | 45 | 91 | 21 | 0.42 | 0.13 | ||
| Shean and Baldwin, | Normal | 395 | 73.3 | 84.4 | 47.8 | 94.2 | 10 | 0.19 | 0.25 | ||
| Sprinkle et al., | Normal | 137 | 90 | n.r. | 26 | n.r. | n.r. | 14 | 0.30 | 0.20 | |
| Strik et al., | Medical | 199 | 83.8 | 71.7 | 33.3 | 97.9 | 8 | 0.32 | 0.18 | ||
| Turner et al., | Medical | 72 | 92 | 71 | n.r. | n.r. | 11 | 0.33 | 0.19 | ||
| Uslu et al., | Psychiatric | Normal = 503; Clinical = 166 | 77.4 | 76.8 | 63.4 | 84.5 | 20 | 0.27 | 0.19 | ||
| Warmenhoven et al., | Medical | 61 | 90 | 69 | n.r. | n.r. | 16 | 0.35 | 0.17 | ||
| Williams et al., | Medical | 269 | 95 | 60 | 62 | 94 | 7 | 0.44 | 0.15 | ||
| Zich et al., | Medical | 31 | 100 | 89 | n.r. | n.r. | 16 | 0.17 | 0.39 | ||
| HAMD | *Aben et al., | Medical | 202 | 78.7 | 74.6 | 36.8 | 94.7 | 12 | 0.29 | 0.18 | |
| Agrell and Dehlin, | Medical | 40 | 71 | 87 | 60 | 80 | 10 | 0.17 | 0.28 | ||
| Berg et al., | Medical | 100 | 80 | 93 | n.r. | n.r. | 10 | 0.12 | 0.45 | ||
| Cho and Kim, | Psychiatric | 164 | 98 | n.r. | 6 | n.r. | n.r. | n.r. | 0.12 | 0.54 | |
| Kang et al., | Medical | 423 | 89 | 84 | n.r. | n.r. | 12 | 0.21 | 0.29 | ||
| *Leentjens et al., | Medical | 63 | 88 | 89 | 74 | 96 | 13/14 | 0.16 | 0.36 | ||
| *Leung et al., | Psychiatric | 93 | 79 | 80 | 77 | 82 | 15/16 | 0.24 | 0.22 | ||
| *Mottram et al., | Psychiatric | 433 | 87.5 | 99.1 | 99.1 | 97.1 | 16 | 0.07 | 0.87 | ||
| *Naarding et al., | Medical | 403 | 100 | 93 | 88 | 100 | 5/6 | 0.16 | 0.38 | ||
| Quaranta et al., | Medical | 143 | 84.9 | 84.1 | 89.8 | 77.3 | 11 | 0.21 | 0.28 | ||
| Roger and Johnson-Greene, | Medical | 67 | 65 | 56 | 28 | 47 | 2 | 0.45 | 0.10 | ||
| *Strik et al., | Medical | 206 | 76.3 | 86 | 58.8 | 98.2 | 12 | 0.18 | 0.28 | ||
| *Thompson et al., | Normal | 703 | 96 | 98 | n.r. | n.r. | n.r. | 0.08 | 0.78 | ||
| *Williams et al., | Medical | 269 | 77 | 76 | 69 | 83 | 7 | 0.28 | 0.19 | ||
| ZSDS | Agrell and Dehlin, | Medical | 40 | 76 | 96 | 93 | 84 | 45 | 0.09 | 0.58 | |
| Dunstan et al., | Psychiatric | Normal = 289; Clinical = 87 | 93 | 69 | n.r. | n.r. | 50 | 0.35 | 0.18 | ||
| Fountoulakis et al., | Psychiatric | normal = 120; clinical = 40 | 90 | 92.5 | n.r. | n.r. | 44/45 | 0.13 | 0.46 | ||
| Gabrys and Peters, | Psychiatric | Normal = 218; Clinical = 369 | 92 | n.r. | 23 | n.r. | n.r. | 40/50 | 0.28 | 0.22 | |
| Greenough and Fraser, | Medical | 274 | 92 | 73 | n.r. | n.r. | 56 | 0.31 | 0.20 | ||
| Magruder-Habib et al., | Psychiatric | Normal = 60; Clinical = 112 | 70 | 94 | n.r. | n.r. | 50 | 0.10 | 0.46 | ||
| Okimoto et al., | Medical | 55 | 76 | 82 | n.r. | n.r. | 60 | 0.22 | 0.23 | ||
| Passik et al., | Medical | 60 | 86.1 | 66.7 | n.r. | n.r. | 48 | 0.37 | 0.16 | ||
| Spitzer et al., | Medical | 337 | 86 | 74 | n.r. | n.r. | 50 | 0.30 | 0.19 | ||
| Zung and Green, | Psychiatric | n.r. | 88 | n.r. | 44 | n.r. | n.r. | 40 | 0.47 | 0.13 | |
| CES-D | Agrell and Dehlin, | Medical | 39 | 56 | 91 | 82 | 75 | 20 | 0.12 | 0.31 | |
| **Cheng and Chan, | Psychiatric | 398 | 76 (75) | 55 (51) | 57(55) | 74(72) | 12/13 (22/23) | 0.49 | 0.10 | ||
| Cho and Kim, | Psychiatric | 164 | 91.3 | 78.8 | 62.7 | 95.9 | 25 | 0.26 | 0.24 | ||
| Fechner-Bates et al., | Medical | 425 | 79.5 | 71.1 | n.r. | n.r. | 16 | 0.32 | 0.16 | ||
| Haringsma et al., | Psychiatric | 318 | 83.7 | 59.8 | 77 | n.r. | 22 | 0.47 | 0.12 | ||
| Hendrie et al., | Psychiatric | 125 | 82 | 88 | n.r. | n.r. | 16 | 0.17 | 0.33 | ||
| ***Irwin et al., | Psychiatric | 83 | 97 | 84 | 85 | n.r. | 4 | 0.19 | 0.34 | ||
| Jones et al., | Medical | 174 | 95.7 | 78.9 | 42.3 | 33.1 | 14 | 0.26 | 0.25 | ||
| Kirmayer et al., | Medical | 685 | 83.3 | n.r. | 14.4 | n.r. | n.r. | 16 | 0.19 | 0.29 | |
| Parikh et al., | Medical | 180 | 90 | 86 | 80 | n.r. | 16 | 0.19 | 0.32 | ||
| Pérez-Stable et al., | Medical | Normal = 195; Clinical = 70 | 83 | n.r. | 45 | n.r. | n.r. | 16 | 0.48 | 0.12 | |
| Roger and Johnson-Greene, | Medical | 67 | 66 | 68 | 34 | 35 | 15 | 0.34 | 0.13 | ||
| Schein and Koenig, | Medical | 76 | 73.1 | 84 | 70.4 | 85.7 | 16 | 0.20 | 0.25 | ||
| Schulberg et al., | Psychiatric | 1554 | 88.9 | 70.4 | 23.3 | 35.3 | 27 | 0.48 | 0.12 | ||
| Shean and Baldwin, | Normal | 395 | 86.7 | 76.6 | 41.9 | 96.7 | 16 | 0.28 | 0.21 | ||
| Shinar et al., | Medical | 27 | 73 | n.r. | 0 | 84 | n.r. | 16 | 0.05 | 1.00 | |
| Weissman et al., | Psychiatric | 235 | 99 | 56 | n.r. | n.r. | 16 | 0.33 | 0.18 | ||
| Williams et al., | Medical | 269 | 72 | 70 | 62 | 79 | 12 | 0.33 | 0.15 | ||
| Zich et al., | Medical | 34 | 100 | 81 | n.r. | n.r. | 27 | 0.24 | 0.27 | ||
| TDI | Balsamo and Saggino, | Psychiatric | 125 | 82 | 98 | 98 | 84 | 36 | 0.10 | 0.58 |
Authors used a 13 items version of the BDI.
Authors used the 17 items version of the HAMD.
Authors used two CES-D versions: one with 10 and the other with 20 items. Values relative to the 20 item version are in parentheses.
Authors used a 10 items version of the CES-D with binary items.
Stroke patients.
Patients with Alzheimer's disease.
Patients with Parkinson's disease.
Patients with major depressive disorder.
Patients with clinically relevant depression.
Psychiatric patients.
community-dwelling elders.
Primary care patients.
Psychiatric patients.
Acute and recovered depressives.
Drug addicts.
Alcoholics.
Schizophrenics.
n.r. not reported datum.
Figure 2Diagnostic accuracy of BDI, HAMD, ZSDS, CES-D, and TDI estimated with the Bayes' theorem (Equation 1) against the different levels of P(A|B) (horizontal axis) and of FPs (series of gray curves). Open symbols report data for each study, while filled symbols indicate the mean values for each depression scale. For the TDI there is currently one study which reports its sensitivity and specificity values. The selected levels of FPs were 10, 5, 2.5, and 1%.
Figure 3Mean values of sensitivity and specificity for the BDI, the HAMD, the ZSDS and the CES-D reported by different published works. For the TDI, there is currently one published work. Vertical lines are standard errors.
Mean values of specificity, sensitivity and cutoff values for each kind of sample (normal, psychiatric or medical) used in studies to estimate the criterion validity of depressive scales.
| Means | Normal | 76.04 | 81.06 | 13.60 | 96.00 | 98.00 | 12.00 | 86.70 | 76.60 | 16.00 | ||||||
| Psychiatric | 85.56 | 70.05 | 17.35 | 85.06 | 89.59 | 12.33 | 86.60 | 77.70 | 45.00 | 86.54 | 69.67 | 18.07 | 82.00 | 98.00 | 36.00 | |
| Medical | 87.25 | 76.57 | 13.75 | 79.03 | 81.71 | 10.50 | 83.22 | 78.33 | 51.80 | 79.23 | 79.14 | 16.73 | ||||
| Standard deviations | Normal | 9.57 | 9.86 | 4.51 | 0 | 0 | 0 | 0 | 0 | 0 | ||||||
| Psychiatric | 8.35 | 12.04 | 6.52 | 7.78 | 8.10 | 2.78 | 9.48 | 16.06 | 4.89 | 7.43 | 15.33 | 7.89 | 0 | 0 | 0 | |
| Medical | 8.14 | 10.31 | 4.72 | 7.81 | 11.06 | 4.21 | 7.03 | 11.28 | 6.10 | 13.01 | 12.44 | 3.90 | ||||
| Number of works in which samples are present | Normal | 5 | 1 | 0 | 1 | 0 | ||||||||||
| Psychiatric | 11 | 4 | 5 | 7 | 1 | |||||||||||
| Medical | 20 | 10 | 5 | 11 | 0 | |||||||||||
Samples cited in only one work have null standard deviation.