| Literature DB >> 20807445 |
Elizabeth Phelan1, Barbara Williams, Kathryn Meeker, Katie Bonn, John Frederick, James Logerfo, Mark Snowden.
Abstract
BACKGROUND: The diagnostic accuracy of the Patient Health Questionnaire-9 (PHQ-9) for assessment of depression in elderly persons in primary care settings in the United States has not been previously addressed. Thus, the purpose of this study was to evaluate the test performance of the PHQ-9 for detecting major and minor depression in elderly patients in primary care.Entities:
Mesh:
Year: 2010 PMID: 20807445 PMCID: PMC2940814 DOI: 10.1186/1471-2296-11-63
Source DB: PubMed Journal: BMC Fam Pract ISSN: 1471-2296 Impact factor: 2.497
Demographic and health characteristics of participants at enrollment (N = 71)
| Characteristic | |
|---|---|
| Age, years, mean ± SD* | 78 ± 7 |
| Female, % | 62 |
| Non-white, % | 32 |
| High school graduate, % | 82 |
| Chronic medical conditions | |
| Mean ± SD | 2.3 ± 1.5 |
| Median (interquartile range) | 2 (2) |
| Three or more chronic medical conditions, % | 42 |
| Patient Health Questionnaire-9 (PHQ-9) | |
| Score, mean ± SD | 5.9 ± 6.1 |
| Score ≥10, %† | 23 (16/71) |
| Patient Health Questionnaire-2 (PHQ-2) | |
| Score, mean ± SD | 1.2 ± 1.6 |
| Score ≥3, %‡ | 20 (14/71) |
| 15-Item Geriatric Depression Scale (GDS) | |
| Score, mean ± SD | 5.8 ± 1.8 |
| Score >5, %§ | 48 (33/69) |
| Structured Clinical Interview for Depression (SCID) | |
| Major depression, % | 12 (8/69) |
| Minor depression, % | 13 (9/69) |
* SD = standard deviation.
† Scores of 10 or greater on the PHQ-9 suggest likely major depression in general medical populations [18].
‡ Scores of 3 or greater on the PHQ-2 suggest possible depression in adults in primary care and obstetrics-gynecology settings [33].
§ Scores greater than 5 on the 15-item GDS suggest major depression in older adults in primary care [17].
Sensitivity and specificity of depression screening instruments for diagnosing major depression at various cutpoints*
| Instrument and Cutpoint | Sensitivity (%) | Specificity (%) | + Likelihood Ratio | - Likelihood Ratio |
|---|---|---|---|---|
| PHQ-9 Cutpoint | ||||
| ≥8 | 88 | 75 | 3.6 | 0.16 |
| (56-98) | (71-77) | |||
| 4.4 | 0.16 | |||
| (56-98) | (76-82) | |||
| 3.5 | 0.46 | |||
| (33-86) | (78-85) | |||
| ≥11 | 63 | 84 | 3.8 | 0.45 |
| (33-85) | (80-87) | |||
| ≥12 | 63 | 84 | 3.8 | 0.45 |
| (33-85) | (80-87) | |||
| PHQ-2 Cutpoint | ||||
| ≥1 | 88 | 61 | 2.2 | 0.21 |
| (55-98) | (56-62) | |||
| 2.3 | 0.37 | |||
| (43-93) | (63-70) | |||
| 4.2 | 0.44 | |||
| (33-85) | (81-88) | |||
| ≥4 | 38 | 93 | 5.7 | 0.67 |
| (15-62) | (91-97) | |||
| ≥5 | 38 | 98 | 22.9 | 0.64 |
| (16-48) | (96-100) | |||
| 15-item GDS Cutpoint | ||||
| ≥5 | 100 | 15 | 1.2 | 0.00 |
| (73-100) | (11-15) | |||
| 2.4 | 0.00 | |||
| (70-100) | (54-59) | |||
| 3.2 | 0.33 | |||
| (43-93) | (72-79) | |||
| ≥8 | 25 | 87 | 1.9 | 0.86 |
| (7-54) | (84-91) | |||
| ≥9 | 13 | 93 | 1.9 | 0.94 |
| (2-37) | (92-97) |
* Cutpoints are specific sum scores that distinguish between individuals with and without the disorder. Bolded cutpoints indicate the optimum balance between sensitivity and specificity, while italicized cutpoints are those that are typically cited in the literature as being those that optimize sensitivity and specificity for the detection of major depression for each respective instrument when applied to the general population.
† CI = confidence interval.
Receiver operating curve analyses for major depression for each screening instrument
| N* | PHQ-9 AUC† | PHQ-2 AUC | GDS AUC | |
|---|---|---|---|---|
| Overall | 69 | 0.87 (0.74-1.00) | 0.81 (0.65-0.98) | 0.81 (0.70-0.91) |
| Gender | ||||
| Male | 26 | 0.88 (0.72-1.00) | 0.91 (0.78-1.00) | 0.76 (0.58-0.93) |
| Female | 43 | 0.85 (0.61-1.00) | 0.70 (0.37-1.00) | 0.84 (0.70-0.98) |
| Race | ||||
| White | 47 | 0.88 (0.73-1.00) | 0.87 (0.75-0.99) | 0.85 (0.74-0.95) |
| Non-white‡ | 22 | 0.93 | 0.31 | 0.90 |
| Age | ||||
| < 80 years | 38 | 0.92 (0.81-1.00) | 0.81 (0.54-1.00) | 0.82 (0.68-0.96) |
| ≥80 years | 31 | 0.80 (0.52-1.00) | 0.83 (0.64-1.00) | 0.80 (0.62-0.98) |
| Comorbidities | ||||
| < 3 | 41 | 0.93 (0.82-1.00) | 0.92 (0.85-1.00) | 0.79 (0.55-1.00) |
| ≥3 | 28 | 0.80 (0.57-1.00) | 0.71 (0.43-0.99) | 0.82 (0.67-0.97) |
* N is for PHQ-9 and PHQ-2; GDS is missing for one participant.
† AUC = area under the receiver operating characteristic curve. Values range from ≤0.5 (no discriminatory ability) to 1.0 (perfect discrimination -- in this case, of depressed from non-depressed).
‡ CI = confidence interval.
§ Only one of the 22 non-white participants had a positive SCID, and therefore no confidence interval could be calculated.
Sensitivity and specificity of each screening instrument using a broadened definition of depression*
| Instrument and Cutpoint | Sensitivity (%) | Specificity (%) | + Likelihood Ratio | - Likelihood Ratio |
|---|---|---|---|---|
| PHQ-9 Cutpoint | ||||
| ≥6 | 77 | 69 | 2.5 | 0.34 |
| (56-90) | (63-74) | |||
| ≥7 | 77 | 77 | 3.3 | 0.31 |
| (56-90) | (70-81) | |||
| 4.4 | 0.28 | |||
| (57-89) | (76-87) | |||
| ≥9 | 71 | 87 | 5.2 | 0.34 |
| (51-85) | (80-91) | |||
| ≥10 | 59 | 89 | 5.1 | 0.47 |
| (40-74) | (82-93) | |||
| PHQ-2 Cutpoint | ||||
| 2.5 | 0.26 | |||
| (62-94) | (61-71) | |||
| ≥2 | 71 | 73 | 2.6 | 0.40 |
| (50-86) | (66-78) | |||
| ≥3 | 53 | 90 | 5.5 | 0.52 |
| (35-67) | (85-95) | |||
| ≥4 | 35 | 98 | 18.4 | 0.66 |
| (21-40) | (94-100) | |||
| ≥5 | 18 | 98 | 9.2 | 0.84 |
| (7-22) | (95-100) | |||
| 15-item GDS Cutpoint | ||||
| ≥4 | 94 | 4 | 0.98 | 1.63 |
| (85-99) | (1-5) | |||
| ≥5 | 88 | 14 | 1.0 | 0.93 |
| (71-96) | (8-16) | |||
| 2.1 | 0.30 | |||
| (60-93) | (55-65) | |||
| ≥7 | 56 | 79 | 2.7 | 0.55 |
| (36-74) | (73-84) | |||
| ≥8 | 31 | 90 | 3.3 | 0.76 |
| (16-46) | (86-95) |
* Bolded cutpoints indicate the optimum balance between sensitivity and specificity.
‡ CI = confidence interval.