| Literature DB >> 30038858 |
John A Gallis1,2, Joanna Maselko3, Karen O'Donnell2,4, Ke Song1,2, Kiran Saqib5, Elizabeth L Turner1,2, Siham Sikander5,6.
Abstract
BACKGROUND: Depression is one of the most prevalent, yet unrecognized but treatable mental disorders in low and middle income countries (LMICs). In such locations, screening tools that are easy-to-administer, valid, and reliable are needed to assist in detecting symptoms of depression. The Patient Health Questionnaire (PHQ-9) is one of the most widely used depression screeners. However, its applicability to community-based settings of Pakistan is limited by the lack of studies examining its validity and reliability in such settings. The current study aimed to demonstrate the criterion-related validity and internal reliability of the Urdu version of the PHQ-9 in a sample of community-based pregnant women in Pakistan compared to a diagnostic clinical interview, the Structured Clinical Interview for DSM disorders (SCID), using data from a depression treatment cluster randomized trial in rural Pakistan.Entities:
Keywords: Criterion-related validity; Depressive disorder; Pakistan; Patient health questionnaire; Perinatal; Pregnant; Reliability; Structured clinical interview; Thinking healthyprogramme; Urdu
Year: 2018 PMID: 30038858 PMCID: PMC6054083 DOI: 10.7717/peerj.5185
Source DB: PubMed Journal: PeerJ ISSN: 2167-8359 Impact factor: 2.984
Baseline participant demographic characteristics, by PHQ-9 and SCID, n = 1, 154
Demographic characteristics are from a 1:1 depressed/non-depressed sample (as determined by PHQ-9 ≥ 10) of n = 1, 154 women.
| PHQ-9 cutoff at 10 | SCID current depressive episode | Total ( | |||
|---|---|---|---|---|---|
| Not depressed ( | Depressed ( | Not depressed ( | Depressed ( | ||
| Mean (SD) | 26.4 (4.3) | 27.0 (4.8) | 26.5 (4.3) | 27.0 (4.8) | 26.7 (4.5) |
| Median (Q1, Q3) | 26.0 (23.0, 29.0) | 27.0 (24.0, 30.0) | 26.0 (24.0, 30.0) | 27.0 (23.0, 30.0) | 26.0 (23.0, 30.0) |
| Range | (18.0–40.0) | (18.0–45.0) | (18.0–40.0) | (18.0–45.0) | (18.0–45.0) |
| Mean (SD) | 2.5 (2.6) | 3.0 (2.7) | 2.6 (2.6) | 3.1 (2.7) | 2.8 (2.7) |
| Median (Q1, Q3) | 2.0 (1.0, 4.0) | 2.0 (1.0, 4.0) | 2.0 (1.0, 4.0) | 2.0 (1.0, 4.0) | 2.0 (1.0, 4.0) |
| Range | (0.0–20.0) | (0.0–21.0) | (0.0–20.0) | (0.0–21.0) | (0.0–21.0) |
| Mean (SD) | 2.8 (2.5) | 14.7 (3.7) | 4.7 (4.7) | 15.1 (4.0) | 8.7 (6.7) |
| Median (Q1, Q3) | 3.0 (0.0, 4.0) | 14.0 (12.0, 17.0) | 3.0 (1.0, 7.0) | 15.0 (12.0, 18.0) | 9.0 (3.0, 14.0) |
| Range | (0.0–9.0) | (10.0–27.0) | (0.0–25.0) | (2.0–27.0) | (0.0–27.0) |
| None (0) | 63 (10.8%) | 107 (18.8%) | 83 (11.7%) | 87 (19.7%) | 170 (14.7%) |
| Primary (1–5) | 87 (14.9%) | 139 (24.4%) | 124 (17.4%) | 102 (23.1%) | 226 (19.6%) |
| Middle (6–8) | 108 (18.5%) | 107 (18.8%) | 129 (18.1%) | 86 (19.5%) | 215 (18.6%) |
| Secondary (9–10) | 167 (28.6%) | 126 (22.1%) | 191 (26.8%) | 102 (23.1%) | 293 (25.4%) |
| Higher secondary (11–12) | 63 (10.8%) | 46 (8.1%) | 74 (10.4%) | 35 (7.9%) | 109 (9.4%) |
| Tertiary (>12) | 96 (16.4%) | 45 (7.9%) | 111 (15.6%) | 30 (6.8%) | 141 (12.2%) |
| None (0) | 33 (5.7%) | 55 (9.6%) | 42 (5.9%) | 46 (10.4%) | 88 (7.6%) |
| Primary (1–5) | 45 (7.7%) | 67 (11.8%) | 62 (8.7%) | 50 (11.3%) | 112 (9.7%) |
| Middle (6–8) | 105 (18.0%) | 137 (24.0%) | 132 (18.5%) | 110 (24.9%) | 242 (21.0%) |
| Secondary (9–10) | 286 (49.0%) | 243 (42.6%) | 347 (48.7%) | 182 (41.2%) | 529 (45.8%) |
| Higher secondary (11–12) | 65 (11.1%) | 47 (8.2%) | 73 (10.3%) | 39 (8.8%) | 112 (9.7%) |
| Tertiary (>12) | 50 (8.6%) | 21 (3.7%) | 56 (7.9%) | 15 (3.4%) | 71 (6.2%) |
Notes.
The baseline sample of n = 1, 154 is a subset of the screening sample of n = 1, 731.
PHQ-9 Score cutoffs by SCID reference standard; screening sample, n = 1, 731.
| PHQ-9 severity of depression | SCID depression status | Total ( | |
|---|---|---|---|
| Depressed ( | Non depressed ( | ||
| Minimal, 0–4 | 5 (1.1%) | 850 (66.6%) | 855 (49.4%) |
| Mild, 5–9 | 19 (4.2%) | 285 (22.3%) | 304 (17.6%) |
| Moderate, 10–14 | 202 (44.5%) | 112 (8.8%) | 314 (18.1%) |
| Moderately severe, 15–19 | 160 (35.2%) | 27 (2.1%) | 187 (10.8%) |
| Severe, 20+ | 68 (15.0%) | 3 (0.2%) | 71 (4.1%) |
Psychometric properties of PHQ-9 at several cutoffs, with SCID as the reference standard; screening sample, n = 1,731.
| PHQ-9 cutoffs | Sensitivity (95% CI) | Specificity (95% CI) | Positive predictive value (95% CI) | Negative predictive value (95% CI) |
|---|---|---|---|---|
| ≥5 | 98.9 (97.4–99.6) | 66.6 (63.9–69.1) | 51.3 (47.9–54.6) | 99.4 (98.6–99.8) |
| ≥6 | 98.0 (96.3–99.1) | 74.1 (71.6–76.5) | 57.3 (53.8–60.9) | 99.1 (98.2–99.6) |
| ≥7 | 97.4 (95.4–98.6) | 80.9 (78.6–83.0) | 64.4 (60.7–68.0) | 98.9 (98.0–99.4) |
| ≥8 | 96.5 (94.3–98.0) | 84.9 (82.8–86.8) | 69.4 (65.7–73.0) | 98.5 (97.6–99.2) |
| ≥9 | 95.8 (93.5–97.5) | 87.9 (85.9–89.6) | 73.7 (70.0–77.2) | 98.3 (97.4–99.0) |
| ≥11 | 89.0 (85.7–91.7) | 91.9 (90.3–93.4) | 79.7 (75.9–83.1) | 95.9 (94.6–97.0) |
| ≥12 | 81.1 (77.1–84.6) | 94.0 (92.6–95.3) | 82.9 (79.1–86.3) | 93.3 (91.8–94.6) |
| ≥13 | 70.0 (65.6–74.2) | 95.5 (94.3–96.6) | 84.8 (80.8–88.3) | 90.0 (88.2–91.5) |
| ≥14 | 59.9 (55.2–64.5) | 96.6 (95.4–97.5) | 86.1 (81.8–89.7) | 87.1 (85.3–88.8) |
| ≥15 | 50.2 (45.5–54.9) | 97.7 (96.7–98.4) | 88.4 (83.8–92.0) | 84.7 (82.7–86.5) |
| ≥16 | 39.2 (34.7–43.9) | 98.3 (97.4–98.9) | 89.0 (83.8–93.0) | 82.0 (80.0–83.9) |
| ≥17 | 33.9 (29.6–38.5) | 98.7 (98.0–99.3) | 90.6 (85.2–94.5) | 80.8 (78.7–82.7) |
| ≥18 | 26.2 (22.2–30.5) | 99.2 (98.6–99.6) | 92.2 (86.2–96.2) | 79.1 (77.0–81.1) |
| ≥19 | 19.6 (16.0–23.6) | 99.5 (98.9–99.8) | 92.7 (85.6–97.0) | 77.7 (75.6–79.7) |
| ≥20 | 15.0 (11.8–18.6) | 99.8 (99.3–100.0) | 95.8 (88.1–99.1) | 76.7 (74.6–78.8) |
Notes.
All numbers are percentages; using data from n = 1, 731 women screened for depression.
Bolded text highlights the psychometric properties at a cutoff of ≥10.
Figure 1Receiver operating characteristic (ROC) curve comparing different cutoffs of PHQ-9 score to the SCID reference standard; screening sample, n = 1, 731.
Abbreviation: AUC, area under the curve. Bolded point indicates the cutoff of ≥10.
Cronbach’s alpha (internal reliability) results; screening sample, n = 1, 731.
| PHQ-9 Items | Cronbach’s alpha if item deleted |
|---|---|
| 1. Feeling tired or having little energy | 0.828 |
| 2. Poor appetite or overeating | 0.842 |
| 3. Trouble falling or staying asleep, or sleeping too much | 0.836 |
| 4. Moving or speaking slowly | 0.820 |
| 5. Trouble concentrating | 0.836 |
| 6. Little interest or pleasure in doing things | 0.816 |
| 7. Feeling down, depressed, or hopeless | 0.809 |
| 8. Feeling bad about yourself | 0.824 |
| 9. Thoughts that you would better off dead, or of hurting yourself | 0.839 |
Notes.
Questions were asked in a different order than is standard.
Items were standardized.