| Literature DB >> 27491674 |
Daniel Hind1, Daphne Kaklamanou2, Dan Beever1, Rosie Webster3, Ellen Lee1, Michael Barkham4, Cindy Cooper1.
Abstract
BACKGROUND: The prevalence of depression in people with multiple sclerosis (PwMS) is high; however, symptoms common to both conditions makes measurement difficult. There is no high quality overview of validation studies to guide the choice of depression inventory for this population.Entities:
Keywords: Chronic Disease; Depression; Multiple Sclerosis; Psychometrics; Reproducibility of Results
Mesh:
Year: 2016 PMID: 27491674 PMCID: PMC4973535 DOI: 10.1186/s12888-016-0931-5
Source DB: PubMed Journal: BMC Psychiatry ISSN: 1471-244X Impact factor: 3.630
Reliability (Internal Consistency, Reliability & Measurement Error) Findings Based on Each Measure
| Authors (Date) | Measure | Reliability (Cronbach’s Alpha) | Internal consistency (Inter-Item Correlations) | Test-test Reliability | Measurement error |
|---|---|---|---|---|---|
| Moran & Mohr (2005) [ | BDI | BDI SEM = 3.26; HRSD SEM = 3.20 | |||
| Aikens et al. (1999) [ | BDI-II | Full BDI | |||
| Amtmann et al. (2014) [ | PHQ-9; CESD-10; | PHQ-9 = .35-.67; CESD-10 = .33-.67; PROMIS-D-8 = .75-.84 | |||
| Sjonnesen et al. (2012) [ | PHQ-9 |
| Item-total correlations (PwMS): anhedonia (.71); depressed mood (.65); fatigue (.57) and concentration (.55). | ||
| Chang et al. (2003) [ | CMDI | Sleep Disturbance | |||
| Solari et al. (2003) [ | CMDI | MS | Item-subscale: mood = .58-.80, evaluative = .37-.70, vegetative = .32-.57 | ICC = 0.78 (95 % CI 0.62–0.89); Subscales range 0.71–0.79 (2 weeks) | |
| Patten et al. (2010) [ | CES-D |
| |||
| Verdier-Taillefer et al. (2001) [ | CES-D |
|
Note: HRSD = Hamilton Rating Scale for Depression
Structural and Content Validity Findings by Measure
| Authors (Date) | Measure | Structural Validity (Factor Analysis) | Content Validity Item Inclusion |
|---|---|---|---|
| Mohr et al. (1997) [ | BDI | Fatigue, work difficulty, and concerns about health ( | |
| Moran & Mohr (2005) [ | BDI | All items showed significant reductions following treatment ( | |
| Aikens et al. (1999) [ | BDI-II | MS had higher scores than healthy controls on work difficulty, | |
| Strober & Arnett (2010) [ | mBDI | MS-NON-DEP: fatigue, indecision, loss of libido, work difficulty, irritability, loss of interest, crying, dissatisfaction, self-criticism. DEP-PwMS: Irritability, loss of interest, crying, dissatisfaction, self-criticism, sadness, pessimism, failure, guilt, appetite, disappointment, weight loss. MS (more severe in DEP-MS) or related to depressive symptoms: irritability, loss of interest, crying, dissatisfaction, self-criticism. | |
| Amtmann et al. (2014) [ | PHQ-9; CESD-10; | Fit indices from a one-factor CFA were acceptable. CFI for all models ≥ of 0.95. TLIs for PHQ-9 and CESD-10 < 0.95 (0.94 and 0.93, respectively). TLI > 0.95 for PROMIS-D-8. RMSEAs > .05 for any of the measures. | |
| Sjonnesen et al. (2012) [ | PHQ-9 | Exclusion of fatigue and concentration items no change in prevalence estimates ( | |
| Patten et al. (2005) [ | CES-D | Agreement between full and modified scales, | |
| Verdier-Taillefer et al. (2001) [ | CES-D | Factor 1 = depressed affect; Factor 4 = interpersonal relationships. Factor 2 = positive affect items (MS group), but contained somatic complaints (other groups); Factor 3 = somatic complaints (MS group) but positive affect (other groups) | |
| Chang et al. (2003) [ | CMDI | CFA confirmed the subscale/factor structure | Vegetative subscale showed misfitting in PwMS. ‘Fatigue’ and ‘useless’ items were endorsed more by MS than controls. |
| Mohr et al. (2007) [ | Two-item measure | The sensitivity and NPV of using either question alone significantly better than either question alone; the sensitivity of both questions was significantly lower than other methods |
Note: Studies that are not included in the table either did not assess content validity or did not report it. a ANOVAS but does not report f values. DEP = Depressed
Criterion Validity Findings by Measure
| Authors (Date) | Measure | Criterion Measure | Correlation | Sensitivity | Specificity | ROC Curve Analysis | PPV | NPV |
|---|---|---|---|---|---|---|---|---|
| Benedict et al (2003) [ | BDI-FS | BDI, CES-D, neuropsychiatric inventory (NPI) | BDI | |||||
| Sullivan et al. (1995) [ | BDI | Structured clinical interview (DSM) | 88 % (≥9), 71 % (≥13) | 46 % (≥9), 79 % (≥13) | 54 % (≥9), 70 % (≥13) | 84 % (≥9), 79 % (≥13) | ||
| Honarmand and Feinstein (2009) [ | HADS-D (≥8) | SCID-I (DSM) | 90 %, (95%CI 73–98)* 86.7 % (95%CI 68–96)* fatigue item excluded (≥6) | 87.3 %, (95%CI 81–92)* 86.7 % (95%CI 80–92)* fatigue item excluded (≥6) | AUC = 0.94, 0.93 fatigue item excluded (≥6) | 90.6 %, 90.6 % fatigue item excluded (≥6) | ||
| Nicholl et al. (2001) [ | HADS-D | BDI (Predictive) |
| 25 % - When using cut-off of 5/6 (identified by ROC curve), 75 % | 86 % - When using cut-off of 5/6 (identified by ROC curve), 69 % | Identified the optimum cut-off of 5/6 when compared to the BDI. | ||
| Avasarala et al. (2003) [ | YSQ | BDI (Cut-off of ≥13) | 65.3 % (95%CI 50–78) | 87.3 % (95%CI 77–94) | 78.0 % (95%CI 62–89) | 78.5 % (95%CI 68–87) | ||
| Mohr et al. (2007) [ | Two-item measure | SCID (DSM) | Both questions: 51 % (95%CI 38–63), either: 99 % (95%CI 91–100), 1 only: 75 % (95%CI 62–84), 2 only: 75 % (95%CI 62–84) | Both: 98 % (95%CI 94–99), either: 87 % (95%CI 81–91), 1 only: 94 % (95%CI 89–97), 2 only: 94 % (95%CI 89–97) | Both: 90 % (95%CI 74–97), either: 72 % (95%CI 61–80), 1 only: 73 % (95%CI 61–83), 2 only: 81 % (95%CI 68–89) | Both: 85 % (95%CI 80–89), either: 99 % (95%CI 96–100), 1 only: 91 % (95%CI 86–95), 2 only: 91 % (95%CI 86–95) | ||
| Pandya et al. (2005) [ | CES-D | DSM-IV psychiatric interview | 59.6 % (95%CI 45.0–74.1) for major depression; 74.5 % (95%CI 61.5–87.4) for any depressive disorder. | |||||
| Solari et al. (2003) [ | CMDI | Previously diagnosed according to DSM-IV | Total scale: 96.8 %; mood: 96.9 %; evaluative and vegetative: 75.0 % | |||||
| Vahter et al (2007) [ | One-item measure | BDI (>10 cut-off) and structured clinical interview | 81% | 89% | 93.5% |
Note: Studies that are not included in the table either did not assess criterion or did not report it.aExtracted from Manoj and Sivan (2007) [52] *Calculated by EL. SCID = Structured Clinical Interview for DSM Disorders; ROC = Receiver operating characteristic
Hypotheses Testing Findings by Measure
| Authors (Date) | Measure | Discriminant | Convergent |
|---|---|---|---|
| Aikens et al. (1999) [ | BDI-II | BDI totals differed by group, | BDI totals similar for MS and diabetes |
| Moran and Mohr (2005) [ | BDI | BDI total score reduced from 23.7 ( | |
| Sullivan et al. (1995) [ | BDI | Participants with major depression scored significantly higher on negative attitude towards self and performance components, | |
| Benedict et al. (2003) [ | BDI-FS | Groups differed in scores between treated and untreated groups (anti-depressants), | |
| Strober and Arnett (2010) [ | mBDI | PwMS (DEP, NON-DEP) endorsed fatigue ( | BDI and depression proneness rating scale, |
| Amtmann et al. (2014) [ | PHQ-9; CESD-10; PROMIS-D-8 | PHQ-9 with CESD-10 (.85); PROMIS-D-8 (.73). CESD-10 with PROMIS-D-8 (.80). | Fatigue scores correlations: PHQ 9 = .73; CESD-10 = .71; PROMIS-D-8 = .55. Sleep disturbance: PHQ-9 = .57; CESD-10 = .56; PROMIS-D-8 = .39. Pain interference: PHQ-9 = .60; CESD-10 = .55; PROMIS-D-8 = .47. |
| Pandya et al. (2005) [ | CES-D | MS with a major depressive disorder diagnosis had CES-D score (36.5) than those who did not (26.0), Mann-Whitney = 12.1, df = 1, | |
| Patten et al. (2005) [ | CES-D | CES-D scores did not correlate as highly with physical QoL, as with mental QoL, Spearman's | CES-D scores correlated highly with mental QoL, Spearman's |
| Verdier-Taillefer et al. (2001) [ | CES-D | PwMS endorsed more depressed affect and fatigue items than GP patients; MS and GP patients endorsed more depressive symptoms than healthy workers ( | |
| Beeney and Arnett (2008) [ | CMDI | Physical disability was correlated with vegetative subscale, | History of depression correlated with CMDI-Mood, |
| Chang et al. (2003) [ | CMDI | PwMS scored higher on all dimensions than controls (CMDI-Mood | |
| Nyenhuis et al (1995) [ | MDI | PwMS endorsed more vegetative symptoms than mood and evaluative, than the depressed group, controlling for total BDI score | BDI (30.5 %) estimated depression significantly greater than MDI-mood (17.7 %), |
| Solari et al. (2003) [ | CMDI | Significant differences in all subscale scores, | |
| Nicholl et al. (2001) [ | HADS-D | HADS-D correlated with both the GHQ-12 ( |
Note: studies that are not included in the table either did not assess construct validity or did not report it. aPaper talks about Mann-Whitney chi square – test does not exist. QoL = Quality of Life; GHQ = General Health Questionnaire
Methodological Quality of Each Article per Measurement Property and Instrument According to COSMIN Checklist
| PAPERS | Measure | Internal Consistency | Reliability | Measurement Error | Content Validity (Face Validity Incl.) | Structural Validity | Hypotheses testing | Cross Cultural Validity | Criterion Validity | Responsiveness |
|---|---|---|---|---|---|---|---|---|---|---|
| Mohr et al. (1997) [ | BDI | - | - | - | Excellent | - | Fair | - | - | - |
| Moran & Mohr (2005) [ | BDI | - | - | Fair | Excellent | - | Fair | - | - | Fair |
| Sullivan et al. (1995) [ | BDI | - | - | - | Fair | - | Fair | - | Fair | - |
| Aikens et al. (1999) [ | BDI-II | Fair | - | - | Excellent | - | Fair | - | - | - |
| Benedict et al. (2003) [ | BDI-FS | - | - | - | Fair | - | Fair | - | Fair | - |
| Strober & Arnett (2010) [ | mBDI | - | - | - | Excellent | - | Fair | - | - | - |
| Pandya et al. (2005) [ | CES-D | - | - | - | - | - | Fair | - | Poord | - |
| Patten et al. (2005) [ | CES-D | - | - | - | Excellent | - | Fair | - | - | - |
| Patten et al. (2010) [ | CES-D | - | Fair | Fair | - | - | Good | - | - | Fair |
| Verdier-Taillefer et al. (2001) [ | CES-D | Good | - | - | Excellent | Good | Good | - | - | - |
| Amtmann et al. (2014) [ | CESD-10; PHQ-9; PROMIS-D-8 | Poora | - | - | Fair | Fair | Fair | - | - | - |
| Sjonnesen et al. (2012) [ | PHQ-9 | Poorb | - | - | Fair | - | Fair | - | - | - |
| Beeney & Arnett (2008) [ | CMDI | - | - | - | Excellent | - | Fair | - | - | Fair |
| Chang et al. (2003) [ | CMDI | Fair | - | - | Excellent | Fair | Fair | - | - | - |
| Nyenhuis et al. (1995) [ | MDI | - | - | - | Fair | - | Fair | - | - | - |
| Solari et al. (2003) [ | CMDI | Fair | Fair | - | Excellent | - | Fair | Poorc | - | - |
| Honarmand & Feinstein (2009) [ | HADS | - | - | - | - | - | Fair | - | Fair | - |
| Nicholl et al. (2001) [ | HADS | - | - | - | - | - | Good | - | Good | - |
| Avasarala et al. (2003) [ | YSQ | - | - | - | - | - | Fair | - | Fair | - |
| Mohr et al. (2007) [ | Two-item measure | - | - | - | Excellent | - | Fair | - | Fair | - |
| Vahter et al (2007) [ | One-item measure | - | - | - | - | - | Fair | - | Fair | - |
Notes: a This paper was rated poor due to not calculating internal consistency for each subscale separately. b This paper was evaluated poor for not assessing the unidimensionality of the measure, although we understand that might not have been the purpose of the study. c Multiple group Confirmatory Factor Analysis or Differential Item Function not performed/assessed. d No correlations or AUC calculated
Studies Excluded at Full Text Screening, With Reasons
| Authors (Date) | Reason for Exclusion |
|---|---|
| Quaranta et al. (2012) [ | Measure was not self-report |
| Knox (2010) [ | Review/narrative |
| Manoj & Sivan (2007) [ | |
| Nocentini (2006) [ | |
| Doward et al. (2009) [ | Doesn’t assess depression |
| Fishman et al. (2004) [ | |
| Groom et al. (2003) [ | |
| McGuigan & Hutchinson (2004) [ | |
| Mueller & Girace (1988) [ | |
| O’Brien et al. (2007) [ | |
| Schwartz et al. (2011) [ | |
| Cook et al. (2012) [ | Doesn't assess validation of depression measure in MS |
| Gold et al. (2003) [ | |
| Horton et al. (2010) [ | |
| Provinciali et al. (1999) [ | |
| Alajbegovic et al. (2009) [ | Not a validation study |
| Good et al. (1992) [ | |
| Leon et al. (2001) [ | Sample not solely PwMS |
| Lykouras et al. (1998) [ |
Study Characteristics
| Authors (Date), Country | Samples | Baseline Characteristics | MS Diagnosis (years) | Disability Status | Recruitment Method | Measures | Validity/ Reliability Tested |
|---|---|---|---|---|---|---|---|
| Aikens et al. (1999) [ | MS=105, Depressed=34, Healthy=80, Diabetes=71, Chronic pain=80 | MS: |
| Expanded Disability Status Scale = 0.0 - 7.5 (Median = 3.8) Moderate Disability | University Hospital | BDI-II | Construct validity; content validity; internal reliability (cronbach's alpha) |
| Amtmann et al. (2014) [ | MS = 455 |
|
| Moderate level | University Hospital/ National Multiple Sclerosis Society charter | CESD–10, PHQ-9 PROMIS-D-8 | Evaluation of dimensionality; inter-item correlation; discriminant/convergent validity |
| Avasarala et al. (2003) [ | MS=120 | Ages: 20–29 | NA | NA | University Hospital | YSQ | Criterion validity |
| Beeney and Arnett (2008) [ | 3 year follow-up MS=52; cross-sectional analysis, MS=96. |
|
| EDSS (T1) =4.55 ( | University Hospital/ National Multiple Sclerosis Society charter | CMDI | Construct validity |
| Benedict et al. (2003) [ | MS=54 |
| NA | EDSS median = 2.5 ( | University Hospital | BDI-FS | Criterion and construct validity |
| Chang et al. (2003) [ | MS=433; plus 'standardisation sample' | MS: | NA | NA | University Hospital | CMDI | Content validity, internal reliability, construct validity |
| Honarmand and Feinstein (2009) [ | Study 1: MS=140; Study 2: MS=40, MD=21, Matched controls (no MD) = 19 |
|
| EDSS = 4.0 ( | Hospital | HADS | Criterion validity |
| Mohr et al. (1997) [ | MS =184, DEP =72, controls (college students) = 555 | MS: |
| NA | University Hospital | BDI | Face validity |
| Mohr et al. (2007) [ | MS = 260 |
|
| NA | University Hospital | Two-item measure | Criterion validity, construct validity |
| Moran and Mohr (2005) [ | MS (with depression)=42 | Mage=43.0 ( |
| NA | University Hospital/ referrals/ National Multiple Sclerosis Society Charter | BDI | Construct validity |
| Nicholl et al. (2001) [ | MS=88 |
|
| NA | Hospital/Rehabilitation ward | HADS | Criterion validity, construct validity |
| Nyenhuis et al. (1995) [ | MS=84, DEP=101, controls (MS matched)=87 | MS: | NA | EDSS = 4.74 ( | Community based | BDI, MDI | Construct validity |
| Pandya et al. (2005) [ | MS=47 |
| NA | EDSS = 3.0 | University hospital/referrals to Psychiatric care | CES-D | Criterion validity, construct validity |
| Patten et al. (2005) [ | MS=567 |
| NA | NA | University Hospital | CES-D | Construct validity |
| Patten et al. (2010) [ | year 0 | 15.9% aged 18–34, 29.5% aged 35–44, 33.6% aged 45–54, 21.0% aged 55+; 77.1% female | NA | EDSS (mode) = 4 ( | University Hospital | CES-D | Test-retest reliability |
| Sjonnesen et al. (2012) [ | MS=173, Controls (general population)=3304 | MS: |
| 27.1% (46/170) unable to work | Patient Registry/ Hospital | PHQ-9 | Content validity, construct validity, internal reliability |
| Solari et al. (2003) [ | MS=213, Healthy controls (matched to MS)=213, DEP=32 | MS: |
| EDSS = 2.9 ( | University Hospital | CMDI | Internal reliability, test-retest reliability, content validity, construct validity, criterion validity |
| Strober and Arnett (2010) [ | MS-DEP=17, MS-NON-DEP=67, healthy controls=22 | MS-DEP: |
| EDSS depressed = 5.18 ( | National Multiple Sclerosis Society charter | mBDI | Construct validity |
| Sullivan et al. (1995) [ | MS=46 |
| NA | NA | Hospital/ referrals | BDI | Criterion validity, construct validity |
| Vahter et al. (2007) [ | MS=134 |
|
| EDSS = 5.8 ( | Hospital | One-item measure | Criterion validity |
| Verdier-Taillefer et al. (2001) [ | MS=857, GP patients=1598, healthy workers=403 | MS: | NA | NA | CES-D | Content validity, internal reliability, construct validity |
Note: MS Multiple Sclerosis, MD Major Depression, DEP Depressed, NA Not Available