Ya-Hsin Chou1, Chin-Pang Lee2, Chia-Yih Liu2, Ching-I Hung2. 1. Department of Psychiatry, Chang-Gung Memorial Hospital at Linkou. 2. Department of Psychiatry, Chang-Gung Memorial Hospital at Linkou; School of Medicine, Chang Gung University College of Medicine, Taoyuan, Taiwan.
Abstract
OBJECTIVE: Previous studies of the Depression and Somatic Symptoms Scale (DSSS), a free scale, have been based on the classical test theory, and the construct validity and dimensionality of the DSSS are as yet uncertain. The aim of this study was to use Mokken scale analysis (MSA) to assess the dimensionality of the DSSS. METHODS: A sample of 214 psychiatric outpatients with mood and anxiety disorders were enrolled at a medical center in Taiwan (age: mean [SD] =38.3 [10.5] years; 63.1% female) and asked to complete the DSSS. MSA was used to assess the dimensionality of the DSSS. RESULTS: All 22 items of the DSSS formed a moderate unidimensional scale (Hs =0.403), supporting its construct validity. The DSSS was divided into 4 subscales (Hs ranged from 0.35 to 0.67), including a general somatic scale (GSS), melancholic scale (MS), muscular pain scale (MPS), and chest symptom scale (CSS). The GSS is a weak reliable Mokken scale; the other 3 scales are strong reliable Mokken scales. CONCLUSION: The DSSS is a psychometrically sound measure of depression and somatic symptoms in adult psychiatric outpatients with depression or anxiety. The summed score of the DSSS and its 4 subscales are valid statistics. Further research is required for replication of the 4 subscales of the DSSS.
OBJECTIVE: Previous studies of the Depression and Somatic Symptoms Scale (DSSS), a free scale, have been based on the classical test theory, and the construct validity and dimensionality of the DSSS are as yet uncertain. The aim of this study was to use Mokken scale analysis (MSA) to assess the dimensionality of the DSSS. METHODS: A sample of 214 psychiatric outpatients with mood and anxiety disorders were enrolled at a medical center in Taiwan (age: mean [SD] =38.3 [10.5] years; 63.1% female) and asked to complete the DSSS. MSA was used to assess the dimensionality of the DSSS. RESULTS: All 22 items of the DSSS formed a moderate unidimensional scale (Hs =0.403), supporting its construct validity. The DSSS was divided into 4 subscales (Hs ranged from 0.35 to 0.67), including a general somatic scale (GSS), melancholic scale (MS), muscular pain scale (MPS), and chest symptom scale (CSS). The GSS is a weak reliable Mokken scale; the other 3 scales are strong reliable Mokken scales. CONCLUSION: The DSSS is a psychometrically sound measure of depression and somatic symptoms in adult psychiatric outpatients with depression or anxiety. The summed score of the DSSS and its 4 subscales are valid statistics. Further research is required for replication of the 4 subscales of the DSSS.
Somatic symptoms among patients with depression are important;1,2 they may confound the diagnosis of depression,3 are often residual symptoms of depression, and might increase the risk of relapse.4,5 Somatic symptoms in patients with major depressive disorder are also associated with a negative treatment outcome and a poor quality of life.6,7 Although somatic symptoms have a negative impact on depression, most conventional scales for depression do not include appropriate items related to somatic symptoms, which may hinder the monitoring, evaluation, and quantification of somatic symptoms.8 For these reasons, the Depression and Somatic Symptoms Scale (DSSS) was developed.8,9 The DSSS is a free scale that can be used to evaluate depression and somatic severity simultaneously. The DSSS and its subscales are significantly correlated with the Hamilton Depression Rating Scale (HAMD) as well as the mental and physical subscale scores of a health-related quality of life scale.9 It is also sensitive to pharmacotherapy.9 Moreover, the predictive ability of the DSSS for the prognosis of depression is not inferior to that of the HAMD.10 Furthermore, previous studies have demonstrated that the DSSS has not only a good reliability, but also acceptable convergent, factorial, and discriminative validities.9–11Summed scores assume that all items are equally correlated with the measured underlying construct; in addition, the point intervals are equal on the scale. However, these assumptions are not always true.12 For example, the items on the DSSS are not linear and continuous measurements, which means that calculating item scores might be meaningless.There are 2 different approaches for evaluation of the psychometric properties of rating scales: the classical test theory (CTT) and the item response theory (IRT).12 The limitations of CTT include the summed score problems and sample- dependent statistics, which may result in different psychometric properties when based on different samples.13,14IRT provides item-level statistics that are not influenced by differences between samples.12–14 IRT assumes that scale items can be ordered along levels of a latent trait, with item “difficulty” demonstrating whether items are difficult (rare) or less difficult (common).15Mokken scale analysis (MSA) is a nonparametric form of IRT derived from Guttman scaling.16–18 On a Guttman scale, a single response can be used to predict responses to all items on the scale.16 In the field of health construct measurements, MSA can be used to scrutinize the appropriateness and performance of the measurements.19 Being a nonparametric analytical method, MSA is robust according to the underlying distribution of the data, avoiding the methodological limitations of previous studies.To the best of our knowledge, no study has used MSA to examine the psychometric properties of the DSSS. Therefore, the purposes of this study were as follows: 1) to examine the construct validity of the DSSS; 2) to assess the dimensionality of the DSSS; and 3) to examine the item hierarchy of the DSSS and determine whether the DSSS has an invariant item ordering (IIO) property, which means that the items of a scale have the same difficulty ordering.
Methods
Participants
This was a secondary analysis of data obtained for a cross- sectional study conducted at the Chang-Gung Memorial Hospital, Linkou, which is a tertiary medical center in Northern Taiwan. Participants were enrolled in the psychiatricoutpatient clinic between September 2007 and August 2009. Three inclusion criteria were established: (1) aged between 20 and 60 years; (2) consecutive outpatients with depression or anxiety; (3) patients who had not taken antidepressants within the previous 4 weeks. Four exclusion criteria were as follows: 1) a history of substance dependence or abuse without full remission in the index month; 2) psychotic disorders, such as schizophrenia, delusional disorder, and other psychotic disorders; 3) dementia, delirium, mental retardation, and mental disorders due to general medical conditions; and 4) patients with psychotic symptoms, catatonic features, severe psychomotor retardation, or a current manic episode in the previous month, which may cause difficulty in completing self-administered scales or cooperating with the study process. In total, we recruited 214 participants (age: mean [SD] =38.3 [10.5] years; 63.1% female) in this study. Table 1 shows the demographic characteristics of the sample.
Table 1
Demographic and clinical characteristics of the sample (N=214)
Characteristics
Statistics
Range
Age, mean (SD), years
38.3 (10.5)
20–58
Female, n (%)
135 (63.1)
Education, mean (SD), years
12.5 (2.9)
0–18
Paid employment, n (%)
142 (66.4)
Married, n (%)
121 (56.5)
With current major depressive episode, n (%)
95 (44.4)
Procedures
This study was approved by the Institutional Review Board of Chang-Gung Memorial Hospital. Participants provided explicit written informed consent to participate in the study. Participants were interviewed by a senior board-certified psychiatrist (C-I Hung), and psychiatric diagnoses were made according to the Structured Clinical Interview for Diagnostic and Statistical Manual of Mental Disorders (DSM)-IV–text revision (TR) Axis I Disorders (SCID-I).20 Patients also completed the DSSS during their intake visits. All information was protected by delinking identifying information from main data sets and sources, and the data were available only to the investigators.
The DSSS
The DSSS is a self-administered scale. It is composed of 12 items in the Depression Subscale (DS) and 10 items in the Somatic Subscale (SS), the latter including 5 pain and 5 nonpain somatic symptoms.9 Each question is rated on a 4-point Likert scale from 0 to 3, and the total score ranges from 0 to 66.
Statistical analysis
All analyses were conducted in R version 3.3.0 (R Foundation for Statistical Computing, Vienna, Austria). The P-values were 2 tailed, and the significance level was set at 0.05. This method was also applied in a previous publication.21
MSA
MSA consists of 2 parts: 1) an automated item selection algorithm that partitions a set of ordinal items into Mokken scales, possibly leaving some items unselected; and 2) methods to investigate assumptions of nonparametric IRT models. The underlying assumptions of Mokken models are unidimensionality, local independence, and latent monotonicity.16 Unidimensionality means that the scale under consideration measures a single latent trait θ. Local independence means that one’s response to an item is not affected by one’s responses to the other items in the same test.22 Latent monotonicity assumes that for each item, the probability of a particular response level is a monotonically nondecreasing function of the latent trait θ.19The 2 main Mokken models are as follows: the monotone homogeneity model (MHM) and the double monotonicity model (DMM).16 The MHM assumes unidimensionality, monotonicity, and local independence of the items within a scale. If these assumptions are met, respondents can be ordered according to the simple sum score of items.16,23,24 In addition to the features of the MHM, the DMM has the property of IIO.22IIO refers to items that have the same “difficulty ordering” irrespective of the value of the latent trait.23–25 The ordering of items is based on item difficulty and shows whether items are difficult (rare) or less difficult (common). IIO is therefore important in order to establish a scale hierarchy that is replicable across samples.19 The IIO property can be expected to hold in any subgroup from the same population and thus is considered to be in some sense “person free”.MSA was conducted using the mokken package.23,24 First, the assumption of unidimensionality was checked using Loevinger’s scalability coefficients H.16,23,26,27 Loevinger’s scalability coefficients comprise 3 indexes: item-pair (H), item (H), and scale (H) scalability coefficients. If H=1, then the scale is a perfect Guttman scale. The MHM implies that these 3 indexes should be between 0 and 1; higher H values hint a higher item discrimination power. The rules of thumb for Loevinger’s scalability coefficients are described as follows:A scale is considered weak if 0.3≤H<0.4, moderate if 0.4≤H<0.5, and strong if H ≥0.5.16,28First, an automated item selection procedure with a genetic algorithm was used to automatically search for a set of items with established cutoffs of H (lower bound c),19,29 which started from 0.30, increased subsequently in steps of 0.05, and stopped at 0.60.16,17,23,27 By executing multiple analyses in this sequence, the strategy can provide important insights into the relationships among items. A scale that contains fewer than 4 items was considered unfavorable.19 Second, to identify positive and negative dependence in the MHM, the assumption of local independence was checked using a conditional association procedure.29–31 Locally dependent items were removed. Third, the assumption of monotonicity was checked using item-rest regression.16,17,23,27,32 If there were violations of monotonicity, then their seriousness could be assessed via consideration of the crit statistic. Items with a crit statistic ≤40 can be safely included in any Mokken scale.27 Fourth, the method restscore was used to check the assumption of nonintersection.15,23,27 If there were violations of nonintersection, then their seriousness could be assessed via consideration of the crit statistic: crit values <40 indicate no serious violation; crit values between 40 and 80 indicate minor violations; and crit values >80 indicate serious violations.27 Finally, the method manifest IIO was used to check IIO.24 We used backward selection to remove items violating IIO. If there were an equal number of violations for ≥2 items, the item with the lowest scalability was removed. Subsequently, these selected items were checked for accuracy in IIO using the statistic H.33 If manifest IIO holds, then 0.30.5 as a strong ordering. Molenaar and Sijtsma’s ρ was calculated to measure the reliability of the Mokken scales.27 A scale with ρ>0.7 was considered highly reliable.34
Results
The demographic and clinical characteristics of the sample are shown in Table 1. The age range of the subjects was between 20 and 58 years; the range of years of education was between 0 and 18 years.
Results of MSA
All of the corresponding item scalability coefficients (H) of the 22 items were larger than 0.3, and the scale scalability H of the DSSS was 0.40, representing a moderate Guttman scale. The reliability of the DSSS was excellent (ρ=0.92). There was no violation of local independence for any of the 22 items. The items also demonstrated good monotonicity without violation of monotonicity for any of the DSSS items. Among the 22 items of the DSSS, only Item 8 had serious violations of the assumption of nonintersection (crit =83); Item 8 was therefore removed. The backward item selection procedure for the other 21 items revealed that Items 6 and 7 should be removed. The remaining 19 items formed a reliable moderate Mokken scale without IIO (H=0.41, H=0.17, ρ=0.92), which fit MHM but still did not fit DMM.Thus, we retained all 22 items to further explore the dimensions of the DSSS, and iterative automated item selection procedures were executed with lower bound c, which started from 0.30 and increased to 0.60 in increments of 0.05 (Table 2). Except for c=0.50 or 0.55, all other solutions included subscales that contained only 2 items. The solutions of c=0.50 and 0.55 were similar. So, the final solution to the Mokken scaling was set at c=0.55. Regarding the dimensionality of the DSSS, 4 reliable Mokken scales were identified (Table 3).
Table 2
Number of items in each subscale during the iterative automatic item selection procedures
Lower bound, ca
0.30
0.35
0.40
0.45
0.50
0.55
0.60
Subscale 1
22
3
2
3
6
7
9
Subscale 2
0
19
14
13
8
7
5
Subscale 3
0
0
2
7
4
4
2
Subscale 4
0
0
2
2
4
4
4
Subscale 5
0
0
2
0
0
0
2
Notes:
Cutoff values of H. H, item scalability coefficient.
Table 3
Four Mokken scales of the Depression and Somatic Symptoms Scale
Subscale
Item number
Label
Mean
Hi
Critical violationa
General somatic scale (GSS):Hs=0.35; ρ=0.76; HT=0.19
4
Insomnia
1.97
0.32
0
16
Anxious or nervous
1.85
0.39
0
6
Irritable mood
1.73
0.36
0
9
Dizziness
1.36
0.41
0
14
Loss of interest in sex
1.33
0.29
0
1
Headache
1.15
0.35
0
22
Decreased appetite or loss of appetite
1.12
0.32
0
Melancholic scale (MS):Hs=0.60; ρ =0.90; HT=0.27b
8
Unable to feel happy or decreased ability to feel happy
The number of significant violations of invariant item ordering (IIO) was calculated using the method of manifest IIO (MIIO). If 2 or more items had the maximum number of violations, then the item producing the lowest value of Loevinger’s H was removed. If there were an equal number of violations for 2 or more items, then the item with the lowest scalability (H) was removed.
Coefficient H for the remaining items.
The items were removed by the backward item selection procedure. H, item scalability coefficient; H, scale scalability coefficient. 0.3
0.7 was considered highly reliable scale.
Subscales
The first subscale, the general somatic scale (GSS), consisted of 7 items mainly concerning vegetative symptoms, headache, dizziness, and anxious/irritable mood. These symptoms are nonspecific to depression and are present in various physical and mental disorders. The GSS was a weak reliable Mokken scale (H=0.35, ρ=0.76). There was no violation of local independence for any of the 7 items; the items also all demonstrated good monotonicity without violation. All 7 items had either no serious violations or minor violations of the assumption of nonintersection. All 7 items were retained in the backward item selection procedure. The GSS did not demonstrate the IIO property (H=0.19).The second subscale, the melancholic scale (MS), consisted of 7 items and corresponded to the core symptoms of depression, as listed in the DSM-5 or International Statistical Classification of Diseases and Related Health Problems (ICD)-10 criteria of major depressive disorder. The MS was a strong Mokken scale (H=0.60, ρ=0.90). There was no violation of local independence for any of the 7 items, which also all demonstrated good monotonicity without any violation. All 7 items had no serious violations of the assumption of nonintersection. Item 10 was removed in the backward item selection procedure. The remaining 6 items formed a strong Mokken scale and did not demonstrate the IIO property (H=0.59, H=0.27, ρ=0.88).The third subscale, the muscular pain scale (MPS), consisted of 4 items related to painful symptoms. The MPS was a reliable strong Mokken scale (H=0.67, ρ=0.87). There was no violation of local independence for any of the 4 items, and all items also demonstrated good monotonicity without any violation. All 4 items had no serious violations of the assumption of nonintersection. Item 7 was removed in the backward item selection procedure. The remaining 3 items formed a strong Mokken scale with a moderate IIO property (H=0.70, H=0.41, ρ=0.85).The final subscale, the chest symptom scale (CSS), consisted of 4 items mainly concerning cardiorespiratory symptoms. This was a reliable strong Mokken scale (H=0.57, ρ=0.84). There was no violation of local independence for any of the 4 items, and all items also demonstrated good monotonicity without any violation. All 4 items had no serious violations of the assumption of nonintersection. All 4 items were retained in the backward item selection procedure. The CSS had a weak IIO property (H=0.35).
Discussion
Our study demonstrated that the 22 items of the DSSS formed a reliable moderate unidimensional scale, which met the criteria for a MHM. The simple sum score of these 22 items within the scale can be used for ordinal personal measurement of depression in psychiatric adult outpatients with depression or anxiety. This finding supports the construct validity of the DSSS. The finding of unidimensionality was different from that of the validation study of the DSSS, which, using exploratory factor analysis (EFA), found a 2-factor solution.9 As an exploratory study in nature, our study provides evidence to show that the sum scores of the DSSS and its 4 subscales are valid statistics. This study does not support the original bidimensional scoring. It is noteworthy that, being a parametric approach, the EFA is not a robust statistical method when its underlying assumption is not fulfilled. For example, the EFA requires the assumption of a normal distribution, which is frequently unrealistic with Likert-type scale data.35 Further research using confirmatory factor analysis is required to verify whether the DSSS is uni- or bidimensional.Using Mokken analysis, we identified 4 reliable Mokken scales among the 22 items of the DSSS. All 4 subscales of the DSSS met the MHM criteria, and 2 (the MPS and the CSS) further met the DHM criteria, indicating that their sum scores were sufficiently useful statistics. However, the GSS had a H of 0.35, whereas the other 3 subscales obtained H values of >0.50. Regarding item reduction, these 3 subscales (the MS, the CSS, and the MPS) could form a shorter scale with better scalability than the original DSSS.Our findings suggested that the MS might be particularly clinically relevant. The items in the MS were similar to those in the 6-item HAMD (HAMD-6), consisting of depressed mood, guilt feelings, work and interests, psychomotor retardation, psychic anxiety, and tiredness/pain. Recent studies have shown that the HAMD-6 is a strong unidimensional scale and is more suitable as an outcome measure than the traditional 17-item HAMD (HAMD-17).35–37 In this regard, the MS might be used along with the HAMD-6 when evaluating the effect of antidepressant therapy.35,37The GSS seemed less useful in measuring the severity of depression. The 7 items of the GSS cover to some degree the neurovegetative symptoms of depression such as appetite, sleep, sex, and anxiety symptoms. These items are traditionally considered to be atypical depressive features.38 Regarding measurement of atypical depressive features, they are covered much more accurately by the Inventory of Depressive Symptomatology.39Our study demonstrated IIO in the CSS and the MPS. Such item hierarchy will assist clinicians to more efficiently assess somatic symptoms. For example, Items 3 (chest tightness) and 13 (neck or shoulder pain) could be used as screening questions related to somatic symptoms. Patients who achieved high scores on Items 11 (chest pain) and 17 (soreness in more than half of the body’s muscles) probably suffered significant chest discomfort and muscular pain.Our study had several strengths. We used a validated questionnaire to measure depression and had a full range of data available for all participants. We applied MSA to assess the construct validity and dimensionality of the DSSS.Our study also had several limitations. First, this study was a cross-sectional analysis of a single-site sample. Future research should attempt to replicate these findings to determine whether the 4 subscales of the DSSS are reasonable and useful. Second, our sample only contained adult psychiatric outpatients with depression or anxiety. Further research is needed to clarify the utility of the DSSS for other psychiatric disorders. Third, this study used data that were not obtained specifically for MSA. Therefore, the study would merit replication with a larger sample size.
Conclusion
The DSSS is a psychometrically sound measure of depression in adult psychiatric outpatients with depression or anxiety. The DSSS may further be divided into 4 Mokken scales, 2 of which had IIO properties. Future research should be performed to attempt to replicate the dimensionality and to determine whether similar items demonstrate IIO.
Authors: Matthew J Bair; Rebecca L Robinson; George J Eckert; Paul E Stang; Thomas W Croghan; Kurt Kroenke Journal: Psychosom Med Date: 2004 Jan-Feb Impact factor: 4.312