Literature DB >> 35896905

Assessing sleep and pain among adults with myalgic encephalomyelitis/chronic fatigue syndrome: psychometric evaluation of the PROMIS® sleep and pain short forms.

Manshu Yang1, San Keller2, Jin-Mann S Lin3.   

Abstract

PURPOSE: To evaluate the psychometric properties of the patient-reported outcome measurement information system® (PROMIS) short forms for assessing sleep disturbance, sleep-related impairment, pain interference, and pain behavior, among adults with myalgic encephalomyelitis/chronic fatigue syndrome (ME/CFS).
METHODS: Data came from the Multi-Site ME/CFS study conducted between 2012 and 2020 at seven ME/CFS specialty clinics across the USA. Baseline and follow-up data from ME/CFS and healthy control (HC) groups were used to examine ceiling/floor effects, internal consistency reliability, differential item functioning (DIF), known-groups validity, and responsiveness.
RESULTS: A total of 945 participants completed the baseline assessment (602 ME/CFS and 338 HC) and 441 ME/CFS also completed the follow-up. The baseline mean T-scores of PROMIS sleep and pain measures ranged from 57.68 to 62.40, about one standard deviation above the national norm (T-score = 50). All four measures showed high internal consistency (ω = 0.92 to 0.97) and no substantial floor/ceiling effects. No DIF was detected by age or sex. Known-groups comparisons among ME/CFS groups with low, medium, and high functional impairment showed significant small-sized differences in scores (η2 = 0.01 to 0.05) for the two sleep measures and small-to-medium-sized differences (η2 = 0.01 to 0.15) for the two pain measures. ME/CFS participants had significantly worse scores than HC (η2 = 0.35 to 0.45) for all four measures. Given the non-interventional nature of the study, responsiveness was evaluated as sensitivity to change over time and the pain interference measure showed an acceptable sensitivity.
CONCLUSION: The PROMIS sleep and pain measures demonstrated satisfactory psychometric properties supporting their use in ME/CFS research and clinical practice.
© 2022. The Author(s), under exclusive licence to Springer Nature Switzerland AG.

Entities:  

Keywords:  Differential item functioning; Internal consistency reliability; Known-groups validity; Myalgic encephalomyelitis/chronic fatigue syndrome (ME/CFS); Pain; Responsiveness; Sleep

Year:  2022        PMID: 35896905      PMCID: PMC9331042          DOI: 10.1007/s11136-022-03199-8

Source DB:  PubMed          Journal:  Qual Life Res        ISSN: 0962-9343            Impact factor:   3.440


Introduction

Myalgic encephalomyelitis/chronic fatigue syndrome (ME/CFS) is a debilitating long-term condition that has affected approximately 836,000 to 2.5 million Americans [1-4] yet remains poorly understood by the healthcare community [5, 6]. Individuals with moderate-to-severe ME/CFS are often confined to their homes, while those with very severe disease are mostly bedbound [7, 8]. Consequently, the illness poses tremendous burdens on patients, their caregivers, and the society, costing the US economy $18–$51 billion annually [9-11]. ME/CFS is characterized by inability to perform usual activities and profound fatigue that lasts for 6 months or longer. In addition, post-exertional malaise, sleep problems, and either orthostatic intolerance or cognitive problems are required for diagnosis. Additional symptoms may include pain, headaches, and gastrointestinal issues [12, 13]. Recent research has found that symptoms of Long COVID-19 or post-COVID-19 conditions were similar to those of ME/CFS [14-18]; thus, ME/CFS research may provide insights useful to the study of Long COVID-19 and vice versa. Identifying valid measures to characterize and track ME/CFS is a critical step toward understanding this illness and other post-infectious fatiguing conditions. The multi-site clinical assessment of myalgic encephalomyelitis/chronic fatigue syndrome (MCAM) study [6], aiming to improve how ME/CFS symptoms and their impact on quality of life could be measured, collected data using the Patient-Reported Outcome Measurement Information System® (PROMIS®) short form measures. While an earlier study supported the validity of the PROMIS Fatigue short form for ME/CFS [19], the illness presentation is known to be broader than the fatigue domain alone. Research and clinical case definitions for ME/CFS [12, 20–23] consistently identify sleep problems as one of the required symptoms for ME/CFS diagnosis, thus it is crucial to identify and evaluate a measure that assesses both sleep quality and its impact on quality of life for ME/CFS, with standardized scores comparable across studies. Findings have been mixed regarding the type and severity of sleep problems in ME/CFS [24, 25], largely due to a lack of standard measures that can be consistently adopted. Research using various self-report measures found that 87–95% of ME/CFS patients reported unrefreshing sleep [1, 25–28], and ME/CFS patients showed significantly poorer sleep quality and more daytime dysfunction compared to healthy controls [29-31]. In contrast, studies implementing objective sleep measures [32] often did not observe more sleep-related difficulties in those with ME/CFS than in healthy controls [33-35]. Pain is another characteristic of ME/CFS listed as either a required or additional symptom in case definitions and commonly reported by patients [36, 37]. Research found that 80–94% of people with ME/CFS experience some type of pain [1, 38–40]. Importantly, other than fatigue, pain has been identified as the most troublesome ME/CFS symptom, with 65% of severely ill patients identifying pain as the top three most troublesome [14]. Among people with ME/CFS, research found that pain was associated with reduced functioning and quality of life [41]; if accompanied with depression, pain was also associated with anxiety [42]. Several measures have been used in clinical settings to evaluate ME/CFS pain [43-49]; however, such measures often focus on the frequency and severity of pain [37], while the specific consequences of pain on daily functioning and interference with life have not been fully assessed. The primary goal of the present study is to evaluate the psychometric properties of four PROMIS® short forms to fully describe the experiences of sleep problems and pain for people suffering from ME/CFS. Using item response theory (IRT), the PROMIS short forms produce precise scores while reducing respondents’ burden. Moreover, the standardized PROMIS scores enable comparisons across studies or patient populations, which help bring meanings to the scores in both clinical and research settings.

Methods

Data source and study sample

Data came from the MCAM study [6]—a multiple-stage study with a rolling cohort design to examine the heterogeneity in patients. MCAM participants were recruited between 2012 and 2020 from ME/CFS specialty clinics across seven US states (CA, FL, NC, NJ, NV, NY, and UT). Patient eligibility was based on ME/CFS expert clinician diagnosis of the illness. Participants were aged 18–70 years at their baseline enrollment and had been diagnosed with CFS, ME, or post-infectious fatigue or managed as other ME/CFS patients in the clinical practice. The study was approved by the Institutional Review Boards of the Centers for Disease Control and Prevention and participating clinics. Of 602 participants with ME/CFS who completed the baseline assessment, 441 also completed a follow-up assessment approximately 10 to 14 months later. No specific intervention was delivered to participants between the baseline and follow-up assessments. Most of the analyses reported here were conducted using baseline data from ME/CFS participants, whereas both baseline and follow-up data were used to evaluate the responsiveness of the measures over time. In addition, baseline data from 338 healthy controls (HC) were used to examine known-groups validity.

Measures

Four PROMIS short forms related to sleep and pain were administered: web-based platforms were used at five clinics and paper forms at the other two clinics. Based on previous literature [50], we assumed comparable responses between electronic surveys and paper-and-pencil surveys. PROMIS was developed using a mixed-method approach and calibrated all items to 2000 US Census population using IRT methods [51] on a T-score metric where a score of 50 represents the mean score of the US general population and 10 is the standard deviation [52].

PROMIS sleep short forms

We administered PROMIS v1.0 sleep measures: the Sleep Disturbance Short Form 8b (SD-SF) and the Sleep-Related Impairment Short Form 8a (SRI-SF), both with eight items [53]. All items have response options on a five-point Likert scale. Four items from SD-SF and one item from SRI-SF were reverse coded so that higher scores indicate greater sleep disturbance or related impairment.

PROMIS pain short forms

We administered PROMIS v1.0 pain measures: the Pain Interference Short Form 6b (PI-SF, six items with five-point Likert scale) [54] and the Pain Behavior Short Form 7a (PB-SF, seven items with six-point Likert scale) [55]. For all items, higher scores indicate more impairment.

Patient characteristics

We collected data on the number of hours spent in vertical or horizontal activities per day and physical health measured by the SF-36 Health Survey (SF-36). These questions were administered to assess ME/CFS functional impairment along with the PROMIS sleep and pain measures at baseline and follow-up. We used these measures to define groups of participants differing in functional impairment for evaluating known-groups validity and responsiveness. Additionally, age and sex were used in the differential item functioning (DIF) analysis.

Analyses

Analyses were conducted using SAS Version 9.4 [56], Mplus Version 8.6 [57], and IRTPRO Version 5.1 [58].

Descriptive statistics

For each PROMIS sleep/pain short form, we calculated the mean and standard deviation of raw sum scores and examined the proportion of participants with ME/CFS at the lowest or highest possible score. A floor or ceiling effect was defined as 15% or more of respondents having the lowest or highest score [59, 60].

Unidimensionality

We evaluated the scale unidimensionality by fitting a 1-factor confirmatory factor analysis (CFA) model separately to each sleep/pain short form, assuming categorical indicators and using the WLSMV estimator [57]. If model fit was poor, exploratory factor analysis (EFA) and bi-factor CFA were further conducted to explore whether scales were at least essentially unidimensional [61-63]. Model fit was tested, with the Comparative Fit Index (CFI) > 0.95, the Tucker–Lewis Index (TLI) > 0.95, and the Root Mean Square Error of Approximation (RMSEA) < 0.06 considered as good fit [64].

IRT scoring

We used two-parameter, graded response models [65] (a type of IRT model) to obtain PROMIS T-scores for participants with ME/CFS and HC at baseline and at follow-up. Item parameters were fixed at the published values from PROMIS item banks v1.0, which were calibrated using a large sample representing 2000 US Census population. We employed response pattern scoring using IRTPRO Version 5.1 [58], based on item parameters available on Assessment Center (https://www.assessmentcenter.net/).

Reliability of scores

Internal consistency reliability for each of the four PROMIS sleep and pain measures was evaluated using Cronbach’s standardized alpha coefficient, categorical omega coefficient [66], and the amount of measurement error in the T-score under IRT. We computed omega based on item loadings and thresholds from 1-factor or bi-factor CFA models via Bayesian estimation [67]. The omega coefficient provides a more accurate estimate of internal consistency than alpha because it makes more realistic assumptions about the measurement models (e.g., does not assume equal factor loadings of all items or uncorrelated error variances) [68-70]. Although a reliability coefficient > 0.70 (or a T-score standard error of measurement [T-SEM] < 5.5) is considered acceptable for group-level analyses, a reliability coefficient > 0.90 (or a T-SEM < 3.2) is needed for precisely assessing individual patients [71, 72]. We hypothesized that, consistent with previous findings with other health conditions [73-77], the PROMIS sleep and pain scores among participants with ME/CFS would have reliability estimates exceeding those recommended for individual-level comparisons (i.e., omega > 0.90 and T-SEM < 3.2).

DIF analysis

We conducted DIF analysis to detect potential measurement bias across population subgroups differing in sex and age. Evidence of DIF occurs when respondent subgroups (e.g., male vs. female) differ in their probabilities of endorsing an item response category after controlling for the underlying trait being measured. DIF suggests that item score differences between subgroups (e.g., male vs. female) may be merely due to group membership or different interpretations of the item content, rather than reflecting true differences in the trait being measured. We examined the possibility of DIF by sex and age for each item in the four PROMIS sleep and pain short forms using the Wald test [78, 79]. The measurement properties of each item were compared across three age groups: 18–39, 40–59, and 60 or above, which allowed a sufficiently large number of participants in each group to support this analysis. Patients ages 40–59 were specifically combined as a category since ME/CFS is more prevalent in this age range [80, 81]. Two DIF comparisons were made for ages 18–39 vs. ages 40+ and ages 40–59 vs. ages 60+. For the Wald test, a non-significant χ2 value indicates no detectable DIF. We used the Benjamin–Hochberg [82, 83] procedure to control for the multiple comparisons involved in checking DIF for each item by sex and age. We hypothesized that there would be no evidence of DIF for items from the four PROMIS sleep and pain short forms in this sample of participants with ME/CFS.

Known-groups validity

Known-groups validity of each PROMIS sleep/pain short form was evaluated by comparing T-scores for groups that are known to differ in their trait levels. We hypothesized that the ME/CFS group with higher levels of functional impairment would have PROMIS scores indicating greater sleep disturbance/impairment or pain interference/behavior; and that the ME/CFS group would exhibit greater sleep disturbance/impairment or pain interference/behavior than HC. We measured the level of functional impairment based on three variables, including (1) hours spent in vertical activities (e.g., sitting, standing, or walking) per day, (2) hours spent in horizontal activities (e.g., resting in recliner with feet up, napping, sleeping in bed) per day, and (3) overall physical health (measured by the Physical Component Summary [PCS] T-scores of the SF-36). Using three different types of measures allows us to better define the known-groups with the former two measures specific to ME/CFS and commonly used by ME/CFS expert clinicians [84], while the third measure capturing generic functional status. Fewer hours of vertical activities, more hours of horizontal activities, and lower SF-36 PCS scores indicate more functional impairment. Additional detail regarding these measures is provided in the supplement materials of [6] and [19]. For vertical activities and physical health, participants with ME/CFS were divided into three similar-sized groups with “low,” “medium,” and “high” level of impairment based on tertiles. For horizontal activities, we divided ME/CFS participants into only two groups (15 h vs. < 15 h of horizontal activities per day), because too many participants (about 47%) were at the “ceiling” of 15 h of horizontal activities per day [19]. Analysis of variance (ANOVA) was used to examine mean differences in PROMIS T-scores for SD-SF, SRI-SF, PI-SF, and PB-SF among known-groups defined by the three aforementioned variables for participants with ME/CFS and between the ME/CFS and HC groups. The Tukey–Kramer method [85] was adopted for multiple comparison adjustment among known-groups. Known-groups validity was considered acceptable when the difference in mean T-scores across groups was significant at α = 0.05. We interpreted the size of these differences using η2 (dividing the sum of squares for the known-groups effect by the total sum of squares): following convention, η2 around 0.01, 0.09, and 0.25 represents small, medium, and large effect, respectively [86, 87].

Responsiveness

Responsiveness, or sensitivity to change, represents how well a measure’s scores reflect changes over time when true changes occur. We hypothesized that participants with ME/CFS with “improved,” “stable,” and “worsened” health would show a significant decrease, no significant changes, and a significant increase in their PROMIS sleep/pain scores, respectively. We initially categorized ME/CFS participants into three groups of “improved,” “stable,” and “worsened” using the three aforementioned measures of functional status. As detailed in [19], for vertical/horizontal activities, “stable” participants were defined as those who had < 1 h of change from baseline to follow-up, because time spent in these activities was reported in integer hours and there was no established threshold for the minimal clinically important difference (MCID). In terms of SF-36 PCS T-score, “stable” participants were those with ≤ 5 points of change. Previous literature suggests MCIDs of 2.5 to about 7 for SF-36 PCS across different patient populations [88-90], but its MCID for ME/CFS has not been established. Therefore, we used the half standard deviation approach [91] and chose an MCID of 5 as the threshold of categorizing participants when assessing the responsiveness. For horizontal activities, we further combined the “stable” and “worsened” groups into a group of “not improved” and compared those who “improved” versus “not improved.” This is because over 77% of “stable” participants were at the worst possible functional status (i.e., 15 h of daily horizontal activities) at both baseline and follow-up. Such participants may not be truly stable, as they might have experienced a worsening in horizontal activities that was undetectable. We used ANOVA to examine if changes in PROMIS sleep/pain scores significantly differed among the “improved,” “stable,” and “worsened” groups (or between the “improved” and “not improved” groups for horizontal activities). Additionally, we calculated Guyatt’s responsiveness statistic (GRS) to describe the effect size comparing the “improved” groups to the “stable,” “worsened,” or “not improved” groups. The GRS is the mean change in PROMIS sleep/pain scores for the target group (i.e., “improved”) divided by the standard deviation of the comparison group (e.g., “not improved”) [92] and is interpreted as small (≥ 0.2 and < 0.5), medium (≥ 0.5 and < 0.8), and large (≥ 0.8) [86].

Results

Most participants with ME/CFS were female (72.6%), white (88.4%), and not currently working (70.1%) (Table 1). Their mean age was 48.0 years, with an average illness duration of 14.2 years and more than half of them had sudden illness onset.
Table 1

Sample characteristics at baseline

ME/CFS (n = 602)HC (n = 338)
n%n%
Sex
 Female43772.622265.7
 Male16527.411534.0
 Missing0010.3
Race
 White53288.419256.8
 Black/African American122.0226.5
 All others284.68826.0
 Missing305.03610.7
Marital status
 Married/committed31652.516849.7
 Previously married10417.36017.7
 Never married17128.410129.9
 Missing111.892.7
Employment
 Full-time9115.117752.4
 Part-time6310.56017.8
 Not working42270.18926.3
 Missing264.3123.5
Educational attainment
 Less than high school40.751.5
 High school graduate13121.89829.0
 College graduate23739.412637.3
 Post college21135.010029.6
 Missing193.192.6
Illness onset status
 Gradual19231.9N.AN.A
 Sudden33054.8N.AN.A
 Missing8013.3N.AN.A

ME/CFS myalgic encephalomyelitis/chronic fatigue syndrome, HC healthy control, SD standard deviation

Sample characteristics at baseline ME/CFS myalgic encephalomyelitis/chronic fatigue syndrome, HC healthy control, SD standard deviation

Descriptive statistics

Table 2 summarizes the distribution of raw sum scores for each PROMIS sleep/pain short form among participants with ME/CFS. The proportions of participants at the highest possible or lowest possible impairment score were all below the threshold of 15%, suggesting no substantial floor/ceiling effects. Approximately 10% of participants reported “not at all/never” on all the pain interference items or reported “I had no pain” on all the pain behavior items, which indicated that a subgroup of ME/CFS participants may not have suffered from pain at the time that they answered the questions.
Table 2

Measure-level raw sum score distributions for PROMIS short forms of sleep disturbance, sleep-related impairment, pain interference, and pain behavior, among participants with myalgic encephalomyelitis/chronic fatigue syndrome (ME/CFS) at baseline from the multi-site clinical assessment of ME/CFS (MCAM) study

MeasureNMeanSDParticipants at the lowest possible raw sum scoreParticipants at the highest possible raw sum score
n%n%

PROMIS sleep disturbance

(raw sum score 8–40)

58328.306.9410.2244.0

PROMIS sleep-related impairment

(raw sum score 8–40)

57726.677.1320.3213.5

PROMIS pain interference

(raw sum score 6–30)

58518.837.496110.1355.8

PROMIS pain behavior

(raw sum score 7–42)

57823.687.466010.000.0

SD standard deviation

Measure-level raw sum score distributions for PROMIS short forms of sleep disturbance, sleep-related impairment, pain interference, and pain behavior, among participants with myalgic encephalomyelitis/chronic fatigue syndrome (ME/CFS) at baseline from the multi-site clinical assessment of ME/CFS (MCAM) study PROMIS sleep disturbance (raw sum score 8–40) PROMIS sleep-related impairment (raw sum score 8–40) PROMIS pain interference (raw sum score 6–30) PROMIS pain behavior (raw sum score 7–42) SD standard deviation

Unidimensionality

As shown in Table 3, the CFI and TLI demonstrated excellent fit (> 0.95) of the pain short forms to a unidimensional CFA model but marginal fit (> 0.90) of the sleep short forms. Therefore, we further conducted EFA and bi-factor analyses for the two sleep short forms and found that all model fit indices were greatly improved to the acceptable ranges. Although the RMSEAs for the pain measures were beyond the published criterion (> 0.06), we considered them as sufficiently unidimensional based on high CFI and TLI values. Moreover, the RMSEA estimates from the current study were consistent with those previously reported for PROMIS measures [54, 55, 93].
Table 3

Model fit indices from the confirmatory factor analyses (CFA) for PROMIS short forms of sleep disturbance, sleep-related impairment, pain interference, and pain behavior, among participants with myalgic encephalomyelitis/chronic fatigue syndrome (ME/CFS) at baseline from the multi-site clinical assessment of ME/CFS (MCAM) study

MeasureCFA modelModel Fit Indices
CFITLIRMSEA
PROMIS sleep disturbance1-Factor0.9220.8900.231
Bi-factor0.9990.9970.040
PROMIS sleep-related impairment1-Factor0.9420.9190.247
Bi-factor0.9990.9980.042
PROMIS pain interference1-Factor0.9970.9950.173
PROMIS pain behavior1-Factor0.9920.9880.113
Threshold of good fit > 0.95 > 0.95 < 0.06

CFI Comparative Fit Index; TLI Tucker–Lewis Index; RMSEA root mean square error of approximation

The values in italics are “thresholds” used to evaluate the model fit indices values reported

Model fit indices from the confirmatory factor analyses (CFA) for PROMIS short forms of sleep disturbance, sleep-related impairment, pain interference, and pain behavior, among participants with myalgic encephalomyelitis/chronic fatigue syndrome (ME/CFS) at baseline from the multi-site clinical assessment of ME/CFS (MCAM) study CFI Comparative Fit Index; TLI Tucker–Lewis Index; RMSEA root mean square error of approximation The values in italics are “thresholds” used to evaluate the model fit indices values reported

IRT scoring

The mean IRT-based T-scores of the four measures ranged from 57.68 to 62.40 for participants with ME/CFS at baseline (see Table 4). As such, the four measures had mean scores about one standard deviation (10 points on the T-score metric) above the national norm (T-score = 50). These scores indicate greater pain and sleep problems compared to the general population, consistent with clinical understanding of the illness and, thus, supporting the validity of using them for ME/CFS.
Table 4

Measure-Level T-score distributions and reliability estimates for PROMIS short forms of sleep disturbance, sleep-related impairment, pain interference, and pain behavior, among participants with myalgic encephalomyelitis/chronic fatigue syndrome (ME/CFS) at baseline from the multi-site clinical assessment of ME/CFS (MCAM) study

MeasurenIRT T-scoreReliability
MeanSDCronbach’s αωaSEM
PROMIS sleep disturbance60259.178.350.880.922.7
PROMIS sleep-related impairment60162.408.730.890.932.5
PROMIS pain interference60162.029.840.970.972.3
PROMIS pain behavior60157.688.930.920.922.1

IRT item response theory, SD standard deviation, SEM standard error of measurement, based on IRT T-scores

aCategorical ω were computed using Green and Yang’s approach [66], based on item loadings and thresholds estimates from 1-factor CFA models (i.e., total omega) for the pain interference and pain behavior short forms and from bi-factor CFA models (i.e., hierarchical omega) for the sleep disturbance and sleep-related impairment short forms

Measure-Level T-score distributions and reliability estimates for PROMIS short forms of sleep disturbance, sleep-related impairment, pain interference, and pain behavior, among participants with myalgic encephalomyelitis/chronic fatigue syndrome (ME/CFS) at baseline from the multi-site clinical assessment of ME/CFS (MCAM) study IRT item response theory, SD standard deviation, SEM standard error of measurement, based on IRT T-scores aCategorical ω were computed using Green and Yang’s approach [66], based on item loadings and thresholds estimates from 1-factor CFA models (i.e., total omega) for the pain interference and pain behavior short forms and from bi-factor CFA models (i.e., hierarchical omega) for the sleep disturbance and sleep-related impairment short forms

Reliability

As shown in Table 4, among participants with ME/CFS, all four PROMIS sleep/pain measures showed high internal consistency with ω ranging from 0.92 to 0.97. Under the IRT framework, the average standard errors of T-scores ranged from 2.1 to 2.7 (see Table 4), which were well below the hypothesized threshold of 3.2 (corresponding to a reliability of 0.9). Compared to Cronbach’s α (ranging from 0.88 to 0.97 in this study), the ω coefficient and the average standard error of IRT T-score are computed under a more lenient and realistic assumption, allowing each item to be linked to the underlying construct (e.g., pain interference) to differing degree. Therefore, we considered the ω coefficient or the standard error of T-score a more accurate estimate of reliability. Although the α coefficients were slightly below the hypothesized threshold of 0.9 for the two sleep measures, these estimates may reflect an underestimation of internal consistency, as the tau-equivalence assumption (i.e., equal factor loadings for all items) required for computing α was violated, with loadings ranging from 0.62 to 0.89 for sleep disturbance and from 0.57 to 0.93 for sleep-related impairment. Taken together, the four PROMIS measures provided highly reliable scores not only for group-level analysis but also for assessing individual ME/CFS patients.

DIF

Table 5 shows the Wald test results for detecting potential DIF by sex and by age. Before using the Benjamini–Hochberg correction for multiplicity, two PROMIS SD-SF items (“I was satisfied with my sleep” and “My sleep was refreshing”) and one item from PROMIS PB-SF (“When I was in pain I grimaced”) showed possible DIF by age. Another item from PROMIS PI-SF (“How much did pain interfere with your enjoyment of recreational activities?”) showed possible DIF by sex, with a p value < 0.05. However, after correction for multiplicity, none of the items exhibited significant DIF by sex or by age.
Table 5

Differential item functioning (DIF) statistics by sex and by age for PROMIS short forms of sleep disturbance, sleep-related impairment, pain interference, and pain behavior, based on baseline myalgic encephalomyelitis/chronic fatigue syndrome (ME/CFS) participant data (n = 602) from the multi-site clinical assessment of ME/CFS (MCAM) study

LabelDIF by SexMale vs FemaleDIF by Age18–39 vs. 40+DIF by Age40–59 vs. 60+
χ2dfpχ2dfpχ2dfp
PROMIS sleep disturbance short form
 Item1: restless1.150.957450.5491150.051
 Item2: satisfied sleep4.350.5074.950.42511.450.044
 Item3: refreshing0.950.9732.540.63916.540.002
 Item4: falling asleep0.950.9712.850.7267.750.176
 Item5: staying asleep5.950.3147.550.1855.950.313
 Item6: trouble sleeping7.250.204450.5507.850.169
 Item7: enough sleep8.850.1163.550.6207.950.163
 Item8: sleep quality350.7022.440.6626.140.192
PROMIS sleep-related impairment short from
 Item1: getting things done4.950.4291.550.918550.423
 Item2: alert when woke up9.450.0931.450.9194.550.485
 Item3: tired5.550.3551.440.8437.740.104
 Item4: problems during day1.550.9113.250.667250.845
 Item5: hard time concentrating4.950.4307.450.1891.350.938
 Item6: irritable2.350.8133.450.6343.550.621
 Item7: sleepy during daytime0.450.997450.5559.950.079
 Item8: trouble staying awake8.250.1447.450.1923.850.581
PROMIS pain interference short form
 Item1: enjoyment of life5.250.3981050.0766.450.271
 Item2: ability to concentrate4.150.5397.150.2156.250.290
 Item3: day-to-day activities3.650.6041.450.9244.650.461
 Item4: recreational activities15.850.0081.650.8981.750.888
 Item5: tasks away from home6.250.2865.350.3805.550.358
 Item6: socializing with others550.4211.950.8617.150.211
PROMIS pain behavior short form
 Item1: irritable2.460.87410.960.0926.960.330
 Item2: grimaced3.960.6892.660.86314.860.022
 Item3: moved extremely slowly10.760.0969.360.1597.960.249
 Item4: moved stiffly3.560.7496.360.3883.160.793
 Item5: called out for help5.660.4708.950.1138.550.130
 Item6: isolated from others4.660.5973.860.7103.860.706
 Item7: thrashed0.660.9967.950.1633.550.619
Differential item functioning (DIF) statistics by sex and by age for PROMIS short forms of sleep disturbance, sleep-related impairment, pain interference, and pain behavior, based on baseline myalgic encephalomyelitis/chronic fatigue syndrome (ME/CFS) participant data (n = 602) from the multi-site clinical assessment of ME/CFS (MCAM) study

Known-groups validity

Results in Tables 6 and 7 show that the omnibus hypothesis of no differences among known-groups was rejected with p < 0.05 for each of the four PROMIS measures (see table footnotes for details), providing supportive evidence for the validity of the sleep and pain short forms for ME/CFS. However, for the two sleep-related PROMIS measures, although mean scores appeared to increase monotonically across low, medium, and high functional impairment groups, the differences between medium and high functional impairment groups were not statistically significant (see Table 6 and its footnotes). For the two pain-related PROMIS measures, we found significant differences for all pairwise comparisons between groups defined by SF-36 PCS scores, with a monotonic increase in mean scores across low, medium, and high functional impairment groups (see Table 7 and its footnotes). When groups were defined by vertical activities, however, we only found significant difference between low and high functional impairment groups but could not differentiate the medium impairment group from the other two groups (see Table 7 and its footnotes). Participants with ME/CFS had significantly higher (i.e., worse) mean scores than HC for all four PROMIS measures. For the two sleep-related PROMIS measures, effect sizes, η2, were small for known-groups defined by functional impairment (i.e., vertical/horizontal activities and SF-36 PCS scores). For the two pain-related PROMIS measures, small-sized and medium-sized effects were found for known-groups defined by vertical/horizontal activities and SF-36 PCS scores, respectively. When comparing ME/CFS to HC participants, effect sizes were large for all four PROMIS measures.
Table 6

Mean T-scores of the PROMIS sleep disturbance (SD) and sleep-related impairment (SRI) short forms (SF), by three indicators of functional impairment level as well as between participants with myalgic encephalomyelitis/chronic fatigue syndrome (ME/CFS) and healthy controls (HC) from the multi-site clinical assessment of ME/CFS (MCAM) study

Known-groupsPROMIS SD-SFaPROMIS SRI-SFb
nMeanSDFη2cnMeanSDFη2c
ME/CFS functional impairment level defined by hours of vertical activities per day
 Low (≥ 10 h)21258.18.94.00.01421260.69.27.90.026
 Medium (≥ 5 and < 10 h)18758.97.618762.78.4
 High (< 5 h)18460.48.118464.08.1
ME/CFS functional impairment level defined by hours of horizontal activities per day
 Lower (< 15 h)25458.38.64.10.00725461.09.311.60.020
 Higher (15 h)33059.78.033063.48.1
ME/CFS functional impairment level defined by SF-36 PCS score
 Low (T-score ≥ 28.78)19657.28.69.20.03019659.89.313.80.045
 Medium (20.49 ≤ T-score < 28.78)20259.57.820263.78.1
 High (T-score < 20.49)19560.88.319463.88.2
ME/CFS vs. HC
 ME/CFS60259.28.3512.10.35360162.48.7733.30.439
 HC33845.79.533844.611.2

SD standard deviation

aOverall differences across groups were significant at p < 0.05, with groups defined by vertical/horizontal activities, SF-36 PCS scores, and ME/CFS vs. healthy controls (HC). For pairwise comparisons between any two functional impairment levels of vertical activities or SF-36 physical component summary (PCS) scores: only difference of low vs. high functional impairment, defined by vertical activities, was significant (p < 0.05); only differences of low vs. medium and low vs. high functional impairment, defined by SF-36 PCS scores, were significant (p < 0.05)

bOverall differences across groups were significant at p < 0.001, with groups defined by vertical/horizontal activities, SF-36 PCS scores, and ME/CFS vs. healthy controls (HC). For pairwise comparisons between any two functional impairment levels of vertical activities or SF-36 PCS scores, only differences of low vs. medium and low vs. high functional impairment were significant (p < 0.05)

cη2 is an effect size measure and was computed by dividing the sum of squares for the known-groups effect by the total sum of squares

Table 7

Mean T-scores of the PROMIS pain interference (PI) and pain behavior (PB) short forms (SF), by three indicators of functional impairment level as well as between participants with myalgic encephalomyelitis/chronic fatigue syndrome (ME/CFS) and healthy controls (HC) from the multi-site clinical assessment of ME/CFS (MCAM) study

Known-groupsPROMIS PI-SFaPROMIS PB-SFb
nMeanSDFη2cnMeanSDFη2c
ME/CFS functional impairment level defined by hours of vertical activities per day
 Low (≥ 10 h)21260.010.27.40.02521256.19.65.00.017
 Medium (≥ 5 and < 10 h)18762.09.318758.08.4
 High (< 5 h)18463.89.718458.88.6
ME/CFS functional impairment level defined by hours of horizontal activities per day
 Lower (< 15 h)25460.910.44.50.00825456.69.85.30.009
 Higher (15 h)33062.69.433058.38.2
ME/CFS functional impairment level defined by SF-36 PCS score
 Low (T-score ≥ 28.78)19657.010.450.80.14719653.510.840.20.120
 Medium (20.49 ≤ T-score < 28.78)20262.89.220258.58.3
 High (T-score < 20.49)19466.27.619461.04.9
ME/CFS vs. HC
 ME/CFS60162.09.8749.80.44560157.78.9630.30.403
 HC33545.27.233541.510.4

SD standard deviation

aOverall differences across groups were significant at p < 0.05, with groups defined by vertical/horizontal activities, SF-36 PCS scores, and ME/CFS vs. healthy controls (HC). For pairwise comparisons between any two functional impairment levels of vertical activities or SF-36 PCS scores: only difference of low vs. high functional impairment, defined by vertical activities, was significant (p < 0.001); differences of low vs. medium, low vs. high, and medium vs. high functional impairment, defined by SF-36 PCS scores, were all significant (p < 0.001)

bOverall differences across groups were significant at p < 0.05, with groups defined by vertical/horizontal activities, SF-36 PCS scores, and ME/CFS vs. HC. For pairwise comparisons between any two functional impairment levels of vertical activities or SF-36 PCS scores: only difference of low vs. high functional impairment, defined by vertical activities, was significant (p < 0.01); differences of low vs. medium, low vs. high, and medium vs. high functional impairment, defined by SF-36 PCS scores, were all significant (p < 0.01)

cη2 is an effect size measure and was computed by dividing the sum of squares for the known-groups effect by the total sum of squares

Mean T-scores of the PROMIS sleep disturbance (SD) and sleep-related impairment (SRI) short forms (SF), by three indicators of functional impairment level as well as between participants with myalgic encephalomyelitis/chronic fatigue syndrome (ME/CFS) and healthy controls (HC) from the multi-site clinical assessment of ME/CFS (MCAM) study SD standard deviation aOverall differences across groups were significant at p < 0.05, with groups defined by vertical/horizontal activities, SF-36 PCS scores, and ME/CFS vs. healthy controls (HC). For pairwise comparisons between any two functional impairment levels of vertical activities or SF-36 physical component summary (PCS) scores: only difference of low vs. high functional impairment, defined by vertical activities, was significant (p < 0.05); only differences of low vs. medium and low vs. high functional impairment, defined by SF-36 PCS scores, were significant (p < 0.05) bOverall differences across groups were significant at p < 0.001, with groups defined by vertical/horizontal activities, SF-36 PCS scores, and ME/CFS vs. healthy controls (HC). For pairwise comparisons between any two functional impairment levels of vertical activities or SF-36 PCS scores, only differences of low vs. medium and low vs. high functional impairment were significant (p < 0.05) cη2 is an effect size measure and was computed by dividing the sum of squares for the known-groups effect by the total sum of squares Mean T-scores of the PROMIS pain interference (PI) and pain behavior (PB) short forms (SF), by three indicators of functional impairment level as well as between participants with myalgic encephalomyelitis/chronic fatigue syndrome (ME/CFS) and healthy controls (HC) from the multi-site clinical assessment of ME/CFS (MCAM) study SD standard deviation aOverall differences across groups were significant at p < 0.05, with groups defined by vertical/horizontal activities, SF-36 PCS scores, and ME/CFS vs. healthy controls (HC). For pairwise comparisons between any two functional impairment levels of vertical activities or SF-36 PCS scores: only difference of low vs. high functional impairment, defined by vertical activities, was significant (p < 0.001); differences of low vs. medium, low vs. high, and medium vs. high functional impairment, defined by SF-36 PCS scores, were all significant (p < 0.001) bOverall differences across groups were significant at p < 0.05, with groups defined by vertical/horizontal activities, SF-36 PCS scores, and ME/CFS vs. HC. For pairwise comparisons between any two functional impairment levels of vertical activities or SF-36 PCS scores: only difference of low vs. high functional impairment, defined by vertical activities, was significant (p < 0.01); differences of low vs. medium, low vs. high, and medium vs. high functional impairment, defined by SF-36 PCS scores, were all significant (p < 0.01) cη2 is an effect size measure and was computed by dividing the sum of squares for the known-groups effect by the total sum of squares

Responsiveness

For the two sleep-related PROMIS short forms, no significant differences were found among groups of participants with ME/CFS defined by whether they improved, remained stable, or worsened based on SF-36 PCS scores or horizontal activity. Overall, sleep-related impairment change scores significantly differed across groups defined by vertical activity (F = 6.7, p = 0.001), but in pairwise group comparisons, the “stable” group did not significantly differ from either the “improved” or “worsened” group. Therefore, there was insufficient evidence to support the responsiveness of the PROMIS sleep measures for ME/CFS. For the two pain-related PROMIS short forms, significant overall differences were found among groups of participants with ME/CFS defined by whether they improved, remained stable, or worsened with respect to SF-36 PCS scores. In pairwise group comparisons defined by SF-36 PCS scores, we found that all three groups significantly differed from each other for pain interference; whereas, only the difference between “improved” and “worsened” groups was significant for pain behavior. It is worth noting that although significant difference was found among groups defined by vertical activities for pain behavior, the difference was not in the expected direction and indicated greater decrease in pain behavior in the “stable” group than in the “improved” group. Therefore, the significant difference for pain behavior across vertical activity groups should not be considered as evidence supporting its responsiveness. Guyatt’s responsiveness statistics were generally small, except for pain interference that showed medium-to-large differences in change scores across “improved,” “stable,” and “worsened” groups defined by SF-36 PCS scores. To summarize, only the pain interference short form showed strong evidence supporting its responsiveness among ME/CFS participants (Tables 8 and 9).
Table 8

Mean changes in T-scores based on PROMIS sleep disturbance (SD) and sleep-related impairment (SRI) short forms (SF), among “improved,” “stable,” and “worsened” groups (or between “improved” and “not improved” groups) defined by three anchor measures from the multi-site clinical assessment of ME/CFS (MCAM) study

Change statusPROMIS SD-SFPROMIS SRI-SF
nMeanSDFaGuyatt’s Responsiveness StatisticnMeanSDFaGuyatt’s Responsiveness Statistic
ImprovedvsWorsened/Not ImprovedStablevsWorsenedImprovedvsWorsened/Not ImprovedStablevsWorsened
Change in hours of vertical activities per day, from baseline to follow-up
 Improved150− 1.077.192.7− 0.253− 0.030149− 1.417.976.7− 0.425− 0.220
 Stable1030.456.831030.026.29
 Worsened1690.666.821691.556.96
Change in hours of horizontal activities per day, from baseline to follow-up
 Improved114− 1.077.023.7− 0.212114− 0.348.180.7− 0.095
 Not improved3080.416.973070.326.93
Change in SF-36 physical component summary (PCS) score, from baseline to follow-up
 Improved107− 1.246.502.9− 0.362− 0.142107− 0.717.941.1− 0.230− 0.101
 Stable2610.297.022600.226.80
 Worsened551.286.94550.947.13

SD standard deviation

aBold values indicate significant overall differences across groups at p < 0.05

Table 9

Mean changes in T-scores based on PROMIS pain interference (PI) and pain behavior (PB) short forms (SF), among “improved,” “stable,” and “worsened” groups (or between “improved” and “not improved” groups) defined by three anchor measures from the multi-site clinical assessment of ME/CFS (MCAM) study

Change statusPROMIS PI-SFPROMIS PB-SF
nMeanSDFaGuyatt’s Responsiveness StatisticnMeanSDFaGuyatt’s Responsiveness Statistic
ImprovedvsWorsened/Not ImprovedStablevsWorsenedImprovedvsWorsened/Not ImprovedStablevsWorsened
Change in hours of vertical activities per day, from baseline to follow-up
 Improved150− 1.787.631.9− 0.218− 0.045150− 0.646.013.3− 0.174− 0.318
 Stable103− 0.606.58103− 1.547.04
 Worsened169− 0.296.831690.456.28
Change in hours of horizontal activities per day, from baseline to follow-up
 Improved114− 1.787.792.5− 0.182114− 1.037.171.5− 0.142
 Not Improved308− 0.546.80308− 0.176.12
Change in SF-36 PCS score, from baseline to follow-up
 Improved107− 3.267.4313.5− 0.874− 0.508107− 1.756.084.6− 0.522− 0.279
 Stable261− 0.796.67261− 0.306.51
 Worsened552.636.73551.375.98

SD standard deviation

aBold values indicate significant overall differences across groups at p < 0.05

Mean changes in T-scores based on PROMIS sleep disturbance (SD) and sleep-related impairment (SRI) short forms (SF), among “improved,” “stable,” and “worsened” groups (or between “improved” and “not improved” groups) defined by three anchor measures from the multi-site clinical assessment of ME/CFS (MCAM) study SD standard deviation aBold values indicate significant overall differences across groups at p < 0.05 Mean changes in T-scores based on PROMIS pain interference (PI) and pain behavior (PB) short forms (SF), among “improved,” “stable,” and “worsened” groups (or between “improved” and “not improved” groups) defined by three anchor measures from the multi-site clinical assessment of ME/CFS (MCAM) study SD standard deviation aBold values indicate significant overall differences across groups at p < 0.05

Discussion

The PROMIS sleep and pain short forms are generic (i.e., not condition-specific) measures that have been tested in various patient populations (e.g., fibromyalgia, hypertension, sleep disorder, rheumatoid arthritis, sickle cell disease) [53, 77, 94, 95]. The current study demonstrated that the four PROMIS short forms had strong reliability and validity to assess sleep and pain outcomes for individuals with ME/CFS, thus they are useful tools for researchers and clinicians to examine individuals with varying levels of functional impairment due to ME/CFS as well as comparing them to those with other illnesses. All four measures demonstrated essential unidimensionality and excellent internal consistency not only for group-level analyses but also for monitoring individuals with ME/CFS. Minimal floor/ceiling effects were shown at the total score level, suggesting that the PROMIS short forms exhibited minimal restriction on the measurement range for sleep and pain problems within the ME/CFS population. We observed that about 10% of participants had the lowest possible scores for the two pain short forms, indicating no pain or no pain interference. This is consistent with our expectation, given that pain is a common and troublesome symptom but not a required symptom for ME/CFS diagnosis. All four measures showed acceptable known-groups validity, with small-to-medium effect sizes when comparing across functional impairment groups and with large effect sizes when comparing participants with ME/CFS to HC. As expected, the T-scores generally had a monotonic increase across ME/CFS groups with low, medium, and high functional impairment, and participants with ME/CFS had significantly higher (i.e., worse) scores than HC. No DIF was detected by age or sex for any items, suggesting that all four measures provide unbiased measurement across these population subgroups. The two sleep short forms (sleep-related impairment and sleep disturbance) and the pain behavior short form did not show sufficient evidence to support their responsiveness to change, while the pain interference short form showed good responsiveness with medium-to-large effect sizes. Previous research has estimated the minimal clinically important difference to be 2 to 6 points for the PROMIS sleep and pain measures across different patient populations [96-99]. However, given the non-interventional nature of the present study, except for pain interference, the average changes in PROMIS T-scores from baseline to follow-up were mostly below 2 points, which dampens our ability to fully evaluate responsiveness. Furthermore, given that the PROMIS Fatigue short form was found to be responsive within the same study sample of individuals with ME/CFS [19], it is possible that the improvement or worsening of functioning (as defined by vertical/horizontal activity or SF-36 PCS score) within a 10 to 14-month period were mainly driven by changes in fatigue levels, while changes due to other symptoms, such as sleep problems and pain behavior, may be less tightly linked to functioning or may take longer to affect functioning. Therefore, further research examining a longer (e.g., 2 years) follow-up period is warranted to test the responsiveness of these scores. Moreover, a better test of responsiveness of scores would be to examine changes in scores among patients after intervention with a therapy of known efficacy. The present study is not without limitations. First, all MCAM study participants were receiving tertiary care and may not fully represent the broader U.S. ME/CFS population. Although the large number of participants recruited from clinics across seven states should form a diverse sample for psychometric analyses, future studies that include non-tertiary care participants are needed to evaluate the stability of parameter estimates. Second, the analyses for evaluating known-groups validity and responsiveness were compromised by the measurement of functional status, defined by hours of vertical/horizontal activity per day and the SF-36 PCS scores. While these measures were highly correlated with the leading ME/CFS symptom of fatigue, their associations with sleep problems or pain may be weaker. Studies using more relevant external criterion measures (e.g., global rating of sleep quality or pain) should be conducted to further examine the validity of PROMIS short forms. In addition, a large proportion of individuals with ME/CFS began the study with the worst possible functional impairment defined by horizontal activity. Consequently, those who were truly stable could not be distinguished from those who experienced an undetectable worsening in functional status when measured by horizontal activity. More research is needed to validate the vertical/horizontal activity measures and establish their cutoffs for MCID, so that stable participants can be consistently and precisely defined. These ME/CFS-specific measures, along with other indicators with greater individual differences and more variability, would help researchers triangulate findings to better evaluate the responsiveness and the test–retest reliability. Despite these limitations, the results of this study provide useful information about the reliability and validity of the four PROMIS sleep and pain short forms, both in general and when used in individuals with ME/CFS, in particular. This information will facilitate the selection of patient-reported outcome measures for ME/CFS and other similar illnesses (e.g., Long COVID-19) moving forward.

Conclusion

In summary, study findings support the reliability and validity of four PROMIS short forms for assessing sleep-related impairment, sleep disturbance, pain interference, and pain behavior, among individuals with ME/CFS. Such measures could be used in research and clinical settings to facilitate understanding of the symptomatology and clinical course of ME/CFS, which is an important step for evaluating treatment effect.
  73 in total

1.  Discriminant Ability, Concurrent Validity, and Responsiveness of PROMIS Health Domains Among Patients With Lumbar Degenerative Disease Undergoing Decompression With or Without Arthrodesis.

Authors:  Taylor E Purvis; Brian J Neuman; Lee H Riley; Richard L Skolasky
Journal:  Spine (Phila Pa 1976)       Date:  2018-11-01       Impact factor: 3.468

2.  The MOS 36-item short-form health survey (SF-36). I. Conceptual framework and item selection.

Authors:  J E Ware; C D Sherbourne
Journal:  Med Care       Date:  1992-06       Impact factor: 2.983

3.  Clinically important changes in short form 36 health survey scales for use in rheumatoid arthritis clinical trials: the impact of low responsiveness.

Authors:  Michael M Ward; Lori C Guthrie; Maria I Alba
Journal:  Arthritis Care Res (Hoboken)       Date:  2014-12       Impact factor: 4.794

4.  Assessing psychometric properties of the PROMIS Sleep Disturbance Scale in older adults in independent-living and continuing care retirement communities.

Authors:  Kelsie M Full; Atul Malhotra; Katie Crist; Kevin Moran; Jacqueline Kerr
Journal:  Sleep Health       Date:  2018-10-28

5.  Measuring change over time: assessing the usefulness of evaluative instruments.

Authors:  G Guyatt; S Walter; G Norman
Journal:  J Chronic Dis       Date:  1987

6.  The Revised Fibromyalgia Impact Questionnaire (FIQR): validation and psychometric properties.

Authors:  Robert M Bennett; Ronald Friend; Kim D Jones; Rachel Ward; Bobby K Han; Rebecca L Ross
Journal:  Arthritis Res Ther       Date:  2009-08-10       Impact factor: 5.156

Review 7.  Sleep in the chronic fatigue syndrome.

Authors:  An N Mariman; Dirk P Vogelaers; Els Tobback; Liesbeth M Delesie; Ignace P Hanoulle; Dirk A Pevernagie
Journal:  Sleep Med Rev       Date:  2012-10-06       Impact factor: 11.609

8.  The fibromyalgia impact questionnaire: development and validation.

Authors:  C S Burckhardt; S R Clark; R M Bennett
Journal:  J Rheumatol       Date:  1991-05       Impact factor: 4.666

9.  Psychometric properties of the PROMIS® Fatigue Short Form 7a among adults with myalgic encephalomyelitis/chronic fatigue syndrome.

Authors:  Manshu Yang; San Keller; Jin-Mann S Lin
Journal:  Qual Life Res       Date:  2019-09-10       Impact factor: 4.147

10.  Barriers to healthcare utilization in fatiguing illness: a population-based study in Georgia.

Authors:  Jin-Mann S Lin; Dana J Brimmer; Roumiana S Boneva; James F Jones; William C Reeves
Journal:  BMC Health Serv Res       Date:  2009-01-20       Impact factor: 2.655

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.