Literature DB >> 34608623

Psychometric validation and responder definition of the sleep disturbance numerical rating scale in moderate-to-severe atopic dermatitis.

J Puelles1, F Fofana2, D Rodriguez3, J I Silverberg4, A Wollenberg5, C Dias Barbosa6, M Vernon7, R Chavda1, S Gabriel1, C Piketty1.   

Abstract

BACKGROUND: Sleep disturbance (SD) is an important part of the burden of atopic dermatitis (AD), but patient-reported outcomes that are easy to understand and interpret in the target population have been lacking. A daily, single-item, self-reported SD 11-point numerical rating scale (NRS) was recently developed to assess SD for patients with moderate-to-severe AD, but its psychometric properties have not yet been described.
OBJECTIVES: To assess the psychometric properties of the SD NRS in patients with moderate-to-severe AD.
METHODS: The psychometric properties of the SD NRS were assessed using data from a phase IIb clinical trial in 218 adults with moderate-to-severe AD.
RESULTS: Test-retest reliability of the SD NRS was substantial to almost perfect (interclass correlation 0·66-1·00) in participants who had stable SD or stable pruritus scores over 1 week. Baseline correlations were moderate to large (r > 0·30) between SD NRS and pruritus or sleep loss scores, but were small (r = -0·11 to 0·17) between SD NRS and EQ-5D-3L index and visual analogue scores, Hospital Anxiety and Depression Scale, Scoring Atopic Dermatitis, and Investigator's Global Assessment. The SD NRS could discriminate groups of participants in the expected direction according to different quality-of-life scores but not according to different clinician-reported disease severity scores. SD NRS scores significantly decreased as sleep loss, itch and quality-of-life scores improved. Analysis of meaningful change suggested a 2-5-point improvement as the initial range of responder definition in the SD NRS score.
CONCLUSIONS: The SD NRS is a reliable, valid and responsive measure of SD in adults with moderate-to-severe AD.
© 2021 The Authors. British Journal of Dermatology published by John Wiley & Sons Ltd on behalf of British Association of Dermatologists.

Entities:  

Mesh:

Year:  2021        PMID: 34608623      PMCID: PMC9299666          DOI: 10.1111/bjd.20783

Source DB:  PubMed          Journal:  Br J Dermatol        ISSN: 0007-0963            Impact factor:   11.113


Although sleep disturbance (SD) is an important part of the burden of atopic dermatitis (AD), few clinical trials of AD treatments have assessed it as a treatment outcome. Objective measures of sleep, especially actigraphy and polysomnography, have been included in some trials, , , but they do not capture the patient’s perspective, including how they feel or function in daily life. Patient‐reported outcomes (PROs) are available for assessing sleep quality from the patient’s perspective. , Some have been employed as outcome measures in clinical trials of AD treatments, including the Pittsburgh Sleep Quality Index for adults and the PROMIS‐SD and PROMIS‐SRI (sleep‐related impairment) item banks. SD has been captured for many years using a visual analogue scale (VAS) included in Scoring Atopic Dermatitis (SCORAD), a clinician‐reported score for assessing AD signs and symptoms, and more recently it has been captured using a VAS included in the patient‐reported version, PO‐SCORAD. , Although these scales have been validated in AD, some have been designed to monitor SD only at specific timepoints or use VAS scales whose design or administration limit their usefulness. , Others do not allow an SD score to be derived, and none capture day‐to‐day fluctuations in SD. Further, patients prefer numerical rating scales (NRSs) over VASs and find them easier to use and interpret. , , , Because of these issues, we developed a self‐reported 11‐point SD NRS for the daily assessment of SD in adults and adolescents with moderate‐to‐severe AD. In accordance with US Food and Drug Administration guidance on PROs, the SD NRS was first demonstrated to have content validity for assessing SD in moderate‐to‐severe AD through cognitive interviews. The interviews, which included 20 adults and 10 adolescents with moderate‐to‐severe AD, moderate‐to‐severe pruritis, and SD, confirmed that the SD NRS and its anchors were easily understood as intended. Daily morning administration of the SD NRS was recommended to provide accurate recall of the number or duration of night‐time awakenings. The interviews also showed that most participants would consider a 1‐ or 2‐point decrease in the SD NRS a meaningful improvement in AD‐associated SD. As a next step in demonstrating that the SD NRS is easy to understand and fit for purpose for assessing SD in patients with AD, the current study examined its other psychometric properties (test–retest reliability, convergent–divergent validity, known‐groups validity, and ability to detect change) using data collected in a phase IIb clinical trial in patients with moderate‐to‐severe AD. In addition, the study used anchor‐based approaches, supported by distribution‐based approaches and the previously collected qualitative information, to establish preliminary responder definitions for meaningful within‐patient change in the SD NRS score for this target population.

Patients and methods

Study design

This post hoc analysis examined the psychometric properties of the SD NRS using data from a multicentre, randomized, placebo‐controlled, double‐blinded phase IIb clinical trial assessing the efficacy and safety of nemolizumab in patients with moderate‐to‐severe AD and severe pruritus (NCT03100344). Participants were eligible to participate in the trial if they were ≥ 18 years old, experienced chronic AD (according to the consensus criteria of the American Academy of Dermatology) for ≥ 2 years before study entry, had confirmed moderate‐to‐severe AD based on clinical assessments, had severe itch on at least three of the last 7 days before study entry, and had a documented history of an inadequate response to topical medications. The trial’s primary outcome was the percentage change in Eczema Area and Severity Index (EASI) from baseline at week 24. Secondary outcomes included the SD NRS, Peak Pruritis (PP) NRS, Average Pruritus (AP) NRS, Pruritus Categorical Score (PCS), EQ 5‐Dimensions 5‐Levels (EQ‐5D‐3L; using the UK value set), Dermatology Life Quality Index (DLQI), Hospital Anxiety and Depression Scale (HADS), Investigator’s Global Assessment (IGA) and SCORAD. A list of outcome measures, recall periods and frequencies of assessments is provided in Table S1 (see Supporting Information). The SD NRS asks the following question to the participant: ‘On a scale of 0–10, with 0 being "no sleep loss related to the symptoms of atopic dermatitis" and 10 being "I did not sleep at all due to the symptoms of atopic dermatitis", how would you rate your sleep last night?'. The SD NRS was completed by participants on an electronic device once daily in the morning throughout the clinical trial. Previous work supported the importance and relevance of SD in participants with moderate‐to‐severe AD. Furthermore, the SD NRS question and its anchors were found to be easy or very easy to understand in participants with moderate‐to‐severe AD, and the evidence of its content validity has been previously described.

Analysis of the psychometric properties of the sleep disturbance numerical rating scale

Test–retest reliability was assessed in ‘stable’ participants, who were defined as having no change or minimal change in the concept measured over 1 week based on various anchors. The primary anchor for assessing test–retest reliability was the SCORAD sleep loss VAS, a component of SCORAD (see Table S1). Other anchors included the PCS (average weekly score), PP NRS (average weekly score), AP NRS (average weekly score) or IGA. For primary analyses, stable participants were defined as having no change or minimal change in the anchor from baseline (test) to week 1 (retest). In exploratory analyses, they were defined as having no change in the anchor from week 15 (test) to week 16 (retest) or from week 23 (test) to week 24 (retest). Paired t‐tests and intraclass correlation coefficients (ICCs) were computed comparing SD NRS values using a two‐way mixed‐effects anova model for absolute agreement following Shrout and Fleiss. ICCs were categorized according to Koo and Li as no agreement for < 0, slight agreement for 0·00–0·20, fair agreement for 0·21–0·40, moderate agreement for 0·41–0·60, substantial agreement for 0·61–0·80 and almost perfect agreement for 0·81–1·00. Construct validity was assessed through convergent–divergent validity and known‐groups validity. Convergent–divergent validity was assessed by calculating Spearman rank‐order correlation coefficients for the SD NRS average weekly score at baseline vs. the baseline values for all clinical outcome assessments measured in the trial: 5‐D Itch sleep item score, SCORAD sleep loss VAS score, PCS average weekly score, PP NRS average weekly score, AP NRS average weekly score, 5‐D Itch total score, DLQI total score, EQ‐5D‐3L index score, EQ‐5D‐3L VAS score, HADS anxiety score, HADS depression score, SCORAD total score, SCORAD body surface area (BSA; a component of SCORAD; see Table S1) and EASI. A priori hypotheses were that correlations would be stronger between the SD NRS and scales measuring a similar construct (i.e. scales measuring SD: 5‐D Itch sleep item, SCORAD sleep loss VAS) than scores from scales measuring dissimilar constructs (i.e. scales not measuring SD). Known‐groups validity of the SD NRS was assessed by comparing the mean SD NRS average weekly scores at baseline between severity groups of participants categorized according to 5‐D Itch sleep item score, DLQI total score, IGA score and EASI score. Comparisons were made by two‐sample t‐test (two groups) or anova (three or more groups) adjusted for multiple comparisons based on the Scheffe method. Raw and refined categories, grouping and explanations are provided in Table S2 (see Supporting Information). As a first step in assessing the responsiveness of the SD NRS, Spearman rank‐order correlation coefficients were calculated between the change from baseline to week 24 in SD NRS weekly scores and the changes from baseline to week 24 in SCORAD sleep loss VAS score, PCS average weekly score, PP NRS average weekly score, NRS average weekly score, DLQI total score, EQ‐5D‐3L index score, EQ‐5D‐3L VAS score, HADS anxiety score, HADS depression score, SCORAD total score, BSA score, IGA score and EASI. In a second step, responsiveness was further tested at week 16 (exploratory) and week 24 (primary) using paired t‐tests and an ancova adjusted to baseline score to compare the change in mean weekly SD NRS score in participants who were categorized as ‘improved’ or ‘not improved’ according to change from baseline in clinical outcome assessments having a large correlation with the SD NRS change from baseline (definitions in Table S3; see Supporting Information). Effect‐size statistics were calculated for each group of participants as the mean difference between the baseline and week 16 or 24 average weekly score divided by the standard deviation of the baseline average weekly score of the SD NRS. The standardized response mean was also calculated for each group of participants as the mean difference between the baseline average weekly score and week 16 or 24 average weekly score divided by the standard deviation of the change in average weekly score.

Responder definition

As recommend by the US Food and Drug Administration, multiple anchor‐based methods were used to establish meaningful within‐patient change and to derive responder definition estimates for the SD NRS score. The anchor‐based responder definitions were estimated as the mean change from baseline in the SD NRS average weekly score to week 24 based on the following criteria: (i) change from baseline in SCORAD sleep loss VAS (≥ 1‐point decrease), , (ii) PCS average weekly score ≤ 1 (rounded value), (iii) change from baseline in PP NRS average weekly score (≥ 4‐point decrease) and (iv) change from baseline in DLQI total score (≥ 4‐point decrease). As an additional exploratory analysis, the anchor‐based responder definitions were estimated based on differences between baseline and week 16. To help interpretation, the anchor‐based methods were supported by distribution‐based methods (standard error of measurement and the half‐ and quarter‐standard deviation) and the previous qualitative findings of the smallest improvement in the SD NRS considered satisfactory (1–3‐point decrease) and meaningful change (1‐ or 2‐point decrease) in the SD NRS in patients with moderate‐to‐severe AD. The standard error of measurement was computed as the standard deviation of an observed score related to its reliability [standard deviation × square root (1 − ICC)], where the ICC was from the SD NRS test–retest reliability in participants defined as stable based on the SCORAD sleep loss VAS.

Statistical considerations

All analyses were conducted on all participants randomized in the phase IIb clinical trial who had SD NRS data at baseline. Because these are post hoc analyses, the sample size was not calculated prospectively. All available data, irrespective of the treatment arm, were included as this is a standard approach when validating a PRO measure using clinical trial data. , Analyses were performed using SAS version 9.4 (SAS Institute, Cary, NC, USA). According to Cohen’s conventions, absolute values of correlations are considered large if ≥ 0·50, moderate if 0·30–0·49 and small if 0·10–0·29.

Results

Participant characteristics

In total, 226 patients were randomized in the phase IIb study. Of these, 218 (96% of the initial study sample) had an SD NRS score at baseline and constituted the sample analysed in this work (Table 1). SD NRS data were available for 175 patients at week 16 and for 154 patients at week 24. The mean (standard deviation) age at baseline was 39·2 (15·2) years, and just over half of the participants (52%) were male. The majority of participants were white (75%) and non‐Hispanic (95%). Investigators assessed the global severity as moderate (IGA = 3) for 66% of participants and severe (IGA = 4) for 34%, and most participants (87%) reported having had severe pruritus for at least three of the last 7 days. The mean (standard deviation) baseline SD NRS score was 7·8 (1·6) (Table 1).
Table 1

Participant demographic and clinical characteristics at baseline

Characteristic/measure N = 218
Age (years)
Mean (SD)39·2 (15·2)
Range18–82
Sex, n (%)
Male113 (51·8)
Female105 (48·2)
Race, n (%)
White164 (75·2)
African American/black26 (11·9)
Asian24 (11·0)
American Indian/Alaska native1 (0·5)
Other3 (1·4)
Ethnicity, n (%)
Hispanic/Latino11 (5·0)
Not Hispanic/Latino207 (95·0)
EASI score, mean (SD)25·6 (10·9)
IGA score, n (%) a
3 (moderate)144 (66·1)
4 (severe)74 (33·9)
Body surface area score, mean (SD)b 41·7 (18·6)
SCORAD total score, mean (SD)b 66·9 (11·6)
≥ 3 days with severe pruritus in the past 7 days, n (%)190 (87·2)
SD NRS, mean (SD)7·8 (1·6)

EASI, Eczema Area and Severity Index; IGA, investigator’s Global Assessment; SCORAD, Scoring Atopic Dermatitis; SD NRS, sleep disturbance numerical rating scale. a N = 215; b N = 217.

Participant demographic and clinical characteristics at baseline EASI, Eczema Area and Severity Index; IGA, investigator’s Global Assessment; SCORAD, Scoring Atopic Dermatitis; SD NRS, sleep disturbance numerical rating scale. a N = 215; b N = 217.

Psychometric properties of the sleep disturbance numerical rating scale

Test–retest reliability

ICCs (range 0·66–0·98) indicated substantial to almost perfect agreement from baseline to week 1. Furthermore, ICCs exceeded the recommended threshold of 0·70 for three out of the four anchors (Table 2). In exploratory analyses, agreement was almost perfect between week 15 and week 16 and between week 23 and week 24 when using PCS, PP NRS or AP NRS as the anchor (ICC 0·97–1·00) (Table 2).
Table 2

Test–retest reliability of the sleep disturbance numerical rating scale (SD NRS) in stable participants

Anchor N SD NRS t‐testICCa
Week n, mean (SD)Week n + 1, mean (SD)Mean difference t‐value P‐value
Baseline to week 1
SCORAD sleep loss VAS score567·81 (1·70)7·37 (1·91)−0·113·7< 0·0010·96
PCS average weekly score1587·74 (1·71)6·84 (1·80)−0·908·97< 0·0010·66
PP NRS average weekly score168·19 (1·14)8·10 (1·22)−0·090·920·370·95
AP NRS average weekly score177·58 (1·86)7·58 (1·77)0·01−0·060·950·98
IGA score1467·90 (1·56)6·40 (2·07)−1·59·61< 0·0010·36
Week 15–16
PCS average weekly score1722·75 (2·63)2·70 (2·60)−0·041·010·320·97
PP NRS average weekly score442·35 (3·04)2·30 (3·02)−0·051·20·241.00
AP NRS average weekly score522·38 (2·99)2·34 (2·97)−0·041·120·271.00
Week 23–24
PCS average weekly score1422·45 (2·52)2·48 (2·53)0·03−0·630·530·98
PP NRS average weekly score332·48 (3·26)2·47 (3·30)−0·010·130·901.00
AP NRS average weekly score182·61 (2·97)2·62 (2·98)0·01−0·130·900·99

Stable participants were defined as having no change or minimal change over a week in the indicated anchor. AP NRS, Average Pruritus Numerical Rating Scale; ICC, interclass correlation coefficient; IGA, Investigator’s Global Assessment; PCS, pruritus categorical score; PP NRS, Peak Pruritus Numerical Rating Scale; SCORAD, Scoring Atopic Dermatitis; VAS, visual analogue score. aICCs were categorized as no agreement for < 0, slight agreement for 0·00–0·20, fair agreement for 0·21–0·40, moderate agreement for 0·41–0·60, substantial agreement for 0·61–0·80 and almost perfect agreement for 0·81–1·00.

Test–retest reliability of the sleep disturbance numerical rating scale (SD NRS) in stable participants Stable participants were defined as having no change or minimal change over a week in the indicated anchor. AP NRS, Average Pruritus Numerical Rating Scale; ICC, interclass correlation coefficient; IGA, Investigator’s Global Assessment; PCS, pruritus categorical score; PP NRS, Peak Pruritus Numerical Rating Scale; SCORAD, Scoring Atopic Dermatitis; VAS, visual analogue score. aICCs were categorized as no agreement for < 0, slight agreement for 0·00–0·20, fair agreement for 0·21–0·40, moderate agreement for 0·41–0·60, substantial agreement for 0·61–0·80 and almost perfect agreement for 0·81–1·00.

Convergent and divergent validity

As expected, correlations at baseline were large between the SD NRS average weekly score and scores assessing SD (r = 0·52 for 5‐D Itch sleep item and r = 0·58 for SCORAD sleep loss VAS) (Table 3). Correlations were small between the SD NRS average weekly score and scores assessing a dissimilar construct (r = −0·11 to 0·17) except for those assessing itch (r = 0·42–0·84 for 5‐D itch, PP NRS, AP NRS and PCS) and quality of life (r = 0·42 for DLQI) (Table 3).
Table 3

Convergent and divergent validity: relationship between the sleep disturbance numerical rating scale (SD NRS) average weekly score and other outcome scores at baseline

Scale score N Spearman rank‐order correlation
r a P‐value
5‐D Itch sleep item2070·52< 0·001
SCORAD sleep loss VAS2180·58< 0·001
PCS (weekly average)2170·47< 0·001
PP NRS (weekly average)2170·84< 0·001
AP NRS (weekly average)2170·81< 0·001
5‐D Itch (total score)2070·42< 0·001
DLQI (total score)2070·42< 0·001
EQ‐5D‐3L index207−0·110·116
EQ‐5D‐3L VAS207−0·070·310
HADS anxiety2060·160·024
HADS depression2060·120·099
SCORAD (total score)2170·170·010
Body surface area2170·080·254
Investigator’s Global Assessment218−0·050·435
Eczema Area and Severity Index2180·140·034

AP NRS, Average Pruritus Numerical Rating Scale; DLQI, Dermatology Life Quality Index; EQ‐5D‐3L, EuroQol 5‐Dimensions 3‐Levels; HADS, Hospital Anxiety and Depression Scale; PCS, pruritus categorical score; PP NRS, Peak Pruritus Numerical Rating Scale; SCORAD, Scoring Atopic Dermatitis; VAS, visual analogue score. aThe correlation was considered small for |r| < 0·30, moderate for 0·30 ≤ |r| < 0·50, and large for |r| ≥ 0·50.

Convergent and divergent validity: relationship between the sleep disturbance numerical rating scale (SD NRS) average weekly score and other outcome scores at baseline AP NRS, Average Pruritus Numerical Rating Scale; DLQI, Dermatology Life Quality Index; EQ‐5D‐3L, EuroQol 5‐Dimensions 3‐Levels; HADS, Hospital Anxiety and Depression Scale; PCS, pruritus categorical score; PP NRS, Peak Pruritus Numerical Rating Scale; SCORAD, Scoring Atopic Dermatitis; VAS, visual analogue score. aThe correlation was considered small for |r| < 0·30, moderate for 0·30 ≤ |r| < 0·50, and large for |r| ≥ 0·50.

Known‐groups validity

The SD NRS was able to discriminate participants in the expected direction according to groups defined by the 5‐D Itch sleep item score and DLQI total score at baseline (P < 0·001 for both) (Table 4). Known‐groups validity was not supported when using clinician‐reported scales such as IGA (P = 0·25) or EASI (P = 0·11).
Table 4

Known‐groups validity: comparison of mean SD NRS average weekly scores at baseline between participants categorized by 5‐D Itch sleep item score, DLQI total score, IGA score and EASI

Scale N SD NRS average weekly score, mean (SD)Overall F‐test
Test value P‐value
5‐D Itch sleep item22·9< 0·001
1 + 2 (never affects sleep + sleep onset delay)336·7 (2·0)
3 (both sleep onset delay and night awakenings)1748·1 (1·4)
DLQI (total score)14·5< 0·001
0–1 (no effect)0
2–10 (small‐to‐moderate effect)477·2 (1·8)
11–20 (very large effect)1007·7 (1·6)
21–30 (extremely large effect)608·7 (1·1)
IGA1·350·247
3 (moderate)1447·9 (1·5)
4 (severe)747·7 (1·7)
EASI2·530·113
12–21·0 (moderate AD)937·6 (1·6)
21·1–72·0 (severe/very severe AD)1258·0 (1·6)

AD, atopic dermatitis; DLQI, Dermatology Life Quality Index; EASI, Eczema Area and Severity Index; IGA, Investigator’s Global Assessment; SD NRS, sleep disturbance numerical rating scale.

Known‐groups validity: comparison of mean SD NRS average weekly scores at baseline between participants categorized by 5‐D Itch sleep item score, DLQI total score, IGA score and EASI AD, atopic dermatitis; DLQI, Dermatology Life Quality Index; EASI, Eczema Area and Severity Index; IGA, Investigator’s Global Assessment; SD NRS, sleep disturbance numerical rating scale.

Responsiveness

As a first step in assessing responsiveness, the change in SD NRS weekly score between baseline and week 24 was compared with the change in the other outcomes over the same period. Correlations were large between the change in SD NRS and the change in AP NRS average weekly score (r = 0·91), PP NRS weekly average score (r = 0·88), SCORAD sleep loss VAS score (r = 0·80), PCS weekly average score (r = 0·75) and SCORAD total score (r = 0·51) (Table 5). Correlations were moderate between the change in SD NRS and change in DLQI total score (r = 0·41). Correlations for all other outcome measures were small (range, r = −0·12 to 0·28).
Table 5

Correlations between the change in sleep disturbance numerical rating scale (SD NRS) weekly scores from baseline and changes in other outcome scores from baseline

Outcome scaleSpearman rank‐order correlation
Week 24Week 16
N r a P‐value N r a P‐value
SCORAD sleep loss VAS1540·80< 0·0011670·77< 0·001
PCS (average weekly)1460·75< 0·0011720·79< 0·001
PP NRS (average weekly)1460·88< 0·0011710·88< 0·001
AP NRS (average weekly)1460·91< 0·0011710·90< 0·001
DLQI (total score)1370·41< 0·001
EQ‐5D‐3L index137−0·120·157
EQ‐5D‐3L VAS137−0·120·152
HADS anxiety1360·150·072
HADS depression1360·150·082
SCORAD (total score)1540·51< 0·001
Body surface area1540·170·0411710·190·012
Investigator’s Global Assessment1540·28< 0·0011710·34< 0·001
Eczema Area and Severity Index1540·210·0091710·29< 0·001

AP NRS, Average Pruritus Numerical Rating Scale; DLQI, Dermatology Life Quality Index; HADS, Hospital Anxiety and Depression Scale; EQ‐5D‐3L, EuroQol 5‐Dimensions 3‐Levels; PCS, pruritus categorical score; PP NRS, Peak Pruritus Numerical Rating Scale; SCORAD, Scoring Atopic Dermatitis; VAS, visual analogue score. aThe correlation was considered small for |r| < 0·30, moderate for 0·30 ≤ |r| < 0·50 and large for |r| ≥ 0·50.

Correlations between the change in sleep disturbance numerical rating scale (SD NRS) weekly scores from baseline and changes in other outcome scores from baseline AP NRS, Average Pruritus Numerical Rating Scale; DLQI, Dermatology Life Quality Index; HADS, Hospital Anxiety and Depression Scale; EQ‐5D‐3L, EuroQol 5‐Dimensions 3‐Levels; PCS, pruritus categorical score; PP NRS, Peak Pruritus Numerical Rating Scale; SCORAD, Scoring Atopic Dermatitis; VAS, visual analogue score. aThe correlation was considered small for |r| < 0·30, moderate for 0·30 ≤ |r| < 0·50 and large for |r| ≥ 0·50. Responsiveness of the SD NRS was further assessed based on clinical outcomes for which correlations were large or moderate (|r| > 0·30 in Table 5). In all cases, the SD NRS average weekly mean score decreased significantly more between baseline and week 24 in participants classified in the ‘improved’ group than in participants classified in the ‘not improved’ group (Table 6). Also, in all cases, effect sizes and standardized response means were larger for patients classified as ‘improved’ than in those classified as ‘not improved’ (effect sizes 3·31–4·51 in the ‘improved’ group and 0·16–2·45 in the ‘not improved’ group; standardized response mean 2·02–3·11 in the ‘improved’ group and 0·36–1·46 in the ‘not improved’ group). Results were similar using week 16 data (Table 6).
Table 6

Responsiveness: association between change in sleep disturbance numerical rating scale (SD NRS) average weekly scores from baseline and changes in other measures from baseline in participants who improved vs. participants who did not improvea

Anchor N SD NRS, mean (SD)Effect sizeb Standardized response meanc P‐value (paired t‐test)Overall F‐test
BaselineFollow‐upMean changeTest value P‐value
Week 16
SCORAD sleep loss VAS29·3< 0·001
Improved (change < 0)1457·8 (1·6)2·2 (2·3)−5·5 (2·6)3·412·17< 0·001
Not improved (change ≥ 0)97·8 (1·6)6·6 (2·0)−0·9 (1·2)0·560·740·057
PCS (average weekly)156< 0·001
Improved (change ≤ −1)1127·9 (1·4)1·7 (1·7)−6·3 (2·0)4·543·22< 0·001
Not improved (change −1 to < 1)347·2 (2·2)5·4 (2·5)−1·7 (1·7)0·770·98< 0·001
PP NRS (average weekly) score80·4< 0·001
Improved (change ≤ −1)1307·8 (1·5)2·0 (2·0)−5·8 (2·3)3·812·55< 0·001
Not improved (change −1 to < 1)167·4 (2·4)6·6 (2·5)−0·6 (1·3)0·250·440·10
AP NRS (average weekly)
Improved (change ≤ −1)1337·8 (1·5)2·1 (2·1)−5·7 (2·4)3·792·41< 0·001
Not improved (change −1 to < 1)137·2 (2·6)6·5 (2·7)−0·5 (1·4)0·180·350·23
DLQI (total score)6·980·009
Improved (change < 0)1267·8 (1·6)2·3 (2·3)−5·5 (2·6)3·42·11< 0·001
Not improved (change ≥ 0)118·0 (1·4)4·6 (3·2)−3·3 (3·5)2·330·930·012
Week 24
SCORAD sleep loss VAS33·6< 0·001
Improved (change < 0)1577·8 (1·6)2·5 (2·4)−5·3 (2·6)3·382·06< 0·001
Not improved (change = 0)107·5 (2·5)6·7 (2·5)−0·5 (1·0)0·220·540·12
PCS (average weekly)179< 0·001
Improved (change ≤ −1)1287·9 (1·4)1·6 (1·7)−6·2 (2·0)4·513·11< 0·001
Not improved (change −1 to < 1)447·4 (2·2)5·7 (2·4)−1·7 (1·8)0·740·91< 0·001
PP NRS (average weekly)90·9< 0·001
Improved (change ≤ −1)1527·8 (1·5)2·2 (2·1)−5·6 (2·3)3·812·42< 0·001
Not improved (change −1 to < 1)197·1 (2·6)6·7 (2·7)−0·4 (1·2)0·170·370·13
AP NRS (average weekly)91·5< 0·001
Improved (change ≤ −1)1527·9 (1·4)2·2 (2·1)−5·6 (2·3)3·972·42< 0·001
Not improved (change −1 to < 1)196·7 (2·7)6·3 (2·8)−0·4 (1·2)0·160·360·14
IGA12·3< 0·001
Improved (change < 0)1167·7 (1·7)2·2 (2·5)−5·5 (2·7)3·312·02< 0·001
Not improved (change ≥ 0)557·9 (1·6)4·0 (2·7)−3·9 (2·7)2·451·46< 0·001

AP NRS, Average Pruritus Numerical Rating Scale; DLQI, Dermatology Life Quality Index; IGA, Investigator’s Global Assessment; PCS, pruritus categorical score; PP NRS, Peak Pruritus Numerical Rating Scale; SCORAD, Scoring Atopic Dermatitis; VAS, visual analogue score. aDefinitions of ‘worsened’ and ‘improved’ for responsiveness testing are provided in Table  S3 (see Supporting Information). bMean difference between the baseline and week 24 average weekly score divided by the standard deviation of the baseline average weekly score of the SD NRS. cMean difference between the baseline and week 24 average weekly score divided by the standard deviation of the change from baseline average weekly score of the SD NRS.

Responsiveness: association between change in sleep disturbance numerical rating scale (SD NRS) average weekly scores from baseline and changes in other measures from baseline in participants who improved vs. participants who did not improvea AP NRS, Average Pruritus Numerical Rating Scale; DLQI, Dermatology Life Quality Index; IGA, Investigator’s Global Assessment; PCS, pruritus categorical score; PP NRS, Peak Pruritus Numerical Rating Scale; SCORAD, Scoring Atopic Dermatitis; VAS, visual analogue score. aDefinitions of ‘worsened’ and ‘improved’ for responsiveness testing are provided in Table  S3 (see Supporting Information). bMean difference between the baseline and week 24 average weekly score divided by the standard deviation of the baseline average weekly score of the SD NRS. cMean difference between the baseline and week 24 average weekly score divided by the standard deviation of the change from baseline average weekly score of the SD NRS.

Responder definition estimate

At week 24, a responder definition of the SD NRS score was estimated based on the SCORAD sleep loss VAS, PCS, PP NRS and DLQI. Anchor‐based responder definition estimates ranged from 5·6 to 6·7 (Table 7). Anchor‐based responder definition estimates were similar at week 16, ranging from 5·4 to 6·7. Distribution‐based estimates were calculated using week 24 SD NRS data. These estimates included a standard error of measurement of 1·58, a quarter‐standard deviation of 0·40 and half‐standard deviation of 0·81.
Table 7

Responder definitions for the change in sleep disturbance numerical rating scale between baseline and week 24

Definition typeStudyDefinitionMeaningful or detectable change threshold estimate
Anchor basedPhase IIb≥ 1‐point decrease in SCORAD sleep loss VAS
Week 245·6
Week 165·4
Phase IIb≥ 4‐point decrease in PCS average weekly score
Week 246·4
Week 166·3
Phase IIb≥ 4‐point decrease in PCS average weekly score
Week 246·7
Week 166·7
Phase IIb≥ 4‐point decrease in DLQI total score
Week 245·7
Week 16ND
Distribution basedPhase IIbQuarter‐standard deviation0·40
Phase IIbHalf‐standard deviation0·81
Standard error of measurement1·58
QualitativeQualitative researcha Smallest change considered satisfactory3
Meaningful improvement2

DLQI, Dermatology Life Quality Index; ND, not determined; PCS, pruritus categorical score; SCORAD, Scoring Atopic Dermatitis; VAS, visual analogue score. aDias‐Barbosa et al.

Responder definitions for the change in sleep disturbance numerical rating scale between baseline and week 24 DLQI, Dermatology Life Quality Index; ND, not determined; PCS, pruritus categorical score; SCORAD, Scoring Atopic Dermatitis; VAS, visual analogue score. aDias‐Barbosa et al. Final responder definitions were based on the anchor‐based estimates, with the support of the distribution‐based estimates along with previous qualitative findings exploring meaningful change. Taken together, the results suggested that a 2–5‐point reduction in the SD NRS represents a meaningful improvement for the target population.

Discussion

A variety of PROs are available to assess SD, especially the PO‐SCORAD sleep loss VAS, , but some of these monitor SD only at specific timepoints or use VAS scales whose design or administration limit their usefulness. , Others do not allow an SD score to be derived, and none capture day‐to‐day fluctuations in SD. Further, patients prefer NRSs over VASs and find them easier to use and interpret. , , , As SD fluctuates daily in patients with AD, the SD NRS was therefore developed to provide clinicians and patients with a simple, easy‐to‐interpret alternative to a VAS that can be used to measure daily fluctuations in SD. This post hoc analysis of data from a phase IIb trial examined the psychometric properties of the SD NRS and established an initial responder definition for meaningful change in patients with moderate‐to‐severe AD. The results provided strong support that the SD NRS is reliable, valid and responsive in patients with moderate‐to‐severe AD. They also extend evidence for the content validity of the SD NRS based on cognitive interviews in patients with moderate‐to‐severe AD. The SD NRS had very good test–retest reliability, with ICCs in almost all cases above the recommended threshold of 0·70, indicating that it stably measures the SD concept over time. The study also confirmed that the SD NRS measured the targeted concept of SD associated with AD. As hypothesized, correlations between the SD NRS and sleep‐related measures (SCORAD sleep loss VAS and 5‐D Itch sleep item) were large. However, the correlations were not exact, indicating that, although the SD NRS measures SD, the three PRO measures are not interchangeable. This is likely due to differences in formats (NRS vs. VAS) and recall periods (24 h for SD NRS, 3 days for SCORAD sleep loss VAS, and 2 weeks for 5‐D Itch sleep item). Correlations with scales measuring concepts other than SD, such as HADS scores, EQ‐5D scores and clinician‐reported disease severity scores (SCORAD, BSA, IGA and EASI), were weak, confirming that the SD NRS can complement standard clinician‐reported disease severity and other disease impacts (e.g. health status and emotional impact) when assessing the benefit of AD treatments. Of the non‐SD PROs, correlations were strongest between the SD NRS and the itch severity scales (PP NRS and AP NRS), confirming that SD is a proximal impact of the primary symptom of AD. , , The SD NRS was able to discriminate between severity groups of participants according to PRO sleep and quality‐of‐life scales, but it was not able to discriminate between severity groups of participants according to clinician‐reported measures (IGA and EASI for AD severity). This is not surprising because AD severity reported by clinicians does not necessarily translate to symptom severity or impact as perceived and reported by patients; indeed, correlations were low between baseline SD NRS and clinician‐reported scales. This should not diminish the ability of the SD NRS to discriminate among severity groups of participants. The study further showed that the SD NRS was responsive to change. Correlations were large between the changes in SD NRS score and changes in itch, sleep loss and disease severity scores, and they were moderate between the changes in SD NRS score and changes in quality‐of‐life scores. Further, the SD NRS score changes were able to significantly differentiate between participants whose pruritus, quality of life and disease severity improved and those whose did not. Even though both groups of participants experienced improvements in SD, as indicated by a decrease in their SD NRS score, decreases in the SD NRS were larger in the ‘improved’ group than in the ‘not improved’ group. This study allowed a preliminary responder definition of SD NRS score for meaningful change (2–5 points) to be established in patients with moderate‐to‐severe AD. This is consistent with responder definitions for other patient‐reported 11‐point itch NRSs in participants with moderate‐to‐severe plaque psoriasis (4 points) and moderate‐to‐severe AD (2–4 points). The relatively wide range in SD NRS for the responder definition was due to the trial including participants with severe itch and severe SD at baseline, which allows substantial room for change. The current study had some limitations. Firstly, the ability of the SD NRS to assess the less severe spectrum of SD was not examined in the current analysis. However, based on qualitative findings, the SD NRS is expected to perform as well among participants with less severe SD. Secondly, over the course of the trial, approximately 30% of the SD NRS data were missing after 24 weeks of follow‐up. Nonetheless, the initial analysis sample of 218 participants, the final sample size (n = 154) and the subject‐to‐item ratio were adequate to allow the SD NRS data to be analysed at baseline and at follow‐up, for the psychometric properties to be confirmed, and for an initial responder definition to be established. , A third limitation was that, of the various anchors included in the current analysis, a few did not cover the 7‐day recall period covered by the SD NRS weekly score. For example, the SCORAD sleep loss VAS has a 3‐day recall period, and the IGA does not have a recall period. The slight differences in the recall periods of the SD NRS and these two anchors should not jeopardize the stability assessment because the test–retest analyses were conducted using a short time window, which should prevent substantial changes in the concepts measured by these different anchors. In conclusion, the current quantitative analysis suggests that the SD NRS is a reliable, fit‐for‐purpose and well‐defined measure of the overall severity of SD specifically adapted to patients with moderate‐to‐severe AD. The SD NRS should provide clinicians with an alternative to VAS because it is simple and easy to interpret and can be used to assess day‐to‐day changes in SD related to AD. Along with the core outcome measures recommended by the Harmonising Outcome Measures for Eczema initiative, the SD NRS can be used to assess impact in AD clinical trials. Finally, the current study confirmed a close link between SD and pruritus in patients with AD, and it highlighted the importance of adequately measuring SD as an outcome of AD treatment.

Author Contribution

Jorge Puelles: Funding acquisition (equal); Methodology (equal); Resources (equal); Writing‐review & editing (equal). Fatoumata Fofana: Conceptualization (equal); Formal analysis (equal); Methodology (equal); Resources (equal); Writing‐review & editing (equal). Danielle Nicole Rodriguez: Conceptualization (equal); Formal analysis (equal); Methodology (equal); Resources (equal); Writing‐review & editing (equal). Jonathan I Silverberg: Methodology (equal); Writing‐review & editing (equal). Andreas Wollenberg: Methodology (equal); Writing‐review & editing (equal). Carla Dias‐Barbosa: Conceptualization (equal); Formal analysis (equal); Methodology (equal); Resources (equal); Writing‐review & editing (equal). Margaret Vernon: Methodology (equal); Writing‐review & editing (equal). Rajeev Chavda: Conceptualization (equal); Funding acquisition (equal); Writing‐review & editing (equal). Sylvie Gabriel: Conceptualization (equal); Funding acquisition (equal); Writing‐review & editing (equal). Christophe Piketty: Conceptualization (equal); Funding acquisition (equal); Writing‐review & editing (equal). Table S1 Outcome measures, recall periods, scales, and frequencies of assessments. Click here for additional data file. Table S2 Raw and refined categories, grouping, and explanations for known‐groups validity testing. Click here for additional data file. Table S3 Definitions of ‘worsened’ and ‘improved’ for responsiveness testing. Click here for additional data file.
  40 in total

1.  Onset of pruritus relief with pimecrolimus cream 1% in adult patients with atopic dermatitis: a randomized trial.

Authors:  R Kaufmann; T Bieber; A L Helgesen; B Lasthein Andersen; T Luger; Y Poulin; J Al-Hafidh; C Paul
Journal:  Allergy       Date:  2006-03       Impact factor: 13.146

2.  What is sufficient evidence for the reliability and validity of patient-reported outcome measures?

Authors:  Marlene H Frost; Bryce B Reeve; Astra M Liepa; Joseph W Stauffer; Ron D Hays
Journal:  Value Health       Date:  2007 Nov-Dec       Impact factor: 5.725

3.  Anti-Interleukin-31 Receptor A Antibody for Atopic Dermatitis.

Authors:  Thomas Ruzicka; Ryosuke Mihara
Journal:  N Engl J Med       Date:  2017-05-25       Impact factor: 91.245

4.  Characteristics of Pruritus in Relation to Self-assessed Severity of Atopic Dermatitis.

Authors:  Flavien Huet; Marie-Sarah Faffa; Florence Poizeau; Stéphanie Merhand; Laurent Misery; Emilie Brenaut
Journal:  Acta Derm Venereol       Date:  2019-03-01       Impact factor: 4.437

5.  Comparison of selected pain assessment tools for use with the elderly.

Authors:  K A Herr; P R Mobily
Journal:  Appl Nurs Res       Date:  1993-02       Impact factor: 2.257

6.  A Comprehensive Conceptual Model of the Experience of Chronic Itch in Adults.

Authors:  Jonathan I Silverberg; Robert W Kantor; Prarthana Dalal; Catherine Hickey; Sara Shaunfield; Karen Kaiser; Jin-Shei Lai; David Cella
Journal:  Am J Clin Dermatol       Date:  2018-10       Impact factor: 7.403

7.  The Assessment of Acute Pain in Pre-Hospital Care Using Verbal Numerical Rating and Visual Analogue Scales.

Authors:  Ahmad K Ismail; Muhammad A Abdul Ghafar; Noor Shaza A Shamsuddin; Nurul A Roslan; Hilwani Kaharuddin; Nik A Nik Muhamad
Journal:  J Emerg Med       Date:  2015-05-26       Impact factor: 1.484

8.  Assessment of pruritus intensity: prospective study on validity and reliability of the visual analogue scale, numerical rating scale and verbal rating scale in 471 patients with chronic pruritus.

Authors:  Ngoc Quan Phan; Christine Blome; Fleur Fritz; Joachim Gerss; Adam Reich; Toshiya Ebata; Matthias Augustin; Jacek C Szepietowski; Sonja Ständer
Journal:  Acta Derm Venereol       Date:  2012-09       Impact factor: 4.437

Review 9.  Practical issues on interpretation of scoring atopic dermatitis: the SCORAD index, objective SCORAD and the three-item severity score.

Authors:  A P Oranje; E J Glazenburg; A Wolkerstorfer; F B de Waard-van der Spek
Journal:  Br J Dermatol       Date:  2007-08-21       Impact factor: 9.302

10.  The Harmonizing Outcome Measures for Eczema (HOME) roadmap: a methodological framework to develop core sets of outcome measurements in dermatology.

Authors:  Jochen Schmitt; Christian Apfelbacher; Phyllis I Spuls; Kim S Thomas; Eric L Simpson; Masutaka Furue; Joanne Chalmers; Hywel C Williams
Journal:  J Invest Dermatol       Date:  2014-09-04       Impact factor: 8.551

View more
  2 in total

1.  Comparing the psychometric properties of the EQ-5D-3L and EQ-5D-5L descriptive systems and utilities in atopic dermatitis.

Authors:  Kamilla Koszorú; Krisztina Hajdu; Valentin Brodszky; Alex Bató; L Hunor Gergely; Anikó Kovács; Zsuzsanna Beretzky; Miklós Sárdy; Andrea Szegedi; Fanni Rencz
Journal:  Eur J Health Econ       Date:  2022-04-12

Review 2.  Patient-reported outcome (PRO) instruments for disease severity and quality of life in patients with atopic dermatitis: a systematic review of English and Chinese literature.

Authors:  Aihua Li; Minlu Zhang; Yating Yang; Joshua Zhang; Xiaoping Xie; Xiaoxian Li; Hui Zhang
Journal:  Ann Transl Med       Date:  2022-08
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.