Literature DB >> 32753991

A Systematic Review of Instruments for the Assessment of Insomnia in Adults.

Raja Mahamade Ali1, Monica Zolezzi1, Ahmed Awaisu1.   

Abstract

INTRODUCTION: Self-reported sleep instruments remain the most practical methods for the assessment of insomnia in clinical practice. This systematic review aims to identify, describe and summarize the psychometric properties of questionnaires available for the assessment of insomnia in the adult population. In addition, the review also aimed to identify sleep instruments available in the Arabic language.
METHODS: A systematic literature search was conducted using the following electronic databases: PubMed, EMBASE, ProQuest Central, SCOPUS, and Google Scholar. The quality assessment of the instruments was conducted using two established international criteria.
RESULTS: One hundred and seven articles were selected for inclusion, from which 31 instruments were identified and categorized based on the constructs they assess as: (1) screening for insomnia (n=14); (2) measuring the consequences of insomnia (n=8); (3) assessing the cognitive aspects of insomnia (n= 5); and (4) assessing sleep hygiene (n= 4). The review of the psychometric properties showed that the Insomnia Severity Index and the Functional Outcomes of Sleep Questionnaire were the most extensively evaluated instrument. Criterion validity and reliability measures were the most commonly reported properties. Only four of the identified instruments were available in Arabic. DISCUSSION: Overall, the findings of this study indicate ample availability of sleep instruments. However, psychometric testing for several of the available sleep instruments remains incomplete, particularly responsiveness and interpretability. Our findings suggest that future studies should focus on reporting more psychometric measures to ensure the trustworthiness of these instruments.
© 2020 Ali et al.

Entities:  

Keywords:  insomnia; psychometric properties; questionnaires; sleep hygiene; sleep quality

Year:  2020        PMID: 32753991      PMCID: PMC7342485          DOI: 10.2147/NSS.S250918

Source DB:  PubMed          Journal:  Nat Sci Sleep        ISSN: 1179-1608


Introduction

Worldwide reports suggest that around one-third of the adult population complains of insomnia symptoms.1–2 Insomnia is characterized by persistent sleep difficulty despite adequate sleep opportunity and associated daytime dysfunction.3 Insomnia has been associated with increased rates of road accidents, lower productivity and work absenteeism.4,5 Several factors are known to contribute to the development or worsening of insomnia symptoms. Current findings from the literature suggest that negative cognitive processes including worry, rumination and catastrophizing thoughts are associated with worse sleep quality and insomnia.6 These processes have been shown to fuel anxiety and arousal resulting in delayed sleep onset and shorter sleep duration.7 Similarly, poor sleep hygiene, which consists of a combination of behavioral practices and environmental conditions which improves sleep, is common among insomniacs.8–11 Studies suggest that chronic insomnia is an independent risk factor for developing mental illnesses in otherwise healthy individuals.12,13 Chronic insomnia has also been reported to increase the probability of developing chronic medical conditions, such as hypertension, cardiovascular disease and type 2 diabetes.14,15 Therefore, early detection and management of insomnia is important to minimize these associated risks. Although polysomnography (PSG) is considered the gold standard method for evaluating insomnia, it is not routinely used as it requires a specialized setting and equipment, and it is often labour intensive. Wrist actigraphy is another objective tool for the assessment of insomnia, but is limited by its lack of specificity.16,17 Consequently, self-reported sleep instruments remain the most practical methods for the assessment of insomnia in clinical practice.16,18 There are several systematic reviews on a range of sleep instruments utilized for the assessment of sleep dysfunction in a variety of medical and neuropsychiatric disorders, or in specialized populations.19–21 In these reviews, there was insufficient evaluation of the psychometric properties of the instruments that were identified, and none of them included instruments which assess sleep hygiene practices in the adult population. Therefore, the main objective of this study is to present an updated systematic review of the literature on validated self-reported instruments used for the assessment of different dimensions of sleep in the adult population. In addition, the review also aimed to identify sleep instruments available in Arabic language.

Methods

Search methods

Data Sources and Search Strategy

A comprehensive systematic search was conducted to identify studies reporting the development and/or validation of instruments for the assessment of self-reported sleep and sleep hygiene in the adult population. The five databases and search engines utilized included PubMed (1966 - April 2018), EMBASE (1980 - April 2018), ProQuest Central (1947–2018), SCOPUS (1966 - April 2018), and Google Scholar (till April 2018). Grey literature was also searched by reviewing conference proceedings and abstracts of the Canadian Sleep Society and the American Academy of Sleep Medicine published in the period from January 2014 to December 2017. Additionally, a hand search of the bibliographies of the articles identified through the electronic databases search was undertaken. Search terms used in the electronic databases search were classified into three categories related to: sleep dysfunction (Category A), the instrument for assessment (Category B), and validation and psychometric properties (Category C). Terms from Category C were not used in databases such as PubMed which offered “validation studies” as one of the limits or filters. Publication language was limited to English and no limits were imposed on the publication year. Examples of full search strategies for two of the electronic databases can be found under the . In addition to the English search, two separate searches were conducted to identify sleep instruments developed in Arabic language or translated into Arabic. The first used the same search terms as those used in the original search, in addition to the word “Arabic”. The second was conducted in Arabic language in the following databases: PubMed, Google Scholar and Dar Al-mandumah using the same search terms as those used in the original search.

Inclusion Criteria

Articles selected for full review were those providing results of validation studies or reporting psychometric properties of instruments and questionnaires assessing characteristics of sleep (quality, quantity, nocturnal awakenings), daytime consequences of poor sleep, or sleep hygiene in the adult population. The inclusion criteria were limited to instruments completed by self-report and to instruments written in English or Arabic.

Exclusion Criteria

Studies were excluded from the review if they included evaluating instruments designed to measure sleep disorders other than insomnia (eg obstructive sleep apnea, restless leg syndrome, etc.), those using sleep items as subdomains of an instrument assessing a condition other than insomnia, studies which focused on pediatric/geriatric populations, studies describing instruments designed to be completed by clinician or caregiver and not by the patient, using instruments developed in languages other than English or Arabic, or describing instruments not psychometrically validated.

Data Collection and Analysis

Data Collection Process and Data Extraction

Duplicate citations were removed after obtaining the initial records of relevant articles from different databases. Titles and abstracts of the articles were then screened for relevance. Full-text of eligible articles were obtained and screened against the inclusion and exclusion criteria. Data were extracted from the eligible studies according to eight key attributes, as established by the Scientific Advisory Committee, Medical Outcomes Trust (SAC-MOT).22 The extraction tool used in this study included the following elements: instrument’s name, authors, conceptual framework (domains and purpose), psychometric properties, validation population, general description of the instrument (number of items, scale, scoring, response format, burden), and cultural and linguistic adaptation. The psychometric properties were extracted from studies describing the validation of the original (English or Arabic) version of the instruments. The psychometric properties extracted from the validation studies included validity, reliability, responsiveness, and interpretability. In addition, instruments’ attributes related to the validation sample were extracted. Data extraction was done by one reviewer (RA) and reviewed by a second author (MZ). The psychometric properties for instruments validated in specific populations were not extracted.

Quality Assessment

A modified version of the criteria developed by Terwee et al23 was used to assess the quality of the sleep instruments. The Terwee at al. criteria evaluates 8 psychometric properties including validity (content, construct and criterion), reliability (internal consistency and reproducibility), responsiveness, interpretability and floor and ceiling effects.23 For the purposes of this study, the criteria for the floor and ceiling effects and the agreement component of the reproducibility were not used, primarily because, for the most part, these measures were not reported in the validation studies included in this review. In addition, Pearson’s Correlation Coefficient was used for rating the reliability, responsiveness, criterion and construct validity.

Results

As illustrated in Figure 1, a total of 4453 citations were retrieved from the search. One hundred and seven articles were deemed suitable for inclusion in the review. These articles included 31 distinct sleep instruments. Of the 107 articles included, 47 discussed the validation process of the instruments in English, while the remaining 60 articles reviewed the translation and cultural adaptation of these instruments into a variety of languages and populations. The two additional searches to identify sleep instruments in the Arabic language did not identify any additional results than those derived from the original search. Table 1 summarizes the results of the validation studies (n=47) related to the 31 sleep instruments included in this review. The table also describes the psychometric properties of the instruments and the characteristics of the populations in which they were validated. Table 2 provides a detailed description of the characteristics of the 31 instruments.
Figure 1

Flow chart of the systematic review process.

Table 1

Studies Testing the Psychometric Properties of Extracted Sleep Instruments

Author, yearInstrumentsSampleValidityReliabilityResponsiveness and Interpretability
Pallesen et al, 200839The Bergen insomnia scale(BIS)University students: (n=320)Community sample: (n=5,000)Patient sample: (n=225)Convergent and discriminant:BIS significantly associated with Athens Insomnia scale (r=0.79) and the PSQI (r=0.73).BIS had lower correlation with Beck depression and anxiety inventories (r=0.55 and 0.32 respectively).-Internal consistency:Students: α= 0.79Community: α= 0.87Patient sample: α= 0.80-Test retest (after 2 weeks)Students (n=200): r = 0.77NR
Yeh et al, 201240The Insomnia Screening Scale (ISS)Study 1: (n=162)-Primary insomniacs-Healthy participantsStudy 2: (n=262)Community sample (paediatrics, adults and elderly)Concurrent/criterion validity:- ISS significantly correlated with Insomnia severity index (ISI) and PSQI (PSQI): (r =0.87 and r =0.85 respectively).- Daytime functioning domain significantly correlated with ISI and PSQI (r =0.63 and 0.68 respectively).- Sleep environment domain negatively correlated with PSQI and ISI: (r = - 0.31 and - 0.25 respectively).- Sleep opportunity negatively correlated with ISI and PSQI: (r = – 0.37 and r = – 0.41, respectively).- External validity:Community sample: 13.36% (inadequate sleep opportunity and sleep environment), and 37.02% met the criteria of insomnia and daytime function.Internal consistency:-Insomnia symptoms: α =0.98-Daytime function: α=0.94-Sleep environment: α = 0.90-Sleep opportunity: α=0.87Interpretability:ISS demonstrated sensitivity and specificity of 0.89 and 0.59 respectivelyCut-off scores:-Sum of sleep environment and opportunity subscales scores= 27-Sum of insomnia symptoms and daytime functioning subscales scores=42
Kato T., 201370Sleep Quality Questionnaire (SQQ)Full-time employees and college studentsContent validity: confirmed by two Japanese experts in stress research.Convergent validity: (n=370)SQQ subscales (Sleep difficulty and daytime sleepiness) with MOS sleep scale (rs = 0.37 and 0.43, p< 0.001).The Daytime Sleepiness subscale score with Epworth Sleepiness Scale (ESS) score (r = 0.47, p < 0.001).Incremental validity: (n=346)SQQ subscales with the GHQ-12, CES-D Scale, FSS, and SWLS scores significant ΔR scores (0.403, 0.313, 0.408 and 0.054 respectively) with p< 0.001.Internal consistency:Daytime sleepiness subscale: -Student sample: α= 0.83-Employees sample: α=0.84Sleep Difficulty subscale:-Students sample: α=0.74-Employees sample: α=0.77Test-retest (over 8 weeks)Daytime Sleepiness subscale: (r =0.76)Sleep Difficulty subscale:(r =0.79)NR
Levine et al, 2003 (study a)36Levine et al, 2003 (study b)37Women’s Health Initiative Insomnia Rating Scale (WHIIRS)Postmenopausal womenContent analysis:The WHIIRS items corresponded to most of insomnia characteristics noted in the nosologies (eg International Classification of Sleep Disorders) and the literature.Construct validity:- Small correlation between the CES-D and the WHIIRS: r =0.29-The WHIIRS mean in the largest CES-D category (M =10.3 for Category 12) was 1.8 times that in the smallest depression category (ie, M =5.7 for Category 0)-The SF-36 subscales were linearly correlated with the WHIIRS: Cohen’s f value was 0.273, (p < 0.0001).-For night sweats and hot flushes with WHIIRS, the values of Cohen’s f were, respectively, 0.205 and 0.157.Internal consistency: α = 0.786.

89.3% of the samples had reliability coefficients ≥ 0.75

Test-retest:Same day administration (r =0.96)Tests after >1 year (r =0.66)
NR
Drake et al, 201438Restorative sleep questionnaire (RSQ)Community based samplePatients with primary insomniaNon-refreshed sleepers (NRS)Content validity:-Key concepts developed through patient focus groups and patient interviews. Concepts reviewed by two expert panels.Convergent/divergent:-RSQ-Daily (RSQ-D) with Leeds Sleep Evaluation Questionnaire scores: positive and significant correlation with all domains (r ≥0.40, p ≤0.006) except for Getting to Sleep scores (r = 0.27; p= 0.079).-RSQ-D with Subjective Sleep questionnaire:Sleep quality (r = 0.59; p < 0.001); TST (r = 0.32; p= 0.036)-RSQ-D with vitality questionnaire: (r = 0.61; p < 0.001)-RSQ with PSG: Latency to persistent sleep (r ≥0.20), TST and sleep efficiency (r ≥0.26), WASO (r ≤0.020) and total wake time (r ≤0.26) significant p-value.Internal consistency:RSQ-D: α= 0.91RSQ-W: α= 0.90Test-retest (responses on consecutive days measures)RSQ-D and RSQ-W: r >0.80NR
Buysse et al, 198967Grandner et al, 200668Backhaus et al, 200269Pittsburgh sleep qualityindex (PSQI)- Group I: healthy control subjects (n=52)- Group 2: patients with major depressive disorder (n=34).- Group 3: Clinical sample physician-referred outpatients at the Sleep Evaluation Center(n = 45)-Non-clinical sample(younger and older adults)-Primary insomniacs- Healthy controlsDiscriminating between patients and controls:-Global PSQI scores differed significantly between subject groups and control group subjects differed from all patient groups.-Patients with DIMS had significantly higher scores than patients with DOES patients. A significant difference in PSQI components’ scores were found between the control group and both DIMS and depressed groups. PSQI scores also differed on 3 components (sleep disturbances, daytime dysfunction, and sleep quality) with DOES patients.- A significant difference in all PSQI component scores except sleep disturbance were identified between DOES and DIMS patients. While DOES and depressed patients differed on all components’ scores except sleep disturbance and daytime dysfunction.Diagnostic validity:(Buysse et al, 1989)A cutoff score of 5 correctly determined the sleep quality for 88.5 of all participants (kappa = 0.75, p < 0.001) reporting a sensitivity of 89.6% and a specificity of 86.5%.(Backhaus et al, 2002)- PSQI global score > 5 resulted in a sensitivity of 98.7% and specificity of 84.4%Criterion/concurrent validity:(Buysse et al, 1989)PSQI scores and PSG:- Significant positive correlation between PSG and PSQI only for sleep latency (r =0.33, p <0.001)PSQI global score and PSG:- Objective sleep latency (r = 0.20, p< 0.01), weak correlation- The global PSQI score correlated only with REM% in controls (r = 0.34, p <0.006) and number of arousals in depressives (r = 0.47, p< 0.002).(Backhaus et al, 2002)- PSQI was highly correlated with sleep log data (r =0.81, p< 0.001 for sleep duration and r =0.71, p= 0.000 for sleep onset latency.-The correlations between PSQI and PSG data was significant and lower ranging between r = (−0.32 to −0.33).Construct validity:-PSQI scores did not correlate significantly with actigraphic measures of sleep (r ≤0.13).PSQI with sleep diary: (significant correlation)-Sleep efficiency (r = −0.562, p< 0.01), TST (r = −0.307, p< 0.01), WASO: (r = 0.262, p< 0.01).-Sleep latency: r = 0.480, p <0.01-CESD: r = 0.305, p <0.01Internal consistency:Overall: α = 0.83 (Buysse et al, 1989)Overall: α = 0.85 (Backhaus et al, 2002)Test-retest: (Buysse et al, 1989)-Test retest (28 days apart):The correlation between the PSQI scores was r = 0.85 (p <0.001).- Component scores:r = 0.84–0.65 (p< 0.001 for each component score).Test retest: (Backhaus et al, 2002)Overall: (r =0.87, p <0.001)-Short interval (2 days):r =0.90, p=0.00-Longer interval (45.6 ± 18 days):r =0.86, p <0.001Interpretability:Can distinguish between “good” and “poor” sleepers. The cut-off point is 5.
Bastien et al, 200129Morin et al, 201130Insomnia severity index (ISI)Study 1:Patients with insomnia complaint (n=145).Study 2: (n=78)Insomnia patient involved in a study assessing the efficacy of cognitive behavioral therapy.Morin et al, 2011

Community sample

(n= 959)

Clinical sample (n=183)

Criterion/concurrent validity:ISI items with sleep diary variables:-Sleep onset latency (r = 0.38), WASO (r = 0.35) and Early morning awakening (r =0.35)-Total ISI score and the sleep efficiency variable (r = - 0.19)

ISI and sleep diary components

r = 0.32–0.55 at baseline and r = 0.50 to 0.91 post-therapy (all p values < 0.05).

ISI variables with PSG variable:

Correlation ranged between r = 0.07 to 0.45 at pretreatment, and from 0.23 to 0.45 at post-treatment. Only the correlation for SOL was significant at pretreatment, whereas all correlations, but one (EMA) were significant at post-treatment (p< 0.05).-The correlations between the patient’s and the clinician’s versions of the ISI at the two assessment periods were all significant (p values < 0.01). Furthermore, the correlations between the patient’s and the significant other versions of the ISI were also significant at the two assessment periods (p values <0.01).Predictive validity:The clinician’s ratings predicted best the patient’s ISI total score at baseline, while at post-treatment, both the clinician and the sleep diary data were reliable predictors of the patient’s total ISI score.R2= 0.37 (p< 0.05) at pretreatment, clinician: (β= 0.52)R2= 0.61 (p< 0.05) at post-treatment, clinician: (β= - 0.52)Sleep diary: (β= - 0.34)Content validity:A principal component analysis, using varimax rotation, explored the ISI content validity and the extent to which its components corresponded to insomnia’s diagnostic criteria.Diagnostic validity: Morin et al, 2011- Subthreshold insomnia:A cutoff score of 8:Sensitivity of 95.8% and 99.4% in the Community and Clinical samples respectively, and with specificity of 78.3% and 91.8%.- Moderate to severe insomnia:-A cutoff score of 15:Specificity of 98.3% and 100% in the Community and Clinical samples, respectively, with a sensitivity of 47.7% and 78.1% respectively.- Community sample: cut-point of 10 (86.1% sensitivity and 87.7% specificity).- Clinical sample: cut-point of 11 was associated with 97.2% sensitivity and a perfect 100% specificity.Construct validity (convergent):- Insomnia severity on the ISI was positively correlated with the corresponding diary variable.-The ISI total score was significantly correlated with the PSQI total score, r = 0.80, p< 0.05.- Significant relationships were found with measures of anxiety and depression, different dimensions of fatigue. Significant correlations were also discovered with the SF-12, with a stronger association identified with the Mental components of the measure than with the Physical Health component.
Study 1:Internal consistency: α = 0.74.Study 2:Internal consistency:The internal reliability coefficients did not change significantly from baseline to follow up (0.76 to 0.78, respectively).Morin et al, 2011Community (Cronbach α = 0.90)Clinical samples (Cronbach α = 0.91).Responsiveness:A significant reduction in ISI score was detected post-treatment in patients’ (8.9 vs 15.4) and clinician’s versions (7.7 vs 17.7).ISI sensitivity to detect clinical improvement (n=146)- Moderate improvement in insomnia was identified as a reduction in ISI score by > 7 points (60% sensitivity, 70% specificity).-Marked/higher improvement in sleep was identified as a decrease in ISI score by > 8 points (64% sensitivity, 80% specificity).Interpretability:The scores of the ISI represent the following:0–7: lack of insomnia,8–14: Subthreshold insomnia15–21: Moderate insomnia22–28: Severe insomnia.Morin et al (2011) suggested clinically significant insomnia could be identified by ISI scores >10 in community samples and scores >11 in clinical settings.
Jenkins et al, 198827Nassermoaddeli et al, 200573Jerlock et al, 200672The Jenkins Sleep Scale (JSS)-Air traffic controllers (n=250 men; 25–49 years old)-Patients recovering after from cardiac surgery (n=500 patients).Construct validity:Spielberger’s state anxiety: r =0.37POMS-Depression: r =0.35POMS-Hostility: r =0.29POMS-Vigor: r = - 0.24 POMS-Fatigue: r = 0.46Positive well-being: r = - 0.22Internal consistency: α= 0.79Test-retest reliability: r =0.59Nassermoaddeli et al, 2005Internal consistency: α= 0.77Jerlock et al, 2006Internal consistency: α= 0.80NR
Soldatos et al, 200065Soldatos et al, 200165Athens Insomnia Scale (AIS)299 subjectsConsisting of:1-Primary insomniacs2- Psychiatric patients (both inpatients and outpatients)3-Healthy subjectsExternal validity:Sleep problems scale with:AIS-8: r = 0.90, p <0.001AIS-5: r = 0.85, p <0.001Soldatos et al, 2001Diagnostic validity:- AIS in general population, for a Cut-off score of 6 only 1% of those responders with insomnia are misdiagnosed (ie NPV= 99%) and (PPV=41%).-Psychiatric population: 8% (NPV=92%) and (PPV=86%).-A total score of 6 or higher in the AIS was shown to correctly identify 90% of the study subjects as suffering or not from “nonorganic insomnia” according to ICD-10.-the AIS in overall case identification corresponded to high indices of both sensitivity (93%) and specificity (85%).Internal consistency:AIS-8: α= 0.89, AIS-5: α=0.87Test retest (n=194; within one week):AIS-8: r = 0.89, p<0.001AIS-5: r = 0.88, p<0.001Interpretability:Cut-off score of 6General population (NPV= 99%, PPV=41%)Psychiatric population (NPV=92%, PPV=86%) with sensitivity (93%) and specificity (85%).-A total score of 6 or higher in the AIS was shown to correctly identify 90% of the subjects’ sleep quality.
Broman et al, 200825Westergren et al, 201526Minimal Insomnia Symptom Scale (MISS)Subjects selected randomly (n=1379)Age range: 20–64 (Sweden).Criterion validity:The correlation between BNSQ question about sleep quality and MISS total score was high with r =0.76.Westergren et al, 2015The difference between the adult and elderly samples was lower for the originally recommended ≥6 points cut-off (0.09 logits).Internal consistency:Total: α=0.73Test retest reliability: (6 months apart)High correlation was found with an ICC of 0.79.Responsiveness:Sensitivity to changePaired t tests revealed that there was a strong trend for increase in score in subjects who deteriorated (m=+0.80; t=2.0; p=0.053).There was also a significant decrease in MISS scores among subjects who improved (m =−1.06; t =2.9; p< 0.01).Interpretability:A cut-off score of ≥6 on the MISS identify insomniacs in the general adult population (sensitivity 0.82; specificity 0.86), PPV 0.44; NPV 0.97.
Espie et al, 201464Sleep ConditionIndicator (SCI)Samples from 5 validation studies:- The Great British Sleep Survey (GBSS): adults > 18-The GBSS+-TV sample-Glasgow Science Centre data (n=256)- A randomized controlled trial (RCT) sample (n=164) recruited into a placebo-controlled evaluation of CBT Content validation:

The DSM-5 was used to develop the questionnaire, consultations were conducted, and a draft was published on the American psychiatric association website.

Concurrent/criterion validity:

SCI was negatively associated with the score of sleep quality questionnaires including PSQI (r = −0.734) and the ISI (r = −0.793).

Sleep condition was significantly associated with physical and mental health (r =0.222 and r =0.335 respectively).

Using the HADS scale, SCI was negatively correlated with symptoms of depression (r =−0.426) and anxiety (r =−0.400). But was higher than the associated detected in the RCT sample study (depression (r = −0.267), anxiety (r =−0.236) and stress (r = −0.263)).

Concurrent and diagnostic validity:

A cut-off score ≤16 was diagnostic for 89% of individuals who were identified as insomniacs on the ISI scale (scores of ≥15), with a capacity of correctly classifying 82% of non-insomniacs.

These findings support the concurrent validity for the SCI and confirming that a score of ≤16 on the SCI could identify insomniac patients.

Internal consistency:

Cronbach’s α = 0.857 (the GBSS

(range of α-if-item-deleted 0.822–0.860).

GBSS+ sample

(α=0.865). The mean corrected item-total correlation was moderate (r =0.620).

Interpretability:A cut-off ≤16, was able to identify 89% of patients who had insomnia (ISI scores of ≥15).However, an SCI score of >16 was able to exclude 82% of individuals without insomnia
Lee, 199241Lee, 200742General Sleep Disturbance Scale (GSDS)Female nurses in different shifts (permanent day shift, permanent night shift and rotating shifts)N=760Parents of hospitalized infants in the intensive care unit n=44Lee, 2007Content validity: Evaluated by multidisciplinary reviewers.Criterion/concurrent validity:1 -Sleep diary:For both the Chinese and English versions, higher GSDS scores were correlated with self-reported lower sleep quality in sleep logs (r = 0.41), higher morning fatigue levels (Chinese version: r = 0.42, p= 0.05; English version: r = 0.56, p = 0.006).2-Wrist actigraphy:less sleep efficiency (Chinese version: r = - 0.26; English version: r = - 0.42).Criterion/predictive validity:The participants’ GSDS mean scores were positively correlated with their morning fatigue levels (Chinese version, r = 0.42, p = 0.05; English version, r = 0.56, p = 0.006), supporting the predictive validity of the GSDS.Lee, 1992Internal consistency:-Overall: α = 0.88-Subscales of quality of sleep, daytime sleepiness, and use of sleep aids were 0.62, 0.79, and 0.82, respectively.Lee, 2007Internal consistencies:Overall: Chinese version (α= 0.81)English version (α= 0.85)NR
Fung et al, 201441Daily Cognitive Communication and Sleep Profile (DCCASP)University students’ university of Toronto (n=59).Criterion validity:DCCASP and PSQI:- Adequate criterion validity for the Sleep Quality domain of the DCCASP was established by comparing it to the Sleep Quality domain of the PSQI, with rs = 0.398 (p< 0.001).-Positive correlation between sleep quality and each of the DCCASP domains: rs (0.38–0.55) (p<0.0001).-Internal consistency:Cronbach’s α ranged between 0.864 and 0.938 across the seven domains of the DCCASP.-Test-retest (repeated after 2 weeks)Concordance Correlation Coefficient (CCC) of each domain of the DCCASP was moderate, ranging from r = 0.548 to 0.742.NR
Abdel-khalek, 200835The Arabic Scale of insomnia (ASI)Students and employeesContent validity: The initial draft of the ASI was reviewed by PhD holding faculty members and master students.Construct:Convergent validity: AIS was significantly correlated with Arabic sleep disorders scale and Jenkins sleep scale a correlation range of (r= 0.56–0.94, p<0.001).Internal consistency:α= 0.84–0.87Test-retest reliability (1 week apart):α= 0.70–0.83NR
Morrone et al, 201752Maugeri Sleep Quality and Distress Inventory (MaSQuDI-17)Outpatients evaluated for sleep disorders insleep centers of Northern ItalyConvergent validity and discriminant:(p <0.001 for all)r2 = 0.5 with the PSQIr2 = 0.15 with ESSr2 = 0.39 with anxiety as measured by A-D scheduler2 = 0.52 with depression as measured by A-D schedule-Discriminating capacity:The difference in mean score between the healthy group and patients (9.31 and 14.15 respectively) was statistically significant (p< 0.001).- Differences were consistent with the type of pathological condition present (normal subject vs OSAS or INS or BSD group of subjects p <0.001).Internal consistency:α= 0.896NR
Johns M., 199144Johns M., 199245Johns M., 199446Epworth Sleepiness scale (ESS)-Control (n=30)-Individuals with sleep disorders (n=150)-Third year medical students (n=104)-OSA patients treated with CPAP-Patients who hadMSLT (n=44)-Spouses of participants who completed ESS (n=50)Discriminating capacity:-Significant differences in ESS scores between the seven diagnostic groups were detected (F= 50.00; df= 6,173; p< 0.0001).-Scores for OSA, narcolepsy and idiopathic hypersomnia were significantly higher than for controls (p <0.001) or primary snorers (p< 0.001).- The insomniacs had significantly lower scores (p< 0.01) than all groups except controls.-Criterion validity:ESS (total score) with MSLT:-Mean Sleep latency (SL): (rho = −0.42, p< 0.01). [Significant negative correlation]-When the correlation between mean SL with ESS individual item scores was assessed only 3 items were significantly correlated (p <0.05) which were:

The likelihood of dosing off “when sitting, inactive in a public place” (rho= −0.44), when “sitting quietly after lunch without alcohol” (rho= −0.34) and when “in a car, stopped for a few minutes in the traffic” (rho= −0.41).

Predictive validity:-All eight item scores as predictors of the mean SL in the MSLT were statistically significant (r = 0.639, p = 0.01).- Items 3 (r = −0.44, p <0.01) and 8 (r = −0.41, p= 0.01) were significant independent predictors.
Internal consistency:α = 0.88 (patients)α = 0.73 (students)Test-retest reliability: (5 months apart)-For 87 students: r = 0.822 (p< 0.001).-Patient-Spouse paired item correlation: (mean rho = 0.57, p< 0.001).- The paired (patient-Spouse) total ESS scores correlation (high)(rho = 0.74, n = 50, p< 0.001).Responsiveness:After 3 months of treatment with CPAP.Treatment with nasal CPAP in 54 individuals with OSAS was associated with a change in ESS scores by 7.0 ± 5.2 (SD) following therapy, which was statistically significant (t =−9.59, df = 53, p< 0.001).
Akerstedt et al, 199024Reyner, et al, 199847Karolinska sleepiness scale (KSS)8 male subjectsCriterion validity:KSS with VAS:-The maximum score on the KSS scale corresponds with the verbal anchor “extremely sleepy, fighting sleep,” while the minimum value corresponds with the rating “alert.”- The association between subjective sleepiness and the EEG/EOG variables was significant: (r =0.29–0.65, p<0.05).-Significant differences in various levels of subjective sleepiness were identified for all variables, except theta activity- during the test conducted under ambulatory conditions. A significant difference between maximum and minimum sleepiness was detected (Wilcoxon, z > 2.20, p<0.05).-No changes were identified in the EEG/EOG before level 7 was reached on the KSS scale. For the test session with closed eyes, a significant variation was identified only with slow rolling eye movement (x2 = 13.6, p<0.01).- Increasing KSS levels were highly significantly correlated with an increased likelihood of falling asleep (Pearson’s r = 0.78; d = 1043; p= 0.001).NRNR
Weaver et al, 199774Weaver et al, 200928Functional outcomes of sleep questionnaire (FOSQ)FOSQ-30:Sample 1 (n = 153)Healthy individuals presenting with sleep complaintSamples 2 (n = 24) and 3 (n = 51): Patients with OSAFOSQ-10:Sample 1: (n=155)Participants with moderate to severe OSA on CPAPSample 2: (n = 51)CPAP-treated OSA patientsNormal subjectsFOSQ-30:Face validity:Seven judges with expertise in the areas of functional status instrument development and sleep problems rated the clinical relevance of each item and the instrument to DOES. Content validity:Determined by the proportion of items receiving a rating of at least three or four across all judges.Construct validity:-Subscale-to-subscale correlations range: r = 0.52–0.86-Subscale-to-global FOSQ score intercorrelations ranged from r = 0.78–0.86.Concurrent/criterion validity:-FOSQ-30 global score with SIP total score (n=24): (r = −0.50, p≤0.05)-FOSQ-30 global with SF-36 sub-scales (n=51): no significant correlation-FOSQ-30 global score with SF36 role emotional functioning subscale (r =0.46, p ≤0.01)-FOSQ activity level subscale significantly correlated with SF36 physical functioning subscale.- The FOSQ social outcome subscale was significantly correlated with the SF-36 social function subscale (r =0.36, p ≤0.05) and SF36 mental health subscale (r =0.38, p≤0.01).-Discriminant validity:-Mean of FOSQ global score discriminated between normal sleepers and those with sleeping problems: (68.05 ± 21.24 and 89.59 ± 8.64 respectively with p= 0.0004)(T157 = −5.88, p= 0.0001).FOSQ-10:Criterion validity:Global scores comparison:-Before treatment with CPAP the FOSQ-10 global score was robustly associated with the FOSQ-30 Total score (r = 0.96, p< 0.0001).-The correlation between the post-treatment total scores of FOSQ-10 and the FOSQ-30 was r =0.97 (p< 0.0001).Known group validity:-Baseline total score on the FOSQ-10 for the OSA group (mean = 12.48 ± 3.23) statistically differed (t = 8.65, p< 0.0001) from the normal values (mean = 17.81 ± 3.10).FOSQ-30:Internal consistency(n=153):-Total: α = 0.95-Subscales (α = 0.86 to α = 0.91)- Item to total correlation range:0.35–0.73Test retest reliability (n=32):(Within 1 week)- Global score: r = 0.90- Individual subscales ranged fromr = 0.81 to r = 0.90FOSQ-10:Internal consistency:α = 0.87Responsiveness:Following CPAP treatment, both the FOSQ-30 and the FOSQ-10 detected a large clinically meaningful change in the total score (p< 0.0001).
David, 200850Occupational impact of sleep questionnaire (OISQ)Community sample of 86 participants (age: 25–50 years)

43 meeting DSM IV criteria for primary insomnia (26 women & 17 men)

43 controls

Criterion:- At each time point OISQ scores positively correlate with PSQI (mean r = 0.59, p<0.001).- Sleep diary variable:Work assessment scores negatively correlated with mean TST (r = −0.47, p<0.001) and mean SE (r = −0.56, p<0.001), and significantly and positively with mean WASO (r =0.66, p<0.001).-Significant negative correlation with SF-36 subscales ranging from r =- 0.21 to −0.62, p< 0.001.Internal consistency:α = 0.93Test-retest:A mean difference of 10.82 between groups was found resulting in a significant main group effect (F = 12.52, p< 0.001).A consistent decrease of 10% is seen in insomniacs compared to control.NR
Bell et al, 201131The Sleep Functional Impact Scale (SFIS)Primary insomniacsHealthy volunteersContent validity:Was assessed though face to face interviews with patients.Convergent/divergent validity: (p <0.001)-SFIS with ISI composite score: (r =0.82), SFIS with PSQI composite: (r =0.78), SFIS with FOSQ composite: (r = −0.69), SFIS with ESS total: (r =0.46), SFIS with MOS- sleep problems indices (I and II): (r ≥0.74), SFIS with MOS breathing and Snoring subscales: (r =0.26 and 0.27 respectively) and with WPAI–GH Subscales (r ≥0.31).Known group validity (n=430):-Higher SFIS score in the insomnia group compared to control (t=19.36, p< 0.0001), effect size=2.05-Lower SFIS scores for those reporting good sleep compared not reporting good sleep.(t = −12.18, p< 0.001) (effect size = −1.04).-Higher SFIS score for participants reporting worse sleep quality on PSQI (t =17.06, p< 0.0001) (effect size = 2.33).Internal consistency:All the sample: α= 0.97Insomniacs= α= 0.95NR
Espie et al,200051Sleep Disturbance Questionnaire (SDQ)Chronic insomniacsThe factors score of SDQ correlated with the SDQ total (0.31 to 0.88).Internal consistency:α= 0.67NR
Regestein et al, 199349Pavlova et al, 200147Hyperarousal Scale (H-scale)-Primary insomnia patients-Hypersomnia-Delayed sleep syndrome-Normal subjects-Patients with refractory insomniaCriterion validity:-H-Scale with neuroticism scale and extroversion-introversion scale: no significant correlation-Higher EEG activity in insomniacs compared to normal subjects (p <0.05).-Significant correlation between hyperarousal score with alpha and non-alpha EEG activity: (r ≥0.38, p≤ 0.03).Discriminating capacity: (Pavlovaa et al, 2001)- The insomnia group had a mean Hyperarousal total score significantly higher than the normal group (F = 20.7; p <0.001).NRNR
Espie et al, 200051Edinger et al, 200158Chung et al, 201659Dysfunctional Beliefs and Attitudes about Sleep Scale (DBAS)Chronic insomniacsNormal subjectsInsomniacsIndividuals with insomnia derived for a randomized controlled trial of self-help CBT-I.Construct validity:- DBAS 10 with DBAS-30: good correlation (r =0.826).-Total score: DBAS-10 and SDQ (r = 0.28)Construct/convergent validity: Edinger et al, 2001-DBAS with DBAS-10: Normal sleepers: r =0.84, Insomniacs: r ≥0.80Discriminative validity:Insomniac subjects had significantly higher scores on the full-length DBAS scale (F (1, 136) = 28.2, p< 0:0001) with mean of (mean DBAS 35.5, SD 10.5) as compared to normal sleepers (mean DBAS 26.4, SD 9.7).- DBAS-10 scores of our insomnia sufferers (mean DBAS-10 43.3, SD 15.0) were significantly (F (1,136) = 15.9, p< 0:0001) higher than those shown by our normal sleepers (mean DBAS-10= 33.5, SD 14.0).Criterion/concurrent validity: Chung et al, 2016

DBAS total score significantly correlated with several subscale scores of the ISI, PSQI total scores and HADS-anxiety and HADS-depression subscale scores.

The DBAS-10 “immediate consequences” subscale had significant correlation with ISI and HADS-anxiety subscale but was not related to HADS-depression subscale.

The DBAS-30 and DBAS-16 “sleep expectation” subscales were only significantly correlated with ISI total score.
DBAS 30:Internal consistency:-Overall: α = 0.72-Only two subscales achieved significance:1) “misattributions or amplifications of the consequences of insomnia”; α= 0.772)Subscale 4: “diminished perceptions of control and predictability of sleep;” α= 0.41.DBAS-10:-Internal consistency: α = 0.69-Internal consistency for factors I and II were 0.73 and 0.60.Edinger et al, 2001Internal consistency: DBAS 30: Normal subjects: α = 0.81Insomniacs; α ≥0.71DBAS-10: Normal subjects: α= 0.70, Insomniacs: α ≥ 0.53Chung et al, 2016Internal consistency: DBAS-30, DBAS-16, and DBAS-10 with Cronbach α of 0.81, 0.80, and 0.73, respectively.Responsiveness:-Changes in DBAS score following treatment were statistically comparable (F (2,68) = 2.33, p< 0.10) for all three treatment groups CBT, relaxation training (RT) or placebo control (PC).- DBAS-10 changes did differ statistically across the three treatment groups (F (2, 68) = 4.69, p< 0.025).-Post hoc comparisons, showed that the CBT-treated insomniacs showed significantly greater decreases on the DBAS-10.- Significantly greater reduction in DBAS scores was found in participants allocated to CBT-I, except the DBAS-30 “attributions” subscale and DBAS-16 “medication”-Improvement in ISI scores by ≥ 8 points was associated with significant changes in DBAS total scores and on DBAS-30 and DBAS-10 subscale scores.
Kallestad et al, 201058Insomnia daytime worry scale (IDWS)Undergraduate students and senior high school students (Norway)Predictive validity:The IDWS predicted insomnia severity over and above the other variables, accounting for an additional 12% of the variance.-In IDWS factor 1 (lack of energy) and IDWS factor 2 (danger) both predicted insomnia severity.The internal consistency:Total scale α =0.93lack of energy subscale: α = 0.94Danger subscale: α =0.75.NR
Tang & Harvey, 200456Jansson- Fröjmark et al 201155Anxiety and Preoccupation about Sleep Questionnaire (APSQ)110 university students (41 of them have insomnia as determined by PSQI score).Community dwelling sample from two counties in Sweden.-Classified according to sleep patterns into:

Insomniacs

Poor sleepers

Normal sleepers

Criterion/concurrent validity:The correlation with PSQI (r = 0.44, p< 0.0001)The correlation with BAI (r = 0.37, p< 0.0001)This indicates that higher scorers on the APSQ were associated with higher scores on PSQI (poorer sleep quality) and BAI (worse anxiety).Frojmark et al, 2011Discriminant validity: Frojmark et al, 2011The 10 APSQ items, the total APSQ scale, and the two retained factors discriminated the three sleep status groups.In all the 10 items (F = 97.6–245.12, p<0.001 in all instances), the total scale (F = 296.99, p<0.001), and the two factors (F = 215.60–328.29, p< 0.001 in both instances), the insomnia disorder group had higher scores than the other two groups, and the poor sleepers scored higher than the normal sleepers.The between group effect sizes for the 10 items ranged from 0.18 and 0.35, for the total scale 0.39, and for the two factors 0.33 and 0.41.Convergent validity:The APSQ and its two factors were significantly related to the Pre-Sleep Arousal Scale –Cognitive(PSAS-C) at (r =0.45–0.52, to the DBAS-10 at a moderate to good level (r = 0.50–0.61), to the HADS Scale -Anxiety at fair level (r = 0.3–0.49), and to the HADS –Depression at a fair level (r = 0.34–0.40).ASPQ with daytime parameters:APSQ and its two subscales were moderately associated with:

Sleep-onset latency: r =0.28–0.34

WASO: r =0.32–0.37, TST: r = 0.26–0.31

Early morning awakening: r = 0.27–0.30

The APSQ and the two subscales were correlated with sleep quality (r = 0.40–0.48).

ASPQ with daytime impairment:-APSQ and its two factors were significantly correlated with daytime impairment (r = 0.41–0.56)-Correlations when the impairment item is removed from the composite score: APSQ, r = 0.53; first subscale, r = 0.53; second subscale, r = 0.38.
Internal consistency:Tang & Harvey, 2004Cronbach’s alpha for total scale= 0.92Frojmark et al, 2011Cronbach’s alpha coefficients were:For total APSQ scale =0.93For first factor= 0.91For the second factor = 0.86NR
Ellis et al, 200757Sleep preoccupation scale (SPS)Validity study:University students and community sampleCriterion/concurrent validity:

SPS with the Sleep Associated Monitoring Index (SAMI):

Good correlation between total scores of SPS and SAMI; Poor (r = 0.67, p< 0.001), average (r =0.58, p< 0.001) and good (r =0.62, p< 0.001) sleepers.

Construct/convergent validity:SPS subscales with Global PSQI-score: p <0.001CBC subscale: r =0.37AC subscale: r =0.55Discriminant validity:[F (2, 721) =57.27, p <0.001], poor sleeper reported higher levels of preoccupation scores than average and good sleepers.The tool significantly differentiated between poor and good sleepers (as identified through the PSQI questionnaire); as poor sleepers reported more preoccupation than normal sleepers. [t (454 =8.78, p<0.001)].
Internal consistency:α= 0.91 (overall)For subscales:CBC=0.93 and AC=0.89NR
Tan et al, 201660Catastrophic thoughts about insomnia scale (CTIS)University studentsContent validity:Expert panel reviewed the scale.Face validity (n=523): The tool was given to participants to complete.Criterion validity:

The correlation between CTIS and PSQI scores was statistically significant, r (137) = 0.643, p< 0.001

The correlation between scores on the CTIS and scores on the ISI was also statistically significant, r (137) = 0.703, p <0.001

Predictive validity:

CTIS along with DBAS16, IDWS, nBFI, and CESD-103, age, gender, and length of education predicted only 51% of the variance in PSQI score F (8, 128) = 17.07, p< 0.001, R2 = 0.51. CTIS, CESD-103, and gender significantly predicted changes in the PSQ scores.

CTIS, DBAS-16, IDWS, nBFI, and CESD-103 and age, gender, and level of education predicted around two-thirds of the changes seen in ISIS scores F (8, 128) = 32.07, p< 0.001, R2 =0.67.Each of the three instruments CTIS, the CESD-103, and the IDWS independently predicted the changes in ISI scores.

Construct/convergent validity:CTIS scores were significantly associated with DBAS-16 scores, r (137) = 0.722, p< 0.001, and IDWS scores, r (137) = 0.753, p< 0.001.
Internal consistent (α = 0.94)Subscales:Helplessness: α = 0.84Rumination: α = 0.88NR
Blake & Gomez, 199863Sleep Hygiene Self-test. (SHS)Male war-zone veteransCriterion validity:Sleep Hygiene self-test with Combat exposure scale and Mississippi scale (r =0.20 and 0.10 respectively).Internal consistency:Overall: α =0.54Responsiveness:After 5 sessions of education on sleep disturbance management (1 hour/week), a significant change of 7.6 points scores was detected
Mastin et al, 200661Sleep Hygiene Index (SHI)Psychology studentsCriterion:The SHI was positively correlated (p< 0.01) with the inadequate sleep hygiene criteria identified by the American Sleep Disorders Association, 1990. (r values = 0.371 to 0.458).Construct:-A positive association was identified between SHI score and both ESS and PSQI scoresr (599) = 0.244 and r (269) = 0.481 respectively with p <0.01-The SHI scores were significantly correlated with PSQI components scores(p ≤ 0.05).Internal consistency: α = 0.66Test-retest reliability (repeated over 4–5 weeks)r (139) = 0.71, p <0.01NR
Brown et al, 200262Sleep Hygiene Awareness and Practice Scale(SHAPS)University students (n=124)Construct:Correlation with sleep quality (as rated by the PSQI):

Sleep hygiene awareness with PSQI rating: (r =0.21)

Sleep hygiene practice with PSQI rating: (r =0.49, p <0.012)

Sleep hygiene practice with sleep hygiene awareness (r =0.30, p <0.012).

Variable sleep length, noise disturbance, going to bed thirsty, and worrying about the ability to fall asleep at bedtime were identified as significant predictors of sleep quality predictors (R2 = 0.24, adjusted R2 = 0.22, F (1, 118) = 5.30, p= 0.023).

Internal consistency:Cronbach’s a = 0.78The caffeine knowledge and sleep-hygiene practice subscales: Cronbach’s a = 0.55 and a = 0.47, respectively.Test retest (4 weeks later):The sleep-hygiene awareness activities and sleep-hygiene practice: (r = 0.76, p <0.001 and r = 0.74, p <0.001, respectively).The caffeine knowledge subsection had poor test-retest reliability (r = 0.50, p <0.001).NR
Grandner et al, 201775Sleep Practicesand Attitudes Questionnaire (SPAQ)General population (Age range:18–80)(n=124)Face validity: evaluated through group conversations (eg focus group) between individuals from the community and research participants. The participants provided their feedback on the content of the instrument and discussed some specific items.Content validity: questions derived from theoretical framework were discussed by a group of professionals in sleep medicine and community members to ensure the comprehensiveness and representativeness of items.Concurrent/criterion validity:- A significant correlation was identified between PSQI sleep duration and average sleep duration (r =0.53, p< 0.001).- Sleepiness with the ESS (rs = 0.39, p< 0.001).- Coping with acute insomnia correlation with the SHI was moderate (r = 0.29, p <0.001).- Activities in bed subscale with the SHI (r =0.53, p <0.001).- Sleep environment with SHI (r = - 0.34, p< 0.001).- Impact of external factors on sleep scores on this subscale significantly differed between good and poor sleepers (p= 0.004).- Sleep quality and global PSQI score: (r = 0.36, p <0.001)Construct validity:SPAQ with DBAS: DBAS was significantly correlated with subscales 2 (r = 0.18, p <0.05), 3 (r =0.28, p <0.01), 4 (r =0.31, p<0.001), 8 (r =0.23, p <0.05), 12 (r =0.30, p <0.001), 13 (r =0.45, p <0.001), 15 (r =0.30, p <0.001), and 16 (r =0.26, p <0.01).Internal consistency:Cronbach’s alpha range:0.251–0.864(for the different subscales).NR

Abbreviations: MOS, Medical Outcomes Study; GHQ-12, General Health Questionnaire-12; CES-D Scale, Centre for Epidemiologic Studies Depression; FSS, Fatigue Severity Scale; SWLS, Satisfaction with Life Scale; DIMS, Disorders of Initiating and Maintaining Sleep; DOES, Disorders of Excessive Somnolence; SOL, Sleep Onset Latency; SF-12, Short Form 12; POMS, Profile of Mood States; NPV, Negative Predictive Value; PPV, Positive Predictive Value; BNSQ, Basic Nordic Sleep Questionnaire; ICC, Inter-Correlation Coefficient; SD, Standard Deviation; CBT-I, Cognitive Behavioral Therapy for insomnia; HADS, Hospital Anxiety and Depression Scale; GBSS+, Great British Sleep Survey extended; A-D schedule, this consists of the State-Trait Anxiety Inventory (STAI-X1) and the Depfnression Questionnaire (DQ); OSAS, Obstructive Sleep Apnea Syndrome; INS, Insomnia; BSD, Behavioral Sleep Disorders; OSA, obstructive sleep apnea; CPAP, Continuous Positive Airway Pressure; MSLT, Multiple Sleep Latency Test; VAS, Visual Analogue Scale; EEG, Electroencephalogram; EOG, Electro-Oculogram; SIP, Sickness Impact Profile; SF-36, Short Form 36; TST, Total Sleep Time; WASO, Wake After Sleep Onset; WPAI–GH, Work Productivity and Activity Impairment–General Health Questionnaire; BAI, Beck Anxiety Inventory; CBC, cognitive/behavioral consequences; AC, Affective Consequences; nBFI, Neuroticism Subscale of the Big Five Inventory; CASQ, Cognitive-somatic Anxiety Questionnaire.

Table 2

Characteristics of Sleep Instruments Extracted from Selected Studies (n=112)

InstrumentsMeasurement ModelBurden (1)Response Format (2)Cultural andLanguageAdaptationPopulation in Which the Tool Was Validated
Number of QsScaleScoringRecall Period
The Bergen insomnia scale (BIS)68-points scale representing the frequency (number of days/week) ranging from 0 to 7Total= 0 to 42Last monthNRSelf- reportEnglishNR
The Insomnia Screening Scale (ISS)26NRInsomnia symptoms: 0–50Daytime function: 0–35Sleep environment: 0–25Sleep opportunity 0–20NRNRSelf- reportNR-Community sample: (6–72 years)-Primary insomnia
Sleep Quality Questionnaire (SQQ)105-point Likert-type scale (0–4)(0= strongly disagree, 4 = strongly agree).Total: 0–40Last month20 minutesSelf-reportJapaneseAdults
Women’s Health Initiative Insomnia Rating Scale (WHIIRS)55-points Likert scale (except 1 question 6 points Likert scale)Total: 0–20last monthNRSelf-reportEnglishPostmenopausal women
Restorative sleep questionnaire (RSQ)RSQ-daily: 11 itemsRSQ-weekly: 9 itemsLikert scale 1–5, in which 1 corresponds with “not at all” and 5 with “completely”.Total score:The average of the questionnaire items then converted to 0–100 scale.A minimum of 5 items must be completed.RSQ-D: One dayRSQ-W: Past weekNRSelf-reportEnglishNR
Pittsburg sleep quality index (PSQI)19 items (+ 5 questions for bed partner (optional)First 4 items (open ended),PSQI items use varyingresponse categories that include recording usual bedtime, usual wake time, number of actual hours slept, and number of minutes to fall asleep, as well asforced-choice four-point Likert scale different options including (“not during the past month”, “less than once a week”, “once or twice a week” and “three or more times a week”).Global PSQI score =0- 21Question score=0 to 3Past month5–10 minutesSelf-reportEnglishFrenchIndiaJapaneseChineseSpanishHungarianPersianSerbianThaiCzechKoreanKurdishArabicHebrewBrazilian PortuguesePortugueseSinhalaItalianMalay-US College students-Pregnant women-Lower back pain-University students-Psychiatric disordered people- Cancer patients-Breast cancer-Fibromyalgia (the Spanish version validated)-Older adults with Generalized anxiety disorders-Community dwelling Ethiopian adults
Insomnia severity index (ISI)7 items5-point Likert scaleTotal: 0 to 28Last 2 weeks<5 minutesSelf-reportClinicianSignificant others (usually spouse).EnglishGermanFrenchSpanishChineseItalianTurkishPersianArabicKoreanDanish-Veterans with traumatic brain injury-Lower back pain-Cancer patients-Community-dwelling older people
The Jenkins Sleep Scale (JSS)4 items6-point Likert scale, reflecting the frequency of having symptoms in the past month: (Not at all = 0; 1–3 days = 1, 4–7 days = 2; 8–14 days = 3; 15–21 days = 4; 22–28 days = 5).Total: 0–20(Higher scores indicate worse sleep quality; score >4 indicates the presence of sleep problem).Previous 1 monthNRSelf- reportEnglishTurkishPortuguese-Post cardiac surgery-Air traffic controllers- Patients with rheumatoid arthritis-Patients with Psoriatic arthritis-Ankylosing spondylitis-Shift workers-Patients with chest pain
Athens Insomnia scale (AIS)AIS-8: 8 itemsAIS-5: 5 itemsEach item is scored on a scale 0–3 with 0 indicating “no problem” and 3 indicating a “very serious problem”.An item is rated positive if the described problem’s frequency was ≥ 3 times/weekTotal score range:AIS-8= 0–24AIS-5= 0–15Flexible based on the study purpose (mostly 1 month)NRSelf-reportAdministered through an interviewerEnglishGreekSpanishJapanese (simplified version: 1 item)-Psychiatric patients-Individuals with primary insomnia- Chronic pain patients
Minimal insomnia Symptoms Scale (MISS)3 items5-point Likert scale (0= no, 4= very severe problems)Total= 0–12NRNRSelf-reportEnglish-Elderly
Sleep ConditionIndicator (SCI)8 items5-point scale (0–4)Total: 0 to 32-Scores can be transformed into a more intuitive 0–10 SCI range: by dividingthe total by 3.2, or by using an online version with automatedscoring.Typical night in the last monthNRSelf-reportEnglishFrenchSwedishCommunity dwelling adults (French population)
General Sleep Disturbance Scale (GSDS)21 items8-point Likert scale ranging from 0 (never) to 7 (every day); higher scores indicate more severely disturbed sleep.Total: 0–147Previous week5 minutes or lessSelf-reportEnglishChineseKoreanItalian-Female nurses-Parents of infants admitted to ICU
Daily Cognitive Communication and Sleep Profile (DCCASP)7 items7-point Likert scale (1=worst function and 7=best function)Total: 7–49DailyNR-Self- report- Therapist-Significant othersEnglishFrenchNR
Arabic Scale of Insomnia (ASI)12 items5-point Likert scale (0= No, 4=very much)Total 0–48Last monthNRSelf-reportedEnglishArabicNR
Maugeri Sleep Quality and Distress Inventory (MaSQuDI-17)173-points Likert scale(1=Never, 2= sometimes and 3= frequently)Total: 17 to 51NRNRSelf-reportEnglishNR
Epworth sleepiness scale (ESS)8Measured on 4-point Likert scale (0–3, corresponding to never dose off and high chance of dosing respectively)Total score (0–24).Recent timesNR-Self-report-Completed by roommateArabicEnglishEthiopianGermanGreekChinese (Eastern China)ItalianTurkishSpanish-Patients with sleep disorders-Patients with neurological disorders-Patients with psychiatric disorders-Workers (truck drivers)-Obstructive sleep apnea patients-African American population-Truck drivers
Karolinska sleepiness scale (KSS)1 item10 points Likert scale1 (extremely alert) to 9 (extremely sleepy, fighting sleep)Total score:7–95 minutes before answering the questionNRNREnglishJapaneseSpanish (for Mexican population)Registered nurses(2)
Functional outcomes of sleep questionnaire (FOSQ)FOSQ-30: 30 itemsFOSQ-10: 10 itemsFOSQ-30: Six-point Likert scale (how frequently an activity is performed, 0 = never did it, 5 = three or more times a week)Subscale score calculation:A response score of 0 = a N/A or missing response. Thus, the potential range of scores for any item is 1–4. Calculate the mean of the answered items with responses equal to or greater than 1 for each subscale. This is the weighted mean item total or subscale score.Global score:FOSQ-30:The mean of the subscale scores and multiply that by the number of subscales for which there is a score.- Scores range: 5–20FOSQ-10:Total: the mean of the subscale scores multiplied by 5.Subscale score: 1–3.Performance on a typical day (or generally how the person feels during the day).FOSQ-30:15 minutesFOSQ-10: NRSelf-administered paper-and-pencil questionnaire at a fifth-grade reading level.EnglishFOSQ-30:ThaiNorwegianSwedishFOSQ-10:PersianMandarin ChineseSpanishFOSQ-30:-Obstructive sleep apnea patients-Veterans with type 2 diabetes-Women with breast cancer (stages I–III)-Pregnant womenFOSQ-10:Family members of critically ill infants
Occupational impact of sleep questionnaire (OISQ)24 items5-point Likert scale. Scores on each question range from 0 (“never/not applicable to 4 “all of the time”).Total: 0–96Last monthNRNREnglishDutchPersianNR
The Sleep Functional Impact Scale (SFIS)26 items5-points Likert scaleTotal: 26–130≥13 items must be completed7 daysNRSelf-reportEnglish (US)Spanish (US)FrenchNR
Sleep disturbance questionnaire (SDQ)12 items5-point Likert scale(“never true,” “seldom true,” “sometimes true,” “often true,” “very often true”)NRTypical night with poor sleepNRNRNRNR
Hyperarousal Scale (H-scale)26Measured on a four-point Likert-type scale from 0 “not at all,” 1 “a little,” 2 “quite a bit” and 3 “Extremely”.The scale measures the summation score (HSUM); with 3 scores per item.NRNRSelf-reportEnglishSwedish-Insomniacs-Patients with hypersomnia
Dysfunctional Beliefs and Attitudes about Sleep Scale (DBAS)-DBAS: 30 or 31 items-DBAS-16: 16 items-DBAS 10: 10 itemsPlace a mark on a 10 cm long line within poles marking “strongly agree” and “strongly disagree”.DBAS full and DBAS 10:Average score across the itemsNot applicableNRSelf-reportEnglishChineseTaiwaneseGerman (DBAS-16)PersianNR
Insomnia daytime worry scale (IDWS)11 items5-point Likert scale.NRLast weekNRSelf-reportEnglishNR
Anxiety and Preoccupation about Sleep Questionnaire (APSQ)10 items-Original (2004): 10 points Likert scale(1= Not true to 10=very true)-Modified (2011): 5-point Likert scale. (1 = strongly disagree, 5= strongly agree)].Total:Original:10–100Modified: 10 to 50Original: past 3 daysModified: last monthNRSelf-reportEnglishNR
Sleep preoccupation scale (SPS)22 items7-point Likert scale (0= never to 6= all of the time)Total: 0–132NRNRSelf-reportEnglishNR
Catastrophic thoughts about insomnia scale (CTIS)18 items7-points Likert scaleRanges from 0=strongly disagree to 6= strongly agree and 3= neutralTotal: 0–108NRNRSelf-reportEnglishNR
Sleep hygiene-self test (SHS)30 itemsDichotomous response (yes or No)Total score(Yes = 1 point and No.=0 point). Every fifth item is scored in the reverse directionPrevious monthNRSelf-reportEnglishNR
Sleep hygiene index (SHI)13-item5–point scale ranging from 0 (never) to 4 (always).Total score: 0–52 or 13–65(higher score indicates worse sleep hygiene)NRNRSelf-reportEnglishPersianTurkishKorean-Individuals with chronic pain-Elderly-University students (Nigerian and Saudi)
Sleep Hygiene Awareness and Practice Scale (SHAPS)50 itemsIt asks respondents to rate the behavior on a scale from behavior is very beneficial to sleep (1) to behavior is very harmful to sleep (7). A rating of 4= behavior has no effect- 1 point= correct, 2 points if omitted and 3 points if incorrect (original developers lack et al, 1987)-Subsection 2: Score range: 0–7Respondents are asked to write y if the substance contains caffeine, n if it does not, and x if they have never heard of it. (brown et al, 2002).Awareness:Subsection 1:Total: Scores can range from 13 to 39,Subsection 2:This score can range from 0 to 100.Practice:The total: scores can range from 0 to 133; higher scores indicate behaviors more indicative of bad sleep hygieneNRNRSelf-reportNRInsomniacsGood sleepersUniversity students
Sleep Practicesand Attitudes Questionnaire (SPAQ)151 itemsThe sleep duration subscale: number of hours of habitual sleep, and the sleepdebt subscale: the percentage of difference between need and habitual sleepduration (with values < 0 reflecting more obtained than needed and values > 0 reflecting less obtained than needed).-All other subscales scores range from 0 to 1.NRNR10 minutesSelf-reportEnglishNR

Note: NR: Not reported; (1): Administration time; (2): Individuals who need to complete the questionnaire.

Studies Testing the Psychometric Properties of Extracted Sleep Instruments 89.3% of the samples had reliability coefficients ≥ 0.75 Community sample Clinical sample (n=183) ISI and sleep diary components ISI variables with PSG variable: The DSM-5 was used to develop the questionnaire, consultations were conducted, and a draft was published on the American psychiatric association website. SCI was negatively associated with the score of sleep quality questionnaires including PSQI (r = −0.734) and the ISI (r = −0.793). Sleep condition was significantly associated with physical and mental health (r =0.222 and r =0.335 respectively). Using the HADS scale, SCI was negatively correlated with symptoms of depression (r =−0.426) and anxiety (r =−0.400). But was higher than the associated detected in the RCT sample study (depression (r = −0.267), anxiety (r =−0.236) and stress (r = −0.263)). A cut-off score ≤16 was diagnostic for 89% of individuals who were identified as insomniacs on the ISI scale (scores of ≥15), with a capacity of correctly classifying 82% of non-insomniacs. These findings support the concurrent validity for the SCI and confirming that a score of ≤16 on the SCI could identify insomniac patients. Cronbach’s α = 0.857 (the GBSS (range of α-if-item-deleted 0.822–0.860). GBSS+ sample (α=0.865). The mean corrected item-total correlation was moderate (r =0.620). The likelihood of dosing off “when sitting, inactive in a public place” (rho= −0.44), when “sitting quietly after lunch without alcohol” (rho= −0.34) and when “in a car, stopped for a few minutes in the traffic” (rho= −0.41). 43 meeting DSM IV criteria for primary insomnia (26 women & 17 men) 43 controls DBAS total score significantly correlated with several subscale scores of the ISI, PSQI total scores and HADS-anxiety and HADS-depression subscale scores. The DBAS-10 “immediate consequences” subscale had significant correlation with ISI and HADS-anxiety subscale but was not related to HADS-depression subscale. Insomniacs Poor sleepers Normal sleepers Sleep-onset latency: r =0.28–0.34 WASO: r =0.32–0.37, TST: r = 0.26–0.31 Early morning awakening: r = 0.27–0.30 The APSQ and the two subscales were correlated with sleep quality (r = 0.40–0.48). SPS with the Sleep Associated Monitoring Index (SAMI): Good correlation between total scores of SPS and SAMI; Poor (r = 0.67, p< 0.001), average (r =0.58, p< 0.001) and good (r =0.62, p< 0.001) sleepers. The correlation between CTIS and PSQI scores was statistically significant, r (137) = 0.643, p< 0.001 The correlation between scores on the CTIS and scores on the ISI was also statistically significant, r (137) = 0.703, p <0.001 CTIS along with DBAS16, IDWS, nBFI, and CESD-103, age, gender, and length of education predicted only 51% of the variance in PSQI score F (8, 128) = 17.07, p< 0.001, R2 = 0.51. CTIS, CESD-103, and gender significantly predicted changes in the PSQ scores. CTIS, DBAS-16, IDWS, nBFI, and CESD-103 and age, gender, and level of education predicted around two-thirds of the changes seen in ISIS scores F (8, 128) = 32.07, p< 0.001, R2 =0.67.Each of the three instruments CTIS, the CESD-103, and the IDWS independently predicted the changes in ISI scores. Sleep hygiene awareness with PSQI rating: (r =0.21) Sleep hygiene practice with PSQI rating: (r =0.49, p <0.012) Sleep hygiene practice with sleep hygiene awareness (r =0.30, p <0.012). Variable sleep length, noise disturbance, going to bed thirsty, and worrying about the ability to fall asleep at bedtime were identified as significant predictors of sleep quality predictors (R2 = 0.24, adjusted R2 = 0.22, F (1, 118) = 5.30, p= 0.023). Abbreviations: MOS, Medical Outcomes Study; GHQ-12, General Health Questionnaire-12; CES-D Scale, Centre for Epidemiologic Studies Depression; FSS, Fatigue Severity Scale; SWLS, Satisfaction with Life Scale; DIMS, Disorders of Initiating and Maintaining Sleep; DOES, Disorders of Excessive Somnolence; SOL, Sleep Onset Latency; SF-12, Short Form 12; POMS, Profile of Mood States; NPV, Negative Predictive Value; PPV, Positive Predictive Value; BNSQ, Basic Nordic Sleep Questionnaire; ICC, Inter-Correlation Coefficient; SD, Standard Deviation; CBT-I, Cognitive Behavioral Therapy for insomnia; HADS, Hospital Anxiety and Depression Scale; GBSS+, Great British Sleep Survey extended; A-D schedule, this consists of the State-Trait Anxiety Inventory (STAI-X1) and the Depfnression Questionnaire (DQ); OSAS, Obstructive Sleep Apnea Syndrome; INS, Insomnia; BSD, Behavioral Sleep Disorders; OSA, obstructive sleep apnea; CPAP, Continuous Positive Airway Pressure; MSLT, Multiple Sleep Latency Test; VAS, Visual Analogue Scale; EEG, Electroencephalogram; EOG, Electro-Oculogram; SIP, Sickness Impact Profile; SF-36, Short Form 36; TST, Total Sleep Time; WASO, Wake After Sleep Onset; WPAI–GH, Work Productivity and Activity Impairment–General Health Questionnaire; BAI, Beck Anxiety Inventory; CBC, cognitive/behavioral consequences; AC, Affective Consequences; nBFI, Neuroticism Subscale of the Big Five Inventory; CASQ, Cognitive-somatic Anxiety Questionnaire. Characteristics of Sleep Instruments Extracted from Selected Studies (n=112) Note: NR: Not reported; (1): Administration time; (2): Individuals who need to complete the questionnaire. Flow chart of the systematic review process. As summarized in Tables 1 and 2, the majority of the instruments identified contain less than 20 items. The longest of these instruments is the Sleep Practices and Attitudes Questionnaire (SPAQ), which contains 151 questions divided into 16 different domains.75 While the Karolinska Sleepiness Scale (KSS),24 the Minimal Insomnia Symptoms Scale (MISS),25,26 and the Jenkins Sleep Scale (JSS)27–73 are the shortest instruments identified, consisting of one, three, and four questions, respectively. Some of the instruments have multiple versions, each consisting of a different number of questions [eg, the Functional Outcomes of Sleep Questionnaire (FOSQ)-30 and the FOSQ-10 have 30 and 10 questions, respectively].74,28 The majority of the instruments use Likert-type scales as response options to generate scores. The time needed to complete an instrument was not reported in the majority of the studies reviewed. However, the response burden, wherever reported, did not exceed 10 minutes. The recall period for the majority of the instruments was one month, except for the Insomnia Severity Index (ISI)29,30 and the the Sleep Functional Impact Scale (SFIS)31 which had a recall period of 2 weeks and 1 week, respectively. Only four of the identified instruments were available in Arabic, three of which (PSQI,32 ISI,33 and ESS34) were originally developed in English, but translated and validated in Arabic-speaking populations. The Arabic Scale of Insomnia (ASI)35 is the only instrument which was originally developed in Arabic. The instruments were classified into four categories based on the outcomes that were assessed in the 107 studies, as follows: (1) instruments screening for insomnia symptoms (n=14); (2) instruments assessing consequences of poor sleep (n=8); (3) instruments assessing the cognitive aspect of insomnia (n=5) and; (4) instruments evaluating sleep hygiene (n=4). A detailed description of the reported psychometric properties for the four categories of sleep instruments retrieved from the studies included in this review is provided in Table 3.
Table 3

Quality Assessment of Extracted Sleep Instruments* (n=31)

Sleep InstrumentReliabilityValidityResponsiveness bInterpretability b
Internal ConsistencyaTest Re-testbContentcConstructbCriterionb
Instruments screening for insomnia symptoms
 BIS++N/A+N/AN/AN/A
 ISS+N/AN/AN/A±N/A+
 SQQ++++N/AN/AN/A
 WHIIRS+++-N/AN/AN/A
 RSQ++++N/AN/AN/A
 PSQI++N/A++N/A+
 ISI+N/A+++++
 JSS+-N/A-N/AN/AN/A
 AIS++N/AN/AN/AN/A+
 MISS++N/AN/A+++
 SCI+N/A+++N/A+
 GSDS+N/AN/AN/A-N/AN/A
 DCCASP++N/AN/A-N/AN/A
 ASI++++?N/AN/A
Instruments evaluating consequences of poor sleep
 MaSQuDI-17+N/AN/A±N/AN/AN/A
 ESS++N/AN/A++N/A
 KSSN/AN/AN/AN/A+N/AN/A
 FOSQ++++++N/A
 OISQ++N/AN/A+N/AN/A
 SFIS+N/A++N/AN/AN/A
 SDQ+N/AN/AN/AN/AN/AN/A
 H-scaleN/AN/AN/AN/A-N/AN/A
Instruments assessing the cognitive aspect of insomnia
 DBAS+N/AN/A+?+N/A
 IDWS+N/AN/AN/A+N/AN/A
 APSQ+N/AN/A-N/AN/AN/A
 SPS+N/AN/A--N/AN/A
 CTIS+N/A+++N/AN/A
Instruments measuring sleep hygiene
 SHS-N/AN/AN/A-+N/A
 SHI-+N/A--N/AN/A
 SHAPS++N/A-N/AN/AN/A
 SPAQ-N/A+--N/AN/A

Notes: *Based on a modified version of the quality criteria developed by Terwee et al.23 aCronbach’s alpha: + if 0.7–0.95; - if (< 0.7 or >0.95). bPearson’s correlation coefficient (r): + if r≥0.70; - if r<0.70. cContent validity: + if reported. N/A: No information available to establish rating; ±, conflicting results in studies or inconsistent results across subscales? questionable/not rated.

Abbreviations: BIS, Bergen insomnia scale; ISS, The Insomnia Screening Scale; SQQ, Sleep Quality Questionnaire, WHIIRS, Women’s Health Initiative Insomnia Rating Scale; RSQ, Restorative sleep questionnaire; PSQI, Pittsburgh sleep quality index; ISI, Insomnia severity Index; JSS, Jenkins Sleep Scale; AIS, Athens Insomnia Scale; MISS, Minimal Insomnia Symptom Scale; SCI, Sleep Condition Indicator; GSDS, General Sleep Disturbance Scale; DCCASP, Daily Cognitive-Communication and Sleep Profile; AIS, Arabic Scale of Insomnia; MaSQuDI-17, Maugeri Sleep Quality and Distress Inventory; ESS, Epworth Sleepiness scale; KSS, Karolinska sleepiness scale; FOSQ, Functional outcomes of sleep questionnaire; OISQ, Occupational Impact of sleep questionnaire; SFIS, The Sleep Functional Impact Scale; SDQ, Sleep Disturbance Questionnaire; H-Scale, Hyperarousal scale; DBAS, Dysfunctional Beliefs and Attitudes about Sleep Scale; IDWS, Insomnia daytime worry scale; APSQ, Anxiety and Preoccupation about Sleep Questionnaire; SPS, Sleep preoccupation scale; CTIS, Catastrophic thoughts about insomnia scale; SHS, Sleep Hygiene Self-test; SHI, Sleep Hygiene Index; SHAPS, Sleep Hygiene Awareness and Practice Scale; SPAQ, Sleep Practices and Attitudes Questionnaire.

Quality Assessment of Extracted Sleep Instruments* (n=31) Notes: *Based on a modified version of the quality criteria developed by Terwee et al.23 aCronbach’s alpha: + if 0.7–0.95; - if (< 0.7 or >0.95). bPearson’s correlation coefficient (r): + if r≥0.70; - if r<0.70. cContent validity: + if reported. N/A: No information available to establish rating; ±, conflicting results in studies or inconsistent results across subscales? questionable/not rated. Abbreviations: BIS, Bergen insomnia scale; ISS, The Insomnia Screening Scale; SQQ, Sleep Quality Questionnaire, WHIIRS, Women’s Health Initiative Insomnia Rating Scale; RSQ, Restorative sleep questionnaire; PSQI, Pittsburgh sleep quality index; ISI, Insomnia severity Index; JSS, Jenkins Sleep Scale; AIS, Athens Insomnia Scale; MISS, Minimal Insomnia Symptom Scale; SCI, Sleep Condition Indicator; GSDS, General Sleep Disturbance Scale; DCCASP, Daily Cognitive-Communication and Sleep Profile; AIS, Arabic Scale of Insomnia; MaSQuDI-17, Maugeri Sleep Quality and Distress Inventory; ESS, Epworth Sleepiness scale; KSS, Karolinska sleepiness scale; FOSQ, Functional outcomes of sleep questionnaire; OISQ, Occupational Impact of sleep questionnaire; SFIS, The Sleep Functional Impact Scale; SDQ, Sleep Disturbance Questionnaire; H-Scale, Hyperarousal scale; DBAS, Dysfunctional Beliefs and Attitudes about Sleep Scale; IDWS, Insomnia daytime worry scale; APSQ, Anxiety and Preoccupation about Sleep Questionnaire; SPS, Sleep preoccupation scale; CTIS, Catastrophic thoughts about insomnia scale; SHS, Sleep Hygiene Self-test; SHI, Sleep Hygiene Index; SHAPS, Sleep Hygiene Awareness and Practice Scale; SPAQ, Sleep Practices and Attitudes Questionnaire.

Instruments Screening for Insomnia Symptoms

From the 14 instruments screening for insomnia symptoms, two [the Women’s Health Initiative Insomnia Rating Scale (WHIIRS)36,37 and the Restorative Sleep Questionnaire (RSQ)38] were dimension-specific and focus on evaluating the sleep quality through assessing problems with initiation and maintenance of sleep. The remaining 12 instruments were multidimensional, assessing sleep quality and consequences of poor sleep. Some of the identified instruments also measure other specific dimensions such as satisfaction with sleep in the Bergen Insomnia Scale (BIS)39 and sleep environment in the Insomnia Screening Scale (ISS).40 While the Daily Cognitive-Communication and Sleep Profile (DCCASP)41 measures the daily fluctuation in sleep quality and evaluates the impact of these fluctuations on the individual’s cognitive functioning and communication, while the General Sleep Disturbance Scale (GSDS)42,43 assesses sleeping patterns and the use of sleep aids during the past month.

Instruments Assessing Consequences of Poor Sleep

These instruments measure an individual’s functioning and daytime performance. Two of these instruments [the Epworth Sleepiness Scale (ESS44–46 and the KSS)24,47 focus on assessing sleepiness only, whereas the Hyperarousal Scale (H-Scale)48,49 also assesses daytime alertness. Three instruments, the FOSQ,74,28 Occupational Impact of Sleep Questionnaire (OISQ),50 and the Sleep Functional Impact Scale (SFIS),31 evaluate the effects of insomnia on overall functioning. Conversely, the Sleep Disturbance Questionnaire (SDQ)51 evaluates factors which contribute to poor sleep, and the Maugeri Sleep Quality and Distress Inventory (MaSQuDI-17)52 measures the emotional burden of insomnia.

Instruments Assessing Cognitive Aspects of Insomnia

These were in line with the ICD-10 criteria which identifies preoccupation with sleepiness and excessive worry about the consequences of insomnia during the day as one of the clinical features for insomnia.53 Four of these instruments [Insomnia Daytime Worry Scale (IDWS),54 Anxiety and Preoccupation about Sleep Questionnaire (APSQ),55,56 Sleep Preoccupation Scale (SPS)57] evaluate the extent of worry about insomnia. The Dysfunctional Beliefs and Attitudes about Sleep Scale (DBAS) explores individuals’ perceptions and beliefs regarding insomnia.51,58,59 While the Catastrophic Thoughts about Insomnia Scale (CTIS)60 assess exaggerated thinking about insomnia and the severity of the consequences.

Instruments Assessing Sleep Hygiene

Out of the four instruments included under this category [Sleep Hygiene Index (SHI),61 Sleep Hygiene Awareness and Practice Scale (SHAPS),62 Sleep Hygiene Self-test (SHS)63 and the SPAQ75], only the SPAQ is a comprehensive instrument which evaluates sleep quality, quantity, consequences of poor sleep, and sleep hygiene. The SHS also evaluates the effects of interventions which are expected to improve sleep hygiene, and is the only sleep-hygiene instrument which has been validated for responsiveness. The SHAPS measures knowledge about sleep hygiene in addition to behaviors surrounding sleep. Some of the sleep instruments which were validated in the 107 articles included in this review, such as the BIS,39 Sleep Condition Indicator (SCI),64 ISS, Athens Insomnia Scale (AIS)65,66 and Sleep Hygiene Index (SHI)61 were developed in accordance with international diagnostic criteria for insomnia such as the International Classification of Sleep Disorders 2nd edition (ICSD-II), International Classification of Diseases 10th edition (ICD-10), and the Diagnostic and Statistical Manual of Mental Disorders (DSM). The majority of these instruments were validated among university students and in primary insomniacs. Many of the identified sleep instruments have been translated and culturally adapted in variety of languages, such as the Pittsburgh Sleep Quality Index (PSQI),67–69 ISI, and ESS which are available in 18, 10, and 6 languages, respectively.

Quality Assessment

The majority of the instruments used in the 107 studies included in this review did not meet all of the eight criteria set by the SAC-MOT.22 However, the conceptual and measurement models were specified for all instruments.Additionally, the reliability of sleep measures was usually reported in the validation studies, particularly the internal consistency reliability was reported for all instruments, except for the H-scale and the KSS. The reported internal consistency values for the sleep and sleep hygiene instruments across the studies included in this systematic review ranged between 0.53 and 0.97. As outlined in Table 3, the test-retest reliability was only reported for 15 of the 31 instruments. Validity measures were reported for almost all of the instruments identified (29 out of 31), of which criterion validity was the most commonly reported and content validity the least reported validity measure. The generalizability of the psychometric properties reported was examined for three instruments (ISS, ISI and AIS) in community samples and primary insomniacs. Only six instruments (MaSQuDI-17, ESS, FOSQ, DBAS, APSQ, SPS) provided evidence of discriminating capacity between healthy individuals and poor sleepers. As summarized in Table 1, some of the sleep instruments were tested for other validity measures such as incremental validity, diagnostic validity, and external validity.40,64-67,70,76 Additionally, the SHS was the only instrument that was assessed for responsiveness to change in sleep hygiene after interventions.46 With the exception of SHAPS, all of these instruments had low internal consistency (α < 0.7) and only the SHI and the SHAPS were evaluated for their reproducibility.75,61–63

Discussion

This study identified 31 sleep instruments and described in detail their psychometric properties as described in 107 validation and cultural adaptation studies. In the validation studies included in this review, only the ISI, FOSQ and MISS were evaluated for all four psychometric properties such as responsiveness, interpretability, reliability and validity.25,73-28,69,76 Because most validation studies try measuring the same construct in different settings, it was expected that the sleep instruments in these studies were mostly tested for their reliability. For the most part, the sleep instruments identified in this systematic review had good internal consistency with Cronbach’s alpha values ≥0.7, which is in line with the recommended threshold for adequate reliability reported in the literature.77 However, overall lower Cronbach’s alpha values were reported for instruments measuring sleep hygiene. It has been suggested that the low internal consistency identified for sleep hygiene measures could be due to the definition of sleep hygiene, which consists of different factors (not necessarily related) that have the potential for negatively affecting sleep.61 Thus, items in sleep hygiene instruments may appear to be poorly related in the internal consistency test. This suggests that when assessing the psychometric properties of sleep hygiene instruments, the use of other validity and reliability measures is recommended to avoid depending on Cronbach’s alpha values alone. The validation studies included in this systematic review reported on the reproducibility of only 15 out of the 31 instruments identified. Mostly were assessed through the test-retest reliability method. However, the time frame between the two measurements was highly variable, ranging between 2 days and 6 months.27,69 There is still no consensus in the literature on the best time interval for reproducibility tests. Findings from studies that compared between different time intervals suggests that test results from an interval of 2 days and 2 months are similar to those resulting from a 2-weeks’ interval.78,79 Criterion validity was the most commonly reported validity measure in the studies included in this review. Concurrent validity (which compares the score of the instrument to that of a gold standard which assesses the same construct) was evaluated for 18 out of the 31 sleep instruments identified, whereas the predictive validity (which provides evidence of an instrument’s ability to forecast a particular outcome in the future) was assessed only for 5 instruments. Considering that most of the validation studies included in this review were done on newly developed instruments, not prioritizing on assessing their predictive validity is understandable. Some of the validation studies such as those done on ISI, RSQ and PSQI, compared the scores generated by the instruments to non-subjective assessment tools such as the PSG.29,37,65 The quality criteria for questionnaires developed by Terwee et al recommends providing the reasons for selecting a measure to be a gold standard in a validation study.23 However, in the studies included in this systematic review, the choice of the instrument to be used as a comparator or as a gold standard was rarely justified. In situations where a gold standard measure is not available, construct validity could be used to assess the instrument’s validity.80 This is the case for some of the instruments included in this review, such as the RSQ, APSQ and SHAPS, which evaluate new sleep-related concepts for which gold standards are not available. The diagnostic validity of an instrument is an important measure, particularly for clinicians, as it examines the extent to which the instrument can accurately differentiate between healthy individuals and insomniacs at a specific score.81 Diagnostic validity was reported for only five of the instruments identified in this review (specifically the PSQI, ISI, AIS, MISS and the SCI). Incremental validity (which describes the ability of an instrument to predict a variable of interest beyond what is possible by other existing instruments) was only tested for the Sleep Quality Questionnaire (SQQ).62 This measure is important as it facilitates the comparison between different instruments and provides evidence of which instrument is superior. The selection of an insomnia instrument in practice is also affected by factors such as the length of the instrument, the time required for completion and the languages in which it is available. Around half of the instruments identified in this review consisted of ten questions or less, and those assessing the cognitive aspects of insomnia or sleep hygiene were usually longer. However, the duration required for completion was not commonly reported. Only six instruments were available in more than three languages, and only four in Arabic language.34 The limited availability of instruments in Arabic language offers a valuable opportunity for researchers interested in sleep medicine in the Arab region to translate and validate these instruments. Despite the comprehensiveness of this systematic review, the time elapsed between the initial database search and the publication of the results is an important limitation. To address this, a supplementary literature search using the same search strategy and databases was undertaken in April 2020, which identified additional validation studies and 7 new instruments including: the Insomnia Catastrophising Scale (ICS),82,83 the Single-item sleep quality scale (SQS),84 the Lebanese insomnia scale (LIS-18),85 the Daytime Sleepiness Perception Scale-4 (DSPS-4),86 the Indian Sleepiness Scale (InSS),87 Athlete Sleep Behavior Questionnaire (ASBQ),88 and the Non-restorative Sleep Scale (NRS)89. These findings are indicative that the improvement in scientists’ understanding of insomnia pathophysiology has been associated with a parallel increase in the development of instruments which measure different aspects of sleep dysfunction and related risk factors. Researchers also appear to be acknowledging the cultural effects on sleep behaviors and insomnia perception, as two of the newly developed instruments (LIS-18 and InSS) were developed to describe sleep from a non-Western perspective. Comparing between the psychometric properties of insomnia instruments across different populations is interesting and could be the focus of a future investigation. In addition, because this systematic review was focused on identifying instruments that assess the symptoms of insomnia, daytime consequences and those related to sleep hygiene, instruments assessing arousal before sleep only such as the Pre-Sleep Arousal Scale (PSAP)90,91 were excluded.

Conclusion

Different instruments are available for evaluating various aspects of sleep and sleep hygiene. The validity and reliability of most of these instruments have been tested and are well established. However, psychometric testing for several of the available sleep instruments remains incomplete, particularly responsiveness and interpretability. Our findings suggest that future studies should focus on reporting more psychometric measures to ensure the trustworthiness of the findings generated by these instruments. The number of sleep instruments available in languages other than English including Arabic are limited, indicating a need to translate and culturally adapt many of these instruments into various languages to be available for use in clinical practice and research in different populations.
  81 in total

Review 1.  Assessing health status and quality-of-life instruments: attributes and review criteria.

Authors:  Neil Aaronson; Jordi Alonso; Audrey Burnam; Kathleen N Lohr; Donald L Patrick; Edward Perrin; Ruth E Stein
Journal:  Qual Life Res       Date:  2002-05       Impact factor: 4.147

2.  A scale for the estimation of sleep problems in clinical research.

Authors:  C D Jenkins; B A Stanton; S J Niemcryk; R M Rose
Journal:  J Clin Epidemiol       Date:  1988       Impact factor: 6.437

Review 3.  Subjective sleep measures for adolescents: a systematic review.

Authors:  X Ji; J Liu
Journal:  Child Care Health Dev       Date:  2016-08-06       Impact factor: 2.508

4.  Sleep hygiene practices in a population-based sample of insomniacs.

Authors:  Catherine D Jefferson; Christopher L Drake; Holly M Scofield; Eric Myers; Tara McClure; Timothy Roehrs; Thomas Roth
Journal:  Sleep       Date:  2005-05       Impact factor: 5.849

5.  Psychometric Comparison of the Full and Abbreviated Versions of the Dysfunctional Beliefs and Attitudes about Sleep Scale.

Authors:  Ka-Fai Chung; Fiona Yan-Yee Ho; Wing-Fai Yeung
Journal:  J Clin Sleep Med       Date:  2016-06-15       Impact factor: 4.062

6.  Psychometric properties of an insomnia-specific measure of worry: the Anxiety and Preoccupation about Sleep Questionnaire.

Authors:  Markus Jansson-Frojmark; Allison G Harvey; Lars-Gunnar Lundh; Annika Norell-Clarke; Steven J Linton
Journal:  Cogn Behav Ther       Date:  2011

7.  Prevalence and consequences of insomnia disorders in the general population of Italy.

Authors:  Maurice M Ohayon; Salvatore Smirne
Journal:  Sleep Med       Date:  2002-03       Impact factor: 3.492

8.  Reliability and validity of the Women's Health Initiative Insomnia Rating Scale.

Authors:  Douglas W Levine; Daniel F Kripke; Robert M Kaplan; Megan A Lewis; Michelle J Naughton; Deborah J Bowen; Sally A Shumaker
Journal:  Psychol Assess       Date:  2003-06

Review 9.  Quantity and quality of sleep and incidence of type 2 diabetes: a systematic review and meta-analysis.

Authors:  Francesco P Cappuccio; Lanfranco D'Elia; Pasquale Strazzullo; Michelle A Miller
Journal:  Diabetes Care       Date:  2009-11-12       Impact factor: 19.112

10.  The Pittsburgh Sleep Quality Index: a new instrument for psychiatric practice and research.

Authors:  D J Buysse; C F Reynolds; T H Monk; S R Berman; D J Kupfer
Journal:  Psychiatry Res       Date:  1989-05       Impact factor: 3.222

View more
  1 in total

1.  Prevalence and variables associated with insomnia among COVID-19 survivors in Colombia.

Authors:  John Carlos Pedrozo-Pupo; Carmen Cecilia Caballero-Domínguez; Adalberto Campo-Arias
Journal:  Acta Biomed       Date:  2022-03-14
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.