| Literature DB >> 26808674 |
Reinie Cordier1, Yu-Wei Chen2, Renée Speyer3, Rebekah Totino1, Kenji Doma3, Anthony Leicht3, Nicole Brown3, Belinda Cuomo1.
Abstract
INTRODUCTION: Improving occupational performance is a key service of occupational therapists and client-centred approach to care is central to clinical practice. As such it is important to comprehensively evaluate the quality of psychometric properties reported across measures of occupational performance; in order to guide assessment and treatment planning.Entities:
Mesh:
Year: 2016 PMID: 26808674 PMCID: PMC4726555 DOI: 10.1371/journal.pone.0147751
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Search Terms.
| English Language; Human, Preschool age (2–5 years), School Age (6–12 years), Adolescence (13–17 years) | |
| English language; Human, Preschool age (2–5 years), School Age (6–12 years), Adolescence (13–17 years | |
| English language; Human, Preschool age (2–5 years), School Age (6–12 years), Adolescence (13–17 years), | |
| English language; Human, birth-18 years, | |
| English, Preschool age (2–5 years), School Age (6–12 years), Adolescence (13–17 years), human, Publication year: 20130401–20140531 | |
| English, Preschool age (2–5 years), School Age (6–12 years), Adolescence (13–17 years), human, Publication year 2013–2014 | |
| English, Preschool age (<1–5 years), School Age (6–12 years), Adolescence (13–17 years), human, Publication year 2013–2014 | |
| English, Child: birth-18 years, human, Publication year 20130425–20141231 | |
| English Language; Human | |
| English language; Human | |
| English language; Human | |
| English language; Human | |
| Publication year: 2013–2014 | |
| None |
Fig 1Flowchart of included studies, manuals and measures.
COSMIN: Definitions of domains, psychometric properties, and aspects of psychometric properties for Health-Related Patient-Reported Outcomes (adapted from Mokkink, Terwee [42]).
| Psychometric property | Domain: Definition |
|---|---|
| The degree that the content of an instrument adequately reflects the construct to be measured. | |
| Face validity | The degree to which instrument (items) appear to be an adequate reflection of the construct to be measured. |
| The extent to which the scores of an instrument are consistent with hypotheses, based on the assumption that the instrument is a valid measure of the construct being measured. | |
| Structural validity | The extent to which instrument scores adequately reflect the dimensionality of the construct to be measured. |
| Hypothesis testing | Item construct validity. |
| Cross-cultural validity | The extent that performance of the items on a translated or culturally adapted instrument adequately replicates the performance of the items of the original version of the instrument. |
| The degree to which the scores of an instrument satisfactorily reflect a “gold standard”. | |
| The level of correlation amongst items. | |
| The proportion of total variance in the measurements due to “true” differences amongst patients. | |
| The error of a patient’s score, systematic and random, |
Notes.
aApplies to Health-Related Patient-Reported Outcomes (HR-PRO) instruments.
bAspect of content validity under the domain of validity.
cAspects of construct validity under the domain of validity.
dInterpretability is not considered a psychometric property
Criteria of psychometric quality rating (adapted from Terwee et al. [40]).
| Psychometric property | Score | Quality Criteria |
|---|---|---|
| + | A clear description is provided of the measurement aim, the target population, the concepts that are being measured, and the item selection AND target population and (investigators OR experts) were involved in item selection | |
| ? | A clear description of above-mentioned aspects is lacking OR only target population involved OR doubtful design or method | |
| - | No target population involvement | |
| ± | Conflicting results | |
| NR | No information found on target population involvement | |
| NE | Not evaluated | |
| + | Factors should explain at least 50% of the variance | |
| ? | Explained variance not mentioned | |
| - | Factors explain < 50% of the variance | |
| ± | Conflicting results | |
| NR | No information found on structural validity | |
| NE | Not evaluated | |
| + | Specific hypotheses were formulated AND at least 75% of the results are in accordance with these hypotheses | |
| ? | Doubtful design or method (e.g., no hypotheses) | |
| - | Less than 75% of hypotheses were confirmed, despite adequate design and methods | |
| ± | Conflicting results between studies within the same manual | |
| NR | No information found on hypotheses testing | |
| NE | Not evaluated | |
| + | Convincing arguments that gold standard is “gold” AND correlation with gold standard ≥0.70 | |
| ? | No convincing arguments that gold standard is “gold” OR doubtful design or method | |
| - | Correlation with gold standard <0.70, despite adequate design and method | |
| ± | Conflicting results | |
| NR | No information found on criterion validity | |
| NE | Not evaluated | |
| + | Factor analyses performed on adequate sample size (7 * # items and ≥ 100) ANDCronbach’s alpha(s) calculated per dimension AND Cronbach’s alpha(s) between 0.70 and 0.95 | |
| ? | No factor analysis OR doubtful design or method | |
| - | Cronbach’s alpha(s) <0.70 or >0.95, despite adequate design and method | |
| ± | Conflicting results | |
| NR | No information found on internal consistency | |
| NE | Not evaluated | |
| + | ICC or weighted Kappa ≥ 0.70 | |
| ? | Doubtful design or method (e.g., time interval not mentioned) | |
| - | ICC or weighted Kappa < 0.70, despite adequate design and method | |
| ± | Conflicting results | |
| NR | No information found on reliability | |
| NE | Not evaluated | |
| + | MIC < SDC OR MIC outside the LOA OR convincing arguments that agreement is acceptable | |
| ? | Doubtful design or method OR (MIC not defined AND no convincing arguments that agreement is acceptable) | |
| - | MIC ≥ SDC OR MIC equals or inside LOA, despite adequate design and method; | |
| ± | Conflicting results | |
| NR | No information found on measurement error | |
| NE | Not evaluated |
Notes.
aScores: + = positive rating,? = indeterminate rating,— = negative rating, ± = conflicting data, NR = not reported, NE = not evaluated (for study of poor methodological quality according to COSMIN rating [Table 6], data are excluded from further evaluation [Tables 7 and 8])
bDoubtful design or method is assigned when a clear description of the design or methods of the study is lacking, sample size smaller than 50 subjects (should be at least 50 in every subgroup analysis), or any important methodological weakness in the design or execution of the study
cHypothesis testing: all correlations should be statistically significant (if not, these hypotheses are not confirmed) AND these correlations should be at least moderate (r > 0.5)
dMeasurement error: MIC = minimal important change, SDC = smallest detectable change, LOA = limits of agreement.
Overview of the psychometric properties and methodological quality of occupational performance instruments.
| Instrument & Author(s) | Year | Internal consistency | Reliability | Measurement error | Content validity | Structural validity | Hypotheses testing | Cross-cultural validity | Criterion validity |
|---|---|---|---|---|---|---|---|---|---|
| Ricon et al. [ | 2013 | Fair (42.9) | NR | NR | NR | NR | Fair (29.2) | NR | NR |
| Vroland-Nordstrom & Krumlinde-Sundholm [ | 2012 | NR | Excellent (79.5) | NR | NR | NR | Good (56.5) | Excellent (80.5) | NR |
| King et al. [ | 2004 | Good (61.9) | Good (70.7) | NR | Good (64.3) | NR | NR | NR | NR |
| King et al. [ | 2006 | NR | NR | NR | NR | NR | Good (74.6) | NR | NR |
| Colón et al. [ | 2008 | Excellent (81.0) | NR | NR | NR | NR | Excellent (86.3) | Good (51.5) | NR |
| Ullenhag et al. [ | 2012 | Excellent (76.2) | NR | NR | Excellent (78.6) | NR | Good (64.8) | Good (54.4) | NR |
| Potvin et al. [ | 2013 | NR | Good (69.0) | NR | Poor (21.4) | NR | NR | NR | NR |
| Nordtorp et al. [ | 2013 | Good (71.5) | Excellent (75.9) | Excellent (82.7) | NR | NR | NR | NR | NR |
| Longo et al. [ | 2012 | NR | Good (72.4) | NR | NR | NR | Good (55.8) | Good (51.5) | NR |
| NR | |||||||||
| King et al. [ | 2004 | Good (61.9) | NR | NR | Good (64.3) | Good (58.3) | NR | NR | NR |
| King et al. [ | 2006 | NR | NR | NR | NR | NR | Good (49.7) | NR | NR |
| Colón et al. [ | 2008 | Excellent (81.0) | NR | NR | NR | NR | Excellent (86.3) | Good (51.5) | NR |
| Potvin et al. [ | 2013 | NR | Good (72.4) | NR | Good (71.4) | NR | NR | NR | NR |
| Nordtorp et al. [ | 2013 | Good (71.5) | Good (73.8) | Excellent (82.7) | NR | NR | NR | NR | NR |
| Keller et al. (I)[ | 2005 | NR | NR | NR | NR | Fair (33.3) | Fair (41.3) | NR | NR |
| Keller et al. (II)[ | 2005 | NR | NR | NR | NR | Fair (50.0) | Fair (41.3) | NR | NR |
| Romero Ayuso et al. [ | 2009 | Fair (38.1) | NR | NR | NR | NR | Poor (17.4) | Poor (24.3) | NR |
| Kramer et al. [ | 2010 | Excellent (90.0) | NR | NR | NR | Good (66.7) | Good (64.8) | NR | NR |
| Asgari & Kramer [ | 2008 | NR | NR | NR | NR | Good (72.2) | NR | Fair (46.2) | NR |
| Taylor et al. [ | 2011 | NR | Excellent (88.6) | NR | NR | NR | Good (73.9) | NR | NR |
Notes.
*The quality of the studies that evaluated the psychometric properties of each instrument were evaluated according to the COSMIN rating. Four-point scale was used (1 = Poor, 2 = Fair, 3 = Good, 4 = Excellent) and the outcome was presented as percentage of rating (Poor = 0–25.0%, Fair = 25.1%-50.0%, Good = 50.1%-75.0%, Excellent = 75.1%-100.0%)
A = Article
M = Manual
NR: not reported.
Quality of psychometric properties based on the criteria by Terwee et al. [40].
| Assessment Reference | Internal consistency | Reliability | Measurement Error | Content validity | Structural validity | Hypothesis testing | Criterion validity |
|---|---|---|---|---|---|---|---|
| Ricon et al. [ | ? | NR | NR | NR | NR | - | NR |
| Vroland-Nordstrom & studyKrumlinde-Sundholm [ | NR | - | NR | NR | NR | ? | NR |
| King et al. [ | - | - | NR | + | NR | NR | NR |
| King et al. [ | NR | NR | NR | NR | NR | + | NR |
| Colón et al. [ | ? | NR | NR | NR | NR | + | NR |
| Ullenhag et al. [ | ? | NR | NR | + | NR | + | NR |
| Potvin et al. [ | NR | - | NR | NE | NR | NR | NR |
| Nordtorp et al. [ | ? | - | - | NR | NR | NR | NR |
| Longo et al. [ | NR | ± | NR | NR | NR | ± | NR |
| King et al. [ | + | NR | NR | + | + | NR | NR |
| King et al. [ | NR | NR | NR | NR | NR | + | NR |
| Colón et al. [ | + | NR | NR | NR | NR | + | NR |
| Potvin et al. [ | NR | - | NR | ? | NR | NR | NR |
| Nordtorp et al. [ | ? | + | - | NR | NR | NR | NR |
| Keller et al. (I)[ | NR | NR | NR | NR | + | - | NR |
| Keller et al. (II)[ | NR | NR | NR | NR | + | + | NR |
| Romero Ayuso et al. [ | ? | NR | NR | NR | NR | NE | NR |
| Kramer et al. [ | ? | NR | NR | NR | + | - | NR |
| Asgari & Kramer [ | NR | NR | NR | NR | ? | NR | NR |
| Taylor et al. [ | NR | ? | NR | NR | NR | + | NR |
Notes. Quality criteria [40, 41] + = positive rating;? = indeterminate rating;— = negative rating; ± = conflicting data; NR = not reported; NE = not evaluated (study of poor methodological quality according to COSMIN rating—data are excluded from further analyses).
Overall quality score of assessments for each psychometric property based on levels of evidence by Schellingerhout et al. [41].
| Assessment | Internal consistency | Reliability | Measurement Error | Content validity | Structural validity | Hypothesis testing | Criterion validity |
|---|---|---|---|---|---|---|---|
| Indeterminate | NR | NR | NR | NR | Limited (negative result) | NR | |
| NR | Strong (negative result) | NR | NR | NR | Indeterminate | NR | |
| Moderate (negative result) | Strong (negative result) | Strong (negative result) | Strong (positive result) | NR | Strong (positive result) | NR | |
| Strong (positive result) | Conflicting | Strong (negative result) | Moderate (positive result) | Moderate (positive result) | Strong (positive result) | NR | |
| Indeterminate | NR | NR | NR | Moderate (positive result) | Conflicting | NR | |
| NR | Indeterminate | NR | NR | Indeterminate | Moderate (positive result) | NR |
Notes. Level of Evidence [41]: Strong evidence positive/negative result (consistent findings in multiple studies of good methodological quality OR in one study of excellent methodological quality); Moderate evidence positive/negative results (consistent findings in multiple studies of fair methodological quality OR in one study of good methodological quality); Limited evidence positive/negative result (one study of fair methodological quality); Conflicting findings; Indeterminate = only indeterminate ratings on the measurement property (i.e., score =? in Table 7); NR = not reported.
Characteristics of the instrument for the assessment of occupational performance.
| Instrument (Acronym) | Purpose of instrument | Published year | Number of subscales/ forms | Total number of items | Response options |
|---|---|---|---|---|---|
| Perceived Efficacy and Goal Setting System (PEGS) [ | To enable children with disabilities to set and prioritise their own intervention goals based on their perception regarding competence in performing daily tasks | 2004 | 3 categories | 24 | 1 –a lot like the less competent child; 2 –a little like the less competent child; 3 –a little like the competent child; 4 –a lot like the competent child |
| Make My Day (MMD) [ | To collect information regarding young children’s perceptions of their performance, degree of independence and satisfaction from daily activities | 2013 | 10 subscales | 34 | 4-point Likert scale |
| Children's Assessment of Participation and Enjoyment (CAPE) [ | A measure of children’s participation in recreation and leisure activities as a construct consisting of multiple domains and dimensions | 2004 | 5 sections (diversity, intensity, whom, where, enjoyment). Three levels of scoring: (i) overall participation; (ii) domain scores reflecting participation in formal and informal activities; and (iii) scores reflecting participation in five types of activities (i.e. recreational, active physical, social, skill-based and self-improvement activities) | 55 | Likert scales are used for each dimension (diversity, intensity, where, with whom, enjoyment) Diversity: Have you done the activity within the last 4 months? Answers “Yes” or “No”. Intensity/Frequency: How often have you performed the activity? Answers from 1- once in the past 4 months, to 7—once a day or more Where: Where do the activities take place? Answers from 1 –at home to 6 –beyond your community. With Whom: With whom do you do the activity? Answers, from 1 –alone to 5 –with others. Enjoyment: (How much do you like or enjoy this activity? Answers from 1- not at all to 5—love it |
| Preferences for Activities of Children (PAC) [ | Extension of CAPE. Measures child preferences for activities. | 2004 | As for CAPE | 55 | A three-point rating ranging from 1 = I would not like to do at all, to 3 = I would really like to do |
| Child Occupational Self-Assessment (COSA) [ | A self-report of occupational competence and value for activities. | 2004 (v 2.0) | 2 subscales | 24 | Competence ratings are given the scores: 1 = I have a big problem doing this; 2 = I have a little problem doing this; 3 = I do this ok; 4 = I am really good at doing this. Values ratings are scored as: 1 = Not really important to me 2 = Important to me 3 = Really important to me 4 = Most important of all to me |
| Occupational Self-Assessment (OSA) [ | To capture individuals' perceptions of how illness and disability affect their occupations and rate their own competence and assign value to performance/participation. | 2006 (v 2.2) | 2 subscales | 21 | Competence ratings are given the scores: 1 = I have a lot of problem doing this; 2 = I have some difficulty doing this; 3 = I do this well; 4 = I do this extremely well. Values ratings are scored as: 1 = this is not so important to me; 2 = this is important to me; 3 = this is more important to me; 4 = this is most important to me. |
Description of studies and manuals for the development and validation of instrument for the assessment of occupational performance.
| Instrument | Reference | Purpose of study | Study population | Age (range[R] and/or Mean[M] Standard deviation[SD]) |
|---|---|---|---|---|
| Vroland-Nordstrom & Krumlinde-Sundholm (I) [ | To translate, adapt, and assess a Swedish-language version of the PEGS | N = 64; (I) pre-testing: | Total sample: R = 5-12y, M = 8.5, SD = 2; (I) R = 5-12y, M = 9y, SD = 3; (II) R = 5-12y, M = 8y, SD = 2 | |
| Vroland-Nordstrom & Krumlinde-Sundholm (II) [ | Study 1: To evaluate the test–retest reliability of child perceptions using the Swedish version of the PEGS. Study 2: To evaluate agreement between children’s and parents’ perceptions of the child’s competence and choices of intervention goals | N = 44; Test-retest samples: Study 1: | Total sample: R = 5-12y, M = 8y, SD = 2; Study 1: R = 5-12y, M = 8y, SD = 2, Study 2: R = 5-12y, M = 8y, SD = 2 | |
| Ricon et al. [ | Internal consistency; Concurrent validity between PEGS and MMD; Hypothesis testing the concurrent validity of the child- and parent-report MMD measures | N = 62; 3 age groups of participants: (I) group 1: | Total sample: R = 4-7y, M = 5.16y, SD = 0.92; (I) R = 4-5y, M = NR, SD = NR; (II) R = 5-6y, M = NR, SD = NR; (III) R = 6-7y, M = NR, SD = NR | |
| Colón et al. [ | Internal consistency; Hypothesis testing to compare CAPE scores of children with and without disability; To assess the validity and reliability of the newly developed Spanish version | N = 249; (I) 6-10y group: | Total sample: R = 6-15y, M = NR, SD = NR; (I) R = 6-10y, M = NR, SD = NR; (II) R = 11-15y, M = NR, SD = NR; (III) R = 6-15y, M = NR, SD = NR; (IV) R = 6-15y, M = NR, SD = NR; (V) R = 6-15y, M = NR, SD = NR; (VI) R = 6-15y, M = NR, SD = NR | |
| King et al. [ | Hypothesis testing to test group differences on CAPE activity type enjoyment scores | N = 427; (I) 6- 8y11m: | Total sample: R = 6-15y, M = NR, SD = NR; (I) R = 6- 8y11m, M = NR, SD = NR; (II) R = 9-11y11m, M = NR, SD = NR; (III) 12-15y, M = NR, SD = NR; (IV) R = 6-15y, M = NR, SD = NR; (V) R = 6-15y, M = NR, SD = NR | |
| Longo et al. [ | To translate, adapt, and assess the cross-cultural validity and test-retest reliability of a Spanish version of the CAPE; Hypothesis testing to test whether the Spanish version of the CAPE discriminates between TD children and adolescents and those with CP | N = 398; (I) TD: | Total sample: R = 8-18y, M = NR, SD = NR; (I) R = 8-18y, M = 13.21y, SD = 3.13; R = 8;0–18;0; (II) R = 8-18y, M = 12.12y, SD = 3.02 | |
| Potvin et al. [ | To assess the test-retest reliability and content validity of the CAPE for HFA sample | N = 61; (I) Without HFA: | Total sample: R = 7-13y, M = NR, SD = NR; (I) R = 7-13y, M = NR, SD = NR; (II) R = 7-13y, M = NR, SD = NR | |
| Ullenhag et al. [ | To translate, adapt, and assess the cross-cultural validity and internal consistency of a Swedish version of the CAPE; Hypothesis testing to compare responses to both the CAPE and Swedish- version CAPE | N = 336; | Total sample: R = 6-17y, M = 12y, SD = 2; (I) R = 6-17y, M = 12y, SD = 2; (II) R = 6-17y, M = 12y, SD = 2; (III) R = 6-8y, M = NR, SD = NR; (IV) R = 9-11y, M = NR, SD = NR; (V) R = 12-15y, M = NR, SD = NR | |
| Nordtorp et al. [ | To examine the test-retest reliability, measurement error, and internal consistency of the Norwegian version of the CAPE | N = 141; (I) TD: | Total sample: R = 7-17y, M = NR, SD = NR; (I) R = 7-14y, M = 11.1y, SD = 2.5; (II) R = 8-17y, M = 14.2y, SD = 2.3 | |
| Manual [ | Internal consistency; Test-retest reliability; Content validity | N = 427; (I) age group 1 (n = 125); (II) age group 2 (n = 176); (III) age group 3 (n = 126); English-speaking children with disabilities. Data also available for family type, family income, number of children living at home, and ethnicity of respondent | Total sample: R = 6-15y, M = NR, SD = NR; (I) R = 6-8y, M = NR, SD = NR; (II) R = 9-11y, M = NR, SD = NR; (III) R = 12-15y, M = NR, SD = NR | |
| Manual [ | Internal consistency; Test-retest reliability; Structural validity; Content validity | N = 427; (I) age group 1 (n = 125); (II) age group 2 (n = 176); (III) age group 3 (n = 126); English-speaking children with disabilities. Data also available for family type, family income, number of children living at home, and ethnicity of respondent | Total sample: R = 6-15y, M = NR, SD = NR; (I) R = 6-8y, M = NR, SD = NR; (II) R = 9-11y, M = NR, SD = NR; (III) R = 12-15y, M = NR, SD = NR | |
| King et al. [ | Hypothesis testing to test group differences in PAC activity type preference scores | N = 427; (I) 6- 8y11m: | Total sample: R = 6-15y, M = NR, SD = NR; (I) R = 6- 8y11m, M = NR, SD = NR; (II) R = 9-11y11m, M = NR, SD = NR; (III) 12-15y, M = NR, SD = NR; (IV) R = 6-15y, M = NR, SD = NR; (V) R = 6-15y, M = NR, SD = NR | |
| Nordtorp et al. [ | To examine the test-retest reliability, measurement error, and internal consistency of the Norwegian version of the PAC | N = 141; (I) TD: | Total sample: R = 7-17y, M = NR, SD = NR; (I) R = 7-14y, M = 11.1y, SD = 2.5; (II) R = 8-17y, M = 14.2y, SD = 2.3 | |
| Potvin et al. [ | To assess the test-retest reliability and content validity of the CAPE for HFA sample | N = 61; (I) Without HFA: | Total sample: R = 7-13y, M = NR, SD = NR; (I) R = 7-13y, M = NR, SD = NR; (II) R = 7-13y, M = NR, SD = NR | |
| Keller et al. 2005 (I) [ | To assess the structural validity of the COSA Hypothesis testing to assess whether the COSA discriminates between groups | N = 62; (I) male: | Total sample: R = 8-17y, M = 11.35y, SD = NR; (I) R = NR, M = NR, SD = NR; (II) R = NR, M = NR, SD = NR; (III) R = NR, M = NR, SD = NR; (IV) R = NR, M = NR, SD = NR | |
| Keller & Kielhofner 2005 (II) [ | To assess the structural validity of a refined version of the COSA Hypothesis testing to assess whether the refined COSA’s properties are psychometrically sound | N = 43; Children who are occupational therapy clients | Total sample: R = 8-17y, M = 12.21y, SD = NR | |
| Kramer et al. 2010 [ | To assess the structural validity of the COSA; Hypothesis testing to test the external validity of the COSA | N = 502; Child clients of occupational therapist and physical therapist researchers and clinicians internationally | Total sample: R = 6-17y 10m, M = 11y 11.7m, SD = 2 y10.4m | |
| Romero-Ayuso & Kramer [ | To assess the internal consistency and cross-cultural validity of the Spanish version of COSA for children with ADHD Hypothesis testing to test whether COSA is an appropriate measure for children with ADHD | N = 30; (I) ADHD diagnosis: | Total sample: R = 7-11y, M = 8.7y, SD = 1.16; (I) R = NR, M = NR, SD = NR; (II) R = NR, M = NR, SD = NR | |
| Asgari & Kramer [ | To assess the structural validity and hypothesis test a second order model of the Persian version of the OSA | N = 336; Students from Tehran junior high schools | Total sample: R = 11-16y, M = 13, SD = 1.2 | |
| Taylor et al. [ | Hypothesis testing to test the validity of OSA for Adolescents recently diagnosed with acute infectious mononucleosis (MONO) discriminant validity of the OSA to measure for non-recovered and recovered adolescents at 12 m follow-up; Reliability testing | N = 296; (I) non-recovered: | Total sample: R = 12-18y, M = NR, SD = NR; (I) R = 12-19y, M = NR, SD = NR; (II) R = 12-19y, M = NR, SD = NR |
Notes. ADHD = Attention deficit hyperactivity disorder; TD = typically developing; R = range; M = Mean; SD = standard deviation; NR = not reported; OT = occupational therapy.