| Literature DB >> 30191453 |
Marijke Hopman-Rock1, Helmi van Hirtum2,3, Paul de Vreede4,5, Ellen Freiberger6.
Abstract
BACKGROUND: Activities of daily living (ADL) are often used as predictors of health and function in older persons. This systematic review is part of a series initiated by the European Network for Action on Ageing and Physical Activity (EUNAAPA). AIM: To assess psychometric properties of ADL instruments for use in older populations.Entities:
Keywords: Activities of daily living; Aging; Assessment; Community dwelling; Function; Health status
Mesh:
Year: 2018 PMID: 30191453 PMCID: PMC6589141 DOI: 10.1007/s40520-018-1034-6
Source DB: PubMed Journal: Aging Clin Exp Res ISSN: 1594-0667 Impact factor: 3.636
Quality criteria for properties of ADL instruments (Table source: Terwee et al. [9])
| Property | Definition | Quality criteriaa,b |
|---|---|---|
| Content validity | The extent to which the domain of interest is comprehensively sampled by the items in the instruments | +: Positive rating −: Poor rating ?: Moderate rating |
| Predictive validity | The extent to which the instrument had the ability to predict onset of difficulties in functioning or negative health outcomes over time (e.g., mortality) | +: High scores with regard to methods, design and results (OR or AUC) ?: Doubtful design or method (e.g., small sample size) −: Inappropriate methods or lack of significant results |
| Construct validityc | The ability to discriminate between subgroups e.g., age groups, gender | +: High scores with regard to methods, design and results (clear group definitions and significant results) ?: Doubtful design or method (e.g., small sample size) −: Inappropriate methods or lack of significant results |
| Concurrent validity | Established by simultaneously applying a previously validated tool or test, and comparing the results | +: Comparison to other instrument with significant results ( ?: Doubtful design or method (e.g., small sample size); significant but small results −: Inappropriate methods or lack of significant results |
| Reliabilityd | An indicator of the consistency of a measurement in terms of internal consistency with stability over time (reproducibility) and the degree of which the measurement is free of measurement error (internal consistency) | +: (good) Intraclass Correlation Coefficient (ICC) or Kappa > 0.70 ?: (moderate) CC 0.70 − 0.60 or −: (poor) ICC or Kappa < 0.70, despite adequate design and method |
| Responsiveness | The instrument’s ability to detect important change over time in the concept being measured, and may be defined as the extent to which a method detects minimal clinically relevant change over time | +: A power calculation for sample size presented adequate design and sufficiently described ?: Doubtful design or method (e.g., no hypotheses) |
| Floor- and ceiling effects | The number of respondents who achieved the lowest or highest possible score | +: ≤ 15% of the respondents achieved the highest or lowest possible scores ?: Doubtful design or method −: > 15% of the respondents achieved the highest or lowest possible scores, despite adequate design and methods |
| Overall quality of individual study | The degree to which one can assign qualitative meaning to quantitative scores | +: Clear description of study population, adequate description of instrument, adequate design for evaluating psychometric properties ?: Doubtful description of either study population, or instrument but with reference given or method −: Poor description of study population, OR instrument and no reference given, and poor method |
AUC area under the curve; ICC intraclass correlation coefficient; OR odds ratio
a + = Positive rating; − = poor rating; ? = moderate rating
bDoubtful design or method = lacking of a clear description of the design or methods of the study, sample size smaller than 30 subjects (e.g., subgroup analysis), or any important methodological weakness in the design or execution of the study
cConvergent and discriminant validity are usually both considered subcategories or subtypes of construct validity
dWe added Cronbach’s alpha as a measure of internal consistency: 0.00 to 0.69 = poor; 0.70 to 0.79 = fair ; 0.80 to 0.89 = good ; 0.90 to 0.99 = excellent/strong
Summary of reviewed outcome measure’s properties (scores summarized by first author, best instrument outcomes reported in text)
| Instrument (described in Refs.) | Ordered by number of positively rated (quality) articles | Reliability | Validity | Responsiveness |
|---|---|---|---|---|
| Katz 6 items [11–17,18,19–20] | 10 | +? | ? | ? |
| OARS (Older Americans Resources and Services) ADL scale [21–22,23,24–27] | 7 | − | ? | − |
| Barthel Indexa [28–34] | 7 | +? | ? | ++ |
| Katz 7 items [35–41] | 7 | + | ? | +? |
| Katz 5 items [42–46,47,48] | 7 | + | + | +? |
| SMAFb functional autonomy measurement system [49–54] | 6 | ++ | ++ | ++ |
| Katz unspecified [55–57] | 3 | + | − | − |
| NHIS ADLc National health Interview Survey [58–59, 60] | 3 | +? | + | − |
| FIM (functional independence measure) [61–63] FIM phone | 2 | + (Motor component) | ? | + |
| FIM observation | 1 | + | − | − |
| FIDS (Functional Independence and Difficulty Scale) [64–66] | 3 | + | +? | + |
+ = positive rating; ? = moderate rating or do not know ; − = poor rating
aBarthel Index Phone version failed to measure reliable in moderate and severe disabled patients
bSMAF clinical version is the most valid
cQualitative study (Jobe et al. [60]) revealed problems with interpreting the questions
Fig. 1Flow chart of the selection process