| Literature DB >> 32788186 |
Sonia Lorente1,2, Carme Viladrich1, Jaume Vives3,4, Josep-Maria Losilla1,4.
Abstract
OBJECTIVE: This meta-review aims to discuss the methodological, research and practical applications of tools that assess the measurement properties of instruments evaluating health-related quality of life (HRQoL) that have been reported in systematic reviews.Entities:
Keywords: qualitative research; quality in health care; statistics & research methods
Mesh:
Year: 2020 PMID: 32788186 PMCID: PMC7422655 DOI: 10.1136/bmjopen-2019-036038
Source DB: PubMed Journal: BMJ Open ISSN: 2044-6055 Impact factor: 2.692
Figure 1PRISMA flow chart. Flow diagram for search results (from Moher et al9). COSMIN, Consensus-based Standards for the selection of health Measurement Instruments; HRQOL, health-related quality of life; PRISMA, Preferred Reporting Items for Systematic Reviews and Meta-Analyses.
Reporting and methodological quality of studies
| 2008–2020 | 2014–2020 | |||
| N | % | N | % | |
| Protocol registered prospectively | ||||
Yes, PROSPERO | 27 | 10.9 | 26 | 20.5 |
No registered | 219 | 89.1 | 100 | 79.3 |
| Standards of systematic review reporting and/or quality assessment | ||||
Yes (AMSTAR, PRISMA, QUOROM…) | 78 | 31.7 | 64 | 50.8 |
No | 168 | 68.3 | 62 | 49.2 |
| Standards to assess reporting and/or quality assessment of primary studies | ||||
Yes (QUADAS, STARD…) | 42 | 17.0 | 30 | 23.8 |
No | 204 | 83.0 | 96 | 76.2 |
| No of databases searched | ||||
1–3 | 96 | 39.1 | 50 | 39.6 |
4–6 | 107 | 43.4 | 61 | 48.4 |
7–9 | 22 | 8.9 | 8 | 6.3 |
≥10 | 18 | 7.3 | 6 | 4.7 |
Not reported | 3 | 1.2 | 1 | 0.8 |
| Other sources | ||||
Official websites/internet | 25 | 10.1 | 7 | 5.5 |
Virtual libraries | 24 | 9.7 | 12 | 9.4 |
Google/google scholar | 25 | 10.1 | 14 | 11.0 |
Scientific journals/thesis | 6 | 2.4 | 2 | 1.6 |
| Search strategy | ||||
| Terms, databases, time period | ||||
Yes | 238 | 96.7 | 123 | 97.6 |
No | 8 | 3.3 | 3 | 2.4 |
| Search syntax | ||||
Detailed syntax reported (Truncations, Booleans…) | 115 | 46.7 | 79 | 62.7 |
Syntax not reported or not detailed enough to be replicable | 125 | 50.8 | 46 | 36.5 |
Supplementary file under request (not available) | 5 | 2.1 | 1 | 0.8 |
| Inclusion/exclusion selection criteria | ||||
Reported and well-defined | 229 | 93.1 | 122 | 96.8 |
Not reported or not clearly stated | 17 | 6.9 | 4 | 3.2 |
| Article selection | ||||
By two or more independent reviewers | 134 | 54.4 | 87 | 69.0 |
Not reported or not clearly stated | 112 | 45.6 | 39 | 31.0 |
| Flow chart | ||||
Yes | 166 | 67.5 | 108 | 85.7 |
No | 80 | 32.5 | 18 | 14.1 |
| Funding | ||||
Reported | 132 | 53.7 | 69 | 54.8 |
Not reported or not clearly stated | 114 | 46.3 | 57 | 45.2 |
| Total | 246 | 100 | 126 | 100 |
%, percentage; AMSTAR, assessment of multiple systematic reviews; n, frequency; PRISMA, Preferred Reporting Items for Systematic Reviews and Meta-Analyses; PROSPERO, Prospective Register of Systematic Reviews; QUADAS, Quality Assessment of Diagnostic Accuracy Studies; QUOROM, quality of reporting of meta-analysis; STARD, Standards for the Reporting of Diagnostic Accuracy Studies.
Tools to assess measurement properties. characteristics and comparison to testing standards
| Tools | Cosmin | Terwee’s criteria | Attributes and criteria | Economic evaluation | Guidance for industry | Fitzpatrick’s criteria | ICF ICFCY | EMPRO | SCI criteria | Andresen’s tool | Canchild outcomes | Omeract | Testing standards |
| Development | Delphi | Author criteria | Expert panel | Literature | Consensus | Literature | Expert panel | Expert panel literature | Literature | Expert panel | Expert panel Delphi | Consensus | |
| Sponsor/s | COSMIN initiative | Author | SACMOT working group | Standing group of health technology | FDA staff | Standing group of health technology | WHO member states | IRYSS committee | SCIRE working group | Author | CanChild centre staff | OMERACT initiative | AERA, APA, NCME |
| Approval updates | 2010, 2018 | 2007 | 1996, 2002, 2013 | 1999, 2017 | 2006, 2009 | 1998 | 2001, 2019* | 2008 | 2008, 2016 | 2000 | 1987†, 2004 | 1992, 1998,2007,2014, 2019 | 1954, 1966, 1974, 1985, 1999, 2014 |
| Items (scoring) | 5–18 items/box (+/−/?) | 8–9 items total (+/−/?) | Not item structured (no scoring) | Not item structured (no scoring) | Not item structured (no scoring) | Not item structured (no scoring) | Not item structured (no scoring) | 39 items(strongly agree, agree, disagree, strongly disagree) | 3–5 items/box (++++/+++/++/+) | Eleven items total (A, B, C) | 2–6 items/box (excellent, adequate, poor) | 2–5 items/box (Green, amber, red, white) | Not item structured (no scoring) |
| Measurement properties | Content construct (Int. structure cross-cultural hypotheses test) | Content construct (Hypotheses test) | Conceptual and measurement model | Descriptive (Content Face Construct) | Conceptual model | Use Content/face construct (convergent, discriminant, int. structure) | Content | Conceptual and measurement model | Content criterion (concurrent predictive ‘discriminant’) | Conceptual and measurement model | Use scale construction | Content, face construct (Convergent, divergent) | Content response process Int. structure (Dimensions, DIF) |
| | Int. consistency measurement error (Test retest, agreement) | Int. consistency reproducibility (Agreement, relative measurement error) | Int. consistency reproducibility (Test retest, inter-rater) | Test retest Inter-rater | Test retest Inter-rater Int. consistency | Int. consistency reproducibility (Test retest) | Int. consistency reproducibility (Test retest, inter-rater) | Int. consistency test retest | Int. consistency test retest | Int. consistency intra/inter-rater test retest | Reproducibility test retest | Int. consistency test retest alternate forms scorers and decision consistency/accuracy | |
| | Equivalence of accommodations | ||||||||||||
| Other characteristics | Norms | Norms, standard values | Norms standardisation | Scales, norms, Score comparability | |||||||||
| Interpretability | Interpretability | Interpretability | Interpretability | Interpretability | Interpretability | Test development and revision | |||||||
| Burden | Burden | Acceptability (Burden) | Burden | Burden | Burden | ||||||||
| Administration accessible forms | Administration accessible forms | Administration | Administration accessible forms | Administration accessible forms | |||||||||
| Feasibility | Cultural adaptations | Practicality | Feasibility cultural adaptations | Cultural adaptations | Applicability cultural adaptations | Cultural adaptations | Clinical utility (Feasibility) | Feasibility | |||||
| Frequency of use (%) | 61 (30.4) | 45 (22.4) | 33 (16.4) | 17 (8.4) | 14 (6.9) | 14 (6.9) | 7 (3.4) | 4 (2.0) | 2 (1.0) | 2 (1.0) | 1 (0.5) | 1 (0.5) | 0 |
*Updated version at website.
†Reference at 2004.
AERA, american educational research association; APA, American Psychological Association; COSMIN, Consensus-based Standards for the selection of health Measurement Instruments; DIF, differential item functioning; EMPRO, Evaluating Measures of Patient Reported Outcomes; FDA, Food and Drug Administration; ICF, international classification of functioning; ICFCY, international classification of functioning for children and youth; IRYSS, Investigation Network for Health and Health Service Outcomes Research; NCME, National Council on Measurement in Education; OMERACT, Outcomes Measures in Rheumatology Clinical Trials; SACMOT, Scientific Advisory Committee Medical Outcomes Trust; SCI, spinal cord injury; SCIRE, Spinal Cord Injury Rehabilitation Evidence.;
Intended use of instruments and their association to measurement properties
| Intended use of instruments identified across the systematic reviews | Frequency | % (over 246) |
| Evaluative (Change scores pre and poststudies. Effectiveness of an intervention) | 178 | 72.3 |
| Impact of disease on HRQoL (disease symptoms, burden…) | 138 | 55.1 |
| Analytic (health policies. Cost-effectiveness. Funding) | 35 | 14.2 |
| Diagnostic (Distinguish between groups, levels of severity…) | 16 | 6.5 |
| Descriptive (Health measures in surveys. Needs of groups of people) | 4 | 1.6 |
| Predictive (Anticipation of future health status. Risk factors. Risk profiles) | 2 | 0.8 |
| Intended use is no reported or no clearly stated | 6 | 2.4 |
| Yes, reviewers made specific conclusions | 68 | 27.6 |
| No, reviewers made general conclusions | 178 | 72.4 |
| Evaluative | ||
| Responsiveness/Conceptual and Measurement Model/Content validity/Reliability (internal consistency, test retest)/Respondent Burden/Convergent validity/Cross cultural validity | 41 | 60.3 |
| Impact | ||
| Conceptual and Measurement Model/Content validity | 29 | 42.6 |
| Analytic | ||
| Preference-based valuation/agreement | 11 | 16.2 |
| Diagnostic | ||
| Known groups validity/test–retest | 7 | 10.3 |
| Predictive | ||
| Sensivity and specificity | 1 | 1.5 |
(%), percentage.
HRQoL, health-related quality of life.