| Literature DB >> 29550964 |
C B Terwee1, C A C Prinsen2, A Chiarotto2, M J Westerman3, D L Patrick4, J Alonso5,6, L M Bouter2,7, H C W de Vet2, L B Mokkink2.
Abstract
BACKGROUND: Content validity is the most important measurement property of a patient-reported outcome measure (PROM) and the most challenging to assess. Our aims were to: (1) develop standards for evaluating the quality of PROM development; (2) update the original COSMIN standards for assessing the quality of content validity studies of PROMs; (3) develop criteria for what constitutes good content validity of PROMs, and (4) develop a rating system for summarizing the evidence on a PROM's content validity and grading the quality of the evidence in systematic reviews of PROMs.Entities:
Keywords: COSMIN; Content validity; Patient outcome assessment; Patient-reported outcome; Systematic review; Validation studies
Mesh:
Year: 2018 PMID: 29550964 PMCID: PMC5891557 DOI: 10.1007/s11136-018-1829-0
Source DB: PubMed Journal: Qual Life Res ISSN: 0962-9343 Impact factor: 4.147
Fig. 1Design of the Delphi study
Characteristics of the respondents to each round of the Delphi study
| Round 1 | Round 2 | Round 3 | Round 4 | |
|---|---|---|---|---|
| Number of participants | 158 | 122 | 84 | 69 |
| Country/region ( | ||||
| US | 24 | 21 | 18 | 13 |
| Canada | 15 | 7 | 6 | 6 |
| UK | 30 | 20 | 14 | 11 |
| Netherlands | 25 | 18 | 8 | 6 |
| Europe other (11 countries) | 37 | 32 | 24 | 21 |
| Australia/New Zealand | 16 | 12 | 9 | 8 |
| Asia | 3 | 1 | 1 | 1 |
| Middle East | 1 | 1 | 1 | 1 |
| South America | 1 | 1 | 1 | 1 |
| Unknown | 6 | 9 | 2 | 1 |
| Professional background ( | ||||
| Allied health care professional | 69 | 38 | 32 | 29 |
| Medical doctor | 19 | 9 | 4 | 4 |
| Clinimetrician/psychometrician | 33 | 19 | 15 | 12 |
| Epidemiologist | 30 | 19 | 13 | 12 |
| Statistician | 6 | 2 | 2 | 2 |
| Other | 54 | 27 | 23 | 19 |
| Unknown | 36 | 16 | 9 | |
| Current professional activity ( | ||||
| Clinician | 35 | 16 | 12 | 10 |
| Researcher | 146 | 81 | 64 | 57 |
| Journal editor | 8 | 6 | 4 | 3 |
| Other | 27 | 14 | 12 | 12 |
| Unknown | 36 | 16 | 9 | |
| Experience in qualitative research | ||||
| A lot-some/a little-none (%) | 65/35 | 67/33 | 71/29 | 70/30 |
| Unknown ( | 36 | 16 | 9 | |
| Experience in development of PROMs | ||||
| A lot-some/a little-none (%) | 58/42 | 60/40 | 66/34 | 61/39 |
| Unknown ( | 37 | 17 | 10 | |
| Experience in evaluation of measurement properties of PROMs | ||||
| A lot-some/a little-none (%) | 85/15 | 92/8 | 90/10 | 90/10 |
| Unknown ( | 37 | 17 | 10 | |
| Experience in evaluation of content validity of PROMs | ||||
| A lot-some/a little-none (%) | 75/25 | 76/24 | 79/21 | 75/25 |
| Unknown ( | 36 | 16 | 9 | |
| Experience in systematic reviews of PROMs | 70/30 | 72/28 | 72/28 | 70/30 |
| A lot-some/a little-none (%) | 36 | 16 | 9 | |
| Unknown ( | ||||
| Ever used the COSMIN checklist | ||||
| Yes/no (%) | 82/18 | 80/20 | 79/21 | 82/18 |
| Unknown ( | 36 | 16 | 9 | |
PROMs patient-reported outcome measures
aMultiple responses allowed
Number of issues on which consensus was reached in relation to the number of issues discussed in each round
| Topic | Round 1 | Round 2 | Round 3 | Round 4 |
|---|---|---|---|---|
| General considerations in the evaluation of content validity of PROMS in systematic reviews of PROMs | 6/8 | 2/2 | NA | NA |
| Standards for evaluating the methodological quality of studies on the development of a PROM (box 1) | ||||
| Standards for evaluating the methodological quality of qualitative research performed to identify relevant items for a new PROM (box 1, part 1) | 14/20 | 30/30a | 3/3 | 2/2 |
| Standards for evaluating the quality of a cognitive interview study performed to evaluate comprehensibility and comprehensiveness of a PROM (box 1, part 2) | 11/14 | 25/25a | NA | NA |
| Standards for evaluating the quality of studies on content validity of PROMs (box 2) | ||||
| Standards for asking patients to rate the relevance, comprehensiveness, and comprehensibility of the items for the population of interest (box 2 part 1) | 8/8 | 9/9 | 1/1 | NA |
| Standards for asking professionals to rate the relevance of the items for the construct of interest (box 2 part 2) | 6/7 | 6/6 | NA | NA |
| Criteria for what constitutes good content validity of PROMs | 20/21 | NA | 7/8 | 1/1 |
| Rating system for rating the content validity of PROMs in a systematic review | NA | NA | 6/6 | 3/3 |
| Total | 65/78 (82%) | 71/71 (100%) | 17/18 (94%) | 6/6 (100%) |
NA Not applicable (not discussed in the round)
aNew issues concerned the 4-point rating scale for the standards
General recommendation on how to perform a systematic review on the content validity of PROMs
| Authors of a systematic review of PROMs should clearly define the scopea of their review. This scope should be the reference point for evaluating content validity of the included PROMs |
| Content validity should be evaluated by at least two reviewers, independently |
| We recommend that the review team includes reviewers with at least some knowledge of the construct of interest; experience with the target population of interest; and some knowledge or experience with PROM development and evaluation, including qualitative research |
| The review team should also consider the content of the PROMs themselves |
See Prinsen et al. for further details [13]
aBy scope we mean the construct, target population, and measurement aim (e.g., evaluation) of interest in the review
COSMIN criteria and rating system for evaluating the content validity of PROMs
Grading the quality of evidence on content validity (modified GRADE approach)
| Study design | Quality of evidence | Lower if |
|---|---|---|
| At least 1 content validity study | High | Risk of bias |
| No content validity studies | Moderate | |
| Low | Inconsistency | |
| Very low | ||
| Indirectness |
The level of evidence indicates how confident we are that the overall ratings are trustworthy. The starting point is the assumption that the evidence is of high quality. The quality of evidence is subsequently downgraded with one or two levels per factor to moderate, low, or very low when there is risk of bias (low study quality), (unexplained) inconsistency in results, or indirect results
Fig. 2Supplementary flow chart for grading the quality of evidence