| Literature DB >> 33289063 |
Tessa Peasgood1,2, Clara Mukuria3, Jill Carlton3, Janice Connell3, John Brazier3.
Abstract
Preference-based measures allow patients to report their level of health, and the responses are then scored using preference weights from a representative general population sample for use in cost utility analysis. The development process of new preference-based measures should ensure that valid items are selected to reflect the constructs of interest included in the measure and that are suitable for use in preference-elicitation exercises. Existing criteria on patient-reported outcome measures (PROMs) development were reviewed, and additional considerations were taken into account in order to generate criteria to support development of new preference-based measures. Criteria covering 22 different aspects related to item selection for preference-based measures are presented. These include criteria related to how items are phrased to ensure accurate completion, the coverage of items in terms of range of domains as well as focus on current outcomes and whether items are suitable for valuation. The criteria are aimed at supporting the development of new preference-based measures with discussion to ensure that even where there is conflict between criteria, issues have been considered at the item selection stage. This would minimize problems at valuation stage by harmonizing established criteria and expanding lists to reflect the unique characteristics of preference-based measures.Entities:
Keywords: Item selection; PROMs; Preference-based measures; Quality of life; Question selection; Utility
Mesh:
Year: 2020 PMID: 33289063 PMCID: PMC8068712 DOI: 10.1007/s11136-020-02718-9
Source DB: PubMed Journal: Qual Life Res ISSN: 0962-9343 Impact factor: 4.147
Fig. 1Sample of a Health State from OAB-5D classification system [9]
Criteria for item selection
| No | Criteria | Explanation |
|---|---|---|
| 1 | Reading level should be appropriate | The item should be easy to read and understand |
| 2 | Avoid questions that are very long | Items should be as short as possible whilst maintaining comprehensibility |
| 3 | Avoid double negative | Avoid phrases in which a negated construct (e.g. no control, not coping) requires a negative answer (e.g. none of the time) |
| 4 | Avoid ambiguity | Avoid questions which have a potentially ambiguous interpretation |
| 5 | Avoid jargon | The vocabulary throughout (including any labelling linked to items) should not be technical |
| 6 | Avoid terms that are colloquial | Excessively colloquial language may not be understood by all respondents and may be hard to translate to other languages |
| 7 | Avoid asking a combination of two or more questions within one item | Where items contain two or more questions at the same time (e.g. anxiety or depression), it is not clear which the respondent is answering. Multiple terms tapping into the same construct may sometimes be required to improve comprehension |
| 8 | Avoid excessively personal questions | Excessively personal or intrusive items may lead to missing values or annoy responders |
| 9. | Avoid ethically inappropriate questions | Consideration should be given to the appropriateness of asking items for potentially vulnerable sub-groups. Questions which might leave people in a worse frame of mind after completion should only be used where no good alternative is available |
| 10 | Avoid questions that are not relevant to all responders | Avoid items that refer to circumstances, situation or lifestyle that may not be universal across all responders |
| 11 | Avoid items that draw on knowledge beyond the individual’s experience | Items that relate to another piece of knowledge, such as what other people think, may be difficult to complete if the responder is not confident in that knowledge |
| 12 | Avoid value laden terms | Items which are value laden may lead people to believe there is a right or wrong way of answering the question |
| 13 | Avoid items that are too extreme or too mild | Items that only tap onto the severe or the mild end of a domain would need to be supplemented by other items on that sub-domain |
| 14 | Avoid items that suffer from Differential Item Functioning (DIF) | DIF identifies sub-groups of people who, despite having the same underlying level of an attribute (or latent construct), answer an item differently |
| 15 | Avoid items that make comparisons to other people | Items that make comparisons to other people depend upon whom the individual chooses to use for a comparison |
| 16 | Avoid items that make comparisons to expectations | Items which make comparisons to a person’s expectations or personal norms are problematic due to lack of inter-personal comparability of responses |
| 17 | Avoid items that make comparisons over time | Items which ask the respondent to make a comparison to another time period or to ‘usual’ are not suitable as they depend upon what the past or ‘usual’ is like for the individual |
| 18 | Avoid items that do not lend themselves to the specified time period | Items need to clearly tap into the current situation as restricted by the time period given within the questionnaire |
| 19 | Avoid items with response options that do not have clear order | Items, and underlying constructs, should have clear order, such that a better response is judged as better quality of life |
| 20 | In preference-elicitation exercises, respondents do trade-off between improvement in the item vs. improvement in another domain | All items included within a classified for valuation should be domains of life that are important |
| 21 | The item does not attribute | Avoid items where a decrement or problem is attributed to a particular circumstance |
| 22 | Response options are in the same direction | Within the description of the state for valuation, responders to valuation exercises are likely to get confused if ‘often’ feeling X is sometimes a good thing and sometimes a bad thing |