| Methodologically/Method Oriented | A: Discussion of “What is Validity and Reliability” in Qualitative Research:
Winter15: for example, points to. a “realist” approach – an account is valid if it reflects the perspectives of the actors in that situation (p. 7) Creswell & Miller16: stress importance of exploring validity from the lens of participants, for example, via member checking and peer (and cognitive) debriefing Golafshani17: inter alia, argues (p601) that validity, and reliability, in qualitative paradigms are assessable in terms of Credibility, Neutrality or Confirmability, Consistency or Dependability and Applicability or Transferability18 and of the importance of triangulation from multiple perspectives Frost et al19: focus on what are the psychometric properties to generate “sufficient evidence” for the validity and reliability of a PROM; argues for importance of establishing content validity as primary task, and use of qualitative research to do this, in particular, focus group and cognitive interviews; and then use of psychometric validation approaches
|
| Theoretical Discussions of Validity | B. Theoretical/Philosophical DiscussionsZumbo20: what is validity, and particularly construct validity, and its implications for process of validation? Presents a “contextualised and pragmatic explanation” of validity; construct validity ‘‘should provide an explanation for the test scores for the observed variation in test scores’’ (p. 69); validity as “establish (ing) the “why” and “how…” (p. 70) and as “support (ing) inferences … (made) from test scores…” (p. 70); validation is a “higher order integrative process…involving…concept formation….” (p. 69); argues for “multilevel testing and measurement” for a multilevel construct (p. 78). Gadermann et al21: importance of asking, “what are the underlying cognitive processes that result in respondents providing responses to self-report questions” (p. 39) in the way they do; use cognitive interviewing to do this. Hubley & Zumbo22: explore meaning of 'response processes'; argue that response processes should be considered as "mechanisms that underlie what people, do, think or feel...when...responding to an item" (p. 2); research in this area should "become more explanation-based" (p. 8) and explore "the broader context (i.e. purpose of testing, setting, culture)" (p. 8) when the response to an item is completed.
|
| Guides to Best Practice for Measure Development | Wild et al23: exploring best practice for cultural adaptation and translation of a PROM; includes use of persons in new cultural context with experience in qualitative interviewing and/or cognitive interviewing to explore translation and cultural adaptation Brod et al24 present best practice guide in use of qualitative research and exploration of content validity
Luyt25: drawing on Adcock and Collier26 presents a framework for measure development comprising three inter-connected stages: (i) measure development (background concept; developing concept definition; devising indicators); (ii) measure validation and (iii) measure revision. Advocating use of qualitative (in stage 1) and quantitative (for stage 2).
|
| Use and Importance of Qualitative Research in Development of a PROM | Lasch et al27: qualitative research as providing sound and rigorous basis for PROM development; role of theoretical saturation (in coding categories) and triangulation, to explore from multiple perspectives Cheung and Clark28: major role of qualitative research in PROM development and also cultural adaptation
|
| Use of Qualitative Approaches in Constructing a Measure and Generating the Item Pool | Mallinson29: importance of exploring the meaning and interpretation that respondents place on items in a PROM, focusing here on the SF-36, a fixed-choice measure; item interpretation and meanings attached may interact with a range of social and cultural factors affecting the respondent; use of face-to-face (cognitive, debriefing) interviews while respondent completes the measure Viswanathan et al30: explore measurement implications of scale responses, depending on whether the primary concern is maximizing discrimination between scale responses, whilst retaining reliability) and meaningful discrimination from the perspective of the respondent; argues for greater emphasis to be placed on meaningful discrimination from the perspective of the potential respondent, and not measure developer/researcher. Luyt25: explores measure development phase, in particular, the “constellation of meanings and understandings associated with a given concept” (p. 4), using focus groups Cheung and Clark28: significance in ensuring explicit focus on patient perspectives; critical role lies in both item generation and establishing content validity (eg, concept elicitation, cognitive debriefing) Breyer et al31: develop a patient-grounded measure on the symptoms, functions and impacts of urethral stricture disease; use of concept elicitation and cognitive interviews, followed by patients prioritizing items in terms of their bothersome-ness
|
| Use of Qualitative Approaches in Establishing Construct Validity and, In particular, Content Validity of a Patient Reported Outcome Measure (PROM) | Hardesty and Bearden32: explore the use of expert judges in assessing the face and content validity of items Cremenns et al33: present a literature review of health self-report measures for children aged 3–8 years; range of measures found, using formats of Likert scales, graphical (pictorial), facial (cartoon) or visual analog; in 40% of measures children involved in item development (researcher talking with child) and in 47% pilot testing with children, where authors reported on content validity, in 40% of children themselves informed this; argues that measure developers should draw on the child’s perspective from the child, and not just rely on researcher/expert panel
|
| Use of Qualitative Approach to Explore the Validity or Reliability of a Qualitative Measure | Golafshani17: advocates the use of quality criteria drawn from Lincoln & Guba18 – credibility, confirmability, consistency transferability; and exploring from multiple perspectives Cremenns et al34 : explore the development of a generic quality of life measure for school-age (6–9 year old) children; advocate and use "think aloud"/cognitive interviews; develop coding categories to select 30 items for the measure; and strategy used by children to answer items for the measure.; use of two independent raters, leading to exploration of intra- and inter-rater reliability; use of both qualitative measures (measure development) and quantitative (for reliability testing and comparison of strategy use) Gadermann et al21: develop coding and sub-coding categories, guided by research purposes (in the paper, strategies employed by children to respond to measure’s items); present in tree diagram format; illustrate category content; add frequency counts; compare tree diagrams with counts attached (here by strategy categories in item response – absolute, relative, general positive, unclear) Luyt25: suggests the use of multiple (two or more) coders for qualitative data analysis, then exploring of inter- and intra-rater reliability quantitatively
|