| Literature DB >> 33888077 |
Jessica Sheringham1, Isla Kuhn2, Jenni Burt2.
Abstract
BACKGROUND: Identifying how unwarranted variations in healthcare delivery arise is challenging. Experimental vignette studies can help, by isolating and manipulating potential drivers of differences in care. There is a lack of methodological and practical guidance on how to design and conduct these studies robustly. The aim of this study was to locate, methodologically assess, and synthesise the contribution of experimental vignette studies to the identification of drivers of unwarranted variations in healthcare delivery.Entities:
Keywords: Healthcare variation; Inequalities; Vignettes
Mesh:
Year: 2021 PMID: 33888077 PMCID: PMC8061048 DOI: 10.1186/s12874-021-01247-4
Source DB: PubMed Journal: BMC Med Res Methodol ISSN: 1471-2288 Impact factor: 4.615
Methodological framework for assessment of experimental vignette studies
| A. Vignette design | |
| 1. Credibility | • The degree to which vignettes credibly represent critical aspects of a clinical scenario or patient to potential participants is crucial to the success of an experimental vignette study [ • Lens model approaches (studies which compare optimal versus actual decisions in a given situation, originally developed by Brunswick in 1950) have demonstrated empirically that the decision-making performance of participants is improved when situations are realistic [ • Basing vignettes on real-life data, clinical expertise, and existing guidelines are recommended ways of enhancing credibility [ |
| 2. Number | • Presenting participants with more than one vignette enables examination of variations in judgement within individuals as well as between them – that is, the extent to which each participant is differentially influenced by each experimental factor in making their decisions. Where this is required to address study aims, for example, in vignette approaches based on the lens model [ • Additional considerations are needed when several vignettes are used,, such as controlling for the order in which vignettes are presented and taking account of clustering within individuals in the analysis (see: wider study design). |
| 3. Variability | • Developing or using a number of different representations of each experimental factor may increase study generalisability, by reducing the possibility that idiosyncrasies in one particular representation are responsible for findings. For example, using one female and one male actor in video vignettes may lead not to participants responding to the constructs of gender, but to • Where participants do view more than one vignette, analysis must account for clustering of vignettes by respondent, to avoid over-estimating the statistical significance of any effect [ |
| 4. Mode | • The mode through which vignettes are delivered has an important influence on the research question an experimental vignette study can answer. • Vignette mode has historically been textual only, with participants presented with a written scenario. Text-based vignettes may constrain not just the information the respondent is given, but how this information is framed. • More recently the use of pictures, videos, actors, and interactive environments have been developed [ • Pictorial modes are particularly suited to examination of characteristics, such as ethnicity, where visual representation removes the need for explicit statement (and prior framing) of the characteristic. • Studies using video vignettes extend this still further by enabling participants to form judgements on body language and speech patterns in addition to visual cues. • Interactive formats, such as unannounced standardised patients or virtual reality set-ups, have the potential to mimic real delivery which enables exploration of how inequalities may unfold |
| 5. Evaluation | • Evaluation of vignettes’ face validity – during vignette construction and once data are collected – is key to understanding the validity of findings in studies using vignettes. • Thinking through in advance what is needed to make particular vignettes ‘successful’ for their target audience will guide the nature of and approach to evaluation. • Options include assessment by an expert panel, feedback from participants, or comparing responses to the vignettes to an additional data source such as clinical data [ |
| 6. Description | • Readers of vignette study papers need to be able to form their own judgments of vignette credibility. An entire vignette should be provided to enable them to do so. |
| B. Wider study design | |
| 1. Concealment | • When investigating unwarranted variations in care, it is important to conceal the purpose of such studies, given that few people will volunteer behaviours or attitudes that they recognise as poor or biased. • If the study’s purpose is not adequately masked it can bias results, even with carefully constructed vignettes [ |
| 2. Realism | • External validity of vignette studies is enhanced when studies are conducted in a setting as close as possible to the natural ecology of decision-making [ • The generalisability of studies to investigate unwarranted variation in healthcare may be improved by collecting data in a setting that mimics key aspects of clinical settings, whether that be the actual environment, other inclusion of features such as the imposition of time constraints. |
| 3. Sampling & response | • The representativeness of any survey rests on sampling, coverage, and nonresponse. • This is particularly important for studies of healthcare variations, where a biased sample or responses – for physician or patient participants – may lead to over- or under-estimation of variations. • Studies need to justify their sample design, sample size, approach to recruitment, response and completion rates, and reasons for excluding data [ • The implications of low or biased responses should be considered. |
| 4. Analysis | • Experimental vignette studies are often complex in how data are structured. Analysis must appropriately account for hierarchies within the data [ |
Assessment of included studies according to methodological framework *
| Study | Vignette design | Wider study design | Score | Rating | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Credibility | Variability | Evaluation | Description | Concealment | Realism | Sampling & response | Analysis | |||||
| Lutfey 2009 & 2010 [ | 3 | 1 | 3 | 1 | 3 | 1 | 5 | 1 | 18 | |||
| Samuelsson 2014 [ | 3 | 1 | 2 | 1 | 2 | 1 | 6 | 1 | 17 | |||
| Adams 2014 [ | 2 | 1 | 2 | 1 | 3 | 1 | 5 | 1 | 16 | |||
| Elliott 2016 [ | 3 | 1 | 3 | 0 | 1 | 0 | 5 | 1 | 14 | |||
| Tinkler 2018 [ | 2 | 1 | 1 | 1 | 3 | 1 | 6 | n/a | 15 | |||
| Sheringham 2017 [ | 2 | 0 | 2 | 1 | 2 | 1 | 4 | 1 | 13 | |||
| Burt 2016 [ | 3 | 1 | 3 | 0 | 1 | 0 | 3 | 1 | 12 | |||
| Fischer 2017 [ | 3 | 1 | 2 | 0 | 2 | 1 | 2 | n/a | 11 | |||
| Hirsh 2009 [ | 3 | 1 | 2 | 1 | 0 | 0 | 2 | 2 | 11 | |||
| Burgess 2014 [ | 2 | 0 | 0 | 0 | 1 | 1 | 6 | n/a | 10 | |||
| Green 2007 [ | 2 | 0 | 2 | 0 | 0 | 1 | 4 | n/a | 9 | |||
| Daugherty 2017 [ | 2 | 0 | 2 | 1 | 0 | 0 | 4 | n/a | 9 | |||
| Wiltshire 2018 [ | 2 | 1 | 0 | 1 | 0 | 0 | 3 | 1 | 8 | |||
| McKinlay 2012 [ | 2 | 0 | 2 | 0 | 0 | 0 | 4 | n/a | 8 | |||
| Begeer 2008 [ | 1 | 0 | 0 | 0 | 0 | 0 | 5 | 0 | 6 | |||
| Bories 2018 [ | 1 | 0 | 1 | 1 | 0 | 0 | 2 | 1 | 6 | |||
| Shapiro 2018 [ | 1 | 0 | 0 | 1 | 0 | 0 | 4 | n/a | 6 | |||
| Johnson-Jennings 2018 [ | 2 | 0 | 1 | 1 | 0 | 0 | 0 | n/a | 4 | |||
| Bernardes 2013 [ | 2 | 0 | 0 | 1 | 0 | 0 | 0 | 1 | 4 | |||
| Papaleontiou 2017 [ | 1 | 0 | 0 | 0 | 0 | 0 | 2 | 0 | 3 | |||
| Gao 2019 [ | 0 | 0 | 0 | 1 | 0 | 0 | 0 | n/a | 1 | |||
* Scoring system (more detail in supplementary file 2): credibility 0-3 (3= construction well described, 2= described to some extent 0/1 = little or no description); (no score); variability 0-1 (1= more than one variant of an experimental factor produced, 0= no); (no score); evaluation 0-3 (3= well described, 2= described to some extent 0/1 = little or no description); description 0-1 (1= full vignette available to view, as much as is practically possible, 0= no); concealment 0-3 (3= concealment strategies clearly described or analysis considered effects of awareness, 2= described to some extent, 1 = purpose was not shared but no description of how concealment attempted 0 = no/not stated); realism 0-1 (1= attempt to introduce realism into data collection conditions, 0- no); sampling & response 0-6 (NB: each score is doubled to account for both sampling and response: 3= random sampling, response & completion rate high, justified exclusions; 2= sample strategy described & justified (purposive or random); response or completion rates fully reported and risk of bias considered; 1= sampling strategy inadequately or not described, inadequate consideration of bias; 0= response rates not given & inadequate consideration of bias); analysis 0-2 (2= accounted for clustering & individual/aggregated analysis performed 1= accounted for clustering OR individual/aggregated analysis performed 0= neither n/a = only one vignette shown to participants)
Fig. 1PRISMA Flow diagram
Included studies – descriptive characteristics and main findings
| Study | Research question | Geographical setting | Healthcare setting | Participants | Process or decision | Patient characteristics | Possible drivers of variation | Condition | Findings |
|---|---|---|---|---|---|---|---|---|---|
| Adams et al, 2014 [ | Identification of mechanisms driving differential diagnoses and disparities that are common to black and white people in both countries; examination ofbetween-country variations due to cultural and health care system differences | UK and USA | Primary care | Physician | Diagnosis, referral, prescription | Ethnicity (black, white) | Information processing, patient cues, knowledge used, healthcare system. | Depression | There was little bias in doctors' decisions overall. UK doctors had greater clinical uncertainty in diagnosing depression amongst black than white patients . Doctors focused more on black patients' physical than psychological symptoms and more often tended to identify endocrine problems. |
| Begeer et al, 2008 [ | Whether ethnic background influences the likelihood of pediatricians’ references to Autism when using clinical judgments versus ratings of explicit diagnostic categories | Netherlands | Child health setting | Physician | Diagnosis | Ethnicity (Dutch vs Moroccan or Turkish) | Spontaneous vs prompted likelihood; physician characteristics | Autism | Spontaneous clinical judgements resulted in ethnic bias; this bias disappeared when doctors were prompted to consider autism. |
| Bernardes et al, 2013 [ | Whether physician sex moderates the effects of patient (distressed) pain behaviours and diagnostic evidence of pathology on treatment prescriptions and referrals; explore the mediating role of pain credibility judgments and psychological attributions on these effects | Portugal | Primary care | Physician | Referral, prescription, assessment | Gender | Physician sex, clinical cues (evidence of pathology, distress) | Chronic lower back pain | Confirming the hypothesis, physician sex moderates the influence of clinical cues on pain management practices: evidence of pathology had a larger effect on male than on female physicians’ referrals to psychology/psychiatry. |
| Bories et al, 2018 [ | To test the hypothesis that physician uncertainty aversion impacts medical decision making for older patients with acute myeloid leukaemia | France | Acute | Physician | Prescription | Age (note clinically relevant) | Physician demographic, occupational, behavioural characteristics | Acute myeloid leukaemia | Physician attitudes to risk influenced chemotherapy decisions for older patients. Physicians opting for intensive chemotherapy (IC) had higher aversion to uncertainty and treated fewer patients annually, than the low IC group but were similar in age, hierarchical status or years of experience. |
| Burgess et al, 2014 [ | To test the hypothesis that racial biases in opioid prescribing would be more likely under high levels of cognitive load | USA | Primary care | Physician | Prescription | Ethnicity (black, white) | Physician cognitive load | Chronic low back pain | Hypotheses were partially confirmed. Cognitive load altered ethnic inequalities in prescribing patterns in different ways for male and female physicians. Under high cognitive load, male physicians were more likely to prescribe opioids for White patients; while under low cognitive load, they were more likely to prescribe opioids for Black patients. Female physicians’ bias toward prescribing opioids to Black patients was stronger under greater cognitive load. |
| Burt et al, 2016 [ | To examine whether South Asian people rate GP consultations similarly to White British people, in order to understand why minority ethnic groups often give poorer evaluations of primary care | England | Primary care | Public | Consultation style | Ethnicity (South Asian, white) | Patients' ratings of quality | Persistent cough, perforated ear drum, painful elbow generalised numbness | Respondents from a Pakistani background rated communication in simulated GP consultations significantly more positively than their White British counterparts (contrary to the hypothesis that South Asians’ poorer evaluations of primary care experience is due to higher expectations of care). |
| Daugherty et al, 2017 [ | To test the hypotheses that physician gender bias would have little effect on treatment decisions for the male patient and would result in lower use of cardiovascular tests among gender-biased physicians for female patients | USA | Physician | Diagnosis | Gender | Implicit bias | Coronary artery disease | Hypotheses were partially confirmed; cardiologists who associated risk taking more with men than with women were more likely to view angiography as useful to diagnose male versus female patients but equally likely to recommend stress testing. Physicians were less certain of diagnosis in women than men. | |
| Elliott et al, 2016 [ | To test whether hospital-based physicians use different verbal and/or nonverbal communication with black and white simulated patients and their surrogates. | USA | Acute | Physician | Consultation style | Ethnicity (black, white) | Verbal and non-verbal communication between patient & physician | Metastatic gastric and pancreatic cancer | Physicians used similar verbal but different nonverbal communication behaviours with black and white patients. |
| Fischer et al, 2017 [ | To test whether patient requests for specific opioid pain medication would lead physicians to classify them as drug-seeking and change management decisions | USA | Primary care | Physician | Prescription | Ethnicity (black, white) | Patient (drug seeking) behaviour | Pain (sciatica) | Physician suspicion of drug-seeking behaviour was much higher when patients requested opioid medication. Physician suspicion of drug-seeking behaviour did not vary by patient characteristics, including gender and race. |
| Gao et al, 2019 [ | To test whether Chinese favour family-centred decision making while European Americans favour shared decision making in depression care | USA | Other - mental health | Public | Mode of decision making - hospital or community care | Race, nationality | Acculturation, preferences for care | Depression | Hypotheses were confirmed; Chinese preferred family-centred decision making while Americans preferred shared decision making. Chinese living in America paralleled European Americans. |
| Green et al, 2007 [ | To test whether implicit or explicit race biases predict physicians' decisions to give thrombolysis for acute myocardial infarction | USA | Acute and primary care | Physician | Diagnosis, prescription | Ethnicity (black, white) | Physician implicit bias | Acute myocardial infarction | Hypothesis was confirmed. As physicians’ pro-white implicit bias increased, so did their likelihood of treating white patients and not treating black patients with thrombolysis. |
| Hirsh et al, 2009 [ | To test whether gendered expectations of pain and facial pain expressions influenced pain assessment and treatment disparities in nurses | USA | Acute | Nurse | Prescription | Age, sex, race | Gender role expectations of pain (sensitivity, endurance, willingness to report), high/low pain facial expression | Pain appendectomy | Hypotheses were partially confirmed; nurses’ gender role expectations of pain didn’t influence decisions but pain expression did. Nurses generally rated female, African American, older patients’ pain higher and were more ready to prescribe opioids. |
| Johnson-Jennings et al, 2018 [ | To test whether patient-provider racial concordance and patient ethnic salience is associated with 1) provider pain assessment 2) attitudes toward referral for traditional healing practices for indigenous patients | USA | Primary care | Other clinical professional | 1) Prescription 2) referral | Ethnicity (Indigenous American - high/low ethnic salience) | Racial concordance (patient & physician) | Chronic lower back pain | 1) Indigenous providers rated patient with higher Indigenous ethnic salience more congruently with the self-reported pain ratings 2) Provider–patient racial concordance increased likelihood of consulting with and referring patients to traditional healing practices. |
| Lutfey et al, 2009 & 2010 [ | 1) Whether physician certainty is associated with decision making. Explore variations, by health care system, patient characteristics 2) whether observed disparities in CHD decision making are influenced by priming physicians to consider CHD. | USA, Germany and England | Primary care | Physician | Diagnosis, referral, prescription, lifestyle recommendations | Age, gender, ethnicity (black, white), SES | Diagnostic certainty, healthcare system, physician priming | CHD | 1) Certainty was positively correlated with test ordering, prescriptions and specialist referrals. Physicians were least certain of CHD diagnoses when patients were younger and female. 2) Primed physicians were more likely to order CHD-related tests and prescriptions than unprimed but main effects for patient, gender and age remained. |
| Mckinlay et al, 2012 [ | Whether physicians’ decisions to diagnose diabetes vary by race/ethnicity (after controlling for SES, age, and gender). | USA | Primary care | Physician | Diagnosis | Age, gender, ethnicity (black, Hispanic, white), SES | Effects of SES on ethnicity | Diabetes | Primary care physicians’ vignette diagnosis was patterned by race/ethnicity (rather than by SES). [Undiagnosed signs of T2DM in the community was patterned by SES rather than race/ethnicity.] |
| Papaleontiou, et al 2017 [ | Understanding why older thyroid cancer patients are not being referred to high-volume surgeons. | USA | Primary care | Physician | Referral | Age | Physician training, patient volume, discipline & patient preferences | Cancer | Endocrinologists and physicians treating more than 10 thyroid cancer patients per year were more likely to refer older thyroid cancer patients than primary care physicians. Patient preference, transportation barriers and confidence in local surgeon were commonly reported reasons to decrease likelihood of referral. |
| Samuelsson et al, 2014 [ | Disentangle a number of determinants on addiction care practitioners' perceptions of the severity of alcohol and drug consumption in clients. | Sweden | Addiction | Other | Referral (eligibility for services), perceptions of severity | Age, gender, ethnicity, SES, family circumstances | % variance due to vignette, professional and work unit | Substance use | Practitioners of different professional backgrounds and workplaces judge alcohol and drug consumption by different norms, and this was also influenced by characteristics of the users. |
| Shapiro, et al. 2018 [ | Whether neonatologists show implicit racial and/or socioeconomic biases and whether these are predictive of recommendations at extreme periviability | USA | Acute | Physician | Care: comfort vs intensive (e.g. resuscitation) | Race, SES | Implicit bias | Periviability | Hypotheses were in part confirmed. Physicians with implicit socioeconomic bias were more likely to recommend comfort care to high than low SES vignettes but did not appear influenced by implicit racial bias. |
| Sheringham et al, 2017 [ | How patients' clinical and sociodemographic characteristics influence GPs’ decisions to initiate lung cancer investigations | England | Primary care | Physician | Diagnosis | Age, gender, ethnicity (black, South Asian, white), SES | Information elicited, physician attributes | Respiratory symptoms | The information GPs elicited from patient vignettes influenced their decisions but did not explain observed ethnic inequalities in cancer investigations |
| Tinkler et al, 2018 [ | Whether appointment offers to new US primary care patients who mention concerns about smoking or weight differ from offers to patients with no health concerns (healthy patients) | USA | Primary care | Other | Appointment offer | Insurance status, race/ethnicity, and gender | Health concerns (smoking/weight concerns vs healthy); state-level Medicaid expansion status | Disease prevention | Patients with smoking concerns were no more likely to be offered new patient appointments than healthy patients and less likely than those with weight concerns. Insurance status influenced access. |
| Wiltshire,et al. 2018 [ | Whether concordance leads to higher ratings of trust in physicians amongst African American women race, gender and age | USA | Primary care | Public | Trust | Race - age, gender | Concordance | Breast exam | Older African-American women did not rate race, gender or age-concordant doctors higher on trust; instead they rated white, older male higher on competence than African-American older females. |
Key: SES socioeconomic status