| Literature DB >> 26202326 |
Michelle Beattie1, Douglas J Murphy2, Iain Atherton3, William Lauder4.
Abstract
BACKGROUND: Improving and sustaining the quality of hospital care is an international challenge. Patient experience data can be used to target improvement and research. However, the use of patient experience data has been hindered by confusion over multiple instruments (questionnaires) with unknown psychometric testing and utility.Entities:
Mesh:
Year: 2015 PMID: 26202326 PMCID: PMC4511995 DOI: 10.1186/s13643-015-0089-0
Source DB: PubMed Journal: Syst Rev ISSN: 2046-4053
Fig. 1Quality critique procedure
Quality Criteria for Measurement Properties (Terwee et al. 2007) [50]
| Property | Rating | Quality criteria |
|---|---|---|
| Reliability | ||
| Internal consistency | (+) | (Sub)scale unidimensional AND Cronbach’s alpha(s) ≥0.70 |
| ? | Dimensionality not known OR Cronbach’s alpha not determined | |
| (−) | (Sub)scale not unidimensional OR Cronbach’s alpha(s) <0.70 | |
| Measurement error | (+) | MIC > SDC OR MIC outside the LOA |
| ? | MIC not defined | |
| (−) | MIC ≤ SDC OR MIC equals or inside LOA | |
| Reliability | (+) | ICC/weighted Kappa ≥0.70 OR Pearson’s |
| ? | Neither ICC/weighted Kappa, nor Pearson’s | |
| (−) | ICC/weighted Kappa <0.70 OR Pearson’s | |
| Validity | ||
| Content validity | (+) | The target population considers all items in the questionnaire to be relevant AND considers the questionnaire to be complete |
| ? | No target population involvement | |
| (−) | The target population considers all items in the questionnaire to be irrelevant OR considers the questionnaire to be incomplete | |
| Construct validity | ||
| Structural validity | (+) | Factors should explain at least 50 % of the variance |
| ? | Explained variance not mentioned | |
| (−) | Factors explain <50 % of the variance | |
| Hypothesis testing | (+) | Correlation with an instrument measuring the same construct ≥50 % OR atleast 75 % of the results is in accordance with the hypotheses) AND correlation with related constructs is higher than with unrelated constructs |
| ? | Solely correlations determined with unrelated constructs | |
| (−) | Correlation with an instrument measuring the same construct <50 % OR <75 % of the results is in accordance with the hypotheses OR correlation with related constructs is lower than with unrelated constructs |
+ positive, − negative, ? indeterminate, AUC area under the curve, MIC minimal important change, ICC intraclass correlation, SDC smallest detectable change, LOA limits of agreement
Additional aspects of utility scoring criteria
| Excellent (****) | Good (***) | Fair (**) | Poor (*) | |
|---|---|---|---|---|
| Questions for cost efficiency | ||||
| 1. What are the number of observations (patients, raters, times) needed to reach the required level of reliability for the purpose of the instrument? | Only a small sample needed (<30) | A moderate sample size (30–49) | Not explicit but can be assumed or (50–99 assessments needed) | No details given or (≥100 assessments needed) |
| 2. How long does an assessment take to complete | ≤15 min | ≤ 30 min | 30–60 min | >60 min |
| 3. What are the administrative costs of completing the assessment? | Easily embedded within existing resource. Little additional support required | Some administrative resource but no specialist resource required | Large amount of resource required to assess and administer | Significant specialist expertise and administrative time required to assess and administer |
| 4. What is the cost to complete a reliable sample? | Minimal | Moderate | Considerable | Extensive |
| Questions for acceptability | ||||
| 1. Is there evidence of subjects understanding of the instrument/assessment? | Investigations of subjects understanding (i.e. cognitive testing of instruments) | Estimated evidence of subjects understanding (i.e. high number of questions missed) | Subject understanding not explicitly stated but some can be assumed (i.e. student guide to OSCE) | No evidence of subject understanding |
| 2. How many assessments are not completed? | There are low numbers of missing items (<10 %) and adequate response rates ( | There are a high number of missing items ( | There are low numbers of missing items or poor (<10 %) and an inadequate response rate (<40 %) | There are high numbers of missing items ( |
| 3. Has the instrument/assessment been tested in an appropriate context? | Evidence of successful administration/use within an appropriate setting | Tested in vivo and changes recommended would be achievable | Testing in vivo and changes recommended would be difficult or only partially tested in vivo | Testing has only been conducted in vitro/simulation |
| Questions for educational impact | ||||
| 1. There is evidence of the instruments intended purpose being achieved (i.e. if aim is to enable hospital ranking for patient selection, is there evidence that the results are actually influencing patient choice?) | Clear evidence of intended purpose being fulfilled | Explanatory or theoretical link between intended and actual use but no clear evidence | Evidence of theoretical work but relationship between intended and actual purpose poorly or not described | No evidence of intended purpose becoming actual |
| 2. The scoring system is easily translated or available in an easy to use format? | Explicitly stated and easy to calculate | Explicitly stated but not easy to calculate | Scoring only calculated by resource with statistical knowledge | Scoring not explained well enough to calculate |
| 3. The feedback from the results can be readily used for action where necessary? | Feedback is readily available in a format that enables necessary action | Feedback is readily available but not drilled down enough to enable targeted action | Minimal feedback available or delay results in limited impact | No explanation to determine adequacy of feedback. No direct feedback could be readily used without additional expertise |
Fig. 2Modified PRISMA flow diagram
Instrument overview
| Instrument/abbreviation | Associated papers | Country of origin | Domains covered | Conceptual framework | No. of items | Mode of administration | Timing of administration |
|---|---|---|---|---|---|---|---|
| Hospital Consumer Assessment of Healthcare Providers and Systems (HCAHPS) | Sofaer et al. [ | USA | Nurse communication | Reflective | 24 | 48 h—6 weeks of discharge | |
| Keller et al. [ | Doctor communication | Telephone | |||||
| O’Malley [ | Physical comfort | Mail with telephone follow-up | |||||
| Levine et al. [ | Pain control | ||||||
| Giordano et al. [ | Medicine communication | ||||||
| Agency for Healthcare Research and Quality [ | Discharge information | Interactive Voice Recognition (IVR) | |||||
| Centers for Medicare and Medicaid [ | Responsiveness to patient | ||||||
| Quality from the Patients' Perspective (QPP) | Wilde et al. [ | Sweden | Medical-technical competence | Reflective | 68 | Self-completion questionnaire | At discharge |
| Wilde et al. [ | Physical technical conditions | ||||||
| Larsson et al. [ | Personal necessities | ||||||
| Characteristics | |||||||
| Identity-orientated approach | |||||||
| Situation | |||||||
| Participation | |||||||
| Commitment | |||||||
| Socio-cultural atmosphere | |||||||
| Positive treatment of significant others | |||||||
| Quality from the Patients' Perspective Shortened (QPPS) | Larsson et al. [ | Sweden | Medical-technical competence | Reflective | 24 | Self-completion questionnaire | At discharge |
| Physical technical conditions | |||||||
| Identity-orientated approach | |||||||
| Socio-cultural atmosphere | |||||||
| Picker Patient Experience Questionnaire (PPE-15) | Jenkinson et al. [ | England | Information and education | Reflective | 15 | Self-completion postal questionnaire | Within 1 month of discharge |
| Jenkinson et al. [ | Coordination of care | ||||||
| Reeves et al. [ | Physical comfort | ||||||
| Emotional support | |||||||
| Respect for patient preferences | |||||||
| Involvement of family and friends | |||||||
| Continuity and transition | |||||||
| Overall impression | |||||||
| NHS Inpatient Survey (NHSIP) | Boyd [ | England (originated in the USA) | Admission to hospital | Formative | 70 | Postal survey | Between 4 and 5 months of discharge |
| Sizmur and Redding [ | The hospital and ward | ||||||
| Picker Institute Europe [ | Doctors | ||||||
| Decourcy et al. [ | Nurses | ||||||
| Your care and treatment | |||||||
| Operations and procedures | |||||||
| Leaving hospital | |||||||
| Scottish Inpatient Patient Experience Survey (SIPES) | Scottish Government [ | Scotland | Admission to hospital | Formative | 30 | Postal survey, questionnaire (also available online, by telephone and via text phone) | Between 4 and 5 months of discharge |
| The hospital and ward | |||||||
| Scottish Government [ | |||||||
| Care and treatment | |||||||
| Hospital staff | |||||||
| Arrangements for leaving hospital | |||||||
| Care and support services after leaving hospital | |||||||
| Hong Kong Inpatient Experience Questionnaire (HKIEQ) | Hospital Authority [ | Hong Kong | Prompt access | Reflective | 62 | Mixed | 48 h—1 month after discharge |
| Information provision | 92 % interviewed by telephone | ||||||
| Wong et al. [ | |||||||
| Care and involvement in decision-making | |||||||
| Physical and emotional needs | 8 % face-to-face home interviews | ||||||
| Coordination of care | |||||||
| Respect and privacy | |||||||
| Environment and facilities | |||||||
| Handling patient feedback | |||||||
| Overall care of health professionals and quality of care | |||||||
| Patient Experience Questionnaire (PEQ) | Pettersen et al. [ | Norway | Information future complaints | Reflective | 35 | Postal self-completion questionnaire | 6 weeks after discharge |
| Nursing services | |||||||
| Communication | |||||||
| Information examinations | |||||||
| Contact with next-of-kin | |||||||
| Doctor services | |||||||
| Hospital and equipment | |||||||
| Information medication | |||||||
| Organisation | |||||||
| General satisfaction | |||||||
| Norwegian Patient Experience Questionnaire (NORPEQ) | Oltedal [ | Norway | Whether doctors were understandable | Reflective | 8 | Self-completion | Within 7 weeks of discharge |
| Doctors professional skills | |||||||
| Nurses professional skills | |||||||
| Nursing care | |||||||
| Whether doctors and nurses were interested in the patients problems | |||||||
| Information on tests | |||||||
| Patient Experiences with Inpatient Care (I-PAHC) | Webster et al. [ | Ethiopia | Nurse communication | Reflective | 16 | Interviewer-assisted completion | After first day of admission |
| Doctor communication | |||||||
| Physical environment | |||||||
| Pain management | |||||||
| Medication and symptom communication | |||||||
| Patient Perceptions of Quality (PPQ) | Rao et al. [ | India | Medicine availability | Reflective | 16 | Interviewer-assisted completion | Not specified |
| Medical information | |||||||
| Staff behaviour | |||||||
| Doctor behaviour | |||||||
| Hospital infrastructure |
Quality of methods and results of psychometric studies
| Instrument/abbreviation | Associated papers | Measurement property | Result | Quality rating of results | Quality rating of methods |
|---|---|---|---|---|---|
| Hospital Consumer Assessment of Healthcare Providers and Systems (HCAHPS) | Sofaer et al. [ | Content validity | Patients considered other aspects of hospital care which appear to have not been included | Negative | Poor |
| Keller et al. [ | Internal consistency | Cronbach’s alpha 0.70 | Positive | Excellent | |
| Keller et al. [ | Reliability | ICC 0.70 | Positive | Excellent | |
| Keller et al. [ | Structural validity | 7 categorises for 16 items. Factor loadings 0.57–91. Uniqueness of error reported | Indeterminate | Excellent | |
| O’Malley [ | Measurement error | Correlation between same composites different services | Indeterminate | Good | |
| Surgery 0.76 | |||||
| Obstetrics 0.73 | |||||
| Medical 0.85 | |||||
| Quality from the Patients' Perspective (QPP) | Wilde et al. [ | Content validity | 35 patient interviews—development of relevant questionnaire | Positive | Excellent |
| Wilde et al. [ | Internal consistency | Cronbach’s alpha 0.80 | Positive | Excellent | |
| Wilde et al. [ | Content validity | High patient ratings of item clarity and comprehensiveness | Positive | Excellent | |
| Wilde et al. [ | Structural validity | Factor solutions | Positive | Good | |
| Medical/technical competence 50.4 % | |||||
| Physical/technical conditions 44.8 % | |||||
| Identity-orientated approach 66.9 % | |||||
| Socio-cultural atmosphere 65.8 % | |||||
| Wilde et al. [ | Criterion validity | Correlation between long and short version in their entirety was 0.90 | Positive | Poor | |
| Larsson et al. [ | Structural validity | RMSEA of 0.050 was obtained indicating the model was an acceptable fit | Indeterminate | Good | |
| Quality from the Patients' Perspective Shortened (QPPS) | Larsson et al. [ | Internal consistency | Cronbach’s alpha 0.74 for overall scale | Positive | Excellent |
| Larsson et al. [ | Criterion validity | Pearson correlation coefficients all results statistically significant 0.0025 when Bonferroni corrections made | Positive | Excellent | |
| Picker Patient Experience Questionnaire(PPE-15) | Jenkinson et al. [ | Internal consistency | Cronbach’s alpha 0.8 | Positive | Good |
| Jenkinson et al. [ | Internal consistency | 0.89 for 4 pages | Positive | Excellent | |
| 0.87 for 12 pages | |||||
| Reeves et al. [ | Content validity | Focus groups, cognitive testing, amendments—research did not identify any missing items from patients’ perspective | Positive | Excellent | |
| Jenkinson et al. [ | Criterion validity | Correlations between short and long version between 0.93 ( | Positive | Good | |
| Jenkinson et al. [ | Hypothesis testing | Item correlations were above recommended levels for all PPE items in both survey versions (0.37–0.61) | Positive | Excellent | |
| NHS Inpatient Survey (NHSIP) | Boyd [ | Content validity | Tested and modified with group of inpatients | Positive | Excellent |
| Sizmur and Redding [ | Internal consistency | Item correlations given but Cronbach’s alpha not reported | Indeterminate | Fair | |
| Scottish Inpatient Patient Experience Survey (SIPES) | Scottish Government [ | Content validity | Extensive work with patient groups: survey, focus groups, stakeholder consultations, cognitive testing. Findings, the patient found the items relevant and comprehensive | Positive | Excellent |
| Scottish Government [ | Internal consistency | Cronbach’s alpha over 0.70 for each survey section | Positive | Poor | |
| Hong Kong Inpatient Experience Questionnaire (HKIEQ) | Hospital Authority [ | Internal consistency | Cronbach’s alpha 0.75 for overall scale | Positive | Fair |
| Hospital Authority [ | Reliability | Intraclass correlation 0.42–0.96 and test re-test 0.78 | Positive | Fair | |
| Hospital Authority [ | Content validity | Participants found the questionnaire to be clear, understandable, and appropriate | Positive | Excellent | |
| Hospital Authority [ | Structural validity | 17 factors explained 74 % of the variance | Positive | Fair | |
| Wong et al. [ | Internal consistency | Cronbach’s alpha 0.75 for overall scale | Positive | Fair | |
| Wong et al. [ | Structural validity | 18 factors explained 75.5 % of the variance | Positive | Fair | |
| Hospital Authority [ | Cross-cultural validity | Translated but not cross-culturally validated | Indeterminate | Fair | |
| Patient Experience Questionnaire (PEQ) | Pettersen et al. [ | Internal consistency | Cronbach’s alpha greater than 0.70 for overall scale | Positive | Fair |
| Pettersen et al. [ | Reliability | Test re-test 0.62–0.85 with ICC exceeding 0.7 | Positive | Fair | |
| Pettersen et al. [ | Content validity | Grouped more than 600 m written comments and held focus groups with previous inpatients to ensure relevant and sufficient items were covered | Positive | Good | |
| Pettersen et al. [ | Structural validity | 20 items, 6 factors accounted for 67 % total variance | Positive | Excellent | |
| Pettersen et al. [ | Hypothesis testing | Associations between rating scale and external measures, i.e. gender, age, fulfilment of expectations. Only mean differences computed | Indeterminate | Poor | |
| Norwegian Patient Experience Questionnaire (NORPEQ) | Oltedal [ | Internal consistency | Item correlation 0.59–0.71 and Cronbach’s alpha 0.85 | Positive | Fair |
| Oltedal [ | Reliability | Intraclass correlation 0.45–0.79 and test re-test 0.88 | Positive | Good | |
| Oltedal [ | Content validity | Patient interviews found questions and scaling easy to understand and all relevant questions covered | Positive | Good | |
| Oltedal [ | Structural validity | 6 items explained 57.7 % variance | Positive | Good | |
| Oltedal [ | Construct validity | Hypothesised scales scores would correlate 0.6–0.8 with satisfaction (correlation significant, range from high to low) | Positive | Good | |
| Scale scores would correlate 0.4–0.6 perceptions of incorrect treatment (moderate result) | |||||
| Scores would correlate 0.1–0.3 with patient health and physical health. (Result 0.19–0.27) | |||||
| Patient Experiences with Inpatient Care (I-PAHC) | Webster et al. [ | Internal consistency | Cronbach’s alpha >0.78 | Positive | Excellent |
| Webster et al. [ | Content validity | Focus groups, revisions by stakeholders, translated, cognitively tested and patient groups reported clear questions covering all aspects important to them | Positive | Excellent | |
| Webster et al. [ | Structural validity | Kept if item loadings greater than 0.40. Variance not reported | Indeterminate | Excellent | |
| Webster et al. [ | Construct validity | 5 factors with loadings 0.48–0.86. Results in accordance with priori hypothesis | Positive | Excellent | |
| Webster et al. [ | Cross-cultural validity | Translation done but not empirically tested | Indeterminate | Fair | |
| Patient Perceptions of Quality (PPQ) | Rao et al. [ | Internal consistency | Cronbach’s alpha >0.70 | Positive | Excellent |
| Rao et al. [ | Content validity | Questionnaire devised from qualitative interviews with patients | Positive | Excellent | |
| Rao et al. [ | Structural validity | 5 dimensions explained 73 % variance | Positive | Excellent |
Results of additional aspects of utility
| HCAHPS | QPP | QPPS | PPE-15 | NHSIP | SIPE | HKIEQ | PEQ | NORPEQ | I-PAHC | PPQ | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| F | R | F | R | F | R | F | R | F | R | F | R | F | R | F | R | F | R | F | R | F | R | |
|
| ||||||||||||||||||||||
| 1. What are the number of observations (patients, raters, times) needed to reach the required level of reliability for the purpose of the instrument? | ≥300 [ | Poor | Not reported | Poor | Not reported | Poor | 330 per group [ | Poor | Not Reported | Poor | Variable but >100 | Poor | 300–500 [ | Poor | Not specified | Poor | Not specified | Poor | ≥230 [ | Poor | Not specified | Poor |
| 2. How long does an assessment take to complete? | 8 min [ | Good | 30 min [ | Good | ≤15 min | Excellent | 12 min [ | Excellent | 20 min (estimate) | Good | 20 min [ | Good | 25 min [ | Good | <30 min (estimate) | Good | >15 min (estimate) | Excellent | 15 min [ | Excellent | <30 min (estimate) | Good |
| 3. What are the administrative costs of completing the assessment? | V large numbers and expertise [ | Poor | Considerable [ | Fair | Brief and easy scoring [ | Excellent | Large no. and standardised data | Fair | Large no. and standardised | Fair | V large numbers and expertise | Poor | V large numbers and expertise | Poor | Considerable | Fair | Brief and simple scoring | Good | Interviewers required | Fair | Interviewer required [ | Fair |
| 4. What is the cost to complete a reliable sample? | Extensive | Poor | Considerable | Fair | Minimal | Good | Considerable | Fair | Extensive | Poor | Extensive | Poor | Extensive | Poor | Considerable | Fair | Moderate | Good | Moderate | Good | Considerable | Fair |
| Overall Rating | POOR | FAIR | GOOD | FAIR | POOR | POOR | POOR | FAIR | GOOD | GOOD | FAIR | |||||||||||
|
| ||||||||||||||||||||||
| 1. Is there evidence of subjects understanding of the instrument/assessment? | Yes [ | Excellent | Yes [ | Excellent | Yes [ | Excellent | Yes [ | Excellent | Yes [ | Excellent | Yes [ | Excellent | Yes [ | Excellent | Yes [ | Excellent | Yes [ | Excellent | Yes [ | Excellent | Yes [ | Excellent |
| 2. How many assessments are not completed? | 25 % miss RR 47 % | Good | 13 % miss RR 68 % [ | Good | 25 % miss RR 79 % [ | Good | 29 % miss RR 68 % [ | Good | No info RR 49 % [ | Good | No info RR 50 % [ | Good | 21 % miss RR 49 % [ | Good | >10 % mis RR 53 % [ | Excellent | 42.5 %mis RR 48 % [ | Excellent | High No RR 95 % [ | Good | 0 % miss RR 85 % [ | Excellent |
| 3. Has the instrument/assessment been tested in an appropriate context? | Yes [ | Excellent | Tested in simulation [ | Fair | Yes [ | Good | Yes [ | Excellent | Yes [ | Excellent | Yes [ | Excellent | Yes [ | Excellent | Yes [ | Excellent | Yes | Excellent | Yes | Excellent | Yes | Excellent |
| Overall Rating | Good | Fair | Good | Good | Good | Good | Good | Excellent | Excellent | Good | Excellent | |||||||||||
|
| ||||||||||||||||||||||
| 1. Is there evidence of the instrument being used for its intended purpose? (i.e. if aim is to provide hospital ranking for patient selection, is there evidence that the results are influencing patient choice?) | Evidence of purpose [ | Excellent | Discussion of purpose but no evidence [ | Fair | Discussion of purpose but no evidence [ | Fair | Explanatory use for national comparison | Good | Clear evidence of purpose [ | Excellent | Explanatory use for national comparison [ | Good | Explanatory use for national benchmarking [ | Good | Clear evidence of purpose [ | Excellent | Explanatory use described [ | Good | Explanatory use described [ | Good | Explanatory use described [ | Good |
| 2. Is the scoring system easily translated or available in an easy to use format? | Easy scoring | Excellent | Easy scoring | Excellent | Easy scoring | Excellent | Easily scored | Excellent | Statistical knowledge | Fair | Easy colour coding | Excellent | Statistical expertise | Fair | Not explained | Poor | Easy scoring | Excellent | Easy scoring | Excellent | Easy scoring | Excellent |
| 3. Can the results be readily used for action where necessary? | Available but not at unit/team level | Good | Results actionable at local level | Excellent | Results actionable at local level | Excellent | Adjustments needed (Jenkinson comparison) | Fair | Expertise required to enable local action | Fair | Results at hospital level | Good | Results at hospital level | Good | No information | Poor | Readily available | Excellent | Readily available | Excellent | Readily available | Excellent |
| Overall Rating | Good | Fair | Fair | Fair | Fair | Good | Fair | Poor | Good | Good | Good | |||||||||||
F findings, R ratings
Results of Beattie and Murphy instrument utility matrix
| Instrument | Primary purpose | Validity | Reliability | Cost efficiency | Acceptability | Educational impact | ||||
|---|---|---|---|---|---|---|---|---|---|---|
| Content/theoretical development | Construct (structural, cross-cultural) | Criterion validity | Internal consistency | Other reliability | Was the correct error source investigated? | Rating | Rating | Rating | ||
| HCAHPS | National comparisons | *(−) | ****(?) | ****(+) | ***/****(+ ?) | Y | * | *** | *** | |
| QPP | Quality improvement | ****(+) | ***(+ ?) | *(+) | ****(+) | Y | ** | ** | ** | |
| QPPS | Quality improvement | ****(+) | ****(+) | P | *** | *** | ** | |||
| PPE-15 | National performance indicators | ****(+) | ****(+) | ***(+) | ***/****(+) | P | ** | *** | ** | |
| NHSIP | National performance indicators | ****(+) | **(?) | N | * | *** | ** | |||
| SIPES | National comparisons | ****(+) | *(+) | N | * | **** | *** | |||
| HKIEQ | National comparisons | ****(+) | **(+ ?) | **(+) | **(+) | Y | * | *** | ** | |
| PEQ | Quality improvement and national surveillance | ***(+) | **/***(+ ?) | **(+) | **(+) | Y | ** | **** | * | |
| NORPEQ | Cross-national comparisons in Nordic countries | ***(+) | ***(+) | **(+) | ***(+) | Y | *** | **** | *** | |
| I-PAHC | Quality improvement in low-income settings | ****(+) | ***/****(+ ?) | ****(+) | P | *** | *** | *** | ||
| PPQ | Local quality improvement | ****(+) | ****(+) | ****(+) | P | ** | **** | *** | ||
Ratings of study quality: *poor, ** fair, ***good, ****excellent. Ratings of measurement results: (+) positive rating, (−) negative rating, (?) indeterminate rating, (+ ?) mixed. Correct source of error: Y yes, N no, P partial