| Literature DB >> 35977255 |
Matthew E Barclay1,2, Mary Dixon-Woods1, Georgios Lyratzopoulos2.
Abstract
Importance: The Centers for Medicare & Medicaid Services (CMS) Hospital Compare star ratings are widely used summaries of hospital quality that can influence patient choice and organizational reputation. Objective: To identify the changes in hospital ratings and rankings associated with alternative methodological choices in the calculation of the 2021 CMS Hospital Compare star ratings. Design Setting and Participants: This cross-sectional study used publicly available 2021 Hospital Compare data for 3339 US hospitals from the October 2020 data release. Change in apparent hospital performance was assessed, arising when plausible alternatives to current methods are used for calculating star ratings in relation to individual measure standardization, domain derivation, and domain weighting. Three example changes were examined in detail, with more comprehensive changes considered using Monte Carlo simulation. Changes in centile of hospital ranks and in star rating overall were examined, as well as separately in CMS peer groupings defined by the number of reported quality domains. Main Outcomes and Measures: Proportion of hospitals receiving a different star rating under the alternative technical specifications than under the current (2021) CMS approach and mean absolute change in centile rank of hospitals under alternative technical specifications compared with the current (2021) CMS approach.Entities:
Mesh:
Year: 2022 PMID: 35977255 PMCID: PMC9107028 DOI: 10.1001/jamahealthforum.2022.1006
Source DB: PubMed Journal: JAMA Health Forum ISSN: 2689-0186
Baseline (2021) Technical Specifications of How Star Ratings Are Derived by CMS, Alongside Their Possible Limitations, and Justification of Alternatives
| Technical specification | Baseline (2021) CMS approach | Limitations of 2021 approach | Plausible alternative approach(es) considered in this study | Likely strengths of the alternative approach |
|---|---|---|---|---|
| 1. Standardization of individual measures | z Scoring used to standardize individual measures so that standardized performance represents the number of standard deviations above or below the mean | Discards contextual information about each measure: certain differences in performance may appear similar after | Use of individual measure-specific rules for transforming hospital performance on each measure to a 0-100 scale | Choice of measure-specific rules allows for identification of “good” (as opposed to “above average”) care quality |
| 2. Grouping of the 49 individual measures into higher-level domains | Individual measures are grouped into 5 domains, which align with those used by CMS to group measures | Measures in the same domain may be measuring different constructs, making interpretation more challenging because poor performance on some measures may be averaged out by better performance on other, uncorrelated measures grouped into the same domain | Assign measures to the same domain if they measure similar empirical constructs, using exploratory factor analysis | Makes it easier to understand domain scores because it avoids combining empirically unrelated measures |
| 3. Weighting of different quality domains before combining them into the composite score | A weight of 0.22 is assigned to each of the 4 outcome domains and a weight of 0.12 to the single-process domain | The weights given to each domain lack justification; the outcome of choice of weights on hospital performance is unclear | Give each domain the same weight | Allows use of weight distributions in Monte Carlo simulation, which in turn allows for assessment of uncertainty owing to choice of weights, while treating the 2021 weights as the ones most likely to be optimal |
| Draw plausible alternative weights from independent log-normal distributions centered on the 2021 domain weights |
Abbreviation: CMS, Centers for Medicare & Medicaid Services.
Performance of Hospitals Under the Baseline (2021) CMS Approach to Assigning Star Ratings and Change in Performance Under Each of the 3 Alternative Technical Specifications Considered Separately
| Technical specification | Baseline (2021) rating | Hospitals, No. (%) | Proportion of hospitals receiving each rating under this alternative design, % | Hospitals reclassified, % (95% CI) | Absolute change in centile of ranks, mean (95% CI) | ||||
|---|---|---|---|---|---|---|---|---|---|
| 5 Stars | 4 Stars | 3 Stars | 2 Stars | 1 Star | |||||
| Alternative standardization (external reference) | Any | 3339 (100) | 15.3 | 31.4 | 31.5 | 17.4 | 4.4 | 55.4 (53.7-57.1) | 15.4 (15.0-15.9) |
| 5 Stars | 369 (11.1) | 60.4 | 32.5 | 7.0 | 0.0 | 0.0 | 39.6 (34.7-44.6) | 11.3 (9.9-12.7) | |
| 4 Stars | 805 (24.1) | 25.3 | 49.4 | 23.1 | 1.9 | 0.2 | 50.6 (47.1-54.0) | 16.7 (15.7-17.7) | |
| 3 Stars | 1007 (30.2) | 7.6 | 37.8 | 44.4 | 9.4 | 0.7 | 55.6 (52.5-58.7) | 17.7 (16.9-18.4) | |
| 2 Stars | 839 (25.1) | 0.8 | 17.0 | 41.2 | 37.3 | 3.6 | 62.7 (59.4-65.9) | 16.0 (15.0-16.9) | |
| 1 Star | 319 (9.6) | 0.3 | 1.9 | 14.4 | 49.2 | 34.2 | 65.8 (60.5-70.8) | 8.7 (7.4-10.0) | |
| Alternative domain grouping (factor analysis) | Any | 3323 (100) | 11.9 | 29.0 | 30.9 | 20.2 | 8.0 | 31.9 (30.3-33.5) | 6.6 (6.4-6.9) |
| 5 Stars | 368 (11.1) | 85.1 | 14.9 | 0.0 | 0.0 | 0.0 | 14.9 (11.7-18.9) | 2.7 (2.4-3.0) | |
| 4 Stars | 799 (24.0) | 10.4 | 79.0 | 10.6 | 0.0 | 0.0 | 21.0 (18.3-24.0) | 6.9 (6.5-7.3) | |
| 3 Stars | 1003 (30.2) | 0.1 | 27.4 | 63.5 | 9.0 | 0.0 | 36.5 (33.6-39.5) | 8.8 (8.4-9.3) | |
| 2 Stars | 835 (25.1) | 0.0 | 0.4 | 36.0 | 56.8 | 6.8 | 43.2 (39.9-46.6) | 7.0 (6.5-7.5) | |
| 1 Star | 318 (9.6) | 0.0 | 0.0 | 0.9 | 33.3 | 65.7 | 34.3 (29.3-39.7) | 2.6 (2.2-3.0) | |
| Alternative domain weights (equal weights) | Any | 3339 (100) | 13.8 | 30.0 | 30.2 | 19.3 | 6.6 | 24.5 (23.0-26.0) | 2.6 (2.5-2.7) |
| 5 Stars | 369 (11.1) | 98.1 | 1.9 | 0.0 | 0.0 | 0.0 | 1.9 (0.9-3.9) | 0.9 (0.8-1.0) | |
| 4 Stars | 805 (24.1) | 12.4 | 86.8 | 0.7 | 0.0 | 0.0 | 13.2 (11.0-15.7) | 2.7 (2.6-2.9) | |
| 3 Stars | 1007 (30.2) | 0.0 | 29.5 | 69.8 | 0.7 | 0.0 | 30.2 (27.4-33.1) | 3.5 (3.3-3.7) | |
| 2 Stars | 839 (25.1) | 0.0 | 0.0 | 35.8 | 64.2 | 0.0 | 35.8 (32.6-39.1) | 2.7 (2.5-2.8) | |
| 1 Star | 319 (9.6) | 0.0 | 0.0 | 0.0 | 31.3 | 68.7 | 31.3 (26.5-36.6) | 0.9 (0.8-1.0) | |
Abbreviation: CMS, Centers for Medicare & Medicaid Services.
Wilson score method.
Normal approximation.
A total of 16 hospitals rated under the current methods could not be assigned scores under the alternative domain groupings owing to missing measure information.
Summary of Changes in Hospital Rating and Centile Rank Associated With the Different Alternative Technical Specifications Considered in the Monte Carlo Simulation by Baseline (2021) Star Rating, Overall, and Hospital Peer Group, Defined by the Number of Domains of Quality Reported
| Peer group | Baseline (2021) rating | Hospitals, No. (%) | Ratings across all Monte Carlo simulations, mean, % | Across Monte Carlo simulations, mean (IQR) | |||||
|---|---|---|---|---|---|---|---|---|---|
| 5 Stars | 4 Stars | 3 Stars | 2 Stars | 1 Star | Hospitals reclassified, % | Absolute change in centile of ranks | |||
| All hospitals | Any | 3339 (100) | 14.0 | 29.3 | 30.5 | 19.6 | 6.5 | 51.8 (44.2-59.7) | 15.0 (11.7-17.7) |
| 5 Stars | 369 (11.1) | 60.8 | 31.3 | 6.5 | 1.3 | 0.1 | 39.2 (31.4-47.4) | 11.2 (6.4-14.2) | |
| 4 Stars | 805 (24.1) | 20.3 | 52.8 | 22.7 | 3.9 | 0.4 | 47.2 (40.4-55.0) | 15.7 (12.3-18.7) | |
| 3 Stars | 1007 (30.2) | 6.2 | 32.4 | 46.2 | 14.0 | 1.2 | 53.8 (46.7-60.9) | 17.0 (14.6-19.5) | |
| 2 Stars | 839 (25.1) | 2.0 | 12.0 | 36.5 | 42.2 | 7.3 | 57.8 (49.5-66.6) | 15.5 (12.2-18.3) | |
| 1 Star | 319 (9.6) | 0.6 | 3.2 | 12.7 | 39.1 | 44.3 | 55.7 (42.9-67.7) | 9.8 (5.8-12.4) | |
| All 5 domains reported | Any | 2472 (100) | 15.1 | 29.8 | 30.0 | 18.9 | 6.2 | 52.0 (44.7-59.8) | 14.9 (11.9-17.4) |
| 5 Stars | 263 (10.6) | 65.5 | 29.1 | 4.6 | 0.7 | 0.1 | 34.5 (25.9-43.0) | 10.6 (6.4-13.2) | |
| 4 Stars | 576 (23.3) | 23.9 | 53.2 | 19.6 | 3.0 | 0.3 | 46.8 (41.1-54.0) | 15.7 (12.5-18.5) | |
| 3 Stars | 769 (31.1) | 6.6 | 34.6 | 45.3 | 12.6 | 0.9 | 54.7 (48.0-61.8) | 17.0 (14.7-19.4) | |
| 2 Stars | 635 (25.7) | 1.7 | 12.7 | 37.6 | 41.0 | 7.0 | 59.0 (50.1-68.3) | 15.3 (12.3-17.9) | |
| 1 Star | 229 (9.3) | 0.3 | 3.0 | 13.5 | 39.8 | 43.3 | 56.7 (43.2-69.4) | 9.4 (5.9-11.6) | |
| 4 Domains reported | Any | 767 (100) | 10.9 | 28.2 | 31.9 | 21.5 | 7.4 | 50.1 (41.6-59.5) | 14.8 (11.0-18.3) |
| 5 Stars | 100 (13.0) | 48.3 | 37.6 | 11.1 | 2.8 | 0.2 | 51.7 (36.0-68.0) | 12.5 (6.4-16.8) | |
| 4 Stars | 209 (27.2) | 11.1 | 52.1 | 30.6 | 5.7 | 0.4 | 47.9 (36.4-59.3) | 15.4 (11.7-19.0) | |
| 3 Stars | 219 (28.6) | 4.2 | 24.3 | 50.0 | 19.0 | 2.5 | 50.0 (41.1-58.9) | 16.8 (14.0-20.0) | |
| 2 Stars | 166 (21.6) | 1.5 | 8.7 | 32.0 | 48.6 | 9.1 | 51.4 (41.6-62.0) | 15.2 (11.4-18.6) | |
| 1 Star | 73 (9.5) | 0.6 | 2.6 | 9.9 | 38.6 | 48.3 | 51.7 (38.4-65.8) | 9.4 (4.5-13.2) | |
| 3 Domains reported | Any | 100 (100) | 12.2 | 24.5 | 31.1 | 22.9 | 9.3 | 60.0 (47.0-74.0) | 19.2 (10.8-27.3) |
| 5 Stars | 6 (6.0) | 60.3 | 23.2 | 12.6 | 3.7 | 0.3 | 39.7 (16.7-50.0) | 14.1 (1.7-26.0) | |
| 4 Stars | 20 (20.0) | 12.4 | 47.6 | 29.4 | 10.0 | 0.6 | 52.4 (30.0-75.0) | 20.9 (8.4-32.0) | |
| 3 Stars | 19 (19.0) | 12.6 | 32.9 | 39.2 | 14.1 | 1.2 | 60.8 (47.4-78.9) | 18.3 (13.8-22.3) | |
| 2 Stars | 38 (38.0) | 7.7 | 15.4 | 38.9 | 32.8 | 5.2 | 67.2 (55.3-81.6) | 20.6 (13.6-27.4) | |
| 1 Star | 17 (17.0) | 4.6 | 8.6 | 13.4 | 32.6 | 40.8 | 59.2 (35.3-76.5) | 16.8 (5.9-26.9) | |
Figure. Differences in Hospital Ranks and Star Ratings Associated With the 3 Main Alternative Specifications Considered
CMS indicates the Centers for Medicare & Medicaid Services.