| Literature DB >> 34841400 |
Eric T Roberts1, Zirui Song2, Lin Ding3, J Michael McWilliams4.
Abstract
IMPORTANCE: Medicare's Merit-Based Incentive Payment System (MIPS), a public reporting and pay-for-performance program, adjusts clinician payments based on publicly reported measures that are chosen primarily by clinicians or their practices. However, measure selection raises concerns that practices could earn bonuses or avoid penalties by selecting measures on which they already perform well, rather than by improving care-a form of gaming. This has prompted calls for mandatory reporting on a smaller set of measures including patient experiences.Entities:
Mesh:
Year: 2021 PMID: 34841400 PMCID: PMC8623747 DOI: 10.1001/jamahealthforum.2021.3105
Source DB: PubMed Journal: JAMA Health Forum ISSN: 2689-0186
Figure 1. Proportions of Practices Including CAHPS Patient Experience Scores in the VM, by Quintile of Mean Baseline Scoresa
CAHPS Indicates Healthcare Providers and Systems; VM, Value-Based Payment Modifier.
aAmong 301 large physician practices (≥100 clinicians) that publicly reported CAHPS patient experience measures in 2016 (the last year of the Physician Quality Reporting System [PQRS] and VM) and started reporting them in either 2014 (140 practices) or 2015 (161 practices). Practices were categorized into quintiles of their baseline performance, which we calculated as an equally weighted average of scores on 11 patient experience domains in the first year a practice reported CAHPS measures for the PQRS (2014 or 2015). Practice-level scores in these 11 domains were reported by CMS in the VM Practice File.
bPercentage of practices voluntarily including CAHPS patient experience measures, assessed in the baseline year, in their overall VM quality score in the baseline year.
cPercentage of practices voluntarily including CAHPS measures, assessed 1 year after baseline, in their overall VM quality score 1 year after baseline.
dPercentage of practices voluntarily including CAHPS measures, assessed 2 years after baseline, in their overall VM quality score two years after baseline.
Before Intervention Characteristics and Changes Among Survey Respondents in Large and Smaller Physician Practices
| Respondent characteristics | Before intervention (2011-2013), % | Change from before to after intervention (2011-2013 to 2015-2016) | ||||
|---|---|---|---|---|---|---|
| Large practices (111-150 clinicians) [n = 4351 respondents in 344 practices] | Smaller practices (50-89 clinicians) [n = 9399 respondents in 945 practices] | % | Differential change | |||
| Large practices (111-150 clinicians) | Smaller practices (50-89 clinicians) | Estimate (95% CI) | ||||
| Age, y | 74.1 | 74.0 | −0.3 | −0.3 | 0.0 (−0.7 to 0.7) | .99 |
| Sex | ||||||
| Female | 56.8 | 59.1 | 1.1 | −2.4 | 3.5 (0.6 to 6.5) | .02 |
| Male | 43.2 | 40.9 | −1.1 | 2.4 | −3.5 (−6.5 to −0.6) | .02 |
| Race and ethnicity | ||||||
| Asian | 1.1 | 0.9 | 0.1 | 0.3 | −0.2 (−1.0 to 0.6) | .62 |
| Black | 4.6 | 5.5 | 2.2 | 0.1 | 2.1 (−2.8 to 7.0) | .39 |
| Hispanic | 2.7 | 2.6 | −0.3 | 0.4 | −0.7 (−2.1 to 0.6) | .28 |
| White | 89.3 | 89.4 | −3.2 | −2.2 | −1.0 (−6.4 to 4.4) | .72 |
| Other | 2.4 | 1.6 | 1.2 | 1.4 | −0.2 (−2.0 to 1.6) | .81 |
| Disabled | 16.1 | 14.7 | −1.7 | 1.8 | −3.5 (−6.2 to −0.8) | .01 |
| End-stage bladder disease | 0.6 | 0.6 | 0.2 | −0.1 | 0.3 (−0.2 to 0.8) | .18 |
| HCC score | 1.3 | 1.3 | 0.0 | 0.1 | −0.1 (−0.1 to 0.0) | .08 |
| CCW chronic conditions, number | 6.6 | 6.6 | 0.1 | 0.1 | 0.0 (−0.3 to 0.2) | .71 |
| Enrolled in Medicaid | 6.3 | 6.2 | 1.1 | −0.1 | 1.2 (−1.4 to 3.9) | .36 |
| Enrolled in a Medicare Savings Program | 3.5 | 3.7 | −0.2 | 0.2 | −0.4 (−1.7 to 0.9) | .56 |
| Education | ||||||
| Less than high school | 11.5 | 11.8 | −2.7 | −2.7 | 0.0 (−2.3 to 2.4) | .98 |
| High school graduate | 31.1 | 31.7 | −4.1 | −3.6 | −0.5 (−4.0 to 2.9) | .77 |
| Some college | 27.2 | 26.5 | 0.0 | 1.0 | −1.0 (−4.1 to 2.0) | .51 |
| College graduate | 11.3 | 10.7 | 2.1 | 1.7 | 0.4 (−2.1 to 2.8) | .78 |
| Graduate education | 15.6 | 15.3 | 3.5 | 3.5 | 0.0 (−2.9 to 3.0) | .98 |
| Current smoker | 9.0 | 9.1 | −1.5 | −0.5 | −1.0 (−2.8 to 0.8) | .28 |
| Use of helper to complete survey | 8.0 | 8.5 | 0.1 | 0.8 | −0.7 (−2.5 to 1.0) | .42 |
| Any functional limitations | 5.2 | 5.6 | 0.6 | −0.7 | 1.3 (0.0 to 2.6) | .04 |
| Self-reported general health score | 3.1 | 3.1 | 0.1 | 0.1 | 0.0 (−0.1 to 0.1) | .60 |
| Self-reported mental health score | 3.7 | 3.7 | 0.0 | 0.0 | 0.0 (−0.1 to 0.1) | .97 |
Abbreviation: CCW, Chronic Condition Data Warehouse; HCC, Hierarchical Condition Category.
Characteristics of respondents to the 2012 to 2014 and 2016 to 2017 fee-for-service Medicare Healthcare Providers and Systems (CAHPS) surveys, as assessed from the survey and Medicare enrollment and claims data from the year prior to the survey (unless otherwise noted). Respondents attributed to practices where they received the majority of primary care visits in the year prior to the survey. All estimates adjusted for CAHPS survey weights. Respondent characteristics for categorical variables are reported as percentages, which allows us to appropriately display survey-weighted distributions of patient characteristics in the population, rather than numerical frequencies which would not incorporate survey weighting.
Practice size calculated as the number of unique clinicians that billed under a practice’s taxpayer identification number in the year prior to the survey.
Number of respondents and practices in the preintervention period (2011-2013). In the postintervention period (2015-2016), the sample included 2989 respondents in 262 large practices (111-150 clinicians) and 4999 respondents in 635 smaller practices (50-89 clinicians). Across the preintervention and postintervention periods, the sample included 21 738 respondents in 2186 practices.
That is, changes in the mean or proportion of patients with the characteristic shown in the table row between large and smaller practices from the preintervention to the postintervention periods as defined in the table columns. We estimated these differential changes by fitting a respondent-level linear difference-in-differences model for each characteristic as a function of a postintervention period indicator, an indicator that a patient’s practice had 111 to 150 clinicians, and an interaction between these indicators. Differential changes are given by the regression coefficient on the interaction term. The 95% CIs and P values were calculated using robust standard errors clustered by practice (taxpayer identification number).
Category consists of beneficiaries whose race and ethnicity were classified as American Indian, Alaska Native, or Other in the Medicare beneficiary summary file.
Disability was original reason for Medicare entitlement.
Hierarchical Condition Category (HCC) scores constructed from Medicare beneficiaries’ demographic characteristics in the year prior to the survey and diagnoses claims from 2 years prior to the survey. Higher HCC scores indicate higher predicted spending in the following year.
Count of chronic conditions from the Medicare Chronic Condition Data Warehouse (CCW), which draws from claims since 1999 to measure the presence of 27 chronic diseases among Medicare beneficiaries. We assessed the presence of chronic conditions reported on claims prior to the survey year.
That is, enrollment in full Medicaid.
Enrollment in 1 of the Medicare Savings Programs, which are partial Medicaid benefits that pay for Medicare Part B premiums, and in some cases Parts A and B cost sharing, for Medicare beneficiaries with low incomes and assets (Section 5 in the Supplement).
A small proportion (approximately 4%) of respondents did not report their education. We retained these observations in regression analyses by including an indicator variable for missing education status.
Proportion reporting difficulty with 1 or more activities of daily living (bathing, dressing, eating, using chairs, walking, and using the toilet).
Assessed on a scale of 1 to 5, where 1 indicates poor self-rated health or mental health and 5 indicates excellent self-rated general or mental health.
Figure 2. Mean Annual Composite Patient Experience Scores in Large vs Smaller Practices
Plotted are unadjusted mean composite scores reflecting patient experiences with care from 2011 to 2013 and 2015 to 2016 in large practices (111-150 clinicians) and smaller practices (50-89 clinicians). Patient experiences with care in 2011 to 2013 and 2015 to 2016 assessed from the 2012 to 2014 and 2016 to 2017 fee-for-service Medicare Healthcare Providers and Systems surveys, respectively. We omitted the 2015 survey, pertaining to patient experiences in 2014, as a transitional year. Practice size was calculated as the number of unique clinicians that billed under a practice’s taxpayer identification number in the year prior to the survey. Scores are standardized to a 0 to 100 scale, with higher scores representing better patient experiences with care (Section 3 in the Supplement). Error bars represent 95% CIs for annual mean scores and were calculated using robust standard errors clustered by practice (taxpayer identification number). The unadjusted difference-in-differences estimate was −0.19 points of the composite score, equivalent to −0.12 practice-level standard deviations (SDs) of the composite score (95% CI, −0.73 to 0.50 SDs; P = .72).
Difference-in-Differences Estimates for Association Between Mandatory Public Reporting and Patient Experiences With Care
| Patient experience scores | Before intervention (2011-2013) | Difference-in-differences estimates | |||
|---|---|---|---|---|---|
| Mean scores among large practices (111-150 clinicians) | SDs of practice scores | Regression estimate | Effect size, SDs (95% CI) | ||
| Composite score | 80.6 | 1.6 | −0.05 | −0.03 (−0.64 to 0.58) | .92 |
| Domain-specific scores | |||||
| Rating of primary physician | 90.1 | 1.0 | −0.17 | −0.16 (−1.12 to 0.79) | .74 |
| Physician communication | 88.2 | 1.3 | 0.19 | 0.15 (−0.68 to 0.97) | .73 |
| Timely access to care | 66.2 | 2.3 | −0.24 | −0.10 (−0.55 to 0.34) | .65 |
| Access to specialists | 84.8 | 2.5 | 0.95 | 0.38 (−0.49 to 1.25) | .39 |
| Care coordination | 80.4 | 1.5 | 0.48 | 0.32 (−0.38 to 1.01) | .37 |
Abbreviations: CAHPS, Consumer Assessment of Healthcare Providers and Systems; HRR, Hospital Referral Region; MA, Medicare Advantage.
Patient experiences with care assessed from the fee-for-service Medicare CAHPS survey. We used responses to surveys administered from 2012 to 2014 and 2016 to 2017 to assess patient experiences with care from 2011 to 2013 and 2015 to 2016, respectively. We omitted the 2015 survey, pertaining to patient experiences in 2014, as a transitional year.
Mean scores among practices with 111 to 150 clinicians in the preintervention period, adjusted for respondent characteristics in Table 1, annual county-level MA penetration rates, HRR fixed effects, year fixed effects, and survey weights. Scores are standardized to a 0 to 100 scale, with higher scores representing better patient experiences with care (Section 3 in the Supplement).
Standard deviation (SD) of the practice-level distribution of patient experience scores, estimated among all practices in the preintervention period (Section 5 in the Supplement for details of this calculation).
Difference-in-differences estimates represent the differential change in composite or domain-specific patient experience scores between large practices (111-150 clinicians) and smaller practices (50-89 clinicians) from the preintervention period (2011-2013) to the postintervention period (2015-2016), adjusted for respondent characteristics in Table 1 and survey weights.
Effect sizes are difference-in-differences estimates scaled by the practice-level SD of each score. An effect size of −0.16 SDs is equivalent to the difference between the median practice (50th percentile) and a practice at the 44th percentile of performance. The corresponding 95% CIs are also scaled by the practice-level SD in each score.
95% CIs and P values were calculated using robust standard errors clustered by practice (taxpayer identification number).
Calculated at the patient level as an equally weighted average of items comprising all domain-specific scores (Section 3 in the Supplement).
Calculated at the patient level as an equally weighted average (at the patient level) of items within each domain-specific score (Section 3 in the Supplement).