| Literature DB >> 35659234 |
Brenda M McGrath1, Linda Takamine1, Cainnear K Hogan1, Timothy P Hofer1,2, Amy K Rosen3,4, Jeremy B Sussman1,2, Wyndy L Wiitala1, Andrew M Ryan5, Hallie C Prescott6,7.
Abstract
BACKGROUND: Hospital-specific template matching (HS-TM) is a newer method of hospital performance assessment.Entities:
Keywords: Benchmarking; Hospital mortality; Quality of health care; Risk adjustment
Mesh:
Year: 2022 PMID: 35659234 PMCID: PMC9166576 DOI: 10.1186/s12913-022-08124-w
Source DB: PubMed Journal: BMC Health Serv Res ISSN: 1472-6963 Impact factor: 2.908
Six items included in each survey vignette
Fig. 1Accuracy, Confidence and Trust in the HS-TM-based vs Regression-Based Performance Assessments. Accuracy indicates whether the participant correctly classified the hospital as lower than average, average, or higher than average mortality. Confidence indicates how confident they were in their rating: Highly Confident, Moderately Confident, Slightly Confident, or Not at all Confident. Confidence is then dichotomized into Not Confident (Not at all Confident, Slightly Confident) or Confident (Moderately Confident, Highly Confident) and the p-value is the significance level of the difference in the percent Confident for HS-TM versus regression. Trust indicates their level of agreement with the following statement: I trust that the results of this performance report accurately reflect the mortality at my hospital relative to other hospitals. (Strongly Agree, Agree, Somewhat Agree, Neither Agree nor Disagree, Somewhat Disagree, Disagree, Strongly Disagree). The p-value indicates the significance level of the difference in the percent that trust the rating (Strongly Agree, Agree, or Somewhat Agree) using HS-TM versus regression
Serial logistic regression models assessing the association between approach (HS-TM vs regression) and correct interpretation of performance assessment vignettes
| Odds Ratio | 95% CI | p | Mean Percent Correct | 95% CI | |
|---|---|---|---|---|---|
| Approach | |||||
| Regression | ref | 0.56 | (0.47, 0.65) | ||
| HS-TM | 3.62 | (2.08, 6.28) | < .0001 | 0.82 | (0.75, 0.88) |
| Approach | |||||
| Regression | ref | 0.63 | (0.51, 0.74) | ||
| HS-TM | 6.26 | (3.10, 12.64) | < .0001 | 0.91 | (0.85, 0.95) |
| Scenario‡ | |||||
| Below-Average Mortality | 17.70 | (5.70, 54.98) | < .0001 | 0.95 | (0.88, 0.98) |
| Average Mortality | ref | 0.54 | (0.40, 0.67) | ||
| High-Average Mortality | 0.52 | (0.24, 1.13) | 0.10 | 0.38 | (0.25, 0.53) |
| Above-Average Mortality | 19.28 | (5.75, 64.72) | 0.91 | 0.96 | (0.88, 0.99) |
| Approach | |||||
| Regression | ref | 0.59 | (0.44, 0.72) | ||
| HS-TM | 6.33 | (3.10, 12.92) | < .0001 | 0.90 | (0.81, 0.95) |
| Scenario | |||||
| Below-Average Mortality | 19.41 | (6.08, 61.91) | < .0001 | 0.95 | (0.86, 0.98) |
| Average Mortality | ref | 0.48 | (0.33, 0.64) | ||
| High-Average Mortality | 0.52 | (0.24, 1.13) | < .0001 | 0.33 | (0.20, 0.49) |
| Above-Average Mortality | 21.06 | (6.12, 72.52) | 0.10 | 0.95 | (0.86, 0.98) |
| Self-Reported Statistical Knowledge | |||||
| Poor | ref | 0.78 | (0.65, 0.87) | ||
| Good | 1.05 | (0.46, 2.42) | 0.90 | 0.79 | (0.63, 0.89) |
| Confidence in Assessment | |||||
| Not Confident | ref | 0.72 | (0.51, 0.86) | ||
| Confident | 1.99 | (0.76, 5.21) | 0.16 | 0.83 | (0.75, 0.90) |
CI Confidence Interval
Across all 3 models the HS-TM approach was consistently associated with increased odds of correctly interpreting the performance assessment vignette
Concerns about the credibility of hospital performance assessments identified through thematic analysis of interview transcripts
| Concern | Representative Comments | |
|---|---|---|
| Concerns inherent to both regression and HS-TM | ||
| 1 | “I could look at a hospital that is two standard deviations above and say they’re just not doing a good job of coding.” “it’s nice to verify elements periodically…if our folks aren’t documenting with accuracy. A few years back, we had 75% of our pneumonias being coded as pneumonias not otherwise specified, so the bulk of that was our providers not being appropriately detailed consistently in their documentation, so we addressed that.” “we may find that we’re not doing a good job documenting our comorbidities, right? The model is dependent on comorbidities, and so, if we’re not doing a good job documenting those, maybe that’s a factor that’s driving our performance as opposed to saying oh there’s something wrong with the quality of our care.” | |
| 2 | “as a transplant hospital, you cannot account for a very select population of patients that are five times the standard deviation beyond the mean when they come into your hospital essentially dying and they’re there to try to get a lifesaving action, so I do need to consider some of the nuances of our hospital.” “we are a major cancer center, so we have a lot of patients referred to us for cancer care throughout the state. Literally we are physically attached to the only accredited cancer center in the state, so …you certainly have to take those nuances into consideration.” “one hospital’s cardiovascular admissions might be quite different from hospital to hospital depending on the sort of services that are being offered. We’re a tertiary or eve quaternary facility here, and so, even if you look at some of the cases we do of coronary interventions or [other advanced cardiac procedure], you know the surgery that we do here that isn’t offered at some of the other facilities, the admission diagnostic group might be the same, but the complexity of the patients might be different.” | |
| Concerns pertinent to both regression and HS-TM, but inherent to neither method | ||
| 3 | “we have that problem locally already where we get benchmarked against other VA hospitals, but not necessarily against our local cohort … So, for instance, my hospital isn’t necessarily benchmarked against the hospital down the street from me, they’re benchmarked against other VA hospitals with similar cohorts of things that you’re showing me here, but I lose that sort of local flavor which might actually make a difference…I still am convinced that there is a local flair to certain data that may not necessarily get accounted for.” “a patient with diabetes in [city a] may not be the equivalent of a patient with diabetes and [city b], because of other confounding factors like being in a rural area or being poor or—who knows—I could come up with probably dozens of potential confounders, so how well the model accounts for those things is important” | |
| 4 | “Well, I wouldn’t say that [the performance report’s] necessarily effective at describing the quality of care. It’s just telling me what my mortality rate is compared to the mean. So, I think it might be a leap to conclude that the problem may be quality of care” … | |
| 5 | “what we’re really doing is we’re seeing how we’re compared to everybody else, which means there’s going to be—I don’t want to say it—but there’s going to be a winner and there’s going to be a loser, right. There’s going to be a number one and there’s going to be somebody who’s at the end. The way I likened stuff like this is, if you took the top 10 violinists on the planet, the top 10 people who you felt were the greatest violinists, they were just amazing, and you applied SAIL to them, one of them would be a one-star violinist right?” “I prefer to say: here is the standard, and let’s all measure ourselves against what we think is the standard.” “if you keep doing norm referencing…someone’s always going to end up on top, and on the bottom, and they are going to feel terrible or really great about it, and it’s not necessarily true… you know our standard wait times are so much less than places in the Community that it’s embarrassing for the Community…but my wait time may be the worst in the country for VA…so this starts to not make sense at some point.” | |
| Concerns more pertinent to HS-TM than regression | ||
| 6 | “Well, 500 hospitalizations. This you know isn’t all that many in my mind…for each of these groupings, you end up with a relatively smaller number of patients to look at, and you know if you’re looking at a mortality rate of let’s just say it’s 3% and 500 you know that’s 15 people that died, right?” “So, it’s just that in a year’s period of time, the number of deaths we will have is not that great, and so even though I’m trying to adjust for all these factors, if in the next six month period for any random reasons I have three or four more deaths, I could suddenly be very close to the you know the 30 day mean with just a small with a relatively small number” | |
| Concerns more pertinent to regression that HS-TM | ||
| 7 | “[hospital-specific template matching is] getting closer to this idea of comparing yourself to other hospitals that are that are of the same ilk” “It just seems to make more sense to me to compare like facilities that are similar more so than trying to adjust facilities that are dissimilar and make them similar.” | |
| 8 | “you don’t know sometimes how tightly things are grouped. We had one quarter, where we move from fifth quintile to second in an acute MI measure… we looked and everybody was packed in there and it was four thousandths of a point difference that moved us about 80 ranking points…” “when the VA chooses to rank things from the top performing hospital to the lowest performing hospital, being a low performer in that type of ranking doesn't necessarily mean that you're performing badly” “I kind of look at the tails more than if I’m clustering the middle, I really don't know if I can distinguish whether facilities within that interquartile range of 25/75 are materially different from one another.” | |
| 9 | “there is certainly a habit of data coming down from above telling us we have a problem to fix it without us really having the ability to see what the data really is that goes into that assessment. We’re in the midst of that right now, where I’m told, something that I don’t agree with because, you know, as the person who looks at every admission every day I don’t see this, and so I don’t understand how our metric is so bad when I’m actually looking every day at what’s walking in the door and either I’m not measuring it correctly or I don’t understand what goes into the metric. And I need that information so that I can better understand whether there truly is a problem and how we would go about addressing it.” | |
Usability of Performance Assessments
| 1 | “if people were dying at a higher rate at my hospital, I wouldn't say that means that we're providing poor quality care. I think what it is, is it's a trigger to say, why are people dying. It's a trigger for a deeper dive.” “it's a little bit like ‘check oil’. There's a lot of reasons why that light may come on and so you need to get under the hood to understand why that that check oil light is coming on. So, I would never devote a huge amount of resources without having a good understanding of why we might be an outlier.” “[these data are] a flag or an indicator for something that that we might need to respond to” “in and of itself, the data doesn't say you're good, bad, or indifferent” “this would be enough for me to start trying to understand why does this exist” … are their service lines, care processes that that we need to be focusing improvement efforts on to bring these numbers down | |
| 2 | “it's possible that I might use this to impress upon certain stakeholders that this is indeed something that we need to devote some energy to” “it's been very well shown that if you want to motivate physician performance just show them where they stand as compared to their peers and they don't like to be [at the bottom]. You know, it's like lake Wobegon, 90% want to be in the top 10% of their class.” “what changes physician behavior in my experience, more than anything is a comparison to your peers in your hospital. I’ve been struck by how that's been true in different organizations, because you can look at a study and say, well, my patients are sicker or older, or they live further away, so I can't discharge them as early, but when you look at how your peers, are doing with the same patients in the same organization you start having to own the differences more, so some providers-specific data on outcomes could be an asset in prompting change” |