Robert W Krell1, Ahmed Hozain2, Lillian S Kao3, Justin B Dimick1. 1. Department of Surgery, University of Michigan Health System, Ann Arbor. 2. Department of Surgery, Michigan State University College of Human Medicine, East Lansing. 3. Department of Surgery, The University of Texas at Houston Medical School, Houston.
Abstract
IMPORTANCE: Quality improvement platforms commonly use risk-adjusted morbidity and mortality to profile hospital performance. However, given small hospital caseloads and low event rates for some procedures, it is unclear whether these outcomes reliably reflect hospital performance. OBJECTIVE: To determine the reliability of risk-adjusted morbidity and mortality for hospital performance profiling using clinical registry data. DESIGN, SETTING, AND PARTICIPANTS: A retrospective cohort study was conducted using data from the American College of Surgeons National Surgical Quality Improvement Program, 2009. Participants included all patients (N = 55,466) who underwent colon resection, pancreatic resection, laparoscopic gastric bypass, ventral hernia repair, abdominal aortic aneurysm repair, and lower extremity bypass. MAIN OUTCOMES AND MEASURES: Outcomes included risk-adjusted overall morbidity, severe morbidity, and mortality. We assessed reliability (0-1 scale: 0, completely unreliable; and 1, perfectly reliable) for all 3 outcomes. We also quantified the number of hospitals meeting minimum acceptable reliability thresholds (>0.70, good reliability; and >0.50, fair reliability) for each outcome. RESULTS: For overall morbidity, the most common outcome studied, the mean reliability depended on sample size (ie, how high the hospital caseload was) and the event rate (ie, how frequently the outcome occurred). For example, mean reliability for overall morbidity was low for abdominal aortic aneurysm repair (reliability, 0.29; sample size, 25 cases per year; and event rate, 18.3%). In contrast, mean reliability for overall morbidity was higher for colon resection (reliability, 0.61; sample size, 114 cases per year; and event rate, 26.8%). Colon resection (37.7% of hospitals), pancreatic resection (7.1% of hospitals), and laparoscopic gastric bypass (11.5% of hospitals) were the only procedures for which any hospitals met a reliability threshold of 0.70 for overall morbidity. Because severe morbidity and mortality are less frequent outcomes, their mean reliability was lower, and even fewer hospitals met the thresholds for minimum reliability. CONCLUSIONS AND RELEVANCE: Most commonly reported outcome measures have low reliability for differentiating hospital performance. This is especially important for clinical registries that sample rather than collect 100% of cases, which can limit hospital case accrual. Eliminating sampling to achieve the highest possible caseloads, adjusting for reliability, and using advanced modeling strategies (eg, hierarchical modeling) are necessary for clinical registries to increase their benchmarking reliability.
IMPORTANCE: Quality improvement platforms commonly use risk-adjusted morbidity and mortality to profile hospital performance. However, given small hospital caseloads and low event rates for some procedures, it is unclear whether these outcomes reliably reflect hospital performance. OBJECTIVE: To determine the reliability of risk-adjusted morbidity and mortality for hospital performance profiling using clinical registry data. DESIGN, SETTING, AND PARTICIPANTS: A retrospective cohort study was conducted using data from the American College of Surgeons National Surgical Quality Improvement Program, 2009. Participants included all patients (N = 55,466) who underwent colon resection, pancreatic resection, laparoscopic gastric bypass, ventral hernia repair, abdominal aortic aneurysm repair, and lower extremity bypass. MAIN OUTCOMES AND MEASURES: Outcomes included risk-adjusted overall morbidity, severe morbidity, and mortality. We assessed reliability (0-1 scale: 0, completely unreliable; and 1, perfectly reliable) for all 3 outcomes. We also quantified the number of hospitals meeting minimum acceptable reliability thresholds (>0.70, good reliability; and >0.50, fair reliability) for each outcome. RESULTS: For overall morbidity, the most common outcome studied, the mean reliability depended on sample size (ie, how high the hospital caseload was) and the event rate (ie, how frequently the outcome occurred). For example, mean reliability for overall morbidity was low for abdominal aortic aneurysm repair (reliability, 0.29; sample size, 25 cases per year; and event rate, 18.3%). In contrast, mean reliability for overall morbidity was higher for colon resection (reliability, 0.61; sample size, 114 cases per year; and event rate, 26.8%). Colon resection (37.7% of hospitals), pancreatic resection (7.1% of hospitals), and laparoscopic gastric bypass (11.5% of hospitals) were the only procedures for which any hospitals met a reliability threshold of 0.70 for overall morbidity. Because severe morbidity and mortality are less frequent outcomes, their mean reliability was lower, and even fewer hospitals met the thresholds for minimum reliability. CONCLUSIONS AND RELEVANCE: Most commonly reported outcome measures have low reliability for differentiating hospital performance. This is especially important for clinical registries that sample rather than collect 100% of cases, which can limit hospital case accrual. Eliminating sampling to achieve the highest possible caseloads, adjusting for reliability, and using advanced modeling strategies (eg, hierarchical modeling) are necessary for clinical registries to increase their benchmarking reliability.
Authors: Peter K Lindenauer; Denise Remus; Sheila Roman; Michael B Rothberg; Evan M Benjamin; Allen Ma; Dale W Bratzler Journal: N Engl J Med Date: 2007-01-26 Impact factor: 91.245
Authors: John D Birkmeyer; David M Shahian; Justin B Dimick; Samuel R G Finlayson; David R Flum; Clifford Y Ko; Bruce Lee Hall Journal: J Am Coll Surg Date: 2008-09-19 Impact factor: 6.113
Authors: J Daley; M G Forbes; G J Young; M P Charns; J O Gibbs; K Hur; W Henderson; S F Khuri Journal: J Am Coll Surg Date: 1997-10 Impact factor: 6.113
Authors: Darrell A Campbell; William G Henderson; Michael J Englesbe; Bruce L Hall; Michael O'Reilly; Dale Bratzler; E Patchen Dellinger; Leigh Neumayer; Barbara L Bass; Matthew M Hutter; James Schwartz; Clifford Ko; Kamal Itani; Steven M Steinberg; Allan Siperstein; Robert G Sawyer; Douglas J Turner; Shukri F Khuri Journal: J Am Coll Surg Date: 2008-10-10 Impact factor: 6.113
Authors: Sarah Hudson Scholle; Joachim Roski; John L Adams; Daniel L Dunn; Eve A Kerr; Donna Pillittere Dugan; Roxanne E Jensen Journal: Am J Manag Care Date: 2008-12 Impact factor: 2.229
Authors: Jacob V Spertus; Sharon-Lise T Normand; Robert Wolf; Matt Cioffi; Ann Lovett; Sherri Rose Journal: Circ Cardiovasc Qual Outcomes Date: 2016-11-08
Authors: Bradley N Reames; Daniel Bacal; Robert W Krell; John D Birkmeyer; Nancy J O Birkmeyer; Jonathan F Finks Journal: Surg Obes Relat Dis Date: 2014-03-28 Impact factor: 4.734
Authors: Michael P Thompson; Cameron M Kaplan; Yu Cao; Gloria J Bazzoli; Teresa M Waters Journal: Health Serv Res Date: 2016-10-21 Impact factor: 3.402
Authors: Justin T Brady; Bona Ko; Samuel F Hohmann; Benjamin P Crawshaw; Jennifer A Leinicke; Scott R Steele; Knut M Augestad; Conor P Delaney Journal: Surg Endosc Date: 2017-12-27 Impact factor: 4.584
Authors: Michael S Calderwood; Ken Kleinman; Susan S Huang; Michael V Murphy; Deborah S Yokoe; Richard Platt Journal: Med Care Date: 2017-01 Impact factor: 2.983