| Literature DB >> 28330409 |
Ilona Wm Verburg1,2, Rebecca Holman1,2,3, Niels Peek4, Ameen Abu-Hanna1, Nicolette F de Keizer1,2.
Abstract
Funnel plots are graphical tools to assess and compare clinical performance of a group of care professionals or care institutions on a quality indicator against a benchmark. Incorrect construction of funnel plots may lead to erroneous assessment and incorrect decisions potentially with severe consequences. We provide workflow-based guidance for data analysts on constructing funnel plots for the evaluation of binary quality indicators, expressed as proportions, risk-adjusted rates or standardised rates. Our guidelines assume the following steps: (1) defining policy level input; (2) checking the quality of models used for case-mix correction; (3) examining whether the number of observations per hospital is sufficient; (4) testing for overdispersion of the values of the quality indicator; (5) testing whether the values of quality indicators are associated with institutional characteristics; and (6) specifying how the funnel plot should be constructed. We illustrate our guidelines using data from the Dutch National Intensive Care Evaluation registry. We expect that our guidelines will be useful to data analysts preparing funnel plots and to registries, or other organisations publishing quality indicators. This is particularly true if these people and organisations wish to use standard operating procedures when constructing funnel plots, perhaps to comply with the demands of certification.Entities:
Keywords: Funnel plot; benchmarking; case-mix correction; intensive care unit; mortality; overdispersion; prediction models; quality indicators; sample size; workflow diagram
Mesh:
Year: 2017 PMID: 28330409 PMCID: PMC6193208 DOI: 10.1177/0962280217700169
Source DB: PubMed Journal: Stat Methods Med Res ISSN: 0962-2802 Impact factor: 3.021
Figure 1.Workflow diagram as UML activity diagram: funnel plots for proportions, risk-adjusted rates and standard rates.
Figure 2.Flowchart illustrating the inclusion and exclusion criteria for entry into the study for proportion of mortality, standardised mortality rate and standardised readmission rate.
The results of the first five steps when producing a funnel plot for the NICE quality indicators.
| Step in the process | Outcome or test | Proportion mortality full population | SMR full population | SMR medical admissions | SMR emergency surgery | SMR elective surgery |
|---|---|---|---|---|---|---|
| Step 1 | Total admissions | 81,828 | 75,315 | 34,829 | 9,032 | 31,454 |
| Median admissions (range ICUs | 684 (222–3,546) | 643 (214–3,425) | 352 (76–1,179) | 75 (22–362) | 182 (13–2253) | |
| Overall percentage of deaths (range ICUs) | 11.9 (3.6–21.4) | 11 (3.7–20.6) | 17.7 (7.7–28.9) | 14.4 (4.4–28.6) | 27.2 (0.0–10.7) | |
| Overall number of deaths (range ICUs) | 9,705 (11–337) | 8,265 (11–310) | 6,178 (8–206) | 1,300 (2–69) | 787 (0–63) | |
| Overall standardised rate (range ICUs) | – | 1.00 (0.51–1.51) | 1.00 (0.50–1.62) | 1.00 (0.25–1.99) | 0.99 (0–3.32) | |
| Overall risk adjusted rate (range ICUs) | – | 0.11 (0.02–0.27) | 0.18 (0.04–0.38) | 0.14 (0.01–0.54) | 0.02 (0–0.27) | |
| Step2 | Moderate calibration[ | – | α = 0.00 (−0.00 to 0.01); β= 0.99 (0.97–1.01) | α = −0.00 (−0.06 to 0.00); β = 1.00 (0.98–1.02) | α = 0.01 (−0.00 to 0.01); β = 0.96 (0.94–0.99)[ | α = 0.02 (−0.00 to 0.05); β = 0.77 (0.68–0.86)[ |
| Center-based calibration[ | – | α = 0.01 (−0.00 to 0.03); β = 0.87 (0.72–1.03) | α = 0.04 (0.02–0.07); β = 0.74 (0.60–0.88)[ | α = 0.04 (0.00–0.07); β = 0.75 (0.51–0.98)[ | α = 0.01 (0.00–0.02); β = 0.75 (0.51–0.98)[ | |
| Patient level scaled brier score | – | 0.33 | 0.32 | 0.27 | 0.12[ | |
| Concordance statistic | – | 0.90 | 0.87 | 0.85 | 0.85 | |
| Step 3[ | Required sample size for 95% control (number and percentage ICUs with insufficient sample size) | 274 (4; 5%) | 304 (8; 9%) | 181 (9; 11%) | 168 (68; 80%)[ | 1,359 (79; 93%)[ |
| Required sample size for 99% control limits (number and percentage ICUs with insufficient sample size) | 410 (16; 19%) | 445 (24; 28%) | 269 (20; 24%) | 251 (74; 87%)[ | 2,028 (84; 99%)[ | |
| Step 4[ | Winsorised estimate φ ( | 5.50 ( | 2.45 ( | 1.71 ( | 1.03 (p = 0.41) | 1.10 (p = 0.24) |
| Step 5 | Number of admissions (divided by 100): relative odds ratio
(exp(β (CI))[ | 1.00 (1.00–1.00)[ | 1.00 (1.00–1.00) | 1.00 (0.99–1.01) | 0.96 (0.90–1.01) | 1.00 (0.99–1.01) |
| Average predicted probability: relative odds ratio (exp(β (CI))[ | – | 0.64 (0.29–1.40) | 0.23 (0.12–0.42)[ | 0.25 (0.06–1.10) | 0.00 (0.00–0.06)[ | |
| Hospital type: specialised academic, teaching or general
hospital: relative odds ratio (exp(β (CI))[ | 0.95 (0.93–0.98)[ | 0.99 (0.96–1.02) | 1.01 (0.97–1.05) | 0.96 (0.89–1.03) | 0.94 (0.85–1.03) |
SMR: standardised mortality rate; ICU: intensive care unit; CI: confidence interval.
Step 3 and 4 are examined using risk adjusted rates in state of standardised rates.
Ordinary least square regression for 50 subgroups of predicted mortality (x = mean predicted and y = mean observed).
Ordinary least square regression for 85 ICUs (x = mean predicted and y = mean observed).
H0:there is no significant relationship between outcome measure and number of admissions or average predicted probability.
H0:distribution of the outcome measure is identical for each hospital type.
Significant p-value < 0.05.
Relative odds ratio significant different from 1, i.e. the confidence interval of relative odds ratios does not contain 1.
The confidence interval around α does not contain 0 or the confidence interval around β does not contain 1.
The scaled brier score < 0.16.
Sample size not sufficient enough for more than 50% of the hospitals.
Grey cells: stop the process, results are not satisfactory according to the statistical analyses plan in chapter 4
Figure 3.Funnel plot for the crude proportion of in-hospital mortality. The value of the quality indicator is presented on the vertical axis and the number of ICU admissions included when calculating the quality indicator is presented on the horizontal axis. Small dots represent ICUs and the solid line represent the benchmark value. Dashed lines represent control limits, different types of dashed line are used to differentiate between the 95% and 99% control limits.
Figure 4.Funnel plot for the APACHE IV standardised in-hospital mortality rate for all ICU admissions, control limits inflated for overdispersion. The value of the quality indicator is presented on the vertical axis and the number of ICU admissions included when calculating the quality indicator is presented on the horizontal axis. Small dots represent ICUs and the solid line represents the benchmark value. Dashed lines represent control limits, different types of dashed lines are used to differentiate between the 95% and 99% control limits.