| Literature DB >> 34115418 |
Yuxin Zhu1,2, Zheyu Wang1,2, Ava L Liberman3, Tzu-Pu Chang4,5, David Newman-Toker6,7,8.
Abstract
In longitudinal event data, a crude rate is a simple quantification of the event rate, defined as the number of events during an evaluation window, divided by the at-risk population size at the beginning or mid-time point of that window. The crude rate recently received revitalizing interest from medical researchers who aimed to improve measurement of misdiagnosis-related harms using administrative or billing data by tracking unexpected adverse events following a "benign" diagnosis. The simplicity of these measures makes them attractive for implementation and routine operational monitoring at hospital or health system level. However, relevant statistical inference procedures have not been systematically summarized. Moreover, it is unclear to what extent the temporal changes of the at-risk population size would bias analyses and affect important conclusions concerning misdiagnosis-related harms. In this article, we present statistical inference tools for using crude-rate based harm measures, as well as formulas and simulation results that quantify the deviation of such measures from those based on the more sophisticated Nelson-Aalen estimator. Moreover, we present results for a generalized multibin version of the crude rate, for which the usual crude rate is a single-bin special case. The generalized multibin crude rate is more straightforward to compute than the Nelson-Aalen estimator and can reduce potential biases of the single-bin crude rate. For studies that seek to use multibin measures, we provide simulations to guide the choice regarding number of bins. We further bolster these results using a worked example of stroke after "benign" dizziness from a large data set.Entities:
Keywords: crude rate; cumulative hazard; deviation bound; misdiagnosis-related harm
Mesh:
Year: 2021 PMID: 34115418 PMCID: PMC8365112 DOI: 10.1002/sim.9039
Source DB: PubMed Journal: Stat Med ISSN: 0277-6715 Impact factor: 2.497
Simulation settings
|
| log10 |
| Expected number of incidences in (0,1.5] | Expected number of incidences in [3,6)×0.5 | Average cumulative hazard difference× | Average cumulative hazard log ratio |
|---|---|---|---|---|---|---|
| 0.6 | 3 | 3405.7 | 5 | 1.9 | 6.11 | 0.939 |
| 168.31 | 30 | 11.2 | 37.1 | 0.939 | ||
| 6.09 | 200 | 57.1 | 272 | 0.939 | ||
| 4 | 158 673.96 | 5 | 2 | 6.09 | 0.939 | |
| 7992.47 | 30 | 11.7 | 36.6 | 0.939 | ||
| 333.65 | 200 | 76 | 246 | 0.939 | ||
| 5 | 7 367 755.99 | 5 | 2 | 6.09 | 0.939 | |
| 371 814.22 | 30 | 11.7 | 36.6 | 0.939 | ||
| 15 722.76 | 200 | 77.9 | 244 | 0.939 | ||
| 0.8 | 3 | 374.88 | 5 | 3.2 | 3.56 | 0.438 |
| 39.29 | 30 | 18.3 | 21.6 | 0.438 | ||
| 3.26 | 200 | 84.8 | 158 | 0.438 | ||
| 4 | 6685.31 | 5 | 3.2 | 3.55 | 0.438 | |
| 710.81 | 30 | 19.2 | 21.3 | 0.438 | ||
| 65.65 | 200 | 124.2 | 143 | 0.438 | ||
| 5 | 118 917 | 5 | 3.2 | 3.55 | 0.438 | |
| 12 661.55 | 30 | 19.3 | 21.3 | 0.438 | ||
| 1180.7 | 200 | 128.5 | 142 | 0.438 | ||
| 1 | 3 | 99.75 | 5 | 4.9 | 0 | 0 |
| 16.42 | 30 | 27.8 | 0 | 0 | ||
| 2.24 | 200 | 115.2 | 0 | 0 | ||
| 4 | 999.75 | 5 | 5 | 0 | 0 | |
| 166.42 | 30 | 29.8 | 0 | 0 | ||
| 24.75 | 200 | 190.2 | 0 | 0 | ||
| 5 | 9999.75 | 5 | 5 | 0 | 0 | |
| 1666.42 | 30 | 30 | 0 | 0 | ||
| 249.75 | 200 | 199 | 0 | 0 |
FIGURE 1Simulation results for crude rate difference estimation and hypothesis testing when we have 10% censoring. Biases decrease with increased number of bins, decreased number of expected events, and larger sample sizes; type-I errors are generally all close to 0.05; power increases with more expected incidences, but not much with increase in sample size or number of bins [Colour figure can be viewed at wileyonlinelibrary.com]
FIGURE 2Simulation results for log crude rate ratio estimation and hypothesis testing when we have 10% censoring. Biases decrease with increased number of bins, decreased number of expected events, and larger sample sizes; type-I errors are generally all close to 0.05; power increases with increased numbers of expected incidences, but not much with increase in sample size or number of bins [Colour figure can be viewed at wileyonlinelibrary.com]
FIGURE 3Comparison of analysis results of the Taiwan NHIRD using difference number of bins. We observe that increasing the number of bins from 1 to 15 results in less than 0.2% of increase in crude rate difference, less than 0.03% of increase in crude rate ratio, and ignorable changes in test P-values. These observations suggest that using a small number of bins has little effect on analysis results