| Literature DB >> 31240171 |
Shabbar I Ranapurwala1,2, Joseph E Cavanaugh3,4, Tracy Young3, Hongqian Wu4, Corinne Peek-Asa3, Marizen R Ramirez3,5.
Abstract
BACKGROUND: The goal of predictive modelling is to identify the likelihood of future events, such as the predictive modelling used in climate science to forecast weather patterns and significant weather occurrences. In public health, increasingly sophisticated predictive models are used to predict health events in patients and to screen high risk individuals, such as for cardiovascular disease and breast cancer. Although causal modelling is frequently used in epidemiology to identify risk factors, predictive modelling provides highly useful information for individual risk prediction and for informing courses of treatment. Such predictive knowledge is often of great utility to physicians, counsellors, health education specialists, policymakers or other professionals, who may then advice course correction or interventions to prevent adverse health outcomes from occurring. In this manuscript, we use an example dataset that documents farm vehicle crashes and conventional statistical methods to forecast the risk of an injury or death in a farm vehicle crash for a specific individual or a scenario.Entities:
Keywords: Decision support techniques; Forecasting; Motor vehicles; Predictions
Year: 2019 PMID: 31240171 PMCID: PMC6572740 DOI: 10.1186/s40621-019-0208-9
Source DB: PubMed Journal: Inj Epidemiol ISSN: 2197-1714
Distribution of farm vehicle crashes and resulting injuries and deaths by calendar year and state: 2005–2010
| Farm vehicle crashes | Injuries and deaths | Deaths | |
|---|---|---|---|
| Total | 7084 | 2087 (29.5) | 163 (2.3) |
| Calendar Year | |||
| 2005 | 1166 (16.5) | 338 (29.0) | 21 (1.8) |
| 2006 | 1114 (15.7) | 318 (28.5) | 29 (2.6) |
| 2007 | 1198 (16.9) | 336 (28.0) | 26 (2.2) |
| 2008 | 1160 (16.4) | 316 (27.2) | 22 (1.9) |
| 2009 | 1196 (16.9) | 407 (34.0) | 37 (3.1) |
| 2010 | 1250 (17.6) | 372 (29.8) | 28 (2.2) |
| State | |||
| Iowa | 1178 (16.6) | 421 (35.7) | 35 (3.0) |
| Illinois | 1214 (17.1) | 421 (34.7) | 27 (2.2) |
| Kansas | 700 (9.9) | 186 (26.6) | 19 (2.7) |
| Minnesota | 850 (12.0) | 199 (23.4) | 22 (2.6) |
| Missouri | 1084 (15.3) | 207 (19.1) | 12 (1.1) |
| North Dakota | 253 (3.6) | 51 (20.2) | 12 (4.7) |
| Nebraska | 536 (7.6) | 189 (35.3) | 10 (1.9) |
| South Dakota | 232 (3.3) | 74 (31.9) | 8 (3.4) |
| Wisconsin | 1037 (14.6) | 339 (32.7) | 18 (1.7) |
Injury / death status, and estimated regression coefficients for non-modifiable, non + semi-modifiable, and non + semi+modifiable risk factors to predict the risk of injury or death in a farm crash: 2005–2010
| Variables | Categories | Injured or died | Model coefficients (std. error) | |||
|---|---|---|---|---|---|---|
| Yes | No | Model 1a | Model 2b | Model 3c | ||
| Intercept | Intercept (constant) | 2087 | 12,747 | −1.75 (0.18) | −1.73 (0.24) | −0.97 (0.25) |
| Non-modifiable factors | ||||||
| State | Iowa | 421 | 1858 | 0.39 (0.10) | 0.46 (0.10) | 0.42 (0.10) |
| Illinois | 421 | 2230 | 0.19 (0.10) | 0.20 (0.11) | 0.33 (0.11) | |
| Kansas | 186 | 1291 | −0.19 (0.12) | −0.21 (0.12) | − 0.26 (0.13) | |
| Minnesota | 199 | 1688 | −0.30 (0.11) | −0.24 (0.12) | − 0.08 (0.12) | |
| Missouri | 207 | 1686 | −0.09 (0.11) | −0.21 (0.12) | − 0.34 (0.12) | |
| North Dakota | 51 | 431 | −0.43 (0.21) | −0.50 (0.21) | − 0.56 (0.21) | |
| Nebraska | 189 | 970 | 0.16 (0.12) | 0.11 (0.12) | 0.08 (0.12) | |
| South Dakota | 74 | 379 | 0.16 (0.17) | 0.14 (0.18) | −0.06 (0.19) | |
| Wisconsin (referent) | 339 | 2214 | 0 | 0 | 0 | |
| Season | Winter (Jan-Mar) (referent) | 192 | 1510 | 0 | 0 | 0 |
| Planting (Apr-May) | 341 | 2307 | 0.06 (0.12) | 0.20 (0.12) | 0.17 (0.12) | |
| Growing (Jun-Aug) | 635 | 3492 | 0.30 (0.11) | 0.42 (0.11) | 0.38 (0.11) | |
| Harvesting (Sep-Dec) | 919 | 5438 | 0.19 (0.10) | 0.16 (0.10) | 0.15 (0.10) | |
| Weather at the time of crash | Clear (referent) | 1618 | 9802 | 0 | 0 | 0 |
| Cloudy | 329 | 2081 | −0.01 (0.08) | − 0.09 (0.08) | − 0.11 (0.08) | |
| Rain | 78 | 376 | 0.17 (0.15) | −0.03 (0.16) | 0.03 (0.16) | |
| Snow/sleet/hail/freezing rain/drizzle | 22 | 289 | −0.74 (0.28) | −0.89 (0.28) | − 0.87 (0.28) | |
| Fog/smog/smoke/other | 40 | 199 | 0.15 (0.22) | 0.01 (0.22) | −0.08 (0.23) | |
| Time of crash | 12:00–5:59 am (referent) | 131 | 739 | 0 | 0 | 0 |
| 6:00–11:59 am | 501 | 3478 | −0.11 (0.13) | 0.13 (0.14) | 0.17 (0.14) | |
| 12:00–5:59 pm | 885 | 6218 | −0.14 (0.13) | 0.08 (0.13) | 0.08 (0.14) | |
| 6:00–11:59 pm | 570 | 2312 | 0.41 (0.14) | 0.12 (0.14) | 0.11 (0.14) | |
| Number of vehicles | Single vehicle | 238 | 827 | 1.37 (0.10) | 1.22 (0.13) | 1.22 (0.14) |
| Two or more vehicles (referent) | 1849 | 11,920 | 0 | 0 | 0 | |
| Equipment type | Farm vehicle/equipment | 672 | 6875 | −1.14 (0.06) | −1.02 (0.06) | −1.63 (0.08) |
| Non-farm vehicle (referent) | 1415 | 5872 | 0 | 0 | 0 | |
| Age | < 16 years age | 159 | 799 | 0.24 (0.11) | 0.04 (0.13) | 0.13 (0.14) |
| 16–24 years age | 388 | 2120 | 0.24 (0.09) | 0.21 (0.09) | 0.22 (0.10) | |
| 25–34 years age (referent) | 234 | 1839 | 0 | 0 | 0 | |
| 35–44 years age | 288 | 1851 | 0.15 (0.10) | 0.15 (0.10) | 0.20 (0.10) | |
| 45–54 years age | 325 | 2326 | 0.11 (0.09) | 0.10 (0.10) | 0.13 (0.10) | |
| 55–64 years age | 278 | 1794 | 0.22 (0.10) | 0.24 (0.10) | 0.30 (0.10) | |
| 65+ years age | 415 | 2018 | 0.49 (0.09) | 0.51 (0.09) | 0.59 (0.10) | |
| Sex | Female (referent) | 626 | 2663 | 0 | 0 | 0 |
| Male | 1461 | 10,084 | −0.19 (0.06) | −0.14 (0.06) | −0.26 (0.06) | |
| Semi-modifiable factors | ||||||
| Light | Daylight (referent) | 1407 | 10,195 | 0 | 0 | |
| Dark-street lights on | 27 | 223 | −0.20 (0.25) | −0.27 (0.26) | ||
| Dark-no street lights | 553 | 1882 | 0.57 (0.10) | 0.59 (0.10) | ||
| Other | 100 | 447 | 0.50 (0.14) | 0.42 (0.15) | ||
| Manner of collision | Non collision (referent) | 294 | 1301 | 0 | 0 | |
| Head-on | 127 | 415 | 0.40 (0.17) | 0.40 (0.18) | ||
| Rear-end | 716 | 2747 | 0.28 (0.13) | 0.28 (0.13) | ||
| Angle, oncoming left turn | 361 | 2343 | 0.04 (0.14) | 0.10 (0.14) | ||
| Sideswipe, same direction | 192 | 2698 | −0.85 (0.15) | −0.73 (0.16) | ||
| Sideswipe, opposite direction | 152 | 1462 | −0.65 (0.16) | −0.55 (0.16) | ||
| Other | 245 | 1781 | −0.21 (0.14) | −0.21 (0.14) | ||
| Vehicle action | Heading straight (referent) | 1510 | 6934 | 0 | 0 | |
| Turning | 133 | 2489 | −0.55 (0.11) | −0.62 (0.11) | ||
| Overtaking/ passing/ changing lanes | 280 | 1750 | −0.25 (0.09) | −0.34 (0.10) | ||
| Slowing/stopping | 43 | 608 | −0.89 (0.17) | −0.84 (0.18) | ||
| Other | 121 | 966 | −0.30 (0.11) | −0.35 (0.11) | ||
| Multiple passengers | No (referent) | 1392 | 9908 | 0 | 0 | |
| Yes | 695 | 2839 | 0.28 (0.08) | 0.34 (0.08) | ||
| Driver | No (referent) | 433 | 1710 | 0 | 0 | |
| Yes | 1654 | 11,037 | −0.16 (0.09) | −0.08 (0.09) | ||
| Modifiable factors | ||||||
| Driver contributing circumstance | No contributing action (referent) | 871 | 6508 | 0 | ||
| Disregarded traffic regulation | 302 | 1487 | 0.35 (0.09) | |||
| Reckless, careless, negligent, aggressive driving | 407 | 2510 | 0.16 (0.08) | |||
| Inattentive/distracted driver | 256 | 1093 | 0.25 (0.09) | |||
| Other contributing action | 251 | 1149 | 0.25 (0.10) | |||
| Occupant Protection | None (referent) | 870 | 4534 | 0 | ||
| Seat belt | 1052 | 7439 | −1.19 (0.09) | |||
| Child safety restraint | 59 | 384 | −1.25 (0.20) | |||
| Other restraint/ protection | 106 | 390 | 0.18 (0.16) | |||
| Quasi-likelihood Information Criterion (QIC) | 10,844.4 | 10,077.4 | 9676.8 | |||
| AUC (95% CI) | 0.69 (0.68, 0.71) | 0.75 (0.74, 0.76) | 0.78 (0.76, 0.79) | |||
Abbreviations: AUC Area under the receiver operating characteristic (ROC) curve
amodel 1 includes non-modifiable factors
bmodel 2 includes non-modifiable and semi-modifiable factors
cmodel 3 includes non-modifiable, semi-modifiable, and modifiable factors
Comparison of the expected (model-based) to observed number of injuries in the nine states from 2005 to 2010
| State | Total occupants ( | Observed injuries or deaths ( | Non-modifiable (Model 1) | Non + semi-modifiable (Model 2) | Non + semi + modifiable (Model 3) | |||
|---|---|---|---|---|---|---|---|---|
| Avg. pred. Prob. | Expected injuries/ deaths ( | Avg. pred. Prob. | Expected injuries/ deaths ( | Avg. pred. Prob. | Expected injuries/ deaths ( | |||
| IA | 2279 | 421 | 0.1804 | 411 | 0.1820 | 415 | 0.1818 | 414 |
| IL | 2651 | 421 | 0.1600 | 424 | 0.1602 | 425 | 0.1594 | 422 |
| KS | 1477 | 186 | 0.1251 | 185 | 0.1246 | 184 | 0.1242 | 183 |
| MN | 1887 | 199 | 0.1008 | 190 | 0.1044 | 197 | 0.1017 | 192 |
| MO | 1893 | 207 | 0.1135 | 215 | 0.1169 | 221 | 0.1197 | 227 |
| ND | 482 | 51 | 0.1052 | 51 | 0.1066 | 51 | 0.1064 | 51 |
| NE | 1159 | 189 | 0.1634 | 189 | 0.1641 | 190 | 0.1619 | 188 |
| SD | 453 | 74 | 0.1549 | 70 | 0.1586 | 72 | 0.1592 | 72 |
| WI | 2553 | 339 | 0.1383 | 353 | 0.1367 | 349 | 0.1365 | 348 |
| Total | 14,834 | 2087 | 0.1408 | 2089 | 0.1419 | 2104 | 0.1415 | 2099 |
Abbreviations: Avg. pred. Prob. Average predicted probability, AUC Area under the receiver operating curve, QIC Quasi-likelihood information criteria
Validation of the predictive models
| Validation data years (Training data years) | Total validation data occupants (training data occupant) | Observed injuries or deaths in validation data ( | Non-modifiable (Model 1) | Non + semi-modifiable | Non + semi + modifiable | |||
|---|---|---|---|---|---|---|---|---|
| Avg. pred. Prob. | Expected injuries/ deaths ( | Avg. pred. Prob. | Expected injuries/ deaths ( | Avg. pred. Prob. | Expected injuries/ deaths ( | |||
| 2008-‘10 (‘05-‘07) | 7624 (7210) | 1095 | 0.1407b | 1073 | 0.1445e | 1102 | 0.1437h | 1095 |
| 2009-‘10 (‘05-‘08) | 5216 (9618) | 779 | 0.1383c | 721a | 0.1405f | 733 | 0.1397i | 729 |
| 2010 (‘05-‘09) | 2615 (12,219) | 372 | 0.1424d | 372 | 0.1410g | 369 | 0.1401j | 366 |
Abbreviations: Avg. pred. Prob. Average predicted probability
ap-value = 0.0327 (Chi Sq = 4.56, df = 1), suggesting that expected injuries and deaths (n = 721) were significantly different than the observed (n = 779). All other expected to observed differences were non-significant
bAUC = 0.69 based on training data 2005–2007
cAUC = 0.70 based on training data 2005–2008
dAUC = 0.69 based on training data 2005–2009
eAUC = 0.75 based on training data 2005–2007
fAUC = 0.75 based on training data 2005–2008
gAUC = 0.74 based on training data 2005–2009
hAUC = 0.77 based on training data 2005–2007
iAUC = 0.77 based on training data 2005–2008
jAUC = 0.77 based on training data 2005–2009
Predicted probabilities of injury or death for drivers and passengers in varying farm crash scenarios using the model 3 estimated coefficients from Table 1
| The risk or injury or death in a farm crash in Iowa for a 25–34 year old, male, in growing season, clear weather, between 6:00–11:59 am, daylight, heading straight, passengers on board (for single vehicle crash, manner of collision = non collision; for multiple vehicle crash, manner of collision = rear end) | ||||
|---|---|---|---|---|
| Vehicle/ occupant type | Seat belt | Driver contributing circumstances | Risk of injury or death (%) | |
| Single vehicle crash | Multiple vehicle crash | |||
| Farm vehicle driver | Yes | None | 16.7% | 7.3% |
| Farm vehicle passenger | 17.8% | 7.8% | ||
| Non-farm vehicle driver | 28.7% | |||
| Non-farm vehicle passenger | 30.3% | |||
| Farm vehicle driver | Yes | Disregarded traffic regulations | 22.2% | 10.1% |
| Farm vehicle passenger | 23.6% | 10.8% | ||
| Non-farm vehicle driver | 36.4% | |||
| Non-farm vehicle passenger | 38.2% | |||
| Farm vehicle driver | No | None | 39.8% | 20.6% |
| Farm vehicle passenger | 41.6% | 21.8% | ||
| Non-farm vehicle driver | 57.0% | |||
| Non-farm vehicle passenger | 58.8% | |||
| Farm vehicle driver | No | Disregarded traffic regulations | 48.5% | 26.9% |
| Farm vehicle passenger | 50.4% | 28.4% | ||
| Non-farm vehicle driver | 65.3% | |||
| Non-farm vehicle passenger | 67.0% | |||
Fig. 1Screenshot of the online tool to calculate risk of injury or death in a farm crash