Literature DB >> 15607273

Poisson, Poisson-gamma and zero-inflated regression models of motor vehicle crashes: balancing statistical fit and theory.

Dominique Lord1, Simon P Washington, John N Ivan.   

Abstract

There has been considerable research conducted over the last 20 years focused on predicting motor vehicle crashes on transportation facilities. The range of statistical models commonly applied includes binomial, Poisson, Poisson-gamma (or negative binomial), zero-inflated Poisson and negative binomial models (ZIP and ZINB), and multinomial probability models. Given the range of possible modeling approaches and the host of assumptions with each modeling approach, making an intelligent choice for modeling motor vehicle crash data is difficult. There is little discussion in the literature comparing different statistical modeling approaches, identifying which statistical models are most appropriate for modeling crash data, and providing a strong justification from basic crash principles. In the recent literature, it has been suggested that the motor vehicle crash process can successfully be modeled by assuming a dual-state data-generating process, which implies that entities (e.g., intersections, road segments, pedestrian crossings, etc.) exist in one of two states-perfectly safe and unsafe. As a result, the ZIP and ZINB are two models that have been applied to account for the preponderance of "excess" zeros frequently observed in crash count data. The objective of this study is to provide defensible guidance on how to appropriate model crash data. We first examine the motor vehicle crash process using theoretical principles and a basic understanding of the crash process. It is shown that the fundamental crash process follows a Bernoulli trial with unequal probability of independent events, also known as Poisson trials. We examine the evolution of statistical models as they apply to the motor vehicle crash process, and indicate how well they statistically approximate the crash process. We also present the theory behind dual-state process count models, and note why they have become popular for modeling crash data. A simulation experiment is then conducted to demonstrate how crash data give rise to "excess" zeros frequently observed in crash data. It is shown that the Poisson and other mixed probabilistic structures are approximations assumed for modeling the motor vehicle crash process. Furthermore, it is demonstrated that under certain (fairly common) circumstances excess zeros are observed-and that these circumstances arise from low exposure and/or inappropriate selection of time/space scales and not an underlying dual state process. In conclusion, carefully selecting the time/space scales for analysis, including an improved set of explanatory variables and/or unobserved heterogeneity effects in count regression models, or applying small-area statistical methods (observations with low exposure) represent the most defensible modeling approaches for datasets with a preponderance of zeros.

Entities:  

Mesh:

Year:  2005        PMID: 15607273     DOI: 10.1016/j.aap.2004.02.004

Source DB:  PubMed          Journal:  Accid Anal Prev        ISSN: 0001-4575


  44 in total

1.  Where do bike lanes work best? A Bayesian spatial model of bicycle lanes and bicycle crashes.

Authors:  Michelle C Kondo; Christopher Morrison; Erick Guerra; Elinore J Kaufman; Douglas J Wiebe
Journal:  Saf Sci       Date:  2018-03       Impact factor: 4.877

2.  Socioeconomic determinants of exposure to alcohol outlets.

Authors:  Christopher Morrison; Paul J Gruenewald; William R Ponicki
Journal:  J Stud Alcohol Drugs       Date:  2015-05       Impact factor: 2.582

3.  Race, Ethnicity, and Exposure to Alcohol Outlets.

Authors:  Christopher Morrison; Paul J Gruenewald; William R Ponicki
Journal:  J Stud Alcohol Drugs       Date:  2016-01       Impact factor: 2.582

4.  Impact of texting laws on motor vehicular fatalities in the United States.

Authors:  Alva O Ferdinand; Nir Menachemi; Bisakha Sen; Justin L Blackburn; Michael Morrisey; Leonard Nelson
Journal:  Am J Public Health       Date:  2014-06-12       Impact factor: 9.308

5.  Disease mapping and regression with count data in the presence of overdispersion and spatial autocorrelation: a Bayesian model averaging approach.

Authors:  Mohammadreza Mohebbi; Rory Wolfe; Andrew Forbes
Journal:  Int J Environ Res Public Health       Date:  2014-01-09       Impact factor: 3.390

6.  Spatial panel analyses of alcohol outlets and motor vehicle crashes in California: 1999-2008.

Authors:  William R Ponicki; Paul J Gruenewald; Lillian G Remer
Journal:  Accid Anal Prev       Date:  2013-03-13

7.  Relating off-premises alcohol outlet density to intentional and unintentional injuries.

Authors:  Christopher Morrison; Karen Smith; Paul J Gruenewald; William R Ponicki; Juliet P Lee; Peter Cameron
Journal:  Addiction       Date:  2015-10-09       Impact factor: 6.526

8.  Protection from annual flooding is correlated with increased cholera prevalence in Bangladesh: a zero-inflated regression analysis.

Authors:  Margaret Carrel; Paul Voss; Peter K Streatfield; Mohammad Yunus; Michael Emch
Journal:  Environ Health       Date:  2010-03-22       Impact factor: 5.984

9.  A micro-temporal geospatial analysis of medical marijuana dispensaries and crime in Long Beach, California.

Authors:  Bridget Freisthler; William R Ponicki; Andrew Gaidus; Paul J Gruenewald
Journal:  Addiction       Date:  2016-02-18       Impact factor: 6.526

10.  The role of risk avoidance and locus of control in workers' near miss experiences: Implications for improving safety management systems.

Authors:  Emily J Haas; Patrick L Yorio
Journal:  J Loss Prev Process Ind       Date:  2019-05       Impact factor: 3.660

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.