Literature DB >> 26945472

A semiparametric negative binomial generalized linear model for modeling over-dispersed count data with a heavy tail: Characteristics and applications to crash data.

Mohammadali Shirazi1, Dominique Lord2, Soma Sekhar Dhavala3, Srinivas Reddy Geedipally4.   

Abstract

Crash data can often be characterized by over-dispersion, heavy (long) tail and many observations with the value zero. Over the last few years, a small number of researchers have started developing and applying novel and innovative multi-parameter models to analyze such data. These multi-parameter models have been proposed for overcoming the limitations of the traditional negative binomial (NB) model, which cannot handle this kind of data efficiently. The research documented in this paper continues the work related to multi-parameter models. The objective of this paper is to document the development and application of a flexible NB generalized linear model with randomly distributed mixed effects characterized by the Dirichlet process (NB-DP) to model crash data. The objective of the study was accomplished using two datasets. The new model was compared to the NB and the recently introduced model based on the mixture of the NB and Lindley (NB-L) distributions. Overall, the research study shows that the NB-DP model offers a better performance than the NB model once data are over-dispersed and have a heavy tail. The NB-DP performed better than the NB-L when the dataset has a heavy tail, but a smaller percentage of zeros. However, both models performed similarly when the dataset contained a large amount of zeros. In addition to a greater flexibility, the NB-DP provides a clustering by-product that allows the safety analyst to better understand the characteristics of the data, such as the identification of outliers and sources of dispersion.
Copyright © 2016 Elsevier Ltd. All rights reserved.

Entities:  

Keywords:  Crash data; Dirichlet process; Generalized linear model; Negative binomial

Mesh:

Year:  2016        PMID: 26945472     DOI: 10.1016/j.aap.2016.02.020

Source DB:  PubMed          Journal:  Accid Anal Prev        ISSN: 0001-4575


  2 in total

1.  Mapping wind erosion hazard with regression-based machine learning algorithms.

Authors:  Hamid Gholami; Aliakbar Mohammadifar; Dieu Tien Bui; Adrian L Collins
Journal:  Sci Rep       Date:  2020-11-24       Impact factor: 4.379

2.  The Application of Non-Parametric Count Models for the Modeling of Female's Accident Rates in Hamadan Province from 2009 to 2016.

Authors:  Mostafa Eghbalian; Abbas Moghimbeigi; Marzieh Mahmoodi; Iraj Mohamadfam; Razieh Sadat Mirmoeini
Journal:  Iran J Public Health       Date:  2020-04       Impact factor: 1.429

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.