| Literature DB >> 32988410 |
Reneta Slikboer1, Samuel D Muir2, S S M Silva2, Denny Meyer2.
Abstract
BACKGROUND: Expenditure on driver-related behavioral interventions and road use policy is often justified by their impact on the frequency of fatal and serious injury crashes. Given the rarity of fatal and serious injury crashes, offense history, and crash history of drivers are sometimes used as an alternative measure of the impact of interventions and changes to policy. The primary purpose of this systematic review was to assess the rigor of statistical modeling used to predict fatal and serious crashes from offense history and crash history using a purpose-made quality assessment tool. A secondary purpose was to explore study outcomes.Entities:
Keywords: Crash; Crash history; Driver offenses; Offense; Quality assessment tool; Statistical modeling; Statistics; Systematic review; Traffic
Mesh:
Year: 2020 PMID: 32988410 PMCID: PMC7523043 DOI: 10.1186/s13643-020-01475-7
Source DB: PubMed Journal: Syst Rev ISSN: 2046-4053
Quality assessment tool
| Sub-section and area of interest | Items |
|---|---|
| Section 1—Reporting of statistical analysis | |
| Data quality | Does the study report information about the research population? Does the study report what type of data has been used for the analysis, whether it is primary data (collected initially for the study) or secondary data (from a different source)? If the study used secondary data does the study name, the databases which have been used for the analysis? If the study has used different databases, does the study describe any linkage between the databases? Does the study report whether the data set is a representative data set? |
| Preliminary analyses | Does the study report any statistical procedures used to modify raw data before the analysis? |
| Primary analyses | Does the study describe about the purpose of the analysis? Does the study identify the variables used in the analysis and summarize each with descriptive statistics? Does the study describe the main methods fully, for analyzing the primary objectives of the study? Is the study clear on which method is used for each analysis, rather than just listing all statistical methods used in one place? If the method includes any assumptions, does the study verify that the data conformed to the assumptions of the test used to analyze them? Does the study indicate whether and how any allowance or adjustments were made for multiple comparisons (performing multiple hypothesis tests on the same data)? Does the study report how it deals with missing data? If relevant, does the study report how any outlying data were treated in the analysis? Does the study report the alpha level (e.g., 0.05) that defines statistical significance? Does the study report the name of the statistical package or program used in the analysis? If the study needs to control any variables for its objective, does the study report it properly? |
| Supplementary analyses | Does the study describe sensitivity analyses if applicable? Does the study test for the underlying assumptions of the methods used in the analysis? Does the study identify post hoc analyses, including unplanned subgroup analyses, as exploratory? If there is an imbalance that exists in the outcome variable of the data set, does the study report how the training phase overcome this issue? |
| Section 2—Reporting of results | |
| Reporting numbers and descriptive stat avg | Does the study report numbers—especially measurements—with an appropriate degree of precision. For ease of comprehension and simplicity, rounded to a reasonable extent? Does the study report total sample and group sizes for each analysis? Does the study report numerators and denominators for all percentages? Does the study summarize data that are approximately normally distributed with means and standard deviations (SD)? Use the form: mean (SD), not mean ± SD? Does the study summarize data that are not normally distributed with medians and interpercentile ranges, ranges, or both (report the upper and lower boundaries of interpercentile ranges and the minimum and maximum values of ranges, not just the size of Does the study report the variability of the data set using either standard deviations, inter-percentile ranges, or ranges (the SE is an inferential statistic—it is about a 68% confidence interval—not a descriptive statistic)? Does the study display summarized or exact data in tables? Does the study display data in figures? Tables present exact values, and figures provide an overall assessment of the data? |
| Reporting risk and ratios | Does the study describe the type of rate (e.g., incidence rates; survival rates), ratio (e.g., odds ratios; hazard ratios), or risk (e.g., absolute risks; relative risk differences), being reported? Does the study describe the quantities represented in the numerator and denominator? Does the study report the time period over with each rate applies? Does the study report any unit of population (that is, the unit multiplier: e.g., × 100; × 10,000) associated with the rate? Does the study consider reporting a measure of precision (a confidence interval) for estimated risks, rates, and ratios? |
| Validation | Does the study describe methods of validation used in the training phase (e.g., cross validation, use of test/hold-out sample)? Does the study describe the attempts to generalize the model beyond the immediate context? |
| Section 3—Method specific quality indicators | |
| Regression analysis | Does the study describe the purpose of the analysis? Does the study confirm that the assumptions of the analysis were met? For example, in linear regression indicate whether an analysis of residuals confirmed the assumptions of linearity. Does the study report the regression equation for either simple or multiple (multivariable) regression analyses? For primary comparisons analyzed with simple linear regression analysis, does the study consider reporting the results graphically, in a scatter plot showing the regression line and its confidence bounds? Does the study report the alpha level used in the univariate analysis? Does the study report whether the variables were assessed for collinearity? Does the study report whether variables were assessed for interactions? Does the study describe the variable selection process by which the final model was developed (e.g., forward stepwise; best subset). Does the study report the regression coefficients (beta weights) of each explanatory variable and the associated confidence intervals and Does the study provide a measure of the model’s “goodness-of-fit” to the data (the coefficient of determination, r2, for simple regression and the coefficient of multiple determination, R2, for multiple regression)? |
| Survival analysis | Does the study describe the purpose of the analysis? Does the study describe the dates or events that mark the beginning and the end of the time period analyzed? Does the study specify the circumstances under which data were censored? Does the study specify the statistical methods used to estimate the survival rate? Does the study confirm that the assumptions of survival analysis were met? For each group, give the estimated survival probability at appropriate follow-up times, with confidence intervals, and the number of participants at risk for death at each time. It is often more helpful to plot the cumulative probability of not surviving. For each group, give the estimated survival probability at appropriate follow-up times, with confidence intervals, and the number of participants at risk for death at each time. It is often more helpful to plot the cumulative probability of not surviving. Reporting median survival times, with confidence intervals, is often useful to allow the results to be compared with those of other studies? Does the study present the full results in a graph (e.g., a Kaplan-Meier plot) or table? Does the study specify the statistical methods used to compare two or more survival curves? Does the study report the Does the study report the regression model used to assess the associations between the explanatory variables and survival or time-to-event? Does the study report a measure of risk (e.g., a hazard ratio) for each explanatory variable, with a confidence interval? |
| SEM models | Does the study report all the parameters and their standard errors? Does the study report the reason for the choice of a clear and complete form of path model structure? Does the study report the global indices of fit? Does the study provide reasons as justification for omitted directed and non-directed arcs? Does the study report alternative and equivalent models? |
Fig. 1PRISMA flowchart
Details of study characteristics
| Study, country, vehicle type, quality | Design, data type, number of drivers/crashes | Independent predictor | Confounders controlled | Dependent variable | Statistical technique used, measure of risk | Main finding of interest |
|---|---|---|---|---|---|---|
| Lui and Marchbanks [ | Longitudinal, FARS (1984‑1986), population, drivers that have had a speeding conviction ( | Date of prior crash, suspension or conviction | None | Date of fatal crash | Survival analysis | Involvement in a fatal crash will occur by 5 years post the prior crash, suspension, or conviction. |
| Perneger and Smith [ | Matched-pairs design, FARS (1986), population, two-car crashes ( | Invalid license, prior DWI, prior suspension, prior speeding, prior crash within 12 months, 13‑24 and 25‑36 months ago | Environment, exposure to traffic, differences in case fatality | Culpable or non-culpable drivers | Logistic regression, odds ratio | Invalid driving license, prior DWI, and prior license suspension increases the likelihood of initiating a fatal crash. |
| Rajalin [ | Case-control design, (Study 1) VALT and traffic offense register, sample, drivers involved in fatal crash ( | Offense rate | Distance driven | Drivers in fatal crashes or control | Logistic regression, odds ratio | Drivers involved in a fatal crash have an offense rate 1.51 times higher than control drivers. |
| Cooper [ | Case-control design, police crash data and driver records in British Columbia (1991‑1994), population, with speeding conviction ( | Thirteen types of prior traffic convictions (e.g., speeding, not obeying signal, failure to yield) | None | Rate of crashes per driver | Logistic regression, estimates | All conviction types contribute to increasing the chance of being in a subsequent FSI crash, convictions associated with speeding and DWI were the most important predictors. |
| Wundersitz et al. [ | Matched-pairs design (drivers paired from same fatal crash group), TARS (1999 to 2002), South Australian population, Culpable drivers ( | Four offense types and grouped offenses over 5 years prior to the fatal crash, crashes and culpable crashes over 5 years prior to fatal crash | None | Culpable vs non-culpable involvement in fatal crash | Not reported, odds ratio | Not significant |
| Kim et al. [ | Case-control design, NPTS and FARS (1995‑1997) population, | Previous traffic offense yes/no | None | Probability of survival, probability of crash | Logistic regression, odds ratio | If a previous traffic offense is present it decreases the likelihood of surviving a one or two-vehicle crash but increases the likelihood of surviving a multi-vehicle crash. A previous traffic offense increases the likelihood of having a crash for one, two, and multi-vehicle crashes. |
| Blower and Green [ | Case-control design, BIFA from FARS (1999‑2005), population, drivers ( | Previous violation or not in previous 3 years, previous crashes or not in previous 3 years | Type of bus service: school, intercity, charter/tour, other | Driver error in fatal crash or not | Logistic regression, odds ratio | Previous violations and crashes each increase the chance of the driver making an error by 1.3 times. Drivers with a violation on their record are 27% more likely to commit an error in a fatal crash than those without a violation. |
| Malchose and Vachal [ | Case-control design (involved in FSI crash or not), NDDL, (2006 to 2009), North Dakota population, crashes ( | No-risk convictions (e.g., parking), risk convictions (e.g., speeding), previous property damage only crash, in the first year or prior to first fatal or injury crash, previous at-fault property damage only crash, in the first year or prior to first fatal | Gender, age risk convictions, rural/urban | Involvement in a fatal or injury crash in the first year of licensure or not | Logistic regression, log odds | Drivers with previous convictions are 0.5 times; and drivers with previous property damage only crash histories are 25.5 times likely to be involved in a fatal or injury crash. |
| Lueck and Murry [ | Case-control design, MCMIS and CDLIS, population, crashes ( | The ability of those with and without a violation/conviction and crash history (2008) | None | Crashes (2009) | Not reported, reported percentages | Twenty-three of the 34 independent variables significantly increased the chance of being in a crash, increases ranged from 18% to 96% Any previous conviction increased the chance of being in a crash by 56% Prior crash increased the chance of being in a future crash by 88% |
| Gates et al. [ | Case-control design, FARS (1993‑2008), population, driver > 20 years old involved in a fatal crash ( | Stimulants present or absent | Past 3 years, prior crashes, DWI, speeding infractions, other infractions, suspensions | Responsibility for crash measured by UDA preceding the crash | Logistic regression, reported percentages | Odds of an UDA increased by: 30% one or more prior crash 24% other moving conviction 26% one prior suspension 14% previous speeding conviction |
| Factor [ | Case-control design, ICBS (2002‑2008), 20% representative sample, drivers ( | Traffic tickets per year (over 13 years) | Daily distance traveled, gender, age, religion, education, social class, vehicle type | Driver involve in FSI crash or not | Logistic regression, odds ratio | As traffic tickets increase so too do the odds of being in an FSI crash. |
| Reguly et al. [ | Case-control design, FARS (1993‑2008), population, negative drug test ( | Opioid analgesic (drivers positive or negative drug test) | Collisions, DWI convictions, other convictions, speeding, license suspensions | Responsibility for crash measured by UDA preceding the crash | Logistic regression, estimate result, odds ratio | Odds of an unsafe driver action increased if the driver had a previous crash, a driving infraction or speeding infraction in the past 3 years. |
| Dubois et al. [ | Case-control design, FARS (1991‑2008), population, driver > 20 years old involved in a fatal crash ( | BAC, cannabis, BAC and cannabis (drivers positive or negative drug test) | History of one, two, or three or more of crashes, DWI, speeding, suspensions | Responsibility for crash measured by an UDA preceding the crash | Logistic regression, odds ratio | Odds of an UDA increased by: 13% one prior crash 39% three or more prior crashes 26% one prior suspension 33% three or more prior suspensions |
| Kumfer et al. [ | Case-control design, FARS (2010‑2012) California, Michigan, New York, North Carolina, Texas, and Washington, representative sample of USA, driver in a single-vehicle crash ( | History of suspensions or revocations (none, one, other), year of last crash or license suspension (no record, two since 2005, other) | None | Multivehicle or single vehicle | Logistic regression, odds ratio | Not significant |
| Feng et al. [ | Case-control design, BIFA from FARS (2006 to 2010) population, drivers ( | Cluster 1 “middle age drivers with history of driving violations” Cluster 2 “young and elderly drivers with history of driving violations” Cluster 3 “drivers without history of driving violations” Valid license or not | None | Level 1 crashes < = 2 fatalities Level 2 crashes > 2, < 3 fatalities Level 3 crashes > = 3 fatalities | Logistic regression, estimate result and odds ratio | Cluster 1, low chance of being involved in level 2 and 3 crashes; cluster 2, high chance of being in level 1 and 2 crashes; cluster 3, ns |
| Li et al. [ | Matched-pairs design (drivers paired from same fatal two-vehicle crash), FARS (1993 to 2014), population, culpable driver ( | Concurrent alcohol and marijuana use | Previous 3-year crash history, previous 3-year DWI conviction, previous 3-year speeding conviction | Culpability or non-culpability in a fatal crash | Logistic regression, odds ratio | Having a crash history, a DWI conviction, a speeding conviction, and license suspension in the previous 3 years increases the likelihood of culpability in a fatal crash. |
| Hamzeie et al. [ | Case control design, FARS fatal crashes (2010 to 2014), population, drivers with known injury severity ( | Cannabis use, five levels of injury severity* | Number of previous license suspensions, number of previous speeding violations | Cannabis use yes/no | Logistic regression, odds ratio | Those with speeding violations and those who have had their license suspended are associated with higher levels of injury in fatal crashes. |
| Stringer [ | Longitudinal, FARS and GSS (1993 to 2015), population, drivers ( | Normative behavior, values and beliefs, local attitudes | Non-DUI fatal crashes, repeat DUI offender crashes | Total frequency of DUI fatal crashes in each county | Poisson multi-level growth curve | Non-DUI fatal crashes and repeat DUI offender crashes significantly predict future DUI fatal crashes |
| Mashhadi et al. [ | Case-control design, CARE and WCRVD (2011‑2014), Wyoming data from 3 interstate highways, single truck crash at fault ( | Violation record | None | Fatality/injury level, being in a single truck crash | Logistic regression | Not significant |
| Yuan et al. [ | Within group (involved in FSI crashes), TIFA and FARS (2010), population, crashes ( | Driver factors (latent factor) made up of five observed measures; belt use, driving experience, history of conviction, history of crash, gender, valid license or not | Other latent variables | Truck occupant injury factors (latent), accident size (latent) | Structural equation modeling, standardized regression weights | Crash history (measured) has a large effect on driver factors (latent) which decreases accident size (latent). Prior suspension, speeding, and convictions (measured) make a fatal accident more severe. |
Country the country from which the data was collected; TIFA database trucks involved in fatal accidents database, FARS fatality analysis reporting system, BIFA buses involved in fatal accidents database, * the KABCO scale, ns not significant; MCMIS motor carrier management information system database, CDLIS commercial driver’s license information system, NDDL North Dakota Drivers’ License data; TARS Traffic Accident Reporting System, NPTS Nationwide Personal Transportation Survey, VALT Traffic Safety Committee of Insurance Companies, GSS General Social Survey from National Opinion Research Center, ICBS Israel Central Bureau of Statistics, CARE Critical Analysis Reporting Environment, WCRVD Wyoming court reported violation database, DUI driving under the influence, DWI driving while intoxicated, FSI fatal or serious injury crash, UDA unsafe driving action (used as a proxy measure)
Quality tool assessment results
| Studies grouped by statistical technique | Reporting of statistical methods | Reporting of statistical results | Method specific quality indicators | Overall quality score |
|---|---|---|---|---|
| Perneger and Smith 1991 | 0.704 | 0.552 | 0.299 | 0.463 |
| Rajalin 1994 | 0.193 | 0.391 | 0.000 | 0.146 |
| Cooper 1997 | 0.372 | 0.188 | 0.444 | 0.362 |
| Wundersitz et al. 2004 | 0.435 | 0.525 | 0.167 | 0.323 |
| Kim et al. 2006 | 0.470 | 0.432 | 0.278 | 0.365 |
| Blower and Green 2010 | 0.675 | 0.590 | 0.556 | 0.594 |
| Malchose and Vachal 2011 | 0.646 | 0.449 | 0.500 | 0.524 |
| Lueck and Murry 2011 | 0.333 | 0.382 | 0.111 | 0.234 |
| Gates et al. 2013 | 0.741 | 0.617 | 0.611 | 0.645 |
| Factor 2014 | 0.880 | 0.576 | 0.389 | 0.558 |
| Reguly et al. 2014 | 0.907 | 0.636 | 0.556 | 0.664 |
| Dubois et al. 2015 | 0.869 | 0.750 | 0.611 | 0.710 |
| Kumfer et al. 2015 | 0.938 | 0.651 | 0.611 | 0.703 |
| Feng et al. 2016 | 0.750 | 0.617 | 0.500 | 0.592 |
| Li et al. 2017 | 0.697 | 0.464 | 0.997 | 0.755 |
| Hamzeie et al. 2017 | 0.741 | 0.571 | 0.500 | 0.578 |
| Mashhadi et al. 2018 | 0.696 | 0.569 | 0.500 | 0.566 |
| Yuan et al. 2019 | 1.000 | 0.387 | 0.600 | 0.647 |
| Lui and Marchbanks 1990 | 0.481 | 0.625 | 0.808 | 0.680 |
| Stringer 2018 | 0.958 | 0.504 | 0.778 | 0.755 |
Fig 2Quality assessment sub-section scores for included studies