| Literature DB >> 32626645 |
Ourania Kounadi1, Alina Ristea2,3, Adelson Araujo4, Michael Leitner2,5.
Abstract
BACKGROUND: Predictive policing and crime analytics with a spatiotemporal focus get increasing attention among a variety of scientific communities and are already being implemented as effective policing tools. The goal of this paper is to provide an overview and evaluation of the state of the art in spatial crime forecasting focusing on study design and technical aspects.Entities:
Keywords: Crime; Forecasting; Hotspots; Prediction; Predictive policing; Spatial analysis; Spatiotemporal
Year: 2020 PMID: 32626645 PMCID: PMC7319308 DOI: 10.1186/s40163-020-00116-7
Source DB: PubMed Journal: Crime Sci ISSN: 2193-7680
Fig. 1The three phases of the study selection process: identification, screening, and eligibility
Data items analyzed at different study stages
| Study stage | Data items |
|---|---|
| Identification | Authors; year; title; data source |
| Screening | Relevance_1; availability |
| Eligibility | Publication type; empirical data; performance evaluation; spatial size; temporal size; purpose, relevance_2 |
| Results “ | Year; title; discipline; journal/conference; study area country, institution |
| Results “ | Study area; scale; sampling period; months; type; sample; inference; task; spatial unit; temporal unit |
| Results “ | Proposed method; best proposed method; baseline method; proposed algorithm type; proposed method input, variables |
| Results “ | Evaluation metric; validation strategy |
A summary of the papers’ general characteristics such as journal or conference, country of study area, institution, and scientific discipline of the first author
| Top 5 journals or conferences (no of papers) | Top 3 countries (count) | |
|---|---|---|
| International Journal of Forecasting (3) | USA (23) | |
| Applied Geography (2) | Brazil (2) | |
European Journal on Criminal Policy and Research (2) EPJ Data Science (2) International Conference on Systems, Man, & Cybernetics (2) | UK (2) |
Fig. 2A yearly count of eligible and selected papers from 2001 to 2018
An overview of the 32 selected papers with information about the space and time of the research, the crime data, and forecasting details
| No* | Authors and date | Space | Time | Crime Data | Forecasting | ||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Study area | Scale | Sampling period | Months | Type | Sample | Inference | Task | Spatial unit | Temporal unit | ||
| 1 | Araujo Junior et al. ( | Natal, Brazil | City | 2006–2016 | 132 | # of crimes | Regression | Rectangular grid ( | Week | ||
| 2 | Araújo et al. ( | Natal, Brazil | City | 2006–2016 | 132 | Hotspots | Binary classification | k-means cells of varying size ( | Week | ||
| 3 | Bowen et al. ( | DeKalb, USA | County | 2011–2014 | 48 | Violent crime | Hotspots | Binary classification | Census block groups | Month | |
| 4 | Brown and Oxford ( | Richmond, USA | City | 1994–1999 | 72 | Break and enter | ≈ over 24,000 | # of crimes | Regression | Grid cells of 1.66 km2, precincts | Week, month |
| 5 | Cohen et al. ( | Pittsburgh, USA | City | 1991–1998 | 96 | 2 crime types | 1.3 million | # of crimes | Regression | 1219 m × 1219 m grid cells | Month |
| 6 | Dash et al. ( | Chicago, USA | City | 2011–2015 | 60 | 34 crime types | 6.6 million | # of crimes | regression | Communities | Month, year |
| 7 | Drawve et al. ( | Little Rock, USA | City | 2008–2009 | 18 | Gun crime | 1429 | Hotspots | Binary classification | 91 m × 91 m grid cells | 6 months |
| 8 | Dugato et al. ( | Milan, Italy | City | 2012–2014 | 36 | Residential burglary | 20,921 | Hotspots | Binary classification | Grid cells of 2500 m2 | Year |
| 9 | Gimenez-Santana et al. ( | Bogota, Colombia | city | 2012–2013 | 24 | 3 crime types | Hotspots | Binary classification | 75 m × 75 m grid cells | Year | |
| 10 | Gorr et al. ( | Pittsburgh, USA | City | 1991–1998 | 96 | 5 crime types | ≈ 1 million | # of crimes | Regression | Police precincts | Month |
| 11 | Hart and Zandbergen ( | Arlington, USA | City | 2007–2008 | 24 | 4 crime types | 6295 | Hotspots | Binary classification | Grid cells of 3 different sizes ( | Year |
| 12 | Hu et al. ( | Baton Rouge, USA | City | 2011 | 12 | Residential burglary | 3706 | Hotspots | Binary classification | 100 m × 100 m grid cells | Week |
| 13 | Huang et al. ( | New York, USA | City | 2014 | 12 | 4 crime types | 103,913 | Category of crime | Binary classification | Districts | Day, month |
| 14 | Ivaha et al. ( | Cardiff, UK | City | 2001–2003 | 26 | Criminal damage | Percent of crime in clusters | Regression | Clusters of varying size ( | Day | |
| 15 | Johansson et al. ( | Sweden three cities: Stockholm, Gothenburg, and Malmö | Cities | 2013–2014 | 12 | Residential burglary | 5681 | Hotspots | binary Classification | Grid cells ( | 3 months |
| 16 | Kadar and Pletikosa ( | New York, USA | City | 2014–2015 | 24 | All crime and 5 crime types | 174,682 | # of crimes | Regression | Census tract | Year |
| 17 | Liesenfeld et al. ( | Pittsburgh, USA | City | 2008–2013 | 72 | All crime | 9936 | # of crimes | Regression | Census tracts | Month, year |
| 18 | Lin et al. ( | Taoyuan City, Taiwan | City | 2015–2018 | 39 | Motor vehicle thefts | ≈ 8580 | Hotspots | Binary classification | 5 to 100 × 5 to 100 grid cells | Month |
| 19 | Malik et al. ( | Tippecanoe, USA | County | 2004–2014 | 120 | all crime | ≈ 310,000 | Hotspots | Binary classification | Grid cells ( | Week |
| 20 | Mohler ( | Chicago, USA | City | 2007–2012 | 72 | 2 crime types | 78,852 | Hotspots | Binary classification | 75 m × 75 m, 150 m × 150 m grid cells | Day |
| 21 | Mohler and Porter ( | Portland, USA | City | 2012–2017 | 60 | 4 crime types | Hotspots | Binary Classification | Grid cells of 5806 m2 | Week, 2 weeks, month, 2 months, 3 months | |
| 22 | Mohler et al. ( | Indianapolis, USA | City | 2012–2013 | 24 | 4 crime types | Hotspots | Binary classification | 300 m × 300 m grid cells | Day | |
| 23 | Mu et al. ( | Boston, USA | City | 2006–2007 | 24 | Residential burglary | Hotspots | Binary classification | 20 × 20 grid cells ( | Month | |
| 24 | Rodríguez et al. ( | San Francisco, USA | City | 2003–2013 | 120 | Burglary | Properties of clusters | Regression | Clusters ( | Day | |
| 25 | Rosser et al. ( | “Major city”, UK ( | City | 2013–2014 | 13 | Residential burglary | 5862 | Hotspots | Binary classification | Street segments ( | Day |
| 26 | Rumi et al. ( | Brisbane, Australia; New York City, USA | Cities | 2013–2013 (AUS); 2012–2013 (USA) | 9 and 12 | 6 crime types | Hotspots | Binary classification | Census regions | 3 h | |
| 27 | Rummens et al. ( | “Large city”, Belgium ( | city | 2011–2014 | 48 | 3 crime types | 163,800 | Hotspots | Binary classification | 200 m by 200 m grid cells | 2 weeks, daytime month, night time month |
| 28 | Shoesmith ( | USA | Country | 1960–2009 | 600 | 2 crime types | Crime rate | Regression | USA regions | Year | |
| 29 | Yang et al. ( | New York, USA | city | January 2014–April 2015 | 16 | 7 crime types | Hotspots | Binary classification | 0.01 latitude × 0.01 longitude grid cell size | Day, week, month | |
| 30 | Yu et al. ( | “City in the Northeast”, USA | City | Residential burglary | Hotspots | binary Classification | grid cells ( | Month | |||
| 31 | Zhao and Tang ( | New York, USA | City | 2012–2013 | 12 | # of crimes | Regression | 2 km × 2 km grid cells | Day, week | ||
| 32 | Zhuang et al. ( | Portland, USA | City | March 2012–December 2016 | 58 | All crime | Hotspots | Binary classification | 183 m × 183 m grid cells | 2 weeks | |
U = undefined or unclear information
* No: Reference number of the paper that are used in Fig. 5
Fig. 3Percentages of all publications (n = 32) for describing basic information when reporting a spatial crime forecasting study. Blue: the item was properly defined; orange: the item was poorly defined or undefined
Fig. 5Overview of forecasting methods (see “Spatial crime forecasting methods” section) and their performance evaluation (see “Considerations when analysing forecasting performance” section) linked to the 32 selected papers. The papers’ references linked to their number are shown in Table 3. The letter denotes an evaluation metric. The letter “U” denotes an undefined item
Top four proposed, best proposed, and baseline methods applied in the 32 selected papers
| Top 4 proposed methods (# of papers) | Top 4 best proposed methods (# of papers) | Top 4 baseline methods (# of papers) |
|---|---|---|
| Random Forest (7) | Random Forest (5) | Autoregressive model-based (5) |
| Multilayer Perceptron (6) | Multilayer Perceptron (5) | Logistic Regression (3) |
Kernel Density Estimation-based (5) Support Vector Machines (5) | Kernel Density Estimation-based (5) Risk Terrain Modelling (3) | Autoregressive integrated moving average, Multilayer Perceptron, Linear Regression, KDE-based, KNN: (2) |
Fig. 4Comparable surveillance plots for evaluation metrics visualization in space (using dummy data). a ROC-like accuracy curve, b PAI curve, and c Hit rate curve
| Dt (i.e., crime data in time t) is modelled to derive Et+1 (i.e., estimated crime information in time t + 1) that is evaluated with Dt+1 (i.e., crime information in time t + 1). |
| 1. Dt is modelled to derive Et+1 that is evaluated with Dt+1. |
| 2. Dt and Vstatic are modelled to derive Et+1 that is evaluated with Dt+1. Where Vstatic is an explanatory variable or variables that do not change between t and t + 1. |
| 3. Dt and Vdynamic_lag are modelled to derive Et+1 that is evaluated with Dt+1. Where Vdynamic_lag is an explanatory variable or variables that change between t and t + 1 and lag is a period of time earlier than the time of the dependent variable. |
| 4. Dt, Vstatic, and Vdynamic_lag are modelled to derive Et+1 that is evaluated with Dt+1. |
| 1. Hotspots and non-hotspots are defined using a statistical approach that separates space between high and low crime areas. |
| 2. Hotspots and non-hotspots are defined using an arbitrary threshold that separates space between high and low crime areas. |
| 3. Crime and non-crime are defined using a statistical approach that separates space between areas where crime is likely to occur and areas crime is not likely to occur. |
| 4. Crime and non-crime are defined using a statistical approach that separates space between areas where there is at least one estimated crime and areas where there is no estimated crime. |