| Literature DB >> 34966551 |
Abstract
Crime analysis/mapping techniques have been developed and applied for crime detection and prevention to predict where and when crime occurs, leveraging historical crime records over a spatial area and covariates for the spatial domain. Some of these techniques may provide insights for understanding crime and disorder, especially, via interpreting the weights for the spatial covariates based on regression modelling. However, to date, the use of temporal covariates for the time domain has not played a significant role in the analysis. In this work, we collect time-stamped crime-related news articles, infer crime topics or themes based on the collection and associate the topics with the historical numeric crime counts. We provide a proof-of-concept study, where instead of adopting spatial covariates, we focus on temporal (or dynamic) covariates and assess their utility. We present a novel joint model tailored for the crime articles and counts such that the temporal covariates (latent variables, more generally) are inferred based on the data sources. We apply the model for violent crime in London.Entities:
Keywords: crime analysis; matrix factorization; temporal/dynamic methods; topic modelling
Year: 2021 PMID: 34966551 PMCID: PMC8633794 DOI: 10.1098/rsos.210750
Source DB: PubMed Journal: R Soc Open Sci ISSN: 2054-5703 Impact factor: 2.963
Figure 3Recent rise of violent crime.
Figure 8Police actions.
Figure 1Graphical plate diagram of the model. The latent variables depend on , affect and are shared between the dynamic topic model (nodes below the left edge of ) and Poisson matrix factorization model (nodes below the right edge of ). The temporal plate indexing t is dashed to emphasize dynamic dependence.
Model selection for our model for different component numbers. The values show mean and s.d. for predictive log likelihood and mean absolute error (MAE) for held-out observations over the folds. Similarly, we also show values for WAIC but using in-sample observations.
| K | 100 | 200 | 300 | 400 |
|---|---|---|---|---|
| log likelihood | −4509 ± 35 | −4496 ± 48 | −4480 ± 48 | −4483 ± 40 |
| WAIC (×103) | −16.70 ± 0.12 | −16.72 ± 0.091 | −16.72 ± 0.097 | −16.84 ± 0.15 |
| MAE | 29.55 ± 0.5 | 29.27 ± 0.71 | 29.18 ± 0.51 | 29.04 ± 0.47 |
Model comparison against MF and PR approaches. The values show mean and s.d. for predictive log likelihood and mean absolute error (MAE) for held-out observations over the folds. Similarly, we also show values for WAIC but using in-sample observations.
| model | our | MF | PR |
|---|---|---|---|
| log likelihood | −4480 ± 48 | −4454 ± 75 | −5347 ± 156 |
| WAIC (×103) | −16.76 ± 0.11 | −16.68 ± 0.09 | −113.21 ± 9.49 |
| MAE | 29.18 ± 0.51 | 29.03 ± 0.51 | 41.23 ± 1.6 |
Figure 2Crime count predictions for each method. The held-out values are indicated by orange squares. The top row is for Tower Hamlets, middle for Camden and bottom for Barnet.
Figure 4Violence-related topics.
Figure 5London riots.
Figure 6Terror attacks.
Figure 7Court and government actions.
Figure 9Theories and explanations of violent crime.