| Literature DB >> 30716110 |
Masoomali Fatehkia1, Dan O'Brien2,3, Ingmar Weber4.
Abstract
Much research has examined how crime rates vary across urban neighborhoods, focusing particularly on community-level demographic and social characteristics. A parallel line of work has treated crime at the individual level as an expression of certain behavioral patterns (e.g., impulsivity). Little work has considered, however, whether the prevalence of such behavioral patterns in a neighborhood might be predictive of local crime, in large part because such measures are hard to come by and often subjective. The Facebook Advertising API offers a special opportunity to examine this question as it provides an extensive list of "interests" that can be tabulated at various geographic scales. Here we conduct an analysis of the association between the prevalence of interests among the Facebook population of a ZIP code and the local rate of assaults, burglaries, and robberies across 9 highly populated cities in the US. We fit various regression models to predict crime rates as a function of the Facebook and census demographic variables. In general, models using the variables for the interests of the whole adult population on Facebook perform better than those using data on specific demographic groups (such as Males 18-34). In terms of predictive performance, models combining Facebook data with demographic data generally have lower error rates than models using only demographic data. We find that interests associated with media consumption and mating competition are predictive of crime rates above and beyond demographic factors. We discuss how this might integrate with existing criminological theory.Entities:
Mesh:
Year: 2019 PMID: 30716110 PMCID: PMC6361434 DOI: 10.1371/journal.pone.0211350
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Factor analysis of the Facebook interest related variables.
| Games | F1 | F2 | Music | F1 | F2 | F3 | Movies | F1 | F2 |
|---|---|---|---|---|---|---|---|---|---|
| Games | 0.811 | 0.408 | Music | 0.402 | 0.317 | 0.486 | Movies | 0.599 | 0.474 |
| Word | 0.021 | 0.329 | Music videos | 0.871 | 0.046 | 0.225 | Animated | 0.686 | 0.233 |
| Action | 0.525 | 0.816 | Classical | -0.096 | 0.160 | 0.540 | Comedy | 0.766 | 0.107 |
| Gambling | 0.530 | 0.399 | Gospel | 0.943 | 0.148 | -0.092 | Fantasy | 0.730 | 0.110 |
| Online poker | 0.076 | -0.339 | Soul | 0.942 | 0.135 | 0.070 | Science fiction | 0.597 | 0.606 |
| First-person shooter | 0.811 | 0.140 | Jazz | 0.742 | 0.109 | 0.186 | Thriller | 0.836 | 0.266 |
| Online | 0.661 | 0.496 | Dance | 0.750 | 0.008 | 0.319 | Action | 0.902 | 0.120 |
| Simulation | 0.678 | 0.417 | Rhythm & blues | 0.975 | 0.176 | 0.045 | Musical theatre | -0.020 | 0.473 |
| Casino | 0.416 | 0.709 | Hip hop | 0.962 | -0.027 | 0.092 | Documentary | 0.383 | 0.882 |
| Board | 0.250 | -0.027 | Blues | 0.249 | 0.915 | 0.143 | Drama | 0.577 | 0.248 |
| Role-playing | 0.627 | 0.043 | Country | -0.227 | 0.914 | 0.070 | Anime | 0.567 | -0.335 |
| Racing | 0.693 | 0.266 | Rock | 0.281 | 0.698 | 0.472 | Horror | 0.838 | 0.362 |
| Browser | 0.319 | 0.588 | Heavy metal | 0.193 | 0.772 | 0.442 | |||
| Shooter | 0.768 | 0.195 | Electronic | 0.405 | 0.216 | 0.779 | |||
| Sports | 0.707 | 0.379 | |||||||
| Strategy | 0.456 | 0.393 | |||||||
| Card | 0.330 | 0.746 | |||||||
| Puzzle video | 0.411 | 0.907 | |||||||
| Video games | 0.836 | 0.410 |
Factors and their loadings on the various Facebook interest related variables for the categories of games, music and movies.
Factor analysis of the Facebook relationship related variables.
| Relationship | F1 | F2 | F3 |
|---|---|---|---|
| female/male ratio with “dating” status | 0.980 | 0.136 | 0.126 |
| female/male ratio with “single” status | 0.741 | 0.401 | 0.142 |
| female/male ratio with “in a relationship” status | 0.967 | 0.112 | 0.095 |
| female/male ratio with “unspecified” status | 0.255 | 0.954 | -0.143 |
| Fraction of males “in a relationship” w. interest in “Online dating service” | 0.005 | -0.107 | 0.436 |
| Fraction of females “in a relationship” w. interest in “Online dating service” | 0.143 | 0.047 | 0.514 |
Columns indicate the factors and their loadings on the various Facebook relationship variables.
Fig 1Correlation matrix of Facebook and demographic factor variables.
Parameter estimates and fit statistics for regression models predicting assault crimes.
| Demog. Model | FB Model | Demog. & FB Model | |
|---|---|---|---|
| -0.100 (0.123) | -0.187 (0.131) | -0.016 (0.123) | |
| -0.075 (0.042) | -0.021 (0.043) | ||
| -0.208 | -0.129 | ||
| 0.553 | 0.262 | ||
| 0.220 | |||
| 0.590 | 0.332 | ||
| -0.528 | |||
| -0.143 | |||
| -0.170 | |||
| Adjusted R-squared | 0.639 | 0.604 | 0.656 |
| Marginal adjusted R-squared (from city dummies) | 0.488 | 0.437 | 0.511 |
| RSS | 151.50 | 166.78 | 143.17 |
| MAE (train) | 437.77 | 450.89 | 423.50 |
| MAE (CV) | 450.18 | 465.38 | 439.08 |
| F-statistics | 70.44 | 66.70 | 55.76 |
| df | 420 | 421 | 416 |
| N | 432 | 432 | 432 |
*** p < 0.001,
** p < 0.01,
* p < 0.05
Demog. = demographic, FB = Facebook. Coefficient standard errors are in parenthesis. All variables were standardized before the regression. City-level fixed effects are accounted for by models.
Parameter estimates and fit statistics for regression models predicting burglary crimes.
| Demog. Model | FB Model | Demog. & FB Model | |
|---|---|---|---|
| 0.780 | 0.922 | 0.899 | |
| -0.086 (0.047) | -0.039 (0.046) | ||
| -0.071 (0.044) | -0.023 (0.044) | ||
| 0.199 | 0.144 | ||
| 0.524 | 0.338 | ||
| -0.152 | |||
| -0.249 | -0.275 | ||
| -0.174 | -0.149 | ||
| Adjusted R-squared | 0.562 | 0.601 | 0.598 |
| Marginal adjusted R-squared (from city dummies) | 0.083 | 0.163 | 0.157 |
| RSS | 183.77 | 167.22 | 167.62 |
| MAE (train) | 168.14 | 161.68 | 161.72 |
| MAE (CV) | 174.29 | 167.62 | 168.12 |
| F-statistics | 51.37 | 55.08 | 46.80 |
| df | 420 | 419 | 417 |
| N | 432 | 432 | 432 |
*** p < 0.001,
** p < 0.01,
* p < 0.05
Demog. = demographic, FB = Facebook. Coefficient standard errors are in parenthesis. All variables were standardized before the regression. City-level fixed effects are accounted for by models.
Parameter estimates and fit statistics for regression models predicting robbery crimes.
| Demog. Model | FB Model | Demog. & FB Model | |
|---|---|---|---|
| 0.761 | 0.820 | 0.906 | |
| -0.119 | -0.077 (0.047) | ||
| -0.292 | -0.272 | ||
| -0.170 | |||
| 0.482 | 0.250 | ||
| 0.6 | 0.283 | ||
| -0.645 | |||
| -0.240 | |||
| -0.171 | |||
| -0.139 | |||
| Adjusted R-squared | 0.558 | 0.528 | 0.581 |
| Marginal adjusted R-squared (from city dummies) | 0.411 | 0.371 | 0.441 |
| RSS | 185.63 | 198.17 | 174.46 |
| MAE (train) | 111.61 | 116.16 | 109.85 |
| MAE (CV) | 115.18 | 120.32 | 114.55 |
| F-statistics | 50.47 | 44.86 | 40.78 |
| df | 420 | 420 | 416 |
| N | 432 | 432 | 432 |
*** p < 0.001,
** p < 0.01,
* p < 0.05
Demog. = demographic, FB = Facebook. Coefficient standard errors are in parenthesis. All variables were standardized before the regression. City-level fixed effects are accounted for by models.
Fig 2Predicted assault crime rates.
Assault crime rates predicted by the Facebook model for ZIP codes in New York City.
Fig 3Assault crime rates 2017.
2017 Assault crime rates for ZIP codes in New York City.