| Literature DB >> 36078307 |
Guido Vittorio Travaini1, Federico Pacchioni1, Silvia Bellumore1, Marta Bosia1,2, Francesco De Micco3,4.
Abstract
Recent evolution in the field of data science has revealed the potential utility of machine learning (ML) applied to criminal justice. Hence, the literature focused on finding better techniques to predict criminal recidivism risk is rapidly flourishing. However, it is difficult to make a state of the art for the application of ML in recidivism prediction. In this systematic review, out of 79 studies from Scopus and PubMed online databases we selected, 12 studies that guarantee the replicability of the models across different datasets and their applicability to recidivism prediction. The different datasets and ML techniques used in each of the 12 studies have been compared using the two selected metrics. This study shows how each method applied achieves good performance, with an average score of 0.81 for ACC and 0.74 for AUC. This systematic review highlights key points that could allow criminal justice professionals to routinely exploit predictions of recidivism risk based on ML techniques. These include the presence of performance metrics, the use of transparent algorithms or explainable artificial intelligence (XAI) techniques, as well as the high quality of input data.Entities:
Keywords: artificial intelligence; crime prediction; explainable artificial intelligence; machine learning; recidivism
Mesh:
Year: 2022 PMID: 36078307 PMCID: PMC9517748 DOI: 10.3390/ijerph191710594
Source DB: PubMed Journal: Int J Environ Res Public Health ISSN: 1660-4601 Impact factor: 4.614
Figure 1Flowchart showing the process of inclusion of publications.
Dataset combined and ML techniques applied.
| Authors | Dataset Combined | ML Techniques |
|---|---|---|
| Butsara et al. (2019) [ | Data by central correctional institution for drug addicts and central women correctional institution in Thailand | Data standardization + Feature selection and CV |
| Duwe and Kim (2017) [ | Minnesota Screening Tool Assessing Recidivism Risk (MnSTARR) + Minnesota Sex Offender Screening Tool-3 (MnSOST-3) | CV |
| Ghasemi et al. (2021) [ | Level of Service/Case Management Inventory (LS/CMI) | CV |
| Haarsma et al. (2020) [ | NeuroCognitive Risk Assessment (NCRA) + demographic feature set | Feature selection + CV |
| Karimi-Haghighi and Castillo (2021) [ | RisCanvi | CV |
| Ozkan et al. (2019) [ | Florida Department of Juvenile Justice (FDJJ) | Feature selection |
| Salo et al. (2019) [ | Finnish Risk and Needs Assessment Form (Riski-ja tarvearvio [RITA]) Finnish Prisoner Database + static predictors | CV |
| Singh and Mohapatra (2021) [ | HCR-20 + clinical and non-clinical risk assessment factors | ANOVA + CV |
| Ting et al. (2018) [ | Youth Level of Service/Case Management Inventory 2.0 (YLS/CMI) | |
| Tolan et al. (2019) [ | Structured Assessment of Violence Risk in Youth (SAVRY) + static features | CV |
| Tollenaar et al. (2013) [ | StatRec with Dutch Offender’s Index | |
| Tollenaar et al. (2019) [ | Dutch Offender’s Index (DOI) | CV |
CV: cross validation; ANOVA: analysis of variance.
Purpose of datasets, ML models and their evaluation.
| Dataset | Type of Recurrence | Purpose | ML Model | Evaluation Metrics | Evaluation Value |
|---|---|---|---|---|---|
| Thailand | Other | Recidivism in drug distribution | Logistic Regression | ACC | 0.90 |
| MnSTARR+ | General | General recidivism | LogitBoost | ACC | 0.82 |
| LS/CMI | General | General recidivism | Random Forest | ACC | 0.74 |
| NCRA+ | General | General recidivism | Glmnet | AUC | 0.70 |
| RisCanvi | Violent | Violent Recidivism | MLP | AUC | 0.78 |
| FDJJ | Sexual | Sexual recidivism in Youth | Random Forest | AUC | 0.71 |
| RITA+ | Other | General and violent recidivism in male | Random Forest | AUC | 0.78 |
| HCR-20+ | General | General recidivism | Ensemble model with NBC, kNN, MLP, PNN, SVM | ACC | 0.87 |
| YLS/CMI | Other | General recidivism in Youth | Random Forest | ACC | 0.65 |
| SAVRY+ | Other | Violent recidivism in youth | Logistic Regression | AUC | 0.71 |
| StatRec | General | General Recidivism | Logistic Regression | ACC | 0.73 |
| Sexual | Sexual recidivism | LDA | ACC | 0.96 | |
| Violent | Violent recidivism | Logistic regression | ACC | 0.78 | |
| DOI | General | General recidivism | L1–Logistic Regression | ACC | 0.78 |
| Sexual | Sexual recidivism | L1–Logistic Regression | ACC | 0.96 | |
| Violent | Violent recidivism | Penalized LDA | ACC | 0.78 |
ACC: accuracy; AUC: area under the curve; Thailand: data by central correctional institution for drug addicts and central women correctional institution in Thailand; MnSTARR+: Minnesota Screening Tool Assessing Recidivism Risk + Minnesota Sex Offender Screening Tool-3; LS/CMI: Level of Service/Case Management Inventory; NCRA+: NeuroCognitive Risk.
Evaluation general recidivism.
| MnSTARR+ | LS/CMI | NCRA+ | HCR-20+ | StatRec | DOI | |
|---|---|---|---|---|---|---|
| ACC | 0.82 | 0.74 |
| 0.74 | 0.78 | |
| AUC |
| 0.75 | 0.70 |
| 0.73 |
AUC: area under curve; ACC: accuracy; MnSTARR+: Minnesota Screening Tool Assessing Recidivism Risk + Minnesota Sex Offender Screening Tool-3; LS/CMI: Level of Service/Case Management Inventory; NCRA+: NeuroCognitive Risk Assessment + demographic feature set; HCR-20+: Historical, Clinical and Risk Management–20 + clinical and non-clinical risk assessment factors; StatRec: static recidivism risk (Static Recidiverisico); DOI: Dutch Offender’s Index. The highest scores are highlighted in bold.
Evaluation sexual recidivism.
| StatRec | DOI | |
|---|---|---|
| ACC |
|
|
| AUC | 0.73 |
|
AUC: area under curve; ACC: accuracy; StatRec: static recidivism risk (Static Recidiverisico); DOI: Dutch Offender’s Index. The highest scores are highlighted in bold.
Evaluation violent recidivism.
| RisCanvi | StatRec | DOI | |
|---|---|---|---|
| ACC |
|
| |
| AUC |
| 0.74 | 0.74 |
AUC: area under curve; ACC: accuracy; RisCanvi: risk assessment protocol for violence prevention introduced in the Catalan prison; StatRec: static recidivism risk (Static Recidiverisico); DOI: Dutch Offender’s Index. The highest scores are highlighted in bold.
Evaluation all the other recidivism.
| Thailand | FDJJ | RITA+ | YLS/CMI | SAVRY+ | |
|---|---|---|---|---|---|
| ACC |
| 0.65 | |||
| AUC | 0.71 |
| 0.69 | 0.71 |
AUC: area under curve; ACC: accuracy; Thailand: Thailand: data by central correctional institution for drug addicts and central women correctional institution in Thailand; FDJJ: Florida Department of Juvenile Justice; RITA+: Finnish Risk and Needs Assessment Form + static predictors; YLS/CMI: Youth Level of Service/Case Management Inventory 2.0; SAVRY+: Structured Assessment of Violence Risk in Youth + static features. The highest scores are highlighted in bold.
Tabular presentation for ROBIS results.
| Phase 2 | Phase 3 | ||||
|---|---|---|---|---|---|
| Review | 1. | 2. Identification and Selection of Studies | 3. Data Collection and Study Appraisal | 4. Synthesis and Findings | Risk of Bias in the Review |
| Butsara et al. (2019) [ |
|
|
|
|
|
| Duwe and Kim (2017) [ |
|
|
|
|
|
| Ghasemi et al. (2021) [ |
|
| ? |
|
|
| Haarsma et al. (2020) [ |
|
|
|
|
|
| Karimi-Haghighi and Castillo (2021) [ |
|
|
|
|
|
| Ozkan et al. (2019) [ |
|
|
|
| ? |
| Salo et al. (2019) [ |
|
| ? |
|
|
| Singh and Mohapatra (2021) [ |
|
|
|
|
|
| Ting et al. (2018) [ |
|
|
|
|
|
| Tolan et al. (2019) [ |
|
|
|
| ? |
| Tollenaar et al. (2013) [ |
|
|
|
|
|
| Tollenaar et al. (2019) [ |
|
|
|
|
|
☺ = low risk; ☹ = high risk and ? = unclear risk.
Figure 2Graphical display for ROBIS results.