| Literature DB >> 30849094 |
Nikolaj Tollenaar1, Peter G M van der Heijden2,3.
Abstract
In a recidivism prediction context, there is no consensus on which modeling strategy should be followed for obtaining an optimal prediction model. In previous papers, a range of statistical and machine learning techniques were benchmarked on recidivism data with a binary outcome. However, two important tree ensemble methods, namely gradient boosting and random forests were not extensively evaluated. In this paper, we further explore the modeling potential of these techniques in the binary outcome criminal prediction context. Additionally, we explore the predictive potential of classical statistical and machine learning methods for censored time-to-event data. A range of statistical manually specified statistical and (semi-)automatic machine learning models is fitted on Dutch recidivism data, both for the binary outcome case and censored outcome case. To enhance generalizability of results, the same models are applied to two historical American data sets, the North Carolina prison data. For all datasets, (semi-) automatic modeling in the binary case seems to provide no improvement over an appropriately manually specified traditional statistical model. There is however evidence of slightly improved performance of gradient boosting in survival data. Results on the reconviction data from two sources suggest that both statistical and machine learning should be tried out for obtaining an optimal model. Even if a flexible black-box model does not improve upon the predictions of a manually specified model, it can serve as a test whether important interactions are missing or other misspecification of the model are present and can thus provide more security in the modeling process.Entities:
Mesh:
Year: 2019 PMID: 30849094 PMCID: PMC6407787 DOI: 10.1371/journal.pone.0213245
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Sample characteristics 2006* DOI data.
| General recidivism | Violent recidivism | Sexual recidivism | |
|---|---|---|---|
| Total | 20,000 | 20,000 | 2,058 |
| average observation time in months | 58.8 | 71.6 | 80.7 |
| % experiencing event | 44.8 | 28.7 | 4.1 |
| 4 year base rate (%) | 37.7 | 22.6 | 3.1 |
| Gender: female(%) | 15.6 | 10.3 | - |
| Age in years (mean) | 35.8 | 34.8 | 39.5 |
| Age of first conviction (mean) | 27.9 | 25.5 | 31.2 |
| Most serious offence type (%) | |||
| Violence | 14.5 | 100.0 | 0.5 |
| Sexual | 0.7 | 0.5 | 97.6 |
| Property with violence | 1.2 | 1.2 | 0.8 |
| Property without violence | 22.7 | 4.2 | 0 |
| Public order | 10.3 | 13.8 | 0 |
| Drug offence | 6.9 | 1.0 | 0 |
| Motoring offence | 30.8 | 1.5 | 0 |
| Misc. offence | 13.0 | 6.3 | 1.0 |
| Country of birth (%) | |||
| Netherlands | 71.7 | 72.0 | 73.6 |
| Morocco | 2.8 | 3.9 | 2.7 |
| Neth. Antilles/Aruba | 2.7 | 3.5 | 3.2 |
| Surinam | 4.5 | 5.0 | 4.5 |
| Turkey | 3.0 | 3.7 | 2.2 |
| Other Western countries | 8.1 | 4.9 | 5.6 |
| Other non-Western countries | 7.2 | 7.0 | 8.2 |
| Offence type present in index case (%) | |||
| Violence component (0/1) | 15.9 | 100.0 | 12.6 |
| Sexual component | 0.7 | 0.5 | 100.0 |
| Property with violence | 1.2 | 1.2 | 1.7 |
| Property without violence | 23.3 | 4.2 | 3.1 |
| Public order | 13.0 | 13.8 | 6.2 |
| Drug offence | 8.2 | 1.0 | 0.9 |
| Motoring offence | 31.5 | 1.5 | 0.3 |
| Misc. offence | 15.1 | 6.3 | 9.4 |
| Criminal history counts (mean) | |||
| Conviction density | 0.4 | 0.5 | 0.3 |
| Number of previous convictions | 4.4 | 5.4 | 3.7 |
| Previous violent offences | 0.5 | 1.0 | 0.5 |
| Previous sexual offences | 0.0 | 0.0 | 0.2 |
| Previous property with | |||
| violence offences | 0.1 | 0.2 | 0.2 |
| Previous property offences | 2.3 | 2.4 | 1.5 |
| Previous public order offences | 0.6 | 1.0 | 0.6 |
| Previous drug offences | 0.2 | 0.2 | 0.1 |
| Previous motoring offences | 0.8 | 0.8 | 0.6 |
| Previous misc. offences | 0.3 | 0.4 | 0.3 |
| Previous prison terms | 0.9 | 1.0 | 0.7 |
| Previous community service orders | 0.3 | 0.5 | 0.3 |
| Previous fines | 0.9 | 1.2 | 0.8 |
| Previous PPDs | 0.5 | 0.5 | 0.3 |
Note: The sexual recidivism data also contain data from 2007, to enlarge the sample size.
†Public prosecutor's disposals.
Sample characteristics North Carolina prison data.
| 1978 | 1980 | |||
|---|---|---|---|---|
| Estimation | Validation | Estimation | Validation | |
| Observation time in months (mean) | 56.0 | 56.1 | 39.4 | 38.7 |
| % experiencing event | 37.0 | 37.4 | 37.0 | 37.5 |
| 4 year base rate (%) | 32.1 | 32.3 | 36.1 | 37.1 |
| TSERVD (mean) | 18.8 | 19.8 | 19.5 | 19.2 |
| AGE (mean) | 346.1 | 342.3 | 339.0 | 341.9 |
| PRIORS (mean) | 1.4 | 1.4 | 1.4 | 1.3 |
| RULE (mean) | 1.1 | 1.3 | 1.5 | 1.5 |
| SCHOOL (mean) | 9.7 | 9.7 | 9.6 | 9.6 |
| WHITE (%) | 50.9 | 52.2 | 51.0 | 51.1 |
| MALE (%) | 93.8 | 94.3 | 94.6 | 94.4 |
| ALCHY (%) | 21.0 | 19.6 | 35.7 | 36.0 |
| JUNKY (%) | 23.9 | 27.2 | 21.8 | 19.6 |
| MARRIED (%) | 25.6 | 26.8 | 23.4 | 23.3 |
| SUPER (%) | 69.8 | 69.6 | 80.1 | 81.6 |
| WORKREL (%) | 45.5 | 46.5 | 43.3 | 42.9 |
| FELON (%) | 31.2 | 32.1 | 43.1 | 41.3 |
| PERSON (%) | 6.0 | 5.3 | 11.3 | 11.2 |
| PROPTY (%) | 25.1 | 25.7 | 44.7 | 44.0 |
Used models and tuning parameter values used for the binary outcome data.
| Parameter 1 | Parameter 2 | Parameter 3 | Parameter 4 | |
|---|---|---|---|---|
| Linear logistic regression [ | - | - | - | - |
| Linear discriminant analysis [ | - | - | - | - |
| λ1 = 0.00001, 0.0001, 0.001, 0.01, 0.1, 0.5, 1, 10, 20, 30, 50, 70, 100, 500, 1,000 | - | - | - | |
| λ2 = 0.00001, 0.0001, 0.001, 0.01, 0.1, 0.5, 1, 10, 20, 30, 50, 70, 100, 500, 1,000 | - | - | - | |
| Penalized discriminant analysis | λ = 0.00001, 0.0001, 0.001, 0.01, 0.1, 0.5, 1, 10, 20, 30, 50, 70, 100, 500, 1,000 | - | - | - |
| Random forest [ | Ntrees: 1,000 | Npredictors: 2, 3, 4, 5, 6, 7, 9, 11, 13, 16 | Node size = 1 | - |
| Stochastic gradient boosting [ | Max. Ntrees: 1,000 | Interaction depth: 2, 3, 4, 5, 6, 7, 8, 9 | Bag fraction = 0.5 | |
| BART [ | Ntrees: 200 | Niter:1,000 | Number of burn-in iterations: 100 |
† Models were tried out both with standardized and unstandardized input data.
Tuning parameters used for the survival data.
| Parameter 1 | Parameter2 | Parameter 3 | |
|---|---|---|---|
| Cox regression [ | - | - | - |
| Aalen regression | - | - | - |
| Cox-cure model | - | - | - |
| Parametric survival models | - | - | - |
| Random survival forest [ | Ntrees: 1,000 | Npredictors: 2, 3, 4, 5, 6, 7, 9, 11, 13, 16 | Node size = 3 |
| Ridge Cox boosting§ [ | Max. number of iterations: 1,000 | csmf = 1 | |
| Stochastic gradient boosting survival analysis [ | Niter: 1,000 | Interaction depth: 2, 3, 4, 5, 6, 7, 8, 9 | Bag fraction = .5 |
| λ1 = 0.00001, 0.0001, 0.001, 0.01, 0.1, 0.5, 1, 10, 20, 30, 50, 70, 100, 500, 1,000 | - | - | |
| λ2 = 0.00001, 0.0001, 0.001, 0.01, 0.1, 0.5, 1, 10, 20, 30, 50, 70, 100, 500, 1,000 | - | - | |
| Single hidden layer Cox neural network | Size of hidden layer = | Weight decay = 0, 0.1, 0.01, 0.001, 0.0001 | - |
| Single hidden layer parametric Neural network models | Size of hidden layer = | Weight decay = 0, 0.1, 0.01, 0.001, 0.0001 | - |
| PLS§ [ | Ncomponents: | - | - |
†Fully non-parametric Aalen models were used, as opposed to semi-parametric Cox-Aalen models [79].
‡The cure parameter was specified to depend on the same variables as the Cox part of the model.
§ Models were tried out both with standardized and unstandardized input data.
ǁ Prior to modeling, the data were rescaled between 0–1 to equate the influence of weight decay on each covariate. The weight decay is identical to the λ in L2-Cox-regression.
Fig 1ROC-curves general recidivism test data.
Fig 2Calibration plots general recidivism test data.
Fig 3Accuracy plot general recidivism test data.
Fig 4ROC-curves violent recidivism test data.
Fig 5Calibration plots violent recidivism test data.
Fig 6Accuracy plot violent recidivism test data.
Fig 7ROC-curves sexual recidivism test data.
Fig 9Accuracy plot sexual recidivism test data.
Fig 8Calibration plots sexual recidivism test data.
Fig 10ROC-curves Schmidt & Witte 1978 test data.
Fig 11Calibration plot Schmidt & Witte 1978 test data.
Fig 12Accuracy plot Schmidt & Witte 1978 test data.
Fig 13ROC-curves Schmidt & Witte 1980 test data.
Fig 14Calibration plot Schmidt & Witte 1980 test data.
Fig 15Accuracy plot Schmidt & Witte 1980 test data.
Fig 16Brier scores general recidivism survival models over time, test data.
Fig 17Brier scores violent recidivism survival models over time, test data.
Fig 18Brier scores sexual recidivism survival models over time, test data.
Fig 19Brier scores survival models Schmidt Witte 1978 data over time, test data.
Fig 20Brier scores survival models Schmidt Witte 1978 data over time, test data.