| Literature DB >> 32038355 |
Gabe Haarsma1, Sasha Davenport1, Devonte C White1,2, Pablo A Ormachea1, Erin Sheena1, David M Eagleman1,3.
Abstract
We seek to address current limitations of forensic risk assessments by introducing the first mobile, self-scoring, risk assessment software that relies on neurocognitive testing to predict reoffense. This assessment, run entirely on a tablet, measures decision-making via a suite of neurocognitive tests in less than 30 minutes. The software measures several cognitive and decision-making traits of the user, including impulsivity, empathy, aggression, and several other traits linked to reoffending. Our analysis measured whether this assessment successfully predicted recidivism by testing probationers in a large urban city (Houston, TX, United States) from 2017 to 2019. To determine predictive validity, we used machine learning to yield cross-validated receiver-operator characteristics. Results gave a recidivism prediction value of 0.70, making it comparable to commonly used risk assessments. This novel approach diverges from traditional self-reporting, interview-based, and criminal-records-based approaches, and can also add a protective layer against bias, while strengthening model accuracy in predicting reoffense. In addition, subjectivity is eliminated and time-consuming administrative efforts are reduced. With continued data collection, this approach opens the possibility of identifying different levels of recidivism risk, by crime type, for any age, or gender, and seeks to steer individuals appropriately toward rehabilitative programs. Suggestions for future research directions are provided.Entities:
Keywords: machine learning; neurocognitive; neurolaw; predictive validity; risk assessment
Year: 2020 PMID: 32038355 PMCID: PMC6992536 DOI: 10.3389/fpsyg.2019.02926
Source DB: PubMed Journal: Front Psychol ISSN: 1664-1078
Commonly used risk assessments, their stated purpose, and their median area under the curve (AUC) (Singh et al., 2011; Desmarais et al., 2016).
| COMPAS | 0.67 | General and violent recidivism, pretrial misconduct |
| IRAS | 0.63 | General recidivism |
| LSI-R | 0.64 | General recidivism |
| ORAS | 0.66 | General recidivism |
| PCRA | 0.71 | Post-conviction reoffense, under supervision |
| PSA | 0.66 | Pretrial risk assessment |
| RMS | 0.67 | General recidivism |
| SARA | 0.70 | Domestic violence |
| SAVRY | 0.71 | Violent risk in youth |
| SORAG | 0.75 | Sex offender |
| SPIn-W | 0.73 | Gender-responsive (for women) |
| Static-99 | 0.70 | Sex offenders, pre-release |
| STRONG | 0.74 | General recidivism |
| SVR-20 | 0.78 | Sexual violence |
| TRAS | 0.67 | General recidivism |
| VRAG | 0.74 | Violent risk |
| WRN | 0.67 | General recidivism |
FIGURE 1Screenshots of the NeuroCognitive Risk Assessment (NCRA): (A) the Eriksen Flanker, (B) Balloon Analog Risk Task, (C) Go/No-Go, (D) Point-Subtraction Aggression Paradigm, (E) Reading the Mind Through the Eyes, (F) Emotional Stroop, and (G) Tower of London (Ormachea et al., 2017).
Overview of probationers (age and gender) and recidivism, by current offense category.
| DWI | 182 | 15 | 8.2 | 78.0 | 31.9 | 10.7 |
| Drug | 187 | 38 | 20.3 | 78.6 | 28.4 | 10.0 |
| Non-violent | 35 | 13 | 37.1 | 85.7 | 29.1 | 9.3 |
| Property | 122 | 23 | 18.9 | 59.8 | 27.7 | 10.2 |
| Sexual non-violent | 8 | 1 | 12.5 | 75.0 | 29.4 | 8.9 |
| Sexual violent | 8 | 1 | 12.5 | 87.5 | 38.6 | 10.1 |
| Violent | 188 | 35 | 18.6 | 77.1 | 27.9 | 8.6 |
| Total | 730 | 126 | 17.3 | 75.3 | 28.8 | 10.0 |
Self-reported race/ethnicity and number of previous arrests.
| DWI | 182 | 3.3 | 23.1 | 41.8 | 27.5 | 4.4 | 10 | 73 | 41 | 27 | 31 | 0 |
| Drug | 187 | 3.2 | 32.1 | 34.8 | 22.5 | 7.5 | 5 | 49 | 40 | 50 | 41 | 2 |
| Non-violent | 35 | 0.0 | 65.7 | 28.6 | 2.9 | 2.9 | 2 | 8 | 9 | 9 | 7 | 0 |
| Property | 122 | 1.6 | 44.3 | 31.1 | 22.1 | 0.8 | 8 | 46 | 26 | 20 | 22 | 0 |
| Sexual non-violent | 8 | 0.0 | 50.0 | 25.0 | 25.0 | 0.0 | 0 | 3 | 2 | 1 | 1 | 1 |
| Sexual violent | 8 | 0.0 | 25.0 | 25.0 | 25.0 | 25.0 | 0 | 5 | 1 | 1 | 1 | 0 |
| Violent | 188 | 1.1 | 44.7 | 32.4 | 16.5 | 5.3 | 11 | 53 | 42 | 48 | 31 | 3 |
| Total | 730 | 2.2 | 36.8 | 34.8 | 21.2 | 4.9 | 36 | 237 | 161 | 156 | 134 | 6 |
Self-reported education and employment.
| Middle or junior high | 17 | 2.3 | Not employed | 152 | 20.8 |
| Some high school | 143 | 19.6 | Homemaker | 14 | 1.9 |
| High school or GED | 323 | 44.2 | Student | 47 | 6.4 |
| Some/in college | 154 | 21.1 | Employed | 380 | 52.1 |
| College graduate | 36 | 4.9 | Student/employed | 12 | 1.6 |
| Graduate school | 7 | 1.0 | Other | 125 | 17.1 |
| Vocational school | 50 | 6.8 |
Definitions of the machine learning features used in each NCRA test.
| Time median | Median response time |
| Time standard deviation | Standard deviation response time |
| Exec effect | Median congruent trials – median incongruent trials |
| Frac correct | Percent of correct trials |
| NIH score | National Institute of Health Flanker score |
| Pop | Number of popped balloons |
| Time collected (*) | Total time/points collected from unpopped balloons |
| Pressed time median | Median time/points collected |
| Pressed count median | Median number of balloon inflate presses |
| Duration time median | Median time/duration of inflate presses |
| Correct go | Correct number of Go’s (carrot) |
| Correct no go | Correct number of No-Go’s (eggplant) |
| Time correct go | Mean response time of correct Go’s |
| Grow (*) | Number of individual grow taps/50 |
| Protect ratio | Protect taps/all taps |
| Punish ratio (*) | Punish taps/all taps |
| Correct (*) | Number of correct trials |
| Time median (*) | Median response time |
| Time standard deviation | Standard deviation response time |
| Dict lookup | Number of trials any trial word is looked up |
| Test correct | Test round with feedback number of correct trials |
| Test time (*) | Test round with feedback mean response time |
| Black correct | Std Stroop color words in black number of correct trials |
| Black time (*) | Std Stroop color words in black mean response time |
| Con color correct | Std Stroop color words congruent color number of correct trials |
| Con color time | Std Stroop color words congruent color mean response time |
| Incon color correct | Std Stroop color words incongruent color number of correct trials |
| Incon color time (*) | Std Stroop color words incongruent color mean response time |
| Neutral correct | Neutral words number of correct trials |
| Neutral time | Neutral words mean response time |
| Pos Neg correct | Positive and negative words number of correct trials |
| Pos Neg time (*) | Positive and negative words mean response time |
| Solved | Number of trials solved |
| Aborted (*) | Number of trials aborted (giving up) |
| All moves | Number of total moves |
| Dup moves (*) | Number of duplicated moves |
| Extra moves | Number of extra moves to solve |
| Illegal moves (*) | Number of illegal moves |
| Mean time | Mean trial time |
| Solved mean time | Mean trial time for solved trials |
| Solved median time | Median trial time for solved trials |
| First move time | Mean time waited before moving a disk in a trial |
| First move frac (*) | Mean fraction of time waited before moving a disk in a trial |
| Final time | Time the last disk was moved |
| Test moves | Number moves in the test round |
| Test time | Time spend in the test round |
| Test solved | Was the test round solved |
| Disk speed | Mean time between start and stop of moving a disk |
Feature sets defined, as used in machine learning modeling analysis.
| Full NCRA | NCRA test data, without any other information |
| RFE NCRA | Recursive feature elimination producing the top 13 most predictive features of the NCRA, with no other information |
| Full NCRA + Demographics | NCRA test data combined with demographics (age, gender, and the current crime category at the time of testing) |
| RFE NCRA + Demographics | Recursive feature elimination producing the top 13 most predictive features of the NCRA combined with demographics |
Machine learning models used with corresponding R statistical analysis package.
| GLM | Generalized linear models | stats version 3.6.1 |
| LDA | Linear discriminant analysis | MASS version 7.3-51.4 |
| k-NN | k-Nearest neighbors | class version 7.3-15 |
| SVM | Support vector machines (polynomial) | kernlab version 0.9-27 |
| GMB | Generalized boosted modeling | gbm version 2.1.5 |
| RF | Random forest | ranger version 0.11.2 |
| Glmnet | GLM with ridge and lasso regularization | glmnet version 2.0-18 |
ROC curve AUCs by feature sets along with machine learning algorithms used.
| Full NCRA | 0.60 | 0.61 | 0.56 | 0.64 | 0.63 | 0.60 | 0.64 |
| RFE NCRA | 0.64 | 0.65 | 0.58 | 0.66 | 0.64 | 0.62 | |
| Full NCRA + Demographics | 0.65 | 0.66 | 0.59 | 0.65 | 0.66 | 0.63 | 0.69 |
| RFE NCRA + Demographics | 0.68 | 0.69 | 0.60 | 0.67 | 0.67 | 0.66 |
FIGURE 2Receiver operating characteristic curves illustrating predictive performance of all machine learning algorithms when looking at the RFE NCRA + Demographics feature set.
FIGURE 3Receiver operating characteristic curves illustrating predictive performance of the Glmnet machine learning method over all feature sets.
FIGURE 4Receiver operating characteristic curves illustrating predictive performance of the Glmnet machine learning method for the RFE NCRA + Demographics feature set.