| Literature DB >> 33126735 |
David A Huber1, Steffen Lau1, Martina Sonnweber1, Moritz P Günther2, Johannes Kirchebner1.
Abstract
Migrants diagnosed with schizophrenia are overrepresented in forensic-psychiatric clinics. A comprehensive characterization of this offender subgroup remains to be conducted. The present exploratory study aims at closing this research gap. In a sample of 370 inpatients with schizophrenia spectrum disorders who were detained in a Swiss forensic-psychiatric clinic, 653 different variables were analyzed to identify possible differences between native Europeans and non-European migrants. The exploratory data analysis was conducted by means of supervised machine learning. In order to minimize the multiple testing problem, the detected group differences were cross-validated by applying six different machine learning algorithms on the data set. Subsequently, the variables identified as most influential were used for machine learning algorithm building and evaluation. The combination of two childhood-related factors and three therapy-related factors allowed to differentiate native Europeans and non-European migrants with an accuracy of 74.5% and a predictive power of AUC = 0.75 (area under the curve). The AUC could not be enhanced by any of the investigated criminal history factors or psychiatric history factors. Overall, it was found that the migrant subgroup was quite similar to the rest of offender patients with schizophrenia, which may help to reduce the stigmatization of migrants in forensic-psychiatric clinics. Some of the predictor variables identified may serve as starting points for studies aimed at developing crime prevention approaches in the community setting and risk management strategies tailored to subgroups of offenders with schizophrenia.Entities:
Keywords: ethnicity; machine learning; minorities; risk factors for criminal behavior; stigmatization
Mesh:
Year: 2020 PMID: 33126735 PMCID: PMC7663465 DOI: 10.3390/ijerph17217922
Source DB: PubMed Journal: Int J Environ Res Public Health ISSN: 1660-4601 Impact factor: 3.390
Figure 1Data processing and statistical analysis. Step 1—Data preparation: Variables with more than 33% missing values eliminated; multiple categorical variables one-hot encoded; continuous variables not manipulated. Step 2—Data split: Split in training dataset with 70% of cases and test dataset with 30% of cases. Step 3—Imputation: Training and test dataset separately imputed via multivariate imputation by chained equations (MICE). Step 4a,b—Variable reduction: Identification of most influential variables in training dataset via different machine learning algorithms. Step 5—Variable importance: Variables ranked in order of importance, based on the number of times identified as most important by machine learning algorithms. Step 6—Variable assessment: Check for multicollinearity and selection of predictor variables identified at least 3x by algorithms. Step 7a,b—Model selection: Calculation and selection of best machine learning algorithm with identified predictor variables on test dataset embedded in 5-fold cross-validation. Step 8—Model assessment: Accuracy, area under the curve (AUC), sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), extraction of best model.
Description of study sample.
| Variable | Total | European Nationality | Non-European Nationality |
|---|---|---|---|
| Male sex | 339 (91.6) | 243 (91.4) | 96 (92.3) |
| Age at admission (mean, SD) | 34.15 (10.226) | 34.37 (10.695) | 33.60 (8.942) |
| Single (at offence) | 297 (80.3) | 224 (84.2) | 73 (70.2) |
| Diagnosis schizophrenia (ICD-9/10) | 294 (79.5) | 219 (82.3) | 75 (72.1) |
| Birth country Switzerland | 167 (45.1) | ||
| Other European country | 99 (26.8) | ||
| Middle East | 25 (6.8) | ||
| Africa | 54 (14.6) | ||
| Other Country | 25 (6.8) |
Note. SD = Standard deviation.
Most important variables to differentiate between European/non-European identified by at least three algorithms.
| Variable Description | Variable Code | Frequency of Identification |
|---|---|---|
| Mean dose equivalent of olanzapine at discharge | R9e | 5 |
| Religious confession of the patient: Islam | SD4b | 5 |
| Migration experienced | SD20 | 5 |
| Patient suffering from poverty in childhood/ adolescence | CJ16 | 4 |
| Social isolation in childhood/ adolescence | CJ1 | 4 |
| Only engaged in most basic tasks in ergotherapy | R17b | 3 |
| Language problems during psychotherapy | R16g | 3 |
Absolute and relative distribution of predictor variables.
| Variable | European | Non-European |
|---|---|---|
| Mean dose equivalent of olanzapine at discharge (with SD) | 18.46 (12.93) | 21.22 (16.52) |
| Patient suffering from poverty in childhood/ adolescence | 68/217 (31.3) | 39/66 (59.1) |
| Social isolation in childhood/ adolescence | 126/220 (57.3) | 16/58 (27.6) |
| Only engaged in most basic tasks in ergotherapy | 110/263 (41.8) | 70/102 (68) |
| Language problems during psychotherapy | 3/265 (1.2) | 10/97 (10.31) |
Note. SD = standard deviation.
Figure 2Receiver operating characteristic (ROC) curve of the final selected model (tree algorithm).
Final tree model performance measures.
| Performance measures | % 95% Confidence Interval |
|---|---|
| Accuracy | 74.46 [69.92, 78.76] |
| AUC | 0.7500 [0.5389, 0.8610] |
| Sensitivity | 75.00 [70.01, 79.42] |
| Specificity | 69.23 [48.10, 84.91] |
| PPV | 96.99 [93.94, 98.59] |
| NPV | 17.31 [10.85, 26.24] |
Note. AUC = area under the curve (level of discrimination); PPV = positive predictive value; NPV = negative predictive value.
Predictor variables identified by at least one machine learning algorithm.
| Variable Code | Variable Description | Algorithm |
|---|---|---|
| SD4a | religious confession of the patient: Catholic | NHST, naive Bayes |
| SD4b | religious confession of the patient: Islam | NHST, backward selection, trees, SVM, naive Bayes |
| SD5a | marital status | NHST |
| SD14 | own children | NHST, tree |
| SD18a | legal guardian: birth parents | NHST, SVM |
| SD18b | legal guardian: single parent | tree |
| SD20 | migration experienced | NHST, backward selection, trees, SVM, naive Bayes |
| CJ1 | social isolation in childhood/ adolescence | NHST, backward selection, logistic regression, tree |
| CJ8 | professional help by a psychiatrist/psychologist sought in patient´s childhood/ adolescence | NHST |
| CJ12 | severe conflicts between the parents in patient´s childhood/ adolescence | NHST, logistic regression |
| CJ16 | patient suffering from poverty in childhood/ adolescence | NHST, backward selection, tree, SVM |
| CJ29 | school failure | NHST, logistic regression |
| PH4 | symptom hallucinations in patient’s history | NHST |
| PH14c | opioid abuse/dependency in patient’s history | NHST |
| PH14d | cocaine abuse/addiction in patient’s history | NHST |
| PH14e | stimulants, amphetamines, ecstasy abuse/addiction in patient´s history | NHST |
| PH17a | persistent refusal of occupational therapy during inpatient treatment | NHST |
| PH18a | outpatient psychiatric treatment(s) before investigated offence | NHST |
| PH19c | number of inpatient treatment(s) before investigated offence | NHST |
| PH19d | time period between release out of last inpatient treatment und the investigated offence in weeks | SVM, naive Bayes |
| PH25e | homeless at time of the investigated offence | NHST |
| CH4k | criminal record: traffic offence | NHST, backward selection |
| CH10b | stay in forensic institution mandated by jurisdiction | NHST |
| S4 | psychosocial variable: low self esteem | NHST |
| J1 | time spent in prison | Tree |
| J3b | type of imprisonment | Logistic regression, SVM |
| D1 | amount of index offences | SVM, naive Bayes |
| D2g | index offence: property crime without violence | NHST |
| D2k | index offence: traffic offence | NHST |
| D10a | in relationship with victim | NHST, backward selection, |
| D11c | location of offence: mutual home with victim | NHST, SVM, naive Bayes |
| D11j | location of offence: public space | NHST |
| D22b | patient´s subjective statements correspond with the facts described in the police records | NHST |
| D25a | patient touched victim’s genital/breast | NHST |
| D25b | patient performed vaginal intercourse at offence | NHST |
| R1c | current psychiatric F2X diagnosis - acute psychotic disorder | NHST |
| R4d | assigned from prison | NHST |
| R5 | patient in legal measure | NHST |
| R8l | clozapine medication in current hospitalization | NHST |
| R9e | mean dose equivalent of olanzapine at discharge * | Backward selection, logistic regression, tree, SVM, naive Bayes |
| R15a | main content of psychotherapy/ conversations “more loosening within forensic setting” | NHST |
| R16g | language problems during psychotherapy | NHST, tree, naive Bayes |
| R17b | only engaged in most basic tasks in ergotherapy | NHST, backward selection, SVM |
| R19 | insight into wrongfulness of offence | NHST |
| R22a | time spent in current forensic hospitalization (weeks) | NHST |
| R25b | Presumably close contact to family at release of current forensic hospitalization | NHST, backward selection |
| R26 | insight into illness and its treatment | NHST |
| PANSS3 | PANSS at admission: Scale Hallucinations | NHST |
| PANSS23 | PANSS at admission: Scale Unusual thought content | NHST |
Note. NHST = null hypothesis significance testing; SVM = support vector machines; PANNS = Positive and negative syndrome scale; * Note that for conversion of cumulative antipsychotic dosages into olanzapine equivalents, conversion factors provided by the classical weighted mean dose method [71] were used with a very small number of exceptions where older antipsychotics were prescribed. In these cases, the minimum effective dose method [72] or international experts’ consensus-based olanzapine equivalents [73] provided the necessary converting factors.