Duy Van Le1, James Montgomery2, Kenneth C Kirkby3, Joel Scanlan2. 1. School of Technology, Environments and Design, College of Sciences and Engineering, University of Tasmania, Private Bag 87, Hobart 7001, TAS, Australia. Electronic address: duyvan.le@utas.edu.au. 2. School of Technology, Environments and Design, College of Sciences and Engineering, University of Tasmania, Private Bag 87, Hobart 7001, TAS, Australia. 3. School of Medicine, College of Health and Medicine, University of Tasmania, Private Bag 87, Hobart 7001, TAS, Australia.
Abstract
OBJECTIVE: Instruments rating risk of harm to self and others are widely used in inpatient forensic psychiatry settings. A potential alternate or supplementary means of risk prediction is from the automated analysis of case notes in Electronic Health Records (EHRs) using Natural Language Processing (NLP). This exploratory study rated presence or absence and frequency of words in a forensic EHR dataset, comparing four reference dictionaries. Seven machine learning algorithms and different time periods of EHR analysis were used to probe which dictionary and which time period were most predictive of risk assessment scores on validated instruments. MATERIALS AND METHODS: The EHR dataset comprised de-identified forensic inpatient notes from the Wilfred Lopes Centre in Tasmania. The data comprised unstructured free-text case note entries and serial ratings of three risk assessment scales: Historical Clinical Risk Management-20 (HCR-20), Short-Term Assessment of Risk and Treatability (START) and Dynamic Appraisal of Situational Aggression (DASA). Four NLP dictionary word lists were selected: 6865 mental health symptom words from the Unified Medical Language System (UMLS), 455 DSM-IV diagnoses from UMLS repository, 6790 English positive and negative sentiment words, and 1837 high frequency words from the Corpus of Contemporary American English (COCA). Seven machine learning methods Bagging, J48, Jrip, Logistic Model Trees (LMT), Logistic Regression, Linear Regression and Support Vector Machine (SVM) were used to identify the combination of dictionaries and algorithms that best predicted risk assessment scores. RESULTS: The most accurate prediction was attained on the DASA dataset using the sentiment dictionary and the LMT and SVM algorithms. CONCLUSIONS: NLP, used in conjunction with NLP dictionaries and machine learning, predicted risk ratings on the HCR-20, START, and DASA, based on EHR content. Further research is required to ascertain the utility of NLP approaches in predicting endpoints of actual self-harm, harm to others or victimisation.
OBJECTIVE: Instruments rating risk of harm to self and others are widely used in inpatient forensic psychiatry settings. A potential alternate or supplementary means of risk prediction is from the automated analysis of case notes in Electronic Health Records (EHRs) using Natural Language Processing (NLP). This exploratory study rated presence or absence and frequency of words in a forensic EHR dataset, comparing four reference dictionaries. Seven machine learning algorithms and different time periods of EHR analysis were used to probe which dictionary and which time period were most predictive of risk assessment scores on validated instruments. MATERIALS AND METHODS: The EHR dataset comprised de-identified forensic inpatient notes from the Wilfred Lopes Centre in Tasmania. The data comprised unstructured free-text case note entries and serial ratings of three risk assessment scales: Historical Clinical Risk Management-20 (HCR-20), Short-Term Assessment of Risk and Treatability (START) and Dynamic Appraisal of Situational Aggression (DASA). Four NLP dictionary word lists were selected: 6865 mental health symptom words from the Unified Medical Language System (UMLS), 455 DSM-IV diagnoses from UMLS repository, 6790 English positive and negative sentiment words, and 1837 high frequency words from the Corpus of Contemporary American English (COCA). Seven machine learning methods Bagging, J48, Jrip, Logistic Model Trees (LMT), Logistic Regression, Linear Regression and Support Vector Machine (SVM) were used to identify the combination of dictionaries and algorithms that best predicted risk assessment scores. RESULTS: The most accurate prediction was attained on the DASA dataset using the sentiment dictionary and the LMT and SVM algorithms. CONCLUSIONS: NLP, used in conjunction with NLP dictionaries and machine learning, predicted risk ratings on the HCR-20, START, and DASA, based on EHR content. Further research is required to ascertain the utility of NLP approaches in predicting endpoints of actual self-harm, harm to others or victimisation.
Authors: Christophe Lemey; Aziliz Le Glaz; Yannis Haralambous; Deok-Hee Kim-Dufor; Philippe Lenca; Romain Billot; Taylor C Ryan; Jonathan Marsh; Jordan DeVylder; Michel Walter; Sofian Berrouiguet Journal: J Med Internet Res Date: 2021-05-04 Impact factor: 5.428
Authors: Jeffrey Thompson; Jinxiang Hu; Dinesh Pal Mudaranthakam; David Streeter; Lisa Neums; Michele Park; Devin C Koestler; Byron Gajewski; Roy Jensen; Matthew S Mayo Journal: Sci Rep Date: 2019-06-25 Impact factor: 4.379
Authors: Carlos R Oliveira; Patrick Niccolai; Anette Michelle Ortiz; Sangini S Sheth; Eugene D Shapiro; Linda M Niccolai; Cynthia A Brandt Journal: JMIR Med Inform Date: 2020-11-03