| Literature DB >> 32729840 |
Jihad S Obeid1, Jennifer Dahne1, Sean Christensen1, Samuel Howard1, Tami Crawford1, Lewis J Frey1, Tracy Stecker1, Brian E Bunnell2.
Abstract
BACKGROUND: Suicide is an important public health concern in the United States and around the world. There has been significant work examining machine learning approaches to identify and predict intentional self-harm and suicide using existing data sets. With recent advances in computing, deep learning applications in health care are gaining momentum.Entities:
Keywords: deep learning; electronic health records; machine learning; natural language processing; suicide; suicide, attempted
Year: 2020 PMID: 32729840 PMCID: PMC7426805 DOI: 10.2196/17784
Source DB: PubMed Journal: JMIR Med Inform
The top 10 words in each group were compared with controls, along with the chi-square statistic for each.
| Concurrent with ISHa,b | Before ISHc | ||
| Keyword | Chi-square ( | Keyword | Chi-square ( |
| suicide | 1.3E+5 | disorder | 1.2E+4 |
| attempt | 8.2E+4 | sid | 8.5E+3 |
| overdose | 6.7E+4 | suicidal | 6.0E+3 |
| si | 6.5E+4 | mood | 5.8E+3 |
| disorder | 5.2E+4 | use | 4.7E+3 |
| suicidal | 5.2E+4 | alcohol | 4.6E+3 |
| psychiatry | 4.0E+4 | qhse | 4.5E+3 |
| iopf | 3.6E+4 | safety | 4.2E+3 |
| interview | 3.5E+4 | interview | 3.9E+3 |
| mood | 2.9E+4 | cocaine | 3.9E+3 |
aKeywords from clinical notes from visits concurrent with ISH events.
bISH: intentional self-harm.
cKeywords from clinical notes from visits before the first ISH events.
dsi: suicidal ideation.
eiop: Institute of Psychiatry.
fqhs: every bedtime (from Latin quaque hora somni).
Figure 1A visualization of a sample of relevant words derived from the Word2vec model reduced into two dimensions using t-distributed stochastic neighbor embedding. V1=variable 1; V2=variable 2.
Words semantically similar to the words attempt and ideation and their cosine similarity in the 200-dimension vector space as identified by the Word2vec analysis.
| Term | Cos sima | |
|
|
| |
|
| attempt | 1.000 |
|
| suicide | 0.730 |
|
| overdose | 0.696 |
|
| osteoarthrithis | 0.679 |
|
| gesture | 0.643 |
|
| sucicide | 0.625 |
|
| benzodiaspines | 0.619 |
|
| intentional | 0.617 |
|
| Cos sima | |
|
| ideation | 1.000 |
|
| suicidal | 0.872 |
|
| homicidal | 0.837 |
|
| ideations | 0.736 |
|
| intent | 0.681 |
|
| ideaiton | 0.651 |
|
| sib | 0.648 |
|
| sucidial | 0.619 |
aCos sim: cosine similarity.
bsi: suicidal ideation.
The metrics for training and cross-validation on the 2012 to 2017 data set.
| Model | AUCa (95% CIb) | Accuracy (95% CI) | Precision | Recall | F1 score |
| NBc | 0.908 (0.882-0.934) | 0.870 (0.839-0.898) | 0.734 | 0.865 | 0.794 |
| DTd | 0.870 (0.839-0.901) | 0.865 (0.833-0.893) | 0.715 | 0.885 | 0.791 |
| RFe | 0.961 (0.944-0.978) | 0.896 (0.867-0.921) | 0.794 | 0.865 | 0.828 |
| SVMf | 0.947 (0.925-0.969) | 0.900 (0.872-0.924) | 0.859 | 0.782 | 0.819 |
| MLPg | 0.957 (0.938-0.976) | 0.917 (0.890-0.939) | 0.828 | 0.897 | 0.862 |
| CNNrh | 0.984 (0.972-0.995) | 0.946 (0.924-0.964) | 0.938 | 0.872 | 0.904 |
| CNNwi | 0.988 (0.977-0.999) | 0.959 (0.939-0.974) | 0.947 | 0.910 | 0.928 |
| LSTMrj | 0.982 (0.972-0.992) | 0.943 (0.920-0.961) | 0.919 | 0.878 | 0.898 |
| LSTMwk | 0.975 (0.960-0.990) | 0.937 (0.913-0.956) | 0.918 | 0.859 | 0.887 |
aAUC: area under the receiver operating characteristic curve.
bCI: 95% confidence intervals for the AUC.
cNB: naïve Bayes.
dDT: decision tree.
eRF: random forest.
fSVM: support vector machine.
gMLP: multilayer perceptron.
hCNNr: convolutional neural network with randomly initialized word embeddings.
iCNNw: convolutional neural network with Word2vec word embeddings.
jLSTMr: long short-term memory with randomly initialized word embeddings.
kLSTMw: long short-term memory with Word2vec word embeddings.
The metrics for training on the 2012 to 2017 data set and testing on the 2018 to 2019 holdout test set using both International Classification of Diseases labels and gold standard labels.
| Model | AUCa (95% CIb) | Accuracy (95% CI) | Precision | Recall | F1 score | ||||||
|
| |||||||||||
|
| CNNrd | 0.999 (0.998-1.000) | 0.985 (0.957-0.997) | 0.980 | 0.990 | 0.985 | |||||
|
| CNNwe | 0.998 (0.996-1.000) | 0.970 (0.936-0.989) | 0.980 | 0.960 | 0.970 | |||||
|
| LSTMrf | 0.997 (0.991-1.000) | 0.980 (0.950-0.995) | 0.990d | 0.970 | 0.980 | |||||
|
| LSTMwg | 0.997 (0.994-1.000) | 0.960 (0.923-0.983) | 0.989 | 0.930 | 0.959 | |||||
|
| |||||||||||
|
| CNNrc | 0.981 (0.966-0.997) | 0.915 (0.867-0.950) | 0.832 | 1.000 | 0.908 | |||||
|
| CNNwe | 0.981 (0.965-0.997) | 0.920 (0.873-0.954) | 0.847 | 0.988 | 0.912 | |||||
|
| LSTMrf | 0.968 (0.946-0.989) | 0.910 (0.861-0.946) | 0.837 | 0.976 | 0.901 | |||||
|
| LSTMwg | 0.967 (0.945-0.989) | 0.920 (0.873-0.954) | 0.862 | 0.964 | 0.910 | |||||
aAUC: area under the receiver operating characteristic curve.
bCI: 95% confidence intervals for the AUC.
cICD: International Classification of Diseases.
dCNNr: convolutional neural network with randomly initialized word embeddings.
eCNNw: convolutional neural network with Word2vec word embeddings.
fLSTMr: long short-term memory with randomly initialized word embeddings.
gLSTMw: long short-term memory with Word2vec word embeddings.
Figure 2The area under the receiver operating characteristic curve for training on the 2012 to 2017 data set and testing on the holdout test set (2018-2019) using (1) International Classification of Diseases labels and (2) gold standard labels. AUC: area under the receiver operating characteristic curve; ICD: International Classification of Diseases; CNNr: convolutional neural network with randomly initialized word embeddings; CNNw: convolutional neural network with Word2vec word embeddings; LSTMr: long short-term memory with randomly initialized word embedding; LSTMw: long short-term memory with Word2vec word embedding.
The metrics for models trained on notes preceding the first intentional self-harm visits in patients presenting during the 2012 to 2017 time frame and tested on notes preceding the first intentional self-harm visits in patients presenting during the 2018 to 2019 time frame.
| Model | AUCa (95% CIb) | Accuracy (95% CI) | Precision | Recall | F1 score |
| CNNrc | 0.882 (0.871-0.891) | 0.792 (0.774-0.807) | 0.863 | 0.694 | 0.769 |
| CNNwd | 0.869 (0.858-0.879) | 0.782 (0.766-0.792) | 0.860 | 0.673 | 0.755 |
| LSTMre | 0.850 (0.827-0.877) | 0.758 (0.729-0.788) | 0.830 | 0.656 | 0.729 |
| LSTMwf | 0.846 (0.819-0.871) | 0.750 (0.717-0.778) | 0.822 | 0.644 | 0.720 |
aAUC: area under the receiver operating characteristic curve.
bCI: 95% confidence intervals for the AUC.
bCNNr: convolutional neural network with randomly initialized word embeddings.
dCNNw: convolutional neural network with Word2vec word embeddings.
eLSTMr: long short-term memory with randomly initialized word embeddings.
fLSTMw: long short-term memory with Word2vec word embeddings.
Figure 3The mean area under the receiver operating characteristic curve and 95% CI for models trained on notes preceding the first intentional self-harm visits in patients presenting during the 2012 to 2017 time frame and tested on notes preceding the first intentional self-harm visits in patients presenting during the 2018 to 2019 time frame. The differences in performance were all significant (P<.001) except for the difference between the LSTMr and LSTMw. AUC: area under the receiver operating characteristic curve; CNNr: convolutional neural network with randomly initialized word embeddings; CNNw: convolutional neural network with Word2vec word embeddings; LSTMr: long short-term memory with randomly initialized word embedding; LSTMw: long short-term memory with Word2vec word embedding.