| Literature DB >> 28739578 |
Joseph Geraci1,2,3, Pamela Wilansky1, Vincenzo de Luca1, Anvesh Roy1, James L Kennedy1, John Strauss1,3,4.
Abstract
BACKGROUND: We report a study of machine learning applied to the phenotyping of psychiatric diagnosis for research recruitment in youth depression, conducted with 861 labelled electronic medical records (EMRs) documents. A model was built that could accurately identify individuals who were suitable candidates for a study on youth depression.Entities:
Keywords: deep learning; depression; neural network; phenotyping; youth
Mesh:
Year: 2017 PMID: 28739578 PMCID: PMC5566092 DOI: 10.1136/eb-2017-102688
Source DB: PubMed Journal: Evid Based Ment Health ISSN: 1362-0347
Example of the Document Term Matrix data used to train our models
| Patient | Frequency of | Frequency of | Frequency of | Frequency of |
| 1 | 0 | 0 | 0.014249584 | 0.02089797 |
| 3 | 0 | 0 | 0 | 0.000758773 |
| 4 | 0 | 0.01683432 | 0 | 0 |
| 5 | 0.00742017 | 0 | 0 | 0 |
Each column provides a frequency measure for the given word. The most predictive words make their way into the neural network model.
Figure 1The more sensitive DL1 method was initially applied. Following DL1, the more specific DL0 model was then used on the documents selected with DL1. DL, deep learning paradigm.
Performance of DL0 considering a fivefold cross-validation
| Predicted 0s | Predicted 1s | |
| True 0s | 639 | 18 |
| True 1s | 56 | 45 |
Sensitivity 44.5%; specificity 97%.
Note that it performs very well with rejecting unsuitable patients accurately, but it does not perform well with predicting suitable participants (the true 1s).
DL, deep learning paradigm.
Performance of DL1 considering a fivefold cross-validation
| Predicted 0s | Predicted 1s | |
| True 0s | 47 | 53 |
| True 1s | 11 | 90 |
Sensitivity 89%; specificity 53%.
In contrast to model DL0, this model is excellent at accurately predicting participants (true 1s) but is poor at rejecting inappropriate patients.
DL, deep learning paradigm.
Performance of DL0_2 considering a fivefold cross-validation
| Predicted 0s | Predicted 1s | |
| True 0s | 570 | 87 |
| True 1s | 25 | 76 |
Sensitivity 75%; specificity 87%.
DL, deep learning paradigm.
Performance of DL1+0 considering a fivefold cross-validation
| Predicted 0s | Predicted 1s | |
| True 0s | 73 | 5 |
| True 1s | 8 | 17 |
Sensitivity 93.5%; specificity 68%; positive predictive value (precision) 77%.
At first it appears that there is not a significant improvement obtained via this model but the user can be more certain that the output recommended candidates are more reliable than DL1 or DL0 alone.
DL, deep learning paradigm.