| Literature DB >> 24489669 |
Chris Poulin1, Brian Shiner2, Paul Thompson1, Linas Vepstas3, Yinong Young-Xu2, Benjamin Goertzel4, Bradley Watts2, Laura Flashman5, Thomas McAllister5.
Abstract
We developed linguistics-driven prediction models to estimate the risk of suicide. These models were generated from unstructured clinical notes taken from a national sample of U.S. Veterans Administration (VA) medical records. We created three matched cohorts: veterans who committed suicide, veterans who used mental health services and did not commit suicide, and veterans who did not use mental health services and did not commit suicide during the observation period (n = 70 in each group). From the clinical notes, we generated datasets of single keywords and multi-word phrases, and constructed prediction models using a machine-learning algorithm based on a genetic programming framework. The resulting inference accuracy was consistently 65% or more. Our data therefore suggests that computerized text analytics can be applied to unstructured medical records to estimate the risk of suicide. The resulting system could allow clinicians to potentially screen seemingly healthy patients at the primary care level, and to continuously evaluate the suicide risk among psychiatric patients.Entities:
Mesh:
Year: 2014 PMID: 24489669 PMCID: PMC3904866 DOI: 10.1371/journal.pone.0085733
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Possible Relationships of Key Words and Known Domains of Suicide Risk Factors.
| Domain | Association of Domain with Suicide (word frequency) | ||
| Known Link | Possible Link | Unknown | |
|
| Agitation (24) | ||
| Frightened (18) | |||
| Delusional (11) | |||
| Tense (7) | |||
| Aggravated (5) | |||
|
| Vtach (15) | ||
| Tach (9) | |||
|
| Quadrants (11) | ||
| ALOH (10) | |||
| Subsalicylate (9) | |||
| MGOH (7) | |||
| Pylori (5) | |||
|
| Nebulizer (8) | ||
| Secretions (5) | |||
| Rhonchi (5) | |||
|
| Terminal (10) | ||
| Unresectable (3) | |||
| Cancers (2) | |||
|
| Analgesia (13) | ||
| Demerol (12) | |||
| Lumbago (5) | |||
|
| Integrated (5) | Adequately (23) | |
| Standards (14) | |||
| Clarify (7) | |||
|
| Format (8) | ||
| Happens (8) | |||
| Camera (7) | |||
| Bottom (7) | |||
Figure 1N-gram performance of the machine-learning algorithm applied to clinical notes. Where Count = Number of Models, Score = Accuracy, and the colors coordinate to model type.
Figure 2Terms displayed are those single words that were predictive for the suicide group (2).
Figure 3Terms displayed are those single words that were predictive for the psychiatric group (3).
Figure 4Terms displayed are those single words that were predictive for the control group (1).