| Literature DB >> 31672168 |
Eben Holderness1,2, Nicholas Miller1, Philip Cawkwell1, Kirsten Bolton1, Marie Meteer2, James Pustejovsky2, Mei-Hua Hall3.
Abstract
BACKGROUND: Readmission after discharge from a hospital is disruptive and costly, regardless of the reason. However, it can be particularly problematic for psychiatric patients, so predicting which patients may be readmitted is critically important but also very difficult. Clinical narratives in psychiatric electronic health records (EHRs) span a wide range of topics and vocabulary; therefore, a psychiatric readmission prediction model must begin with a robust and interpretable topic extraction component.Entities:
Keywords: Electronic health record; Machine learning; Natural language processing; Psychotic disorders; Risk prediction
Mesh:
Year: 2019 PMID: 31672168 PMCID: PMC6823956 DOI: 10.1186/s13326-019-0210-8
Source DB: PubMed Journal: J Biomed Semantics
Demographic breakdown of the target cohort
| Mean Age (2014) | 20.7 |
| Gender (Male) | 79% |
| Race | |
| Asian | 6% |
| Black | 7% |
| Caucasian | 77% |
| Latino | 5% |
| Multiracial | 5% |
| Insurance (Public) | 5.5% |
| 30-day Inpatient Readmission Rate | 14% |
Annotation scheme for the domain classification task
| Domain | Description | Example | Example Keywords |
|---|---|---|---|
| Appearance | Physical appearance, gestures, and mannerisms | “A well-appearing, clean young woman appearing her stated age, pleasant and cooperative. Eye contact was good." | Disheveled, clothing, groomed, wearing, clean |
| Thought Content | Suicidal/homicidal ideation, obsessions, phobias, delusions, hallucinations | “No SI, No HI, No hallucinations, Ideas of reference, Paranoid delusions" | Obsession, delusion, grandiose, ideation, suicidal, paranoid |
| Interpersonal | Family situation, friendships, and other social relationships | “Pt. overall appears to be functioning very well despite this conflict with a romantic interest of hers." | Boyfriend, relationship, peers, family, parents, social |
| Mood | Feelings and overall disposition | “Pt. indicates that his mood is becoming more ‘depressed.’" | Anxious, calm, depressed, labile, confused, cooperative |
| Occupation | School and/or employment | “Pt. followed through with decision to leave college at this point in time." | Boss, employed, job, school, class, homework, work |
| Thought Process | Pace and coherence of thoughts. Includes linear, goal-directed, perseverative, tangential, and flight of ideas | “Disorganized (Difficult to communicate with patient.), Paucity of thought, Thought-blocking." | Linear, tangential, prosody, blocking, goal-directed, perseverant |
| Substance | Drug and/or alcohol use | “Patient used marijuana once which he believes triggered the current episode." | Cocaine, marijuana, ETOH, addiction, narcotic |
| Other | Any example that does not fall into any of the other seven domains | “Maintain mood stabilization, prevent future episodes of mania, improve self-monitoring skills." | – |
Distribution of gold standard sentences and tokens across risk factor domains
| Total Sentences | Total Tokens | |
|---|---|---|
| Appearance | 670 | 11648 |
| Mood | 793 | 17672 |
| Interpersonal | 574 | 11674 |
| Occupation | 664 | 14166 |
| Thought Content | 756 | 18785 |
| Thought Process | 663 | 11203 |
| Substance Use | 727 | 14793 |
| Totals | 4847 | 99941 |
| Sentences With >1 Domain | 222 | 8912 |
Inter-annotator agreement
| Labels | Fleiss’s Kappa | Cohen’s Multi-Kappa | Mean Accuracy |
|---|---|---|---|
| Overall | 0.575 | 0.571 | 0.746 |
| First Domain Only | 0.536 | 0.528 | 0.805 |
Fig. 1Data pipeline for training and evaluating our risk factor domain classifiers
Architectures of our highest-performing MLP and RBF networks
| Network | MLP | RBF |
|---|---|---|
| Input Layer | ||
| Nodes | 512 | 512 |
| Dropout | 0.2 | 0.2 |
| Activation | ReLU | ReLU |
| Hidden Layer | ||
| Nodes | 250 | 700 |
| Dropout | 0.5 | 0.0 |
| Activation | ReLU | RBF |
| Output Layer | ||
| Nodes | 7 | 7 |
| Activation | Sigmoid | Linear |
| Optimizer | Adam | Adam |
| Loss Function | Categorical Cross Entropy | Mean Squared Error |
| Training Epochs | 60 | 50 |
| Batch Size | 128 | 128 |
Overall and domain-specific Precision, Recall, and F1 scores for our models
| Precision | Recall | F1 | |
| Aggregate Cosine Similarity Scores | 0.626 | 0.692 | 0.657 |
| MLP Baseline (No MWEs) | 0.816 | 0.830 | 0.823 |
| RBF Baseline (No MWEs) | 0.795 | 0.808 | 0.801 |
| MLP (w/ MWEs) | 0.821 | 0.835 | 0.828 |
| Appearance | 0.953 | 0.825 | 0.884 |
| Interpersonal | 0.843 | 0.897 | 0.869 |
| Mood | 0.723 | 0.816 | 0.767 |
| Occupation | 0.945 | 0.834 | 0.886 |
| Substance | 0.898 | 0.946 | 0.921 |
| Thought Content | 0.830 | 0.685 | 0.751 |
| Thought Process | 0.792 | 0.878 | 0.833 |
| Other | 0.509 | 0.614 | 0.557 |
| RBF (w/ MWEs) | 0.814 | 0.799 | 0.806 |
| Appearance | 0.952 | 0.803 | 0.871 |
| Interpersonal | 0.929 | 0.882 | 0.905 |
| Mood | 0.748 | 0.759 | 0.754 |
| Occupation | 0.956 | 0.847 | 0.898 |
| Substance | 0.826 | 0.927 | 0.874 |
| Thought Content | 0.866 | 0.685 | 0.765 |
| Thought Process | 0.958 | 0.818 | 0.883 |
| Other | 0.405 | 0.411 | 0.408 |
Fig. 22-component linear discriminant analysis of the RPDR training data