| Literature DB >> 25379126 |
Stephanie W Haas1, Debbie Travers2, Anna Waller3, Deepika Mahalingam4, John Crouch3, Todd A Schwartz5, Javed Mostafa6.
Abstract
OBJECTIVE: Automated syndrome classification aims to aid near real-time syndromic surveillance to serve as an early warning system for disease outbreaks, using Emergency Department (ED) data. We present a system that improves the automatic classification of an ED record with triage note into one or more syndrome categories using the vector space model coupled with a 'learning' module that employs a pseudo-relevance feedback mechanism.Entities:
Keywords: Disease outbreaks; electronic health records/classification; machine learning; natural language processing; public health informatics; public health surveillance/methods
Year: 2014 PMID: 25379126 PMCID: PMC4221085 DOI: 10.5210/ojphi.v6i2.5469
Source DB: PubMed Journal: Online J Public Health Inform ISSN: 1947-2579
Examples of chief complaints and triage notes
| | | |
| Fever | | Gastrointestinal |
| Fever | Pt c/o wheezy
| Respiratory |
| Fever | Mom reports child awoke
w/fine | Fever-Rash |
Baseline NC DETECT Performance
|
|
|
| |
|
| 3353 | 3699 | 3640 |
|
| 0.28 | 0.23 | 0.45 |
|
| 0.97 | 0.99 | 0.99 |
Figure 1EMT-C Architecture
Figure 2Creation of Master Term List (MTL) Vector
Distribution of records in pilot and final test sets
|
|
|
| ||
|
|
|
|
| |
| Gastrointestinal | 80 (16.5%) | 405 (83.5%) | 23 (0.8%) | 3030 (99.2%) |
| Respiratory | 87 (17.9%) | 398 (82.1%) | 171 (5.6%) | 2882 (94.4%) |
| Fever-Rash | 5 (1.0%) | 480 (99.0%) | 249 (8.2%) | 2804 (91.8%) |
Maximum weighted Sensitivity and Specificity values, and the variation settings for the configurations that produced them, on Final Test Set (n = 3053 weighted to 1.34 million).
|
|
|
| |
|
| 0.97 | 0.91 | >0.99 |
|
| 0.94 | 0.91 | 0.93 |
Sensitivity and specificity values for all system configurations of EMT-C on Pilot Test Set (N = 485). Maximum values for sensitivity (Se) and specificity (Sp) are bolded. Configuration settings based on 4 system variations 1) Augmentation of Master Term List, 2) Source of terms for record vector, 3) Use of exclusion terms, 4) Use of pseudo-relevance feedback.
|
|
|
|
| |||||||||
|
| ||||||||||||
|
|
|
|
|
|
| |||||||
|
|
|
|
|
|
|
|
|
|
|
|
| |
| 1. MTL terms: | 0.89 | 0.86 | 0.90 | 0.71 | 0.87 | 0.86 |
| 0.73 |
|
|
| .086 |
| 1. MTL terms: | 0.76 | 0.88 |
| 0.56 | 0.83 | 0.86 |
| 0.73 |
|
|
|
|
| 1. MTL terms: | 0.88 | 0.86 | 0.94 | 0.57 | 0.86 | 0.86 |
| 0.73 |
| 0.58 |
| 0.57 |
| 1. MTL terms | 0.75 | 0.88 | 0.90 | 0.59 | 0.82 |
| 0.95 | 0.73 |
| 0.59 |
| 0.58 |
|
| ||||||||||||
| 1. MTL terms: | 0.83 | 0.88 | 0.84 | 0.78 | 0.87 | 0.86 |
| 0.73 |
|
|
|
|
| 1. MTL terms: | 0.71 |
| 0.90 | .063 | 0.83 | 0.86 |
| 0.73 |
|
|
|
|
| 1. MTL terms: | 0.83 | 0.88 | 0.88 | 0.67 | 0.86 | 0.86 |
| 0.73 |
| 0.59 |
| 0.58 |
| 1. MTL terms | 0.71 |
| 0.85 | 0.66 | 0.82 |
| 0.95 | 0.73 |
| 0.60 |
| 0.58 |
Weighted sensitivity and specificity for all system configurations of EMT-C on Final Test Set (N=3053 weighted to 1.34 million). Maximum values for sensitivity (Se) and specificity (Sp) are bolded. Configuration settings based on 4 system variations 1) Augmentation of Master Term List, 2) Source of terms for record vector, 3) Use of exclusion terms, 4) Use of pseudo-relevance feedback.
|
|
|
|
| |||||||||
|
| ||||||||||||
|
|
|
|
|
|
| |||||||
|
|
|
|
|
|
|
|
|
|
|
|
| |
| 1. MTL terms: | 0.83 | 0.92 |
| 0.76 | 0.65 | 0.89 |
| 0.62 |
| 0.91 |
| 0.69 |
| 1. MTL terms: | 0.77 | 0.93 | 0.86 | 0.79 | 0.54 |
| 0.87 | 0.64 | 0.96 |
| 0.98 | 0.79 |
| 1. MTL terms: | 0.94 | 0.76 | 0.94 | 0.78 | 0.64 | 0.90 | 0.81 | 0.80 |
| 0.91 |
| 0.73 |
| 1. MTL terms | 0.92 | 0.80 | 0.86 | 0.81 | 0.54 |
| 0.76 | 0.80 | 0.96 |
| 0.98 | 0.76 |
|
| ||||||||||||
| 1. MTL terms: | 0.77 | 0.93 | 0.91 | 0.79 | 0.65 | 0.89 |
| 0.62 |
| 0.91 |
| 0.69 |
| 1. MTL terms: | 0.72 |
| 0.80 | 0.81 | 0.54 |
| 0.87 | 0.64 | 0.96 |
| 0.98 | 0.79 |
| 1. MTL terms: | 0.89 | 0.80 | 0.89 | 0.81 | 0.64 | 0.90 | 0.81 | 0.80 |
| 0.91 |
| 0.73 |
| 1. MTL terms | 0.86 | 0.83 | 0.83 | 0.80 | 0.54 |
| 0.76 | 0.80 | 0.96 |
| 0.98 | 0.76 |