| Literature DB >> 25006878 |
Hongzhang Zheng1, Holly Gaff2, Gary Smith3, Sylvain DeLisle1.
Abstract
BACKGROUNDS: Electronic medical records (EMR) form a rich repository of information that could benefit public health. We asked how structured and free-text narrative EMR data should be combined to improve epidemic surveillance for acute respiratory infections (ARI).Entities:
Mesh:
Year: 2014 PMID: 25006878 PMCID: PMC4090236 DOI: 10.1371/journal.pone.0100845
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Description of ARI CDAs.
| Category | Subcategory | Case-Detection Algorithm Number | |||||||
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | ||
|
| CDC ICD-9 Codes | • | |||||||
| VA ICD-9 Codes | • | • | • | • | • | • | |||
| OR Cough Remedies | • | • | • | ||||||
| OR Temperature ≥38°C | • | ||||||||
| OR Text of Clinical Note | • | • | • | ||||||
| AND Text of Clinical Note | • | • | |||||||
|
| Sensitivity (%) | 63 | 79 | 85 | 88 | 97 | 99 | 69 | 75 |
| (57, 69) | (74, 84) | (80, 88) | (83, 91) | (94, 99) | (96, 100) | (64, 75) | (70, 80) | ||
| Specificity (%) | 92 | 97 | 96 | 93 | 90 | 89 | 99 | 99 | |
| (91, 92) | (96, 97) | (95, 96) | (92, 93) | (90, 91) | (89, 90) | (99, 99) | (98, 99) | ||
| PPV (%) | 13 | 31 | 25 | 18 | 16 | 14 | 54 | 49 | |
| (11, 15) | (28, 34) | (22, 27) | (16, 20) | (14, 18) | (13, 16) | (49, 59) | (44, 54) | ||
| Area under the ROC | 78 | 88 | 90 | 90 | 94 | 94 | 84 | 87 | |
| (75, 80) | (85, 90) | (88, 92) | (88, 92) | (93, 95) | (93, 95) | (81, 87) | (84, 89) | ||
Composition and performance of the eight (8) ARI CDAs used in this study. Individual CDAs are numbered in the first row. Black dot in columns 3–11 indicates that a component (column 2) is included in the corresponding CDA. Note that “performance” refers to ability to detect single cases with possible ARI. Performance numbers in parenthesis indicate 95% confidence limits.
Figure 1Simulated prospective surveillance cycle.
Upper panel displays daily counts time series of authentic cases identified by CDA 2, either alone (black diamonds) or combined with simulated cases provided by the epidemic model for a community influenza outbreak that began at day zero (red circles). Lower panel shows the corresponding EARS W2c statistic for both time series (authentic cases alone (black diamonds) or combined with simulated epidemic cases (red circles)). True positive alarms occur when the value of the W2c statistic exceeds a threshold in the combined dataset while remaining sub threshold in the background dataset. For this 80-day surveillance cycle, at the arbitrarily set threshold of 3.2 (blue horizontal line), the time to the first true-positive alarm (detection delay) is 19 days. A false positive alarm occurs at day 50, when the statistic originating from the background-only dataset exceeds threshold.
Figure 2System performance using alternative case-detection methods.
AMOC curves displaying epidemic Detection Delay (days) as a function of daily false alert rate (FAR) (upper panel) or yearly caseload (lower panel). Each curve represent an alternative CDA: CDA 1 (grey stars), CDA 2 (black circles), CDA 3 (red triangles), CDA 4 (green crosses), CDA 5 (blue x's), CDA 6 (teal diamonds), CDA 7 (purple triangles), CDA 8 (yellow stars).