| Literature DB >> 28380048 |
Ye Ye1,2, Michael M Wagner1,2, Gregory F Cooper1,2, Jeffrey P Ferraro3,4, Howard Su1, Per H Gesteland3,4,5, Peter J Haug3,4, Nicholas E Millett1, John M Aronis1, Andrew J Nowalk6, Victor M Ruiz1, Arturo López Pineda7, Lingyun Shi1, Rudy Van Bree4, Thomas Ginter8, Fuchiang Tsui1,2.
Abstract
OBJECTIVES: This study evaluates the accuracy and transferability of Bayesian case detection systems (BCD) that use clinical notes from emergency department (ED) to detect influenza cases.Entities:
Mesh:
Year: 2017 PMID: 28380048 PMCID: PMC5381795 DOI: 10.1371/journal.pone.0174970
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Study design.
(A) Development—Two BCDs were developed at IH and UPMC, respectively. At each site, a rule-based parser was manually built by a knowledge engineer based on expert-annotated sample notes. A Bayesian network classifier was machine-learned from a local training set. (B) Test—Test datasets were created using local encounters (continuous arrows) or non-local encounters (dashed arrows) to evaluate both local performance and transferability. Bayesian network classifier’s abilities to discriminate a case of (1) influenza from non-influenza and (2) influenza from non-influenza influenza-like illness (NI-ILI) were evaluated. Not shown—an algorithm limited encounter data included in the test dataset based on time since registration.
Summary of training and test datasets.
| Datasets | Measures | IH | UPMC |
|---|---|---|---|
| Encounters dates | 1/2008 to 5/2010 | 1/2008 to 5/2010 | |
| # of encounters | 47,504 | 41,189 | |
| # of | 1,858 | 915 | |
| # of | 15,989 | 3,040 | |
| # of | 29,657 | 37,234 | |
| # of clinical notes | 60,344 (1.2 per encounter) | 76,467 (1.9 per encounter) | |
| # of finding extracted by UPMC parser | 877,377 (18 per encounter) | 1,031,134 (25 per encounter) | |
| # of finding extracted by IH parser | 934,414 (20 per encounter) | 849,932 (21 per encounter) | |
| Encounters dates | 6/2010 to 5/2011 | 6/2010 to 5/2011 | |
| # of encounters | 182,386 | 238,722 | |
| # of | 661 | 339 | |
| # of | 5,722 | 1,567 | |
| # of | 176,003 | 236,816 | |
| # of clinical notes | 220,276 (1.2 per encounter) | 480,059 (2 per encounter) | |
| # of findings extracted by UPMC parser | 2,822,282 (15 per encounter) | 6,305,782 (26 per encounter) | |
| # of findings extracted by IH parser | 2,950,928 (16 per encounter) | 5,361,241 (22 per encounter) |
aFor training purposes, we only used other encounters during the summer period from July 1, 2009 to August 31, 2009.
Transferability studies.
| Objectives | System and location used to measure transferability | Training dataset (source of notes / parser) | Resulting Bayesian network classifier | Test dataset (source of notes / parser) |
|---|---|---|---|---|
| BCDIH at IH | IH / IH | BNIH&NLPIH | IH / IH | |
| BCDIH transferred to UPMC | IH / IH | BNIH&NLPIH | UPMC / IH | |
| BCDIH transferred to UPMC with relearning of BN with UPMC data | UPMC / IH | BNUPMC&NLPIH | UPMC / IH | |
| BCDUPMC at UPMC | UPMC / UPMC | BNUPMC&NLPUPMC | UPMC / UPMC | |
| BCDUPMC transferred to IH | UPMC / UPMC | BNUPMC&NLPUPMC | IH / UPMC | |
| BCDUPMC transferred to IH with relearning of BN with IH data | IH / UPMC | BNIH&NLPUPMC | IH / UPMC |
aThe subscript associated with BN refers to the source of training dataset.
Seven factors affecting the performance of a Bayesian case detection system.
| Factor | Meaning of the factor | Candidate Configurations |
|---|---|---|
| Development Institution | The institution that provides training data for BCD development | IH, UPMC |
| Development Parser | The parser that is used to extract training findings for BCD development | IH parser, UPMC parser |
| Application Institution | The institution where a developed BCD is applied | IH, UPMC |
| Application Parser | The parser that is used to extract findings when applying a BCD to the application institution | IH parser, UPMC parser |
| NLP Transfer | The condition of whether a parser has been developed locally or not: if a parser has been developed in another institution, then the parser is transferred. Otherwise, the parser is not transferred. | Yes, No |
| BN Transfer | The condition of whether a Bayesian network has been developed locally or not: if a Bayesian network had been developed in another institution, then it is transferred. Otherwise, it is not transferred. | Yes, No |
| Classification Task | Two classification tasks: 1) influenza vs. non-influenza, and 2) influenza vs. non-influenza influenza-like illness (NI-ILI) | FLU_NONFLU, FLU_NI-ILI |
Clinical findings included in the four Bayesian network classifiers.
| Clinical findings in networks | IH training notes | UPMC training notes | ||
|---|---|---|---|---|
| BNIH&NLPIH (11 features) | BNIH&NLPUPMC (12 features) | BNUPMC&NLPUPMC (13 features) | BNUPMC&NLPIH (8 features) | |
| X | X | X | X | |
| X | X | X | X | |
| X | X | X | X | |
| X | X | X | X | |
| X | X | X | ||
| X | X | X | ||
| X | X | |||
| X | X | X | ||
| X | X | |||
| X | ||||
| X | ||||
| X | ||||
| X | X | |||
| X | X | |||
| X | ||||
| X | ||||
| X | ||||
| X | ||||
| X | ||||
| X | ||||
| X | ||||
| X | ||||
The design and purpose of these Bayesian network classifiers have been listed in Table 2.
The name of each classifier labels the source of training note and the NLP parser.
For example, BNIH&NLPIH represents the Bayesian network learned with IH clinical findings extracted by the IH parser.
“X” indicates that a Bayesian network classifier includes the clinical finding listed on the same row.
Fig 2Four Bayesian network classifiers developed using datasets distinguished by data resources and NLP parsers.
GeNIe visualization [62].
Performance and transferability of the influenza detection systems.
| Discrimination | Time Delay | BCDIH at IH [Local BCD] | BCDIH transferred to UPMC | BCDIH transferred with relearning | BCDUPMC at UPMC [Local BCD] | BCDUPMC transferred to IH | BCDUPMC transferred with relearning |
|---|---|---|---|---|---|---|---|
| Day 0 | 0.74 (0.73,0.76) | 0.80 (0.78,0.82) | 0.77 (0.75,0.8) | 0.80 (0.78,0.82) | 0.65 (0.63,0.67) | 0.74 (0.73,0.76) | |
| Day 1 | 0.92 (0.91,0.93) | 0.83 (0.8,0.85) | 0.83 (0.8,0.85) | 0.90 (0.89,0.92) | 0.92 (0.91,0.93) | 0.93 (0.92,0.93) | |
| Day 2 | 0.92 (0.92,0.93) | 0.82 (0.79,0.84) | 0.82 (0.79,0.84) | 0.93 (0.92,0.94) | 0.93 (0.92,0.94) | 0.93 (0.92,0.94) | |
| Day 3 | 0.93 (0.92,0.93) | 0.83 (0.8,0.85) | 0.82 (0.79,0.84) | 0.94 (0.93,0.95) | 0.93 (0.93,0.94) | 0.93 (0.92,0.94) | |
| Day 4 | 0.93 (0.92,0.94) | 0.84 (0.81,0.87) | 0.83 (0.8,0.85) | 0.94 (0.93,0.96) | 0.93 (0.93,0.94) | 0.93 (0.93,0.94) | |
| Day 5 | 0.93 (0.92,0.94) | 0.85 (0.82,0.87) | 0.83 (0.8,0.85) | 0.95 (0.93,0.96) | 0.93 (0.93,0.94) | 0.93 (0.93,0.94) | |
| Day 6 | 0.93 (0.92,0.94) | 0.86 (0.83,0.88) | 0.83 (0.81,0.86) | 0.95 (0.94,0.96) | 0.94 (0.93,0.94) | 0.93 (0.93,0.94) | |
| Day 0 | 0.48 (0.46,0.50) | 0.54 (0.51,0.58) | 0.65 (0.62,0.68) | 0.65 (0.62,0.68) | 0.63 (0.61,0.65) | 0.52 (0.49,0.54) | |
| Day 1 | 0.67 (0.65,0.70) | 0.57 (0.54,0.61) | 0.65 (0.62,0.68) | 0.70 (0.67,0.74) | 0.74 (0.71,0.76) | 0.73 (0.71,0.75) | |
| Day 2 | 0.68 (0.66,0.71) | 0.57 (0.54,0.61) | 0.63 (0.6,0.67) | 0.73 (0.7,0.76) | 0.74 (0.72,0.77) | 0.74 (0.72,0.76) | |
| Day 3 | 0.68 (0.66,0.71) | 0.59 (0.55,0.62) | 0.63 (0.59,0.66) | 0.75 (0.71,0.78) | 0.75 (0.73,0.77) | 0.74 (0.72,0.76) | |
| Day 4 | 0.69 (0.67,0.71) | 0.60 (0.56,0.64) | 0.63 (0.6,0.67) | 0.75 (0.72,0.78) | 0.75 (0.73,0.77) | 0.74 (0.72,0.76) | |
| Day 5 | 0.69 (0.67,0.71) | 0.60 (0.57,0.64) | 0.63 (0.6,0.67) | 0.75 (0.72,0.78) | 0.75 (0.72,0.77) | 0.74 (0.72,0.76) | |
| Day 6 | 0.69 (0.67,0.71) | 0.61 (0.58,0.65) | 0.64 (0.61,0.67) | 0.75 (0.72,0.79) | 0.75 (0.73,0.77) | 0.74 (0.72,0.76) | |
Parentheses indicate the 95% C.I.s for the areas under the ROC curves.
Information delay of all ED encounters between June 1, 2010 and May 31, 2011 at UPMC and IH.
| Day information becomes available relative to registration day | First encounter notes | Complete set of encounter notes | ||
|---|---|---|---|---|
| UPMC | IH | UPMC | IH | |
| 59.6% | 38.2% | 50.6% | 35.1% | |
| 80.8% | 94.4% | 74.6% | 90.4% | |
| 87.6% | 97.6% | 83.5% | 94.7% | |
| 92.0% | 98.7% | 89.0% | 96.5% | |
| 94.7% | 99.1% | 92.3% | 97.5% | |
| 96.4% | 99.4% | 94.4% | 98.1% | |
| 97.5% | 99.5% | 95.9% | 98.5% | |
Fig 3AUCs of BCDIH and BCDUPMC for discriminating between influenza and non-influenza over different time delays.
Fig 4AUCs of BCDIH and BCDUPMC for discriminating between influenza and NI-ILI over different time delays.