Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Comparison of machine learning classifiers for influenza detection from emergency department free-text reports.

Literature DB >> 26385375

Comparison of machine learning classifiers for influenza detection from emergency department free-text reports.

Arturo López Pineda¹, Ye Ye², Shyam Visweswaran², Gregory F Cooper², Michael M Wagner², Fuchiang Rich Tsui³.

Abstract

Influenza is a yearly recurrent disease that has the potential to become a pandemic. An effective biosurveillance system is required for early detection of the disease. In our previous studies, we have shown that electronic Emergency Department (ED) free-text reports can be of value to improve influenza detection in real time. This paper studies seven machine learning (ML) classifiers for influenza detection, compares their diagnostic capabilities against an expert-built influenza Bayesian classifier, and evaluates different ways of handling missing clinical information from the free-text reports. We identified 31,268 ED reports from 4 hospitals between 2008 and 2011 to form two different datasets: training (468 cases, 29,004 controls), and test (176 cases and 1620 controls). We employed Topaz, a natural language processing (NLP) tool, to extract influenza-related findings and to encode them into one of three values: Acute, Non-acute, and Missing. Results show that all ML classifiers had areas under ROCs (AUC) ranging from 0.88 to 0.93, and performed significantly better than the expert-built Bayesian model. Missing clinical information marked as a value of missing (not missing at random) had a consistently improved performance among 3 (out of 4) ML classifiers when it was compared with the configuration of not assigning a value of missing (missing completely at random). The case/control ratios did not affect the classification performance given the large number of training cases. Our study demonstrates ED reports in conjunction with the use of ML and NLP with the handling of missing value information have a great potential for the detection of infectious diseases.

Entities: Chemical Disease Species

Keywords: Bayesian; Case detection; Emergency department reports; Influenza; Machine learning

Mesh：

Year: 2015 PMID： 26385375 PMCID： PMC4684714 DOI： 10.1016/j.jbi.2015.08.019

Source DB: PubMed Journal: J Biomed Inform ISSN： 1532-0464 Impact factor: 6.317

25 in total

1. The Unified Medical Language System (UMLS): integrating biomedical terminology.

Authors: Olivier Bodenreider
Journal: Nucleic Acids Res Date: 2004-01-01 Impact factor: 16.971

2. Automated encoding of clinical documents based on natural language processing.

Authors: Carol Friedman; Lyudmila Shagina; Yves Lussier; George Hripcsak
Journal: J Am Med Inform Assoc Date: 2004-06-07 Impact factor: 4.497

Review 3. Real-time PCR in clinical microbiology: applications for routine laboratory testing.

Authors: M J Espy; J R Uhl; L M Sloan; S P Buckwalter; M F Jones; E A Vetter; J D C Yao; N L Wengenack; J E Rosenblatt; F R Cockerill; T F Smith
Journal: Clin Microbiol Rev Date: 2006-01 Impact factor: 26.132

4. An efficient bayesian method for predicting clinical outcomes from genome-wide data.

Authors: Gregory F Cooper; Pablo Hennings-Yeomans; Shyam Visweswaran; Michael Barmada
Journal: AMIA Annu Symp Proc Date: 2010-11-13

5. Design and performance of the CDC real-time reverse transcriptase PCR swine flu panel for detection of 2009 A (H1N1) pandemic influenza virus.

Authors: Bo Shu; Kai-Hui Wu; Shannon Emery; Julie Villanueva; Roy Johnson; Erica Guthrie; LaShondra Berman; Christine Warnes; Nathelia Barnes; Alexander Klimov; Stephen Lindstrom
Journal: J Clin Microbiol Date: 2011-05-18 Impact factor: 5.948

6. Performance of six influenza rapid tests in detecting human influenza in clinical specimens.

Authors: Aeron C Hurt; Robert Alexander; Jan Hibbert; Nicola Deed; Ian G Barr
Journal: J Clin Virol Date: 2007-04-23 Impact factor: 3.168

7. [Comparison study of a real-time reverse transcription polymerase chain reaction assay with an enzyme immunoassay and shell vial culture for influenza A and B virus detection in adult patients].

Authors: Jordi Reina; Virginia Plasencia; Maria Leyes; Antonio Nicolau; Antonia Galmés; Gabriel Arbona
Journal: Enferm Infecc Microbiol Clin Date: 2009-05-23 Impact factor: 1.731

8. Infection and death from influenza A H1N1 virus in Mexico: a retrospective analysis.

Authors: Santiago Echevarría-Zuno; Juan Manuel Mejía-Aranguré; Alvaro J Mar-Obeso; Concepción Grajales-Muñiz; Eduardo Robles-Pérez; Margot González-León; Manuel Carlos Ortega-Alvarez; Cesar Gonzalez-Bonilla; Ramón Alberto Rascón-Pacheco; Víctor Hugo Borja-Aburto
Journal: Lancet Date: 2009-11-11 Impact factor: 79.321

9. Naïve Bayesian Classifier and Genetic Risk Score for Genetic Risk Prediction of a Categorical Trait: Not so Different after all!

Authors: Paola Sebastiani; Nadia Solovieff; Jenny X Sun
Journal: Front Genet Date: 2012-02-29 Impact factor: 4.599

10. Estimates of the prevalence of pandemic (H1N1) 2009, United States, April-July 2009.

Authors: Carrie Reed; Frederick J Angulo; David L Swerdlow; Marc Lipsitch; Martin I Meltzer; Daniel Jernigan; Lyn Finelli
Journal: Emerg Infect Dis Date: 2009-12 Impact factor: 6.883

23 in total

1. Controlling testing volume for respiratory viruses using machine learning and text mining.

Authors: Mark V Mai; Michael Krauthammer
Journal: AMIA Annu Symp Proc Date: 2017-02-10

Review 2. Aspiring to Unintended Consequences of Natural Language Processing: A Review of Recent Developments in Clinical and Consumer-Generated Text Processing.

Authors: D Demner-Fushman; N Elhadad
Journal: Yearb Med Inform Date: 2016-11-10

3. The effects of natural language processing on cross-institutional portability of influenza case detection for disease surveillance.

Authors: Jeffrey P Ferraro; Ye Ye; Per H Gesteland; Peter J Haug; Fuchiang Rich Tsui; Gregory F Cooper; Rudy Van Bree; Thomas Ginter; Andrew J Nowalk; Michael Wagner
Journal: Appl Clin Inform Date: 2017-05-31 Impact factor: 2.342

4. Automatic prediction of coronary artery disease from clinical narratives.

Authors: Kevin Buchan; Michele Filannino; Özlem Uzuner
Journal: J Biomed Inform Date: 2017-06-27 Impact factor: 6.317

5. Automated influenza case detection for public health surveillance and clinical diagnosis using dynamic influenza prevalence method.

Authors: Fuchiang Tsui; Ye Ye; Victor Ruiz; Gregory F Cooper; Michael M Wagner
Journal: J Public Health (Oxf) Date: 2018-12-01 Impact factor: 2.341

6. Longitudinal analysis of social and behavioral determinants of health in the EHR: exploring the impact of patient trajectories and documentation practices.

Authors: Daniel J Feller; Jason Zucker; Oliver Bear Don't Walk; Michael T Yin; Peter Gordon; Noémie Elhadad
Journal: AMIA Annu Symp Proc Date: 2020-03-04