Literature DB >> 26385375

Comparison of machine learning classifiers for influenza detection from emergency department free-text reports.

Arturo López Pineda1, Ye Ye2, Shyam Visweswaran2, Gregory F Cooper2, Michael M Wagner2, Fuchiang Rich Tsui3.   

Abstract

Influenza is a yearly recurrent disease that has the potential to become a pandemic. An effective biosurveillance system is required for early detection of the disease. In our previous studies, we have shown that electronic Emergency Department (ED) free-text reports can be of value to improve influenza detection in real time. This paper studies seven machine learning (ML) classifiers for influenza detection, compares their diagnostic capabilities against an expert-built influenza Bayesian classifier, and evaluates different ways of handling missing clinical information from the free-text reports. We identified 31,268 ED reports from 4 hospitals between 2008 and 2011 to form two different datasets: training (468 cases, 29,004 controls), and test (176 cases and 1620 controls). We employed Topaz, a natural language processing (NLP) tool, to extract influenza-related findings and to encode them into one of three values: Acute, Non-acute, and Missing. Results show that all ML classifiers had areas under ROCs (AUC) ranging from 0.88 to 0.93, and performed significantly better than the expert-built Bayesian model. Missing clinical information marked as a value of missing (not missing at random) had a consistently improved performance among 3 (out of 4) ML classifiers when it was compared with the configuration of not assigning a value of missing (missing completely at random). The case/control ratios did not affect the classification performance given the large number of training cases. Our study demonstrates ED reports in conjunction with the use of ML and NLP with the handling of missing value information have a great potential for the detection of infectious diseases.
Copyright © 2015 Elsevier Inc. All rights reserved.

Entities:  

Keywords:  Bayesian; Case detection; Emergency department reports; Influenza; Machine learning

Mesh:

Year:  2015        PMID: 26385375      PMCID: PMC4684714          DOI: 10.1016/j.jbi.2015.08.019

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  25 in total

1.  The Unified Medical Language System (UMLS): integrating biomedical terminology.

Authors:  Olivier Bodenreider
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

2.  Automated encoding of clinical documents based on natural language processing.

Authors:  Carol Friedman; Lyudmila Shagina; Yves Lussier; George Hripcsak
Journal:  J Am Med Inform Assoc       Date:  2004-06-07       Impact factor: 4.497

Review 3.  Real-time PCR in clinical microbiology: applications for routine laboratory testing.

Authors:  M J Espy; J R Uhl; L M Sloan; S P Buckwalter; M F Jones; E A Vetter; J D C Yao; N L Wengenack; J E Rosenblatt; F R Cockerill; T F Smith
Journal:  Clin Microbiol Rev       Date:  2006-01       Impact factor: 26.132

4.  An efficient bayesian method for predicting clinical outcomes from genome-wide data.

Authors:  Gregory F Cooper; Pablo Hennings-Yeomans; Shyam Visweswaran; Michael Barmada
Journal:  AMIA Annu Symp Proc       Date:  2010-11-13

5.  Design and performance of the CDC real-time reverse transcriptase PCR swine flu panel for detection of 2009 A (H1N1) pandemic influenza virus.

Authors:  Bo Shu; Kai-Hui Wu; Shannon Emery; Julie Villanueva; Roy Johnson; Erica Guthrie; LaShondra Berman; Christine Warnes; Nathelia Barnes; Alexander Klimov; Stephen Lindstrom
Journal:  J Clin Microbiol       Date:  2011-05-18       Impact factor: 5.948

6.  Performance of six influenza rapid tests in detecting human influenza in clinical specimens.

Authors:  Aeron C Hurt; Robert Alexander; Jan Hibbert; Nicola Deed; Ian G Barr
Journal:  J Clin Virol       Date:  2007-04-23       Impact factor: 3.168

7.  [Comparison study of a real-time reverse transcription polymerase chain reaction assay with an enzyme immunoassay and shell vial culture for influenza A and B virus detection in adult patients].

Authors:  Jordi Reina; Virginia Plasencia; Maria Leyes; Antonio Nicolau; Antonia Galmés; Gabriel Arbona
Journal:  Enferm Infecc Microbiol Clin       Date:  2009-05-23       Impact factor: 1.731

8.  Infection and death from influenza A H1N1 virus in Mexico: a retrospective analysis.

Authors:  Santiago Echevarría-Zuno; Juan Manuel Mejía-Aranguré; Alvaro J Mar-Obeso; Concepción Grajales-Muñiz; Eduardo Robles-Pérez; Margot González-León; Manuel Carlos Ortega-Alvarez; Cesar Gonzalez-Bonilla; Ramón Alberto Rascón-Pacheco; Víctor Hugo Borja-Aburto
Journal:  Lancet       Date:  2009-11-11       Impact factor: 79.321

9.  Naïve Bayesian Classifier and Genetic Risk Score for Genetic Risk Prediction of a Categorical Trait: Not so Different after all!

Authors:  Paola Sebastiani; Nadia Solovieff; Jenny X Sun
Journal:  Front Genet       Date:  2012-02-29       Impact factor: 4.599

10.  Estimates of the prevalence of pandemic (H1N1) 2009, United States, April-July 2009.

Authors:  Carrie Reed; Frederick J Angulo; David L Swerdlow; Marc Lipsitch; Martin I Meltzer; Daniel Jernigan; Lyn Finelli
Journal:  Emerg Infect Dis       Date:  2009-12       Impact factor: 6.883

View more
  23 in total

1.  Controlling testing volume for respiratory viruses using machine learning and text mining.

Authors:  Mark V Mai; Michael Krauthammer
Journal:  AMIA Annu Symp Proc       Date:  2017-02-10

Review 2.  Aspiring to Unintended Consequences of Natural Language Processing: A Review of Recent Developments in Clinical and Consumer-Generated Text Processing.

Authors:  D Demner-Fushman; N Elhadad
Journal:  Yearb Med Inform       Date:  2016-11-10

3.  The effects of natural language processing on cross-institutional portability of influenza case detection for disease surveillance.

Authors:  Jeffrey P Ferraro; Ye Ye; Per H Gesteland; Peter J Haug; Fuchiang Rich Tsui; Gregory F Cooper; Rudy Van Bree; Thomas Ginter; Andrew J Nowalk; Michael Wagner
Journal:  Appl Clin Inform       Date:  2017-05-31       Impact factor: 2.342

4.  Automatic prediction of coronary artery disease from clinical narratives.

Authors:  Kevin Buchan; Michele Filannino; Özlem Uzuner
Journal:  J Biomed Inform       Date:  2017-06-27       Impact factor: 6.317

5.  Automated influenza case detection for public health surveillance and clinical diagnosis using dynamic influenza prevalence method.

Authors:  Fuchiang Tsui; Ye Ye; Victor Ruiz; Gregory F Cooper; Michael M Wagner
Journal:  J Public Health (Oxf)       Date:  2018-12-01       Impact factor: 2.341

6.  Longitudinal analysis of social and behavioral determinants of health in the EHR: exploring the impact of patient trajectories and documentation practices.

Authors:  Daniel J Feller; Jason Zucker; Oliver Bear Don't Walk; Michael T Yin; Peter Gordon; Noémie Elhadad
Journal:  AMIA Annu Symp Proc       Date:  2020-03-04

7.  Utilizing Chinese Admission Records for MACE Prediction of Acute Coronary Syndrome.

Authors:  Danqing Hu; Zhengxing Huang; Tak-Ming Chan; Wei Dong; Xudong Lu; Huilong Duan
Journal:  Int J Environ Res Public Health       Date:  2016-09-13       Impact factor: 3.390

8.  Detection and Prevention of Virus Infection.

Authors:  Ying Wang; Bairong Shen
Journal:  Adv Exp Med Biol       Date:  2022       Impact factor: 2.622

Review 9.  Natural Language Processing for EHR-Based Computational Phenotyping.

Authors:  Zexian Zeng; Yu Deng; Xiaoyu Li; Tristan Naumann; Yuan Luo
Journal:  IEEE/ACM Trans Comput Biol Bioinform       Date:  2018-06-25       Impact factor: 3.710

10.  An AI Approach for Identifying Patients With Cirrhosis.

Authors:  Jihad S Obeid; Ali Khalifa; Brandon Xavier; Halim Bou-Daher; Don C Rockey
Journal:  J Clin Gastroenterol       Date:  2021-07-08       Impact factor: 3.062

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.