Literature DB >> 22539080

Pneumonia identification using statistical feature selection.

Cosmin Adrian Bejan1, Fei Xia, Lucy Vanderwende, Mark M Wurfel, Meliha Yetisgen-Yildiz.   

Abstract

OBJECTIVE: This paper describes a natural language processing system for the task of pneumonia identification. Based on the information extracted from the narrative reports associated with a patient, the task is to identify whether or not the patient is positive for pneumonia.
DESIGN: A binary classifier was employed to identify pneumonia from a dataset of multiple types of clinical notes created for 426 patients during their stay in the intensive care unit. For this purpose, three types of features were considered: (1) word n-grams, (2) Unified Medical Language System (UMLS) concepts, and (3) assertion values associated with pneumonia expressions. System performance was greatly increased by a feature selection approach which uses statistical significance testing to rank features based on their association with the two categories of pneumonia identification.
RESULTS: Besides testing our system on the entire cohort of 426 patients (unrestricted dataset), we also used a smaller subset of 236 patients (restricted dataset). The performance of the system was compared with the results of a baseline previously proposed for these two datasets. The best results achieved by the system (85.71 and 81.67 F1-measure) are significantly better than the baseline results (50.70 and 49.10 F1-measure) on the restricted and unrestricted datasets, respectively.
CONCLUSION: Using a statistical feature selection approach that allows the feature extractor to consider only the most informative features from the feature space significantly improves the performance over a baseline that uses all the features from the same feature space. Extracting the assertion value for pneumonia expressions further improves the system performance.

Entities:  

Mesh:

Year:  2012        PMID: 22539080      PMCID: PMC3422830          DOI: 10.1136/amiajnl-2011-000752

Source DB:  PubMed          Journal:  J Am Med Inform Assoc        ISSN: 1067-5027            Impact factor:   4.497


  19 in total

1.  Automatic identification of pneumonia related concepts on chest x-ray reports.

Authors:  M Fiszman; W W Chapman; S R Evans; P J Haug
Journal:  Proc AMIA Symp       Date:  1999

2.  Automatic detection of acute bacterial pneumonia from chest X-ray reports.

Authors:  M Fiszman; W W Chapman; D Aronsky; R S Evans; P J Haug
Journal:  J Am Med Inform Assoc       Date:  2000 Nov-Dec       Impact factor: 4.497

3.  Effective mapping of biomedical text to the UMLS Metathesaurus: the MetaMap program.

Authors:  A R Aronson
Journal:  Proc AMIA Symp       Date:  2001

4.  A comparison of classification algorithms to automatically identify chest X-ray reports that support pneumonia.

Authors:  W W Chapman; M Fizman; B E Chapman; P J Haug
Journal:  J Biomed Inform       Date:  2001-02       Impact factor: 6.317

5.  A simple algorithm for identifying negated findings and diseases in discharge summaries.

Authors:  W W Chapman; W Bridewell; P Hanbury; G F Cooper; B G Buchanan
Journal:  J Biomed Inform       Date:  2001-10       Impact factor: 6.317

6.  MediClass: A system for detecting and classifying encounter-based clinical events in any electronic medical record.

Authors:  Brian Hazlehurst; H Robert Frost; Dean F Sittig; Victor J Stevens
Journal:  J Am Med Inform Assoc       Date:  2005-05-19       Impact factor: 4.497

7.  2010 i2b2/VA challenge on concepts, assertions, and relations in clinical text.

Authors:  Özlem Uzuner; Brett R South; Shuying Shen; Scott L DuVall
Journal:  J Am Med Inform Assoc       Date:  2011-06-16       Impact factor: 4.497

8.  Combining decision support methodologies to diagnose pneumonia.

Authors:  D Aronsky; M Fiszman; W W Chapman; P J Haug
Journal:  Proc AMIA Symp       Date:  2001

9.  Extracting information on pneumonia in infants using natural language processing of radiology reports.

Authors:  Eneida A Mendonça; Janet Haas; Lyudmila Shagina; Elaine Larson; Carol Friedman
Journal:  J Biomed Inform       Date:  2005-03-30       Impact factor: 6.317

10.  Unlocking clinical data from narrative reports: a study of natural language processing.

Authors:  G Hripcsak; C Friedman; P O Alderson; W DuMouchel; S B Johnson; P D Clayton
Journal:  Ann Intern Med       Date:  1995-05-01       Impact factor: 25.391

View more
  33 in total

1.  A systematic comparison of feature space effects on disease classifier performance for phenotype identification of five diseases.

Authors:  Christopher Kotfila; Özlem Uzuner
Journal:  J Biomed Inform       Date:  2015-08-01       Impact factor: 6.317

2.  Open Globe Injury Patient Identification in Warfare Clinical Notes.

Authors:  Emilia Apostolova; Helen A White; Patty A Morris; David A Eliason; Tom Velez
Journal:  AMIA Annu Symp Proc       Date:  2018-04-16

3.  Learning to identify treatment relations in clinical text.

Authors:  Cosmin A Bejan; Joshua C Denny
Journal:  AMIA Annu Symp Proc       Date:  2014-11-14

4.  Assessing the role of a medication-indication resource in the treatment relation extraction from clinical text.

Authors:  Cosmin Adrian Bejan; Wei-Qi Wei; Joshua C Denny
Journal:  J Am Med Inform Assoc       Date:  2014-10-21       Impact factor: 4.497

5.  Feature extraction for phenotyping from semantic and knowledge resources.

Authors:  Wenxin Ning; Stephanie Chan; Andrew Beam; Ming Yu; Alon Geva; Katherine Liao; Mary Mullen; Kenneth D Mandl; Isaac Kohane; Tianxi Cai; Sheng Yu
Journal:  J Biomed Inform       Date:  2019-02-07       Impact factor: 6.317

Review 6.  Text Mining for Precision Medicine: Bringing Structure to EHRs and Biomedical Literature to Understand Genes and Health.

Authors:  Michael Simmons; Ayush Singhal; Zhiyong Lu
Journal:  Adv Exp Med Biol       Date:  2016       Impact factor: 2.622

Review 7.  "Big data" and the electronic health record.

Authors:  M K Ross; W Wei; L Ohno-Machado
Journal:  Yearb Med Inform       Date:  2014-08-15

8.  On-time clinical phenotype prediction based on narrative reports.

Authors:  Cosmin A Bejan; Lucy Vanderwende; Heather L Evans; Mark M Wurfel; Meliha Yetisgen-Yildiz
Journal:  AMIA Annu Symp Proc       Date:  2013-11-16

9.  Reviewing 741 patients records in two hours with FASTVISU.

Authors:  Jean-Baptiste Escudié; Anne-Sophie Jannot; Eric Zapletal; Sarah Cohen; Georgia Malamut; Anita Burgun; Bastien Rance
Journal:  AMIA Annu Symp Proc       Date:  2015-11-05

10.  Robust clinical marker identification for diabetic kidney disease with ensemble feature selection.

Authors:  Xing Song; Lemuel R Waitman; Yong Hu; Alan S L Yu; David C Robbins; Mei Liu
Journal:  J Am Med Inform Assoc       Date:  2019-03-01       Impact factor: 4.497

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.