Literature DB >> 24786209

N-gram support vector machines for scalable procedure and diagnosis classification, with applications to clinical free text data from the intensive care unit.

Ben J Marafino1, Jason M Davies2, Naomi S Bardach3, Mitzi L Dean1, R Adams Dudley4.   

Abstract

BACKGROUND: Existing risk adjustment models for intensive care unit (ICU) outcomes rely on manual abstraction of patient-level predictors from medical charts. Developing an automated method for abstracting these data from free text might reduce cost and data collection times.
OBJECTIVE: To develop a support vector machine (SVM) classifier capable of identifying a range of procedures and diagnoses in ICU clinical notes for use in risk adjustment.
MATERIALS AND METHODS: We selected notes from 2001-2008 for 4191 neonatal ICU (NICU) and 2198 adult ICU patients from the MIMIC-II database from the Beth Israel Deaconess Medical Center. Using these notes, we developed an implementation of the SVM classifier to identify procedures (mechanical ventilation and phototherapy in NICU notes) and diagnoses (jaundice in NICU and intracranial hemorrhage (ICH) in adult ICU). On the jaundice classification task, we also compared classifier performance using n-gram features to unigrams with application of a negation algorithm (NegEx).
RESULTS: Our classifier accurately identified mechanical ventilation (accuracy=0.982, F1=0.954) and phototherapy use (accuracy=0.940, F1=0.912), as well as jaundice (accuracy=0.898, F1=0.884) and ICH diagnoses (accuracy=0.938, F1=0.943). Including bigram features improved performance on the jaundice (accuracy=0.898 vs 0.865) and ICH (0.938 vs 0.927) tasks, and outperformed NegEx-derived unigram features (accuracy=0.898 vs 0.863) on the jaundice task. DISCUSSION: Overall, a classifier using n-gram support vectors displayed excellent performance characteristics. The classifier generalizes to diverse patient populations, diagnoses, and procedures.
CONCLUSIONS: SVM-based classifiers can accurately identify procedure status and diagnoses among ICU patients, and including n-gram features improves performance, compared to existing methods. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions.

Entities:  

Keywords:  ICU; Risk adjustment; SVM; classification; n-gram

Mesh:

Year:  2014        PMID: 24786209      PMCID: PMC4147615          DOI: 10.1136/amiajnl-2014-002694

Source DB:  PubMed          Journal:  J Am Med Inform Assoc        ISSN: 1067-5027            Impact factor:   4.497


  20 in total

1.  The effect of race and ethnicity on outcomes among patients in the intensive care unit: a comprehensive study involving socioeconomic status and resuscitation preferences.

Authors:  Sara E Erickson; Eduard E Vasilevskis; Michael W Kuzniewicz; Brian A Cason; Rondall K Lane; Mitzi L Dean; Deborah J Rennie; R Adams Dudley
Journal:  Crit Care Med       Date:  2011-03       Impact factor: 7.598

2.  Approaches to conventional mechanical ventilation of the patient with acute respiratory distress syndrome.

Authors:  Dean R Hess
Journal:  Respir Care       Date:  2011-10       Impact factor: 2.258

3.  Multiparameter Intelligent Monitoring in Intensive Care II: a public-access intensive care unit database.

Authors:  Mohammed Saeed; Mauricio Villarroel; Andrew T Reisner; Gari Clifford; Li-Wei Lehman; George Moody; Thomas Heldt; Tin H Kyaw; Benjamin Moody; Roger G Mark
Journal:  Crit Care Med       Date:  2011-05       Impact factor: 7.598

4.  Heuristics to determine ventilation times of ICU patients from the MIMIC-II database.

Authors:  Hanqing Cao; K P Lee; Colleen M Ennett; Larry Eshelman; Larry Nielsen; Mohammed Saeed; Brian Gross
Journal:  Annu Int Conf IEEE Eng Med Biol Soc       Date:  2010

5.  Variation in ICU risk-adjusted mortality: impact of methods of assessment and potential confounders.

Authors:  Michael W Kuzniewicz; Eduard E Vasilevskis; Rondall Lane; Mitzi L Dean; Nisha G Trivedi; Deborah J Rennie; Ted Clay; Pamela L Kotler; R Adams Dudley
Journal:  Chest       Date:  2008-04-10       Impact factor: 9.410

6.  Overcoming barriers to NLP for clinical text: the role of shared tasks and the need for additional creative solutions.

Authors:  Wendy W Chapman; Prakash M Nadkarni; Lynette Hirschman; Leonard W D'Avolio; Guergana K Savova; Ozlem Uzuner
Journal:  J Am Med Inform Assoc       Date:  2011 Sep-Oct       Impact factor: 4.497

7.  Document-level classification of CT pulmonary angiography reports based on an extension of the ConText algorithm.

Authors:  Brian E Chapman; Sean Lee; Hyunseok Peter Kang; Wendy W Chapman
Journal:  J Biomed Inform       Date:  2011-04-01       Impact factor: 6.317

8.  Use of a support vector machine for categorizing free-text notes: assessment of accuracy across two institutions.

Authors:  Adam Wright; Allison B McCoy; Stanislav Henkin; Abhivyakti Kale; Dean F Sittig
Journal:  J Am Med Inform Assoc       Date:  2013-03-30       Impact factor: 4.497

9.  Predictors of early postdischarge mortality in critically ill patients: a retrospective cohort study from the California Intensive Care Outcomes project.

Authors:  Eduard E Vasilevskis; Michael W Kuzniewicz; Brian A Cason; Rondall K Lane; Mitzi L Dean; Ted Clay; Deborah J Rennie; R Adams Dudley
Journal:  J Crit Care       Date:  2010-08-16       Impact factor: 3.425

Review 10.  Clinical review: Critical care management of spontaneous intracerebral hemorrhage.

Authors:  Fred Rincon; Stephan A Mayer
Journal:  Crit Care       Date:  2008-12-10       Impact factor: 9.097

View more
  21 in total

1.  A systematic comparison of feature space effects on disease classifier performance for phenotype identification of five diseases.

Authors:  Christopher Kotfila; Özlem Uzuner
Journal:  J Biomed Inform       Date:  2015-08-01       Impact factor: 6.317

2.  Trends in biomedical informatics: automated topic analysis of JAMIA articles.

Authors:  Dong Han; Shuang Wang; Chao Jiang; Xiaoqian Jiang; Hyeon-Eui Kim; Jimeng Sun; Lucila Ohno-Machado
Journal:  J Am Med Inform Assoc       Date:  2015-11       Impact factor: 4.497

3.  Application of Machine Learning in Intensive Care Unit (ICU) Settings Using MIMIC Dataset: Systematic Review.

Authors:  Mahanazuddin Syed; Shorabuddin Syed; Kevin Sexton; Hafsa Bareen Syeda; Maryam Garza; Meredith Zozus; Farhanuddin Syed; Salma Begum; Abdullah Usama Syed; Joseph Sanford; Fred Prior
Journal:  Informatics (MDPI)       Date:  2021-03-03

4.  Application of clinical text data for phenome-wide association studies (PheWASs).

Authors:  Scott J Hebbring; Majid Rastegar-Mojarad; Zhan Ye; John Mayer; Crystal Jacobson; Simon Lin
Journal:  Bioinformatics       Date:  2015-02-04       Impact factor: 6.937

5.  Automatic Classification of Structured Product Labels for Pregnancy Risk Drug Categories, a Machine Learning Approach.

Authors:  Laritza M Rodriguez; Dina Demner Fushman
Journal:  AMIA Annu Symp Proc       Date:  2015-11-05

6.  Clinical Informatics Researcher's Desiderata for the Data Content of the Next Generation Electronic Health Record.

Authors:  Timothy I Kennell; James H Willig; James J Cimino
Journal:  Appl Clin Inform       Date:  2017-12-21       Impact factor: 2.342

7.  Content Coding of Psychotherapy Transcripts Using Labeled Topic Models.

Authors:  Garren Gaut; Mark Steyvers; Zac E Imel; David C Atkins; Padhraic Smyth
Journal:  IEEE J Biomed Health Inform       Date:  2015-11-25       Impact factor: 5.772

8.  Interpretable Predictions of Clinical Outcomes with An Attention-based Recurrent Neural Network.

Authors:  Ying Sha; May D Wang
Journal:  ACM BCB       Date:  2017-08

9.  Extracting Smoking Status from Electronic Health Records Using NLP and Deep Learning.

Authors:  Suraj Rajendran; Umit Topaloglu
Journal:  AMIA Jt Summits Transl Sci Proc       Date:  2020-05-30

10.  Automated information extraction from free-text EEG reports.

Authors:  Siddharth Biswal; Zarina Nip; Valdery Moura Junior; Matt T Bianchi; Eric S Rosenthal; M Brandon Westover
Journal:  Conf Proc IEEE Eng Med Biol Soc       Date:  2015
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.