Literature DB >> 17947623

Five-way smoking status classification using text hot-spot identification and error-correcting output codes.

Aaron M Cohen1.   

Abstract

We participated in the i2b2 smoking status classification challenge task. The purpose of this task was to evaluate the ability of systems to automatically identify patient smoking status from discharge summaries. Our submission included several techniques that we compared and studied, including hot-spot identification, zero-vector filtering, inverse class frequency weighting, error-correcting output codes, and post-processing rules. We evaluated our approaches using the same methods as the i2b2 task organizers, using micro- and macro-averaged F1 as the primary performance metric. Our best performing system achieved a micro-F1 of 0.9000 on the test collection, equivalent to the best performing system submitted to the i2b2 challenge. Hot-spot identification, zero-vector filtering, classifier weighting, and error correcting output coding contributed additively to increased performance, with hot-spot identification having by far the largest positive effect. High performance on automatic identification of patient smoking status from discharge summaries is achievable with the efficient and straightforward machine learning techniques studied here.

Entities:  

Mesh:

Year:  2007        PMID: 17947623      PMCID: PMC2274879          DOI: 10.1197/jamia.M2434

Source DB:  PubMed          Journal:  J Am Med Inform Assoc        ISSN: 1067-5027            Impact factor:   4.497


  1 in total

1.  An effective general purpose approach for automated biomedical document classification.

Authors:  Aaron M Cohen
Journal:  AMIA Annu Symp Proc       Date:  2006
  1 in total
  23 in total

1.  Mayo clinic smoking status classification system: extensions and improvements.

Authors:  Sunghwan Sohn; Guergana K Savova
Journal:  AMIA Annu Symp Proc       Date:  2009-11-14

2.  Comparison of UMLS terminologies to identify risk of heart disease using clinical notes.

Authors:  Chaitanya Shivade; Pranav Malewadkar; Eric Fosler-Lussier; Albert M Lai
Journal:  J Biomed Inform       Date:  2015-09-12       Impact factor: 6.317

3.  Risk factor detection for heart disease by applying text analytics in electronic medical records.

Authors:  Manabu Torii; Jung-Wei Fan; Wei-Li Yang; Theodore Lee; Matthew T Wiley; Daniel S Zisook; Yang Huang
Journal:  J Biomed Inform       Date:  2015-08-14       Impact factor: 6.317

4.  Creating a Synthetic Clinical Trial: Comparative Effectiveness Analyses Using an Electronic Medical Record.

Authors:  Marjorie G Zauderer; Aleksandr Grigorenko; Paul May; Nicholas Kastango; Isaac Wagner; Aryeh Caroline; Mark G Kris
Journal:  JCO Clin Cancer Inform       Date:  2019-06

5.  A system for classifying disease comorbidity status from medical discharge summaries using automated hotspot and negated concept detection.

Authors:  Kyle H Ambert; Aaron M Cohen
Journal:  J Am Med Inform Assoc       Date:  2009-04-23       Impact factor: 4.497

6.  SYRIAC: The systematic review information automated collection system a data warehouse for facilitating automated biomedical text classification.

Authors:  Jianji J Yang; Aaron M Cohen; Aaron Cohen; Marian S McDonagh
Journal:  AMIA Annu Symp Proc       Date:  2008-11-06

7.  eQuality for all: Extending automated quality measurement of free text clinical narratives.

Authors:  Steven H Brown; Peter L Elkin; S Trent Rosenbloom; Elliot Fielstein; Ted Speroff
Journal:  AMIA Annu Symp Proc       Date:  2008-11-06

8.  Use of semantic features to classify patient smoking status.

Authors:  Patrick J McCormick; Noémie Elhadad; Peter D Stetson
Journal:  AMIA Annu Symp Proc       Date:  2008-11-06

9.  Ontology-guided feature engineering for clinical text classification.

Authors:  Vijay N Garla; Cynthia Brandt
Journal:  J Biomed Inform       Date:  2012-05-09       Impact factor: 6.317

10.  Automatic lymphoma classification with sentence subgraph mining from pathology reports.

Authors:  Yuan Luo; Aliyah R Sohani; Ephraim P Hochberg; Peter Szolovits
Journal:  J Am Med Inform Assoc       Date:  2014-01-15       Impact factor: 4.497

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.