Literature DB >> 17947622

Mayo clinic NLP system for patient smoking status identification.

Guergana K Savova1, Philip V Ogren, Patrick H Duffy, James D Buntrock, Christopher G Chute.   

Abstract

This article describes our system entry for the 2006 I2B2 contest "Challenges in Natural Language Processing for Clinical Data" for the task of identifying the smoking status of patients. Our system makes the simplifying assumption that patient-level smoking status determination can be achieved by accurately classifying individual sentences from a patient's record. We created our system with reusable text analysis components built on the Unstructured Information Management Architecture and Weka. This reuse of code minimized the development effort related specifically to our smoking status classifier. We report precision, recall, F-score, and 95% exact confidence intervals for each metric. Recasting the classification task for the sentence level and reusing code from other text analysis projects allowed us to quickly build a classification system that performs with a system F-score of 92.64 based on held-out data tests and of 85.57 on the formal evaluation data. Our general medical natural language engine is easily adaptable to a real-world medical informatics application. Some of the limitations as applied to the use-case are negation detection and temporal resolution.

Entities:  

Mesh:

Year:  2007        PMID: 17947622      PMCID: PMC2274870          DOI: 10.1197/jamia.M2437

Source DB:  PubMed          Journal:  J Am Med Inform Assoc        ISSN: 1067-5027            Impact factor:   4.497


  3 in total

1.  A simple algorithm for identifying negated findings and diseases in discharge summaries.

Authors:  W W Chapman; W Bridewell; P Hanbury; G F Cooper; B G Buchanan
Journal:  J Biomed Inform       Date:  2001-10       Impact factor: 6.317

2.  Identifying patient smoking status from medical discharge records.

Authors:  Ozlem Uzuner; Ira Goldstein; Yuan Luo; Isaac Kohane
Journal:  J Am Med Inform Assoc       Date:  2007-10-18       Impact factor: 4.497

3.  Automatic document classification of biological literature.

Authors:  David Chen; Hans-Michael Müller; Paul W Sternberg
Journal:  BMC Bioinformatics       Date:  2006-08-07       Impact factor: 3.169

  3 in total
  63 in total

1.  Automated discovery of drug treatment patterns for endocrine therapy of breast cancer within an electronic medical record.

Authors:  Guergana K Savova; Janet E Olson; Sean P Murphy; Victoria L Cafourek; Fergus J Couch; Matthew P Goetz; James N Ingle; Vera J Suman; Christopher G Chute; Richard M Weinshilboum
Journal:  J Am Med Inform Assoc       Date:  2011-12-01       Impact factor: 4.497

2.  A translational engine at the national scale: informatics for integrating biology and the bedside.

Authors:  Isaac S Kohane; Susanne E Churchill; Shawn N Murphy
Journal:  J Am Med Inform Assoc       Date:  2011-11-10       Impact factor: 4.497

3.  Mayo clinical Text Analysis and Knowledge Extraction System (cTAKES): architecture, component evaluation and applications.

Authors:  Guergana K Savova; James J Masanz; Philip V Ogren; Jiaping Zheng; Sunghwan Sohn; Karin C Kipper-Schuler; Christopher G Chute
Journal:  J Am Med Inform Assoc       Date:  2010 Sep-Oct       Impact factor: 4.497

4.  Mayo clinic smoking status classification system: extensions and improvements.

Authors:  Sunghwan Sohn; Guergana K Savova
Journal:  AMIA Annu Symp Proc       Date:  2009-11-14

5.  Health services research and data linkages: issues, methods, and directions for the future.

Authors:  Cathy J Bradley; Lynne Penberthy; Kelly J Devers; Debra J Holden
Journal:  Health Serv Res       Date:  2010-08-02       Impact factor: 3.402

6.  Creating a Synthetic Clinical Trial: Comparative Effectiveness Analyses Using an Electronic Medical Record.

Authors:  Marjorie G Zauderer; Aleksandr Grigorenko; Paul May; Nicholas Kastango; Isaac Wagner; Aryeh Caroline; Mark G Kris
Journal:  JCO Clin Cancer Inform       Date:  2019-06

7.  eQuality for all: Extending automated quality measurement of free text clinical narratives.

Authors:  Steven H Brown; Peter L Elkin; S Trent Rosenbloom; Elliot Fielstein; Ted Speroff
Journal:  AMIA Annu Symp Proc       Date:  2008-11-06

8.  Use of semantic features to classify patient smoking status.

Authors:  Patrick J McCormick; Noémie Elhadad; Peter D Stetson
Journal:  AMIA Annu Symp Proc       Date:  2008-11-06

9.  Electronic health records-driven phenotyping: challenges, recent advances, and perspectives.

Authors:  Jyotishman Pathak; Abel N Kho; Joshua C Denny
Journal:  J Am Med Inform Assoc       Date:  2013-12       Impact factor: 4.497

10.  Identification of inactive medications in narrative medical text.

Authors:  Eugene M Breydo; Julia T Chu; Alexander Turchin
Journal:  AMIA Annu Symp Proc       Date:  2008-11-06
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.