Literature DB >> 23304330

A study of transportability of an existing smoking status detection module across institutions.

Mei Liu1, Anushi Shah, Min Jiang, Neeraja B Peterson, Qi Dai, Melinda C Aldrich, Qingxia Chen, Erica A Bowton, Hongfang Liu, Joshua C Denny, Hua Xu.   

Abstract

Electronic Medical Records (EMRs) are valuable resources for clinical observational studies. Smoking status of a patient is one of the key factors for many diseases, but it is often embedded in narrative text. Natural language processing (NLP) systems have been developed for this specific task, such as the smoking status detection module in the clinical Text Analysis and Knowledge Extraction System (cTAKES). This study examined transportability of the smoking module in cTAKES on the Vanderbilt University Hospital's EMR data. Our evaluation demonstrated that modest effort of change is necessary to achieve desirable performance. We modified the system by filtering notes, annotating new data for training the machine learning classifier, and adding rules to the rule-based classifiers. Our results showed that the customized module achieved significantly higher F-measures at all levels of classification (i.e., sentence, document, patient) compared to the direct application of the cTAKES module to the Vanderbilt data.

Entities:  

Mesh:

Year:  2012        PMID: 23304330      PMCID: PMC3540509     

Source DB:  PubMed          Journal:  AMIA Annu Symp Proc        ISSN: 1559-4076


  21 in total

1.  "Understanding" medical school curriculum content using KnowledgeMap.

Authors:  Joshua C Denny; Jeffrey D Smithers; Randolph A Miller; Anderson Spickard
Journal:  J Am Med Inform Assoc       Date:  2003-03-28       Impact factor: 4.497

2.  Automated extraction of ejection fraction for quality measurement using regular expressions in Unstructured Information Management Architecture (UIMA) for heart failure.

Authors:  Jennifer H Garvin; Scott L DuVall; Brett R South; Bruce E Bray; Daniel Bolton; Julia Heavirland; Steve Pickard; Paul Heidenreich; Shuying Shen; Charlene Weir; Matthew Samore; Mary K Goldstein
Journal:  J Am Med Inform Assoc       Date:  2012-03-21       Impact factor: 4.497

3.  Five-way smoking status classification using text hot-spot identification and error-correcting output codes.

Authors:  Aaron M Cohen
Journal:  J Am Med Inform Assoc       Date:  2007-10-18       Impact factor: 4.497

4.  Medical i2b2 NLP smoking challenge: the A-Life system architecture and methodology.

Authors:  Daniel T Heinze; Mark L Morsch; Brian C Potter; Ronald E Sheffer
Journal:  J Am Med Inform Assoc       Date:  2007-10-18       Impact factor: 4.497

5.  Identifying smokers with a medical extraction system.

Authors:  Cheryl Clark; Kathleen Good; Lesley Jezierny; Melissa Macpherson; Brian Wilson; Urszula Chajewska
Journal:  J Am Med Inform Assoc       Date:  2007-10-18       Impact factor: 4.497

6.  Development of a large-scale de-identified DNA biobank to enable personalized medicine.

Authors:  D M Roden; J M Pulley; M A Basford; G R Bernard; E W Clayton; J R Balser; D R Masys
Journal:  Clin Pharmacol Ther       Date:  2008-05-21       Impact factor: 6.875

7.  Extracting findings from narrative reports: software transferability and sources of physician disagreement.

Authors:  G Hripcsak; G J Kuperman; C Friedman
Journal:  Methods Inf Med       Date:  1998-01       Impact factor: 2.176

8.  Identifying patient smoking status from medical discharge records.

Authors:  Ozlem Uzuner; Ira Goldstein; Yuan Luo; Isaac Kohane
Journal:  J Am Med Inform Assoc       Date:  2007-10-18       Impact factor: 4.497

9.  Mayo clinic NLP system for patient smoking status identification.

Authors:  Guergana K Savova; Philip V Ogren; Patrick H Duffy; James D Buntrock; Christopher G Chute
Journal:  J Am Med Inform Assoc       Date:  2007-10-18       Impact factor: 4.497

10.  Using implicit information to identify smoking status in smoke-blind medical discharge summaries.

Authors:  Richard Wicentowski; Matthew R Sydes
Journal:  J Am Med Inform Assoc       Date:  2007-10-18       Impact factor: 4.497

View more
  29 in total

1.  Using Anchors to Estimate Clinical State without Labeled Data.

Authors:  Yoni Halpern; Youngduck Choi; Steven Horng; David Sontag
Journal:  AMIA Annu Symp Proc       Date:  2014-11-14

2.  Investigating Longitudinal Tobacco Use Information from Social History and Clinical Notes in the Electronic Health Record.

Authors:  Yan Wang; Elizabeth S Chen; Serguei Pakhomov; Elizabeth Lindemann; Genevieve B Melton
Journal:  AMIA Annu Symp Proc       Date:  2017-02-10

3.  Practical implementation of an existing smoking detection pipeline and reduced support vector machine training corpus requirements.

Authors:  Richard Khor; Wai-Kuan Yip; Mathias Bressel; William Rose; Gillian Duchesne; Farshad Foroudi
Journal:  J Am Med Inform Assoc       Date:  2013-08-06       Impact factor: 4.497

4.  Examining the use, contents, and quality of free-text tobacco use documentation in the Electronic Health Record.

Authors:  Elizabeth S Chen; Elizabeth W Carter; Indra Neil Sarkar; Tamara J Winden; Genevieve B Melton
Journal:  AMIA Annu Symp Proc       Date:  2014-11-14

5.  An Empirical Study for Impacts of Measurement Errors on EHR based Association Studies.

Authors:  Rui Duan; Ming Cao; Yonghui Wu; Jing Huang; Joshua C Denny; Hua Xu; Yong Chen
Journal:  AMIA Annu Symp Proc       Date:  2017-02-10

6.  ICD-9 tobacco use codes are effective identifiers of smoking status.

Authors:  Laura K Wiley; Anushi Shah; Hua Xu; William S Bush
Journal:  J Am Med Inform Assoc       Date:  2013-02-09       Impact factor: 4.497

7.  Automated Extraction of Substance Use Information from Clinical Texts.

Authors:  Yan Wang; Elizabeth S Chen; Serguei Pakhomov; Elliot Arsoniadis; Elizabeth W Carter; Elizabeth Lindemann; Indra Neil Sarkar; Genevieve B Melton
Journal:  AMIA Annu Symp Proc       Date:  2015-11-05

8.  Multi-center colonoscopy quality measurement utilizing natural language processing.

Authors:  Timothy D Imler; Justin Morea; Charles Kahi; Eric A Sherer; Jon Cardwell; Cynthia S Johnson; Huiping Xu; Dennis Ahnen; Fadi Antaki; Christopher Ashley; Gyorgy Baffy; Ilseung Cho; Jason Dominitz; Jason Hou; Mark Korsten; Anil Nagar; Kittichai Promrat; Douglas Robertson; Sameer Saini; Amandeep Shergill; Walter Smalley; Thomas F Imperiale
Journal:  Am J Gastroenterol       Date:  2015-03-10       Impact factor: 10.864

9.  A Natural Language Processing Tool to Extract Quantitative Smoking Status from Clinical Narratives.

Authors:  Xi Yang; Hanyuan Yang; Tianchen Lyu; Shuang Yang; Yi Guo; Jiang Bian; Hua Xu; Yonghui Wu
Journal:  IEEE Int Conf Healthc Inform       Date:  2021-03-12

10.  Enabling high-throughput genotype-phenotype associations in the Epidemiologic Architecture for Genes Linked to Environment (EAGLE) project as part of the Population Architecture using Genomics and Epidemiology (PAGE) study.

Authors:  William S Bush; Jonathan Boston; Sarah A Pendergrass; Logan Dumitrescu; Robert Goodloe; Kristin Brown-Gentry; Sarah Wilson; Bob McClellan; Eric Torstenson; Melissa A Basford; Kylee L Spencer; Marylyn D Ritchie; Dana C Crawford
Journal:  Pac Symp Biocomput       Date:  2013
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.