Literature DB >> 26209007

Agile text mining for the 2014 i2b2/UTHealth Cardiac risk factors challenge.

James Cormack1, Chinmoy Nath2, David Milward3, Kalpana Raja2, Siddhartha R Jonnalagadda2.   

Abstract

This paper describes the use of an agile text mining platform (Linguamatics' Interactive Information Extraction Platform, I2E) to extract document-level cardiac risk factors in patient records as defined in the i2b2/UTHealth 2014 challenge. The approach uses a data-driven rule-based methodology with the addition of a simple supervised classifier. We demonstrate that agile text mining allows for rapid optimization of extraction strategies, while post-processing can leverage annotation guidelines, corpus statistics and logic inferred from the gold standard data. We also show how data imbalance in a training set affects performance. Evaluation of this approach on the test data gave an F-Score of 91.7%, one percent behind the top performing system.
Copyright © 2015 Elsevier Inc. All rights reserved.

Entities:  

Keywords:  Clinical natural language processing; Information extraction; Text mining

Mesh:

Year:  2015        PMID: 26209007      PMCID: PMC4737484          DOI: 10.1016/j.jbi.2015.06.030

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  14 in total

1.  A simple algorithm for identifying negated findings and diseases in discharge summaries.

Authors:  W W Chapman; W Bridewell; P Hanbury; G F Cooper; B G Buchanan
Journal:  J Biomed Inform       Date:  2001-10       Impact factor: 6.317

2.  Extracting medication information from clinical text.

Authors:  Ozlem Uzuner; Imre Solti; Eithon Cadag
Journal:  J Am Med Inform Assoc       Date:  2010 Sep-Oct       Impact factor: 4.497

Review 3.  Identifying risk factors for heart disease over time: Overview of 2014 i2b2/UTHealth shared task Track 2.

Authors:  Amber Stubbs; Christopher Kotfila; Hua Xu; Özlem Uzuner
Journal:  J Biomed Inform       Date:  2015-07-22       Impact factor: 6.317

4.  Five-way smoking status classification using text hot-spot identification and error-correcting output codes.

Authors:  Aaron M Cohen
Journal:  J Am Med Inform Assoc       Date:  2007-10-18       Impact factor: 4.497

5.  Annotating risk factors for heart disease in clinical narratives for diabetic patients.

Authors:  Amber Stubbs; Özlem Uzuner
Journal:  J Biomed Inform       Date:  2015-05-21       Impact factor: 6.317

6.  Description of a rule-based system for the i2b2 challenge in natural language processing for clinical data.

Authors:  Lois C Childs; Robert Enelow; Lone Simonsen; Norris H Heintzelman; Kimberly M Kowalski; Robert J Taylor
Journal:  J Am Med Inform Assoc       Date:  2009-04-23       Impact factor: 4.497

7.  Identifying patient smoking status from medical discharge records.

Authors:  Ozlem Uzuner; Ira Goldstein; Yuan Luo; Isaac Kohane
Journal:  J Am Med Inform Assoc       Date:  2007-10-18       Impact factor: 4.497

8.  Ontology-based interactive information extraction from scientific abstracts.

Authors:  David Milward; Marcus Bjäreland; William Hayes; Michelle Maxwell; Lisa Oberg; Nick Tilford; James Thomas; Roger Hale; Sylvia Knight; Julie Barnes
Journal:  Comp Funct Genomics       Date:  2005

9.  Automated identification of pneumonia in chest radiograph reports in critically ill patients.

Authors:  Vincent Liu; Mark P Clark; Mark Mendoza; Ramin Saket; Marla N Gardner; Benjamin J Turk; Gabriel J Escobar
Journal:  BMC Med Inform Decis Mak       Date:  2013-08-15       Impact factor: 2.796

10.  A CTD-Pfizer collaboration: manual curation of 88,000 scientific articles text mined for drug-disease and drug-phenotype interactions.

Authors:  Allan Peter Davis; Thomas C Wiegers; Phoebe M Roberts; Benjamin L King; Jean M Lay; Kelley Lennon-Hopkins; Daniela Sciaky; Robin Johnson; Heather Keating; Nigel Greene; Robert Hernandez; Kevin J McConnell; Ahmed E Enayetallah; Carolyn J Mattingly
Journal:  Database (Oxford)       Date:  2013-11-28       Impact factor: 3.451

View more
  10 in total

1.  Automatic prediction of coronary artery disease from clinical narratives.

Authors:  Kevin Buchan; Michele Filannino; Özlem Uzuner
Journal:  J Biomed Inform       Date:  2017-06-27       Impact factor: 6.317

2.  Practical applications for natural language processing in clinical research: The 2014 i2b2/UTHealth shared tasks.

Authors:  Özlem Uzuner; Amber Stubbs
Journal:  J Biomed Inform       Date:  2015-10-24       Impact factor: 6.317

3.  Identifying Cases of Shoulder Injury Related to Vaccine Administration (SIRVA) in the United States: Development and Validation of a Natural Language Processing Method.

Authors:  Chengyi Zheng; Jonathan Duffy; In-Lu Amy Liu; Lina S Sy; Ronald A Navarro; Sunhea S Kim; Denison S Ryan; Wansu Chen; Lei Qian; Cheryl Mercado; Steven J Jacobsen
Journal:  JMIR Public Health Surveill       Date:  2022-05-24

4.  Nd:YAG capsulotomy incidence associated with five different single-piece monofocal intraocular lenses: a 3-year Spanish real-world evidence study of 8293 eyes.

Authors:  José I Belda; Javier Placeres Dabán; Juan Carlos Elvira; Derek O'Boyle; Xavier Puig; Caridad Pérez-Vives; Ming Zou; Shaohui Sun
Journal:  Eye (Lond)       Date:  2021-11-11       Impact factor: 4.456

Review 5.  Clinical concept extraction: A methodology review.

Authors:  Sunyang Fu; David Chen; Huan He; Sijia Liu; Sungrim Moon; Kevin J Peterson; Feichen Shen; Liwei Wang; Yanshan Wang; Andrew Wen; Yiqing Zhao; Sunghwan Sohn; Hongfang Liu
Journal:  J Biomed Inform       Date:  2020-08-06       Impact factor: 6.317

6.  Automatic health record review to help prioritize gravely ill Social Security disability applicants.

Authors:  Kenneth Abbott; Yen-Yi Ho; Jennifer Erickson
Journal:  J Am Med Inform Assoc       Date:  2017-07-01       Impact factor: 4.497

7.  Using Natural Language Processing to Measure and Improve Quality of Diabetes Care: A Systematic Review.

Authors:  Alexander Turchin; Luisa F Florez Builes
Journal:  J Diabetes Sci Technol       Date:  2021-03-19

8.  Systematic drug repositioning through mining adverse event data in ClinicalTrials.gov.

Authors:  Eric Wen Su; Todd M Sanger
Journal:  PeerJ       Date:  2017-03-23       Impact factor: 2.984

9.  Open Agile text mining for bioinformatics: the PubAnnotation ecosystem.

Authors:  Jin-Dong Kim; Yue Wang; Toyofumi Fujiwara; Shujiro Okuda; Tiffany J Callahan; K Bretonnel Cohen
Journal:  Bioinformatics       Date:  2019-11-01       Impact factor: 6.937

10.  A Natural Language Processing-Based Approach for Identifying Hospitalizations for Worsening Heart Failure Within an Integrated Health Care Delivery System.

Authors:  Andrew P Ambrosy; Rishi V Parikh; Sue Hee Sung; Anand Narayanan; Rajeev Masson; Phuong-Quang Lam; Kevin Kheder; Alan Iwahashi; Alexander B Hardwick; Jesse K Fitzpatrick; Harshith R Avula; Van N Selby; Xian Shen; Navneet Sanghera; Joaquim Cristino; Alan S Go
Journal:  JAMA Netw Open       Date:  2021-11-01
  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.