Literature DB >> 20819860

Improving textual medication extraction using combined conditional random fields and rule-based systems.

Domonkos Tikk1, Illés Solt.   

Abstract

OBJECTIVE: In the i2b2 Medication Extraction Challenge, medication names together with details of their administration were to be extracted from medical discharge summaries.
DESIGN: The task of the challenge was decomposed into three pipelined components: named entity identification, context-aware filtering and relation extraction. For named entity identification, first a rule-based (RB) method that was used in our overall fifth place-ranked solution at the challenge was investigated. Second, a conditional random fields (CRF) approach is presented for named entity identification (NEI) developed after the completion of the challenge. The CRF models are trained on the 17 ground truth documents, the output of the rule-based NEI component on all documents, a larger but potentially inaccurate training dataset. For both NEI approaches their effect on relation extraction performance was investigated. The filtering and relation extraction components are both rule-based. MEASUREMENTS: In addition to the official entry level evaluation of the challenge, entity level analysis is also provided.
RESULTS: On the test data an entry level F(1)-score of 80% was achieved for exact matching and 81% for inexact matching with the RB-NEI component. The CRF produces a significantly weaker result, but CRF outperforms the rule-based model with 81% exact and 82% inexact F(1)-score (p<0.02).
CONCLUSION: This study shows that a simple rule-based method is on a par with more complicated machine learners; CRF models can benefit from the addition of the potentially inaccurate training data, when only very few training documents are available. Such training data could be generated using the outputs of rule-based methods.

Entities:  

Mesh:

Substances:

Year:  2010        PMID: 20819860      PMCID: PMC2995683          DOI: 10.1136/jamia.2010.004119

Source DB:  PubMed          Journal:  J Am Med Inform Assoc        ISSN: 1067-5027            Impact factor:   4.497


  16 in total

1.  Extracting medication information from clinical text.

Authors:  Ozlem Uzuner; Imre Solti; Eithon Cadag
Journal:  J Am Med Inform Assoc       Date:  2010 Sep-Oct       Impact factor: 4.497

Review 2.  Text-mining approaches in molecular biology and biomedicine.

Authors:  Martin Krallinger; Ramon Alonso-Allende Erhardt; Alfonso Valencia
Journal:  Drug Discov Today       Date:  2005-03-15       Impact factor: 7.851

Review 3.  A survey of current work in biomedical text mining.

Authors:  Aaron M Cohen; William R Hersh
Journal:  Brief Bioinform       Date:  2005-03       Impact factor: 11.622

4.  AliBaba: PubMed as a graph.

Authors:  Conrad Plake; Torsten Schiemann; Marcus Pankalla; Jörg Hakenberg; Ulf Leser
Journal:  Bioinformatics       Date:  2006-07-26       Impact factor: 6.937

5.  Rapidly retargetable approaches to de-identification in medical records.

Authors:  Ben Wellner; Matt Huyck; Scott Mardis; John Aberdeen; Alex Morgan; Leonid Peshkin; Alex Yeh; Janet Hitzeman; Lynette Hirschman
Journal:  J Am Med Inform Assoc       Date:  2007-06-28       Impact factor: 4.497

6.  Evaluating the state-of-the-art in automatic de-identification.

Authors:  Ozlem Uzuner; Yuan Luo; Peter Szolovits
Journal:  J Am Med Inform Assoc       Date:  2007-06-28       Impact factor: 4.497

Review 7.  Extracting interactions between proteins from the literature.

Authors:  Deyu Zhou; Yulan He
Journal:  J Biomed Inform       Date:  2007-12-15       Impact factor: 6.317

8.  Automating concept identification in the electronic medical record: an experiment in extracting dosage information.

Authors:  D A Evans; N D Brownlow; W R Hersh; E M Campbell
Journal:  Proc AMIA Annu Fall Symp       Date:  1996

9.  Identifying gene and protein mentions in text using conditional random fields.

Authors:  Ryan McDonald; Fernando Pereira
Journal:  BMC Bioinformatics       Date:  2005-05-24       Impact factor: 3.169

10.  Automatic construction of rule-based ICD-9-CM coding systems.

Authors:  Richárd Farkas; György Szarvas
Journal:  BMC Bioinformatics       Date:  2008-04-11       Impact factor: 3.169

View more
  10 in total

1.  MedXN: an open source medication extraction and normalization tool for clinical text.

Authors:  Sunghwan Sohn; Cheryl Clark; Scott R Halgrim; Sean P Murphy; Christopher G Chute; Hongfang Liu
Journal:  J Am Med Inform Assoc       Date:  2014-03-17       Impact factor: 4.497

2.  Extracting Drug Names and Associated Attributes From Discharge Summaries: Text Mining Study.

Authors:  Ghada Alfattni; Maksim Belousov; Niels Peek; Goran Nenadic
Journal:  JMIR Med Inform       Date:  2021-05-05

3.  A cascade of classifiers for extracting medication information from discharge summaries.

Authors:  Scott Russell Halgrim; Fei Xia; Imre Solti; Eithon Cadag; Ozlem Uzuner
Journal:  J Biomed Semantics       Date:  2011-07-14

4.  Clinical research informatics: a conceptual perspective.

Authors:  Michael G Kahn; Chunhua Weng
Journal:  J Am Med Inform Assoc       Date:  2012-04-20       Impact factor: 4.497

5.  Extracting and standardizing medication information in clinical text - the MedEx-UIMA system.

Authors:  Min Jiang; Yonghui Wu; Anushi Shah; Priyanka Priyanka; Joshua C Denny; Hua Xu
Journal:  AMIA Jt Summits Transl Sci Proc       Date:  2014-04-07

Review 6.  Semantic annotation in biomedicine: the current landscape.

Authors:  Jelena Jovanović; Ebrahim Bagheri
Journal:  J Biomed Semantics       Date:  2017-09-22

7.  Transformation of Pathology Reports Into the Common Data Model With Oncology Module: Use Case for Colon Cancer.

Authors:  Borim Ryu; Eunsil Yoon; Seok Kim; Sejoon Lee; Hyunyoung Baek; Soyoung Yi; Hee Young Na; Ji-Won Kim; Rong-Min Baek; Hee Hwang; Sooyoung Yoo
Journal:  J Med Internet Res       Date:  2020-12-09       Impact factor: 5.428

8.  Using nanoinformatics methods for automatically identifying relevant nanotoxicology entities from the literature.

Authors:  Miguel García-Remesal; Alejandro García-Ruiz; David Pérez-Rey; Diana de la Iglesia; Víctor Maojo
Journal:  Biomed Res Int       Date:  2012-12-27       Impact factor: 3.411

9.  A sequence labeling approach to link medications and their attributes in clinical notes and clinical trial announcements for information extraction.

Authors:  Qi Li; Haijun Zhai; Louise Deleger; Todd Lingren; Megan Kaiser; Laura Stoutenborough; Imre Solti
Journal:  J Am Med Inform Assoc       Date:  2012-12-25       Impact factor: 4.497

10.  NOBLE - Flexible concept recognition for large-scale biomedical natural language processing.

Authors:  Eugene Tseytlin; Kevin Mitchell; Elizabeth Legowski; Julia Corrigan; Girish Chavan; Rebecca S Jacobson
Journal:  BMC Bioinformatics       Date:  2016-01-14       Impact factor: 3.169

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.