Literature DB >> 25879978

SimConcept: a hybrid approach for simplifying composite named entities in biomedical text.

Chih-Hsuan Wei, Robert Leaman, Zhiyong Lu.   

Abstract

One particular challenge in biomedical named entity recognition (NER) and normalization is the identification and resolution of composite named entities, where a single span refers to more than one concept (e.g., BRCA1/2). Previous NER and normalization studies have either ignored composite mentions, used simple ad hoc rules, or only handled coordination ellipsis, making a robust approach for handling multitype composite mentions greatly needed. To this end, we propose a hybrid method integrating a machine-learning model with a pattern identification strategy to identify the individual components of each composite mention. Our method, which we have named SimConcept, is the first to systematically handle many types of composite mentions. The technique achieves high performance in identifying and resolving composite mentions for three key biological entities: genes (90.42% in F-measure), diseases (86.47% in F-measure), and chemicals (86.05% in F-measure). Furthermore, our results show that using our SimConcept method can subsequently improve the performance of gene and disease concept recognition and normalization. SimConcept is available for download at: http://www.ncbi.nlm.nih.gov/CBBresearch/Lu/Demo/SimConcept/.

Entities:  

Mesh:

Year:  2015        PMID: 25879978      PMCID: PMC4543296          DOI: 10.1109/JBHI.2015.2422651

Source DB:  PubMed          Journal:  IEEE J Biomed Health Inform        ISSN: 2168-2194            Impact factor:   5.772


  33 in total

1.  Tagging gene and protein names in biomedical text.

Authors:  Lorraine Tanabe; W John Wilbur
Journal:  Bioinformatics       Date:  2002-08       Impact factor: 6.937

2.  Cross-species gene normalization by species inference.

Authors:  Chih-Hsuan Wei; Hung-Yu Kao
Journal:  BMC Bioinformatics       Date:  2011-10-03       Impact factor: 3.169

3.  Semi-automatic semantic annotation of PubMed queries: a study on quality, efficiency, satisfaction.

Authors:  Aurélie Névéol; Rezarta Islamaj Doğan; Zhiyong Lu
Journal:  J Biomed Inform       Date:  2010-11-20       Impact factor: 6.317

4.  High-performance gene name normalization with GeNo.

Authors:  Joachim Wermter; Katrin Tomanek; Udo Hahn
Journal:  Bioinformatics       Date:  2009-02-02       Impact factor: 6.937

5.  Improving perceived and actual text difficulty for health information consumers using semi-automated methods.

Authors:  Gondy Leroy; James E Endicott; Obay Mouradi; David Kauchak; Melissa L Just
Journal:  AMIA Annu Symp Proc       Date:  2012-11-03

6.  tmVar: a text mining approach for extracting sequence variants in biomedical literature.

Authors:  Chih-Hsuan Wei; Bethany R Harris; Hung-Yu Kao; Zhiyong Lu
Journal:  Bioinformatics       Date:  2013-04-05       Impact factor: 6.937

7.  Systematic identification of pharmacogenomics information from clinical trials.

Authors:  Jiao Li; Zhiyong Lu
Journal:  J Biomed Inform       Date:  2012-04-24       Impact factor: 6.317

8.  BioCreative-IV virtual issue.

Authors:  Cecilia N Arighi; Cathy H Wu; Kevin B Cohen; Lynette Hirschman; Martin Krallinger; Alfonso Valencia; Zhiyong Lu; John W Wilbur; Thomas C Wiegers
Journal:  Database (Oxford)       Date:  2014-05-22       Impact factor: 3.451

9.  Overview of BioCreative II gene normalization.

Authors:  Alexander A Morgan; Zhiyong Lu; Xinglong Wang; Aaron M Cohen; Juliane Fluck; Patrick Ruch; Anna Divoli; Katrin Fundel; Robert Leaman; Jörg Hakenberg; Chengjie Sun; Heng-hui Liu; Rafael Torres; Michael Krauthammer; William W Lau; Hongfang Liu; Chun-Nan Hsu; Martijn Schuemie; K Bretonnel Cohen; Lynette Hirschman
Journal:  Genome Biol       Date:  2008-09-01       Impact factor: 13.583

10.  Detecting concept mentions in biomedical text using hidden Markov model: multiple concept types at once or one at a time?

Authors:  Manabu Torii; Kavishwar Wagholikar; Hongfang Liu
Journal:  J Biomed Semantics       Date:  2014-01-17
View more
  8 in total

1.  PubTator central: automated concept annotation for biomedical full text articles.

Authors:  Chih-Hsuan Wei; Alexis Allot; Robert Leaman; Zhiyong Lu
Journal:  Nucleic Acids Res       Date:  2019-07-02       Impact factor: 16.971

2.  A graph-based method for reconstructing entities from coordination ellipsis in medical text.

Authors:  Chi Yuan; Yongli Wang; Ning Shang; Ziran Li; Ruxin Zhao; Chunhua Weng
Journal:  J Am Med Inform Assoc       Date:  2020-07-01       Impact factor: 4.497

3.  TaggerOne: joint named entity recognition and normalization with semi-Markov Models.

Authors:  Robert Leaman; Zhiyong Lu
Journal:  Bioinformatics       Date:  2016-06-09       Impact factor: 6.937

4.  A Comparison between Human and NLP-based Annotation of Clinical Trial Eligibility Criteria Text Using The OMOP Common Data Model.

Authors:  Xinhang Li; Hao Liu; Fabrício Kury; Chi Yuan; Alex Butler; Yingcheng Sun; Anna Ostropolets; Hua Xu; Chunhua Weng
Journal:  AMIA Jt Summits Transl Sci Proc       Date:  2021-05-17

5.  HerbKG: Constructing a Herbal-Molecular Medicine Knowledge Graph Using a Two-Stage Framework Based on Deep Transfer Learning.

Authors:  Xian Zhu; Yueming Gu; Zhifeng Xiao
Journal:  Front Genet       Date:  2022-04-27       Impact factor: 4.772

6.  Mining chemical patents with an ensemble of open systems.

Authors:  Robert Leaman; Chih-Hsuan Wei; Cherry Zou; Zhiyong Lu
Journal:  Database (Oxford)       Date:  2016-05-12       Impact factor: 3.451

7.  Text Mining Genotype-Phenotype Relationships from Biomedical Literature for Database Curation and Precision Medicine.

Authors:  Ayush Singhal; Michael Simmons; Zhiyong Lu
Journal:  PLoS Comput Biol       Date:  2016-11-30       Impact factor: 4.475

8.  SBLC: a hybrid model for disease named entity recognition based on semantic bidirectional LSTMs and conditional random fields.

Authors:  Kai Xu; Zhanfan Zhou; Tao Gong; Tianyong Hao; Wenyin Liu
Journal:  BMC Med Inform Decis Mak       Date:  2018-12-07       Impact factor: 2.796

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.