Literature DB >> 15542017

Biomedical named entity recognition using two-phase model based on SVMs.

Ki-Joong Lee1, Young-Sook Hwang, Seonho Kim, Hae-Chang Rim.   

Abstract

Named entity (NE) recognition has become one of the most fundamental tasks in biomedical knowledge acquisition. In this paper, we present a two-phase named entity recognizer based on SVMs, which consists of a boundary identification phase and a semantic classification phase of named entities. When adapting SVMs to named entity recognition, the multi-class problem and the unbalanced class distribution problem become very serious in terms of training cost and performance. We try to solve these problems by separating the NE recognition task into two subtasks, where we use appropriate SVM classifiers and relevant features for each subtask. In addition, by employing a hierarchical classification method based on ontology, we effectively solve the multi-class problem concerning semantic classification. The experimental results on the GENIA corpus show that the proposed method is effective not only in reducing computational cost but also in improving performance. The F-score (beta=1) for the boundary identification is 74.8 and the F-score for the semantic classification is 66.7.

Mesh:

Year:  2004        PMID: 15542017     DOI: 10.1016/j.jbi.2004.08.012

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  12 in total

Review 1.  Biomedical language processing: what's beyond PubMed?

Authors:  Lawrence Hunter; K Bretonnel Cohen
Journal:  Mol Cell       Date:  2006-03-03       Impact factor: 17.970

2.  Analysis of sampling techniques for imbalanced data: An n = 648 ADNI study.

Authors:  Rashmi Dubey; Jiayu Zhou; Yalin Wang; Paul M Thompson; Jieping Ye
Journal:  Neuroimage       Date:  2013-10-29       Impact factor: 6.556

3.  A bioinformatics analysis of the cell line nomenclature.

Authors:  Sirarat Sarntivijai; Alexander S Ade; Brian D Athey; David J States
Journal:  Bioinformatics       Date:  2008-10-10       Impact factor: 6.937

4.  Extracting genetic alteration information for personalized cancer therapy from ClinicalTrials.gov.

Authors:  Jun Xu; Hee-Jin Lee; Jia Zeng; Yonghui Wu; Yaoyun Zhang; Liang-Chin Huang; Amber Johnson; Vijaykumar Holla; Ann M Bailey; Trevor Cohen; Funda Meric-Bernstam; Elmer V Bernstam; Hua Xu
Journal:  J Am Med Inform Assoc       Date:  2016-03-24       Impact factor: 4.497

5.  Named entity recognition for bacterial Type IV secretion systems.

Authors:  Sophia Ananiadou; Dan Sullivan; William Black; Gina-Anne Levow; Joseph J Gillespie; Chunhong Mao; Sampo Pyysalo; Balakrishna Kolluru; Junichi Tsujii; Bruno Sobral
Journal:  PLoS One       Date:  2011-03-29       Impact factor: 3.240

6.  Contextual weighting for Support Vector Machines in literature mining: an application to gene versus protein name disambiguation.

Authors:  Tapio Pahikkala; Filip Ginter; Jorma Boberg; Jouni Järvinen; Tapio Salakoski
Journal:  BMC Bioinformatics       Date:  2005-06-22       Impact factor: 3.169

7.  Using contextual and lexical features to restructure and validate the classification of biomedical concepts.

Authors:  Jung-Wei Fan; Hua Xu; Carol Friedman
Journal:  BMC Bioinformatics       Date:  2007-07-24       Impact factor: 3.169

8.  Semi-automated curation of protein subcellular localization: a text mining-based approach to Gene Ontology (GO) Cellular Component curation.

Authors:  Kimberly Van Auken; Joshua Jaffery; Juancarlos Chan; Hans-Michael Müller; Paul W Sternberg
Journal:  BMC Bioinformatics       Date:  2009-07-21       Impact factor: 3.169

9.  Extracting laboratory test information from biomedical text.

Authors:  Yanna Shen Kang; Mehmet Kayaalp
Journal:  J Pathol Inform       Date:  2013-08-31

10.  Microbial phenomics information extractor (MicroPIE): a natural language processing tool for the automated acquisition of prokaryotic phenotypic characters from text sources.

Authors:  Jin Mao; Lisa R Moore; Carrine E Blank; Elvis Hsin-Hui Wu; Marcia Ackerman; Sonali Ranade; Hong Cui
Journal:  BMC Bioinformatics       Date:  2016-12-13       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.