Literature DB >> 19535010

Feature selection techniques for maximum entropy based biomedical named entity recognition.

Sujan Kumar Saha1, Sudeshna Sarkar, Pabitra Mitra.   

Abstract

Named entity recognition is an extremely important and fundamental task of biomedical text mining. Biomedical named entities include mentions of proteins, genes, DNA, RNA, etc which often have complex structures, but it is challenging to identify and classify such entities. Machine learning methods like CRF, MEMM and SVM have been widely used for learning to recognize such entities from an annotated corpus. The identification of appropriate feature templates and the selection of the important feature values play a very important role in the success of these methods. In this paper, we provide a study on word clustering and selection based feature reduction approaches for named entity recognition using a maximum entropy classifier. The identification and selection of features are largely done automatically without using domain knowledge. The performance of the system is found to be superior to existing systems which do not use domain knowledge.

Mesh:

Year:  2009        PMID: 19535010     DOI: 10.1016/j.jbi.2008.12.012

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  9 in total

1.  Improving deep learning method for biomedical named entity recognition by using entity definition information.

Authors:  Ying Xiong; Shuai Chen; Buzhou Tang; Qingcai Chen; Xiaolong Wang; Jun Yan; Yi Zhou
Journal:  BMC Bioinformatics       Date:  2021-12-17       Impact factor: 3.169

2.  Assessing the impact of case sensitivity and term information gain on biomedical concept recognition.

Authors:  Tudor Groza; Karin Verspoor
Journal:  PLoS One       Date:  2015-03-19       Impact factor: 3.240

3.  A kernel-based approach for biomedical named entity recognition.

Authors:  Rakesh Patra; Sujan Kumar Saha
Journal:  ScientificWorldJournal       Date:  2013-12-29

4.  Semi-Supervised Bidirectional Long Short-Term Memory and Conditional Random Fields Model for Named-Entity Recognition Using Embeddings from Language Models Representations.

Authors:  Min Zhang; Guohua Geng; Jing Chen
Journal:  Entropy (Basel)       Date:  2020-02-22       Impact factor: 2.524

5.  Chinese-Named Entity Recognition From Adverse Drug Event Records: Radical Embedding-Combined Dynamic Embedding-Based BERT in a Bidirectional Long Short-term Conditional Random Field (Bi-LSTM-CRF) Model.

Authors:  Hong Wu; Jiatong Ji; Haimei Tian; Yao Chen; Weihong Ge; Haixia Zhang; Feng Yu; Jianjun Zou; Mitsuhiro Nakamura; Jun Liao
Journal:  JMIR Med Inform       Date:  2021-12-01

6.  Deep learning-based methods for natural hazard named entity recognition.

Authors:  Junlin Sun; Yanrong Liu; Jing Cui; Handong He
Journal:  Sci Rep       Date:  2022-03-17       Impact factor: 4.996

7.  Recognizing scientific artifacts in biomedical literature.

Authors:  Tudor Groza; Hamed Hassanzadeh; Jane Hunter
Journal:  Biomed Inform Insights       Date:  2013-04-02

8.  Unregistered biological words recognition by Q-learning with transfer learning.

Authors:  Fei Zhu; Quan Liu; Hui Wang; Xiaoke Zhou; Yuchen Fu
Journal:  ScientificWorldJournal       Date:  2014-02-19

9.  Biomedical named entity extraction: some issues of corpus compatibilities.

Authors:  Asif Ekbal; Sriparna Saha; Utpal Kumar Sikdar
Journal:  Springerplus       Date:  2013-11-12
  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.