Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Stacked ensemble combined with fuzzy matching for biomedical named entity recognition of diseases.

Literature DB >> 27634494

Stacked ensemble combined with fuzzy matching for biomedical named entity recognition of diseases.

Balu Bhasuran¹, Gurusamy Murugesan², Sabenabanu Abdulkadhar², Jeyakumar Natarajan³.

Abstract

Biomedical Named Entity Recognition (Bio-NER) is the crucial initial step in the information extraction process and a majorly focused research area in biomedical text mining. In the past years, several models and methodologies have been proposed for the recognition of semantic types related to gene, protein, chemical, drug and other biological relevant named entities. In this paper, we implemented a stacked ensemble approach combined with fuzzy matching for biomedical named entity recognition of disease names. The underlying concept of stacked generalization is to combine the outputs of base-level classifiers using a second-level meta-classifier in an ensemble. We used Conditional Random Field (CRF) as the underlying classification method that makes use of a diverse set of features, mostly based on domain specific, and are orthographic and morphologically relevant. In addition, we used fuzzy string matching to tag rare disease names from our in-house disease dictionary. For fuzzy matching, we incorporated two best fuzzy search algorithms Rabin Karp and Tuned Boyer Moore. Our proposed approach shows promised result of 94.66%, 89.12%, 84.10%, and 76.71% of F-measure while on evaluating training and testing set of both NCBI disease and BioCreative V CDR Corpora. Copyright Â

Entities: Disease

Keywords: Biomedical named entity recognition; Fuzzy matching; Machine learning; Stacked ensemble; Text mining

Mesh：

Substances：
Proteins

Year: 2016 PMID： 27634494 DOI： 10.1016/j.jbi.2016.09.009

Source DB: PubMed Journal: J Biomed Inform ISSN： 1532-0464 Impact factor: 6.317

Keyword Cloud
Cited

8 in total

Stacked ensemble combined with fuzzy matching for biomedical named entity recognition of diseases.

1. Combining Literature Mining and Machine Learning for Predicting Biomedical Discoveries.

2. BioBERT and Similar Approaches for Relation Extraction.

3. A Text Mining Protocol for Mining Biological Pathways and Regulatory Networks from Biomedical Literature.

4. A Text Mining Pipeline Using Active and Deep Learning Aimed at Curating Information in Computational Neuroscience.

5. Weighted Random Forests to Improve Arrhythmia Classification.

6. Automatic extraction of gene-disease associations from literature using joint ensemble learning.

Review 7. Artificial Intelligence (AI) in Rare Diseases: Is the Future Brighter?

8. Multi-step ahead meningitis case forecasting based on decomposition and multi-objective optimization methods.