Literature DB >> 33508086

HunFlair: An Easy-to-Use Tool for State-of-the-Art Biomedical Named Entity Recognition.

Leon Weber1,2, Mario Sänger1, Jannes Münchmeyer1,3, Maryam Habibi1, Ulf Leser1, Alan Akbik1.   

Abstract

SUMMARY: Named entity recognition (NER) is an important step in biomedical information extraction pipelines. Tools for NER should be easy to use, cover multiple entity types, be highly accurate, and be robust towards variations in text genre and style. We present HunFlair, an NER tagger fulfilling these requirements. HunFlair is integrated into the widely-used NLP framework Flair, recognizes five biomedical entity types, reaches or overcomes state-of-the-art performance on a wide set of evaluation corpora, and is trained in a cross-corpus setting to avoid corpus-specific bias. Technically, it uses a character-level language model pretrained on roughly 24 million biomedical abstracts and three million full texts. It outperforms other off-the-shelf biomedical NER tools with an average gain of 7.26 pp over the next best tool in a cross-corpus setting and achieves on-par results with state-of-the-art research prototypes in in-corpus experiments. HunFlair can be installed with a single command and is applied with only four lines of code. Furthermore, it is accompanied by harmonized versions of 23 biomedical NER corpora. AVAILABILITY: HunFlair ist freely available through the Flair NLP framework (https://github.com/flairNLP/flair) under an MIT license and is compatible with all major operating systems. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author(s) (2021). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

Entities:  

Year:  2021        PMID: 33508086     DOI: 10.1093/bioinformatics/btab042

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  6 in total

1.  RegEl corpus: identifying DNA regulatory elements in the scientific literature.

Authors:  Samuele Garda; Freyda Lenihan-Geels; Sebastian Proft; Stefanie Hochmuth; Markus Schülke; Dominik Seelow; Ulf Leser
Journal:  Database (Oxford)       Date:  2022-06-27       Impact factor: 4.462

2.  Assigning species information to corresponding genes by a sequence labeling framework.

Authors:  Ling Luo; Chih-Hsuan Wei; Po-Ting Lai; Qingyu Chen; Rezarta Islamaj; Zhiyong Lu
Journal:  Database (Oxford)       Date:  2022-10-13       Impact factor: 4.462

3.  BERN2: an advanced neural biomedical named entity recognition and normalization tool.

Authors:  Mujeen Sung; Minbyul Jeong; Yonghwa Choi; Donghyeon Kim; Jinhyuk Lee; Jaewoo Kang
Journal:  Bioinformatics       Date:  2022-10-14       Impact factor: 6.931

4.  Hierarchical shared transfer learning for biomedical named entity recognition.

Authors:  Zhaoying Chai; Han Jin; Shenghui Shi; Siyan Zhan; Lin Zhuo; Yu Yang
Journal:  BMC Bioinformatics       Date:  2022-01-04       Impact factor: 3.169

5.  HIT 2.0: an enhanced platform for Herbal Ingredients' Targets.

Authors:  Deyu Yan; Genhui Zheng; Caicui Wang; Zikun Chen; Tiantian Mao; Jian Gao; Yu Yan; Xiangyi Chen; Xuejie Ji; Jinyu Yu; Saifeng Mo; Haonan Wen; Wenhao Han; Mengdi Zhou; Yuan Wang; Jun Wang; Kailin Tang; Zhiwei Cao
Journal:  Nucleic Acids Res       Date:  2022-01-07       Impact factor: 16.971

Review 6.  Text Mining for Building Biomedical Networks Using Cancer as a Case Study.

Authors:  Sofia I R Conceição; Francisco M Couto
Journal:  Biomolecules       Date:  2021-09-29
  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.