Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 A BERT-based ensemble learning approach for the BioCreative VII challenges: full-text chemical identification and multi-label classification in PubMed articles.

Literature DB >> 35849027

A BERT-based ensemble learning approach for the BioCreative VII challenges: full-text chemical identification and multi-label classification in PubMed articles.

Sheng-Jie Lin¹, Wen-Chao Yeh², Yu-Wen Chiu¹, Yung-Chun Chang^1,3,4, Min-Huei Hsu¹, Yi-Shin Chen², Wen-Lian Hsu^4,5.

Abstract

In this research, we explored various state-of-the-art biomedical-specific pre-trained Bidirectional Encoder Representations from Transformers (BERT) models for the National Library of Medicine - Chemistry (NLM CHEM) and LitCovid tracks in the BioCreative VII Challenge, and propose a BERT-based ensemble learning approach to integrate the advantages of various models to improve the system's performance. The experimental results of the NLM-CHEM track demonstrate that our method can achieve remarkable performance, with F1-scores of 85% and 91.8% in strict and approximate evaluations, respectively. Moreover, the proposed Medical Subject Headings identifier (MeSH ID) normalization algorithm is effective in entity normalization, which achieved a F1-score of about 80% in both strict and approximate evaluations. For the LitCovid track, the proposed method is also effective in detecting topics in the Coronavirus disease 2019 (COVID-19) literature, which outperformed the compared methods and achieve state-of-the-art performance in the LitCovid corpus. Database URL: https://www.ncbi.nlm.nih.gov/research/coronavirus/.

Entities: Chemical

Mesh：

Year: 2022 PMID： 35849027 PMCID： PMC9290865 DOI： 10.1093/database/baac056

Source DB: PubMed Journal: Database (Oxford) ISSN： 1758-0463 Impact factor: 4.462

Keyword Cloud
References

16 in total

1. ML-Net: multi-label classification of biomedical texts with deep neural networks.

Authors: Jingcheng Du; Qingyu Chen; Yifan Peng; Yang Xiang; Cui Tao; Zhiyong Lu
Journal: J Am Med Inform Assoc Date: 2019-11-01 Impact factor: 4.497

2. BioCreative V CDR task corpus: a resource for chemical disease relation extraction.

Authors: Jiao Li; Yueping Sun; Robin J Johnson; Daniela Sciaky; Chih-Hsuan Wei; Robert Leaman; Allan Peter Davis; Carolyn J Mattingly; Thomas C Wiegers; Zhiyong Lu
Journal: Database (Oxford) Date: 2016-05-09 Impact factor: 3.451

3. Exploiting syntactic and semantics information for chemical-disease relation extraction.

Authors: Huiwei Zhou; Huijie Deng; Long Chen; Yunlong Yang; Chen Jia; Degen Huang
Journal: Database (Oxford) Date: 2016-04-14 Impact factor: 3.451

4. Chemical-induced disease relation extraction via attention-based distant supervision.

Authors: Jinghang Gu; Fuqing Sun; Longhua Qian; Guodong Zhou
Journal: BMC Bioinformatics Date: 2019-07-22 Impact factor: 3.169

5. Chemlistem: chemical named entity recognition using recurrent neural networks.

Authors: Peter Corbett; John Boyle
Journal: J Cheminform Date: 2018-12-06 Impact factor: 5.514

6. Modeling Spatiotemporal Pattern of Depressive Symptoms Caused by COVID-19 Using Social Media Data Mining.

Authors: Diya Li; Harshita Chaudhary; Zhe Zhang
Journal: Int J Environ Res Public Health Date: 2020-07-10 Impact factor: 3.390

7. LitCovid: an open database of COVID-19 literature.

Authors: Qingyu Chen; Alexis Allot; Zhiyong Lu
Journal: Nucleic Acids Res Date: 2020-11-09 Impact factor: 16.971

8. Overview of BioCreative II gene mention recognition.

Authors: Larry Smith; Lorraine K Tanabe; Rie Johnson nee Ando; Cheng-Ju Kuo; I-Fang Chung; Chun-Nan Hsu; Yu-Shi Lin; Roman Klinger; Christoph M Friedrich; Kuzman Ganchev; Manabu Torii; Hongfang Liu; Barry Haddow; Craig A Struble; Richard J Povinelli; Andreas Vlachos; William A Baumgartner; Lawrence Hunter; Bob Carpenter; Richard Tzong-Han Tsai; Hong-Jie Dai; Feng Liu; Yifei Chen; Chengjie Sun; Sophia Katrenko; Pieter Adriaans; Christian Blaschke; Rafael Torres; Mariana Neves; Preslav Nakov; Anna Divoli; Manuel Maña-López; Jacinto Mata; W John Wilbur
Journal: Genome Biol Date: 2008-09-01 Impact factor: 13.583

9. DTranNER: biomedical named entity recognition with deep learning-based label-label transition model.

Authors: S K Hong; Jae-Gil Lee
Journal: BMC Bioinformatics Date: 2020-02-11 Impact factor: 3.169