Literature DB >> 35849027

A BERT-based ensemble learning approach for the BioCreative VII challenges: full-text chemical identification and multi-label classification in PubMed articles.

Sheng-Jie Lin1, Wen-Chao Yeh2, Yu-Wen Chiu1, Yung-Chun Chang1,3,4, Min-Huei Hsu1, Yi-Shin Chen2, Wen-Lian Hsu4,5.   

Abstract

In this research, we explored various state-of-the-art biomedical-specific pre-trained Bidirectional Encoder Representations from Transformers (BERT) models for the National Library of Medicine - Chemistry (NLM CHEM) and LitCovid tracks in the BioCreative VII Challenge, and propose a BERT-based ensemble learning approach to integrate the advantages of various models to improve the system's performance. The experimental results of the NLM-CHEM track demonstrate that our method can achieve remarkable performance, with F1-scores of 85% and 91.8% in strict and approximate evaluations, respectively. Moreover, the proposed Medical Subject Headings identifier (MeSH ID) normalization algorithm is effective in entity normalization, which achieved a F1-score of about 80% in both strict and approximate evaluations. For the LitCovid track, the proposed method is also effective in detecting topics in the Coronavirus disease 2019 (COVID-19) literature, which outperformed the compared methods and achieve state-of-the-art performance in the LitCovid corpus. Database URL: https://www.ncbi.nlm.nih.gov/research/coronavirus/.
© The Author(s) 2022. Published by Oxford University Press.

Entities:  

Mesh:

Year:  2022        PMID: 35849027      PMCID: PMC9290865          DOI: 10.1093/database/baac056

Source DB:  PubMed          Journal:  Database (Oxford)        ISSN: 1758-0463            Impact factor:   4.462


  16 in total

1.  ML-Net: multi-label classification of biomedical texts with deep neural networks.

Authors:  Jingcheng Du; Qingyu Chen; Yifan Peng; Yang Xiang; Cui Tao; Zhiyong Lu
Journal:  J Am Med Inform Assoc       Date:  2019-11-01       Impact factor: 4.497

2.  BioCreative V CDR task corpus: a resource for chemical disease relation extraction.

Authors:  Jiao Li; Yueping Sun; Robin J Johnson; Daniela Sciaky; Chih-Hsuan Wei; Robert Leaman; Allan Peter Davis; Carolyn J Mattingly; Thomas C Wiegers; Zhiyong Lu
Journal:  Database (Oxford)       Date:  2016-05-09       Impact factor: 3.451

3.  Exploiting syntactic and semantics information for chemical-disease relation extraction.

Authors:  Huiwei Zhou; Huijie Deng; Long Chen; Yunlong Yang; Chen Jia; Degen Huang
Journal:  Database (Oxford)       Date:  2016-04-14       Impact factor: 3.451

4.  Chemical-induced disease relation extraction via attention-based distant supervision.

Authors:  Jinghang Gu; Fuqing Sun; Longhua Qian; Guodong Zhou
Journal:  BMC Bioinformatics       Date:  2019-07-22       Impact factor: 3.169

5.  Chemlistem: chemical named entity recognition using recurrent neural networks.

Authors:  Peter Corbett; John Boyle
Journal:  J Cheminform       Date:  2018-12-06       Impact factor: 5.514

6.  Modeling Spatiotemporal Pattern of Depressive Symptoms Caused by COVID-19 Using Social Media Data Mining.

Authors:  Diya Li; Harshita Chaudhary; Zhe Zhang
Journal:  Int J Environ Res Public Health       Date:  2020-07-10       Impact factor: 3.390

7.  LitCovid: an open database of COVID-19 literature.

Authors:  Qingyu Chen; Alexis Allot; Zhiyong Lu
Journal:  Nucleic Acids Res       Date:  2020-11-09       Impact factor: 16.971

8.  Overview of BioCreative II gene mention recognition.

Authors:  Larry Smith; Lorraine K Tanabe; Rie Johnson nee Ando; Cheng-Ju Kuo; I-Fang Chung; Chun-Nan Hsu; Yu-Shi Lin; Roman Klinger; Christoph M Friedrich; Kuzman Ganchev; Manabu Torii; Hongfang Liu; Barry Haddow; Craig A Struble; Richard J Povinelli; Andreas Vlachos; William A Baumgartner; Lawrence Hunter; Bob Carpenter; Richard Tzong-Han Tsai; Hong-Jie Dai; Feng Liu; Yifei Chen; Chengjie Sun; Sophia Katrenko; Pieter Adriaans; Christian Blaschke; Rafael Torres; Mariana Neves; Preslav Nakov; Anna Divoli; Manuel Maña-López; Jacinto Mata; W John Wilbur
Journal:  Genome Biol       Date:  2008-09-01       Impact factor: 13.583

9.  DTranNER: biomedical named entity recognition with deep learning-based label-label transition model.

Authors:  S K Hong; Jae-Gil Lee
Journal:  BMC Bioinformatics       Date:  2020-02-11       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.