Literature DB >> 33569575

Unsupervised and self-supervised deep learning approaches for biomedical text mining.

Mohamed Nadif1, François Role1.   

Abstract

Biomedical scientific literature is growing at a very rapid pace, which makes increasingly difficult for human experts to spot the most relevant results hidden in the papers. Automatized information extraction tools based on text mining techniques are therefore needed to assist them in this task. In the last few years, deep neural networks-based techniques have significantly contributed to advance the state-of-the-art in this research area. Although the contribution to this progress made by supervised methods is relatively well-known, this is less so for other kinds of learning, namely unsupervised and self-supervised learning. Unsupervised learning is a kind of learning that does not require the cost of creating labels, which is very useful in the exploratory stages of a biomedical study where agile techniques are needed to rapidly explore many paths. In particular, clustering techniques applied to biomedical text mining allow to gather large sets of documents into more manageable groups. Deep learning techniques have allowed to produce new clustering-friendly representations of the data. On the other hand, self-supervised learning is a kind of supervised learning where the labels do not have to be manually created by humans, but are automatically derived from relations found in the input texts. In combination with innovative network architectures (e.g. transformer-based architectures), self-supervised techniques have allowed to design increasingly effective vector-based word representations (word embeddings). We show in this survey how word representations obtained in this way have proven to successfully interact with common supervised modules (e.g. classification networks) to whose performance they greatly contribute.
© The Author(s) 2021. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

Entities:  

Keywords:  deep learning; self-supervised learning; text mining; unsupervised learning

Year:  2021        PMID: 33569575     DOI: 10.1093/bib/bbab016

Source DB:  PubMed          Journal:  Brief Bioinform        ISSN: 1467-5463            Impact factor:   11.622


  3 in total

1.  Combining Literature Mining and Machine Learning for Predicting Biomedical Discoveries.

Authors:  Balu Bhasuran
Journal:  Methods Mol Biol       Date:  2022

2.  BioBERT and Similar Approaches for Relation Extraction.

Authors:  Balu Bhasuran
Journal:  Methods Mol Biol       Date:  2022

Review 3.  Recent developments in application of single-cell RNA sequencing in the tumour immune microenvironment and cancer therapy.

Authors:  Pei-Heng Li; Xiang-Yu Kong; Ya-Zhou He; Yi Liu; Xi Peng; Zhi-Hui Li; Heng Xu; Han Luo; Jihwan Park
Journal:  Mil Med Res       Date:  2022-09-26
  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.