Literature DB >> 34042769

Transfer Learning for Classifying Spanish and English Text by Clinical Specialties.

Alexandra Pomares-Quimbaya1, Pilar López-Úbeda2, Stefan Schulz3.   

Abstract

Transfer learning has demonstrated its potential in natural language processing tasks, where models have been pre-trained on large corpora and then tuned to specific tasks. We applied pre-trained transfer models to a Spanish biomedical document classification task. The main goal is to analyze the performance of text classification by clinical specialties using state-of-the-art language models for Spanish, and compared them with the results using corresponding models in English and with the most important pre-trained model for the biomedical domain. The outcomes present interesting perspectives on the performance of language models that are pre-trained for a particular domain. In particular, we found that BioBERT achieved better results on Spanish texts translated into English than the general domain model in Spanish and the state-of-the-art multilingual model.

Keywords:  Classification; Natural Language Processing; Spanish; Transfer learning

Mesh:

Year:  2021        PMID: 34042769     DOI: 10.3233/SHTI210184

Source DB:  PubMed          Journal:  Stud Health Technol Inform        ISSN: 0926-9630


  1 in total

1.  Detecting racism and xenophobia using deep learning models on Twitter data: CNN, LSTM and BERT.

Authors:  José Alberto Benítez-Andrades; Álvaro González-Jiménez; Álvaro López-Brea; Jose Aveleira-Mata; José-Manuel Alija-Pérez; María Teresa García-Ordás
Journal:  PeerJ Comput Sci       Date:  2022-03-01
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.