Literature DB >> 33379231

An Effective BERT-Based Pipeline for Twitter Sentiment Analysis: A Case Study in Italian.

Marco Pota1, Mirko Ventura1, Rosario Catelli1,2, Massimo Esposito1.   

Abstract

Over the last decade industrial and academic communities have increased their focus on sentiment analysis techniques, especially applied to tweets. State-of-the-art results have been recently achieved using language models trained from scratch on corpora made up exclusively of tweets, in order to better handle the Twitter jargon. This work aims to introduce a different approach for Twitter sentiment analysis based on two steps. Firstly, the tweet jargon, including emojis and emoticons, is transformed into plain text, exploiting procedures that are language-independent or easily applicable to different languages. Secondly, the resulting tweets are classified using the language model BERT, but pre-trained on plain text, instead of tweets, for two reasons: (1) pre-trained models on plain text are easily available in many languages, avoiding resource- and time-consuming model training directly on tweets from scratch; (2) available plain text corpora are larger than tweet-only ones, therefore allowing better performance. A case study describing the application of the approach to Italian is presented, with a comparison with other Italian existing solutions. The results obtained show the effectiveness of the approach and indicate that, thanks to its general basis from a methodological perspective, it can also be promising for other languages.

Entities:  

Keywords:  BERT; Italian language; NLP; language models; sentiment analysis

Mesh:

Year:  2020        PMID: 33379231      PMCID: PMC7796054          DOI: 10.3390/s21010133

Source DB:  PubMed          Journal:  Sensors (Basel)        ISSN: 1424-8220            Impact factor:   3.576


  3 in total

1.  Long short-term memory.

Authors:  S Hochreiter; J Schmidhuber
Journal:  Neural Comput       Date:  1997-11-15       Impact factor: 2.026

2.  Monitoring the environment and human sentiment on the Great Barrier Reef: Assessing the potential of collective sensing.

Authors:  Susanne Becken; Bela Stantic; Jinyan Chen; Ali Reza Alaei; Rod M Connolly
Journal:  J Environ Manage       Date:  2017-08-03       Impact factor: 6.789

3.  Sentiment of Emojis.

Authors:  Petra Kralj Novak; Jasmina Smailović; Borut Sluban; Igor Mozetič
Journal:  PLoS One       Date:  2015-12-07       Impact factor: 3.240

  3 in total
  5 in total

1.  A survey on text classification: Practical perspectives on the Italian language.

Authors:  Andrea Gasparetto; Alessandro Zangari; Matteo Marcuzzo; Andrea Albarelli
Journal:  PLoS One       Date:  2022-07-06       Impact factor: 3.752

2.  A novel LSTM-CNN-grid search-based deep neural network for sentiment analysis.

Authors:  Ishaani Priyadarshini; Chase Cotton
Journal:  J Supercomput       Date:  2021-05-05       Impact factor: 2.474

3.  A deep learning approach in predicting products' sentiment ratings: a comparative analysis.

Authors:  Vimala Balakrishnan; Zhongliang Shi; Chuan Liang Law; Regine Lim; Lee Leng Teh; Yue Fan
Journal:  J Supercomput       Date:  2021-11-05       Impact factor: 2.557

4.  Sentimental and spatial analysis of COVID-19 vaccines tweets.

Authors:  Areeba Umair; Elio Masciari
Journal:  J Intell Inf Syst       Date:  2022-04-15       Impact factor: 1.888

5.  Human sentiments monitoring during COVID-19 using AI-based modeling.

Authors:  Areeba Umair; Elio Masciari
Journal:  Procedia Comput Sci       Date:  2022-08-12
  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.