Literature DB >> 33733159

Using Topic Modeling Methods for Short-Text Data: A Comparative Analysis.

Rania Albalawi1, Tet Hin Yeap1, Morad Benyoucef2.   

Abstract

With the growth of online social network platforms and applications, large amounts of textual user-generated content are created daily in the form of comments, reviews, and short-text messages. As a result, users often find it challenging to discover useful information or more on the topic being discussed from such content. Machine learning and natural language processing algorithms are used to analyze the massive amount of textual social media data available online, including topic modeling techniques that have gained popularity in recent years. This paper investigates the topic modeling subject and its common application areas, methods, and tools. Also, we examine and compare five frequently used topic modeling methods, as applied to short textual social data, to show their benefits practically in detecting important topics. These methods are latent semantic analysis, latent Dirichlet allocation, non-negative matrix factorization, random projection, and principal component analysis. Two textual datasets were selected to evaluate the performance of included topic modeling methods based on the topic quality and some standard statistical evaluation metrics, like recall, precision, F-score, and topic coherence. As a result, latent Dirichlet allocation and non-negative matrix factorization methods delivered more meaningful extracted topics and obtained good results. The paper sheds light on some common topic modeling methods in a short-text context and provides direction for researchers who seek to apply these methods.
Copyright © 2020 Albalawi, Yeap and Benyoucef.

Entities:  

Keywords:  natural language processing; online social networks; short text; topic modeling; user-generated content

Year:  2020        PMID: 33733159      PMCID: PMC7861298          DOI: 10.3389/frai.2020.00042

Source DB:  PubMed          Journal:  Front Artif Intell        ISSN: 2624-8212


  1 in total

Review 1.  An overview of topic modeling and its current applications in bioinformatics.

Authors:  Lin Liu; Lin Tang; Wen Dong; Shaowen Yao; Wei Zhou
Journal:  Springerplus       Date:  2016-09-20
  1 in total
  9 in total

1.  A Topic Modeling Comparison Between LDA, NMF, Top2Vec, and BERTopic to Demystify Twitter Posts.

Authors:  Roman Egger; Joanne Yu
Journal:  Front Sociol       Date:  2022-05-06

2.  A novel multiple kernel fuzzy topic modeling technique for biomedical data.

Authors:  Junaid Rashid; Jungeun Kim; Amir Hussain; Usman Naseem; Sapna Juneja
Journal:  BMC Bioinformatics       Date:  2022-07-12       Impact factor: 3.307

3.  Comparison of public discussions of gene editing on social media between the United States and China.

Authors:  Jiaojiao Ji; Matthew Robbins; Jieyu Ding Featherstone; Christopher Calabrese; George A Barnett
Journal:  PLoS One       Date:  2022-05-02       Impact factor: 3.752

4.  Perception of the Food and Drug Administration Electronic Cigarette Flavor Enforcement Policy on Twitter: Observational Study.

Authors:  Xinyi Lu; Li Sun; Zidian Xie; Dongmei Li
Journal:  JMIR Public Health Surveill       Date:  2022-03-29

5.  More than a feeling? What does compassion in healthcare 'look like' to patients?

Authors:  Sofie I Baguley; Alina Pavlova; Nathan S Consedine
Journal:  Health Expect       Date:  2022-06-03       Impact factor: 3.318

6.  Corporate Social Responsibility Activities Through Twitter: From Topic Model Analysis to Indexes Measuring Communication Characteristics.

Authors:  Camilla Salvatore; Silvia Biffignandi; Annamaria Bianchi
Journal:  Soc Indic Res       Date:  2022-08-20

7.  Online Brand Community User Segments: A Text Mining Approach.

Authors:  Ruichen Ge; Hong Zhao; Sha Zhang
Journal:  Front Artif Intell       Date:  2022-07-18

8.  Microlearning in Diverse Contexts: A Bibliometric Analysis.

Authors:  Rajagopal Sankaranarayanan; Javier Leung; Victoria Abramenka-Lachheb; Grace Seo; Ahmed Lachheb
Journal:  TechTrends       Date:  2022-10-13

9.  Resilience in Web-Based Mental Health Communities: Building a Resilience Dictionary With Semiautomatic Text Analysis.

Authors:  Yong-Bin Kang; Anthony McCosker; Peter Kamstra; Jane Farmer
Journal:  JMIR Form Res       Date:  2022-09-22
  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.