Literature DB >> 33644469

Preprocessing Arabic text on social media.

Mohamed Osman Hegazi1, Yasser Al-Dossari1, Abdullah Al-Yahy1, Abdulaziz Al-Sumari1, Anwer Hilal2.   

Abstract

Currently, social media plays an important role in daily life and routine. Millions of people use social media for different purposes. Large amounts of data flow through online networks every second, and these data contain valuable information that can be extracted if the data are properly processed and analyzed. However, most of the processing results are affected by preprocessing difficulties. This paper presents an approach to extract information from social media Arabic text. It provides an integrated solution for the challenges in preprocessing Arabic text on social media in four stages: data collection, cleaning, enrichment, and availability. The preprocessed Arabic text is stored in structured database tables to provide a useful corpus to which, information extraction and data analysis algorithms can be applied. The experiment in this study reveals that the implementation of the proposed approach yields a useful and full-featured dataset and valuable information. The resultant dataset presented the Arabic text in three structured levels with more than 20 features. Additionally, the experiment provides valuable information and processed results such as topic classification and sentiment analysis.
© 2021 The Authors.

Entities:  

Keywords:  Arabic text; Data analysis; Database; Document and text processing; Information extraction; Information retrieval; Knowledge discovery; Natural language processing; Sentiment analysis

Year:  2021        PMID: 33644469      PMCID: PMC7895730          DOI: 10.1016/j.heliyon.2021.e06191

Source DB:  PubMed          Journal:  Heliyon        ISSN: 2405-8440


  1 in total

1.  Arabic Sentiment Classification Using Convolutional Neural Network and Differential Evolution Algorithm.

Authors:  Abdelghani Dahou; Mohamed Abd Elaziz; Junwei Zhou; Shengwu Xiong
Journal:  Comput Intell Neurosci       Date:  2019-02-26
  1 in total
  1 in total

Review 1.  Emotion Analysis of Arabic Tweets: Language Models and Available Resources.

Authors:  Ghadah Alqahtani; Abdulrahman Alothaim
Journal:  Front Artif Intell       Date:  2022-03-30
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.