Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Human-annotated dataset for social media sentiment analysis for Albanian language.

Literature DB >> 35832321

Human-annotated dataset for social media sentiment analysis for Albanian language.

Fatbardh Kadriu¹, Doruntina Murtezaj¹, Fatbardh Gashi¹, Lule Ahmedi¹, Arianit Kurti², Zenun Kastrati².

Abstract

Social media was a heavily used platform by people in different countries to express their opinions about different crises, especially during the Covid-19 pandemics. This dataset is created through collecting people's comments in the news items on the official Facebook site of the National Institute of Public Health of Kosovo. The dataset contains a total of 10,132 comments that are human-annotated in the Albanian language as a low-resource language. The dataset was collected from March 12, 2020, and this coincides with the emergence of the first confirmed Covid-19 case in Kosovo until August 31, 2020, when the second wave started. Due to the scarcity of labeled data for low-resource languages, the dataset can be used by the research community in the field of machine learning, information retrieval, affective computing, as well as by the public agencies and decision makers.

Entities: Chemical

Keywords: Affective computing; Machine/deep learning; NLP; Sentiment analysis; Text classification

Year: 2022 PMID： 35832321 PMCID： PMC9272335 DOI： 10.1016/j.dib.2022.108436

Source DB: PubMed Journal: Data Brief ISSN： 2352-3409

Keyword Cloud
References

3 in total

1. The (ir)rational consideration of the cost of science in transition economies.

Authors: Quan-Hoang Vuong
Journal: Nat Hum Behav Date: 2018-01

2. Reaping the benefits of Open Data in public health.

Authors: P Huston; V L Edge; E Bernier
Journal: Can Commun Dis Rep Date: 2019-10-03

3 in total