Literature DB >> 28660257

Inducing Domain-Specific Sentiment Lexicons from Unlabeled Corpora.

William L Hamilton1, Kevin Clark1, Jure Leskovec1, Dan Jurafsky1.   

Abstract

A word's sentiment depends on the domain in which it is used. Computational social science research thus requires sentiment lexicons that are specific to the domains being studied. We combine domain-specific word embeddings with a label propagation framework to induce accurate domain-specific sentiment lexicons using small sets of seed words. We show that our approach achieves state-of-the-art performance on inducing sentiment lexicons from domain-specific corpora and that our purely corpus-based approach outperforms methods that rely on hand-curated resources (e.g., WordNet). Using our framework, we induce and release historical sentiment lexicons for 150 years of English and community-specific sentiment lexicons for 250 online communities from the social media forum Reddit. The historical lexicons we induce show that more than 5% of sentiment-bearing (non-neutral) English words completely switched polarity during the last 150 years, and the community-specific lexicons highlight how sentiment varies drastically between different communities.

Entities:  

Year:  2016        PMID: 28660257      PMCID: PMC5483533          DOI: 10.18653/v1/D16-1057

Source DB:  PubMed          Journal:  Proc Conf Empir Methods Nat Lang Process


  4 in total

1.  Extracting semantic representations from word co-occurrence statistics: stop-lists, stemming, and SVD.

Authors:  John A Bullinaria; Joseph P Levy
Journal:  Behav Res Methods       Date:  2012-09

2.  A Unified Framework for Creating Domain Dependent Polarity Lexicons from User Generated Reviews.

Authors:  Muhammad Zubair Asghar; Aurangzeb Khan; Shakeel Ahmad; Imran Ali Khan; Fazal Masud Kundi
Journal:  PLoS One       Date:  2015-10-14       Impact factor: 3.240

3.  Norms of valence, arousal, and dominance for 13,915 English lemmas.

Authors:  Amy Beth Warriner; Victor Kuperman; Marc Brysbaert
Journal:  Behav Res Methods       Date:  2013-12

4.  Characterizing the Google Books Corpus: Strong Limits to Inferences of Socio-Cultural and Linguistic Evolution.

Authors:  Eitan Adam Pechenick; Christopher M Danforth; Peter Sheridan Dodds
Journal:  PLoS One       Date:  2015-10-07       Impact factor: 3.240

  4 in total
  6 in total

1.  Construct validity of six sentiment analysis methods in the text of encounter notes of patients with critical illness.

Authors:  Gary E Weissman; Lyle H Ungar; Michael O Harhay; Katherine R Courtright; Scott D Halpern
Journal:  J Biomed Inform       Date:  2018-12-14       Impact factor: 6.317

2.  Two is better than one: Using a single emotion lexicon can lead to unreliable conclusions.

Authors:  Gabriela Czarnek; David Stillwell
Journal:  PLoS One       Date:  2022-10-14       Impact factor: 3.752

3.  Tracking COVID-19 Discourse on Twitter in North America: Infodemiology Study Using Topic Modeling and Aspect-Based Sentiment Analysis.

Authors:  Hyeju Jang; Emily Rempel; David Roth; Giuseppe Carenini; Naveed Zafar Janjua
Journal:  J Med Internet Res       Date:  2021-02-10       Impact factor: 5.428

4.  Augmenting Semantic Lexicons Using Word Embeddings and Transfer Learning.

Authors:  Thayer Alshaabi; Colin M Van Oort; Mikaela Irene Fudolig; Michael V Arnold; Christopher M Danforth; Peter Sheridan Dodds
Journal:  Front Artif Intell       Date:  2022-01-24

5.  BengSentiLex and BengSwearLex: creating lexicons for sentiment analysis and profanity detection in low-resource Bengali language.

Authors:  Salim Sazzed
Journal:  PeerJ Comput Sci       Date:  2021-11-16

6.  Machine learning to support social media empowered patients in cancer care and cancer treatment decisions.

Authors:  Daswin De Silva; Weranja Ranasinghe; Tharindu Bandaragoda; Achini Adikari; Nishan Mills; Lahiru Iddamalgoda; Damminda Alahakoon; Nathan Lawrentschuk; Raj Persad; Evgeny Osipov; Richard Gray; Damien Bolton
Journal:  PLoS One       Date:  2018-10-18       Impact factor: 3.240

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.