Literature DB >> 33816997

A supervised scheme for aspect extraction in sentiment analysis using the hybrid feature set of word dependency relations and lemmas.

Bhavana R Bhamare1, Jeyanthi Prabhu2.   

Abstract

Due to the massive progression of the Web, people post their reviews for any product, movies and places they visit on social media. The reviews available on social media are helpful to customers as well as the product owners to evaluate their products based on different reviews. Analyzing structured data is easy as compared to unstructured data. The reviews are available in an unstructured format. Aspect-Based Sentiment Analysis mines the aspects of a product from the reviews and further determines sentiment for each aspect. In this work, two methods for aspect extraction are proposed. The datasets used for this work are SemEval restaurant review dataset, Yelp and Kaggle datasets. In the first method a multivariate filter-based approach for feature selection is proposed. This method support to select significant features and reduces redundancy among selected features. It shows improvement in F1-score compared to a method that uses only relevant features selected using Term Frequency weight. In another method, selective dependency relations are used to extract features. This is done using Stanford NLP parser. The results gained using features extracted by selective dependency rules are better as compared to features extracted by using all dependency rules. In the hybrid approach, both lemma features and selective dependency relation based features are extracted. Using the hybrid feature set, 94.78% accuracy and 85.24% F1-score is achieved in the aspect category prediction task.
© 2021 Bhamare and Prabhu.

Entities:  

Keywords:  Aspect based sentiment analysis; Feature extraction; Machine learning; Natural language processing; Support vector machine

Year:  2021        PMID: 33816997      PMCID: PMC7959606          DOI: 10.7717/peerj-cs.347

Source DB:  PubMed          Journal:  PeerJ Comput Sci        ISSN: 2376-5992


  2 in total

1.  Supervised and Unsupervised Aspect Category Detection for Sentiment Analysis with Co-occurrence Data.

Authors:  Kim Schouten; Onne van der Weijde; Flavius Frasincar; Rommert Dekker
Journal:  IEEE Trans Cybern       Date:  2017-04-14       Impact factor: 11.448

2.  Lexicon-enhanced sentiment analysis framework using rule-based classification scheme.

Authors:  Muhammad Zubair Asghar; Aurangzeb Khan; Shakeel Ahmad; Maria Qasim; Imran Ali Khan
Journal:  PLoS One       Date:  2017-02-23       Impact factor: 3.240

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.