Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Beyond the topics: how deep learning can improve the discriminability of probabilistic topic modelling.

Literature DB >> 33816904

Beyond the topics: how deep learning can improve the discriminability of probabilistic topic modelling.

Noura Al Moubayed¹, Stephen McGough², Bashar Awwad Shiekh Hasan³.

Abstract

The article presents a discriminative approach to complement the unsupervised probabilistic nature of topic modelling. The framework transforms the probabilities of the topics per document into class-dependent deep learning models that extract highly discriminatory features suitable for classification. The framework is then used for sentiment analysis with minimum feature engineering. The approach transforms the sentiment analysis problem from the word/document domain to the topics domain making it more robust to noise and incorporating complex contextual information that are not represented otherwise. A stacked denoising autoencoder (SDA) is then used to model the complex relationship among the topics per sentiment with minimum assumptions. To achieve this, a distinct topic model and SDA per sentiment polarity is built with an additional decision layer for classification. The framework is tested on a comprehensive collection of benchmark datasets that vary in sample size, class bias and classification task. A significant improvement to the state of the art is achieved without the need for a sentiment lexica or over-engineered features. A further analysis is carried out to explain the observed improvement in accuracy.

Entities: Chemical Disease Gene Species

Keywords: Sentiment analysis; Stacked denoising autoencoders; Text classification; Topic modelling

Year: 2020 PMID： 33816904 PMCID： PMC7924555 DOI： 10.7717/peerj-cs.252

Source DB: PubMed Journal: PeerJ Comput Sci ISSN： 2376-5992

8 in total

Beyond the topics: how deep learning can improve the discriminability of probabilistic topic modelling.

1. Nonlinear autoassociation is not equivalent to PCA.

2. Linear recursive distributed representations.

3. A fast learning algorithm for deep belief nets.

4. Reducing the dimensionality of data with neural networks.

Review 5. Representation learning: a review and new perspectives.

6. Auto-association by multilayer perceptrons and singular value decomposition.

7. Probabilistic Topic Models: A focus on graphical model design and applications to document and image analysis.

8. How inherently noisy is human sensory processing?