| Literature DB >> 25208128 |
Shusen Zhou1, Qingcai Chen2, Xiaolong Wang2.
Abstract
In this paper, we develop a novel semi-supervised learning algorithm called active hybrid deep belief networks (AHD), to address the semi-supervised sentiment classification problem with deep learning. First, we construct the previous several hidden layers using restricted Boltzmann machines (RBM), which can reduce the dimension and abstract the information of the reviews quickly. Second, we construct the following hidden layers using convolutional restricted Boltzmann machines (CRBM), which can abstract the information of reviews effectively. Third, the constructed deep architecture is fine-tuned by gradient-descent based supervised learning with an exponential loss function. Finally, active learning method is combined based on the proposed deep architecture. We did several experiments on five sentiment classification datasets, and show that AHD is competitive with previous semi-supervised learning algorithm. Experiments are also conducted to verify the effectiveness of our proposed method with different number of labeled reviews and unlabeled reviews respectively.Entities:
Mesh:
Year: 2014 PMID: 25208128 PMCID: PMC4160211 DOI: 10.1371/journal.pone.0107122
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Architecture of HDBN.
Figure 2Architecture of CRBM.
Algorithm of HDBN.
|
|
| data |
| number of training data |
| number of layers |
| number of units in every hidden layer |
| number of groups in every convolutional hidden layer |
| hidden layer |
| convolutional hidden layer |
| parameter space |
| biases |
|
|
| deep architecture with parameter space |
| 1. Greedy layer-wise unsupervised learning |
|
|
|
|
|
|
| Calculate the non-linear positive and negative phase: |
|
|
| Normal calculation. |
|
|
| Convolutional calculation according to Eq. 6 and Eq. 7. |
|
|
| Update the weights and biases: |
|
|
|
|
|
|
|
|
| 2. Supervised learning based on gradient descent |
|
|
Algorithm of AHD.
|
|
| data |
| number of training data |
| number of iterations |
| number of active choosing data for every iteration |
| parameter space |
|
|
| deep architecture with parameter space |
|
|
| Train HDBN with labeled dataset |
| Choose |
| Add |
|
|
| Train HDBN with labeled dataset |
HDBN structure used in experiment.
| Dataset | Structure |
| MOV | 100-100-4-2 |
| KIT | 50-50-3-2 |
| ELE | 50-50-3-2 |
| BOO | 50-50-5-2 |
| DVD | 50-50-5-2 |
Test accuracy with 100 labeled reviews for semi-supervised learning.
| Type | MOV | KIT | ELE | BOO | DVD |
| Spectral | 67.3 | 63.7 | 57.7 | 55.8 | 56.2 |
| TSVM | 68.7 | 65.5 | 62.9 | 58.7 | 57.3 |
| DBN | 71.3 | 72.6 | 73.6 | 64.3 | 66.7 |
| PIV | – |
| 70.0 | 60.1 | 49.5 |
| HDBN |
| 74.8 |
|
|
|
Test accuracy with 100 labeled reviews for active semi-supervised learning.
| Type | MOV | KIT | ELE | BOO | DVD |
| Active | 68.9 | 68.1 | 63.3 | 58.6 | 58.0 |
| MECH | 76.2 | 74.1 | 70.6 | 62.1 | 62.7 |
| ADN |
|
|
| 69.0 | 71.6 |
| AFD | 75 | 77 |
|
|
|
Figure 3Test accuracy of HDBN with different number of unlabeled reviews on five datasets.
Figure 4Test accuracy of ADN and AHD with different number of labeled reviews on five datasets.