| Literature DB >> 33266773 |
Jindong Chen1,2, Yuxuan Du1,2, Linlin Liu1,2, Pinyi Zhang1,2, Wen Zhang3,4.
Abstract
The modeling and forecasting of BBS (Bulletin Board System) posts time series is crucial for government agencies, corporations and website operators to monitor public opinion. Accurate prediction of the number of BBS posts will assist government agencies or corporations in making timely decisions and estimating the future number of BBS posts will help website operators to allocate resources to deal with the possible hot events pressure. By combining sample entropy (SampEn) and deep neural networks (DNN), an approach (SampEn-DNN) is proposed for BBS posts time series modeling and forecasting. The main idea of SampEn-DNN is to utilize SampEn to decide the input vectors of DNN with smallest complexity, and DNN to enhance the prediction performance of time series. Selecting Tianya Zatan new posts as the data source, the performances of SampEn-DNN were compared with auto-regressive integrated moving average (ARIMA), seasonal ARIMA, polynomial regression, neural networks, etc. approaches for prediction of the daily number of new posts. From the experimental results, it can be found that the proposed approach SampEn-DNN outperforms the state-of-the-art approaches for BBS posts time series modeling and forecasting.Entities:
Keywords: BBS posts; deep neural networks; sample entropy; time series
Year: 2019 PMID: 33266773 PMCID: PMC7514164 DOI: 10.3390/e21010057
Source DB: PubMed Journal: Entropy (Basel) ISSN: 1099-4300 Impact factor: 2.524
Figure 1Procedures for the calculation of SampEn(m, r, δ).
Figure 2DNN (deep neural networks) model consists of DBN (deep belief network) and FNN (feedforward neural network).
SampEn (sample entropy) results of BBS (Bulletin Board System) posts time series with different m and δ.
|
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
| |||||||||||||
|
| 1.19 | 1.22 | 1.20 | 1.24 | 1.31 | 1.32 | 1.05 | 1.32 | 1.36 | 1.26 | 1.30 | 1.38 | |
|
| 1.06 | 1.06 | 1.13 | 1.14 | 1.09 | 1.18 | 0.93 | 1.20 | 1.14 | 1.12 | 1.18 | 1.20 | |
|
| 0.96 | 0.95 | 1.01 | 1.05 | 0.96 | 1.09 | 0.85 | 1.10 | 1.06 | 1.04 | 1.06 | 1.12 | |
|
| 0.90 | 0.80 | 0.91 | 0.97 | 0.92 | 0.95 | 0.81 | 1.07 | 1.00 | 0.93 | 0.97 | 1.15 | |
|
| 0.83 | 0.75 | 0.87 | 0.96 | 0.82 | 0.93 | 0.78 | 1.00 | 0.89 | 0.99 | 0.94 | 1.09 | |
|
| 0.81 | 0.72 | 0.80 | 0.90 | 0.73 | 0.93 | 0.76 | 0.87 | 0.83 | 1.13 | 0.83 | 0.92 | |
|
| 0.68 | 0.72 | 0.90 | 0.73 | 0.70 | 0.85 | 0.71 | 0.88 | 0.97 | 1.10 | 0.76 | 0.78 | |
|
| 0.73 | 0.72 | 0.92 | 0.78 | 0.61 | 1.30 | 0.76 | 1.14 | 0.98 | 0.96 | 1.10 | 0.89 | |
|
| 0.63 | 0.74 | 0.69 | 0.98 | 0.50 | 1.50 | 0.74 | 0.98 | 1.10 | 0.92 | 1.20 | 1.95 | |
|
| 0.54 | 0.85 | 0.59 | 1.10 | 0.41 | NaN | 0.45 | NaN | 1.11 | NaN | NaN | NaN | |
|
| 0.56 | 1.50 | 0.92 | NaN | 0.47 | NaN | 0.48 | NaN | NaN | NaN | NaN | NaN | |
|
| 0.98 | NaN | NaN | NaN | 0.69 | NaN | 0.62 | NaN | NaN | NaN | NaN | NaN | |
Figure 3Procedure of determination of the optimal dimension m∗.
Figure 4Procedure for determination of the optimal skipping parameter .
Figure 5The time series of daily new post number on Tianya Zatan broad.
MMREs (mean magnitude of relative error) of ARIMA (auto-regressive integrated moving average), seasonal ARIMA, polynomial (polynomial regression), ANN (artificial neural networks) and SampEn-DNN (sample entropy-deep neural networks) on BBS post time series.
| Subset | ARIMA | Seasonal ARIMA | Polynomial | ANN | SampEn-DNN |
|---|---|---|---|---|---|
| 1 | 0.2355 ± 0.0090 | 0.1968 ± 0.0119 | 0.2772 ± 0.0109 | 0.2003 ± 0.0129 | 0.1419 ± 0.0078 |
| 2 | 0.1895 ± 0.0103 | 0.1691 ± 0.0126 | 0.4735 ± 0.0179 | 0.1694 ± 0.0107 | 0.1241 ± 0.0078 |
| 3 | 0.1915 ± 0.0117 | 0.1704 ± 0.0107 | 0.3535 ± 0.0135 | 0.1934 ± 0.0092 | 0.1748 ± 0.0080 |
| 4 | 0.1325 ± 0.0049 | 0.1188 ± 0.0066 | 0.3637 ± 0.0212 | 0.1450 ± 0.0098 | 0.0878 ± 0.0036 |
| 5 | 0.1653 ± 0.0083 | 0.1451 ± 0.0102 | 0.2401 ± 0.0126 | 0.1494 ± 0.0101 | 0.1256 ± 0.0043 |
Wilcoxon signed rank test on MREs (magnitude of relative error) for the five methods.
| Model Pair | Seasonal ARIMA | Polynomial Regression | ANN | SampEn-DNN |
|---|---|---|---|---|
| ARIMA | < | ≫ | < | ≪ |
| Seasonal ARIMA | ∼ | ≫ | ∼ | < |
| Polynomial Regression | ≪ | ∼ | ≪ | ≪ |
| ANN | ∼ | ≫ | ∼ | < |