| Literature DB >> 33808525 |
Qiang Zhao1, Qing Li2, Deshui Yu2, Yinghua Han2.
Abstract
In many industrial domains, there is a significant interest in obtaining temporal relationships among multiple variables in time-series data, given that such relationships play an auxiliary role in decision making. However, when transactions occur frequently only for a period of time, it is difficult for a traditional time-series association rules mining algorithm (TSARM) to identify this kind of relationship. In this paper, we propose a new TSARM framework and a novel algorithm named TSARM-UDP. A TSARM mining framework is used to mine time-series association rules (TSARs) and an up-to-date pattern (UDP) is applied to discover rare patterns that only appear in a period of time. Based on the up-to-date pattern mining, the proposed TSAR-UDP method could extract temporal relationship rules with better generality. The rules can be widely used in the process industry, the stock market, etc. Experiments are then performed on the public stock data and real blast furnace data to verify the effectiveness of the proposed algorithm. We compare our algorithm with three state-of-the-art algorithms, and the experimental results show that our algorithm can provide greater efficiency and interpretability in TSARs and that it has good prospects.Entities:
Keywords: association rules mining; data mining; temporal relationships; time-series; up-to-date pattern
Year: 2021 PMID: 33808525 PMCID: PMC8003227 DOI: 10.3390/e23030365
Source DB: PubMed Journal: Entropy (Basel) ISSN: 1099-4300 Impact factor: 2.524
Figure 1The framework of TSARM-UDP.
Figure 2The difference of calculating Support between Formula (2) and Formula (5).
Figure 3The flowchart of the proposed TSARM-UDP.
The log database in this example.
|
|
|
|
| 1 | 2018/9/1 10:00 | b, d, f |
| 2 | 2018/9/1 10:05 | b, d, f |
| 3 | 2018/9/1 10:10 | d, f |
| 4 | 2018/9/1 10:15 | a, d |
| 5 | 2018/9/1 10:20 | a, b, d |
| 6 | 2018/9/1 10:25 | d |
| 7 | 2018/9/1 10:30 | c |
| 8 | 2018/9/1 10:35 | a, b, c |
| 9 | 2018/9/1 10:40 | c, f, e |
| 10 | 2018/9/1 10:45 | b, d |
The results of and of each item in D.
|
|
|
|
| a | 4, 5, 8 | 3 |
| b | 1, 2, 5, 8, 10 | 5 |
| c | 7, 8, 9 | 3 |
| d | 1, 2, 3, 4, 5, 6, 10 | 7 |
| e | 9 | 1 |
| f | 1, 2, 3, 9 | 4 |
The results of Timelist(i) and TSupport of each item in D.
|
|
|
|
| a | 4, 5, 8 | 0.3 |
| b | 1, 2, 5, 8, 10 | 0.5 |
| c | 7, 8, 9 | 0.3 |
| d | 1, 2, 3, 4, 5, 6, 10 | 0.7 |
| e | 9 | 0.1 |
| f | 1, 2, 3, 9 | 0.4 |
The results of Timelist(i) and Count(i) of candidate T.
|
|
|
|
|
|
|
| ( | 1 | {5} | ( | 1 | {6} |
| ( | 2 | {1, 2} | ( | 1 | {6} |
| ( | 0 | Null | ( | 0 | Null |
| ( | 0 | Null | ( | 0 | Null |
| ( | 0 | Null | ( | 0 | Null |
| ( | 0 | Null | ( | 0 | Null |
| ( | 0 | Null | ( | 1 | {2} |
| ( | 0 | Null | ( | 0 | Null |
| ( | 2 | {2, 5} | ( | 3 | {1, 2, 3} |
| ( | 3 | {4, 5, 6} | ( | 0 | Null |
The results of TConfidence and Lift in .
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Figure 4Comparisons of mining results on the stock dataset (, ).
Figure 5Comparisons of the rule numbers and on the stock dataset ( and ).
The accuracy rate of the rules.
|
|
|
| Rule 1 | 75 |
| Rule 2 | 100 |
| Rule 3 | 100 |
| Rule 4 | 100 |
Comparisons of the running time of the four algorithms on the stock dataset.
|
|
|
|
|
|
| 0.3 | 146.6700 s | 4.1460 s | 3.7000 s | 2.9920 s |
| 0.4 | 58.5040 s | 3.4150 s | 3.4640 s | 2.8190 s |
| 0.5 | 41.7840 s | 3.2090 s | 3.2700 s | 2.7840 s |
| 0.6 | 27.6320 s | 3.1430 s | 3.1900 s | 2.7400 s |
| 0.7 | 11.3200 s | 2.8100 s | 2.9940 s | 2.6320 s |
Input variables and their corresponding discretization intervals.
|
|
|
|
|
| Blast wind volume | <3400 | 3400∼3500 | ≥3500 |
| Blast wind temperature | <1170 | 1170∼1190 | ≥1190 |
| Blast wind pressure | <335 | 335∼350 | ≥350 |
| Oxygen enrichment | <4400 | 4400∼5000 | ≥5000 |
| Top temperature | <100 | 100∼140 | ≥140 |
| Normal blast velocity | <190 | 190∼200 | ≥200 |
| Actual blast velocity | <220 | 220∼230 | ≥230 |
| Permeability index (PI) | <23 | 23∼26 | ≥26 |
| Blast furnace bosh gas volume | <4400 | 4400∼4500 | ≥4500 |
| Theoretical combustion temperature | <2200 | 2200∼2300 | ≥2300 |
| Permeability coefficient | <6 | 6∼7 | ≥7 |
Interval division and coding.
|
|
|
|
|
| Coding | 1 | 2 | 3 |
Variable coding.
|
|
|
| Blast wind volume | 1 |
| Blast wind temperature | 2 |
| Blast wind pressure | 3 |
| Oxygen enrichment | 4 |
| Top temperature | 5 |
| Normal blast velocity | 6 |
| Actual blast velocity | 7 |
| Permeability index (PI) | 8 |
| Blast furnace bosh gas volume | 9 |
| Theoretical combustion temperature | 10 |
| Permeability coefficient | 11 |
Figure 6, , and rule numbers comparison on the BF dataset ( and ).
Figure 7Rule numbers and comparison on the BF dataset (, ).
Figure 8Rule numbers and comparison on the BF dataset (, ).
Rule numbers mined by the proposed algorithm with different T.
|
|
|
|
|
| ( | 25 | ( | 25 |
| ( | 27 | ( | 29 |
| ( | 31 | ( | 28 |
| ( | 29 | ( | 27 |
| ( | 27 | ( | 25 |
Comparisons of the running time of the four algorithms on the BF dataset.
|
|
|
|
|
|
|
| 1.1809 × | 7.0400 s | 6.8650 s | 5.7770 s |
|
| 438.1290 s | 5.6700 s | 5.8360 s | 4.5320 s |
|
| 162.0220 s | 4.9260 s | 5.1850 s | 4.0640 s |
|
| 45.2970 s | 4.1600 s | 4.6900 s | 3.7080 s |
|
| 29.0880 s | 3.4410 s | 3.4440 s | 3.6820 s |
Example rules mined from the blast furnace data with TSARM-UDP.
|
|
|
|
|
|
| 0.97952 | 1.1013 |
|
|
| 1 | 1.1395 |
|
|
| 0.98868 | 1.1266 |
|
|
| 0.90352 | 1.0766 |
|
|
| 0.98305 | 1.1053 |
|
|
| 1 | 1.1395 |
|
|
| 0.97952 | 1.1612 |
|
|
| 1 | 1.1395 |
|
|
| 0.95819 | 1.1359 |
|
|
| 0.98675 | 1.1094 |
|
|
| 0.98817 | 1.1715 |
|
|
| 0.97895 | 1.1605 |
|
Rules mined from the blast furnace data with LTARM.
|
|
|
|
|
|
| 0.95491 | 1.0881 |
|
|
| 0.95049 | 1.0686 |
|
|
| 0.98489 | 1.0599 |
|
|
| 0.93668 | 1.0673 |
|
|
| 0.94573 | 1.0633 |
|
|
| 0.91852 | 1.0466 |
|
|
| 0.93654 | 1.053 |
|
|
| 0.78473 | 1.0429 |
|
|
| 0.95038 | 1.0829 |
|
|
| 0.91674 | 1.0307 |
|
|
| 0.80067 | 1.0641 |
|