| Literature DB >> 36052052 |
YanRong Liu1, LiJun Wang1, Rong Miao1, HengNi Ren1.
Abstract
The Apriori algorithm in association rules is the main algorithm used in the treatment and prevention of chronic diseases in data mining, and the algorithm in the current stage of China's medical field of association between chronic diseases has some problems, such as the need to scan the transaction database of cases several times, producing a large data set and more redundant rules. To address the above problems, a data mining algorithm of association rules combining clustering matrix and pruning strategy is proposed, which improves the algorithm by using the clustering matrix method to compress the stored transaction database and introducing the prepruning and postpruning strategy methods on the basis of adding constraint conditions. The experimental results show that the optimization algorithm has unique advantages in reducing the number of database scans and the number of candidate item sets generated and ultimately greatly reduces the running time and I/O load of the algorithm, and the running efficiency of the algorithm is greatly improved.Entities:
Mesh:
Year: 2022 PMID: 36052052 PMCID: PMC9427230 DOI: 10.1155/2022/8526256
Source DB: PubMed Journal: Comput Intell Neurosci
Define symbols.
|
| The set of |
|
| |
|
| min_supp item Sets |
|
| Candidate item sets |
Figure 1Add a mining for user constraints.
Database.D
| Item | Item sets |
|---|---|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| a1, |
|
| a1, |
|
| a1, |
Figure 2Candidate sets for different support threshold generation.
Figure 3Candidate item sets at different support thresholds.
Figure 4Execution time under different number of data sets.