| Literature DB >> 33267371 |
Taopin Mu1,2, Xianyong Zhang1,2, Zhiwen Mo1,2.
Abstract
Rough set theory is an important approach for data mining, and it refers to Shannon's information measures for uncertainty measurements. The existing local conditional-entropies have both the second-order feature and application limitation. By improvements of hierarchical granulation, this paper establishes double-granule conditional-entropies based on three-level granular structures (i.e., micro-bottom, meso-middle, macro-top ), and then investigates the relevant properties. In terms of the decision table and its decision classification, double-granule conditional-entropies are proposed at micro-bottom by the dual condition-granule system. By virtue of successive granular summation integrations, they hierarchically evolve to meso-middle and macro-top, to respectively have part and complete condition-granulations. Then, the new measures acquire their number distribution, calculation algorithm, three bounds, and granulation non-monotonicity at three corresponding levels. Finally, the hierarchical constructions and achieved properties are effectively verified by decision table examples and data set experiments. Double-granule conditional-entropies carry the second-order characteristic and hierarchical granulation to deepen both the classical entropy system and local conditional-entropies, and thus they become novel uncertainty measures for information processing and knowledge reasoning.Entities:
Keywords: conditional entropy; granular computing; information theory; rough set theory; three-level granular structures; uncertainty
Year: 2019 PMID: 33267371 PMCID: PMC7515153 DOI: 10.3390/e21070657
Source DB: PubMed Journal: Entropy (Basel) ISSN: 1099-4300 Impact factor: 2.524
Three-level granular structures based on condition granulation of the decision table.
| Structure Naming | Composition System | Granular Scale | Granular Level | Number of Parallel Patterns |
|---|---|---|---|---|
| Micro-Bottom |
| Micro | Bottom |
|
| Meso-Middle |
| Meso | Middle |
|
| Macro-Top |
| Macro | Top | 1 |
Figure 1Schematic diagram of three-level granular structures.
Matrix distribution of double-granule conditional-entropies at micro-bottom.
|
|
| ⋯ |
| ⋯ |
|
|
|
| ⋯ |
| ⋯ |
|
| ⋮ | ⋮ | ⋱ | ⋮ | ⋱ | ⋮ |
|
|
| ⋯ |
| ⋯ |
|
| ⋮ | ⋮ | ⋱ | ⋮ | ⋱ | ⋮ |
|
|
| ⋯ |
| ⋯ |
|
Figure 2Convex figure of information function .
Three bounds of double-granule conditional-entropies at micro-bottom.
|
|
| ⋯ |
| ⋯ |
|
|
|
| ⋯ |
| ⋯ |
|
| ⋮ | ⋮ | ⋱ | ⋮ | ⋱ | ⋮ |
|
|
| ⋯ |
| ⋯ |
|
| ⋮ | ⋮ | ⋱ | ⋮ | ⋱ | ⋮ |
|
|
| ⋯ |
| ⋯ |
|
Marginal distribution of double-granule conditional-entropies at meso-middle and macro-top.
|
|
| ⋯ |
| ⋯ |
| Meso-Middle |
|
|
| ⋯ |
| ⋯ |
|
|
| ⋮ | ⋮ | ⋱ | ⋮ | ⋱ | ⋮ | ⋮ |
|
|
| ⋯ |
| ⋯ |
|
|
| ⋮ | ⋮ | ⋱ | ⋮ | ⋱ | ⋮ | ⋮ |
|
|
| ⋯ |
| ⋯ |
|
|
| Meso-Middle |
| ⋯ |
| ⋯ |
| Macro-Top: |
Three bounds of double-granule conditional-entropies at meso-middle and macro-top.
|
|
|
|
|
|
|
|
|
|
|
|
| ⋮ | ⋮ | ⋮ | ⋮ | ⋮ |
|
|
|
|
|
|
| ⋮ | ⋮ | ⋮ | ⋮ | ⋮ |
|
|
|
|
|
|
| Macro-Top |
|
|
|
|
A decision table.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 2 | 2 | 4 | 4 | 4 | 3 | 4 | 4 | 4 | 2 | 4 | 1 |
|
| 2 | 2 | 4 | 4 | 2 | 2 | 4 | 4 | 4 | 2 | 3 | 1 |
|
| 3 | 4 | 3 | 4 | 2 | 4 | 2 | 4 | 4 | 2 | 2 | 0 |
|
| 2 | 4 | 2 | 3 | 2 | 4 | 2 | 4 | 2 | 2 | 4 | 0 |
|
| 4 | 4 | 2 | 4 | 2 | 4 | 3 | 4 | 4 | 4 | 3 | 0 |
|
| 2 | 4 | 2 | 4 | 2 | 2 | 2 | 4 | 4 | 4 | 4 | 0 |
|
| 2 | 2 | 4 | 4 | 2 | 2 | 2 | 3 | 4 | 4 | 2 | 0 |
|
| 2 | 2 | 4 | 4 | 2 | 2 | 2 | 4 | 4 | 3 | 4 | 1 |
Information values of double-granule conditional-entropies in the example.
|
|
|
|
|
|
|
| Meso-Middle |
|
| 0 |
| 0 | 0 | 0 | 0 |
|
|
|
|
|
|
|
|
|
|
|
| 0 |
| 0 | 0 | 0 | 0 |
|
|
| 0 |
| 0 | 0 | 0 | 0 |
|
|
| 0 |
| 0 | 0 | 0 | 0 |
|
|
| 0 |
| 0 | 0 | 0 | 0 |
|
| Meso-Middle |
|
|
|
|
|
| Macro-Top: |
Three bounds of double-granule conditional-entropies in the example.
|
|
|
|
|
|
|
| Meso-Middle |
|
|
|
| |||||
|
|
|
|
| ||||
|
|
| ||||||
|
|
| ||||||
|
|
| ||||||
|
|
| ||||||
| Meso-Middle |
|
|
|
|
|
| Macro-Top: |
Double-granule conditional-entropies based on an attribute-enlargement chain in the example.
| Level | Measure |
|
|
|
|
|
|
|
|
|
|
|
| Micro-Bottom |
| 0 | 0 | 0 | 0 | 0 | 0 | 0 | ||||
| Meso-Middle |
|
|
|
|
|
|
|
| 0 | 0 | 0 | 0 |
| Macro-Top |
|
|
|
|
|
|
|
| 0 | 0 | 0 | 0 |
Double-granule conditional-entropies regarding in the example.
|
|
|
|
|
| Meso-Middle |
|
|
|
|
|
|
|
|
|
| 0 | 0 | 0 |
|
|
|
| 0 | 0 | 0 |
|
|
|
| 0 | 0 | 0 |
|
| Meso-Middle |
|
|
|
| Macro-Top: |
Figure 3Macro-top’s double-granule conditional-entropies and their three bounds based on an attribute-enlargement chain in the example.
Three UCI data sets.
| Label | Name |
|
|
|
|
|
|---|---|---|---|---|---|---|
| (1) | VOTING | 435 | 16 | 342 | 1 | 2 |
| (2) | SPECT | 187 | 22 | 169 | 1 | 2 |
| (3) | Tic-Tac-Toe | 958 | 9 | 958 | 1 | 2 |
Double-granule conditional-entropies in the VOTING data set.
|
|
|
|
| Meso-Middle | ⋯ |
|
| ⋯ |
| Meso-Middle |
|
| 0.9867 | 0.9782 | 0.8369 | 2.8018 | ⋯ |
| 0 | ⋯ | 0 | 0 |
|
| 0.9782 | 0.8113 | 0.6578 | 2.4473 | ⋯ | ⋮ | ⋮ | ⋱ | ⋮ | ⋮ |
|
| 0.8369 | 0.6578 | 0.6479 | 2.1427 | ⋯ |
| 0 | ⋯ | 0 | 0 |
| Meso-Middle | 2.8018 | 2.4473 | 2.1427 | Macro-Top: | ⋯ | Meso-Middle | 0 | ⋯ | 0 | Macro-Top: |
Three information bounds in the VOTING data set.
|
|
|
|
| Meso-Middle | ⋯ |
|
| ⋯ |
| Meso-Middle |
|
|
|
|
|
| ⋯ |
| ⋯ |
| ||
|
|
|
|
|
| ⋮ | ⋮ | ⋮ | ⋱ | ⋮ | ⋮ |
|
|
|
|
|
| ⋯ |
| ⋯ |
| ||
| Meso-Middle |
|
|
| Macro-Top: | ⋯ | Meso-Middle |
| ⋯ |
| Macro-Top: |
Double-granule conditional-entropies in the SPECT data set.
|
|
|
| Meso-Middle | ⋯ |
|
| ⋯ |
| Meso-Middle |
|
| 0.2108 | 0.3815 | 0.5924 | ⋯ |
| 0 | ⋯ | 0 | 1.5335 |
| ⋮ | ⋮ | ⋱ | ⋮ | ⋮ | |||||
|
| 0.3815 | 0.5399 | 0.9215 | ⋯ |
| 0 | ⋯ | 0 | 1.5335 |
| Meso-Middle | 0.5924 | 0.9215 | Macro-Top: | ⋯ | Meso-Middle | 1.5335 | ⋯ | 1.5335 | Macro-Top: |
Three information bounds in the SPECT data set.
|
|
|
| Meso-Middle | ⋯ |
|
| ⋯ |
| Meso-Middle |
|
|
|
|
| ⋯ |
| ⋯ |
| ||
| ⋮ | ⋮ | ⋮ | ⋱ | ⋮ | ⋮ | ||||
|
|
|
|
| ⋯ |
| ⋯ |
| ||
| Meso-Middle |
|
| Macro-Top: | ⋯ | Meso-Middle |
| ⋯ |
| Macro-Top: |
Double-granule conditional-entropies in the Tic-Tac-Toe data set.
|
|
|
|
| Meso-Middle | ⋯ |
|
| ⋯ |
| Meso-Middle |
|
| 0.8742 | 0.9248 | 0.8794 | 2.6784 | ⋯ |
| 0 | ⋯ | 0 | 0 |
|
| 0.9248 | 0.9881 | 0.9509 | 2.8638 | ⋯ | ⋮ | ⋮ | ⋱ | ⋮ | ⋮ |
|
| 0.8794 | 0.9509 | 0.8901 | 2.7203 | ⋯ |
| 0 | ⋯ | 0 | 0 |
| Meso-Middle | 2.6784 | 2.8638 | 2.7203 | Macro-Top: | ⋯ | Meso-Middle | 0 | ⋯ | 0 | Macro-Top: |
Three information bounds in the Tic-Tac-Toe data set.
|
|
|
|
| Meso-Middle | ⋯ |
|
| ⋯ |
| Meso-Middle |
|
|
|
|
|
| ⋯ |
| ⋯ | |||
|
|
|
|
|
| ⋮ | ⋮ | ⋮ | ⋱ | ⋮ | ⋮ |
|
|
|
|
|
| ⋯ |
| ⋯ | |||
| Meso-Middle |
|
|
| Macro-Top: | ⋯ | Meso-Middle | ⋯ | Macro-Top: |
Figure 4Macro-top’s double-granule conditional-entropies and their three information bounds based on an attribute-enlargement chain in data experiments.