| Literature DB >> 35756086 |
Min Fan1,2, Shan Luo1,2, Jinhai Li1,2.
Abstract
Knowledge discovery combined with network structure is an emerging field of network data analysis and mining. Three-way concept analysis is a method that can fit the human mind in uncertain decisions and analysis. In reality, when three-way concept analysis is placed in the background of a network, not only the three-way rules need to be obtained, but also the network characteristic values of these rules should be obtained, which is of great significance for concept cognition in the network. This paper mainly combines complex network analysis with the formal context of three-way decision. Firstly, the network formal context of three-way decision (NFC3WD) is proposed to unify the two studies mentioned above into one data framework. Then, the network weaken-concepts of three-way decision (NWC3WD) and their corresponding sub-networks are studied. Therefore, we can not only find out the network weaken-concepts but also know the average influence of the sub-network, as well as the influence difference within the sub-network. Furthermore, the concept logic of network and the properties of its operators are put forward, which lays a foundation for designing the algorithm of rule extraction. Subsequently, the bidirectional rule extraction algorithm and reduction algorithm based on confidence degree are also explored. Meanwhile, these algorithms are applied to the diagnosis examples of COVID-19 from which we can not only get diagnostic rules, but also know the importance of the population corresponding to these diagnostic rules in the network through network eigenvalues. Finally, experimental analysis is made to show the superiority of the proposed method.Entities:
Keywords: Concept logic of network; Granular computing; Network rule extraction; Network weaken-concept; Three-way concept analysis; Three-way decision
Year: 2022 PMID: 35756086 PMCID: PMC9205655 DOI: 10.1007/s10489-022-03672-4
Source DB: PubMed Journal: Appl Intell (Dordr) ISSN: 0924-669X Impact factor: 5.019
A network formal context of three-way decision
| ⋯ | |||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| ⋯ | ⋯ | ⋯ | ⋯ | ⋯ | |||||||||||||
| 0 | 1 | ⋯ | 1 | ⋯ | 0 | 0 | ⋯ | 0 | 0 | 1 | ⋯ | -1 | 0 | 1 | ⋯ | -1 | |
| 1 | 0 | ⋯ | 0 | ⋯ | 0 | 0 | ⋯ | 1 | 1 | -1 | ⋯ | 0 | 0 | 1 | ⋯ | 0 | |
| ⋯ | ⋯ | ⋯ | ⋯ | ⋯ | ⋯ | ⋅ | ⋯ | ⋯ | ⋯ | ⋯ | ⋯ | ⋯ | ⋯ | ⋯ | ⋯ | ⋯ | ⋯ |
| 1 | 0 | ⋯ | 0 | ⋯ | 0 | 1 | ⋯ | 0 | 0 | 1 | ⋯ | 0 | -1 | 0 | ⋯ | 1 | |
Fig. 1Infectious disease network
The network formal context of three-way decision of infectious diseases
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | ||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 0 | 1 | 0 | 0 | 1 | 0 | 0 | 1 | 1 | 0 | 0 | 0 | 1 | 1 | 0 | 1 | 1 | 1 | -1 | -1 | -1 | -1 | -1 | -1 | -1 | 1 | -1 | 1 | -1 | -1 |
| 2 | 1 | 0 | 1 | 1 | 0 | 1 | 0 | 1 | 1 | 0 | 1 | 0 | 1 | 1 | 1 | -1 | -1 | -1 | -1 | 1 | 1 | 1 | 0 | 0 | 0 | 1 | -1 | -1 | -1 | 1 |
| 3 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 0 | 0 | 1 | 1 | 1 | 1 | 0 | 1 | -1 | -1 | -1 | 1 | 1 | -1 | -1 | 1 | 0 | 1 | 1 | 0 | -1 | 1 | 0 |
| 4 | 0 | 1 | 1 | 0 | 1 | 0 | 0 | 0 | 0 | 1 | 1 | 1 | 0 | 0 | 0 | -1 | -1 | -1 | 1 | 1 | 1 | -1 | 0 | 0 | 0 | 1 | 1 | -1 | 1 | 0 |
| 5 | 1 | 0 | 0 | 1 | 0 | 1 | 1 | 0 | 1 | 1 | 1 | 1 | 0 | 1 | 0 | 1 | -1 | 1 | 1 | 1 | 1 | 1 | 0 | 1 | 1 | 1 | 0 | 0 | 0 | -1 |
| 6 | 0 | 1 | 1 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 0 | 1 | -1 | -1 | -1 | -1 | -1 | -1 | 1 | 0 | 0 | 1 | -1 | 0 | -1 | 0 | 0 |
| 7 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 1 | 0 | 0 | 1 | 1 | 1 | 0 | 1 | -1 | -1 | -1 | 1 | 1 | -1 | 1 | 0 | -1 | -1 | 0 | 0 | -1 | 0 | 0 |
| 8 | 1 | 1 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 1 | 0 | 1 | 1 | 0 | -1 | 1 | 1 | -1 | 0 | 1 | 1 | -1 | -1 | -1 | 1 | -1 | 0 | -1 | 1 |
| 9 | 1 | 1 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 1 | -1 | -1 | -1 | -1 | -1 | -1 | 0 | 0 | -1 | 0 | 1 | -1 | -1 |
| 10 | 0 | 0 | 1 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 0 | 0 | 1 | -1 | -1 | -1 | 1 | 1 | 1 | 1 | 1 | 0 | 0 | 1 | 0 | -1 | 1 | 0 |
| 11 | 0 | 1 | 1 | 1 | 1 | 0 | 1 | 1 | 0 | 1 | 0 | 1 | 1 | 0 | 1 | -1 | -1 | -1 | 1 | 1 | 1 | 1 | 0 | 1 | 1 | -1 | 0 | -1 | 0 | 0 |
| 12 | 0 | 0 | 1 | 1 | 1 | 1 | 1 | 0 | 0 | 1 | 1 | 0 | 0 | 0 | 1 | -1 | -1 | -1 | 1 | 1 | -1 | -1 | 1 | 1 | 0 | 1 | 0 | -1 | 1 | -1 |
| 13 | 1 | 1 | 1 | 0 | 0 | 1 | 1 | 1 | 0 | 0 | 1 | 0 | 0 | 0 | 1 | 1 | -1 | -1 | -1 | 1 | 1 | 1 | 0 | -1 | -1 | 1 | -1 | 0 | -1 | 1 |
| 14 | 1 | 1 | 0 | 0 | 1 | 0 | 0 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 1 | -1 | -1 | -1 | 0 | -1 | 0 | 0 | -1 | 0 | 1 | -1 | -1 |
| 15 | 0 | 1 | 1 | 0 | 0 | 1 | 1 | 0 | 0 | 1 | 1 | 1 | 1 | 0 | 0 | -1 | -1 | -1 | 1 | 1 | 1 | 1 | 0 | -1 | -1 | 1 | 0 | -1 | 0 | 0 |
Fig. 2The main idea of bidirectional rule extraction
Fig. 3Sequential decision making for COVID-19
The basic information of experimental data sets
| Data sets | Nodes | Attributes |
|---|---|---|
| Heart disease | 303 | 14 |
| Breast cancer wisconsin (Original) | 699 | 11 |
| Iris | 150 | 5 |
| Acute inflammations | 120 | 8 |
| Seeds | 210 | 7 |
| Breast cancer coimbra | 116 | 11 |
| Hepatitis | 155 | 20 |
| Lymphography | 148 | 19 |
| Spect heart | 267 | 23 |
The transformation of attribute values of the Heart Disease data set
| Attribute name | Marker name | Attribute value partition | Mapped values |
|---|---|---|---|
| Age | age< 52 | -1 | |
| age> 65 | 1 | ||
| 52≤ | 0 | ||
| Sex | sex= 1 | 1 | |
| sex= 0 | -1 | ||
| cp | cp= 0 | -1 | |
| cp= 1 | 1 | ||
| cp= 2,3 | 0 | ||
| trestbps | trestbps≥ 90 | 1 | |
| trestbps< 90 | -1 | ||
| chol | chol ≥ 250 | 1 | |
| chol < 200 | -1 | ||
| 200≤chol< 250 | 0 | ||
| fbs | fbs ≥ 200 | 1 | |
| fbs< 200 | -1 | ||
| restecg | restecg= 1,2 | 1 | |
| restecg= 0 | -1 | ||
| thalach | thalach> 100,thalach< 60 | 1 | |
| 60≤ thalach≤ 100 | -1 | ||
| exang | exang= 1 | 1 | |
| exang= 0 | -1 | ||
| oldpeak | oldpeak≥ 0.5 | 1 | |
| oldpeak< 0.5 | -1 | ||
| slope | slope= 1 | 1 | |
| slope= 2 | -1 | ||
| slope= 3 | 0 | ||
| ca | ca= 1,2,3,4 | 1 | |
| ca= 0 | -1 | ||
| thal | thal= 0,1,2 | 1 | |
| thal= 3 | -1 | ||
| target | target= 1 | 1 | |
| target= 0 | -1 |
Rules for Heart Disease data set
| Rule type | Total number of rules | Main rules |
| ||||
|---|---|---|---|---|---|---|---|
| 1 | ( | 1 | 1 | 39.7057 | 14.8919 | ||
| 1 | ( | 1 | 1 | 39.462 | 14.4613 | ||
| 40 | ( | 0.8303 | 0.7366 | 37.4108 | 11.2108 | ||
| ( | 0.5879 | 0.6218 | 29.3431 | 8.3118 | |||
| ( | 0.4727 | 0.8041 | 26.8052 | 5.7455 | |||
| 71 | ( | 0.7536 | 0.7273 | 35.3282 | 7.6181 | ||
| ( | 0.6449 | 0.7607 | 32.3924 | 7.7250 | |||
| ( | 0.5725 | 0.5374 | 28.5641 | 7.0872 | |||
Experimental results of the chosen data sets
| Data sets | Total number of rules | Reduction rules | Main rules |
|
| ||
|---|---|---|---|---|---|---|---|
| Breast cancer wisconsin | 133 | 70 | ( | 0.8967 | 0.8127 | 90.7643 | 46.8699 |
| ( | 0.6405 | 0.9568 | 92.2786 | 23.8341 | |||
| ( | 0.6157 | 0.9675 | 76.5135 | 24.6198 | |||
| Iris | 23 | 12 | ( | 1 | 1 | 9.619 | 5.6871 |
| ( | 0.98 | 0.875 | 8.5139 | 2.2049 | |||
| ( | 0.82 | 0.6721 | 5.8417 | 1.8458 | |||
| Acute inflammations | 28 | 18 | ( | 0.8305 | 1 | 14.2865 | 2.8125 |
| ( | 0.8 | 1 | 15.9808 | 1.1987 | |||
| ( | 0.6 | 1 | 13.3448 | 2.17241 | |||
| Seeds | 58 | 55 | ( | 0.9286 | 0.8667 | 46.0313 | 8.2483 |
| ( | 0.8857 | 0.8732 | 47.7632 | 6.5574 | |||
| ( | 0.7143 | 0.9615 | 40.0952 | 3.6689 |
The experimental results for comparison
| Dataset | Our algorithm (BiR) | Literature [ | Bayes net | Random forest | Decision tree | |||||
|---|---|---|---|---|---|---|---|---|---|---|
| Auc | Accuracy(%) | Auc | Accuracy(%) | Auc | Accuracy(%) | Auc | Accuracy(%) | Auc | Accuracy(%) | |
| Acute inflammations | ||||||||||
| Breast cancer wisconsin | 0.99 | 95.85 | 0.99 | 0.99 | 97.14 | 0.98 | 93.14 | |||
| Breast cancer coimbra | 0.72 | 72.45 | 0.64 | 62.07 | 0.68 | 58.62 | 0.70 | 62.07 | ||
| Heart disease | 0.78 | 80.26 | 82.89 | 0.84 | 76.32 | 0.81 | 78.95 | |||
| Hepatitis | 0.98 | 80.68 | 0.91 | 84.62 | 0.87 | 82.05 | 0.79 | 71.79 | ||
| Lymphography | 90.60 | 0 | 97.30 | 0.99 | 97.30 | |||||
| Spect heart | 0.93 | 84.59 | 0.82 | 79.10 | 0.83 | 91.04 | 0.75 | 77.61 | ||
Bold entries represent better Auc and accuracy values
Fig. 4Roc curve comparison chart