| Literature DB >> 21362183 |
Thorsten Lehr1, Jing Yuan, Dirk Zeumer, Supriya Jayadev, Marylyn D Ritchie.
Abstract
BACKGROUND: Several methods have been presented for the analysis of complex interactions between genetic polymorphisms and/or environmental factors. Despite the available methods, there is still a need for alternative methods, because no single method will perform well in all scenarios. The aim of this work was to evaluate the performance of three selected rule based classifier algorithms, RIPPER, RIDOR and PART, for the analysis of genetic association studies.Entities:
Year: 2011 PMID: 21362183 PMCID: PMC3060133 DOI: 10.1186/1756-0381-4-4
Source DB: PubMed Journal: BioData Min ISSN: 1756-0381 Impact factor: 2.522
Case-Control Models
| Model A | Model B | Model C |
|---|---|---|
| If rs5 = AA and rs10 = AB then Case | If rs5 = BB and rs10 = AA then Case | If random number |
| If rs5 = AB and rs10 = AA then Case | If rs15 = AA and AUC >105 then Case | Else Control |
| If rs5 = AB and rs10 = BB then Case | Else Control | |
| If rs5 = BB and rs10 = AB then Case | ||
| Else Control |
Datasets investigated
| Dataset | Model | # SNPs | Ratio Control/Case | FP [%] | FN [%] | |
|---|---|---|---|---|---|---|
| 1 | A | 500 | 300 | 2 | 5 | 5 |
| 2 | A | 1500 | 300 | 2 | 5 | 5 |
| 3 | A | 3000 | 300 | 2 | 5 | 5 |
| 4 | A | 500 | 600 | 2 | 5 | 5 |
| 5 | A | 1500 | 600 | 2 | 5 | 5 |
| 6 | A | 3000 | 600 | 2 | 5 | 5 |
| 7 | A | 500 | 300 | 2 | 10 | 10 |
| 8 | A | 1500 | 300 | 2 | 10 | 10 |
| 9 | A | 3000 | 300 | 2 | 10 | 10 |
| 10 | A | 500 | 600 | 2 | 10 | 10 |
| 11 | A | 1500 | 600 | 2 | 10 | 10 |
| 12 | A | 3000 | 600 | 2 | 10 | 10 |
| 13 | A | 500 | 300 | 2 | 20 | 20 |
| 14 | A | 1500 | 300 | 2 | 20 | 20 |
| 15 | A | 3000 | 300 | 2 | 20 | 20 |
| 16 | A | 500 | 600 | 2 | 20 | 20 |
| 17 | A | 1500 | 600 | 2 | 20 | 20 |
| 18 | A | 3000 | 600 | 2 | 20 | 20 |
| 19 | B | 500 | 150+150 | 2 | 5 | 5 |
| 20 | B | 1500 | 150+150 | 2 | 5 | 5 |
| 21 | B | 3000 | 150+150 | 2 | 5 | 5 |
| 22 | B | 500 | 300+300 | 2 | 5 | 5 |
| 23 | B | 1500 | 300+300 | 2 | 5 | 5 |
| 24 | B | 3000 | 300+300 | 2 | 5 | 5 |
| 25 | B | 500 | 150+150 | 2 | 10 | 10 |
| 26 | B | 1500 | 150+150 | 2 | 10 | 10 |
| 27 | B | 3000 | 150+150 | 2 | 10 | 10 |
| 28 | B | 500 | 300+300 | 2 | 10 | 10 |
| 29 | B | 1500 | 300+300 | 2 | 10 | 10 |
| 30 | B | 3000 | 300+300 | 2 | 10 | 10 |
| 31 | B | 500 | 150+150 | 2 | 20 | 20 |
| 32 | B | 1500 | 150+150 | 2 | 20 | 20 |
| 33 | B | 3000 | 150+150 | 2 | 20 | 20 |
| 34 | B | 500 | 300+300 | 2 | 20 | 20 |
| 35 | B | 1500 | 300+300 | 2 | 20 | 20 |
| 36 | B | 3000 | 300+300 | 2 | 20 | 20 |
| 37 | C | 500 | 300 | 2 | n.a. | n.a. |
| 38 | C | 1500 | 300 | 2 | n.a. | n.a. |
| 39 | C | 3000 | 300 | 2 | n.a. | n.a. |
| 40 | C | 500 | 600 | 2 | n.a. | n.a. |
| 41 | C | 1500 | 600 | 2 | n.a. | n.a. |
| 42 | C | 3000 | 600 | 2 | n.a. | n.a. |
$ For case rule A patients are equally distributed for the 4 case rules; for case rule B the first number indicates the number for rule 1, the second number for rule 2; n.a: not applicable.
Settings of Algorithm Options
| Nr | RIPPER | RIDOR | PART |
|---|---|---|---|
| 1 | -F 3 -N 2.0 -O 10 | -F 3 -S 1 -N 2.0 -A | -R -B -M 2 -N 3 |
| 2 | -F 3 -N 5.0 -O 10 | -F 3 -S 1 -N 5.0 -A | -R -B -M 5 -N 3 |
| 3 | -F 3 -N 10.0 -O 10 | -F 3 -S 1 -N 10.0 -A | -R -B -M 10 -N 3 |
| 4 | -F 10 -N 2.0 -O 10 | -F 10 -S 1 -N 2.0 -A | -R -B -M 2 -N 10 |
| 5 | -F 10 -N 5.0 -O 10 | -F 10 -S 1 -N 5.0 -A | -R -B -M 5 -N 10 |
| 6 | -F 10 -N 10.0 -O 10 | -F 10 -S 1 -N 10.0 -A | -R -B -M 10 -N 10 |
| 7 | -F 100 -N 2.0 -O 10 | -F 20 -S 1 -N 2.0 -A | -R -B -M 2 -N 100 |
| 8 | -F 100 -N 5.0 -O 10 | -F 20 -S 1 -N 5.0 -A | -R -B -M 5 -N 100 |
| 9 | -F 100 -N 10.0 -O 10 | -F 20 -S 1 -N 10.0 -A | -R -B -M 10 -N 100 |
| 10 | -F 3 -N 2.0 -O 100 | -R -M 2 -N 3 | |
| 11 | -F 3 -N 5.0 -O 100 | -R -M 5 -N 3 | |
| 12 | -F 3 -N 10.0 -O 100 | -R -M 10 -N 3 | |
| 13 | -F 10 -N 2.0 -O 100 | -R -M 2 -N 10 | |
| 14 | -F 10 -N 5.0 -O 100 | -R -M 5 -N 10 | |
| 15 | -F 10 -N 10.0 -O 100 | -R -M 10 -N 10 | |
| 16 | -F 100 -N 2.0 -O 100 | -R -M 2 -N 100 | |
| 17 | -F 100 -N 5.0 -O 100 | -R -M 5 -N 100 | |
| 18 | -F 100 -N 10.0 -O 100 | -R -M 10 -N 100 | |
| 19 | -B -M 2 -C 0.25 | ||
| 20 | -B -M 2 -C 0.1 | ||
| 21 | -B -M 5 -C 0.25 | ||
| 22 | -B -M 5 -C 0.1 | ||
| 23 | -B -M 10 -C 0.25 | ||
| 24 | -B -M 10 -C 0.1 | ||
| 25 | -M 2 -C 0.25 | ||
| 26 | -M 2 -C 0.1 | ||
| 27 | -M 5 -C 0.25 | ||
| 28 | -M 5 -C 0.1 | ||
| 29 | -M 10 -C 0.25 | ||
| 30 | -M 10 -C 0.1 |
RIPPER: F: number of folds for reduced error pruning; N: minimal weights of instances within a split; O: number of optimization runs
RIDOR: F: number of folds for reduced error pruning; S: number of shuffles for randomization; A: Flag set to use the error rate of all the data to select the default class in each step. N: minimal weight of instances within a split.
PART: C: confidence threshold for pruning; M: minimum number of instances per leaf; R: use reduced error pruning; N: number of folds for reduced error pruning; B: Use binary splits for nominal attributes
Grading System
| Grade | Model | Rules | Attribute |
|---|---|---|---|
| A | 100% accordance | 100% accordance | All attributes were present (i.e. detected) and ranked as most frequent, i.e. top 2 for model A and top 4 for model B. |
| B | One attribute was missing or an additional attribute was identified by the generated model | One attribute was missing or an additional attribute was identified by the generated model | All attributes were present but not ranked as most frequent |
| C | Two attributes were different between the generated and the true model, | Two attributes were different between the generated and the true model | One attribute was not present, remaining attributes were present and rank was not considered |
| D | Three attributes were different between the generated and the true model | Three attributes were different between the generated and the true model | Two attributes were not present, remaining attributes were present and ranked as most frequent |
Comparison of the generated versus true model, rules and attributes. (Model B was used for the example.)
Statistics
| RIPPER | RIDOR | PART | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Model A | Model B | Model C | Model A | Model B | Model C | Model A | Model B | Model C | ||
| # | Min | 1 | 3 | 1 | 2 | 3 | 1 | 1 | 1 | 2 |
| Attributes* per model | 5th Percentile | 2 | 3 | 1 | 4 | 5 | 4 | 2 | 4 | 7 |
| Median | 7 | 4 | 15 | 16 | 17 | 28 | 24 | 24 | 47 | |
| 95th Percentile | 28 | 13 | 42 | 46 | 50 | 70 | 102 | 109 | 144 | |
| Max | 43 | 21 | 63 | 65 | 62 | 73 | 245 | 240 | 259 | |
| # rules$ per model | Min | 2 | 3 | 2 | 3 | 3 | 1 | 1 | 1 | 1 |
| 5th Percentile | 3 | 3 | 2 | 4 | 4 | 2 | 2 | 2 | 1 | |
| Median | 6 | 3 | 6 | 8 | 8 | 9.5 | 7 | 6 | 9 | |
| 95th Percentile | 11 | 7 | 14 | 16 | 17 | 19 | 25.5 | 24.5 | 37.5 | |
| Max | 16 | 9 | 18 | 20 | 20 | 20 | 45 | 43 | 46 | |
* unique SNP or unique environmental variable; $ Case and control rules combined
Qualitative Results - Summary
| RIPPER | RIDOR | PART | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Evaluation Level | Grade | Model A | Model B | Model C | Model A | Model B | Model C | Model A | Model B | Model C |
| Models | A | 33% (6) | 56% (10) | 0% (0) | 0% (0) | 11% (2) | 0% (0) | 0% (0) | 33% (6) | 0% (0) |
| B | 33% (6) | 50% (9) | 0% (0) | 0% (0) | 6% (1) | 0% (0) | 0% (0) | 33% (6) | 0% (0) | |
| C | 22% (4) | 56% (10) | 0% (0) | 0% (0) | 6% (1) | 0% (0) | 0% (0) | 22% (4) | 0% (0) | |
| D | 6% (1) | 22% (4) | 0% (0) | 0% (0) | 17% (3) | 0% (0) | 6% (1) | 11% (2) | 0% (0) | |
| Rules | A | 83% (15) | 100% (18) | 0% (0) | 83% (15) | 94% (17) | 0% (0) | 44% (8) | 78% (14) | 0% (0) |
| B | 89% (16) | 72% (13) | 0% (0) | 94% (17) | 100% (18) | 0% (0) | 50% (9) | 100% (18) | 0% (0) | |
| C | 67% (12) | 67% (12) | 0% (0) | 56% (10) | 100% (18) | 0% (0) | 78% (14) | 89% (16) | 0% (0) | |
| D | 50% (9) | 56% (10) | 0% (0) | 67% (12) | 100% (18) | 0% (0) | 56% (10) | 94% (17) | 0% (0) | |
| Attributes | A | 83% (15) | 56% (10) | 0% (0) | 50% (9) | 72% (13) | 0% (0) | 11% (2) | 0% (0) | 0% (0) |
| B | 17% (3) | 28% (5) | 0% (0) | 50% (9) | 17% (3) | 0% (0) | 61% (11) | 78% (14) | 0% (0) | |
| C | 0% (0) | 17% (3) | 0% (0) | 0% (0) | 11% (2) | 0% (0) | 6% (1) | 22% (4) | 0% (0) | |
| D | 0% (0) | 0% (0) | 0% (0) | 0% (0) | 0% (0) | 0% (0) | 22% (4) | 0% (0) | 0% (0) | |
Summary of the results, separated by algorithm, case-control model, and grading. Number represents the percent frequency of datasets where a respective grade was achieved at least once (absolute number is in brackets).
Quantitative Results - Models Level
| RIPPER | RIDOR | PART | ||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Model | Dataset | SNPs | Patients | Error [%] | A | B | C | D | A | B | C | D | A | B | C | D |
| A | 1 | 500 | 300 | 5 | 0 | 1 | 7 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| A | 2 | 1500 | 300 | 5 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| A | 3 | 3000 | 300 | 5 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| A | 4 | 500 | 600 | 5 | 3 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| A | 5 | 1500 | 600 | 5 | 7 | 5 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| A | 6 | 3000 | 600 | 5 | 1 | 2 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| A | 7 | 500 | 300 | 10 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| A | 8 | 1500 | 300 | 10 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| A | 9 | 3000 | 300 | 10 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| A | 10 | 500 | 600 | 10 | 8 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| A | 11 | 1500 | 600 | 10 | 0 | 0 | 0 | 7 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| A | 12 | 3000 | 600 | 10 | 1 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| A | 13 | 500 | 300 | 20 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| A | 14 | 1500 | 300 | 20 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| A | 15 | 3000 | 300 | 20 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| A | 16 | 500 | 600 | 20 | 6 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| A | 17 | 1500 | 600 | 20 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| A | 18 | 3000 | 600 | 20 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| B | 19 | 500 | 150+150 | 5 | 11 | 7 | 0 | 0 | 2 | 0 | 1 | 0 | 1 | 1 | 0 | 0 |
| B | 20 | 1500 | 150+150 | 5 | 15 | 0 | 3 | 0 | 0 | 1 | 0 | 0 | 2 | 0 | 0 | 0 |
| B | 21 | 3000 | 150+150 | 5 | 4 | 4 | 2 | 0 | 0 | 0 | 0 | 3 | 0 | 0 | 1 | 1 |
| B | 22 | 500 | 300+300 | 5 | 14 | 0 | 4 | 0 | 0 | 0 | 0 | 0 | 5 | 0 | 0 | 0 |
| B | 23 | 1500 | 300+300 | 5 | 18 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 0 | 0 |
| B | 24 | 3000 | 300+300 | 5 | 17 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 0 | 0 |
| B | 25 | 500 | 150+150 | 10 | 7 | 10 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 2 | 0 |
| B | 26 | 1500 | 150+150 | 10 | 0 | 2 | 2 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 2 | 1 |
| B | 27 | 3000 | 150+150 | 10 | 0 | 3 | 2 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| B | 28 | 500 | 300+300 | 10 | 18 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 1 | 1 | 0 | 0 |
| B | 29 | 1500 | 300+300 | 10 | 17 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 0 |
| B | 30 | 3000 | 300+300 | 10 | 13 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| B | 31 | 500 | 150+150 | 20 | 0 | 0 | 7 | 1 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 |
| B | 32 | 1500 | 150+150 | 20 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| B | 33 | 3000 | 150+150 | 20 | 0 | 0 | 4 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| B | 34 | 500 | 300+300 | 20 | 0 | 4 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| B | 35 | 1500 | 300+300 | 20 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| B | 36 | 3000 | 300+300 | 20 | 0 | 0 | 0 | 7 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| C | 37 | 500 | 300 | n.a. | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| C | 38 | 1500 | 300 | n.a. | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| C | 39 | 3000 | 300 | n.a. | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| C | 40 | 500 | 600 | n.a. | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| C | 41 | 1500 | 600 | n.a. | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| C | 42 | 3000 | 600 | n.a. | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
Summary of the results at the model level, separated by algorithm, case-control model, dataset and grading. Number expresses the absolute frequency of the respective grading assignment.
Quantitative Results - Rules Level
| RIPPER | RIDOR | PART | ||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Model | Dataset | SNPs | Patients | Error [%] | A | B | C | D | A | B | C | D | A | B | C | D |
| A | 1 | 500 | 300 | 5 | 44 | 14 | 3 | 0 | 4 | 6 | 2 | 3 | 12 | 13 | 2 | 2 |
| A | 2 | 1500 | 300 | 5 | 0 | 36 | 0 | 0 | 4 | 16 | 0 | 3 | 3 | 4 | 1 | 2 |
| A | 3 | 3000 | 300 | 5 | 0 | 1 | 7 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 |
| A | 4 | 500 | 600 | 5 | 46 | 14 | 0 | 0 | 30 | 20 | 0 | 0 | 25 | 28 | 27 | 10 |
| A | 5 | 1500 | 600 | 5 | 72 | 0 | 0 | 0 | 12 | 29 | 4 | 1 | 2 | 4 | 3 | 2 |
| A | 6 | 3000 | 600 | 5 | 28 | 32 | 0 | 0 | 8 | 27 | 0 | 0 | 12 | 6 | 2 | 1 |
| A | 7 | 500 | 300 | 10 | 23 | 27 | 4 | 7 | 2 | 5 | 0 | 1 | 2 | 3 | 2 | 0 |
| A | 8 | 1500 | 300 | 10 | 19 | 13 | 0 | 3 | 2 | 12 | 0 | 2 | 2 | 0 | 2 | 0 |
| A | 9 | 3000 | 300 | 10 | 1 | 11 | 12 | 11 | 0 | 1 | 3 | 3 | 0 | 0 | 0 | 0 |
| A | 10 | 500 | 600 | 10 | 46 | 16 | 0 | 0 | 11 | 23 | 7 | 0 | 25 | 20 | 11 | 2 |
| A | 11 | 1500 | 600 | 10 | 35 | 10 | 15 | 6 | 6 | 19 | 11 | 2 | 0 | 11 | 16 | 9 |
| A | 12 | 3000 | 600 | 10 | 48 | 5 | 20 | 0 | 7 | 7 | 0 | 0 | 0 | 0 | 3 | 1 |
| A | 13 | 500 | 300 | 20 | 11 | 21 | 8 | 3 | 1 | 4 | 3 | 6 | 0 | 0 | 4 | 2 |
| A | 14 | 1500 | 300 | 20 | 5 | 1 | 6 | 9 | 0 | 1 | 3 | 0 | 0 | 0 | 0 | 0 |
| A | 15 | 3000 | 300 | 20 | 3 | 0 | 1 | 0 | 1 | 0 | 0 | 7 | 0 | 0 | 0 | 0 |
| A | 16 | 500 | 600 | 20 | 48 | 17 | 7 | 1 | 8 | 6 | 17 | 4 | 0 | 2 | 1 | 0 |
| A | 17 | 1500 | 600 | 20 | 25 | 18 | 26 | 2 | 8 | 4 | 15 | 4 | 0 | 0 | 2 | 4 |
| A | 18 | 3000 | 600 | 20 | 0 | 19 | 1 | 3 | 1 | 6 | 2 | 4 | 0 | 0 | 1 | 0 |
| B | 19 | 500 | 150+150 | 5 | 29 | 4 | 3 | 0 | 8 | 13 | 5 | 5 | 31 | 17 | 9 | 2 |
| B | 20 | 1500 | 150+150 | 5 | 30 | 6 | 0 | 0 | 8 | 8 | 7 | 5 | 12 | 13 | 19 | 3 |
| B | 21 | 3000 | 150+150 | 5 | 27 | 5 | 13 | 8 | 7 | 5 | 11 | 7 | 4 | 21 | 8 | 8 |
| B | 22 | 500 | 300+300 | 5 | 36 | 0 | 0 | 0 | 8 | 35 | 9 | 3 | 41 | 41 | 15 | 10 |
| B | 23 | 1500 | 300+300 | 5 | 36 | 0 | 0 | 0 | 9 | 21 | 12 | 5 | 20 | 27 | 11 | 4 |
| B | 24 | 3000 | 300+300 | 5 | 35 | 1 | 0 | 0 | 9 | 19 | 10 | 10 | 13 | 22 | 25 | 8 |
| B | 25 | 500 | 150+150 | 10 | 25 | 11 | 1 | 1 | 4 | 4 | 17 | 20 | 12 | 20 | 14 | 1 |
| B | 26 | 1500 | 150+150 | 10 | 16 | 5 | 19 | 9 | 1 | 9 | 14 | 7 | 17 | 13 | 3 | 3 |
| B | 27 | 3000 | 150+150 | 10 | 18 | 4 | 11 | 14 | 0 | 14 | 3 | 9 | 0 | 5 | 1 | 1 |
| B | 28 | 500 | 300+300 | 10 | 36 | 0 | 0 | 0 | 17 | 8 | 9 | 8 | 28 | 18 | 14 | 10 |
| B | 29 | 1500 | 300+300 | 10 | 35 | 1 | 0 | 0 | 7 | 14 | 16 | 6 | 7 | 27 | 7 | 4 |
| B | 30 | 3000 | 300+300 | 10 | 31 | 7 | 2 | 0 | 7 | 9 | 12 | 9 | 2 | 17 | 14 | 7 |
| B | 31 | 500 | 150+150 | 20 | 18 | 0 | 13 | 10 | 4 | 3 | 9 | 5 | 0 | 1 | 3 | 2 |
| B | 32 | 1500 | 150+150 | 20 | 10 | 8 | 21 | 11 | 2 | 8 | 5 | 6 | 0 | 1 | 0 | 0 |
| B | 33 | 3000 | 150+150 | 20 | 9 | 7 | 18 | 2 | 1 | 2 | 7 | 9 | 0 | 1 | 0 | 1 |
| B | 34 | 500 | 300+300 | 20 | 18 | 17 | 3 | 13 | 4 | 5 | 18 | 13 | 2 | 1 | 6 | 1 |
| B | 35 | 1500 | 300+300 | 20 | 18 | 0 | 9 | 31 | 4 | 3 | 12 | 13 | 2 | 4 | 7 | 2 |
| B | 36 | 3000 | 300+300 | 20 | 16 | 2 | 23 | 20 | 5 | 3 | 5 | 8 | 1 | 3 | 1 | 1 |
| C | 37 | 500 | 300 | n.a. | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| C | 38 | 1500 | 300 | n.a. | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| C | 39 | 3000 | 300 | n.a. | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| C | 40 | 500 | 600 | n.a. | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| C | 41 | 1500 | 600 | n.a. | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| C | 42 | 3000 | 600 | n.a. | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
Summary of the results at the rules level, separated by algorithm, case-control model, dataset and grading. Number expresses the absolute frequency of the respective grading assignment.
Quantitative Results - Attributes Level
| RIPPER | RIDOR | PART | ||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Model | Dataset | SNPs | Patients | Error [%] | A | B | C | D | A | B | C | D | A | B | C | D |
| A | 1 | 500 | 300 | 5 | 1 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 1 | 0 | 0 |
| A | 2 | 1500 | 300 | 5 | 1 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 1 | 0 | 0 |
| A | 3 | 3000 | 300 | 5 | 0 | 1 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 |
| A | 4 | 500 | 600 | 5 | 1 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 1 | 0 | 0 | 1 |
| A | 5 | 1500 | 600 | 5 | 1 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 1 | 0 | 0 |
| A | 6 | 3000 | 600 | 5 | 1 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 1 | 0 | 0 |
| A | 7 | 500 | 300 | 10 | 1 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 1 | 0 | 0 |
| A | 8 | 1500 | 300 | 10 | 1 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 1 | 0 | 0 |
| A | 9 | 3000 | 300 | 10 | 1 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 1 |
| A | 10 | 500 | 600 | 10 | 1 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 1 | 0 | 0 | 0 |
| A | 11 | 1500 | 600 | 10 | 1 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 1 | 0 | 0 |
| A | 12 | 3000 | 600 | 10 | 1 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 1 | 0 | 0 |
| A | 13 | 500 | 300 | 20 | 1 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 1 | 0 | 0 |
| A | 14 | 1500 | 300 | 20 | 0 | 1 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 1 |
| A | 15 | 3000 | 300 | 20 | 0 | 1 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 1 |
| A | 16 | 500 | 600 | 20 | 1 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 1 | 0 | 0 |
| A | 17 | 1500 | 600 | 20 | 1 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 1 | 0 | 0 |
| A | 18 | 3000 | 600 | 20 | 1 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 1 | 0 |
| B | 19 | 500 | 150+150 | 5 | 1 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 1 | 0 | 0 |
| B | 20 | 1500 | 150+150 | 5 | 1 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 1 | 0 | 0 |
| B | 21 | 3000 | 150+150 | 5 | 1 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 1 | 0 | 0 |
| B | 22 | 500 | 300+300 | 5 | 1 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 1 | 0 | 0 |
| B | 23 | 1500 | 300+300 | 5 | 1 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 1 | 0 | 0 |
| B | 24 | 3000 | 300+300 | 5 | 1 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 1 | 0 | 0 |
| B | 25 | 500 | 150+150 | 10 | 1 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 1 | 0 | 0 |
| B | 26 | 1500 | 150+150 | 10 | 0 | 1 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 1 | 0 | 0 |
| B | 27 | 3000 | 150+150 | 10 | 0 | 0 | 1 | 0 | 0 | 0 | 1 | 0 | 0 | 1 | 0 | 0 |
| B | 28 | 500 | 300+300 | 10 | 1 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 1 | 0 | 0 |
| B | 29 | 1500 | 300+300 | 10 | 1 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 1 | 0 | 0 |
| B | 30 | 3000 | 300+300 | 10 | 1 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 1 | 0 | 0 |
| B | 31 | 500 | 150+150 | 20 | 0 | 1 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 1 | 0 |
| B | 32 | 1500 | 150+150 | 20 | 0 | 0 | 1 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 1 | 0 |
| B | 33 | 3000 | 150+150 | 20 | 0 | 0 | 1 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 1 | 0 |
| B | 34 | 500 | 300+300 | 20 | 0 | 1 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 1 | 0 | 0 |
| B | 35 | 1500 | 300+300 | 20 | 0 | 1 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 1 | 0 | 0 |
| B | 36 | 3000 | 300+300 | 20 | 0 | 1 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 1 | 0 |
| C | 37 | 500 | 300 | n.a. | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| C | 38 | 1500 | 300 | n.a. | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| C | 39 | 3000 | 300 | n.a. | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| C | 40 | 500 | 600 | n.a. | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| C | 41 | 1500 | 600 | n.a. | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| C | 42 | 3000 | 600 | n.a. | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
Summary of the results at the attribute level, separated by algorithm, case-control model, dataset and grading. A "1" reflects an affiliation to the respective grading, whereas a "0" reflects no affiliation.