| Literature DB >> 30069067 |
Bijan Parsia1, Nicolas Matentzoglu1, Rafael S Gonçalves2, Birte Glimm3, Andreas Steigmiller3.
Abstract
The OWL Reasoner Evaluation competition is an annual competition (with an associated workshop) that pits OWL 2 compliant reasoners against each other on various standard reasoning tasks over naturally occurring problems. The 2015 competition was the third of its sort and had 14 reasoners competing in six tracks comprising three tasks (consistency, classification, and realisation) over two profiles (OWL 2 DL and EL). In this paper, we discuss the design, execution and results of the 2015 competition with particular attention to lessons learned for benchmarking, comparative experiments, and future competitions.Entities:
Keywords: OWL; Ontologies; Reasoning
Year: 2017 PMID: 30069067 PMCID: PMC6044265 DOI: 10.1007/s10817-017-9406-8
Source DB: PubMed Journal: J Autom Reason ISSN: 0168-7433 Impact factor: 0.944
Fig. 1The number of ontologies sampled for each size bin
Breakdown of user-submitted ontologies in the ORE 2015 corpus
| Ontology | TBox | ABox | Ontology | TBox | ABox |
|---|---|---|---|---|---|
| CO | 7527 | 0 | VFB-EPN | 33,612 | 63,295 |
| DINTO | 123,930 | 0 | VFB-KB | 20,187 | 147,996 |
| DPO | 917 | 0 | VFB-NCT | 33,612 | 63,295 |
| GO+ | 150,955 | 0 |
Breakdown of user-submitted ontologies in the ORE 2014 corpus
| Ontology | TBox | ABox | Ontology | TBox | ABox |
|---|---|---|---|---|---|
| CB-BERN | 428 | 209,932 | MSC-D | 9532 | 0 |
| CB-CORK | 428 | 20,393 | MSC | 9532 | 318 |
| DCHARS | 1925 | 1728 | FHKB-V3 | 425 | 3307 |
| DGO | 233 | 47,603 | RMO-A | 1925 | 15,759 |
| DMKB | 1925 | 1606 | FHKB-V1 | 355 | 3296 |
| DMOP | 1986 | 765 | DPC-OLY | 122 | 35,866 |
| DPC-1 | 122 | 54,898 | PD | 1930 | 973 |
| DPC-2 | 122 | 79,955 | FHKB-V2 | 419 | 3304 |
| FMA-CPFNS | 123,024 | 86 | USDA10 | 174 | 3602 |
| FTC | 140,799 | 0 | USDA15 | 176 | 5948 |
| GALEN-FU | 37,411 | 0 | USDA20 | 176 | 8600 |
| GALEN-H | 10,628 | 0 | USDA25 | 177 | 9785 |
| G-CDS | 4322 | 0 | USDA5 | 174 | 1226 |
| G-CDS-D | 4322 | 140 | CB-VIENNA | 428 | 584,266 |
| HP | 123 | 17,027 |
Participant list with OWL 2 DL reasoners in the top and OWL 2 EL reasoners in the bottom part
| Reasoner | New 2015 | Consistency | Classification | Realisation | Language | License |
|---|---|---|---|---|---|---|
| OWL DL | ||||||
| Chainsaw [ | – |
|
|
| Java | LGPL 2.0 |
| FaCT++ [ | – |
|
|
| C++ | LGPL 2.0 |
| HermiT | – |
|
|
| Java | LGPL 3.0 |
| Jfact [ | – |
|
|
| Java | LGPL 2.0 |
| Konclude [ | – |
|
|
| C++ | LGPL 2.1 |
| MORe [ | – |
|
| – | Java | LGPL 3.0 |
| PAGOdA [ |
| – | – |
| Java | Academic license |
| Pellet-OA4 [ |
|
|
|
| Java | AGPL v3 |
| Racer [ |
|
|
|
| LISP | BSD 3-clause license |
| TrOWL [ | – |
|
|
| Java | AGPL v3 |
| OWL EL | ||||||
| ELepHant [ | – |
|
|
| C++ | Apache Licence 2.0 |
| ELK [ | – |
|
|
| Java | Apache Licence 2.0 |
| jcel [ | – |
|
| – | Java | Apache Licence 2.0 |
HermiT was submitted with OWL API 3 and OWL API 4 bindings
Breakdown of the competition by track
| Task | Competitors | Problems |
|---|---|---|
| OWL DL | ||
| Consistency | 10 | 306 |
| Classification | 10 | 306 |
| Realisation | 10 | 264 |
| OWL EL | ||
| Consistency | 13 | 298 |
| Classification | 13 | 298 |
| Realisation | 12 | 109 |
Fig. 2Results of the competition by track as displayed in the live competition display. Score indicates the number of problems solved out of the total problems for that track. The number of unsolved problems (whether by timeout, crash, or “wrong” results) are displayed in the next column. Time indicates the time actually taken to complete solved problems. Time is used to resolve ties for solved problems
Fig. 3Reasoning time of the three winning reasoners in each category for the DL and EL profile (ordered separately by speed of the reasoner). The top horizontal line illustrates the timeout of 180 s
Full break-down of solved problems by reasoner and task over the 306 ontologies for DL consistency and classification, 264 for DL realisation, 298 for EL consistency and classification, and 109 for EL realisation
| Reasoner | Solved | Timeout | Error | Wrong | Solved | Timeout | Error | Wrong |
|---|---|---|---|---|---|---|---|---|
| DL consistency | EL consistency | |||||||
| Chainsaw | 291 | 3 | 11 | 1 | 276 | 19 | 3 | 0 |
| ELepHant | – | – | – | – | 296 | 2 | 0 | 0 |
| ELK | – | – | – | – | 298 | 0 | 0 | 0 |
| FaCT++ | 276 | 16 | 13 | 1 | 270 | 22 | 6 | 0 |
| HermiT | 294 | 8 | 3 | 1 | 282 | 16 | 0 | 0 |
| HermiT-OA4 | 293 | 8 | 4 | 1 | 282 | 16 | 0 | 0 |
| jcel | – | – | – | – | 261 | 35 | 2 | 0 |
| Jfact | 166 | 83 | 52 | 5 | 227 | 71 | 0 | 0 |
| Konclude | 303 | 1 | 0 | 2 | 298 | 0 | 0 | 0 |
| MORe | 253 | 43 | 2 | 8 | 295 | 3 | 0 | 0 |
| Pellet-OA4 | 278 | 26 | 0 | 2 | 285 | 13 | 0 | 0 |
| Racer | 239 | 48 | 1 | 18 | 256 | 40 | 0 | 2 |
| TrOWL | 266 | 0 | 36 | 4 | 273 | 0 | 25 | 0 |
| DL classification | EL classification | |||||||
| Chainsaw | 119 | 171 | 16 | 0 | 191 | 94 | 13 | 0 |
| ELepHant | – | – | – | – | 291 | 6 | 0 | 1 |
| ELK | – | – | – | – | 298 | 0 | 0 | 0 |
| FaCT++ | 200 | 87 | 17 | 2 | 244 | 51 | 3 | 0 |
| HermiT | 236 | 67 | 2 | 1 | 272 | 26 | 0 | 0 |
| HermiT-OA4 | 237 | 66 | 2 | 1 | 272 | 26 | 0 | 0 |
| jcel | – | – | – | – | 133 | 158 | 6 | 1 |
| Jfact | 128 | 106 | 59 | 13 | 189 | 89 | 2 | 18 |
| Konclude | 288 | 7 | 1 | 10 | 294 | 0 | 0 | 4 |
| MORe | 247 | 41 | 2 | 16 | 294 | 2 | 0 | 2 |
| Pellet-OA4 | 187 | 105 | 14 | 0 | 261 | 28 | 9 | 0 |
| Racer | 164 | 86 | 2 | 54 | 237 | 38 | 0 | 23 |
| TrOWL | 201 | 0 | 35 | 70 | 275 | 0 | 23 | 0 |
| DL realisation | EL realisation | |||||||
| Chainsaw | 79 | 166 | 16 | 3 | 43 | 64 | 2 | 0 |
| ELepHant | – | – | – | – | 84 | 1 | 0 | 24 |
| ELK | – | – | – | – | 102 | 0 | 0 | 7 |
| FaCT++ | 172 | 58 | 25 | 9 | 79 | 27 | 3 | 0 |
| HermiT | 163 | 93 | 5 | 3 | 57 | 52 | 0 | 0 |
| HermiT-OA4 | 162 | 93 | 6 | 3 | 57 | 52 | 0 | 0 |
| Jfact | 109 | 89 | 47 | 19 | 63 | 43 | 0 | 3 |
| Konclude | 247 | 2 | 1 | 14 | 104 | 0 | 0 | 5 |
| PAGOdA | 104 | 51 | 95 | 14 | 86 | 15 | 0 | 8 |
| Pellet-OA4 | 136 | 54 | 24 | 50 | 60 | 32 | 2 | 15 |
| Racer | 46 | 75 | 3 | 140 | 32 | 31 | 0 | 46 |
| TrOWL | 150 | 0 | 43 | 71 | 86 | 0 | 22 | 1 |
Fig. 4Number of reasoning problems (y-axis) by total number of reasoners solving them. For example, 5 DL classification tasks were not solved by any reasoner, and 123 EL classification tasks were solved by all reasoners
Results for the user submitted ontologies (times are reported in seconds)
| Reasoner | CO | DINTO | DPO | GO+ |
|---|---|---|---|---|
| Consistency | ||||
| Chainsaw | 1.80 | 4.86 | 0.76 | 6.05 |
| FaCT++ | 2.32 |
| 0.81 | 1254.24 |
| HermiT |
| 9.68 | 2.98 | 202.42 |
| HermiT-OA4 |
| 12.66 | 3.64 | 71.52 |
| Jfact |
|
| 3.08 |
|
| Konclude | 0.42 | 3.60 | 0.18 | 6.38 |
| MORe |
|
| 44.74 | 1103.87 |
| Pellet-OA4 |
| 35.53 | 2.55 |
|
| Racer | 7.72 |
| 0.93 | 79.88 ( |
| TrOWL | 4.10 | 10.94 | 2.38 | 38.02 |
| Classification | ||||
| Chainsaw |
|
| 5.82 |
|
| FaCT++ | 6652.28 |
| 8.32 |
|
| HermiT |
|
| 84.29 | 6227.97 |
| HermiT-OA4 |
|
| 93.51 | 2351.83 |
| Jfact |
|
| 49.30 ( |
|
| Konclude | 201.68 | 6.84 | 0.46 | 69.75 |
| MORe |
|
| 43.73 | 1023.11 |
| Pellet-OA4 |
|
| 9.45 |
|
| Racer |
|
|
| 78.92 ( |
| TrOWL | 3.87 ( | 13.06 | 2.73 ( | 42.05 ( |
Italics indicate a failed attempt, either timeout, thrown error, or wrong (results)
Results for the user submitted ontologies
| Reasoner | VFB-EPNT | VFB-KB | VFB-NCT |
|---|---|---|---|
| Consistency | |||
| Chainsaw | 12.82 |
| 12.71 |
| FaCT++ | 10.20 |
| 10.27 |
| HermiT | 11.11 |
| 12.00 |
| HermiT-OA4 | 10.43 |
| 9.83 |
| Jfact | 121.92 |
| 120.33 |
| Konclude | 3.73 | 3.81 | 3.93 |
| MORe |
|
|
|
| Pellet-OA4 | 9.06 |
| 8.86 |
| Racer | 134.28 ( | 741.89 ( | 129.21 ( |
| TrOWL | 10.49 |
| 9.10 |
| Classification | |||
| Chainsaw |
|
|
|
| FaCT++ | 10.24 |
| 9.93 |
| HermiT | 9.59 |
| 9.19 |
| HermiT-OA4 | 9.93 | 9.10 ( | 9.95 |
| Jfact | 121.71 | 9.22 ( | 121.38 |
| Konclude | 3.87 | 24.45 | 3.40 |
| MORe |
|
|
|
| Pellet-OA4 |
|
|
|
| Racer | 132.82 ( | 768.53 ( | 132.41 ( |
| TrOWL | 8.93 |
| 9.46 |
| Realisation | |||
| Chainsaw |
|
|
|
| FaCT++ | 10.19 |
| 10.04 |
| HermiT | 9.37 |
| 10.07 |
| HermiT-OA4 | 9.40 |
| 9.77 |
| Jfact | 121.42 |
| 124.17 |
| Konclude | 3.92 | 4.24 | 3.93 |
| PAGOdA |
|
|
|
| Pellet-OA4 |
| 8.76 |
|
| Racer | 160.64 ( | 812.88 ( | 134.55 ( |
| TrOWL | 8.89 |
| 8.78 |
Italics indicate a failed attempt, either timeout, thrown error, or wrong (results)
Overall results for the user submitted ontologies with numbers in brackets showing %; sorting is by task, supported profile (EL reasoners are shown last) and % of solved problems; only ontologies with an ABox are used in the realisation track
| Reasoner | Completed | Error | Timeout | All |
|---|---|---|---|---|
| Consistency | ||||
| Konclude | 33 (91.67) | 1 (2.78) | 2 (5.56) | 36 |
| HermiT-OA4 | 29 (80.56) | 5 (13.89) | 2 (5.56) | 36 |
| HermiT | 27 (75.00) | 4 (11.11) | 5 (13.89) | 36 |
| TrOWL | 27 (75.00) | 9 (25.00) | 0 (0.00) | 36 |
| Chainsaw | 25 (69.44) | 8 (22.22) | 3 (8.33) | 36 |
| Racer | 24 (66.67) | 0 (0.00) | 12 (33.33) | 36 |
| Pellet-OA4 | 23 (63.89) | 8 (22.22) | 5 (13.89) | 36 |
| Jfact | 22 (61.11) | 9 (25.00) | 5 (13.89) | 36 |
| FaCT++ | 21 (58.33) | 12 (33.33) | 3 (8.33) | 36 |
| MORe | 19 (52.78) | 9 (25.00) | 8 (22.22) | 36 |
| ELepHant | 1 (100.00) | 0 (0.00) | 0 (0.00) | 1 |
| ELK | 1 (100.00) | 0 (0.00) | 0 (0.00) | 1 |
| jcel | 1 (100.00) | 0 (0.00) | 0 (0.00) | 1 |
| Classification | ||||
| Konclude | 29 (80.56) | 2 (5.56) | 5 (13.89) | 36 |
| TrOWL | 27 (75.00) | 9 (25.00) | 0 (0.00) | 36 |
| HermiT-OA4 | 24 (66.67) | 6 (16.67) | 6 (16.67) | 36 |
| HermiT | 23 (63.89) | 6 (16.67) | 7 (19.44) | 36 |
| Jfact | 22 (61.11) | 9 (25.00) | 5 (13.89) | 36 |
| FaCT++ | 19 (52.78) | 13 (36.11) | 4 (11.11) | 36 |
| Racer | 19 (52.78) | 0 ( 0.00) | 17 (47.22) | 36 |
| MORe | 17 (47.22) | 10 (27.78) | 9 (25.00) | 36 |
| Pellet-OA4 | 14 (38.89) | 9 (25.00) | 13 (36.11) | 36 |
| Chainsaw | 7 (19.44) | 21 (58.33) | 8 (22.22) | 36 |
| ELepHant | 1 (100.00) | 0 (0.00) | 0 (0.00) | 1 |
| ELK | 1 (100.00) | 0 (0.00) | 0 (0.00) | 1 |
| jcel | 0 (0.00) | 1 (100.00) | 0 (0.00) | 1 |
| Realisation | ||||
| Konclude | 23 (79.31) | 3 (10.34) | 3 (10.34) | 29 |
| TrOWL | 21 (72.41) | 8 (27.59) | 0 (0.00) | 29 |
| Jfact | 19 (65.52) | 9 (31.03) | 1 (3.45) | 29 |
| FaCT++ | 17 (58.62) | 10 (34.48) | 2 (6.90) | 29 |
| HermiT-OA4 | 17 (58.62) | 7 (24.14) | 5 (17.24) | 29 |
| HermiT | 16 (55.17) | 5 (17.24) | 8 (27.59) | 29 |
| Racer | 14 (48.28) | 0 ( 0.00) | 15 (51.72) | 29 |
| Pellet-OA4 | 13 (44.83) | 9 (31.03) | 7 (24.14) | 29 |
| Chainsaw | 9 (31.03) | 17 (58.62) | 3 (10.34) | 29 |