| Literature DB >> 27747590 |
Dominic Girardi1, Josef Küng2, Raimund Kleiser3, Michael Sonnberger3, Doris Csillag3, Johannes Trenkler3, Andreas Holzinger4.
Abstract
Established process models for knowledge discovery find the domain-expert in a customer-like and supervising role. In the field of biomedical research, it is necessary to move the domain-experts into the center of this process with far-reaching consequences for both their research output and the process itself. In this paper, we revise the established process models for knowledge discovery and propose a new process model for domain-expert-driven interactive knowledge discovery. Furthermore, we present a research infrastructure which is adapted to this new process model and demonstrate how the domain-expert can be deeply integrated even into the highly complex data-mining process and data-exploration tasks. We evaluated this approach in the medical domain for the case of cerebral aneurysms research.Entities:
Keywords: Doctor-in-the-loop; Expert-in-the-loop; Interactive machine learning; Knowledge discovery; Medical research; Process model
Year: 2016 PMID: 27747590 PMCID: PMC4999567 DOI: 10.1007/s40708-016-0038-2
Source DB: PubMed Journal: Brain Inform ISSN: 2198-4026
Fig. 2An ontology-guided nonlinear mapping of 1032 cerebral aneurysms with a distance calculation based on the following features: Aneurysm.Presentation, Aneurysm.Width, Aneurysm.Location, Patient.Number of Aneurysms, and Patient.Age. a The aneurysms are colored according to their presentation: green is incidental, blue is coincidental, and red is after a subarachnoid bleeding. b The aneurysms are colored according to their rupture state red are ruptured, white are non-ruptured. (Color figure online)
Fig. 1A new process model for domain-expert-centered knowledge discovery in biomedical research
The area under the ROC curve for the best configuration of each algorithm for the features: Aneurysm.Presentation, Aneurysm.Location, Aneurysm.Width, Patient.Number of Aneurysms, and Patient.Age
| NB | RF | MLP | LR | SVM |
|---|---|---|---|---|
| 0.987 | 0.992 | 0.988 | 0.984 | 0.994 |
NB Naive Bayes, RF Random Forest, MLPMulti-Layer Perceptron, LR Logistic Regression, SVM Support Vector Machine
The areas under the ROC curves for the best configuration of each algorithm for the features: Aneurysm.Location, Aneurysm.Width, Patient.Number of Aneurysms, and Patient.Age
| NB | RF | MLP | LR | SVM |
|---|---|---|---|---|
| 0.779 | 0.814 | 0.776 | 0.793 | 0.809 |
NB Naive Bayes, RF Random Forest, MLP Multi-Layer Perceptron, LR Logistic Regression, SVM - Support Vector Machine
The areas under the ROC curves for the best configuration of each algorithm: Aneurysm.Location, Aneurysm.Width, Patient.Number of Aneurysms, Patient.Age, and Patient.Sex
| NB | RF | MLP | LR | SVM |
|---|---|---|---|---|
| 0.789 | 0.820 | 0.773 | 0.800 | 0.812 |
NB Naive Bayes, RF Random Forest, MLP Multi-Layer Perceptron, LR Logistic Regression, SVM Support Vector Machine
Fig. 3An ontology-guided nonlinear mapping of 1032 cerebral aneurysms with a distance calculation based on the following features: Aneurysm.Width, Aneurysm.Location, Patient.Number of Aneurysms, and Patient.Age. a The aneurysms that are colored according to their location. b The aneurysms that are colored according to their rupture state red are ruptured, white are non-ruptured. (Color figure online)