| Literature DB >> 28203277 |
Hossein Mohammadhassanzadeh1, William Van Woensel1, Samina Raza Abidi2, Syed Sibte Raza Abidi1.
Abstract
BACKGROUND: Capturing complete medical knowledge is challenging-often due to incomplete patient Electronic Health Records (EHR), but also because of valuable, tacit medical knowledge hidden away in physicians' experiences. To extend the coverage of incomplete medical knowledge-based systems beyond their deductive closure, and thus enhance their decision-support capabilities, we argue that innovative, multi-strategy reasoning approaches should be applied. In particular, plausible reasoning mechanisms apply patterns from human thought processes, such as generalization, similarity and interpolation, based on attributional, hierarchical, and relational knowledge. Plausible reasoning mechanisms include inductive reasoning, which generalizes the commonalities among the data to induce new rules, and analogical reasoning, which is guided by data similarities to infer new facts. By further leveraging rich, biomedical Semantic Web ontologies to represent medical knowledge, both known and tentative, we increase the accuracy and expressivity of plausible reasoning, and cope with issues such as data heterogeneity, inconsistency and interoperability. In this paper, we present a Semantic Web-based, multi-strategy reasoning approach, which integrates deductive and plausible reasoning and exploits Semantic Web technology to solve complex clinical decision support queries.Entities:
Keywords: Analogical reasoning; Inductive generalization; Medical knowledge bases; Plausible reasoning; Semantic Web reasoning
Year: 2017 PMID: 28203277 PMCID: PMC5303296 DOI: 10.1186/s13040-017-0123-y
Source DB: PubMed Journal: BioData Min ISSN: 1756-0381 Impact factor: 2.522
Plausible patterns
Fig. 1Semantic medical knowledge bases and plausible reasoning – mentioning the three core building blocks of the Semantic Web, namely RDF, OWL, and SPARQL
Fig. 2SeMS-KBS process diagram
Fig. 3Initial justification tree of the query live(p116,true)
Fig. 4Part of justification tree extended by analogical reasoning
Fig. 5Concept hierarchy of the hepatitis data with corresponding conceptual similarity values-presenting the similarity of clinicial medicines to diagnosis and demographic infomation
Fig. 6Part of justification tree extended by inductive generalization
Fig. 9Justification tree generated by induction, and a view of ontology representing the location of Antivirals in the concept hierarchy and the value of conceptual similarity (0.8) between Antivirals and ascites
Fig. 7SeMS-KBS User Interface: Main query interface
Fig. 8Justification tree generated by analogical reasoning
Summary of reference data - attribute information and distribution
| Attribute | Values/Distribution | Description | |
|---|---|---|---|
| 1 | Age | [7 … 78]a | Age of the patient |
| 2 | Sex | male(139)b, female(16) | Gender of the patient |
| 3 | Steroids | posc (78), neg (76) | A class of medication used to provide relief for inflamed areas of the body |
| 4 | Antivirals | pos (131), neg (24) | A class of medication used specifically for treating viral infections |
| 5 | Fatigue | pos (54), neg (100) | Extreme tiredness, typically resulting from mental or physical illness |
| 6 | Malaise | pos (93), neg (61) | A general feeling of discomfort, illness, or uneasiness |
| 7 | Anorexia | pos (122), neg (32) | Eating disorder causing people to obsess about weight and what they eat |
| 8 | Big Liver | pos (120), neg (25) | Liver is swollen beyond its normal size |
| 9 | Firm Liver | pos (84), neg (60) | Liver tissue is harder than normal |
| 10 | Spleen Palpable | pos (120), neg (30) | Spleen becomes touchable or bigger than normal size |
| 11 | Spiders | pos (99), neg (51) | Small angiomata which appear on the surface of the skin |
| 12 | Ascites | pos (130), neg (20) | Accumulation of fluid in the peritoneal cavity, causing abdominal swelling |
| 13 | Varices | pos (132), neg (18) | Abnormal veins in lower part of the tube running from throat to stomach |
| 14 | Bilirubin | [0.3 … 8.0] | A yellowish pigment found in bile, a fluid made by the liver |
| 15 | Alkaline Phosphate | [26 … 295] | A protein found in all body tissues, including the liver, bile ducts, and bone |
| 16 | SGOT | [14 … 648] | One of the enzymes that helps liver build and break down proteins |
| 17 | Albumin | [2.1 … 6.4] | The main protein of human blood plasma |
| 18 | ProTime INR | [0 … 100] | The test that is used to determine the clotting tendency of blood |
| 19 | Histology | pos (70), neg (85) | The study of the microscopic anatomy of cells and tissues |
| 20 | Patient Label | die (32), live (123) | Entity label, required for evaluating machine learning systems |
arange of attributes with continuous values
bdistribution of attributes with categorical values
cpos: positive, neg: negative
The numeric ranges show the intervals for continuous values
For categorical values, the numbers of patients with/without that attribute are shown
Average number of missing values among test datasets with same percentage of missing values
| Original Dataset | Datasets w. 5% Missing Data | Datasets w.10% Missing Data | Datasets w.15% Missing Data | Datasets w.20% Missing Data | |
|---|---|---|---|---|---|
| Total # missing values (average) | 167 | 306 | 438 | 577 | 720 |
Fig. 10A partial view of the developed ontology after the data enrichment phase
Experimental results of the original dataset
| Reasoning Strategy | |||
|---|---|---|---|
| Deductive Only | Deductive + Plausible (Without Expert's Verification) | Deductive + Plausible (With Expert’s Verification) | |
| #Queries answered correctly | 144 | 149 | 147 |
| #Queries answered incorrectly | 7 | 5 | 6 |
| #Queries unanswered | 4 | 1 | 2 |
| # Plausible rules used | N/A | 3 | 2 |
| # Plausible facts used | N/A | 1 | 1 |
| Coverage | 97% | 99% | 99% |
| Correctness | 95% | 97% | 96% |
Experimental results of the datasets with different percentages of missing data
| Reasoning Strategy | Deductive Only | Deductive + Plausible (Without Expert’s Verification) | Deductive + Plausible (With Expert’s Verification) | Deductive Only | Deductive + Plausible (Without Expert’s Verification) | Deductive + Plausible (With Expert’s Verification) | Deductive Only | Deductive + Plausible (Without Expert’s Verification) | Deductive + Plausible (With Expert’s Verification) | Deductive Only | Deductive + Plausible (Without Expert’s Verification) | Deductive + Plausible (With Expert’s Verification) | Deductive Only | Deductive + Plausible (Without Expert’s Verification) | Deductive + Plausible (With Expert’s Verification) | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Dataset #5–1 | Dataset #5–2 | Dataset #5–3 | Dataset #5–4 | Dataset #5–5 | ||||||||||||
| 5% Missing Data | #Queries answered correctly | 141 | 150 | 144 | 136 | 148 | 138 | 139 | 150 | 144 | 140 | 150 | 145 | 142 | 149 | 144 |
| #Queries answered incorrectly | 7 | 4 | 7 | 8 | 6 | 8 | 8 | 4 | 9 | 8 | 4 | 8 | 7 | 3 | 7 | |
| #Queries unanswered | 7 | 1 | 4 | 11 | 1 | 9 | 8 | 1 | 2 | 7 | 1 | 2 | 6 | 3 | 4 | |
| # Plausible Rules | N/A | 3 | 2 | N/A | 5 | 1 | N/A | 6 | 3 | N/A | 2 | 2 | N/A | 3 | 1 | |
| # Plausible Facts | N/A | 4 | 1 | N/A | 1 | 4 | N/A | 3 | 3 | N/A | 2 | 2 | N/A | 1 | 2 | |
| Coverage | 95% | 99% | 97% | 93% | 99% | 94% | 95% | 99% | 99% | 95% | 99% | 99% | 96% | 98% | 97% | |
| Correctness | 95% | 97% | 95% | 94% | 96% | 95% | 95% | 97% | 94% | 95% | 97% | 95% | 95% | 98% | 95% | |
| Dataset #10–1 | Dataset #10–2 | Dataset #10–3 | Dataset #10–4 | Dataset #10–5 | ||||||||||||
| 10% Missing Data | #Queries answered correctly | 128 | 155 | 144 | 135 | 150 | 142 | 133 | 151 | 144 | 131 | 152 | 141 | 137 | 151 | 143 |
| #Queries answered incorrectly | 8 | 0 | 8 | 6 | 3 | 8 | 7 | 3 | 7 | 10 | 3 | 8 | 6 | 3 | 6 | |
| #Queries unanswered | 19 | 0 | 3 | 14 | 2 | 5 | 15 | 1 | 4 | 14 | 0 | 6 | 12 | 1 | 6 | |
| # Plausible Rules | N/A | 4 | 3 | N/A | 4 | 1 | N/A | 7 | 3 | N/A | 4 | 2 | N/A | 6 | 2 | |
| # Plausible Facts | N/A | 3 | 4 | N/A | 3 | 2 | N/A | 1 | 2 | N/A | 1 | 8 | N/A | 3 | 3 | |
| Coverage | 88% | 100% | 98% | 91% | 99% | 97% | 90% | 99% | 97% | 91% | 100% | 96% | 92% | 99% | 96% | |
| Correctness | 94% | 100% | 95% | 96% | 98% | 95% | 95% | 98% | 95% | 93% | 98% | 95% | 96% | 98% | 96% | |
| Dataset #15–1 | Dataset #15–2 | Dataset #15–3 | Dataset #15–4 | Dataset #15–5 | ||||||||||||
| 15% Missing Data | #Queries answered correctly | 119 | 150 | 146 | 128 | 151 | 140 | 124 | 147 | 139 | 113 | 147 | 141 | 128 | 150 | 142 |
| #Queries answered incorrectly | 9 | 4 | 7 | 9 | 4 | 10 | 9 | 5 | 9 | 8 | 8 | 9 | 9 | 4 | 8 | |
| #Queries unanswered | 27 | 1 | 2 | 18 | 0 | 5 | 22 | 3 | 7 | 34 | 0 | 5 | 18 | 1 | 5 | |
| # Plausible Rules | N/A | 6 | 5 | N/A | 5 | 2 | N/A | 5 | 4 | N/A | 8 | 4 | N/A | 6 | 3 | |
| # Plausible Facts | N/A | 1 | 1 | N/A | 3 | 2 | N/A | 0 | 0 | N/A | 3 | 3 | N/A | 1 | 3 | |
| Coverage | 83% | 99% | 99% | 88% | 100% | 97% | 86% | 98% | 95% | 78% | 100% | 97% | 88% | 99% | 97% | |
| Correctness | 93% | 97% | 95% | 93% | 97% | 93% | 93% | 97% | 94% | 93% | 95% | 94% | 93% | 97% | 95% | |
| Dataset #20–1 | Dataset #20–2 | Dataset #20–3 | Dataset #20–4 | Dataset #20–5 | ||||||||||||
| 20% Missing Data | #Queries answered correctly | 111 | 143 | 135 | 115 | 152 | 138 | 113 | 150 | 140 | 111 | 148 | 135 | 113 | 148 | 129 |
| #Queries answered incorrectly | 9 | 10 | 10 | 9 | 3 | 10 | 12 | 5 | 10 | 10 | 7 | 13 | 11 | 7 | 13 | |
| #Queries unanswered | 35 | 2 | 10 | 31 | 0 | 7 | 30 | 0 | 5 | 34 | 0 | 7 | 31 | 0 | 13 | |
| # Plausible Rules | N/A | 6 | 5 | N/A | 8 | 3 | N/A | 7 | 3 | N/A | 6 | 3 | N/A | 8 | 4 | |
| # Plausible Facts | N/A | 4 | 5 | N/A | 1 | 1 | N/A | 4 | 4 | N/A | 3 | 4 | N/A | 4 | 4 | |
| Coverage | 77% | 99% | 94% | 80% | 100% | 95% | 81% | 100% | 97% | 78% | 100% | 95% | 80% | 100% | 92% | |
| Correctness | 93% | 93% | 93% | 93% | 98% | 93% | 90% | 97% | 93% | 92% | 95% | 91% | 91% | 95% | 91% | |
Summary of the experimental results of original dataset and datasets with different percentages of missing data-values are average of the corresponding values in Table 5
| Reasoning Strategy | Deductive Only | Deductive + Plausible (Without Expert’s Verification) | Deductive + Plausible (With Expert’s Verification) | Deductive Only | Deductive + Plausible (Without Expert’s Verification) | Deductive + Plausible (With Expert’s Verification) | Deductive Only | Deductive + Plausible (Without Expert’s Verification) | Deductive + Plausible (With Expert’s Verification) | Deductive Only | Deductive + Plausible (Without Expert’s Verification) | Deductive + Plausible (With Expert’s Verification) | Deductive Only | Deductive + Plausible (Without Expert’s Verification) | Deductive + Plausible (With Expert’s Verification) |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Original Dataset | 5% Missing | 10% Missing | 15% Missing | 20% Missing | |||||||||||
| #Queries answered correctly (Avg.) | 144 | 149 | 147 | 140 | 149 | 143 | 133 | 152 | 143 | 122 | 149 | 142 | 113 | 148 | 135 |
| #Queries answered incorrectly (Avg.) | 7 | 5 | 6 | 8 | 4 | 8 | 7 | 2 | 7 | 9 | 5 | 9 | 10 | 6 | 11 |
| #Queries unanswered (Avg.) | 4 | 1 | 2 | 8 | 1 | 4 | 15 | 1 | 5 | 24 | 1 | 5 | 32 | 0 | 8 |
| # Plausible Rules (Avg.) | N/A | 3 | 2 | N/A | 4 | 2 | N/A | 5 | 2 | N/A | 6 | 4 | N/A | 7 | 4 |
| # Plausible Facts (Avg.) | N/A | 1 | 1 | N/A | 2 | 2 | N/A | 2 | 4 | N/A | 2 | 2 | N/A | 3 | 4 |
| Coverage (Avg.) | 97% | 99% | 99% | 95% | 99% | 97% | 90% | 99% | 97% | 85% | 99% | 97% | 79% | 100% | 95% |
| Correctness (Avg.) | 95% | 97% | 96% | 95% | 97% | 95% | 95% | 98% | 95% | 93% | 97% | 94% | 92% | 96% | 92% |
Verified knowledge extension, inferred via induction and analogical reasoning through Experiment #10–1 and #20–1
Summary of the verified inductively inferred rules, aggregated based on the experiments with same percentage of missing values
| Dataset | Verified Inductive Rule | Inferred in more than one Experiment |
|---|---|---|
| Original Dataset | noPhysicalSign(P,bigLiver) :- onMed(P, (type Antiviral)). | * |
| physicalSign(P,firmLiver) :- symptom(P, malaise). | * | |
| 5% Missing | physicalSign(P,ascites) :- physicalSign(P,spleenPalpable). | - |
| physicalSign(P,firmLiver) :- onMed(P, (type Antiviral)). | - | |
| physicalSign(P,firmLiver) :- symptom(P, malaise). | * | |
| noPhysicalSign(P,spiders) :- noPhysicalSign(P,firmLiver), noPathology(P,histology). | - | |
| 10% Missing | noPhysicalSign(P,ascites) :- onMed(P, (type Antiviral)). | * |
| physicalSign(P,ascites) :- onMed(P, (type Antiviral)), physicalSign(P,spleenPalpable). | * | |
| noPhysicalSign(P,bigLiver) :- onMed(P, (type Antiviral)). | * | |
| noPhysicalSign(P,firmLiver) :- onMed(P, (type Antiviral)), PhysicalSign(P,ascites). | - | |
| noPhysicalSign(P,firmLiver) :- onMed(P, (type Antiviral)). | - | |
| physicalSign(P,firmLiver) :- onMed(P, (type Antiviral)). | * | |
| physicalSign(P,spiders) :- physicalSign(P,ascites). | * | |
| physicalSign(P,spiders) :- physicalSign(P,bigLiver). | * | |
| physicalSign(P,spiders) :- physicalSign(P,spleenPalpable). | * | |
| 15% Missing | noPhysicalSign(P,ascites) :- onMed(P, (type Antiviral)). | * |
| physicalSign(P,ascites) :- physicalSign(P,firmLiver). | - | |
| physicalSign(P,ascites) :- onMed(P, (type Steroid)). | * | |
| physicalSign(P,ascites) :- physicalSign(P,varices). | - | |
| noPhysicalSign(P,bigLiver) :- onMed(P, (type Antiviral)). | * | |
| physicalSign(P,firmLiver) :- onMed(P, (type Antiviral)). | * | |
| physicalSign(P,firmLiver) :- physicalSign(P,varices). | * | |
| physicalSign(P,spiders) :- physicalSign(P,ascites). | * | |
| physicalSign(P,spiders) :- physicalSign(P,bigLiver). | * | |
| physicalSign(P,spiders) :- onMed(P, (type Antiviral)). | - | |
| 20%Missing | noPhysicalSign(P,ascites) :- onMed(P, (type Antiviral)). | * |
| physicalSign(P,ascites) :- onMed(P, (type Antiviral)), physicalSign(P,spleenPalpable). | * | |
| physicalSign(P,ascites) :- physicalSign(P,firmLiver), physicalSign(P,varices). | - | |
| physicalSign(P,ascites) :- physicalSign(P,spiders) | - | |
| physicalSign(P,ascites) :- onMed(P, (type Steroid)). | * | |
| physicalSign(P,ascites) :- physicalSign(P,varices), physicalSign(P,spleenPalpable). | - | |
| physicalSign(P,firmLiver) :- onMed(P, (type Antiviral)). | * | |
| physicalSign(P,firmLiver) :- physicalSign(P,bigLiver), physicalSign(P,spleenPalpable). | - | |
| physicalSign(P,firmLiver) :- physicalSign(P,spiders). | - | |
| physicalSign(P,firmLiver) :- physicalSign(P,spleenPalpable). | - | |
| physicalSign(P,firmLiver) :- physicalSign(P,varices). | * | |
| physicalSign(P,spiders) :- physicalSign(P,bigLiver). | * | |
| physicalSign(P,spiders) :- physicalSign(P,ascites),physicalSign(P,varices). | - | |
| physicalSign(P,spiders) :- physicalSign(P,ascites). | * | |
| physicalSign(P,spiders) :- physicalSign(P,spleenPalpable). | * |
Symbol (*) means yes, and (−) means no
Speed performance of the SeMS-KBS
| Reasoning Method | Minimum | Maximum | Average | STDEV |
|---|---|---|---|---|
| Inductive Generalization (seconds) | 5.5 s | 14.5 s | 10.3 s | 3.8 s |
| Analogical Reasoning (milliseconds) | 6 ms | 164 ms | 41 ms | 40 ms |