| Literature DB >> 31196129 |
Ying Shen1, Yaliang Li2, Hai-Tao Zheng3, Buzhou Tang4, Min Yang5.
Abstract
BACKGROUND: Ontology has attracted substantial attention from both academia and industry. Handling uncertainty reasoning is important in researching ontology. For example, when a patient is suffering from cirrhosis, the appearance of abdominal vein varices is four times more likely than the presence of bitter taste. Such medical knowledge is crucial for decision-making in various medical applications but is missing from existing medical ontologies. In this paper, we aim to discover medical knowledge probabilities from electronic medical record (EMR) texts to enrich ontologies. First, we build an ontology by identifying meaningful entity mentions from EMRs. Then, we propose a symptom-dependency-aware naïve Bayes classifier (SDNB) that is based on the assumption that there is a level of dependency among symptoms. To ensure the accuracy of the diagnostic classification, we incorporate the probability of a disease into the ontology via innovative approaches.Entities:
Keywords: Ontology; Probability; Uncertainty reasoning; naïve Bayes classifier
Mesh:
Year: 2019 PMID: 31196129 PMCID: PMC6567606 DOI: 10.1186/s12859-019-2924-0
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Examples of the diseases and their syndromes and conditional probabilities
| Disease | Syndrome and Conditional Probability |
|---|---|
| Acute pyelonephritis | (fever, 0.2), (shaking, 0.1), (frequent urination, 0.1), (urinary incontinence, 0.1), (odynuria, 0.1), (stomachache, 0.1), (urine turbidity and urinary smell, 0.1), (nausea, 0.05), (vomiting, 0.05), (headache, 0.05), and (sore all over, 0.05) |
| Acute interstitial nephritis | (oliguria, 0.6), (fever, 0.1), (rash, 0.1), and (joint pain, 0.1) |
| Chronic interstitial nephritis | (night time urination, 0.1), (foam in urine, 0.5), (blaze, 0.2), and (white nails, 0.2) |
Fig. 1Ontology class: Gastric ulcer
Fig. 2ROC chart and AUC for classifier evaluations
Experimental results in four scenarios: (a) without the naïve Bayes classifier; (b) with the original naïve Bayes classifier; (c) with an improved naïve Bayes classifier that is based on the co-occurrence frequency; and (d) with the symptom-dependency-aware weighted naïve Bayes classifier
| SDNB ontology | SDNB ontology + NB | SDNB ontology + improved NB | SDNB ontology + SDNB classifier | |
|---|---|---|---|---|
| Area under the ROC curve | 0.7574 | 0.8392 | 0.8753 | 0.8876 |
| Std. of the error | 0.03865 | 0.03063 | 0.01628 | 0.01264 |
| 95% confidence interval | 0.6817 to 0.8331 | 0.7792 to 0.8993 | 0.8434 to 0.9073 | 0.8437 to 0.9281 |
| < 0.0001 | < 0.0001 | < 0.0001 | < 0.0001 |
Diagnostic reasoning results in four scenarios: (a) without any naïve Bayes classifier; (b) with the original naïve Bayes classifier; (c) with the improved naïve Bayes classifier that is based on the co-occurrence frequency; and (d) with the symptom-dependency-aware weighted naïve Bayes classifier
| Disease | Case | Symptom set | SDNB ontology | SDNB ontology + NB | SDNB ontology + improved NB | SDNB ontology + SDNB classifier |
|---|---|---|---|---|---|---|
| Jaundice | Case 1 | {Nausea, Vomiting, Yellow sclera, Weary, Pale stools, Dark urine, Itchiness, Fatigue, Abdominal pain, Weight loss, Vomiting, Fever, Pale stools, Dark urine} | 0.67 | 0.71 | 0.83 | 0.862 |
| Pancreatic Cancer | Case 2 | {Yellow sclera, Jaundice, Abdominal pain, Back pain, Bloating, Nausea, Vomiting} | 0.54 | 0.61 | 0.64 | 0.646 |
| Liver disease | Case 3 | {Dizziness, Body skin yellow dyeing, Abdominal pain and swelling, Itchy skin} | 0.42 | 0.48 | 0.55 | 0.567 |
Fig. 3Diagnosis of cirrhosis based on the generated SDNB ontology and the proposed SDNB classifier
Fig. 4Distribution of the number of diseases diagnosed by doctors in all involved medical record data
Example of Chinese EMR data that has been translated into English
| Item | Content |
|---|---|
| GENDER | Male |
| AGE | 48 |
| ILLNESS_DESC | The patient complained of abdominal discomfort after meals, especially high-fat meals. He also had aching in his right shoulder and back. |
| BODY_EXAM | An ultrasound of the upper abdomen revealed cholelithiasis. |
| DIAG_DESC | Cholecystitis |
Fig. 5Subgraph of the generated ontology
Fig. 6Flow diagram of disease probability calculation using the improved naïve Bayes classifier based on attribute relevance