| Literature DB >> 27540498 |
Chaveevan Pechsiri1, Rapepun Piriyakul2.
Abstract
This paper aims to extract a group-pair relation as a Problem-Solving relation, for example a DiseaseSymptom-Treatment relation and a CarProblem-Repair relation, between two event-explanation groups, a problem-concept group as a symptom/CarProblem-concept group and a solving-concept group as a treatment-concept/repair concept group from hospital-web-board and car-repair-guru-web-board documents. The Problem-Solving relation (particularly Symptom-Treatment relation) including the graphical representation benefits non-professional persons by supporting knowledge of primarily solving problems. The research contains three problems: how to identify an EDU (an Elementary Discourse Unit, which is a simple sentence) with the event concept of either a problem or a solution; how to determine a problem-concept EDU boundary and a solving-concept EDU boundary as two event-explanation groups, and how to determine the Problem-Solving relation between these two event-explanation groups. Therefore, we apply word co-occurrence to identify a problem-concept EDU and a solving-concept EDU, and machine-learning techniques to solve a problem-concept EDU boundary and a solving-concept EDU boundary. We propose using k-mean and Naïve Bayes to determine the Problem-Solving relation between the two event-explanation groups involved with clustering features. In contrast to previous works, the proposed approach enables group-pair relation extraction with high accuracy.Entities:
Keywords: Elementary discourse unit; Problem-solving relation; Semantic relation; Word co-occurrence
Year: 2016 PMID: 27540498 PMCID: PMC4975736 DOI: 10.1186/s40064-016-2864-3
Source DB: PubMed Journal: Springerplus ISSN: 2193-1801
Fig. 1An example of a web-board document showing the DiseaseSymptom–Treatment relation expression (where the […] symbol means ellipsis)
Fig. 2The problem-solving-map representation of the DiseaseSymptom–treatment relation
Fig. 3System overview where the input is text or downloaded documents and the output is the problem-solving relation i.e. a DiseaseSymptom–treatment relation and a CarProblem–repair relation
Fig. 4DiseaseSymptom–treatment relation annotation
Fig. 5The problem-solving relation extraction algorithm to extract the DiseaseSymptom–treatment relation
Fig. 6The PSM representation of the DiseaseSymptom–treatment
The accuracy of word-co identification and the accuracy of boundary determination
| Disease categoriesand car-problem category (500 EDUs per category) | # of different problem word-CO | # of different solving word-CO | Correctness of problem word-CO Identification | Correctness of solving word-CO Identification | % Correctness of determining boundary of each EDU group; i.e. Dsym, AT/RT | |||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Precision | Recall | Precision | Recall | Problem-Concept-EDU boundary | Solving-Concept-EDU boundary | |||||||
| ME | LR | SVM | ME | LR | SVM | |||||||
| Childhood-disease | 74 | 39 | 0.893 | 0.762 | 0.882 | 0.857 | 80.8 | 82.1 | 81.5 | 91.7 | 90.4 | 89.7 |
| Abdominal disease | 73 | 41 | 0.875 | 0.700 | 0.913 | 0.848 | 80.0 | 81.8 | 80.9 | 87.8 | 87.5 | 87.1 |
| Heart/brain disease | 50 | 38 | 0.901 | 0.846 | 0.894 | 0.850 | 81.6 | 85.5 | 85.0 | 89.4 | 89.0 | 88.6 |
| GeneralCar-problem | 37 | 68 | 0.881 | 0.804 | 0.906 | 0.894 |
| 91.9 | 91.3 | 87.5 | 88.7 | 88.3 |
Italic value indicates the highest achieved %correctness among all experiments
The accuracy of problem-solving relation extraction
| Testing corpora (500 EDUs per corpus) | Problem-solving relation extraction | |||
|---|---|---|---|---|
| By Naïve Bayes with clustering | By Naïve Bayes without clustering | |||
| Precision | Recall | Precision | Recall | |
| Medical-healthcare corpus 197 problem features, 118 solving features | 0.875 | 0.754 | 0.840 | 0.720 |
| Car-repair corpus 37 problem features, 68 solving features | 0.822 | 0.742 | 0.852 | 0.790 |