| Literature DB >> 28699547 |
Muhammad Amith1, Hsing-Yi Song1, Yaoyun Zhang1, Hua Xu1, Cui Tao2.
Abstract
BACKGROUND: Knowledge engineering for ontological knowledgebases is resource and time intensive. To alleviate these issues, especially for novices, automated tools from the natural language domain can assist in the development process of ontologies. We focus towards the development of ontologies for the public health domain and use patient-centric sources from MedlinePlus related to HPV-causing cancers.Entities:
Keywords: Natural language processing; Ontology learning; Open information extraction; Public health; Semi-automated ontology development
Mesh:
Year: 2017 PMID: 28699547 PMCID: PMC5506564 DOI: 10.1186/s12911-017-0465-x
Source DB: PubMed Journal: BMC Med Inform Decis Mak ISSN: 1472-6947 Impact factor: 2.796
Fig. 1Transforming patient information to ontological format
Fig. 2Pearl Information Extraction Kitchen (PIE KIT)
Dataset options for extraction
| Representation | Default | +SVOA | +SVOA_NVERB | Default (n-ary) | +SVOA (n-ary) | +SVOA_NVERB (n-ary) |
|---|---|---|---|---|---|---|
| n-ary | No | No | No | Yes | Yes | Yes |
| Clause detection | ||||||
| Process modifiers | Yes | Yes | Yes | Yes | Yes | Yes |
| Conservative SVOA | No | Yes | Yes | No | Yes | Yes |
| Conservative SVA | Yes | Yes | Yes | Yes | Yes | Yes |
| Conjugate verbs | Yes | Yes | Yes | Yes | Yes | Yes |
| Conjugate non-verbs | No | No | Yes | No | No | Yes |
Fig. 3Screenshot of the sentence selection
Contextual accuracy results for outputted triples
| Correct | Incorrect | Accuracy (%) | |
|---|---|---|---|
| Default | 243 | 60 | 80.2 |
| Default+SVOA | 223 | 39 | 85.1 |
| Default+SVOA+NVERB_CC | 288 | 57 | 83.5 |
Contextual accuracy results for outputted n-ary tuples
| Correct | Incorrect | Accuracy (%) | |
|---|---|---|---|
| Default | 170 | 21 | 89.0 |
| Default+SVOA | 170 | 21 | 89.0 |
| Default+SVOA+NVERB_CC | 231 | 28 | 89.2 |