Literature DB >> 28244549

Relating Complexity and Error Rates of Ontology Concepts. More Complex NCIt Concepts Have More Errors.

Hua Min1, Ling Zheng, Yehoshua Perl, Michael Halper, Sherri De Coronado, Christopher Ochs.   

Abstract

OBJECTIVES: Ontologies are knowledge structures that lend support to many health-information systems. A study is carried out to assess the quality of ontological concepts based on a measure of their complexity. The results show a relation between complexity of concepts and error rates of concepts.
METHODS: A measure of lateral complexity defined as the number of exhibited role types is used to distinguish between more complex and simpler concepts. Using a framework called an area taxonomy, a kind of abstraction network that summarizes the structural organization of an ontology, concepts are divided into two groups along these lines. Various concepts from each group are then subjected to a two-phase QA analysis to uncover and verify errors and inconsistencies in their modeling. A hierarchy of the National Cancer Institute thesaurus (NCIt) is used as our test-bed. A hypothesis pertaining to the expected error rates of the complex and simple concepts is tested.
RESULTS: Our study was done on the NCIt's Biological Process hierarchy. Various errors, including missing roles, incorrect role targets, and incorrectly assigned roles, were discovered and verified in the two phases of our QA analysis. The overall findings confirmed our hypothesis by showing a statistically significant difference between the amounts of errors exhibited by more laterally complex concepts vis-à-vis simpler concepts.
CONCLUSIONS: QA is an essential part of any ontology's maintenance regimen. In this paper, we reported on the results of a QA study targeting two groups of ontology concepts distinguished by their level of complexity, defined in terms of the number of exhibited role types. The study was carried out on a major component of an important ontology, the NCIt. The findings suggest that more complex concepts tend to have a higher error rate than simpler concepts. These findings can be utilized to guide ongoing efforts in ontology QA.

Entities:  

Keywords:  National Cancer Institute thesaurus; Ontology quality assurance; abstraction network; ontology complexity; ontology modeling

Mesh:

Year:  2017        PMID: 28244549     DOI: 10.3414/ME16-01-0085

Source DB:  PubMed          Journal:  Methods Inf Med        ISSN: 0026-1270            Impact factor:   2.176


  5 in total

Review 1.  Assessing the practice of biomedical ontology evaluation: Gaps and opportunities.

Authors:  Muhammad Amith; Zhe He; Jiang Bian; Juan Antonio Lossio-Ventura; Cui Tao
Journal:  J Biomed Inform       Date:  2018-02-17       Impact factor: 6.317

2.  Training a Convolutional Neural Network with Terminology Summarization Data Improves SNOMED CT Enrichment.

Authors:  Ling Zheng; Hao Liu; Yehoshua Perl; James Geller
Journal:  AMIA Annu Symp Proc       Date:  2020-03-04

3.  Missing lateral relationships in top-level concepts of an ontology.

Authors:  Ling Zheng; Yan Chen; Hua Min; P Lloyd Hildebrand; Hao Liu; Michael Halper; James Geller; Sherri de Coronado; Yehoshua Perl
Journal:  BMC Med Inform Decis Mak       Date:  2020-12-15       Impact factor: 2.796

4.  Outlier concepts auditing methodology for a large family of biomedical ontologies.

Authors:  Ling Zheng; Hua Min; Yan Chen; Vipina Keloth; James Geller; Yehoshua Perl; George Hripcsak
Journal:  BMC Med Inform Decis Mak       Date:  2020-12-15       Impact factor: 2.796

5.  Taxonomy-Based Approaches to Quality Assurance of Ontologies.

Authors:  Michael Halper; Yehoshua Perl; Christopher Ochs; Ling Zheng
Journal:  J Healthc Eng       Date:  2017-10-11       Impact factor: 2.682

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.