Literature DB >> 29854100

Quality Assurance of NCI Thesaurus by Mining Structural-Lexical Patterns.

Rashmie Abeysinghe1, Michael A Brooks2, Jeffery Talbert3, Cui Licong1.   

Abstract

Quality assurance of biomedical terminologies such as the National Cancer Institute (NCI) Thesaurus is an essential part of the terminology management lifecycle. We investigate a structural-lexical approach based on non-lattice subgraphs to automatically identify missing hierarchical relations and missing concepts in the NCI Thesaurus. We mine six structural-lexical patterns exhibiting in non-lattice subgraphs: containment, union, intersection, union-intersection, inference-contradiction, and inference union. Each pattern indicates a potential specific type of error and suggests a potential type of remediation. We found 809 non-lattice subgraphs with these patterns in the NCI Thesaurus (version 16.12d). Domain experts evaluated a random sample of 50 small non-lattice subgraphs, of which 33 were confirmed to contain errors and make correct suggestions (33/50 = 66%). Of the 25 evaluated subgraphs revealing multiple patterns, 22 were verified correct (22/25 = 88%). This shows the effectiveness of our structurallexical-pattern-based approach in detecting errors and suggesting remediations in the NCI Thesaurus.

Entities:  

Mesh:

Year:  2018        PMID: 29854100      PMCID: PMC5977579     

Source DB:  PubMed          Journal:  AMIA Annu Symp Proc        ISSN: 1559-4076


  17 in total

1.  The NCI Thesaurus quality assurance life cycle.

Authors:  Sherri de Coronado; Lawrence W Wright; Gilberto Fragoso; Margaret W Haber; Elizabeth A Hahn-Dantona; Francis W Hartel; Sharon L Quan; Tracy Safran; Nicole Thomas; Lori Whiteman
Journal:  J Biomed Inform       Date:  2009-06       Impact factor: 6.317

Review 2.  A review of auditing methods applied to the content of controlled biomedical terminologies.

Authors:  Xinxin Zhu; Jung-Wei Fan; David M Baorto; Chunhua Weng; James J Cimino
Journal:  J Biomed Inform       Date:  2009-03-12       Impact factor: 6.317

3.  Auditing the NCI thesaurus with semantic web technologies.

Authors:  Fleur Mougin; Olivier Bodenreider
Journal:  AMIA Annu Symp Proc       Date:  2008-11-06

Review 4.  Literature review of SNOMED CT use.

Authors:  Dennis Lee; Nicolette de Keizer; Francis Lau; Ronald Cornet
Journal:  J Am Med Inform Assoc       Date:  2013-07-04       Impact factor: 4.497

5.  A terminological and ontological analysis of the NCI Thesaurus.

Authors:  W Ceusters; B Smith; L Goldberg
Journal:  Methods Inf Med       Date:  2005       Impact factor: 2.176

6.  Scalable quality assurance for large SNOMED CT hierarchies using subject-based subtaxonomies.

Authors:  Christopher Ochs; James Geller; Yehoshua Perl; Yan Chen; Junchuan Xu; Hua Min; James T Case; Zhi Wei
Journal:  J Am Med Inform Assoc       Date:  2014-10-21       Impact factor: 4.497

7.  Large-scale, Exhaustive Lattice-based Structural Auditing of SNOMED CT.

Authors:  Guo-Qiang Zhang; Olivier Bodenreider
Journal:  AMIA Annu Symp Proc       Date:  2010-11-13

8.  Mining non-lattice subgraphs for detecting missing hierarchical relations and concepts in SNOMED CT.

Authors:  Licong Cui; Wei Zhu; Shiqiang Tao; James T Case; Olivier Bodenreider; Guo-Qiang Zhang
Journal:  J Am Med Inform Assoc       Date:  2017-07-01       Impact factor: 4.497

9.  Ontology quality assurance through analysis of term transformations.

Authors:  Karin Verspoor; Daniel Dvorkin; K Bretonnel Cohen; Lawrence Hunter
Journal:  Bioinformatics       Date:  2009-06-15       Impact factor: 6.937

10.  Preliminary Analysis of Difficulty of Importing Pattern-Based Concepts into the National Cancer Institute Thesaurus.

Authors:  Zhe He; James Geller
Journal:  Stud Health Technol Inform       Date:  2016
View more
  11 in total

1.  Identifying Similar Non-Lattice Subgraphs in Gene Ontology based on Structural Isomorphism and Semantic Similarity of Concept Labels.

Authors:  Rashmie Abeysinghe; Xufeng Qu; Licong Cui
Journal:  AMIA Annu Symp Proc       Date:  2018-12-05

2.  A lexical-based approach for exhaustive detection of missing hierarchical IS-A relations in SNOMED CT.

Authors:  Fengbo Zheng; Jay Shi; Licong Cui
Journal:  AMIA Annu Symp Proc       Date:  2021-01-25

3.  A Comparison of Exhaustive and Non-lattice-based Methods for Auditing Hierarchical Relations in Gene Ontology.

Authors:  Rashmie Abeysinghe; Fengbo Zheng; Licong Cui
Journal:  AMIA Annu Symp Proc       Date:  2022-02-21

4.  Leveraging non-lattice subgraphs for suggestion of new concepts for SNOMED CT.

Authors:  Xubing Hao; Rashmie Abeysinghe; Fengbo Zheng; Licong Cui
Journal:  Proceedings (IEEE Int Conf Bioinformatics Biomed)       Date:  2021-12

5.  An evidence-based lexical pattern approach for quality assurance of Gene Ontology relations.

Authors:  Rashmie Abeysinghe; Yuntao Yang; Mason Bartels; W Jim Zheng; Licong Cui
Journal:  Brief Bioinform       Date:  2022-05-13       Impact factor: 13.994

6.  An efficient, large-scale, non-lattice-detection algorithm for exhaustive structural auditing of biomedical ontologies.

Authors:  Guo-Qiang Zhang; Guangming Xing; Licong Cui
Journal:  J Biomed Inform       Date:  2018-03-13       Impact factor: 6.317

7.  Leveraging Non-lattice Subgraphs to Audit Hierarchical Relations in NCI Thesaurus.

Authors:  Rashmie Abeysinghe; Michael A Brooks; Licong Cui
Journal:  AMIA Annu Symp Proc       Date:  2020-03-04

8.  Detecting missing IS-A relations in the NCI Thesaurus using an enhanced hybrid approach.

Authors:  Fengbo Zheng; Rashmie Abeysinghe; Nicholas Sioutos; Lori Whiteman; Lyubov Remennik; Licong Cui
Journal:  BMC Med Inform Decis Mak       Date:  2020-12-15       Impact factor: 2.796

9.  Enhancing the Quality of Hierarchic Relations in the National Cancer Institute Thesaurus to Enable Faceted Query of Cancer Registry Data.

Authors:  Licong Cui; Rashmie Abeysinghe; Fengbo Zheng; Shiqiang Tao; Ningzhou Zeng; Isaac Hands; Eric B Durbin; Lori Whiteman; Lyubov Remennik; Nicholas Sioutos; Guo-Qiang Zhang
Journal:  JCO Clin Cancer Inform       Date:  2020-05

10.  A transformation-based method for auditing the IS-A hierarchy of biomedical terminologies in the Unified Medical Language System.

Authors:  Fengbo Zheng; Jay Shi; Yuntao Yang; W Jim Zheng; Licong Cui
Journal:  J Am Med Inform Assoc       Date:  2020-10-01       Impact factor: 4.497

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.