Literature DB >> 29854265

Spark-MCA: Large-scale, Exhaustive Formal Concept Analysis for Evaluating the Semantic Completeness of SNOMED CT.

Zhu Wei1,2, Cui Licong3, Zhang Guo-Qiang1.   

Abstract

The completeness of a medical terminology system consists of two parts: complete content coverage and complete semantics. In this paper, we focus on semantic completeness and present a scalable approach, called Spark-MCA, for evaluating the semantic completeness of SNOMED CT. We formulate the SNOMED CT contents into an FCA-based formal context, in which SNOMED CT concepts are used for extents, while their attributes are used as intents. We applied Spark-MCA to the 201403 US edition of SNOMED CT to exhaustively compute all the formal concepts and sub concept relationships in about 2 hours with 96 processors using an Amazon Web Service cluster. We found a total of 799,868 formal concepts, within which 500,583 are not contained in the 201403 release. We compared these concepts with the cumulative addition of 22,687 concepts from the 5 "delta" files from the 201403 release to the 201609 release. 3,231 matches were found between those suggested by FCA and those from cumulative concept addition by the SNOMED CT Editorial Panel. This result provides encouraging evidence that our approach could be useful for enhancing the semantic completeness of SNOMED CT.

Mesh:

Year:  2018        PMID: 29854265      PMCID: PMC5977568     

Source DB:  PubMed          Journal:  AMIA Annu Symp Proc        ISSN: 1559-4076


  9 in total

1.  SNOMED-CT: The advanced terminology and coding system for eHealth.

Authors:  Kevin Donnelly
Journal:  Stud Health Technol Inform       Date:  2006

2.  Biomedical ontologies in action: role in knowledge management, data integration and decision support.

Authors:  O Bodenreider
Journal:  Yearb Med Inform       Date:  2008

3.  Auditing the semantic completeness of SNOMED CT using formal concept analysis.

Authors:  Guoqian Jiang; Christopher G Chute
Journal:  J Am Med Inform Assoc       Date:  2008-10-24       Impact factor: 4.497

Review 4.  Literature review of SNOMED CT use.

Authors:  Dennis Lee; Nicolette de Keizer; Francis Lau; Ronald Cornet
Journal:  J Am Med Inform Assoc       Date:  2013-07-04       Impact factor: 4.497

5.  Large-scale, Exhaustive Lattice-based Structural Auditing of SNOMED CT.

Authors:  Guo-Qiang Zhang; Olivier Bodenreider
Journal:  AMIA Annu Symp Proc       Date:  2010-11-13

6.  MaPLE: A MapReduce Pipeline for Lattice-based Evaluation and Its Application to SNOMED CT.

Authors:  Guo-Qiang Zhang; Wei Zhu; Mengmeng Sun; Shiqiang Tao; Olivier Bodenreider; Licong Cui
Journal:  Proc IEEE Int Conf Big Data       Date:  2014-10

7.  Using SPARQL to Test for Lattices: application to quality assurance in biomedical ontologies.

Authors:  Guo-Qiang Zhang; Olivier Bodenreider
Journal:  Semant Web ISWC       Date:  2010

8.  FEDRR: fast, exhaustive detection of redundant hierarchical relations for quality improvement of large biomedical ontologies.

Authors:  Guangming Xing; Guo-Qiang Zhang; Licong Cui
Journal:  BioData Min       Date:  2016-10-10       Impact factor: 2.522

9.  Mining non-lattice subgraphs for detecting missing hierarchical relations and concepts in SNOMED CT.

Authors:  Licong Cui; Wei Zhu; Shiqiang Tao; James T Case; Olivier Bodenreider; Guo-Qiang Zhang
Journal:  J Am Med Inform Assoc       Date:  2017-07-01       Impact factor: 4.497

  9 in total
  2 in total

Review 1.  Assessing the practice of biomedical ontology evaluation: Gaps and opportunities.

Authors:  Muhammad Amith; Zhe He; Jiang Bian; Juan Antonio Lossio-Ventura; Cui Tao
Journal:  J Biomed Inform       Date:  2018-02-17       Impact factor: 6.317

2.  Extending import detection algorithms for concept import from two to three biomedical terminologies.

Authors:  Vipina K Keloth; James Geller; Yan Chen; Julia Xu
Journal:  BMC Med Inform Decis Mak       Date:  2020-12-15       Impact factor: 2.796

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.