Literature DB >> 22289420

A hybrid knowledge-based and data-driven approach to identifying semantically similar concepts.

Rimma Pivovarov1, Noémie Elhadad.   

Abstract

An open research question when leveraging ontological knowledge is when to treat different concepts separately from each other and when to aggregate them. For instance, concepts for the terms "paroxysmal cough" and "nocturnal cough" might be aggregated in a kidney disease study, but should be left separate in a pneumonia study. Determining whether two concepts are similar enough to be aggregated can help build better datasets for data mining purposes and avoid signal dilution. Quantifying the similarity among concepts is a difficult task, however, in part because such similarity is context-dependent. We propose a comprehensive method, which computes a similarity score for a concept pair by combining data-driven and ontology-driven knowledge. We demonstrate our method on concepts from SNOMED-CT and on a corpus of clinical notes of patients with chronic kidney disease. By combining information from usage patterns in clinical notes and from ontological structure, the method can prune out concepts that are simply related from those which are semantically similar. When evaluated against a list of concept pairs annotated for similarity, our method reaches an AUC (area under the curve) of 92%.
Copyright © 2012 Elsevier Inc. All rights reserved.

Entities:  

Mesh:

Year:  2012        PMID: 22289420      PMCID: PMC3345313          DOI: 10.1016/j.jbi.2012.01.002

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  14 in total

1.  Aggregating UMLS semantic types for reducing conceptual complexity.

Authors:  A T McCray; A Burgun; O Bodenreider
Journal:  Stud Health Technol Inform       Date:  2001

2.  Towards the development of a conceptual distance metric for the UMLS.

Authors:  Jorge E Caviedes; James J Cimino
Journal:  J Biomed Inform       Date:  2004-04       Impact factor: 6.317

3.  Measures of semantic similarity and relatedness in the biomedical domain.

Authors:  Ted Pedersen; Serguei V S Pakhomov; Siddharth Patwardhan; Christopher G Chute
Journal:  J Biomed Inform       Date:  2006-06-10       Impact factor: 6.317

Review 4.  Empirical distributional semantics: methods and biomedical applications.

Authors:  Trevor Cohen; Dominic Widdows
Journal:  J Biomed Inform       Date:  2009-02-14       Impact factor: 6.317

5.  Towards a framework for developing semantic relatedness reference standards.

Authors:  Serguei V S Pakhomov; Ted Pedersen; Bridget McInnes; Genevieve B Melton; Alexander Ruggieri; Christopher G Chute
Journal:  J Biomed Inform       Date:  2010-10-31       Impact factor: 6.317

6.  UMLS-Interface and UMLS-Similarity : open source software for measuring paths and semantic similarity.

Authors:  Bridget T McInnes; Ted Pedersen; Serguei V S Pakhomov
Journal:  AMIA Annu Symp Proc       Date:  2009-11-14

7.  The Unified Medical Language System.

Authors:  D A Lindberg; B L Humphreys; A T McCray
Journal:  Methods Inf Med       Date:  1993-08       Impact factor: 2.176

8.  The measurement of observer agreement for categorical data.

Authors:  J R Landis; G G Koch
Journal:  Biometrics       Date:  1977-03       Impact factor: 2.571

9.  A method of comparing the areas under receiver operating characteristic curves derived from the same cases.

Authors:  J A Hanley; B J McNeil
Journal:  Radiology       Date:  1983-09       Impact factor: 11.105

10.  Ontology quality assurance through analysis of term transformations.

Authors:  Karin Verspoor; Daniel Dvorkin; K Bretonnel Cohen; Lawrence Hunter
Journal:  Bioinformatics       Date:  2009-06-15       Impact factor: 6.937

View more
  16 in total

Review 1.  Medication-indication knowledge bases: a systematic review and critical appraisal.

Authors:  Hojjat Salmasian; Tran H Tran; Herbert S Chase; Carol Friedman
Journal:  J Am Med Inform Assoc       Date:  2015-09-02       Impact factor: 4.497

2.  Characterizing the sublanguage of online breast cancer forums for medications, symptoms, and emotions.

Authors:  Noémie Elhadad; Shaodian Zhang; Patricia Driscoll; Samuel Brody
Journal:  AMIA Annu Symp Proc       Date:  2014-11-14

3.  Survival Analysis with Electronic Health Record Data: Experiments with Chronic Kidney Disease.

Authors:  Yolanda Hagar; David Albers; Rimma Pivovarov; Herbert Chase; Vanja Dukic; Noémie Elhadad
Journal:  Stat Anal Data Min       Date:  2014-08-19       Impact factor: 1.051

4.  U-path: An undirected path-based measure of semantic similarity.

Authors:  Bridget T McInnes; Ted Pedersen; Ying Liu; Genevieve B Melton; Serguei V Pakhomov
Journal:  AMIA Annu Symp Proc       Date:  2014-11-14

5.  A hierarchical knowledge-based approach for retrieving similar medical images described with semantic annotations.

Authors:  Camille Kurtz; Christopher F Beaulieu; Sandy Napel; Daniel L Rubin
Journal:  J Biomed Inform       Date:  2014-03-12       Impact factor: 6.317

6.  Speculation detection for Chinese clinical notes: Impacts of word segmentation and embedding models.

Authors:  Shaodian Zhang; Tian Kang; Xingting Zhang; Dong Wen; Noémie Elhadad; Jianbo Lei
Journal:  J Biomed Inform       Date:  2016-02-26       Impact factor: 6.317

7.  Lessons learned in replicating data-driven experiments in multiple medical systems and patient populations.

Authors:  Samantha Kleinberg; Noémie Elhadad
Journal:  AMIA Annu Symp Proc       Date:  2013-11-16

8.  Lessons learned from developing a drug evidence base to support pharmacovigilance.

Authors:  J C Smith; J C Denny; Q Chen; H Nian; A Spickard; S T Rosenbloom; R A Miller
Journal:  Appl Clin Inform       Date:  2013-12-18       Impact factor: 2.342

9.  Multi-Ontology Refined Embeddings (MORE): A hybrid multi-ontology and corpus-based semantic representation model for biomedical concepts.

Authors:  Steven Jiang; Weiyi Wu; Naofumi Tomita; Craig Ganoe; Saeed Hassanpour
Journal:  J Biomed Inform       Date:  2020-10-01       Impact factor: 6.317

10.  Unsupervised biomedical named entity recognition: experiments with clinical and biological texts.

Authors:  Shaodian Zhang; Noémie Elhadad
Journal:  J Biomed Inform       Date:  2013-08-15       Impact factor: 6.317

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.