Literature DB >> 21044697

Towards a framework for developing semantic relatedness reference standards.

Serguei V S Pakhomov1, Ted Pedersen, Bridget McInnes, Genevieve B Melton, Alexander Ruggieri, Christopher G Chute.   

Abstract

Our objective is to develop a framework for creating reference standards for functional testing of computerized measures of semantic relatedness. Currently, research on computerized approaches to semantic relatedness between biomedical concepts relies on reference standards created for specific purposes using a variety of methods for their analysis. In most cases, these reference standards are not publicly available and the published information provided in manuscripts that evaluate computerized semantic relatedness measurement approaches is not sufficient to reproduce the results. Our proposed framework is based on the experiences of medical informatics and computational linguistics communities and addresses practical and theoretical issues with creating reference standards for semantic relatedness. We demonstrate the use of the framework on a pilot set of 101 medical term pairs rated for semantic relatedness by 13 medical coding experts. While the reliability of this particular reference standard is in the "moderate" range; we show that using clustering and factor analyses offers a data-driven approach to finding systematic differences among raters and identifying groups of potential outliers. We test two ontology-based measures of relatedness and provide both the reference standard containing individual ratings and the R program used to analyze the ratings as open-source. Currently, these resources are intended to be used to reproduce and compare results of studies involving computerized measures of semantic relatedness. Our framework may be extended to the development of reference standards in other research areas in medical informatics including automatic classification, information retrieval from medical records and vocabulary/ontology development.
Copyright © 2010 Elsevier Inc. All rights reserved.

Entities:  

Mesh:

Year:  2010        PMID: 21044697      PMCID: PMC3063326          DOI: 10.1016/j.jbi.2010.10.004

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  28 in total

1.  Effective mapping of biomedical text to the UMLS Metathesaurus: the MetaMap program.

Authors:  A R Aronson
Journal:  Proc AMIA Symp       Date:  2001

2.  Investigating semantic similarity measures across the Gene Ontology: the relationship between sequence and annotation.

Authors:  P W Lord; R D Stevens; A Brass; C A Goble
Journal:  Bioinformatics       Date:  2003-07-01       Impact factor: 6.937

3.  Enhancing MEDLINE document clustering by incorporating MeSH semantic similarity.

Authors:  Shanfeng Zhu; Jia Zeng; Hiroshi Mamitsuka
Journal:  Bioinformatics       Date:  2009-06-03       Impact factor: 6.937

4.  Comparison of ontology-based semantic-similarity measures.

Authors:  Wei-Nchih Lee; Nigam Shah; Karanjot Sundlass; Mark Musen
Journal:  AMIA Annu Symp Proc       Date:  2008-11-06

5.  Use abstracted patient-specific features to assist an information-theoretic measurement to assess similarity between medical cases.

Authors:  Hui Cao; Genevieve B Melton; Marianthi Markatou; George Hripcsak
Journal:  J Biomed Inform       Date:  2008-03-22       Impact factor: 6.317

6.  Semantic Similarity and Relatedness between Clinical Terms: An Experimental Study.

Authors:  Serguei Pakhomov; Bridget McInnes; Terrence Adam; Ying Liu; Ted Pedersen; Genevieve B Melton
Journal:  AMIA Annu Symp Proc       Date:  2010-11-13

7.  UMLS-Interface and UMLS-Similarity : open source software for measuring paths and semantic similarity.

Authors:  Bridget T McInnes; Ted Pedersen; Serguei V S Pakhomov
Journal:  AMIA Annu Symp Proc       Date:  2009-11-14

8.  Influence of the MedDRA hierarchy on pharmacovigilance data mining results.

Authors:  Ronald K Pearson; Manfred Hauben; David I Goldsmith; A Lawrence Gould; David Madigan; Donald J O'Hara; Stephanie J Reisinger; Alan M Hochberg
Journal:  Int J Med Inform       Date:  2009-02-18       Impact factor: 4.046

9.  Predicting judged similarity of natural categories from their neural representations.

Authors:  Matthew Weber; Sharon L Thompson-Schill; Daniel Osherson; James Haxby; Lawrence Parsons
Journal:  Neuropsychologia       Date:  2008-12-31       Impact factor: 3.139

10.  EDGAR: extraction of drugs, genes and relations from the biomedical literature.

Authors:  T C Rindflesch; L Tanabe; J N Weinstein; L Hunter
Journal:  Pac Symp Biocomput       Date:  2000
View more
  19 in total

1.  A hybrid knowledge-based and data-driven approach to identifying semantically similar concepts.

Authors:  Rimma Pivovarov; Noémie Elhadad
Journal:  J Biomed Inform       Date:  2012-01-25       Impact factor: 6.317

2.  A comparison of word embeddings for the biomedical natural language processing.

Authors:  Yanshan Wang; Sijia Liu; Naveed Afzal; Majid Rastegar-Mojarad; Liwei Wang; Feichen Shen; Paul Kingsbury; Hongfang Liu
Journal:  J Biomed Inform       Date:  2018-09-12       Impact factor: 6.317

3.  U-path: An undirected path-based measure of semantic similarity.

Authors:  Bridget T McInnes; Ted Pedersen; Ying Liu; Genevieve B Melton; Serguei V Pakhomov
Journal:  AMIA Annu Symp Proc       Date:  2014-11-14

4.  Evaluating semantic relatedness and similarity measures with Standardized MedDRA Queries.

Authors:  Robert W Bill; Ying Liu; Bridget T McInnes; Genevieve B Melton; Ted Pedersen; Serguei Pakhomov
Journal:  AMIA Annu Symp Proc       Date:  2012-11-03

5.  Retrofitting Concept Vector Representations of Medical Concepts to Improve Estimates of Semantic Similarity and Relatedness.

Authors:  Zhiguo Yu; Byron C Wallace; Todd Johnson; Trevor Cohen
Journal:  Stud Health Technol Inform       Date:  2017

6.  Unsupervised low-dimensional vector representations for words, phrases and text that are transparent, scalable, and produce similarity metrics that are not redundant with neural embeddings.

Authors:  Neil R Smalheiser; Aaron M Cohen; Gary Bonifield
Journal:  J Biomed Inform       Date:  2019-01-14       Impact factor: 6.317

7.  Semantic similarity in the biomedical domain: an evaluation across knowledge sources.

Authors:  Vijay N Garla; Cynthia Brandt
Journal:  BMC Bioinformatics       Date:  2012-10-10       Impact factor: 3.169

8.  User centered and ontology based information retrieval system for life sciences.

Authors:  Mohameth-François Sy; Sylvie Ranwez; Jacky Montmain; Armelle Regnault; Michel Crampes; Vincent Ranwez
Journal:  BMC Bioinformatics       Date:  2012-01-25       Impact factor: 3.169

9.  tESA: a distributional measure for calculating semantic relatedness.

Authors:  Maciej Rybinski; José Francisco Aldana-Montes
Journal:  J Biomed Semantics       Date:  2016-12-28

10.  A Word Pair Dataset for Semantic Similarity and Relatedness in Korean Medical Vocabulary: Reference Development and Validation.

Authors:  Sanghoun Song; Hyung Joon Joo; Yunjin Yum; Jeong Moon Lee; Moon Joung Jang; Yoojoong Kim; Jong-Ho Kim; Seongtae Kim; Unsub Shin
Journal:  JMIR Med Inform       Date:  2021-06-24
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.