Literature DB >> 14728239

A data-driven approach for extracting "the most specific term" for ontology development.

Guergana K Savova1, Marcelline Harris, Thomas Johnson, Serguei V Pakhomov, Christopher G Chute.   

Abstract

We present a data-driven approach to extract the "most specific" terms relevant to an ontology of functioning, disability and health. The algorithm is a combination of statistical and linguistic approaches. The statistical filter is based on the frequency of the content words in a given text string; the linguistic heuristic is an extension of existing algorithms but goes beyond noun phrases and is formulated as a "complete syntactic node". Thus, it can be applied to any syntactic node of interest in the particular domain. Two test sets were marked by three experts. Test set 1 is a well-constructed text from pain abstracts; test set 2 is actual medical reports. Results are reported as recall, precision, F-score and rate of valid terms in false positives. A limitation of the current research is the relatively small test set.

Entities:  

Mesh:

Year:  2003        PMID: 14728239      PMCID: PMC1480306     

Source DB:  PubMed          Journal:  AMIA Annu Symp Proc        ISSN: 1559-4076


  3 in total

1.  Terminology extraction from text to build an ontology in surgical intensive care.

Authors:  Sophie Le Moigno; Jean Charlet; Didier Bourigault; Patrice Degoulet; Marie-Christine Jaulent
Journal:  Proc AMIA Symp       Date:  2002

2.  Nursing intervention lexicon and taxonomy study: language and classification methods.

Authors:  S J Grobe
Journal:  ANS Adv Nurs Sci       Date:  1990-12       Impact factor: 1.824

3.  Functional status and the forward progress of merry-go-rounds: toward a coherent analytical framework.

Authors:  N K Leidy
Journal:  Nurs Res       Date:  1994 Jul-Aug       Impact factor: 2.381

  3 in total
  4 in total

1.  Evaluating measures of redundancy in clinical texts.

Authors:  Rui Zhang; Serguei Pakhomov; Bridget T McInnes; Genevieve B Melton
Journal:  AMIA Annu Symp Proc       Date:  2011-10-22

2.  Text-mining approach to evaluate terms for ontology development.

Authors:  Lam C Tsoi; Ravi Patel; Wenle Zhao; W Jim Zheng
Journal:  J Biomed Inform       Date:  2009-03-24       Impact factor: 6.317

3.  TEPAPA: a novel in silico feature learning pipeline for mining prognostic and associative factors from text-based electronic medical records.

Authors:  Frank Po-Yen Lin; Adrian Pokorny; Christina Teng; Richard J Epstein
Journal:  Sci Rep       Date:  2017-07-31       Impact factor: 4.379

4.  Terminology extraction from medical texts in Polish.

Authors:  Małgorzata Marciniak; Agnieszka Mykowiecka
Journal:  J Biomed Semantics       Date:  2014-05-31
  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.