Literature DB >> 15988041

Comparison of vector space model methodologies to reconcile cross-species neuroanatomical concepts.

P R Srinivas1, Shang-Heng Wei, Nello Cristianini, E G Jones, F A Gorin.   

Abstract

Generating informational thesauri that classify, cross-reference, and retrieve diverse and highly detailed neuroscientific information requires identifying related neuroanatomical terms and acronyms within and between species (Gorin et al., 2001) Manual construction of such informational thesauri is laborious, and we describe implementing and evaluating a neuroanatomical term and acronym reconciliation (NTAR) system to assist domain experts with this task. NTAR is composed of two modules. The neuroanatomical term extraction (NTE) module employs a hidden Markov model (HMM) in conjunction with lexical rules to extract neuroanatomical terms (NT) and acronyms (NA) from textual material. The output of the NTE is formatted into collections of term- or acronym-indexed documents composed of sentences and word phrases extracted from textual material. The second information retrieval (IR) module utilizes a vector space model (VSM) and includes a novel, automated relevance feedback algorithm. The IR module retrieves statistically related neuroanatomical terms and acronyms in response to queried neuroanatomical terms and acronyms. Neuroanatomical terms and acronyms retrieval obtained from term-based inquiries were compared with (1) term retrieval obtained by including automated relevance feedback and with (2) term retrieval using "document-to-document" comparisons (context-based VSM). The retrieval of synonymous and similar primate and macaque thalamic terms and acronyms in response to a query list of human thalamic terminology by these three IR approaches was compared against a previously published, manually constructed concordance table of homologous cross-species terms and acronyms. Term-based VSM with automated relevance feedback retrieved 70% and 80% of these primate and macaque terms and acronyms, respectively, listed in the concordance table. Automated feedback algorithm correctly identified 87% of the macaque terms and acronyms that were independently selected by a domain expert as being appropriate for manual relevance feedback. Context-based VSM correctly retrieved 97% and 98% of the primate and macaque terms and acronyms listed in the term homology table. These results indicate that the NTAR system could assist neuroscientists with thesauri creation for closely related, highly detailed neuroanatomical domains.

Entities:  

Mesh:

Year:  2005        PMID: 15988041     DOI: 10.1385/NI:3:2:115

Source DB:  PubMed          Journal:  Neuroinformatics        ISSN: 1539-2791


  10 in total

1.  Mining molecular binding terminology from biomedical text.

Authors:  T C Rindflesch; L Hunter; A R Aronson
Journal:  Proc AMIA Symp       Date:  1999

Review 2.  The challenges and rewards of integrating diverse neuroscience information.

Authors:  F Gorin; M Hogarth; M Gertz
Journal:  Neuroscientist       Date:  2001-02       Impact factor: 7.519

3.  Methods for semi-automated indexing for high precision information retrieval.

Authors:  Daniel C Berrios; Russell J Cucina; Lawrence M Fagan
Journal:  J Am Med Inform Assoc       Date:  2002 Nov-Dec       Impact factor: 4.497

4.  Automatic resolution of ambiguous terms based on machine learning and conceptual relations in the UMLS.

Authors:  Hongfang Liu; Stephen B Johnson; Carol Friedman
Journal:  J Am Med Inform Assoc       Date:  2002 Nov-Dec       Impact factor: 4.497

5.  Free-text medical document retrieval via phrase-based vector space model.

Authors:  Wenlei Mao; Wesley W Chu
Journal:  Proc AMIA Symp       Date:  2002

6.  A multi-aspect comparison study of supervised word sense disambiguation.

Authors:  Hongfang Liu; Virginia Teller; Carol Friedman
Journal:  J Am Med Inform Assoc       Date:  2004-04-02       Impact factor: 4.497

7.  Neuroanatomical term generation and comparison between two terminologies.

Authors:  Prashanti R Srinivas; Daniel Gusfield; Oliver Mason; Michael Gertz; Michael Hogarth; James Stone; Edward G Jones; Fredric A Gorin
Journal:  Neuroinformatics       Date:  2003

8.  NeuroNames 2002.

Authors:  Douglas M Bowden; Mark F Dubach
Journal:  Neuroinformatics       Date:  2003

9.  A shallow parser based on closed-class words to capture relations in biomedical text.

Authors:  Gondy Leroy; Hsinchun Chen; Jesse D Martinez
Journal:  J Biomed Inform       Date:  2003-06       Impact factor: 6.317

Review 10.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.

Authors:  S F Altschul; T L Madden; A A Schäffer; J Zhang; Z Zhang; W Miller; D J Lipman
Journal:  Nucleic Acids Res       Date:  1997-09-01       Impact factor: 16.971

  10 in total
  1 in total

1.  Using text mining to link journal articles to neuroanatomical databases.

Authors:  Leon French; Paul Pavlidis
Journal:  J Comp Neurol       Date:  2012-06-01       Impact factor: 3.215

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.