Literature DB >> 23304369

Deterministic binary vectors for efficient automated indexing of MEDLINE/PubMed abstracts.

Manuel Wahle1, Dominic Widdows, Jorge R Herskovic, Elmer V Bernstam, Trevor Cohen.   

Abstract

The need to maintain accessibility of the biomedical literature has led to development of methods to assist human indexers by recommending index terms for newly encountered articles. Given the rapid expansion of this literature, it is essential that these methods be scalable. Document vector representations are commonly used for automated indexing, and Random Indexing (RI) provides the means to generate them efficiently. However, RI is difficult to implement in real-world indexing systems, as (1) efficient nearest-neighbor search requires retaining all document vectors in RAM, and (2) it is necessary to maintain a store of randomly generated term vectors to index future documents. Motivated by these concerns, this paper documents the development and evaluation of a deterministic binary variant of RI. The increased capacity demonstrated by binary vectors has implications for information retrieval, and the elimination of the need to retain term vectors facilitates distributed implementations, enhancing the scalability of RI.

Entities:  

Mesh:

Year:  2012        PMID: 23304369      PMCID: PMC3540485     

Source DB:  PubMed          Journal:  AMIA Annu Symp Proc        ISSN: 1559-4076


  11 in total

1.  The NLM Indexing Initiative.

Authors:  A R Aronson; O Bodenreider; H F Chang; S M Humphrey; J G Mork; S J Nelson; T C Rindflesch; W J Wilbur
Journal:  Proc AMIA Symp       Date:  2000

2.  Automatic MeSH term assignment and quality assessment.

Authors:  W Kim; A R Aronson; W J Wilbur
Journal:  Proc AMIA Symp       Date:  2001

3.  The Unified Medical Language System (UMLS): integrating biomedical terminology.

Authors:  Olivier Bodenreider
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

4.  The NLM Indexing Initiative's Medical Text Indexer.

Authors:  Alan R Aronson; James G Mork; Clifford W Gay; Susanne M Humphrey; Willie J Rogers
Journal:  Stud Health Technol Inform       Date:  2004

5.  An overview of MetaMap: historical perspective and recent advances.

Authors:  Alan R Aronson; François-Michel Lang
Journal:  J Am Med Inform Assoc       Date:  2010 May-Jun       Impact factor: 4.497

6.  Reflective random indexing for semi-automatic indexing of the biomedical literature.

Authors:  Vidya Vasuki; Trevor Cohen
Journal:  J Biomed Inform       Date:  2010-04-09       Impact factor: 6.317

7.  Optimal training sets for Bayesian prediction of MeSH assignment.

Authors:  Sunghwan Sohn; Won Kim; Donald C Comeau; W John Wilbur
Journal:  J Am Med Inform Assoc       Date:  2008-04-24       Impact factor: 4.497

8.  MEDRank: using graph-based concept ranking to index biomedical texts.

Authors:  Jorge R Herskovic; Trevor Cohen; Devika Subramanian; M Sriram Iyengar; Jack W Smith; Elmer V Bernstam
Journal:  Int J Med Inform       Date:  2011-03-25       Impact factor: 4.046

9.  An application of Expert Network to clinical classification and MEDLINE indexing.

Authors:  Y Yang; C G Chute
Journal:  Proc Annu Symp Comput Appl Med Care       Date:  1994

10.  Recommending MeSH terms for annotating biomedical articles.

Authors:  Minlie Huang; Aurélie Névéol; Zhiyong Lu
Journal:  J Am Med Inform Assoc       Date:  2011-05-25       Impact factor: 4.497

View more
  9 in total

1.  Reasoning with Vectors: A Continuous Model for Fast Robust Inference.

Authors:  Dominic Widdows; Trevor Cohen
Journal:  Log J IGPL       Date:  2014-11-19       Impact factor: 0.861

2.  Classification-by-Analogy: Using Vector Representations of Implicit Relationships to Identify Plausibly Causal Drug/Side-effect Relationships.

Authors:  Justin Mower; Devika Subramanian; Ning Shang; Trevor Cohen
Journal:  AMIA Annu Symp Proc       Date:  2017-02-10

3.  Predicting MeSH Beyond MEDLINE.

Authors:  Adam K Kehoe; Vetle I Torvik; Matthew B Ross; Neil R Smalheiser
Journal:  Proc 1st Workshop Sch Web Min (2017)       Date:  2017-02

4.  Hyperdimensional computing approach to word sense disambiguation.

Authors:  Bjoern-Toby Berster; J Caleb Goodwin; Trevor Cohen
Journal:  AMIA Annu Symp Proc       Date:  2012-11-03

5.  Identifying plausible adverse drug reactions using knowledge extracted from the literature.

Authors:  Ning Shang; Hua Xu; Thomas C Rindflesch; Trevor Cohen
Journal:  J Biomed Inform       Date:  2014-07-19       Impact factor: 6.317

6.  Comparison and combination of several MeSH indexing approaches.

Authors:  Antonio Jose Jimeno Yepes; James G Mork; Dina Demner-Fushman; Alan R Aronson
Journal:  AMIA Annu Symp Proc       Date:  2013-11-16

7.  Word Sense Disambiguation of clinical abbreviations with hyperdimensional computing.

Authors:  Sungrim Moon; Bjoern-Toby Berster; Hua Xu; Trevor Cohen
Journal:  AMIA Annu Symp Proc       Date:  2013-11-16

8.  Discovering discovery patterns with Predication-based Semantic Indexing.

Authors:  Trevor Cohen; Dominic Widdows; Roger W Schvaneveldt; Peter Davies; Thomas C Rindflesch
Journal:  J Biomed Inform       Date:  2012-07-26       Impact factor: 6.317

9.  An overview of the BIOASQ large-scale biomedical semantic indexing and question answering competition.

Authors:  George Tsatsaronis; Georgios Balikas; Prodromos Malakasiotis; Ioannis Partalas; Matthias Zschunke; Michael R Alvers; Dirk Weissenborn; Anastasia Krithara; Sergios Petridis; Dimitris Polychronopoulos; Yannis Almirantis; John Pavlopoulos; Nicolas Baskiotis; Patrick Gallinari; Thierry Artiéres; Axel-Cyrille Ngonga Ngomo; Norman Heino; Eric Gaussier; Liliana Barrio-Alvers; Michael Schroeder; Ion Androutsopoulos; Georgios Paliouras
Journal:  BMC Bioinformatics       Date:  2015-04-30       Impact factor: 3.169

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.