Literature DB >> 18998999

A fast document classification algorithm for gene symbol disambiguation in the BITOLA literature-based discovery support system.

Andrej Kastrin1, Dimitar Hristovski.   

Abstract

Gene symbol disambiguation is an important problem for biomedical text mining systems. When detecting gene symbols in MEDLINE citations one of the biggest challenges is the fact that many gene symbols also denote other, more general biomedical concepts (e.g. CT, MR). Our approach to this problem is first to classify the citations into genetic and non-genetic domains and then to detect gene symbols only in the genetic domain. We used ontological information provided by Medical Subject Headings (MeSH) for this classification task. The proposed algorithm is fast and is able to process the full MEDLINE distribution in a few hours. It achieves predictive accuracy of 0.91. The algorithm is currently implemented in the BITOLA literature-based discovery support system (http://www.mf.uni-lj.si/bitola/).

Mesh:

Year:  2008        PMID: 18998999      PMCID: PMC2655979     

Source DB:  PubMed          Journal:  AMIA Annu Symp Proc        ISSN: 1559-4076


  12 in total

1.  Disambiguating ambiguous biomedical terms in biomedical narrative text: an unsupervised method.

Authors:  H Liu; Y A Lussier; C Friedman
Journal:  J Biomed Inform       Date:  2001-08       Impact factor: 6.317

2.  Supporting discovery in medicine by association rule mining in Medline and UMLS.

Authors:  D Hristovski; J Stare; B Peterlin; S Dzeroski
Journal:  Stud Health Technol Inform       Date:  2001

3.  Gene name ambiguity of eukaryotic nomenclatures.

Authors:  Lifeng Chen; Hongfang Liu; Carol Friedman
Journal:  Bioinformatics       Date:  2004-08-27       Impact factor: 6.937

4.  Using literature-based discovery to identify disease candidate genes.

Authors:  Dimitar Hristovski; Borut Peterlin; Joyce A Mitchell; Susanne M Humphrey
Journal:  Int J Med Inform       Date:  2005-03       Impact factor: 4.046

5.  Gene symbol disambiguation using knowledge-based profiles.

Authors:  Hua Xu; Jung-Wei Fan; George Hripcsak; Eneida A Mendonça; Marianthi Markatou; Carol Friedman
Journal:  Bioinformatics       Date:  2007-02-21       Impact factor: 6.937

6.  Fish oil, Raynaud's syndrome, and undiscovered public knowledge.

Authors:  D R Swanson
Journal:  Perspect Biol Med       Date:  1986       Impact factor: 1.416

7.  Ambiguity of human gene symbols in LocusLink and MEDLINE: creating an inventory and a disambiguation test collection.

Authors:  Marc Weeber; Bob J Schijvenaars; Erik M Van Mulligen; Barend Mons; Rob Jelier; Christian C Van Der Eijk; Jan A Kors
Journal:  AMIA Annu Symp Proc       Date:  2003

8.  Word Sense Disambiguation by Selecting the Best Semantic Type Based on Journal Descriptor Indexing: Preliminary Experiment.

Authors:  Susanne M Humphrey; Willie J Rogers; Halil Kilicoglu; Dina Demner-Fushman; Thomas C Rindflesch
Journal:  J Am Soc Inf Sci Technol       Date:  2006-01-01

9.  Underexpression of mineralocorticoid receptor in colorectal carcinomas and association with VEGFR-2 overexpression.

Authors:  Francesco Di Fabio; Carlos Alvarado; Agnieszka Majdan; Adrian Gologan; Linda Voda; Elliot Mitmaker; Lenore K Beitel; Philip H Gordon; Mark Trifiro
Journal:  J Gastrointest Surg       Date:  2007-08-17       Impact factor: 3.452

10.  Entrez Gene: gene-centered information at NCBI.

Authors:  Donna Maglott; Jim Ostell; Kim D Pruitt; Tatiana Tatusova
Journal:  Nucleic Acids Res       Date:  2006-12-05       Impact factor: 16.971

View more
  2 in total

1.  A literature search tool for intelligent extraction of disease-associated genes.

Authors:  Jae-Yoon Jung; Todd F DeLuca; Tristan H Nelson; Dennis P Wall
Journal:  J Am Med Inform Assoc       Date:  2013-09-02       Impact factor: 4.497

2.  Networks of neuroinjury semantic predications to identify biomarkers for mild traumatic brain injury.

Authors:  Michael J Cairelli; Marcelo Fiszman; Han Zhang; Thomas C Rindflesch
Journal:  J Biomed Semantics       Date:  2015-05-18
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.