Literature DB >> 14728264

Ambiguity of human gene symbols in LocusLink and MEDLINE: creating an inventory and a disambiguation test collection.

Marc Weeber1, Bob J Schijvenaars, Erik M Van Mulligen, Barend Mons, Rob Jelier, Christian C Van Der Eijk, Jan A Kors.   

Abstract

Genes are discovered almost on a daily basis and new names have to be found. Although there are guidelines for gene nomenclature, the naming process is highly creative. Human genes are often named with a gene symbol and a longer, more descriptive term; the short form is very often an abbreviation of the long form. Abbreviations in biomedical language are highly ambiguous, i.e., one gene symbol often refers to more than one gene. Using an existing abbreviation expansion algorithm,we explore MEDLINE for the use of human gene symbols derived from LocusLink. It turns out that just over 40% of these symbols occur in MEDLINE, however, many of these occurrences are not related to genes. Along the process of making an inventory, a disambiguation test collection is constructed automatically.

Entities:  

Mesh:

Year:  2003        PMID: 14728264      PMCID: PMC1480234     

Source DB:  PubMed          Journal:  AMIA Annu Symp Proc        ISSN: 1559-4076


  15 in total

1.  Disambiguating proteins, genes, and RNA in text: a machine learning approach.

Authors:  V Hatzivassiloglou; P A Duboué; A Rzhetsky
Journal:  Bioinformatics       Date:  2001       Impact factor: 6.937

2.  Disambiguating ambiguous biomedical terms in biomedical narrative text: an unsupervised method.

Authors:  H Liu; Y A Lussier; C Friedman
Journal:  J Biomed Inform       Date:  2001-08       Impact factor: 6.317

3.  Mapping abbreviations to full forms in biomedical articles.

Authors:  Hong Yu; George Hripcsak; Carol Friedman
Journal:  J Am Med Inform Assoc       Date:  2002 May-Jun       Impact factor: 4.497

4.  A study of abbreviations in the UMLS.

Authors:  H Liu; Y A Lussier; C Friedman
Journal:  Proc AMIA Symp       Date:  2001

5.  Creating an online dictionary of abbreviations from MEDLINE.

Authors:  Jeffrey T Chang; Hinrich Schütze; Russ B Altman
Journal:  J Am Med Inform Assoc       Date:  2002 Nov-Dec       Impact factor: 4.497

6.  Automatic resolution of ambiguous terms based on machine learning and conceptual relations in the UMLS.

Authors:  Hongfang Liu; Stephen B Johnson; Carol Friedman
Journal:  J Am Med Inform Assoc       Date:  2002 Nov-Dec       Impact factor: 4.497

7.  A study of abbreviations in MEDLINE abstracts.

Authors:  Hongfang Liu; Alan R Aronson; Carol Friedman
Journal:  Proc AMIA Symp       Date:  2002

8.  Tagging gene and protein names in biomedical text.

Authors:  Lorraine Tanabe; W John Wilbur
Journal:  Bioinformatics       Date:  2002-08       Impact factor: 6.937

Review 9.  The nature of lexical knowledge.

Authors:  A T McCray
Journal:  Methods Inf Med       Date:  1998-11       Impact factor: 2.176

10.  Developing a test collection for biomedical word sense disambiguation.

Authors:  M Weeber; J G Mork; A R Aronson
Journal:  Proc AMIA Symp       Date:  2001
View more
  12 in total

1.  Generating quality word sense disambiguation test sets based on MeSH indexing.

Authors:  Jung-Wei Fan; Carol Friedman
Journal:  AMIA Annu Symp Proc       Date:  2009-11-14

2.  A multivariate approach for integrating genome-wide expression data and biological knowledge.

Authors:  Sek Won Kong; William T Pu; Peter J Park
Journal:  Bioinformatics       Date:  2006-07-28       Impact factor: 6.937

3.  A fast document classification algorithm for gene symbol disambiguation in the BITOLA literature-based discovery support system.

Authors:  Andrej Kastrin; Dimitar Hristovski
Journal:  AMIA Annu Symp Proc       Date:  2008-11-06

4.  Challenges in the association of human single nucleotide polymorphism mentions with unique database identifiers.

Authors:  Philippe E Thomas; Roman Klinger; Laura I Furlong; Martin Hofmann-Apitius; Christoph M Friedrich
Journal:  BMC Bioinformatics       Date:  2011-07-05       Impact factor: 3.169

5.  CoPub Mapper: mining MEDLINE based on search term co-publication.

Authors:  Blaise T F Alako; Antoine Veldhoven; Sjozef van Baal; Rob Jelier; Stefan Verhoeven; Ton Rullmann; Jan Polman; Guido Jenster
Journal:  BMC Bioinformatics       Date:  2005-03-11       Impact factor: 3.169

6.  Thesaurus-based disambiguation of gene symbols.

Authors:  Bob J A Schijvenaars; Barend Mons; Marc Weeber; Martijn J Schuemie; Erik M van Mulligen; Hester M Wain; Jan A Kors
Journal:  BMC Bioinformatics       Date:  2005-06-16       Impact factor: 3.169

7.  Retrieval with gene queries.

Authors:  Aditya K Sehgal; Padmini Srinivasan
Journal:  BMC Bioinformatics       Date:  2006-04-21       Impact factor: 3.169

8.  Text mining of full-text journal articles combined with gene expression analysis reveals a relationship between sphingosine-1-phosphate and invasiveness of a glioblastoma cell line.

Authors:  Jeyakumar Natarajan; Daniel Berrar; Werner Dubitzky; Catherine Hack; Yonghong Zhang; Catherine DeSesa; James R Van Brocklyn; Eric G Bremer
Journal:  BMC Bioinformatics       Date:  2006-08-10       Impact factor: 3.169

9.  Gene and protein nomenclature in public databases.

Authors:  Katrin Fundel; Ralf Zimmer
Journal:  BMC Bioinformatics       Date:  2006-08-09       Impact factor: 3.169

10.  Biomedical term mapping databases.

Authors:  Jonathan D Wren; Jeffrey T Chang; James Pustejovsky; Eytan Adar; Harold R Garner; Russ B Altman
Journal:  Nucleic Acids Res       Date:  2005-01-01       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.