Literature DB >> 12855479

Extracting synonymous gene and protein terms from biological literature.

Hong Yu1, Eugene Agichtein.   

Abstract

MOTIVATION: Genes and proteins are often associated with multiple names. More names are added as new functional or structural information is discovered. Because authors can use any one of the known names for a gene or protein, information retrieval and extraction would benefit from identifying the gene and protein terms that are synonyms of the same substance.
RESULTS: We have explored four complementary approaches for extracting gene and protein synonyms from text, namely the unsupervised, partially supervised, and supervised machine-learning techniques, as well as the manual knowledge-based approach. We report results of a large scale evaluation of these alternatives over an archive of biological journal articles. Our evaluation shows that our extraction techniques could be a valuable supplement to resources such as SWISSPROT, as our systems were able to capture gene and protein synonyms not listed in the SWISSPROT database.

Mesh:

Substances:

Year:  2003        PMID: 12855479     DOI: 10.1093/bioinformatics/btg1047

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  17 in total

1.  Identification of related gene/protein names based on an HMM of name variations.

Authors:  L Yeganova; L Smith; W J Wilbur
Journal:  Comput Biol Chem       Date:  2004-04       Impact factor: 2.877

2.  Biomedical negation scope detection with conditional random fields.

Authors:  Shashank Agarwal; Hong Yu
Journal:  J Am Med Inform Assoc       Date:  2010 Nov-Dec       Impact factor: 4.497

3.  Using co-occurrence network structure to extract synonymous gene and protein names from MEDLINE abstracts.

Authors:  A M Cohen; W R Hersh; C Dubay; K Spackman
Journal:  BMC Bioinformatics       Date:  2005-04-22       Impact factor: 3.169

4.  Quantitative assessment of dictionary-based protein named entity tagging.

Authors:  Hongfang Liu; Zhang-Zhi Hu; Manabu Torii; Cathy Wu; Carol Friedman
Journal:  J Am Med Inform Assoc       Date:  2006-06-23       Impact factor: 4.497

5.  PubMedMiner: Mining and Visualizing MeSH-based Associations in PubMed.

Authors:  Yucan Zhang; Indra Neil Sarkar; Elizabeth S Chen
Journal:  AMIA Annu Symp Proc       Date:  2014-11-14

6.  Detecting hedge cues and their scope in biomedical text with conditional random fields.

Authors:  Shashank Agarwal; Hong Yu
Journal:  J Biomed Inform       Date:  2010-08-13       Impact factor: 6.317

7.  Automatic extraction of mutations from Medline and cross-validation with OMIM.

Authors:  Dietrich Rebholz-Schuhmann; Stephane Marcel; Sylvie Albert; Ralf Tolle; Georg Casari; Harald Kirsch
Journal:  Nucleic Acids Res       Date:  2004-01-02       Impact factor: 16.971

Review 8.  Recent advances in biomedical literature mining.

Authors:  Sendong Zhao; Chang Su; Zhiyong Lu; Fei Wang
Journal:  Brief Bioinform       Date:  2021-05-20       Impact factor: 11.622

9.  Connecting the dots between PubMed abstracts.

Authors:  M Shahriar Hossain; Joseph Gresock; Yvette Edmonds; Richard Helm; Malcolm Potts; Naren Ramakrishnan
Journal:  PLoS One       Date:  2012-01-03       Impact factor: 3.240

10.  Contextual weighting for Support Vector Machines in literature mining: an application to gene versus protein name disambiguation.

Authors:  Tapio Pahikkala; Filip Ginter; Jorma Boberg; Jouni Järvinen; Tapio Salakoski
Journal:  BMC Bioinformatics       Date:  2005-06-22       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.