Literature DB >> 19345281

The value of parsing as feature generation for gene mention recognition.

Larry H Smith1, W John Wilbur.   

Abstract

We measured the extent to which information surrounding a base noun phrase reflects the presence of a gene name, and evaluated seven different parsers in their ability to provide information for that purpose. Using the GENETAG corpus as a gold standard, we performed machine learning to recognize from its context when a base noun phrase contained a gene name. Starting with the best lexical features, we assessed the gain of adding dependency or dependency-like relations from a full sentence parse. Features derived from parsers improved performance in this partial gene mention recognition task by a small but statistically significant amount. There were virtually no differences between parsers in these experiments.

Entities:  

Mesh:

Year:  2009        PMID: 19345281      PMCID: PMC2757476          DOI: 10.1016/j.jbi.2009.03.011

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  7 in total

1.  Boosting naïve Bayesian learning on a large subset of MEDLINE.

Authors:  W J Wilbur
Journal:  Proc AMIA Symp       Date:  2000

2.  Tagging gene and protein names in biomedical text.

Authors:  Lorraine Tanabe; W John Wilbur
Journal:  Bioinformatics       Date:  2002-08       Impact factor: 6.937

3.  MedPost: a part-of-speech tagger for bioMedical text.

Authors:  L Smith; T Rindflesch; W J Wilbur
Journal:  Bioinformatics       Date:  2004-04-08       Impact factor: 6.937

4.  Toward information extraction: identifying protein names from biological papers.

Authors:  K Fukuda; A Tamura; T Tsunoda; T Takagi
Journal:  Pac Symp Biocomput       Date:  1998

5.  Approximate Statistical Tests for Comparing Supervised Classification Learning Algorithms.

Authors: 
Journal:  Neural Comput       Date:  1998-09-15       Impact factor: 2.026

6.  BioCreAtIvE task 1A: gene mention finding evaluation.

Authors:  Alexander Yeh; Alexander Morgan; Marc Colosimo; Lynette Hirschman
Journal:  BMC Bioinformatics       Date:  2005-05-24       Impact factor: 3.169

7.  Benchmarking natural-language parsers for biological applications using dependency graphs.

Authors:  Andrew B Clegg; Adrian J Shepherd
Journal:  BMC Bioinformatics       Date:  2007-01-25       Impact factor: 3.169

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.