Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 GAPSCORE: finding gene and protein names one word at a time.

Literature DB >> 14734313

GAPSCORE: finding gene and protein names one word at a time.

Jeffrey T Chang¹, Hinrich Schütze, Russ B Altman.

Abstract

MOTIVATION: New high-throughput technologies have accelerated the accumulation of knowledge about genes and proteins. However, much knowledge is still stored as written natural language text. Therefore, we have developed a new method, GAPSCORE, to identify gene and protein names in text. GAPSCORE scores words based on a statistical model of gene names that quantifies their appearance, morphology and context.
RESULTS: We evaluated GAPSCORE against the Yapex data set and achieved an F-score of 82.5% (83.3% recall, 81.5% precision) for partial matches and 57.6% (58.5% recall, 56.7% precision) for exact matches. Since the method is statistical, users can choose score cutoffs that adjust the performance according to their needs. AVAILABILITY: GAPSCORE is available at http://bionlp.stanford.edu/gapscore/

Mesh：

Substances：
Proteins

Year: 2004 PMID： 14734313 DOI： 10.1093/bioinformatics/btg393

Source DB: PubMed Journal: Bioinformatics ISSN： 1367-4803 Impact factor: 6.937

Keyword Cloud
Cited

22 in total

9. EnzyMiner: automatic identification of protein level mutations and their impact on target enzymes from PubMed abstracts.

Authors: Süveyda Yeniterzi; Ugur Sezerman
Journal: BMC Bioinformatics Date: 2009-08-27 Impact factor: 3.169

10. 3D-footprint: a database for the structural analysis of protein-DNA complexes.

Authors: Bruno Contreras-Moreira
Journal: Nucleic Acids Res Date: 2009-09-18 Impact factor: 16.971

GAPSCORE: finding gene and protein names one word at a time.

1. NLProt: extracting protein names and sequences from papers.

2. A statistical approach to scanning the biomedical literature for pharmacogenetics knowledge.

3. Quantitative assessment of dictionary-based protein named entity tagging.

4. BioTagger-GM: a gene/protein name recognition system.

Review 5. Recent progress in automatically extracting information from the pharmacogenomic literature.

6. eFIP: a tool for mining functional impact of phosphorylation from literature.

7. Systematic identification of pharmacogenomics information from clinical trials.

8. BIOADI: a machine learning approach to identifying abbreviations and definitions in biological literature.

9. EnzyMiner: automatic identification of protein level mutations and their impact on target enzymes from PubMed abstracts.

10. 3D-footprint: a database for the structural analysis of protein-DNA complexes.