Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 High-performance gene name normalization with GeNo.

Literature DB >> 19188193

High-performance gene name normalization with GeNo.

Joachim Wermter¹, Katrin Tomanek, Udo Hahn.

Abstract

MOTIVATION: The recognition and normalization of textual mentions of gene and protein names is both particularly important and challenging. Its importance lies in the fact that they constitute the crucial conceptual entities in biomedicine. Their recognition and normalization remains a challenging task because of widespread gene name ambiguities within species, across species, with common English words and with medical sublanguage terms.
RESULTS: We present GeNo, a highly competitive system for gene name normalization, which obtains an F-measure performance of 86.4% (precision: 87.8%, recall: 85.0%) on the BioCreAtIvE-II test set, thus being on a par with the best system on that task. Our system tackles the complex gene normalization problem by employing a carefully crafted suite of symbolic and statistical methods, and by fully relying on publicly available software and data resources, including extensive background knowledge based on semantic profiling. A major goal of our work is to present GeNo's architecture in a lucid and perspicuous way to pave the way to full reproducibility of our results. AVAILABILITY: GeNo, including its underlying resources, will be available from www.julielab.de. It is also currently deployed in the Semedico search engine at www.semedico.org.

Entities: Chemical

Mesh：

Substances：
Proteins

Year: 2009 PMID： 19188193 DOI： 10.1093/bioinformatics/btp071

Source DB: PubMed Journal: Bioinformatics ISSN： 1367-4803 Impact factor: 6.937

Keyword Cloud
Cited

43 in total

1. Cross-species gene normalization by species inference.

Authors: Chih-Hsuan Wei; Hung-Yu Kao
Journal: BMC Bioinformatics Date: 2011-10-03 Impact factor: 3.169

2. Soft tagging of overlapping high confidence gene mention variants for cross-species full-text gene normalization.

Authors: Cheng-Ju Kuo; Maurice H T Ling; Chun-Nan Hsu
Journal: BMC Bioinformatics Date: 2011-10-03 Impact factor: 3.169

3. Toward an automatic method for extracting cancer- and other disease-related point mutations from the biomedical literature.

Authors: Emily Doughty; Attila Kertesz-Farkas; Olivier Bodenreider; Gary Thompson; Asa Adadey; Thomas Peterson; Maricel G Kann
Journal: Bioinformatics Date: 2010-12-07 Impact factor: 6.937

High-performance gene name normalization with GeNo.

1. Cross-species gene normalization by species inference.

2. Soft tagging of overlapping high confidence gene mention variants for cross-species full-text gene normalization.

3. Toward an automatic method for extracting cancer- and other disease-related point mutations from the biomedical literature.

4. Beyond accuracy: creating interoperable and scalable text-mining web services.

5. A literature search tool for intelligent extraction of disease-associated genes.

Review 6. Recent progress in automatically extracting information from the pharmacogenomic literature.

7. SimConcept: A Hybrid Approach for Simplifying Composite Named Entities in Biomedicine.

8. Moara: a Java library for extracting and normalizing gene and protein mentions.

9. SimConcept: a hybrid approach for simplifying composite named entities in biomedical text.

10. Biomedical text mining and its applications.