Literature DB >> 17495998

MutationFinder: a high-performance system for extracting point mutation mentions from text.

J Gregory Caporaso1, William A Baumgartner, David A Randolph, K Bretonnel Cohen, Lawrence Hunter.   

Abstract

Discussion of point mutations is ubiquitous in biomedical literature, and manually compiling databases or literature on mutations in specific genes or proteins is tedious. We present an open-source, rule-based system, MutationFinder, for extracting point mutation mentions from text. On blind test data, it achieves nearly perfect precision and a markedly improved recall over a baseline. AVAILABILITY: MutationFinder, along with a high-quality gold standard data set, and a scoring script for mutation extraction systems have been made publicly available. Implementations, source code and unit tests are available in Python, Perl and Java. MutationFinder can be used as a stand-alone script, or imported by other applications. PROJECT URL: http://bionlp.sourceforge.net.

Entities:  

Mesh:

Year:  2007        PMID: 17495998      PMCID: PMC2516306          DOI: 10.1093/bioinformatics/btm235

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  5 in total

1.  Disambiguating proteins, genes, and RNA in text: a machine learning approach.

Authors:  V Hatzivassiloglou; P A Duboué; A Rzhetsky
Journal:  Bioinformatics       Date:  2001       Impact factor: 6.937

2.  Automated extraction of mutation data from the literature: application of MuteXt to G protein-coupled receptors and nuclear hormone receptors.

Authors:  Florence Horn; Anthony L Lau; Fred E Cohen
Journal:  Bioinformatics       Date:  2004-01-22       Impact factor: 6.937

3.  OSIRIS: a tool for retrieving literature about sequence variants.

Authors:  Julio Bonis; Laura Inés Furlong; Ferran Sanz
Journal:  Bioinformatics       Date:  2006-07-31       Impact factor: 6.937

4.  Automatic extraction of mutations from Medline and cross-validation with OMIM.

Authors:  Dietrich Rebholz-Schuhmann; Stephane Marcel; Sylvie Albert; Ralf Tolle; Georg Casari; Harald Kirsch
Journal:  Nucleic Acids Res       Date:  2004-01-02       Impact factor: 16.971

5.  BioCreAtIvE task 1A: gene mention finding evaluation.

Authors:  Alexander Yeh; Alexander Morgan; Marc Colosimo; Lynette Hirschman
Journal:  BMC Bioinformatics       Date:  2005-05-24       Impact factor: 3.169

  5 in total
  70 in total

Review 1.  Bioinformatics for personal genome interpretation.

Authors:  Emidio Capriotti; Nathan L Nehrt; Maricel G Kann; Yana Bromberg
Journal:  Brief Bioinform       Date:  2012-01-13       Impact factor: 11.622

2.  An examination of the OMIM database for associating mutation to a consensus reference sequence.

Authors:  Zuofeng Li; Beili Ying; Xingnan Liu; Xiaoyan Zhang; Hong Yu
Journal:  Protein Cell       Date:  2012-04-04       Impact factor: 14.870

3.  Toward an automatic method for extracting cancer- and other disease-related point mutations from the biomedical literature.

Authors:  Emily Doughty; Attila Kertesz-Farkas; Olivier Bodenreider; Gary Thompson; Asa Adadey; Thomas Peterson; Maricel G Kann
Journal:  Bioinformatics       Date:  2010-12-07       Impact factor: 6.937

4.  Beyond accuracy: creating interoperable and scalable text-mining web services.

Authors:  Chih-Hsuan Wei; Robert Leaman; Zhiyong Lu
Journal:  Bioinformatics       Date:  2016-02-16       Impact factor: 6.937

5.  Intrinsic evaluation of text mining tools may not predict performance on realistic tasks.

Authors:  J Gregory Caporaso; Nita Deshpande; J Lynn Fink; Philip E Bourne; K Bretonnel Cohen; Lawrence Hunter
Journal:  Pac Symp Biocomput       Date:  2008

Review 6.  Recent progress in automatically extracting information from the pharmacogenomic literature.

Authors:  Yael Garten; Adrien Coulet; Russ B Altman
Journal:  Pharmacogenomics       Date:  2010-10       Impact factor: 2.533

7.  Extraction of genotype-phenotype-drug relationships from text: from entity recognition to bioinformatics application.

Authors:  Adrien Coulet; Nigam Shah; Lawrence Hunter; Chitta Barral; Russ B Altman
Journal:  Pac Symp Biocomput       Date:  2010

8.  Improved mutation tagging with gene identifiers applied to membrane protein stability prediction.

Authors:  Rainer Winnenburg; Conrad Plake; Michael Schroeder
Journal:  BMC Bioinformatics       Date:  2009-08-27       Impact factor: 3.169

9.  Pharmspresso: a text mining tool for extraction of pharmacogenomic concepts and relationships from full text.

Authors:  Yael Garten; Russ B Altman
Journal:  BMC Bioinformatics       Date:  2009-02-05       Impact factor: 3.169

10.  Extraction of human kinase mutations from literature, databases and genotyping studies.

Authors:  Martin Krallinger; Jose M G Izarzugaza; Carlos Rodriguez-Penagos; Alfonso Valencia
Journal:  BMC Bioinformatics       Date:  2009-08-27       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.