Literature DB >> 19060303

Facts from text: can text mining help to scale-up high-quality manual curation of gene products with ontologies?

Rainer Winnenburg1, Thomas Wächter, Conrad Plake, Andreas Doms, Michael Schroeder.   

Abstract

The biomedical literature can be seen as a large integrated, but unstructured data repository. Extracting facts from literature and making them accessible is approached from two directions: manual curation efforts develop ontologies and vocabularies to annotate gene products based on statements in papers. Text mining aims to automatically identify entities and their relationships in text using information retrieval and natural language processing techniques. Manual curation is highly accurate but time consuming, and does not scale with the ever increasing growth of literature. Text mining as a high-throughput computational technique scales well, but is error-prone due to the complexity of natural language. How can both be married to combine scalability and accuracy? Here, we review the state-of-the-art text mining approaches that are relevant to annotation and discuss available online services analysing biomedical literature by means of text mining techniques, which could also be utilised by annotation projects. We then examine how far text mining has already been utilised in existing annotation projects and conclude how these techniques could be tightly integrated into the manual annotation process through novel authoring systems to scale-up high-quality manual curation.

Mesh:

Year:  2008        PMID: 19060303     DOI: 10.1093/bib/bbn043

Source DB:  PubMed          Journal:  Brief Bioinform        ISSN: 1467-5463            Impact factor:   11.622


  32 in total

Review 1.  Computational tools for prioritizing candidate genes: boosting disease gene discovery.

Authors:  Yves Moreau; Léon-Charles Tranchevent
Journal:  Nat Rev Genet       Date:  2012-07-03       Impact factor: 53.242

2.  Mapping plant interactomes using literature curated and predicted protein-protein interaction data sets.

Authors:  KiYoung Lee; David Thorneycroft; Premanand Achuthan; Henning Hermjakob; Trey Ideker
Journal:  Plant Cell       Date:  2010-04-06       Impact factor: 11.277

3.  Semi-automatic semantic annotation of PubMed queries: a study on quality, efficiency, satisfaction.

Authors:  Aurélie Névéol; Rezarta Islamaj Doğan; Zhiyong Lu
Journal:  J Biomed Inform       Date:  2010-11-20       Impact factor: 6.317

Review 4.  Recent progress in automatically extracting information from the pharmacogenomic literature.

Authors:  Yael Garten; Adrien Coulet; Russ B Altman
Journal:  Pharmacogenomics       Date:  2010-10       Impact factor: 2.533

5.  Plasma Proteome Signature of Sepsis: a Functionally Connected Protein Network.

Authors:  Genaro Pimienta; Douglas M Heithoff; Alexandre Rosa-Campos; Minerva Tran; Jeffrey D Esko; Michael J Mahan; Jamey D Marth; Jeffrey W Smith
Journal:  Proteomics       Date:  2019-02-20       Impact factor: 3.984

6.  Improved mutation tagging with gene identifiers applied to membrane protein stability prediction.

Authors:  Rainer Winnenburg; Conrad Plake; Michael Schroeder
Journal:  BMC Bioinformatics       Date:  2009-08-27       Impact factor: 3.169

7.  Semi-automated ontology generation within OBO-Edit.

Authors:  Thomas Wächter; Michael Schroeder
Journal:  Bioinformatics       Date:  2010-06-15       Impact factor: 6.937

8.  A comprehensive benchmark of kernel methods to extract protein-protein interactions from literature.

Authors:  Domonkos Tikk; Philippe Thomas; Peter Palaga; Jörg Hakenberg; Ulf Leser
Journal:  PLoS Comput Biol       Date:  2010-07-01       Impact factor: 4.475

9.  Text mining and manual curation of chemical-gene-disease networks for the comparative toxicogenomics database (CTD).

Authors:  Thomas C Wiegers; Allan Peter Davis; K Bretonnel Cohen; Lynette Hirschman; Carolyn J Mattingly
Journal:  BMC Bioinformatics       Date:  2009-10-08       Impact factor: 3.169

Review 10.  Calling International Rescue: knowledge lost in literature and data landslide!

Authors:  Teresa K Attwood; Douglas B Kell; Philip McDermott; James Marsh; Steve R Pettifer; David Thorne
Journal:  Biochem J       Date:  2009-12-10       Impact factor: 3.857

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.