Literature DB >> 19348628

Creating reference datasets for systems biology applications using text mining.

Martin Krallinger1, Ana María Rojas, Alfonso Valencia.   

Abstract

High-throughput experimental techniques are generating large data collections with the aim of identifying novel entities involved in fundamental cellular processes as well as drawing a systematic picture of the relationships between individual components. Determining the accuracy of the resulting data and the selection of a subset of targets for more careful characterizations often requires relying on information provided by manually annotated data repositories. These repositories are incomplete and cover only a small fraction of the knowledge contained in the literature. We propose in this paper the use of text-mining technologies to extract, organize, and present information relevant for a particular biological topic. The aims of the resulting approach are (1) to enable topic-centric biological literature navigation, (2) to assist in the construction of manually revised data repositories, (3) to provide prioritization of biological entities for experimental studies, and (4) to enable human interpretation of large-scale experiments by providing direct links of bio-entities to relevant descriptions in the literature.

Entities:  

Mesh:

Year:  2009        PMID: 19348628     DOI: 10.1111/j.1749-6632.2008.03750.x

Source DB:  PubMed          Journal:  Ann N Y Acad Sci        ISSN: 0077-8923            Impact factor:   5.691


  5 in total

Review 1.  Recent advances in biomedical literature mining.

Authors:  Sendong Zhao; Chang Su; Zhiyong Lu; Fei Wang
Journal:  Brief Bioinform       Date:  2021-05-20       Impact factor: 11.622

2.  Targeted Therapy Database (TTD): a model to match patient's molecular profile with current knowledge on cancer biology.

Authors:  Simone Mocellin; Jeff Shrager; Richard Scolyer; Sandro Pasquali; Daunia Verdi; Francesco M Marincola; Marta Briarava; Randy Gobbel; Carlo Rossi; Donato Nitti
Journal:  PLoS One       Date:  2010-08-10       Impact factor: 3.240

3.  The Protein-Protein Interaction tasks of BioCreative III: classification/ranking of articles and linking bio-ontology concepts to full text.

Authors:  Martin Krallinger; Miguel Vazquez; Florian Leitner; David Salgado; Andrew Chatr-Aryamontri; Andrew Winter; Livia Perfetto; Leonardo Briganti; Luana Licata; Marta Iannuccelli; Luisa Castagnoli; Gianni Cesareni; Mike Tyers; Gerold Schneider; Fabio Rinaldi; Robert Leaman; Graciela Gonzalez; Sergio Matos; Sun Kim; W John Wilbur; Luis Rocha; Hagit Shatkay; Ashish V Tendulkar; Shashank Agarwal; Feifan Liu; Xinglong Wang; Rafal Rak; Keith Noto; Charles Elkan; Zhiyong Lu; Rezarta Islamaj Dogan; Jean-Fred Fontaine; Miguel A Andrade-Navarro; Alfonso Valencia
Journal:  BMC Bioinformatics       Date:  2011-10-03       Impact factor: 3.169

4.  PIE the search: searching PubMed literature for protein interaction information.

Authors:  Sun Kim; Dongseop Kwon; Soo-Yong Shin; W John Wilbur
Journal:  Bioinformatics       Date:  2011-12-22       Impact factor: 6.937

5.  Cost sensitive hierarchical document classification to triage PubMed abstracts for manual curation.

Authors:  Emily Seymour; Rohini Damle; Alessandro Sette; Bjoern Peters
Journal:  BMC Bioinformatics       Date:  2011-12-19       Impact factor: 3.169

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.