Literature DB >> 19957157

Analysis of biological processes and diseases using text mining approaches.

Martin Krallinger1, Florian Leitner, Alfonso Valencia.   

Abstract

A number of biomedical text mining systems have been developed to extract biologically relevant information directly from the literature, complementing bioinformatics methods in the analysis of experimentally generated data. We provide a short overview of the general characteristics of natural language data, existing biomedical literature databases, and lexical resources relevant in the context of biomedical text mining. A selected number of practically useful systems are introduced together with the type of user queries supported and the results they generate. The extraction of biological relationships, such as protein-protein interactions as well as metabolic and signaling pathways using information extraction systems, will be discussed through example cases of cancer-relevant proteins. Basic strategies for detecting associations of genes to diseases together with literature mining of mutations, SNPs, and epigenetic information (methylation) are described. We provide an overview of disease-centric and gene-centric literature mining methods for linking genes to phenotypic and genotypic aspects. Moreover, we discuss recent efforts for finding biomarkers through text mining and for gene list analysis and prioritization. Some relevant issues for implementing a customized biomedical text mining system will be pointed out. To demonstrate the usefulness of literature mining for the molecular oncology domain, we implemented two cancer-related applications. The first tool consists of a literature mining system for retrieving human mutations together with supporting articles. Specific gene mutations are linked to a set of predefined cancer types. The second application consists of a text categorization system supporting breast cancer-specific literature search and document-based breast cancer gene ranking. Future trends in text mining emphasize the importance of community efforts such as the BioCreative challenge for the development and integration of multiple systems into a common platform provided by the BioCreative Metaserver.

Entities:  

Mesh:

Year:  2010        PMID: 19957157     DOI: 10.1007/978-1-60327-194-3_16

Source DB:  PubMed          Journal:  Methods Mol Biol        ISSN: 1064-3745


  29 in total

1.  A literature search tool for intelligent extraction of disease-associated genes.

Authors:  Jae-Yoon Jung; Todd F DeLuca; Tristan H Nelson; Dennis P Wall
Journal:  J Am Med Inform Assoc       Date:  2013-09-02       Impact factor: 4.497

2.  Prediction of similarities among rheumatic diseases.

Authors:  Pinar Yildirim; Cinar Ceken; Reza Hassanpour; Mehmet Resit Tolun
Journal:  J Med Syst       Date:  2010-11-03       Impact factor: 4.460

3.  NCBI disease corpus: a resource for disease name recognition and concept normalization.

Authors:  Rezarta Islamaj Doğan; Robert Leaman; Zhiyong Lu
Journal:  J Biomed Inform       Date:  2014-01-03       Impact factor: 6.317

4.  Enabling enrichment analysis with the Human Disease Ontology.

Authors:  Paea LePendu; Mark A Musen; Nigam H Shah
Journal:  J Biomed Inform       Date:  2011-04-29       Impact factor: 6.317

5.  A gene ontology inferred from molecular networks.

Authors:  Janusz Dutkowski; Michael Kramer; Michal A Surma; Rama Balakrishnan; J Michael Cherry; Nevan J Krogan; Trey Ideker
Journal:  Nat Biotechnol       Date:  2013-01       Impact factor: 54.908

6.  Discriminative and informative features for biomolecular text mining with ensemble feature selection.

Authors:  Sofie Van Landeghem; Thomas Abeel; Yvan Saeys; Yves Van de Peer
Journal:  Bioinformatics       Date:  2010-09-15       Impact factor: 6.937

7.  Semantically linking molecular entities in literature through entity relationships.

Authors:  Sofie Van Landeghem; Jari Björne; Thomas Abeel; Bernard De Baets; Tapio Salakoski; Yves Van de Peer
Journal:  BMC Bioinformatics       Date:  2012-06-26       Impact factor: 3.169

8.  iBBiG: iterative binary bi-clustering of gene sets.

Authors:  Daniel Gusenleitner; Eleanor A Howe; Stefan Bentink; John Quackenbush; Aedín C Culhane
Journal:  Bioinformatics       Date:  2012-07-12       Impact factor: 6.937

9.  Chapter 9: Analyses using disease ontologies.

Authors:  Nigam H Shah; Tyler Cole; Mark A Musen
Journal:  PLoS Comput Biol       Date:  2012-12-27       Impact factor: 4.475

10.  Using cited references to improve the retrieval of related biomedical documents.

Authors:  Francisco M Ortuño; Ignacio Rojas; Miguel A Andrade-Navarro; Jean-Fred Fontaine
Journal:  BMC Bioinformatics       Date:  2013-03-27       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.