Literature DB >> 16204115

Expert knowledge without the expert: integrated analysis of gene expression and literature to derive active functional contexts.

Robert Küffner1, Katrin Fundel, Ralf Zimmer.   

Abstract

MOTIVATION: The interpretation of expression data without appropriate expert knowledge is difficult and usually limited to exploratory data analysis, such as clustering and detecting differentially regulated genes. However, comparing experimental results against manually compiled knowledge resources might limit or bias the perspective on the data. Thus, manual analysis by experts is required to obtain confident predictions about involved processes.
RESULTS: We present an algorithm to simultaneously derive interpretations of expression measurements together with biological hypotheses from biomedical publications. It identifies active functional contexts ('concepts'), i.e. gene clusters that exhibit both a significant gene expression as well as a coherent literature profile. Manual intervention by an expert in specifying prior knowledge is not required. The approach scales to realistic applications and does not rely on controlled vocabularies or pathway resources. We validated our algorithm by analyzing a current juvenile arthritis dataset. A number of gene clusters and accompanying literature topics are identified as an interpretation of the data that coincide well with the phenotype and biological processes known to be involved in the disease. We demonstrate that generated clusters are both more sensitive and more specific than Gene Ontology categories detected on the same data. The method allows for in-depth investigation of subsets of genes, the associated literature topics and publications. AVAILABILITY: Supplementary data on clusters is available upon request.

Entities:  

Mesh:

Substances:

Year:  2005        PMID: 16204115     DOI: 10.1093/bioinformatics/bti1143

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  13 in total

Review 1.  Cardiovascular genomics: a biomarker identification pipeline.

Authors:  John H Phan; Chang F Quo; May Dongmei Wang
Journal:  IEEE Trans Inf Technol Biomed       Date:  2012-05-16

Review 2.  Text-mining solutions for biomedical research: enabling integrative biology.

Authors:  Dietrich Rebholz-Schuhmann; Anika Oellrich; Robert Hoehndorf
Journal:  Nat Rev Genet       Date:  2012-11-14       Impact factor: 53.242

3.  Identifying overrepresented concepts in gene lists from literature: a statistical approach based on Poisson mixture model.

Authors:  Xin He; Moushumi Sen Sarma; Xu Ling; Brant Chee; Chengxiang Zhai; Bruce Schatz
Journal:  BMC Bioinformatics       Date:  2010-05-20       Impact factor: 3.169

4.  Rewriting and suppressing UMLS terms for improved biomedical term identification.

Authors:  Kristina M Hettne; Erik M van Mulligen; Martijn J Schuemie; Bob Ja Schijvenaars; Jan A Kors
Journal:  J Biomed Semantics       Date:  2010-03-31

5.  Improving the efficiency of biomarker identification using biological knowledge.

Authors:  John H Phan; Qiqin Yin-Goen; Andrew N Young; May D Wang
Journal:  Pac Symp Biocomput       Date:  2009

6.  The Text-mining based PubChem Bioassay neighboring analysis.

Authors:  Lianyi Han; Tugba O Suzek; Yanli Wang; Steve H Bryant
Journal:  BMC Bioinformatics       Date:  2010-11-08       Impact factor: 3.169

7.  Discovering semantic features in the literature: a foundation for building functional associations.

Authors:  Monica Chagoyen; Pedro Carmona-Saez; Hagit Shatkay; Jose M Carazo; Alberto Pascual-Montano
Journal:  BMC Bioinformatics       Date:  2006-01-26       Impact factor: 3.169

8.  Text-derived concept profiles support assessment of DNA microarray data for acute myeloid leukemia and for androgen receptor stimulation.

Authors:  Rob Jelier; Guido Jenster; Lambert C J Dorssers; Bas J Wouters; Peter J M Hendriksen; Barend Mons; Ruud Delwel; Jan A Kors
Journal:  BMC Bioinformatics       Date:  2007-01-18       Impact factor: 3.169

9.  Novel protein-protein interactions inferred from literature context.

Authors:  Herman H H B M van Haagen; Peter A C 't Hoen; Alessandro Botelho Bovo; Antoine de Morrée; Erik M van Mulligen; Christine Chichester; Jan A Kors; Johan T den Dunnen; Gert-Jan B van Ommen; Silvère M van der Maarel; Vinícius Medina Kern; Barend Mons; Martijn J Schuemie
Journal:  PLoS One       Date:  2009-11-18       Impact factor: 3.240

10.  Martini: using literature keywords to compare gene sets.

Authors:  Theodoros G Soldatos; Seán I O'Donoghue; Venkata P Satagopam; Lars J Jensen; Nigel P Brown; Adriano Barbosa-Silva; Reinhard Schneider
Journal:  Nucleic Acids Res       Date:  2009-10-25       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.