Literature DB >> 20059365

Biological cluster evaluation for gene function prediction.

Sebastian Klie1, Zoran Nikoloski, Joachim Selbig.   

Abstract

Recent advances in high-throughput omics techniques render it possible to decode the function of genes by using the "guilt-by-association" principle on biologically meaningful clusters of gene expression data. However, the existing frameworks for biological evaluation of gene clusters are hindered by two bottleneck issues: (1) the choice for the number of clusters, and (2) the external measures which do not take in consideration the structure of the analyzed data and the ontology of the existing biological knowledge. Here, we address the identified bottlenecks by developing a novel framework that allows not only for biological evaluation of gene expression clusters based on existing structured knowledge, but also for prediction of putative gene functions. The proposed framework facilitates propagation of statistical significance at each of the following steps: (1) estimating the number of clusters, (2) evaluating the clusters in terms of novel external structural measures, (3) selecting an optimal clustering algorithm, and (4) predicting gene functions. The framework also includes a method for evaluation of gene clusters based on the structure of the employed ontology. Moreover, our method for obtaining a probabilistic range for the number of clusters is demonstrated valid on synthetic data and available gene expression profiles from Saccharomyces cerevisiae. Finally, we propose a network-based approach for gene function prediction which relies on the clustering of optimal score and the employed ontology. Our approach effectively predicts gene function on the Saccharomyces cerevisiae data set and is also employed to obtain putative gene functions for an Arabidopsis thaliana data set.

Entities:  

Keywords:  NP-completeness; algorithms; biochemical networks; combinatorics; computational molecular biology; databases; functional genomics; gene expression

Mesh:

Year:  2010        PMID: 20059365     DOI: 10.1089/cmb.2009.0129

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  8 in total

1.  PlaNet: combined sequence and expression comparisons across plant networks derived from seven species.

Authors:  Marek Mutwil; Sebastian Klie; Takayuki Tohge; Federico M Giorgi; Olivia Wilkins; Malcolm M Campbell; Alisdair R Fernie; Björn Usadel; Zoran Nikoloski; Staffan Persson
Journal:  Plant Cell       Date:  2011-03-25       Impact factor: 11.277

2.  Functional genome annotation of Drosophila seminal fluid proteins using transcriptional genetic networks.

Authors:  Julien F Ayroles; Brooke A Laflamme; Eric A Stone; Mariana F Wolfner; Trudy F C Mackay
Journal:  Genet Res (Camb)       Date:  2011-12       Impact factor: 1.588

3.  Metabolomic and transcriptomic stress response of Escherichia coli.

Authors:  Szymon Jozefczuk; Sebastian Klie; Gareth Catchpole; Jedrzej Szymanski; Alvaro Cuadros-Inostroza; Dirk Steinhauser; Joachim Selbig; Lothar Willmitzer
Journal:  Mol Syst Biol       Date:  2010-05-11       Impact factor: 11.429

4.  Conserved co-functional network between maize and Arabidopsis aid in the identification of seed defective genes in maize.

Authors:  Xiangbo Zhang; Yang Cui; Juxuan Wang; Yonghong Huang; Yongwen Qi
Journal:  Genes Genomics       Date:  2021-03-02       Impact factor: 1.839

5.  Speeding up the Consensus Clustering methodology for microarray data analysis.

Authors:  Raffaele Giancarlo; Filippo Utro
Journal:  Algorithms Mol Biol       Date:  2011-01-14       Impact factor: 1.405

6.  The Choice between MapMan and Gene Ontology for Automated Gene Function Prediction in Plant Science.

Authors:  Sebastian Klie; Zoran Nikoloski
Journal:  Front Genet       Date:  2012-06-28       Impact factor: 4.599

7.  Evolution of herbivore-induced early defense signaling was shaped by genome-wide duplications in Nicotiana.

Authors:  Wenwu Zhou; Thomas Brockmöller; Zhihao Ling; Ashton Omdahl; Ian T Baldwin; Shuqing Xu
Journal:  Elife       Date:  2016-11-04       Impact factor: 8.140

8.  Unified feature association networks through integration of transcriptomic and proteomic data.

Authors:  Ryan S McClure; Jason P Wendler; Joshua N Adkins; Jesica Swanstrom; Ralph Baric; Brooke L Deatherage Kaiser; Kristie L Oxford; Katrina M Waters; Jason E McDermott
Journal:  PLoS Comput Biol       Date:  2019-09-17       Impact factor: 4.475

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.