Literature DB >> 19758426

Towards the prediction of essential genes by integration of network topology, cellular localization and biological process information.

Marcio L Acencio1, Ney Lemke.   

Abstract

BACKGROUND: The identification of essential genes is important for the understanding of the minimal requirements for cellular life and for practical purposes, such as drug design. However, the experimental techniques for essential genes discovery are labor-intensive and time-consuming. Considering these experimental constraints, a computational approach capable of accurately predicting essential genes would be of great value. We therefore present here a machine learning-based computational approach relying on network topological features, cellular localization and biological process information for prediction of essential genes.
RESULTS: We constructed a decision tree-based meta-classifier and trained it on datasets with individual and grouped attributes-network topological features, cellular compartments and biological processes-to generate various predictors of essential genes. We showed that the predictors with better performances are those generated by datasets with integrated attributes. Using the predictor with all attributes, i.e., network topological features, cellular compartments and biological processes, we obtained the best predictor of essential genes that was then used to classify yeast genes with unknown essentiality status. Finally, we generated decision trees by training the J48 algorithm on datasets with all network topological features, cellular localization and biological process information to discover cellular rules for essentiality. We found that the number of protein physical interactions, the nuclear localization of proteins and the number of regulating transcription factors are the most important factors determining gene essentiality.
CONCLUSION: We were able to demonstrate that network topological features, cellular localization and biological process information are reliable predictors of essential genes. Moreover, by constructing decision trees based on these data, we could discover cellular rules governing essentiality.

Entities:  

Mesh:

Year:  2009        PMID: 19758426      PMCID: PMC2753850          DOI: 10.1186/1471-2105-10-290

Source DB:  PubMed          Journal:  BMC Bioinformatics        ISSN: 1471-2105            Impact factor:   3.169


  47 in total

1.  Predicting essential genes in fungal genomes.

Authors:  Michael Seringhaus; Alberto Paccanaro; Anthony Borneman; Michael Snyder; Mark Gerstein
Journal:  Genome Res       Date:  2006-08-09       Impact factor: 9.043

2.  Currency and commodity metabolites: their identification and relation to the modularity of metabolic networks.

Authors:  M Huss; P Holme
Journal:  IET Syst Biol       Date:  2007-09       Impact factor: 1.615

3.  The THP1-SAC3-SUS1-CDC31 complex works in transcription elongation-mRNA export preventing RNA-mediated genome instability.

Authors:  Cristina González-Aguilera; Cristina Tous; Belén Gómez-González; Pablo Huertas; Rosa Luna; Andrés Aguilera
Journal:  Mol Biol Cell       Date:  2008-07-30       Impact factor: 4.138

4.  Functional genomics of genes with small open reading frames (sORFs) in S. cerevisiae.

Authors:  James P Kastenmayer; Li Ni; Angela Chu; Lauren E Kitchen; Wei-Chun Au; Hui Yang; Carole D Carter; David Wheeler; Ronald W Davis; Jef D Boeke; Michael A Snyder; Munira A Basrai
Journal:  Genome Res       Date:  2006-03       Impact factor: 9.043

Review 5.  What are decision trees?

Authors:  Carl Kingsford; Steven L Salzberg
Journal:  Nat Biotechnol       Date:  2008-09       Impact factor: 54.908

6.  A global view of pleiotropy and phenotypically derived gene function in yeast.

Authors:  Aimée Marie Dudley; Daniel Maarten Janse; Amos Tanay; Ron Shamir; George McDonald Church
Journal:  Mol Syst Biol       Date:  2005-03-29       Impact factor: 11.429

7.  Towards the identification of essential genes using targeted genome sequencing and comparative analysis.

Authors:  Adam M Gustafson; Evan S Snitkin; Stephen C J Parker; Charles DeLisi; Simon Kasif
Journal:  BMC Genomics       Date:  2006-10-19       Impact factor: 3.969

8.  Why do hubs in the yeast protein interaction network tend to be essential: reexamining the connection between the network topology and essentiality.

Authors:  Elena Zotenko; Julian Mestre; Dianne P O'Leary; Teresa M Przytycka
Journal:  PLoS Comput Biol       Date:  2008-08-01       Impact factor: 4.475

9.  The BioGRID Interaction Database: 2008 update.

Authors:  Bobby-Joe Breitkreutz; Chris Stark; Teresa Reguly; Lorrie Boucher; Ashton Breitkreutz; Michael Livstone; Rose Oughtred; Daniel H Lackner; Jürg Bähler; Valerie Wood; Kara Dolinski; Mike Tyers
Journal:  Nucleic Acids Res       Date:  2007-11-13       Impact factor: 16.971

10.  StAR: a simple tool for the statistical comparison of ROC curves.

Authors:  Ismael A Vergara; Tomás Norambuena; Evandro Ferrada; Alex W Slater; Francisco Melo
Journal:  BMC Bioinformatics       Date:  2008-06-05       Impact factor: 3.169

View more
  44 in total

1.  Both the hydrophobicity and a positively charged region flanking the C-terminal region of the transmembrane domain of signal-anchored proteins play critical roles in determining their targeting specificity to the endoplasmic reticulum or endosymbiotic organelles in Arabidopsis cells.

Authors:  Junho Lee; Hyunkyung Lee; Jinho Kim; Sumin Lee; Dae Heon Kim; Sanguk Kim; Inhwan Hwang
Journal:  Plant Cell       Date:  2011-04-22       Impact factor: 11.277

2.  Topological properties of robust biological and computational networks.

Authors:  Saket Navlakha; Xin He; Christos Faloutsos; Ziv Bar-Joseph
Journal:  J R Soc Interface       Date:  2014-04-30       Impact factor: 4.118

3.  Characteristics of Plant Essential Genes Allow for within- and between-Species Prediction of Lethal Mutant Phenotypes.

Authors:  John P Lloyd; Alexander E Seddon; Gaurav D Moghe; Matthew C Simenc; Shin-Han Shiu
Journal:  Plant Cell       Date:  2015-08-18       Impact factor: 11.277

4.  Identifying essential genes in bacterial metabolic networks with machine learning methods.

Authors:  Kitiporn Plaimas; Roland Eils; Rainer König
Journal:  BMC Syst Biol       Date:  2010-05-03

5.  Iteration method for predicting essential proteins based on orthology and protein-protein interaction networks.

Authors:  Wei Peng; Jianxin Wang; Weiping Wang; Qing Liu; Fang-Xiang Wu; Yi Pan
Journal:  BMC Syst Biol       Date:  2012-07-18

6.  From hub proteins to hub modules: the relationship between essentiality and centrality in the yeast interactome at different scales of organization.

Authors:  Jimin Song; Mona Singh
Journal:  PLoS Comput Biol       Date:  2013-02-21       Impact factor: 4.475

7.  Examination of the relationship between essential genes in PPI network and hub proteins in reverse nearest neighbor topology.

Authors:  Kang Ning; Hoong Kee Ng; Sriganesh Srihari; Hon Wai Leong; Alexey I Nesvizhskii
Journal:  BMC Bioinformatics       Date:  2010-10-12       Impact factor: 3.169

8.  A machine learning approach for genome-wide prediction of morbid and druggable human genes based on systems-level data.

Authors:  Pedro R Costa; Marcio L Acencio; Ney Lemke
Journal:  BMC Genomics       Date:  2010-12-22       Impact factor: 3.969

9.  Cross-Predicting Essential Genes between Two Model Eukaryotic Species Using Machine Learning.

Authors:  Tulio L Campos; Pasi K Korhonen; Neil D Young
Journal:  Int J Mol Sci       Date:  2021-05-11       Impact factor: 5.923

10.  A new method for the discovery of essential proteins.

Authors:  Xue Zhang; Jin Xu; Wang-xin Xiao
Journal:  PLoS One       Date:  2013-03-21       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.