Literature DB >> 16899653

Predicting essential genes in fungal genomes.

Michael Seringhaus1, Alberto Paccanaro, Anthony Borneman, Michael Snyder, Mark Gerstein.   

Abstract

Essential genes are required for an organism's viability, and the ability to identify these genes in pathogens is crucial to directed drug development. Predicting essential genes through computational methods is appealing because it circumvents expensive and difficult experimental screens. Most such prediction is based on homology mapping to experimentally verified essential genes in model organisms. We present here a different approach, one that relies exclusively on sequence features of a gene to estimate essentiality and offers a promising way to identify essential genes in unstudied or uncultured organisms. We identified 14 characteristic sequence features potentially associated with essentiality, such as localization signals, codon adaptation, GC content, and overall hydrophobicity. Using the well-characterized baker's yeast Saccharomyces cerevisiae, we employed a simple Bayesian framework to measure the correlation of each of these features with essentiality. We then employed the 14 features to learn the parameters of a machine learning classifier capable of predicting essential genes. We trained our classifier on known essential genes in S. cerevisiae and applied it to the closely related and relatively unstudied yeast Saccharomyces mikatae. We assessed predictive success in two ways: First, we compared all of our predictions with those generated by homology mapping between these two species. Second, we verified a subset of our predictions with eight in vivo knockouts in S. mikatae, and we present here the first experimentally confirmed essential genes in this species.

Entities:  

Mesh:

Year:  2006        PMID: 16899653      PMCID: PMC1557763          DOI: 10.1101/gr.5144106

Source DB:  PubMed          Journal:  Genome Res        ISSN: 1088-9051            Impact factor:   9.043


  31 in total

1.  Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes.

Authors:  A Krogh; B Larsson; G von Heijne; E L Sonnhammer
Journal:  J Mol Biol       Date:  2001-01-19       Impact factor: 5.469

2.  A Bayesian system integrating expression data with sequence patterns for localizing proteins: comprehensive application to the yeast genome.

Authors:  A Drawid; M Gerstein
Journal:  J Mol Biol       Date:  2000-08-25       Impact factor: 5.469

Review 3.  Rational identification of new antibacterial drug targets that are essential for viability using a genomics-based approach.

Authors:  Alison F Chalker; R Dwayne Lunsford
Journal:  Pharmacol Ther       Date:  2002-07       Impact factor: 12.310

4.  Predicting subcellular localization of proteins using machine-learned classifiers.

Authors:  Z Lu; D Szafron; R Greiner; P Lu; D S Wishart; B Poulin; J Anvik; C Macdonell; R Eisner
Journal:  Bioinformatics       Date:  2004-01-22       Impact factor: 6.937

5.  A hidden Markov model for predicting transmembrane helices in protein sequences.

Authors:  E L Sonnhammer; G von Heijne; A Krogh
Journal:  Proc Int Conf Intell Syst Mol Biol       Date:  1998

6.  The codon Adaptation Index--a measure of directional synonymous codon usage bias, and its potential applications.

Authors:  P M Sharp; W H Li
Journal:  Nucleic Acids Res       Date:  1987-02-11       Impact factor: 16.971

7.  A simple method for displaying the hydropathic character of a protein.

Authors:  J Kyte; R F Doolittle
Journal:  J Mol Biol       Date:  1982-05-05       Impact factor: 5.469

8.  Bioinformatics analysis of experimentally determined protein complexes in the yeast Saccharomyces cerevisiae.

Authors:  Zoltán Dezso; Zoltán N Oltvai; Albert-László Barabási
Journal:  Genome Res       Date:  2003-10-14       Impact factor: 9.043

9.  A genome-based approach for the identification of essential bacterial genes.

Authors:  F Arigoni; F Talabot; M Peitsch; M D Edgerton; E Meldrum; E Allet; R Fish; T Jamotte; M L Curchod; H Loferer
Journal:  Nat Biotechnol       Date:  1998-09       Impact factor: 54.908

10.  Bioinformatics for whole-genome shotgun sequencing of microbial communities.

Authors:  Kevin Chen; Lior Pachter
Journal:  PLoS Comput Biol       Date:  2005-07       Impact factor: 4.475

View more
  39 in total

1.  Gene Expression Evolves under a House-of-Cards Model of Stabilizing Selection.

Authors:  Andrea Hodgins-Davis; Daniel P Rice; Jeffrey P Townsend
Journal:  Mol Biol Evol       Date:  2015-04-20       Impact factor: 16.240

2.  ZCURVE 3.0: identify prokaryotic genes with higher accuracy as well as automatically and accurately select essential genes.

Authors:  Zhi-Gang Hua; Yan Lin; Ya-Zhou Yuan; De-Chang Yang; Wen Wei; Feng-Biao Guo
Journal:  Nucleic Acids Res       Date:  2015-05-14       Impact factor: 16.971

3.  Topological properties of robust biological and computational networks.

Authors:  Saket Navlakha; Xin He; Christos Faloutsos; Ziv Bar-Joseph
Journal:  J R Soc Interface       Date:  2014-04-30       Impact factor: 4.118

4.  Characteristics of Plant Essential Genes Allow for within- and between-Species Prediction of Lethal Mutant Phenotypes.

Authors:  John P Lloyd; Alexander E Seddon; Gaurav D Moghe; Matthew C Simenc; Shin-Han Shiu
Journal:  Plant Cell       Date:  2015-08-18       Impact factor: 11.277

5.  Drug target prediction and prioritization: using orthology to predict essentiality in parasite genomes.

Authors:  Maria A Doyle; Robin B Gasser; Ben J Woodcroft; Ross S Hall; Stuart A Ralph
Journal:  BMC Genomics       Date:  2010-04-03       Impact factor: 3.969

6.  Identifying essential genes in bacterial metabolic networks with machine learning methods.

Authors:  Kitiporn Plaimas; Roland Eils; Rainer König
Journal:  BMC Syst Biol       Date:  2010-05-03

7.  Protein network prediction and topological analysis in Leishmania major as a tool for drug target selection.

Authors:  Andrés F Flórez; Daeui Park; Jong Bhak; Byoung-Chul Kim; Allan Kuchinsky; John H Morris; Jairo Espinosa; Carlos Muskus
Journal:  BMC Bioinformatics       Date:  2010-09-27       Impact factor: 3.169

8.  Towards the prediction of essential genes by integration of network topology, cellular localization and biological process information.

Authors:  Marcio L Acencio; Ney Lemke
Journal:  BMC Bioinformatics       Date:  2009-09-16       Impact factor: 3.169

9.  Integrated assessment of genomic correlates of protein evolutionary rate.

Authors:  Yu Xia; Eric A Franzosa; Mark B Gerstein
Journal:  PLoS Comput Biol       Date:  2009-06-12       Impact factor: 4.475

10.  Computational prediction of essential genes in an unculturable endosymbiotic bacterium, Wolbachia of Brugia malayi.

Authors:  Alexander G Holman; Paul J Davis; Jeremy M Foster; Clotilde K S Carlow; Sanjay Kumar
Journal:  BMC Microbiol       Date:  2009-11-28       Impact factor: 3.605

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.