Literature DB >> 18326660

Prokaryotic gene finding based on physicochemical characteristics of codons calculated from molecular dynamics simulations.

Poonam Singhal1, B Jayaram, Surjit B Dixit, David L Beveridge.   

Abstract

An ab initio model for gene prediction in prokaryotic genomes is proposed based on physicochemical characteristics of codons calculated from molecular dynamics (MD) simulations. The model requires a specification of three calculated quantities for each codon: the double-helical trinucleotide base pairing energy, the base pair stacking energy, and an index of the propensity of a codon for protein-nucleic acid interactions. The base pairing and stacking energies for each codon are obtained from recently reported MD simulations on all unique tetranucleotide steps, and the third parameter is assigned based on the conjugate rule previously proposed to account for the wobble hypothesis with respect to degeneracies in the genetic code. The third interaction propensity parameter values correlate well with ab initio MD calculated solvation energies and flexibility of codon sequences as well as codon usage in genes and amino acid composition frequencies in approximately 175,000 protein sequences in the Swissprot database. Assignment of these three parameters for each codon enables the calculation of the magnitude and orientation of a cumulative three-dimensional vector for a DNA sequence of any length in each of the six genomic reading frames. Analysis of 372 genomes comprising approximately 350,000 genes shows that the orientations of the gene and nongene vectors are well differentiated and make a clear distinction feasible between genic and nongenic sequences at a level equivalent to or better than currently available knowledge-based models trained on the basis of empirical data, presenting a strong support for the possibility of a unique and useful physicochemical characterization of DNA sequences from codons to genomes.

Entities:  

Mesh:

Substances:

Year:  2008        PMID: 18326660      PMCID: PMC2480686          DOI: 10.1529/biophysj.107.116392

Source DB:  PubMed          Journal:  Biophys J        ISSN: 0006-3495            Impact factor:   4.033


  30 in total

1.  EcoGene: a genome sequence database for Escherichia coli K-12.

Authors:  K E Rudd
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

2.  On the convergence of a clustering algorithm for protein-coding regions in microbial genomes.

Authors:  P Baldi
Journal:  Bioinformatics       Date:  2000-04       Impact factor: 6.937

Review 3.  Current methods of gene prediction, their strengths and weaknesses.

Authors:  Catherine Mathé; Marie-France Sagot; Thomas Schiex; Pierre Rouzé
Journal:  Nucleic Acids Res       Date:  2002-10-01       Impact factor: 16.971

Review 4.  Computational prediction of eukaryotic protein-coding genes.

Authors:  Michael Q Zhang
Journal:  Nat Rev Genet       Date:  2002-09       Impact factor: 53.242

5.  Molecular dynamics simulations of the 136 unique tetranucleotide sequences of DNA oligonucleotides. I. Research design and results on d(CpG) steps.

Authors:  David L Beveridge; Gabriela Barreiro; K Suzie Byun; David A Case; Thomas E Cheatham; Surjit B Dixit; Emmanuel Giudice; Filip Lankas; Richard Lavery; John H Maddocks; Roman Osman; Eleanore Seibert; Heinz Sklenar; Gautier Stoll; Kelly M Thayer; Péter Varnai; Matthew A Young
Journal:  Biophys J       Date:  2004-08-23       Impact factor: 4.033

6.  A hidden Markov model that finds genes in E. coli DNA.

Authors:  A Krogh; I S Mian; D Haussler
Journal:  Nucleic Acids Res       Date:  1994-11-11       Impact factor: 16.971

7.  How to interpret an anonymous bacterial genome: machine learning approach to gene identification.

Authors:  W S Hayes; M Borodovsky
Journal:  Genome Res       Date:  1998-11       Impact factor: 9.043

8.  Beyond the wobble: the rule of conjugates.

Authors:  B Jayaram
Journal:  J Mol Evol       Date:  1997-12       Impact factor: 2.395

Review 9.  Computational methods for the identification of genes in vertebrate genomic sequences.

Authors:  J M Claverie
Journal:  Hum Mol Genet       Date:  1997       Impact factor: 6.150

10.  Microbial gene identification using interpolated Markov models.

Authors:  S L Salzberg; A L Delcher; S Kasif; O White
Journal:  Nucleic Acids Res       Date:  1998-01-15       Impact factor: 16.971

View more
  12 in total

1.  DNA-energetics-based analyses suggest additional genes in prokaryotes.

Authors:  Garima Khandelwal; Jalaj Gupta; B Jayaram
Journal:  J Biosci       Date:  2012-07       Impact factor: 1.826

2.  Toward a Universal Structural and Energetic Model for Prokaryotic Promoters.

Authors:  Akhilesh Mishra; Priyanka Siwach; Pallavi Misra; Bhyravabhotla Jayaram; Manju Bansal; Wilma K Olson; Kelly M Thayer; David L Beveridge
Journal:  Biophys J       Date:  2018-08-08       Impact factor: 4.033

3.  A statistical thermodynamic model for investigating the stability of DNA sequences from oligonucleotides to genomes.

Authors:  Garima Khandelwal; Rebecca A Lee; B Jayaram; David L Beveridge
Journal:  Biophys J       Date:  2014-06-03       Impact factor: 4.033

Review 4.  The ABCs of molecular dynamics simulations on B-DNA, circa 2012.

Authors:  David L Beveridge; Thomas E Cheatham; Mihaly Mezei
Journal:  J Biosci       Date:  2012-07       Impact factor: 1.826

5.  A phenomenological model for predicting melting temperatures of DNA sequences.

Authors:  Garima Khandelwal; Jayaram Bhyravabhotla
Journal:  PLoS One       Date:  2010-08-26       Impact factor: 3.240

6.  Physico-chemical fingerprinting of RNA genes.

Authors:  Ankita Singh; Akhilesh Mishra; Ali Khosravi; Garima Khandelwal; B Jayaram
Journal:  Nucleic Acids Res       Date:  2017-04-20       Impact factor: 16.971

7.  Investigating dynamic and energetic determinants of protein nucleic acid recognition: analysis of the zinc finger zif268-DNA complexes.

Authors:  Rubben Torella; Elisabetta Moroni; Michele Caselle; Giulia Morra; Giorgio Colombo
Journal:  BMC Struct Biol       Date:  2010-11-24

8.  An integrative strategy to identify the entire protein coding potential of prokaryotic genomes by proteogenomics.

Authors:  Ulrich Omasits; Adithi R Varadarajan; Michael Schmid; Sandra Goetze; Damianos Melidis; Marc Bourqui; Olga Nikolayeva; Maxime Québatte; Andrea Patrignani; Christoph Dehio; Juerg E Frey; Mark D Robinson; Bernd Wollscheid; Christian H Ahrens
Journal:  Genome Res       Date:  2017-11-15       Impact factor: 9.043

9.  The Leader Peptide peTrpL Forms Antibiotic-Containing Ribonucleoprotein Complexes for Posttranscriptional Regulation of Multiresistance Genes.

Authors:  Hendrik Melior; Sandra Maaß; Siqi Li; Konrad U Förstner; Saina Azarderakhsh; Adithi R Varadarajan; Maximilian Stötzel; Muhammad Elhossary; Susanne Barth-Weber; Christian H Ahrens; Dörte Becher; Elena Evguenieva-Hackenberg
Journal:  mBio       Date:  2020-06-16       Impact factor: 7.867

Review 10.  Genomes to hits in silico - a country path today, a highway tomorrow: a case study of chikungunya.

Authors:  Anjali Soni; Khushhali M Pandey; Pratima Ray; B Jayaram
Journal:  Curr Pharm Des       Date:  2013       Impact factor: 3.116

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.