Literature DB >> 10575363

A symbolic-numeric approach to find patterns in genomes. Application to the translation initiation sites of E. coli.

C Delamarche1, P Guerdoux-Jamet, R Gras, J Nicolas.   

Abstract

DNA sequence data provided by genome sequencing programs open new research prospects. In this respect, computational investigations are of major importance to discover new 'functional/structural patterns' and to improve biological process knowledge. For example, even though the principal steps of translation initiation in prokaryotes are known, it is difficult to point out the exact pattern of the mRNA that is recognized by the ribosome. In this study, we have carried out a systematic context analysis of the complete genome of E. coli, around codons in competition for translation initiation. Using a combinatorial approach, we first show that it is possible to accurately define the initiation site by looking for the localization of patterns representing various combinations of trinucleotides. We have combined this approach with a statistical analysis based on the frequencies of these patterns. This leads to a decision tree, able to discriminate true and false starts with a recognition level near 90%. Our method may help to precisely localize the beginning of open reading frames, and point to likely mistakes for some genes in the database. The method may be included as a component of a gene recognition system, is not restricted to a particular genome or a two-classes discrimination, and may be applied to a broader class of biological patterns.

Entities:  

Mesh:

Substances:

Year:  1999        PMID: 10575363     DOI: 10.1016/s0300-9084(99)00328-4

Source DB:  PubMed          Journal:  Biochimie        ISSN: 0300-9084            Impact factor:   4.079


  3 in total

1.  In vivo evidence for the prokaryotic model of extended codon-anticodon interaction in translation initiation.

Authors:  Donna Esposito; Julien P Fey; Stephan Eberhard; Amanda J Hicks; David B Stern
Journal:  EMBO J       Date:  2003-02-03       Impact factor: 11.598

2.  Prediction of solvent accessibility and sites of deleterious mutations from protein sequence.

Authors:  Huiling Chen; Huan-Xiang Zhou
Journal:  Nucleic Acids Res       Date:  2005-06-03       Impact factor: 16.971

3.  Current awareness.

Authors:  R Drysdale; L Bayraktaroglu
Journal:  Yeast       Date:  2000-06-30       Impact factor: 3.239

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.