Literature DB >> 15724279

Using amino acid patterns to accurately predict translation initiation sites.

Huiqing Liu1, Hao Han, Jinyan Li, Limsoon Wong.   

Abstract

The translation initiation site (TIS) prediction problem is about how to correctly identify TIS in mRNA, cDNA, or other types of genomic sequences. High prediction accuracy can be helpful in a better understanding of protein coding from nucleotide sequences. This is an important step in genomic analysis to determine protein coding from nucleotide sequences. In this paper, we present an in silico method to predict translation initiation sites in vertebrate cDNA or mRNA sequences. This method consists of three sequential steps as follows. In the first step, candidate features are generated using k-gram amino acid patterns. In the second step, a small number of top-ranked features are selected by an entropy-based algorithm. In the third step, a classification model is built to recognize true TISs by applying support vector machines or ensembles of decision trees to the selected features. We have tested our method on several independent data sets, including two public ones and our own extracted sequences. The experimental results achieved are better than those reported previously using the same data sets. Our high accuracy not only demonstrates the feasibility of our method, but also indicates that there might be "amino acid" patterns around TIS in cDNA and mRNA sequences.

Mesh:

Substances:

Year:  2004        PMID: 15724279

Source DB:  PubMed          Journal:  In Silico Biol        ISSN: 1386-6338


  6 in total

1.  Amino acid biases in the N- and C-termini of proteins are evolutionarily conserved and are conserved between functionally related proteins.

Authors:  Peter M Palenchar
Journal:  Protein J       Date:  2008-08       Impact factor: 2.371

2.  Improvement in the prediction of the translation initiation site through balancing methods, inclusion of acquired knowledge and addition of features to sequences of mRNA.

Authors:  Lívia Márcia Silva; Felipe Carvalho de Souza Teixeira; José Miguel Ortega; Luis Enrique Zárate; Cristiane Neri Nobre
Journal:  BMC Genomics       Date:  2011-12-22       Impact factor: 3.969

3.  MetWAMer: eukaryotic translation initiation site prediction.

Authors:  Michael E Sparks; Volker Brendel
Journal:  BMC Bioinformatics       Date:  2008-09-18       Impact factor: 3.169

4.  Feature selection for the prediction of translation initiation sites.

Authors:  Guo Liang Li; Tze Yun Leong
Journal:  Genomics Proteomics Bioinformatics       Date:  2005-05       Impact factor: 7.691

5.  Dragon TIS Spotter: an Arabidopsis-derived predictor of translation initiation sites in plants.

Authors:  Arturo Magana-Mora; Haitham Ashoor; Boris R Jankovic; Allan Kamau; Karim Awara; Rajesh Chowdhary; John A C Archer; Vladimir B Bajic
Journal:  Bioinformatics       Date:  2012-10-30       Impact factor: 6.937

6.  A comprehensive software suite for the analysis of cDNAs.

Authors:  Kazuharu Arakawa; Haruo Suzuki; Kosuke Fujishima; Kenji Fujimoto; Sho Ueda; Motomu Matsui; Masaru Tomita
Journal:  Genomics Proteomics Bioinformatics       Date:  2005-08       Impact factor: 7.691

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.