Literature DB >> 8872383

Gene structure prediction using information on homologous protein sequence.

I B Rogozin1, L Milanesi, N A Kolchanov.   

Abstract

In this paper a new approach for the prediction of protein coding gene structures is described. The principal scheme of prediction is as follows: first, the exons with the best potential are predicted in a sequence with unknown functions and a list of potential amino acid fragments coded by these exons is formed. Second, testing the homology between each amino acid fragment from the list and proteins from the SWISS-PROT database of amino acid sequences. One protein with the best homology is chosen out of all the homologous sequences. Third, reconstruction of the exon-intron structure, basing it on its homology with the chosen protein sequences. The method was tested on an independent control set (20 genes). The results were as follows: 21% of real exons were lost and 3% of non-real exons were found. This system can be used to refine the results of gene prediction systems, especially if highly homologous proteins are found in the amino acid sequence database.

Mesh:

Substances:

Year:  1996        PMID: 8872383     DOI: 10.1093/bioinformatics/12.3.161

Source DB:  PubMed          Journal:  Comput Appl Biosci        ISSN: 0266-7061


  4 in total

Review 1.  Current methods of gene prediction, their strengths and weaknesses.

Authors:  Catherine Mathé; Marie-France Sagot; Thomas Schiex; Pierre Rouzé
Journal:  Nucleic Acids Res       Date:  2002-10-01       Impact factor: 16.971

2.  Identification of TSIX, encoding an RNA antisense to human XIST, reveals differences from its murine counterpart: implications for X inactivation.

Authors:  B R Migeon; A K Chowdhury; J A Dunston; I McIntosh
Journal:  Am J Hum Genet       Date:  2001-09-12       Impact factor: 11.025

3.  A method of precise mRNA/DNA homology-based gene structure prediction.

Authors:  Alexander Churbanov; Mark Pauley; Daniel Quest; Hesham Ali
Journal:  BMC Bioinformatics       Date:  2005-10-21       Impact factor: 3.169

4.  Gene identification in novel eukaryotic genomes by self-training algorithm.

Authors:  Alexandre Lomsadze; Vardges Ter-Hovhannisyan; Yury O Chernoff; Mark Borodovsky
Journal:  Nucleic Acids Res       Date:  2005-11-28       Impact factor: 16.971

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.