Literature DB >> 10764574

Gene structure prediction by spliced alignment of genomic DNA with protein sequences: increased accuracy by differential splice site scoring.

J Usuka1, V Brendel.   

Abstract

Gene identification in genomic DNA from eukaryotes is complicated by the vast combinatorial possibilities of potential exon assemblies. If the gene encodes a protein that is closely related to known proteins, gene identification is aided by matching similarity of potential translation products to those target proteins. The genomic DNA and protein sequences can be aligned directly by scoring the implied residues of in-frame nucleotide triplets against the protein residues in conventional ways, while allowing for long gaps in the alignment corresponding to introns in the genomic DNA. We describe a novel method for such spliced alignment. The method derives an optimal alignment based on scoring for both sequence similarity of the predicted gene product to the protein sequence and intrinsic splice site strength of the predicted introns. Application of the method to a representative set of 50 known genes from Arabidopsis thaliana showed significant improvement in prediction accuracy compared to previous spliced alignment methods. The method is also more accurate than ab initio gene prediction methods, provided sufficiently close target proteins are available. In view of the fast growth of public sequence repositories, we argue that close targets will be available for the majority of novel genes, making spliced alignment an excellent practical tool for high-throughput automated genome annotation. Copyright 2000 Academic Press.

Entities:  

Mesh:

Substances:

Year:  2000        PMID: 10764574     DOI: 10.1006/jmbi.2000.3641

Source DB:  PubMed          Journal:  J Mol Biol        ISSN: 0022-2836            Impact factor:   5.469


  18 in total

1.  The maize genome contains a helitron insertion.

Authors:  Shailesh K Lal; Michael J Giroux; Volker Brendel; C Eduardo Vallejos; L Curtis Hannah
Journal:  Plant Cell       Date:  2003-02       Impact factor: 11.277

2.  Refined annotation of the Arabidopsis genome by complete expressed sequence tag mapping.

Authors:  Wei Zhu; Shannon D Schlueter; Volker Brendel
Journal:  Plant Physiol       Date:  2003-06       Impact factor: 8.340

Review 3.  Current methods of gene prediction, their strengths and weaknesses.

Authors:  Catherine Mathé; Marie-France Sagot; Thomas Schiex; Pierre Rouzé
Journal:  Nucleic Acids Res       Date:  2002-10-01       Impact factor: 16.971

4.  Genomic shotgun array: a procedure linking large-scale DNA sequencing with regional transcript mapping.

Authors:  Ling-Hui Li; Jian-Chiuan Li; Yung-Feng Lin; Chung-Yen Lin; Chung-Yung Chen; Shih-Feng Tsai
Journal:  Nucleic Acids Res       Date:  2004-02-11       Impact factor: 16.971

5.  A novel class of Helitron-related transposable elements in maize contain portions of multiple pseudogenes.

Authors:  Smriti Gupta; Andrea Gallavotti; Gabrielle A Stryker; Robert J Schmidt; Shailesh K Lal
Journal:  Plant Mol Biol       Date:  2005-01       Impact factor: 4.076

6.  Evaluation of five ab initio gene prediction programs for the discovery of maize genes.

Authors:  Hong Yao; Ling Guo; Yan Fu; Lisa A Borsuk; Tsui-Jung Wen; David S Skibbe; Xiangqin Cui; Brian E Scheffler; Jun Cao; Scott J Emrich; Daniel A Ashlock; Patrick S Schnable
Journal:  Plant Mol Biol       Date:  2005-02       Impact factor: 4.076

7.  Two large Arabidopsis thaliana gene families are homologous to the Brassica gene superfamily that encodes pollen coat proteins and the male component of the self-incompatibility response.

Authors:  V Vanoosthuyse; C Miege; C Dumas; J M Cock
Journal:  Plant Mol Biol       Date:  2001-05       Impact factor: 4.076

8.  Helitron mediated amplification of cytochrome P450 monooxygenase gene in maize.

Authors:  Natalie Jameson; Nikolaos Georgelis; Eric Fouladbash; Sara Martens; L Curtis Hannah; Shailesh Lal
Journal:  Plant Mol Biol       Date:  2008-06       Impact factor: 4.076

9.  Cooperation of Spaln and Prrn5 for Construction of Gene-Structure-Aware Multiple Sequence Alignment.

Authors:  Osamu Gotoh
Journal:  Methods Mol Biol       Date:  2021

10.  Comparative plant genomics resources at PlantGDB.

Authors:  Qunfeng Dong; Carolyn J Lawrence; Shannon D Schlueter; Matthew D Wilkerson; Stefan Kurtz; Carol Lushbough; Volker Brendel
Journal:  Plant Physiol       Date:  2005-10       Impact factor: 8.340

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.