Literature DB >> 1572661

Identifying coding exons by similarity search: alu-derived and other potentially misleading protein sequences.

J M Claverie1.   

Abstract

The search for significant local similarities with known protein sequences is a powerful method for interpreting anonymous cDNA sequences or locating coding exons within genomic DNA sequences at a stage where the average contig size is still very small. The BLASTx program, implemented on the National Center for Biotechnology Information server, allows a sensitive search of all putative translations of a nucleotide query sequence against all known proteins in a matter of seconds. From an analysis of the current databases, I report a set of protein sequences exhibiting high local similarity to Alu repeat or vector sequences. These entries can lead to misleading interpretations of similarity searches. During the course of this study, the protease of a human spumaretrovirus was found to have integrated the 3' end half of the U2 snRNA.

Entities:  

Mesh:

Substances:

Year:  1992        PMID: 1572661     DOI: 10.1016/0888-7543(92)90321-i

Source DB:  PubMed          Journal:  Genomics        ISSN: 0888-7543            Impact factor:   5.736


  2 in total

Review 1.  Assessment of protein coding measures.

Authors:  J W Fickett; C S Tung
Journal:  Nucleic Acids Res       Date:  1992-12-25       Impact factor: 16.971

Review 2.  Computational methods for exon detection.

Authors:  J M Claverie
Journal:  Mol Biotechnol       Date:  1998-08       Impact factor: 2.695

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.