Literature DB >> 16201917

Correcting BLAST e-values for low-complexity segments.

Itai Sharon1, Aaron Birkland, Kuan Chang, Ran El-Yaniv, Golan Yona.   

Abstract

The statistical estimates of BLAST and PSI-BLAST are of extreme importance to determine the biological relevance of sequence matches. While being very effective in evaluating most matches, these estimates usually overestimate the significance of matches in the presence of low complexity segments. In this paper, we present a model, based on divergence measures and statistics of the alignment structure, that corrects BLAST e-values for low complexity sequences without filtering or excluding them and generates scores that are more effective in distinguishing true similarities from chance similarities. We evaluate our method and compare it to other known methods using the Gene Ontology (GO) knowledge resource as a benchmark. Various performance measures, including ROC analysis, indicate that the new model improves upon the state of the art. The program is available at biozon.org/ftp/ and www.cs.technion.ac.il/ approximately itaish/lowcomp/.

Mesh:

Year:  2005        PMID: 16201917     DOI: 10.1089/cmb.2005.12.980

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  3 in total

1.  Retrieval accuracy, statistical significance and compositional similarity in protein sequence database searches.

Authors:  Yi-Kuo Yu; E Michael Gertz; Richa Agarwala; Alejandro A Schäffer; Stephen F Altschul
Journal:  Nucleic Acids Res       Date:  2006-10-26       Impact factor: 16.971

2.  In silico characterization of tandem repeats in Trichophyton rubrum and related dermatophytes provides new insights into their role in pathogenesis.

Authors:  Matheus Eloy Franco; Tamires Aparecida Bitencourt; Mozart Marins; Ana Lúcia Fachin
Journal:  Database (Oxford)       Date:  2017-01-01       Impact factor: 3.451

3.  Exploiting a Reference Genome in Terms of Duplications: The Network of Paralogs and Single Copy Genes in Arabidopsis thaliana.

Authors:  Mara Sangiovanni; Alessandra Vigilante; Maria Luisa Chiusano
Journal:  Biology (Basel)       Date:  2013-12-09
  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.