Literature DB >> 15290755

Optimal spaced seeds for homologous coding regions.

Broña Brejová1, Daniel G Brown, Tomás Vinar.   

Abstract

Optimal spaced seeds were developed as a method to increase sensitivity of local alignment programs similar to BLASTN. Such seeds have been used before in the program PatternHunter, and have given improved sensitivity and running time relative to BLASTN in genome-genome comparison. We study the problem of computing optimal spaced seeds for detecting homologous coding regions in unannotated genomic sequences. By using well-chosen seeds, we are able to improve the sensitivity of coding sequence alignment over that of TBLASTX, while keeping runtime comparable to BLASTN. We identify good seeds by first giving effective hidden Markov models of conservation in alignments of homologous coding regions. We give an efficient algorithm to compute the optimal spaced seed when conservation patterns are generated by these models. Our results offer the hope of improved gene finding due to fewer missed exons in DNA/DNA comparison, and more effective homology search in general, and may have applications outside of bioinformatics.

Mesh:

Substances:

Year:  2004        PMID: 15290755     DOI: 10.1142/s0219720004000326

Source DB:  PubMed          Journal:  J Bioinform Comput Biol        ISSN: 0219-7200            Impact factor:   1.122


  9 in total

1.  A unifying framework for seed sensitivity and its application to subset seeds.

Authors:  Gregory Kucherov; Laurent Noé; Mikhail Roytberg
Journal:  J Bioinform Comput Biol       Date:  2006-04       Impact factor: 1.122

2.  Global, highly specific and fast filtering of alignment seeds.

Authors:  Matthis Ebel; Giovanna Migliorelli; Mario Stanke
Journal:  BMC Bioinformatics       Date:  2022-06-10       Impact factor: 3.307

3.  Hit integration for identifying optimal spaced seeds.

Authors:  Won-Hyoung Chung; Seong-Bae Park
Journal:  BMC Bioinformatics       Date:  2010-01-18       Impact factor: 3.169

4.  Massive fungal biodiversity data re-annotation with multi-level clustering.

Authors:  Duong Vu; Szániszló Szöke; Christian Wiwie; Jan Baumbach; Gianluigi Cardinali; Richard Röttger; Vincent Robert
Journal:  Sci Rep       Date:  2014-10-30       Impact factor: 4.379

5.  rasbhari: Optimizing Spaced Seeds for Database Searching, Read Mapping and Alignment-Free Sequence Comparison.

Authors:  Lars Hahn; Chris-André Leimeister; Rachid Ounit; Stefano Lonardi; Burkhard Morgenstern
Journal:  PLoS Comput Biol       Date:  2016-10-19       Impact factor: 4.475

6.  Best hits of 11110110111: model-free selection and parameter-free sensitivity calculation of spaced seeds.

Authors:  Laurent Noé
Journal:  Algorithms Mol Biol       Date:  2017-02-14       Impact factor: 1.405

7.  Using multiple alignments to improve seeded local alignment algorithms.

Authors:  Jason Flannick; Serafim Batzoglou
Journal:  Nucleic Acids Res       Date:  2005-08-12       Impact factor: 16.971

8.  Universal seeds for cDNA-to-genome comparison.

Authors:  Leming Zhou; Jonathan Stanton; Liliana Florea
Journal:  BMC Bioinformatics       Date:  2008-01-23       Impact factor: 3.169

9.  Improved search heuristics find 20,000 new alignments between human and mouse genomes.

Authors:  Martin C Frith; Laurent Noé
Journal:  Nucleic Acids Res       Date:  2014-02-03       Impact factor: 16.971

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.