Literature DB >> 12490453

A hierarchical approach to aligning collinear regions of genomes.

Mikhail A Roytberg1, Aleksey Y Ogurtsov, Svetlana A Shabalina, Alexey S Kondrashov.   

Abstract

MOTIVATION: As a first approximation, similarity between two long orthologous regions of genomes can be represented by a chain of local similarities. Within such a chain, pairs of successive similarities are collinear (non-conflicting), i.e. segments involved in the nth similarity precede in both sequences segments involved in the (n+1)th similarity. However, when all similarities between two long sequences are considered, usually there are many conflicts between them. Although some conflicts can be avoided by masking transposons or low-complexity sequences, selecting only those similarities that reflect orthology and, thus, belong to the evolutionarily true chain is not trivial.
RESULTS: We propose a simple, hierarchical algorithm of finding the true chain of local similarities. Starting from similarities with low P-values, we resolve each pairwise conflict by deleting a similarity with a higher P-value. This greedy approach constructs a chain of similarities faster than when a chain optimal with respect to some global criterion is sought, and makes more sense biologically.

Entities:  

Mesh:

Year:  2002        PMID: 12490453     DOI: 10.1093/bioinformatics/18.12.1673

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  1 in total

1.  Multiple sequence alignment with user-defined anchor points.

Authors:  Burkhard Morgenstern; Sonja J Prohaska; Dirk Pöhler; Peter F Stadler
Journal:  Algorithms Mol Biol       Date:  2006-04-19       Impact factor: 1.405

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.