Literature DB >> 12855437

Glocal alignment: finding rearrangements during alignment.

Michael Brudno1, Sanket Malde, Alexander Poliakov, Chuong B Do, Olivier Couronne, Inna Dubchak, Serafim Batzoglou.   

Abstract

MOTIVATION: To compare entire genomes from different species, biologists increasingly need alignment methods that are efficient enough to handle long sequences, and accurate enough to correctly align the conserved biological features between distant species. The two main classes of pairwise alignments are global alignment, where one string is transformed into the other, and local alignment, where all locations of similarity between the two strings are returned. Global alignments are less prone to demonstrating false homology as each letter of one sequence is constrained to being aligned to only one letter of the other. Local alignments, on the other hand, can cope with rearrangements between non-syntenic, orthologous sequences by identifying similar regions in sequences; this, however, comes at the expense of a higher false positive rate due to the inability of local aligners to take into account overall conservation maps.
RESULTS: In this paper we introduce the notion of glocal alignment, a combination of global and local methods, where one creates a map that transforms one sequence into the other while allowing for rearrangement events. We present Shuffle-LAGAN, a glocal alignment algorithm that is based on the CHAOS local alignment algorithm and the LAGAN global aligner, and is able to align long genomic sequences. To test Shuffle-LAGAN we split the mouse genome into BAC-sized pieces, and aligned these pieces to the human genome. We demonstrate that Shuffle-LAGAN compares favorably in terms of sensitivity and specificity with standard local and global aligners. From the alignments we conclude that about 9% of human/mouse homology may be attributed to small rearrangements, 63% of which are duplications.

Entities:  

Mesh:

Substances:

Year:  2003        PMID: 12855437     DOI: 10.1093/bioinformatics/btg1005

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  142 in total

1.  Evening expression of arabidopsis GIGANTEA is controlled by combinatorial interactions among evolutionarily conserved regulatory motifs.

Authors:  Markus C Berns; Karl Nordström; Frédéric Cremer; Réka Tóth; Martin Hartke; Samson Simon; Jonas R Klasen; Ingmar Bürstel; George Coupland
Journal:  Plant Cell       Date:  2014-10-31       Impact factor: 11.277

2.  Identification of a dopaminergic enhancer indicates complexity in vertebrate dopamine neuron phenotype specification.

Authors:  Esther Fujimoto; Tamara J Stevenson; Chi-Bin Chien; Joshua L Bonkowsky
Journal:  Dev Biol       Date:  2011-01-27       Impact factor: 3.582

3.  Mauve: multiple alignment of conserved genomic sequence with rearrangements.

Authors:  Aaron C E Darling; Bob Mau; Frederick R Blattner; Nicole T Perna
Journal:  Genome Res       Date:  2004-07       Impact factor: 9.043

Review 4.  Detecting genomic islands using bioinformatics approaches.

Authors:  Morgan G I Langille; William W L Hsiao; Fiona S L Brinkman
Journal:  Nat Rev Microbiol       Date:  2010-05       Impact factor: 60.633

Review 5.  Enhancer identification through comparative genomics.

Authors:  Axel Visel; James Bristow; Len A Pennacchio
Journal:  Semin Cell Dev Biol       Date:  2007-01-05       Impact factor: 7.727

6.  Extreme genomic variation in a natural population.

Authors:  Kerrin S Small; Michael Brudno; Matthew M Hill; Arend Sidow
Journal:  Proc Natl Acad Sci U S A       Date:  2007-03-19       Impact factor: 11.205

7.  Distribution and intensity of constraint in mammalian genomic sequence.

Authors:  Gregory M Cooper; Eric A Stone; George Asimenos; Eric D Green; Serafim Batzoglou; Arend Sidow
Journal:  Genome Res       Date:  2005-06-17       Impact factor: 9.043

8.  Multiple whole-genome alignments without a reference organism.

Authors:  Inna Dubchak; Alexander Poliakov; Andrey Kislyuk; Michael Brudno
Journal:  Genome Res       Date:  2009-01-28       Impact factor: 9.043

9.  Gene function prediction based on genomic context clustering and discriminative learning: an application to bacteriophages.

Authors:  Jason Li; Saman K Halgamuge; Christopher I Kells; Sen-Lin Tang
Journal:  BMC Bioinformatics       Date:  2007-05-22       Impact factor: 3.169

10.  Genomic regulatory blocks underlie extensive microsynteny conservation in insects.

Authors:  Pär G Engström; Shannan J Ho Sui; Oyvind Drivenes; Thomas S Becker; Boris Lenhard
Journal:  Genome Res       Date:  2007-11-07       Impact factor: 9.043

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.