Literature DB >> 15979044

GAME: a simple and efficient whole genome alignment method using maximal exact match filtering.

Jeong-Hyeon Choi1, Hwan-Gue Cho, Sun Kim.   

Abstract

In this paper, we present a simple and efficient whole genome alignment method using maximal exact match (MEM). The major problem with the use of MEM anchor is that the number of hits in non-homologous regions increases exponentially when shorter MEM anchors are used to detect more homologous regions. To deal with this problem, we have developed a fast and accurate anchor filtering scheme based on simple match extension with minimum percent identity and extension length criteria. Due to its simplicity and accuracy, all MEM anchors in a pair of genomes can be exhaustively tested and filtered. In addition, by incorporating the translation technique, the alignment quality and speed of our genome alignment algorithm have been further improved. As a result, our genome alignment algorithm, GAME (Genome Alignment by Match Extension), performs competitively over existing algorithms and can align large whole genomes, e.g., A. thaliana, without the requirement of typical large memory and parallel processors. This is shown using an experiment which compares the performance of BLAST, BLASTZ, PatternHunter, MUMmer and our algorithm in aligning all 45 pairs of 10 microbial genomes. The scalability of our algorithm is shown in another experiment where all pairs of five chromosomes in A. thaliana were compared.

Entities:  

Mesh:

Year:  2005        PMID: 15979044     DOI: 10.1016/j.compbiolchem.2005.04.004

Source DB:  PubMed          Journal:  Comput Biol Chem        ISSN: 1476-9271            Impact factor:   2.877


  7 in total

1.  A practical algorithm for finding maximal exact matches in large sequence datasets using sparse suffix arrays.

Authors:  Zia Khan; Joshua S Bloom; Leonid Kruglyak; Mona Singh
Journal:  Bioinformatics       Date:  2009-04-23       Impact factor: 6.937

2.  Genome-wide DNA methylation maps in follicular lymphoma cells determined by methylation-enriched bisulfite sequencing.

Authors:  Jeong-Hyeon Choi; Yajun Li; Juyuan Guo; Lirong Pei; Tibor A Rauch; Robin S Kramer; Simone L Macmil; Graham B Wiley; Lynda B Bennett; Jennifer L Schnabel; Kristen H Taylor; Sun Kim; Dong Xu; Arun Sreekumar; Gerd P Pfeifer; Bruce A Roe; Charles W Caldwell; Kapil N Bhalla; Huidong Shi
Journal:  PLoS One       Date:  2010-09-29       Impact factor: 3.240

3.  De novo identification of LTR retrotransposons in eukaryotic genomes.

Authors:  Mina Rho; Jeong-Hyeon Choi; Sun Kim; Michael Lynch; Haixu Tang
Journal:  BMC Genomics       Date:  2007-04-03       Impact factor: 3.969

4.  CGAT: a comparative genome analysis tool for visualizing alignments in the analysis of complex evolutionary changes between closely related genomes.

Authors:  Ikuo Uchiyama; Toshio Higuchi; Ichizo Kobayashi
Journal:  BMC Bioinformatics       Date:  2006-10-24       Impact factor: 3.169

5.  NucDiff: in-depth characterization and annotation of differences between two sets of DNA sequences.

Authors:  Ksenia Khelik; Karin Lagesen; Geir Kjetil Sandve; Torbjørn Rognes; Alexander Johan Nederbragt
Journal:  BMC Bioinformatics       Date:  2017-07-12       Impact factor: 3.169

6.  Long read alignment based on maximal exact match seeds.

Authors:  Yongchao Liu; Bertil Schmidt
Journal:  Bioinformatics       Date:  2012-09-15       Impact factor: 6.937

7.  DART: a fast and accurate RNA-seq mapper with a partitioning strategy.

Authors:  Hsin-Nan Lin; Wen-Lian Hsu
Journal:  Bioinformatics       Date:  2018-01-15       Impact factor: 6.937

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.