Literature DB >> 12611807

Alignment-free sequence comparison-a review.

Susana Vinga1, Jonas Almeida.   

Abstract

MOTIVATION: Genetic recombination and, in particular, genetic shuffling are at odds with sequence comparison by alignment, which assumes conservation of contiguity between homologous segments. A variety of theoretical foundations are being used to derive alignment-free methods that overcome this limitation. The formulation of alternative metrics for dissimilarity between sequences and their algorithmic implementations are reviewed.
RESULTS: The overwhelming majority of work on alignment-free sequence has taken place in the past two decades, with most reports published in the past 5 years. Two main categories of methods have been proposed-methods based on word (oligomer) frequency, and methods that do not require resolving the sequence with fixed word length segments. The first category is based on the statistics of word frequency, on the distances defined in a Cartesian space defined by the frequency vectors, and on the information content of frequency distribution. The second category includes the use of Kolmogorov complexity and Chaos Theory. Despite their low visibility, alignment-free metrics are in fact already widely used as pre-selection filters for alignment-based querying of large applications. Recent work is furthering their usage as a scale-independent methodology that is capable of recognizing homology when loss of contiguity is beyond the possibility of alignment. AVAILABILITY: Most of the alignment-free algorithms reviewed were implemented in MATLAB code and are available at http://bioinformatics.musc.edu/resources.html

Mesh:

Year:  2003        PMID: 12611807     DOI: 10.1093/bioinformatics/btg005

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  193 in total

1.  Local homology recognition and distance measures in linear time using compressed amino acid alphabets.

Authors:  Robert C Edgar
Journal:  Nucleic Acids Res       Date:  2004-01-16       Impact factor: 16.971

2.  Alignment-free detection of horizontal gene transfer between closely related bacterial genomes.

Authors:  Mirjana Domazet-Lošo; Bernhard Haubold
Journal:  Mob Genet Elements       Date:  2011-09-01

3.  Separating significant matches from spurious matches in DNA sequences.

Authors:  Hugo Devillers; Sophie Schbath
Journal:  J Comput Biol       Date:  2011-12-09       Impact factor: 1.479

4.  An alignment-free method to identify candidate orthologous enhancers in multiple Drosophila genomes.

Authors:  Manonmani Arunachalam; Karthik Jayasurya; Pavel Tomancak; Uwe Ohler
Journal:  Bioinformatics       Date:  2010-07-11       Impact factor: 6.937

5.  Protein sequence analysis based on hydropathy profile of amino acids.

Authors:  Xiao-li Xie; Li-fei Zheng; Ying Yu; Li-ping Liang; Man-cai Guo; John Song; Zhi-fa Yuan
Journal:  J Zhejiang Univ Sci B       Date:  2012-02       Impact factor: 3.066

6.  Efficient and sequence-independent replication of DNA containing a third base pair establishes a functional six-letter genetic alphabet.

Authors:  Denis A Malyshev; Kirandeep Dhami; Henry T Quach; Thomas Lavergne; Phillip Ordoukhanian; Ali Torkamani; Floyd E Romesberg
Journal:  Proc Natl Acad Sci U S A       Date:  2012-07-06       Impact factor: 11.205

7.  A study of bias and increasing organismal complexity from their post-translational modifications and reaction site interplays.

Authors:  Oliver Bonham-Carter; Ishwor Thapa; Steven From; Dhundy Bastola
Journal:  Brief Bioinform       Date:  2016-01-13       Impact factor: 11.622

8.  Alignment-free genome comparison with feature frequency profiles (FFP) and optimal resolutions.

Authors:  Gregory E Sims; Se-Ran Jun; Guohong A Wu; Sung-Hou Kim
Journal:  Proc Natl Acad Sci U S A       Date:  2009-02-02       Impact factor: 11.205

9.  A genomic distance based on MUM indicates discontinuity between most bacterial species and genera.

Authors:  Marc Deloger; Meriem El Karoui; Marie-Agnès Petit
Journal:  J Bacteriol       Date:  2008-10-31       Impact factor: 3.490

10.  Support Values for Genome Phylogenies.

Authors:  Fabian Klötzl; Bernhard Haubold
Journal:  Life (Basel)       Date:  2016-03-07
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.