Literature DB >> 24162172

Sequence analysis by iterated maps, a review.

Jonas S Almeida1.   

Abstract

Among alignment-free methods, Iterated Maps (IMs) are on a particular extreme: they are also scale free (order free). The use of IMs for sequence analysis is also distinct from other alignment-free methodologies in being rooted in statistical mechanics instead of computational linguistics. Both of these roots go back over two decades to the use of fractal geometry in the characterization of phase-space representations. The time series analysis origin of the field is betrayed by the title of the manuscript that started this alignment-free subdomain in 1990, 'Chaos Game Representation'. The clash between the analysis of sequences as continuous series and the better established use of Markovian approaches to discrete series was almost immediate, with a defining critique published in same journal 2 years later. The rest of that decade would go by before the scale-free nature of the IM space was uncovered. The ensuing decade saw this scalability generalized for non-genomic alphabets as well as an interest in its use for graphic representation of biological sequences. Finally, in the past couple of years, in step with the emergence of BigData and MapReduce as a new computational paradigm, there is a surprising third act in the IM story. Multiple reports have described gains in computational efficiency of multiple orders of magnitude over more conventional sequence analysis methodologies. The stage appears to be now set for a recasting of IMs with a central role in processing nextgen sequencing results.

Keywords:  alignment-free; big data; chaos game; iterated maps; mapreduce; sequence analysis

Mesh:

Year:  2013        PMID: 24162172      PMCID: PMC4017330          DOI: 10.1093/bib/bbt072

Source DB:  PubMed          Journal:  Brief Bioinform        ISSN: 1467-5463            Impact factor:   11.622


  37 in total

1.  Analysis of genomic sequences by Chaos Game Representation.

Authors:  J S Almeida; J A Carriço; A Maretzek; P A Noble; M Fletcher
Journal:  Bioinformatics       Date:  2001-05       Impact factor: 6.937

Review 2.  Alignment-free sequence comparison-a review.

Authors:  Susana Vinga; Jonas Almeida
Journal:  Bioinformatics       Date:  2003-03-01       Impact factor: 6.937

Review 3.  Graphical representation of proteins.

Authors:  Milan Randić; Jure Zupan; Alexandru T Balaban; Drazen Vikić-Topić; Dejan Plavsić
Journal:  Chem Rev       Date:  2010-10-12       Impact factor: 60.622

4.  How long is the coast of britain? Statistical self-similarity and fractional dimension.

Authors:  B Mandelbrot
Journal:  Science       Date:  1967-05-05       Impact factor: 47.728

5.  Methods for assessing the statistical significance of molecular sequence features by using general scoring schemes.

Authors:  S Karlin; S F Altschul
Journal:  Proc Natl Acad Sci U S A       Date:  1990-03       Impact factor: 11.205

6.  Exceptional motifs in different Markov chain models for a statistical analysis of DNA sequences.

Authors:  S Schbath; B Prum; E de Turckheim
Journal:  J Comput Biol       Date:  1995       Impact factor: 1.479

7.  Chaos game representation of coding regions of human globin genes and alcohol dehydrogenase genes of phylogenetically divergent species.

Authors:  K A Hill; N J Schisler; S M Singh
Journal:  J Mol Evol       Date:  1992-09       Impact factor: 2.395

8.  Biological sequences as pictures: a generic two dimensional solution for iterated maps.

Authors:  Jonas S Almeida; Susana Vinga
Journal:  BMC Bioinformatics       Date:  2009-03-31       Impact factor: 3.169

9.  Chaos game representation for comparison of whole genomes.

Authors:  Jijoy Joseph; Roschen Sasikumar
Journal:  BMC Bioinformatics       Date:  2006-05-05       Impact factor: 3.169

10.  Local Renyi entropic profiles of DNA sequences.

Authors:  Susana Vinga; Jonas S Almeida
Journal:  BMC Bioinformatics       Date:  2007-10-16       Impact factor: 3.169

View more
  15 in total

Review 1.  Lung cancer-a fractal viewpoint.

Authors:  Frances E Lennon; Gianguido C Cianci; Nicole A Cipriani; Thomas A Hensing; Hannah J Zhang; Chin-Tu Chen; Septimiu D Murgu; Everett E Vokes; Michael W Vannier; Ravi Salgia
Journal:  Nat Rev Clin Oncol       Date:  2015-07-14       Impact factor: 66.675

2.  Interpreting alignment-free sequence comparison: what makes a score a good score?

Authors:  Martin T Swain; Martin Vickers
Journal:  NAR Genom Bioinform       Date:  2022-09-05

3.  An efficient numerical representation of genome sequence: natural vector with covariance component.

Authors:  Nan Sun; Xin Zhao; Stephen S-T Yau
Journal:  PeerJ       Date:  2022-06-16       Impact factor: 3.061

4.  An investigation into inter- and intragenomic variations of graphic genomic signatures.

Authors:  Rallis Karamichalis; Lila Kari; Stavros Konstantinidis; Steffen Kopecki
Journal:  BMC Bioinformatics       Date:  2015-08-07       Impact factor: 3.169

Review 5.  Methodological challenges and analytic opportunities for modeling and interpreting Big Healthcare Data.

Authors:  Ivo D Dinov
Journal:  Gigascience       Date:  2016-02-25       Impact factor: 6.524

6.  RNA-TVcurve: a Web server for RNA secondary structure comparison based on a multi-scale similarity of its triple vector curve representation.

Authors:  Ying Li; Xiaohu Shi; Yanchun Liang; Juan Xie; Yu Zhang; Qin Ma
Journal:  BMC Bioinformatics       Date:  2017-01-21       Impact factor: 3.169

7.  Benchmarking of alignment-free sequence comparison methods.

Authors:  Andrzej Zielezinski; Hani Z Girgis; Guillaume Bernard; Chris-Andre Leimeister; Kujin Tang; Thomas Dencker; Anna Katharina Lau; Sophie Röhling; Jae Jin Choi; Michael S Waterman; Matteo Comin; Sung-Hou Kim; Susana Vinga; Jonas S Almeida; Cheong Xin Chan; Benjamin T James; Fengzhu Sun; Burkhard Morgenstern; Wojciech M Karlowski
Journal:  Genome Biol       Date:  2019-07-25       Impact factor: 13.583

8.  Additive methods for genomic signatures.

Authors:  Rallis Karamichalis; Lila Kari; Stavros Konstantinidis; Steffen Kopecki; Stephen Solis-Reyes
Journal:  BMC Bioinformatics       Date:  2016-08-22       Impact factor: 3.169

Review 9.  Alignment-free sequence comparison: benefits, applications, and tools.

Authors:  Andrzej Zielezinski; Susana Vinga; Jonas Almeida; Wojciech M Karlowski
Journal:  Genome Biol       Date:  2017-10-03       Impact factor: 13.583

Review 10.  ALUminating the Path of Atherosclerosis Progression: Chaos Theory Suggests a Role for Alu Repeats in the Development of Atherosclerotic Vascular Disease.

Authors:  Miguel Hueso; Josep M Cruzado; Joan Torras; Estanislao Navarro
Journal:  Int J Mol Sci       Date:  2018-06-12       Impact factor: 5.923

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.