Literature DB >> 24096012

Pattern recognition and probabilistic measures in alignment-free sequence analysis.

Isabel Schwende1, Tuan D Pham.   

Abstract

With the massive production of genomic and proteomic data, the number of available biological sequences in databases has reached a level that is not feasible anymore for exact alignments even when just a fraction of all sequences is used. To overcome this inevitable time complexity, ultrafast alignment-free methods are studied. Within the past two decades, a broad variety of nonalignment methods have been proposed including dissimilarity measures on classical representations of sequences like k-words or Markov models. Furthermore, articles were published that describe distance measures on alternative representations such as compression complexity, spectral time series or chaos game representation. However, alignments are still the standard method for real world applications in biological sequence analysis, and the time efficient alignment-free approaches are usually applied in cases when the accustomed algorithms turn out to fail or be too inconvenient.

Keywords:  alignment-free; distance measures; distortion measures; pattern classification; sequence comparison; signal processing

Mesh:

Year:  2013        PMID: 24096012     DOI: 10.1093/bib/bbt070

Source DB:  PubMed          Journal:  Brief Bioinform        ISSN: 1467-5463            Impact factor:   11.622


  10 in total

1.  On the Natural Structure of Amino Acid Patterns in Families of Protein Sequences.

Authors:  Pablo Turjanski; Diego U Ferreiro
Journal:  J Phys Chem B       Date:  2018-10-08       Impact factor: 2.991

2.  CLAP: a web-server for automatic classification of proteins with special reference to multi-domain proteins.

Authors:  Mutharasu Gnanavel; Prachi Mehrotra; Ramaswamy Rakshambikai; Juliette Martin; Narayanaswamy Srinivasan; Ramachandra M Bhaskara
Journal:  BMC Bioinformatics       Date:  2014-10-04       Impact factor: 3.169

3.  A weighted string kernel for protein fold recognition.

Authors:  Saghi Nojoomi; Patrice Koehl
Journal:  BMC Bioinformatics       Date:  2017-08-25       Impact factor: 3.169

4.  Reconstructing evolutionary trees in parallel for massive sequences.

Authors:  Quan Zou; Shixiang Wan; Xiangxiang Zeng; Zhanshan Sam Ma
Journal:  BMC Syst Biol       Date:  2017-12-14

5.  String kernels for protein sequence comparisons: improved fold recognition.

Authors:  Saghi Nojoomi; Patrice Koehl
Journal:  BMC Bioinformatics       Date:  2017-02-28       Impact factor: 3.169

6.  Protein Sequence Comparison Based on Physicochemical Properties and the Position-Feature Energy Matrix.

Authors:  Lulu Yu; Yusen Zhang; Ivan Gutman; Yongtang Shi; Matthias Dehmer
Journal:  Sci Rep       Date:  2017-04-10       Impact factor: 4.379

7.  A new graph-theoretic approach to determine the similarity of genome sequences based on nucleotide triplets.

Authors:  Subhram Das; Arijit Das; D K Bhattacharya; D N Tibarewala
Journal:  Genomics       Date:  2020-08-19       Impact factor: 5.736

8.  Additive methods for genomic signatures.

Authors:  Rallis Karamichalis; Lila Kari; Stavros Konstantinidis; Steffen Kopecki; Stephen Solis-Reyes
Journal:  BMC Bioinformatics       Date:  2016-08-22       Impact factor: 3.169

Review 9.  Alignment-free sequence comparison: benefits, applications, and tools.

Authors:  Andrzej Zielezinski; Susana Vinga; Jonas Almeida; Wojciech M Karlowski
Journal:  Genome Biol       Date:  2017-10-03       Impact factor: 13.583

10.  SSAW: A new sequence similarity analysis method based on the stationary discrete wavelet transform.

Authors:  Jie Lin; Jing Wei; Donald Adjeroh; Bing-Hua Jiang; Yue Jiang
Journal:  BMC Bioinformatics       Date:  2018-05-02       Impact factor: 3.169

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.