Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Ultra-fast global homology detection with Discrete Cosine Transform and Dynamic Time Warping.

Literature DB >> 29684140

Ultra-fast global homology detection with Discrete Cosine Transform and Dynamic Time Warping.

Daniele Raimondi^1,2,3,4, Gabriele Orlando^1,2,4, Yves Moreau^3,5, Wim F Vranken^1,2.

Abstract

Motivation: Evolutionary information is crucial for the annotation of proteins in bioinformatics. The amount of retrieved homologs often correlates with the quality of predicted protein annotations related to structure or function. With a growing amount of sequences available, fast and reliable methods for homology detection are essential, as they have a direct impact on predicted protein annotations.
Results: We developed a discriminative, alignment-free algorithm for homology detection with quasi-linear complexity, enabling theoretically much faster homology searches. To reach this goal, we convert the protein sequence into numeric biophysical representations. These are shrunk to a fixed length using a novel vector quantization method which uses a Discrete Cosine Transform compression. We then compute, for each compressed representation, similarity scores between proteins with the Dynamic Time Warping algorithm and we feed them into a Random Forest. The WARP performances are comparable with state of the art methods. Availability and implementation: The method is available at http://ibsquare.be/warp. Supplementary information: Supplementary data are available at Bioinformatics online.

Mesh：

Substances：
Proteins

Year: 2018 PMID： 29684140 DOI： 10.1093/bioinformatics/bty309

Source DB: PubMed Journal: Bioinformatics ISSN： 1367-4803 Impact factor: 6.937

Keyword Cloud
Cited

3 in total

1. ShiftCrypt: a web server to understand and biophysically align proteins through their NMR chemical shift values.

Authors: Gabriele Orlando; Daniele Raimondi; Luciano Porto Kagami; Wim F Vranken
Journal: Nucleic Acids Res Date: 2020-07-02 Impact factor: 16.971

2. Insight into the protein solubility driving forces with neural attention.

Authors: Daniele Raimondi; Gabriele Orlando; Piero Fariselli; Yves Moreau
Journal: PLoS Comput Biol Date: 2020-04-30 Impact factor: 4.475

3. Exploring the limitations of biophysical propensity scales coupled with machine learning for protein sequence analysis.

Authors: Daniele Raimondi; Gabriele Orlando; Wim F Vranken; Yves Moreau
Journal: Sci Rep Date: 2019-11-15 Impact factor: 4.379

3 in total