Literature DB >> 16837522

Remote homology detection based on oligomer distances.

Thomas Lingner1, Peter Meinicke.   

Abstract

MOTIVATION: Remote homology detection is among the most intensively researched problems in bioinformatics. Currently discriminative approaches, especially kernel-based methods, provide the most accurate results. However, kernel methods also show several drawbacks: in many cases prediction of new sequences is computationally expensive, often kernels lack an interpretable model for analysis of characteristic sequence features, and finally most approaches make use of so-called hyperparameters which complicate the application of methods across different datasets.
RESULTS: We introduce a feature vector representation for protein sequences based on distances between short oligomers. The corresponding feature space arises from distance histograms for any possible pair of K-mers. Our distance-based approach shows important advantages in terms of computational speed while on common test data the prediction performance is highly competitive with state-of-the-art methods for protein remote homology detection. Furthermore the learnt model can easily be analyzed in terms of discriminative features and in contrast to other methods our representation does not require any tuning of kernel hyperparameters. AVAILABILITY: Normalized kernel matrices for the experimental setup can be downloaded at www.gobics.de/thomas. Matlab code for computing the kernel matrices is available upon request. CONTACT: thomas@gobics.de, peter@gobics.de.

Entities:  

Mesh:

Substances:

Year:  2006        PMID: 16837522     DOI: 10.1093/bioinformatics/btl376

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  25 in total

1.  Protein remote homology detection by combining Chou's distance-pair pseudo amino acid composition and principal component analysis.

Authors:  Bin Liu; Junjie Chen; Xiaolong Wang
Journal:  Mol Genet Genomics       Date:  2015-04-21       Impact factor: 3.291

2.  Physicochemical property distributions for accurate and rapid pairwise protein homology detection.

Authors:  Bobbie-Jo M Webb-Robertson; Kyle G Ratuiste; Christopher S Oehmen
Journal:  BMC Bioinformatics       Date:  2010-03-19       Impact factor: 3.169

3.  UFO: a web server for ultra-fast functional profiling of whole genome protein sequences.

Authors:  Peter Meinicke
Journal:  BMC Genomics       Date:  2009-09-02       Impact factor: 3.969

4.  Statistical learning of peptide retention behavior in chromatographic separations: a new kernel-based approach for computational proteomics.

Authors:  Nico Pfeifer; Andreas Leinenbach; Christian G Huber; Oliver Kohlbacher
Journal:  BMC Bioinformatics       Date:  2007-11-30       Impact factor: 3.169

5.  Using amino acid physicochemical distance transformation for fast protein remote homology detection.

Authors:  Bin Liu; Xiaolong Wang; Qingcai Chen; Qiwen Dong; Xun Lan
Journal:  PLoS One       Date:  2012-09-28       Impact factor: 3.240

6.  MS4--Multi-Scale Selector of Sequence Signatures: an alignment-free method for classification of biological sequences.

Authors:  Eduardo Corel; Florian Pitschi; Ivan Laprevotte; Gilles Grasseau; Gilles Didier; Claudine Devauchelle
Journal:  BMC Bioinformatics       Date:  2010-07-30       Impact factor: 3.169

7.  A discriminative method for family-based protein remote homology detection that combines inductive logic programming and propositional models.

Authors:  Juliana S Bernardes; Alessandra Carbone; Gerson Zaverucha
Journal:  BMC Bioinformatics       Date:  2011-03-23       Impact factor: 3.169

8.  A discriminative method for protein remote homology detection and fold recognition combining Top-n-grams and latent semantic analysis.

Authors:  Bin Liu; Xiaolong Wang; Lei Lin; Qiwen Dong; Xuan Wang
Journal:  BMC Bioinformatics       Date:  2008-12-01       Impact factor: 3.169

9.  HIV-1 coreceptor usage prediction without multiple alignments: an application of string kernels.

Authors:  Sébastien Boisvert; Mario Marchand; François Laviolette; Jacques Corbeil
Journal:  Retrovirology       Date:  2008-12-04       Impact factor: 4.602

10.  Word correlation matrices for protein sequence analysis and remote homology detection.

Authors:  Thomas Lingner; Peter Meinicke
Journal:  BMC Bioinformatics       Date:  2008-06-03       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.