Literature DB >> 12855449

Sensitive pattern discovery with 'fuzzy' alignments of distantly related proteins.

Andreas Heger1, Liisa Holm.   

Abstract

MOTIVATION: Evolutionary comparison leads to efficient functional characterisation of hypothetical proteins. Here, our goal is to map specific sequence patterns to putative functional classes. The evolutionary signal stands out most clearly in a maximally diverse set of homologues. This diversity, however, leads to a number of technical difficulties. The targeted patterns-as gleaned from structure comparisons-are too sparse for statistically significant signals of sequence similarity and accurate multiple sequence alignment.
RESULTS: We address this problem by a fuzzy alignment model, which probabilistically assigns residues to structurally equivalent positions (attributes) of the proteins. We then apply multivariate analysis to the 'attributes x proteins' matrix. The dimensionality of the space is reduced using non-negative matrix factorization. The method is general, fully automatic and works without assumptions about pattern density, minimum support, explicit multiple alignments, phylogenetic trees, etc. We demonstrate the discovery of biologically meaningful patterns in an extremely diverse superfamily related to urease.

Mesh:

Substances:

Year:  2003        PMID: 12855449     DOI: 10.1093/bioinformatics/btg1017

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  10 in total

1.  Metagenes and molecular pattern discovery using matrix factorization.

Authors:  Jean-Philippe Brunet; Pablo Tamayo; Todd R Golub; Jill P Mesirov
Journal:  Proc Natl Acad Sci U S A       Date:  2004-03-11       Impact factor: 11.205

2.  Theme discovery from gene lists for identification and viewing of multiple functional groups.

Authors:  Petri Pehkonen; Garry Wong; Petri Törönen
Journal:  BMC Bioinformatics       Date:  2005-06-29       Impact factor: 3.169

3.  Discovering semantic features in the literature: a foundation for building functional associations.

Authors:  Monica Chagoyen; Pedro Carmona-Saez; Hagit Shatkay; Jose M Carazo; Alberto Pascual-Montano
Journal:  BMC Bioinformatics       Date:  2006-01-26       Impact factor: 3.169

4.  Biclustering of gene expression data by Non-smooth Non-negative Matrix Factorization.

Authors:  Pedro Carmona-Saez; Roberto D Pascual-Marqui; F Tirado; Jose M Carazo; Alberto Pascual-Montano
Journal:  BMC Bioinformatics       Date:  2006-02-17       Impact factor: 3.169

5.  Fuzzy logic in medicine and bioinformatics.

Authors:  Angela Torres; Juan J Nieto
Journal:  J Biomed Biotechnol       Date:  2006

6.  bioNMF: a versatile tool for non-negative matrix factorization in biology.

Authors:  Alberto Pascual-Montano; Pedro Carmona-Saez; Monica Chagoyen; Francisco Tirado; Jose M Carazo; Roberto D Pascual-Marqui
Journal:  BMC Bioinformatics       Date:  2006-07-28       Impact factor: 3.169

7.  PairsDB atlas of protein sequence space.

Authors:  Andreas Heger; Eija Korpelainen; Taavi Hupponen; Kimmo Mattila; Vesa Ollikainen; Liisa Holm
Journal:  Nucleic Acids Res       Date:  2007-11-05       Impact factor: 16.971

Review 8.  Nonnegative matrix factorization: an analytical and interpretive tool in computational biology.

Authors:  Karthik Devarajan
Journal:  PLoS Comput Biol       Date:  2008-07-25       Impact factor: 4.475

9.  bioNMF: a web-based tool for nonnegative matrix factorization in biology.

Authors:  E Mejía-Roa; P Carmona-Saez; R Nogales; C Vicente; M Vázquez; X Y Yang; C García; F Tirado; A Pascual-Montano
Journal:  Nucleic Acids Res       Date:  2008-05-30       Impact factor: 16.971

10.  Identification of an ideal-like fingerprint for a protein fold using overlapped conserved residues based approach.

Authors:  Amit Goyal; Sriram Sokalingam; Kyu-Suk Hwang; Sun-Gu Lee
Journal:  Sci Rep       Date:  2014-07-10       Impact factor: 4.379

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.