Literature DB >> 11807944

Singular value decomposition analysis of protein sequence alignment score data.

F Fogolari1, S Tessari, H Molinari.   

Abstract

One of the standard tools for the analysis of data arranged in matrix form is singular value decomposition (SVD). Few applications to genomic data have been reported to date mainly for the analysis of gene expression microarray data. We review SVD properties, examine mathematical terms and assumptions implicit in the SVD formalism, and show that SVD can be applied to the analysis of matrices representing pairwise alignment scores between large sets of protein sequences. In particular, we illustrate SVD capabilities for data dimension reduction and for clustering protein sequences. A comparison is performed between SVD-generated clusters of proteins and annotation reported in the SWISS-PROT Database for a set of protein sequences forming the calycin superfamily, entailing all entries corresponding to the lipocalin, cytosolic fatty acid-binding protein, and avidin-streptavidin Prosite patterns. Copyright 2001 Wiley-Liss, Inc.

Entities:  

Mesh:

Year:  2002        PMID: 11807944     DOI: 10.1002/prot.10032

Source DB:  PubMed          Journal:  Proteins        ISSN: 0887-3585


  4 in total

1.  Probabilistic assembly of human protein interaction networks from label-free quantitative proteomics.

Authors:  Mihaela E Sardiu; Yong Cai; Jingji Jin; Selene K Swanson; Ronald C Conaway; Joan W Conaway; Laurence Florens; Michael P Washburn
Journal:  Proc Natl Acad Sci U S A       Date:  2008-01-24       Impact factor: 11.205

2.  A singular value decomposition approach for improved taxonomic classification of biological sequences.

Authors:  Anderson R Santos; Marcos A Santos; Jan Baumbach; John A McCulloch; Guilherme C Oliveira; Artur Silva; Anderson Miyoshi; Vasco Azevedo
Journal:  BMC Genomics       Date:  2011-12-22       Impact factor: 3.969

3.  Amino acid "little Big Bang": representing amino acid substitution matrices as dot products of Euclidian vectors.

Authors:  Karel Zimmermann; Jean-François Gibrat
Journal:  BMC Bioinformatics       Date:  2010-01-04       Impact factor: 3.169

4.  Subfamily specific conservation profiles for proteins based on n-gram patterns.

Authors:  John K Vries; Xiong Liu
Journal:  BMC Bioinformatics       Date:  2008-01-30       Impact factor: 3.169

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.