Literature DB >> 22081520

Representing and comparing protein folds and fold families using three-dimensional shape-density representations.

Lazaros Mavridis1, Anisah W Ghoorah, Vishwesh Venkatraman, David W Ritchie.   

Abstract

The question of how best to compare and classify the (three-dimensional) structures of proteins is one of the most important unsolved problems in computational biology. To help tackle this problem, we have developed a novel shape-density superposition algorithm called 3D-Blast which represents and superposes the shapes of protein backbone folds using the spherical polar Fourier correlation technique originally developed by us for protein docking. The utility of this approach is compared with several well-known protein structure alignment algorithms using receiver-operator-characteristic plots of queries against the "gold standard" CATH database. Despite being completely independent of protein sequences and using no information about the internal geometry of proteins, our results from searching the CATH database show that 3D-Blast is highly competitive compared to current state-of-the-art protein structure alignment algorithms. A novel and potentially very useful feature of our approach is that it allows an average or "consensus" fold to be calculated easily for a given group of protein structures. We find that using consensus shapes to represent entire fold families also gives very good database query performance. We propose that using the notion of consensus fold shapes could provide a powerful new way to index existing protein structure databases, and that it offers an objective way to cluster and classify all of the currently known folds in the protein universe.
Copyright © 2011 Wiley Periodicals, Inc.

Keywords:  protein alignment; protein classification; protein clustering; protein comparison; protein indexing; three-dimensional protein shapes; three-dimensional superpositions

Mesh:

Substances:

Year:  2011        PMID: 22081520     DOI: 10.1002/prot.23218

Source DB:  PubMed          Journal:  Proteins        ISSN: 0887-3585


  4 in total

1.  PFClust: an optimised implementation of a parameter-free clustering algorithm.

Authors:  Khadija Musayeva; Tristan Henderson; John Bo Mitchell; Lazaros Mavridis
Journal:  Source Code Biol Med       Date:  2014-02-04

2.  Real time structural search of the Protein Data Bank.

Authors:  Dmytro Guzenko; Stephen K Burley; Jose M Duarte
Journal:  PLoS Comput Biol       Date:  2020-07-08       Impact factor: 4.475

3.  Virtual interactomics of proteins from biochemical standpoint.

Authors:  Jaroslav Kubrycht; Karel Sigler; Pavel Souček
Journal:  Mol Biol Int       Date:  2012-08-08

4.  PFClust: a novel parameter free clustering algorithm.

Authors:  Lazaros Mavridis; Neetika Nath; John B O Mitchell
Journal:  BMC Bioinformatics       Date:  2013-07-03       Impact factor: 3.169

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.