Literature DB >> 12176833

Simplifying amino acid alphabets by means of a branch and bound algorithm and substitution matrices.

Nicola Cannata1, Stefano Toppo, Chiara Romualdi, Giorgio Valle.   

Abstract

MOTIVATION: Protein and DNA are generally represented by sequences of letters. In a number of circumstances simplified alphabets (where one or more letters would be represented by the same symbol) have proved their potential utility in several fields of bioinformatics including searching for patterns occurring at an unexpected rate, studying protein folding and finding consensus sequences in multiple alignments. The main issue addressed in this paper is the possibility of finding a general approach that would allow an exhaustive analysis of all the possible simplified alphabets, using substitution matrices like PAM and BLOSUM as a measure for scoring.
RESULTS: The computational approach presented in this paper has led to a computer program called AlphaSimp (Alphabet Simplifier) that can perform an exhaustive analysis of the possible simplified amino acid alphabets, using a branch and bound algorithm together with standard or user-defined substitution matrices. The program returns a ranked list of the highest-scoring simplified alphabets. When the extent of the simplification is limited and the simplified alphabets are maintained above ten symbols the program is able to complete the analysis in minutes or even seconds on a personal computer. However, the performance becomes worse, taking up to several hours, for highly simplified alphabets. AVAILABILITY: AlphaSimp and other accessory programs are available at http://bioinformatics.cribi.unipd.it/alphasimp

Entities:  

Mesh:

Year:  2002        PMID: 12176833     DOI: 10.1093/bioinformatics/18.8.1102

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  6 in total

1.  A method for computing the inter-residue interaction potentials for reduced amino acid alphabet.

Authors:  Abhinav Luthra; Anupam Nath Jha; G K Ananthasuresh; Saraswathi Vishveswara
Journal:  J Biosci       Date:  2007-08       Impact factor: 1.826

2.  Assessing side-chain perturbations of the protein backbone: a knowledge-based classification of residue Ramachandran space.

Authors:  David B Dahl; Zach Bohannan; Qianxing Mo; Marina Vannucci; Jerry Tsai
Journal:  J Mol Biol       Date:  2008-02-29       Impact factor: 5.469

3.  Optimal neighborhood indexing for protein similarity search.

Authors:  Pierre Peterlongo; Laurent Noé; Dominique Lavenier; Van Hoa Nguyen; Gregory Kucherov; Mathieu Giraud
Journal:  BMC Bioinformatics       Date:  2008-12-16       Impact factor: 3.169

4.  Prediction of Metal Ion Binding Sites in Proteins from Amino Acid Sequences by Using Simplified Amino Acid Alphabets and Random Forest Model.

Authors:  Suresh Kumar
Journal:  Genomics Inform       Date:  2017-12-29

5.  Unearthing the root of amino acid similarity.

Authors:  James D Stephenson; Stephen J Freeland
Journal:  J Mol Evol       Date:  2013-06-07       Impact factor: 2.395

Review 6.  Research progress of reduced amino acid alphabets in protein analysis and prediction.

Authors:  Yuchao Liang; Siqi Yang; Lei Zheng; Hao Wang; Jian Zhou; Shenghui Huang; Lei Yang; Yongchun Zuo
Journal:  Comput Struct Biotechnol J       Date:  2022-07-04       Impact factor: 6.155

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.