Literature DB >> 29211823

Identifying functionally informative evolutionary sequence profiles.

Nelson Gil1, Andras Fiser1.   

Abstract

Motivation: Multiple sequence alignments (MSAs) can provide essential input to many bioinformatics applications, including protein structure prediction and functional annotation. However, the optimal selection of sequences to obtain biologically informative MSAs for such purposes is poorly explored, and has traditionally been performed manually.
Results: We present Selection of Alignment by Maximal Mutual Information (SAMMI), an automated, sequence-based approach to objectively select an optimal MSA from a large set of alternatives sampled from a general sequence database search. The hypothesis of this approach is that the mutual information among MSA columns will be maximal for those MSAs that contain the most diverse set possible of the most structurally and functionally homogeneous protein sequences. SAMMI was tested to select MSAs for functional site residue prediction by analysis of conservation patterns on a set of 435 proteins obtained from protein-ligand (peptides, nucleic acids and small substrates) and protein-protein interaction databases. Availability and implementation: A freely accessible program, including source code, implementing SAMMI is available at https://github.com/nelsongil92/SAMMI.git. Contact: andras.fiser@einstein.yu.edu. Supplementary information: Supplementary data are available at Bioinformatics online.

Mesh:

Year:  2018        PMID: 29211823      PMCID: PMC5905606          DOI: 10.1093/bioinformatics/btx779

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  55 in total

1.  AltAVisT: comparing alternative multiple sequence alignments.

Authors:  Burkhard Morgenstern; Sachin Goel; Alexander Sczyrba; Andreas Dress
Journal:  Bioinformatics       Date:  2003-02-12       Impact factor: 6.937

2.  Amino acid sequence analysis of the annexin super-gene family of proteins.

Authors:  G J Barton; R H Newman; P S Freemont; M J Crumpton
Journal:  Eur J Biochem       Date:  1991-06-15

3.  Correction for phylogeny, small number of observations and data redundancy improves the identification of coevolving amino acid pairs using mutual information.

Authors:  Cristina Marino Buslje; Javier Santos; Jose Maria Delfino; Morten Nielsen
Journal:  Bioinformatics       Date:  2009-03-10       Impact factor: 6.937

4.  Multiple sequence threading: an analysis of alignment quality and stability.

Authors:  W R Taylor
Journal:  J Mol Biol       Date:  1997-06-27       Impact factor: 5.469

5.  Functional classification of immune regulatory proteins.

Authors:  Rotem Rubinstein; Udupi A Ramagopal; Stanley G Nathenson; Steven C Almo; Andras Fiser
Journal:  Structure       Date:  2013-04-11       Impact factor: 5.006

6.  Functional clustering of immunoglobulin superfamily proteins with protein-protein interaction information calibrated hidden Markov model sequence profiles.

Authors:  Eng-Hui Yap; Tyler Rosche; Steve Almo; Andras Fiser
Journal:  J Mol Biol       Date:  2013-11-16       Impact factor: 5.469

7.  MetaPSICOV: combining coevolution methods for accurate prediction of contacts and long range hydrogen bonding in proteins.

Authors:  David T Jones; Tanya Singh; Tomasz Kosciolek; Stuart Tetchner
Journal:  Bioinformatics       Date:  2014-11-26       Impact factor: 6.937

8.  CCMpred--fast and precise prediction of protein residue-residue contacts from correlated mutations.

Authors:  Stefan Seemayer; Markus Gruber; Johannes Söding
Journal:  Bioinformatics       Date:  2014-07-26       Impact factor: 6.937

9.  BioLiP: a semi-manually curated database for biologically relevant ligand-protein interactions.

Authors:  Jianyi Yang; Ambrish Roy; Yang Zhang
Journal:  Nucleic Acids Res       Date:  2012-10-18       Impact factor: 16.971

10.  Protein interface classification by evolutionary analysis.

Authors:  Jose M Duarte; Adam Srebniak; Martin A Schärer; Guido Capitani
Journal:  BMC Bioinformatics       Date:  2012-12-22       Impact factor: 3.169

View more
  3 in total

1.  The choice of sequence homologs included in multiple sequence alignments has a dramatic impact on evolutionary conservation analysis.

Authors:  Nelson Gil; Andras Fiser
Journal:  Bioinformatics       Date:  2019-01-01       Impact factor: 6.937

2.  Discovery of receptor-ligand interfaces in the immunoglobulin superfamily.

Authors:  Nelson Gil; Eduardo J Fajardo; Andras Fiser
Journal:  Proteins       Date:  2019-07-29

3.  Protein-protein binding supersites.

Authors:  Raji Viswanathan; Eduardo Fajardo; Gabriel Steinberg; Matthew Haller; Andras Fiser
Journal:  PLoS Comput Biol       Date:  2019-01-07       Impact factor: 4.779

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.