Literature DB >> 24766258

Simultaneous alignment and folding of protein sequences.

Jérôme Waldispühl1, Charles W O'Donnell, Sebastian Will, Srinivas Devadas, Rolf Backofen, Bonnie Berger.   

Abstract

Accurate comparative analysis tools for low-homology proteins remains a difficult challenge in computational biology, especially sequence alignment and consensus folding problems. We present partiFold-Align, the first algorithm for simultaneous alignment and consensus folding of unaligned protein sequences; the algorithm's complexity is polynomial in time and space. Algorithmically, partiFold-Align exploits sparsity in the set of super-secondary structure pairings and alignment candidates to achieve an effectively cubic running time for simultaneous pairwise alignment and folding. We demonstrate the efficacy of these techniques on transmembrane β-barrel proteins, an important yet difficult class of proteins with few known three-dimensional structures. Testing against structurally derived sequence alignments, partiFold-Align significantly outperforms state-of-the-art pairwise and multiple sequence alignment tools in the most difficult low-sequence homology case. It also improves secondary structure prediction where current approaches fail. Importantly, partiFold-Align requires no prior training. These general techniques are widely applicable to many more protein families (partiFold-Align is available at http://partifold.csail.mit.edu/ ).

Mesh:

Substances:

Year:  2014        PMID: 24766258      PMCID: PMC4082353          DOI: 10.1089/cmb.2013.0163

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  36 in total

1.  Decision tree-based formation of consensus protein secondary structure prediction.

Authors:  J Selbig; T Mevissen; T Lengauer
Journal:  Bioinformatics       Date:  1999-12       Impact factor: 6.937

2.  BETAWRAP: successful prediction of parallel beta -helices from primary sequence reveals an association with many microbial pathogens.

Authors:  P Bradley; L Cowen; M Menke; J King; B Berger
Journal:  Proc Natl Acad Sci U S A       Date:  2001-12-18       Impact factor: 11.205

3.  Predicting reliable regions in protein sequence alignments.

Authors:  Melissa Cline; Richard Hughey; Kevin Karplus
Journal:  Bioinformatics       Date:  2002-02       Impact factor: 6.937

4.  Progress in predicting inter-residue contacts of proteins with neural networks and correlated mutations.

Authors:  P Fariselli; O Olmea; A Valencia; R Casadio
Journal:  Proteins       Date:  2001

5.  Dynalign: an algorithm for finding the secondary structure common to two RNA sequences.

Authors:  David H Mathews; Douglas H Turner
Journal:  J Mol Biol       Date:  2002-03-22       Impact factor: 5.469

6.  T-Coffee: A novel method for fast and accurate multiple sequence alignment.

Authors:  C Notredame; D G Higgins; J Heringa
Journal:  J Mol Biol       Date:  2000-09-08       Impact factor: 5.469

7.  BATMAS30: amino acid substitution matrix for alignment of bacterial transporters.

Authors:  Roman A Sutormin; Aleksandra B Rakhmaninova; Mikhail S Gelfand
Journal:  Proteins       Date:  2003-04-01

8.  LAGAN and Multi-LAGAN: efficient tools for large-scale multiple alignment of genomic DNA.

Authors:  Michael Brudno; Chuong B Do; Gregory M Cooper; Michael F Kim; Eugene Davydov; Eric D Green; Arend Sidow; Serafim Batzoglou
Journal:  Genome Res       Date:  2003-03-12       Impact factor: 9.043

9.  MUSCLE: multiple sequence alignment with high accuracy and high throughput.

Authors:  Robert C Edgar
Journal:  Nucleic Acids Res       Date:  2004-03-19       Impact factor: 16.971

10.  Pfam: the protein families database.

Authors:  Robert D Finn; Alex Bateman; Jody Clements; Penelope Coggill; Ruth Y Eberhardt; Sean R Eddy; Andreas Heger; Kirstie Hetherington; Liisa Holm; Jaina Mistry; Erik L L Sonnhammer; John Tate; Marco Punta
Journal:  Nucleic Acids Res       Date:  2013-11-27       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.