Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Simultaneous alignment and folding of protein sequences.

Literature DB >> 24766258

Simultaneous alignment and folding of protein sequences.

Jérôme Waldispühl¹, Charles W O'Donnell, Sebastian Will, Srinivas Devadas, Rolf Backofen, Bonnie Berger.

Abstract

Accurate comparative analysis tools for low-homology proteins remains a difficult challenge in computational biology, especially sequence alignment and consensus folding problems. We present partiFold-Align, the first algorithm for simultaneous alignment and consensus folding of unaligned protein sequences; the algorithm's complexity is polynomial in time and space. Algorithmically, partiFold-Align exploits sparsity in the set of super-secondary structure pairings and alignment candidates to achieve an effectively cubic running time for simultaneous pairwise alignment and folding. We demonstrate the efficacy of these techniques on transmembrane β-barrel proteins, an important yet difficult class of proteins with few known three-dimensional structures. Testing against structurally derived sequence alignments, partiFold-Align significantly outperforms state-of-the-art pairwise and multiple sequence alignment tools in the most difficult low-sequence homology case. It also improves secondary structure prediction where current approaches fail. Importantly, partiFold-Align requires no prior training. These general techniques are widely applicable to many more protein families (partiFold-Align is available at http://partifold.csail.mit.edu/ ).

Mesh：

Substances：
Membrane Proteins

Year: 2014 PMID： 24766258 PMCID： PMC4082353 DOI： 10.1089/cmb.2013.0163

Source DB: PubMed Journal: J Comput Biol ISSN： 1066-5277 Impact factor: 1.479

Keyword Cloud
References

36 in total

1. Decision tree-based formation of consensus protein secondary structure prediction.

Authors: J Selbig; T Mevissen; T Lengauer
Journal: Bioinformatics Date: 1999-12 Impact factor: 6.937

2. BETAWRAP: successful prediction of parallel beta -helices from primary sequence reveals an association with many microbial pathogens.

Authors: P Bradley; L Cowen; M Menke; J King; B Berger
Journal: Proc Natl Acad Sci U S A Date: 2001-12-18 Impact factor: 11.205

3. Predicting reliable regions in protein sequence alignments.

Authors: Melissa Cline; Richard Hughey; Kevin Karplus
Journal: Bioinformatics Date: 2002-02 Impact factor: 6.937

4. Progress in predicting inter-residue contacts of proteins with neural networks and correlated mutations.

Authors: P Fariselli; O Olmea; A Valencia; R Casadio
Journal: Proteins Date: 2001

5. Dynalign: an algorithm for finding the secondary structure common to two RNA sequences.

Authors: David H Mathews; Douglas H Turner
Journal: J Mol Biol Date: 2002-03-22 Impact factor: 5.469

6. T-Coffee: A novel method for fast and accurate multiple sequence alignment.

Authors: C Notredame; D G Higgins; J Heringa
Journal: J Mol Biol Date: 2000-09-08 Impact factor: 5.469

7. BATMAS30: amino acid substitution matrix for alignment of bacterial transporters.

Authors: Roman A Sutormin; Aleksandra B Rakhmaninova; Mikhail S Gelfand
Journal: Proteins Date: 2003-04-01

8. LAGAN and Multi-LAGAN: efficient tools for large-scale multiple alignment of genomic DNA.

Authors: Michael Brudno; Chuong B Do; Gregory M Cooper; Michael F Kim; Eugene Davydov; Eric D Green; Arend Sidow; Serafim Batzoglou
Journal: Genome Res Date: 2003-03-12 Impact factor: 9.043

9. MUSCLE: multiple sequence alignment with high accuracy and high throughput.

Authors: Robert C Edgar
Journal: Nucleic Acids Res Date: 2004-03-19 Impact factor: 16.971

10. Pfam: the protein families database.

Authors: Robert D Finn; Alex Bateman; Jody Clements; Penelope Coggill; Ruth Y Eberhardt; Sean R Eddy; Andreas Heger; Kirstie Hetherington; Liisa Holm; Jaina Mistry; Erik L L Sonnhammer; John Tate; Marco Punta
Journal: Nucleic Acids Res Date: 2013-11-27 Impact factor: 16.971