Literature DB >> 29889553

Adaptive Local Realignment of Protein Sequences.

Dan DeBlasio1, John Kececioglu2.   

Abstract

While mutation rates can vary markedly over the residues of a protein, multiple sequence alignment tools typically use the same values for their scoring-function parameters across a protein's entire length. We present a new approach, called adaptive local realignment, that in contrast automatically adapts to the diversity of mutation rates along protein sequences. This builds upon a recent technique known as parameter advising, which finds global parameter settings for an aligner, to now adaptively find local settings. Our approach in essence identifies local regions with low estimated accuracy, constructs a set of candidate realignments using a carefully-chosen collection of parameter settings, and replaces the region if a realignment has higher estimated accuracy. This new method of local parameter advising, when combined with prior methods for global advising, boosts alignment accuracy as much as 26% over the best default setting on hard-to-align protein benchmarks, and by 6.4% over global advising alone. Adaptive local realignment has been implemented within the Opal aligner using the Facet accuracy estimator.

Keywords:  alignment accuracy; iterative refinement; local mutation rates; multiple sequence alignment; parameter advising

Mesh:

Substances:

Year:  2018        PMID: 29889553      PMCID: PMC6067105          DOI: 10.1089/cmb.2018.0045

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  23 in total

1.  T-Coffee: A novel method for fast and accurate multiple sequence alignment.

Authors:  C Notredame; D G Higgins; J Heringa
Journal:  J Mol Biol       Date:  2000-09-08       Impact factor: 5.469

2.  Amino acid substitution matrices from protein blocks.

Authors:  S Henikoff; J G Henikoff
Journal:  Proc Natl Acad Sci U S A       Date:  1992-11-15       Impact factor: 11.205

3.  ProbCons: Probabilistic consistency-based multiple sequence alignment.

Authors:  Chuong B Do; Mahathi S P Mahabhashyam; Michael Brudno; Serafim Batzoglou
Journal:  Genome Res       Date:  2005-02       Impact factor: 9.043

4.  Multiple alignment by aligning alignments.

Authors:  Travis J Wheeler; John D Kececioglu
Journal:  Bioinformatics       Date:  2007-07-01       Impact factor: 6.937

5.  Accuracy estimation and parameter advising for protein multiple sequence alignment.

Authors:  John Kececioglu; Dan DeBlasio
Journal:  J Comput Biol       Date:  2013-03-14       Impact factor: 1.479

6.  A method for estimating the number of invariant amino acid coding positions in a gene using cytochrome c as a model case.

Authors:  W M Fitch; E Margoliash
Journal:  Biochem Genet       Date:  1967-06       Impact factor: 1.890

7.  Learning Parameter-Advising Sets for Multiple Sequence Alignment.

Authors:  Dan DeBlasio; John Kececioglu
Journal:  IEEE/ACM Trans Comput Biol Bioinform       Date:  2017 Sep-Oct       Impact factor: 3.710

8.  Maximum-likelihood estimation of phylogeny from DNA sequences when substitution rates differ over sites.

Authors:  Z Yang
Journal:  Mol Biol Evol       Date:  1993-11       Impact factor: 16.240

9.  Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega.

Authors:  Fabian Sievers; Andreas Wilm; David Dineen; Toby J Gibson; Kevin Karplus; Weizhong Li; Rodrigo Lopez; Hamish McWilliam; Michael Remmert; Johannes Söding; Julie D Thompson; Desmond G Higgins
Journal:  Mol Syst Biol       Date:  2011-10-11       Impact factor: 11.429

10.  M-Coffee: combining multiple sequence alignment methods with T-Coffee.

Authors:  Iain M Wallace; Orla O'Sullivan; Desmond G Higgins; Cedric Notredame
Journal:  Nucleic Acids Res       Date:  2006-03-23       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.