Literature DB >> 27559428

A Convex Atomic-Norm Approach to Multiple Sequence Alignment and Motif Discovery.

Ian E H Yen1, Xin Lin1, Jiong Zhang2, Pradeep Ravikumar3, Inderjit S Dhillon3.   

Abstract

Multiple Sequence Alignment and Motif Discovery, known as NP-hard problems, are two fundamental tasks in Bioinformatics. Existing approaches to these two problems are based on either local search methods such as Expectation Maximization (EM), Gibbs Sampling or greedy heuristic methods. In this work, we develop a convex relaxation approach to both problems based on the recent concept of atomic norm and develop a new algorithm, termed Greedy Direction Method of Multiplier, for solving the convex relaxation with two convex atomic constraints. Experiments show that our convex relaxation approach produces solutions of higher quality than those standard tools widely-used in Bioinformatics community on the Multiple Sequence Alignment and Motif Discovery problems.

Entities:  

Year:  2016        PMID: 27559428      PMCID: PMC4993214     

Source DB:  PubMed          Journal:  JMLR Workshop Conf Proc        ISSN: 1938-7288


  16 in total

1.  Evaluation of protein multiple alignments by SAM-T99 using the BAliBASE multiple alignment test set.

Authors:  K Karplus; B Hu
Journal:  Bioinformatics       Date:  2001-08       Impact factor: 6.937

Review 2.  Recent progress in multiple sequence alignment: a survey.

Authors:  Cédric Notredame
Journal:  Pharmacogenomics       Date:  2002-01       Impact factor: 2.533

3.  T-Coffee: A novel method for fast and accurate multiple sequence alignment.

Authors:  C Notredame; D G Higgins; J Heringa
Journal:  J Mol Biol       Date:  2000-09-08       Impact factor: 5.469

4.  MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform.

Authors:  Kazutaka Katoh; Kazuharu Misawa; Kei-ichi Kuma; Takashi Miyata
Journal:  Nucleic Acids Res       Date:  2002-07-15       Impact factor: 16.971

5.  MUSCLE: multiple sequence alignment with high accuracy and high throughput.

Authors:  Robert C Edgar
Journal:  Nucleic Acids Res       Date:  2004-03-19       Impact factor: 16.971

Review 6.  Inching toward reality: an improved likelihood model of sequence evolution.

Authors:  J L Thorne; H Kishino; J Felsenstein
Journal:  J Mol Evol       Date:  1992-01       Impact factor: 2.395

7.  Settling the intractability of multiple alignment.

Authors:  Isaac Elias
Journal:  J Comput Biol       Date:  2006-09       Impact factor: 1.479

8.  PhyloGibbs: a Gibbs sampling motif finder that incorporates phylogeny.

Authors:  Rahul Siddharthan; Eric D Siggia; Erik van Nimwegen
Journal:  PLoS Comput Biol       Date:  2005-12-09       Impact factor: 4.475

9.  Kalign--an accurate and fast multiple sequence alignment algorithm.

Authors:  Timo Lassmann; Erik L L Sonnhammer
Journal:  BMC Bioinformatics       Date:  2005-12-12       Impact factor: 3.169

10.  MEME: discovering and analyzing DNA and protein sequence motifs.

Authors:  Timothy L Bailey; Nadya Williams; Chris Misleh; Wilfred W Li
Journal:  Nucleic Acids Res       Date:  2006-07-01       Impact factor: 16.971

View more
  2 in total

1.  Dual Decomposed Learning with Factorwise Oracles for Structural SVMs of Large Output Domain.

Authors:  Ian E H Yen; Xiangru Huang; Kai Zhong; Ruohan Zhang; Pradeep Ravikumar; Inderjit S Dhillon
Journal:  Adv Neural Inf Process Syst       Date:  2016-12

2.  Scalable Convex Multiple Sequence Alignment via Entropy-Regularized Dual Decomposition.

Authors:  Jiong Zhang; Ian E H Yen; Pradeep Ravikumar; Inderjit S Dhillon
Journal:  JMLR Workshop Conf Proc       Date:  2017-04
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.