Literature DB >> 11861921

BALSA: Bayesian algorithm for local sequence alignment.

Bobbie-Jo M Webb1, Jun S Liu, Charles E Lawrence.   

Abstract

The Smith-Waterman algorithm yields a single alignment, which, albeit optimal, can be strongly affected by the choice of the scoring matrix and the gap penalties. Additionally, the scores obtained are dependent upon the lengths of the aligned sequences, requiring a post-analysis conversion. To overcome some of these shortcomings, we developed a Bayesian algorithm for local sequence alignment (BALSA), that takes into account the uncertainty associated with all unknown variables by incorporating in its forward sums a series of scoring matrices, gap parameters and all possible alignments. The algorithm can return both the joint and the marginal optimal alignments, samples of alignments drawn from the posterior distribution and the posterior probabilities of gap penalties and scoring matrices. Furthermore, it automatically adjusts for variations in sequence lengths. BALSA was compared with SSEARCH, to date the best performing dynamic programming algorithm in the detection of structural neighbors. Using the SCOP databases PDB40D-B and PDB90D-B, BALSA detected 19.8 and 41.3% of remote homologs whereas SSEARCH detected 18.4 and 38% at an error rate of 1% errors per query over the databases, respectively.

Entities:  

Mesh:

Substances:

Year:  2002        PMID: 11861921      PMCID: PMC101229          DOI: 10.1093/nar/30.5.1268

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


  15 in total

1.  Bayesian inference on biopolymer models.

Authors:  J S Liu; C E Lawrence
Journal:  Bioinformatics       Date:  1999-01       Impact factor: 6.937

2.  Amino acid substitution matrices from protein blocks.

Authors:  S Henikoff; J G Henikoff
Journal:  Proc Natl Acad Sci U S A       Date:  1992-11-15       Impact factor: 11.205

3.  Basic local alignment search tool.

Authors:  S F Altschul; W Gish; W Miller; E W Myers; D J Lipman
Journal:  J Mol Biol       Date:  1990-10-05       Impact factor: 5.469

4.  Comparison of methods for searching protein sequence databases.

Authors:  W R Pearson
Journal:  Protein Sci       Date:  1995-06       Impact factor: 6.725

Review 5.  A sequence similarity search algorithm based on a probabilistic interpretation of an alignment scoring system.

Authors:  P Bucher; K Hofmann
Journal:  Proc Int Conf Intell Syst Mol Biol       Date:  1996

6.  Methods for assessing the statistical significance of molecular sequence features by using general scoring schemes.

Authors:  S Karlin; S F Altschul
Journal:  Proc Natl Acad Sci U S A       Date:  1990-03       Impact factor: 11.205

7.  Improved tools for biological sequence comparison.

Authors:  W R Pearson; D J Lipman
Journal:  Proc Natl Acad Sci U S A       Date:  1988-04       Impact factor: 11.205

8.  A general method applicable to the search for similarities in the amino acid sequence of two proteins.

Authors:  S B Needleman; C D Wunsch
Journal:  J Mol Biol       Date:  1970-03       Impact factor: 5.469

9.  Applications and statistics for multiple high-scoring segments in molecular sequences.

Authors:  S Karlin; S F Altschul
Journal:  Proc Natl Acad Sci U S A       Date:  1993-06-15       Impact factor: 11.205

10.  Identification of common molecular subsequences.

Authors:  T F Smith; M S Waterman
Journal:  J Mol Biol       Date:  1981-03-25       Impact factor: 5.469

View more
  13 in total

1.  ldpA encodes an iron-sulfur protein involved in light-dependent modulation of the circadian period in the cyanobacterium Synechococcus elongatus PCC 7942.

Authors:  Mitsunori Katayama; Takao Kondo; Jin Xiong; Susan S Golden
Journal:  J Bacteriol       Date:  2003-02       Impact factor: 3.490

2.  Gibbs Recursive Sampler: finding transcription factor binding sites.

Authors:  William Thompson; Eric C Rouchka; Charles E Lawrence
Journal:  Nucleic Acids Res       Date:  2003-07-01       Impact factor: 16.971

3.  Aligning sequences by minimum description length.

Authors:  John S Conery
Journal:  EURASIP J Bioinform Syst Biol       Date:  2007

4.  Sequence alignment as hypothesis testing.

Authors:  Lu Meng; Fengzhu Sun; Xuegong Zhang; Michael S Waterman
Journal:  J Comput Biol       Date:  2011-05       Impact factor: 1.479

5.  BAYESIAN PROTEIN STRUCTURE ALIGNMENT.

Authors:  Abel Rodriguez; Scott C Schmidler
Journal:  Ann Appl Stat       Date:  2014-12-19       Impact factor: 2.083

6.  RNAG: a new Gibbs sampler for predicting RNA secondary structure for unaligned sequences.

Authors:  Donglai Wei; Lauren V Alpert; Charles E Lawrence
Journal:  Bioinformatics       Date:  2011-07-24       Impact factor: 6.937

7.  Dynamic use of multiple parameter sets in sequence alignment.

Authors:  Xiaoqiu Huang; Douglas L Brutlag
Journal:  Nucleic Acids Res       Date:  2006-12-19       Impact factor: 16.971

8.  Optimizing amino acid substitution matrices with a local alignment kernel.

Authors:  Hiroto Saigo; Jean-Philippe Vert; Tatsuya Akutsu
Journal:  BMC Bioinformatics       Date:  2006-05-05       Impact factor: 3.169

9.  Measuring global credibility with application to local sequence alignment.

Authors:  Bobbie-Jo M Webb-Robertson; Lee Ann McCue; Charles E Lawrence
Journal:  PLoS Comput Biol       Date:  2008-05-16       Impact factor: 4.475

10.  A probabilistic model of local sequence alignment that simplifies statistical significance estimation.

Authors:  Sean R Eddy
Journal:  PLoS Comput Biol       Date:  2008-05-30       Impact factor: 4.475

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.