Literature DB >> 11465063

Making sense of score statistics for sequence alignments.

M Pagni1, C V Jongeneel.   

Abstract

The search for similarity between two biological sequences lies at the core of many applications in bioinformatics. This paper aims to highlight a few of the principles that should be kept in mind when evaluating the statistical significance of alignments between sequences. The extreme value distribution is first introduced, which in most cases describes the distribution of alignment scores between a query and a database. The effects of the similarity matrix and gap penalty values on the score distribution are then examined, and it is shown that the alignment statistics can undergo an abrupt phase transition. A few types of random sequence databases used in the estimation of statistical significance are presented, and the statistics employed by the BLAST, FASTA and PRSS programs are compared. Finally the different strategies used to assess the statistical significance of the matches produced by profiles and hidden Markov models are presented.

Mesh:

Substances:

Year:  2001        PMID: 11465063     DOI: 10.1093/bib/2.1.51

Source DB:  PubMed          Journal:  Brief Bioinform        ISSN: 1467-5463            Impact factor:   11.622


  10 in total

1.  Recent improvements to the PROSITE database.

Authors:  Nicolas Hulo; Christian J A Sigrist; Virginie Le Saux; Petra S Langendijk-Genevaux; Lorenza Bordoli; Alexandre Gattiker; Edouard De Castro; Philipp Bucher; Amos Bairoch
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

2.  Querying pathways in protein interaction networks based on hidden Markov models.

Authors:  Xiaoning Qian; Sing-Hoi Sze; Byung-Jun Yoon
Journal:  J Comput Biol       Date:  2009-02       Impact factor: 1.479

3.  A Puzzling Anomaly in the 4-Mer Composition of the Giant Pandoravirus Genomes Reveals a Stringent New Evolutionary Selection Process.

Authors:  Olivier Poirot; Sandra Jeudy; Chantal Abergel; Jean-Michel Claverie
Journal:  J Virol       Date:  2019-11-13       Impact factor: 5.103

4.  Zona pellucida-binding protein 2 (ZPBP2) and several proteins containing BX7B motifs in human sperm may have hyaluronic acid binding or recognition properties.

Authors:  F Torabi; O A Bogle; J M Estanyol; R Oliva; D Miller
Journal:  Mol Hum Reprod       Date:  2017-12-01       Impact factor: 4.025

5.  Effective identification of conserved pathways in biological networks using hidden Markov models.

Authors:  Xiaoning Qian; Byung-Jun Yoon
Journal:  PLoS One       Date:  2009-12-07       Impact factor: 3.240

6.  A widespread family of polymorphic contact-dependent toxin delivery systems in bacteria.

Authors:  Stephanie K Aoki; Elie J Diner; Claire T'kint de Roodenbeke; Brandt R Burgess; Stephen J Poole; Bruce A Braaten; Allison M Jones; Julia S Webb; Christopher S Hayes; Peggy A Cotter; David A Low
Journal:  Nature       Date:  2010-11-18       Impact factor: 49.962

7.  Accelerating pairwise statistical significance estimation for local alignment by harvesting GPU's power.

Authors:  Yuhong Zhang; Sanchit Misra; Ankit Agrawal; Md Mostofa Ali Patwary; Wei-Keng Liao; Zhiguang Qin; Alok Choudhary
Journal:  BMC Bioinformatics       Date:  2012-04-12       Impact factor: 3.169

8.  Pairwise statistical significance of local sequence alignment using multiple parameter sets and empirical justification of parameter set change penalty.

Authors:  Ankit Agrawal; Xiaoqiu Huang
Journal:  BMC Bioinformatics       Date:  2009-03-19       Impact factor: 3.169

9.  Density-based hierarchical clustering of pyro-sequences on a large scale--the case of fungal ITS1.

Authors:  Marco Pagni; Hélène Niculita-Hirzel; Loïc Pellissier; Anne Dubuis; Ioannis Xenarios; Antoine Guisan; Ian R Sanders; Jérôme Goudet; Nicolas Guex
Journal:  Bioinformatics       Date:  2013-03-28       Impact factor: 6.937

10.  HAMAP in 2013, new developments in the protein family classification and annotation system.

Authors:  Ivo Pedruzzi; Catherine Rivoire; Andrea H Auchincloss; Elisabeth Coudert; Guillaume Keller; Edouard de Castro; Delphine Baratin; Béatrice A Cuche; Lydie Bougueleret; Sylvain Poux; Nicole Redaschi; Ioannis Xenarios; Alan Bridge
Journal:  Nucleic Acids Res       Date:  2012-11-27       Impact factor: 16.971

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.