Literature DB >> 20365196

Finite-temperature local protein sequence alignment: percolation and free-energy distribution.

S Wolfsheimer1, O Melchert, A K Hartmann.   

Abstract

Sequence alignment is a tool in bioinformatics that is used to find homological relationships in large molecular databases. It can be mapped on the physical model of directed polymers in random media. We consider the finite-temperature version of local sequence alignment for proteins and study the transition between the linear phase and the biologically relevant logarithmic phase, where the free energy grows linearly or logarithmically with the sequence length. By means of numerical simulations and finite-size-scaling analysis, we determine the phase diagram in the plane that is spanned by the gap costs and the temperature. We use the most frequently used parameter set for protein alignment. The critical exponents that describe the parameter-driven transition are found to be explicitly temperature dependent. Furthermore, we study the shape of the (free-) energy distribution close to the transition by rare-event simulations down to probabilities on the order 10(-64). It is well known that in the logarithmic region, the optimal score distribution (T=0) is described by a modified Gumbel distribution. We confirm that this also applies for the free-energy distribution (T>0). However, in the linear phase, the distribution crosses over to a modified Gaussian distribution.

Entities:  

Mesh:

Substances:

Year:  2009        PMID: 20365196     DOI: 10.1103/PhysRevE.80.061913

Source DB:  PubMed          Journal:  Phys Rev E Stat Nonlin Soft Matter Phys        ISSN: 1539-3755


  4 in total

1.  Objective method for estimating asymptotic parameters, with an application to sequence alignment.

Authors:  Sergey Sheetlin; Yonil Park; John L Spouge
Journal:  Phys Rev E Stat Nonlin Soft Matter Phys       Date:  2011-09-13

2.  Conotoxin protein classification using free scores of words and support vector machines.

Authors:  Nazar Zaki; Stefan Wolfsheimer; Gregory Nuel; Sawsan Khuri
Journal:  BMC Bioinformatics       Date:  2011-05-29       Impact factor: 3.169

3.  Community detection in sequence similarity networks based on attribute clustering.

Authors:  Janamejaya Chowdhary; Frank E Löffler; Jeremy C Smith
Journal:  PLoS One       Date:  2017-07-24       Impact factor: 3.240

4.  ProbPFP: a multiple sequence alignment algorithm combining hidden Markov model optimized by particle swarm optimization with partition function.

Authors:  Qing Zhan; Nan Wang; Shuilin Jin; Renjie Tan; Qinghua Jiang; Yadong Wang
Journal:  BMC Bioinformatics       Date:  2019-11-25       Impact factor: 3.169

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.