Literature DB >> 19808039

Several appropriate background distributions for entropy-based protein sequence conservation measures.

Yongchao Dou1, Xiaoqi Zheng, Jun Wang.   

Abstract

Amino acid background distribution is an important factor for entropy-based methods which extract sequence conservation information from protein multiple sequence alignments (MSAs). However, MSAs are usually not large enough to allow a reliable observed background distribution. In this paper, we propose two new estimations of background distribution. One is an integration of the observed background distribution and the position-specific residue distribution, and the other is a normalized square root of observed background frequency. To validate these new background distributions, they are applied to the relative entropy model to find catalytic sites and ligand binding sites from protein MSAs. Experimental results show that they are superior to the observed background distribution in predicting functionally important residues.

Mesh:

Substances:

Year:  2009        PMID: 19808039     DOI: 10.1016/j.jtbi.2009.09.030

Source DB:  PubMed          Journal:  J Theor Biol        ISSN: 0022-5193            Impact factor:   2.691


  3 in total

1.  Sequence conservation in the prediction of catalytic sites.

Authors:  Yongchao Dou; Xingbo Geng; Hongyun Gao; Jialiang Yang; Xiaoqi Zheng; Jun Wang
Journal:  Protein J       Date:  2011-04       Impact factor: 2.371

2.  L1pred: a sequence-based prediction tool for catalytic residues in enzymes with the L1-logreg classifier.

Authors:  Yongchao Dou; Jun Wang; Jialiang Yang; Chi Zhang
Journal:  PLoS One       Date:  2012-04-27       Impact factor: 3.240

3.  New methods to measure residues coevolution in proteins.

Authors:  Hongyun Gao; Yongchao Dou; Jialiang Yang; Jun Wang
Journal:  BMC Bioinformatics       Date:  2011-05-26       Impact factor: 3.169

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.