| Literature DB >> 19808039 |
Yongchao Dou1, Xiaoqi Zheng, Jun Wang.
Abstract
Amino acid background distribution is an important factor for entropy-based methods which extract sequence conservation information from protein multiple sequence alignments (MSAs). However, MSAs are usually not large enough to allow a reliable observed background distribution. In this paper, we propose two new estimations of background distribution. One is an integration of the observed background distribution and the position-specific residue distribution, and the other is a normalized square root of observed background frequency. To validate these new background distributions, they are applied to the relative entropy model to find catalytic sites and ligand binding sites from protein MSAs. Experimental results show that they are superior to the observed background distribution in predicting functionally important residues.Mesh:
Substances:
Year: 2009 PMID: 19808039 DOI: 10.1016/j.jtbi.2009.09.030
Source DB: PubMed Journal: J Theor Biol ISSN: 0022-5193 Impact factor: 2.691