| Literature DB >> 10064709 |
D C Torney1, C C Whittaker, G Xie.
Abstract
We introduce a generally applicable method for the discovery and quantitation of all of the characteristic statistical properties of a class of biological sequences, given examples from the class. This method employs a reversible binary encoding of sequences into the binary digits -1 and +1. Then, provided that the sample is sufficient, the sample cumulants on the subsets of digit positions will manifest all of the statistical properties of the class. As an illustration, we present the main results of a complete characterization of the stationary statistical properties of human coding sequences, in terms of their sample cumulants. Many of the telling sample cumulants are described.Entities:
Mesh:
Substances:
Year: 1999 PMID: 10064709 DOI: 10.1006/jmbi.1998.2567
Source DB: PubMed Journal: J Mol Biol ISSN: 0022-2836 Impact factor: 5.469