| Literature DB >> 11410030 |
Abstract
We outline numerical characterization of DNA primary sequence based on calculation of the average distance between pairs of nucleic acid bases. This leads to a representation of DNA by a condensed 4 x 4 symmetrical matrix, the elements of which give the average separation between pair of bases X, Y in DNA (X, Y = A, C, G, T). As an invariant of choice we consider the leading eigenvalue of the derived 4 x 4 matrix. Additional structurally related invariants were obtained by constructing additional "higher order" 4 x 4 matrices derived from the initial 4 x 4 matrix by raising its elements to higher powers. Suitably normalized leading eigenvalue of these matrices offer a novel characterization of DNA primary sequences, referred to as "DNA profiles". The approach is illustrated on exon 1 of human beta-globin gene.Entities:
Mesh:
Substances:
Year: 2001 PMID: 11410030 DOI: 10.1021/ci0000981
Source DB: PubMed Journal: J Chem Inf Comput Sci ISSN: 0095-2338