| Literature DB >> 8808582 |
Abstract
We designed a new probabilistic algorithm, named PAGEC (probabilistic algorithm for genome comparison), which allowed a highly interactive study of long genomic strings. The comparison between two nucleic acid sequences is based on the creation of multiple index tables, which drastically reduces processing time for huge genomes, e.g. 13 min for a 4 Mb/4 Mb comparison. PAGEC lowered the need for memory when compared with other types of algorithm and took into account the low resolution of the final representation (paper or computer screen). Considering that standard printers permit a 300 d.p.i. resolution, the loss of computed information due to the probabilistic conception of the algorithm was not usually noticeable in the present study, mainly due to increased genomic sizes. Refinement was possible through an interactive zooming system, which enabled the visualization of the lexical base sequences of a considered part of both of the studied genomes. Biological examples of computation based on yeast and animal nucleic acid sequences presented in this paper reveal the flexibility of the PAGEC program, which is a valuable tool for genetic studies as it offers a solution to an important problem that will become even more important as time passes.Entities:
Mesh:
Substances:
Year: 1995 PMID: 8808582 DOI: 10.1093/bioinformatics/11.6.657
Source DB: PubMed Journal: Comput Appl Biosci ISSN: 0266-7061