Literature DB >> 15838121

Visualization of K-tuple distribution in procaryote complete genomes and their randomized counterparts.

Huimin Xie1, Bailin Hao.   

Abstract

A few years ago we developed a simple scheme to visualize the string composition of long DNA sequences in terms of two- and one-dimensional (2D and 1D) histograms. While the patterns in the 2D histograms have been well understood, the structure of the 1D histograms has not been analyzed in details. It turns out that the structure of the 1D histograms of the genomic sequences and their randomized counterparts varies significantly depending on the g+c content of the genomes. In particular, the 1D histograms of some randomized sequences may show rich structure, a seemingly anti-intuitive result. Three approaches are used to explain the phenomenon: (1) Monte Carlo simulation, (2) exact computation by using the Goulden-Jackson cluster method, and (3) a Poisson approximation method. The multi-modal phenomena in K-histograms are well elucidated by the last approach.

Mesh:

Year:  2002        PMID: 15838121

Source DB:  PubMed          Journal:  Proc IEEE Comput Soc Bioinform Conf        ISSN: 1555-3930


  3 in total

1.  Universal global imprints of genome growth and evolution--equivalent length and cumulative mutation density.

Authors:  Hong-Da Chen; Wen-Lang Fan; Sing-Guan Kong; Hoong-Chien Lee
Journal:  PLoS One       Date:  2010-04-14       Impact factor: 3.240

2.  Diminishing return for increased Mappability with longer sequencing reads: implications of the k-mer distributions in the human genome.

Authors:  Wentian Li; Jan Freudenberg; Pedro Miramontes
Journal:  BMC Bioinformatics       Date:  2014-01-03       Impact factor: 3.169

3.  SeeDNA: a visualization tool for K-string content of long DNA sequences and their randomized counterparts.

Authors:  Junjie Shen; Shuyu Zhang; Hoong-Chien Lee; Bailin Hao
Journal:  Genomics Proteomics Bioinformatics       Date:  2004-08       Impact factor: 7.691

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.