Literature DB >> 19669432

Super paramagnetic clustering of DNA sequences.

Sugiarto Radjiman1, Lianyi Han, Jian-Sheng Wang, Yu Zong Chen.   

Abstract

An unsupervised clustering of 4541 DNA sequences containing active promoter regions from vertebrate and arthropod classes (including their viral genes) was performed. All necessary information was solely gathered a priori from the DNA sequences by measuring frequencies of tri-nucleotides and tetra-nucleotides. We employed Super Paramagnetic Clustering, a novel clustering algorithm based on physical properties of an inhomogeneous granular ferromagnet. This method utilizes Swendsen-Wang cluster Monte Carlo simulations to distinguish clusters by measuring pairs of correlation functions from different resolutions. We identified two strongly separated clusters of human viral genes corresponding to the Epstein-Barr virus and the Herpes Simplex virus type 1. In addition, vertebrate and arthropod sequences were successfully separated into two different classes with merely 9.25% of arthropod sequences being misclassified. From a functional perspective, these sequences have high gene function correlations with sequences from the vertebrate cluster. By tuning a clustering parameter, Super Paramagnetic Clustering was able to classify vertebrate class further into two major clusters, from where a large number of housekeeping genes and tissue-specific genes were found respectively. The indications came from observation of gene expression function and consensus transcription factors which were found grouped together in specific positions of the DNA sequences.

Entities:  

Year:  2006        PMID: 19669432      PMCID: PMC3022498          DOI: 10.1007/s10867-006-2120-0

Source DB:  PubMed          Journal:  J Biol Phys        ISSN: 0092-0606            Impact factor:   1.365


  29 in total

1.  The TRANSFAC system on gene expression regulation.

Authors:  E Wingender; X Chen; E Fricke; R Geffers; R Hehl; I Liebich; M Krull; V Matys; H Michael; R Ohnhäuser; M Prüss; F Schacherer; S Thiele; S Urbach
Journal:  Nucleic Acids Res       Date:  2001-01-01       Impact factor: 16.971

2.  Superparamagnetic clustering of data.

Authors: 
Journal:  Phys Rev Lett       Date:  1996-04-29       Impact factor: 9.161

Review 3.  The RNA polymerase II core promoter: a key component in the regulation of gene expression.

Authors:  Jennifer E F Butler; James T Kadonaga
Journal:  Genes Dev       Date:  2002-10-15       Impact factor: 11.361

4.  Computational identification of transcription factor binding sites via a transcription-factor-centric clustering (TFCC) algorithm.

Authors:  Zhou Zhu; Yitzhak Pilpel; George M Church
Journal:  J Mol Biol       Date:  2002-04-19       Impact factor: 5.469

5.  The Eukaryotic Promoter Database EPD: the impact of in silico primer extension.

Authors:  Christoph D Schmid; Viviane Praz; Mauro Delorenzi; Rouaïda Périer; Philipp Bucher
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

6.  Human housekeeping genes are compact.

Authors:  Eli Eisenberg; Erez Y Levanon
Journal:  Trends Genet       Date:  2003-07       Impact factor: 11.639

Review 7.  Transcription by RNA polymerase II: initiator-directed formation of transcription-competent complexes.

Authors:  L Weis; D Reinberg
Journal:  FASEB J       Date:  1992-11       Impact factor: 5.191

8.  Nonuniversal critical dynamics in Monte Carlo simulations.

Authors: 
Journal:  Phys Rev Lett       Date:  1987-01-12       Impact factor: 9.161

9.  Compilation of vertebrate-encoded transcription factors.

Authors:  S Faisst; S Meyer
Journal:  Nucleic Acids Res       Date:  1992-01-11       Impact factor: 16.971

10.  Clustering of DNA sequences in human promoters.

Authors:  Peter C FitzGerald; Andrey Shlyakhtenko; Alain A Mir; Charles Vinson
Journal:  Genome Res       Date:  2004-07-15       Impact factor: 9.043

View more
  2 in total

1.  Scaling behavior of nucleotide cluster in DNA sequences.

Authors:  Jun Cheng; Zi-shuang Tong; Lin-xi Zhang
Journal:  J Zhejiang Univ Sci B       Date:  2007-05       Impact factor: 3.066

2.  EvoluCode: Evolutionary Barcodes as a Unifying Framework for Multilevel Evolutionary Data.

Authors:  Benjamin Linard; Ngoc Hoan Nguyen; Francisco Prosdocimi; Olivier Poch; Julie D Thompson
Journal:  Evol Bioinform Online       Date:  2011-12-21       Impact factor: 1.625

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.