Literature DB >> 10745986

SEGMENT: identifying compositional domains in DNA sequences.

J L Oliver1, R Román-Roldán, J Pérez, P Bernaola-Galván.   

Abstract

MOTIVATION: DNA sequences are formed by patches or domains of different nucleotide composition. In a few simple sequences, domains can simply be identified by eye; however, most DNA sequences show a complex compositional heterogeneity (fractal structure), which cannot be properly detected by current methods. Recently, a computationally efficient segmentation method to analyse such nonstationary sequence structures, based on the Jensen-Shannon entropic divergence, has been described. Specific algorithms implementing this method are now needed.
RESULTS: Here we describe a heuristic segmentation algorithm for DNA sequences, which was implemented on a Windows program (SEGMENT). The program divides a DNA sequence into compositionally homogeneous domains by iterating a local optimization procedure at a given statistical significance. Once a sequence is partitioned into domains, a global measure of sequence compositional complexity (SCC), accounting for both the sizes and compositional biases of all the domains in the sequence, is derived. SEGMENT computes SCC as a function of the significance level, which provides a multiscale view of sequence complexity.

Mesh:

Year:  1999        PMID: 10745986     DOI: 10.1093/bioinformatics/15.12.974

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  10 in total

1.  Mining Bacillus subtilis chromosome heterogeneities using hidden Markov models.

Authors:  Pierre Nicolas; Laurent Bize; Florence Muri; Mark Hoebeke; François Rodolphe; S Dusko Ehrlich; Bernard Prum; Philippe Bessières
Journal:  Nucleic Acids Res       Date:  2002-03-15       Impact factor: 16.971

2.  Effects of coarse-graining on the scaling behavior of long-range correlated and anti-correlated signals.

Authors:  Yinlin Xu; Qianli D Y Ma; Daniel T Schmitt; Pedro Bernaola-Galván; Plamen Ch Ivanov
Journal:  Physica A       Date:  2011-11-01       Impact factor: 3.263

3.  Low-complexity regions in Plasmodium falciparum proteins.

Authors:  E Pizzi; C Frontali
Journal:  Genome Res       Date:  2001-02       Impact factor: 9.043

Review 4.  Genomics, morphogenesis and biophysics: triangulation of Purkinje cell development.

Authors:  Malcolm J Simons; András J Pellionisz
Journal:  Cerebellum       Date:  2006       Impact factor: 3.648

5.  Current awareness on comparative and functional genomics.

Authors: 
Journal:  Yeast       Date:  2000-09-30       Impact factor: 3.239

Review 6.  Investigating genomic structure using changept: A Bayesian segmentation model.

Authors:  Manjula Algama; Jonathan M Keith
Journal:  Comput Struct Biotechnol J       Date:  2014-08-27       Impact factor: 7.271

7.  NGSmethDB 2017: enhanced methylomes and differential methylation.

Authors:  Ricardo Lebrón; Cristina Gómez-Martín; Pedro Carpena; Pedro Bernaola-Galván; Guillermo Barturen; Michael Hackenberg; José L Oliver
Journal:  Nucleic Acids Res       Date:  2016-10-27       Impact factor: 16.971

8.  Interpreting genomic data via entropic dissection.

Authors:  Rajeev K Azad; Jing Li
Journal:  Nucleic Acids Res       Date:  2012-10-03       Impact factor: 16.971

9.  Comparing segmentations by applying randomization techniques.

Authors:  Niina Haiminen; Heikki Mannila; Evimaria Terzi
Journal:  BMC Bioinformatics       Date:  2007-05-23       Impact factor: 3.169

10.  Driven progressive evolution of genome sequence complexity in Cyanobacteria.

Authors:  Andrés Moya; José L Oliver; Miguel Verdú; Luis Delaye; Vicente Arnau; Pedro Bernaola-Galván; Rebeca de la Fuente; Wladimiro Díaz; Cristina Gómez-Martín; Francisco M González; Amparo Latorre; Ricardo Lebrón; Ramón Román-Roldán
Journal:  Sci Rep       Date:  2020-11-04       Impact factor: 4.379

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.