Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 SEGMENT: identifying compositional domains in DNA sequences.

Literature DB >> 10745986

SEGMENT: identifying compositional domains in DNA sequences.

J L Oliver¹, R Román-Roldán, J Pérez, P Bernaola-Galván.

Abstract

MOTIVATION: DNA sequences are formed by patches or domains of different nucleotide composition. In a few simple sequences, domains can simply be identified by eye; however, most DNA sequences show a complex compositional heterogeneity (fractal structure), which cannot be properly detected by current methods. Recently, a computationally efficient segmentation method to analyse such nonstationary sequence structures, based on the Jensen-Shannon entropic divergence, has been described. Specific algorithms implementing this method are now needed.
RESULTS: Here we describe a heuristic segmentation algorithm for DNA sequences, which was implemented on a Windows program (SEGMENT). The program divides a DNA sequence into compositionally homogeneous domains by iterating a local optimization procedure at a given statistical significance. Once a sequence is partitioned into domains, a global measure of sequence compositional complexity (SCC), accounting for both the sizes and compositional biases of all the domains in the sequence, is derived. SEGMENT computes SCC as a function of the significance level, which provides a multiscale view of sequence complexity.

Mesh：

Year: 1999 PMID： 10745986 DOI： 10.1093/bioinformatics/15.12.974

Source DB: PubMed Journal: Bioinformatics ISSN： 1367-4803 Impact factor: 6.937

Keyword Cloud
Cited

10 in total

1. Mining Bacillus subtilis chromosome heterogeneities using hidden Markov models.

Authors: Pierre Nicolas; Laurent Bize; Florence Muri; Mark Hoebeke; François Rodolphe; S Dusko Ehrlich; Bernard Prum; Philippe Bessières
Journal: Nucleic Acids Res Date: 2002-03-15 Impact factor: 16.971

2. Effects of coarse-graining on the scaling behavior of long-range correlated and anti-correlated signals.

Authors: Yinlin Xu; Qianli D Y Ma; Daniel T Schmitt; Pedro Bernaola-Galván; Plamen Ch Ivanov
Journal: Physica A Date: 2011-11-01 Impact factor: 3.263

3. Low-complexity regions in Plasmodium falciparum proteins.

Authors: E Pizzi; C Frontali
Journal: Genome Res Date: 2001-02 Impact factor: 9.043

Review 4. Genomics, morphogenesis and biophysics: triangulation of Purkinje cell development.

Authors: Malcolm J Simons; András J Pellionisz
Journal: Cerebellum Date: 2006 Impact factor: 3.648

5. Current awareness on comparative and functional genomics.

Authors:
Journal: Yeast Date: 2000-09-30 Impact factor: 3.239

Review 6. Investigating genomic structure using changept: A Bayesian segmentation model.

Authors: Manjula Algama; Jonathan M Keith
Journal: Comput Struct Biotechnol J Date: 2014-08-27 Impact factor: 7.271

7. NGSmethDB 2017: enhanced methylomes and differential methylation.

Authors: Ricardo Lebrón; Cristina Gómez-Martín; Pedro Carpena; Pedro Bernaola-Galván; Guillermo Barturen; Michael Hackenberg; José L Oliver
Journal: Nucleic Acids Res Date: 2016-10-27 Impact factor: 16.971

8. Interpreting genomic data via entropic dissection.

Authors: Rajeev K Azad; Jing Li
Journal: Nucleic Acids Res Date: 2012-10-03 Impact factor: 16.971

9. Comparing segmentations by applying randomization techniques.

Authors: Niina Haiminen; Heikki Mannila; Evimaria Terzi
Journal: BMC Bioinformatics Date: 2007-05-23 Impact factor: 3.169

10. Driven progressive evolution of genome sequence complexity in Cyanobacteria.

Authors: Andrés Moya; José L Oliver; Miguel Verdú; Luis Delaye; Vicente Arnau; Pedro Bernaola-Galván; Rebeca de la Fuente; Wladimiro Díaz; Cristina Gómez-Martín; Francisco M González; Amparo Latorre; Ricardo Lebrón; Ramón Román-Roldán
Journal: Sci Rep Date: 2020-11-04 Impact factor: 4.379

10 in total