Literature DB >> 10745987

Segmentation of yeast DNA using hidden Markov models.

L Peshkin1, M S Gelfand.   

Abstract

MOTIVATION: Compositionally homogeneous segments of genomic DNA often correspond to meaningful biological units. Simple sliding window analysis is usually insufficient for compositional segmentation of natural sequences. Hidden Markov models (HMM) with a small number of states are a natural language for description of compositional properties of chromosome-size DNA sequences.
RESULTS: The algorithms were applied to yeast Saccharomyces cerevisiae chromosomes (YC) I, III, IV, VI and IX. The optimal number of HMM states is found to be four. The optimal four-state HMMs for all chromosomes are very similar, as well as the reconstructed segmentations. In most cases the models with k + 1 states are obtained by 'splitting' one of the states in the model with k states, and the corresponding increase of the level of detail in segmentation. The high AT states usually correspond to intergenic regions. We also explore the model's likelihood landscape and analyze the dynamics of the optimization process, thus addressing the problem of reliability of the obtained optima and efficiency of the algorithms.

Entities:  

Mesh:

Substances:

Year:  1999        PMID: 10745987     DOI: 10.1093/bioinformatics/15.12.980

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  7 in total

1.  Mining Bacillus subtilis chromosome heterogeneities using hidden Markov models.

Authors:  Pierre Nicolas; Laurent Bize; Florence Muri; Mark Hoebeke; François Rodolphe; S Dusko Ehrlich; Bernard Prum; Philippe Bessières
Journal:  Nucleic Acids Res       Date:  2002-03-15       Impact factor: 16.971

2.  Isochore structures in the genome of the plant Arabidopsis thaliana.

Authors:  Ren Zhang; Chun-Ting Zhang
Journal:  J Mol Evol       Date:  2004-08       Impact factor: 2.395

3.  In silico segmentations of lentivirus envelope sequences.

Authors:  Aurélia Boissin-Quillon; Didier Piau; Caroline Leroux
Journal:  BMC Bioinformatics       Date:  2007-03-21       Impact factor: 3.169

4.  GC-Profile: a web-based tool for visualizing and analyzing the variation of GC content in genomic sequences.

Authors:  Feng Gao; Chun-Ting Zhang
Journal:  Nucleic Acids Res       Date:  2006-07-01       Impact factor: 16.971

5.  Current awareness on comparative and functional genomics.

Authors: 
Journal:  Yeast       Date:  2000-09-30       Impact factor: 3.239

Review 6.  Investigating genomic structure using changept: A Bayesian segmentation model.

Authors:  Manjula Algama; Jonathan M Keith
Journal:  Comput Struct Biotechnol J       Date:  2014-08-27       Impact factor: 7.271

7.  Comparing segmentations by applying randomization techniques.

Authors:  Niina Haiminen; Heikki Mannila; Evimaria Terzi
Journal:  BMC Bioinformatics       Date:  2007-05-23       Impact factor: 3.169

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.