Literature DB >> 17037964

Segmenting eukaryotic genomes with the Generalized Gibbs Sampler.

Jonathan M Keith1.   

Abstract

Eukaryotic genomes display segmental patterns of variation in various properties, including GC content and degree of evolutionary conservation. DNA segmentation algorithms are aimed at identifying statistically significant boundaries between such segments. Such algorithms may provide a means of discovering new classes of functional elements in eukaryotic genomes. This paper presents a model and an algorithm for Bayesian DNA segmentation and considers the feasibility of using it to segment whole eukaryotic genomes. The algorithm is tested on a range of simulated and real DNA sequences, and the following conclusions are drawn. Firstly, the algorithm correctly identifies non-segmented sequence, and can thus be used to reject the null hypothesis of uniformity in the property of interest. Secondly, estimates of the number and locations of change-points produced by the algorithm are robust to variations in algorithm parameters and initial starting conditions and correspond to real features in the data. Thirdly, the algorithm is successfully used to segment human chromosome 1 according to GC content, thus demonstrating the feasibility of Bayesian segmentation of eukaryotic genomes. The software described in this paper is available from the author's website (www.uq.edu.au/ approximately uqjkeith/) or upon request to the author.

Entities:  

Mesh:

Year:  2006        PMID: 17037964     DOI: 10.1089/cmb.2006.13.1369

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  9 in total

1.  Stochastic models for large interacting systems and related correlation inequalities.

Authors:  Thomas M Liggett
Journal:  Proc Natl Acad Sci U S A       Date:  2010-09-08       Impact factor: 11.205

2.  Computational characterization of 3' splice variants in the GFAP isoform family.

Authors:  Sarah E Boyd; Betina Nair; Sze Woei Ng; Jonathan M Keith; Jacqueline M Orian
Journal:  PLoS One       Date:  2012-03-30       Impact factor: 3.240

3.  Discovery of putative small non-coding RNAs from the obligate intracellular bacterium Wolbachia pipientis.

Authors:  Megan Woolfit; Manjula Algama; Jonathan M Keith; Elizabeth A McGraw; Jean Popovici
Journal:  PLoS One       Date:  2015-03-04       Impact factor: 3.240

Review 4.  Investigating genomic structure using changept: A Bayesian segmentation model.

Authors:  Manjula Algama; Jonathan M Keith
Journal:  Comput Struct Biotechnol J       Date:  2014-08-27       Impact factor: 7.271

5.  Genome-wide identification of conserved intronic non-coding sequences using a Bayesian segmentation approach.

Authors:  Manjula Algama; Edward Tasker; Caitlin Williams; Adam C Parslow; Robert J Bryson-Richardson; Jonathan M Keith
Journal:  BMC Genomics       Date:  2017-03-27       Impact factor: 3.969

6.  Detection and identification of cis-regulatory elements using change-point and classification algorithms.

Authors:  Mirana Ramialison; Jonathan Keith; Dominic Maderazo; Jennifer A Flegg; Manjula Algama
Journal:  BMC Genomics       Date:  2022-01-25       Impact factor: 3.969

7.  Interpreting genomic data via entropic dissection.

Authors:  Rajeev K Azad; Jing Li
Journal:  Nucleic Acids Res       Date:  2012-10-03       Impact factor: 16.971

8.  Drosophila 3' UTRs are more complex than protein-coding sequences.

Authors:  Manjula Algama; Christopher Oldmeadow; Edward Tasker; Kerrie Mengersen; Jonathan M Keith
Journal:  PLoS One       Date:  2014-05-13       Impact factor: 3.240

9.  Bayesian change-point modeling with segmented ARMA model.

Authors:  Farhana Sadia; Sarah Boyd; Jonathan M Keith
Journal:  PLoS One       Date:  2018-12-31       Impact factor: 3.240

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.