Literature DB >> 25385081

Supervised learning method for predicting chromatin boundary associated insulator elements.

Paweł Bednarz1, Bartek Wilczyński.   

Abstract

In eukaryotic cells, the DNA material is densely packed inside the nucleus in the form of a DNA-protein complex structure called chromatin. Since the actual conformation of the chromatin fiber defines the possible regulatory interactions between genes and their regulatory elements, it is very important to understand the mechanisms governing folding of chromatin. In this paper, we show that supervised methods for predicting chromatin boundary elements are much more effective than the currently popular unsupervised methods. Using boundary locations from published Hi-C experiments and modEncode tracks as features, we can tell the insulator elements from randomly selected background sequences with great accuracy. In addition to accurate predictions of the training boundary elements, our classifiers make new predictions. Many of them correspond to the locations of known insulator elements. The key features used for predicting boundary elements do not depend on the prediction method. Because of its miniscule size, chromatin state cannot be measured directly, we need to rely on indirect measurements, such as ChIP-Seq and fill in the gaps with computational models. Our results show that currently, at least in the model organisms, where we have many measurements including ChIP-Seq and Hi-C, we can make accurate predictions of insulator positions.

Keywords:  Bayesian network; Chromatin boundary; insulator element; random forest

Mesh:

Substances:

Year:  2014        PMID: 25385081     DOI: 10.1142/S0219720014420062

Source DB:  PubMed          Journal:  J Bioinform Comput Biol        ISSN: 0219-7200            Impact factor:   1.122


  7 in total

1.  MOCCA: a flexible suite for modelling DNA sequence motif occurrence combinatorics.

Authors:  Bjørn André Bredesen; Marc Rehmsmeier
Journal:  BMC Bioinformatics       Date:  2021-05-07       Impact factor: 3.169

2.  Taking promoters out of enhancers in sequence based predictions of tissue-specific mammalian enhancers.

Authors:  Julia Herman-Izycka; Michal Wlasnowolski; Bartek Wilczynski
Journal:  BMC Med Genomics       Date:  2017-05-24       Impact factor: 3.063

3.  Quantifying the similarity of topological domains across normal and cancer human cell types.

Authors:  Natalie Sauerwald; Carl Kingsford
Journal:  Bioinformatics       Date:  2018-07-01       Impact factor: 6.937

4.  Semi-nonparametric modeling of topological domain formation from epigenetic data.

Authors:  Emre Sefer; Carl Kingsford
Journal:  Algorithms Mol Biol       Date:  2019-03-05       Impact factor: 1.405

5.  Gnocis: An integrated system for interactive and reproducible analysis and modelling of cis-regulatory elements in Python 3.

Authors:  Bjørn André Bredesen-Aa; Marc Rehmsmeier
Journal:  PLoS One       Date:  2022-09-09       Impact factor: 3.752

6.  A comparative analysis of health surveillance strategies for administrative video display terminal employees.

Authors:  Saki Gerassis; Alberto Abad; Javier Taboada; Ángeles Saavedra; Eduardo Giráldez
Journal:  Biomed Eng Online       Date:  2019-12-11       Impact factor: 2.819

Review 7.  Insulators in Plants: Progress and Open Questions.

Authors:  Amina Kurbidaeva; Michael Purugganan
Journal:  Genes (Basel)       Date:  2021-09-16       Impact factor: 4.096

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.