Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Supervised learning method for predicting chromatin boundary associated insulator elements.

Literature DB >> 25385081

Supervised learning method for predicting chromatin boundary associated insulator elements.

Abstract

In eukaryotic cells, the DNA material is densely packed inside the nucleus in the form of a DNA-protein complex structure called chromatin. Since the actual conformation of the chromatin fiber defines the possible regulatory interactions between genes and their regulatory elements, it is very important to understand the mechanisms governing folding of chromatin. In this paper, we show that supervised methods for predicting chromatin boundary elements are much more effective than the currently popular unsupervised methods. Using boundary locations from published Hi-C experiments and modEncode tracks as features, we can tell the insulator elements from randomly selected background sequences with great accuracy. In addition to accurate predictions of the training boundary elements, our classifiers make new predictions. Many of them correspond to the locations of known insulator elements. The key features used for predicting boundary elements do not depend on the prediction method. Because of its miniscule size, chromatin state cannot be measured directly, we need to rely on indirect measurements, such as ChIP-Seq and fill in the gaps with computational models. Our results show that currently, at least in the model organisms, where we have many measurements including ChIP-Seq and Hi-C, we can make accurate predictions of insulator positions.

Keywords: Bayesian network; Chromatin boundary; insulator element; random forest

Mesh：

Substances：
Chromatin

Year: 2014 PMID： 25385081 DOI： 10.1142/S0219720014420062

Source DB: PubMed Journal: J Bioinform Comput Biol ISSN： 0219-7200 Impact factor: 1.122

Keyword Cloud
Cited

7 in total

1. MOCCA: a flexible suite for modelling DNA sequence motif occurrence combinatorics.

Authors: Bjørn André Bredesen; Marc Rehmsmeier
Journal: BMC Bioinformatics Date: 2021-05-07 Impact factor: 3.169

2. Taking promoters out of enhancers in sequence based predictions of tissue-specific mammalian enhancers.

Authors: Julia Herman-Izycka; Michal Wlasnowolski; Bartek Wilczynski
Journal: BMC Med Genomics Date: 2017-05-24 Impact factor: 3.063

3. Quantifying the similarity of topological domains across normal and cancer human cell types.

Authors: Natalie Sauerwald; Carl Kingsford
Journal: Bioinformatics Date: 2018-07-01 Impact factor: 6.937

4. Semi-nonparametric modeling of topological domain formation from epigenetic data.

Authors: Emre Sefer; Carl Kingsford
Journal: Algorithms Mol Biol Date: 2019-03-05 Impact factor: 1.405

5. Gnocis: An integrated system for interactive and reproducible analysis and modelling of cis-regulatory elements in Python 3.

Authors: Bjørn André Bredesen-Aa; Marc Rehmsmeier
Journal: PLoS One Date: 2022-09-09 Impact factor: 3.752

6. A comparative analysis of health surveillance strategies for administrative video display terminal employees.

Authors: Saki Gerassis; Alberto Abad; Javier Taboada; Ángeles Saavedra; Eduardo Giráldez
Journal: Biomed Eng Online Date: 2019-12-11 Impact factor: 2.819

Review 7. Insulators in Plants: Progress and Open Questions.

Authors: Amina Kurbidaeva; Michael Purugganan
Journal: Genes (Basel) Date: 2021-09-16 Impact factor: 4.096

7 in total