Literature DB >> 23427986

From binding motifs in ChIP-Seq data to improved models of transcription factor binding sites.

Ivan Kulakovskiy1, Victor Levitsky, Dmitry Oshchepkov, Leonid Bryzgalov, Ilya Vorontsov, Vsevolod Makeev.   

Abstract

Chromatin immunoprecipitation followed by deep sequencing (ChIP-Seq) became a method of choice to locate DNA segments bound by different regulatory proteins. ChIP-Seq produces extremely valuable information to study transcriptional regulation. The wet-lab workflow is often supported by downstream computational analysis including construction of models of nucleotide sequences of transcription factor binding sites in DNA, which can be used to detect binding sites in ChIP-Seq data at a single base pair resolution. The most popular TFBS model is represented by positional weight matrix (PWM) with statistically independent positional weights of nucleotides in different columns; such PWMs are constructed from a gapless multiple local alignment of sequences containing experimentally identified TFBSs. Modern high-throughput techniques, including ChIP-Seq, provide enough data for careful training of advanced models containing more parameters than PWM. Yet, many suggested multiparametric models often provide only incremental improvement of TFBS recognition quality comparing to traditional PWMs trained on ChIP-Seq data. We present a novel computational tool, diChIPMunk, that constructs TFBS models as optimal dinucleotide PWMs, thus accounting for correlations between nucleotides neighboring in input sequences. diChIPMunk utilizes many advantages of ChIPMunk, its ancestor algorithm, accounting for ChIP-Seq base coverage profiles ("peak shape") and using the effective subsampling-based core procedure which allows processing of large datasets. We demonstrate that diPWMs constructed by diChIPMunk outperform traditional PWMs constructed by ChIPMunk from the same ChIP-Seq data. Software website: http://autosome.ru/dichipmunk/

Entities:  

Mesh:

Substances:

Year:  2013        PMID: 23427986     DOI: 10.1142/S0219720013400040

Source DB:  PubMed          Journal:  J Bioinform Comput Biol        ISSN: 0219-7200            Impact factor:   1.122


  29 in total

1.  The BaMM web server for de-novo motif discovery and regulatory sequence analysis.

Authors:  Anja Kiesel; Christian Roth; Wanwan Ge; Maximilian Wess; Markus Meier; Johannes Söding
Journal:  Nucleic Acids Res       Date:  2018-07-02       Impact factor: 16.971

2.  HOCOMOCO: towards a complete collection of transcription factor binding models for human and mouse via large-scale ChIP-Seq analysis.

Authors:  Ivan V Kulakovskiy; Ilya E Vorontsov; Ivan S Yevshin; Ruslan N Sharipov; Alla D Fedorova; Eugene I Rumynskiy; Yulia A Medvedeva; Arturo Magana-Mora; Vladimir B Bajic; Dmitry A Papatsenko; Fedor A Kolpakov; Vsevolod J Makeev
Journal:  Nucleic Acids Res       Date:  2018-01-04       Impact factor: 16.971

Review 3.  Noncoding Variants Functional Prioritization Methods Based on Predicted Regulatory Factor Binding Sites.

Authors:  Haoyue Fu; Xiangde Zhang
Journal:  Curr Genomics       Date:  2017-08       Impact factor: 2.236

4.  Mocap: large-scale inference of transcription factor binding sites from chromatin accessibility.

Authors:  Xi Chen; Bowen Yu; Nicholas Carriero; Claudio Silva; Richard Bonneau
Journal:  Nucleic Acids Res       Date:  2017-05-05       Impact factor: 16.971

5.  Varying levels of complexity in transcription factor binding motifs.

Authors:  Jens Keilwagen; Jan Grau
Journal:  Nucleic Acids Res       Date:  2015-06-26       Impact factor: 16.971

6.  Architectural proteins Pita, Zw5,and ZIPIC contain homodimerization domain and support specific long-range interactions in Drosophila.

Authors:  Nikolay Zolotarev; Anna Fedotova; Olga Kyrchanova; Artem Bonchuk; Aleksey A Penin; Andrey S Lando; Irina A Eliseeva; Ivan V Kulakovskiy; Oksana Maksimenko; Pavel Georgiev
Journal:  Nucleic Acids Res       Date:  2016-05-02       Impact factor: 16.971

7.  Disentangling transcription factor binding site complexity.

Authors:  Ralf Eggeling
Journal:  Nucleic Acids Res       Date:  2018-11-16       Impact factor: 16.971

8.  Bayesian Markov models consistently outperform PWMs at predicting motifs in nucleotide sequences.

Authors:  Matthias Siebert; Johannes Söding
Journal:  Nucleic Acids Res       Date:  2016-06-09       Impact factor: 16.971

9.  HOCOMOCO: expansion and enhancement of the collection of transcription factor binding sites models.

Authors:  Ivan V Kulakovskiy; Ilya E Vorontsov; Ivan S Yevshin; Anastasiia V Soboleva; Artem S Kasianov; Haitham Ashoor; Wail Ba-Alawi; Vladimir B Bajic; Yulia A Medvedeva; Fedor A Kolpakov; Vsevolod J Makeev
Journal:  Nucleic Acids Res       Date:  2015-11-19       Impact factor: 16.971

10.  THiCweed: fast, sensitive detection of sequence features by clustering big datasets.

Authors:  Ankit Agrawal; Snehal V Sambare; Leelavati Narlikar; Rahul Siddharthan
Journal:  Nucleic Acids Res       Date:  2018-03-16       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.