Literature DB >> 34988437

OLOGRAM-MODL: mining enriched n-wise combinations of genomic features with Monte Carlo and dictionary learning.

Quentin Ferré1, Cécile Capponi2, Denis Puthier1.   

Abstract

Most epigenetic marks, such as Transcriptional Regulators or histone marks, are biological objects known to work together in n-wise complexes. A suitable way to infer such functional associations between them is to study the overlaps of the corresponding genomic regions. However, the problem of the statistical significance of n-wise overlaps of genomic features is seldom tackled, which prevent rigorous studies of n-wise interactions. We introduce OLOGRAM-MODL, which considers overlaps between n ≥ 2 sets of genomic regions, and computes their statistical mutual enrichment by Monte Carlo fitting of a Negative Binomial distribution, resulting in more resolutive P-values. An optional machine learning method is proposed to find complexes of interest, using a new itemset mining algorithm based on dictionary learning which is resistant to noise inherent to biological assays. The overall approach is implemented through an easy-to-use CLI interface for workflow integration, and a visual tree-based representation of the results suited for explicability. The viability of the method is experimentally studied using both artificial and biological data. This approach is accessible through the command line interface of the pygtftk toolkit, available on Bioconda and from https://github.com/dputhier/pygtftk.
© The Author(s) 2021. Published by Oxford University Press on behalf of NAR Genomics and Bioinformatics.

Entities:  

Year:  2021        PMID: 34988437      PMCID: PMC8693575          DOI: 10.1093/nargab/lqab114

Source DB:  PubMed          Journal:  NAR Genom Bioinform        ISSN: 2631-9268


  26 in total

Review 1.  CTCF: master weaver of the genome.

Authors:  Jennifer E Phillips; Victor G Corces
Journal:  Cell       Date:  2009-06-26       Impact factor: 41.582

2.  Cell signaling can direct either binary or graded transcriptional responses.

Authors:  S R Biggar; G R Crabtree
Journal:  EMBO J       Date:  2001-06-15       Impact factor: 11.598

3.  Differential oestrogen receptor binding is associated with clinical outcome in breast cancer.

Authors:  Caryn S Ross-Innes; Rory Stark; Andrew E Teschendorff; Kelly A Holmes; H Raza Ali; Mark J Dunning; Gordon D Brown; Ondrej Gojis; Ian O Ellis; Andrew R Green; Simak Ali; Suet-Feung Chin; Carlo Palmieri; Carlos Caldas; Jason S Carroll
Journal:  Nature       Date:  2012-01-04       Impact factor: 49.962

4.  PC-TraFF: identification of potentially collaborating transcription factors using pointwise mutual information.

Authors:  Cornelia Meckbach; Rebecca Tacke; Xu Hua; Stephan Waack; Edgar Wingender; Mehmet Gültas
Journal:  BMC Bioinformatics       Date:  2015-12-01       Impact factor: 3.169

5.  Inferring chromatin-bound protein complexes from genome-wide binding assays.

Authors:  Eugenia G Giannopoulou; Olivier Elemento
Journal:  Genome Res       Date:  2013-04-03       Impact factor: 9.043

Review 6.  Nanog Dynamics in Mouse Embryonic Stem Cells: Results from Systems Biology Approaches.

Authors:  Lucia Marucci
Journal:  Stem Cells Int       Date:  2017-06-08       Impact factor: 5.443

7.  Discover context-specific combinatorial transcription factor interactions by integrating diverse ChIP-Seq data sets.

Authors:  Li Teng; Bing He; Peng Gao; Long Gao; Kai Tan
Journal:  Nucleic Acids Res       Date:  2013-11-11       Impact factor: 16.971

8.  Denoising genome-wide histone ChIP-seq with convolutional neural networks.

Authors:  Pang Wei Koh; Emma Pierson; Anshul Kundaje
Journal:  Bioinformatics       Date:  2017-07-15       Impact factor: 6.937

9.  Coloc-stats: a unified web interface to perform colocalization analysis of genomic features.

Authors:  Boris Simovski; Chakravarthi Kanduri; Sveinung Gundersen; Dmytro Titov; Diana Domanska; Christoph Bock; Lara Bossini-Castillo; Maria Chikina; Alexander Favorov; Ryan M Layer; Andrey A Mironov; Aaron R Quinlan; Nathan C Sheffield; Gosia Trynka; Geir K Sandve
Journal:  Nucleic Acids Res       Date:  2018-07-02       Impact factor: 16.971

10.  Colocalization analyses of genomic elements: approaches, recommendations and challenges.

Authors:  Chakravarthi Kanduri; Christoph Bock; Sveinung Gundersen; Eivind Hovig; Geir Kjetil Sandve
Journal:  Bioinformatics       Date:  2019-05-01       Impact factor: 6.937

View more
  1 in total

1.  Epigenetic regulation of transcription factor binding motifs promotes Th1 response in Chagas disease cardiomyopathy.

Authors:  Pauline Brochet; Barbara Maria Ianni; Laurie Laugier; Amanda Farage Frade; João Paulo Silva Nunes; Priscila Camillo Teixeira; Charles Mady; Ludmila Rodrigues Pinto Ferreira; Quentin Ferré; Ronaldo Honorato Barros Santos; Andreia Kuramoto; Sandrine Cabantous; Samuel Steffen; Antonio Noedir Stolf; Pablo Pomerantzeff; Alfredo Inacio Fiorelli; Edimar Alcides Bocchi; Cristina Wide Pissetti; Bruno Saba; Darlan da Silva Cândido; Fabrício C Dias; Marcelo Ferraz Sampaio; Fabio Antônio Gaiotto; José Antonio Marin-Neto; Abílio Fragata; Ricardo Costa Fernandes Zaniratto; Sergio Siqueira; Giselle De Lima Peixoto; Vagner Oliveira-Carvalho Rigaud; Fernando Bacal; Paula Buck; Rafael Ribeiro Almeida; Hui Tzu Lin-Wang; André Schmidt; Martino Martinelli; Mario Hiroyuki Hirata; Eduardo Antonio Donadi; Alexandre Costa Pereira; Virmondes Rodrigues Junior; Denis Puthier; Jorge Kalil; Lionel Spinelli; Edecio Cunha-Neto; Christophe Chevillard
Journal:  Front Immunol       Date:  2022-08-22       Impact factor: 8.786

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.