Gaia Ceddia1, Liuba Nausicaa Martino2, Alice Parodi2, Piercesare Secchi2,3, Stefano Campaner4, Marco Masseroli1. 1. Dipartimento di Elettronica, Informazione e Bioingegneria, Italy. 2. MOX - Dipartimento di Matematica, Politecnico di Milano, Milan 20133, Italy. 3. Center for Analysis, Decisions and Society, Human Technopole, Milan 20157, Italy. 4. Center for Genomic Science of IIT@SEMM, Istituto Italiano di Tecnologia (IIT), Milan 20139, Italy.
Abstract
MOTIVATION: Genome regulatory networks have different layers and ways to modulate cellular processes, such as cell differentiation, proliferation, and adaptation to external stimuli. Transcription factors and other chromatin-associated proteins act as combinatorial protein complexes that control gene transcription. Thus, identifying functional interaction networks among these proteins is a fundamental task to understand the genome regulation framework. RESULTS: We developed a novel approach to infer interactions among transcription factors in user-selected genomic regions, by combining the computation of association rules and of a novel Importance Index on ChIP-seq datasets. The hallmark of our method is the definition of the Importance Index, which provides a relevance measure of the interaction among transcription factors found associated in the computed rules. Examples on synthetic data explain the index use and potential. A straightforward pre-processing pipeline enables the easy extraction of input data for our approach from any set of ChIP-seq experiments. Applications on ENCODE ChIP-seq data prove that our approach can reliably detect interactions between transcription factors, including known interactions that validate our approach. AVAILABILITY AND IMPLEMENTATION: A R/Bioconductor package implementing our association rules and Importance Index-based method is available at http://bioconductor.org/packages/release/bioc/html/TFARM.html. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
MOTIVATION: Genome regulatory networks have different layers and ways to modulate cellular processes, such as cell differentiation, proliferation, and adaptation to external stimuli. Transcription factors and other chromatin-associated proteins act as combinatorial protein complexes that control gene transcription. Thus, identifying functional interaction networks among these proteins is a fundamental task to understand the genome regulation framework. RESULTS: We developed a novel approach to infer interactions among transcription factors in user-selected genomic regions, by combining the computation of association rules and of a novel Importance Index on ChIP-seq datasets. The hallmark of our method is the definition of the Importance Index, which provides a relevance measure of the interaction among transcription factors found associated in the computed rules. Examples on synthetic data explain the index use and potential. A straightforward pre-processing pipeline enables the easy extraction of input data for our approach from any set of ChIP-seq experiments. Applications on ENCODE ChIP-seq data prove that our approach can reliably detect interactions between transcription factors, including known interactions that validate our approach. AVAILABILITY AND IMPLEMENTATION: A R/Bioconductor package implementing our association rules and Importance Index-based method is available at http://bioconductor.org/packages/release/bioc/html/TFARM.html. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Authors: Yijun Zhao; Yi Ding; Yangqian Shen; Samuel Failing; Jacqueline Hwang Journal: Int J Environ Res Public Health Date: 2022-02-19 Impact factor: 3.390