| Literature DB >> 30445760 |
Deepti Vipin1, Lingfei Wang2, Guillaume Devailly3, Tom Michoel4,5, Anagha Joshi6,7.
Abstract
Transcription control plays a crucial role in establishing a unique gene expression signature for each of the hundreds of mammalian cell types. Though gene expression data have been widely used to infer cellular regulatory networks, existing methods mainly infer correlations rather than causality. We developed statistical models and likelihood-ratio tests to infer causal gene regulatory networks using enhancer RNA (eRNA) expression information as a causal anchor and applied the framework to eRNA and transcript expression data from the FANTOM Consortium. Predicted causal targets of transcription factors (TFs) in mouse embryonic stem cells, macrophages and erythroblastic leukaemia overlapped significantly with experimentally-validated targets from ChIP-seq and perturbation data. We further improved the model by taking into account that some TFs might act in a quantitative, dosage-dependent manner, whereas others might act predominantly in a binary on/off fashion. We predicted TF targets from concerted variation of eRNA and TF and target promoter expression levels within a single cell type, as well as across multiple cell types. Importantly, TFs with high-confidence predictions were largely different between these two analyses, demonstrating that variability within a cell type is highly relevant for target prediction of cell type-specific factors. Finally, we generated a compendium of high-confidence TF targets across diverse human cell and tissue types.Entities:
Keywords: causal inference; enhancer activity; gene expression; transcription regulation
Mesh:
Substances:
Year: 2018 PMID: 30445760 PMCID: PMC6274755 DOI: 10.3390/ijms19113609
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Figure 1Overview of the Findr framework. (A) The schematic representation of causal gene regulatory network inference using enhancer activity as a causal anchor. (B) Five statistical tests used by Findr for causal inference. (C) Workflow of the Findr-A framework. eRNA, enhancer RNA; TF, transcription factor; B, binary; C, continuous.
Figure 2Recall-precision curves for target predictions by Findr-B and Findr-C using ChIP-seq. The dotted line represents the background or the the random classifier precision.
Figure 3Robustness of Findr performance demonstrated by using different score thresholds.
Figure 4Comparison of Findr-adaptive (A) predictions using mouse ES cells and all cell types samples. (A) Bar plots representing enrichments for Findr-B, -C and -A predictions using ChIP-seq data as a validation dataset for ES cells (left) and all cell types (right). (B) Bar plots representing enrichments for Findr-B, -C and -A predictions using knock-out data as a validation dataset for ES cells (left) and all cell types (right). (C) Overlap of factors between ES and all cell types using ChIP-seq (left) and knock-out (right) as validation datasets.
Figure 5Hierarchical representation of similarities between transcription factor-target sets predicted using Findr-A causal inference on the FANTOM5 human dataset.