| Literature DB >> 30304373 |
Florian Schmidt1,2,3, Fabian Kern1,2,4, Peter Ebert2,3, Nina Baumgarten1,2,5,6, Marcel H Schulz1,2,5,6.
Abstract
SUMMARY: Prediction of transcription factor (TF) binding from epigenetics data and integrative analysis thereof are challenging. Here, we present TEPIC 2 a framework allowing for fast, accurate and versatile prediction, and analysis of TF binding from epigenetics data: it supports 30 species with binding motifs, computes TF gene and scores up to two orders of magnitude faster than before due to improved implementation, and offers easy-to-use machine learning pipelines for integrated analysis of TF binding predictions with gene expression data allowing the identification of important TFs.Entities:
Mesh:
Substances:
Year: 2019 PMID: 30304373 PMCID: PMC6499243 DOI: 10.1093/bioinformatics/bty856
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.(a) Runtime comparison of TEPIC to TEPIC 2 using a subset of 458 human TFs. While the original implementation ran up to 1300 minutes to compute TFBS, TEPIC 2 is able to compute TF affinities for peaks in the vicinity of genes in at most 15 minutes. We used four cell line samples and three primary human hepatocyte samples (LiHe1–3) to conduct the runtime experiments. (b) We compared TEPIC TF affinities computed in footprints called with HINT-BC (Gusmao ) in four different cell-lines in terms of AUPR against PIQ (Sherwood ) and an extension of the widely used method Fimo, called Fimo-Prior (Cuellar-Partida ). Notably, TF affinities computed with TEPIC outperform both PIQ and Fimo-Prior