| Literature DB >> 35639508 |
Li Tang1, Zhizhou Zhong1, Yisheng Lin1, Yifei Yang1, Jun Wang2, James F Martin3,4,5, Min Li1.
Abstract
Long distance enhancers can physically interact with promoters to regulate gene expression through formation of enhancer-promoter (E-P) interactions. Identification of E-P interactions is also important for profound understanding of normal developmental and disease-associated risk variants. Although the state-of-art predictive computation methods facilitate the identification of E-P interactions to a certain extent, currently there is no efficient method that can meet various requirements of usage. Here we developed EPIXplorer, a user-friendly web server for efficient prediction, analysis and visualization of E-P interactions. EPIXplorer integrates 9 robust predictive algorithms, supports multiple types of 3D contact data and multi-omics data as input. The output from EPIXplorer is scored, fully annotated by regulatory elements and risk single-nucleotide polymorphisms (SNPs). In addition, the Visualization and Downstream module provide further functional analysis, all the output files and high-quality images are available for download. Together, EPIXplorer provides a user-friendly interface to predict the E-P interactions in an acceptable time, as well as understand how the genome-wide association study (GWAS) variants influence disease pathology by altering DNA looping between enhancers and the target gene promoters. EPIXplorer is available at https://www.csuligroup.com/EPIXplorer.Entities:
Year: 2022 PMID: 35639508 PMCID: PMC9252822 DOI: 10.1093/nar/gkac397
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 19.160
The predictive algorithms integrated in EPIXplorer
| Strategy | Input | Prediction output | Downstream analysis/ Visualization | Advantage | Disadvantage | |
|---|---|---|---|---|---|---|
| PreSTIGE | Unsupervised (Distance-based) | Distance, H3K4me1, RNA-seq | E-P interaction | No/No | Low running times, does not need 3D contact | Low accuracy |
| Ernst et al. | Unsupervised (Correlation-based) | CTCF, histone marks, TF binding | E-P interaction | No/No | Low running times, does not need 3D contact | Low accuracy |
| Thurman et al. | Unsupervised (Correlation-based) | DHS | E-P interaction | No/No | Low running times, does not need 3D contact | Low accuracy |
| EpiTensor | Unsupervised (Decomposition-based) | DHS, histone marks, RNA-seq | 3D interactions | No/No | does not need 3D contact | Low accuracy, slow speed |
| IM-PET | Supervised (Random Forest) | DNA, histone marks, TFBSs, RNA-seq + ChIA-PET | E-P interaction | No/No | High accuracy | Need enhancer locus and signals, classification only |
| JEME | Supervised (Linear Regression) | DHS, distance, eRNA, histone marks + ChIA-PET/Hi-C/eQTL | E-P interaction | No/No | High accuracy, does not need 3D contact | Slow speed, classification only |
| TargetFinder | Supervised (Gradient Tree Boosting) | DHS, DNA methylation, TFBSs, histone marks, CAGE + Hi-C | E-P interaction | No/No | High accuracy | Need 3D contact, classification only |
| 3DPredictor | Supervised (Gradient Boosting) | CTCF, distance, RNA-seq + Hi-C | 3D interactions | No/No | High accuracy | Need 3D contact, slow speed, classification only |
| LoopPredictor | Supervised (Random Forest, Gradient Boosted Regression Trees) | RNA-seq, ChIP-seq, ATAC-seq, RRBS + HiChIP | E-P interaction, Enhancer-Enhancer (E-E) interaction, Promoter-Promoter (P-P) interaction | No/No | High accuracy, both classification and regression | Need 3D contact, slow speed |
Figure 1.The overall design of EPIXplorer. Multi-omics datasets and/or 3D contact data can be prepared as input. EPIXplorer provides a practical guidance from three aspects: By Model Type (BMT), By Input Type (BIT), and By Bio Sample (BBS). Downstream and visualization modules perform further analysis for predicted loops.
Figure 2.Performance of EPIXplorer. (A) The AUPR score and ACC of 9 integrated algorithms in EPIXplorer evaluated with K562 and GM12878 BENGI datasets and gold standard loop sets. (B) The running time of 9 integrated algorithms in EPIXplorer.
Figure 3.Example application of EPIXplorer implemented with LoopPredictor. (A) The genome-wide distribution of K562 predicted loops. (B) Validation of predicted loops by published CRISPRi contacts. (C) Ranking list of significantly enriched motifs. (D) Top enriched GO terms for predicted loop anchors.