Literature DB >> 31318408

Integrating distal and proximal information to predict gene expression via a densely connected convolutional neural network.

Wanwen Zeng1, Yong Wang2,3, Rui Jiang1.   

Abstract

MOTIVATION: Interactions among cis-regulatory elements such as enhancers and promoters are main driving forces shaping context-specific chromatin structure and gene expression. Although there have been computational methods for predicting gene expression from genomic and epigenomic information, most of them neglect long-range enhancer-promoter interactions, due to the difficulty in precisely linking regulatory enhancers to target genes. Recently, HiChIP, a novel high-throughput experimental approach, has generated comprehensive data on high-resolution interactions between promoters and distal enhancers. Moreover, plenty of studies suggest that deep learning achieves state-of-the-art performance in epigenomic signal prediction, and thus promoting the understanding of regulatory elements. In consideration of these two factors, we integrate proximal promoter sequences and HiChIP distal enhancer-promoter interactions to accurately predict gene expression.
RESULTS: We propose DeepExpression, a densely connected convolutional neural network, to predict gene expression using both promoter sequences and enhancer-promoter interactions. We demonstrate that our model consistently outperforms baseline methods, not only in the classification of binary gene expression status but also in regression of continuous gene expression levels, in both cross-validation experiments and cross-cell line predictions. We show that the sequential promoter information is more informative than the experimental enhancer information; meanwhile, the enhancer-promoter interactions within ±100 kbp around the TSS of a gene are most beneficial. We finally visualize motifs in both promoter and enhancer regions and show the match of identified sequence signatures with known motifs. We expect to see a wide spectrum of applications using HiChIP data in deciphering the mechanism of gene regulation.
AVAILABILITY AND IMPLEMENTATION: DeepExpression is freely available at https://github.com/wanwenzeng/DeepExpression. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author(s) 2019. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Year:  2020        PMID: 31318408     DOI: 10.1093/bioinformatics/btz562

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  14 in total

1.  Imputing missing RNA-sequencing data from DNA methylation by using a transfer learning-based neural network.

Authors:  Xiang Zhou; Hua Chai; Huiying Zhao; Ching-Hsing Luo; Yuedong Yang
Journal:  Gigascience       Date:  2020-07-01       Impact factor: 6.524

2.  Interpreting Neural Networks for Biological Sequences by Learning Stochastic Masks.

Authors:  Johannes Linder; Alyssa La Fleur; Zibo Chen; Ajasja Ljubeti; David Baker; Sreeram Kannan; Georg Seelig
Journal:  Nat Mach Intell       Date:  2022-01-25

3.  Prediction of single-cell gene expression for transcription factor analysis.

Authors:  Fatemeh Behjati Ardakani; Kathrin Kattler; Tobias Heinen; Florian Schmidt; David Feuerborn; Gilles Gasparoni; Konstantin Lepikhov; Patrick Nell; Jan Hengstler; Jörn Walter; Marcel H Schulz
Journal:  Gigascience       Date:  2020-10-30       Impact factor: 6.524

Review 4.  Machine Learning in Epigenomics: Insights into Cancer Biology and Medicine.

Authors:  Emre Arslan; Jonathan Schulz; Kunal Rai
Journal:  Biochim Biophys Acta Rev Cancer       Date:  2021-07-07       Impact factor: 10.680

5.  Accurate and highly interpretable prediction of gene expression from histone modifications.

Authors:  Fabrizio Frasca; Matteo Matteucci; Michele Leone; Marco J Morelli; Marco Masseroli
Journal:  BMC Bioinformatics       Date:  2022-04-26       Impact factor: 3.307

6.  Integrating Long-Range Regulatory Interactions to Predict Gene Expression Using Graph Convolutional Networks.

Authors:  Jeremy Bigness; Xavier Loinaz; Shalin Patel; Erica Larschan; Ritambhara Singh
Journal:  J Comput Biol       Date:  2022-03-21       Impact factor: 1.549

7.  intePareto: an R package for integrative analyses of RNA-Seq and ChIP-Seq data.

Authors:  Yingying Cao; Simo Kitanovski; Daniel Hoffmann
Journal:  BMC Genomics       Date:  2020-12-29       Impact factor: 3.969

8.  A machine learning framework for the prediction of chromatin folding in Drosophila using epigenetic features.

Authors:  Michal B Rozenwald; Aleksandra A Galitsyna; Grigory V Sapunov; Ekaterina E Khrameeva; Mikhail S Gelfand
Journal:  PeerJ Comput Sci       Date:  2020-11-30

Review 9.  Towards the Genetic Architecture of Complex Gene Expression Traits: Challenges and Prospects for eQTL Mapping in Humans.

Authors:  Chaeyoung Lee
Journal:  Genes (Basel)       Date:  2022-01-26       Impact factor: 4.096

10.  DeepCAPE: A Deep Convolutional Neural Network for the Accurate Prediction of Enhancers.

Authors:  Shengquan Chen; Mingxin Gan; Hairong Lv; Rui Jiang
Journal:  Genomics Proteomics Bioinformatics       Date:  2021-02-11       Impact factor: 6.409

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.