Literature DB >> 31598637

Expectation pooling: an effective and interpretable pooling method for predicting DNA-protein binding.

Xiao Luo1, Xinming Tu2, Yang Ding2, Ge Gao2, Minghua Deng1,3.   

Abstract

MOTIVATION: Convolutional neural networks (CNNs) have outperformed conventional methods in modeling the sequence specificity of DNA-protein binding. While previous studies have built a connection between CNNs and probabilistic models, simple models of CNNs cannot achieve sufficient accuracy on this problem. Recently, some methods of neural networks have increased performance using complex neural networks whose results cannot be directly interpreted. However, it is difficult to combine probabilistic models and CNNs effectively to improve DNA-protein binding predictions.
RESULTS: In this article, we present a novel global pooling method: expectation pooling for predicting DNA-protein binding. Our pooling method stems naturally from the expectation maximization algorithm, and its benefits can be interpreted both statistically and via deep learning theory. Through experiments, we demonstrate that our pooling method improves the prediction performance DNA-protein binding. Our interpretable pooling method combines probabilistic ideas with global pooling by taking the expectations of inputs without increasing the number of parameters. We also analyze the hyperparameters in our method and propose optional structures to help fit different datasets. We explore how to effectively utilize these novel pooling methods and show that combining statistical methods with deep learning is highly beneficial, which is promising and meaningful for future studies in this field.
AVAILABILITY AND IMPLEMENTATION: All code is public in https://github.com/gao-lab/ePooling. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author(s) 2019. Published by Oxford University Press.

Entities:  

Year:  2020        PMID: 31598637     DOI: 10.1093/bioinformatics/btz768

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  7 in total

1.  Protein Science Meets Artificial Intelligence: A Systematic Review and a Biochemical Meta-Analysis of an Inter-Field.

Authors:  Jalil Villalobos-Alva; Luis Ochoa-Toledo; Mario Javier Villalobos-Alva; Atocha Aliseda; Fernando Pérez-Escamirosa; Nelly F Altamirano-Bustamante; Francine Ochoa-Fernández; Ricardo Zamora-Solís; Sebastián Villalobos-Alva; Cristina Revilla-Monsalve; Nicolás Kemper-Valverde; Myriam M Altamirano-Bustamante
Journal:  Front Bioeng Biotechnol       Date:  2022-07-07

Review 2.  Obtaining genetics insights from deep learning via explainable artificial intelligence.

Authors:  Gherman Novakovsky; Nick Dexter; Maxwell W Libbrecht; Wyeth W Wasserman; Sara Mostafavi
Journal:  Nat Rev Genet       Date:  2022-10-03       Impact factor: 59.581

3.  MAResNet: predicting transcription factor binding sites by combining multi-scale bottom-up and top-down attention and residual network.

Authors:  Ke Han; Long-Chen Shen; Yi-Heng Zhu; Jian Xu; Jiangning Song; Dong-Jun Yu
Journal:  Brief Bioinform       Date:  2022-01-17       Impact factor: 13.994

4.  SAResNet: self-attention residual network for predicting DNA-protein binding.

Authors:  Long-Chen Shen; Yan Liu; Jiangning Song; Dong-Jun Yu
Journal:  Brief Bioinform       Date:  2021-09-02       Impact factor: 11.622

5.  Modern deep learning in bioinformatics.

Authors:  Haoyang Li; Shuye Tian; Yu Li; Qiming Fang; Renbo Tan; Yijie Pan; Chao Huang; Ying Xu; Xin Gao
Journal:  J Mol Cell Biol       Date:  2020-10-30       Impact factor: 6.216

6.  Genomics enters the deep learning era.

Authors:  Etienne Routhier; Julien Mozziconacci
Journal:  PeerJ       Date:  2022-06-24       Impact factor: 3.061

7.  LW-CovidNet: Automatic covid-19 lung infection detection from chest X-ray images.

Authors:  Noor Ahmed; Xin Tan; Lizhuang Ma
Journal:  IET Image Process       Date:  2022-09-24       Impact factor: 1.773

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.