Sirajul Salekin1, Jianqiu Michelle Zhang1, Yufei Huang1,2. 1. Electrical and Computer Engineering Department, University of Texas at San Antonio, San Antonio, TX, USA. 2. Department of Epidemiology and Biostatistics, University of Texas Health Science Center, San Antonio, TX, USA.
Abstract
Motivation: Transcription factor (TF) binds to the promoter region of a gene to control gene expression. Identifying precise TF binding sites (TFBSs) is essential for understanding the detailed mechanisms of TF-mediated gene regulation. However, there is a shortage of computational approach that can deliver single base pair resolution prediction of TFBS. Results: In this paper, we propose DeepSNR, a Deep Learning algorithm for predicting TF binding location at Single Nucleotide Resolution de novo from DNA sequence. DeepSNR adopts a novel deconvolutional network (deconvNet) model and is inspired by the similarity to image segmentation by deconvNet. The proposed deconvNet architecture is constructed on top of 'DeepBind' and we trained the entire model using TF-specific data from ChIP-exonuclease (ChIP-exo) experiments. DeepSNR has been shown to outperform motif search-based methods for several evaluation metrics. We have also demonstrated the usefulness of DeepSNR in the regulatory analysis of TFBS as well as in improving the TFBS prediction specificity using ChIP-seq data. Availability and implementation: DeepSNR is available open source in the GitHub repository (https://github.com/sirajulsalekin/DeepSNR). Supplementary information: Supplementary data are available at Bioinformatics online.
Motivation: Transcription factor (TF) binds to the promoter region of a gene to control gene expression. Identifying precise TF binding sites (TFBSs) is essential for understanding the detailed mechanisms of TF-mediated gene regulation. However, there is a shortage of computational approach that can deliver single base pair resolution prediction of TFBS. Results: In this paper, we propose DeepSNR, a Deep Learning algorithm for predicting TF binding location at Single Nucleotide Resolution de novo from DNA sequence. DeepSNR adopts a novel deconvolutional network (deconvNet) model and is inspired by the similarity to image segmentation by deconvNet. The proposed deconvNet architecture is constructed on top of 'DeepBind' and we trained the entire model using TF-specific data from ChIP-exonuclease (ChIP-exo) experiments. DeepSNR has been shown to outperform motif search-based methods for several evaluation metrics. We have also demonstrated the usefulness of DeepSNR in the regulatory analysis of TFBS as well as in improving the TFBS prediction specificity using ChIP-seq data. Availability and implementation: DeepSNR is available open source in the GitHub repository (https://github.com/sirajulsalekin/DeepSNR). Supplementary information: Supplementary data are available at Bioinformatics online.
Authors: David Martin; Cristina Pantoja; Ana Fernández Miñán; Christian Valdes-Quezada; Eduardo Moltó; Fuencisla Matesanz; Ozren Bogdanović; Elisa de la Calle-Mustienes; Orlando Domínguez; Leila Taher; Mayra Furlan-Magaril; Antonio Alcina; Susana Cañón; María Fedetz; María A Blasco; Paulo S Pereira; Ivan Ovcharenko; Félix Recillas-Targa; Lluís Montoliu; Miguel Manzanares; Roderic Guigó; Manuel Serrano; Fernando Casares; José Luis Gómez-Skarmeta Journal: Nat Struct Mol Biol Date: 2011-05-22 Impact factor: 15.369
Authors: Troy W Whitfield; Jie Wang; Patrick J Collins; E Christopher Partridge; Shelley Force Aldred; Nathan D Trinklein; Richard M Myers; Zhiping Weng Journal: Genome Biol Date: 2012-09-26 Impact factor: 13.583