Literature DB >> 34962264

Detection of transcription factors binding to methylated DNA by deep recurrent neural network.

Hongfei Li1, Yue Gong1, Yifeng Liu2, Hao Lin3, Guohua Wang1.   

Abstract

Transcription factors (TFs) are proteins specifically involved in gene expression regulation. It is generally accepted in epigenetics that methylated nucleotides could prevent the TFs from binding to DNA fragments. However, recent studies have confirmed that some TFs have capability to interact with methylated DNA fragments to further regulate gene expression. Although biochemical experiments could recognize TFs binding to methylated DNA sequences, these wet experimental methods are time-consuming and expensive. Machine learning methods provide a good choice for quickly identifying these TFs without experimental materials. Thus, this study aims to design a robust predictor to detect methylated DNA-bound TFs. We firstly proposed using tripeptide word vector feature to formulate protein samples. Subsequently, based on recurrent neural network with long short-term memory, a two-step computational model was designed. The first step predictor was utilized to discriminate transcription factors from non-transcription factors. Once proteins were predicted as TFs, the second step predictor was employed to judge whether the TFs can bind to methylated DNA. Through the independent dataset test, the accuracies of the first step and the second step are 86.63% and 73.59%, respectively. In addition, the statistical analysis of the distribution of tripeptides in training samples showed that the position and number of some tripeptides in the sequence could affect the binding of TFs to methylated DNA. Finally, on the basis of our model, a free web server was established based on the proposed model, which can be available at https://bioinfor.nefu.edu.cn/TFPM/.
© The Author(s) 2021. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

Entities:  

Keywords:  deep recurrent neural network; methylated DNA; transcription factors; tripeptide; tripeptide word vector

Mesh:

Substances:

Year:  2022        PMID: 34962264     DOI: 10.1093/bib/bbab533

Source DB:  PubMed          Journal:  Brief Bioinform        ISSN: 1467-5463            Impact factor:   11.622


  5 in total

1.  SCMRSA: a New Approach for Identifying and Analyzing Anti-MRSA Peptides Using Estimated Propensity Scores of Dipeptides.

Authors:  Phasit Charoenkwan; Sakawrat Kanthawong; Nalini Schaduangrat; Pietro Li'; Mohammad Ali Moni; Watshara Shoombuatong
Journal:  ACS Omega       Date:  2022-09-01

2.  Heterogeneity Analysis of Bladder Cancer Based on DNA Methylation Molecular Profiling.

Authors:  Shuyu Wang; Dali Xu; Bo Gao; Shuhan Yan; Yiwei Sun; Xinxing Tang; Yanjia Jiao; Shan Huang; Shumei Zhang
Journal:  Front Oncol       Date:  2022-06-07       Impact factor: 5.738

3.  Deepm5C: A deep-learning-based hybrid framework for identifying human RNA N5-methylcytosine sites using a stacking strategy.

Authors:  Md Mehedi Hasan; Sho Tsukiyama; Jae Youl Cho; Hiroyuki Kurata; Md Ashad Alam; Xiaowen Liu; Balachandran Manavalan; Hong-Wen Deng
Journal:  Mol Ther       Date:  2022-05-06       Impact factor: 12.910

4.  Assessing comparative importance of DNA sequence and epigenetic modifications on gene expression using a deep convolutional neural network.

Authors:  Shang Gao; Jalees Rehman; Yang Dai
Journal:  Comput Struct Biotechnol J       Date:  2022-07-13       Impact factor: 6.155

5.  Identifying Transcription Factors That Prefer Binding to Methylated DNA Using Reduced G-Gap Dipeptide Composition.

Authors:  Quang H Nguyen; Hoang V Tran; Binh P Nguyen; Trang T T Do
Journal:  ACS Omega       Date:  2022-08-30
  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.