Literature DB >> 33371507

pcPromoter-CNN: A CNN-Based Prediction and Classification of Promoters.

Muhammad Shujaat1,2, Abdul Wahab1, Hilal Tayara3, Kil To Chong1,4.   

Abstract

A promoter is a small region within the DNA structure that has an important role in initiating transcription of a specific gene in the genome. Different types of promoters are recognized by their different functions. Due to the importance of promoter functions, computational tools for the prediction and classification of a promoter are highly desired. Promoters resemble each other; therefore, their precise classification is an important challenge. In this study, we propose a convolutional neural network (CNN)-based tool, the pcPromoter-CNN, for application in the prediction of promotors and their classification into subclasses σ70, σ54, σ38, σ32, σ28 and σ24. This CNN-based tool uses a one-hot encoding scheme for promoter classification. The tools architecture was trained and tested on a benchmark dataset. To evaluate its classification performance, we used four evaluation metrics. The model exhibited notable improvement over that of existing state-of-the-art tools.

Keywords:  bioinformatics; computational biology; convolution neural network (CNN); non-promoters; promoters

Year:  2020        PMID: 33371507      PMCID: PMC7767505          DOI: 10.3390/genes11121529

Source DB:  PubMed          Journal:  Genes (Basel)        ISSN: 2073-4425            Impact factor:   4.096


  29 in total

Review 1.  Deep learning.

Authors:  Yann LeCun; Yoshua Bengio; Geoffrey Hinton
Journal:  Nature       Date:  2015-05-28       Impact factor: 49.962

2.  MULTiPly: a novel multi-layer predictor for discovering general and specific types of promoters.

Authors:  Meng Zhang; Fuyi Li; Tatiana T Marquez-Lago; André Leier; Cunshuo Fan; Chee Keong Kwoh; Kuo-Chen Chou; Jiangning Song; Cangzhi Jia
Journal:  Bioinformatics       Date:  2019-09-01       Impact factor: 6.937

Review 3.  Promoter structure, promoter recognition, and transcription activation in prokaryotes.

Authors:  S Busby; R H Ebright
Journal:  Cell       Date:  1994-12-02       Impact factor: 41.582

4.  iSuc-PseOpt: Identifying lysine succinylation sites in proteins by incorporating sequence-coupling effects into pseudo components and optimizing imbalanced training dataset.

Authors:  Jianhua Jia; Zi Liu; Xuan Xiao; Bingxiang Liu; Kuo-Chen Chou
Journal:  Anal Biochem       Date:  2015-12-23       Impact factor: 3.365

5.  iRNA-PseColl: Identifying the Occurrence Sites of Different RNA Modifications by Incorporating Collective Effects of Nucleotides into PseKNC.

Authors:  Pengmian Feng; Hui Ding; Hui Yang; Wei Chen; Hao Lin; Kuo-Chen Chou
Journal:  Mol Ther Nucleic Acids       Date:  2017-03-29

6.  Some remarks on protein attribute prediction and pseudo amino acid composition.

Authors:  Kuo-Chen Chou
Journal:  J Theor Biol       Date:  2010-12-17       Impact factor: 2.691

7.  DNC4mC-Deep: Identification and Analysis of DNA N4-Methylcytosine Sites Based on Different Encoding Schemes By Using Deep Learning.

Authors:  Abdul Wahab; Omid Mahmoudi; Jeehong Kim; Kil To Chong
Journal:  Cells       Date:  2020-07-22       Impact factor: 6.600

8.  The primary σ factor in Escherichia coli can access the transcription elongation complex from solution in vivo.

Authors:  Seth R Goldman; Nikhil U Nair; Christopher D Wells; Bryce E Nickels; Ann Hochschild
Journal:  Elife       Date:  2015-09-15       Impact factor: 8.713

9.  RegulonDB version 9.0: high-level integration of gene regulation, coexpression, motif clustering and beyond.

Authors:  Socorro Gama-Castro; Heladia Salgado; Alberto Santos-Zavaleta; Daniela Ledezma-Tejeida; Luis Muñiz-Rascado; Jair Santiago García-Sotelo; Kevin Alquicira-Hernández; Irma Martínez-Flores; Lucia Pannier; Jaime Abraham Castro-Mondragón; Alejandra Medina-Rivera; Hilda Solano-Lira; César Bonavides-Martínez; Ernesto Pérez-Rueda; Shirley Alquicira-Hernández; Liliana Porrón-Sotelo; Alejandra López-Fuentes; Anastasia Hernández-Koutoucheva; Víctor Del Moral-Chávez; Fabio Rinaldi; Julio Collado-Vides
Journal:  Nucleic Acids Res       Date:  2015-11-02       Impact factor: 16.971

10.  Classifying Promoters by Interpreting the Hidden Information of DNA Sequences via Deep Learning and Combination of Continuous FastText N-Grams.

Authors:  Nguyen Quoc Khanh Le; Edward Kien Yee Yapp; N Nagasundaram; Hui-Yuan Yeh
Journal:  Front Bioeng Biotechnol       Date:  2019-11-05
View more
  5 in total

1.  Identification of piRNA disease associations using deep learning.

Authors:  Syed Danish Ali; Hilal Tayara; Kil To Chong
Journal:  Comput Struct Biotechnol J       Date:  2022-03-03       Impact factor: 7.271

2.  Generative Adversarial Networks for Creating Synthetic Nucleic Acid Sequences of Cat Genome.

Authors:  Debapriya Hazra; Mi-Ryung Kim; Yung-Cheol Byun
Journal:  Int J Mol Sci       Date:  2022-03-28       Impact factor: 5.923

3.  PromoterLCNN: A Light CNN-Based Promoter Prediction and Classification Model.

Authors:  Daryl Hernández; Nicolás Jara; Mauricio Araya; Roberto E Durán; Carlos Buil-Aranda
Journal:  Genes (Basel)       Date:  2022-06-23       Impact factor: 4.141

4.  Database of Potential Promoter Sequences in the Capsicum annuum Genome.

Authors:  Valentina Rudenko; Eugene Korotkov
Journal:  Biology (Basel)       Date:  2022-07-26

5.  UbiComb: A Hybrid Deep Learning Model for Predicting Plant-Specific Protein Ubiquitylation Sites.

Authors:  Arslan Siraj; Dae Yeong Lim; Hilal Tayara; Kil To Chong
Journal:  Genes (Basel)       Date:  2021-05-11       Impact factor: 4.096

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.