Literature DB >> 25361964

iPro54-PseKNC: a sequence-based predictor for identifying sigma-54 promoters in prokaryote with pseudo k-tuple nucleotide composition.

Hao Lin1, En-Ze Deng2, Hui Ding2, Wei Chen3, Kuo-Chen Chou4.   

Abstract

The σ(54) promoters are unique in prokaryotic genome and responsible for transcripting carbon and nitrogen-related genes. With the avalanche of genome sequences generated in the postgenomic age, it is highly desired to develop automated methods for rapidly and effectively identifying the σ(54) promoters. Here, a predictor called 'iPro54-PseKNC' was developed. In the predictor, the samples of DNA sequences were formulated by a novel feature vector called 'pseudo k-tuple nucleotide composition', which was further optimized by the incremental feature selection procedure. The performance of iPro54-PseKNC was examined by the rigorous jackknife cross-validation tests on a stringent benchmark data set. As a user-friendly web-server, iPro54-PseKNC is freely accessible at http://lin.uestc.edu.cn/server/iPro54-PseKNC. For the convenience of the vast majority of experimental scientists, a step-by-step protocol guide was provided on how to use the web-server to get the desired results without the need to follow the complicated mathematics that were presented in this paper just for its integrity. Meanwhile, we also discovered through an in-depth statistical analysis that the distribution of distances between the transcription start sites and the translation initiation sites were governed by the gamma distribution, which may provide a fundamental physical principle for studying the σ(54) promoters.
© The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

Entities:  

Mesh:

Substances:

Year:  2014        PMID: 25361964      PMCID: PMC4245931          DOI: 10.1093/nar/gku1019

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


  89 in total

1.  Protein-length distributions for the three domains of life.

Authors:  J Zhang
Journal:  Trends Genet       Date:  2000-03       Impact factor: 11.639

2.  Predicting subcellular localization of proteins in a hybridization space.

Authors:  Yu-Dong Cai; Kuo-Chen Chou
Journal:  Bioinformatics       Date:  2004-02-05       Impact factor: 6.937

3.  Generic eukaryotic core promoter prediction using structural features of DNA.

Authors:  Thomas Abeel; Yvan Saeys; Eric Bonnet; Pierre Rouzé; Yves Van de Peer
Journal:  Genome Res       Date:  2007-12-20       Impact factor: 9.043

4.  An optimization approach to predicting protein structural class from amino acid composition.

Authors:  C T Zhang; K C Chou
Journal:  Protein Sci       Date:  1992-03       Impact factor: 6.725

5.  An Improved Position Weight Matrix method based on an entropy measure for the recognition of prokaryotic promoters.

Authors:  Qinqin Wu; Jiajun Wang; Hong Yan
Journal:  Int J Data Min Bioinform       Date:  2011       Impact factor: 0.667

Review 6.  Structure and function of bacterial sigma factors.

Authors:  J D Helmann; M J Chamberlin
Journal:  Annu Rev Biochem       Date:  1988       Impact factor: 23.643

7.  SLLE for predicting membrane protein types.

Authors:  Meng Wang; Jie Yang; Zhi-Jie Xu; Kuo-Chen Chou
Journal:  J Theor Biol       Date:  2005-01-07       Impact factor: 2.691

8.  iNitro-Tyr: prediction of nitrotyrosine sites in proteins with general pseudo amino acid composition.

Authors:  Yan Xu; Xin Wen; Li-Shu Wen; Ling-Yun Wu; Nai-Yang Deng; Kuo-Chen Chou
Journal:  PLoS One       Date:  2014-08-14       Impact factor: 3.240

9.  Some remarks on protein attribute prediction and pseudo amino acid composition.

Authors:  Kuo-Chen Chou
Journal:  J Theor Biol       Date:  2010-12-17       Impact factor: 2.691

10.  ProSOM: core promoter prediction based on unsupervised clustering of DNA physical profiles.

Authors:  Thomas Abeel; Yvan Saeys; Pierre Rouzé; Yves Van de Peer
Journal:  Bioinformatics       Date:  2008-07-01       Impact factor: 6.937

View more
  109 in total

1.  Prediction of Protein-Protein Interaction Sites with Machine-Learning-Based Data-Cleaning and Post-Filtering Procedures.

Authors:  Guang-Hui Liu; Hong-Bin Shen; Dong-Jun Yu
Journal:  J Membr Biol       Date:  2015-11-12       Impact factor: 1.843

2.  repRNA: a web server for generating various feature vectors of RNA sequences.

Authors:  Bin Liu; Fule Liu; Longyun Fang; Xiaolong Wang; Kuo-Chen Chou
Journal:  Mol Genet Genomics       Date:  2015-06-18       Impact factor: 3.291

3.  TargetFreeze: Identifying Antifreeze Proteins via a Combination of Weights using Sequence Evolutionary Information and Pseudo Amino Acid Composition.

Authors:  Xue He; Ke Han; Jun Hu; Hui Yan; Jing-Yu Yang; Hong-Bin Shen; Dong-Jun Yu
Journal:  J Membr Biol       Date:  2015-06-10       Impact factor: 1.843

4.  iN6-methylat (5-step): identifying DNA N6-methyladenine sites in rice genome using continuous bag of nucleobases via Chou's 5-step rule.

Authors:  Nguyen Quoc Khanh Le
Journal:  Mol Genet Genomics       Date:  2019-05-04       Impact factor: 3.291

5.  Bastion3: a two-layer ensemble predictor of type III secreted effectors.

Authors:  Jiawei Wang; Jiahui Li; Bingjiao Yang; Ruopeng Xie; Tatiana T Marquez-Lago; André Leier; Morihiro Hayashida; Tatsuya Akutsu; Yanju Zhang; Kuo-Chen Chou; Joel Selkrig; Tieli Zhou; Jiangning Song; Trevor Lithgow
Journal:  Bioinformatics       Date:  2019-06-01       Impact factor: 6.937

6.  MULTiPly: a novel multi-layer predictor for discovering general and specific types of promoters.

Authors:  Meng Zhang; Fuyi Li; Tatiana T Marquez-Lago; André Leier; Cunshuo Fan; Chee Keong Kwoh; Kuo-Chen Chou; Jiangning Song; Cangzhi Jia
Journal:  Bioinformatics       Date:  2019-09-01       Impact factor: 6.937

7.  Predicting membrane proteins and their types by extracting various sequence features into Chou's general PseAAC.

Authors:  Ahmad Hassan Butt; Nouman Rasool; Yaser Daanial Khan
Journal:  Mol Biol Rep       Date:  2018-09-20       Impact factor: 2.316

Review 8.  Structural Variability in the RLR-MAVS Pathway and Sensitive Detection of Viral RNAs.

Authors:  Qiu-Xing Jiang
Journal:  Med Chem       Date:  2019       Impact factor: 2.745

9.  Computational prediction and interpretation of both general and specific types of promoters in Escherichia coli by exploiting a stacked ensemble-learning framework.

Authors:  Fuyi Li; Jinxiang Chen; Zongyuan Ge; Ya Wen; Yanwei Yue; Morihiro Hayashida; Abdelkader Baggag; Halima Bensmail; Jiangning Song
Journal:  Brief Bioinform       Date:  2021-03-22       Impact factor: 11.622

10.  Imbalanced multi-label learning for identifying antimicrobial peptides and their functional types.

Authors:  Weizhong Lin; Dong Xu
Journal:  Bioinformatics       Date:  2016-08-26       Impact factor: 6.937

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.