Literature DB >> 30649179

MULTiPly: a novel multi-layer predictor for discovering general and specific types of promoters.

Meng Zhang1, Fuyi Li2,3, Tatiana T Marquez-Lago4,5, André Leier4,5, Cunshuo Fan6, Chee Keong Kwoh7, Kuo-Chen Chou8, Jiangning Song2,3,9, Cangzhi Jia1,6.   

Abstract

MOTIVATION: Promoters are short DNA consensus sequences that are localized proximal to the transcription start sites of genes, allowing transcription initiation of particular genes. However, the precise prediction of promoters remains a challenging task because individual promoters often differ from the consensus at one or more positions.
RESULTS: In this study, we present a new multi-layer computational approach, called MULTiPly, for recognizing promoters and their specific types. MULTiPly took into account the sequences themselves, including both local information such as k-tuple nucleotide composition, dinucleotide-based auto covariance and global information of the entire samples based on bi-profile Bayes and k-nearest neighbour feature encodings. Specifically, the F-score feature selection method was applied to identify the best unique type of feature prediction results, in combination with other types of features that were subsequently added to further improve the prediction performance of MULTiPly. Benchmarking experiments on the benchmark dataset and comparisons with five state-of-the-art tools show that MULTiPly can achieve a better prediction performance on 5-fold cross-validation and jackknife tests. Moreover, the superiority of MULTiPly was also validated on a newly constructed independent test dataset. MULTiPly is expected to be used as a useful tool that will facilitate the discovery of both general and specific types of promoters in the post-genomic era.
AVAILABILITY AND IMPLEMENTATION: The MULTiPly webserver and curated datasets are freely available at http://flagshipnt.erc.monash.edu/MULTiPly/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author(s) 2019. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Mesh:

Year:  2019        PMID: 30649179      PMCID: PMC6736106          DOI: 10.1093/bioinformatics/btz016

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  68 in total

1.  Compilation and analysis of sigma(54)-dependent promoter sequences.

Authors:  H Barrios; B Valderrama; E Morett
Journal:  Nucleic Acids Res       Date:  1999-11-15       Impact factor: 16.971

2.  Sigma70 promoters in Escherichia coli: specific transcription in dense regions of overlapping promoter-like signals.

Authors:  Araceli M Huerta; Julio Collado-Vides
Journal:  J Mol Biol       Date:  2003-10-17       Impact factor: 5.469

3.  Energetic contributions to the initiation of transcription in E. coli.

Authors:  Jayanthi Ramprakash; Frederick P Schwarz
Journal:  Biophys Chem       Date:  2008-09-18       Impact factor: 2.352

4.  A new taxonomy-based protein fold recognition approach based on autocross-covariance transformation.

Authors:  Qiwen Dong; Shuigeng Zhou; Jihong Guan
Journal:  Bioinformatics       Date:  2009-08-25       Impact factor: 6.937

5.  The recognition and prediction of sigma70 promoters in Escherichia coli K-12.

Authors:  Qian-Zhong Li; Hao Lin
Journal:  J Theor Biol       Date:  2006-04-17       Impact factor: 2.691

6.  Prediction of supertype-specific HLA class I binding peptides using support vector machines.

Authors:  Guang Lan Zhang; Ivana Bozic; Chee Keong Kwoh; J Thomas August; Vladimir Brusic
Journal:  J Immunol Methods       Date:  2007-01-25       Impact factor: 2.303

7.  DiProDB: a database for dinucleotide properties.

Authors:  Maik Friedel; Swetlana Nikolajewa; Jürgen Sühnel; Thomas Wilhelm
Journal:  Nucleic Acids Res       Date:  2008-09-19       Impact factor: 16.971

8.  The cross-species prediction of bacterial promoters using a support vector machine.

Authors:  Michael Towsey; Peter Timms; James Hogan; Sarah A Mathews
Journal:  Comput Biol Chem       Date:  2008-07-15       Impact factor: 2.877

9.  Computational identification of protein methylation sites through bi-profile Bayes feature extraction.

Authors:  Jianlin Shao; Dong Xu; Sau-Na Tsai; Yifei Wang; Sai-Ming Ngai
Journal:  PLoS One       Date:  2009-03-17       Impact factor: 3.240

10.  Using support vector machine combined with auto covariance to predict protein-protein interactions from protein sequences.

Authors:  Yanzhi Guo; Lezheng Yu; Zhining Wen; Menglong Li
Journal:  Nucleic Acids Res       Date:  2008-04-04       Impact factor: 16.971

View more
  21 in total

1.  Computational prediction and interpretation of both general and specific types of promoters in Escherichia coli by exploiting a stacked ensemble-learning framework.

Authors:  Fuyi Li; Jinxiang Chen; Zongyuan Ge; Ya Wen; Yanwei Yue; Morihiro Hayashida; Abdelkader Baggag; Halima Bensmail; Jiangning Song
Journal:  Brief Bioinform       Date:  2021-03-22       Impact factor: 11.622

2.  Critical assessment of computational tools for prokaryotic and eukaryotic promoter prediction.

Authors:  Meng Zhang; Cangzhi Jia; Fuyi Li; Chen Li; Yan Zhu; Tatsuya Akutsu; Geoffrey I Webb; Quan Zou; Lachlan J M Coin; Jiangning Song
Journal:  Brief Bioinform       Date:  2022-03-10       Impact factor: 11.622

3.  HSM6AP: a high-precision predictor for the Homo sapiens N6-methyladenosine (m^6 A) based on multiple weights and feature stitching.

Authors:  Jing Li; Shida He; Fei Guo; Quan Zou
Journal:  RNA Biol       Date:  2021-02-12       Impact factor: 4.652

4.  TSSFinder-fast and accurate ab initio prediction of the core promoter in eukaryotic genomes.

Authors:  Mauro de Medeiros Oliveira; Igor Bonadio; Alicia Lie de Melo; Glaucia Mendes Souza; Alan Mitchell Durham
Journal:  Brief Bioinform       Date:  2021-11-05       Impact factor: 11.622

5.  A Method for Prediction of Thermophilic Protein Based on Reduced Amino Acids and Mixed Features.

Authors:  Changli Feng; Zhaogui Ma; Deyun Yang; Xin Li; Jun Zhang; Yanjuan Li
Journal:  Front Bioeng Biotechnol       Date:  2020-05-05

6.  Positive-unlabelled learning of glycosylation sites in the human proteome.

Authors:  Fuyi Li; Yang Zhang; Anthony W Purcell; Geoffrey I Webb; Kuo-Chen Chou; Trevor Lithgow; Chen Li; Jiangning Song
Journal:  BMC Bioinformatics       Date:  2019-03-06       Impact factor: 3.169

7.  Its2vec: Fungal Species Identification Using Sequence Embedding and Random Forest Classification.

Authors:  Chao Wang; Ying Zhang; Shuguang Han
Journal:  Biomed Res Int       Date:  2020-05-27       Impact factor: 3.411

8.  pcPromoter-CNN: A CNN-Based Prediction and Classification of Promoters.

Authors:  Muhammad Shujaat; Abdul Wahab; Hilal Tayara; Kil To Chong
Journal:  Genes (Basel)       Date:  2020-12-21       Impact factor: 4.096

9.  Predicting ATP-Binding Cassette Transporters Using the Random Forest Method.

Authors:  Ruiyan Hou; Lida Wang; Yi-Jun Wu
Journal:  Front Genet       Date:  2020-03-25       Impact factor: 4.599

10.  iCrotoK-PseAAC: Identify lysine crotonylation sites by blending position relative statistical features according to the Chou's 5-step rule.

Authors:  Sharaf Jameel Malebary; Muhammad Safi Ur Rehman; Yaser Daanial Khan
Journal:  PLoS One       Date:  2019-11-21       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.