Literature DB >> 30529532

iPSW(2L)-PseKNC: A two-layer predictor for identifying promoters and their strength by hybrid features via pseudo K-tuple nucleotide composition.

Xuan Xiao1, Zhao-Chun Xu2, Wang-Ren Qiu3, Peng Wang4, Hui-Ting Ge4, Kuo-Chen Chou5.   

Abstract

The promoter is a regulatory DNA region about 81-1000 base pairs long, usually located near the transcription start site (TSS) along upstream of a given gene. By combining a certain protein called transcription factor, the promoter provides the starting point for regulated gene transcription, and hence plays a vitally important role in gene transcriptional regulation. With explosive growth of DNA sequences in the post-genomic age, it has become an urgent challenge to develop computational method for effectively identifying promoters because the information thus obtained is very useful for both basic research and drug development. Although some prediction methods were developed in this regard, most of them were limited at merely identifying whether a query DNA sequence being of a promoter or not. However, based on their strength-distinct levels for transcriptional activation and expression, promoter should be divided into two categories: strong and weak types. Here a new two-layer predictor, called "iPSW(2L)-PseKNC", was developed by fusing the physicochemical properties of nucleotides and their nucleotide density into PseKNC (pseudo K-tuple nucleotide composition). Its 1st-layer serves to predict whether a query DNA sequence sample is of promoter or not, while its 2nd-layer is able to predict the strength of promoters. It has been observed through rigorous cross-validations that the 1st-layer sub-predictor is remarkably superior to the existing state-of-the-art predictors in identifying the promoters and non-promoters, and that the 2nd-layer sub-predictor can do what is beyond the reach of the existing predictors. Moreover, the web-server for iPSW(2L)-PseKNC has been established at http://www.jci-bioinfo.cn/iPSW(2L)-PseKNC, by which the majority of experimental scientists can easily get the results they need.
Copyright © 2018 Elsevier Inc. All rights reserved.

Keywords:  Chou's intuitive metrics; Nucleotide density; Promoter strength; PseKNC; SVM; iPSW(2L)-PseKNC

Mesh:

Year:  2018        PMID: 30529532     DOI: 10.1016/j.ygeno.2018.12.001

Source DB:  PubMed          Journal:  Genomics        ISSN: 0888-7543            Impact factor:   5.736


  13 in total

Review 1.  Recent advances in genetic engineering tools based on synthetic biology.

Authors:  Jun Ren; Jingyu Lee; Dokyun Na
Journal:  J Microbiol       Date:  2020-01-02       Impact factor: 3.422

Review 2.  Some illuminating remarks on molecular genetics and genomics as well as drug development.

Authors:  Kuo-Chen Chou
Journal:  Mol Genet Genomics       Date:  2020-01-01       Impact factor: 3.291

3.  Critical assessment of computational tools for prokaryotic and eukaryotic promoter prediction.

Authors:  Meng Zhang; Cangzhi Jia; Fuyi Li; Chen Li; Yan Zhu; Tatsuya Akutsu; Geoffrey I Webb; Quan Zou; Lachlan J M Coin; Jiangning Song
Journal:  Brief Bioinform       Date:  2022-03-10       Impact factor: 11.622

4.  iPro-WAEL: a comprehensive and robust framework for identifying promoters in multiple species.

Authors:  Pengyu Zhang; Hongming Zhang; Hao Wu
Journal:  Nucleic Acids Res       Date:  2022-10-14       Impact factor: 19.160

5.  iTTCA-MFF: identifying tumor T cell antigens based on multiple feature fusion.

Authors:  Hongliang Zou; Fan Yang; Zhijian Yin
Journal:  Immunogenetics       Date:  2022-03-05       Impact factor: 3.330

6.  Plant protection product dose rate estimation in apple orchards using a fuzzy logic system.

Authors:  Peter Berk; Denis Stajnko; Marko Hočevar; Aleš Malneršič; Viktor Jejčič; Aleš Belšak
Journal:  PLoS One       Date:  2019-04-24       Impact factor: 3.240

7.  iMethylK_pseAAC: Improving Accuracy of Lysine Methylation Sites Identification by Incorporating Statistical Moments and Position Relative Features into General PseAAC via Chou's 5-steps Rule.

Authors:  Sarah Ilyas; Waqar Hussain; Adeel Ashraf; Yaser Daanial Khan; Sher Afzal Khan; Kuo-Chen Chou
Journal:  Curr Genomics       Date:  2019-05       Impact factor: 2.236

8.  iPTT(2 L)-CNN: A Two-Layer Predictor for Identifying Promoters and Their Types in Plant Genomes by Convolutional Neural Network.

Authors:  Ang Sun; Xuan Xiao; Zhaochun Xu
Journal:  Comput Math Methods Med       Date:  2021-01-05       Impact factor: 2.238

9.  SNARE-CNN: a 2D convolutional neural network architecture to identify SNARE proteins from high-throughput sequencing data.

Authors:  Nguyen Quoc Khanh Le; Van-Nui Nguyen
Journal:  PeerJ Comput Sci       Date:  2019-02-25

10.  Classifying Promoters by Interpreting the Hidden Information of DNA Sequences via Deep Learning and Combination of Continuous FastText N-Grams.

Authors:  Nguyen Quoc Khanh Le; Edward Kien Yee Yapp; N Nagasundaram; Hui-Yuan Yeh
Journal:  Front Bioeng Biotechnol       Date:  2019-11-05
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.