Literature DB >> 33232970

Computational prediction of species-specific yeast DNA replication origin via iterative feature representation.

Balachandran Manavalan1, Shaherin Basith1, Tae Hwan Shin1, Gwang Lee1.   

Abstract

Deoxyribonucleic acid replication is one of the most crucial tasks taking place in the cell, and it has to be precisely regulated. This process is initiated in the replication origins (ORIs), and thus it is essential to identify such sites for a deeper understanding of the cellular processes and functions related to the regulation of gene expression. Considering the important tasks performed by ORIs, several experimental and computational approaches have been developed in the prediction of such sites. However, existing computational predictors for ORIs have certain curbs, such as building only single-feature encoding models, limited systematic feature engineering efforts and failure to validate model robustness. Hence, we developed a novel species-specific yeast predictor called yORIpred that accurately identify ORIs in the yeast genomes. To develop yORIpred, we first constructed optimal 40 baseline models by exploring eight different sequence-based encodings and five different machine learning classifiers. Subsequently, the predicted probability of 40 models was considered as the novel feature vector and carried out iterative feature learning approach independently using five different classifiers. Our systematic analysis revealed that the feature representation learned by the support vector machine algorithm (yORIpred) could well discriminate the distribution characteristics between ORIs and non-ORIs when compared with the other four algorithms. Comprehensive benchmarking experiments showed that yORIpred achieved superior and stable performance when compared with the existing predictors on the same training datasets. Furthermore, independent evaluation showcased the best and accurate performance of yORIpred thus underscoring the significance of iterative feature representation. To facilitate the users in obtaining their desired results without undergoing any mathematical, statistical or computational hassles, we developed a web server for the yORIpred predictor, which is available at: http://thegleelab.org/yORIpred.
© The Author(s) 2020. Published by Oxford University Press.

Entities:  

Keywords:  iterative feature representation; machine learning; replication origin; support vector machine

Year:  2021        PMID: 33232970      PMCID: PMC8294535          DOI: 10.1093/bib/bbaa304

Source DB:  PubMed          Journal:  Brief Bioinform        ISSN: 1467-5463            Impact factor:   11.622


  61 in total

1.  Using extreme gradient boosting to identify origin of replication in Saccharomyces cerevisiae via hybrid features.

Authors:  Duyen Thi Do; Nguyen Quoc Khanh Le
Journal:  Genomics       Date:  2020-01-24       Impact factor: 5.736

2.  iMRM: a platform for simultaneously identifying multiple kinds of RNA modifications.

Authors:  Kewei Liu; Wei Chen
Journal:  Bioinformatics       Date:  2020-06-01       Impact factor: 6.937

3.  The meaning and use of the area under a receiver operating characteristic (ROC) curve.

Authors:  J A Hanley; B J McNeil
Journal:  Radiology       Date:  1982-04       Impact factor: 11.105

4.  PseAAC-Builder: a cross-platform stand-alone program for generating various special Chou's pseudo-amino acid compositions.

Authors:  Pufeng Du; Xin Wang; Chao Xu; Yang Gao
Journal:  Anal Biochem       Date:  2012-03-27       Impact factor: 3.365

5.  iProEP: A Computational Predictor for Predicting Promoter.

Authors:  Hong-Yan Lai; Zhao-Yue Zhang; Zhen-Dong Su; Wei Su; Hui Ding; Wei Chen; Hao Lin
Journal:  Mol Ther Nucleic Acids       Date:  2019-06-13

6.  Initiation of DNA replication from non-canonical sites on an origin-depleted chromosome.

Authors:  Naomi L Bogenschutz; Jairo Rodriguez; Toshio Tsukiyama
Journal:  PLoS One       Date:  2014-12-08       Impact factor: 3.240

7.  Ori-Finder 2, an integrated tool to predict replication origins in the archaeal genomes.

Authors:  Hao Luo; Chun-Ting Zhang; Feng Gao
Journal:  Front Microbiol       Date:  2014-09-15       Impact factor: 5.640

8.  SDM6A: A Web-Based Integrative Machine-Learning Framework for Predicting 6mA Sites in the Rice Genome.

Authors:  Shaherin Basith; Balachandran Manavalan; Tae Hwan Shin; Gwang Lee
Journal:  Mol Ther Nucleic Acids       Date:  2019-08-16       Impact factor: 8.886

9.  AtbPpred: A Robust Sequence-Based Prediction of Anti-Tubercular Peptides Using Extremely Randomized Trees.

Authors:  Balachandran Manavalan; Shaherin Basith; Tae Hwan Shin; Leyi Wei; Gwang Lee
Journal:  Comput Struct Biotechnol J       Date:  2019-07-03       Impact factor: 7.271

10.  SIMLIN: a bioinformatics tool for prediction of S-sulphenylation in the human proteome based on multi-stage ensemble-learning models.

Authors:  Xiaochuan Wang; Chen Li; Fuyi Li; Varun S Sharma; Jiangning Song; Geoffrey I Webb
Journal:  BMC Bioinformatics       Date:  2019-11-21       Impact factor: 3.169

View more
  4 in total

1.  TACOS: a novel approach for accurate prediction of cell-specific long noncoding RNAs subcellular localization.

Authors:  Young-Jun Jeon; Md Mehedi Hasan; Hyun Woo Park; Ki Wook Lee; Balachandran Manavalan
Journal:  Brief Bioinform       Date:  2022-07-18       Impact factor: 13.994

2.  PUP-Fuse: Prediction of Protein Pupylation Sites by Integrating Multiple Sequence Representations.

Authors:  Firda Nurul Auliah; Andi Nur Nilamyani; Watshara Shoombuatong; Md Ashad Alam; Md Mehedi Hasan; Hiroyuki Kurata
Journal:  Int J Mol Sci       Date:  2021-02-20       Impact factor: 5.923

3.  LGFC-CNN: Prediction of lncRNA-Protein Interactions by Using Multiple Types of Features through Deep Learning.

Authors:  Lan Huang; Shaoqing Jiao; Sen Yang; Shuangquan Zhang; Xiaopeng Zhu; Rui Guo; Yan Wang
Journal:  Genes (Basel)       Date:  2021-10-24       Impact factor: 4.096

4.  IRC-Fuse: improved and robust prediction of redox-sensitive cysteine by fusing of multiple feature representations.

Authors:  Md Mehedi Hasan; Md Ashad Alam; Watshara Shoombuatong; Hiroyuki Kurata
Journal:  J Comput Aided Mol Des       Date:  2021-01-04       Impact factor: 3.686

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.