Literature DB >> 27153623

iDHS-EL: identifying DNase I hypersensitive sites by fusing three different modes of pseudo nucleotide composition into an ensemble learning framework.

Bin Liu1, Ren Long2, Kuo-Chen Chou3.   

Abstract

MOTIVATION: Regulatory DNA elements are associated with DNase I hypersensitive sites (DHSs). Accordingly, identification of DHSs will provide useful insights for in-depth investigation into the function of noncoding genomic regions.
RESULTS: In this study, using the strategy of ensemble learning framework, we proposed a new predictor called iDHS-EL for identifying the location of DHS in human genome. It was formed by fusing three individual Random Forest (RF) classifiers into an ensemble predictor. The three RF operators were respectively based on the three special modes of the general pseudo nucleotide composition (PseKNC): (i) kmer, (ii) reverse complement kmer and (iii) pseudo dinucleotide composition. It has been demonstrated that the new predictor remarkably outperforms the relevant state-of-the-art methods in both accuracy and stability.
AVAILABILITY AND IMPLEMENTATION: For the convenience of most experimental scientists, a web server for iDHS-EL is established at http://bioinformatics.hitsz.edu.cn/iDHS-EL, which is the first web-server predictor ever established for identifying DHSs, and by which users can easily get their desired results without the need to go through the mathematical details. We anticipate that IDHS-EL: will become a very useful high throughput tool for genome analysis. CONTACT: bliu@gordonlifescience.org or bliu@insun.hit.edu.cn SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Mesh:

Substances:

Year:  2016        PMID: 27153623     DOI: 10.1093/bioinformatics/btw186

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  55 in total

1.  An information-based network approach for protein classification.

Authors:  Xiaogeng Wan; Xin Zhao; Stephen S T Yau
Journal:  PLoS One       Date:  2017-03-28       Impact factor: 3.240

2.  Predicting membrane proteins and their types by extracting various sequence features into Chou's general PseAAC.

Authors:  Ahmad Hassan Butt; Nouman Rasool; Yaser Daanial Khan
Journal:  Mol Biol Rep       Date:  2018-09-20       Impact factor: 2.316

3.  Computational prediction and interpretation of both general and specific types of promoters in Escherichia coli by exploiting a stacked ensemble-learning framework.

Authors:  Fuyi Li; Jinxiang Chen; Zongyuan Ge; Ya Wen; Yanwei Yue; Morihiro Hayashida; Abdelkader Baggag; Halima Bensmail; Jiangning Song
Journal:  Brief Bioinform       Date:  2021-03-22       Impact factor: 11.622

4.  Evolutionary mechanism and biological functions of 8-mers containing CG dinucleotide in yeast.

Authors:  Yan Zheng; Hong Li; Yue Wang; Hu Meng; Qiang Zhang; Xiaoqing Zhao
Journal:  Chromosome Res       Date:  2017-02-09       Impact factor: 5.239

5.  pDHS-ELM: computational predictor for plant DNase I hypersensitive sites based on extreme learning machines.

Authors:  Shanxin Zhang; Minjun Chang; Zhiping Zhou; Xiaofeng Dai; Zhenghong Xu
Journal:  Mol Genet Genomics       Date:  2018-03-29       Impact factor: 3.291

6.  Sparse Bayesian classification and feature selection for biological expression data with high correlations.

Authors:  Xian Yang; Wei Pan; Yike Guo
Journal:  PLoS One       Date:  2017-12-27       Impact factor: 3.240

7.  RicENN: Prediction of Rice Enhancers with Neural Network Based on DNA Sequences.

Authors:  Yujia Gao; Yiqiong Chen; Haisong Feng; Youhua Zhang; Zhenyu Yue
Journal:  Interdiscip Sci       Date:  2022-02-21       Impact factor: 2.233

8.  Imbalanced multi-label learning for identifying antimicrobial peptides and their functional types.

Authors:  Weizhong Lin; Dong Xu
Journal:  Bioinformatics       Date:  2016-08-26       Impact factor: 6.937

9.  BioSeq-BLM: a platform for analyzing DNA, RNA and protein sequences based on biological language models.

Authors:  Hong-Liang Li; Yi-He Pang; Bin Liu
Journal:  Nucleic Acids Res       Date:  2021-12-16       Impact factor: 16.971

10.  Evaluating machine learning methodologies for identification of cancer driver genes.

Authors:  Sharaf J Malebary; Yaser Daanial Khan
Journal:  Sci Rep       Date:  2021-06-10       Impact factor: 4.379

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.