Literature DB >> 28552554

pDHS-SVM: A prediction method for plant DNase I hypersensitive sites based on support vector machine.

Shanxin Zhang1, Zhiping Zhou2, Xinmeng Chen3, Yong Hu2, Lindong Yang2.   

Abstract

DNase I hypersensitive sites (DHSs) are accessible chromatin regions hypersensitive to cleavages by DNase I endonucleases. DHSs are indicative of cis-regulatory DNA elements (CREs), all of which play important roles in global gene expression regulation. It is helpful for discovering CREs by recognition of DHSs in genome. To accelerate the investigation, it is an important complement to develop cost-effective computational methods to identify DHSs. However, there is a lack of tools used for identifying DHSs in plant genome. Here we presented pDHS-SVM, a computational predictor to identify plant DHSs. To integrate the global sequence-order information and local DNA properties, reverse complement kmer and dinucleotide-based auto covariance of DNA sequences were applied to construct the feature space. In this work, fifteen physical-chemical properties of dinucleotides were used and Support Vector Machine (SVM) was employed. To further improve the performance of the predictor and extract an optimized subset of nucleotide physical-chemical properties positive for the DHSs, a heuristic nucleotide physical-chemical property selection algorithm was introduced. With the optimized subset of properties, experimental results of Arabidopsis thaliana and rice (Oryza sativa) showed that pDHS-SVM could achieve accuracies up to 87.00%, and 85.79%, respectively. The results indicated the effectiveness of proposed method for predicting DHSs. Furthermore, pDHS-SVM could provide a helpful complement for predicting CREs in plant genome. Our implementation of the novel proposed method pDHS-SVM is freely available as source code, at https://github.com/shanxinzhang/pDHS-SVM.
Copyright © 2017 Elsevier Ltd. All rights reserved.

Entities:  

Keywords:  DHS; Dinucleotide-based auto covariance; Prediction; SVM

Mesh:

Substances:

Year:  2017        PMID: 28552554     DOI: 10.1016/j.jtbi.2017.05.030

Source DB:  PubMed          Journal:  J Theor Biol        ISSN: 0022-5193            Impact factor:   2.691


  3 in total

1.  pDHS-ELM: computational predictor for plant DNase I hypersensitive sites based on extreme learning machines.

Authors:  Shanxin Zhang; Minjun Chang; Zhiping Zhou; Xiaofeng Dai; Zhenghong Xu
Journal:  Mol Genet Genomics       Date:  2018-03-29       Impact factor: 3.291

2.  iDHS-FFLG: Identifying DNase I Hypersensitive Sites by Feature Fusion and Local-Global Feature Extraction Network.

Authors:  Lei-Shan Wang; Zhan-Li Sun
Journal:  Interdiscip Sci       Date:  2022-09-27       Impact factor: 3.492

3.  Applications of Support Vector Machine in Genomic Prediction in Pig and Maize Populations.

Authors:  Wei Zhao; Xueshuang Lai; Dengying Liu; Zhenyang Zhang; Peipei Ma; Qishan Wang; Zhe Zhang; Yuchun Pan
Journal:  Front Genet       Date:  2020-12-03       Impact factor: 4.599

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.