Literature DB >> 28688491

Sequence based predictor for discrimination of enhancer and their types by applying general form of Chou's trinucleotide composition.

Muhammad Tahir1, Maqsood Hayat2, Muhammad Kabir3.   

Abstract

BACKGROUND AND OBJECTIVES: Enhancers are pivotal DNA elements, which are widely used in eukaryotes for activation of transcription genes. On the basis of enhancer strength, they are further classified into two groups; strong enhancers and weak enhancers. Due to high availability of huge amount of DNA sequences, it is needed to develop fast, reliable and robust intelligent computational method, which not only identify enhancers but also determines their strength. Considerable progress has been achieved in this regard; however, timely and precisely identification of enhancers is still a challenging task.
METHODS: Two-level intelligent computational model for identification of enhancers and their subgroups is proposed. Two different feature extraction techniques including di-nucleotide composition and tri-nucleotide composition were adopted for extraction of numerical descriptors. Four classification methods including probabilistic neural network, support vector machine, k-nearest neighbor and random forest were utilized for classification.
RESULTS: The proposed method yielded 77.25% of accuracy for dataset S1 contains enhancers and non-enhancers, whereas 64.70% of accuracy for dataset S2 comprises of strong enhancer and weak enhancer sequences using jackknife cross-validation test.
CONCLUSION: The predictive results validated that the proposed method is better than that of existing approaches so far reported in the literature. It is thus highly observed that the developed method will be useful and expedient for basic research and academia.
Copyright © 2017 Elsevier B.V. All rights reserved.

Entities:  

Keywords:  Dinucleotide composition; KNN; PNN; SVM; Trinucleotide composition

Mesh:

Substances:

Year:  2017        PMID: 28688491     DOI: 10.1016/j.cmpb.2017.05.008

Source DB:  PubMed          Journal:  Comput Methods Programs Biomed        ISSN: 0169-2607            Impact factor:   5.428


  4 in total

1.  i4mC-Deep: An Intelligent Predictor of N4-Methylcytosine Sites Using a Deep Learning Approach with Chemical Properties.

Authors:  Waleed Alam; Hilal Tayara; Kil To Chong
Journal:  Genes (Basel)       Date:  2021-07-23       Impact factor: 4.096

2.  iRSpot-Pse6NC: Identifying recombination spots in Saccharomyces cerevisiae by incorporating hexamer composition into general PseKNC.

Authors:  Hui Yang; Wang-Ren Qiu; Guoqing Liu; Feng-Biao Guo; Wei Chen; Kuo-Chen Chou; Hao Lin
Journal:  Int J Biol Sci       Date:  2018-05-22       Impact factor: 6.580

3.  iPseU-CNN: Identifying RNA Pseudouridine Sites Using Convolutional Neural Networks.

Authors:  Muhammad Tahir; Hilal Tayara; Kil To Chong
Journal:  Mol Ther Nucleic Acids       Date:  2019-04-11

4.  Enhancer-LSTMAtt: A Bi-LSTM and Attention-Based Deep Learning Method for Enhancer Recognition.

Authors:  Guohua Huang; Wei Luo; Guiyang Zhang; Peijie Zheng; Yuhua Yao; Jianyi Lyu; Yuewu Liu; Dong-Qing Wei
Journal:  Biomolecules       Date:  2022-07-17
  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.