Literature DB >> 28239713

EnhancerPred2.0: predicting enhancers and their strength based on position-specific trinucleotide propensity and electron-ion interaction potential feature selection.

Wenying He1, Cangzhi Jia1.   

Abstract

Enhancers are cis-acting elements that play major roles in upregulating eukaryotic gene expression by providing binding sites for transcription factors and their complexes. Because enhancers are highly cell/tissue specific, lack common motifs, and are far from the target gene, the systematic and precise identification of enhancer regions in DNA sequences is a big challenge. In this study, we developed an enhancer prediction method called EnhancerPred2.0 by combining position-specific trinucleotide propensity (PSTNP) information with the electron-ion interaction potential (EIIP) values for trinucleotides, to predict enhancers and their subgroups. To obtain the optimal combination of features, F-score values were used in a two-step wrapper-based feature selection method, which was applied in a high dimensional feature vector from PSTNP and EIIP. Finally, 126 optimized features from PSTNP combined with 32 optimized features from EIIP yielded the best performance for identifying enhancers and non-enhancers, with an overall accuracy (Acc) of 88.27% and a Matthews correlation coefficient (MCC) of 0.77. Additionally, 198 features from PSTNP combined with 37 features from EIIP yielded the best performance for identifying strong and weak enhancers, with an overall Acc of 98.05% and a MCC of 0.96. Rigorous jackknife tests indicated that EnhancerPred2.0 was significantly better than the existing enhancer prediction methods in both overall accuracy and stability.

Entities:  

Mesh:

Substances:

Year:  2017        PMID: 28239713     DOI: 10.1039/c7mb00054e

Source DB:  PubMed          Journal:  Mol Biosyst        ISSN: 1742-2051


  9 in total

1.  Deep4mC: systematic assessment and computational prediction for DNA N4-methylcytosine sites by deep learning.

Authors:  Haodong Xu; Peilin Jia; Zhongming Zhao
Journal:  Brief Bioinform       Date:  2021-05-20       Impact factor: 11.622

2.  mRNALocater: Enhance the prediction accuracy of eukaryotic mRNA subcellular localization by using model fusion strategy.

Authors:  Qiang Tang; Fulei Nie; Juanjuan Kang; Wei Chen
Journal:  Mol Ther       Date:  2021-04-03       Impact factor: 12.910

3.  Ensemble of Deep Recurrent Neural Networks for Identifying Enhancers via Dinucleotide Physicochemical Properties.

Authors:  Kok Keng Tan; Nguyen Quoc Khanh Le; Hui-Yuan Yeh; Matthew Chin Heng Chua
Journal:  Cells       Date:  2019-07-23       Impact factor: 6.600

4.  A machine learning technique for identifying DNA enhancer regions utilizing CIS-regulatory element patterns.

Authors:  Ahmad Hassan Butt; Tamim Alkhalifah; Fahad Alturise; Yaser Daanial Khan
Journal:  Sci Rep       Date:  2022-09-07       Impact factor: 4.996

5.  Enhancer-LSTMAtt: A Bi-LSTM and Attention-Based Deep Learning Method for Enhancer Recognition.

Authors:  Guohua Huang; Wei Luo; Guiyang Zhang; Peijie Zheng; Yuhua Yao; Jianyi Lyu; Yuewu Liu; Dong-Qing Wei
Journal:  Biomolecules       Date:  2022-07-17

6.  70ProPred: a predictor for discovering sigma70 promoters based on combining multiple features.

Authors:  Wenying He; Cangzhi Jia; Yucong Duan; Quan Zou
Journal:  BMC Syst Biol       Date:  2018-04-24

7.  Sc-ncDNAPred: A Sequence-Based Predictor for Identifying Non-coding DNA in Saccharomyces cerevisiae.

Authors:  Wenying He; Ying Ju; Xiangxiang Zeng; Xiangrong Liu; Quan Zou
Journal:  Front Microbiol       Date:  2018-09-12       Impact factor: 5.640

8.  Identification of DNA N6-methyladenine sites by integration of sequence features.

Authors:  Hao-Tian Wang; Fu-Hui Xiao; Gong-Hua Li; Qing-Peng Kong
Journal:  Epigenetics Chromatin       Date:  2020-02-24       Impact factor: 4.954

9.  4mCpred-EL: An Ensemble Learning Framework for Identification of DNA N4-methylcytosine Sites in the Mouse Genome.

Authors:  Balachandran Manavalan; Shaherin Basith; Tae Hwan Shin; Da Yeon Lee; Leyi Wei; Gwang Lee
Journal:  Cells       Date:  2019-10-28       Impact factor: 6.600

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.