| Literature DB >> 32505881 |
Zhe Ju1, Shi-Yun Wang2.
Abstract
Lysine 2-hydroxyisobutyrylation (Khib) is a new type of histone mark, which has been found to affect the association between histone and DNA. To better understand the molecular mechanism of Khib, it is important to identify 2-hydroxyisobutyrylated substrates and their corresponding Khib sites accurately. In this study, a novel bioinformatics tool named KhibPred is proposed to predict Khib sites in human HeLa cells. Three kinds of effective features, the composition of k-spaced amino acid pairs, binary encoding and amino acid factors, are incorporated to encode Khib sites. Moreover, an ensemble support vector machine is employed to overcome the imbalanced problem in the prediction. As illustrated by 10-fold cross-validation, the performance of KhibPred achieves a satisfactory performance with an area under receiver operating characteristic curve of 0.7937. Therefore, KhibPred can be a useful tool for predicting protein Khib sites. Feature analysis shows that the polarity factor features play significant roles in the prediction of Khib sites. The conclusions derived from this study might provide useful insights for in-depth investigation into the molecular mechanisms of Khib.Entities:
Keywords: 2-Hydroxyisobutyrylation; Ensemble support vector machine; Feature extraction; Post-translational modification
Year: 2020 PMID: 32505881 DOI: 10.1016/j.compbiolchem.2020.107280
Source DB: PubMed Journal: Comput Biol Chem ISSN: 1476-9271 Impact factor: 2.877