| Literature DB >> 27889654 |
Masaki Banno1, Yusuke Komiyama2, Wei Cao1, Yuya Oku1, Kokoro Ueki1, Kazuya Sumikoshi1, Shugo Nakamura1, Tohru Terada1, Kentaro Shimizu3.
Abstract
Several methods have been proposed for protein-sugar binding site prediction using machine learning algorithms. However, they are not effective to learn various properties of binding site residues caused by various interactions between proteins and sugars. In this study, we classified sugars into acidic and nonacidic sugars and showed that their binding sites have different amino acid occurrence frequencies. By using this result, we developed sugar-binding residue predictors dedicated to the two classes of sugars: an acid sugar binding predictor and a nonacidic sugar binding predictor. We also developed a combination predictor which combines the results of the two predictors. We showed that when a sugar is known to be an acidic sugar, the acidic sugar binding predictor achieves the best performance, and showed that when a sugar is known to be a nonacidic sugar or is not known to be either of the two classes, the combination predictor achieves the best performance. Our method uses only amino acid sequences for prediction. Support vector machine was used as a machine learning algorithm and the position-specific scoring matrix created by the position-specific iterative basic local alignment search tool was used as the feature vector. We evaluated the performance of the predictors using five-fold cross-validation. We have launched our system, as an open source freeware tool on the GitHub repository (https://doi.org/10.5281/zenodo.61513).Entities:
Keywords: Carbohydrate; Machine learning; Sugar-binding proteins; Sugar-binding residue prediction; Support vector machine
Mesh:
Substances:
Year: 2016 PMID: 27889654 DOI: 10.1016/j.compbiolchem.2016.10.009
Source DB: PubMed Journal: Comput Biol Chem ISSN: 1476-9271 Impact factor: 2.877