Shiming Wang1, Jie Li2, Yadong Wang3. 1. School of Computer Science and Technology, Harbin Institute of Technology, Harbin, Heilongjiang, 150001, China. 2. School of Computer Science and Technology, Harbin Institute of Technology, Harbin, Heilongjiang, 150001, China. jieli@hit.edu.cn. 3. School of Computer Science and Technology, Harbin Institute of Technology, Harbin, Heilongjiang, 150001, China. ydwang@hit.edu.cn.
Abstract
BACKGROUND: Detecting pathogenic proteins is the origin way to understand the mechanism and resist the invasion of diseases, making pathogenic protein prediction develop into an urgent problem to be solved. Prediction for genome-wide proteins may be not necessarily conducive to rapidly cure diseases as developing new drugs specifically for the predicted pathogenic protein always need major expenditures on time and cost. In order to facilitate disease treatment, computational method to predict pathogenic proteins which are targeted by existing drugs should be exploited. RESULTS: In this study, we proposed a novel computational model to predict drug-targeted pathogenic proteins, named as M2PP. Three types of features were presented on our constructed heterogeneous network (including target proteins, diseases and drugs), which were based on the neighborhood similarity information, drug-inferred information and path information. Then, a random forest regression model was trained to score unconfirmed target-disease pairs. Five-fold cross-validation experiment was implemented to evaluate model's prediction performance, where M2PP achieved advantageous results compared with other state-of-the-art methods. In addition, M2PP accurately predicted high ranked pathogenic proteins for common diseases with public biomedical literature as supporting evidence, indicating its excellent ability. CONCLUSIONS: M2PP is an effective and accurate model to predict drug-targeted pathogenic proteins, which could provide convenience for the future biological researches.
BACKGROUND: Detecting pathogenic proteins is the origin way to understand the mechanism and resist the invasion of diseases, making pathogenic protein prediction develop into an urgent problem to be solved. Prediction for genome-wide proteins may be not necessarily conducive to rapidly cure diseases as developing new drugs specifically for the predicted pathogenic protein always need major expenditures on time and cost. In order to facilitate disease treatment, computational method to predict pathogenic proteins which are targeted by existing drugs should be exploited. RESULTS: In this study, we proposed a novel computational model to predict drug-targeted pathogenic proteins, named as M2PP. Three types of features were presented on our constructed heterogeneous network (including target proteins, diseases and drugs), which were based on the neighborhood similarity information, drug-inferred information and path information. Then, a random forest regression model was trained to score unconfirmed target-disease pairs. Five-fold cross-validation experiment was implemented to evaluate model's prediction performance, where M2PP achieved advantageous results compared with other state-of-the-art methods. In addition, M2PP accurately predicted high ranked pathogenic proteins for common diseases with public biomedical literature as supporting evidence, indicating its excellent ability. CONCLUSIONS: M2PP is an effective and accurate model to predict drug-targeted pathogenic proteins, which could provide convenience for the future biological researches.
Authors: Michael Kloth; Vanessa Ruesseler; Christoph Engel; Katharina Koenig; Martin Peifer; Erika Mariotti; Helen Kuenstlinger; Alexandra Florin; Ursula Rommerscheidt-Fuss; Ulrike Koitzsch; Claudia Wodtke; Frank Ueckeroth; Stefanie Holzapfel; Stefan Aretz; Peter Propping; Markus Loeffler; Sabine Merkelbach-Bruse; Margarete Odenthal; Nicolaus Friedrichs; Lukas Carl Heukamp; Thomas Zander; Reinhard Buettner Journal: Gut Date: 2015-04-28 Impact factor: 23.059
Authors: Fiona H Blackhall; Melania Pintilie; Michael Michael; Natasha Leighl; Ronald Feld; Ming-Sound Tsao; Frances A Shepherd Journal: Clin Cancer Res Date: 2003-06 Impact factor: 12.531
Authors: Soyeon Jeong; Bu Gyeom Kim; Dae Yeong Kim; Bo Ram Kim; Jung Lim Kim; Seong Hye Park; Yoo Jin Na; Min Jee Jo; Hye Kyeong Yun; Yoon A Jeong; Hong Jun Kim; Sun Il Lee; Han Do Kim; Dae Hyun Kim; Sang Cheul Oh; Dae-Hee Lee Journal: Cancers (Basel) Date: 2019-06-05 Impact factor: 6.639
Authors: Heather Spencer Feigelson; Lauren R Teras; W Ryan Diver; Weining Tang; Alpa V Patel; Victoria L Stevens; Eugenia E Calle; Michael J Thun; Mark Bouzyk Journal: Breast Cancer Res Date: 2008-07-08 Impact factor: 6.466