Zi-Han Guo 1 , Lei Chen 1,2 , Xian Zhao 1 . Show Affiliations »
Abstract
AIM AND OBJECTIVE: A metabolic pathway is an important type of biological pathway, which is composed of a series of chemical reactions. It provides essential molecules and energies for living organisms. To date, several metabolic pathways have been uncovered. However, their completeness is still on the way. A number of prediction methods have been built to assign chemicals into certain metabolic pathway, which can further be used to predict novel latent chemicals for a given metabolic pathway. However, they did not make use of chemical properties in a system level to construct prediction models. METHOD: In this study, we applied a network integration method, which can extract topological features from different chemical networks, representing chemical associations from their different properties, and fused several high-dimension vector representations into a low-dimension vector representation for each chemical. The compact vector representations were fed into the Support Vector Machine (SVM) to construct the prediction model. To tackle the problem that one chemical can participate in more than one pathway type, we construct an SVM-based binary prediction model for each pathway type to determine whether a given chemical can participate in the pathway type. Furthermore, the Synthetic Minority Over-sampling Technique (SMOTE) was adopted to weaken the influence of imbalanced dataset. RESULTS AND CONCLUSION: Each binary model gave a quite good performance and was superior to the classic prediction model, indicating that the proposed models can be useful tools for integrating heterogeneous information to assign chemicals into correct metabolic pathways. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.net.
AIM AND OBJECTIVE: A metabolic pathway is an important type of biological pathway, which is composed of a series of chemical reactions. It provides essential molecules and energies for living organisms. To date, several metabolic pathways have been uncovered. However, their completeness is still on the way. A number of prediction methods have been built to assign chemicals into certain metabolic pathway, which can further be used to predict novel latent chemicals for a given metabolic pathway. However, they did not make use of chemical properties in a system level to construct prediction models. METHOD: In this study, we applied a network integration method, which can extract topological features from different chemical networks, representing chemical associations from their different properties, and fused several high-dimension vector representations into a low-dimension vector representation for each chemical. The compact vector representations were fed into the Support Vector Machine (SVM) to construct the prediction model. To tackle the problem that one chemical can participate in more than one pathway type, we construct an SVM-based binary prediction model for each pathway type to determine whether a given chemical can participate in the pathway type. Furthermore, the Synthetic Minority Over-sampling Technique (SMOTE) was adopted to weaken the influence of imbalanced dataset. RESULTS AND CONCLUSION: Each binary model gave a quite good performance and was superior to the classic prediction model, indicating that the proposed models can be useful tools for integrating heterogeneous information to assign chemicals into correct metabolic pathways. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.net.
Keywords:
Mashup; Metabolic pathway; Synthetic Minority Over-sampling Technique; feature extraction; feature fusion; support vector machine.
Mesh: See more »
Substances: See more »
Year: 2018
PMID: 30520371 DOI: 10.2174/1386207322666181206112641
Source DB: PubMed Journal: Comb Chem High Throughput Screen ISSN: 1386-2073 Impact factor: 1.339