| Literature DB >> 24455714 |
Bing Niu1, Guohua Huang2, Linfeng Zheng3, Xueyuan Wang1, Fuxue Chen1, Yuhui Zhang4, Tao Huang5.
Abstract
It is important to correctly and efficiently predict the interaction of substrate-enzyme and to predict their product in metabolic pathway. In this work, a novel approach was introduced to encode substrate/product and enzyme molecules with molecular descriptors and physicochemical properties, respectively. Based on this encoding method, KNN was adopted to build the substrate-enzyme-product interaction network. After selecting the optimal features that are able to represent the main factors of substrate-enzyme-product interaction in our prediction, totally 160 features out of 290 features were attained which can be clustered into ten categories: elemental analysis, geometry, chemistry, amino acid composition, predicted secondary structure, hydrophobicity, polarizability, solvent accessibility, normalized van der Waals volume, and polarity. As a result, our predicting model achieved an MCC of 0.423 and an overall prediction accuracy of 89.1% for 10-fold cross-validation test.Entities:
Mesh:
Substances:
Year: 2013 PMID: 24455714 PMCID: PMC3881445 DOI: 10.1155/2013/674215
Source DB: PubMed Journal: Biomed Res Int Impact factor: 3.411
Figure 1The curve of the 290 prediction models using IFS.
Prediction accuracies of different dataset with KNN.
| Dataset | 10-folds cross-validation test | |||
|---|---|---|---|---|
| SN (%) | SP (%) | ACC (%) | MCC | |
| Original dataset | 53.71 | 92.4 | 88.9 | 0.412 |
| Optimal dataset | 55.2 | 92.4 | 89.1 | 0.423 |
Figure 2Feature distribution.
Top 80 features rank according to their correlation to target.
| No. | Name | Categories | No. | Name | Categories |
|---|---|---|---|---|---|
| 1 | Polarity | Polarity | 41 | Amino Acids Composition Cys | Amino acids composition |
| 2 | Substrate_ | Chemical | 42 | Polarizability | Polarizability |
| 3 | Solvent accessibility | Solvent accessibility | 43 | Polarizability | Polarizability |
| 4 | Solvent accessibility | Solvent accessibility | 44 | Amino Acids Composition Ile | Amino acids composition |
| 5 | Secondary structure | Secondary structure | 45 | Hydrophobicity | Hydrophobicity |
| 6 | Normalized Van Der Waals volume | Normalized Van Der Waals volume | 46 | Secondary structure | Secondary structure |
| 7 | Normalized Van Der Waals volume | Normalized Van Der Waals volume | 47 | Substrate_Stereo | Geometry |
| 8 | Secondary structure | Secondary structure | 48 | Normalized Van Der Waals volume | Normalized Van Der Waals volume |
| 9 | Secondary structure | Secondary structure | 49 | Substrate_Smallest | Geometry |
| 10 | Substrate_ | Chemical | 50 | Substrate_Smallest | Geometry |
| 11 | Substrate_ | Elemental analysis | 51 | Substrate_Rotatable | Geometry |
| 12 | Amino Acids Composition Asn | Amino acids composition | 52 | Substrate_H | Elemental analysis |
| 13 | Polarity | Polarity | 53 | Amino Acids Composition Thr | Amino acids composition |
| 14 | Hydrophobicity | Hydrophobicity | 54 | Polarizability | Polarizability |
| 15 | Substrate_MinZ | Geometry | 55 | Amino Acids Composition Leu | Amino acids composition |
| 16 | Solvent accessibility | Solvent accessibility | 56 | Amino Acids Composition His | Amino acids composition |
| 17 | Polarity | Polarity | 57 | Substrate_CarboAliphatic | Geometry |
| 18 | Hydrophobicity | Hydrophobicity | 58 | Product_HComposition | Elemental analysis |
| 19 | Substrate_VanDerWaals | Chemical | 59 | Polarizability | Polarizability |
| 20 | Amino Acids Composition Asp | Amino acids composition | 60 | Normalized Van Der Waals volume | Normalized Van Der Waals volume |
| 21 | Hydrophobicity | Chemical | 61 | Amino Acids Composition Gln | Amino acids composition |
| 22 | Substrate_ | Elemental analysis | 62 | Normalized Van Der Waals volume | Normalized Van Der Waals volume |
| 23 | Solvent accessibility | Solvent accessibility | 63 | Polarizability | Polarizability |
| 24 | Secondary structure | Secondary structure | 64 | Amino Acids Composition Lys | Amino acids Composition |
| 25 | Amino Acids Composition Ser | Amino acids composition | 65 | Polarizability | Polarizability |
| 26 | Substrate_Water | Chemical | 66 | Amino Acids Composition Tyr | Amino acids composition |
| 27 | Secondary structure | Secondary structure | 67 | Amino Acids Composition Arg | Amino acids composition |
| 28 | Hydrophobicity | Hydrophobicity | 68 | Secondary structure | Secondary structure |
| 29 | Substrate_FusedRingCount | Geometry | 69 | Polarizability | Polarizability |
| 30 | Substrate_Carbo | Geometry | 70 | Normalized Van Der Waals volume | Normalized Van Der Waals volume |
| 31 | Amino Acids Composition Glu | Amino acids composition | 71 | Polarity | Polarity |
| 32 | Hydrophobicity | Hydrophobicity | 72 | Normalized Van Der Waals volume | Normalized Van Der Waals volume |
| 33 | Polarizability | Polarizability | 73 | Product_NComposition | Elemental analysis |
| 34 | Polarity | Polarity | 74 | Solvent accessibility | Solvent accessibility |
| 35 | Normalized Van Der Waals volume | Normalized Van Der Waals volume | 75 | Product_Hetero | Geometry |
| 36 | Substrate_Fused | Geometry | 76 | Substrate_CarboAromatic | Geometry |
| 37 | Polarizability | Polarizability | 77 | Substrate_PComposition | Elemental analysis |
| 38 | Secondary structure | Secondary structure | 78 | Hydrophobicity | Hydrophobicity |
| 39 | Substrate_RingCount | Geometry | 79 | Product_CComposition | Elemental analysis |
| 40 | Amino Acids Composition Pro | Amino acids composition | 80 | Normalized Van Der Waals volume | Normalized Van Der Waals volume |