| Literature DB >> 22962471 |
Yasuo Tabei1, Edouard Pauwels, Véronique Stoven, Kazuhiro Takemoto, Yoshihiro Yamanishi.
Abstract
MOTIVATION: Drug effects are mainly caused by the interactions between drug molecules and their target proteins including primary targets and off-targets. Identification of the molecular mechanisms behind overall drug-target interactions is crucial in the drug design process.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22962471 PMCID: PMC3436839 DOI: 10.1093/bioinformatics/bts412
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.Part of the extracted substructure–domain association network. Pink circle and blue rectangle represent a chemical substructure and a protein domain, respectively. Node size represents a node degree. Edge width represents the weight of substructure and domain pair
Examples of extracted chemogenomic features by the L1LOG method
| Rank | Weight | Substrucure ID | PubChem substructure definition |
|---|---|---|---|
| 1 | 2.1468 | SUB158 | |
| 1 | 2.1468 | PF00106 | |
| 2 | 2.1118 | SUB414 | |
| 2 | 2.1118 | PF00255 | |
| 3 | 1.9413 | SUB158 | |
| 3 | 1.9413 | PF01126 | |
| 4 | 1.8035 | SUB686 | |
| 4 | 1.8035 | PF01094 | |
| 5 | 1.7707 | SUB687 | |
| 5 | 1.7707 | PF03171 | |
| 6 | 1.7514 | SUB348 | |
| 6 | 1.7514 | PF03414 | |
| 7 | 1.6343 | SUB387 | |
| 7 | 1.6343 | PF00042 | |
| 8 | 1.6299 | SUB409 | |
| 8 | 1.6299 | PF00167 | |
| 9 | 1.5807 | SUB32 | |
| 9 | 1.5807 | PF00348 | |
| 10 | 1.5797 | SUB567 | |
| 10 | 1.5797 | PF00464 | |
| 11 | 1.5105 | SUB309 | |
| 11 | 1.5105 | PF00102 | |
| 12 | 1.5065 | SUB433 | |
| 12 | 1.5065 | PF02518 | |
| 13 | 1.5033 | SUB449 | |
| 13 | 1.5033 | PF00107 | |
| 14 | 1.4956 | SUB695 | |
| 14 | 1.4956 | PF00551 | |
| 15 | 1.4784 | SUB433 | |
| 15 | 1.4784 | PF07884 |
Fig. 2.Comparison of the number of extracted features between different methods
Fig. 3.Effect of negative examples on the number of extracted features
Fig. 4.Distribution of the number of extracted features across different feature extraction methods
AUC scores on pair-wise cross validation experiments
| Ratio | L1-Log | L1-SVM | L2-Log | L2-SVM | KSVM | SCCA |
|---|---|---|---|---|---|---|
| 1 | 0.8285±0.0009 | 0.8301±0.0006 | 0.8366±0.0010 | 0.8461±0.0009 | 0.8339±0.0005 | 0.7975±0.0018 |
| 5 | 0.8379±0.0008 | 0.8551±0.0008 | 0.8464±0.0008 | 0.8659±0.0008 | NA | 0.7975±0.0018 |
| 10 | 0.8437±0.0010 | 0.8654±0.0010 | 0.8512±0.0010 | 0.8728±0.0009 | NA | 0.7975±0.0018 |
| 50 | 0.8419±0.0010 | 0.8677±0.0009 | 0.8514±0.0011 | 0.8736±0.0010 | NA | 0.7975±0.0018 |
| 100 | 0.8418±0.0010 | 0.8677±0.0009 | 0.8516±0.0010 | 0.8740±0.0010 | NA | 0.7975±0.0018 |
| ALL | 0.8411±0.0006 | 0.8658±0.0006 | 0.8483±0.0007 | 0.8659±0.0004 | NA | 0.7975±0.0018 |
The number of negative examples is varied from the same number of positive examples to the number of all possible negative examples. NA means that it was not computationally feasible.
AUC scores on block-wise cross validation experiments
| Ratio | L1-Log | L1-SVM | L2-Log | L2-SVM | KSVM | SCCA |
|---|---|---|---|---|---|---|
| 1 | 0.7071±0.0010 | 0.7061±0.0015 | 0.7222±0.0009 | 0.7316±0.0011 | 0.7325±0.0006 | 0.7496±0.0042 |
| 5 | 0.7318±0.0004 | 0.7286±0.0007 | 0.7368±0.0005 | 0.7505±0.0005 | NA | 0.7496±0.0042 |
| 10 | 0.7254±0.0003 | 0.7339±0.0005 | 0.7370±0.0004 | 0.7479±0.0003 | NA | 0.7496±0.0042 |
| 50 | 0.7243±0.0004 | 0.7366±0.0004 | 0.7378±0.0005 | 0.7479±0.0004 | NA | 0.7496±0.0042 |
| 100 | 0.7244±0.0005 | 0.7352±0.0006 | 0.7361±0.0005 | 0.7496±0.0003 | NA | 0.7496±0.0042 |
| ALL | 0.7244±0.0004 | 0.7377±0.0006 | 0.7371±0.0005 | 0.7481±0.0004 | NA | 0.7496±0.0042 |
The number of negative examples is varied from the same number of positive examples to the number of all possible negative examples. NA means that it was not computationally feasible.