| Literature DB >> 29551932 |
Choon Sen Seah1, Shahreen Kasim1, Mohd Farhan Md Fudzee1, Jeffrey Mark Law Tze Ping1, Mohd Saberi Mohamad2, Rd Rohmat Saedudin3, Mohd Arfian Ismail4.
Abstract
Microarray technology has become one of the elementary tools for researchers to study the genome of organisms. As the complexity and heterogeneity of cancer is being increasingly appreciated through genomic analysis, cancerous classification is an emerging important trend. Significant directed random walk is proposed as one of the cancerous classification approach which have higher sensitivity of risk gene prediction and higher accuracy of cancer classification. In this paper, the methodology and material used for the experiment are presented. Tuning parameter selection method and weight as parameter are applied in proposed approach. Gene expression dataset is used as the input datasets while pathway dataset is used to build a directed graph, as reference datasets, to complete the bias process in random walk approach. In addition, we demonstrate that our approach can improve sensitive predictions with higher accuracy and biological meaningful classification result. Comparison result takes place between significant directed random walk and directed random walk to show the improvement in term of sensitivity of prediction and accuracy of cancer classification.Entities:
Keywords: Directed random walk algorithm; Group specific tuning parameter; Pathway
Year: 2017 PMID: 29551932 PMCID: PMC5851940 DOI: 10.1016/j.sjbs.2017.11.024
Source DB: PubMed Journal: Saudi J Biol Sci ISSN: 1319-562X Impact factor: 4.219
Fig. 1Biological pathway of Leukocyte Transendothelial Migration (KEGG PATHWAY, 2017).
Fig. 2Simple illustration of single pathway data.
Fig. 3Simple illustration of the relationship between genes.
Fig. 4Simple illustration of relationship of weight among genes.
Fig. 5Highlighted genes to represent a single pathway.
Weight of highlighted gene in Fig. 5.
| Nodes | EPAC | Rap1 | ITGAL | Pyk2 | Vav | RhoA |
|---|---|---|---|---|---|---|
| Weight | 2.338914 | 8.47301 | 6.1441 | 3.102989 | 11.38365 | 5.149393 |
Fig. 6Flowchart of sDRW.
Fig. 7Pseudo code of tuning parameter selection method in sDRW.
Fig. 8Simple illustration of pathway dataset.
Adjacency matrix of relationship of gene.
| A | 1 | 2 | 3 | 4 | 5 |
|---|---|---|---|---|---|
| 1 | 0 | 0 | 1 | 0 | 0 |
| 2 | 0 | 0 | 0 | 1 | 0 |
| 3 | 0 | 1 | 0 | 0 | 1 |
| 4 | 0 | 0 | 0 | 0 | 0 |
| 5 | 1 | 0 | 0 | 0 | 0 |
Result of vector from first node to sixth node.
| Vector, | Significant Directed Random Walk |
|---|---|
| 0 | |
| 3.243577 | |
| 5.682564 | |
| 5.047153 | |
| 6.364853 | |
| 7.505854 |
Fig. 9Complex illustration of pathway dataset.
Name of risk pathway that predicted by sDRW.
| Datasets | Restart | Significant Directed Random Walk, sDRW |
|---|---|---|
| Lung | 0.1 | Endocytosis, |
| 0.2 | Pancreatic selection, | |
| 0.3 | Focal adhesion | |
| 0.4 | ECM-receptor interaction | |
| 0.5 | Leukocyte transendothelial migration, | |
| 0.6 | Focal adhesion | |
| 0.7 | Focal adhesion | |
| 0.8 | Pancreatic secretion, | |
| 0.9 | ECM-receptor interaction | |
| Stomach | 0.1 | TGF-beta signaling pathway |
| 0.2 | Hedgehog signaling pathway, | |
| 0.3 | Wnt signaling pathway, | |
| 0.4 | Hedgehog signaling pathway, | |
| 0.5 | Notch signaling pathway, | |
| 0.6 | Regulation of actin cytoskeleton | |
| 0.7 | Hedgehog signaling pathway | |
| 0.8 | Alanine, aspartate and glutamate metabolism, | |
| 0.9 | TGF-beta signaling pathway | |
| Liver | 0.1 | Sphingolipid metabolism |
| 0.2 | Focal adhesion, | |
| 0.3 | Tight junction | |
| 0.4 | Sphingolipid metabolism, | |
| 0.5 | Bacterial invasion of epithelial cells | |
| 0.6 | Glycerolipid metabolism, | |
| 0.7 | Focal adhesion, | |
| 0.8 | Glycerolipid metabolism | |
| 0.9 | Sphingolipid metabolism | |
| Tyroid | 0.1 | Tight junction |
| 0.2 | Cell adhesion molecules (CAMs) | |
| 0.3 | Tight junctionCell adhesion molecules | |
| 0.4 | Fatty acid metabolism | |
| 0.5 | Regulation of actin cytoskeleton, | |
| 0.6 | Wnt signaling pathwayCell adhesion molecules | |
| 0.7 | Fatty acid metabolism | |
| 0.8 | MAPK signaling pathway & Fatty acid metabolism | |
| 0.9 | Focal adhesion | |
| Kidney | 0.1 | Endocytosis, |
| 0.2 | Regulation of actin cytoskeleton | |
| 0.3 | Calcium signaling pathway, | |
| 0.4 | Endocytosis | |
| 0.5 | Phosphatidylinositol signaling system, | |
| 0.6 | Protein processing in endoplasmic reticulum, | |
| 0.7 | Endocytosis, | |
| 0.8 | PPAR signaling pathway | |
| 0.9 | Calcium signaling pathway | |
| Breast | 0.1 | Neuroactive ligand-receptor interaction |
| 0.2 | Glycerophospholipid metabolism | |
| 0.3 | Neuroactive ligand-receptor interaction | |
| 0.4 | Adipocytokine signaling pathway, | |
| 0.5 | Cytokine-cytokine receptor interaction, | |
| 0.6 | Jak-STAT signaling pathway | |
| 0.7 | Neuroactive ligand-receptor interaction | |
| 0.8 | Chemokine signaling pathway | |
| 0.9 | Adipocytokine signaling pathway, | |
Name of risk pathway that predicted by sDRW and DRW.
| Datasets | Restart | Significant Directed Random Walk, sDRW | Directed Random Walk, DRW |
|---|---|---|---|
| Lung | 0.1 | Endocytosis, | Tight junction |
| 0.2 | Pancreatic selection, | ECM-receptor interaction | |
| 0.3 | Focal adhesion | ECM-receptor interaction | |
| 0.4 | ECM-receptor interaction | ECM-receptor interaction, | |
| 0.5 | Leukocyte transendothelial migration, | ECM-receptor interaction, | |
| 0.6 | Focal adhesion | Leukocyte transendothelial migration | |
| 0.7 | Focal adhesion | Focal adhesion | |
| 0.8 | Pancreatic secretion, | Focal adhesion | |
| 0.9 | ECM-receptor interaction | Pancreatic secretion | |
| Stomach | 0.1 | TGF-beta signaling pathway | TGF-beta signaling pathway |
| 0.2 | Hedgehog signaling pathway, | Hedgehog signaling pathway | |
| 0.3 | Wnt signaling pathway, | Wnt signaling pathway | |
| 0.4 | Hedgehog signaling pathway, | Hedgehog signaling pathway, | |
| 0.5 | Notch signaling pathway, | Notch signaling pathway | |
| 0.6 | Regulation of actin cytoskeleton | Regulation of actin cytoskeleton | |
| 0.7 | Hedgehog signaling pathway | Hedgehog signaling pathway | |
| 0.8 | Alanine, aspartate and glutamate metabolism, | Alanine, aspartate and glutamate metabolism, | |
| 0.9 | TGF-beta signaling pathway | TGF-beta signaling pathway | |
| Liver | 0.1 | Sphingolipid metabolism | Sphingolipid metabolism |
| 0.2 | Focal adhesion, | Focal adhesion, | |
| 0.3 | Tight junction | Sphingolipid metabolism | |
| 0.4 | Sphingolipid metabolism, | Sphingolipid metabolism, | |
| 0.5 | Bacterial invasion of epithelial cells | Bacterial invasion of epithelial cells | |
| 0.6 | Glycerolipid metabolism, | Glycerolipid metabolism | |
| 0.7 | Focal adhesion, | Focal adhesion | |
| 0.8 | Glycerolipid metabolism | Sphingolipid metabolism | |
| 0.9 | Sphingolipid metabolism | Glycerolipid metabolism | |
| Tyroid | 0.1 | Tight junction | Cell adhesion molecules (CAMs) |
| 0.2 | Cell adhesion molecules (CAMs) | Cell adhesion molecules (CAMs) | |
| 0.3 | Tight junctionCell adhesion molecules | Tight junction,Cell adhesion molecules | |
| 0.4 | Fatty acid metabolism | Tight junction, | |
| 0.5 | Regulation of actin cytoskeleton, | Regulation of actin cytoskeleton, | |
| 0.6 | Wnt signaling pathwayCell adhesion molecules | Wnt signaling pathway,Cell adhesion molecules | |
| 0.7 | Fatty acid metabolism | Fatty acid metabolism | |
| 0.8 | MAPK signaling pathway & Fatty acid metabolism | MAPK signaling pathway | |
| 0.9 | Focal adhesion | Focal adhesion | |
| Kidney | 0.1 | Endocytosis, | Regulation of actin cytoskeleton |
| 0.2 | Regulation of actin cytoskeleton | Regulation of actin cytoskeleton | |
| 0.3 | Calcium signaling pathway, | Regulation of actin cytoskeleton, | |
| 0.4 | Endocytosis | Endocytosis | |
| 0.5 | Phosphatidylinositol signaling system, | Phosphatidylinositol signaling system, | |
| 0.6 | Protein processing in endoplasmic reticulum, | Regulation of actin cytoskeleton | |
| 0.7 | Endocytosis, | Endocytosis, | |
| 0.8 | PPAR signaling pathway | PPAR signaling pathway | |
| 0.9 | Calcium signaling pathway | Endocytosis | |
| Breast | 0.1 | Neuroactive ligand-receptor interaction | Adipocytokine signaling pathway |
| 0.2 | Glycerophospholipid metabolism | Glycerophospholipid metabolism | |
| 0.3 | Neuroactive ligand-receptor interaction | Neuroactive ligand-receptor interaction | |
| 0.4 | Adipocytokine signaling pathway, | Fatty acid metabolism | |
| 0.5 | Cytokine-cytokine receptor interaction, | Cytokine-cytokine receptor interaction | |
| 0.6 | Jak-STAT signaling pathway | Jak-STAT signaling pathway | |
| 0.7 | Neuroactive ligand-receptor interaction | Adipocytokine signaling pathway, | |
| 0.8 | Chemokine signaling pathway | Chemokine signaling pathway | |
| 0.9 | Adipocytokine signaling pathway, | Adipocytokine signaling pathway, | |
Number of risk pathway detected by sDRW and DRW.
| Datasets | Method | Restart probabilities, | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| 0.1 | 0.2 | 0.3 | 0.4 | 0.5 | 0.6 | 0.7 | 0.8 | 0.9 | ||
| Lung, | sDRW | 2 | 1 | 1 | 2 | 1 | 1 | 2 | 1 | |
| GSE10072 | DRW | 1 | 1 | 1 | 2 | 2 | 1 | 1 | 1 | 1 |
| Detected Extra pathway | 2 | 1 | 0 | −1 | 0 | 0 | 0 | 1 | 0 | |
| Stomach, | sDRW | 1 | 2 | 2 | 2 | 2 | 1 | 1 | 1 | |
| GSE13911 | DRW | 1 | 1 | 1 | 2 | 1 | 1 | 1 | 2 | 1 |
| Detected Extra pathway | 0 | 1 | 1 | 0 | 1 | 0 | 0 | 1 | 0 | |
| Liver, | sDRW | 1 | 2 | 1 | 1 | 2 | 2 | 1 | 1 | |
| GSE17856 | DRW | 1 | 2 | 1 | 2 | 1 | 1 | 1 | 1 | 1 |
| Detected Extra pathway | 0 | 0 | 0 | 1 | 0 | 1 | 1 | 0 | 0 | |
| Tyroid, | sDRW | 1 | 2 | 2 | 2 | 2 | 1 | 2 | 1 | |
| GSE5364 | DRW | 1 | 1 | 2 | 2 | 2 | 2 | 1 | 1 | 1 |
| Detected Extra pathway | 0 | 1 | 0 | 0 | 2 | 0 | 0 | 1 | 0 | |
| Kidney, | sDRW | 2 | 1 | 2 | 1 | 2 | 2 | 1 | 1 | |
| GSE17895 | DRW | 1 | 1 | 2 | 1 | 2 | 1 | 2 | 1 | 2 |
| Detected Extra pathway | 1 | 0 | 0 | 0 | 0 | 2 | 0 | 0 | −1 | |
| Breast, | sDRW | 1 | 1 | 1 | 2 | 1 | 1 | 1 | 2 | |
| GSE1456 | DRW | 1 | 1 | 1 | 1 | 1 | 1 | 2 | 1 | 2 |
| Detected Extra pathway | 0 | 0 | 0 | 2 | 1 | 0 | −1 | 0 | 0 | |
*The bold r is the optimum restart probability for sDRW.
Fig. 10Comparison of number of detected risk pathways between sDRW and DRW in six different cancer datasets.
AUC of every datasets against restart probabilities from 0.1 to 0.9 in sDRW.
| Datasets | Restart Probabilities, | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| 0.1 | 0.2 | 0.3 | 0.4 | 0.5 | 0.6 | 0.7 | 0.8 | 0.9 | |
| Lung | 0.9676 | 0.9702 | 0.9818 | 0.9764 | 0.9819 | 0.9877 | 0.9877 | 0.9582 | 0.9871 |
| Stomach | 0.9472 | 0.9749 | 0.9362 | 0.8935 | 0.9356 | 0.9642 | 0.9215 | 0.9784 | 0.95478 |
| Liver | 0.9469 | 0.9844 | 0.9427 | 0.9629 | 0.9428 | 0.9525 | 0.9635 | 0.9836 | 0.9684 |
| Tyroid | 0.9426 | 0.9579 | 0.9869 | 0.9258 | 0.9538 | 0.9125 | 0.9312 | 0.9216 | 0.9528 |
| Kidney | 0.9615 | 0.9472 | 0.9637 | 0.9578 | 0.9472 | 0.9478 | 0.9573 | 0.9268 | 0.9637 |
| Breast | 0.8493 | 0.7042 | 0.7296 | 0.9508 | 0.8941 | 0.8251 | 0.8466 | 0.9943 | 0.9467 |
Number of cancerous gene detected by sDRW and DRW.
| Datasets | Method | Restart Probabilities, | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| 0.1 | 0.2 | 0.3 | 0.4 | 0.5 | 0.6 | 0.7 | 0.8 | 0.9 | ||
| Lung, | sDRW | 160 | 118 | 49 | 112 | 118 | 118 | 118 | 49 | |
| GSE10072 | DRW | 63 | 49 | 49 | 167 | 167 | 63 | 118 | 118 | 45 |
| Increment of percentage, % | 325.3968 | 226.5306 | 140.8163 | −70.6597 | −32.9341 | 87.3016 | 0 | 0 | 8.8889 | |
| Stomach, | sDRW | 41 | 53 | 109 | 65 | 70 | 108 | 24 | 41 | |
| GSE13911 | DRW | 41 | 24 | 80 | 65 | 29 | 108 | 24 | 48 | 41 |
| Increment of percentage, % | 0 | 120.8333 | 36.25 | 0 | 141.3793 | 0 | 0 | 85.4167 | 0 | |
| Liver, | sDRW | 21 | 170 | 61 | 40 | 67 | 136 | 109 | 21 | |
| GSE17856 | DRW | 21 | 130 | 21 | 82 | 40 | 27 | 109 | 21 | 109 |
| Increment of percentage, % | 0 | 30.7692 | 190.4762 | −10.9756 | 0 | 148.1481 | 24.7706 | 4.1905 | −80.7339 | |
| Tyroid, | sDRW | 23 | 29 | 39 | 33 | 52 | 13 | 76 | 51 | |
| GSE5364 | DRW | 16 | 16 | 39 | 43 | 9 | 52 | 13 | 63 | 51 |
| Increment of percentage, % | 43.75 | 81.25 | 0 | −23.2558 | 988.8889 | 0 | 0 | 20.6349 | 0 | |
| Kidney, | sDRW | 73 | 39 | 175 | 34 | 53 | 73 | 19 | 161 | |
| GSE17895 | DRW | 39 | 39 | 53 | 34 | 53 | 39 | 73 | 19 | 73 |
| Increment of percentage, % | 87.1795 | 0 | 230.1887 | 0 | 0 | 141.0256 | 0 | 0 | 120.5479 | |
| Breast, | sDRW | 19 | 12 | 19 | 35 | 21 | 19 | 23 | 26 | |
| GSE1456 | DRW | 14 | 12 | 19 | 9 | 26 | 21 | 33 | 23 | 23 |
| Increment of percentage, % | 35.7143 | 0 | 0 | 388.8889 | 34.6138 | 0 | −42.4242 | 0 | 13.0435 | |
*The bold r is the optimum restart probability for sDRW.
Comparison of AUC between sDRW and DRW.
| Dataset | Method | Restart Probabilities, | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| 0.1 | 0.2 | 0.3 | 0.4 | 0.5 | 0.6 | 0.7 | 0.8 | 0.9 | ||
| Lung, | sDRW | 0.9702 | 0.9818 | 0.9764 | 0.9819 | 0.9877 | 0.9877 | 0.9582 | 0.9871 | |
| GSE10072 | DRW | 0.9636 | 0.9761 | 0.9699 | 0.976 | 0.963 | 0.9817 | 0.9764 | 0.9764 | 0.9816 |
| Stomach, | sDRW | 0.9472 | 0.9749 | 0.9362 | 0.8935 | 0.9356 | 0.9642 | 0.9215 | 0.95478 | |
| GSE13911 | DRW | 0.9362 | 0.9235 | 0.9424 | 0.9531 | 0.9235 | 0.9642 | 0.9148 | 0.9548 | 0.9642 |
| Liver, | sDRW | 0.9469 | 0.9844 | 0.9427 | 0.9428 | 0.9525 | 0.9635 | 0.9836 | 0.9684 | |
| GSE17856 | DRW | 0.9225 | 0.9528 | 0.9483 | 0.9468 | 0.9241 | 0.9216 | 0.9574 | 0.9748 | 0.9425 |
| Tyroid, | sDRW | 0.9426 | 0.9579 | 0.9869 | 0.9258 | 0.9125 | 0.9312 | 0.9216 | 0.9528 | |
| GSE5364 | DRW | 0.9461 | 0.9472 | 0.9572 | 0.9462 | 0.9136 | 0.8467 | 0.9318 | 0.9127 | 0.9424 |
| Kidney, | sDRW | 0.9615 | 0.9472 | 0.9637 | 0.9578 | 0.9472 | 0.9573 | 0.9268 | 0.9637 | |
| GSE17895 | DRW | 0.9437 | 0.9426 | 0.9259 | 0.9471 | 0.9421 | 0.9431 | 0.9841 | 0.9144 | 0.9258 |
| Breast, | sDRW | 0.8493 | 0.7042 | 0.7296 | 0.8941 | 0.8251 | 0.8466 | 0.9943 | 0.9467 | |
| GSE1456 | DRW | 0.6379 | 0.7821 | 0.6872 | 0.9496 | 0.9135 | 0.7258 | 0.5984 | 0.9546 | 0.9268 |
*The bold r is the optimum restart probability for sDRW.
Fig. 11Comparison number of detected significant genes between sDRW and DRW in 8 different cancer datasets.