| Literature DB >> 25972728 |
Fan Zhang1, Youping Deng2, Mu Wang3, Li Cui4, Renee Drabier5.
Abstract
Genes do not function alone but through complex biological pathways. Pathway-based biomarkers may be a reliable diagnostic tool for early detection of breast cancer due to the fact that breast cancer is not a single homogeneous disease. We applied Integrated Pathway Analysis Database (IPAD) and Gene Set Enrichment Analysis (GSEA) approaches to the study of pathway-based biomarker discovery problem in breast cancer proteomics. Our strategy for identifying and analyzing pathway-based biomarkers are threefold. Firstly, we performed pathway analysis with IPAD to build the gene set database. Secondly, we ran GSEA to identify 16 pathway-based biomarkers. Lastly, we built a Support Vector Machine model with three-way data split and fivefold cross-validation to validate the biomarkers. The approach-unraveling the intricate pathways, networks, and functional contexts in which genes or proteins function-is essential to the understanding molecular mechanisms of pathway-based biomarkers in breast cancer.Entities:
Keywords: breast cancer; machine learning; pathway analysis; proteomics
Year: 2015 PMID: 25972728 PMCID: PMC4324474 DOI: 10.4137/CIN.S14069
Source DB: PubMed Journal: Cancer Inform ISSN: 1176-9351
16 pathway-based biomarkers identified.
| PATHWAY ID | PATHWAY NAME | AE | N | ES | FDR |
|---|---|---|---|---|---|
| h_classicPathway | Classical complement pathway | 14 | 14 | 0.70549 | 0.07245 |
| 500792 | GPCR ligand binding | 16 | 409 | 0.60928 | 0.08991 |
| hsa05322 | Systemic lupus erythematosus | 17 | 141 | 0.68254 | 0.10214 |
| 166663 | Initial triggering of complement | 13 | 63 | 0.64665 | 0.10883 |
| h_lectinPathway | Lectin-induced complement pathway | 10 | 12 | 0.76798 | 0.1105 |
| h_compPathway | Complement pathway | 16 | 19 | 0.66422 | 0.12402 |
| 200149 | PDGFR-beta signaling pathway | 12 | 129 | 0.70522 | 0.16918 |
| hsa04610 | Complement and coagulation cascades | 42 | 71 | 0.50477 | 0.17074 |
| 373076 | Class A/1 (Rhodopsin-like receptors) | 10 | 304 | 0.7634 | 0.1732 |
| 375276 | Peptide ligand–binding receptors | 10 | 185 | 0.7634 | 0.1732 |
| 392499 | Metabolism of proteins | 18 | 378 | 0.56802 | 0.21487 |
| 381150 | Diabetes pathways | 15 | 137 | 0.57968 | 0.21726 |
| 388396 | GPCR downstream signaling | 27 | 866 | 0.48634 | 0.22031 |
| hsa05133 | Pertussis | 18 | 84 | 0.5792 | 0.22171 |
| hsa05150 | 16 | 69 | 0.58669 | 0.22746 | |
| 418594 | G alpha (i) signaling events | 12 | 199 | 0.67202 | 0.23845 |
Notes: AE is the number of genes presented in the pathway. N is the total number of genes in the pathway.
Validation with SVM for the 16 pathway-based biomarkers.
| PATHWAY ID | TRAINING SET | TESTING SET | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| PRECISION (%) | ACCURACY (%) | SENSITIVITY (%) | SPECIFICITY (%) | ROC (%) | PRECISION (%) | ACCURACY (%) | SENSITIVITY (%) | SPECIFICITY (%) | ROC (%) | |
| h_classicPathway | 80.56 | 77.50 | 72.50 | 82.50 | 87.88 | 71.43 | 72.50 | 75.00 | 70.00 | 78.63 |
| 500792 | 79.49 | 78.75 | 77.50 | 80.00 | 90.63 | 74.47 | 78.75 | 87.50 | 70.00 | 89.00 |
| hsa05322 | 73.17 | 73.75 | 75.00 | 72.50 | 88.88 | 69.39 | 73.75 | 85.00 | 62.50 | 82.19 |
| 166663 | 77.27 | 80.00 | 85.00 | 75.00 | 88.94 | 69.39 | 73.75 | 85.00 | 62.50 | 77.81 |
| h_lectinPathway | 73.91 | 77.50 | 85.00 | 70.00 | 89.56 | 64.29 | 65.00 | 67.50 | 62.50 | 75.63 |
| h_compPathway | 80.00 | 80.00 | 80.00 | 80.00 | 90.44 | 70.45 | 72.50 | 77.50 | 67.50 | 79.19 |
| 200149 | 72.22 | 70.00 | 65.00 | 75.00 | 78.69 | 57.14 | 53.75 | 30.00 | 77.50 | 65.19 |
| hsa04610 | 86.67 | 91.25 | 97.50 | 85.00 | 96.94 | 79.59 | 86.25 | 97.50 | 75.00 | 88.81 |
| 375276 | 80.56 | 77.50 | 72.50 | 82.50 | 89.69 | 75.56 | 78.75 | 85.00 | 72.50 | 86.75 |
| 373076 | 80.56 | 77.50 | 72.50 | 82.50 | 89.69 | 75.56 | 78.75 | 85.00 | 72.50 | 86.75 |
| 392499 | 82.05 | 81.25 | 80.00 | 82.50 | 93.25 | 78.26 | 82.50 | 90.00 | 75.00 | 91.88 |
| 381150 | 81.40 | 83.75 | 87.50 | 80.00 | 93.44 | 76.47 | 83.75 | 97.50 | 70.00 | 88.38 |
| 388396 | 92.50 | 92.50 | 92.50 | 92.50 | 98.19 | 82.98 | 88.75 | 97.50 | 80.00 | 94.81 |
| hsa05133 | 81.58 | 80.00 | 77.50 | 82.50 | 90.94 | 73.81 | 75.00 | 77.50 | 72.50 | 81.44 |
| hsa05150 | 84.09 | 87.50 | 92.50 | 82.50 | 93.81 | 73.91 | 77.50 | 85.00 | 70.00 | 81.88 |
| 418594 | 86.11 | 82.50 | 77.50 | 87.50 | 91.13 | 80.00 | 83.75 | 90.00 | 77.50 | 87.75 |
| Mean | 80.76 | 80.70 | 80.63 | 80.78 | 90.75 | 73.29 | 76.56 | 82.03 | 71.09 | 83.50 |
Prediction result for the GPCR downstream signaling.
| PREDICTED | TRAINING SET | TESTING SET | ||
|---|---|---|---|---|
| CANCER | NORMAL | CANCER | NORMAL | |
| Cancer | 37 | 3 | 39 | 8 |
| Normal | 3 | 37 | 1 | 32 |
| Precision | 92.50% | 82.98% | ||
| Accuracy | 92.50% | 88.75% | ||
| Sensitivity | 92.50% | 97.50% | ||
| Specificity | 92.50% | 80.00% | ||
Figure 1Seventeen genes pathway-based biomarkers predicting the healthy and breast cancer samples in testing set. X axis shows the 17 genes in the GPCR downstream signaling pathway. Y-axis shows the 40 breast cancer and 40 healthy samples.
Notes: H, healthy; P, cancer; blue, predicted as healthy; yellow, predicted as cancer.
Prediction performance after removing all “insignificant” proteins in pathways.
| PATHWAY ID | TRAINING SET | TESTING SET | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| PRECISION (%) | ACCURACY (%) | SENSITIVITY (%) | SPECIFICITY (%) | ROC (%) | PRECISION (%) | ACCURACY (%) | SENSITIVITY (%) | SPECIFICITY (%) | ROC (%) | |
| h_classicPathway | 71.25 | 67.50 | 75.00 | 85.44 | 65.85 | 66.25 | 67.50 | 65.00 | 75.00 | 71.25 |
| 500792 | 70.00 | 77.50 | 62.50 | 76.75 | 57.14 | 56.25 | 50.00 | 62.50 | 63.00 | 70.00 |
| hsa05322 | 73.75 | 75.00 | 72.50 | 86.31 | 69.39 | 73.75 | 85.00 | 62.50 | 75.25 | 73.75 |
| 166663 | 75.00 | 67.50 | 82.50 | 84.06 | 60.00 | 55.00 | 30.00 | 80.00 | 74.06 | 75.00 |
| h_lectinPathway | 70.00 | 65.00 | 75.00 | 85.00 | 62.86 | 61.25 | 55.00 | 67.50 | 74.25 | 70.00 |
| h_compPathway | 71.25 | 67.50 | 75.00 | 85.44 | 65.85 | 66.25 | 67.50 | 65.00 | 75.00 | 71.25 |
| 200149 | 70.00 | 60.00 | 80.00 | 71.41 | 57.89 | 53.75 | 27.50 | 80.00 | 47.28 | 70.00 |
| hsa04610 | 77.50 | 75.00 | 80.00 | 88.50 | 71.79 | 71.25 | 70.00 | 72.50 | 78.81 | 77.50 |
| 375276 | 70.00 | 77.50 | 62.50 | 76.75 | 57.14 | 56.25 | 50.00 | 62.50 | 63.00 | 70.00 |
| 373076 | 70.00 | 77.50 | 62.50 | 76.75 | 57.14 | 56.25 | 50.00 | 62.50 | 63.00 | 70.00 |
| 392499 | 73.75 | 70.00 | 77.50 | 71.16 | 66.67 | 58.75 | 35.00 | 82.50 | 60.06 | 73.75 |
| 381150 | 71.25 | 75.00 | 67.50 | 83.38 | 56.00 | 53.75 | 35.00 | 72.50 | 71.88 | 71.25 |
| 388396 | 70.00 | 77.50 | 62.50 | 76.75 | 57.14 | 56.25 | 50.00 | 62.50 | 63.00 | 70.00 |
| hsa05133 | 75.00 | 70.00 | 80.00 | 85.00 | 62.07 | 58.75 | 45.00 | 72.50 | 74.00 | 75.00 |
| hsa05150 | 73.75 | 65.00 | 82.50 | 84.00 | 64.52 | 61.25 | 50.00 | 72.50 | 74.06 | 73.75 |
| 418594 | 70.00 | 77.50 | 62.50 | 76.75 | 57.14 | 56.25 | 50.00 | 62.50 | 63.00 | 70.00 |
| MEAN | 72.03 | 71.56 | 72.50 | 80.84 | 61.79 | 60.08 | 51.09 | 69.06 | 68.42 | 72.03 |