| Literature DB >> 30382159 |
Srinivasulu Yerukala Sathipati1, Shinn-Ying Ho2,3,4.
Abstract
Breast cancer is a heterogeneous disease and one of the most common cancers among women. Recently, microRNAs (miRNAs) have been used as biomarkers due to their effective role in cancer diagnosis. This study proposes a support vector machine (SVM)-based classifier SVM-BRC to categorize patients with breast cancer into early and advanced stages. SVM-BRC uses an optimal feature selection method, inheritable bi-objective combinatorial genetic algorithm, to identify a miRNA signature which is a small set of informative miRNAs while maximizing prediction accuracy. MiRNA expression profiles of a 386-patient cohort of breast cancer were retrieved from The Cancer Genome Atlas. SVM-BRC identified 34 of 503 miRNAs as a signature and achieved a 10-fold cross-validation mean accuracy, sensitivity, specificity, and Matthews correlation coefficient of 80.38%, 0.79, 0.81, and 0.60, respectively. Functional enrichment of the 10 highest ranked miRNAs was analysed in terms of Kyoto Encyclopedia of Genes and Genomes and Gene Ontology annotations. Kaplan-Meier survival analysis of the highest ranked miRNAs revealed that four miRNAs, hsa-miR-503, hsa-miR-1307, hsa-miR-212 and hsa-miR-592, were significantly associated with the prognosis of patients with breast cancer.Entities:
Mesh:
Substances:
Year: 2018 PMID: 30382159 PMCID: PMC6208346 DOI: 10.1038/s41598-018-34604-3
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Comparison of SVM-BRC with the some classifiers for the 386-patient breast cancer cohort.
| Method | 10-CV accuracy (%) | Sensitivity | Specificity | MCC |
|---|---|---|---|---|
| SVM-BRC-Mean | 80.38 ± 1.55 | 0.79 ± 2.7 | 0.81 ± 2.26 | 0.60 ± 0.03 |
| SVM-BRC-Best | 83.16 | 0.84 | 0.81 | 0.66 |
| Random forest | 66.83 | 0.66 | 0.67 | 0.33 |
| Multilayer perceptron | 57.25 | 0.57 | 0.57 | 0.14 |
| SMO | 62.69 | 0.62 | 0.63 | 0.25 |
| Naïve Bayes | 64.50 | 0.63 | 0.65 | 0.29 |
| Decision tree | 50.25 | 0.50 | 0.50 | 0.01 |
Figure 1SVM-BRC performance evaluation using the ROC curve. The area under the ROC curve is 0.87 using a 386-patient breast cancer cohort.
Ten highest ranked miRNAs and feature knockout analysis of individual miRNAs.
| Rank | miRNA | MED scores | Accuracy difference (%) |
|---|---|---|---|
| 1 | hsa-miR-200c | 69.68 | 20.99 |
| 2 | hsa-miR-503 | 65.02 | 20.73 |
| 3 | hsa-miR-1307 | 48.44 | 21.25 |
| 4 | hsa-miR-361 | 47.92 | 21.25 |
| 5 | hsa-miR-212 | 46.89 | 20.99 |
| 6 | hsa-miR-592 | 46.89 | 19.95 |
| 7 | hsa-miR-1185-1 | 43.26 | 20.73 |
| 8 | hsa-miR-146b | 43.26 | 19.69 |
| 9 | hsa-miR-1468 | 34.45 | 21.25 |
| 10 | hsa-miR-769 | 30.82 | 20.47 |
Figure 2Feature knockout analysis. Prediction performance difference for individual miRNAs using feature knockout analysis.
Enriched KEGG pathways and the corresponding target genes for the 10 highest ranked miRNAs.
| KEGG pathway | p-value | Target genes |
|---|---|---|
| Fatty acid biosynthesis (hsa00061) | <1e-325 | FASN |
| Fatty acid metabolism (hsa01212) | <1e-325 | FASN |
| Adherens junction (hsa04520) | 4.47E-06 | TGFBR1, MET, WASL, SMAD2, ACTG1, IQGAP1, IGF1R, VCL, RHOA, TJP1, MLLT4, CDH1, CTNNB1, CTNNA1, WASF2, ACTN4, CREBBP |
| Protein processing in endoplasmic reticulum (hsa04141) | 0.00083483 | HSPA1A, EIF2AK1, SSR1, RAD23B, AMFR, UGGT1, YOD1, SEL1L, HSP90AA1, DNAJC10, UBE2E2, STT3B, HSPH1, PDIA6, RAD23A, PRKCSH, VCP, HSPA8, LMAN1, RPN2, DERL1, HSPA1B |
| Cytokine-cytokine receptor interaction (hsa04060) | 0.002767508 | IL6ST |
| Bacterial invasion of epithelial cells (hsa05100) | 0.01255968 | ARPC5L, MET, ITGB1, WASL, SEPT11, ACTG1, VCL, RHOA, CD2AP, CDH1, CLTA, WASF2, FN1, ARPC2 |
| Spliceosome (hsa03040) | 0.02884541 | RBM25, HSPA1A, HNRNPA1, DDX23, PPIL1, U2SURP, PRPF8, SRSF1, HNRNPM, DHX15, HSPA8, DHX16, SRSF3, HSPA1B, SNRPC, SNRNP200, SRSF8 |
| Proteoglycans in cancer (hsa05205) | 0.03666157 | PDCD4, MET, ITGB1, EZR, ARHGEF12, ACTG1, FRS2, IQGAP1, RHOA, ERBB3, ITGAV, LUM, HOXD10, FN1, MAP2K1, SDC4, TWIST1, VEGFA, MDM2, SMAD2, WNT5A, PPP1CC, ACTG1, TIAM1, IGF1R, AKT2, PTK2, CTNNB1, ITGA2, DDX5, GAB1 |
Figure 3KEGG pathway analysis of the 10 highest ranked miRNAs.
Figure 4Gene ontology (GO) annotations for the 10 highest ranked miRNAs. GO enrichment analysis was performed for the 10 highest ranked miRNAs at three levels: biological process (a), molecular functions (b), and cellular component (c).
Figure 5Kaplan-Meier plots of hsa-miR-503, hsa-miR-1307, hsa-miR-212, and hsa-miR-592 for the systemically treated breast cancer cohort.