| Literature DB >> 31296219 |
Sungjoon Park1, Doyeong Hwang1, Yoon Sun Yeo1, Hyunggee Kim2,3, Jaewoo Kang4,5.
Abstract
BACKGROUND: Gene expression data is widely used for identifying subtypes of diseases such as cancer. Differentially expressed gene analysis and gene set enrichment analysis are widely used for identifying biological mechanisms at the gene level and gene set level, respectively. However, the results of differentially expressed gene analysis are difficult to interpret and gene set enrichment analysis does not consider the interactions among genes in a gene set.Entities:
Keywords: Breast cancer subtype; Context specific regulatory module; Feature importance score; Gene regulatory network inference; Single sample GSEA
Mesh:
Year: 2019 PMID: 31296219 PMCID: PMC6624175 DOI: 10.1186/s12920-019-0515-6
Source DB: PubMed Journal: BMC Med Genomics ISSN: 1755-8794 Impact factor: 3.063
Fig. 1Overview of CONFIGURE
Fig. 2An illustration of a regulatory module
Classifying breast cancer subtypes according to the expression status of three breast cancer biomarkers
| ER | PR | HER2 | Ki67 | ||
|---|---|---|---|---|---|
| Luminal A | + | + | - | - | |
| Luminal B | HER2 + | + | + | + | |
| HER2 - | + | + | - | + | |
| HER2 | - | - | + | ||
| Basal-like | - | - | - |
The number of samples of each breast cancer subtype
| Luminal A | Luminal B | Her2 | Basal-like | Total | |
|---|---|---|---|---|---|
| # of Samples | 391 | 370 | 109 | 202 | 1072 |
Performance of multi-class classifiers
| Accuracy | F1-macro | F1-weighted | |
|---|---|---|---|
|
| 0.8983 | 0.8924 | 0.8986 |
| SVM-Gene expression | 0.8899 | 0.8917 | 0.8898 |
| SVM-Gene expression (Hallmarks) | 0.8834 | 0.8923 | 0.8831 |
| COSSY | 0.8657 | 0.8225 | 0.8723 |
| Dominant Class Prediction | 0.3451 | 0.1283 | 0.5132 |
Performance of one-vs-rest binary classifiers
| Luminal A | Luminal B | HER2 | Basal-like | Average | |
|---|---|---|---|---|---|
| Accuracy | |||||
|
| 0.9366 | 0.8722 | 0.9627 | 0.9907 | 0.9405 |
| SVM-Gene expression | 0.9104 | 0.8741 | 0.9664 | 0.9841 | 0.9338 |
| SVM-Gene expression (Hallmarks) | 0.9291 | 0.8657 | 0.958 | 0.9888 | 0.9354 |
| COSSY | 0.8871 | 0.7836 | 0.9067 | 0.9813 | 0.8897 |
| Dominant Class Prediction | 0.6353 | 0.6549 | 0.8983 | 0.8116 | 0.75 |
| F1-Score | |||||
|
| 0.913 | 0.8105 | 0.7959 | 0.9747 | 0.8736 |
| SVM-Gene expression | 0.8772 | 0.8143 | 0.8378 | 0.958 | 0.8719 |
| SVM-Gene expression (Hallmarks) | 0.9033 | 0.8 | 0.7887 | 0.9698 | 0.8655 |
| COSSY | 0.8428 | 0.8542 | 0.3101 | 0.9506 | 0.7394 |
| Dominant Class Prediction | 0 | 0 | 0 | 0 | 0 |
The results of basal-like type specific regulatory modules obtained by CONFIGURE
|
|
|
|
|
|
|---|---|---|---|---|
| POU5F1(-) | TOX3 | 0.1328 | down-regulated | [ |
| RALGPS2 | ||||
| FUT8 | ||||
| HMGCR | ||||
| FOXA1 | ||||
| ZIC1(-) | XBP1 | 0.1072 | down-regulated | |
| OVOL1 | ||||
| SLC1A4 | ||||
| SMAD7 | ||||
| CNTN1 | ||||
| RARA(+) | RARA | 0.0827 | down-regulated | |
| STARD3 | ||||
| PLEKHH3 | ||||
| MAG | ||||
| PCGF2 | ||||
| E2F3(+) | E2F3 | 0.0667 | up-regulated | |
| ANP32E | ||||
| GEN1 | ||||
| SYNCRIP | ||||
| BEND3 | ||||
| GATA6(-) | MAST4 | 0.058 | down-regulated | [ |
| PDE6B | ||||
| ROBO2 | ||||
| KIF5A | ||||
| ABI2 | ||||
| PHOX2B(+) | PHOX2B | 0.0454 | up-regulated | |
| DDC | ||||
| MSGN1 | ||||
| AKR1D1 | ||||
| FABP7 | ||||
| GLI3(-) | PPIF | 0.0447 | up-regulated | [ |
| ELF5 | ||||
| ORAI1 | ||||
| POR | ||||
| HMGA1 | ||||
| ETV6(+) | PHB2 | 0.0346 | up-regulated | |
| NCAPD2 | ||||
| VANGL2 | ||||
| PLEKHA5 | ||||
| ETV6 | ||||
| SRF(-) | PAIP2 | 0.0344 | down-regulated | [ |
| ERLEC1 | ||||
| NECAP1 | ||||
| SCRN3 | ||||
| ZFP62 | ||||
| PLAGL1(-) | SLC25A17 | 0.0327 | down-regulated | |
| NPBWR2 | ||||
| PTK6 | ||||
| SYCE2 | ||||
| HN1L |