| Literature DB >> 33138797 |
Gyeong Back Lim1,2, Young-Ae Kim2, Jeong-Han Seo2, Hee Jin Lee2, Gyungyub Gong3, Sung Hee Park4,5.
Abstract
BACKGROUND: Triple-Negative Breast Cancer (TNBC) is an aggressive and complex subtype of breast cancer. The current biomarkers used in the context of breast cancer treatment are highly dependent on the targeting of oestrogen receptor, progesterone receptor, or HER2, resulting in treatment failure and disease recurrence and creating clinical challenges. Thus, there is still a crucial need for the improvement of TNBC treatment; the discovery of effective biomarkers that can be easily translated to the clinics is essential.Entities:
Keywords: Differential expression; NanoString nCounter immune panel; Pathological complete response; Prediction model; Prognostic marker; Random Forest; Relapse; Triple negative breast cancer
Mesh:
Substances:
Year: 2020 PMID: 33138797 PMCID: PMC7607642 DOI: 10.1186/s12885-020-07399-8
Source DB: PubMed Journal: BMC Cancer ISSN: 1471-2407 Impact factor: 4.430
Fig. 1A Work flow diagram
Dataset
| Training | Test | |
|---|---|---|
| anon-pCR | 26 | 11 |
| bpCR | 12 | 6 |
| cnon-relapase | 22 | 13 |
| drelapse | 16 | 4 |
a, b: The number of patients in non-pCR and pCR corresponding to training and test groups
c, d: The number of patents in non-relapse and relapse corresponding to training and test groups
DEGs in pCR and relapse groups
| Gene | a |
|---|---|
| bpCR DEG | |
| IL2RA | 2.34E-07 |
| CCL5 | 1.17E-06 |
| SELE | 1.71E-04 |
| CCL20 | 1.67E-05 |
| FCER1A | 7.49E-04 |
| CD1A | 8.26E-05 |
| HAMP | 3.35E-04 |
| CD7 | 5.12E-04 |
| C4A.B | 1.34E-04 |
| cRELAPSE DEG | |
| CCL5 | 9.49E-06 |
| vCCL7 | 9.85E-06 |
| TNFSF13B | 5.80E-05 |
| CSF2RB | 1.60E-04 |
| CLEC4E | 2.73E-04 |
| CCL8 | 3.58E-04 |
| SELE | 3.97E-04 |
| EDNRB | 4.34E-04 |
| IL17B | 5.75E-04 |
| IL2RA | 7.44E-04 |
| FCER1A | 8.09E-04 |
| TGFBI | 1.07E-03 |
| GZMB | 1.12E-03 |
Only genes at FDR < 0.05 after multiple hypothesis testing are presented
a: Original P value calculated in edgeR
b, c: DEGs in pCR and relapse conditions respectively
DEGs in combinations of pCR and relapse with POST treatment
| Gene | a |
|---|---|
| KLRG2 | 0.000048 |
| HLA-DQA1 | 1.45E-25 |
| HLA-DQB1 | 6.26E-19 |
| CEACAM6 | 2.36E-04 |
a: Original P value calculated in edgeR
b: Combination of a POST and pCR group compared with a POST and relapse group
c: Combination of a POST and relapse group with a POST and non-relapse group
Performance comparison with our model and Elastic Net model
| Model | aAUC | bPPV |
|---|---|---|
| cpCR model | 0.84 | 0.7 |
| d RELAPSE model | 0.88 | 0.69 |
| EEN pCR model | 0.64 | 0 |
| fEN RELAPSE model | 0.68 | 0.23 |
aAUC: Receiver Operating Characteristic Area Under Curve
bPPV: Predictive Positive Value (TP / TP + FP)
cOur model pCR: Random Forest analysis using pCR DEG.
dOur model RELAPSE: Random Forest analysis using RELAPSE DEG.
eEN model pCR: Random Forest analysis using EN pCR genes (alpha value < 0.95)
fEN model RELAPSE: Random Forest analysis using EN RELAPSE genes (alpha value < 0.2)
Fig. 2Comparison of our prediction models with EN-based prediction models. a AUCs between ours and the EN-based prediction model of pCR. Blue represents our RF model of pCR constructed on the features selected using edgeR, which is more robust than that the model represented in green, based on features selected using EN. b AUCs between ours and EN-based prediction model of relapse. Blue represents our RF model of relapse constructed on the features selected using edgeR
Fig. 3A randomization test for pCR. Plots present the distributions of the AUC and PPV values for all 1000 permutations. Histograms represent (a) AUCs and (b) PPV of permutations. Red represents AUC or PPV of permutations below (Table 4) whilst green represents permutations superior to our thresholds
Fig. 4A randomization test for relapse. Plots present the distribution of the AUC and PPV values for all 1000 permutations. Histograms represent (a) AUC and (b) PPV of permutations. Red represents AUC or PPV below (Table 4) whilst green represents permutations superior to our thresholds
Fig. 5Survival analysis using the Cox proportional hazards regression model. a Stands for the cox model built on pCR genes. Red indicates pCR and blue non-pCR. b Stands for the cox model built on relapse genes. Red represents relapse and blue non-relapse
Significant genes in overall survival analysis for the pCR model
| Gene | aHazard Ratio(95%CI) | b |
|---|---|---|
| CD1A | 0.103439 | 0.00755 |
| FCER1A | 0.47464 | 0.00128 |
a: The Hazard Ratio calculated in the Cox proportional hazards regression model. A Hazard ratio lower than 1 indicates non-risk factor; a ratio higher than 1 indicates risk factor
b: Original P value (< 0.05) calculated in the Cox proportional hazards regression model
Fig. 6Violin plots for significant genes in the Cox model of pCR. a and b represent CD1A and FCER1A, respectively. CD1A, and FCER1A have high expression in non-pCR
Significant genes in disease free survival analysis for the relapse model
| Gene | aHazard Ratio(95%CI) | b |
|---|---|---|
| FCER1A | 0.14087 | 0.0458 |
| EDNRB | 0.35933 | 0.0337 |
| TGFBI | 0.52611 | 0.0262 |
a: The Hazard Ratio calculated in the Cox proportional hazards regression model. A Hazard ratio lower than 1 indicates non-risk factor; a ratio higher than 1 indicates risk factor
b: Original P value (< 0.05) calculated in the Cox proportional hazards regression model
Fig. 7Violin plots for significant genes in the Cox model of relapse. Violin plots for three significant genes in the Cox proportional hazards regression model. a Corresponds to EDNRB with high expression at relapse, b corresponds to FCER1A with high expression at relapse, and c corresponds to TGFBI with high expression at relapse
Functional enrichment analysis for pCR and RELAPSE models
| Category | Term | Genes | a |
|---|---|---|---|
| bpCR | |||
| GOTERM_BP_DIRECT | GO:0006955 ~ immune response | IL2RA, CCL20, HAMP, CD1A, CCL5, CD7 | 1.95E-07 |
| UP_SEQ_FEATURE | disulfide bond | FCER1A, IL2RA, CCL20, HAMP, CD1A, CCL5, SELE, CD7 | 1.36E-06 |
| UP_SEQ_FEATURE | signal peptide | FCER1A, IL2RA, CCL20, HAMP, CD1A, CCL5, SELE, CD7 | 3.57E-06 |
| UP_KEYWORDS | Disulfide bond | FCER1A, IL2RA, CCL20, HAMP, CD1A, CCL5, SELE, CD7 | 3.58E-06 |
| UP_KEYWORDS | Signal | FCER1A, IL2RA, CCL20, HAMP, CD1A, CCL5, SELE, CD7 | 1.37E-05 |
| KEGG_PATHWAY | hsa04640:Hematopoietic cell lineage | IL2RA, CD1A, CD7 | 0.002295 |
| KEGG_PATHWAY | hsa04668:TNF signaling pathway | CCL20, CCL5, SELE | 0.003452 |
| KEGG_PATHWAY | hsa04060:Cytokine-cytokine receptor interaction | IL2RA, CCL20, CCL5 | 0.016968 |
| KEGG_PATHWAY | hsa05323:Rheumatoid arthritis | CCL20, CCL5 | 0.074368 |
| cRELAPSE | |||
| UP_SEQ_FEATURE | disulfide bond | FCER1A, IL2RA, CCL8, GZMB, CCL5, CCL7, EDNRB, IL17B, TNFSF13B, CLEC4E, TGFBI, CSF2RB, SELE | 8.75E-11 |
| UP_KEYWORDS | Disulfide bond | FCER1A, IL2RA, CCL8, GZMB, CCL5, CCL7, EDNRB, IL17B, TNFSF13B, CLEC4E, TGFBI, CSF2RB, SELE | 4.58E-10 |
| UP_SEQ_FEATURE | signal peptide | FCER1A, EDNRB, IL17B, IL2RA, TGFBI, CCL8, CSF2RB, GZMB, CCL5, SELE, CCL7 | 7.83E-07 |
| UP_KEYWORDS | Cytokine | IL17B, TNFSF13B, CCL8, CCL5, CCL7 | 3.29E-06 |
| GOTERM_BP_DIRECT | GO:0006954 ~ inflammatory response | IL17B, IL2RA, CCL8, CCL5, SELE, CCL7 | 3.97E-06 |
| UP_KEYWORDS | Signal | FCER1A, EDNRB, IL17B, IL2RA, TGFBI, CCL8, CSF2RB, GZMB, CCL5, SELE, CCL7 | 4.97E-06 |
| UP_KEYWORDS | Glycoprotein | FCER1A, EDNRB, IL17B, IL2RA, CLEC4E, TNFSF13B, CSF2RB, GZMB, CCL5, SELE, CCL7 | 1.17E-05 |
| KEGG_PATHWAY | hsa04060:Cytokine-cytokine receptor interaction | IL17B, IL2RA, TNFSF13B, CCL8, CSF2RB, CCL5, CCL7 | 7.28E-07 |
| KEGG_PATHWAY | hsa04062:Chemokine signaling pathway | CCL8, CCL5, CCL7 | 0.034067 |
Only annotations with a FDR < 0.05 after multiple hypothesis testing are presented from DAVID outputs
a: Original P value (FDR < 0.05) calculated in DAVID
Fig. 8An interaction network for genes in the pCR model. The interaction network is derived from STRING for the nine significant genes in the pCR model. FCER1A did not form any interaction as per the STRING Database. Red represents interactions derived from gene fusion, purple from experiments, and yellow from text mining
Fig. 9An interaction network for genes in the relapse model. Of 13 genes in the relapse model, eight genes formed interactions. Five genes on the top left corner did not form any interaction as per the STRING database. The interaction network is derived from STRING for the nine significant genes. Red represents interactions derived from gene fusion, purple from experiments, and yellow from text mining
Molecular function enrichment of genes in interaction networks
| pCR | RELAPSE | ||
|---|---|---|---|
| GO-term | term description | GO-term | term description |
| GO:0019955 | cytokine binding | GO:0005125 | cytokine activity |
| GO:0004896 | cytokine receptor activity | GO:0001664 | G protein-coupled receptor binding |
| GO:0004911 | interleukin-2 receptor activity | GO:0005102 | signaling receptor binding |
| GO:0019976 | interleukin-2 binding | GO:0005126 | cytokine receptor binding |
| GO:0098772 | molecular function regulator | GO:0048020 | CCR chemokine receptor binding |
| GO:0005515 | protein binding | GO:0098772 | molecular function regulator |
| GO:0016493 | C-C chemokine receptor activity | GO:0008009 | chemokine activity |
| GO:0019957 | C-C chemokine binding | GO:0031726 | CCR1 chemokine receptor binding |
| GO:0038023 | signaling receptor activity | GO:0016004 | phospholipase activator activity |
| GO:0001848 | complement binding | GO:0004896 | cytokine receptor activity |
| GO:0004435 | phosphatidylinositol phospholipase C activity | GO:0038023 | signaling receptor activity |
| GO:0048018 | receptor ligand activity | GO:0004435 | phosphatidylinositol phospholipase C activity |
| GO:0015026 | coreceptor activity | GO:0005515 | protein binding |
| GO:0048020 | CCR chemokine receptor binding | GO:0005088 | Ras guanyl-nucleotide exchange factor activity |
| GO:0005125 | cytokine activity | GO:0030246 | carbohydrate binding |
| GO:0008009 | chemokine activity | GO:0019209 | kinase activator activity |
| GO:0005088 | Ras guanyl-nucleotide exchange factor activity | GO:0019955 | cytokine binding |
| GO:0001664 | G protein-coupled receptor binding | GO:0004888 | transmembrane signaling receptor activity |
| GO:0005126 | cytokine receptor binding | GO:0008528 | G protein-coupled peptide receptor activity |
| GO:0030246 | carbohydrate binding | GO:0008201 | heparin binding |
| GO:0004888 | transmembrane signaling receptor activity | GO:0008047 | enzyme activator activity |
| GO:0019209 | kinase activator activity | ||
| GO:0005102 | signaling receptor binding | ||
| GO:0005488 | binding | ||
| GO:0030234 | enzyme regulator activity | ||
| GO:0004866 | endopeptidase inhibitor activity | ||
| GO:0004252 | serine-type endopeptidase activity | ||
Molecular Function analysis presents. GO Gene Ontology. GO terms at FDR < 0.05 present