| Literature DB >> 33708832 |
Wenning Cao1,2, Yike Jiang3,4, Xiang Ji2,5, Xuejiao Guan2,5, Qianyu Lin2,4, Lan Ma2,3,4,6.
Abstract
BACKGROUND: Triple-negative breast cancer (TNBC) is an aggressive subtype of breast cancer with high rates of metastasis and recurrence. Conventional clinical treatments are ineffective for it as it lacks therapeutic biomarkers. Figuring out the biomarkers related to TNBC will be beneficial for its clinical treatment and prognosis.Entities:
Keywords: Triple-negative breast cancer (TNBC); differentially expressed genes; meta-analysis; prognostic biomarkers; weighted gene co-expressed network analysis (WGCNA)
Year: 2021 PMID: 33708832 PMCID: PMC7940929 DOI: 10.21037/atm-20-5989
Source DB: PubMed Journal: Ann Transl Med ISSN: 2305-5839
Characteristics of individual studies included in meta-analysis
| GEO accession | Chip | Number of probes | Sample size | Number of TNBC | Number of non-TNBC |
|---|---|---|---|---|---|
| GSE27447 | [HuGene-1_0-st] Affymetrix Human Gene 1.0 ST Array | 33297 | 19 | 5 | 14 |
| GSE43358 | [HG-U133_Plus_2] Affymetrix Human Genome U133 Plus 2.0 Array | 54675 | 57 | 17 | 40 |
| GSE36295 | [HuGene-1_0-st] Affymetrix Human Gene 1.0 ST Array | 33297 | 38 | 11 | 27 |
| GSE61724 | [HuGene-1_0-st] Affymetrix Human Gene 1.0 ST Array | 33297 | 64 | 16 | 48 |
| GSE75678 | Agilent-014850 Whole Human Genome Microarray | 45220 | 54 | 14 | 40 |
| Total | 232 | 63 | 169 |
GEO, gene expression omnibus; TNBC, triple-negative breast cancer.
MetaQC quantitative quality control measures for selected datasets.
| Dataset | IQC | EQC | CQCg | CQCp | AQCg | AQCp | Rank |
|---|---|---|---|---|---|---|---|
| GSE43353 | 4.91 | 4 | 59.48 | 23.64 | 41.5 | 25.54 | 2.00 |
| GSE75678 | 0.61* | 3.82 | 97.4 | 62.92 | 68.53 | 36.21 | 2.08 |
| GSE27447 | 3.14 | 4 | 17.81 | 3.39 | 11.45 | 8.21 | 3.42 |
| GSE36295 | 4.91 | 3.82 | 14.14 | 17.3 | 5.53 | 10.24 | 3.67 |
| GSE61724 | 5.31 | 3.7 | 15.31 | 7.95 | 8.53 | 5.38 | 3.83 |
*, low performance. IQC, internal quality control; EQC, external quality control; CQCg, consistency quality control of differential expression ranking in genes; CQCp, consistency quality control of differential expression ranking in pathways; AQCg, accuracy quality control of differentially expressed gene detection; AQCp, accuracy quality control of pathway identification.
Figure 1PPI network of DEGs. Red nodes represent higher expression in TNBC samples than non-TNBC samples, while green nodes represent lower expression in TNBC samples than non-TNBC sample. PPI, protein-protein interaction; DEGs, differentially expressed genes; TNBC, triple-negative breast cancer.
Top 11 genes with degree >10 in PPI network
| Gene | Expression | Degree | Q.value | Qpval | τ2 | FDR |
|---|---|---|---|---|---|---|
|
| Down | 42 | 1.33884 | 0.85475 | 0 | 0.0053 |
|
| Up | 23 | 2.55752 | 0.63437 | 0 | 0.0006 |
|
| Up | 18 | 3.96701 | 0.41049 | 0 | 8.89E-03 |
|
| Down | 18 | 1.67509 | 0.79524 | 0 | 3.25E-19 |
|
| Down | 15 | 3.70639 | 0.44720 | 0 | 3.25E-19 |
|
| Down | 15 | 0.47565 | 0.97583 | 0 | 0.0074 |
|
| Down | 14 | 2.23518 | 0.69259 | 0 | 0.0029 |
|
| Down | 13 | 2.37068 | 0.66793 | 0 | 0.0011 |
|
| Down | 13 | 2.95618 | 0.56519 | 0 | 0.0099 |
|
| Up | 11 | 1.77908 | 0.77631 | 0 | 0.0023 |
|
| Up | 11 | 2.86412 | 0.58082 | 0 | 0.0023 |
PPI, protein interaction; Qpval: Q P value; FDR: false discovery rate.
Figure 2The process of WGCNA analysis. (A) Hierarchical clustering dendrograms of identified co-expressed genes in modules. (B) Module-trait relationships. The Y axis is the modules which are represented by “ME + color”. The X axis is the clinical features. Patho_T, Patho_M, and Patho_N stand for the TNM system. Patho_stage is stage of cancer. Lym_nodes_pos means lymph node positive. ER_Neg, PR_Neg, Her2_Neg represent ER negative, PR negative, HER2 negative, respectively. (C) Boxplots showing GS (Y axis) across each module (X axis). (D) Cluster analysis of modules and TNBC to find TNBC-related modules. (E) Scatterplots of correlation between MM (X axis) and GS (Y axis) in cyan module. (F) Scatterplots of correlation between intramodule connectivity (X axis) and GS (Y axis) in cyan module. (G) Scatterplots of correlation between MM (X axis) and GS (Y axis) in yellow module. (H) Scatterplots of correlation between intramodule connectivity (X axis) and GS (Y axis) in yellow module. (I) List of hub genes. WGCNA, weighted gene co-expressed network analysis; MM, module membership.
Figure 3The results of GO enrichment analysis and KEGG enrichment analysis in yellow and cyan modules (P<0.05). (A) GO analysis of genes in cyan module. (B) GO analysis of genes in yellow module. (C) KEGG analysis of genes in cyan module. (D) KEGG analysis of genes in yellow module. GO, Gene Ontology; KEGG, Kyoto Encyclopedia of Genes and Genomes.
Top 10 enriched pathways of DEGs
| KEGG term | KEGG ID | P value | Gene symbol |
|---|---|---|---|
| PI3K-Akt signaling pathway | hsa04151 | 6.51E-06 |
|
| Metabolic pathways | hsa01100 | 6.10E-06 |
|
| Protein processing in endoplasmic reticulum | hsa04141 | 6.10E-06 |
|
| Viral carcinogenesis | hsa05203 | 0.00026 |
|
| Huntington's disease | hsa05016 | 0.00076 |
|
| Oxidative phosphorylation | hsa00190 | 0.00163 |
|
| Parkinson's disease | hsa05012 | 0.00216 |
|
| Epstein-Barr virus infection | hsa05169 | 0.00371 |
|
| Proteoglycans in cancer | hsa05205 | 0.00371 |
|
| Alzheimer's disease | hsa05010 | 0.00412 |
|
DEG, differentially expressed gene; KEGG, Kyoto Encyclopedia of Genes and Genomes.
Figure 4Kaplan-Meier survival curves, which show that expression levels of 3 deregulated genes (A) HSPB1, (B) TPX2 and (C) IFI16 were associated with prognosis in TNBC patients (unadjusted P value <0.05). Y axis is the overall survival rate and X axis is the survival time. TNBC, triple-negative breast cancer.