| Literature DB >> 29137596 |
Liying Yang1, Yunyan Shen2, Xiguo Yuan2, Junying Zhang2, Jianhua Wei3.
Abstract
BACKGROUND: Gene expression profiling has led to the definition of breast cancer molecular subtypes: Basal-like, HER2-enriched, LuminalA, LuminalB and Normal-like. Different subtypes exhibit diverse responses to treatment. In the past years, several traditional clustering algorithms have been applied to analyze gene expression profiling. However, accurate identification of breast cancer subtypes, especially within highly variable LuminalA subtype, remains a challenge. Furthermore, the relationship between DNA methylation and expression level in different breast cancer subtypes is not clear.Entities:
Keywords: Biclustering; Breast cancer; Classification; Gene expression profiles; Methylation; Subtype
Mesh:
Year: 2017 PMID: 29137596 PMCID: PMC5686903 DOI: 10.1186/s12859-017-1926-z
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
AP-ISA biclusters composition comparing to PAM-50 labels
| Basal-like | HER2+ | LuminalA | LuminalB | Normal-like | Totalnum | |
|---|---|---|---|---|---|---|
| Bicluster1 | 0 | 0 | 6 | 0 | 24 | 30 |
| Bicluster2 | 5 | 4 | 9 | 1 | 3 | 22 |
| Bicluster3 | 0 | 42 | 5 | 8 | 0 | 55 |
| Bicluster4 | 90 | 1 | 0 | 0 | 0 | 91 |
| Bicluster5 | 0 | 0 | 33 | 25 | 1 | 59 |
| Bicluster6 | 0 | 0 | 31 | 17 | 0 | 49 |
| Bicluster7 | 0 | 3 | 22 | 33 | 1 | 59 |
| Bicluster8 | 37 | 16 | 22 | 14 | 3 | 92 |
| Bicluster9 | 0 | 0 | 97 | 19 | 0 | 117 |
Sample number of ER and PR status in biclusters from AP-ISA
| Class type | ER+ | ER- | PR+ | PR- |
|---|---|---|---|---|
| Bicluster 1 | 27 | 3 | 23 | 6 |
| Bicluster 2 | 15 | 6 | 14 | 7 |
| Bicluster 3 | 33 | 19 | 23 | 31 |
| Bicluster 4 | 11 | 75 | 6 | 79 |
| Bicluster 5 | 57 | 1 | 47 | 10 |
| Bicluster 6 | 49 | 0 | 45 | 4 |
| Bicluster 7 | 57 | 0 | 50 | 7 |
| Bicluster 8 | 50 | 40 | 43 | 46 |
| Bicluster 9 | 112 | 2 | 108 | 6 |
Fig. 1Gene comparsion between biclsutering and GeneCards database. Left side represents the number of genes in GeneCards, right side represents the result of biclsutering in our study, while the middle column stand for intersection number. Four Luminal subgroups in our study all intersect with Luminal in GeneCards
Intersection genes between AP-ISA biclusters and GeneCards database
| Subtype | Intersection gene number | Genes |
|---|---|---|
| HER2+ | 8 | GRB7;ERBB2;CASP3;SDC1;STARD3;ABCC3; |
| Basal-like | 8 | GABRP;MSH2;CDKN2A;EN1;YBX1;VGLL1; |
| Luminal-5 | 6 | BCL2;GATA3;RERG;ESR1; BAG1;CCND1 |
| Luminal-6 | 5 | BCL2;ESR1;DACH1;BAG1;XBP1 |
| Luminal-7 | 4 | SLC9A3R1;KRT19;CANX;YWHAZ |
| Luminal-9 | 11 | PGR;EPHX2;BCL2;CYB5A;MUC1;RAB31;MYB; |
Significant genes in AP-ISA biclusters and the most distinct gene enrichment pathways by Gene Ontology and KEGG
| Class type | Term (Enrichiment type) |
| significant genes |
|---|---|---|---|
| Normal-like | regulation of cell proliferation (Gene Ontology) | 4.38E-10 | CDKN1C;TXNIP;DPT;EDNRB; KL; |
| regulation of multicellular organismal process (Gene Ontology) | 1.03E-09 | ||
| cell differentiation (Gene Ontology) | 1.07E-08 | ||
| PPAR signaling pathway (KEGG Pathways) | 4.58E-04 | ||
| HER2+ | single-organism process | 1.29E-04 | ERBB2;FGFR4;GRB7;GSK3B; FA2H; |
| epidermal growth factor receptor signaling pathway (Gene Ontology) | 6.944E-03 | ||
| ERBB signaling pathway (Gene Ontology) | 7.514E-03 | ||
| Basal-like | cell cycle process | 1.09E-05 | CDK6;CDKN2A;MSH2;FZD9; FABP7; |
| lymphocyte differentiation (Gene Ontology) | 1.019E-03 | ||
| B cell activation (Gene Ontology) | 8.673E-03 | ||
| p53 signaling pathway (KEGG Pathways) | 3.15E-04 | ||
| Pathways in cancer (KEGG Pathways) | 4.489E-03 | ||
| Luminal-5 | mammary gland epithelium development | 1.15E-05 | CCND1;ESR1;GATA3;TBX3; BTF3; |
| Wnt signaling pathway (Gene Ontology) | 7.896E-03 | ||
| CD8-positive, alpha-beta T cell lineage commitmen (Gene Ontology) | 4.294E-03 | ||
| Luminal-6 | Glutamate receptor signaling pathway (Gene Ontology) | 2.316E-03 | BCL2;WNT3;ESR1;SERP1;PIGT; TLE3;STC1;ARNT2;PKIB;ZFX; HAGH; |
| CD8-positive, alpha-beta T cell lineage commitment (Gene Ontology) | 3.87E-03 | ||
| response to insulin-like growth factor stimulus (Gene Ontology) | 7.726E-03 | ||
| Retinol metabolism (KEGG Pathways) | 4.07E-03 | ||
| Luminal-9 | CD8-positive, alpha-beta T cell lineage commitment (Gene Ontology) | 4.717E-03 | XBP1;BCL2;C3orf18;CIRBP;GAD1; |
| response to insulin-like growth factor stimulus (Gene Ontology) | 9.412E-03 | ||
| beta-Alanine metabolism (KEGG Pathways) | 5.476E-03 |
Fig. 2Over-expressed genes of Basal-like samples in p53 signaling pathway. Some over-expressed genes in Basal-like were found to be significantly enriched for the pathway genes (p = 3.15E-05). Pathway and graphics were taken from the Kyoto Encyclopedia of Genes and Genomes (KEGG) database
Fig. 3Methylation analysis in six methylation areas exhibits differential methylation level among biclusters. The blue lines, red lines, black and gray lines respectively display TSS200, TSS1500, 5’UTR and 1st Exon area which represent promoter region. The green lines represent genebody and the pink lines 3’UTR. Horizontal axis indicates samples in AP-ISA biclusters. Values of vertical axis were calculated by averaging the methylaiton values in the same sample
Fig. 4Bicluster size
Comparison of total number of biclusters, effective number of biclusters and the ratio of the effective number to the total number of biclusters
| Method | Total number of biclsuters | Eff. number of biclusters | Ratio |
|---|---|---|---|
| AP-ISA | 9 | 6.743 | 0.749 |
| ISA | 12 | 8.489 | 0.707 |
| LAS | 10 | 4.799 | 0.479 |
| CC | 10 | 10 | 1 |
| Sparse BC | 70 | 70 | 1 |
| SSVD | 10 | 1.57 | 0.157 |
Biclusters in each method that match with PAM50
| PAM50 | AP-ISA | ISA | LAS | CC | SparseBC | SSVD |
|---|---|---|---|---|---|---|
| Basal-like | 4 | 5 | 1, 10 | 5, 7 | 3 | – |
| ERBB2+ | 3 | 6 | 8 | – | – | – |
| LuminalA | 5, 6, 9 | 1, 4 | 2, 3, 7 | 1 | 4, 6 | – |
| LuminalB | 5, 7 | – | 3 | – | 1, 4 | – |
| Normal-like | 1 | 2 | 2, 6 | 1 | 7 | – |