| Literature DB >> 28589855 |
Garam Lee1, Lisa Bang2, So Yeon Kim1, Dokyoon Kim3,4, Kyung-Ah Sohn5.
Abstract
BACKGROUND: Breast cancer is a complex disease in which different genomic patterns exists depending on different subtypes. Recent researches present that multiple subtypes of breast cancer occur at different rates, and play a crucial role in planning treatment. To better understand underlying biological mechanisms on breast cancer subtypes, investigating the specific gene regulatory system via different subtypes is desirable.Entities:
Mesh:
Year: 2017 PMID: 28589855 PMCID: PMC5461552 DOI: 10.1186/s12920-017-0268-z
Source DB: PubMed Journal: BMC Med Genomics ISSN: 1755-8794 Impact factor: 3.063
Fig. 1Overview of the proposed framework for identifying subtype-specific association patterns. For target gene estimation, our weighted lasso framework requires a covariate matrix and subtype information on samples. Note that four colors in Breast Cancer Subtype field correspond to subtypes, Luminal A, Luminal B, HER2 positive, and Triple Negative, respectively. With two inputs mixed from a kernel method, target genes in each of the subtypes are estimated based on DNA methylation features
Number of samples per subtype
| HER2 Positive | Luminal A | Luminal B | Triple Negative | Total |
|---|---|---|---|---|
| 16 | 306 | 42 | 73 | 437 |
Fig. 2Density plot for the number of DNA methylation features across all target genes. The number of methylation features ranges from 10 to 1698. For most of target genes, around 200 ~ 300 number of features are used for estimation
Top 10 well-estimated gene list
| Overall | HER2 positive | Luminal A | Luminal B | Triple negative |
|---|---|---|---|---|
| TRA2B | MMP1 | PSMD3 | PSMD3 | ABCC12 |
| HNRNPK | SPDYC | GPD1 | CDC6 | SLC18A2 |
| RAB5B | ERBB2 | ERBB2 | RPL19 | IVD |
| HNRNPK.1 | ELOVL2 | UGT1A6 | ERBB2 | ABCA12 |
| HNRNPK.2 | SERPINA5 | BMPR1B | CACNG6 | DNALI1 |
| SEC11A | NPY1R | CTSE | PCK1 | DEGS2 |
| SF3A1 | SEMA3E | CCL21 | PSMB3 | UCHL1 |
| SRP14 | AKR1B10 | TAT | PIP4K2B | NEIL1 |
| CDC42 | UGT8 | RPL19 | CALML3 | MAGOH |
| NRF1 | EPO | ATP6V0A4 | ABCC12 | HGD |
Genes having the smallest prediction error over all target genes are shown in column Overall, and genes that show prediction improvement when estimated using kernel weighted lasso over the plain lasso are shown for each subtype in the remaining columns
Top 20 well-estimated KEGG pathways
| Overall | |
|---|---|
| SPLICEOSOME | SNARE_INTERACTIONS_IN_VESICULAR_TRANSPORT |
| PROTEIN_EXPORT | VALINE_LEUCINE_AND_ISOLEUCINE_BIOSYNTHESIS |
| AMINOACYL_TRNA_BIOSYNTHESIS | UBIQUITIN_MEDIATED_PROTEOLYSIS |
| NON_HOMOLOGOUS_END_JOINING | DNA_REPLICATION |
| RNA_DEGRADATION | REGULATION_OF_AUTOPHAGY |
| NUCLEOTIDE_EXCISION_REPAIR | RENAL_CELL_CARCINOMA |
| BASAL_TRANSCRIPTION_FACTORS | GLYOXYLATE_AND_DICARBOXYLATE_METABOLISM |
| MISMATCH_REPAIR | OXIDATIVE_PHOSPHORYLATION |
| RNA_POLYMERASE | NOTCH_SIGNALING_PATHWAY |
| PROTEASOME | PARKINSONS_DISEASE |
Top 10 better-estimated KEGG pathways per subtype
| HER2 positive | Triple negative |
|---|---|
| GLYCOLYSIS_GLUCONEOGENESIS | GLYCOLYSIS_GLUCONEOGENESIS |
| CITRATE_CYCLE_TCA_CYCLE | CITRATE_CYCLE_TCA_CYCLE |
| PENTOSE_PHOSPHATE_PATHWAY | PENTOSE_PHOSPHATE_PATHWAY |
| FRUCTOSE_AND_MANNOSE_METABOLISM | PENTOSE_AND_GLUCURONATE_INTERCONVERSIONS |
| GALACTOSE_METABOLISM | FRUCTOSE_AND_MANNOSE_METABOLISM |
| FATTY_ACID_METABOLISM | GALACTOSE_METABOLISM |
| STEROID_BIOSYNTHESIS | ASCORBATE_AND_ALDARATE_METABOLISM |
| PRIMARY_BILE_ACID_BIOSYNTHESIS | FATTY_ACID_METABOLISM |
| OXIDATIVE_PHOSPHORYLATION | STEROID_BIOSYNTHESIS |
| PURINE_METABOLISM | PRIMARY_BILE_ACID_BIOSYNTHESIS |
| Luminal A | Luminal B |
| GLYCOLYSIS_GLUCONEOGENESIS | GLYCOLYSIS_GLUCONEOGENESIS |
| PENTOSE_PHOSPHATE_PATHWAY | CITRATE_CYCLE_TCA_CYCLE |
| PENTOSE_AND_GLUCURONATE_INTERCONVERSIONS | PENTOSE_PHOSPHATE_PATHWAY |
| FRUCTOSE_AND_MANNOSE_METABOLISM | PENTOSE_AND_GLUCURONATE_INTERCONVERSIONS |
| GALACTOSE_METABOLISM | FRUCTOSE_AND_MANNOSE_METABOLISM |
| ASCORBATE_AND_ALDARATE_METABOLISM | GALACTOSE_METABOLISM |
| FATTY_ACID_METABOLISM | ASCORBATE_AND_ALDARATE_METABOLISM |
| STEROID_BIOSYNTHESIS | FATTY_ACID_METABOLISM |
| PRIMARY_BILE_ACID_BIOSYNTHESIS | STEROID_BIOSYNTHESIS |
| STEROID_HORMONE_BIOSYNTHESIS | PRIMARY_BILE_ACID_BIOSYNTHESIS |
Fig. 3Subtype-specific Root Mean Squared Error from 5-fold cross validation. Each bar represents the average prediction error obtained from the proposed method (red), separate estimation that uses only the corresponding subtype data (yellow), and a single common estimation ignoring the subtype information (gray). Our proposed method shows significantly improved performance over the separate estimation approach, and slightly better or comparable performance over single common estimation
Fig. 4Subtype-specific association networks between DNA methylation and gene expression, and Venn diagram for the number of edges in the network. An edge in a subtype-specific association network is drawn if methylation node A and gene expression node B have non-zero a regression coefficient resulted from kernel weighted lasso. The edges are colored based on their subtype-specific association. Venn diagram represents the number of edges occurring in each association network where intersection region stands for the number of edges appearing in more than two networks
Top 5 hub methylation features in subtype-specific association network and their degrees
| Gene Name | Total | HER2 positive | Luminal A | Luminal B | Triple negative | Literature |
|---|---|---|---|---|---|---|
| LEP | 9 | 2 | 1 | 6 | 0 | [ |
| RET | 6 | 2 | 1 | 3 | 0 | [ |
| FGFR3 | 6 | 1 | 0 | 5 | 0 | [ |
| PLA2G2A | 6 | 1 | 3 | 2 | 0 | |
| ADCY5 | 6 | 2 | 0 | 4 | 0 |
For each methylation node, the total number of connected edges that are present over four subtype-specific association networks is shown in the column Total. Remaining columns represent the degrees in the corresponding subtype-specific association network