| Literature DB >> 33968140 |
Bingbo Wang1, Xiujuan Ma1, Minghui Xie1, Yue Wu1, Yajun Wang2, Ran Duan1, Chenxing Zhang1, Liang Yu1, Xingli Guo1, Lin Gao1.
Abstract
Multi-omics molecules regulate complex biological processes (CBPs), which reflect the activities of various molecules in living organisms. Meanwhile, the applications to represent disease subtypes and cell types have created an urgent need for sample grouping and associated CBP-inferring tools. In this paper, we present CBP-JMF, a practical tool primarily for discovering CBPs, which underlie sample groups as disease subtypes in applications. Differently from existing methods, CBP-JMF is based on a joint non-negative matrix tri-factorization framework and is implemented in Python. As a pragmatic application, we apply CBP-JMF to identify CBPs for four subtypes of breast cancer. The result shows significant overlapping between genes extracted from CBPs and known subtype pathways. We verify the effectiveness of our tool in detecting CBPs that interpret subtypes of disease.Entities:
Keywords: complex biological processes; disease; multi-dimensional genomic data; non-negative matrix factorization; subtype
Year: 2021 PMID: 33968140 PMCID: PMC8103031 DOI: 10.3389/fgene.2021.665416
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Figure 1Illustration of the framework and optimization objective function of complex biological processes–joint matrix tri-factorization.
Figure 2Complex biological processes of luminal B and basal-like subtype. We mapped the genes and miRNAs obtained from luminal B's module and basal-like's module to an integrated gene regulation network. The network was obtained through integrating three databases including Reactom, Kyoto Encyclopedia of Genes and Genomes, and Nci-PID Pathway Interaction Database. The interactions between genes and miRNAs were obtained from miRTarBase. The size of the node is proportional to the size of the degree. The thickness of the edges indicates the strength of the regulatory relationship expressed by the Pearson correlation coefficient between microRNA and gene.
Enrichment analysis of the extracted module gene across six datasets.
| Total | 51 | 43 | 947 | 516 | 61 | 102 |
| Overlapped nodes | 2 | 5 | 13 | 6 | 3 | 6 |
| 0.049 | 0.0003 | 0.007 | 0.008 | 0.010 | 0.012 |
Figure 3Part of complex biological processes luminal B and basal-like. The edges with checkmarks are the interactions that have been documented. (A) Luminal B's biological processes: luminal subtypes are driven by the estrogen/ER pathway. Among all nodes, ERBB2, ERBB3, and ESR1 are involved in the estrogen/ER pathway. (B) Basal-like's biological processes: basal-like subtype is driven by the deregulation of various signaling pathways (Notch, MAPK, FoxO signaling pathway, and Wnt/beta-catenin). Among all nodes, MAPKAPK2, CDC25B, CCNB1, CCNB2, PAK1, and STMN1 are known to exist in multiple signaling pathways.
Evidences of luminal B's complex biological processes.
| miR-34a->ERBB2 | Wang et al., | MiR-34a modulates ErbB2 in breast cancer |
| ERBB2->VAV2 | Wang et al., | ErbB2 colocalizes with Vav2 |
| VAV2->RAC3 | Rosenberg et al., | Vav2 promotes Rac3 activation at invadopodia |
| miR-200b->JUN | Jin et al., | MiR-200b upregulates JUN in breast cancer |
| JUN->CCND1 | Cicatiello et al., | CCND1 promoter activation by estrogens in human breast cancer cells is mediated by the recruitment of a c-Jun/c-Fos/estrogen receptor |
| JUN->ESR1 | Stossi et al., | The activation of ESR1 gene locus in a process that was dependent upon activation and recruitment of the c-Jun transcription factor |
| miR-26a->ESR1 | Howard and Yang, | MiR-26a modulates ESR1 in breast cancer |
| ESR1->VAV2 | Grassilli et al., | ESR1 upregulates VAV2 in breast cancer cell lines |
Evidences of basal-like's complex biological processes.
| CCNB1(CCNB2)->PLK1->CDK1 | Li et al., | CCNB1 (CCNB2), PLK1, and CDK1 have interactions in chicken breast muscle |
| miR221->FOS | Yao et al., | miR221 modulates FOS |
| miR221->PAK1 | Ergun et al., | miR221 modulates PAK1 in breast cancer cell lines |
| PAK1->PLK1 | Maroto et al., | PAK1 regulates PLK1 |
| MAPKAPK2->CDC25B | MAPK signaling pathway | MAPKAPK2 and CDC25B are involved in MAPK signaling pathway |
| CDC25B->CDK1 | Timofeev et al., | Timely assembly of CDK1 required CDC25B |
Figure 4Kaplan–Meier (K–M) survival analysis for patients which are classified using different methods. (A) KM survival curve for labeled samples, whose subtypes are known in advance. (B) KM survival analysis for unlabeled samples, which are classified using complex biological processes–joint matrix tri-factorization (CBP-JMF) on mRNA expression and miRNA expression data. (C) KM survival analysis for unlabeled patients, which are classified using CBP-JMF only on mRNA expression. (D) KM survival analysis for unlabeled patients, which are classified on mRNA expression and miRNA expression data without graph embedding regularization.
The CBP-JMF algorithm.
| 1: |
| 2: Initialize |
| 3: Initialize |
| 4: |
| 5: |
| 6: Fix V, update |
| 7: |
| 8: Fix |
| 9: Fix |
| 10: |
| 11: Fix |
| 12: |
| 13: |
| 14: |
| 15: |