| Literature DB >> 29931238 |
Jungwoo Lee, Sejoon Oh, Lee Sael.
Abstract
Motivation: Given multi-platform genome data with prior knowledge of functional gene sets, how can we extract interpretable latent relationships between patients and genes? More specifically, how can we devise a tensor factorization method which produces an interpretable gene factor matrix based on functional gene set information while maintaining the decomposition quality and speed?Entities:
Mesh:
Year: 2018 PMID: 29931238 PMCID: PMC6289137 DOI: 10.1093/bioinformatics/bty490
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.An overview of a PanCan12 tensor factorization via GIFT. (A) shows a PanCan12 tensor factorization and (B) shows a factor matrix (FM) computation at time t + 1 constrained on a mask matrix M()
Characteristic comparisons of GIFT with P-Tucker and Silenced-TF
| Method | P-Tuckera | Silenced-TF |
|
|---|---|---|---|
|
| Low |
|
|
|
|
| Low |
|
|
|
|
|
|
Oh .
Summary of dataset
| Dataset | Order | Size | Observable entries |
|---|---|---|---|
| PanCan12 tensor | 3 | (4555 × 14 351 × 5) | 180M |
| Sampled-PanCan12 | 3 | (4555 × 14 351 × 5) | 36–144M |
| Mask matrix | 2 | (14 351 × 50) | 7K |
M, million; K, thousand.
Fig. 2.Row-wise update rule of GIFT
Fig. 3.Distributions of values in a gene factor matrix derived by GIFT (λ = 10) for unmasked (A) and masked (B) entries
Fig. 4.Mask matrix and gene factor matrices (FM) of GIFT, P-Tucker and Silenced-TF. Subset of genes are shown for better visualization
Fig. 5.Top-K ratios based on descending order of absolute factor values
Fig. 6.Performance comparisons of GIFT, Silenced-TF and P-Tucker. (A) is a reconstruction error plot. (B) is a test RMSE plot
Fig. 7.Convergence and scalability of GIFT. (A) Total running time of GIFT with respect to the number of non-zeros. (B) GIFT shows faster convergence than P-Tucker and has higher accuracy than Silenced-TF
Significant relations found on the PanCan12 dataset via GIFT
| Cancer | Gene set | Genes | Evidence |
|---|---|---|---|
| HNSC, LUAD, LUSC, BLCA | TGF beta signaling | SKIL* | Encodes the SNON, negative regulators of |
| FKBP1A* | Interacts with a type I | ||
| LEFTY2* | Encodes a secreted ligand of the | ||
| GBM | Angiogenesis | PF4* | Inhibits cell proliferation and |
| VCAN* | Encodes a protein involving in celladhesion, and | ||
| BRCA | Estrogen response late | IL17RB* | Involved in development and progression of |
| TFF3* | Promotes invasion and migration of | ||
| Bile acid metabolism | APOA1* | ||
| OV, UCEC | Interferon-gamma response | IRF7* | Encodes |
| BST2* | High levels of BST2 have been identified in | ||
| Apoptosis | CASP8AP2+ | Associated with | |
| Encoded protein plays a regulatory role in Fas-mediated | |||
| READ, COAD | Protein secretion | STX7* | Controls vesicle trafficking events involved in |
| KIRC, LAML | Mitotic spindle | LATS1* | Binds phosphorylated zyxin and moves it to the |
Note: GIFT extracts significant gene sets and notable relations between cancer, gene sets and genes. Evidence column lists supporting evidence for either gene to gene set or gene to cancer relations (*: important gene, +: not included in a gene set, but related).