| Literature DB >> 34408773 |
Nasim Vahabi1, Caitrin W McDonough2, Ankit A Desai3, Larisa H Cavallari2, Julio D Duarte2, George Michailidis1.
Abstract
BACKGROUND: The development of high-throughput techniques has enabled profiling a large number of biomolecules across a number of molecular compartments. The challenge then becomes to integrate such multimodal Omics data to gain insights into biological processes and disease onset and progression mechanisms. Further, given the high dimensionality of such data, incorporating prior biological information on interactions between molecular compartments when developing statistical models for data integration is beneficial, especially in settings involving a small number of samples.Entities:
Keywords: cis-regulatory quantitative; multi-block PLS; multi-omics; supervised Integration; survival analysis
Year: 2021 PMID: 34408773 PMCID: PMC8366414 DOI: 10.3389/fgene.2021.701405
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.772
FIGURE 1Illustration of the multi-blocks data structure and the supervised Cox-sMBPLS algorithm. (A) three blocks of multi-Omics profiles: mRNA expression, genotypes, and DNA methylation. The response variable (y) is a vector of survival times of size n = 1. (B) splitting the Omics blocks based on the cis-regulatory quantitative effects (eQTL, eQTM, meQTL). (C) updating Omics blocks with QT-residuals, and y with reweighted times. (D) Multi-Omics modules which are combinations of all three Omics profiles.
FIGURE 3Post-hoc analyzes for multi-mics modules 10 and 13 resulted from the Cox-sMBPLS model. (A) gene-set network analysis. (B) disease ontology for PDPK1, TAB2, PRICKLE3, and HRC genes which are presented in both modules. (C) disease enrichment analysis. (D) chromosomal map. Dashed boxes show the Omics profiles, which are located in a small window on the same chromosome.
Simulation settings based on the different number of latent components (k), censoring rate (δ) and p number of features in block b.
| Scenario # | Censoring | Dimensionality | Number of Components |
| 1** | δ = 10% (Low) | ||
| 2 | |||
| 3 | |||
| 4 | δ = 40% (Moderate) | ||
| 5 | |||
| 6 | |||
| 7 | δ = 60% (High) | ||
| 8 | |||
| 9 |
Simulation results for the scenarios with a low level of dimensionality (total of 2230 features, and n = 91).
| Censoring % | Measure | Number of components | |||||||||
| k = 2 | k = 5 | ||||||||||
| Cox-sMBPLS | El-net Cox | RSF | Block forest | MCIA | Cox-sMBPLS | El-net Cox | RSF | Block forest | MCIA | ||
| 10% (Low) | C-index | 0.49 (0.08) | 0.50 (0.10) | 0.49 (0.09) | 0.51 (0.09) | 0.51 (0.07) | 0.53 (0.10) | 0.51 (0.10) | 0.51 (0.09) | ||
| C/D AUC* | 0.38 (0.30) | 0.46 (0.06) | 0.87 (0.13) | 0.91 (0.12) | 0.36 (0.29) | 0.46 (0.09) | 0.87 (0.12) | 0.91 (0.12) | |||
| I/D AUC** | 0.57 (0.06) | 0.57 (0.07) | 0.57 (0.07) | 0.57 (0.07) | 0.57 (0.08) | 0.57 (0.07) | 0.57 (0.07) | ||||
| Uno’s AUC*** | 0.46 (0.18) | 0.48 (0.22) | 0.46 (0.23) | 0.49 (0.23) | 0.44 (0.18) | 0.46 (0.23) | 0.46 (0.23) | 0.45 (0.23) | |||
| 40% (Moderate) | C-index | 0.49 (0.09) | 0.51 (0.10) | 0.49 (0.10) | 0.51 (0.11) | 0.49 (0.09) | 0.51 (0.10) | 0.49 (0.11) | 0.51 (0.11) | ||
| C/D AUC | 0.35 (0.27) | 0.39 (0.19) | 0.84 (0.19) | 0.90 (0.20) | 0.36 (0.27) | 0.37 (0.19) | 0.84 (0.19) | 0.90 (0.20) | |||
| I/D AUC | 0.57 (0.07) | 0.57 (0.08) | 0.56 (0.07) | 0.57 (0.07) | 0.58 (0.07) | 0.58 (0.08) | 0.57 (0.07) | 0.57 (0.07) | |||
| Uno’s AUC | 0.44 (0.22) | 0.45 (0.23) | 0.44 (0.25) | 0.42 (0.24) | 0.45 (0.22) | 046 (0.23) | 0.45 (0.25) | 0.42 (0.24) | |||
| 60% (High) | C-index | 0.50 (0.10) | 0.51 (0.13) | 0.49 (0.14) | 0.48 (0.13) | 0.50 (0.10) | 0.50 (0.12) | 0.50 (0.16) | 0.48 (0.12) | ||
| C/D AUC | 0.30 (0.27) | 0.35 (0.21) | 0.81 (0.20) | 0.29 (0.26) | 0.34 (0.21) | 0.82 (0.20) | |||||
| I/D AUC | 0.58 (0.07) | 0.59 (0.06) | 0.57 (0.09) | 0.58 (0.07) | 0.60 (0.08) | 0.59 (0.07) | 0.59 (0.06) | 0.58 (0.07) | |||
| Uno’s AUC | 0.45 (0.24) | 0.47 (0.27) | 0.40 (0.28) | 0.41 (0.29) | 0.45 (0.24) | 0.46 (0.27) | 0.40 (0.30) | 0.41 (0.29) | |||
FIGURE 2Simulations results for scenarios with a low level of dimensionality. Boxplots for (A) Harrell’s C-index, (B) I/D AUC, (C) C/D AUC, and (D) Uno’s AUC values. Results are shown for different censoring rates (δ = 10,40, 60%) and number of components (k = 2, 5, 10).