| Literature DB >> 34238766 |
Yong Jin Heo1,2, Chanwoong Hwa1, Gang-Hee Lee1, Jae-Min Park1, Joon-Yong An1,2.
Abstract
Multi-omics approaches are novel frameworks that integrate multiple omics datasets generated from the same patients to better understand the molecular and clinical features of cancers. A wide range of emerging omics and multi-view clustering algorithms now provide unprecedented opportunities to further classify cancers into subtypes, improve the survival prediction and therapeutic outcome of these subtypes, and understand key pathophysiological processes through different molecular layers. In this review, we overview the concept and rationale of multi-omics approaches in cancer research. We also introduce recent advances in the development of multi-omics algorithms and integration methods for multiple-layered datasets from cancer patients. Finally, we summarize the latest findings from large-scale multi-omics studies of various cancers and their implications for patient subtyping and drug development.Entities:
Keywords: cancer research; genomics; multi-omics approach; proteogenomics; proteomics; systems biology
Mesh:
Year: 2021 PMID: 34238766 PMCID: PMC8334347 DOI: 10.14348/molcells.2021.0042
Source DB: PubMed Journal: Mol Cells ISSN: 1016-8478 Impact factor: 5.034
Fig. 1Overview of multi-omics approaches in cancer research.
The integration of omics datasets is a crucial step in multi-omics studies. Datasets such as somatic mutations, CNV, gene expression, methylation, and proteome datasets are merged using various computational frameworks with distinct methods. The integration enables the comparison of molecular features across multiple viewpoints and the clustering of patients with relevant clinical features. Possible outcomes include enhanced identification of clinical subtypes, understanding of cancer pathophysiology, prediction of potential drug targets, and clinical decision support.
List of computational frameworks for multi-omics cancer studies
| Study | Findings | Dataset | Principles |
|---|---|---|---|
| iCluster ( | Novel subgroups from 2,000 breast tumors | mRNA expression | Joint latent variable model-based clustering method |
| iOmicsPASS ( | Novel transcriptional regulatory network from TCGA/CPTAC breast cancer data | mRNA expression | Network construction using a modified nearest shrunken centroid algorithm |
| SALMON ( | Improved survival analysis | Mutation | Deep learning based on co-expression modules |
| SNF ( | Subtype classification of clinical relevance | mRNA | Patient similarity networks using an iterative procedure based on message passing |
| NEMO ( | Novel subtypes from even partial AML datasets | mRNA | Sample clustering from partial datasets using an adjusted Rand index |
| MONET ( | Module detection of patient subtypes and improved survival analysis | mRNA | Detect similar modules commonly present across multi-omics datasets |
| PARADIGM ( | Detection of pathways affected by cancer with fewer false positives | mRNA expression | Pathway recognition algorithm applied to multi-omics datasets |
| LRAcluster ( | Subtype detection in both pan-cancer analysis and single cancer types | Mutation | Performance of low-rank approximation from probabilistic models |
| BCC ( | Detection of patient subtypes in response to survival rates and driver mutation signatures | mRNA | Bayesian framework for estimation of an integrative clustering model |
Gene expression data with normalization (e.g., quantile normalization, fragment per kilobase of transcript per million mapped reads [FPKM]).
Quantification of miRNA expression.
Circular binary segmentation-based copy number segmented means.
Affymetrix 6.0 SNP arrays.
Protein quantification by iTRAQ (isobaric Tags for Relative and Absolute Quantification) protein quantification.
Reverse phase protein array (RPPA).
Illumina Human Methylation arrays.
In the SALMON method, the copy number burden (CNB) is calculated using the total gene length (Kb) from SNP 6 data, and the tumor mutation burden (TMB) is calculated using the total number of mutated genes reported in Mutation Annotation Format (MAF) files.
The LRAcluster method uses somatic mutation data converted into a binary form.
Fig. 2Latest findings in cancer multi-omics research.
Multi-omics approaches integrate various high-throughput sequencing datasets across a range of molecular layers. Biological features are subject to multi-view clustering methods and account for distinct subtypes of cancer patients based on relevant clinical features.