| Literature DB >> 30580496 |
Benjamin Balluff1, Achim Buck2, Marta Martin-Lorenzo1, Frédéric Dewez1, Rupert Langer3, Liam A McDonnell4, Axel Walch2, Ron M A Heeren1.
Abstract
SCOPE: In biomedical research, mass spectrometry imaging (MSI) can obtain spatially-resolved molecular information from tissue sections. Especially matrix-assisted laser desorption/ionization (MALDI) MSI offers, depending on the type of matrix, the detection of a broad variety of molecules ranging from metabolites to proteins, thereby facilitating the collection of multilevel molecular data. Lately, integrative clustering techniques have been developed that make use of the complementary information of multilevel molecular data in order to better stratify patient cohorts, but which have not yet been applied in the field of MSI.Entities:
Keywords: cancer; integrative clustering; mass spectrometry imaging; prognosis
Mesh:
Year: 2019 PMID: 30580496 PMCID: PMC6590511 DOI: 10.1002/prca.201800137
Source DB: PubMed Journal: Proteomics Clin Appl ISSN: 1862-8346 Impact factor: 3.494
Figure 1Integrative clustering by Similarity Network Fusion (SNF). Clustering by SNF was performed on the individual A) metabolite and B) peptide datasets. Differences in survival of the resulting patient groups were statistically evaluated by a Kaplan–Meier/log‐rank analysis. This was repeated for the C) concatenated metabolite/peptide data and D) the SNF‐fused data. Only the latter one resulted in the detection of patient groups with statistically significant differences in survival (p = 0.025). SNF also allows ranking features according to their relevance for the clustering. The top 10% E) most relevant metabolite and F) peptide signals are shown as heat maps where patients are in the rows and the features in the columns.
Figure 2Integrative clustering by moCluster. A) First, four latent variables were found to represent sufficiently the concordant structures in the data based on their relative contribution of variance. The consensus‐PCA was then run with those four latent variables. B) The optimum number of clusters was found to be five where the Gap statistic shows the biggest positive change. C) The consensus‐PCA scores were submitted to hierarchical clustering and the resulting tree was cut at the height (≈30) where five patient groups are obtained. D) These patient groups differed statistically significant in their survival (p = 0.027) as calculated by Kaplan–Meier/log‐rank analysis. moCluster also provides a ranking of the features according to their loading. The top 10% E) most relevant metabolite and F) peptide signals are shown as heatmaps where patients are in the rows and the features in the columns.
Clustering and survival analysis results
| Clustering method | Dataset | Optimal number of clusters | Log rank |
|---|---|---|---|
| SNF | Metabolites | 3 | 0.259 |
| Peptides | 2 | 0.512 | |
| Concatenated | 2 | 0.100 | |
| Fused | 2 | 0.025a) | |
| iCluster | Fused | 2 | 0.134 |
| moCluster | Metabolites | 5 | 0.152 |
| Peptides | 5 | 0.249 | |
| Concatenated | 4 | 0.100 | |
| Fused | 5 | 0.027 |
Considered statistically significant (p ≤ 0.05)