| Literature DB >> 29430923 |
Walid M Abdelmoula1,2, Nicola Pezzotti3, Thomas Hölt3, Jouke Dijkstra1, Anna Vilanova3, Liam A McDonnell4, Boudewijn P F Lelieveldt1,3.
Abstract
Technological advances in mass spectrometry imaging (MSI) have contributed to growing interest in 3D MSI. However, the large size of 3D MSI data sets has made their efficient analysis and visualization and the identification of informative molecular patterns computationally challenging. Hierarchical stochastic neighbor embedding (HSNE), a nonlinear dimensionality reduction technique that aims at finding hierarchical and multiscale representations of large data sets, is a recent development that enables the analysis of millions of data points, with manageable time and memory complexities. We demonstrate that HSNE can be used to analyze large 3D MSI data sets at full mass spectral and spatial resolution. To benchmark the technique as well as demonstrate its broad applicability, we have analyzed a number of publicly available 3D MSI data sets, recorded from various biological systems and spanning different mass-spectrometry ionization techniques. We demonstrate that HSNE is able to rapidly identify regions of interest within these large high-dimensionality data sets as well as aid the identification of molecular ions that characterize these regions of interest; furthermore, through clearly separating measurement artifacts, the HSNE analysis exhibits a degree of robustness to measurement batch effects, spatially correlated noise, and mass spectral misalignment.Entities:
Keywords: 3D MSI; HSNE; data analysis; nonlinear dimensionality reduction; proteomics; segmentation; t-SNE
Mesh:
Year: 2018 PMID: 29430923 PMCID: PMC5838640 DOI: 10.1021/acs.jproteome.7b00725
Source DB: PubMed Journal: J Proteome Res ISSN: 1535-3893 Impact factor: 4.466
Summary of the 3D MSI Data Sets And Their Computational Processing Time Using HSNE
| data set | preservation | mass range (kDa) | no. tissue sections; (tissue thickness μm) | spatial resolution (μm) | data set
size (no. voxels × no. | HSNE running time (min) |
|---|---|---|---|---|---|---|
| 3D DESI-MSI colorectal carcinoma | fresh frozen | 0.2–1.05 | 26; (10) | 100 | 148 044 × 8073 | ∼10 |
| 3D MALDI-MSI mouse kidney | PAXgene | 2–20 | 73; (3.5) | 50 | 1 362 830 × 7680 | ∼43 |
| 3D MALDI-MSI mouse pancreas | PAXgene | 1.6–15 | 29; (5) | 60 | 497 255 × 13 312 | >25 |
| 3D MALDI-MSI OSCC | fresh frozen | 2–20 | 58; (10) | 60 | 825 558 × 7680 | ∼30 |
| 3D MALDI-MSI atherosclerotic plaques | fresh frozen | <1 | 5; (10) | 100 | 10 185 × 20 | ∼5 |
Figure 1Hierarchical analysis of 3D DESI-MSI of colorectal carcinoma data set using the HSNE reveals structural patterns at different hierarchical scales. The overview embedding represents the coarsest level in which generic dominant structures are revealed, namely: background and foreground tissue. Detailed embedding on the tissue foreground reveals two major structures that represent colorectal cancer and connective tissues. At the finest embedding level, more structures are uncovered within each of the colorectal cancer and muscle tissues. The Pearson correlation distribution between HSNE segmentation maps at Level 2 and all of the spectra is presented for cancer and muscle tissue, showing the most localized m/z feature in both tissue classes.
Figure 2Analysis of 3D MALDI-MSI data of a mouse kidney using the HSNE: (a) HSNE scatter plot showing the spectral similarities as landmarks in a low-dimensional representation and (b) HSNE spatial structures based on the landmarks selection in panel a. The identified four anatomical structures with distinct spectral signatures were merged into a single 3D image (c,d) representing: renal cortex (red), renal medulla (green), renal pelvis (blue), and surrounding of renal pelvis (yellow). The multiorthoslice view in panel d allows in-depth visualization of the identified features.
Figure 3Visualization of the most colocalized 3D m/z features with respect to the associated HSNE spatial segmentation maps of the 3D MALDI-MSI mouse kidney data set.
Figure 4Analysis of 3D MALDI-MSI of mouse pancreas data set using the HSNE reveals structural patterns at different hierarchical scales. The detailed embedding at level 2 reveals three spectrally distinct clusters given in panel b and colored red, green, and blue. The spatial correlation between each of the clusters identified in panel b and the spectral information was computed (c), and the highest localized m/z features were identified (d). The m/z value of 5805.54, which is colocalized with the red cluster given in panel b, was previously identified as insulin.
Figure 5Analysis of 3D MALDI-MSI of human oral squamous cell carcinoma data set using the HSNE reveals structural patterns on different hierarchical scales. The correlation analysis (c) allows us to identify the most colocalized m/z features (d) with the HSNE spatial structures (b).