| Literature DB >> 33431000 |
Gerard Baquer1, Lluc Sementé1, María García-Altares2,3, Young Jin Lee4, Pierre Chaurand5, Xavier Correig1,6,7, Pere Ràfols1,6,7.
Abstract
Mass spectrometry imaging (MSI) has become a mature, widespread analytical technique to perform non-targeted spatial metabolomics. However, the compounds used to promote desorption and ionization of the analyte during acquisition cause spectral interferences in the low mass range that hinder downstream data processing in metabolomics applications. Thus, it is advisable to annotate and remove matrix-related peaks to reduce the number of redundant and non-biologically-relevant variables in the dataset. We have developed rMSIcleanup, an open-source R package to annotate and remove signals from the matrix, according to the matrix chemical composition and the spatial distribution of its ions. To validate the annotation method, rMSIcleanup was challenged with several images acquired using silver-assisted laser desorption ionization MSI (AgLDI MSI). The algorithm was able to correctly classify m/z signals related to silver clusters. Visual exploration of the data using Principal Component Analysis (PCA) demonstrated that annotation and removal of matrix-related signals improved spectral data post-processing. The results highlight the need for including matrix-related peak annotation tools such as rMSIcleanup in MSI workflows.Entities:
Keywords: Mass spectrometry imaging; Matrix annotation; Overlapping-signal detection; Silver-assisted laser/desorption ionization; Spatial metabolomics; Spectral processing
Year: 2020 PMID: 33431000 PMCID: PMC7374922 DOI: 10.1186/s13321-020-00449-0
Source DB: PubMed Journal: J Cheminform ISSN: 1758-2946 Impact factor: 5.514
List of the 14 AgLDI MSI datasets used for validation
| No | Species | Tissue type | Ag deposition system and estimated layer thickness | Lateral res. | m/z range | Mass spectrometer | Acq. mode | Refs. |
|---|---|---|---|---|---|---|---|---|
| 1 | Mouse | Pancreas | ATC Orion 8-HV Sputtering system, 5 nm | 30 | 70–1200 | Bruker ultrafleXtreme™ MALDI-TOF/TOF | Positive/profile | – |
| 2 | Mouse | Pancreas | ATC Orion 8-HV Sputtering system, 5 nm | 30 | 70–1200 | Bruker ultrafleXtreme™ MALDI-TOF/TOF | Positive/profile | – |
| 3 | Mouse | Kidney | ATC Orion 8-HV Sputtering system, 5 nm | 100 | 70–1200 | Bruker ultrafleXtreme™ MALDI-TOF/TOF | Positive/profile | – |
| 4 | Mouse | Brain | ATC Orion 8-HV Sputtering system, 5 nm | 80 | 70–1200 | Bruker ultrafleXtreme™ MALDI-TOF/TOF | Positive/profile | – |
| 5 | Mouse | Brain | ATC Orion 8-HV Sputtering system, 5 nm | 80 | 70–1200 | Bruker ultrafleXtreme™ MALDI-TOF/TOF | Positive/profile | – |
| 6 | Mouse | Brain | ATC Orion 8-HV Sputtering system, 5 nm | 80 | 70–1200 | Bruker ultrafleXtreme™ MALDI-TOF/TOF | Positive/profile | – |
| 7 | Mouse | Brain | ATC Orion 8-HV Sputtering system, 5 nm | 80 | 80–1000 | Bruker ultrafleXtreme™ MALDI-TOF/TOF | Positive/profile | – |
| 8 | Mouse | Brain | ATC Orion 8-HV Sputtering system, 5 nm | 80 | 80–1000 | Bruker ultrafleXtreme™ MALDI-TOF/TOF | Positive/profile | – |
| 9 | Mouse | Brain | ATC Orion 8-HV Sputtering system, 5 nm | 80 | 80–1000 | Bruker ultrafleXtreme™ MALDI-TOF/TOF | Positive/profile | – |
| 10 | Mouse | Brain | ATC Orion 8-HV Sputtering system, 5 nm | 80 | 80–1000 | Bruker ultrafleXtreme™ MALDI-TOF/TOF | Positive/profile | – |
| 11 | Mouse | Brain | Cressington Sputter Coater, 23 ± 2 nm | 75 | 100–1100 | Bruker ultrafleXtreme™ MALDI-TOF/TOF | Positive/profile | [ |
| 12 | Homo sapiens | Fingermark | Cressington Sputter Coater, 14 ± 2 nm | 75 | 100–1100 | Bruker ultrafleXtreme™ MALDI-TOF/TOF | Positive/profile | [ |
| 13 | B73 inbred corn | Root | Cressington 108Auto, 5 s | 10 | 50–970 | Thermo Finnigan™ MALDI-LTQ-Orbitrap Discovery | Positive/centroid | [ |
| 14 | B73 inbred corn | Root | Cressington 108Auto, 5 s | 10 | 50–900 | Thermo Finnigan™ MALDI-LTQ-Orbitrap Discovery | Negative/centroid | [ |
Sample type, sample preparation and LDI-MSI acquisition parameters. Datasets from 1 to 10 were acquired in-house. Datasets 11–14 were provided by external laboratories
“Validation list” used for validation
| Chemical formula | Validation list | Type | Monoisotopic mass (n = 1) | PubChem CID | Refs |
|---|---|---|---|---|---|
| Positive class | Silver cluster | 106.9051 | 104755 | [ | |
| Negative class | Neutral salt | 125.903 | 62656 | [ | |
| 141.8734 | 24561 | ||||
| 185.8229 | 66199 | ||||
| 233.809 | 24563 | ||||
| Synthetic compound | 107.9124 | 139654 | |||
| 108.9202 | 92028350 | ||||
| 110.9072 | 71348557 | ||||
| 168.8924 | 24470 | ||||
| 570.9807 | 71351869 | ||||
| 144.9014 | 82221 | ||||
| 192.9111 | 159722 | ||||
| Plants, wax, insects’ pheromones | 487.3428 | – | [ | ||
| 515.3741 | – | ||||
| 543.4054 | – | ||||
| Plant wax | 489.322 | – | |||
| 517.3533 | – | ||||
| 545.3846 | – | ||||
| Wax | 503.3013 | – | |||
| 559.3639 | – |
The “positive class” consists of silver clusters. The “negative class” consists of neutral silver salts, synthetic silver compounds and silver adducts that are not expected to be found in animal samples. The index n denotes the number of atoms or molecules inside the cluster. The minimum and maximum value of n depend on the monoisotopic mass of the atom or molecule and the mass range of the dataset
Fig. 1Similarity scores performance a Spectral similarity S1 vs. Intra-cluster morphological similarity S2 scatter plot. Each point represents a potential cluster classified by the algorithm. All clusters shown in Table 2 are evaluated for all 14 datasets presented in Table 1. Blue points represent the “positive class” (should be present in the sample) while the red points correspond to the negative class (should not be present in the sample). Most “positive class” points are located in the top right corner well separated from the negative class points. This indicates proper classification power. b Precision and recall (PR) curve computed according to Davis et al. 2006 [30]. c Similarity score S1·S2 vs. Cluster number. Clusters are arranged in decreasing order of mean similarity score. A clear gap between an S of 0.5 and 0.7 separates the “positive class” from the negative class. Refer to Additional file 1: Table S1 for a mapping of cluster numbers to cluster chemical formula
Fig. 2Overlapping detection algorithm performance when searching for the cluster in Dataset 1. a Comparison between the mean experimental spectra and the theoretical cluster at the cluster masses within a tolerance of 4 scans. Red and blue represent theoretical and experimental profiles, respectively. As can be seen, while the peaks in the centre of the cluster perfectly match the theoretical ratios, the peaks on the edges differ considerably. b Spatial distributions of the experimental cluster peaks. After performing the overlapping detection only the four ion images in the centre in green are classified as Ag-related. The remaining ion images in red are classified as suffering from overlapping. The morphologies of the overlapped ions (red) differ from the ones without overlapping (green) due to ion overlapping. c Correlation matrix between the experimental ion images of the cluster. The ion image number corresponds to the position of the ion in the isotopic pattern in ascending order of m/z. The first two images are clearly not correlated with the remaining images of the cluster. The last image also shows a considerably lower correlation. d Zoom-in of experimental mean spectra. Peaks m/z 641.43 and m/z 643.43 show clear overlapping
Fig. 3Exploratory analysis with PCA before and after removing matrix-related peaks. Red, green and blue are used to represent the spatial distribution of PC1, PC2 and PC3, respectively. The last column uses the Red Green Blue colour model (RGB) to represent the first three principal components in a single image. The annotation and removal of the matrix-related peaks lead to a generalized improvement in the contrast of morphological structures in all principal components. a Pancreas tissue from Dataset 2. b Brain tissue from Dataset 11 [18]