Walid M Abdelmoula1,2, Sylwia A Stopka1,3, Elizabeth C Randall3, Michael Regan1, Jeffrey N Agar4, Jann N Sarkaria5, William M Wells3,6, Tina Kapur3, Nathalie Y R Agar1,3,7. 1. Department of Neurosurgery, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, 02115, USA. 2. Invicro LLC, Boston, MA, 02210, USA. 3. Department of Radiology, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, 02115, USA. 4. Department of Chemistry and Chemical Biology, Northeastern University, 412 TF (140 The Fenway), Boston, MA, 02111, USA. 5. Department of Radiation Oncology, Mayo Clinic, 200 First St SW, Rochester, MN, 55902, USA. 6. Computer Science and Artificial Intelligence Laboratory, MIT, Cambridge, MA, 02139, USA. 7. Department of Cancer Biology, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA, 02115, USA.
Abstract
MOTIVATION: Mass spectrometry imaging (MSI) provides rich biochemical information in a label-free manner and therefore holds promise to substantially impact current practice in disease diagnosis. However, the complex nature of MSI data poses computational challenges in its analysis. The complexity of the data arises from its large size, high dimensionality, and spectral non-linearity. Preprocessing, including peak picking, has been used to reduce raw data complexity, however peak picking is sensitive to parameter selection that, perhaps prematurely, shapes the downstream analysis for tissue classification and ensuing biological interpretation. RESULTS: We propose a deep learning model, massNet, that provides the desired qualities of scalability, non-linearity, and speed in MSI data analysis. This deep learning model was used, without prior preprocessing and peak picking, to classify MSI data from a mouse brain harboring a patient-derived tumor. The massNet architecture established automatically learning of predictive features, and automated methods were incorporated to identify peaks with potential for tumor delineation. The model's performance was assessed using cross-validation, and the results demonstrate higher accuracy and a substantial gain in speed compared to the established classical machine learning method, support vector machine. AVAILABILITY AND IMPLEMENTATION: https://github.com/wabdelmoula/massNet. AVAILABILITY OF DATA: The data underlying this article are available in the NIH Common Fund's National Metabolomics Data Repository (NMDR) Metabolomics Workbench under project id (PR001292) with http://dx.doi.org/10.21228/M8Q70T. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
MOTIVATION: Mass spectrometry imaging (MSI) provides rich biochemical information in a label-free manner and therefore holds promise to substantially impact current practice in disease diagnosis. However, the complex nature of MSI data poses computational challenges in its analysis. The complexity of the data arises from its large size, high dimensionality, and spectral non-linearity. Preprocessing, including peak picking, has been used to reduce raw data complexity, however peak picking is sensitive to parameter selection that, perhaps prematurely, shapes the downstream analysis for tissue classification and ensuing biological interpretation. RESULTS: We propose a deep learning model, massNet, that provides the desired qualities of scalability, non-linearity, and speed in MSI data analysis. This deep learning model was used, without prior preprocessing and peak picking, to classify MSI data from a mouse brain harboring a patient-derived tumor. The massNet architecture established automatically learning of predictive features, and automated methods were incorporated to identify peaks with potential for tumor delineation. The model's performance was assessed using cross-validation, and the results demonstrate higher accuracy and a substantial gain in speed compared to the established classical machine learning method, support vector machine. AVAILABILITY AND IMPLEMENTATION: https://github.com/wabdelmoula/massNet. AVAILABILITY OF DATA: The data underlying this article are available in the NIH Common Fund's National Metabolomics Data Repository (NMDR) Metabolomics Workbench under project id (PR001292) with http://dx.doi.org/10.21228/M8Q70T. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Authors: Pierre Chaurand; Sarah A Schwartz; Dean Billheimer; Baogang J Xu; Anna Crecelius; Richard M Caprioli Journal: Anal Chem Date: 2004-02-15 Impact factor: 6.986
Authors: Liam A McDonnell; Alexandra van Remoortere; Nico de Velde; René J M van Zeijl; André M Deelder Journal: J Am Soc Mass Spectrom Date: 2010-08-21 Impact factor: 3.109
Authors: Theodore Alexandrov; Michael Becker; Sren-Oliver Deininger; Günther Ernst; Liane Wehder; Markus Grasmair; Ferdinand von Eggeling; Herbert Thiele; Peter Maass Journal: J Proteome Res Date: 2010-11-15 Impact factor: 4.466
Authors: Kirill A Veselkov; Reza Mirnezami; Nicole Strittmatter; Robert D Goldin; James Kinross; Abigail V M Speller; Tigran Abramov; Emrys A Jones; Ara Darzi; Elaine Holmes; Jeremy K Nicholson; Zoltan Takats Journal: Proc Natl Acad Sci U S A Date: 2014-01-07 Impact factor: 11.205
Authors: Daniel P Donnelly; Catherine M Rawlins; Caroline J DeHart; Luca Fornelli; Luis F Schachner; Ziqing Lin; Jennifer L Lippens; Krishna C Aluri; Richa Sarin; Bifan Chen; Carter Lantz; Wonhyeuk Jung; Kendall R Johnson; Antonius Koller; Jeremy J Wolff; Iain D G Campuzano; Jared R Auclair; Alexander R Ivanov; Julian P Whitelegge; Ljiljana Paša-Tolić; Julia Chamot-Rooke; Paul O Danis; Lloyd M Smith; Yury O Tsybin; Joseph A Loo; Ying Ge; Neil L Kelleher; Jeffrey N Agar Journal: Nat Methods Date: 2019-06-27 Impact factor: 28.547
Authors: David Pertzborn; Christoph Arolt; Günther Ernst; Oliver J Lechtenfeld; Jan Kaesler; Daniela Pelzel; Orlando Guntinas-Lichius; Ferdinand von Eggeling; Franziska Hoffmann Journal: Cancers (Basel) Date: 2022-09-05 Impact factor: 6.575