| Literature DB >> 30395171 |
Johannes Leuschner1, Maximilian Schmidt1, Pascal Fernsel1, Delf Lachmund1, Tobias Boskamp1, Peter Maass1.
Abstract
MOTIVATION: Non-negative matrix factorization (NMF) is a common tool for obtaining low-rank approximations of non-negative data matrices and has been widely used in machine learning, e.g. for supporting feature extraction in high-dimensional classification tasks. In its classical form, NMF is an unsupervised method, i.e. the class labels of the training data are not used when computing the NMF. However, incorporating the classification labels into the NMF algorithms allows to specifically guide them toward the extraction of data patterns relevant for discriminating the respective classes. This approach is particularly suited for the analysis of mass spectrometry imaging (MSI) data in clinical applications, such as tumor typing and classification, which are among the most challenging tasks in pathology. Thus, we investigate algorithms for extracting tumor-specific spectral patterns from MSI data by NMF methods.Entities:
Mesh:
Year: 2019 PMID: 30395171 PMCID: PMC6546133 DOI: 10.1093/bioinformatics/bty909
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.Schematic MALDI MSI workflow. Tissue sections (a) are subjected to sample preparation including deparaffination, antigen retrieval, on-tissue tryptic digestion and matrix application (b). Prepared tissue sections are inserted into the MALDI MSI instrument (c) and mass spectra (d) are acquired. When fixing single m/z values, the intensities in the measurement area can be visualized as m/z images (e), reflecting the molecular distribution of peptides with corresponding masses
Fig. 2.Standard process of NMF-based data classification
Fig. 3.Left: Performance of selected classification schemes for different numbers of features and using 8-fold CV. Right: Performance variation for selected classification schemes for different numbers of features. Each vertical bar represents the minimum and maximum balanced accuracy achieved in the individual CV folds
Fig. 4.Classification performance achieved with selected supervised and unsupervised NMF models in the 2-fold CV scenario
Fig. 5.Comparison of the number of active weights for Flog and FR_log. Weights greater than 10% of the maximum absolute weight value are considered active
Fig. 6.Discriminatory patterns learned by the method Flog on data A and B