Genevera I Allen1, Mirjana Maletić-Savatić. 1. Department of Pediatrics-Neurology, Baylor College of Medicine, Jan and Dan Duncan Neurological Research Institute at Texas Children's Hospital, Houston, TX 77030, USA. gallen@rice.edu
Abstract
MOTIVATION: Nuclear magnetic resonance (NMR) spectroscopy has been used to study mixtures of metabolites in biological samples. This technology produces a spectrum for each sample depicting the chemical shifts at which an unknown number of latent metabolites resonate. The interpretation of this data with common multivariate exploratory methods such as principal components analysis (PCA) is limited due to high-dimensionality, non-negativity of the underlying spectra and dependencies at adjacent chemical shifts. RESULTS: We develop a novel modification of PCA that is appropriate for analysis of NMR data, entitled Sparse Non-Negative Generalized PCA. This method yields interpretable principal components and loading vectors that select important features and directly account for both the non-negativity of the underlying spectra and dependencies at adjacent chemical shifts. Through the reanalysis of experimental NMR data on five purified neural cell types, we demonstrate the utility of our methods for dimension reduction, pattern recognition, sample exploration and feature selection. Our methods lead to the identification of novel metabolites that reflect the differences between these cell types. AVAILABILITY: www.stat.rice.edu/~gallen/software.html. CONTACT: gallen@rice.edu. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
MOTIVATION: Nuclear magnetic resonance (NMR) spectroscopy has been used to study mixtures of metabolites in biological samples. This technology produces a spectrum for each sample depicting the chemical shifts at which an unknown number of latent metabolites resonate. The interpretation of this data with common multivariate exploratory methods such as principal components analysis (PCA) is limited due to high-dimensionality, non-negativity of the underlying spectra and dependencies at adjacent chemical shifts. RESULTS: We develop a novel modification of PCA that is appropriate for analysis of NMR data, entitled Sparse Non-Negative Generalized PCA. This method yields interpretable principal components and loading vectors that select important features and directly account for both the non-negativity of the underlying spectra and dependencies at adjacent chemical shifts. Through the reanalysis of experimental NMR data on five purified neural cell types, we demonstrate the utility of our methods for dimension reduction, pattern recognition, sample exploration and feature selection. Our methods lead to the identification of novel metabolites that reflect the differences between these cell types. AVAILABILITY: www.stat.rice.edu/~gallen/software.html. CONTACT: gallen@rice.edu. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Authors: Paul Sajda; Shuyan Du; Truman R Brown; Radka Stoyanova; Dikoma C Shungu; Xiangling Mao; Lucas C Parra Journal: IEEE Trans Med Imaging Date: 2004-12 Impact factor: 10.048
Authors: M Maletić-Savatić; L K Vingara; L N Manganas; Y Li; S Zhang; A Sierra; R Hazel; D Smith; M E Wagshul; F Henn; L Krupp; G Enikolopov; H Benveniste; P M Djurić; I Pelczer Journal: Cold Spring Harb Symp Quant Biol Date: 2008-11-06
Authors: Eldon L Ulrich; Hideo Akutsu; Jurgen F Doreleijers; Yoko Harano; Yannis E Ioannidis; Jundong Lin; Miron Livny; Steve Mading; Dimitri Maziuk; Zachary Miller; Eiichi Nakatani; Christopher F Schulte; David E Tolmie; R Kent Wenger; Hongyang Yao; John L Markley Journal: Nucleic Acids Res Date: 2007-11-04 Impact factor: 16.971
Authors: Genevera I Allen; Christine Peterson; Marina Vannucci; Mirjana Maletić-Savatić Journal: Stat Anal Data Min Date: 2013-08-01 Impact factor: 1.051
Authors: Ranjit Pelia; Suresh Venkateswaran; Jason D Matthews; Yael Haberman; David J Cutler; Jeffrey S Hyams; Lee A Denson; Subra Kugathasan Journal: BMC Med Genomics Date: 2021-07-29 Impact factor: 3.063