Literature DB >> 27547730

PCA as a practical indicator of OPLS-DA model reliability.

Bradley Worley1, Robert Powers1.   

Abstract

BACKGROUND: Principal Component Analysis (PCA) and Orthogonal Projections to Latent Structures Discriminant Analysis (OPLS-DA) are powerful statistical modeling tools that provide insights into separations between experimental groups based on high-dimensional spectral measurements from NMR, MS or other analytical instrumentation. However, when used without validation, these tools may lead investigators to statistically unreliable conclusions. This danger is especially real for Partial Least Squares (PLS) and OPLS, which aggressively force separations between experimental groups. As a result, OPLS-DA is often used as an alternative method when PCA fails to expose group separation, but this practice is highly dangerous. Without rigorous validation, OPLS-DA can easily yield statistically unreliable group separation.
METHODS: A Monte Carlo analysis of PCA group separations and OPLS-DA cross-validation metrics was performed on NMR datasets with statistically significant separations in scores-space. A linearly increasing amount of Gaussian noise was added to each data matrix followed by the construction and validation of PCA and OPLS-DA models.
RESULTS: With increasing added noise, the PCA scores-space distance between groups rapidly decreased and the OPLS-DA cross-validation statistics simultaneously deteriorated. A decrease in correlation between the estimated loadings (added noise) and the true (original) loadings was also observed. While the validity of the OPLS-DA model diminished with increasing added noise, the group separation in scores-space remained basically unaffected.
CONCLUSION: Supported by the results of Monte Carlo analyses of PCA group separations and OPLS-DA cross-validation metrics, we provide practical guidelines and cross-validatory recommendations for reliable inference from PCA and OPLS-DA models.

Entities:  

Keywords:  Chemometrics; Metabolomics; OPLS; PCA; PLS

Year:  2016        PMID: 27547730      PMCID: PMC4990351          DOI: 10.2174/2213235X04666160613122429

Source DB:  PubMed          Journal:  Curr Metabolomics        ISSN: 2213-235X


  8 in total

1.  Statistical total correlation spectroscopy: an exploratory approach for latent biomarker identification from metabolic 1H NMR data sets.

Authors:  Olivier Cloarec; Marc-Emmanuel Dumas; Andrew Craig; Richard H Barton; Johan Trygg; Jane Hudson; Christine Blancher; Dominique Gauguier; John C Lindon; Elaine Holmes; Jeremy Nicholson
Journal:  Anal Chem       Date:  2005-03-01       Impact factor: 6.986

2.  Raman spectroscopy of blood for species identification.

Authors:  Gregory McLaughlin; Kyle C Doty; Igor K Lednev
Journal:  Anal Chem       Date:  2014-11-12       Impact factor: 6.986

3.  Multivariate Analysis in Metabolomics.

Authors:  Bradley Worley; Robert Powers
Journal:  Curr Metabolomics       Date:  2013

4.  Utilities for quantifying separation in PCA/PLS-DA scores plots.

Authors:  Bradley Worley; Steven Halouska; Robert Powers
Journal:  Anal Biochem       Date:  2012-10-15       Impact factor: 3.365

Review 5.  Human urinary carcinogen metabolites: biomarkers for investigating tobacco and cancer.

Authors:  Stephen S Hecht
Journal:  Carcinogenesis       Date:  2002-06       Impact factor: 4.944

6.  Centering, scaling, and transformations: improving the biological information content of metabolomics data.

Authors:  Robert A van den Berg; Huub C J Hoefsloot; Johan A Westerhuis; Age K Smilde; Mariët J van der Werf
Journal:  BMC Genomics       Date:  2006-06-08       Impact factor: 3.969

7.  Metabolite content profiling of bottlenose dolphin exhaled breath.

Authors:  Alexander A Aksenov; Laura Yeates; Alberto Pasamontes; Craig Siebe; Yuriy Zrodnikov; Jason Simmons; Mitchell M McCartney; Jean-Pierre Deplanque; Randall S Wells; Cristina E Davis
Journal:  Anal Chem       Date:  2014-10-17       Impact factor: 6.986

8.  MVAPACK: a complete data handling package for NMR metabolomics.

Authors:  Bradley Worley; Robert Powers
Journal:  ACS Chem Biol       Date:  2014-03-07       Impact factor: 5.100

  8 in total
  60 in total

Review 1.  The Role of the Gut Microbiome in Predicting Response to Diet and the Development of Precision Nutrition Models-Part I: Overview of Current Methods.

Authors:  Riley L Hughes; Maria L Marco; James P Hughes; Nancy L Keim; Mary E Kable
Journal:  Adv Nutr       Date:  2019-11-01       Impact factor: 8.701

Review 2.  Recent Advances in NMR-Based Metabolomics.

Authors:  G A Nagana Gowda; Daniel Raftery
Journal:  Anal Chem       Date:  2016-12-02       Impact factor: 6.986

3.  NMR Metabolomics Protocols for Drug Discovery.

Authors:  Fatema Bhinderwala; Robert Powers
Journal:  Methods Mol Biol       Date:  2019

4.  Metabolomic study of polyamines in rat urine following intraperitoneal injection of γ-hydroxybutyric acid.

Authors:  Hyeon-Seong Lee; Chan Seo; Young-A Kim; Meejung Park; Boyeon Choi; Moongi Ji; Sooyeun Lee; Man-Jeong Paik
Journal:  Metabolomics       Date:  2019-04-02       Impact factor: 4.290

Review 5.  Beyond the paradigm: Combining mass spectrometry and nuclear magnetic resonance for metabolomics.

Authors:  Darrell D Marshall; Robert Powers
Journal:  Prog Nucl Magn Reson Spectrosc       Date:  2017-01-11       Impact factor: 9.795

6.  Investigation of Yacon Leaves (Smallanthus sonchifolius) for α-Glucosidase Inhibitors Using Metabolomics and In Silico Approach.

Authors:  Zuhelmi Aziz; Nancy Dewi Yuliana; Partomuan Simanjuntak; Mohamad Rafi; Esti Mulatsari; Syamsudin Abdillah
Journal:  Plant Foods Hum Nutr       Date:  2021-10-19       Impact factor: 3.921

7.  The use of ecological analytical tools as an unconventional approach for untargeted metabolomics data analysis: the case of Cecropia obtusifolia and its adaptive responses to nitrate starvation.

Authors:  Jorge David Cadena-Zamudio; Juan Luis Monribot-Villanueva; Claudia-Anahí Pérez-Torres; Fulgencio Alatorre-Cobos; Beatriz Jiménez-Moraila; José A Guerrero-Analco; Enrique Ibarra-Laclette
Journal:  Funct Integr Genomics       Date:  2022-10-06       Impact factor: 3.674

8.  Evaluation of Multivariate Classification Models for Analyzing NMR Metabolomics Data.

Authors:  Thao Vu; Parker Siemek; Fatema Bhinderwala; Yuhang Xu; Robert Powers
Journal:  J Proteome Res       Date:  2019-08-22       Impact factor: 4.466

9.  Improved Discrimination of Disease States Using Proteomics Data with the Updated Aristotle Classifier.

Authors:  David Hua; Heather Desaire
Journal:  J Proteome Res       Date:  2021-04-28       Impact factor: 4.466

10.  Metabolic Profiling of Praziquantel-mediated Prevention of Opisthorchis viverrini-induced Cholangiocyte Transformation in the Hamster Model of Cholangiocarcinoma.

Authors:  Pattama Prommajun; Jutarop Phetcharaburanin; Nisana Namwat; Poramate Klanrit; Prakasit Sa-Ngiamwibool; Malinee Thanee; Hasaya Dokduang; Yingpinyapat Kittirat; Jia V Li; Watcharin Loilome
Journal:  Cancer Genomics Proteomics       Date:  2021 Jan-Feb       Impact factor: 4.069

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.