Literature DB >> 26002472

A tutorial review: Metabolomics and partial least squares-discriminant analysis--a marriage of convenience or a shotgun wedding.

Piotr S Gromski1, Howbeer Muhamadali1, David I Ellis1, Yun Xu1, Elon Correa1, Michael L Turner2, Royston Goodacre3.   

Abstract

The predominance of partial least squares-discriminant analysis (PLS-DA) used to analyze metabolomics datasets (indeed, it is the most well-known tool to perform classification and regression in metabolomics), can be said to have led to the point that not all researchers are fully aware of alternative multivariate classification algorithms. This may in part be due to the widespread availability of PLS-DA in most of the well-known statistical software packages, where its implementation is very easy if the default settings are used. In addition, one of the perceived advantages of PLS-DA is that it has the ability to analyze highly collinear and noisy data. Furthermore, the calibration model is known to provide a variety of useful statistics, such as prediction accuracy as well as scores and loadings plots. However, this method may provide misleading results, largely due to a lack of suitable statistical validation, when used by non-experts who are not aware of its potential limitations when used in conjunction with metabolomics. This tutorial review aims to provide an introductory overview to several straightforward statistical methods such as principal component-discriminant function analysis (PC-DFA), support vector machines (SVM) and random forests (RF), which could very easily be used either to augment PLS or as alternative supervised learning methods to PLS-DA. These methods can be said to be particularly appropriate for the analysis of large, highly-complex data sets which are common output(s) in metabolomics studies where the numbers of variables often far exceed the number of samples. In addition, these alternative techniques may be useful tools for generating parsimonious models through feature selection and data reduction, as well as providing more propitious results. We sincerely hope that the general reader is left with little doubt that there are several promising and readily available alternatives to PLS-DA, to analyze large and highly complex data sets.
Copyright © 2015 Elsevier B.V. All rights reserved.

Keywords:  Chemometrics; Metabolomics; Partial least squares-discriminant analysis; Principal component-discriminant function analysis; Random forests; Support vector machines

Mesh:

Year:  2015        PMID: 26002472     DOI: 10.1016/j.aca.2015.02.012

Source DB:  PubMed          Journal:  Anal Chim Acta        ISSN: 0003-2670            Impact factor:   6.558


  166 in total

1.  1H NMR-based metabonomics for infertility diagnosis in men with varicocele.

Authors:  Filipe Tenorio Lira Neto; Ronmilson Alves Marques; Alexandre de Freitas Cavalcanti Filho; Leslie Clifford Noronha Araujo; Salvador Vilar Correia Lima; Licarion Pinto; Ricardo Oliveira Silva
Journal:  J Assist Reprod Genet       Date:  2020-07-26       Impact factor: 3.412

2.  Metabolomics technology and bioinformatics for precision medicine.

Authors:  Rajeev K Azad; Vladimir Shulaev
Journal:  Brief Bioinform       Date:  2019-11-27       Impact factor: 11.622

3.  Metabolomic characterization of hypertension and dyslipidemia.

Authors:  Chaofu Ke; Xiaohong Zhu; Yuxia Zhang; Yueping Shen
Journal:  Metabolomics       Date:  2018-08-31       Impact factor: 4.290

4.  Nitrogen deprivation in Fusarium oxysporum promotes mycotoxin production via intermediates in the Krebs cycle and unreported methylmalonyl-CoA mutase activity.

Authors:  A V Karpe; M S Dunn; M C Taylor; T Nguyen; C Ong; T Karla; S Rockman; D J Beale
Journal:  Metabolomics       Date:  2018-12-11       Impact factor: 4.290

5.  Predictors of ccf-mtDNA reactivity to acute psychological stress identified using machine learning classifiers: A proof-of-concept.

Authors:  Caroline Trumpff; Anna L Marsland; Richard P Sloan; Brett A Kaufman; Martin Picard
Journal:  Psychoneuroendocrinology       Date:  2019-05-07       Impact factor: 4.905

6.  Omics, big data and machine learning as tools to propel understanding of biological mechanisms and to discover novel diagnostics and therapeutics.

Authors:  Nikolaos Perakakis; Alireza Yazdani; George E Karniadakis; Christos Mantzoros
Journal:  Metabolism       Date:  2018-08-08       Impact factor: 8.694

7.  Critical review of reporting of the data analysis step in metabolomics.

Authors:  E C Considine; G Thomas; A L Boulesteix; A S Khashan; L C Kenny
Journal:  Metabolomics       Date:  2017-12-01       Impact factor: 4.290

8.  Urine metabolomic analysis for monitoring internal load in professional football players.

Authors:  Guillermo Quintas; Xavier Reche; Juan Daniel Sanjuan-Herráez; Helena Martínez; Marta Herrero; Xavier Valle; Marc Masa; Gil Rodas
Journal:  Metabolomics       Date:  2020-03-28       Impact factor: 4.290

9.  Tracing Hematopoietic Progenitor Cell Neutrophilic Differentiation via Raman Spectroscopy.

Authors:  Ji Sun Choi; Yelena Ilin; Mary L Kraft; Brendan A C Harley
Journal:  Bioconjug Chem       Date:  2018-09-06       Impact factor: 4.774

10.  Metabolomics in the prevention and management of asthma.

Authors:  Zhaozhong Zhu; Carlos A Camargo; Kohei Hasegawa
Journal:  Expert Rev Respir Med       Date:  2019-10-09       Impact factor: 3.772

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.