Literature DB >> 20566707

Enhancement of plant metabolite fingerprinting by machine learning.

Ian M Scott1, Cornelia P Vermeer, Maria Liakata, Delia I Corol, Jane L Ward, Wanchang Lin, Helen E Johnson, Lynne Whitehead, Baldeep Kular, John M Baker, Sean Walsh, Anuja Dave, Tony R Larson, Ian A Graham, Trevor L Wang, Ross D King, John Draper, Michael H Beale.   

Abstract

Metabolite fingerprinting of Arabidopsis (Arabidopsis thaliana) mutants with known or predicted metabolic lesions was performed by (1)H-nuclear magnetic resonance, Fourier transform infrared, and flow injection electrospray-mass spectrometry. Fingerprinting enabled processing of five times more plants than conventional chromatographic profiling and was competitive for discriminating mutants, other than those affected in only low-abundance metabolites. Despite their rapidity and complexity, fingerprints yielded metabolomic insights (e.g. that effects of single lesions were usually not confined to individual pathways). Among fingerprint techniques, (1)H-nuclear magnetic resonance discriminated the most mutant phenotypes from the wild type and Fourier transform infrared discriminated the fewest. To maximize information from fingerprints, data analysis was crucial. One-third of distinctive phenotypes might have been overlooked had data models been confined to principal component analysis score plots. Among several methods tested, machine learning (ML) algorithms, namely support vector machine or random forest (RF) classifiers, were unsurpassed for phenotype discrimination. Support vector machines were often the best performing classifiers, but RFs yielded some particularly informative measures. First, RFs estimated margins between mutant phenotypes, whose relations could then be visualized by Sammon mapping or hierarchical clustering. Second, RFs provided importance scores for the features within fingerprints that discriminated mutants. These scores correlated with analysis of variance F values (as did Kruskal-Wallis tests, true- and false-positive measures, mutual information, and the Relief feature selection algorithm). ML classifiers, as models trained on one data set to predict another, were ideal for focused metabolomic queries, such as the distinctiveness and consistency of mutant phenotypes. Accessible software for use of ML in plant physiology is highlighted.

Entities:  

Mesh:

Year:  2010        PMID: 20566707      PMCID: PMC2923910          DOI: 10.1104/pp.109.150524

Source DB:  PubMed          Journal:  Plant Physiol        ISSN: 0032-0889            Impact factor:   8.340


  57 in total

1.  Growth stage-based phenotypic analysis of Arabidopsis: a model for high throughput functional genomics in plants.

Authors:  D C Boyes; A M Zayed; R Ascenzi; A J McCaskill; N E Hoffman; K R Davis; J Görlach
Journal:  Plant Cell       Date:  2001-07       Impact factor: 11.277

2.  Predictive metabolic engineering: a goal for systems biology.

Authors:  Lee J Sweetlove; Robert L Last; Alisdair R Fernie
Journal:  Plant Physiol       Date:  2003-06       Impact factor: 8.340

3.  Preprocessing, classification modeling and feature selection using flow injection electrospray mass spectrometry metabolite fingerprint data.

Authors:  David P Enot; Wanchang Lin; Manfred Beckmann; David Parker; David P Overy; John Draper
Journal:  Nat Protoc       Date:  2008       Impact factor: 13.491

4.  Cuticular lipid composition, surface structure, and gene expression in Arabidopsis stem epidermis.

Authors:  Mi Chung Suh; A Lacey Samuels; Reinhard Jetter; Ljerka Kunst; Mike Pollard; John Ohlrogge; Fred Beisson
Journal:  Plant Physiol       Date:  2005-11-18       Impact factor: 8.340

5.  Leaf vitamin C contents modulate plant defense transcripts and regulate genes that control development through hormone signaling.

Authors:  Gabriela M Pastori; Guy Kiddle; John Antoniw; Stephanie Bernard; Sonja Veljovic-Jovanovic; Paul J Verrier; Graham Noctor; Christine H Foyer
Journal:  Plant Cell       Date:  2003-04       Impact factor: 11.277

Review 6.  Chloroformates in gas chromatography as general purpose derivatizing agents.

Authors:  P Husek
Journal:  J Chromatogr B Biomed Sci Appl       Date:  1998-10-09

7.  MAPMAN: a user-driven tool to display genomics data sets onto diagrams of metabolic pathways and other biological processes.

Authors:  Oliver Thimm; Oliver Bläsing; Yves Gibon; Axel Nagel; Svenja Meyer; Peter Krüger; Joachim Selbig; Lukas A Müller; Seung Y Rhee; Mark Stitt
Journal:  Plant J       Date:  2004-03       Impact factor: 6.417

Review 8.  An assessment of recently published gene expression data analyses: reporting experimental design and statistical factors.

Authors:  Peyman Jafari; Francisco Azuaje
Journal:  BMC Med Inform Decis Mak       Date:  2006-06-21       Impact factor: 2.796

9.  Automics: an integrated platform for NMR-based metabonomics spectral processing and data analysis.

Authors:  Tao Wang; Kang Shao; Qinying Chu; Yanfei Ren; Yiming Mu; Lijia Qu; Jie He; Changwen Jin; Bin Xia
Journal:  BMC Bioinformatics       Date:  2009-03-16       Impact factor: 3.169

Review 10.  Machine learning and its applications to biology.

Authors:  Adi L Tarca; Vincent J Carey; Xue-wen Chen; Roberto Romero; Sorin Drăghici
Journal:  PLoS Comput Biol       Date:  2007-06       Impact factor: 4.475

View more
  6 in total

1.  Serum Metabonomics of Mild Acute Pancreatitis.

Authors:  Hongmin Xu; Lei Zhang; Huan Kang; Jiandong Zhang; Jie Liu; Shuye Liu
Journal:  J Clin Lab Anal       Date:  2016-05-12       Impact factor: 2.352

2.  Plantmetabolomics.org: mass spectrometry-based Arabidopsis metabolomics--database and tools update.

Authors:  Preeti Bais; Stephanie M Moon-Quanbeck; Basil J Nikolau; Julie A Dickerson
Journal:  Nucleic Acids Res       Date:  2011-11-10       Impact factor: 16.971

3.  Predicting future biomass yield in Miscanthus using the carbohydrate metabolic profile as a biomarker.

Authors:  Anne L Maddison; Anyela Camargo-Rodriguez; Ian M Scott; Charlotte M Jones; Dafydd M O Elias; Sarah Hawkins; Alice Massey; John Clifton-Brown; Niall P McNamara; Iain S Donnison; Sarah J Purdy
Journal:  Glob Change Biol Bioenergy       Date:  2017-01-21       Impact factor: 4.745

4.  Natural variation in wild tomato trichomes; selecting metabolites that contribute to insect resistance using a random forest approach.

Authors:  Ruy W J Kortbeek; Marc D Galland; Aleksandra Muras; Frans M van der Kloet; Bart André; Maurice Heilijgers; Sacha A F T van Hijum; Michel A Haring; Robert C Schuurink; Petra M Bleeker
Journal:  BMC Plant Biol       Date:  2021-07-02       Impact factor: 4.215

Review 5.  You Are What You Eat: Application of Metabolomics Approaches to Advance Nutrition Research.

Authors:  Abdul-Hamid M Emwas; Nahla Al-Rifai; Kacper Szczepski; Shuruq Alsuhaymi; Saleh Rayyan; Hanan Almahasheer; Mariusz Jaremko; Lorraine Brennan; Joanna Izabela Lachowicz
Journal:  Foods       Date:  2021-05-31

Review 6.  Metabolomics-Guided Elucidation of Plant Abiotic Stress Responses in the 4IR Era: An Overview.

Authors:  Morena M Tinte; Kekeletso H Chele; Justin J J van der Hooft; Fidele Tugizimana
Journal:  Metabolites       Date:  2021-07-08
  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.