Literature DB >> 32379974

Applications of Machine Learning to In Silico Quantification of Chemicals without Analytical Standards.

Dimitri Panagopoulos Abrahamsson1, June-Soo Park2, Randolph R Singh3, Marina Sirota4,5, Tracey J Woodruff1.   

Abstract

Non-targeted analysis provides a comprehensive approach to analyze environmental and biological samples for nearly all chemicals present. One of the main shortcomings of current analytical methods and workflows is that they are unable to provide any quantitative information constituting an important obstacle in understanding environmental fate and human exposure. Herein, we present an in silico quantification method using mahine-learning for chemicals analyzed using electrospray ionization (ESI). We considered three data sets from different instrumental setups: (i) capillary electrophoresis electrospray ionization-mass spectrometry (CE-MS) in positive ionization mode (ESI+), (ii) liquid chromatography quadrupole time-of-flight mass spectrometry (LC-QTOF/MS) in ESI+ and (iii) LC-QTOF/MS in negative ionization mode (ESI-). We developed and applied two different machine-learning algorithms: a random forest (RF) and an artificial neural network (ANN) to predict the relative response factors (RRFs) of different chemicals based on their physicochemical properties. Chemical concentrations can then be calculated by dividing the measured abundance of a chemical, as peak area or peak height, by its corresponding RRF. We evaluated our models and tested their predictive power using 5-fold cross-validation (CV) and y randomization. Both the RF and the ANN models showed great promise in predicting RRFs. However, the accuracy of the predictions was dependent on the data set composition and the experimental setup. For the CE-MS ESI+ data set, the best model predicted measured RRFs with a mean absolute error (MAE) of 0.19 log units and a cross-validation coefficient of determination (Q2) of 0.84 for the testing set. For the LC-QTOF/MS ESI+ data set, the best model predicted measured RRFs with an MAE of 0.32 and a Q2 of 0.40. For the LC-QTOF/MS ESI- data set, the best model predicted measured RRFs with a MAE of 0.50 and a Q2 of 0.20. Our findings suggest that machine-learning algorithms can be used for predicting concentrations of nontargeted chemicals with reasonable uncertainties, especially in ESI+, while the application on ESI- remains a more challenging problem.

Entities:  

Mesh:

Year:  2020        PMID: 32379974      PMCID: PMC7328371          DOI: 10.1021/acs.jcim.9b01096

Source DB:  PubMed          Journal:  J Chem Inf Model        ISSN: 1549-9596            Impact factor:   4.956


  25 in total

1.  Transferability of the electrospray ionization efficiency scale between different instruments.

Authors:  Jaanus Liigand; Anneli Kruve; Piia Liigand; Asko Laaniste; Marion Girod; Rodolphe Antoine; Ivo Leito
Journal:  J Am Soc Mass Spectrom       Date:  2015-08-06       Impact factor: 3.109

2.  y-Randomization and its variants in QSPR/QSAR.

Authors:  Christoph Rücker; Gerta Rücker; Markus Meringer
Journal:  J Chem Inf Model       Date:  2007-09-20       Impact factor: 4.956

3.  Bayesian machine learning for quantum molecular dynamics.

Authors:  R V Krems
Journal:  Phys Chem Chem Phys       Date:  2019-06-26       Impact factor: 3.676

4.  Electrospray ionization efficiency scale of organic compounds.

Authors:  Merit Oss; Anneli Kruve; Koit Herodes; Ivo Leito
Journal:  Anal Chem       Date:  2010-04-01       Impact factor: 6.986

5.  Suspect Screening Analysis of Chemicals in Consumer Products.

Authors:  Katherine A Phillips; Alice Yau; Kristin A Favela; Kristin K Isaacs; Andrew McEachran; Christopher Grulke; Ann M Richard; Antony J Williams; Jon R Sobus; Russell S Thomas; John F Wambaugh
Journal:  Environ Sci Technol       Date:  2018-02-26       Impact factor: 9.028

6.  Household Dust as a Repository of Chemical Accumulation: New Insights from a Comprehensive High-Resolution Mass Spectrometric Study.

Authors:  Christoph Moschet; Tarun Anumol; Bonny M Lew; Deborah H Bennett; Thomas M Young
Journal:  Environ Sci Technol       Date:  2018-02-13       Impact factor: 9.028

Review 7.  Machine learning in chemoinformatics and drug discovery.

Authors:  Yu-Chen Lo; Stefano E Rensi; Wen Torng; Russ B Altman
Journal:  Drug Discov Today       Date:  2018-05-08       Impact factor: 7.851

8.  Virtual quantification of metabolites by capillary electrophoresis-electrospray ionization-mass spectrometry: predicting ionization efficiency without chemical standards.

Authors:  Kenneth R Chalcraft; Richard Lee; Casandra Mills; Philip Britz-McKibbin
Journal:  Anal Chem       Date:  2009-04-01       Impact factor: 6.986

9.  ChemmineR: a compound mining framework for R.

Authors:  Yiqun Cao; Anna Charisi; Li-Chang Cheng; Tao Jiang; Thomas Girke
Journal:  Bioinformatics       Date:  2008-07-02       Impact factor: 6.937

10.  Mordred: a molecular descriptor calculator.

Authors:  Hirotomo Moriwaki; Yu-Shi Tian; Norihito Kawashita; Tatsuya Takagi
Journal:  J Cheminform       Date:  2018-02-06       Impact factor: 5.514

View more
  5 in total

1.  Quantitative non-targeted analysis: Bridging the gap between contaminant discovery and risk characterization.

Authors:  James P McCord; Louis C Groff; Jon R Sobus
Journal:  Environ Int       Date:  2021-12-02       Impact factor: 9.621

2.  Uncertainty estimation strategies for quantitative non-targeted analysis.

Authors:  Louis C Groff; Jarod N Grossman; Anneli Kruve; Jeffrey M Minucci; Charles N Lowe; James P McCord; Dustin F Kapraun; Katherine A Phillips; S Thomas Purucker; Alex Chao; Caroline L Ring; Antony J Williams; Jon R Sobus
Journal:  Anal Bioanal Chem       Date:  2022-06-14       Impact factor: 4.478

3.  First Novel Workflow for Semiquantification of Emerging Contaminants in Environmental Samples Analyzed by Gas Chromatography-Atmospheric Pressure Chemical Ionization-Quadrupole Time of Flight-Mass Spectrometry.

Authors:  Reza Aalizadeh; Varvara Nikolopoulou; Nikiforos A Alygizakis; Nikolaos S Thomaidis
Journal:  Anal Chem       Date:  2022-06-27       Impact factor: 8.008

4.  Machine Learning for Absolute Quantification of Unidentified Compounds in Non-Targeted LC/HRMS.

Authors:  Emma Palm; Anneli Kruve
Journal:  Molecules       Date:  2022-02-02       Impact factor: 4.411

5.  Approaches for assessing performance of high-resolution mass spectrometry-based non-targeted analysis methods.

Authors:  Christine M Fisher; Katherine T Peter; Seth R Newton; Andrew J Schaub; Jon R Sobus
Journal:  Anal Bioanal Chem       Date:  2022-07-07       Impact factor: 4.478

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.