Literature DB >> 30625565

Machine learning combined with non-targeted LC-HRMS analysis for a risk warning system of chemical hazards in drinking water: A proof of concept.

Saer Samanipour1, Sarit Kaserzon2, Soumini Vijayasarathy2, Hui Jiang2, Phil Choi2, Malcolm J Reid3, Jochen F Mueller2, Kevin V Thomas4.   

Abstract

Guaranteeing clean drinking water to the global population is becoming more challenging, because of the cases of water scarcity across the globe, growing population, and increased chemical footprint of this population. Existing targeted strategies for hazard monitoring in drinking water are not adequate to handle such diverse and multidimensional stressors. In the current study, we have developed, validated, and tested a machine learning algorithm based on the data produced via non-targeted liquid chromatography coupled with high resolution mass spectrometry (LC-HRMS) for the identification of potential chemical hazards in drinking water. The machine learning algorithm consisted of a composite statistical model including an unsupervised component (i.e. principal component analysis PCA) and a supervised one (i.e. partial least square discrimination analysis PLS-DA). This model was trained using a training set of 20 drinking water samples previously tested via conventional suspect screening. The developed model was validated using a validation set of 20 drinking water samples of which 4 were spiked with 15 labeled standards at four different concentration levels. The model successfully detected all of the added analytes in the four spiked samples without producing any cases of false detection. The same validation set was processed via conventional trend analysis in order to cross validate the composite model. The results of cross validation showed that even though the conventional trend analysis approach produced a false positive detection rate of ≤5% the composite model outperformed that approach by producing zero cases of false detection. Additionally, the validated model went through an additional test with 42 extra drinking water samples from the same source for an unbiased examination of the model. Finally, the potentials and limitations of this approach were further discussed.
Copyright © 2018 Elsevier B.V. All rights reserved.

Entities:  

Keywords:  Drinking water; LC-HRMS; Machine learning; Non-target; Statistical modeling

Mesh:

Substances:

Year:  2018        PMID: 30625565     DOI: 10.1016/j.talanta.2018.11.039

Source DB:  PubMed          Journal:  Talanta        ISSN: 0039-9140            Impact factor:   6.057


  1 in total

1.  From Centroided to Profile Mode: Machine Learning for Prediction of Peak Width in HRMS Data.

Authors:  Saer Samanipour; Phil Choi; Jake W O'Brien; Bob W J Pirok; Malcolm J Reid; Kevin V Thomas
Journal:  Anal Chem       Date:  2021-11-29       Impact factor: 6.986

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.