Literature DB >> 30830388

Comparing normalization methods and the impact of noise.

Thao Vu1, Eli Riekeberg2, Yumou Qiu1, Robert Powers3,4.   

Abstract

INTRODUCTION: Failure to properly account for normal systematic variations in OMICS datasets may result in misleading biological conclusions. Accordingly, normalization is a necessary step in the proper preprocessing of OMICS datasets. In this regards, an optimal normalization method will effectively reduce unwanted biases and increase the accuracy of downstream quantitative analyses. But, it is currently unclear which normalization method is best since each algorithm addresses systematic noise in different ways.
OBJECTIVE: Determine an optimal choice of a normalization method for the preprocessing of metabolomics datasets.
METHODS: Nine MVAPACK normalization algorithms were compared with simulated and experimental NMR spectra modified with added Gaussian noise and random dilution factors. Methods were evaluated based on an ability to recover the intensities of the true spectral peaks and the reproducibility of true classifying features from orthogonal projections to latent structures-discriminant analysis model (OPLS-DA).
RESULTS: Most normalization methods (except histogram matching) performed equally well at modest levels of signal variance. Only probabilistic quotient (PQ) and constant sum (CS) maintained the highest level of peak recovery (> 67%) and correlation with true loadings (> 0.6) at maximal noise.
CONCLUSION: PQ and CS performed the best at recovering peak intensities and reproducing the true classifying features for an OPLS-DA model regardless of spectral noise level. Our findings suggest that performance is largely determined by the level of noise in the dataset, while the effect of dilution factors was negligible. A minimal allowable noise level of 20% was also identified for a valid NMR metabolomics dataset.

Entities:  

Keywords:  Metabolomics; NMR; Noise; Normalization; Preprocessing chemometrics

Mesh:

Substances:

Year:  2018        PMID: 30830388      PMCID: PMC6638559          DOI: 10.1007/s11306-018-1400-6

Source DB:  PubMed          Journal:  Metabolomics        ISSN: 1573-3882            Impact factor:   4.290


  4 in total

1.  PRPS-ST: A protocol-agnostic self-training method for gene expression-based classification of blood cancers.

Authors:  Aixiang Jiang; Laura K Hilton; Jeffrey Tang; Christopher K Rushton; Bruno M Grande; David W Scott; Ryan D Morin
Journal:  Blood Cancer Discov       Date:  2020-09-10

2.  Metabolic changes associated with adaptive resistance to daptomycin in Streptococcus mitis-oralis.

Authors:  Allison Parrett; Joseph M Reed; Stewart G Gardner; Nagendra N Mishra; Arnold S Bayer; Robert Powers; Greg A Somerville
Journal:  BMC Microbiol       Date:  2020-06-15       Impact factor: 3.605

3.  Recurrent Topics in Mass Spectrometry-Based Metabolomics and Lipidomics-Standardization, Coverage, and Throughput.

Authors:  Evelyn Rampler; Yasin El Abiead; Harald Schoeny; Mate Rusz; Felina Hildebrand; Veronika Fitz; Gunda Koellensperger
Journal:  Anal Chem       Date:  2020-11-28       Impact factor: 6.986

4.  A Comprehensive Evaluation of Metabolomics Data Preprocessing Methods for Deep Learning.

Authors:  Krzysztof Jan Abram; Douglas McCloskey
Journal:  Metabolites       Date:  2022-02-24
  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.