| Literature DB >> 28525573 |
Bo Li1, Jing Tang1, Qingxia Yang1,2, Shuang Li1, Xuejiao Cui1, Yinghong Li1, Yuzong Chen3, Weiwei Xue1, Xiaofeng Li1, Feng Zhu1,2.
Abstract
Diverse forms of unwanted signal variations in mass spectrometry-based metabolomics data adversely affect the accuracies of metabolic profiling. A variety of normalization methods have been developed for addressing this problem. However, their performances vary greatly and depend heavily on the nature of the studied data. Moreover, given the complexity of the actual data, it is not feasible to assess the performance of methods by single criterion. We therefore developed NOREVA to enable performance evaluation of various normalization methods from multiple perspectives. NOREVA integrated five well-established criteria (each with a distinct underlying theory) to ensure more comprehensive evaluation than any single criterion. It provided the most complete set of the available normalization methods, with unique features of removing overall unwanted variations based on quality control metabolites and allowing quality control samples based correction sequentially followed by data normalization. The originality of NOREVA and the reliability of its algorithms were extensively validated by case studies on five benchmark datasets. In sum, NOREVA is distinguished for its capability of identifying the well performed normalization method by taking multiple criteria into consideration and can be an indispensable complement to other available tools. NOREVA can be freely accessed at http://server.idrb.cqu.edu.cn/noreva/.Entities:
Mesh:
Year: 2017 PMID: 28525573 PMCID: PMC5570188 DOI: 10.1093/nar/gkx449
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.The general workflow of NOREVA. (A) Uploading of mass spectrometry (MS)-based metabolomics data with or without IS, QCM and quality control sample (QCS); (B) Data pre-processing by QC-RLSC and imputation of missing signals; (C) Data normalization based on the studied methods; (D) Performance evaluation by multiple criteria.
Evaluation results of four criteria on benchmark dataset MTBLS79 (selected measure under each criterion was shown in bracket)
| Criterion ( | Criterion ( | Criterion ( | Criterion ( | |
|---|---|---|---|---|
| (PMAD) | (distribution of | (consistency) | (AUC) | |
| Auto scaling | 0.8360 | Good | 14.6500 | 0.8344 |
| Contrast | 0.7797 | Fair | 9.7500 | 0.6250 |
| Cubic splines | 0.1393 | Excellent | 13.7500 | 0.8322 |
| Cyclic loess | 0.3188 | Good | 15.6500 | 0.8356 |
| EigenMS | 0.1799 | Good | 16.4000 | 0.8010 |
| Level scaling | 0.2890 | Good | 15.1000 | 0.8345 |
| Linear baseline | 0.6035 | Fair | 6.3000 | 0.7072 |
| Log-transform | 0.1349 | Good | 14.7500 | 0.8168 |
| Mean | 0.3100 | Good | 14.7500 | 0.8213 |
| Median | 0.3100 | Good | 14.5500 | 0.8177 |
| MSTUS | 0.0064 | Good | 14.3500 | 0.8405 |
| Pareto scaling | 0.5320 | Good | 14.9500 | 0.8344 |
| Power scaling | 0.1660 | Good | 14.9500 | 0.8314 |
| PQN | 0.3260 | Good | 13.7000 | 0.8309 |
| Quantile | 0.2989 | Excellent | 13.8000 | 0.8119 |
| Range scaling | 0.1573 | Good | 15.3500 | 0.8344 |
| Total sum | 2.4336 | Fair | 14.7000 | 0.7538 |
| Vast scaling | 2.7200 | Good | 15.0000 | 0.8344 |
| VSN | 0.5626 | Excellent | 13.7500 | 0.8373 |
The way calculating those measures was described in ‘Materials and Methods’ section and ‘Supplementary Methods’ section. Besides of quantitative measures, qualitative ones such as distribution of P-value were also evaluated and three performance levels were provided (Excellent, Good and Fair). Qualitative measures were evaluated by visual inspection and examples illustrating how those three performance levels were assigned were shown in Supplementary Figure S1.
Figure 2.Difference between logFCs of the normalized results across various methods and those of the spike-in metabolites (used here as the gold standard). Only one method (Contrast) led to unbiased logFC estimates and thus effectively preserved the true biological variations.
Figure 3.Evaluation of QC-RLSC's effect on signal drifts correction. (A and B) performance evaluation based on the intensity of an example metabolite M72T126. In contrast to M72T126΄s intensity before QCS-based correction (A), the results after correction were greatly corrected (B) by lining QC samples (blue dots) in a more straight line. (C and D) The first two principal components of the dataset MTBLS146 before and after QCS-based correction. Signal variations between two analytical batches were clearly evident (C) and were effectively suppressed by signal correction (D).
Figure 4.Performance assessment of (A) three IS-based normalization methods and (B) one QCM-based normalization method on removing unwanted variations by the RLA plots before and after normalization. Parameters used in this case study for the RUV-random method were set as k = 3 and λ = 0.03.