Literature DB >> 34185426

Susceptibility to false discovery in biomarker research using liquid chromatography-high resolution mass spectrometry based untargeted metabolomics profiling.

Pengwei Zhang1,2, Irene L Ang2, Melody M T Lam3, Rui Wei2, Kate M K Lei2, Xingwang Zhou4, Henry H N Lam5, Qing-Yu He1, Terence C W Poon2.   

Abstract

Entities:  

Mesh:

Substances:

Year:  2021        PMID: 34185426      PMCID: PMC8236120          DOI: 10.1002/ctm2.469

Source DB:  PubMed          Journal:  Clin Transl Med        ISSN: 2001-1326


× No keyword cloud information.
Dear Editor, Our study demonstrates that biomarker research using liquid chromatography (LC)‐high resolution (HR) mass spectrometry (MS) based untargeted metabolomics profiling is susceptible to the discovery of false positive biomarkers. LC‐MS, especially LC‐HRMS, is popularly used to discover putative biomarkers through comparing untargeted metabolomic profiles between a patient group and a control group. , This approach is susceptible to various pre‐analytical, analytical, and post‐analytical biases. Moreover, isotopes, adducts, in‐source fragment products of some metabolites, artifacts, and contaminants could be wrongly considered as unique metabolomic features. , To what extent the putative metabolomic biomarkers could be false remains unknown. We attempted to identify putative biomarkers for differentiating two artificial groups of plasma samples (12 samples in each group) with well‐defined differences in their metabolome contents (Figure 1A, Table 1; Tables S1 and S2). By design, a maximum of 22 putative biomarkers are true, and the rest of the putative biomarkers must be false.
FIGURE 1

The experimental workflow of the present study (A), MS and MS/MS spectra of representative relevant false positive biomarkers (B – G), and the observed change‐folds of true biomarkers and false positive biomarkers (H, I). (A) When using real patient samples for biomarker discovery, it is not possible to tell which putative biomarkers are false, and it is also not possible to avoid pre‐analytical biases. To overcome these two problems, two groups of plasma samples (12 samples in each group) mimicking those collected from a disease group and a non‐disease group were created by spiking separately with two different sets of 11 metabolite standards into a preparation of pooled human plasma. Each of the 24 plasma samples (i.e., 12 from the disease group and 12 from the non‐disease group) was treated as a unique patient sample and subjected to separate metabolite extraction and LC‐HRMS based untargeted metabolomics profiling. XCMS was employed for feature extraction, grouping and retention time alignment. CAMERA and MS‐FLO were used to annotate and remove the redundant peaks. At a false discovery rate of ≤ 5%, metabolomic features with statistically significant differential intensities were regarded as putative biomarkers. By the design, a maximum of 22 putative biomarkers could be true, and the rest of the putative biomarkers were false. Six relevant false positive biomarkers which were identified as the in‐source fragmentation products (B, C) and adducts (D,E) of a true biomarker (e.g., Tryptophan, Trp). The retention times of the in‐source fragmentation products (B) and adducts (D) was the same as that of the original metabolite. The m/z values of the in‐source fragmentation products (B) identical to the m/z value of one of the peaks in the MS/MS spectrum of the original metabolite (C). The MS/MS fragmentation patterns of the adducts (D) and the principal ion of original metabolite shared common fragment ions (E). An example of a relevant false positive biomarker which was identified as in‐source complex of a true biomarker (Valine) and an endogenous metabolite (Taurine) (F). The extracted ion chromatogram (10 ppm tolerance) of valine (upper panel), taurine (middle panel) and in‐source complex of valine and taurine (lower panel) (F), and the MS/MS fragmentation spectra showed fragments of the relevant false positive biomarker [Taurine+Valine+2Na+H‐2H]+ (G). The box‐plots (H) of the absolute log2 fold‐change values of the true biomarkers, relevant false positive biomarkers and irrelevant false positive biomarkers and their comparisons (Mann Whitney test, two‐tailed P‐values). ROC curve (I) for differentiation the true biomarkers and relevant false positive biomarkers from the irrelevant false positive biomarkers using various cutoffs of the absolute value of log2 fold‐change. For each cutoff, the true acceptance rate was the proportion of true biomarkers and relevant false positive biomarkers having their absolute values of log2 fold‐change higher than the cutoff, whereas the false acceptance rate was the proportion of irrelevant false positive biomarkers having their values higher than the cutoff

TABLE 1

Information of the two sets of metabolite standards spiked into the human pooled plasma

Metabolite standardSetReported concentration in human plasma (μM)Estimated mean concentration (μM) in human plasmaSpiked amount (μM)Supplemental References
L‐LeucineA123 ± 25, 140, 66 to 170125250[17–19]
CreatineA44 ± 28, 22.5 to 6255110[20, 21]
L‐HistidineA82 ± 10, 78.8 ± 18.980160[17, 22]
L‐PhenylalanineA57 ± 9, 70 ± 10.98, 63.0 ± 11.765130[17, 23, 24]
D‐(+)‐GlucoseA4700 to 6100, 5770500010000[17, 25]
L‐Aspartic AcidA54 ± 4, 3 ± 1, 19.3 ± 4.8, 30 ± 142040[17, 22, 26, 27]
L‐ValineA233 ± 43, 212.0 ± 61230460[17, 28]
L‐AlanineA333 ± 74330660[17]
L‐SerineA114 ± 19, 258 ± 27, 104 ± 26, 159.8 ± 26.6140280[17, 26–28]
L‐CarnitineA25.4 to 54.1, 43.6 ± 9.9, 50 ± 304590[23, 29, 30]
L‐GlutamineA586 ± 84, 5905901180[17, 31]
CreatinineB35 to 122, 74.12 ± 10.9175150[32, 33]
L‐ProlineB168 ± 60, 339 ± 51, 163.6240480[17, 26, 34]
BetaineB20 to 144, 57.0 ± 15.475150[17, 35, 36]
L‐GlycineB230 ± 52, 255.4 ± 65.9230460[17, 31]
L‐ArginineB110 ± 21.4, 99.0 ± 22.8, 82.2 to 140.9110220[23, 37, 38]
L‐ThreonineB140 ± 33, 127.9 ± 28.2140280[17, 39]
L‐TryptophanB67 to 72, 44 ± 750100[26, 40]
L‐LysineB188 ± 32, 150 ± 2.1, 183 ± 39180360[17, 41, 42]
L‐Glutamic AcidB24 ± 15, 87.0 ± 37.1, 72.0 ± 10.185170[17, 22, 43]
L‐AsparagineB41 ± 10, 40.04080[17, 44]
HypoxanthineB8.14 ± 2.86, 11.02 ± 3.671020[45, 46]
The experimental workflow of the present study (A), MS and MS/MS spectra of representative relevant false positive biomarkers (B – G), and the observed change‐folds of true biomarkers and false positive biomarkers (H, I). (A) When using real patient samples for biomarker discovery, it is not possible to tell which putative biomarkers are false, and it is also not possible to avoid pre‐analytical biases. To overcome these two problems, two groups of plasma samples (12 samples in each group) mimicking those collected from a disease group and a non‐disease group were created by spiking separately with two different sets of 11 metabolite standards into a preparation of pooled human plasma. Each of the 24 plasma samples (i.e., 12 from the disease group and 12 from the non‐disease group) was treated as a unique patient sample and subjected to separate metabolite extraction and LC‐HRMS based untargeted metabolomics profiling. XCMS was employed for feature extraction, grouping and retention time alignment. CAMERA and MS‐FLO were used to annotate and remove the redundant peaks. At a false discovery rate of ≤ 5%, metabolomic features with statistically significant differential intensities were regarded as putative biomarkers. By the design, a maximum of 22 putative biomarkers could be true, and the rest of the putative biomarkers were false. Six relevant false positive biomarkers which were identified as the in‐source fragmentation products (B, C) and adducts (D,E) of a true biomarker (e.g., Tryptophan, Trp). The retention times of the in‐source fragmentation products (B) and adducts (D) was the same as that of the original metabolite. The m/z values of the in‐source fragmentation products (B) identical to the m/z value of one of the peaks in the MS/MS spectrum of the original metabolite (C). The MS/MS fragmentation patterns of the adducts (D) and the principal ion of original metabolite shared common fragment ions (E). An example of a relevant false positive biomarker which was identified as in‐source complex of a true biomarker (Valine) and an endogenous metabolite (Taurine) (F). The extracted ion chromatogram (10 ppm tolerance) of valine (upper panel), taurine (middle panel) and in‐source complex of valine and taurine (lower panel) (F), and the MS/MS fragmentation spectra showed fragments of the relevant false positive biomarker [Taurine+Valine+2Na+H‐2H]+ (G). The box‐plots (H) of the absolute log2 fold‐change values of the true biomarkers, relevant false positive biomarkers and irrelevant false positive biomarkers and their comparisons (Mann Whitney test, two‐tailed P‐values). ROC curve (I) for differentiation the true biomarkers and relevant false positive biomarkers from the irrelevant false positive biomarkers using various cutoffs of the absolute value of log2 fold‐change. For each cutoff, the true acceptance rate was the proportion of true biomarkers and relevant false positive biomarkers having their absolute values of log2 fold‐change higher than the cutoff, whereas the false acceptance rate was the proportion of irrelevant false positive biomarkers having their values higher than the cutoff Information of the two sets of metabolite standards spiked into the human pooled plasma The number of metabolomic features depended on the signal‐to‐noise ratio threshold (snthresh) used for feature extraction (Table 2). A snthresh of 5 had been widely used (Table S3). Using a snthresh of 5 and a false discovery rate (FDR) cutoff of 5% for data mining, 22 true biomarkers (i.e., true positives) and 165 false positive biomarkers were observed (Table 3; Table S4). Therefore, the actual FDR (i.e., the false positive rate) was 88% instead of 5% (Table 2). Increasing the snthresh and reducing the sample size (e.g., n = 6 for each group) could decrease the number of false positive biomarkers. However, the actual FDR remained > 60% (Table 2; Table S5). We also performed a negative control experiment using two groups of plasma samples (n = 12 for each group) having identical metabolome contents. No differential metabolomic features were found (Supplemental Table S6). This indicated that our experimental methods did not suffer from any obvious pre‐analytical biases and analytical biases.
TABLE 2

Summary of the biomarker discovery results (FDR = 5%) obtained by comparing the metabolomic profiles of plasma samples mimicking those collected from diseased subjects (n = 12) and non‐diseased subjects (n = 12)

snthresh (signal‐to‐noise ratio threshold) for metabolomic feature extractionTotal number of metabolomic features a (M)Number of differential metabolomic features, i.e., putative biomarkers (PB)Number of true positives i.e., true biomarkers identified as putative biomarkers (TP)Number of false negatives (FN = 22 – TP)Number of false positives, i.e., putative biomarkers which were not the true biomarkers (FP = PB – TP)Percentage of metabolomic features which were false positives (FP / M)False Negative Rate for the true biomarkers (FNR = FN / 22)Actual False Discovery Rate, i.e., False Positive Rate for the putative biomarkers (FPR = FP / PB)
20402732115212.9%4.5%71.2%
10578942207212.5%0%76.5%
580518722016520.5%0%88.2%

The metabolomic features were cleaned up using CAMERA and MS‐FLO. Features absent in 80% of the samples were discarded. Then features corresponding to the features with CV > 30% in the QC samples were discarded. Moreover, features contributed by the impurities in the metabolomic standards were tracked and excluded from the calculations.

TABLE 3

Summary of 22 true biomarkers among the 187 putative biomarkers discovered by comparing the metabolomic profiles of plasma samples mimicking those collected from diseased subjects (n = 12) and non‐diseased subjects (n = 12). The metabolomic features were extracted using a snthresh value = 5

Normalized signal intensity (mean ± SD)
IdentityMetabolite standards, Set A or Set BRetention timeObserved m/zTheoretical m/zMass error (ppm)DiseasedNon‐diseasedBH adjusted P‐valueFold‐change (Diseased /Non‐diseased)
HypoxanthineB2.8137.04590137.045780.918.9E+07±1.1E+071.9E+08±1.1E+071.8E‐080.5
CreatinineB3.4114.06622114.066190.286.7E+08±7.4E+071.9E+09±7.4E+071.7E‐170.4
L‐TryptophanB5.3205.09714205.09715−0.073.1E+07±2.0E+069.8E+07±2.0E+061.6E‐150.3
L‐PhenylalanineA5.4166.08625166.08626−0.031.6E+08±7.0E+065.3E+07±7.0E+062.3E‐203.0
L‐LeucineA6.0132.10191132.10191−0.011.7E+08±1.5E+075.9E+07±1.5E+073.3E‐152.9
L‐Valine [M+2Na‐H]+ A7.4162.04997162.05014−1.081.7E+07±2.0E+067.6E+06±2.0E+061.2E‐112.3
BetaineB7.7118.08623118.08626−0.243.9E+08±2.9E+072.0E+09±2.9E+072.7E‐190.2
L‐ProlineB7.8116.07076116.070611.298.7E+07±5.5E+063.1E+08±5.5E+061.0E‐130.3
D‐Glucose [M+Na]+ A7.5203.05258203.05261−0.133.8E+07±1.4E+071.4E+07±1.4E+074.8E‐052.7
CarnitineA8.5162.11239162.11247−0.522.4E+09±2.3E+089.1E+08±2.3E+081.9E‐142.7
L‐AlanineA8.690.0549690.054960.043.7E+07±3.0E+061.7E+07±3.0E+064.3E‐142.2
CreatineA8.7132.07674132.07675−0.052.7E+08±1.3E+071.1E+08±1.3E+073.4E‐172.4
L‐ThreonineB8.9120.06552120.065520.035.7E+06±9.1E+052.1E+07±9.1E+055.7E‐120.3
L‐GlycineB9.176.0393076.03931−0.193.0E+06±1.4E+057.4E+06±1.4E+057.4E‐170.4
L‐SerineA9.4106.04984106.04987−0.259.1E+06±1.2E+063.4E+06±1.2E+061.1E‐112.6
L‐GlutamineA9.3147.07636147.07642−0.436.7E+07±1.2E+073.2E+07±1.2E+077.1E‐082.1
L‐Glutamic acidB9.4148.06034148.06043−0.582.0E+07±2.4E+062.8E+07±2.4E+066.4E‐040.7
L‐AsparagineB9.5133.06075133.06077−0.191.3E+06±2.8E+055.2E+06±2.8E+051.1E‐090.2
L‐Aspartic acid [M+2Na‐H]+ A10.0178.00862178.00868−0.301.4E+06±5.6E+055.1E+05±5.6E+055.8E‐042.7
L‐ArginineB10.4175.11884175.11895−0.621.2E+08±1.4E+073.8E+08±1.4E+071.3E‐160.3
L‐HistidineA10.4156.07675156.076750.007.6E+07±3.6E+062.8E+07±3.6E+062.4E‐192.7
L‐LysineB10.5147.11273147.11281−0.702.4E+07±2.4E+067.2E+07±2.4E+062.1E‐140.3
Summary of the biomarker discovery results (FDR = 5%) obtained by comparing the metabolomic profiles of plasma samples mimicking those collected from diseased subjects (n = 12) and non‐diseased subjects (n = 12) The metabolomic features were cleaned up using CAMERA and MS‐FLO. Features absent in 80% of the samples were discarded. Then features corresponding to the features with CV > 30% in the QC samples were discarded. Moreover, features contributed by the impurities in the metabolomic standards were tracked and excluded from the calculations. Summary of 22 true biomarkers among the 187 putative biomarkers discovered by comparing the metabolomic profiles of plasma samples mimicking those collected from diseased subjects (n = 12) and non‐diseased subjects (n = 12). The metabolomic features were extracted using a snthresh value = 5 We revealed identities of 93% (154) of the false positive biomarkers (Table 4; Table S4). Eighty‐five percent (141) of the false positive biomarkers were contributed by in‐source fragmentation products (29), in‐source complexes (23), adducts (79), and isotopes (10) of the true biomarkers (Figure 1B–1G), whereas 8% (13) were contributed by metabolites irrelevant to the true biomarkers (Figure S1). We named these two types of false positive biomarkers as “relevant false positive biomarkers” and “irrelevant false positive biomarkers,” respectively. Those adducts, in‐source complexes, etc. should be considered as false positives because their signal intensities cannot be accurately measured in biological specimens.
TABLE 4

Origins of 165 false positive biomarkers obtained using a signal‐to‐noise ratio threshold value of 5

OriginsClassificationNumber of false positive biomarkers (number of them having retention times overlapping with those of spiked metabolite standards)
Spiked metabolite standardsIn source fragmentation products29 (29)
Adducts79 (79)
Isotopes10 (10)
In‐source complex23 (23)
Subtotal 141 (141)
Irrelevant metabolitesPrincipal ion11 (7)
In source fragmentation products0 (0)
Adducts2 (2)
Isotopes0 (0)
In‐source complex0 (0)
Subtotal 13 (9)
Unidentified 11 (9)
Total 165 (159)
Origins of 165 false positive biomarkers obtained using a signal‐to‐noise ratio threshold value of 5 Metabolomic features corresponding to 88 (62%) relevant false positive biomarkers were observed in the LC‐HRMS profiles of the respective mixtures of the pure metabolites (i.e., Set A and Set B, Table S7). This confirmed they were originated from the true biomarkers. Although CAMERA and MS‐FLO were used to annotate and remove redundant peaks, redundant features still existed in the final list of metabolomic features (Table S4). Similar annotation mistakes could be found in the previous studies. , It was not possible to differentiate the relevant false positive biomarkers from the true biomarkers according to signal intensity change (Figure 1H). However, they could be identified through manual inspection of the MS and MS/MS spectra. A true biomarker and its relevant false positive biomarkers shared nearly the same retention time and common MS/MS fragmentation products (Figures 1B–1G). For the irrelevant false positive biomarkers, one possible cause for their presence was matrix effect. In electrospray ionization MS, matrix effect is characterized as an alteration of signal response by co‐eluting substances. Nine (69%) of 13 irrelevant false positive biomarkers had their retention times overlapping with the spiked metabolites (Table 4). Since the negative control experiment did not find any differential metabolomic features, our data suggested that increasing the plasma levels of 22 metabolites (11 in each study group) by about two‐folds were sufficient to alter the “matrix” significantly. Fortunately, fold‐changes of the irrelevant false positive biomarkers were significantly lower than those of the true biomarkers and the relevant false positive biomarkers (Figure 1H). A fold‐change cutoff of 1.5 could reject 11 of 13 irrelevant false positive biomarkers in the present study, thus suggesting that any differential metabolomic features with a fold‐change < 1.5 have a high chance of being false positive. Unlike real biomarker discovery studies, in the present study “true” biomarkers and other metabolites were identical among the plasma samples within each study group. Such “homogeneous” design allowed us to identify artifacts unambiguously using a small sample size. With respect to the complexity of metabolomes among patient specimens in real life, the false positive rate of putative metabolomic biomarkers in a real study could be higher or lower than the value reported by us. The amount of false positive biomarkers should depend on various factors, such as number, quantity, and properties of true biomarkers, complex interactions between the true biomarkers and background metabolites in individual specimens, detection sensitivity of the MS platform, and the study sample size. We showed that larger sample size (n = 12 vs. n = 6 for each group) resulted in more false positive biomarkers because of higher statistical power. In general, about 50 cases or more for each study group are needed for reliable biomarker discovery. We speculate that with sufficient statistical power, similar types of false positive biomarkers could be identified. To our knowledge, this study is the first to demonstrate that the use of LC‐HRMS based metabolomic profiling for metabolite biomarker discovery are susceptible to false discovery. Without knowing the identities, it is risky to treat any statistically differential features as putative biomarkers. Relevant false positive biomarkers could be identified through careful manual inspection of the retention time, MS, and MS/MS information. Informatics tools which can accurately and effectively annotate the LC‐HRMS features need to be developed. Irrelevant false positive biomarkers could be minimized by using an appropriate fold‐change cutoff (Figure 1I). Other approaches, such as multicenter study, validation using an independent patient cohort and orthogonal chemical analysis, should also help eliminate the false positive biomarkers.

CONFLICT OF INTEREST

The authors declare no conflict of interest.

AVAILABILITY OF DATA AND MATERIAL

All supplementary tables and figures are provided with the manuscript. The LC‐HRMS raw data files of the present study is available at the NIH Common Fund's National Metabolomics Data Repository (NMDR) website, the Metabolomics Workbench, https://www.metabolomicsworkbench.org, where it has been assigned Project ID PR000757. The data can be accessed directly via it's Project DOI: 10.21228/M8TQ2B. SUPPORTING INFORMATION Click here for additional data file.
  10 in total

Review 1.  Matrix effects: the Achilles heel of quantitative high-performance liquid chromatography-electrospray-tandem mass spectrometry.

Authors:  Paul J Taylor
Journal:  Clin Biochem       Date:  2005-04       Impact factor: 3.281

Review 2.  A review of validated biomarkers obtained through metabolomics.

Authors:  Ángeles López-López; Ángeles López-Gonzálvez; Tomás Clive Barker-Tejeda; Coral Barbas
Journal:  Expert Rev Mol Diagn       Date:  2018-06-06       Impact factor: 5.225

3.  Systems-Level Annotation of a Metabolomics Data Set Reduces 25 000 Features to Fewer than 1000 Unique Metabolites.

Authors:  Nathaniel G Mahieu; Gary J Patti
Journal:  Anal Chem       Date:  2017-09-15       Impact factor: 6.986

4.  CROP: correlation-based reduction of feature multiplicities in untargeted metabolomic data.

Authors:  Štěpán Kouřil; Julie de Sousa; Jan Václavík; David Friedecký; Tomáš Adam
Journal:  Bioinformatics       Date:  2020-05-01       Impact factor: 6.937

Review 5.  Implementation of liquid chromatography-high resolution mass spectrometry methods for untargeted metabolomic analyses of biological samples: A tutorial.

Authors:  Julian Pezzatti; Julien Boccard; Santiago Codesido; Yoric Gagnebin; Abhinav Joshi; Didier Picard; Víctor González-Ruiz; Serge Rudaz
Journal:  Anal Chim Acta       Date:  2020-01-02       Impact factor: 6.558

Review 6.  Metabolite Measurement: Pitfalls to Avoid and Practices to Follow.

Authors:  Wenyun Lu; Xiaoyang Su; Matthias S Klein; Ian A Lewis; Oliver Fiehn; Joshua D Rabinowitz
Journal:  Annu Rev Biochem       Date:  2017-06-20       Impact factor: 23.643

7.  Avoiding misannotation of in-source fragmentation products as cellular metabolites in liquid chromatography-mass spectrometry-based metabolomics.

Authors:  Yi-Fan Xu; Wenyun Lu; Joshua D Rabinowitz
Journal:  Anal Chem       Date:  2015-01-27       Impact factor: 6.986

8.  Translational biomarker discovery in clinical metabolomics: an introductory tutorial.

Authors:  Jianguo Xia; David I Broadhurst; Michael Wilson; David S Wishart
Journal:  Metabolomics       Date:  2012-12-04       Impact factor: 4.290

9.  CliqueMS: a computational tool for annotating in-source metabolite ions from LC-MS untargeted metabolomics data based on a coelution similarity network.

Authors:  Oriol Senan; Antoni Aguilar-Mogas; Miriam Navarro; Jordi Capellades; Luke Noon; Deborah Burks; Oscar Yanes; Roger Guimerà; Marta Sales-Pardo
Journal:  Bioinformatics       Date:  2019-10-15       Impact factor: 6.937

  10 in total
  1 in total

Review 1.  Strategies for structure elucidation of small molecules based on LC-MS/MS data from complex biological samples.

Authors:  Zhitao Tian; Fangzhou Liu; Dongqin Li; Alisdair R Fernie; Wei Chen
Journal:  Comput Struct Biotechnol J       Date:  2022-09-07       Impact factor: 6.155

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.