| Literature DB >> 34185426 |
Pengwei Zhang1,2, Irene L Ang2, Melody M T Lam3, Rui Wei2, Kate M K Lei2, Xingwang Zhou4, Henry H N Lam5, Qing-Yu He1, Terence C W Poon2.
Abstract
Entities:
Mesh:
Substances:
Year: 2021 PMID: 34185426 PMCID: PMC8236120 DOI: 10.1002/ctm2.469
Source DB: PubMed Journal: Clin Transl Med ISSN: 2001-1326
FIGURE 1The experimental workflow of the present study (A), MS and MS/MS spectra of representative relevant false positive biomarkers (B – G), and the observed change‐folds of true biomarkers and false positive biomarkers (H, I). (A) When using real patient samples for biomarker discovery, it is not possible to tell which putative biomarkers are false, and it is also not possible to avoid pre‐analytical biases. To overcome these two problems, two groups of plasma samples (12 samples in each group) mimicking those collected from a disease group and a non‐disease group were created by spiking separately with two different sets of 11 metabolite standards into a preparation of pooled human plasma. Each of the 24 plasma samples (i.e., 12 from the disease group and 12 from the non‐disease group) was treated as a unique patient sample and subjected to separate metabolite extraction and LC‐HRMS based untargeted metabolomics profiling. XCMS was employed for feature extraction, grouping and retention time alignment. CAMERA and MS‐FLO were used to annotate and remove the redundant peaks. At a false discovery rate of ≤ 5%, metabolomic features with statistically significant differential intensities were regarded as putative biomarkers. By the design, a maximum of 22 putative biomarkers could be true, and the rest of the putative biomarkers were false. Six relevant false positive biomarkers which were identified as the in‐source fragmentation products (B, C) and adducts (D,E) of a true biomarker (e.g., Tryptophan, Trp). The retention times of the in‐source fragmentation products (B) and adducts (D) was the same as that of the original metabolite. The m/z values of the in‐source fragmentation products (B) identical to the m/z value of one of the peaks in the MS/MS spectrum of the original metabolite (C). The MS/MS fragmentation patterns of the adducts (D) and the principal ion of original metabolite shared common fragment ions (E). An example of a relevant false positive biomarker which was identified as in‐source complex of a true biomarker (Valine) and an endogenous metabolite (Taurine) (F). The extracted ion chromatogram (10 ppm tolerance) of valine (upper panel), taurine (middle panel) and in‐source complex of valine and taurine (lower panel) (F), and the MS/MS fragmentation spectra showed fragments of the relevant false positive biomarker [Taurine+Valine+2Na+H‐2H]+ (G). The box‐plots (H) of the absolute log2 fold‐change values of the true biomarkers, relevant false positive biomarkers and irrelevant false positive biomarkers and their comparisons (Mann Whitney test, two‐tailed P‐values). ROC curve (I) for differentiation the true biomarkers and relevant false positive biomarkers from the irrelevant false positive biomarkers using various cutoffs of the absolute value of log2 fold‐change. For each cutoff, the true acceptance rate was the proportion of true biomarkers and relevant false positive biomarkers having their absolute values of log2 fold‐change higher than the cutoff, whereas the false acceptance rate was the proportion of irrelevant false positive biomarkers having their values higher than the cutoff
Information of the two sets of metabolite standards spiked into the human pooled plasma
| Metabolite standard | Set | Reported concentration in human plasma (μM) | Estimated mean concentration (μM) in human plasma | Spiked amount (μM) | Supplemental References |
|---|---|---|---|---|---|
| L‐Leucine | A | 123 ± 25, 140, 66 to 170 | 125 | 250 | [17–19] |
| Creatine | A | 44 ± 28, 22.5 to 62 | 55 | 110 | [20, 21] |
| L‐Histidine | A | 82 ± 10, 78.8 ± 18.9 | 80 | 160 | [17, 22] |
| L‐Phenylalanine | A | 57 ± 9, 70 ± 10.98, 63.0 ± 11.7 | 65 | 130 | [17, 23, 24] |
| D‐(+)‐Glucose | A | 4700 to 6100, 5770 | 5000 | 10000 | [17, 25] |
| L‐Aspartic Acid | A | 54 ± 4, 3 ± 1, 19.3 ± 4.8, 30 ± 14 | 20 | 40 | [17, 22, 26, 27] |
| L‐Valine | A | 233 ± 43, 212.0 ± 61 | 230 | 460 | [17, 28] |
| L‐Alanine | A | 333 ± 74 | 330 | 660 | [17] |
| L‐Serine | A | 114 ± 19, 258 ± 27, 104 ± 26, 159.8 ± 26.6 | 140 | 280 | [17, 26–28] |
| L‐Carnitine | A | 25.4 to 54.1, 43.6 ± 9.9, 50 ± 30 | 45 | 90 | [23, 29, 30] |
| L‐Glutamine | A | 586 ± 84, 590 | 590 | 1180 | [17, 31] |
| Creatinine | B | 35 to 122, 74.12 ± 10.91 | 75 | 150 | [32, 33] |
| L‐Proline | B | 168 ± 60, 339 ± 51, 163.6 | 240 | 480 | [17, 26, 34] |
| Betaine | B | 20 to 144, 57.0 ± 15.4 | 75 | 150 | [17, 35, 36] |
| L‐Glycine | B | 230 ± 52, 255.4 ± 65.9 | 230 | 460 | [17, 31] |
| L‐Arginine | B | 110 ± 21.4, 99.0 ± 22.8, 82.2 to 140.9 | 110 | 220 | [23, 37, 38] |
| L‐Threonine | B | 140 ± 33, 127.9 ± 28.2 | 140 | 280 | [17, 39] |
| L‐Tryptophan | B | 67 to 72, 44 ± 7 | 50 | 100 | [26, 40] |
| L‐Lysine | B | 188 ± 32, 150 ± 2.1, 183 ± 39 | 180 | 360 | [17, 41, 42] |
| L‐Glutamic Acid | B | 24 ± 15, 87.0 ± 37.1, 72.0 ± 10.1 | 85 | 170 | [17, 22, 43] |
| L‐Asparagine | B | 41 ± 10, 40.0 | 40 | 80 | [17, 44] |
| Hypoxanthine | B | 8.14 ± 2.86, 11.02 ± 3.67 | 10 | 20 | [45, 46] |
Summary of the biomarker discovery results (FDR = 5%) obtained by comparing the metabolomic profiles of plasma samples mimicking those collected from diseased subjects (n = 12) and non‐diseased subjects (n = 12)
|
| Total number of metabolomic features | Number of differential metabolomic features, i.e., putative biomarkers (PB) | Number of true positives i.e., true biomarkers identified as putative biomarkers (TP) | Number of false negatives (FN = 22 – TP) | Number of false positives, i.e., putative biomarkers which were not the true biomarkers (FP = PB – TP) | Percentage of metabolomic features which were false positives (FP / M) | False Negative Rate for the true biomarkers (FNR = FN / 22) | Actual False Discovery Rate, i.e., False Positive Rate for the putative biomarkers (FPR = FP / PB) |
|---|---|---|---|---|---|---|---|---|
| 20 | 402 | 73 | 21 | 1 | 52 | 12.9% | 4.5% | 71.2% |
| 10 | 578 | 94 | 22 | 0 | 72 | 12.5% | 0% | 76.5% |
| 5 | 805 | 187 | 22 | 0 | 165 | 20.5% | 0% | 88.2% |
The metabolomic features were cleaned up using CAMERA and MS‐FLO. Features absent in 80% of the samples were discarded. Then features corresponding to the features with CV > 30% in the QC samples were discarded. Moreover, features contributed by the impurities in the metabolomic standards were tracked and excluded from the calculations.
Summary of 22 true biomarkers among the 187 putative biomarkers discovered by comparing the metabolomic profiles of plasma samples mimicking those collected from diseased subjects (n = 12) and non‐diseased subjects (n = 12). The metabolomic features were extracted using a snthresh value = 5
| Normalized signal intensity (mean ± SD) | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| Identity | Metabolite standards, Set A or Set B | Retention time | Observed m/z | Theoretical m/z | Mass error (ppm) | Diseased | Non‐diseased | BH adjusted | Fold‐change (Diseased /Non‐diseased) |
| Hypoxanthine | B | 2.8 | 137.04590 | 137.04578 | 0.91 | 8.9E+07±1.1E+07 | 1.9E+08±1.1E+07 | 1.8E‐08 | 0.5 |
| Creatinine | B | 3.4 | 114.06622 | 114.06619 | 0.28 | 6.7E+08±7.4E+07 | 1.9E+09±7.4E+07 | 1.7E‐17 | 0.4 |
| L‐Tryptophan | B | 5.3 | 205.09714 | 205.09715 | −0.07 | 3.1E+07±2.0E+06 | 9.8E+07±2.0E+06 | 1.6E‐15 | 0.3 |
| L‐Phenylalanine | A | 5.4 | 166.08625 | 166.08626 | −0.03 | 1.6E+08±7.0E+06 | 5.3E+07±7.0E+06 | 2.3E‐20 | 3.0 |
| L‐Leucine | A | 6.0 | 132.10191 | 132.10191 | −0.01 | 1.7E+08±1.5E+07 | 5.9E+07±1.5E+07 | 3.3E‐15 | 2.9 |
| L‐Valine [M+2Na‐H]+ | A | 7.4 | 162.04997 | 162.05014 | −1.08 | 1.7E+07±2.0E+06 | 7.6E+06±2.0E+06 | 1.2E‐11 | 2.3 |
| Betaine | B | 7.7 | 118.08623 | 118.08626 | −0.24 | 3.9E+08±2.9E+07 | 2.0E+09±2.9E+07 | 2.7E‐19 | 0.2 |
| L‐Proline | B | 7.8 | 116.07076 | 116.07061 | 1.29 | 8.7E+07±5.5E+06 | 3.1E+08±5.5E+06 | 1.0E‐13 | 0.3 |
| D‐Glucose [M+Na]+ | A | 7.5 | 203.05258 | 203.05261 | −0.13 | 3.8E+07±1.4E+07 | 1.4E+07±1.4E+07 | 4.8E‐05 | 2.7 |
| Carnitine | A | 8.5 | 162.11239 | 162.11247 | −0.52 | 2.4E+09±2.3E+08 | 9.1E+08±2.3E+08 | 1.9E‐14 | 2.7 |
| L‐Alanine | A | 8.6 | 90.05496 | 90.05496 | 0.04 | 3.7E+07±3.0E+06 | 1.7E+07±3.0E+06 | 4.3E‐14 | 2.2 |
| Creatine | A | 8.7 | 132.07674 | 132.07675 | −0.05 | 2.7E+08±1.3E+07 | 1.1E+08±1.3E+07 | 3.4E‐17 | 2.4 |
| L‐Threonine | B | 8.9 | 120.06552 | 120.06552 | 0.03 | 5.7E+06±9.1E+05 | 2.1E+07±9.1E+05 | 5.7E‐12 | 0.3 |
| L‐Glycine | B | 9.1 | 76.03930 | 76.03931 | −0.19 | 3.0E+06±1.4E+05 | 7.4E+06±1.4E+05 | 7.4E‐17 | 0.4 |
| L‐Serine | A | 9.4 | 106.04984 | 106.04987 | −0.25 | 9.1E+06±1.2E+06 | 3.4E+06±1.2E+06 | 1.1E‐11 | 2.6 |
| L‐Glutamine | A | 9.3 | 147.07636 | 147.07642 | −0.43 | 6.7E+07±1.2E+07 | 3.2E+07±1.2E+07 | 7.1E‐08 | 2.1 |
| L‐Glutamic acid | B | 9.4 | 148.06034 | 148.06043 | −0.58 | 2.0E+07±2.4E+06 | 2.8E+07±2.4E+06 | 6.4E‐04 | 0.7 |
| L‐Asparagine | B | 9.5 | 133.06075 | 133.06077 | −0.19 | 1.3E+06±2.8E+05 | 5.2E+06±2.8E+05 | 1.1E‐09 | 0.2 |
| L‐Aspartic acid [M+2Na‐H]+ | A | 10.0 | 178.00862 | 178.00868 | −0.30 | 1.4E+06±5.6E+05 | 5.1E+05±5.6E+05 | 5.8E‐04 | 2.7 |
| L‐Arginine | B | 10.4 | 175.11884 | 175.11895 | −0.62 | 1.2E+08±1.4E+07 | 3.8E+08±1.4E+07 | 1.3E‐16 | 0.3 |
| L‐Histidine | A | 10.4 | 156.07675 | 156.07675 | 0.00 | 7.6E+07±3.6E+06 | 2.8E+07±3.6E+06 | 2.4E‐19 | 2.7 |
| L‐Lysine | B | 10.5 | 147.11273 | 147.11281 | −0.70 | 2.4E+07±2.4E+06 | 7.2E+07±2.4E+06 | 2.1E‐14 | 0.3 |
Origins of 165 false positive biomarkers obtained using a signal‐to‐noise ratio threshold value of 5
| Origins | Classification | Number of false positive biomarkers (number of them having retention times overlapping with those of spiked metabolite standards) |
|---|---|---|
| Spiked metabolite standards | In source fragmentation products | 29 (29) |
| Adducts | 79 (79) | |
| Isotopes | 10 (10) | |
| In‐source complex | 23 (23) | |
|
|
| |
| Irrelevant metabolites | Principal ion | 11 (7) |
| In source fragmentation products | 0 (0) | |
| Adducts | 2 (2) | |
| Isotopes | 0 (0) | |
| In‐source complex | 0 (0) | |
|
|
| |
| Unidentified |
| |
|
|
|