| Literature DB >> 20957145 |
Judith R Denery1, Ashlee A K Nunes, Mark S Hixon, Tobin J Dickerson, Kim D Janda.
Abstract
BACKGROUND: Development of robust, sensitive, and reproducible diagnostic tests for understanding the epidemiology of neglected tropical diseases is an integral aspect of the success of worldwide control and elimination programs. In the treatment of onchocerciasis, clinical diagnostics that can function in an elimination scenario are non-existent and desperately needed. Due to its sensitivity and quantitative reproducibility, liquid chromatography-mass spectrometry (LC-MS) based metabolomics is a powerful approach to this problem. METHODOLOGY/PRINCIPALEntities:
Mesh:
Substances:
Year: 2010 PMID: 20957145 PMCID: PMC2950146 DOI: 10.1371/journal.pntd.0000834
Source DB: PubMed Journal: PLoS Negl Trop Dis ISSN: 1935-2727
Summary of samples analyzed in this study.
| Country of origin | Matrix | Number/description | Clinical pathology | Nodulectomy results | Average mf per mg |
| Cameroon | Plasma | 16 | Palpable nodule(s) | All nodules contained live worms | — |
| 18 uninfected controls | — | NA | 0 | ||
| 1 calcified worm | Palpable nodule(s) | 1 dead, calcified worm | — | ||
| 2 ambiguous lipoma | Palpable nodule(s) | Fat deposit | — | ||
| 1 indeterminant infection status | — | — | — | ||
| Ghana | Serum | 15 | Palpable nodule(s) | — | 12 |
| 10 | Acute papular onchodermatitis | — | + | ||
| Liberia | Serum | 10 | — | — | 527 |
| Guatemala | Serum | 21 | Palpable nodule(s) | 24% of nodules sampled all worms dead | — |
| 17 uninfected controls | — | NA | — | ||
| 6 | — | NA | — | ||
| 6 | — | NA | — | ||
| 5 | — | NA | — | ||
| India | Plasma | 4 | — | NA | — |
| USA | Plasma | 3 | — | NA | — |
| Serum | 3 | — | NA | — |
— indicates no measurement made.
no mf were measured in samples collected from four different anatomical locations from the 18 control individuals included in this analysis.
Average number of mf measured per patient in original study [9] using two skin snips [49].
Samples are known to be mf+, but records are not available.
Average number of mf collected from six different anatomical locations from the 10 patients included in this analysis.
Figure 1Schematic diagram of the LC-MS based metabolomic workflow.
Wherein the multi-region serum and plasma samples are extracted and analyzed within a single sequence on the ESI-TOF in positive mode. Mass spectral data is preprocessed with XCMS software and multivariate statistical analysis and machine learning classification algorithms are used to distinguish patterns in the data and provide a binary output to the classification of samples. ROC curves are used to quantify the relationship between sensitivity and specificity for a given test. Ultimately, this information can be used in an iterative fashion to interrogate larger datasets and provide necessary diagnostic information to better characterize the disease status of clinical samples from a variety of geographic regions.
Characteristics of the 14 candidate biomarkers.
| Compound Classification | p-value | RT (min) | XCMS average | Fold change | Molecular formula | MS/MS major fragments (% abundance) | FTMS accurate mass |
| Fatty acid/Sterol lipid | 6.32×10−13 | 45.7 | 521.4197 | −3.36 | C32H56O5 | 111.0451(39.15)503.4123(29.7) | 521.4190 |
| Fatty acid/Sterol lipid | 2.06×10−11 | 45.7 | 469.3872 | −3.71 | C28H52O5 | 415.357(100)291.2331(48.57) | 469.3888 |
| Sterol lipid | 2.16×10−11 | 41.4 | 425.3611 | −3.55 | C26H48O4 | 389.3432(100.0)139.1107(12.8) | 425.3625 |
| Protein | 3.59×10−10 | 31.6 | 979.9368 | −3.99 | - | ||
| Protein | 6.53×10−10 | 31.5 | 986.2677 | −5.65 | - | ||
| Hexacosenoic acid | 4.01×10−10 | 50.7 | 395.3867 | −2.77 | C26H50O2 | 71.0859(100.0)57.0709(81.99) | 395.3804 |
| Pentacosenoic acid | 7.75×10−10 | 49.1 | 381.3710 | −2.43 | C25H48O2 | 71.0858(100.0)57.0719(58.75) | 381.3728 |
| Fatty alcohol/aldehyde | 2.39×10−8 | 48.5 | 241.2505 | 1.54 | C16H32O | 55.0551(77.61)83.0877(46.81) | — |
| Fatty acid | 1.40×10−9 | 39.0 | 367.2840 | −2.28 | C22H38O4 | 331.2649(100.0)79.0547(33.89) | 367.2828 |
| Hydroxy-octadecenoic acid | 1.52×10−9 | 46.1 | 299.2581 | −2.54 | C18H34O3 | 95.0851(100)71.0863(82.07) | 299.2592 |
| Phosphorylated sphingolipid | 4.83×10−9 | 30.0 | 352.2256 | −1.55 | C16H34NO5P | 236.2366(100.0)184.0694(25.99) | 352.2247 |
| Sterol lipid | 1.44×10−8 | 45.5 | 447.3470 | −2.19 | C28H46O4 | 429.3376(43.06)411.3276(32.91) | 447.3470 |
| Protein | 2.05×10−8 | 33.3 | 966.5938 | −3.09 | - | ||
| Protein | 5.24×10−8 | 31.7 | 1086.2922 | −2.76 | - |
Fragments collected under a collision-induced dissociation energy of 20 eV.
FTMS accurate mass was not obtained for this compound. This formula is based on TOF-MS mass accuracy.
Statistical values such as p-value and fold change were determined by XCMS analysis of the O. volvulus + and O. volvulus − mass spectral data files. Retention time (RT), and mass to charge value (m/z), fold change and the direction of overall ion intensity change, represents the average value across all files. Molecular formula and compound class identifier as determined by MS/MS and FTMS analysis is provided.
Figure 2PCA factor score plots of MS peak intensity values for the 14 candidate onchocerciasis biomarkers.
Mass feature intensity values were extracted through ESI-TOF+/XCMS analysis of (A) African blood serum and plasma samples from 55 O. volvulus infected individuals compared against 18 healthy controls. (B) A sample set including 76 O. volvulus infected individuals compared against 56 O. volvulus negative controls (including healthy and those infected with other tropical diseases). (C) An extraction of only those data points representing the 21 Guatemala O. volvulus infected individuals compared against 18 healthy controls. Individual data points are symbolized using the following code for country of origin and disease status: “blue diamond” = Cameroon Ov−, “blue circle” = Guatemala Ov−, “blue astrisk” = Scripps Ov−, “green circle” = Leishmaniasis Ov−, “green square” = Chagas Ov−, “green triangle” = LF Ov−, “pink diamond” = Cameroon Ov+, “pink circle” = Guatemala Ov+, “pink square” = Ghana Ov+ (1986, 1991 and 2003 samples), “pink triangle” = Liberia Ov+, “orange diamond” = Cameroon Ov?.
Summary of the diagnostic accuracy of the machine learning algorithm analysis.
| Algorithm | Classifier type | Entire sample set | Africa samples | ||||
| Sensitivity | Specificity | ROC area | Sensitivity | Specificity | ROC area | ||
| BayesNet | Bayesian network | 84.8 | 87 | 0.929 | 97.3 | 99.1 | 1 |
| NaiveBayes | Bayesian network | 88.6 | 88.3 | 0.930 | 94.5 | 98.2 | 1 |
| Logistic | Logistic regression | 85.6 | 85.2 | 0.898 | 98.6 | 99.6 | 0.999 |
| IB1 | Nearest neighbor | 87.1 | 83.9 | 0.855 | 98.6 | 95.8 | 0.972 |
| OneR | Minimum error attribute | 76.5 | 76.6 | 0.766 | 89.0 | 85.2 | 0.871 |
| Multilayer perceptron | Backpropagation classification | 87.9 | 85.9 | 0.921 | 98.6 | 95.8 | 1 |
| FLR | Fuzzy lattice reasoning | 81.1 | 83.7 | 0.824 | 98.6 | 99.6 | 0.991 |
| Functional trees | Classification tree | 84.1 | 83.6 | 0.861 | 100 | 100 | 1 |
| Random forest | Classification tree | 88.6 | 89.3 | 0.954 | 97.3 | 95.4 | 0.997 |
The mass spectral ion intensities of the top 14 candidate onchocerciasis biomarkers from onchocerciasis infected and uninfected samples were compared between the multi-region sample set and the African blood samples. All results were obtained using a 10 fold cross validation analysis.