| Literature DB >> 30065970 |
Daniel P Howsmon1,2, Troy Vargason2,3, Robert A Rubin4, Leanna Delhey5,6, Marie Tippett5,6, Shannon Rose5,6, Sirish C Bennuri5,6, John C Slattery6, Stepan Melnyk6, S Jill James6, Richard E Frye7, Juergen Hahn1,2,3.
Abstract
Autism spectrum disorder (ASD) is a developmental disorder which is currently only diagnosed through behavioral testing. Impaired folate-dependent one carbon metabolism (FOCM) and transsulfuration (TS) pathways have been implicated in ASD, and recently a study involving multivariate analysis based upon Fisher Discriminant Analysis returned very promising results for predicting an ASD diagnosis. This article takes another step toward the goal of developing a biochemical diagnostic for ASD by comparing five classification algorithms on existing data of FOCM/TS metabolites, and also validating the classification results with new data from an ASD cohort. The comparison results indicate a high sensitivity and specificity for the original data set and up to a 88% correct classification of the ASD cohort at an expected 5% misclassification rate for typically-developing controls. These results form the foundation for the development of a biochemical test for ASD which promises to aid diagnosis of ASD and provide biochemical understanding of the disease, applicable to at least a subset of the ASD population.Entities:
Keywords: autism spectrum disorder; biomarkers; multivariate statistical analysis
Year: 2018 PMID: 30065970 PMCID: PMC6063877 DOI: 10.1002/btm2.10095
Source DB: PubMed Journal: Bioeng Transl Med ISSN: 2380-6761
Variable identifiers (ID) and names
| ID | Variable name | ID | Variable name |
|---|---|---|---|
|
| Methionine |
| GSSG |
|
| SAM |
| fGSH/GSSG |
|
| SAH |
| tGSH/GSSG |
|
| SAM/SAH |
| 3‐ClT |
|
| Adenosine |
| 3‐NT |
|
| Homocysteine |
| Tyrosine |
|
| tCysteine |
| Tryptophane |
|
| Glu‐Cys |
| fCystine |
|
| Cys‐Gly |
| fCysteine |
|
| tGSH |
| fCystine/fCysteine |
|
| fGSH |
| % oxidized glutathione |
Figure 1Comparison of fitted PDFs for (a) univariate, (b) PCA‐all, (c) FDA‐all, and (d) LR‐all
Figure 2Final CART tree fitted to the training data
Cross‐validation comparison of univariate and projection‐based multivariate classifiers
| Fitted | Cross‐validated confusion matrix | ||||||
|---|---|---|---|---|---|---|---|
| Classifier | Variables | C‐statistic |
| TP | FP | FN | TN |
| Univariate |
| 0.9159 | 0.01 | 23 | 1 | 60 | 75 |
| 0.05 | 47 | 3 | 36 | 73 | |||
| 0.10 | 65 | 6 | 18 | 70 | |||
| 0.20 | 71 | 13 | 12 | 63 | |||
| PCA‐all | All | 0.9706 | 0.01 | 56 | 1 | 27 | 75 |
| 0.05 | 77 | 3 | 6 | 73 | |||
| 0.10 | 80 | 6 | 3 | 70 | |||
| 0.20 | 80 | 16 | 3 | 60 | |||
| FDA‐all | All | 0.9915 | 0.01 | 50 | 1 | 33 | 75 |
| 0.05 | 81 | 5 | 2 | 71 | |||
| 0.10 | 81 | 13 | 2 | 63 | |||
| 0.20 | 82 | 19 | 1 | 57 | |||
| FDA‐sub |
| 0.9711 | 0.01 | 38 | 1 | 45 | 75 |
| 0.05 | 78 | 3 | 5 | 73 | |||
| 0.10 | 81 | 8 | 2 | 68 | |||
| 0.20 | 81 | 16 | 2 | 60 | |||
| LR‐all | All | 0.9972 | 0.01 | 73 | 6 | 10 | 70 |
| 0.05 | 78 | 8 | 5 | 68 | |||
| 0.10 | 79 | 11 | 4 | 65 | |||
| 0.20 | 81 | 19 | 2 | 57 | |||
| LR‐sub |
| 0.9757 | 0.01 | 21 | 1 | 62 | 75 |
| 0.05 | 79 | 6 | 4 | 70 | |||
| 0.10 | 81 | 8 | 2 | 68 | |||
| 0.20 | 81 | 15 | 2 | 61 | |||
The threshold is the percent membership in the TD class. TP refers to True Positive, FP to False Positive, FN to False Negative, and TN to True Negative.
Figure 3Predictions of the ASD validation data from the (a) PCA‐all, (b) FDA‐sub, and (c) LR‐sub models. Note that the validation data, “VAL,” only consists of data from a set of children with an ASD diagnosis and therefore significant overlap between VAL and ASD is expected and desired
Validation performance for the final biomarker models
| Classifier | Variables |
| TP | FN |
|---|---|---|---|---|
| Univariate |
| 0.05 | 68 | 86 |
| 0.10 | 112 | 42 | ||
| PCA‐all | All | 0.05 | 128 | 26 |
| 0.10 | 149 | 5 | ||
| FDA‐sub |
| 0.05 | 135 | 19 |
| 0.10 | 150 | 4 | ||
| LR‐sub |
| 0.05 | 122 | 32 |
| 0.10 | 139 | 15 | ||
| CART |
| – | 116 | 38 |
TP refers to True Positive and FN to False Negative. False Positive and True Negative are not needed for the validation set as it only consists of data from a set of children with an ASD diagnosis.