| Literature DB >> 30261000 |
Andrea S Martinez-Vernon1, James A Covington2, Ramesh P Arasaradnam3,4,5, Siavash Esfahani2, Nicola O'Connell3, Ioannis Kyrou6,7,8, Richard S Savage5,9.
Abstract
MOTIVATION: The measurement of disease biomarkers in easily-obtained bodily fluids has opened the door to a new type of non-invasive medical diagnostics. New technologies are being developed and fine-tuned in order to make this possibility a reality. One such technology is Field Asymmetric Ion Mobility Spectrometry (FAIMS), which allows the measurement of volatile organic compounds (VOCs) in biological samples such as urine. These VOCs are known to contain a range of information on the relevant person's metabolism and can in principle be used for disease diagnostic purposes. Key to the effective use of such data are well-developed data processing pipelines, which are necessary to extract the most useful data from the complex underlying biological structure.Entities:
Mesh:
Substances:
Year: 2018 PMID: 30261000 PMCID: PMC6160042 DOI: 10.1371/journal.pone.0204425
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Demographics of the patients from this study.
| Diabetic patients | Volunteers | Overall | |
|---|---|---|---|
| Male | 43 | 17 | 60 |
| Female | 29 | 26 | 55 |
| Average age | 57 | 45 | 53 |
| Average alcohol usage | 2 | 5 | 3 |
| Average BMI (s.d.) | 38.3 (10.5) | 28.1 (5.96) | 34.5 (10.3) |
| Total | 72 | 43 | 115 |
We note that due to the challenges of constructing such a pilot study (our controls are healthy volunteers), there is some degree of demographic mismatch between the disease and control groups. The distributions of age/sex/alcohol use/body mass index (BMI) each overlap between the control and disease groups, but there are statistically significant differences in age and BMI. We can quantify the effect of age/sex/alcohol use/BMI as potential confounding biomarkers, using the ROC curve AUC statistic. To do this we treat each demographic covariate in turn as a single biomarker, which can then be used directly to compute the ROC curve (and hence AUC) due only to that potentially confounding factor. This results in the following AUC scores: Age AUC = 0.73 (0.62–0.83). Gender AUC = 0.6 (0.51–0.69). BMI AUC = 0.79 (0.71–0.87). Alcohol AUC = 0.67 (0.58–0.76). While the BMI result in particular is not ideal (and weakens the claims we are able to make with regards to diabetes diagnosis in this paper), we nevertheless note that the best–performing pipeline scores AUC = 0.85 and therefore offers some evidence that FAIMS data are worth pursuing for diabetes diagnosis. BMI is measured as kg/m2 and alcohol usage refers to the number of units of alcohol (measured as 10ml or 8g of pure alcohol) consumed per week.
Fig 1The general workflow of classifying FAIMS data into diseased or non-diseased classes.
The steps that were explored are indicated as dark blue boxes. Variations or specification of some steps are displayed at the sides. The order in which the steps and approaches were investigated differs from the order shown in the diagram. Consult the main text for a description of the order. Briefly, the pipeline was compared when using the data of different sample “runs” either individually or in ensembles. Different forms of discrete wavelet transforms (DWT) were considered, as well as a feature exclusion step based on the feature variance. Within the cross–validation cycle, we evaluated three different feature selection methods (filter, wrapper and embedded), as well as a post–filter selection principal component analysis (PCA) step and the inclusion of the demographic data as features. Finally, we also explored ensemble steps at the classifier model probability level. See main text for details and the order in which the pipeline was explored.
Fig 2The recommended pipeline for classifying FAIMS data into diseased or non-diseased classes resulting from this study.
We found that “run” 2 data with a 2D wavelet transform were the better performing steps prior to the feature selection. The filter method with an nKeep parameter value of 2 perform best and with minimal algorithm run time. The addition of the demographic data as features to the wavelet transform FAIMS data resulted in a higher AUC score, although it was not found to be a statistically significant finding. However, these data might prove informative in a larger-scale pilot analysis. Overall, no classifier model was found to out–compete the others and we therefore suggest to use all five, until further research determines a “clear winner”. See main text for details and discussion about our findings.
Fig 3Data visualisation.
(a) Heat map of FAIMS data for a diabetic patient. (b) Linearised data without wavelet transform. (c) Data with one–dimensional (1D) discrete wavelet transform (DWT). (d-f) show the equivalent plots for a member of the control group (volunteer).
Model performance comparison with the use of different runs.
| Run 1 | Run 2 | Run 3 | |
|---|---|---|---|
| AUC | 0.739 | 0.805 | |
| –CIs | (0.648–0.83) | (0.747–0.9) | (0.722–0.89) |
| Sensitivity | 0.528 | 0.625 | 0.833 |
| –CIs | (0.353–0.593) | (0.264–0.497) | (0.0892–0.273) |
| Specificity | 0.93 | 0.953 | 0.674 |
| –CIs | (0.0146–0.191) | (0.00568–0.158) | (0.191–0.485) |
Model performance (confident intervals; CIs) are reported for the Sparse Logistic Regression algorithm.
Model performance comparison of use of raw FAIMS data and wavelet-transformed FAIMS data.
| no DWT | 1D DWT | |
|---|---|---|
| AUC | 0.682 | |
| –CIs | (0.584–0.78) | (0.736–0.89) |
| Sensitivity | 0.444 | 0.569 |
| –CIs | (0.434–0.673) | (0.314–0.553) |
| Specificity | 0.907 | 0.977 |
| –CIs | (0.0259–0.221) | (0.000589–0.123) |
Model performance (confident intervals; CIs) are reported for the Sparse Logistic Regression algorithm.
Model performance comparison using different of 2D wavelet transforms.
| 2D DWT | 2D DWT (256 x 256) | 2D DWT (128 x 128) | |
|---|---|---|---|
| AUC | 0.824 | 0.824 | |
| –CIs | (0.747–0.9) | (0.746–0.9) | (0.746–0.9) |
| Sensitivity | 0.625 | 0.639 | 0.597 |
| –CIs | (0.264–0.497) | (0.251–0.483) | (0.289–0.525) |
| Specificity | 0.953 | 0.93 | 0.977 |
| –CIs | (0.00568–0.158) | (0.0146–0.191) | (0.000589–0.123) |
Model performance (confident intervals; CIs) are reported for the Sparse Logistic Regression algorithm.
Cropped matrix dimensions. Baseline matrix dimensions is 512 x 512.
Fig 4Classification model performance for each model across a range of nKeep values.
Error bars show the 95% confidence intervals. Neural Network cannot be used with more than 11 features.
Model performance comparison of PCA implementation.
| no PCA | PCA | |
|---|---|---|
| AUC | 0.8 | |
| –CIs | (0.747–0.9) | (0.717–0.88) |
| Sensitivity | 0.625 | 0.694 |
| –CIs | (0.264–0.497) | (0.202–0.425) |
| Specificity | 0.953 | 0.814 |
| –CIs | (0.00568–0.158) | (0.0839–0.334) |
Model performance (confident intervals; CIs) are reported for the Sparse Logistic Regression algorithm.
Feature selection method comparisons.
| FILTER | EMBEDDED | |
|---|---|---|
| n Features | 2 | 87983 |
| AUC | 0.824 | |
| –CIs | (0.747–0.9) | (0.746–0.9) |
| Sensitivity | 0.625 | 0.694 |
| –CIs | (0.264–0.497) | (0.202–0.425) |
| Specificity | 0.953 | 0.884 |
| –CIs | (0.00568–0.158) | (0.0389–0.251) |
| 8.81 min | 54.87 min |
* Only SLR model run
Model performance (confident intervals; CIs) are reported for the Sparse Logistic Regression algorithm.
Feature selection method comparison.
| WRAPPER | ||||||
|---|---|---|---|---|---|---|
| 100 | 250 | 500 | 1000 | 2000 | 3000 | |
| AUC | 0.739 | 0.765 | 0.751 | 0.762 | 0.756 | 0.703 |
| –CIs | (0.645–0.83) | (0.672–0.86) | (0.652–0.85) | (0.672–0.85) | (0.666–0.85) | (0.603–0.8) |
| Sensitivity | 0.681 | 0.722 | 0.764 | 0.625 | 0.694 | 0.764 |
| –CIs | (0.214–0.44) | (0.179–0.396) | (0.144–0.351) | (0.264–0.497) | (0.202–0.425) | (0.144–0.351) |
| Specificity | 0.767 | 0.767 | 0.721 | 0.814 | 0.744 | 0.628 |
| –CIs | (0.118–0.386) | (0.118–0.386) | (0.153–0.437) | (0.0839–0.334) | (0.135–0.412) | (0.23–0.533) |
| 100.34 min | 9.64 min | 511.39 min | 635.98 min | 438.41 min | 709.06 min | |
Model performance (confident intervals; CIs) are reported for the Sparse Logistic Regression algorithm.
Model performance comparison run subtraction.
| 2D DWT | Run 3—Run 1 | Run 1– Run 3 | |
|---|---|---|---|
| AUC | 0.606 | 0.605 | |
| –CIs | (0.747–0.9) | (0.498–0.71) | (0.497–0.71) |
| Sensitivity | 0.625 | 0.583 | 0.625 |
| –CIs | (0.264–0.497) | (0.302–0.539) | (0.264–0.497) |
| Specificity | 0.953 | 0.651 | 0.605 |
| –CIs | (0.00568–0.158) | (0.21–0.509) | (0.25–0.556) |
Model performance (confident intervals; CIs) are reported for the Sparse Logistic Regression algorithm.
Model performance comparison- noise reduction approaches.
| 2D DWT | Run ensemble | Probability ensemble | |
|---|---|---|---|
| AUC | 0.808 | ||
| –CIs | (0.747–0.9) | (0.73–0.89) | (0.752–0.9) |
| Sensitivity | 0.625 | 0.708 | 0.653 |
| –CIs | (0.264–0.497) | (0.19–0.411) | (0.239–0.469) |
| Specificity | 0.953 | 0.814 | 0.907 |
| –CIs | (0.00568–0.158) | (0.0839–0.334) | (0.0259–0.221) |
Model performance (confident intervals; CIs) are reported for the Sparse Logistic Regression algorithm.
Model performance comparison when using the demographic (demo) variables as features or when using these in addition to the two FAIMS features selected by the filter method.
| 2D DWT | Demo variables | Demo and FAIMS | |
|---|---|---|---|
| AUC | 0.825 | 0.87 | |
| –CIs | (0.747–0.9) | (0.8–0.94) | (0.839–0.95) |
| Sensitivity | 0.625 | 0.764 | 0.778 |
| –CIs | (0.264–0.497) | (0.144–0.351) | (0.133–0.336) |
| Specificity | 0.953 | 0.907 | 0.884 |
| –CIs | (0.00568–0.158) | (0.0259–0.221) | (0.0389–0.251) |
Model performance (confident intervals; CIs) are reported for the Sparse Logistic Regression algorithm.