| Literature DB >> 24957028 |
Christopher Phillips1, Neil Mac Parthaláin2, Yasir Syed3, Davide Deganello4, Timothy Claypole5, Keir Lewis3.
Abstract
Exhaled volatile organic compounds (VOCs) are of interest for their potential to diagnose disease non-invasively. However, most breath VOC studies have analyzed single breath samples from an individual and assumed them to be wholly consistent representative of the person. This provided the motivation for an investigation of the variability of breath profiles when three breath samples are taken over a short time period (two minute intervals between samples) for 118 stable patients with Chronic Obstructive Pulmonary Disease (COPD) and 63 healthy controls and analyzed by gas chromatography and mass spectroscopy (GC/MS). The extent of the variation in VOC levels differed between COPD and healthy subjects and the patterns of variation differed for isoprene versus the bulk of other VOCs. In addition, machine learning approaches were applied to the breath data to establish whether these samples differed in their ability to discriminate COPD from healthy states and whether aggregation of multiple samples, into single data sets, could offer improved discrimination. The three breath samples gave similar classification accuracy to one another when evaluated separately (66.5% to 68.3% subjects classified correctly depending on the breath repetition used). Combining multiple breath samples into single data sets gave better discrimination (73.4% subjects classified correctly). Although accuracy is not sufficient for COPD diagnosis in a clinical setting, enhanced sampling and analysis may improve accuracy further. Variability in samples, and short-term effects of practice or exertion, need to be considered in any breath testing program to improve reliability and optimize discrimination.Entities:
Year: 2014 PMID: 24957028 PMCID: PMC4101508 DOI: 10.3390/metabo4020300
Source DB: PubMed Journal: Metabolites ISSN: 2218-1989
Chronic Obstructive Pulmonary Disease (COPD) and control groups in the study.
| Variable (Mean ± SD) | COPD ( | Controls ( | |
|---|---|---|---|
| Age (years) | 67.0 ± 8.4 | 67.4 ± 9.7 | |
| Male | 61% | 47% | |
| Smoking Status | -never | 0 | 39 |
| -ex | 78 | 18 | |
| -current | 40 | 6 | |
| Body Mass Index (kg/m2) | 25.6 ± 4.5 | 27.0 ± 4.4 | |
| Predicted % FEV1 | 49.6 ± 18 | 98 ± 16 | |
| Oxygen saturation % | 95.0 ± 2.4 | 95.8 ± 2.3 | |
Figure 1Variation in mean isoprene levels in repeat breath tests for both COPD and control groups (abundance normalized according to batch benzaldehyde). Error bars show standard errors.
Figure 2Variation in mean total volatile organic compound (TVOC) minus isoprene levels in repeat breath tests for both COPD and control groups (abundance normalized according to batch benzaldehyde). Error bars show standard errors.
VOC levels in second and third breath tests compared with first in both COPD and control groups—median, geometric means and coefficient of variation across each group are shown.
| VOC | COPD | Control | |||
|---|---|---|---|---|---|
| 2/1 | 3/1 | 2/1 | 3/1 | ||
| Median | Isoprene | 0.930 | 0.750 | 0.755 | 0.678 |
| Total-isoprene | 1.143 | 0.936 | 1.196 | 0.705 | |
| Benzene | 1.071 | 1.061 | 0.982 | 1.002 | |
| Toluene | 1.067 | 1.027 | 0.979 | 0.992 | |
| Benzaldehyde | 1.151 | 1.036 | 1.036 | 0.978 | |
| Hexanal | 0.973 | 0.843 | 0.967 | 0.860 | |
| Nonadecane | 0.995 | 0.980 | 1.013 | 0.989 | |
| Geometric mean | Isoprene | 0.901 | 0.752 | 0.781 | 0.641 |
| Total-isoprene | 1.093 | 0.933 | 1.043 | 0.836 | |
| Benzene | 1.124 | 1.058 | 0.979 | 1.006 | |
| Toluene | 1.080 | 1.023 | 0.961 | 0.978 | |
| Benzaldehyde | 1.153 | 0.983 | 0.950 | 0.969 | |
| Hexanal | 1.036 | 0.929 | 0.962 | 0.827 | |
| Nonadecane | 1.002 | 0.975 | 0.956 | 0.897 | |
| CV% | Isoprene | 23 | 26 | ||
| Total-isoprene | 40 | 40 | |||
| Benzene | 21 | 23 | |||
| Toluene | 18 | 18 | |||
| Benzaldehyde | 26 | 24 | |||
| Hexanal | 21 | 21 | |||
| Nonadecane | 28 | 27 | |||
Classification accuracy when using machine learning techniques on individual and combined breath samples.
| Classifier | 1 only | 2 only | 3 only | 1 and 2 | 1 and 3 | 2 and 3 | 1 + 2 + 3 | ||||
|---|---|---|---|---|---|---|---|---|---|---|---|
|
| |||||||||||
| J48 | 66.91(12.30) | 65.28(12.45) | 67.75(9.98) | 73.66(7.52) | 72.30(7.88) | 70.46(6.17) | 74.13(9.84) | ||||
| JRIP | 70.69(9.85) | 69.99(9.76) | 66.64(9.88) | 72.11(6.99) | 70.85(7.29) | 69.93(6.44) | 73.28(9.93) | ||||
| PART | 67.18(10.47) | 65.58(10.49) | 65.24(11.55) | 73.28(6.90) | 72.38(6.78) | 72.01(6.66) | 72.74(9.80) | ||||
| Mean | 68.26 | 66.95 | 66.54 | 73.02 | 71.84 | 70.80 | 73.38 | ||||
|
| |||||||||||
| J48 | 0.67(0.14) | 0.65(0.14) | 0.66(0.13) | 0.72(0.10) | 0.72(0.10) | 0.69(0.09) | 0.65(0.15) | ||||
| JRIP | 0.66(0.12) | 0.65(0.11) | 0.63(0.11) | 0.69(0.09) | 0.68(0.09) | 0.65(0.08) | 0.70(0.12) | ||||
| PART | 0.67(0.14) | 0.61(0.12) | 0.63(0.14) | 0.73(0.09) | 0.70(0.10) | 0.71(0.08) | 0.65(0.16) | ||||
| Mean | 0.67 | 0.64 | 0.64 | 0.71 | 0.70 | 0.68 | 0.67 | ||||
|
| |||||||||||
| J48 | 0.71(0.15) | 0.69(0.16) | 0.75(0.14) | 0.80(0.08) | 0.79(0.09) | 0.78(0.08) | 0.69(0.19) | ||||
| JRIP | 0.81(0.12) | 0.81(0.12) | 0.76(0.16) | 0.81(0.09) | 0.79(0.09) | 0.81(0.10) | 0.75(0.15) | ||||
| PART | 0.73(0.14) | 0.73(0.14) | 0.73(0.14) | 0.79(0.08) | 0.80(0.09) | 0.80(0.09) | 0.66(0.19) | ||||
| Mean | 0.75 | 0.74 | 0.75 | 0.80 | 0.79 | 0.80 | 0.70 | ||||
|
| |||||||||||
| J48 | 0.58(0.22) | 0.58(0.21) | 0.54(0.20) | 0.62(0.15) | 0.59(0.15) | 0.55(0.12) | 0.59(0.22) | ||||
| JRIP | 0.51(0.22) | 0.48(0.19) | 0.50(0.25) | 0.56(0.19) | 0.55(0.17) | 0.50(0.18) | 0.64(0.21) | ||||
| PART | 0.57(0.19) | 0.52(0.21) | 0.51(0.21) | 0.63(0.15) | 0.58(0.14) | 0.57(0.14) | 0.62(0.23) | ||||
| Mean | 0.55 | 0.52 | 0.52 | 0.60 | 0.57 | 0.54 | 0.62 | ||||
Figure 3Classification accuracy when using machine learning techniques on individual and combined breath samples (error bars show standard deviations).