| Literature DB >> 35832450 |
Ekaterina Smirnova1, Christopher Mallow2, John Muschelli3, Yuan Shao4, Jeffrey Thiboutot5, Andres Lam4, Ana M Rule4, Ciprian Crainiceanu3, Lonny Yarmus5.
Abstract
Background: Lung cancer remains the leading cause of cancer deaths accounting for almost 25% of all cancer deaths. Breath-based volatile organic compounds (VOCs) have been studied in lung cancer but previous studies have numerous limitations. We conducted a prospective matched case to control study of the ability of preidentified VOC performance in the diagnosis of stage 1 lung cancer (S1LC).Entities:
Keywords: Lung cancer; breath tests; mass spectrometry; volatile organic compounds (VOCs)
Year: 2022 PMID: 35832450 PMCID: PMC9271440 DOI: 10.21037/tlcr-21-953
Source DB: PubMed Journal: Transl Lung Cancer Res ISSN: 2218-6751
Figure 1Flow diagram of the patient selection for the breath volatile organic carbon study. S1LC, stage 1 lung cancer.
Demographic and clinical characteristics of the study participants
| Characteristics | Case (N=88) | Matched control (N=88, type 1) | Housemate Control (N=49, type 2) | P value |
|---|---|---|---|---|
| Age (years), mean (SD) | 67.85 (9.28) | 67.91 (9.81) | 63.13 (14.15) | 0.025 |
| BMI (kg/m2), mean (SD) | 27.61 (6.11) | 28.68 (5.60) | 28.35 (4.81) | 0.446 |
| Smoking history, n (%) | <0.001 | |||
| Never | 16 (18.2) | 16 (18.2) | 31 (63.3) | |
| Current | 14 (15.9) | 14 (15.9) | 3 (6.1) | |
| Former | 58 (65.9) | 58 (65.9) | 15 (30.6) | |
| Race, n (%) | 0.273 | |||
| White | 63 (71.6) | 62 (70.5) | 38 (77.6) | |
| Black | 17 (19.3) | 19 (21.6) | 3 (6.1) | |
| Asian/Pacific Islander | 6 (6.8) | 6 (6.8) | 6 (12.2) | |
| Other | 2 (2.3) | 1 (1.1) | 2 (4.1) | |
| Female sex, n (%) | 52 (59.1) | 52 (59.1) | 24 (49.0) | 0.450 |
| No history of family cancer, n (%) | 56 (63.6) | 85 (96.6) | 39 (79.6) | <0.001 |
| No kidney disease, n (%) | 72 (81.8) | 80 (90.9) | 47 (95.9) | 0.030 |
| No diabetes, n (%) | 70 (79.5) | 70 (79.5) | 41 (83.7) | 0.813 |
| No liver disease, n (%) | 86 (97.7) | 82 (93.2) | 47 (95.9) | 0.340 |
| No alcohol use, n (%) | 37 (42.0) | 29 (33.0) | 20 (40.8) | 0.423 |
P value column corresponds to the chi-squared test for the null hypothesis of equality of means in the three groups.
Figure 2Boxplots of concentrations for VOCs with concentrations with less than 10% data below LOD for (A) training data; and (B) test data. Boxplots are separated by cases (red), type 1 matched controls (dark green), and type 2 housemate controls (light green). The x-axis provides the compounds and the y-axis labels are displayed on the original scale even though the data were log10 transformed. VOCs, volatile organic compounds; LOD, limit of detection.
Results for unpaired t-tests comparing the mean of the log concentration between the cases and controls (type 1 and type 2 combined) for the training, test, and combined data
| VOC | Training data | Test data | Combined data (training +test) | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| N cases | N controls | P value | N cases | N controls | P value | N cases | N controls | P value | |||
| 2-Pentanone | 28 | 48 | 0.568 | 55 | 77 | 0.105 | 83 | 125 | 0.699 | ||
| Acetoin | 28 | 49 | 0.091 | 57 | 84 | 0.001 | 85 | 133 | <0.001 | ||
| Dodecane | 29 | 51 | 0.268 | 56 | 85 | 0.462 | 85 | 136 | 0.762 | ||
| Heptanal | 23 | 45 | 0.084 | 56 | 82 | 0.837 | 79 | 127 | 0.237 | ||
VOC, volatile organic compound.
Prediction performance measured as AUC in the training and test data when predicting S1LC cases based on individual binary predictors defined as “above or below LOD” for each VOC
| VOC | Univariate AUC | |
|---|---|---|
| Training | Test | |
| 3-3-dimethyl-pentane | 0.504 | 0.494 |
| 2-Butanone | 0.504 | 0.503 |
| 2-Pentanone | 0.504 | 0.474 |
| Toluene | 0.524 | 0.500 |
| 3-methyl-1-butanol | 0.561 | 0.518 |
| Acetoin | 0.514 | 0.497 |
| 2-hexanol | 0.555 | 0.486 |
| Hexanal | 0.528 | 0.500 |
| Ethylbenzene | 0.504 | 0.511 |
| Heptanal | 0.558 | 0.494 |
| Cyclohexanone | 0.547 | 0.488 |
| p-Cymene | 0.630 | 0.580 |
| Dodecane | 0.517 | 0.511 |
VOCs are ordered by name, not by any measure. AUC, area under the curve; S1LC, stage 1 lung cancer; LOD, limit of detection; VOC, volatile organic compound.
Prediction performance of VOC log concentration measured by the AUC in univariate and forward selection models
| VOC | Univariate model | Forward selection cumulative model | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Training | Test | Training | Test | ||||||||
| Univariate AUC | N | Univariate AUC | N | Cumulative AUC | N | Cumulative AUC | N | ||||
| Acetoin | 0.649 | 77 | 0.650 | 141 | 0.649 | 77 | 0.650 | 141 | |||
| Heptanal | 0.610 | 68 | 0.511 | 138 | 0.669 | 64 | 0.559 | 137 | |||
| 2-Pentanone | 0.502 | 76 | 0.590 | 132 | 0.689 | 63 | 0.601 | 128 | |||
| Dodecane | 0.574 | 80 | 0.541 | 141 | 0.686 | 63 | 0.592 | 127 | |||
Models are developed on the training data and evaluated on the test data set. The forward selection model is cumulative; for example, the row labeled 2-Pentanone indicates that 2-Pentanone was the third variable added to the model and the corresponding AUC refers to the model that includes Acetoin, Heptanal, and 2-Pentanone. VOC, volatile organic compound; AUC, area under the curve.
Estimated sensitivity (proportion of correctly identified S1LC cases), specificity (proportion of correctly identified controls), and accuracy (proportion of correctly classified cases and controls) for three Acetoin concentration thresholds using test data and combined data (training and test)
| Threshold | Test data | All data | |||||
|---|---|---|---|---|---|---|---|
| Sensitivity | Specificity | Accuracy | Sensitivity | Specificity | Accuracy | ||
| 10% (0.026 mg/L) | 0.649 | 0.583 | 0.610 | 0.518 | 0.699 | 0.628 | |
| 25% (0.044 mg/L) | 0.754 | 0.429 | 0.560 | 0.671 | 0.541 | 0.592 | |
| 50% (0.098 mg/L) | 0.930 | 0.286 | 0.546 | 0.871 | 0.368 | 0.564 | |
S1LC, stage 1 lung cancer.
| if Acetointest <10threshold (from training data) | participant is classified as S1LC case; |
| if Acetointest ≥10threshold (from training data) | participant is classified as control. |