| Literature DB >> 29339749 |
Mavra Nasir1, Heather D Bean2, Agnieszka Smolinska3, Christiaan A Rees1, Edith T Zemanick4, Jane E Hill5,6.
Abstract
Respiratory infections caused by Pseudomonas aeruginosa and Staphylococcus aureus are the leading cause of morbidity and mortality in cystic fibrosis (CF) patients. The authors aimed to identify volatile biomarkers from bronchoalveolar lavage (BAL) samples that can guide breath biomarker development for pathogen identification. BAL samples (n = 154) from CF patients were analyzed using two-dimensional gas chromatography time-of-flight mass spectrometry. Random Forest was used to select suites of volatiles for identifying P. aeruginosa-positive and S. aureus-positive samples using multiple infection scenarios and validated using test sets. Using nine volatile molecules, we differentiated P. aeruginosa-positive (n = 7) from P. aeruginosa-negative (n = 53) samples with an area under the receiver operating characteristic curve (AUROC) of 0.86 (95% CI 0.71-1.00) and with positive and negative predictive values of 0.67 (95% CI 0.38-0.75) and 0.92 (95% CI 0.88-1.00), respectively. We were also able to discriminate S. aureus-positive (n = 15) from S. aureus-negative (n = 45) samples with an AUROC of 0.88 (95% CI 0.79-1.00) using eight volatiles and with positive and negative predictive values of 0.86 (95% CI 0.61-0.96) and 0.70 (95% CI 0.61-0.75), respectively. Prospective validation of identified biomarkers as screening tools in patient breath may lead to clinical application.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29339749 PMCID: PMC5770459 DOI: 10.1038/s41598-017-18491-8
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Microbiological results from BAL fluid samples of patients with cystic fibrosis (n = 121; samples with no cultured microorganisms (NCM) excluded for clarity). (A) Colour matrix of the microbiological profiles observed among the samples; (red) bacteria (excluding nontuberculous mycobacteria (NTM)), (yellow) fungi, (green) NTM. (B) Bar plot of the most prevalent microorganisms in the samples (for organisms present in ≥1% samples).
Demographic and clinical characteristics of subjects in the study groups.
| Study groups¥ | p-value | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| NCM |
| ||||||||||
| n | 19 | 40 | 32 | 114 | 93 | — | — | — | — | — | — |
| Age | 17.6 (±8.6) | 13.3 (±5.2) | 9.8 (±6.9) | 10.5 (±6.2) | 11.4 (±7.5) | 0.74 | 0.25 | 0.12 | 0.72 | ||
| Gender (M/F) | 9/10 | 25/15 | 15/17 | 55/59 | 39/54 | 1.00 | 0.74 | 0.51 | 0.66 | 1.00 | 0.19 |
| Genotype ( | 6/7 | 15/14 | 13/7 | 66/41 | 53/34 | 0.64 | 0.82 | 0.51 | 1.00 | 0.60 | 0.72 |
| BMI, kg/m2 | 20.0 (±2.8) | 18.2 (±3.7) | 17.6 (±2.7) | 17.9 (±3.2) | 18.2 (±2.9) | 0.82 | 0.25 | 0.12 | 0.94 | ||
| Comorbidities (Y/N) Pancreatic insufficiency CF-related diabetes | 17/2 | 33/7 | 21/11 | 110/4 | 90/3 | 0.19 | 0.74 | 0.25 | 0.95 | 0.60 | 0.10 |
| 4/15 | 6/34 | 7/25 | 18/96 | 14/79 | 1.00 | 0.82 | 0.77 | 0.95 | 0.93 | 1.00 | |
| FVC % predicted Age ≥6 years | 82.0 (±16.7) n = 18 | 94.6 (±15.6) n = 35 | 94.2 (±19.7) n = 22 | 91.5 (±18.8) n = 87 | 88.5 (±20.1) n = 69 | 0.07 | 0.82 | 0.51 | 0.12 | 0.08 | 0.72 |
| FEV1% predicted Age ≥6 years | 74.1 (±20.5) n = 18 | 87.1 (±17.9) n = 35 | 85.5 (±22.3) n = 22 | 83.4 (±20.7) n = 87 | 78.4 (±21.7) n = 69 | 0.19 | 0.82 | 0.50 | 0.13 | 0.16 | 0.22 |
¥Samples can belong to more than one study group.
Data shown as mean (standard deviation) except where indicated.
p-value <0.05*, 0.01** considered statistically significant after Benjamini-Hochberg correction.
M, male; F, female, Y, yes; N, no; FVC, forced vital capacity; FEV1, forced expiratory volume in 1 s.
Figure 2Putative chemical class identifications of volatile molecules present in BAL fluid samples of Pa+, Sa+, and NCM groups.
Model error rate, sensitivity, specificity, PPV, NPV, AUROC (95% CI) for volatile molecules from BAL fluid as diagnostic for Pa+ versus NCM, Sa+ versus NCM, Pa+/Sa+ versus NCM, Pa+ versus Sa+, Sa+ versus Pa+, Pa+ versus Pa− and Sa+ versus Sa−.
| n | Model error rate | Sensitivity | Specificity | PPV | NPV | AUROC | |
|---|---|---|---|---|---|---|---|
| Training set | 30 | 0.30 | 0.35 (0.05–0.76) | 0.70 (0.40–0.90) | 0.30 (0.10–0.62) | 0.72 (0.55–0.88) | 0.70 (0.45–0.91) |
| Test set | 23 | 0.40 | 0.29 (0.04–0.71) | 0.63 (0.35–0.85) | 0.25 (0.08–0.56) | 0.66 (0.52–0.79) | 0.67 (0.39–0.89) |
|
| |||||||
| Training set | 43 | 0.32 | 0.58 (0.30–0.80) | 0.60 (0.27–0.85) | 0.62 (0.45–0.81) | 0.51 (0.33–0.66) | 0.87 (0.77–1.00) |
| Test set | 30 | 0.46 | 0.53 (0.27–0.77) | 0.54 (0.25–0.81) | 0.60 (0.42–0.76) | 0.45 (0.30–0.64) | 0.81 (0.74–0.99) |
|
| |||||||
| Training set | 53 | 0.30 | 0.82 (0.66–0.95) | 0.72 (0.39–0.90) | 0.86 (0.71–0.95) | 0.72 (0.46–0.89) | 0.80 (0.69–0.97) |
| Test set | 38 | 0.40 | 0.79 (0.63–0.90) | 0.60 (0.33–0.82) | 0.81 (0.70–0.89) | 0.56 (0.38–0.72) | 0.70 (0.62–0.89) |
|
| |||||||
| Training set | 36 | 0.25 | 0.70 (0.35–0.93) | 0.92 (0.63–1.00) | 0.88 (0.51–0.98) | 0.80 (0.61–0.91) | 0.84 (0.71–0.98) |
| Test set | 23 | 0.30 | 0.73 (0.46–0.87) | 0.81 (0.60–0.94) | 0.67 (0.43–0.85) | 0.72 (0.59–0.82) | 0.79 (0.67–0.95) |
|
| |||||||
| Training set | 36 | 0.25 | 0.92 (0.63–1.00) | 0.70 (0.35–0.93) | 0.80 (0.61–0.91) | 0.88 (0.51–0.98) | 0.84 (0.71–0.98) |
| Test set | 23 | 0.30 | 0.81 (0.60–0.94) | 0.73 (0.46–0.87) | 0.72 (0.59–0.82) | 0.67 (0.43–0.85) | 0.79 (0.67–0.95) |
|
| |||||||
| Training set | 73 | 0.20 | 0.77 (0.60–0.90) | 0.91 (0.48–0.94) | 0.71 (0.37–0.80) | 0.95 (0.91–1.00) | 0.90 (0.82–1.00) |
| Test set | 60 | 0.25 | 0.75 (0.63–1.00) | 0.88 (0.48–0.90) | 0.67 (0.38–0.75) | 0.92 (0.88–1.00) | 0.86 (0.71–1.00) |
| Training set | 73 | 0.30 | 0.85 (0.80–0.99) | 0.56 (0.36–0.79) | 0.90 (0.81–0.95) | 0.73 (0.65–0.80) | 0.89 (0.80–1.00) |
| Test set | 60 | 0.37 | 0.80 (0.78–0.92) | 0.52 (0.31–0.72) | 0.86 (0.61–0.96) | 0.70 (0.61–0.75) | 0.88 (0.79–1.00) |
*Volatile molecules are the same for both models.
Figure 3Receiver operating characteristic (ROC) curves on test set samples of (A) Pa+ (n = 8) versus NCM (n = 15) using a panel of six volatile molecules, (B) Sa+ (n = 15) versus NCM (n = 15) using seven volatile molecules, (C) Pa+/Sa+ (n = 23) versus NCM (n = 15) using nine volatile molecules, (D) Pa+ (n = 7) versus Pa− (n = 53) using nine volatile molecules, (E) Sa+ (n = 15) versus Sa− (n = 45) using eight volatile molecules and (F) Pa+ (n = 8) versus Sa+ (n = 15) using 11 volatile molecules. (Dotted line indicates random classification).
Figure 4Three-dimensional principal component scores plot on test set samples of Pa+ (diamond, n = 8) versus Sa+ (plus, n = 15) with complete microbiology profiles.
Putative identification of 38 discriminatory volatile molecules for all six models.
| Peak # | Putative identification | Putative molecular formula | CAS# | RI | ||||||
|---|---|---|---|---|---|---|---|---|---|---|
| 1 | Unknown | Unknown | — | — | ↑ | — | — | — | — | — |
| 2 | Ketone | C6H12O2 | — | — | — | ↓ | — | — | — | — |
| 3 |
|
|
|
|
|
|
| — |
|
|
| 4 | Ether | C4H6O | — | — | — | ↑ | — | — | — | 635 |
| 5 |
|
|
|
|
|
|
|
|
|
|
| 6 |
|
|
|
|
|
|
|
|
|
|
| 7 | Aromatic | C6H8O | ↓ | ↑ | ↑ | — | — | ↑ | — | 734 |
| 8 | Ether | C6H12O2 | — | — | — | ↑ | — | — | — | 744 |
| 9 | Ketone | C6H12O | — | — | — | ↑ | — | — | — | 743 |
| 10 | Sulfur Containing | C20H42O2S | — | — | — | ↑ | — | — | — | 788 |
| 11 | Hydrocarbon | C15H30 | ↑* | — | ↑ | — | — | — | — | 795 |
| 12 | Ketone | C9H16O3 | — | — | ↑ | — | — | — | — | 826 |
| 13 | Alcohol | C11H22O2 | — | — | ↑ | — | — | — | — | 841 |
| 14 | Ester | C5H8O4 | — | ↑* | — | ↓ | — | ↑ | — | — |
| 15 | Hydrocarbon | C10H20 | ↑* | — | ↑* | — | — | — | — | — |
| 16 | Nitrogen Containing | C12H25NO2 | — | — | — | ↓ | — | — | — | — |
| 17 | Carboxylic Acid | C4H6O4 | — | — | ↑* | — | — | — | — | — |
| 18 | Ketone | C12H14O | — | — | ↓ | — | — | — | — | — |
| 19 | Alcohol | C4H10O3 | — | — | — | ↑ | — | — | — | — |
| 20 | Hydrocarbon | C4H8 | ↑ | — | — | — | ↑** | — | — | — |
| 21 | Ester | C4H8O2 | ↑ | — | — | — | — | — | — | — |
| 22 | Hydrocarbon | C8H18 | — | ↑ | — | — | — | — | — | 734 |
| 23 | Alcohol | C5H12O | — | ↑** | — | — | — | — | — | 768 |
| 24 | Alcohol | C16H14O | — | ↓** | — | — | — | — | — | 789 |
| 25 | Hydrocarbon | C9H20 | — | ↑** | — | — | — | — | — | — |
| 26 |
|
| — | — | — | — |
| — |
|
|
| 27 |
|
| — | — | — | — |
| — |
|
|
| 28 |
|
| — | — | — | — |
| — |
|
|
| 29 | Ester | C6H12O2 | — | — | — | — | ↑* | — | — | 784 |
| 30 | Hydrocarbon | C4H10 | — | — | — | — | ↑* | — | — | — |
| 31 | Aromatic | C8H10 | — | — | — | — | ↑* | — | — | — |
| 32 | Nitrogen containing | C10H15N | — | — | — | — | ↓** | — | — | — |
| 33 | Ether | C4H6O | — | — | — | — | — | ↑ | — | — |
| 34 | Hydrocarbon | C7H16 | — | — | — | — | — | ↑ | — | — |
| 35 | Ketone | C6H12O | — | — | — | — | — | ↑ | — | — |
| 36 | Ketone | C8H16O | — | — | — | — | — | ↑ | — | — |
| 37 | Aromatic | C14H22 | — | — | — | — | — | ↑ | — | — |
| 38 | Ketone | C7H14O | — | — | — | — | — | ↑** | — | — |
Volatile molecules in bold have been previously reported in the literature in P. aeruginosa and/or S. aureus volatile metabolomics studies. RI; experimentally determined retention-index, Up; ↑, Down; ↓, based on mean peak area. p-value <0.05*, 0.01** considered statistically significant after Benjamini-Hochberg correction.
Previously reported P. aeruginosa– and S. aureus–-associated volatile molecules.
| Volatile molecule | KEGG pathway ID | Pathway name | Reference | ||
|---|---|---|---|---|---|
|
|
| Both | |||
| 2-butanone | 00460 | Cyanoamino acid metabolism |
[ |
[ |
[ |
| 01110 | Biosynthesis of secondary metabolites | ||||
| 2-methyl-2-butanol | — | — |
[ | — | — |
| 1-butanol | 00650 | Butanoate metabolism |
[ | — |
[ |
| 01120 | Microbial metabolism in diverse environments | ||||
| 01220 | Degradation of aromatic compounds | ||||
| 2-butanol | — | — |
[ | — |
[ |
| Ethyl acetate | — | — |
[ | — |
[ |
| 3-methyl-2-butanone | — | — |
[ | — | — |