| Literature DB >> 31349744 |
Eitan Halper-Stromberg1,2, Lucas Gillenwater3, Charmion Cruickshank-Quinn4, Wanda Kay O'Neal5, Nichole Reisdorph6, Irina Petrache1,3, Yonghua Zhuang7, Wassim W Labaki8, Jeffrey L Curtis8, James Wells9, Stephen Rennard10,11, Katherine A Pratte3, Prescott Woodruff12, Kathleen A Stringer13, Katerina Kechris14, Russell P Bowler15,16.
Abstract
Smoking causes chronic obstructive pulmonary disease (COPD). Though recent studies identified a COPD metabolomic signature in blood, no large studies examine the metabolome in bronchoalveolar lavage (BAL) fluid, a more direct representation of lung cell metabolism. We performed untargeted liquid chromatography-mass spectrometry (LC-MS) on BAL and matched plasma from 115 subjects from the SPIROMICS cohort. Regression was performed with COPD phenotypes as the outcome and metabolites as the predictor, adjusted for clinical covariates and false discovery rate. Weighted gene co-expression network analysis (WGCNA) grouped metabolites into modules which were then associated with phenotypes. K-means clustering grouped similar subjects. We detected 7939 and 10,561 compounds in BAL and paired plasma samples, respectively. FEV1/FVC (Forced Expiratory Volume in One Second/Forced Vital Capacity) ratio, emphysema, FEV1 % predicted, and COPD exacerbations associated with 1230, 792, eight, and one BAL compounds, respectively. Only two plasma compounds associated with a COPD phenotype (emphysema). Three BAL co-expression modules associated with FEV1/FVC and emphysema. K-means BAL metabolomic signature clustering identified two groups, one with more airway obstruction (34% of subjects, median FEV1/FVC 0.67), one with less (66% of subjects, median FEV1/FVC 0.77; p < 2 × 10-4). Associations between metabolites and COPD phenotypes are more robustly represented in BAL compared to plasma.Entities:
Keywords: BAL; BALF; COPD; LC–MS; bronchoalveolar lavage; emphysema; mass spectrometry; metabolomics; plasma
Year: 2019 PMID: 31349744 PMCID: PMC6724137 DOI: 10.3390/metabo9080157
Source DB: PubMed Journal: Metabolites ISSN: 2218-1989
Cohort characteristics.
| Variable | Non-Smokers | Smoking Controls | COPD | |
|---|---|---|---|---|
| n | 12 | 56 | 47 | |
| Sex, % men | 33 | 45 | 62 | 0.104 |
| Race,% White | 50 | 73 | 87 | 7.48 × 10−3 * |
| Race, % Black | 25 | 21 | 6 | 7.48 × 10−3 * |
| Race, % Asian | 17 | 2 | 4 | 7.48 × 10−3 * |
| Race, % other | 8 | 4 | 2 | 7.48 × 10−3 * |
| Age, yr | 56 (50–60) | 58 (50–66) | 64 (58–68) | 7.95 × 10−4 * |
| Current smokers, % | 0 | 36 | 36 | 2.68 × 10−2 * |
| Pack–years | 0 (0–0) | 34 (26–44) | 42 (34–60) | 3.95 × 10−11 * |
| Body mass index | 26.21 (5.46) | 28.78 (4.47) | 28.9 (5.27) | 0.198 |
| Chronic bronchitis, % | 0 (0) | 7 (26) | 15 (36) | 0.294 |
| Exacerbations/yr | 0.08 (0.29) | 0.12 (0.43) | 0.39(0.68) | 0.117 |
| Emphysema, % | 0.15 (0.06–1.22) | 0.16 (0.05–0.4) | 1.05 (0.32–2.5) | 2.90 × 10−3 * |
| FEV1 % | 99.29 (7.31) | 100.23 (13.1) | 78.97 (19.92) | 3.87 × 10−8 * |
| FEV1/FVC | 81 (77–87) | 78 (75–81) | 61 (55–67) | 5.31 × 10−24 * |
Data are presented as median (interquartile range) or mean ± SD. * p-value < 0.05. Emphysema, %: % Emphysema voxels (<−950 Hounsfield units) in lung CT (Computed Tomography) image. Exacerbations/yr: # of exacerbations in last year. Chronic bronchitis: Daily productive cough for at least 3 months in the previous 2 consecutive years. % FEV1: Postbronchodilator % predicted forced expiratory volume in one second. FEV1/FVC: Ratio of forced expiratory volume in one second to forced vital capacity. COPD: chronic obstructive pulmonary disease.
Figure 1Compilation of Venn diagrams for bronchoalveolar lavage (BAL) and plasma compounds. All compounds (A), only annotated compounds (B), or annotated compounds with identifiers in HMDB (C), or KEGG (D) databases.
Figure 2BAL and Plasma comparison using distribution of Pearson’s correlation between BAL and plasma. All compounds (left), only annotated compounds (middle), or annotated compounds with KEGG identifiers (right). Mean correlation and t-test p-values are mean = 0.015, P = 1.7 × 10−4 (A), mean = 0.021, P = 4.7 × 10−15 (B) and mean = 1.1 × 10×, P = 0.94 (C). The set of all compounds passing the <20% missingness preprocessing filter and annotated with HMDB identifiers was equivalent to the corresponding KEGG set, and no separate distribution for HMDB is shown.
Significantly associated compounds with clinical, cell-type, and COPD sub-phenotype variables.
| Variable | BAL | Plasma |
|---|---|---|
| Sex | 1 | 240 |
| Current Smoker | 249 | 7 |
| Age | 0 | 177 |
| Menopause | 0 | 0 |
| Neutrophil Count | 665 | 0 |
| Lymphocyte Count | 5 | 0 |
| Eosinophil Count | 0 | 4 |
| BAL Neutrophil Count | 0 | 4 |
| BAL Lymphocyte Count | 1 | 0 |
| BAL Eosinophil Count | 0 | 7 |
| BAL Monocyte Count | 1 | 0 |
| BAL Macrophage Count | 1 | 1 |
| Hemoglobin | 0 | 63 |
| Hematocrit | 0 | 80 |
| FEV1/FVC | 1230 | 0 |
| Emphysema, % | 791 | 2 |
| Chronic Bronchitis | 0 | 0 |
| Exacerbations/yr | 1 | 0 |
| FEV1 % | 8 | 0 |
Cells are populated with numbers obtained after testing all compounds, 7939 from BAL and 10,561 from plasma. Compound measures with >20% missingness in the raw data were modeled using tobit regression. Compound measures ≤20% missingness in the raw data were modeled using either beta, logistic, negative binomial, or linear regression (Table S2). Compounds were significant at p-value adjusted FDR (False Discovery Rate) <0.05 Emphysema, %: % Emphysema voxels (<−950 Hounsfield units) in lung CT image; FEV1 %: Postbronchodilator % predicted forced expiratory volume in one second; FEV1/FVC: Ratio of forced expiratory volume in one second to forced vital capacity; Exacerbations/yr: # of exacerbations in last year.
Compounds in BAL most significantly associated with FEV1/FVC with corresponding plasma results.
| Compound | FDR BAL | Estimate BAL | SE BAL | FDR Plasma | Estimate Plasma | SE Plasma |
|---|---|---|---|---|---|---|
| PS (37:3) | 7.6 × 10−5 | 0.45 | 0.089 | 1 | 0.0015 | 0.094 |
| Lophocerine | 7.6 × 10−5 | 0.42 | 0.084 | 1 | −0.0034 | 0.066 |
| p-cresol | 7.6 × 10−5 | 0.4 | 0.08 | 0.98 | −0.036 | 0.14 |
| PE (38:3) | 7.6 × 10−5 | 0.38 | 0.075 | 0.93 | 0.086 | 0.094 |
| PC (40:6) | 7.6 × 10−5 | 0.35 | 0.069 | 0.11 | 0.14 | 0.033 |
| PC (40:6) (isomer) | 7.6 × 10−5 | 0.34 | 0.063 | 0.68 | −0.16 | 0.079 |
| Ceramide (d18:1/16:0) * | 7.6 × 10−5 | −0.29 | 0.054 | 0.89 | 0.092 | 0.086 |
| PC (32:1) ** | 7.6 × 10−5 | 0.28 | 0.054 | 0.96 | −0.048 | 0.082 |
| Glycocholic acid * | 7.6 × 10−5 | 0.27 | 0.052 | 0.96 | 0.023 | 0.035 |
| MGDG (36:5) | 7.6 × 10−5 | 0.27 | 0.055 | 0.89 | 21 | 20 |
| S-(Phenylacetothiohydroximoyl)-L-cysteine | 7.6 × 10−5 | 0.26 | 0.051 | 0.78 | −0.13 | 0.09 |
| SM (d18:1/24:1) ** | 7.6 × 10−5 | 0.26 | 0.051 | |||
| PE (35:1) | 7.6 × 10−5 | 0.26 | 0.05 | 0.96 | −0.036 | 0.075 |
| N-palmitoyl glycine | 7.6 × 10−5 | 0.25 | 0.05 | 0.92 | 17 | 20 |
| L-Threonylcarbamoyladenylate | 7.6 × 10−5 | 0.25 | 0.049 | 0.55 | −0.078 | 0.033 |
| Decaprenyl phosphate | 7.6 × 10−5 | 0.24 | 0.047 | 0.99 | −2.9 | 11 |
| Mycalamide B | 7.6 × 10−5 | 0.23 | 0.044 | 0.97 | −0.0099 | 0.027 |
| PC (36:4) * | 7.6 × 10−5 | 0.23 | 0.046 | 0.44 | 36 | 14 |
| PE (36:3) | 7.6 × 10−5 | 0.22 | 0.045 | 0.96 | 0.019 | 0.042 |
| PC (34:2) ** | 7.6 × 10−5 | 0.22 | 0.044 | 0.95 | 5.9 | 8.5 |
| Homocysteine * | 7.6 × 10−5 | 0.22 | 0.046 | 0.89 | 1.6 | 1.4 |
| SQMG (16:1) | 7.6 × 10−5 | 0.21 | 0.042 | 0.55 | −26 | 12 |
| PE (34:2) * | 7.6 × 10−5 | 0.2 | 0.039 | 0.98 | −0.019 | 0.081 |
| CL (70:0) | 9.2 × 10−5 | 0.27 | 0.056 | 0.98 | −0.015 | 0.071 |
| CL (72:7) | 9.4 × 10−5 | 0.40 | 0.082 | 1 | 0.001 | 0.11 |
Top 25 compounds for BAL FEV1/FVC association after sorting of the FDR p-value and estimate. * indicates an accurate mass and retention time match, ** indicates an accurate mass and MSMS library match. SE: Standard Error; FDR: False discovery rate based on Benjamini–Hochberg; CL: cardiolipin; SM: Sphingomyelin; PC: Phosphatidylcholine; PE: Phosphatidylethanolamine; PS: Phosphatidylserine; SQMG: Sulfoquinovosyl monoacylglycerol; MGDG: Monogalactosyldiacylglycerol.
Figure 3Enrichment of compound classes in BAL compounds associated with FEV1/FVC and % Emphysema). (A) Odds ratio with 95% confidence intervals for compounds in a given category to appear among the FDR corrected FEV1/FVC associated compounds versus appearing among non-associated compounds, using Fisher’s exact test. Regular expression searches identified compounds of different categories with matching compounds checked manually for accurate categorization. (B) Same as A (top left) but for more specific amino acid containing compounds. Categories shown in B include amino acid containing compounds for amino acids with >10 compounds detected experiment-wide for BAL. In B, a compound need only contain the compound listed to be included. (C) Same as A for % Emphysema (D) Same as B for % Emphysema
Figure 4Strength of correlation between weighted gene co-expression network analysis (WGCNA) modules and clinical variables and outcomes. Heatmap of module/clinical variable correlation. Plasma (A) or BAL (B). Module colors correspond to dendrogram in Figure S2. Cell text is Pearson correlation (p-value) between the first principal component representing the module and the corresponding variable. Plasma WGCNA modules without any correlation, p-values <0.01, are excluded (12 excluded, 19 displayed) for greater visual clarity. Full plasma WGCNA module to clinical variable correlations are shown in Figure S5.