| Literature DB >> 32218378 |
Emily Mastej1, Lucas Gillenwater2, Yonghua Zhuang3, Katherine A Pratte2, Russell P Bowler2,4, Katerina Kechris3.
Abstract
Chronic obstructive pulmonary disease (COPD) is a disease in which airflow obstruction in the lung makes it difficult for patients to breathe. Although COPD occurs predominantly in smokers, there are still deficits in our understanding of the additional risk factors in smokers. To gain a deeper understanding of the COPD molecular signatures, we used Sparse Multiple Canonical Correlation Network (SmCCNet), a recently developed tool that uses sparse multiple canonical correlation analysis, to integrate proteomic and metabolomic data from the blood of 1008 participants of the COPDGene study to identify novel protein-metabolite networks associated with lung function and emphysema. Our aim was to integrate -omic data through SmCCNet to build interpretable networks that could assist in the discovery of novel biomarkers that may have been overlooked in alternative biomarker discovery methods. We found a protein-metabolite network consisting of 13 proteins and 7 metabolites which had a -0.34 correlation (p-value = 2.5 × 10-28) to lung function. We also found a network of 13 proteins and 10 metabolites that had a -0.27 correlation (p-value = 2.6 × 10-17) to percent emphysema. Protein-metabolite networks can provide additional information on the progression of COPD that complements single biomarker or single -omic analyses.Entities:
Keywords: COPD; SmCCNet; metabolomics; multi-omic networks; proteomics
Year: 2020 PMID: 32218378 PMCID: PMC7241079 DOI: 10.3390/metabo10040124
Source DB: PubMed Journal: Metabolites ISSN: 2218-1989
Cohort characteristics.
| Clinical Variables | Control | COPD | PRISm | Missing Spirometry | Whole Cohort |
|---|---|---|---|---|---|
| Sex, % women | 53.3 | 42.7 | 63 | 50 | 49.1 |
| Race, % white | 88.9 | 94.8 | 91.3 | 100 | 92.1 |
| Former Smoker, % | 73.7 | 77.6 | 70.7 | 66.7 | 75.2 |
| Age (yr) | 64.6 (58.3–71.5) | 71.1 (64.9–76.6) | 67.3 (60.9–72.9) | 72 (68.2–75.3) | 68 (61–74.6) |
| Body mass index (kg/m2) | 28.4 (25.3–32.1) | 27.4 (23.7–31.7) | 30.8 (27.4–37.4) | 27.8 (25.2–30.2) | 28.1 (24.7–32.2) |
| Heart disease comorbidity (%) | 30.3 | 50.2 | 38 | 50 | 40.7 |
| FEV1% predicted | 106.8 (11.7) | 58(23.5) | 70 (7.7) | NA | 77.4 (26.6) * |
| Percent emphysema | 2.2 (2.6) | 12.8 (12.3) | 1.5 (2.6) | 9.2 (11.7) | 7.1 (10.1) ** |
Data is presented as the median (interquartile range) for age and body mass index and mean (standard deviation) for FEV1% predicted and percent emphysema. The whole cohort is the combination of former and current smokers. PRISm: Preserved Ratio Impaired Spirometry defines individuals with a reduced FEV1 but with a preserved FEV1/FVC where FVC is forced vital capacity. GOLD: the global Obstructive Lung Disease system for grading COPD severity: GOLD 1 is early COPD, GOLD 2 is moderate COPD, GOLD 3 is severe COPD, GOLD 4 is very severe COPD, and GOLD 0 is an individual without COPD. Heart disease definition can be found in the Methods section. FEV1%: percent predicted forced expiratory volume in one second. Percent emphysema: percent of lung voxels less than −950 Hounsfield Units on inspiratory CT scans. * In total, 12 subjects were removed because they did not have FEV1 values. ** In total, 60 subjects were removed because they did not have percent emphysema values (12 control, 44 COPD, and 4 PRISm).
Figure 1Identified network after applying sparse multiple canonical correlation network (SmCCNet) to adjusted proteomic and metabolomic data and FEV1%. Proteins are blue nodes and metabolites are red nodes. Grey edges represent a negative correlation between the nodes. Purple edges represent a positive correlation between the nodes. Edge thickness corresponds to the relationships between the nodes based on the canonical weights. Node size corresponds with connectivity. Abbreviations: retinol-binding protein (RBP), repulsive guidance molecule A (RGMA). * Indicates a compound that has not been confirmed based on a standard, but Metabolon is confident in its identity.
Individual network node correlations to FEV1%.
| -Omics Type | Network Node | Correlation to FEV1 (%) |
|---|---|---|
|
| Troponin T | −0.254 |
| Protein S100-A4 | 0.187 | |
| Alpha-(1,3)-fucosyltransferase 5 | −0.175 | |
| Carbonic anhydrase 6 | 0.171 | |
| RGMA | 0.152 | |
| Epidermal growth factor receptor | 0.151 | |
| Hemojuvelin | 0.146 | |
| C-reactive protein | −0.144 | |
| Macrophage mannose receptor 1 | −0.144 | |
| Kallistatin | 0.141 | |
| Angiopoietin-2 | −0.140 | |
| RBP | 0.138 | |
| Complement component C9 | −0.135 | |
|
| Phosphocholine | 0.250 |
| Ergothioneine | 0.220 | |
| 5-hydroxyhexanoate | −0.213 | |
| Palmitoleoylcarnitine (C16:1) | −0.205 | |
| Myristoleoylcarnitine (C14:1) | −0.200 | |
| Cis-4-decenoylcarnitine (C10:1) | −0.199 | |
| (N(1) + N(8))-acetylspermidine | −0.184 |
Pearson correlations between FEV1% and individual metabolites and proteins in the identified network associated with FEV1%.
Figure 2Identified network after applying SmCCNet to adjusted proteomic and metabolomic data and percent emphysema. Proteins are blue nodes and metabolites are red nodes. Grey edges represent a negative correlation between the nodes. Purple edges represent a positive correlation between the nodes. Edge thickness corresponds to the relationships between the nodes based on the canonical weights. Node size corresponds with connectivity. Abbreviations: insulin-like growth factor-binding protein 2 (IGFBP-2), quinone oxidoreductase-like protein 1 (QORL1).
Individual network node correlations to percent emphysema.
| -Omics Type | Network Node | Correlation to Percent Emphysema |
|---|---|---|
| Proteins | Troponin T | 0.197 |
| Leptin | −0.169 | |
| Glucagon | −0.163 | |
| Growth hormone receptor | −0.161 | |
| Proto-oncogene tyrosine-protein kinase receptor Ret | −0.146 | |
| Chordin-like protein 1 | 0.143 | |
| Hemojuvelin | −0.142 | |
| Sex hormone-binding globulin | 0.139 | |
| Aminoacylase-1 | −0.138 | |
| Adiponectin | 0.137 | |
| Apolipoprotein E | −0.130 | |
| IGFBP-2 | 0.119 | |
| QORL1 | −0.106 | |
| Metabolites | 1-stearoyl-2-linoleoyl-GPI (18:0/18:2) | −0.209 |
| androsterone glucuronide | −0.206 | |
| 1-stearoyl-2-docosahexaenoyl-GPE (18:0/22:6) | −0.200 | |
| 1-palmitoyl-2-docosahexaenoyl-GPE (16:0/22:6) | −0.188 | |
| 1-palmitoyl-2-linoleoyl-GPI (16:0/18:2) | −0.187 | |
| 1-ribosyl-imidazoleacetate | −0.173 | |
| Valine | −0.168 | |
| palmitoyl-linoleoyl-glycerol (16:0/18:2) [ | −0.166 | |
| 1-stearoyl-2-arachidonoyl-GPI (18:0/20:4) | −0.161 | |
| Glutamate | −0.151 |
Pearson correlations between percent emphysema and individual metabolites and proteins in identified network associated with percent emphysema.
Figure 3Trends of network first principal component (PC1) with disease severity. Trends within the network associated with FEV1% were analyzed by dividing subjects into control, moderate COPD (Gold = 1 or 2), and severe COPD (GOLD = 3 or 4) groups. There were significance differences between all three groups (p-value < 0.005), with higher PC1 in the severe COPD group versus the moderate COPD group versus the control (A). Subjects were also divided by emphysema severity to analyze trends within the network associated with percent emphysema (B). Only subjects that had more than 20% emphysema had a significantly different PC1 than the other three groups (p-value = 0.00066).