| Literature DB >> 33987620 |
Lucas A Gillenwater1, Katherine A Pratte1, Brian D Hobbs2,3, Michael H Cho2,3, Yonghua Zhuang4, Eitan Halper-Stromberg5, Charmion Cruickshank-Quinn6, Nichole Reisdorph7, Irina Petrache1,8, Wassim W Labaki9, Wanda K O'Neal10, Victor E Ortega11, Dean P Jones12, Karan Uppal12, Sean Jacobson1, Gregory Michelotti13, Christine H Wendt14, Katerina J Kechris4, Russell P Bowler1,8.
Abstract
Background: Small studies have recently suggested that there are specific plasma metabolic signatures in chronic obstructive pulmonary disease (COPD), but there have been no large comprehensive study of metabolomic signatures in COPD that also integrate genetic variants. Materials andEntities:
Keywords: chronic obstructive pulmonary disease; integrated omics; metabolomic quantitative trait analysis; metabolomics; network analysis
Year: 2020 PMID: 33987620 PMCID: PMC8109053 DOI: 10.1089/nsm.2020.0009
Source DB: PubMed Journal: Netw Syst Med ISSN: 2690-5949
Demographics of Discovery cohort
| Total | PRISm | Control | COPD | Missing | p | |
|---|---|---|---|---|---|---|
| No. of participants (%) | 957 | 85 (8.9) | 390 (40.8) | 468 (48.9) | 14 (1.4) | |
| Age[ | 68.3 (8.4) | 66.7 (7.3) | 65.9 (8.5) | 70.5 (8.0) | 70.7 (5.8) | <0.0001 |
| Male sex (%) | 490 (51.2) | 31 (36.5) | 184 (47.2) | 268 (57.3) | 7 (50.0) | 0.0002 |
| BMI (%) | 29.1 (6.2) | 32.6 (7.7) | 29.3 (5.6) | 28.2 (6.1) | 27.9 (5.3) | <0.0001 |
| Current smoker (%) | 204 (21.3) | 24 (28.2) | 78 (20.0) | 98 (20.9) | 4 (28.6) | 0.1914 |
| Smoking pack-years[ | 46.0 (24.9) | 48.6 (24.3) | 36.1 (19.5) | 53.6 (26.2) | 51.7 (22.2) | <0.001 |
| FEV1pp_utah[ | 76.6 (26.5) | 70.2 (7.4) | 99.2 (11.5) | 58.9 (23.5) | NA | NA |
| FEV1/FVC[ | 0.7 (0.2) | 0.8 (0.0) | 0.8 (0.0) | 0.5 (0.1) | NA | NA |
| Percent emphysema[ | 7.3 (10.2) | 1.6 (2.5) | 2.2 (2.6) | 12.9 (12.2) | 9.2 (11.7) | <0.0001 |
Chi-square tests were used to test for differences between groups in binary variables. One-way ANOVA tests were performed to test for differences between groups in continuous variables.
PRISm, Preserved Ratio Impaired Spirometry[23]; COPD is defined by GOLD score ≥1; missing, 14 subjects were deemed ineligible for spirometry and thus did not have a defined GOLD status. These subjects were still included in analyses with other COPD phenotypes and the met-QTL analysis.
BMI, body mass index (kg/m2); FEV1/FVC, post-bronchodilator forced expiratory volume at one second/forced vital capacity; FEV1pp, FEV1 percent predicted.
Mean and standard deviations provided.
COPD, chronic obstructive pulmonary disease.
Demographics of replication cohorts
| COPDGene—Emory | SPIROMICS—Metabolon | SPIROMICS—UC | p | |
|---|---|---|---|---|
| No. of participants | 271 | 445 | 76 | NA |
| Age[ | 67.3 (8.4) | 65.3 (8) | 61.6 (8) | <0.001 |
| Male sex (%) | 127 (46.9) | 244 (54.8) | 40 (52.6) | 0.1164 |
| BMI (%) | 28.7 (5.8) | 28.1 (4.9) | 28.3 (4.8) | 0.3675 |
| Current smoker (%) | 57 (21.0) | 116 (26.4) | 19 (25.7) | 0.2665 |
| Smoking pack-years[ | 43.9 (23.3) | 47.8 (31.6) | 43.2 (24) | 0.1362 |
| FEV1pp_utah[ | 77.1 (25.3) | 79.4 (23.5) | 89.3 (20.8) | 0.0004 |
| FEV1/FVC[ | 0.7 (0.1) | 0.6 (0.1) | 0.7 (0.1) | 0.0015 |
| Percent emphysema[ | 6.9 (9.7) | 6.1 (8.2) | 4 (4.8) | 0.038 |
Chi-square tests were used to test for differences between groups in binary variables. One-way ANOVA tests were performed to test for differences between groups in continuous variables.
Mean and standard deviation provided unless otherwise noted.
FIG. 1.Metabolite associations with COPD. (A) Heat map of signed p-values. Metabolites are organized by super pathway. Red intensity indicates positive direction of effect, while blue intensity indicates negative. Only select metabolites most significantly associated are labeled. (B) Scatter plot of 1-stearoyl-2-linoleoyl-GPI (18:0/18:2) abundance by FEV1/FVC ratio. (C) Scatter plot of phosphocholine abundance by FEV1pp. (D) Bee swarm of ergothioneine abundance by GOLD stage. Ergothioneine was one of the topmost associated metabolites with all airflow obstruction phenotypes. Metabolites are color coded by Super Pathway designation. Metabolite abundances are inverse normal transformed. *Indicates compounds that have not been officially confirmed based on a standard, but Metabolon is confident in its identity. COPD, chronic obstructive pulmonary disease; FEV1, forced expiratory volume at one second; FEV1pp, FEV1 percent predicted; FVC, forced vital capacity; GPI, glycophosphatidylinositol.
FIG. 2.Genome-wide associations between SNPs and metabolites. (A) Discovery mWAS Manhattan plot showing −log10 p-values from mWAS tests. The blue and red dashed lines indicate false discovery rate and Bonferroni significance, respectively. Loci in which >20% of the metabolite variance is explained by a single SNP are labeled by nearest gene and metabolites affected. Metabolite text colors coded by Super Pathway. (B–D) Bee swarms of inverse normal transformed metabolite abundances by genotype for metabolites of different subpathways with the greatest variance explained by one SNP. Overlayed box plots represent the median and interquartile range of transformed metabolite abundance. (E) Bar plot of the percent variation for metabolites explained by clinical (green), top mQTL SNP (blue), the second independent mQTL SNP (orange), and any more independent mQTL SNPs (red). The gray indicates unknown variance. Clinical factors include age, sex, BMI, smoking status, smoking pack-years, and clinical center. BMI, body mass index; mWAS, metabolome-wide association study; SNPs, single-nucleotide polymorphisms.
Metabolite single-nucleotide polymorphisms explaining >15% of the variation in blood
| Metabolite | Super pathway | Subpathway | Variance explained | rsID | Consequence | Closest gene | Effect allele | Other allele | EAF | p |
|---|---|---|---|---|---|---|---|---|---|---|
| Amino acid | Lysine metabolism | 0.50 | rs2147896 | Missense | A | G | 0.64 | 2.68×10−146 | ||
| Amino acid | Lysine metabolism | 0.48 | rs2147896 | Missense | A | G | 0.64 | 1.98×10−135 | ||
| X - 12112 | Unannotated | Unannotated | 0.44 | rs7905265 | Downstream gene | C | G | 0.64 | 6.00×10−119 | |
| Amino acid | Lysine metabolism | 0.38 | rs10182082 | Intron | G | C | 0.22 | 6.07×10−38 | ||
| 0.15 | rs7905265 | Downstream gene | C | G | 0.64 | 1.86×10−97 | ||||
| Amino acid | Lysine metabolism | 0.34 | rs7905265 | Downstream gene | C | G | 0.64 | 5.04×10−88 | ||
| 0.25 | rs10182082 | Intron | G | C | 0.22 | 6.51×10−65 | ||||
| Ethylmalonate | Amino acid | Leucine, isoleucine and valine metabolism | 0.30 | rs1799958 | Missense | A | G | 0.25 | 5.87×10−79 | |
| 0.01 | rs6490297 | Intergenic | C | T | 0.27 | 9.09×10−28 | ||||
| Amino acid | Urea cycle; arginine and proline metabolism | 0.27 | rs10168931 | Intron | A | G | 0.23 | 5.63×10−70 | ||
| Amino acid | Urea cycle; arginine and proline metabolism | 0.26 | rs10168931 | Intron | G | A | 0.77 | 2.02×10−64 | ||
| Amino acid | Histidine metabolism | 0.25 | rs10206899 | Intron | C | T | 0.23 | 1.76×10−66 | ||
| Butyrylcarnitine (C4) | Lipid | Fatty acid metabolism (also BCAA metabolism) | 0.23 | rs1799958 | Missense | A | G | 0.25 | 1.75×10−59 | |
| Lipid | Primary bile acid metabolism | 0.22 | rs4149056 | Missense | G | T | 0.16 | 4.51×10−39 | ||
| Xenobiotics | Bacterial/Fungal | 0.21 | rs2147896 | Missense | A | G | 0.64 | 6.75×10−54 | ||
| Deoxycholic acid glucuronide | Lipid | Secondary bile acid metabolism | 0.19 | rs34594059 | Intron | C | C | 0.36 | 5.79×10−45 | |
| X - 19141 | Unannotated | Unannotated | 0.17 | rs34436963 | 3′ UTR | G | A | 0.64 | 2.12×10−43 | |
| 0.06 | rs1165196 | Missense | G | A | 0.45 | 3.08×10−17 | ||||
| 1-Palmitoyl-GPI[ | Lipid | Lysophospholipid | 0.16 | rs102275 | Intron | T | C | 0.64 | 2.38×10−41 | |
| 5alpha-androstan-3alpha,17alpha-diol monosulfate | Lipid | Androgenic steroid | 0.16 | rs1495741 | Intergenic | G | A | 0.23 | 4.41×10 −41 | |
| 0.01 | rs1041983 | Synonymous | C | T | 0.67 | 6.28×10−15 | ||||
| Glycochenodeoxycholate 3-sulfate | Lipid | Primary bile metabolism | 0.16 | rs4149056 | Missense | T | T | 0.16 | 4.51×10−39 | |
| 0.01 | rs11045913 | Downstream gene | G | A | 0.56 | 3.07×10−15 | ||||
| Decadienedioic acid (C10:2-DC)[ | Lipid | Fatty acid, dicarboxylate | 0.14 | rs11621061 | Intron | C | T | 0.76 | 1.89×10−28 | |
| 0.04 | rs58231493 | Upstream gene | C | T | 0.56 | 8.80×10−32 | ||||
| 3-Amino-2-piperidone | Amino acid | Urea cycle; arginine and proline metabolism | 0.13 | rs37369 | Missense | T | C | 0.08 | 6.06×10−33 | |
| 0.04 | rs16899974 | Missense | A | C | 0.22 | 9.31×10−17 |
Metabolite, metabolite annotation; super pathway, metabolite class annotation; subpathway, within class pathway annotation; variance explained, metabolite variance explained by genotype variation in mQTL SNP.
Total variance explained by mQTLs—metabolite variance explained by variation in all mQTL SNPs.
p-Value, p-value of regression test with SNP.
Indicates compounds that have not been officially confirmed based on a standard, but Metabolon is confident in its identity.
Indicates a compound for which a standard is not available, but Metabolon is confident in its identity or the information provided.
(#) or [#] indicates a compound that is a structural isomer of another compound in the Metabolon spectral library.
Ancestral allele, major allele; closest gene, closest gene to SNP as mapped in VEP; consequence, VEP annotation of variant; EAF, effect allele frequency in Discovery; effect allele, allele with positive association to metabolite; rsID, reference SNP ID number assigned by NCBI.
BCAA, branched chain amino acid; GPI, glycophosphatidylinositol; SNP, single-nucleotide polymorphism; VEP, Variant Effect Predictor.
FIG. 3.mQTL SNP enrichment analyses. (A) A bar plot showing the percentage by variant annotation of mQTL SNPs (black) and all SNPs tested (black). *Variant annotations significantly enriched in mQTLs. (B) A circular bar plot showing the percentage of each subpathway with at least one independent mQTL. The bars are colored by super pathway and labeled by subpathway, with the total number of metabolites in the subpathway with an mQTL in parentheses. FAM, fatty acid metabolism; Met, metabolism.
FIG. 4.Scatter plot of metabolite variance explained by lead SNP in Discovery and SPIROMICS—Metabolon cohorts. We calculated the variance explained (r[2]) for each lead mQTL SNP and metabolite by cohort. Metabolite colors represent super pathway annotation and are labeled if variance explained by genotype was >0.15 in either cohort.
mQTL replications
| SNPs | CHR | Closest gene(s) | Discovery | COPDGene—Emory[ | SPIROMICS—Metabolon | |
|---|---|---|---|---|---|---|
| rs2147896, rs7905265 | 10 | 506.130672493465_285.4155141 | ||||
| rs174533, rs102275, rs10224, rs1404372161, rs174567, rs1535 | 11 | 1222.85903685705_175.407391, 1589.12673654366_174.9453186, 782.570331871481_178.6310904, 1588.12181759391_175.0049822, 1210.34978948251_175.6062675, 872.538808473057_175.3542714, 831.571639521416_172.2185806, 942.54035577298_179.744523, 1617.16142714192_173.1058804, 875.556039922196_181.877006, 1618.17099209145_172.9865179, 1196.33733718313_175.4123396, 1591.13917344304_174.7156649, 1208.8420355579_175.0746202, 1209.35026653248_175.0413014, 1222.35055488247_174.4656717, 874.553705272313_181.378412, | PE(20:3(8Z11Z14Z)18:1(11Z)); PC(15:020:4(8Z11Z14Z17Z)); 1621.20035693998_171.6141601, 1590.14303149284_177.2413577, 783.572309421383_174.840046, 811.603130219842_173.9750369, 1223.36030483198_172.6915558 | 1-stearoyl-2-linoleoyl-GPI (18:0/18:2), | ||
| rs409170, rs1126464 | 16 | |||||
| rs887829, rs4148324 | 2 | X - 24849, X - 16946, X - 21448, X - 11522, Succinimide, | 167.072710946364_51.47896251, 191.072722146363_54.49274426, 180.080778195961_51.36922788, 283.10777804461_53.32993737, 299.139042143047_56.91349852, 586.274358886281_65.12241616, 2-(3-Carboxy-3-(methylammonio)propyl)- | |||
Bold text indicates replication across cohorts.
COPDGene—Emory metabolites not annotated by xMSAnnotator are reported as mass-to-charge ratio and retention time.
Indicates compounds that have not been officially confirmed based on a standard, but Metabolon is confident in its identity.
(#) or [#] indicates a compound that is a structural isomer of another compound in the Metabolon spectral library.
FIG. 5.Phenotype-driven modules. Cytoscape network representation of metabolite modules significantly associated one or more COPD phenotypes. Circular nodes are nonsignificant, while triangular nodes were significant in univariate analysis. “V” nodes mQTL SNPs. The color corresponds to the phenotype with which the metabolites in the module are associated; red indicates airflow obstruction phenotypes (FEV1/FVC or FEV1pp), purple indicates percent emphysema, green indicates both spirometric phenotypes and percent emphysema, and blue are nonsignificant metabolites.
Module variance explained by genetic and environmental variables
| Phenotype | Metabolite | Module ID | Module beta | Adjusted score | Most significant independent SNP | Consequence | Closest gene | Module first PC variance explained by genetic variants | Module first PC variance explained by covariates | Metabolite variance explained by genetic variants | Metabolite variance explained by covariates |
|---|---|---|---|---|---|---|---|---|---|---|---|
| FEV1pp | 3-Amino-2-piperidone | 2 | −0.005 | 0.006 | rs37369 | Missense variant | 0.8 | 10.8 | 18.9 | 4 | |
| FEV1pp | (S)-a-Amino-omega-caprolactam | 2 | −0.005 | 0.006 | NA | NA | 0.8 | 10.8 | NA | NA | |
| FEV1pp | Tetradecadienedioate (C14:2-DC)[ | 19 | −0.006 | 0.001 | rs11626972 | Upstream gene variant | 13.2 | 3 | 5.5 | 2 | |
| FEV1pp | Decadienedioic acid (C10:2-DC)** | 19 | −0.006 | 0.001 | rs58231493 | Upstream gene variant | 13.2 | 3 | 18.6 | 3.2 | |
| FEV1/FVC | 1-Methylnicotinamide | 3 | 0.734 | 0.029 | rs1495741 | Intergenic variant | 0.2 | 2.8 | 5.8 | 7 | |
| FEV1/FVC | Nicotinamide | 3 | 0.734 | 0.029 | NA | NA | 0.2 | 2.8 | NA | NA | |
| FEV1/FVC | 3 | 0.734 | 0.029 | NA | NA | 0.2 | 2.8 | NA | NA | ||
| FEV1/FVC | 2-Aminobutyrate | 4 | 1.031 | 0.002 | rs10168931 | Intron variant | 0.1 | 1.8 | 8.5 | 2.3 | |
| FEV1/FVC | Gamma-glutamyl-2-aminobutyrate | 4 | 1.031 | 0.002 | NA | NA | 0.1 | 1.8 | NA | NA | |
| FEV1/FVC | Aspartate | 10 | 1.299 | 0 | rs750062 | Upstream gene variant | 0.3 | 17 | 6.6 | 6.5 | |
| FEV1/FVC | Glutamate | 10 | 1.299 | 0 | NA | NA | 0.3 | 17 | NA | NA | |
| FEV1/FVC | Linoleoyl-arachidonoyl-glycerol (18:2/20:4) [2][ | 18 | 0.916 | 0.015 | NA | NA | 3.8 | 4.6 | NA | NA | |
| FEV1/FVC | Oleoyl-arachidonoyl-glycerol (18:1/20:4) [2][ | 18 | 0.916 | 0.015 | rs174567 | Intron variant | 3.8 | 4.6 | 5 | 3.1 |
ModuleID, module ID within phenotype; module beta, beta estimate of change in modules based on 1 unit increase in phenotype; adjusted score, score (p-value) after multiple testing correction; most significant independent SNP, SNP most significantly associated with metabolite. NA, no SNPs significantly associated; consequence, VEP annotation of variant; closest gene, closest gene to SNP as mapped in VEP; module first PC variance explained by genetic variants, adjusted r2 of linear regression model with the first PC of the module and independent mQTL SNPs. Module first PC variance explained by covariates, adjusted r2 of linear regression model with the first PC of the module and covariates (see Materials and Methods section). Metabolite variance explained by genetic variants, adjusted r2 of linear regression model with metabolite and independent mQTL SNPs. Metabolite variance explained by covariates, adjusted r2 of linear regression model with metabolite and covariates (see Materials and Methods section).
Indicates compounds that have not been officially confirmed based on a standard, but Metabolon is confident in its identity; **indicates a compound for which a standard is not available, but Metabolon is confident in its identity or the information provided; (#) or [#] indicates a compound that is a structural isomer of another compound in the Metabolon spectral library.
PC, principal component.