| Literature DB >> 31644575 |
Scott C Ritchie1,2,3, Johannes Kettunen4,5,6, Marta Brozynska1,2, Artika P Nath1,2, Aki S Havulinna5,7, Satu Männistö5, Markus Perola5,7, Veikko Salomaa5, Mika Ala-Korpela4,6,8,9,10,11, Gad Abraham1,2,3,12, Peter Würtz13,14, Michael Inouye1,2,3,12,15.
Abstract
BACKGROUND: GlycA is a nuclear magnetic resonance (NMR) spectroscopy biomarker that predicts risk of disease from myriad causes. It is heterogeneous; arising from five circulating glycoproteins with dynamic concentrations: alpha-1 antitrypsin (AAT), alpha-1-acid glycoprotein (AGP), haptoglobin (HP), transferrin (TF), and alpha-1-antichymotrypsin (AACT). The contributions of each glycoprotein to the disease and mortality risks predicted by GlycA remain unknown.Entities:
Mesh:
Substances:
Year: 2019 PMID: 31644575 PMCID: PMC6808431 DOI: 10.1371/journal.pone.0223692
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Relationship between GlycA and its constituent glycoproteins.
A) Heatmap of the Pearson correlation between GlycA, AAT, AGP, HP, and TF in the 626 DILGOM07 participants with matched NMR metabolite measurements and glycoprotein assay data after log transformation and standardisation. Rows and columns have been ordered in decreasing order of correlation coefficient with GlycA. B) Heatmap of the log transformed and standardised concentrations of GlycA and each glycoprotein. Columns correspond to DILGOM07 participants, which have been hierarchically clustered (average linkage) based on their Euclidean distance calculated on their GlycA and glycoprotein measurements. Rows are ordered as in panel A.
Fig 2Comparison of imputation models to glycoprotein immunoassays in the 626 DILGOM07 participants with matched glycoprotein assay and metabolite quantification by NMR metabolomics.
A) Comparison of the imputed glycoprotein levels (y-axes) to the immunoassayed glycoprotein levels (x-axes) after log transformation and standardisation. The r2 value indicates the proportion of variance in the assayed glycoprotein explained by the respective imputation models. B) Boxplots of the Spearman correlation between the imputed and observed concentrations observed in the 10-fold cross validation procedure used for model training. Red triangles show the Spearman correlation between the predicted and observed concentrations in panel A (detailed in S2 Table).
Cohort characteristics.
| DILGOM07 (Model training dataset) | DILGOM07 | FINRISK97 | |
|---|---|---|---|
| 2007 | 2007 | 1997 | |
| 626 | 4,540 | 7,321 | |
| 328 (53%) | 2,387 (53%) | 3,644 (50%) | |
| 53 (25–74) | 52 (25–74) | 48 (25–74) | |
| 8 years | 8 years | 8 years | |
| 26.80 ± 4.66 | 27.2 ± 4.8 | 26.6 ± 4.5 | |
| 1.30 ± 0.18 | 1.30 ± 0.20 | 1.41 ± 0.25 | |
| 1.19 ± 0.20 (N = 615) | 1.19 ± 0.20 (N = 626) | - | |
| 789 ± 203 (N = 615) | 793 ± 205 (N = 626) | - | |
| 1.09 ± 0.49 (N = 614) | 1.10 ± 0.50 (N = 622) | - | |
| 2.65 ± 0.38 (N = 615) | 2.66 ± 0.38 (N = 626) | - | |
| 1.18 ± 0.11 (N = 615) | 1.16 ± 0.09 (N = 4,496) | 1.29 ± 0.11 (N = 7,246) | |
| 779 ± 145 (N = 615) | 786 ± 142 (N = 4,474) | 832 ± 178 (N = 7,151) | |
| 1.04 ± 0.40 (N = 614) | 1.00 ± 0.33 (N = 4,491) | 1.14 ± 0.46 (N = 7,194) | |
| 2.63 ± 0.10 (N = 615) | - | - | |
Data are reported as the mean ± standard deviation (s.d.) unless otherwise indicated.
Fig 3Glycoprotein associated risks of disease and mortality.
Comparison of Cox proportional hazard ratios (triangles) for the first diagnosis occurrence (hospitalisation or mortality) conferred per standard deviation increase of AAT, HP, AGP, or GlycA in inverse-variance weighted fixed effects meta-analysis of DILGOM07 and FINRISK97. Bars around each hazard ratio indicate the 95% confidence interval. Diagnosis data were analysed for a total of 351 outcomes with >20 events in both DILGOM07 and FINRISK97 over a matched 8-year follow-up period. Models were fit using age as the time scale and adjusting for sex, smoking status, BMI, blood pressure, alcohol consumption, prevalent disease prior to baseline (Methods), and previously identified biomarkers for 5-year risk of all-cause mortality (citrate, albumin, and VLDL particle size). Only outcomes with a significant and replicable association with at least one of AAT, HP, or AGP are shown (Storey-Tibshirani FDR adjusted P-value < 0.05/3, adjusting for the three glycoproteins, in DILGOM07, FINRISK97, and meta-analysis). Associations which were significant and replicable are shown with solid hazard ratios and 95% confidence intervals. The alphanumeric codes in the square brackets indicate the ICD10 code or ICD10 disease group for each diagnosis. The number of events in DILGOM07 and FINRISK97 are shown to the left of each hazard ratio for each outcome. Different numbers of events for the same outcome between biomarkers arise from differences in the number of samples for which each glycoprotein was successfully imputed (Methods). Hazard ratios fit separately in DILGOM07 and FINRISK97 along with comparison to the hazard ratios calculated from the immunoassayed AAT, HP, and AGP measurements can be found in S2 Fig. Hazard ratios for all tested outcomes are detailed in S3 Table.
Fig 4Comparison of biomarkers across all outcomes in meta-analysis of DILGOM07 and FINRISK97.
A) Quantile-Quantile plots of distributions of hazard ratio estimate P-values (y-axis) compared to distribution of expected P-values under the null hypothesis that the corresponding biomarker is not associated with any outcome (x-axis). Hazard ratio estimate P-values are shown after adjustment for multiple testing using the Storey-Tibshirani FDR method. The dashed line indicates the location where p-values would fall if the observed distribution was identical to the null distribution. Points above the red dashed line indicated hazard ratios with FDR adjusted P < 0.05 in the meta-analysis, while points above the blue dashed line indicate hazard ratios with FDR adjusted P < 0.05/3 in the meta-analysis. B) Density plots comparing each biomarker’s distribution of hazard ratio standard errors across all outcomes. C) Density plots comparing each biomarker’s distribution of hazard ratios across all outcomes.
Highlighted gene sets significantly enriched for genes associated with AAT.
| Collection | Gene set | Size | NES | FDR |
|---|---|---|---|---|
| Hallmark | Reactive oxygen species pathway | 43 | 2.21 | 0.002 |
| Hallmark | TNFa signaling via NFkB | 194 | 1.92 | 0.01 |
| Hallmark | PI3K/AKT/mTOR signaling | 101 | 1.88 | 0.02 |
| Hallmark | IL6/JAK/STAT3 signaling | 81 | 1.94 | 0.02 |
| Hallmark | Apoptosis | 154 | 1.85 | 0.02 |
| KEGG | Toll-like receptor signaling pathway | 96 | 2.05 | 0.04 |
| Reactome | Toll receptor cascades | 102 | 1.98 | 0.04 |
| GO:BP | Cytokine production involved in immune response | 17 | 2.24 | 0.006 |
| GO:BP | T cell differentiation involved in immune response | 28 | 1.96 | 0.03 |
| GO:BP | Antimicrobial humoral response | 43 | 1.97 | 0.03 |
| GO:BP | Defense response to fungus | 35 | 1.93 | 0.04 |
| GO:BP | Regulation of innate immune response | 325 | 1.86 | 0.05 |
| GO:BP | Phagocytosis engulfment | 17 | 1.87 | 0.05 |
| GO:BP | Antigen processing and presentation of peptide antigen via MHC class I | 86 | 1.86 | 0.05 |
| GO:BP | Negative regulation of viral process | 84 | 1.85 | 0.05 |
| GO:BP | Negative regulation of immune response | 113 | 1.85 | 0.05 |
A selection of the gene sets that were significantly enriched for AAT-associated differential expression (Methods). See S5 Table for a full listing of all 139 gene sets significantly enriched for AAT associated genes. Gene sets shown here were selected to highlight the association between elevated AAT and increase expression of diverse immune response pathways. A gene set was considered significantly enriched for AAT associated genes if its Benjamini-Hochberg FDR adjusted permutation test P-value for enrichment was < 0.05 (FDR correction performed within each gene set collection separately). The tested gene set collections included Hallmark pathways, KEGG pathways, Reactome pathways, GO biological process (GO:BP) terms, GO molecular function (GO:MF) terms, and GO cellular compartments (GO:CC). Size: number of genes on the Illumina HT-12 array annotated for the corresponding gene set. NES: enrichment score normalized by gene set size in a permutation procedure (Methods).