| Literature DB >> 25147954 |
Stefan Enroth1, Asa Johansson2, Sofia Bosdotter Enroth3, Ulf Gyllensten1.
Abstract
Ideal biomarkers used for disease diagnosis should display deviating levels in affected individuals only and be robust to factors unrelated to the disease. Here we show the impact of genetic, clinical and lifestyle factors on circulating levels of 92 protein biomarkers for cancer and inflammation, using a population-based cohort of 1,005 individuals. For 75% of the biomarkers, the levels are significantly heritable and genome-wide association studies identifies 16 novel loci and replicate 2 previously known loci with strong effects on one or several of the biomarkers with P-values down to 4.4 × 10(-58). Integrative analysis attributes as much as 56.3% of the observed variance to non-disease factors. We propose that information on the biomarker-specific profile of major genetic, clinical and lifestyle factors should be used to establish personalized clinical cutoffs, and that this would increase the sensitivity of using biomarkers for prediction of clinical end points.Entities:
Mesh:
Substances:
Year: 2014 PMID: 25147954 PMCID: PMC4143927 DOI: 10.1038/ncomms5684
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Figure 1Characteristics of the PEA measurements.
(a) Intensities of PEA values and proportion of proteins and individuals above detection limit. In the heatmap, individuals are in columns and proteins are in rows. Heatmap colours represent ddCq-values ranging from low (blue) to high (yellow) with measurements below detection limit coded white. (b) Significant covariates in relation to each protein. Covariates are listed from the upper right part of the circle (12 o'clock to 4) and connections illustrate significant (P-value <0.05, Bonferroni adjusted) contributions to PEA variance. (c) PEA to PEA correlations, coloured connections represent a correlation coefficient (R2) greater than 0.5. The width of the connection reflects the magnitude of the squared correlation coefficients. All correlations coefficients (R) were positive.
List of significant covariates.
| Age | 24 | Up: ADM, CTSD, CXCL9, CXCL10, CXCL11, ErbB4, FAS, Flt3L, GDF-15, WFDC2, HGF, IL-8, IL-2-RA, KLK6, hK11, MCP-1, OPG, PSA, TF, TNF-R1, TNF-R2 U-PAR, VEGF-ADown: E-selectin |
| Sex (female) | 4 | Up: GH, FABP4Down: E-selectin, TR-AP |
| Systolic blood pressure | 22 | Up: ADM, CTSD CXCL9, CXCL10, ErbB4, FAS, Flt3L, hK11, GDF-15, WFDC2, HGF, IL-8, TGF-beta-1, MCP-1, OPG, PRSS8, PSA, TF, TNF-R1, TNF-R2, U-PAR, VEGF-A |
| Length | 11 | Down: ADM, BAFF, CXCL9, FABP4, Flt3L, FR-alpha, GDF-15, GH, WFDC2, OPG, U-PAR |
| Weight | 21 | Up: ADM, CTSD, CSF-1, CXCL10, CPI-B, E-selectin, ErbB2, FABP4, FAS, GDF-15, HGF, IL-1ra, IL-6, MCP-1, OPG, PRSS8, TNF-R1, TNF-R2, U-PAR, VEGF-ADown: GH |
| Waist | 3 | Up: GDF-15, WFDC2, OPG |
| TLS (147 ppl) | 4 | Down: FABP4, GH IL-6, IL-2RA |
| KA06 (720 ppl) | 17 | Up: CCL21, HGF, MCP-1, MK, PECAM-1Down: CASP-3, CD69, CXCL11, EGF, IL-1ra, IL-2-RA, hK11, MIC-A, OPG, TNFSF14, TNF-R1, TNF-R2 |
| Smoking (130 ppl) | 2 | Up: WFDC2Down: IL-12 |
| A10AD (3 ppl) | 1 | Up: TGF-alpha |
| C03DA (2 ppl) | 4 | Up: CXCL13, MIC-A, U-PAR, TNF-R2 |
| C08CA (54 ppl) | 1 | Up: IL-6 |
| N05CD (2 ppl) | 1 | Up: PRL |
| N05CF (6 ppl) | 2 | Up: FAS, IL-1ra |
| R03BA (26 ppl) | 2 | Down: Basigin, HGF-receptor |
| R03AC (13 ppl) | 1 | Down: VEGF-D |
| R03DC (3 ppl) | 1 | Down: VEGF-D |
| ABO blood group | 3 | E-selectin, PECAM-1, TIE2 |
TLS, traditional lifestyle.
Direction of correlations was calculated using PEA-values without any additional covariate correction having been carried out. A10AD (insulins and analogues for injection, intermediate-acting combined with fast-acting), C03DA (aldosterone antagonists), C08CA (dihydropyridine derivatives), N05CD (benzodiazepine derivatives), N05CF (benzodiazepine-related drugs), R03BA (glucocorticoids), R03AC (selective β-2-adrenoreceptor agonists), R03DC (leukotriene receptor antagonists).
GWAS results.
| IL-6RA | 0.66 | 653 | 22 | 1.1 × 10−40 | 27.3 | 0.50 | 317 | 10 | 1.1 × 10−20 | 27.4 | 0.68 | 22 | 4.4 × 10−58 | 26.6 | 1.06 |
| CXCL10 | 0.47 | 641 | 24 | 1.4 × 10−31 | 21.3 | 0.39 | 311 | 1 | 2.3 × 10−06 | 7.2 | 0.46 | 16 | 6.8 × 10−37 | 16.9 | 0.94 |
| CCL24 | 0.71 | 653 | 17 | 2.3 × 10−28 | 18.7 | 0.79 | 317 | 4 | 5.3 × 10−11 | 13.6 | 0.78 | 30 | 2.0 × 10−36 | 16.4 | 1.09 |
| MIC-A | 0.40 | 363 | 25 | 2.1 × 10−17 | 19.8 | 0.54 | 174 | 14 | 3.7 × 10−04 | 7.3 | 0.52 | 19 | 5.3 × 10−16 | 12.2 | 1.25 |
| CD40-L | 0.27 | 640 | 14 | 1.3 × 10−16 | 10.7 | 0.39 | 315 | 13 | 2.1 × 10−10 | 12.8 | 0.33 | 22 | 1.1 × 10−25 | 11.5 | 1.13 |
| CXCL5 | 0.41 | 653 | 8 | 7.7 × 10−16 | 9.9 | 0.62 | 317 | 5 | 1.5 × 10−10 | 12.9 | 0.51 | 8 | 4.3 × 10−26 | 11.5 | 1.06 |
| hK11 | 0.33 | 628 | 5 | 3.7 × 10−15 | 9.8 | 0.08 | 310 | 5 | 4.3 × 10−04 | 3.4 | 0.24 | 15 | 5.3 × 10−18 | 7.9 | 0.98 |
| Ep-CAM | 0.54 | 653 | 18 | 2.1 × 10−14 | 5.2 | 0.50 | 317 | 8 | 1.4 × 10−09 | 11.6 | 0.57 | 24 | 6.7 × 10−16 | 6.7 | 0.97 |
| IL-17RB | 0.51 | 415 | 1 | 4.7 × 10−13 | 12.6 | 0.30 | 199 | 1 | 1.1 × 10−07 | 14.2 | 0.48 | 2 | 1.7 × 10−18 | 12.5 | 0.96 |
| IL-12B | 0.38 | 652 | 1 | 4.7 × 10−12 | 7.3 | 0.64 | 317 | 1 | 1.3 × 10−06 | 7.4 | 0.43 | 4 | 9.0 × 10−17 | 7.1 | 0.93 |
| VEGF-D | 0.29 | 652 | 21 | 8.7 × 10−10 | 5.8 | 0.39 | 317 | 21 | 6.9 × 10−08 | 8.8 | 0.36 | 36 | 1.1 × 10−15 | 6.6 | 1.01 |
| E-selectin | 0.44 | 639 | 4 | 8.9 × 10−10 | 5.7 | 0.70 | 306 | 4 | 1.3 × 10−08 | 10.6 | 0.56 | 6 | 5.0 × 10−16 | 7.0 | 1.01 |
| MIA | 0.25 | 648 | 4 | 1.6 × 10−09 | 5.6 | 0.55 | 316 | 4 | 2.0 × 10−09 | 11.4 | 0.37 | 7 | 6.8 × 10−17 | 7.2 | 1.05 |
| MPO | 0.43 | 646 | 1 | 3.8 × 10−09 | 0.22 | 317 | 0 | 9.0 × 10−01 | 0.39 | 0 | 6.1 × 10−07 | 1.05 | |||
| CCL19 | 0.32 | 653 | 10 | 1.3 × 10−08 | 5.0 | 0.33 | 317 | 10 | 2.0 × 10−06 | 7.1 | 0.32 | 15 | 2.6 × 10−13 | 5.5 | 0.92 |
GWAS, genome-wide association study; SNP, single-nulecotide polymorphism; Var Expl, variance explained.
P-values were calculated from 1df Wald statistics χ2-values.
*Heritability estimate.
†Fraction of variance explained in the adjusted and transformed phenotype by the top-ranking SNP (SNP with lowest P-value in the combined analysis).
‡Estimation of the inflation factor for the resulting distribution of P-values.
Figure 2Manhattan plots of GWAS results.
(a) IL-6RA (b) CXCL5 (c) CCL24 and (d) E-selectin. X axis labels refer to human chromosomes listed 1–22 and X. P-values were calculated from 1df Wald statistics χ2 values using 971 individuals.
Location and annotation of top GWAS hits.
| IL-6RA | rs4129267 | 4.39 × 10−58 | 0.84 (0.052) | T (C) | 1:154426264 | Intronic | |
| CXCL10 | rs11548618 | 6.78 × 10−37 | 1.80 (0.14) | A (G) | 4:76943947 | Nonsynonymous | |
| CCL24 | rs6946822 | 2.02 × 10−36 | −0.62 (0.049) | T (C) | 7:75479448 | Intergenic, 36 kb upstream | |
| rs11465293 | 7.95 × 10−13 | −0.63 (0.088) | A (G) | 7:75442723 | Nonsynonymous | ||
| MIC-A | rs3869132 | 5.33 × 10−16 | −0.73 (0.090) | A (G) | 6:31410948 | Intergenic, 28 kb downstream | |
| rs2263316 | 1.02 × 10−08 | 0.48 (0.083) | G (A) | 6:31421297 | Intergenic, 38 kb downstream | ||
| CD40-L | rs148594123 | 1.07 × 10−25 | −0.96 (0.091) | A (G) | X:135741443 | Nonsynonymous | |
| CXCL5 | rs425535 | 4.27 × 10−26 | −0.86 (0.081) | T (T) | 4:74863997 | Synonymous | |
| rs2472649 | 3.57 × 10−21 | −0.70 (0.074) | A (A) | 4:74857708 | Intergenic, 4 kb downstream | ||
| rs2393967 | 4.54 × 10−08 | 0.30 (0.052) | C (A) | 10:65133156 | Intronic | ||
| hK11 | rs117268623 | 5.25 × 10−18 | −1.48 (0.17) | T (C) | 19:51527970 | Nonsynonymous | |
| Ep-CAM | rs201314303 | 6.74 × 10−16 | −2.48 (0.31) | G (C) | 2:47612302 | Intronic | |
| rs56398830 | 1.26 × 10−15 | −0.94 (0.12) | A (G) | 13:103701690 | Nonsynonymous | ||
| IL-17RB | rs6801605 | 1.75 × 10−18 | −0.56 (0.064) | A (G) | 3:53876218 | Intronic | |
| 4 kb upstream | |||||||
| IL-12 | rs10045431 | 8.99 × 10−17 | 0.47 (0.057) | A (A) | 5:158814533 | Intergenic, 57 kb upstream | |
| VEGF-D | rs188779336 | 1.11 × 10−15 | −1.58 (0.20) | G (C) | X:15308292 | Intronic | |
| rs146086561 | 1.81 × 10−15 | −1.57 (0.20) | T (C) | X:15365438 | Nonsynonymous | ||
| E-selectin | rs507666 | 5.01 × 10−16 | −0.55 (0.067) | A (G) | 9:136149399 | Intronic | |
| MIA | rs2230694 | 6.83 × 10−17 | 0.66 (0.079) | G (A) | 19:41263403 | Synonymous | |
| rs2607426 | 1.13 × 10−16 | 0.65 (0.079) | G (A) | 19:41274713 | Intergenic, 6k upstream | ||
| rs2233154 | 1.13 × 10−16 | 0.65 (0.079) | T (C) | 19:41281346 | UTR5 | ||
| rs2233159 | 1.07 × 10−16 | −0.65 (0.079) | C (C) | 19:41283365 | UTR3 | ||
| CCL19 | rs7775228 | 2.63 × 10−13 | 0.51 (0.069) | C (T) | 6:32658079 | Intergenic, 24 kb upstream |
GWAS, genome-wide association study; SNP, single-nulecotide polymorphism.
P-values were calculated from 1df Wald statistics χ2-values.
*In hg19 coordinates.
†Independent genome-wide significant loci as per conditional analysis on top-snp, reported P-values after conditioning on the top-hit.
‡rs11465293 was not discovered in the unconditional discovery-replication analysis and was subsequently rerun in a conditional discovery-replication analysis resulting in the P-values 3.9 × 10−9 and 6.7 × 10−5 for the discovery and replication cohorts, respectively.
§Imputed.
||When the top SNP(s) was imputed, the top ranking genotyped SNP was also included in the table.
¶In perfect LD (R2=1) with top-snp in our cohort.
Figure 3Covariates and protein biomarkers.
(a) Variance explained by each of the covariates for the set of 77 biomarkers with measurable variability with the 11 most important covariates coloured. The combined effect of the remaining covariates is shown in grey, assuming independence in effect between covariates. (b) The percent of the variance explained by the full set of covariates studied for the 77 proteins, using a combined model. (c) Abundance of CXCL10, expressed as ddCq-values, in relation to age when stratified by genotype at rs11548618; AA (grey), AB (red) and BB (blue). Shadowed areas represent the 95% confidence interval in a linear model predicting ddCq from age. (d) Fitted normal distribution densities based on mean and standard deviation in ddCq-values for CXCL10, split by the rs11548618 genotype. (e) Fitted normal distribution densities based on mean and standard deviation in ddCq-values for CCL24 split by the rs6946822 genotype. (f) Fitted normal distribution densities based on mean and standard deviation in ddCq-values for IL-6 split by use of hypertension medications. Only groups where there are at least 10 individuals are shown. C07AB: β-blocking agents, selective. C08CA: dihydropyridine derivatives. C09AA: ACE inhibitors, plain. (d–f) Interquartile ranges indicated with coloured boxes above the curves.