| Literature DB >> 32641083 |
Robert F Hillary1, Daniel Trejo-Banos2, Athanasios Kousathanas2, Daniel L McCartney1, Sarah E Harris3,4, Anna J Stevenson1, Marion Patxot2, Sven Erik Ojavee2, Qian Zhang5, David C Liewald3, Craig W Ritchie6, Kathryn L Evans1, Elliot M Tucker-Drob7,8, Naomi R Wray5, Allan F McRae5, Peter M Visscher5, Ian J Deary3,4, Matthew R Robinson9, Riccardo E Marioni10.
Abstract
BACKGROUND: The molecular factors which control circulating levels of inflammatory proteins are not well understood. Furthermore, association studies between molecular probes and human traits are often performed by linear model-based methods which may fail to account for complex structure and interrelationships within molecular datasets.Entities:
Mesh:
Substances:
Year: 2020 PMID: 32641083 PMCID: PMC7346642 DOI: 10.1186/s13073-020-00754-1
Source DB: PubMed Journal: Genome Med ISSN: 1756-994X Impact factor: 11.117
Fig. 1Genetic architecture of inflammatory protein biomarkers in the Lothian Birth Cohort 1936. a Chromosomal locations of pQTLs concordant between Bayesian penalised and ordinary least squares regression models for genome-wide association studies (n = 13 pQTLs). The x-axis represents the chromosomal location of concordantly identified cis and trans SNPs associated with the levels of Olink® inflammatory proteins. The y-axis represents the position of the gene encoding the associated protein. The sole conditionally significant concordant trans association is annotated. Cis (red circles); trans (blue circles). b Absolute effect size (per standard deviation of difference in protein level per effect allele) of pQTLs versus minor allele frequency. Cis (red circles); trans (blue circles). c Classification of 13 pQTLs by function as defined by functional enrichment analysis in FUMA. d Variance in protein levels explained by pQTLs (estimates from Bayesian penalised regression are displayed)
Fig. 2Variance in circulating inflammatory protein levels explained by common genetic variation. a In this panel, the variance explained (r2) by consensus SNPs (n = 13 SNP, 1 per protein) in the ordinary least squares regression model was compared against the variance explained by the same SNP set identified in the Bayesian penalised regression approach. b The proportion of variance explained in Olink® inflammatory protein levels by common genetic variants genotyped in the LBC1936 participants is shown. Only those proteins which had significant pQTL associations in both the ordinary least squares and Bayesian methods are presented (n = 13). Additionally, the proportion of variance explained attributable to medium effects (prior: variance of 1% explained) and large effects (prior: variance of 10% explained) are demonstrated in purple and green, respectively. Error bars represent 95% credible intervals
Fig. 3Effect of genetic variation on inflammatory protein levels. a Box plot of MCP2 levels as a function of genotype (rs3138036, effect allele: G, other allele: A, beta = − 1.20, se = 0.06). b Box plot of MCP4 levels as a function of genotype (rs14075, effect allele: G, other allele: A, beta = − 0.62, se = 0.05). Centre line of boxplot: median, bounds of box: first and third quartiles
Fig. 4Genomic locations of CpG sites associated with differential inflammatory protein levels. The x-axis represents the chromosomal location of CpG sites associated with the levels of Olink® inflammation biomarkers. The y-axis represents the position of the gene encoding the associated protein. The level of concordance across three models used to perform epigenome-wide association studies is represented by different shape patterns. Those CpG sites (n = 3) which were identified by linear (ordinary least squares), mixed model and Bayesian penalised regression models, and passed a Bonferroni-corrected significance threshold are represented by diamonds and annotated. Three proteins (CXCL9, CXCL10 and CXCL11) were associated with differential methylation levels at the cg07839457 site in the NLRC5 transcription factor locus. Additionally, two proteins (CCL11 and TGF-alpha) were associated with the smoking-associated cg05575921 site in the AHRR locus. Cis (red); trans (blue)
Fig. 5Variance in circulating inflammatory protein levels explained by DNA methylation. a In this panel, the variance explained in circulating protein levels by complete methylation data from sites present on the Infinium 450 K methylation array was examined. A comparison between variance explained (h2) by a mixed model approach (OSCA) and a Bayesian penalised regression approach (BayesR+) is shown. b The proportion of variance explained in Olink® inflammatory protein levels by DNA methylation, as estimated by BayesR+ is shown. Only those proteins (n = 3) which had significant CpG associations in ordinary least squares, mixed model and Bayesian methods are presented. Additionally, the proportion of variance explained attributable to small effects (prior: variance of 0.1% explained), medium effects (prior: variance of 1.0% explained) and large effects (prior: variance of 10% explained) are demonstrated in blue, gold and dark orange, respectively. Error bars represent 95% credible intervals