| Literature DB >> 35604899 |
Janne Pott1,2, Tarcyane Garcia1, Stefanie M Hauck3, Agnese Petrera3, Kerstin Wirkner1,2, Markus Loeffler1,2, Holger Kirsten1,2, Annette Peters3,4,5, Markus Scholz1,2.
Abstract
BACKGROUND: Although gene-expression (GE) and protein levels are typically strongly genetically regulated, their correlation is known to be low. Here we investigate this phenomenon by focusing on the genetic background of this correlation in order to understand the similarities and differences in the genetic regulation of these omics layers. METHODS ANDEntities:
Mesh:
Year: 2022 PMID: 35604899 PMCID: PMC9126407 DOI: 10.1371/journal.pone.0268815
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.752
Basic sample description.
| Variable | Overall | Female | Male | P-value |
|---|---|---|---|---|
| (n = 2,014) | (n = 974) | (n = 1,040) | ||
| Age (years) | 62.5 (11.5) | 62.0 (11.3) | 62.9 (11.6) |
|
| BMI (kg/m2) | 27.7 (4.5) | 27.2 (4.8) | 28.1 (4.1) |
|
| Current smoker | 320 (16.7%) | 147 (16.2%) | 173 (17.3%) | 5.49E-01 |
| Hypertension | 1166 (58.7%) | 496 (51.9%) | 670 (65.1%) |
|
| Type 2 diabetes | 479 (23.8%) | 214 (22.0%) | 265 (25.5%) | 7.25E-02 |
| Statin therapy | 319 (15.9%) | 124 (12.8%) | 195 (18.8%) |
|
| TC (mmol/l) | 5.70 (1.06) | 5.87 (1.06) | 5.53 (1.04) |
|
| LDL-C (mmol/l) | 3.58 (0.95) | 3.59 (0.96) | 3.57 (0.95) | 5.88E-01 |
| HDL-C (mmol/l) | 1.62 (0.46) | 1.81 (0.46) | 1.44 (0.39) |
|
For continuous parameters, the unit is given in parenthesis, and arithmetic mean and standard deviation values are shown. For binary variables, total number and percentages are provided. Differences between sexes were tested with a chi-squared test for all binary parameters, and with Mann-Whitney U test for all continuous parameters. Abbreviations: BMI, body mass index; TC, total cholesterol; LDL-C, low-density lipoprotein; HDL-C, high-density lipoprotein.
a anamnestic, medication or determined by HbA1c>6.5%;
b ATC-code beginning with C10.
Fig 1Circular plot of cis-associations.
Log-transformed p-values for cis-pQTLs and eQTLs are shown in the green respectively blue circle. We obtained the statistics for eQTLs from GTEx and present the results of the tissue with the strongest eQTL per gene (see S3 Table in S2 File). For plotting, the y-axis was restricted to -log (p) = 20, i.e. all larger -log (p) values were set to 20. The red circles mark the classical genome-wide significance threshold (p = 5x10-8). Gene names are added for loci not yet described for blood protein levels, and are colored with respect to the novelty level (blue: not described for any other traits, black: reported for other traits except for blood protein levels).
Comparison of effect direction of cis-pQTLs from our GWAS and cis-eQTLs from GTEx.
| Locus Information | pQTL | eQTL GTEx whole blood and (sec.) best tissue | ||||
|---|---|---|---|---|---|---|
| Protein (ratio) | pQTL effect allele / EAF | beta | p-value | beta | p-value | Tissue |
| IL6R | rs4129267 | 0.421 | 2.96x10-323 | -0.087 | 5.40x10-09 | Artery Tib. |
| (17/18) | T / 0.380 | -0.194 | 8.41x10-17 | |||
| CCL15 | rs41436444 | 0.939 | 4.56x10-292 | - | - | Lung |
| (20/21) | CAGGGCAG / 0.080 | -0.635 | 1.65x10-17 | |||
| CCL16 | rs10445391 | -0.852 | 8.01x10-155 | - | - | Thyroid |
| (3/4) | G / 0.072 | 0.220 | 2.27x10-04 | |||
|
| rs721917 | -0.475 | 1.37x10-95 | 0.113 | 3.39x10-03 | Artery Tib. |
| (42/42) | G / 0.406 | 0.661 | 1.90x10-43 | |||
|
| rs7214248 | 0.193 | 4.29x10-61 | -0.072 | 3.80x10-04 | Artery Tib. |
| (24/26) | A / 0.346 | -0.221 | 1.02x10-17 | |||
|
| rs897811 | -0.163 | 1.70x10-20 | 0.501 | 5.78x10-19 | Thyroid |
| (35/37) | C / 0.116 | 0.376 | 9.71x10-10 | |||
| TIMP4 | rs392394 | 0.140 | 4.54x10-17 | -0.024 | 4.36x10-01 | Artery Tib. |
| (16/17) | A / 0.782 | -0.239 | 7.95x10-10 | |||
| TNFRSF11B | rs11300005 | 0.073 | 2.70x10-11 | - | - | Eso. Mus. |
| (2/2) | C / 0.493 | -0.133 | 1.32x10-02 | |||
| AXL | rs3786556 | -0.446 | 9.39x10-16 | -0.002 | 9.68x10-01 | Artery Tib. |
| (28/28) | T / 0.184 | 0.227 | 2.13x10-13 | |||
| CPA1 | rs35454128 | 0.197 | 3.79x10-08 | 0.117 | 8.18x10-02 | Adipose Sub. |
| (7/9) | C / 0.120 | -0.237 | 1.95x10-03 | |||
| CASP3 | rs6845294 | 0.153 | 1.09x10-07 | 0.026 | 1.54x10-01 | Cells fibro. |
| (14/15) | A / 0.687 | -0.248 | 5.35x10-17 | |||
| CDH5 | rs16956504 | 0.105 | 1.10x10-07 | 0.007 | 8.93x10-01 | Pituitary |
| (3/3) | C / 0.110 | -0.161 | 2.14x10-02 | |||
|
| rs145042193 | -0.064 | 8.22x10-07 | 0.104 | 2.47x10-03 | Cells fibro. |
| (25/25) | T / 0.208 | 0.362 | 4.92x10-17 | |||
We show results of the 13 genes for which discordant effect directions between pQTL and most of the respective eQTLs (more than 75% of all significant eQTLs across tissues) were observed. We also report eQTLs of whole blood and the best-associated tissue in GTEx. The effect allele and its frequency is given below of the respective SNP ID. For four genes we could replicate the different effect direction in our LIFE data (marked in bold, see S2 Table in S2 File for more details).
Fig 2Scatter plot of effect estimates of the 63 lead pQTL-SNPs on gene expression and protein levels.
As the focus was on the direction only, we did not normalize the effect estimates. Only SNPs with a significant eQTL in at least one tissue are displayed (p< = 0.05 in GTEx). Results of the 13 genes showing discordant pQTL and eQTL directions in more than 75% of eQTL tissues are labeled (see also Table 2 and S5 Table in S2 File).
Proteins showing predominantly negative correlation between tissue-specific GE and blood PE.
| Protein | #tissues (neg/tot) | Effect | P-value | Tissue |
|---|---|---|---|---|
| IL6R | 25/29 | -2.177 | 7.11x10-303 | Artery Tib. |
| -2.965 | 8.21x10-197 | WB | ||
| SFTPD | 33/40 | -6.003 | 7.69x10-77 | Colon Trans. |
| BLMH | 12/15 | -0.985 | 1.48x10-58 | Artery Tib. |
| -1.052 | 8.11x10-15 | WB | ||
| TIMP4 | 29/36 | -0.371 | 8.00x10-17 | Brain Ant. Cingulate Cortex |
| CASP3 | 25/29 | 0.865 | 1.51x10-7 | Brain Putamen |
| -109.229 | 2.78x10-7 | WB | ||
| CXCL16 | 41/41 | -0.242 | 6.70x10-7 | Kidney Cortex |
| -0.067 | 1.54x10-2 | WB | ||
| TREML2 | 13/14 | -40.610 | 5.85x10-6 | Thyroid |
| -0.876 | 3.58x10-4 | WB |
MetaXcan results are shown for the best-associated tissue with negative effect estimate and whole blood (WB), if a prediction model was available. Neg = Number of significant negative correlations across tissues. Tot = Total number of significant associations.
Fig 3Overlap of genes with negative correlation to protein levels according to different analysis strategies.
We present the genes with negative protein correlations for more than 75% of the significant tissues according to MetaXcan (MX), negative causal effect estimates according to Mendelian Randomization (MR), and opposite effect direction of eQTLs and pQTLs (see also Tables 2–4).
Proteins with predominantly negative causal links of GE and PE.
| Protein | #tissues (neg/tot) | Causal effect estimate | P-value MR | Association effect estimate | P-value MetaXcan | Tissue |
|---|---|---|---|---|---|---|
| IL6R | 5/5 | -0.713 | 5.57x10-27 | 0.126 | 4.63x10-06 | Testis |
| -3.008 | 1.35x10-16 | -2.965 | 8.21x10-197 | WB | ||
| SFTPD | 22/25 | -0.347 | 2.64x10-17 | -0.050 | 5.68x10-04 | Heart AA |
| WB | ||||||
| -0.749 | 6.43x10-06 | |||||
| BLMH | 2/2 | -0.873 | 3.07x10-15 | -0.985 | 1.48x10-58 | Artery Tib. |
| CCL15 | 15/16 | -0.459 | 1.02x10-14 | Nerve Tib. | ||
| PI3 | 1/1 | -0.731 | 1.02x10-08 | WB | ||
| AXL | 6/6 | -0.243 | 6.03x10-06 | Artery Tib. | ||
| CCL16 | 15/15 | -0.181 | 1.28x10-05 | Liver | ||
| CASP3 | 5/6 | -0.440 | 1.89x10-05 | -0.173 | 1.57x10-06 | Cells fibro. |
| 0.914 | 1.83x10-05 | 0.865 | 1.51x10-07 | WB | ||
| TFRC | 1/1 | -0.404 | 7.94x10-05 | Lung | ||
| CXCL16 | 17/17 | -0.225 | 1.69x10-04 | -0.213 | 1.68x10-05 | Thyroid |
| -0.097 | 2.97x10-03 | -0.067 | 1.54x10-02 | WB | ||
| CDH5 | 1/1 | -0.148 | 1.92x10-02 | Thyroid | ||
| CD163 | 1/1 | -0.129 | 3.66x10-02 | Testis |
MR results are shown for the best tissue with negative effect estimate and whole blood, if available. We added respective MetaXcan results for comparison. Neg = Number of significant negative causal estimate across tissue. Tot = Total number of significant MR tests.
Fig 4Graphical overview of causal networks form gene expression (GE) over protein expression (PE) on the outcome coronary artery disease (CAD).
Orange arrows indicate a negative causal effect of protein level on CAD (IL6R and TFPI) or of gene expression on protein levels, blue arrows denote positive links. The settings are indicated by c (combined), m (males) and f (females). Tissues in which gene expression showed a significant indirect effect on CAD are listed next to their gene. Bold tissues indicate also a significant direct effect.