| Literature DB >> 28129359 |
Antonio F Di Narzo1,2, Shannon E Telesco3, Carrie Brodmerkel3, Carmen Argmann1,2, Lauren A Peters2,4, Katherine Li3, Brian Kidd1,2, Joel Dudley1,2, Judy Cho1,2, Eric E Schadt1,2, Andrew Kasarskis1,2, Radu Dobrin3, Ke Hao1,2.
Abstract
To date, no large scale, systematic description of the blood serum proteome has been performed in inflammatory bowel disease (IBD) patients. By using microarray technology, a more complete description of the blood proteome of IBD patients is feasible. It may help to achieve a better understanding of the disease. We analyzed blood serum profiles of 1128 proteins in IBD patients of European descent (84 Crohn's Disease (CD) subjects and 88 Ulcerative Colitis (UC) subjects) as well as 15 healthy control subjects, and linked protein variability to patient age (all cohorts) and genetic components (genotype data generated from CD patients). We discovered new, previously unreported aging-associated proteomic traits (such as serum Albumin level), confirmed previously reported results from different tissues (i.e., upregulation of APOE with aging), and found loss of regulation of MMP7 in CD patients. In carrying out a genome wide genotype-protein association study (proteomic Quantitative Trait Loci, pQTL) within the CD patients, we identified 41 distinct proteomic traits influenced by cis pQTLs (underlying SNPs are referred to as pSNPs). Significant overlaps between pQTLs and cis eQTLs corresponding to the same gene were observed and in some cases the QTL were related to inflammatory disease susceptibility. Importantly, we discovered that serum protein levels of MST1 (Macrophage Stimulating 1) were regulated by SNP rs3197999 (p = 5.96E-10, FDR<5%), an accepted GWAS locus for IBD. Filling the knowledge gap of molecular mechanisms between GWAS hits and disease susceptibility requires systematically dissecting the impact of the locus at the cell, mRNA expression, and protein levels. The technology and analysis tools that are now available for large-scale molecular studies can elucidate how alterations in the proteome driven by genetic polymorphisms cause or provide protection against disease. Herein, we demonstrated this directly by integrating proteomic and pQTLs with existing GWAS, mRNA expression, and eQTL datasets to provide insights into the biological processes underlying IBD and pinpoint causal genetic variants along with their downstream molecular consequences.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28129359 PMCID: PMC5271178 DOI: 10.1371/journal.pgen.1006565
Source DB: PubMed Journal: PLoS Genet ISSN: 1553-7390 Impact factor: 5.917
Fig 1Heatmap of top significant age-related proteins.
Included in the heatmap are all microarray probes with pvalue ≤ 1E-4 in at least one cohort. The UC, CD and NC cells are color-coded according to the Wald t-test of the age coefficient of the protein levels in each cohort, with the t test further reported in each cell. The last 2 columns report previously known association of gene mRNA levels with age within different tissues. Shades of green indicate increase of protein or mRNA level with aging; shades of red indicate decrease of protein or mRNA level with aging; the white color indicates lack of evidence in either direction.
Overlap of proteomic aging signature between study samples and previous report.
| Cohort | Size of Aging Signature | Overlap Genes | Overlap odds ratio | Overlap pvalue |
|---|---|---|---|---|
| UC | 221 | CCDC80, CHRDL1, CST3, FSTL3, ROR1, THBS4 | 6.29 | 0.0055 |
| CD | 108 | CCDC80, FSTL3, IGFBP6, THBS4 | 6.48 | 0.0106 |
| NC | 43 | CHRDL1 | 2.84 | 0.3231 |
* Number of proteomic traits with nominal pvalue≤0.05 of positive association with aging.
**Genes overlapping with aging signature reported by Menni et al [4]; all 10 genes in Menni et al [4] were reported as having a positive association with aging.
Fig 2Increase of Albumin levels with aging in CD and UC.
Scatterplot of the Albumin protein level vs patients age, separately for UC patients (left panel) and CD patients (baseline data only is displayed, right panel). Age in years on the horizontal axis; mean-centered and adjusted log2-protein expression on the vertical axis (adjusted for sex and plateID).
Fig 3Variation of APOE levels with aging in CD, UC and NC subjects.
Forest plot of the estimated log2-FC of APOE protein levels (probe SL000276) per 10 years increase in age, with 95% confidence intervals, as obtained from the differential expression analysis performed separately in Crohn’s Disease (CD), Ulcerative Colitis (UC) and Normal Controls (NC) subjects. Estimated log2-FCs and 95% Confidence Intervals are further reported on the right.
Newly discovered protein-QTLs (10% FDR): protein probe and lead significant SNP.
| SL008063 | PPAC | P24666 | ACP1 | rs17713879 | 2 | 0.25 | A/G | increase | 2.46E-10 |
| SL008039 | AK1A1 | P14550 | AKR1A1 | rs6688727 | 1 | 46.00 | A/G | increase | 2.92E-06 |
| SL000003 | Angiogenin | P03950 | ANG | rs72659950 | 14 | 21.12 | A/G | increase | 4.79E-06 |
| SL010375 | ASAH2 | Q9NR71 | ASAH2 | rs137889787 | 10 | 51.99 | A/G | decrease | 1.18E-09 |
| SL008644 | BST1 | Q10588 | BST1 | rs73241195 | 4 | 15.66 | C/G | 6.19E-13 | |
| SL003190 | MIP-5 | Q16663 | CCL15 | rs854685 | 17 | 34.32 | A/G | increase | 3.44E-07 |
| SL003197 | TECK | O15444 | CCL25 | rs184230513 | 19 | 8.12 | A/G | increase | 3.66E-10 |
| SL000516 | MCP-3 | P80098 | CCL7 | rs1133763 | 17 | 32.65 | A/C | increase | 8.76E-10 |
| SL005215 | Siglec-3 | P20138 | CD33 | rs12459419 | 19 | 51.73 | A/G | decrease | 4.02E-11 |
| SL006029 | Chitotriosidase-1 | Q13231 | CHIT1 | rs2244783 | 1 | 203.19 | A/G | decrease | 1.19E-11 |
| SL006694 | CNDP1 | Q96KN2 | CNDP1 | rs4329999 | 18 | 72.23 | A/G | decrease | 4.09E-06 |
| SL014048 | CONA1 | Q86Y22 | COL23A1 | rs2133281 | 5 | 177.89 | A/G | decrease | 8.98E-07 |
| SL004015 | TAFI | Q96IY4 | CPB2 | rs9526134 | 13 | 46.62 | A/C | increase | 4.69E-06 |
| SL011808 | CPNE1 | Q99829 | CPNE1 | rs143646210 | 20 | 34.20 | A/G | decrease | 1.60E-07 |
| SL010456 | CYTN | P01037 | CST1 | rs4260306 | 20 | 23.73 | A/G | decrease | 7.33E-16 |
| SL004673 | Cathepsin S | P25774 | CTSS | rs72702561 | 1 | 150.69 | A/G | increase | 2.37E-06 |
| SL006550 | ECM1 | Q16610 | ECM1 | rs77257036 | 1 | 150.48 | A/G | decrease | 4.01E-08 |
| SL000408 | Epo | P01588 | EPO | rs73170784 | 7 | 100.36 | A/G | increase | 1.22E-06 |
| SL007729 | ARTS1 | Q9NZ08 | ERAP1 | rs13178387 | 5 | 96.14 | A/C | decrease | 4.63E-09 |
| SL012469 | GPC5 | P78333 | GPC5 | rs1332090 | 13 | 92.45 | A/G | increase | 7.00E-08 |
| SL006512 | HGFA | Q04756 | HGFAC | rs2857985 | 4 | 3.48 | A/G | increase | 6.71E-11 |
| SL000437 | Haptoglobin Mixed Type | P00738 | HP | rs12925078 | 16 | 72.07 | A/G | decrease | 2.84E-06 |
| SL002922 | sICAM-1 | P05362 | ICAM1 | rs901886 | 19 | 10.40 | A/G | increase | 1.66E-15 |
| SL010368 | IDUA | P35475 | IDUA | rs11248061 | 4 | 0.98 | A/C | increase | 9.54E-06 |
| SL001943 | IL-6 sRa | P08887 | IL6R | rs12133641 | 1 | 154.43 | A/G | decrease | 3.57E-11 |
| SL003184 | sLeptin R | P48357 | LEPR | rs61781312 | 1 | 66.09 | A/G | increase | 6.78E-30 |
| SL005190 | ILT-2 | Q8NHL6 | LILRB1 | rs10427127 | 19 | 55.14 | A/G | increase | 5.17E-14 |
| SL005191 | ILT-4 | Q8N423 | LILRB2 | rs386056 | 19 | 54.78 | A/G | decrease | 1.39E-06 |
| SL000510 | Lysozyme | P61626 | LYZ | rs1384 | 12 | 69.75 | A/G | decrease | 1.09E-07 |
| SL004516 | MBL | P11226 | MBL2 | rs10824792 | 10 | 54.53 | A/G | increase | 1.57E-09 |
| SL005202 | MSP | P26927 | MST1 | rs9836291 | 3 | 49.70 | A/G | decrease | 1.79E-11 |
| SL008611 | ASAHL | Q02083 | NAAA | rs3796484 | 4 | 76.87 | A/T | 7.86E-06 | |
| SL005212 | Prolactin Receptor | P16471 | PRLR | rs34623800 | 5 | 35.01 | A/G | decrease | 3.55E-06 |
| SL000572 | SAA | P0DJI8 | SAA1 | rs11024616 | 11 | 18.33 | C/G | 1.12E-06 | |
| SL010470 | Semaphorin 3E | O15041 | SEMA3E | rs2191530 | 7 | 83.15 | A/G | increase | 6.65E-07 |
| SL014292 | SIG14 | Q08ET2 | SIGLEC14 | rs73052804 | 19 | 52.16 | A/G | decrease | 2.20E-08 |
| SL005219 | Siglec-9 | Q9Y336 | SIGLEC9 | rs2075803 | 19 | 51.63 | A/G | increase | 4.92E-27 |
| SL016928 | SLAF7 | Q9NQ25 | SLAMF7 | rs2295616 | 1 | 160.71 | A/G | decrease | 2.20E-06 |
| SL003043 | TIMP-3 | P35625 | TIMP3 | rs5754272 | 22 | 33.17 | A/G | increase | 1.77E-09 |
| SL003198 | Tenascin | P24821 | TNC | rs11787994 | 9 | 117.75 | A/C | increase | 5.80E-06 |
| SL004782 | TSG-6 | P98066 | TNFAIP6 | rs73004003 | 2 | 152.17 | A/G | increase | 3.81E-06 |
1) Lead pSNP;
2) Hg19;
3) effective allele/alternative allele;
4) estimated effect of the effective allele on the protein expression level
Overlaps between blood serum protein-QTLs and previously published eQTLs from several tissues (10% FDR).
| Tissue | observed overlap | expected overlap | fold enrichment | Fisher test pvalue |
|---|---|---|---|---|
| Blood[ | 14 | 14.2 | 0.99 | 1.000 |
| Cerebellum | 9 | 7.4 | 1.22 | 0.535 |
| Prefrontal | 8 | 7.9 | 1.01 | 1.000 |
| Visual Cortex | 4 | 4.9 | 0.82 | 0.809 |
| Liver[ | 20 | 8.6 | 2.33 | 0.000 |
| Omental fat [ | 16 | 14.2 | 1.13 | 0.616 |
| Subcutaneous fat [ | 15 | 12.2 | 1.23 | 0.383 |
1) number of shared gene symbols;
2) expected number of shared gene symbols under the assumption of independence between the two studies; a list of 991 gene symbols which were tested for cis-eQTLs in both the present study and any of the eQTL studies was used as a background;
3) eQTLs from the Harvard Brain collection (www.brainbank.mclean.org).
Fig 4Enrichment for GWAS signals in blood protein-QTLs of CD patients.
Expected uniform -log10(relative rank) of the protein-SNPs (nominal pvalue ≤ 1E-5) within the full GWAS SNPs list on the horizontal axis; observed–log10(relative rank) on the vertical axis. CD: Crohn’s Disease; UC: Ulcerative Colitis; BMI: Body Mass Index; SCZ: Schizofrenia; Stroke: Ischemic Stroke; T2D: Type-2 Diabetes. References for all the studies are reported in the Methods section.
Fig 5Locuszoom plot of MST1, CD and UC association pvalues around the MST1 gene.
It is worth noting that our proteomics platform has 4 probes in this chromosomal region, targeting 4 different proteins: IMPDH2 (probe SL010928), MST1 (probe SL005202), MST1R (probe SL004637) and MAPKAPK3 (probe SL004765). Of these, MST1 is most significantly associated with the IBD GWAS SNP in this locus (S3 Fig), and the association pattern was highly consistent with the CD and UC GWAS peaks (Fig 5).