Literature DB >> 30445421

High-Throughput Identification of the Plasma Proteomic Signature of Inflammatory Bowel Disease.

Antonio F Di Narzo1,2, Carrie Brodmerkel3, Shannon E Telesco3, Carmen Argmann1,2, Lauren A Peters1,2,4, Katherine Li3, Brian Kidd1,2, Joel Dudley1,2, Judy Cho1,2, Eric E Schadt1,2,4, Andrew Kasarskis1,2, Radu Dobrin3, Ke Hao1,2,5.   

Abstract

BACKGROUND: The molecular aetiology of inflammatory bowel disease [IBD] and its two subtypes, ulcerative colitis [UC] and Crohn's disease [CD], have been carefully investigated at genome and transcriptome levels. Recent advances in high-throughput proteome quantification has enabled comprehensive large-scale plasma proteomics studies of IBD.
METHODS: The study used two cohorts: [1] The CERTIFI-cohort: 42 samples from the CERTIFI trial of anti-TNFα-refractory CD patients; [2] the PROgECT-UNITI-HCs cohort: 46 UC samples of the PROgECT study, 84 CD samples of the UNITI I and UNITI II studies, and 72 healthy controls recruited in Mount Sinai Hospital, New York, USA. The plasma proteome for these two cohorts was quantified using high-throughput platforms.
RESULTS: For the PROgECT-UNITI-HCs cohort, we measured a total of 1310 proteins. Of these, 493 proteins showed different plasma levels in IBD patients to the plasma levels in controls at 10% false discovery rate [FDR], among which 11 proteins had a fold change greater than 2. The proteins upregulated in IBD were associated with immunity functionality, whereas the proteins downregulated in IBD were associated with nutrition and metabolism. The proteomic profiles were very similar between UC and CD. In the CERTIFI cohort, 1014 proteins were measured, and it was found that the plasma protein level had little correlation with the blood or intestine transcriptomes.
CONCLUSIONS: We report the largest proteomics study to date on IBD and controls. A large proportion of plasma proteins are altered in IBD, which provides insights into the disease aetiology and indicates a potential for biomarker discovery. © European Crohn’s and Colitis Organisation (ECCO) 2018.

Entities:  

Keywords:  Proteomics; differential expression analysis; inflammatory bowel disease; proteomic quantitative trait loci

Mesh:

Substances:

Year:  2019        PMID: 30445421      PMCID: PMC6441306          DOI: 10.1093/ecco-jcc/jjy190

Source DB:  PubMed          Journal:  J Crohns Colitis        ISSN: 1873-9946            Impact factor:   9.071


1. Introduction

It is estimated that 568 per 100000 people in the USA have been diagnosed with inflammatory bowel disease [IBD], among which 249 have ulcerative colitis [UC] and 319 have Crohn’s disease [CD].[1] The investigation of molecular phenotypes [e.g. transcriptome and proteome] and how they relate to disease susceptibility and progression can help close the gap in understanding between variation in the human genome that is associated with disease and the biological processes that lead to disease. The integration of genetic studies [e.g. genome-wide association studies, GWAS] and transcriptomic studies has proven particularly fruitful. Transcriptome profiling [e.g. microarray or RNA sequencing] has identified changes in the mRNA of many genes in the blood and intestine that have been associated with IBD[2,3]; however, the transcriptomic differences between colonic or blood UC and CD have been reported as minimal.[3] Genome-wide association studies and genome sequencing have been used to identify genetic loci, including eQTLs and pQTLs [expressional and proteomics quantitative trait loci, respectively], that have proven to be powerful intermediate phenotypes in revealing the molecular aetiology underlying IBD genetic association loci.[4-8] Circulating protein levels are critical measures of disease presence and activity. Only recently have researchers begun employing high-throughput screening technologies to measure circulating plasma protein levels in large human populations.[9-13] Although extensive study of the transcriptional signatures of IBD in relevant tissues has been conducted, research on proteomic alterations in CD and UC is still in its infancy,[14-18] and to date, there has been no comprehensive plasma proteomics study of IBD using a large sample size [e.g. N > 200]. Herein, we have attempted to close this gap. In this study, we employed high-throughput proteomics technology [SOMAscan panel, Materials and Methods] to assess variation in the levels of plasma proteins of three groups of individuals of well-characterized UC and CD, and non-IBD controls,[19,20] and the controls were matched to IBD patients in terms of age, sex, and ethnicity. We examined the correlation between plasma protein levels and mRNA measured in blood and intestine. Further, we systematically explored the distinct pair-wise plasma proteomic profiles for UC vs control, CD vs control and UC vs CD.

2. Materials and methods

Figure 1 summarizes the work flow of data collection and downstream analysis. Details are reported in the following sections.
Figure 1.

Study design and workflow. [A] Data generation from the CERTIFI cohort; [B] data generation from the PROgECT-UNITI-HCs cohort; [C] differential protein analysis workflow.

Study design and workflow. [A] Data generation from the CERTIFI cohort; [B] data generation from the PROgECT-UNITI-HCs cohort; [C] differential protein analysis workflow.

2.1. Study cohorts

All subjects used in this study were of Caucasian ancestry. The CERTIFI cohort [Figure 1] assessed were CD subjects from the CERTIFI study [ClinicalTrials.gov number, NCT00771667]. The range of disease duration was 6 months – 41 years, and the median was 11.1 years; the range of CD activity index was 223–449, with median = 331.5 [Table S1]. The transcriptome data of this cohort is publicly available at GEO [Accession number GSE100833].[21] In the PROgECT-UNITI-HCs cohort [Figure 1], samples were selected from three different studies. [1] A total of 46 UC samples were collected as part of the PROgECT study [ClinitalTrials.gov number, NCT01988961].[22] The UC subjects had a Mayo score of between 7 and 11 [median = 9], had a disease duration of between 0 and 11 years [median = 2.4 months], and were receiving corticosteroids [n = 16] and immunomodulators [n = 7], Azathioprine and 6-MP [n = 7], and 5-ASA [n = 38]. [2] A total of 84 CD samples were collected as part of UNITI I and UNITI II [ClinicalTrials.gov numbers NCT01369329 and NCT01369342, respectively].[20] The CD subjects had a CD activity index of between 208 and 460 [median = 286], had involvement of the colon only [n = 11], colon and ileum [n = 50], and ileum only [n = 23], had a disease duration of between 5 months and 40 years [median = 7.3 years], and were under standard of care. [3] The 72 HC samples were recruited at Mount Sinai Medical Center in New York City, NY, USA. The CD, UC, and HCs in the ‘matched IBD and HC cohort’ were matched in terms of age and sex. Most importantly, plasma collections were conducted using identical protocols for the UC, CD, and control subjects, and proteome profiling was performed at the same time and under the same laboratory conditions. Importantly, the CERTIFI cohort and the PROgECT-UNITI-HCs cohort were profiled using different versions of SOMAscan panels.

2.2. Proteomics assays

Plasma protein levels were measured using a SOMAmer-based capture array, ‘SOMAscan’[10,23] [web site: http://www.somalogic.com/Products-Services/SOMAscan]. On the CERTIFI cohort, 1014 proteins were measured, and on the PROgECT-UNITI-HCs cohort, a total of 1310 proteins were measured. The SOMAscan technology uses chemically modified nucleotides to convert a protein signal to a nucleotide signal that is measured as relative fluorescence units using a custom DNA microarray. Quality Control was based on assessment of batch effect, unsupervised clustering of the expression profiles, and Principal Components Analysis [PCA]. No sample or probe failed the Quality Control. Further, we fully randomized the sample across the Somalogics assay plates to eliminate possible confounding by batch effects. In data analyses, we adjusted for plate as a covariate.

2.3. Plasma CRP level

In the clinical lab [independent of the Somalogic technology], we measured plasma CRP levels in the majority of the PROgECT-UNITI-HCs cohort: 84 CD subjects, 42 UC subjects and 71 normal HCs. The clinical lab measurements were compared with the Somalogic values, where Pearson’s correlation was 0.817 [Figure S5].

2.4. Differential protein analysis

Differential proteins [DPs] refer to the proteins that showed different plasma levels in two groups of subjects. The analysis was performed on the PROgECT-UNITI-HCs cohort with linear regression models, using the log-2 transformed protein level as the outcome variable [y] and disease status plus other covariates as regressors. Specifically, we analyzed one protein at a time. The following regression model was fitted: y ~ Age + Sex + PlateID + disease_CD + disease_UC. Significance of the ‘CD+UC’ contrast (average of the CD vs controls and UC vs controls effects) and the ‘CD-UC’ contrast (difference between the CD vs controls and the UC vs controls effects) was assessed with a t test, using the ‘glht’ function from the ‘multcomp’ R package. Since 1310 proteins were examined, we accounted for multiple testing using the false discovery rate [FDR]. The FDR was estimated by repeating the analysis for all probes 1000 times, each time randomly permuting the subject labels. The FDR for an observed nominal p value ‘p0’ was then computed as:

2.5. Gene set enrichment analysis

Differential proteins [e.g identified comparing UC vs control] entered gene sets enrichment analysis to explore their common functions. Three tools for gene sets enrichment analysis were used. [1] The Molecular Signatures Database [MSigDB]. Differential proteins were compared with MSigDB[24] using the GSEA software v2.2.0 and the MSigDB v5.1 [http://software.broadinstitute.org/gsea]. Because multiple gene sets were evaluated, the FDR was quantified separately for each contrast by running 1000 permutations. [2] MetaCore Suite [portal.genego.com]. The 1310 proteins measured by SOMAscan were used as the background for the enrichment analysis. The FDR was also calculated by the MetaCore software. [3] Enrichment for disease-related genes were conducted usingthe NHGRI-EBI GWAS catalog.[25] Specifically, for each disease we tested for the enrichment of proteins upregulated in IBD, and proteins downregulated in IBD separately.

2.6. Genomic co-localization of protein level and IBD risk

Co-localization analysis is conducted to determine whether the plasma protein level and IBD risk are controlled by the same genetic variant[s], using COLOC version 2.3–6 in R.[26] Our analysis focused on the IBD proteomic signature proteins that were also influenced by pQTL in the CD-only cohort.[13] The aim of co-localization analysis is to find molecules that sit on the causal pathway of IBD, and the strategy is to identify the protein[s] that is controlled by the same genetic variant[s] that leads[lead] to IBD. Co-localization analysis uses two types of information [1] IBD GWAS, which use a large sample size [n ~ 60000] to convincingly link genetic variants to IBD incidence[27]; [2] pQTLs, which quantify the association between genetic variants and proteomic traits. Herein these two types of information come from [1] UKBB IBD GWAS,[27] for which the results are publicly available; and [2] pQTLs quantified for the CERTIFI cohort.[13] The analysis was conducted using the COLOC package version 2.3–6 in R,[26] in which default priors of the software were used. COLOC analyzed one protein’s level and IBD risk at a time. In each analysis, we denoted Trait 1 as the level of the given protein; and Trait 2 as the IBD risk. In total, five hypotheses were evaluated. H0: No association with either Trait 1 or Trait 2; H1: Association with Trait 1, not with Trait 2; H2: Association with Trait 2, not with Trait 1; H3: Association with Trait 1 and Trait 2, with multiple single-nucleotide polymorphisms [SNPs]; H4: Association with Trait 1 and Trait 2, one shared SNP. When we observed a large posterior probability of H4 [i.e.. PP.H4 > 80%], we concluded a single genetic variant controls both IBD risk and the protein’s level in the plasma. When we observed high posterior probability of Hypothesis 3 and 4 [i.e. PP.H3+PP.H4 > 80%], the data multiple genetic variants may control both IBD risk and the protein’s level in the plasma.

2.7. FACS immunophenotyping profiles and FACS-protein association testing

FACS data was collected from the HCs of the PROgECT-UNITI-HCs cohort [Supplementary Information]. Association between 12 blood immunophenotyping traits levels and 1310 plasma protein levels were tested using a linear regression, including the inverse-normal transformed protein level as the outcome, and the immunophenotyping trait as a covariate. Additional model covariates were age, sex, proteomics plate, and FACS batch.

3. Results

3.1. The plasma proteome was only weakly correlated with the whole blood or intestine transcriptome

We first assessed whether plasma proteomics provided non-redundant biological insights compared with blood and intestinal transcriptomes. Whole blood transcriptomes, intestine tissue transcriptomes, and plasma proteomes were profiled for the CERTIFI cohort [42 CD subjects of the anti-TNF–refractory CERTIFI trial, Table S1; GEO accession number: GSE100833]. In total, 1014 plasma proteins were assessed by SOMAscan panel. The whole transcriptome [52810 probesets] was assessed in blood and intestinal tissue using an Affymetrix HU133 microarray. The plasma proteomics profile was only weakly correlated with the whole blood [median ρ = 0.03] or the intestinal transcriptome [median ρ = 0.01] [Figure S1A and B]. Although the median correlation [ρ] was still positive [Figure S1A, Figure S1B, and Table S2], such a weak correlation indicated that the plasma proteomics extracted information orthogonal to the transcriptomes of disease-relevant tissues in CD patients, and indicated further protein signature studies were necessary. We also conducted Co-inertia analysis [CIA],[28] which only indicated a detectable correlation between plasma protein and whole blood mRNA [p = 0.054], but not between plasma protein and intestinal mRNA [Table S3]. The limited sample size for the intestinal data made it difficult to interpret the CIA test statistics. We conducted permutation [by randomizing the sample IDs in the SOMAscan dataset] to derive the empirical p value for the CIA test. The empirical p value was 0.054 for the correlation between the plasma proteome and the whole blood mRNA. For the intestine, CIA empirical p values were not significant [e.g. p value = 0.262 for the sigmoid colon; p value = 0.817 for the transverse colon]. Since the plasma proteome is secreted from multiple organs and cell types, and the blood and intestinal transcriptome are derived from intracellular sources, perhaps it is not surprising that the plasma protein – mRNA correlation is weak. Moreover, the correlation between the blood and intestinal mRNAs was also very weak [median ρ = 0.02, Figure S1C].

3.2. Comparison of plasma proteomic profiles among UC, CD, and control subjects

We tested 1310 probes for differential protein expression in the PROgECT-UNITI-HCs cohort: UC patients of the PROgECT trial [n = 46], CD patients of UNITI I and UNITI trials [n = 84], and HCs [HCs, n = 72]. The HC samples were matched to UC and CD patients in terms of age and sex [Table S1]. At 10% FDR, N = 353 proteins were differentially expressed between UC and HCs, among which N = 12 proteins had a fold change of ≥2 [Table S4]. These 353 proteins were termed ‘UC differential proteins’ [UC-DPs]. Comparing CD vs controls, N = 518 proteins were differentially expressed at 10% FDR, among which N = 9 proteins had a fold change of ≥2 [Table S4]. These 518 proteins were termed ‘CD-DPs’. Importantly, the UC- and CD-DPs were very similar, with 283 proteins in common [Figure 2A, overlap odds ratio = 12.4, Fisher test p value = 1.5E-75]. Furthermore, the UC- and CD-DPs had very similar fold changes ([Figure 2B] and significance level [Figure 2C], indicating the two IBD subtypes [i.e. UC and CD] were similar in term of alterations in plasma proteomics from healthy states). Next, we compared CD [UNITI I and II] with UC [PROgECT] patients. At 10% FDR, 16 proteins were upregulated and 15 downregulated, respectively [Figure 3 and Table S4]. Further, we pooled UNITI I and II and PROgECT as a single group [henceforth, simply ‘IBD’] and compared it with HCs [Table 1]. At 10% FDR, we found 219 probes upregulated and 274 downregulated, respectively, comparing IBD vs controls [Table 1]. Eleven proteins had a fold change of ≥2, including CRP, 6-Phosphogluconate dehydrogenase, Ferritin, Glucagon, Haptoglobin, PCI, SAA, Haemoglobin, LEAP-1, Cyclophilin F, and H2A3, for which the FDRs were all below 0.1%. These combined 493 up- and downregulated proteins [37.6% of all proteins measured by SOMAscan] were termed ‘IBD-DPs’.
Figure 2.

Comparison of CD and UC proteomic signatures. [A] Venn Diagram of the overlap of the 10% false discovery rate [FDR] signature probes lists. [B] Scatterplot of the estimated log2-fold change [FC] of CD vs HCs [horizontal axis] and UC vs HCs [vertical axis]; dotted lines mark the ±2 effect sizes thresholds. Each circle represents a somalogic probe. All tested probes are reported. [C] Scatterplot of the estimated t- test of the CD vs HCs [horizontal axis] and UC vs HCs [vertical axis]. Dotted lines mark the ±2 significance thresholds. Each circle represents somalogic probe. All tested probes are reported.

Figure 3.

Volcano plots of differential protein expression analysis. [A] CD vs UC signature. [B] IBD vs HCs signature; dotted blue line marks the empirical 10% false discovery rate [FDR] significance threshold, obtained by permuting the input dataset and repeating the analysis 1000 times. The top five significant proteins are labelled.

Table 1.

Plasma proteomic signatures of IBD and subtypes.

SignatureSample size# significant probes [10% FDR]Effect size: |log2[FC]|
Up Down Median 95th percentile
IBD vs HCs130 vs 722192740.0920.487
CD vs UC84 vs 4616150.0540.223

IBD: inflammatory bowel disease; HCs: healthy controls; UC: ulcerative colitis; CD: Crohns’ disease; FDR: false discovery rate; FC: fold change.

Plasma proteomic signatures of IBD and subtypes. IBD: inflammatory bowel disease; HCs: healthy controls; UC: ulcerative colitis; CD: Crohns’ disease; FDR: false discovery rate; FC: fold change. Comparison of CD and UC proteomic signatures. [A] Venn Diagram of the overlap of the 10% false discovery rate [FDR] signature probes lists. [B] Scatterplot of the estimated log2-fold change [FC] of CD vs HCs [horizontal axis] and UC vs HCs [vertical axis]; dotted lines mark the ±2 effect sizes thresholds. Each circle represents a somalogic probe. All tested probes are reported. [C] Scatterplot of the estimated t- test of the CD vs HCs [horizontal axis] and UC vs HCs [vertical axis]. Dotted lines mark the ±2 significance thresholds. Each circle represents somalogic probe. All tested probes are reported. Volcano plots of differential protein expression analysis. [A] CD vs UC signature. [B] IBD vs HCs signature; dotted blue line marks the empirical 10% false discovery rate [FDR] significance threshold, obtained by permuting the input dataset and repeating the analysis 1000 times. The top five significant proteins are labelled.

3.3. Functional annotation of differential proteins

We characterized the annotation of the IBD-DPs by testing for enrichment of gene sets from three databases: MSigDB,[24] MetaCore [portal.genego.com], and the NHGRI-EBI catalog.[25] The full results are reported in Table S5 [MSigDB], Table S6 [MetaCore], and Table S7 [NHGRI-EBI GWAS catalog]. At 5% FDR, the ‘chemokine and cytokine activities’ gene set was enriched in the proteins upregulated in IBD vs control [Table S5], as were ‘biomarkers of CD’, ‘other immune disorders’, and ‘myocardial infarction and ischemia’ [Table S6]. The upregulated signature also enriched for IBD, CD, UC, and Coeliac disease GWAS loci and for IL-6-induced signalling and Th-17–derived cytokines [Table S6], potentially highlighting the anti-TNF–related pathways in disease. The upregulated signature was under enriched for GWAS loci of height, BMI, and obesity [Figure 4 and Table S7]. In contrast, the plasma proteins downregulated in IBD subjects were enriched for hyperlipidemia biomarkers [Table S5], as well as Obesity-, Height- and BMI–associated genetic loci. Proteins downregulated in IBD patients were further enriched for ‘c-Kit ligand signalling pathway during haemopoiesis’and ‘complement pathway disruption in thrombotic microangiopathy’.
Figure 4.

Enrichment for genetic risk loci. Log-2 odds ratios of overlap between IBD signature (10% false discovery rate [FDR]) proteins and genes/diseases from the NHGRI-EBI catalog of genome-wide association studies. The IBD signature was further split into a list of IBD-increased and IBD-decreased proteins, respectively.

Enrichment for genetic risk loci. Log-2 odds ratios of overlap between IBD signature (10% false discovery rate [FDR]) proteins and genes/diseases from the NHGRI-EBI catalog of genome-wide association studies. The IBD signature was further split into a list of IBD-increased and IBD-decreased proteins, respectively.

3.4. Weighted gene correlation network analysis

Weighted gene correlationi network analysis [WGCNA] was used as a systems biology method to describe the correlation patterns among the proteins measured in the PROgECT-UNITI-HCs cohort.[29,30] Networks were built in an unsupervised manner for PROgECT, UNITI, HCs and combined cohorts, resulting in three disease group–specific co-expression networks and one combined co-expression network [Table S8]. For example, in the combined network, we found seven modules of highly correlated proteins [Table S8, the modules are named by colors], with 485 proteins not fitting into a module therefore being placed into the ‘Grey’ module. The functional annotations of each module, using the Kyoto Encyclopedia of Genes and Genomes [KEGG] database,[31] are summarized in Table S9. The first principle components [i.e. eigen-peptide][30] of the combined co-expression network, Pooled_blue, Pooled_brown and Pooled_green, were significantly associated with IBD GWAS, with a Wilcoxon test p value of 7.86E-09, 1.03E-03, and 1.03E-5, respectively [Table S10]. We explored the function annotations of these three modules using the Kyoto Encyclopedia of Genes and Genomes [KEGG] database.[31] All three modules were closely related to the immune functions [Table S9], and they were significantly enriched for differential proteins between CD vs CL and UC vs CL [Table S11], indicating that these modules were disease associated. Further, the differential expression–tagged modules were also identified for PROgECT, UNITI, and HCs co-expression networks as they were also significantly enriched for CD vs CL and UC vs CL differential proteins [Table S11].

3.5. Genetic co-localization of plasma protein level and IBD risk

Fifteen proteins in the IBD-DPs were previously reported that were influenced by nearby genetic variants [i.e. proteomic quantitative trait loci, pQTL].[13] The genetic variants in association with the protein levels were termed pSNPs. Co-localization analysis examined whether the IBD, UC, and CD risk was controlled by the same genetic variants that influence protein levels. We evaluated the causality of how genetic variants modulate disease variance in IBD through protein level alteration. Integrating the pQTLs that we previously reported on the CERTIFI trial of anti-TNFα–refractory CD patients[13] and the most recent GWAS on IBD,[27] we found two proteins [LYZ and MSP] co-localized with IBD risk [Table S11]. Macrophage-stimulating protein [MSP, aka MST1] showed a large posterior probability of H4 [i.e. >0.99, Table S12], indicating that the same genetic variant controls both IBD risk and plasma MSP levels. Genetic co-localization analysis on LYZ [lysozyme] resulted in a large posterior of H3+H4 probability [i.e. >0.8, Table S12], suggesting that multiple variants at the LYZ locus control IBD susceptibility and the plasma LYZ levels.

3.6. Contrast of CD vs UC signature

Comparing CD [UNITI I & II] vs UC [PROgECT] subjects, 31 proteins showed significantly different plasma levels [FDR≤10%, Table 2]. We manually annotated the signature proteins to broad biological categories. Of the 31 proteins, 17 were related to immune response. Further, by querying the STRING database,[32] we found 10 out of the 31 proteins formed a tight network of interacting proteins [Figure S2]: CXCL1, CXCL2/3, GCC, GZMB, IFNA7, IL23R, IL8, PYY, RPS3, and TYK2, known for their roles in immune response. Although several of the proteins [TYK2, CXCL1, EGF, REG4, GZMB, and IL8] had different plasma levels in UC vs CD subjects, they were also in CD- and/or UC-DPs [Table S4].
Table 2.

Differentially expressed proteins: UC vs CD [FDR≤10%]*.

Probe IDProteinUniProtGene symbolLog2-FCSE t-value p value
SL009341BASIP35613 BSG 0.090.0283.124.10E-03
SL008382CYTDP28325 CST5 0.450.1323.441.43E-03
SL016566DRAK2O94768 STK17B 0.080.0273.084.65E-03
SL000084EGFP01133 EGF –0.320.104–3.074.87E-03
SL000420FerritinP02794 P02792 FTH1 FTL 0.750.2073.607.93E-04
SL004337FGF-19O95750 FGF19 –0.420.123–3.381.74E-03
SL002086Ficolin-3O75636 FCN3 0.500.0965.219.70E-07
SL004271GFAPP14136 GFAP –0.330.103–3.213.13E-03
SL000433GlucagonP01275 GCG 0.520.1463.569.17E-04
SL004068Granzyme BP10144 GZMB –0.510.131–3.922.48E-04
SL003173Gro-aP09341 CXCL1 –0.280.065–4.373.96E-05
SL017610Gro-b/gP19876 P19875 CXCL2/3 –0.360.087–4.198.54E-05
SL019978IFNA7P01567 IFNA7 –0.230.073–3.104.45E-03
SL005177IL-1F6Q9UHA7 IL36A 0.430.1313.302.26E-03
SL005185IL-23 RQ5VWK5 IL23R 0.280.0893.163.60E-03
SL000039IL-8P10145 CXCL8 –0.620.148–4.217.93E-05
SL004536LEAP-1P81172 HAMP 0.750.2383.143.94E-03
SL013872LRP1BQ9NZR2 LRP1B –0.210.066–3.143.87E-03
SL000645MMP-10P09238 MMP10 –0.360.105–3.391.66E-03
SL000522MMP-12P39900 MMP12 –0.550.139–3.991.90E-04
SL007237MP2K4P45985 MAP2K4 0.330.1103.055.23E-03
SL008933PARK7Q99497 PARK7 –0.240.076–3.124.19E-03
SL016555PDE11Q9HCR9 PDE11A 0.090.0283.213.08E-03
SL011404PDE4DQ08499 PDE4D 0.100.0323.153.78E-03
SL011405PDE5AO76074 PDE5A 0.110.0353.055.20E-03
SL011509PYYP10082 PYY 0.500.1044.835.45E-06
SL012561REG4Q9BYZ8 REG4 –0.350.096–3.656.65E-04
SL008059RS3P23396 RPS3 –0.230.066–3.441.44E-03
SL010384Testican-1Q08629 SPOCK1 –0.220.063–3.559.78E-04
SL007181TYK2P29597 TYK2 0.170.0424.011.73E-04
SL018946UB2G2P60604 UBE2G2 0.150.0453.421.50E-03

UC: ulcerative colitis; CD: Crohns’ disease; SE: standard error; FDR: false discovery rate; FC: fold change. *The sign of the t-value represents the direction, and the positive value indicates the protein level is higher in CD than inUC; the negative value indicates vice versa.

Differentially expressed proteins: UC vs CD [FDR≤10%]*. UC: ulcerative colitis; CD: Crohns’ disease; SE: standard error; FDR: false discovery rate; FC: fold change. *The sign of the t-value represents the direction, and the positive value indicates the protein level is higher in CD than inUC; the negative value indicates vice versa. Other downregulated proteins in CD vs UC include RS3 [40S ribosomal protein S3], involved in apoptosis in response to oxidative stress[33,34]; PARK7, involved in NADPH function,[35] GFAP[36,37] and MMP12. MMP12 was linked to dendritic cells as part of a tissue remodelling network that contributes to granuloma formation in a gene network that programs phagocytosis in leprosy.[38-40] Proteins upregulated in CD vs UC, included FCN3,[41-43] involved in the complement pathway; PYY,[44] involved in serotonin receptor signalling, IL36A[45-47]; a potent regulator of dendritic cells and T cells and highly correlated with IL1 and IL17; MAP2K4,[48] involved in IL1 signalling to P38; UBE262,[49,50] which has a role in ER stress recovery; PDE5A with a putative role in anti-oxidant status, and BASI, a protein that associates with NOD2.[51,52]

4. Discussion

High-throughput technology for protein quantitation in large sample sizes has only recently become tractable. We conducted a systematic plasma proteomics study on UC subjects [from the PROgECT trial], CD subjects [from UNITI I and II], and matched HCs, as well as paired measures of protein and mRNA levels in a CD cohort [from the CERTIFI trial]. Two key observations were made. [1] CD and UC plasma proteomics profiles were significantly different from those of the HCs, and the up- and downregulated proteins revealed distinct biological functions, whereas the contrast between CD and UC proteomics was minimal. [2] The blood and intestine transcriptomes and plasma proteomics had very modest correlation and reflected different biological insights. Up- and downregulated proteins in IBD subjects belonged to different functional categories. Proteins upregulated in IBD versus controls were significantly enriched in chemokine and cytokine activity, immune disease biomarkers, and autoimmune disease GWAS signals [Tables S5–S7, Figure 4]. In contrast, the proteins downregulated in IBD were enriched for nutrition and metabolism [e.g. height, BMI, and obesity GWAS gene], potentially related to impaired gastrointestinal function in IBD.[53-55] Comparing IBD [UC subjects from the PROgECT trial and CD subjects from the UNITI I and II trials] vs HCs, we identified proteins of large fold changes [i.e. FC≥2]. Not unexpected, CRP, an inflammatory marker upregulated in IBD, was among the most upregulated in IBD compared with in the HCs [Table 3]. We further observed a large increase in PGD concentration (6-Phosphogluconate dehydrogenase; log2 fold change [FC] = 2.07), an enzyme in the pentose pathway that produces NADPH and is increased with the oxidative stress that is associated with CD.[56,57] The substantial fold changes enabled us to classify IBD vs HCs using select blood protein markers. As a proof of concept, in Figure S3, we display the joint distribution of PGD and CRP in our cohort. PGD and CRP levels were moderately correlated [Pearson correlation = 0.211, 95% CI: 0.075–0.339]. Together, PGD and CRP separated IBD patients from controls, validating a known blood biomarker for IBD.
Table 3.

Top IBD differentially expressed proteins [FDR≤10% and |log2FC| > 1].

Probe IDTargetGene symbolLog2-FCSE p valueCD vs UC
SL0002476-Phosphogluconate dehydrogenase PGD 2.070.1936.66E-16NS
SL000051CRP CRP 1.180.1596.45E-12NS
SL005793Cyclophilin F PPIF –1.030.1465.51E-11NS
SL000420Ferritin FTH1 FTL –1.20.1794.37E-10higher in CD
SL000433Glucagon GCG 1.060.1262.02E-14higher in CD
SL019979H2A3 HIST3H2A –1.330.1569.21E-15NS
SL000437Haptoglobin, Mixed Type HP 1.060.2248.28E-06NS
SL000836Haemoglobin HBA1 HBB –1.130.2411.09E-05NS
SL004536LEAP-1 HAMP –1.490.2061.99E-11higher in CD
SL000550PCI SERPINA5 1.670.2312.22E-11NS
SL000572SAA SAA1 2.120.3249.15E-10NS

IBD: inflammatory bowel disease; HCs: healthy controls; UC: ulcerative colitis; CD: Crohns’ disease; FDR: false discovery rate; FC: fold change.; NS: non-significant.

Top IBD differentially expressed proteins [FDR≤10% and |log2FC| > 1]. IBD: inflammatory bowel disease; HCs: healthy controls; UC: ulcerative colitis; CD: Crohns’ disease; FDR: false discovery rate; FC: fold change.; NS: non-significant. We found downregulation of REG4 [SOMAscan probeID: SL012561] in CD compared with UC. REG4 has been shown to be involved in the regeneration of damaged gastrointestinal mucosa in UC patients, where it has higher expression compared with HC subjects.[58] REG4 has been further suggested as a biomarker for the discrimination of CD from UC patients, as it showed differential expression in the inflamed and uninflamed biopsies of six CD patients when compared with six UC patients.[59,60] This study also detected higher REG4 plasma levels in CD vs control and UC vs control, supporting the strategy of applying high-throughput proteomics technology to larger patient populations to identify plasma protein biomarkers for diagnostics. Although UC and CD are two IBD subtypes with key clinical differences,[2,61] they displayed similar plasma proteomic profiles, comparable with those of previous findings at the mRNA level.[3] At 10% FDR, 31 proteins [2.4% of the proteins measured in this study] showed significantly different levels in CD vs UC subjects [Table 2], and of these 17 were related to the immune response. Our observation supports the clinical description that UC and CD are two subsets of the same disease. The common aetiology explained the highly similar proteomics profile. The UC and CD subjects we used in the study represented heterogeneous phenotypes within each cohort, in terms of disease duration and disease severity [Table S1], which is a limitation. We tested the association between plasma protein levels and disease duration or disease severity [Figure S4]. The Spearman correlation coefficient was centered at zero [Figure S4A and C], indicating disease duration or severity did not systematically shift the protein levels upward or downward. The p-values of Spearman correlation largely follow a diagonal line in the quantile–quantile plots [Figure S4B and D], indicating that the association between the plasma proteome and disease duration or disease severity was not more significant than random chance. In other words, the differential proteins identified in this study were not likely attributable to heterogeneity in the samples’ clinical characters. There are previous proteomics studies on IBD,[14-18] summarized in Table 4. However, these studies used small sample sizes and reported limited proteins or mass spectrometry peaks as different in IBD vs control, or as different between subtypes of IBD. Herein, we report a proteomics study on IBD of the largest sample size to date. The improved statistical power enabled us to identify 493 proteins with significantly different plasma levels in IBD vs controls, substantially more than the number of proteins reported by previous studies [Table 4]. Our findings showed certain degrees of consistency with previous reports, even though the assay platform and clinical states of the study subjects were different.
Table 4.

Comparison of differential proteins of present study with published IBD proteome studies.

Comparison groups and reference*Bio-sampleSample size by groupPublished differential proteinsPublished differential proteinsmeasured in the current studyPublished differential proteinsvalidated in the current study
IBD vs HCs[62]serumCD: 30 UC: 30 Inflammatory controls: 30 HCs: 30PF4, MRP8, FIBA, HPA2PF4
IBD vs HCs[63]bloodCD: 22 UC: 48 Colorectal Cancer: 5 Infectious colitis: 6; HCs: 13HNP1, NHP2, HNP3
CD vs UC[64]bloodCD: 13 UC: 17 HCs: 17ABHD14B, ACTB, ANXA6, CA2, PPIA, PRDX2, S100A9ANXA6, CA2, PPIA, S100A9CA2, PPIA
CD vs HCs[65]bloodHCs: 48 CD: 15APOA4, APOE, C3, C3F, FETUA, FIBA, FPAAPOE
IBD vs HCs[66]serumCD: 48 UC: 62APOC1, ITIH4, PF4ITIH4, PF4ITIH4
IBD vs HCs[67]intestineCD: 3 UC: 4 Inflammatory polyps: 2 Normal colon: 3TTBK2, SYNE2, SUCLG2, POSTN, ANXA2, EPX, LAP3, RDX, S100A8, MPO, DEFA1B, PRG2, LCP1, PSME1POSTN, ANXA2, PSME1, MPOPOSTN, ANXA2, PSME1
IBD vs HCs[68]serumUC: 27 CD: 56 HCs: 14 RA controls: 12SPP24, alpha-1-microglobulin
CD w/wo stricturing[69]serumCD: 18 UC: 9A2M, LDHB, CAPNS1, MBL2, APOB, CD5L, ALB, FLNA, SCGB1A1, BRDT, CTSD, PSMA7, CP, OIT3, F2, TNXAA2M, APOB, CAPNS1, CD5L, CTSD, F2, LDHB, MBL2A2M, LDHB, APOB
IBD vs HCs[70]§serumHCs: 48 CD: 15 UC: 26
CD vs UC[71]§colonCD: 27 UC: 24
CD vs UC[72]§colonCD: 26 UC: 36

* IBD: inflammatory bowel disease; HCs: healthy controls; UC: ulcerative colitis; CD: Crohns’ disease.

§Study only reported mass spectrometry peaks but not proteins.

Comparison of differential proteins of present study with published IBD proteome studies. * IBD: inflammatory bowel disease; HCs: healthy controls; UC: ulcerative colitis; CD: Crohns’ disease. §Study only reported mass spectrometry peaks but not proteins. We showed that the plasma protein levels in IBD subjects were weakly positively correlated with the mRNA level in two IBD relevant tissues [blood and intestine]. The explanations are two-fold. [1] The plasma proteome is secreted from multiple organs and cell types, and the liver is a dominant source.[13] In 72 HCs of the PROgECT-UNITI-HCs cohort, we quantified immune cell compositions [FACS, Materials and Methods]. The plasma proteome was not associated with immune cell compositions [Table S13]. [2] It has been recognized that levels of mRNAs and the levels of the proteins they encode only correlate modestly.[73,74] For many genes, the functional products are the proteins rather than transcripts. Complex regulatory mechanisms are used by biological systems to modulate protein levels in response to environmental exposure or disease conditions. The regulatory mechanisms include translational regulation [e.g. small RNAs and regulatory proteins], ribosome density, and protein half-life.[73] Nevertheless, our results indicate that research into the IBD transcriptome and proteome reveal different aspects of the disease and are complementary.

Conflict of Interest

The authors have no conflict of interest to disclose.

Funding

The work was partially supported by NIH grants U24DK062429 and 1R01DK092235-02. Dr Ke Hao is partially supported by the National Natural Science Foundation of China [Grant Nos 21477087 and 91643201] and by the Ministry of Science and Technology of China [Grant No. 2016YFC0206507]. Click here for additional data file. Click here for additional data file. Click here for additional data file.
  71 in total

1.  Dimeric Ube2g2 simultaneously engages donor and acceptor ubiquitins to form Lys48-linked ubiquitin chains.

Authors:  Weixiao Liu; Yongliang Shang; Yan Zeng; Chao Liu; Yanchang Li; Linhui Zhai; Pan Wang; Jizhong Lou; Ping Xu; Yihong Ye; Wei Li
Journal:  EMBO J       Date:  2013-12-23       Impact factor: 11.598

2.  C/EBPβ is a transcriptional key regulator of IL-36α in murine macrophages.

Authors:  Andreas Nerlich; Nanthapon Ruangkiattikul; Kristin Laarmann; Nina Janze; Oliver Dittrich-Breiholz; Michael Kracht; Ralph Goethe
Journal:  Biochim Biophys Acta       Date:  2015-06-08

3.  Proinflammatory cytokines induce neurotrophic factor expression in enteric glia: a key to the regulation of epithelial apoptosis in Crohn's disease.

Authors:  Georg B T von Boyen; Martin Steinkamp; Irmlind Geerling; Max Reinshagen; Karl H Schäfer; Guido Adler; Joachim Kirsch
Journal:  Inflamm Bowel Dis       Date:  2006-05       Impact factor: 5.325

4.  Cell-type deconvolution with immune pathways identifies gene networks of host defense and immunopathology in leprosy.

Authors:  Megan S Inkeles; Rosane Mb Teles; Delila Pouldar; Priscila R Andrade; Cressida A Madigan; David Lopez; Mike Ambrose; Mahdad Noursadeghi; Euzenir N Sarno; Thomas H Rea; Maria T Ochoa; M Luisa Iruela-Arispe; William R Swindell; Tom Hm Ottenhoff; Annemieke Geluk; Barry R Bloom; Matteo Pellegrini; Robert L Modlin
Journal:  JCI Insight       Date:  2016-09-22

5.  Distinct expression of interleukin (IL)-36α, β and γ, their antagonist IL-36Ra and IL-38 in psoriasis, rheumatoid arthritis and Crohn's disease.

Authors:  M-A Boutet; G Bart; M Penhoat; J Amiaud; B Brulin; C Charrier; F Morel; J-C Lecron; M Rolli-Derkinderen; A Bourreille; S Vigne; C Gabay; G Palmer; B Le Goff; F Blanchard
Journal:  Clin Exp Immunol       Date:  2016-02-22       Impact factor: 4.330

6.  Proteolytic cleavage and loss of function of biologic agents that neutralize tumor necrosis factor in the mucosa of patients with inflammatory bowel disease.

Authors:  Paolo Biancheri; Randall J Brezski; Antonio Di Sabatino; Allison R Greenplate; Keri L Soring; Gino R Corazza; Klaartje B Kok; Laura Rovedatti; Anna Vossenkämper; Nadja Ahmad; Susanne A Snoek; Severine Vermeire; Paul Rutgeerts; Robert E Jordan; Thomas T MacDonald
Journal:  Gastroenterology       Date:  2015-07-11       Impact factor: 22.682

7.  Epidermal CD147 expression plays a key role in IL-22-induced psoriatic dermatitis.

Authors:  Cong Peng; ShengXi Zhang; Li Lei; Xu Zhang; Xuekun Jia; Zhongling Luo; Xiaoyan Huang; Yanhong Kuang; Weiqi Zeng; Juan Su; Xiang Chen
Journal:  Sci Rep       Date:  2017-03-08       Impact factor: 4.379

8.  Host-microbe interactions have shaped the genetic architecture of inflammatory bowel disease.

Authors:  Luke Jostins; Stephan Ripke; Rinse K Weersma; Richard H Duerr; Dermot P McGovern; Ken Y Hui; James C Lee; L Philip Schumm; Yashoda Sharma; Carl A Anderson; Jonah Essers; Mitja Mitrovic; Kaida Ning; Isabelle Cleynen; Emilie Theatre; Sarah L Spain; Soumya Raychaudhuri; Philippe Goyette; Zhi Wei; Clara Abraham; Jean-Paul Achkar; Tariq Ahmad; Leila Amininejad; Ashwin N Ananthakrishnan; Vibeke Andersen; Jane M Andrews; Leonard Baidoo; Tobias Balschun; Peter A Bampton; Alain Bitton; Gabrielle Boucher; Stephan Brand; Carsten Büning; Ariella Cohain; Sven Cichon; Mauro D'Amato; Dirk De Jong; Kathy L Devaney; Marla Dubinsky; Cathryn Edwards; David Ellinghaus; Lynnette R Ferguson; Denis Franchimont; Karin Fransen; Richard Gearry; Michel Georges; Christian Gieger; Jürgen Glas; Talin Haritunians; Ailsa Hart; Chris Hawkey; Matija Hedl; Xinli Hu; Tom H Karlsen; Limas Kupcinskas; Subra Kugathasan; Anna Latiano; Debby Laukens; Ian C Lawrance; Charlie W Lees; Edouard Louis; Gillian Mahy; John Mansfield; Angharad R Morgan; Craig Mowat; William Newman; Orazio Palmieri; Cyriel Y Ponsioen; Uros Potocnik; Natalie J Prescott; Miguel Regueiro; Jerome I Rotter; Richard K Russell; Jeremy D Sanderson; Miquel Sans; Jack Satsangi; Stefan Schreiber; Lisa A Simms; Jurgita Sventoraityte; Stephan R Targan; Kent D Taylor; Mark Tremelling; Hein W Verspaget; Martine De Vos; Cisca Wijmenga; David C Wilson; Juliane Winkelmann; Ramnik J Xavier; Sebastian Zeissig; Bin Zhang; Clarence K Zhang; Hongyu Zhao; Mark S Silverberg; Vito Annese; Hakon Hakonarson; Steven R Brant; Graham Radford-Smith; Christopher G Mathew; John D Rioux; Eric E Schadt; Mark J Daly; Andre Franke; Miles Parkes; Severine Vermeire; Jeffrey C Barrett; Judy H Cho
Journal:  Nature       Date:  2012-11-01       Impact factor: 49.962

9.  Protein co-expression network analysis (ProCoNA).

Authors:  David L Gibbs; Arie Baratt; Ralph S Baric; Yoshihiro Kawaoka; Richard D Smith; Eric S Orwoll; Michael G Katze; Shannon K McWeeney
Journal:  J Clin Bioinforma       Date:  2013-06-01

10.  Inherited determinants of Crohn's disease and ulcerative colitis phenotypes: a genetic association study.

Authors:  Isabelle Cleynen; Gabrielle Boucher; Luke Jostins; L Philip Schumm; Sebastian Zeissig; Tariq Ahmad; Vibeke Andersen; Jane M Andrews; Vito Annese; Stephan Brand; Steven R Brant; Judy H Cho; Mark J Daly; Marla Dubinsky; Richard H Duerr; Lynnette R Ferguson; Andre Franke; Richard B Gearry; Philippe Goyette; Hakon Hakonarson; Jonas Halfvarson; Johannes R Hov; Hailang Huang; Nicholas A Kennedy; Limas Kupcinskas; Ian C Lawrance; James C Lee; Jack Satsangi; Stephan Schreiber; Emilie Théâtre; Andrea E van der Meulen-de Jong; Rinse K Weersma; David C Wilson; Miles Parkes; Severine Vermeire; John D Rioux; John Mansfield; Mark S Silverberg; Graham Radford-Smith; Dermot P B McGovern; Jeffrey C Barrett; Charlie W Lees
Journal:  Lancet       Date:  2015-10-18       Impact factor: 202.731

View more
  5 in total

1.  Broadly effective metabolic and immune recovery with C5 inhibition in CHAPLE disease.

Authors:  Ahmet Ozen; Nurhan Kasap; Ivan Vujkovic-Cvijin; Richard Apps; Foo Cheung; Elif Karakoc-Aydiner; Bilge Akkelle; Sinan Sari; Engin Tutar; Figen Ozcay; Dilara Kocacik Uygun; Ali Islek; Gamze Akgun; Merve Selcuk; Oya Balci Sezer; Yu Zhang; Gunsel Kutluk; Erdem Topal; Ersin Sayar; Cigdem Celikel; Roderick H J Houwen; Aysen Bingol; Ismail Ogulur; Sevgi Bilgic Eltan; Andrew L Snow; Camille Lake; Giovanna Fantoni; Camille Alba; Brian Sellers; Samuel D Chauvin; Clifton L Dalgard; Olivier Harari; Yan G Ni; Ming-Dauh Wang; Kishor Devalaraja-Narashimha; Poorani Subramanian; Rabia Ergelen; Reha Artan; Sukru Nail Guner; Buket Dalgic; John Tsang; Yasmine Belkaid; Deniz Ertem; Safa Baris; Michael J Lenardo
Journal:  Nat Immunol       Date:  2021-01-04       Impact factor: 25.606

2.  Identification of Immune-Related Gene Signature and Prediction of CeRNA Network in Active Ulcerative Colitis.

Authors:  Mengmeng Xu; Ying Kong; Nannan Chen; Wenlong Peng; Ruidong Zi; Manman Jiang; Jinfeng Zhu; Yuting Wang; Jicheng Yue; Jinrong Lv; Yuanyuan Zeng; Y Eugene Chin
Journal:  Front Immunol       Date:  2022-03-22       Impact factor: 7.561

Review 3.  PARK7/DJ-1 as a Therapeutic Target in Gut-Brain Axis Diseases.

Authors:  Domonkos Pap; Apor Veres-Székely; Beáta Szebeni; Ádám Vannay
Journal:  Int J Mol Sci       Date:  2022-06-14       Impact factor: 6.208

4.  Immunomodulatory role of Parkinson's disease 7 in inflammatory bowel disease.

Authors:  Rita Lippai; Apor Veres-Székely; Erna Sziksz; Yoichiro Iwakura; Domonkos Pap; Réka Rokonay; Beáta Szebeni; Gábor Lotz; Nóra J Béres; Áron Cseh; Attila J Szabó; Ádám Vannay
Journal:  Sci Rep       Date:  2021-07-16       Impact factor: 4.379

Review 5.  Integrating omics for a better understanding of Inflammatory Bowel Disease: a step towards personalized medicine.

Authors:  Manoj Kumar; Mathieu Garand; Souhaila Al Khodor
Journal:  J Transl Med       Date:  2019-12-13       Impact factor: 8.440

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.