Marco Gasparetto1, Felicity Payne1, Komal Nayak2, Judith Kraiczy2, Claire Glemas1, Yosef Philip-McKenzie2, Alexander Ross3, Rachel D Edgar4, Daniel R Zerbino4, Camilla Salvestrini5, Franco Torrente5, Nicholas T Ventham6, Rahul Kalla7, Jack Satsangi8, Peter Sarkies9, Robert Heuschkel10, Matthias Zilbauer11. 1. Department of Paediatrics, University of Cambridge, Addenbrooke's Hospital, Cambridge, United Kingdom; Department of Paediatric Gastroenterology, Hepatology and Nutrition, Cambridge University Hospitals, Addenbrooke's, Cambridge, United Kingdom. 2. Department of Paediatrics, University of Cambridge, Addenbrooke's Hospital, Cambridge, United Kingdom. 3. Department of Paediatrics, University of Cambridge, Addenbrooke's Hospital, Cambridge, United Kingdom; Wellcome Trust-Medical Research Council Stem Cell Institute, University of Cambridge, Cambridge, United Kingdom. 4. European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, United Kingdom. 5. Department of Paediatric Gastroenterology, Hepatology and Nutrition, Cambridge University Hospitals, Addenbrooke's, Cambridge, United Kingdom. 6. Academic Coloproctology, Western General Hospital, Edinburgh, Edinburgh, United Kingdom. 7. Medical Research Council Centre for Inflammation Research, Queens Medical Research Institute, University of Edinburgh, Edinburgh, United Kingdom. 8. Translational Gastroenterology Unit, University of Oxford, Oxford, United Kingdom. 9. Medical Research Council London Institute of Medical Sciences, United Kingdom; Institute of Clinical Sciences, Imperial College London, United Kingdom. 10. Department of Paediatric Gastroenterology, Hepatology and Nutrition, Cambridge University Hospitals, Addenbrooke's, Cambridge, United Kingdom. Electronic address: robert.heuschkel@addenbrookes.nhs.uk. 11. Department of Paediatrics, University of Cambridge, Addenbrooke's Hospital, Cambridge, United Kingdom; Department of Paediatric Gastroenterology, Hepatology and Nutrition, Cambridge University Hospitals, Addenbrooke's, Cambridge, United Kingdom; Wellcome Trust-Medical Research Council Stem Cell Institute, University of Cambridge, Cambridge, United Kingdom. Electronic address: mz304@cam.ac.uk.
Abstract
BACKGROUND & AIMS: Gene expression patterns of CD8+ T cells have been reported to correlate with clinical outcomes of adults with inflammatory bowel diseases (IBD). We aimed to validate these findings in independent patient cohorts. METHODS: We obtained peripheral blood samples from 112 children with a new diagnosis of IBD (71 with Crohn's disease and 41 with ulcerative colitis) and 19 children without IBD (controls) and recorded medical information on disease activity and outcomes. CD8+ T cells were isolated from blood samples by magnetic bead sorting at the point of diagnosis and during the course of disease. Genome-wide transcription (n = 192) and DNA methylation (n = 66) profiles were generated using Affymetrix and Illumina arrays, respectively. Publicly available transcriptomes and DNA methylomes of CD8+ T cells from 3 adult patient cohorts with and without IBD were included in data analyses. RESULTS: Previously reported CD8+ T-cell prognostic expression and exhaustion signatures were only found in the original adult IBD patient cohort. These signatures could not be detected in either a pediatric or a second adult IBD cohort. In contrast, an association between CD8+ T-cell gene expression with age and sex was detected across all 3 cohorts. CD8+ gene transcription was clearly associated with IBD in the 2 cohorts that included non-IBD controls. Lastly, DNA methylation profiles of CD8+ T cells from children with Crohn's disease correlated with age but not with disease outcome. CONCLUSIONS: We were unable to validate previously reported findings of an association between CD8+ T-cell gene transcription and disease outcome in IBD. Our findings reveal the challenges of developing prognostic biomarkers for patients with IBD and the importance of their validation in large, independent cohorts before clinical application.
BACKGROUND & AIMS: Gene expression patterns of CD8+ T cells have been reported to correlate with clinical outcomes of adults with inflammatory bowel diseases (IBD). We aimed to validate these findings in independent patient cohorts. METHODS: We obtained peripheral blood samples from 112 children with a new diagnosis of IBD (71 with Crohn's disease and 41 with ulcerative colitis) and 19 children without IBD (controls) and recorded medical information on disease activity and outcomes. CD8+ T cells were isolated from blood samples by magnetic bead sorting at the point of diagnosis and during the course of disease. Genome-wide transcription (n = 192) and DNA methylation (n = 66) profiles were generated using Affymetrix and Illumina arrays, respectively. Publicly available transcriptomes and DNA methylomes of CD8+ T cells from 3 adult patient cohorts with and without IBD were included in data analyses. RESULTS: Previously reported CD8+ T-cell prognostic expression and exhaustion signatures were only found in the original adult IBD patient cohort. These signatures could not be detected in either a pediatric or a second adult IBD cohort. In contrast, an association between CD8+ T-cell gene expression with age and sex was detected across all 3 cohorts. CD8+ gene transcription was clearly associated with IBD in the 2 cohorts that included non-IBD controls. Lastly, DNA methylation profiles of CD8+ T cells from children with Crohn's disease correlated with age but not with disease outcome. CONCLUSIONS: We were unable to validate previously reported findings of an association between CD8+ T-cell gene transcription and disease outcome in IBD. Our findings reveal the challenges of developing prognostic biomarkers for patients with IBD and the importance of their validation in large, independent cohorts before clinical application.
Gene expression patterns of CD8+ T cells was reported to correlate with outcomes of adults with inflammatory bowel diseases (IBD). T-cell exhaustion has been associated with IBD progression.
New Findings
Transcription and DNA methylation patterns of CD8+ T cells vary with patient age, sex, and IBD vs non-IBD but do not correlate with outcome.
Limitations
This was a study of 112 pediatric IBD patients and 19 controls; larger studies are needed.
Impact
The findings reveal the challenges of developing tests that determine prognoses of patients with IBD. All proposed prognostic markers must be validated in large, independent cohorts before they can be used in the clinic.Inflammatory bowel diseases (IBD) such as Crohn’s disease (CD) and ulcerative colitis (UC) are complex conditions that vary vastly in their phenotype and disease course. Among the most prominent factors to impact on disease presentation and behavior is the age of onset, with distinct phenotypic differences reported particularly in children diagnosed younger than 6 years.1, 2, 3 In older children, who comprise the vast majority of childhood-onset IBD, phenotype and disease behavior are considered to be more similar to adult-onset IBD. Nevertheless, our knowledge of factors contributing to phenotypic differences related to age of disease onset in IBD, such as affected cell types and specific molecular mechanisms involved, remains limited. CD8+ T cells have been implicated in the pathogenesis of several immune-mediated diseases, including IBD.
,
An important finding linking CD8+ T-cell biology to the pathogenesis of immune-mediated diseases was the discovery of a prognostic transcriptional signature in adult patients diagnosed with several conditions, including systemic lupus erythematosus, antineutrophil cytoplasmic antibody–associated vasculitis, as well as IBD.
,
Specifically, expression of distinct signature genes was found to divide patients into 2 groups that differed in their disease behavior. Moreover, T-cell exhaustion was proposed as an underlying mechanism as patients displaying an “exhausted” expression signature were shown to have a milder disease course. In addition to gene transcription, increasing evidence points towards a major role for epigenetic mechanisms, such as DNA methylation, in regulating fundamental aspects of CD8+ T-cell function, including proliferation, activation, and T-cell exhaustion.
,Given that existing evidence in this area has been exclusively derived from studies performed in adult IBD populations combined with a large body of evidence supporting age-related differences in T-cell function,
,
we set out to investigate CD8+ T-cell biology in childhood-onset IBD. We prospectively recruited a cohort of 131 children newly diagnosed (treatment-naïve) with IBD (n = 112) and non-IBD controls (n = 19, Table 1
) and isolated CD8+ T-cells from peripheral blood samples. All patients were followed-up for a minimum of 18 months from diagnosis and detailed clinical phenotype as well as outcome and treatment data were documented (Supplementary Table 1). In addition, longitudinal samples were obtained from a subset of patients at 3-month intervals (n = 62 samples). In total, we generated 193 CD8+ T-cell transcriptomes and 66 DNA methylomes and set out to analyze these datasets with a view to identify variation amongst patients and potential correlation with clinical outcome. Importantly, the use of publicly available and previously published datasets, including a further independent adult IBD cohort, allowed us to test for the presence of age-of-onset and cohort-specific differences in CD8+ T-cell–derived molecular profiles.
Table 1
Summary of Pediatric Patients, Samples, and Molecular Profiles
Variable
CDa
UCb
Control
No. of patients
71
41
19
Age, y, median (range)
13 (6–17)
13 (5–16)
13 (4–16)
Sex, male, n (%)
46 (70)
45 (51)
8 (42)
Disease distribution, n (%)c
CD–L1: 7 L1+ L4: 3
10 (14)
—
—
CD–L2: 13 L2+L4: 10
23 (32)
—
—
CD–L3: 23 L3+L4: 14
33 (52)
—
—
CD–L4
1 (1)
UC–E1
—
5 (12)
—
UC–E2
—
7 (17)
—
UC–E3
—
6 (15)
—
UC–E4
—
23 (56)
—
CD8 purified samples, n
At diagnosisd
71 (67)
41 (40)
19 (19)
Longitudinald
36 (36)
26 (26)
—
With methylation datad
66 (66)
—
—
107 samples from 71 patients.
67 samples from 41 patients.
Disease location for CD and UC at diagnosis as per Paris Classification for pediatric IBD.
Number of samples after quality control in parentheses.
Summary of Pediatric Patients, Samples, and Molecular Profiles107 samples from 71 patients.67 samples from 41 patients.Disease location for CD and UC at diagnosis as per Paris Classification for pediatric IBD.Number of samples after quality control in parentheses.
Methods
Ethical Approval
Ethical approval was obtained from the local research committee (REC 12/EE/0482 and REC 17/EE/0265) and patients were prospectively recruited after informed consent. All investigations were carried out according to the Declaration of Helsinki and Good Clinical Practice Guidelines.
Patient Recruitment and Clinical Data Recording
A total of 131 children aged between 4 and 17 years (median 13 years) were recruited prospectively between March 2013 and March 2016 at Cambridge University Hospitals National Health Service Foundation Trust in the Department of Paediatric Gastroenterology, Hepatology and Nutrition. Diagnosis of Inflammatory Bowel Disease was made according to current international guidelines (revised Porto criteria) and included measurement of serum and stool inflammatory markers, upper and lower gastrointestinal endoscopies, and radiologic and histological examination. Any patient with gastrointestinal and/or extraintestinal diseases other than IBD was excluded from the study. In total, 71 patients were diagnosed with CD, 41 with UC, and 19 were classed as non-IBD, healthy controls. The latter were defined as patients who underwent endoscopic examination as part of their routine clinical care and were found to have normal macroscopic and histological appearance for their intestinal mucosa and complete resolution of any gastrointestinal symptoms.All patients were followed up for a minimum of 18 months in the Cambridge pediatric gastroenterology unit, and detailed clinical data was prospectively recorded using the hospital’s patient electronic database (EPIC). This included phenotypic parameters at diagnosis (eg, presence of diarrhea, rectal bleeding, weight loss, extraintestinal manifestations, and perianal disease); disease activity scores: Pediatric Crohn’s Disease Activity Index and Pediatric Ulcerative Colitis Activity Index; and information on disease course and outcomes. The latter covered number of treatment escalations, treatment history, response to treatment, requirement for surgical intervention, treatment with biologics and the number of unplanned inpatient admission days (Supplementary Table 1). In order to account for the fact that disease outcome is not restricted to a single measure, we designed a severity score that considered the number of treatment escalations, escalation to treatment with biologics, perianal disease, and surgery, as well as unplanned/urgent inpatient admissions (Supplementary Table 2). The score was calculated at 18 months from diagnosis and patients were categorized into mild, moderate, and severe groups after blinded scoring by at least 2 consultant pediatric gastroenterologists.
Supplementary Table 2
Disease Outcome Severity Score
CD
UC
No. of treatment escalations
No. of treatment escalations
0–1: 0
0–1: 0
2: 1
2: 1
≥3: 2
3: 2
Biologics
Biologics
No: 0
No: 0
Yes: 2
Yes: 2
CD-related surgery
UC-related surgery (colectomy)
No: 0
No: 0
Yes: 2
Yes: 2
Perianal disease
Steroid-free remission at 3 mo from diagnosis (PUCAI score <10)
A peripheral blood sample (volume varied according to age from 10 to 25 mL) for the purification of CD8+ T cells was taken on the same day as diagnostic endoscopy was performed. All children were treatment-naïve at this time point. Additional longitudinal samples (n = 62) were taken from a subset of children at 3-month intervals post diagnosis. Disease status was recorded as either “in remission” or “active disease” based on their clinical and biochemical disease activity scores Pediatric Crohn’s Disease Activity Index and Pediatric Ulcerative Colitis Activity Index and the treatment history recorded (Supplementary Table 1). All samples were processed immediately and CD8+ T cells isolated using magnetic bead sorting as detailed below.
Magnetic Bead Sorting
A peripheral blood sample of 10 mL (age 4–10 years) or 25 mL (age 10–17 years) was obtained and CD8+ T cells extracted using magnetic bead sorting. Briefly, peripheral blood mononuclear cells were isolated by density centrifugation over Ficoll (Histopaque 1077; GE Healthcare, Chicago, IL) and CD8+ T cells were separated by magnetic cell sorting using anti CD8 microbeads (Miltenyi Biotech, Bergisch Gladbach, Germany) as described by the manufacturer. Separation was performed on an AutoMACs Pro Separator (Miltenyi Biotech). Cell purity was regularly assessed on a subset of random samples using flow cytometry (see Supplementary Methods). With a mean of 84% (72.5%–93.8 %), purity of our samples was found to be similar or higher compared with other published datasets.
DNA and RNA Extraction
DNA and RNA were extracted simultaneously from isolated CD8+ T-cell samples using AllPrep MiniKit (Qiagen, Hilden, Germany), according to the manufacturer’s instructions. DNA and RNA quality were assessed using an Agilent Bioanalyser 2100 (Agilent, Santa Clara, CA) and quantified by spectrophotometry using a NanoDrop ND-1000 spectrophotometer (ThermoFisher Scientific, Waltham, MA). DNA was bisulfite-converted using Zymo DNA methylation Gold kit (Zymo Research, Irvine, CA).
Genome Wide Transcriptional and DNA Methylation Profiling
Whole genome transcript analysis was performed on 200 ng of total RNA using the Affymetrix Human Gene ST version 2.0 Array (Affymetrix, Santa Clara, CA). Genome-wide DNA methylation was profiled using the Illumina EPIC platform (Illumina, Cambridge, UK). All microarray data have been deposited in ArrayExpress, accession numbers: E-MTAB-7923 (expression data) and E-MTAB-8925 (methylation data).
Publicly Available Datasets
Gene expression microarray data from 2 previously published adult patient cohorts were included. For adult cohort 1 (GSE87650), a total of 52 CD8+ T-cell–specific genome-wide transcriptomes (profiled on Illumina HumanHT-12, version 4) were available (n = 14 healthy controls, n = 19 CD, n = 19 UC). A summary of this patient cohort is provided in Supplementary Table 3. Adult cohort 2 included a total of 67 CD8+ T-cell–specific transcriptomes (profiled on Affymetrix Human Gene ST, versions 1.0 and 1.1) with samples derived from patients diagnosed with CD (n = 35) and UC (n = 32) (E-MTAB-331). Clinical phenotype data available for the latter cohort included age, sex, and disease severity classed as either severe (IBD1) or mild (IBD2). We obtained publicly available CD8+ T-cell–derived genome-wide DNA methylation profiles (K450 Illumina DNA methylation arrays) from 2 previously published adult patient cohorts. Adult cohort 1 (GSE87640) contained a total of 56 CD8+ T-cell samples—18 CD, 19 UC, and 19 healthy control individuals (age range, 18–63 years). Adult cohort 3 included samples obtained from a younger (age range, 22–34 years, n = 50) and older (age range, 73–84 years, n = 50) group of healthy adults (GSE59065).
Supplementary Table 3
Adult Cohort 1: Summary of Patients Passing Quality Control, Samples, and Molecular Profiles
Variable
CD
UC
Control
No. of patients
19
19
14
Age, y, median (range)
30 (18–63)
33 (22–62)
35 (20–58)
Sex, male, n (%)
11 (58)
13 (68)
10 (71)
Smoking status, n (%)
Current smoker
8 (42)
1 (5)
2 (14)
Ex smoker
4 (21)
9 (47)
8 (57)
Never smoked
7 (37)
8 (42)
2 (14)
Unknown
0
1 (5)
2 (14)
Disease distribution,a n (%)
CD–L1
3 (16)
—
—
CD–L2
7 (37)
—
—
CD–L3: 5 L3+L4: 1
6 (32)
—
—
UC–E2
—
6 (32)
UC–E3
—
4 (21)
—
UC–E4
—
4 (21)
—
Unknown
3 (16)
5 (26)
—
Disease location for CD and UC at diagnosis.
Data analyses were performed separately in each dataset following the same analytical approach as outlined below, with adjustments made to account for differences in array type. Datasets derived from different studies were never combined or analyzed together.
Bioinformatic Analyses
A detailed description of bioinformatic methods used in this publication is provided in the Supplementary Methods section. Briefly, all analyses were performed in R, version 3.5.2. Pre-processing of Affymetrix gene expression array data included normalization of raw signal intensity data using the variance stabilization and calibration with robust multi-array average method as part of the affy package (version 1.56.0). Preprocessing of Illumina gene expression array data was performed using the lumi package (version 2.34.0) and normalized using robust spline normalization of log-transformed raw data. Quality-control assessment of all datasets was performed independently, using arrayQualityMetrics (version 3.34.0). Samples failing quality control were removed, and batch correction was performed using the “ComBat” function as part of the sva package (version 3.26.0). Our study design accounted for expected technical variation, including batch, by ensuring a balanced distribution of cases (ie, UC and CD) and controls between batches, as well as the inclusion of technical replicates. Successful batch correction was confirmed on technical replicates as well as principal variance component analyses. The latter was also used to demonstrate retention of biologic signals, such as sex, diagnosis, and age. Analyses were also performed on samples within individual batches and confirmed the results of combined batches. Data was annotated using the hugene20sttranscriptcluster.db, hugene10sttranscriptcluster.db or lumiHumanIDMapping package, dependent on array version. A total of 67 CD, 40 UC, 19 control, and 62 follow-up pediatric patient samples were retained for downstream analysis. Weighted gene co-expression network analysis (WGCNA) analyses were performed on normalized and batch-corrected datasets using the WGCNA package (version 1.63) and resulting modules were correlated with clinical phenotypes as described previously.DNA methylation data was processed using the minfi package (version 1.28.0) and included functional normalization and quality-control assessment as described previously. Published datasets included in this study were subjected to the same analyses. Epigenetic age and T-cell abundances were calculated using an established method developed by Horvath.
Results
Variation of CD8+ T-Cell Gene Transcription Shows Association With Disease, Age, Systemic Inflammation, and Sex
Transcriptional plasticity of CD8+ T cells occurs during systemic inflammation and distinct differences have been reported in patients diagnosed with chronic inflammatory conditions, including IBD.
,
In order to determine the degree of variation in CD8+ T-cell gene transcription within our sample cohort, we first performed principal component (PC) analyses and tested the correlation between observed variance and phenotype at diagnosis. For these analyses, we included samples obtained from children at the point of diagnosis (treatment naïve, UC n = 40, CD n = 67), as well as non-IBD controls (n = 19). Variation in CD8+ T-cell gene transcription was found to be significantly associated with diagnosis (ie, difference between IBD and non-IBD controls; Figure 1
A) and age (Supplementary Figure 1
A), both contributing to the largest proportion of variance observed in PC1. Although a major overlap between IBD and control patient–derived samples was observed with 89% (ie, 17 of 19) of control samples clustering closely together, a proportion of IBD samples separated in PC1 (Figure 1
B). Additional associations were identified for disease distribution, disease subtype (ie, UC vs CD), as well as albumin and a number of serum inflammatory markers, such as platelet count, erythrocyte sedimentation rate, and C-reactive protein. Furthermore, sex was found to be strongly associated with variance observed in PCs 8–10 (Figure 1
A, Supplementary Figure 1
B). Next, we performed similar analyses on previously published CD8+ T-cell transcriptomes obtained from an adult patient cohort, including both IBD patients (n = 38) and healthy controls (n = 14, adult cohort 1, Supplementary Table 2). As shown in Figure 1
C and D, variation in CD8+ T-cell–specific gene transcription in this adult patient cohort was also associated with diagnosis, age, and sex. Similar to the pediatric samples, the majority of adult control samples clustered closely together (78% of control samples) with a subset of IBD patient–derived CD8+ T-cell transcriptomes separated out (Figure 1
D). An association between CD8+ T-cell transcription with age and sex was also confirmed in a second adult IBD patient cohort (adult cohort 2, Supplementary Figure 1
C).
Figure 1
Genome-wide transcriptional profiles of CD8+ T cells obtained from children newly diagnosed with CD (n = 67), UC (n = 40), and healthy controls (n = 19), as well as from a previously published adult patient cohort (CD n = 19, UC n = 19, non-IBD controls n = 14, adult cohort 1). (A) Observed variance within pediatric CD8+ T-cell transcriptomes (top panel) in the first 10 PCs. Heatmap displaying correlation between observed transcriptional variance and phenotype as well as selected serum markers at diagnosis (bottom panel). (B) PC analysis plot of pediatric CD8+ T-cell transcriptomes illustrating close clustering of samples derived from non-IBD controls (red circle containing 17 of 19 control samples). (C) Variance in adult CD8+ T-cell transcriptomes also shows association with diagnosis, sex, and age. (D) PC analysis plot of adult CD8+ T-cell transcriptomes showing a similar distribution with close clustering of most non-IBD samples (red circle containing 11 of 14 control samples). P values were generated with a Kendall correlation for continuous variables, or an analysis of variance for categorical.
Supplementary Figure 1
(A, B) Genome-wide transcriptional profiles of CD8+ T cells obtained from children newly diagnosed with CD (n = 67), UC (n = 40), and healthy controls (n = 19). PC analysis plots were generated after pre-processing and batch correction. (A) PC1 vs PC2 with samples labeled by age: 10 years or younger (n = 26, red) or older than 10 years (n = 100, blue). (B) Showing PC1 vs PC9 with samples labeled by sex: male (n = 74, blue), female (n = 52, red). (C) Assessment of variance within CD8+ T-cell transcriptomes generated from adult patients diagnosed with CD (adult cohort 2, n = 35) in each PC (top panel). Heatmap displaying correlation between observed transcriptional variance with sex and age (bottom panel). (D) PC analysis plot of samples obtained from pediatric patients at diagnosis and during follow-up, illustrating overlap between the majority of samples derived from patients in clinical remission (blue circle, 30 of 32) and non-IBD controls (red circle, 17 of 19).
Genome-wide transcriptional profiles of CD8+ T cells obtained from children newly diagnosed with CD (n = 67), UC (n = 40), and healthy controls (n = 19), as well as from a previously published adult patient cohort (CD n = 19, UC n = 19, non-IBD controls n = 14, adult cohort 1). (A) Observed variance within pediatric CD8+ T-cell transcriptomes (top panel) in the first 10 PCs. Heatmap displaying correlation between observed transcriptional variance and phenotype as well as selected serum markers at diagnosis (bottom panel). (B) PC analysis plot of pediatric CD8+ T-cell transcriptomes illustrating close clustering of samples derived from non-IBD controls (red circle containing 17 of 19 control samples). (C) Variance in adult CD8+ T-cell transcriptomes also shows association with diagnosis, sex, and age. (D) PC analysis plot of adult CD8+ T-cell transcriptomes showing a similar distribution with close clustering of most non-IBD samples (red circle containing 11 of 14 control samples). P values were generated with a Kendall correlation for continuous variables, or an analysis of variance for categorical.In order to investigate potential transcriptional changes over time and in response to treatment, we obtained longitudinal blood samples (n = 62) and isolated CD8+ T cells from a subset of patients at various stages post diagnosis, including during early remission (3 months post induction), sustained remission (6 months post induction), and first and second relapse (Table 1). Although we did not observe major differences in CD8+ gene expression based on specific treatment received (data not shown), samples obtained from patients in clinical remission appeared to cluster more closely to non-IBD controls (Supplementary Figure 1
D), suggesting an impact of systemic inflammation on CD8+ T-cell transcription.In summary, variation observed in CD8+ T-cell transcriptional profiles obtained from children and adults with IBD and controls showed significant association with diagnosis, age, systemic inflammation, and sex.
Testing for the Presence of a Previously Reported Prognostic CD8+ T-Cell–Specific Expression Signature in Adult and Pediatric Inflammatory Bowel Disease Patient Cohorts
A CD8+ T-cell–specific gene transcription signature has been shown to correlate with disease outcome in adult patients with several immune-mediated diseases, including antineutrophil cytoplasmic antibody–associated vasculitis, systemic lupus erythematosus, as well as adult patients diagnosed with CD and UC.
,
We analyzed 2 previously published adult (adult cohort 1 and 2) and our pediatric IBD patient–derived sample cohorts for the presence of the previously reported CD8+ T-cell–specific, prognostic transcriptional signature.Using the same analytical approach (Supplementary Methods), we confirmed the presence of a CD8+ T-cell expression signature in the adult CD and UC patient cohort in which the signature was first discovered (adult cohort 2). As shown in Figure 2
A and Supplementary Figure 2
A, unsupervised clustering according to expression of signature genes separated CD and UC patient–derived CD8+ T-cell samples into 2 distinct groups that were reported to vary in disease outcome (IBD1 = severe; IBD2 = mild) based on number of treatment escalations. A major overlap between the CD and UC cohort–derived expression signatures was also confirmed (Supplementary Figure 2
B). We next applied the same expression signature to our pediatric dataset, selecting genes of the core adult IBD signature and subjecting the resulting expression profiles to unsupervised clustering (Supplementary Methods). As shown in Figure 2
B, CD8+ T-cell expression of signature genes in pediatric IBD samples obtained at the point of diagnosis (treatment-naïve, active disease, n = 107) did not lead to any significant clustering. Given the major similarities in disease phenotype, behavior and pathogenesis between adolescent (12–16 years) and adult onset IBD, we performed additional analyses to test for the presence of the previously reported prognostic transcriptional signature in older children (ie, children diagnosed aged 12–16 years). As shown in Figure 2
C, omitting younger IBD patients from the analyses yielded similar results, with no clear clustering observed. Next, we included a second adult IBD patient cohort (adult cohort 1) in our analyses and applied the previously reported expression signature. Similar to pediatric patient–derived CD8+ T-cell transcriptomes, no clear clustering based on the expression of signature genes was observed in a total of 38 adult IBD patients (Figure 2
D). No clustering could be identified when performing analyses after separating pediatric samples into CD and UC applying disease subtype–specific signatures (Supplementary Figure 2
C and D), and when combining all samples, including CD, UC, non-IBD controls, as well as longitudinal samples (n = 188, Supplementary Figure 2
E).
Figure 2
Disease prognostic transcriptional signature in adult and pediatric CD8+ T-cell samples. Genes forming the previously reported transcriptional signature were selected from genome-wide transcriptomes and subjected to hierarchical clustering. (A) Heatmap and hierarchical clustering of genes forming the IBD prognostic expression signature in CD8+ T cells obtained from adult patients diagnosed with CD (n = 35, adult cohort 2). (B, C) Heatmap and hierarchical clustering tree for genes forming the core adult IBD prognostic expression signature applied to CD8+ T-cell transcriptomes of (B) all children newly diagnosed with IBD (n = 107), and (C) children diagnosed aged 12 to 16 years (n = 74). (D) Prognostic signature applied to a second adult IBD patient cohort (adult cohort 1, n = 38). Clustering was tested for statistical significance using M3C.
Supplementary Figure 2
Application of previously reported disease prognostic transcriptional signature to adult and pediatric CD8+ T-cell samples. Genes forming the previously reported transcriptional signature were selected from genome-wide transcriptomes and subjected to hierarchical clustering. (A) Heatmap and hierarchical clustering of genes forming the UC-specific prognostic expression signature in CD8+ T cells obtained from adult patients diagnosed with UC (n = 32, original adult cohort 2). (B) Venn diagram illustrating overlap of transcriptional signatures identified in adult patients diagnosed with CD (n = 35) and UC (n = 32) forming the core IBD signature. (C–E) Heatmap and hierarchical clustering tree for genes forming adult IBD prognostic expression signature applied to CD8+ T-cell transcriptomes of (C) all children newly diagnosed with CD (n = 67, adult CD-specific signature applied), (D) all children newly diagnosed with UC (n = 40, adult UC-specific signature applied), and (E) all pediatric patient derived CD8+ T-cell transcriptomes, including non-IBD controls and longitudinal samples (n = 188, adult core IBD signature applied).
Disease prognostic transcriptional signature in adult and pediatric CD8+ T-cell samples. Genes forming the previously reported transcriptional signature were selected from genome-wide transcriptomes and subjected to hierarchical clustering. (A) Heatmap and hierarchical clustering of genes forming the IBD prognostic expression signature in CD8+ T cells obtained from adult patients diagnosed with CD (n = 35, adult cohort 2). (B, C) Heatmap and hierarchical clustering tree for genes forming the core adult IBD prognostic expression signature applied to CD8+ T-cell transcriptomes of (B) all children newly diagnosed with IBD (n = 107), and (C) children diagnosed aged 12 to 16 years (n = 74). (D) Prognostic signature applied to a second adult IBD patient cohort (adult cohort 1, n = 38). Clustering was tested for statistical significance using M3C.Together, our extensive analyses were unable to confirm the presence of the previously reported prognostic CD8+ T-cell–derived transcriptional signature in our large pediatric or second adult IBD cohort.
CD8+ T-Cell Transcription in Childhood Inflammatory Bowel Disease Lacks Correlation With Disease Outcome
In the absence of the previously reported adult expression signature, we next tested our entire pediatric dataset for the presence of a potential childhood-specific prognostic signature by correlating expression data with clinical outcome. Among other strategies we performed WGCNA, allowing the identification of gene groups (called eigengenes or modules) that correlate with clinical outcome measures as described previously (Supplementary Methods). Applying WGCNA to adult CD patient–derived expression data (adult cohort 2) revealed several strong gene expression modules, some of which significantly correlated with reported patient outcome status (ie, IBD1/IBD2, Figure 3
Ai and Supplementary Table 4). A major overlap of the genes forming these modules with the previously reported prognostic expression signature was observed (Figure 3
Aii). Moreover, expression of genes defining the top module was found to cluster patients according to outcome status, confirming the validity of this analytical approach (Figure 3
Aiii). In contrast to results obtained from the adult CD dataset, none of the gene modules resulting from the analysis of the pediatric CD dataset correlated significantly with any of the clinical outcome parameters tested, including number of treatment escalations, requirement for treatment with biologics, surgery, or with a summary disease outcome score designed to account for the fact that disease outcome is not defined by a single measurement (see Supplementary Methods and Supplementary Table 2 for details; Figure 3
Bi). The validity of our approach in the pediatric sample cohort was confirmed by the identification of module–trait relationships for sex and age, although the latter only reached statistical significance in UC patient–derived samples (Figure 3
Bii, Supplementary Table 4 and Supplementary Figure 3
A). Lastly, we preformed WGCNA analyses on CD8+ T-cell transcriptomes obtained from the older cohort of children diagnosed with CD or UC (aged 12–16 years). When omitting samples derived from children younger than 12 years, none of the resulting gene modules correlated significantly with any of the disease outcome measures we tested (Supplementary Figure 3
B and C).
Figure 3
WGCNA of CD8+ T-cell transcriptomes derived from adult-onset (adult cohort 2) (A) and childhood-onset (B) CD. (Ai) Module–trait relationship heatmap displaying gene modules and their correlation with age, sex, and outcome class (severe vs mild). Numbers indicate degree of correlation between module and trait (top) and corrected P value evaluating statistical significance of association (in parentheses). (Aii) Overlap between genes forming the adult IBD prognostic expression signature derived from Figure 1A (gray) and top correlated modules (blue and turquoise). (Aiii) Hierarchical clustering and heatmap of genes forming the blue module. (Bi) Top 10 module–trait relationship heatmap of CD8+ T cells derived from children newly diagnosed with CD (n = 60). (Bii) Hierarchical clustering and heatmap based on genes forming the gene module associated with sex (dark red module). Samples of patients commenced on treatment with biologics at diagnosis were excluded from these analyses.
Supplementary Figure 3
Application of WGCNA to CD8+ T-cell transcriptional profiles derived from pediatric patients diagnosed with (A) UC (all ages, n = 39), (B) CD (age of disease onset 12–16 years, n = 37) and (C) UC (age of disease onset 12–16 years, n = 25). Displayed are the top 10 gene modules and their association with clinical phenotype (age at diagnosis, sex, number of treatment escalations, treatment escalation to biologics, surgery, and summary severity score). Statistically significant correlations were observed for sex (P < .05) in all cohorts and with age when including pediatric UC samples of all age groups (A). Numbers indicate degree of correlation between module and trait (top) and corrected P value evaluating statistical significance of association (in parentheses).
WGCNA of CD8+ T-cell transcriptomes derived from adult-onset (adult cohort 2) (A) and childhood-onset (B) CD. (Ai) Module–trait relationship heatmap displaying gene modules and their correlation with age, sex, and outcome class (severe vs mild). Numbers indicate degree of correlation between module and trait (top) and corrected P value evaluating statistical significance of association (in parentheses). (Aii) Overlap between genes forming the adult IBD prognostic expression signature derived from Figure 1A (gray) and top correlated modules (blue and turquoise). (Aiii) Hierarchical clustering and heatmap of genes forming the blue module. (Bi) Top 10 module–trait relationship heatmap of CD8+ T cells derived from children newly diagnosed with CD (n = 60). (Bii) Hierarchical clustering and heatmap based on genes forming the gene module associated with sex (dark red module). Samples of patients commenced on treatment with biologics at diagnosis were excluded from these analyses.Taken together, our results demonstrate a lack of correlation between CD8+ T-cell transcription and disease outcome parameters in children newly diagnosed with IBD.
Expression of Genes Associated With CD8+ T-Cell Exhaustion Show Limited Variation in Pediatric Patient Samples
T-cell exhaustion has been reported as an underlying mechanism contributing to the observed differences in adult onset IBD disease behavior. To investigate these findings in our pediatric patient cohort, we performed unsupervised clustering analyses based on genes associated with T-cell exhaustion as reported previously. Although expression of exhaustion-related genes clustered the original adult CD patient cohort–derived transcriptomes into distinct subgroups (adult cohort 2, Figure 4
A), we did not observe any significant clustering in CD8+ T-cell transcriptomes of children newly diagnosed with CD, UC, all IBD, or when combining all pediatric samples (n = 188, Figure 4
B and Supplementary Figure 4
A–C). Furthermore, no clear clustering based on the expression of T-cell exhaustion–related genes was observed in CD8+ T-cell transcriptomes of a second adult IBD cohort (adult cohort 1, Supplementary Figure 4
D).
Figure 4
Transcriptional variation of genes associated with T-cell exhaustion in CD8+ T cells. Genes forming a transcriptional T-cell exhaustion signature were selected from genome-wide transcriptomes and subjected to hierarchical clustering. Heatmap and hierarchical clustering trees are displayed for samples obtained from adults (adult cohort 2, A, n = 35) and children (B, n = 67) diagnosed with CD.
Supplementary Figure 4
Transcriptional variation of genes associated with T-cell exhaustion in CD8+ T cells. Genes forming a transcriptional T-cell exhaustion signature (Wherry et al) were selected from genome-wide transcriptomes and subjected to hierarchical clustering. Heatmap and hierarchical clustering trees are displayed for samples obtained from (A) children newly diagnosed with UC (n = 40), (B) all pediatric patient derived CD8+ T-cell transcriptomes (n = 188, including controls and longitudinal follow-up samples) and (C) adult IBD patients (adult cohort 1, n = 38). No significant clusters were identified and patients with mild (ie, 0–1 treatment escalations in gray), or moderate to severe (ie, 2 or more treatment escalations in black) outcome are distributed equally in pediatric patients. The y-axis indicates expression of genes according to reference signature (red = up-regulated, blue = down-regulated).
Transcriptional variation of genes associated with T-cell exhaustion in CD8+ T cells. Genes forming a transcriptional T-cell exhaustion signature were selected from genome-wide transcriptomes and subjected to hierarchical clustering. Heatmap and hierarchical clustering trees are displayed for samples obtained from adults (adult cohort 2, A, n = 35) and children (B, n = 67) diagnosed with CD.Together these findings suggest that T-cell exhaustion associated gene transcription in IBD patient–derived CD8+ T–cells does not vary sufficiently to detect a reproducible clustering of patient samples in the cohorts we tested.
CD8+ T-Cell DNA Methylation Shows Association With Age But Not With Disease Outcome in Children Newly Diagnosed With Crohn’s Disease
DNA methylation has been shown to regulate key aspects of CD8+ T-cell function, including cellular differentiation and exhaustion. Moreover, epigenetically mediated alterations in T-cell function have been implicated in disease pathogenesis of several immune-mediated conditions.
,
,
We performed genome-wide DNA methylation profiling of CD8+ T cells derived from children newly diagnosed with CD (n = 66). Variation observed among these samples showed significant associations with disease distribution, sex, age, and inflammatory markers (Figure 5
A). Similar to CD8+ T-cell transcriptomes, we did not detect any significant correlation between DNA methylation and clinical outcome using various analytical approaches, including differential methylation and variance decomposition analyses (Figure 5
B and data not shown). In order to investigate potential age-related changes in CD8+ T-cell DNA methylation, we included publicly available genome wide methylomes obtained from 2 adult cohorts,
,
one of which included samples obtained from adult IBD patients (adult cohort 1). Firstly, we applied an “epigenetic clock” algorithm to these datasets and confirmed a significant correlation between the predicted age based on DNA methylation and chronological age (P < .005; Figure 5
C). As DNA methylation has been used as a highly accurate measure to predict cellular composition, we next estimated the proportion of naïve and memory CD8+ T cells in our dataset as well as the published adult datasets. As shown in Figure 5
D, in both adult datasets we observed a significant inverse correlation for the estimated proportion of naïve CD8+ T cells with age, as well as an age-dependent increase in the proportion of memory CD8+ T cells in adult cohort 3 (P < .005).
Figure 5
Genome-wide DNA methylation profiles derived from CD8+ T cells. (A) Variation observed in CD8+ T-cell DNA methylomes obtained from children newly diagnosed with CD (n = 66). Degree of variance in the first 10 PCs (top panel) and association with phenotype/serum parameters are indicated (heatmap, bottom panel). P values were generated with a Spearman correlation for continuous variables or an analysis of variance for categorical. (B) PC analysis plot of childhood CD patient–derived CD8+ T-cell DNA methylomes. Samples labeled according to disease outcome (mild, moderate, severe). (C) Epigenetic clock applied to CD8+ T-cell DNA methylomes derived from children newly diagnosed with CD, as well as 2 publicly available datasets (adult cohorts 1 and 3). Adult cohort 1 includes samples derived from adult patients diagnosed with IBD and healthy controls. (D) Estimated proportion of naïve (left) and memory (right) CD8+ T cells based on DNA methylomes demonstrating correlation with age in adult cohorts. Adult cohort 1 (n = 56), adult cohort 3 (n = 98).
Genome-wide DNA methylation profiles derived from CD8+ T cells. (A) Variation observed in CD8+ T-cell DNA methylomes obtained from children newly diagnosed with CD (n = 66). Degree of variance in the first 10 PCs (top panel) and association with phenotype/serum parameters are indicated (heatmap, bottom panel). P values were generated with a Spearman correlation for continuous variables or an analysis of variance for categorical. (B) PC analysis plot of childhood CD patient–derived CD8+ T-cell DNA methylomes. Samples labeled according to disease outcome (mild, moderate, severe). (C) Epigenetic clock applied to CD8+ T-cell DNA methylomes derived from children newly diagnosed with CD, as well as 2 publicly available datasets (adult cohorts 1 and 3). Adult cohort 1 includes samples derived from adult patients diagnosed with IBD and healthy controls. (D) Estimated proportion of naïve (left) and memory (right) CD8+ T cells based on DNA methylomes demonstrating correlation with age in adult cohorts. Adult cohort 1 (n = 56), adult cohort 3 (n = 98).Together, these results further support the lack of CD8+ T-cell–derived molecular profiles with disease outcome in children newly diagnosed with CD and highlight age as a likely factor impacting on CD8+ T-cell function and accounting for distinct differences between patient cohorts.
Discussion
Accurate disease prognostic biomarkers in IBD remain an essential requirement for effective personalized treatment by predicting clinical outcomes in individual patients. However, the search for parameters that reliably predict disease behavior at diagnosis remains challenging, with most studies identifying markers of short-term disease activity rather than longer-term clinical outcomes. An innovative and elegant approach of combining purification of individual leukocyte subsets with genome-wide transcriptional profiling led to the discovery of a prognostic expression signature in CD8+ T cells and, more recently, the development of a whole blood–based clinical biomarker in adult IBD patients. Interestingly, the reported prognostic expression signature was also found to be present in healthy adult control cohorts, strongly suggesting differences in CD8+ T-cell function are unrelated to chronic inflammation. Indeed, T-cell exhaustion was later identified to contribute to the observed differences. Specifically, these findings suggested that individuals displaying exhausted CD8+ T-cell expression phenotypes experience a milder immune-mediated disease course, while being more susceptible to infectious diseases.Here, we investigated CD8+ T-cell transcription in a large, prospectively recruited cohort of children, newly diagnosed with IBD and compared our dataset with 2 previously published adult IBD cohorts. Our results identified major differences between cohorts. Specifically, although we were able reproduce findings of the previously reported CD8+ T-cell transcriptional signature in the original patient cohort (adult cohort 2), we could not replicate these results in either our pediatric or a second adult IBD patient cohort (adult cohort 1)–derived genome-wide profiles. Moreover, we were unable to find any correlation between CD8+ T-cell gene transcription or DNA methylation and clinical outcomes in our pediatric patient cohort. This was despite including a wide range of clinically meaningful pediatric outcome measures, those used in previous adult studies (eg, number of treatment escalations), as well as a new summary outcome score, in which we combined key aspects of disease severity.The differences observed in CD8+ T-cell derived transcriptional profiles of 2 adult IBD patient cohorts in regard to the expression of genes forming the previously reported prognostic- or T-cell exhaustion signature are unexplained, as samples only formed distinct clusters in the original adult IBD cohort (ie, adult cohort 2). Furthermore, the clear clustering pattern of these signatures was also absent in CD8+ T-cell–derived transcriptomes of older children (ie, diagnosed at age 12–16 years) in whom disease pathogenesis and behavior is considered very similar to adult-onset IBD. Although the lack of clustering in 2 of 3 cohorts cannot completely exclude the presence of a subtler transcriptional signature, our results highlight the importance of validating molecular predictors of such clinical importance in independent cohorts. In this regard, the lack of consensus on defining disease outcome in IBD, as described recently by Dulai and colleagues, makes the comparability of such studies difficult, and further adds to the challenges of developing a reliable, clinically relevant biological predictor. Hence, development of a validated clinical outcome score in IBD patients of all ages should be considered essential for the validation of existing and future biomarkers.In contrast to disease outcome, our analyses identified significant and reproducible associations between CD8+ T-cell gene transcription with sex and age across all 3 sample cohorts. Although the former is unlikely to be of major relevance to IBD pathogenesis, age associated changes in T-cell function might contribute to phenotypic differences between childhood- and adult-onset IBD. These findings are in keeping with a large body of existing evidence suggesting a major effect of human aging on immunocompetence. Specifically, CD8+ T-cell populations have been reported to be particularly affected by aging, as their ability to generate primary CD8+ T-cell responses to newly encountered antigens decreases over time. This effect can be partly attributed to a decrease in the number of naïve CD8+ T cells with advancing age, as well as an increase in T-cell exhaustion.
,
Our data are in keeping with these reports, as we demonstrate a reduction in the estimated proportion of naïve T cells, as well as an increase in memory T cells with age. As expected, age-related DNA methylation changes were most apparent in those datasets from patient cohorts that covered a wide age range.
,
Beyond IBD, our findings are also relevant to age- and sex-related susceptibility to viral infections (eg, severe acute respiratory syndrome coronavirus 2).
,A limitation of our study was that we were unable to combine adult and pediatric datasets due to differences in array types, as well as the lack of matching, publicly available, clinical metadata. However, our study illustrates the power of comparing results obtained from the analyses of independent sample cohorts in order to validate biological and clinically relevant molecular signatures.In summary, our study reveals an association between CD8+ T-cell gene transcription and DNA methylation with age in 3 independent patient cohorts, suggesting a potential contribution of this mechanism towards age-associated differences in IBD phenotypes. In contrast, the presence of a previously reported prognostic expression signature was confirmed in only in 1 of 3 sample cohorts tested. In the absence of any significant association between CD8+ T-cell gene transcription and DNA methylation and clinical outcomes in childhood-onset IBD, our results do not currently support the use of these molecular profiles as prognostic biomarkers in children. Importantly, our results highlight the need to validate clinical biomarkers in independent patient cohorts using reliable, clinically relevant outcome measures across all age groups.
Authors: Matthew E Ritchie; Belinda Phipson; Di Wu; Yifang Hu; Charity W Law; Wei Shi; Gordon K Smyth Journal: Nucleic Acids Res Date: 2015-01-20 Impact factor: 16.971
Authors: E John Wherry; Sang-Jun Ha; Susan M Kaech; W Nicholas Haining; Surojit Sarkar; Vandana Kalia; Shruti Subramaniam; Joseph N Blattman; Daniel L Barber; Rafi Ahmed Journal: Immunity Date: 2007-10-18 Impact factor: 31.745
Authors: Nicholas T Funderburg; Samantha R Stubblefield Park; Hannah C Sung; Gareth Hardy; Brian Clagett; James Ignatz-Hoover; Clifford V Harding; Pingfu Fu; Jeffry A Katz; Michael M Lederman; Alan D Levine Journal: Immunology Date: 2013-09 Impact factor: 7.397
Authors: Basavaraj Kerur; Eric I Benchimol; Karoline Fiedler; Marisa Stahl; Jeffrey Hyams; Michael Stephens; Ying Lu; Marian Pfefferkorn; Raza Alkhouri; Jennifer Strople; Judith Kelsen; Leah Siebold; Alka Goyal; Joel R Rosh; Neal LeLeiko; Johan Van Limbergen; Anthony L Guerrerio; Ross Maltz; Lina Karam; Eileen Crowley; Anne Griffiths; Melvin B Heyman; Mark Deneau; Keith Benkov; Joshua Noe; Dedrick Mouton; Helen Pappa; Joseph A Galanko; Scott Snapper; Aleixo M Muise; Michael D Kappelman Journal: Inflamm Bowel Dis Date: 2021-02-16 Impact factor: 5.325
Authors: Daniele Biasci; James C Lee; Nurulamin M Noor; Diana R Pombal; Monica Hou; Nina Lewis; Tariq Ahmad; Ailsa Hart; Miles Parkes; Eoin F McKinney; Paul A Lyons; Kenneth G C Smith Journal: Gut Date: 2019-04-27 Impact factor: 23.059
Authors: Ruth Pidsley; Elena Zotenko; Timothy J Peters; Mitchell G Lawrence; Gail P Risbridger; Peter Molloy; Susan Van Djik; Beverly Muhlhausler; Clare Stirzaker; Susan J Clark Journal: Genome Biol Date: 2016-10-07 Impact factor: 13.583
Authors: Carmen Argmann; Minami Tokuyama; Ryan C Ungaro; Ruiqi Huang; Ruixue Hou; Sakteesh Gurunathan; Roman Kosoy; Antonio Di'Narzo; Wenhui Wang; Bojan Losic; Haritz Irizar; Lauren Peters; Aleksandar Stojmirovic; Gabrielle Wei; Phillip H Comella; Mark Curran; Carrie Brodmerkel; Joshua R Friedman; Ke Hao; Eric E Schadt; Jun Zhu; Judy Cho; Noam Harpaz; Marla C Dubinsky; Bruce E Sands; Andrew Kasarskis; Saurabh Mehandru; Jean-Frederic Colombel; Mayte Suárez-Fariñas Journal: Gastroenterology Date: 2021-09-02 Impact factor: 22.682
Authors: Johanne Brooks-Warburton; James Ashton; Anjan Dhar; Tony Tham; Patrick B Allen; Sami Hoque; Laurence B Lovat; Shaji Sebastian Journal: Frontline Gastroenterol Date: 2021-12-10