| Literature DB >> 34098982 |
Ilkyu Park1,2, Jaeeun Jung2, Sugi Lee1,2, Kunhyang Park3, Jea-Woon Ryu4, Mi-Young Son5, Hyun-Soo Cho6, Dae-Soo Kim7,8.
Abstract
BACKGROUND: Inflammatory bowel disease (IBD) is a chronic and idiopathic inflammatory disorder of the gastrointestinal tract and comprises ulcerative colitis (UC) and Crohn's disease (CD). Crohn's disease can affect any part of the gastrointestinal tract, but mainly the terminal ileum and colon. In the present study, we aimed to characterize terminal-ileal CD (ICD) and colonic CD (CCD) at the molecular level, which might enable a more optimized approach for the clinical care and scientific research of CD.Entities:
Keywords: Colonic CD; Crohn’s disease; Logistic regression; Paediatric patients; Terminal-ileal CD; Transcriptomic profile
Year: 2021 PMID: 34098982 PMCID: PMC8185924 DOI: 10.1186/s12967-021-02909-z
Source DB: PubMed Journal: J Transl Med ISSN: 1479-5876 Impact factor: 5.531
Fig. 1Principal component analysis (PCA) of newly diagnosed patients with paediatric CD. Unsupervised PCA analysis revealed differences according to diagnosis and gut segment; colon control (royal blue) vs. ileum control (spring green) and colon CD (navy blue) vs. ileum CD (dark green). CD Crohn’s disease
Fig. 2Overview of the study design. Schematic representation of this research. CD Crohn’s disease, UC ulcerative colitis, AUC area under curve
Fig. 3A Venn diagram illustrating differentially expressed genes in each tissue type. CCGs [n = 240] and ICGs [n = 310] were obtained in mucosal biopsies samples. Shaded area indicates CD common DEGs [n = 471] regardless of tissue type; CD Crohn’s disease
Fig. 4Samples from treatment-naïve paediatric patients with CD show molecular differences. Hierarchical clustering and heatmaps of RNA-Seq data of CCGs and ICGs with six different expression types. Samples are color-coded in the top bar according to the sample types and diagnosis [CD, control] and genes are color-coded in the left sidebar according to their expression types. a A heatmap of CCGs. b A heatmap of ICGs. CD Crohn’s disease
Summary of number of CCD and ICD-specific genes with different expression types
Depending on the tissue type of α and β, CCD and ICD-specific genes were separately classified: when α = colon tissue type and β = ileum tissue type, the result would be CCD genes; when α = ileum tissue type and β = colon tissue type, the result would be ICD genes. To get significantly differentially expressed genes, genes with p-value of 0.05 and confidence interval 95% were set to be cutoff for CDα vs CDβ, NMα vs NMβ, and CDα vs NMα. To specifically expressed, gene were also selected from CDβ vs. NMβ, (p-value > 0.1). For each pattern graph, red, blue, and black line represents percentile of 75th, 25th, and median, respectively
CD: Crohn’s disease; NM: Normal (control) samples; CCD: colonic CD; NMC; normal colon; NMI: normal ileum; ICD: terminal-ileal CD; ≒: no differences (p-value > 0.1)
Most significant KEGG pathways with altered expression in terms of disease
| Name of pathway | Count | Fold enrichment | FDR | |
|---|---|---|---|---|
| CCGs | ||||
| Oxidative phosphorylation | 22 | 3.18E−19 | 12.11 | 2.62E−14 |
| *Parkinson’s disease | 21 | 2.63E−17 | 10.82 | 1.81E−12 |
| Alzheimer’s disease | 22 | 4.60E−17 | 9.58 | 3.75E−12 |
| *Non-alcoholic fatty liver disease (NAFLD) | 21 | 1.12E−14 | 8.00 | 6.41E−10 |
| Huntington’s disease | 19 | 2.72E−14 | 9.21 | 1.05E−09 |
| Metabolic pathways | 41 | 7.24E−12 | 2.52 | 3.04E−06 |
| ICGs | ||||
| Vitamin digestion and absorption | 8 | 1.22E−7 | 18.53 | 1.52E−04 |
| Fat digestion and absorption | 41 | 3.57E−4 | 1.71 | 0.4450 |
| Protein digestion and absorption | 6 | 8.80E−4 | 7.84 | 1.0931 |
| Metabolic pathways | 8 | 1.63E−3 | 4.63 | 1.9248 |
| Chemical carcinogenesis | 7 | 4.53E−3 | 4.46 | 5.4888 |
Indicates the associations with CD were previously reported. Count: number of genes; p-value: modified Fisher Exact p-value, EASE Score, the smaller, the more enriched; Fold Enrichment: − log (p-value); CCGs: colonic CD genes; ICGs: terminal-ileal CD genes
Fig. 5Gene ontology (GO) enrichment analysis of CCGs and ICGs. The x-axes represent the rich factor, which is the ratio of differentially expressed targeted gene numbers in the process to all the annotated genes located in the process. The higher rich factor indicates the higher level of enrichment. The dot size represents the number of target genes in the GO. a Scatter chart displaying Gene ontology of CCD genes, b Scatter chart displaying Gene ontology of ICD genes. CCGs colonic CD genes, ICGs terminal-ileal CD genes
CD colon specific genes relates to mitochondria dysfunction (n = 20)
| Gene symbol | Expression type | |||
|---|---|---|---|---|
| CCD vs. NMC | CCD vs. ICD | NMC vs. NMI | ||
| COX5B | B | 1.17E−04 | 4.98E−04 | 5.45E−08 |
| COX6A1 | B | 1.78E−04 | 5.37E−05 | 3.46E−07 |
| COX8A | B | 1.96E−04 | 1.44E−06 | 1.56E−08 |
| CYC1 | B | 5.94E−06 | 4.76E−06 | 2.02E−10 |
| NDUFA11 | B | 9.13E−04 | 2.12E−04 | 1.61E−07 |
| NDUFA9 | B | 4.54E−04 | 6.79E−05 | 2.45E−06 |
| NDUFB10 | B | 2.25E−04 | 7.07E−03 | 2.17E−06 |
| NDUFB2 | B | 4.34E−04 | 2.80E−08 | 4.22E−08 |
| NDUFB7 | B | 1.78E−04 | 5.24E−05 | 6.12E−07 |
| NDUFB9 | B | 1.78E−04 | 5.75E−04 | 9.90E−07 |
| NDUFS2 | B | 3.56E−05 | 3.10E−03 | 1.85E−07 |
| NDUFS3 | B | 4.77E−04 | 7.60E−04 | 6.09E−06 |
| NDUFS7 | B | 8.24E−04 | 1.39E−03 | 2.21E−07 |
| NDUFS8 | B | 1.04E−03 | 3.11E−04 | 5.45E−07 |
| NDUFV1 | B | 3.49E−08 | 9.42E−07 | 3.41E−13 |
| UQCRC1 | B | 3.38E−04 | 8.44E−04 | 2.05E−06 |
| UQCRH | B | 1.19E−05 | 1.21E−03 | 7.42E−08 |
| UQCRHL | B | 2.30E−05 | 7.71E−07 | 8.57E−10 |
| UQCR10 | B | 5.06E−04 | 6.70E−03 | 1.53E−06 |
| UQCRFS1 | B | 3.33E−04 | 1.14E−03 | 1.29E−06 |
CCD colonic CD, NMC normal colon, NMI normal ileum, ICD terminal-ileal CD
Fig. 6Protein–protein interactions among selected colonic CD genes (CCGs) involved in mitochondrial dysfunction. a The STRING graph shows 20 genes of mitochondrial dysfunction with interactions. Edges represent protein–protein associations and line width shows the strength of interaction. b Expression pattern of differentially expressed genes of type B. All 20 genes were belong to expression type B. CCGs colonic CD genes
Fig. 7Final candidate genes that discriminate colonic CD from ileal CD and CD from UC. The boxplot of expression levels of ANP32E in the colon CD, Ileum CD, and UC samples of paediatric patients. Boxplots shows differences between three groups. Three group comparisons for each analysis are performed by Kruskal–Wallis test shown in p-values. CD Crohn’s disease, UC ulcerative colitis
Logistic regression analysis outcome from paediatric patients and adult patients
| Patients | Number of genes | Accuracy | AUC | PPV | NPV |
|---|---|---|---|---|---|
| Paediatric | 33 | 0.85 | 0.83 | 0.80 | 1 |
| Paediatric validation | 33 | 0.93 | 0.92 | 0.90 | 0.94 |
| Adult validation | 33 | 0.88 | 0.72 | 0.50 | 0.93 |
PPV = TP/TP + FP, NPV = TN/TN + FN
AUC area under the curve, PPV positive predictive value, NPV negative predictive value