| Literature DB >> 35058554 |
Florian Uellendahl-Werth1, Carlo Maj2, Oleg Borisov2, Simonas Juzenas1,3, Eike Matthias Wacker1, Isabella Friis Jørgensen4, Tim Alexander Steiert1, Saptarshi Bej5, Peter Krawitz2, Per Hoffmann6,7, Christoph Schramm8, Olaf Wolkenhauer5,9,10, Karina Banasik4, Søren Brunak4, Stefan Schreiber1,11, Tom Hemming Karlsen12, Franziska Degenhardt13, Markus Nöthen6,7, Andre Franke1, Trine Folseraas12, David Ellinghaus14,15.
Abstract
Genetic correlations and an increased incidence of psychiatric disorders in inflammatory-bowel disease have been reported, but shared molecular mechanisms are unknown. We performed cross-tissue and multiple-gene conditioned transcriptome-wide association studies for 23 tissues of the gut-brain-axis using genome-wide association studies data sets (total 180,592 patients) for Crohn's disease, ulcerative colitis, primary sclerosing cholangitis, schizophrenia, bipolar disorder, major depressive disorder and attention-deficit/hyperactivity disorder. We identified NR5A2, SATB2, and PPP3CA (encoding a target for calcineurin inhibitors in refractory ulcerative colitis) as shared susceptibility genes with transcriptome-wide significance both for Crohn's disease, ulcerative colitis and schizophrenia, largely explaining fine-mapped association signals at nearby genome-wide association study susceptibility loci. Analysis of bulk and single-cell RNA-sequencing data showed that PPP3CA expression was strongest in neurons and in enteroendocrine and Paneth-like cells of the ileum, colon, and rectum, indicating a possible link to the gut-brain-axis. PPP3CA together with three further suggestive loci can be linked to calcineurin-related signaling pathways such as NFAT activation or Wnt.Entities:
Mesh:
Year: 2022 PMID: 35058554 PMCID: PMC8776955 DOI: 10.1038/s42003-022-03031-6
Source DB: PubMed Journal: Commun Biol ISSN: 2399-3642
Fig. 1Selection of 23 tissue and cell types to perform transcriptome-wide association analyses (TWAS) for gut-brain axis (GBA) diseases.
We selected 23 tissues and cell types previously found to contribute to the heritability of psychiatric or immune-mediated disease (Methods). Cross-tissue TWAS training and imputation models were used using transcriptome-wide and genome-wide reference data from 4043 reference samples from 23 tissues and cell types (Supplementary Data 2) provided by consortium projects GTEx, STARNET, and BLUEPRINT (Methods): gastrointestinal and liver tissues (n = 9), brain tissues (n = 10), whole blood (n = 1), and immune cell types (CD14+ monocytes, CD16+ neutrophils, CD4+ T-cells; n = 3). Different colors represent biological samples from different tissue classes. The size of the circles corresponds to the number of reference samples of the respective tissue.
Transcriptome-wide significant (Pconditional < 3.20 × 10−6) genes from multiple-gene-conditioned fine-mapping analysis at loci with multiple-gene-disease association signals, for psychiatric and immune phenotypes in 23 gut-brain-axis (GBA) tissues.
| CD | UC | PSC | SCZ | MDD | BD | ADHD | |
|---|---|---|---|---|---|---|---|
| TWAS discovery | 18,431/33,658 | 14,191/33,658 | 3.377/33,658 | 36,989/113,075 | 135,458/344,901 | 9412/127,760 | 20,183/35,191 |
| 215 [80] | 152 [50] | 150 [22] | 89 [53] | 9 [5] | 5 [5] | 10 [6] | |
| 242 (240) | 171 (163) | 136 (61) | 108 (85) | 9 (5) | 14 (14) | 11 (11) | |
| 164 (164) | 109 (109) | 51 (50) | 52 (49) | 2 (2) | 7 (7) | 3 (3) | |
| 78 (76) | 62 (54) | 85 (11) | 56 (36) | 7 (3) | 7 (7) | 8 (8) | |
| 132 (132) | 89 (88) | 81 (41) | 56 (42) | 4 (3) | 3 (3) | 5 (5) | |
| 99 (99) | 67 (67) | 57 (32) | 38 (28) | 3 (2) | 2 (2) | 4 (4) |
For the selection of 23 tissues, see Fig. 1, Supplementary Data 2 and Methods. 164 (78), 109 (62), 51 (85), 52 (56), 2 (7), 7 (7), 3 (8) TWAS susceptibility genes are within (outside) the boundaries of established GWAS susceptibility loci (Methods, Supplementary Data 7) for CD, UC, PSC, SCZ, MDD, BD, and ADHD, respectively. Approximately half of all gene-disease associations are found in at least two tissues of the GBA (for more details see Supplementary Data 6).
TWAS discovery ncases/ncontrols, number of cases and controls of GWAS data sets (Supplementary Data 1); n unique genes [loci] across 23 tissues, number of transcriptome-wide significant genes (PGBJ conditional < 3.20 × 10−6; Fig. 2a) from cross-tissue and multiple-gene-conditioned fine-mapping analysis at loci with multiple-gene-disease association signals (analysis GBJconditional; see Methods). The number of associated loci of size ±1 Mb in square brackets corresponds to the number of red dots in Fig. 2a. GBJconditional corrected for nearby genes in the range of ±1 Mb around for leading transcriptome-wide significant genes (red dots in Fig. 2a) across all 23 tissues. n unique genes in total from single tissues, number of transcriptome-wide significant genes (Pconditional < 3.20 × 10−6; Supplementary Data 3) from multiple-gene-conditioned single-tissue analysis (i.e., Single-tissueconditional, before applying the generalized Berk–Jones (GBJ) test to quantify total gene-disease associations across all 23 tissues).
CD Crohn’s disease, UC ulcerative colitis, PSC primary sclerosing cholangitis, SCZ schizophrenia, MDD major depressive disorder, BIP bipolar disease (BIP), ADHD attention-deficit/hyperactivity disorder.
*significant gene-disease associations in at least two of 23 tissues.
**significant gene-disease associations in at least two tissues including the brain and non-brain tissues (excluding cross-tissue associations for immune and gastro/liver tissues only, respectively, see Fig. 1); in curved brackets: number of gene-disease associations outside the extended major histocompatibility complex (MHC; chr6:25-34 Mb).
Fig. 2Manhattan plots showing transcriptome-wide significant (Pconditional < 3.20 × 10−6 from GBJconditional) gene discoveries for psychiatric and immune phenotypes in 23 gut-brain-axis (GBA) tissues, and proportion of shared predicted transcriptome-wide disease-associated gene expression (ρE) for pairs of immune-mediated and psychiatric diseases averaged across 23 tissues of the GBA.
a Gray dotted line, transcriptome-wide significance threshold; red dots, lead gene-disease associations at TWAS loci (for regions of size ±1 Mb) and conditioned on all genes within ±1 Mb regions. CD Crohn’s disease, UC ulcerative colitis, PSC primary sclerosing cholangitis, SCZ schizophrenia, MDD major depressive disorder, BIP bipolar disease, ADHD attention-deficit/hyperactivity disorder. b Spearman’s trait correlation (ρ) values at the transcriptome-wide level of gene expression for each disease pair and averaged across 23 tissues of the GBA show that effects of genetic risk variants shared between psychiatric and immune phenotypes are largely mediated by gene expression. Strongest positive correlations across psychiatric and immune phenotypes were observed for pairs UC and BP (E = 0.15), CD and BP (E = 0.14), UC and SCZ (E = 0.13), and CD and SCZ (E = 0.11), which is in line with genome-wide genetic correlations (G) for pairs UC and BP (G = 0.23), CD and BP (G = 0.22), UC and SCZ (G = 0.14), and CD and SCZ (G = 0.12) reported by Tylee et al.[11]. The dendrogram reflects the corresponding hierarchical single linkage clustering across diseases. Genetic trait correlations for single-tissue results (analysis Single-tissueconditional) are given in Supplementary Data 9, 10. For the selection of 23 tissues, see Fig. 1, Supplementary Data 2, and Methods. Results from unconditioned cross-tissue TWAS analysis (GBJmarginal; before multiple-gene-conditioned fine-mapping analysis) are shown in Supplementary Fig. 2.
Transcriptome-wide significant susceptibility genes (NR5A2, SATB2, PPP3CA) shared between schizophrenia and inflammatory bowel diseases expressed in the gut-brain axis (GBA) tissues.
| Gene | Gene-start-end (in kb) | ( | Primary tissue(s) of strongest gene-disease association | Diseases primary tissue(s) | Secondary tissue(s) of gene-disease association | Overlap with GWAS locus | Distance to GWAS locus | ||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| SCZ | CD | UC | SCZ | CD | UC | ||||||||
| chr1:199,996 - 200,146 | (13.4, 18.2) | Hypothalamus | SCZ, CD, UC | SCZ:-; CD/UC: Colon transversum | 4.86 | 5.82 | 7.7 | 1.63 × 10−6 | 3.66 × 10−8 | 1.97 × 10−13 | No overlap | SCZ:104 kb; CD/UC:654 kb | |
| chr2:200,134 - 200,320 | (6.9, 24.8) | Frontal Cortex BA9 | SCZ, UC | SCZ: -; UC: Colon sigmoideum | 4.75 | -0.55 | 8.46 | 1.29 × 10−6 | 0.908 | 2.27 × 10−13 | No overlap | SCZ:67 kb; UC:425 kb | |
| chr4:101,947 - 102,267 | (5.0, 10.9) | Putamen basal ganglia | SCZ, CD | SCZ: - CD: Colon transversum | -4.87 | -6.79 | -2.96 | 1.57 × 10−6 | 1.28 × 10−10 | 5.54 × 10−3 | No overlap | SCZ:280 kb; CD:384 kb | |
Gene overlap analysis of TWAS results from multiple-gene-conditioned fine-mapping analysis (Single-tissueconditional; Supplementary Data 12, with detailed results in Supplementary Data 3) identified three common TWAS susceptibility genes (NR5A2, PPP3CA, SATB2) that meet the transcriptome-wide significance threshold (Pconditional < 3.20 × 10−6 for each disease separately; for diseases listed in the “Diseases primary tissue(s)” column). These three genes are shared between psychiatric and immune phenotypes in the same tissues and across the brain and non-brain tissues (Fig. 3). Increased predicted expression (indicated by a positive Z-score) of NR5A2 and SATB2 is associated with increased risk SCZ and CD as well as SCZ and UC. Decreased predicted expression (indicated by a negative Z-score) of PPP3CA is associated with increased risk for SCZ and CD. Genes inside the extended major histocompatibility complex (MHC; chr6:25-34 Mb) were excluded from gene overlap analysis.
Gene, susceptibility gene shared between psychiatric and immune disease(s); Gene-start-end, transcription start-end positions (including UTRs) in kilobases (genome build GRCh37/hg19); (hmed-min2, hmed-max2), estimated minimum/maximum gene expression heritability in percentage attributable to SNPs in the vicinity of each gene (Methods) and averaged across 23 different tissues (Fig. 1); Primary tissue(s), primary tissue(s) identified from multiple-gene-conditioned fine-mapping analysis (Single-tissueconditional, Methods), in which the transcriptome-wide significant gene-disease association signal occurred; Diseases primary tissue(s), diseases showing a transcriptome-wide association signal in the primary tissue(s) in Single-tissueconditional analysis; CD, Crohn’s disease; UC, ulcerative colitis; SCZ, schizophrenia; Secondary tissue(s), further tissues with suggestive evidence (Pconditional < 1 × 10−4) for gene-disease association; Z statistic, Z-score from analysis Single-tissueconditional and/or GBJconditional for primary tissue(s), the plus/minus sign indicates increased/decreased predicted expression of these genes to be associated with increased disease risk, TWAS Pconditional, P value from multiple-gene-conditioned fine-mapping analyses Single-tissueconditional and/or GBJconditional for primary tissue(s), P values’ significance threshold was 3.20 × 10−6; Overlap with GWAS locus, testing an overlap with locus boundaries of established GWAS loci from the latest fine-mapping GWAS studies for SCZ[31] and CD/UC[32,33]. Distance to GWAS locus, closest GWAS loci from literature for SCZ[31], CD, and UC[33] within ± 1 Mb region around coordinates from column “Gene-start-end” (in kb; genome build GRCh37/hg19): SCZ: chr1:200,250-200,422 (1q32.1) with candidate genes LINC00862, ZNF281. CD/UC: chr1:200,803-201,092 (1q32.1) with candidate genes CAMSAP2, RPL34P6, MRO3HP3, KIF21B, GPR25, CACNA1S, C1orf106, ASCL5. UC: chr2:199,447-199,709 (2q33.1). SCZ: chr2:200,387-200,633 (2q33.1) with candidate genes FTCDNL1, LOC101927641. SCZ: chr2:200,536-201,310 (2q33.1) with candidate genes C2orf47, C2orf69, FTCDNL1, SPATS2L, TYW5. SCZ: chr4:102,547-103,389 (4q24) with candidate genes BANK1, SLC39A8. CD: chr4:102,651-103,144 (4q24) with candidate gene BANK1.
Fig. 3The transcriptome-wide significant gene-disease association signals of NR5A2 (1q32.1), SATB2 (2q33.1), and PPP3CA (4q24) for SCZ, CD, and UC largely explain fine-mapped GWAS association signals from nearby GWAS susceptibility loci.
a The gene-disease association signal of NR5A2 is driven by the nearby but nonoverlapping established SCZ and CD/UC GWAS signals located 104 and 654 kilobases (kb) upstream of NR5A2, respectively (Table 2). Increased genetically regulated expression of NR5A2 in the hypothalamus and colon transversum was associated with increased risk for schizophrenia (SCZ), Crohn’s disease (CD), and ulcerative colitis (UC). b The association signal of SATB2 is driven by the nearby but nonoverlapping established SCZ and UC GWAS signals located 67 and 424 kilobases (kb) downstream and upstream of SATB2, respectively. Increased genetically regulated expression of SATB2 in frontal cortex BA9 and colon sigmoideum is associated with increased risk for SCZ and UC. Although 95% credible sets of variants most likely to be causal (red circles; upper part) differ between SCZ and UC, the distribution pattern of TWAS gene eQTLs of SATB2 (gray asterisks [upper part] and green transparent circles [middle part]) leads to the shared gene-disease association for SATB2 in both SCZ and UC. c The gene-disease association signal of PPP3CA is driven by the nearby (nonoverlapping) established SCZ and CD GWAS signals located 280 and 383 kilobases (kb) upstream of SATB2, respectively. Decreased genetically regulated expression of PPP3CA in putamen basal ganglia and colon transversum is associated with increased risk for SCZ and CD. Raw data of this figure can be found in Supplementary Data 16. Explanation of figure elements: Subplots include original consortium GWAS SNP summary statistics of size ±1 Mb (gray circles; upper part) around the gene, with the 95% credible sets of variants most likely to be causal at each locus (red circles; upper part) as defined by GWAS fine-mapping studies[31,33]. SNP summary stats were conditioned (black circles; upper part) on TWAS gene eQTLs (black asterisks [upper part] correspond to the green transparent circles [middle part], with the relative absolute weight of the eQTLs visualized by the size of green transparent circles) to examine if the original GWAS SNP statistics (gray circles [upper part]; gray asterisks correspond to the green transparent circles [middle part]) can be explained by genetically regulated expression of the gene (green rectangles depict unconditioned/conditioned TWAS P values from Single-tissueconditional analysis; lower part). GWAS SNP summary statistics of 95% credible sets of variants after joint GWAS/TWAS fine-mapping analysis are depicted as light green circles (upper part). Gene-start and gene end positions are marked by dumbbell-shaped black bars (middle part).
Fig. 4Analysis of developmental and single-cell RNA-seq data suggest a cell-specific pleiotropic role of NR5A2, SATB2, and PPP3CA in SCZ, CD, and UC and showed that PPP3CA expression was strongest in neurons and enteroendocrine and Paneth-like cells of the ileum, colon, and rectum, indicating a possible link to the GBA.
Visualizations of bulk expression of the candidate genes can be found in Supplementary Fig. 42. a Brain expression data from humans of different ages[38] confirm GTeX and BLUEPRINT tissue data and show that NR5A2 is almost absent in brain samples. SATB2 is strongly upregulated in the fetal forebrain in midfetal development. This suggests a developmental function that may predispose to disease when dysregulated. PPP3CA expression is strongest in the forebrain and increases with age. Expression in the liver is given for reference. PC, post conception. Data on expression in the developmental gut[39] show that NR5A2 is strongly expressed in the ileum and less so in the colon, whereas SATB2 is more abundant in the colon than in the ileum. This is consistent with the observation that SATB2 is significant for UC but not for CD (Table 2). PPP3CA expression is not evident from bulk RNA-Seq data of intestinal tissue. b NR5A2 is expressed in ileal progenitor, stem cells, and transient amplifying (TA) cells, less strongly in colon and rectum, and not in the brain, consistent with bulk RNA-seq results. SATB2 is strongly expressed in all cell types of the colon and rectum. PPP3CA, despite weak expression in the ileum or colon, is enriched in enteroendocrine and Paneth-like cells of the colon and rectum, supporting a role in immunomodulation. In the brain, PPP3CA is enriched in neurons. Overall, these expression data indicate that separate functions are likely in the gut and brain-based on specific expression patterns. NR5A2 expression was not found in the brain, but literature results make brain-specific function highly likely (Supplementary Note 3). Raw data of this figure can be found in Supplementary Data 17. (exPFC glutamatergic neurons from the PFC, exCA1/3 pyramidal neurons from the Hip CA region, GABA GABAergic interneurons, exDG granule neurons from the Hip dentate gyrus region, ASC astrocytes, NSC neuronal stem cells, MG microglia, ODC oligodendrocytes, OPC oligodendrocyte precursor cells, NSC neuronal stem cells, SMC smooth muscle cells, END endothelial cells).
Fig. 5Calcineurin-dependent NFAT signaling as shared signaling pathway for IBD and SCZ.
a 18 gene-disease associations with suggestive significance (PGBJ conditional < 0.05; i.e., after multiple-gene-conditioned fine-mapping analysis) are enriched in the “calcineurin-dependent NFAT signaling in leukocytes” pathway (comprising 55 genes; Methods), showing genetically altered gene expression for SCZ, CD, or UC compared to healthy controls. PPP3CA encodes the catalytic subunit calcineurin A; FKBP1A (also known as FKBP12) encodes a cis-trans prolyl isomerase that binds the immunosuppressant FK506 (tacrolimus). b The calcineurin inhibitor tacrolimus, used against various inflammatory diseases and as an immunosuppressant in organ-transplanted PSC-IBD patients, prevents NFAT signaling by binding to FKBP1A and causes inhibition of calcineurin. The reduced genetic expression of PPP3CA that we identified in intestinal and brain tissues causes an increased risk of SCZ and CD (Table 2). Raw data of this figure can be found in Supplementary Data 18.