Literature DB >> 35479407

Integrated analysis of microbe-host interactions in Crohn's disease reveals potential mechanisms of microbial proteins on host gene expression.

Padhmanand Sudhakar1, Tahila Andrighetti2, Sare Verstockt1, Clara Caenepeel1,3, Marc Ferrante1,3, João Sabino1,3, Bram Verstockt1,3, Severine Vermeire1,3.   

Abstract

Inflammatory responses of the intestinal epithelial barrier in patients with Crohn's disease (CD), a chronic inflammatory bowel disease (IBD), are associated with gut microbial alterations. At a community level, there is scarce mechanistic evidence on the effects of gut microbial alterations on host mucosal barrier responses. We used a computational microbe-host interaction prediction framework based on network diffusion and systems biology to integrate publicly available paired gut microbial and intestinal gene expression datasets. The ileal signaling network potentially modulated by the microbiota was enriched with immune-related pathways such as those associated with IL-4, IL-2, IL-13, NFkB, and toll-like receptors. We identified bacterial proteins eliciting post-translational modifications on host receptors, resulting in the de-repression of pro-inflammatory cytokines via critical hub proteins such as NFkB. The signaling networks were over-represented with CD associated genes and CD drug targets. Using datasets generated from our validation cohorts, we confirmed some of the results.
© 2022 The Authors.

Entities:  

Keywords:  Gastroenterology; Microbiome; Molecular genetics

Year:  2022        PMID: 35479407      PMCID: PMC9035720          DOI: 10.1016/j.isci.2022.103963

Source DB:  PubMed          Journal:  iScience        ISSN: 2589-0042


Introduction

Crohn’s disease (CD) is a sub-type of inflammatory bowel disease (IBD), characterized by intestinal microbial dysbiosis (DeCruz et al., 2015; Lloyd-Price et al., 2019; Pascal et al., 2017; Schaubeck et al., 2016). Although a complex set of factors including host genetics, lifestyle, diet, and exposure to antibiotics contribute to the development of CD (Ananthakrishnan et al., 2018), the microbiome-host axis is considered as a fulcrum influencing the manifestation of CD symptoms (Ananthakrishnan et al., 2017; Erickson et al., 2012; Gevers et al., 2014; Halfvarson et al., 2017; Jostins et al., 2012; Pérez-Brocal et al., 2015). This is reflected by alterations in the gut microbial compositions of IBD patients at various taxonomic levels, including enhanced abundances of particular species and epithelial barrier responses (Erickson et al., 2012; Lloyd-Price et al., 2019). Some of these responses result in hyperinflammation (Craven et al., 2012; Shaw et al., 2016) and degradation of the mucus layer which makes the epithelial barrier more vulnerable (Johansson et al., 2014; Roberton and Corfield, 1999; Wu et al., 2011, 2014; Yu, 2015; Yu et al., 2014) to microbial components. These observations are also supported by changes in the expression of genes and/or proteins involved in these processes in CD patients exhibiting microbial dysbiosis (Lloyd-Price et al., 2019). Although several lines of evidence point to the importance of microbiome-host interactions in CD, the underlying molecular mechanisms that mediate the cross-talk between the microbiome and the host are largely unknown. This could be due to (1) the lack of comprehensive datasets with coherent/parallel sampling and high resolution profiling of the gut microbiome and intestinal gene expression from the same samples/subjects; (2) practical difficulties associated with the retrieval of enough biomaterial from luminal samples/biopsies to be able to measure multiple read-outs; (3) the inability of in vitro systems such as organoid cultures, cell-lines, and microfluidic systems to truly reproduce in-vivo conditions and the ensuing microbiome-host interactions; and (4) high costs and practical limitations associated with high-throughput technologies to perform live profiling of microbiome-host interactions. Computational approaches provide an alternative option to investigate and explore the role of microbiome-host interactions (Guven-Maiorov et al., 2017). Various methods have been applied in the last decade to infer and predict the interactions between microbial and host proteins (Cui et al., 2016; Krishnadev and Srinivasan, 2011; Kshirsagar et al., 2012; Nourani et al., 2015) (Huang and Brumell, 2014; Schweppe et al., 2015; Sudhakar et al., 2019) (Bose et al., 2019). However, most of these tools are either confined solely to the upstream interactions between microbial and host proteins or document only experimentally verified protein-protein interactions corresponding to pathogens (Durmuş Tekir et al., 2013; Kumar and Nanduri, 2010; Li et al., 2015; Vialas and Gil, 2015). Other existing tools in the computational domain of microbiome-host interactions are limited to the analysis of metabolic networks (Agren et al., 2013; Arkin et al., 2018; Heirendt et al., 2019; Levy et al., 2015). Nevertheless, a common trait in most of the existing microbiome-host interaction analysis tools is that they do not consider the cumulative effects of inter-kingdom protein-protein interactions on downstream host processes. Recent tools such as MicrobioLink (Andrighetti et al., 2020), a computational pipeline investigating the functional effects of microbiome-host interactions by linking microbial-host protein interactions to the expression of host proteins using network propagation principles, enable the inference of inter-kingdom cross-talks. Motivated by the availability of synchronous high-resolution datasets profiling the gut microbial alterations and gene expression measurements in CD patients via the IBDMDB/HMP2 project (Lloyd-Price et al., 2019), we applied a modified workflow (Figure 1) of the MicrobioLink pipeline to the IBDMDB/HMP2 dataset to infer the cumulative effects of an altered CD gut microbiome on intestinal gene expression in CD patients. Topological and functional analysis of the resulting networks aided in the identification of the prominent hubs and signaling pathways mediating the effect of the altered microbiome on differentially expressed genes in the ileum and rectum of CD patients. Using an independent cohort composed of CD patients, we validated some of our findings. We also investigated the disease relevance and druggability of the hubs as well as the gene expression signatures modulated by the altered CD microbiome.
Figure 1

Graphical description of the workflow used to infer the plausible effect of the altered CD gut microbial proteins on host intestinal gene expression

Graphical description of the workflow used to infer the plausible effect of the altered CD gut microbial proteins on host intestinal gene expression

Results

Bacterial proteins altered in CD can potentially bind to human proteins

By combining information about enhanced abundances and transcriptional activity of bacterial species in CD as reported in the HMP2 IBDMDB dataset with other independent studies reporting the same trend, we identified eight bacterial species (Figure 2A, Table S1) relevant to CD. This approach based on multiple lines of evidence (enhanced abundance, transcriptional activity from one same study and additional confirmation of enhanced abundance from another study) imparts greater reliability in making inferences from noisy microbiome datasets (Figure 2A). The eight identified species include Adhesive invasive Escherichia coli (AIEC), Bacteroides fragilis, Clostridium clostridioforme, Clostridium hathewayi, Clostridium symbiosum, Klebsiella pneumoniae, Ruminococcus gnavus, and Veillonella parvula (Figure 2A, Table S1). By restricting the microbe-host protein-protein interaction prediction (see STAR Methods section for more details) to the homologs (in the eight CD relevant species) of enhanced microbial genes and transcripts (Table S2) identified in CD patients in the HMP2 IBDMDB cohort, we inferred 739 orthologous groups with the potential ability to bind to human proteins (Figure 2B, Table S3). Limiting the microbe-host protein-protein interaction prediction to microbial genes and transcripts significantly enhanced in CD proteins further ensures the specificity of the microbial proteins with respect to their role in CD pathogenesis. The 739 orthologous groups thus identified make up about 3.7% of the total number of orthologous groups identified by clustering all the proteins from the eight bacterial species (Figure 2B). About 73% (539) of the 739 orthologous groups had protein members from either one or two species only which could indicate that a large proportion of protein-clusters are specific to particular species. In contrast, only ten orthologous groups (1.3%) contained proteins from all the eight species, suggesting that generic effects at the level of microbe-host PPIs are minimal compared to species specific effects (Figure 2B). Functional analysis (hypergeometric test; FDR ≤ 0.05) of the 739 orthologous groups with potential binding capabilities identified several metabolic functions in addition to processes involved in post-translational modifications.
Figure 2

Overview of the stepwise results leading to the inference of the site-specific signaling networks

(A) Schematic displaying the procedure based on multiple evidences to identify bacterial species relevant to (based on enhanced abundance and transcriptional activity) CD.

(B) Figure displaying the relationship between the fraction of orthologous groups of proteins (from the 8 CD relevant species) which are capable of binding to human proteins and the number of species with protein membership in the orthologous groups.

(C) Hairball representation of the signaling network capturing the potential effect of the microbial proteins on ileal gene expression in CD patients.

(D) In-degree distribution of the ileal signaling network.

(E and F) The segregation of the ileal signaling network into sequential layers based on their position in the signaling chains mediating the effect of the bacterial proteins on ileal gene expression. The size of the nodes in E denotes the clustering co-efficient, whereas in F it stands for the betweenness centrality. The top three ranked nodes according to the clustering co-efficient are indicated in E while the top three transcription factors (TFs) ranked by the betweenness centrality are shown in (F).

Overview of the stepwise results leading to the inference of the site-specific signaling networks (A) Schematic displaying the procedure based on multiple evidences to identify bacterial species relevant to (based on enhanced abundance and transcriptional activity) CD. (B) Figure displaying the relationship between the fraction of orthologous groups of proteins (from the 8 CD relevant species) which are capable of binding to human proteins and the number of species with protein membership in the orthologous groups. (C) Hairball representation of the signaling network capturing the potential effect of the microbial proteins on ileal gene expression in CD patients. (D) In-degree distribution of the ileal signaling network. (E and F) The segregation of the ileal signaling network into sequential layers based on their position in the signaling chains mediating the effect of the bacterial proteins on ileal gene expression. The size of the nodes in E denotes the clustering co-efficient, whereas in F it stands for the betweenness centrality. The top three ranked nodes according to the clustering co-efficient are indicated in E while the top three transcription factors (TFs) ranked by the betweenness centrality are shown in (F).

Network diffusion highlights the potential modulation of host gene expression in CD by bacterial proteins

Even though we identified bacterial proteins capable of binding to human proteins, it does not necessarily imply that these interactions have an influence on disease development or progression. To fill this gap, we harnessed the power of network diffusion to investigate if and how the inferred bacterial-host PPIs can have downstream effects as measured by gene expression from the host. Using this approach, our analysis revealed some of the potential mechanisms and key regulators that could mediate the effect on host gene expression of the altered bacterial proteins in CD. The signaling network (Figure 2C), resembling a scale-free network of 901 nodes and 1600 interactions (Figure 2D), captures the potential effect of the altered bacterial proteins on ileal gene expression in CD patients. The prominent components as inferred by network topological metrics such as betweenness centrality (the extent to which a node is a bridge between two other nodes) and clustering co-efficient (the extent to which the neighbors of a node are connected to each other) (Figure 2F, Table S4) point to proteins known to play important functional roles in intestinal inflammation, immune interactions/signaling and often with expression/activity modulations in CD patients (Tables 1 and S4). Top ranked nodes by betweenness centrality include critical transcription factors STAT3 (UniProt: P40763), TP53 (UniProt: P04637), ESR1 (UniProt: P03372), STAT1 (UniProt: P42224), SPI1 (UniProt: P17947), SMAD3 (UniProt: P84022), and MYC (UniProt: P01106) (Table 1) in addition to signaling proteins represented by SOCS3 (UniProt: O14543), SRC (UniProt: P12931), NCOA1 (UniProt: Q15788), CTNNB1 (UniProt: P35222), and PRKACA (UniProt: P17612) among others (Tables 1 and 4).
Table 1

The functions (in intestinal inflammation and IBD/CD) of the top 10 proteins (as inferred by the metric of betweenness centrality) in the ileal signaling network

NodeProtein nameUniProt biological process keywordsRole in intestinal inflammationKnown role in CD/IBD
STAT3Signal transducer and activator of transcription 3Host-virus interaction, Transcription, Transcription regulationSTAT3 phosphorylation protects against intestinal inflammation(Li et al., 2017)a, STAT3 modulates intestinal inflammatory response(Kim et al., 2016)a, Inactivation of IL-6/STAT3 cascade contributes to the attenuation of chronic intestinal inflammation(Atreya et al., 2000)STAT3 Activation confined to actively inflamed colons in CD patients(Musso et al., 2005), increased total STAT3 in CD(Mudter et al., 2005), increased pSTAT3 correlated with histological degree of inflammation(Mudter et al., 2005)
TP53Cellular tumor antigen p53Apoptosis, Biological rhythms, Cell cycle, Host-virus interaction, Necrosis, Transcription, Transcription regulationPromotes intestinal inflammation(Cooks et al., 2013; Spehlmann et al., 2013)aMost frequent mutation in IBD-associated dysplastic lesions and in cancers(Alpert et al., 2019; Wanders et al., 2020), Nested or diffuse TP53 in noncancer IBD(Laurent et al., 2011)
ESR1Estrogen receptorTranscription, Transcription regulationPositive regulator of intestinal inflammation(Goodman et al., 2020)aModulates mucosal inflammation in IBD(Cook et al., 2014)a
SOCS3Suppressor of cytokine signaling 3Growth regulation, Ubl conjugation pathwayPositive regulator of intestinal inflammation(Koay et al., 2014; Suzuki et al., 2001)Modulator of intestinal inflammation in CD (25,997,679), Elevated in CD subgroups with active inflammation at both ileum and colon(Savić Mlakar et al., 2018)
SRCProto-oncogene tyrosine-protein kinase SrcCell adhesion, Cell cycle, Host-virus interaction, ImmunityInvolved in linking inflammation to intestinal epithelial regeneration(Taniguchi et al., 2015)a
STAT1Signal transducer and activator of transcription 1-alpha/betaAntiviral defense, Host-virus interaction, Transcription, Transcription regulationTumor suppressor molecule in inflammation-associated carcinogenesis(Leon-Cabrera et al., 2018)Enhanced STAT1 expression in lamina-propria T-cells and colonic biopsies of CD patients(Taniguchi et al., 2015; Wu et al., 2007)
SPI1Transcription factor PU.1Transcription, Transcription regulationModulates chronic intestinal inflammation(Kekuda et al., 2008)a and shigellosis(Ta et al., 2017)aUpregulated in patients with Crohn’|'s ileitis(Fagnart et al., 1985)
SMAD3Mothers against decapentaplegic homolog 3Host-virus interaction, Transcription, Transcription regulationSMAD3 phosphorylation reduced in CD(Monteleone et al., 2001)a
MYCMyc proto-oncogene proteinTranscription, Transcription regulationMYC dependent deregulation of Wnt signaling promotes carcinogenesis(Shah et al., 2015)High frequency of MYC amplification in IBD associated intestinal adenocarcinomas(Hartman et al., 2018), Reduced expression in IECs of IBD patients(Alexander et al., 1996)
NCOA1Nuclear receptor coactivator 1Transcription, Transcription regulation

Evidence from mice experiments.

The functions (in intestinal inflammation and IBD/CD) of the top 10 proteins (as inferred by the metric of betweenness centrality) in the ileal signaling network Evidence from mice experiments. The human signaling network (i.e., excluding the bacterial proteins) capturing the effect of altered bacterial proteins on ileal gene expression in CD patients was over-represented with Reactome pathways such as signaling mediated by interleukins (IL-4 (UniProt: P24394), IL-2 (UniProt: P60568), and IL-13 (UniProt: P35225)), receptor tyrosine-kinases, multiple toll-like receptors including TLR4 (UniProt: O00206), MyD88 (UniProt: Q99836), NFkB (UniProt: P19838) (TRAF6 mediated induction of NFkB upon TLR7/8 or 9 activation), in addition to FCERI mediated MAPK activation (Figures 3A, 3B and Table S5). Meanwhile, functional over-representation analysis confined to genes differentially expressed (all upregulated - Figure 3C) in the ileum concurrently point to interleukin (IL-4,10,13) signaling and immune system functions among other generic functions such as gene expression and transcriptional control (Figures 3C and S1). Using an independent cohort of CD patients, we also validated the microbiome-induced gene expression in the ileum of CD patients in our test cohort. We observed that 78% (92/117) of the DEGs in our test cohort were also differentially expressed in the validation cohort (R2 = 0.907) (Figure 3D, Table S6).
Figure 3

Functional analysis of the microbiota-modulated ileal signaling network

(A) Top 10 Reactome pathways enriched (adj.p-value ≤ 0.1) in the ileal signaling network (B) and overlap between them.

(C) Expression profiles of the genes belonging to the top 10 Reactome pathways enriched (adj.p-value ≤ 0.1) among the differentially expressed genes in the ileal signaling network.

(D) Overlap between the ileal DE-Gs potentially modulated by the altered microbiota in CD patients and DE-Gs measured in an independent cohort of CD patients.

(E) Modules (tightly connected networks) detected by MCODE within the ileal signaling network.

(F) Over-represented Reactome signaling pathways (adj.p-value ≤ 0.1) in the network modules inferred from the ileal signaling network.

(G) Bacterial proteins with post-translational modification functions and inferred to influence the differential expression of genes in the ileum. EC: Adhesive invasive E. coli (AIEC); BF: Bacteroides fragilis; CC: Clostridium clostridioforme; CH: Clostridium hathewayi; CS: Clostridium symbiosum; KP: Klebsiella pneumoniae; RG: Ruminococcus gnavus; VP: Veillonella parvula. ∗ - Number of bacterial species in which a protein member from the ortholog group was detected. Singleton proteins or ortholog groups with just membership from a single species have not been shown in this figure.

Functional analysis of the microbiota-modulated ileal signaling network (A) Top 10 Reactome pathways enriched (adj.p-value ≤ 0.1) in the ileal signaling network (B) and overlap between them. (C) Expression profiles of the genes belonging to the top 10 Reactome pathways enriched (adj.p-value ≤ 0.1) among the differentially expressed genes in the ileal signaling network. (D) Overlap between the ileal DE-Gs potentially modulated by the altered microbiota in CD patients and DE-Gs measured in an independent cohort of CD patients. (E) Modules (tightly connected networks) detected by MCODE within the ileal signaling network. (F) Over-represented Reactome signaling pathways (adj.p-value ≤ 0.1) in the network modules inferred from the ileal signaling network. (G) Bacterial proteins with post-translational modification functions and inferred to influence the differential expression of genes in the ileum. EC: Adhesive invasive E. coli (AIEC); BF: Bacteroides fragilis; CC: Clostridium clostridioforme; CH: Clostridium hathewayi; CS: Clostridium symbiosum; KP: Klebsiella pneumoniae; RG: Ruminococcus gnavus; VP: Veillonella parvula. ∗ - Number of bacterial species in which a protein member from the ortholog group was detected. Singleton proteins or ortholog groups with just membership from a single species have not been shown in this figure. The ileal signaling network with both bacterial and host proteins could be broken down into three distinct modules based on the overall connectivity with other proteins in the network (Figure 3E) with common and specific functions (Figure 3F, Table S7).While modules 1 and 2 shared several over-represented pathways such as TRIF-mediated TLR4 signaling, IL-4/IL-13 signaling among many others, several module specific pathways could also be identified. For example, module 1 was specific to the enrichment of interferon and PI3K/AKT signaling while FCERI (UniProt: P12319) and TGF-beta receptor signaling were confined to module 2. Module 3 was, however, peculiarly distinct in terms of its composition by harboring genes corresponding to exclusively enriched pathways such as response to infection by parasites, FCGR-3A mediated phagocytosis, cell-cell communication, BRAF/RAF1 UniProt: P15056) signaling, ERBB2 (UniProt: P04626) signaling among others (Figure 3F, Table S7). As potential drivers of the ileal gene expression changes described above, several bacterial proteins (which could be clustered into groups based on sequence homology) could be identified to have an influence as shown in Figure 3G. These include proteins with acetyltransferase, chaperone, peptidase, peptidyl-prolyl cis-trans isomerase and proteolytic functions among others. For example, the orthologous group OMA01692, representative of a peptidase S8 domain containing group of proteins present in 4 (C. clostridioforme, C. hathewayi, C. symbiosum, R. gnavus) of the 8 CD relevant bacterial species, potentially influences the expression of 43% of the differentially expressed genes in the ileum of CD patients.

Regulatory control which could be exerted by bacterial influences can potentially induce site-specific inflammatory responses at the level of gene expression in CD

We also observed that the bacterial species identified as being relevant in CD tend to modulate ileal gene expression through prominent TFs such as SPI1, STAT1 and NFKB1 (UniProt: P19838) as identified by the network topology analysis (Figure 4A, Table S8). Based on our results, we inferred that the activity of these TFs could be potentially influenced by multiple species with most of the TFs among the top 20 betweenness centrality ranked nodes capable of being modulated by at least seven of the eight CD relevant species. SPI1, STAT1 and NFKB1 individually regulate the expression of 23%, 20 and 17% of the DE-Gs and cumulatively around 52% of the non-redundant DE-Gs in the ileal signaling network (Figure S3). More than two-thirds (71%) of the DE-Gs modulated by the top TFs are also involved in immune responses and inflammation. NFKB1, for example, is indirectly modulated upstream by the S8 domain containing peptidase cleaving MAP3K7 (UniProt: O43318) by binding to the PCSK cleavage motif CLV_PCSK_PC1ET2_1 (Figures 4B and 4C). Inactivation of MAP3K7 leads to the activation of NFKB1 which subsequently results in the upregulation of several genes with functions related to inflammation (IL6 (UniProt: P05231), IFNG (UniProt: P01579), etc.), recruitment and adhesion of immune cells (CCL2 (UniProt: P13500), CXCL10 (UniProt: P02778), etc.), remodeling of the extracellular matrix (MMP13 (UniProt: P45452), COL3A1 (UniProt: P02461), etc.) and enzymes involved in metabolizing inflammatory pathway precursors (PTGS2). This is one of several examples wherein the influence of bacterial proteins results in the upregulation of genes involved in pathogenesis of CD. With the exception of four genes (CCL2, COL1A1 (UniProt: P02452), COL3A1, MMP2 (UniProt: P08253)), all the other genes under the transcriptional control of three TFs including NFKB1 (which mediate the modulation by the S8 domain containing peptidase) are differentially expressed in the validation cohort (R2 = 0.803) (Figure 4D). When all the ileal DE-Gs under the control of the top TFs among the top 20 betweenness centrality ranked nodes were considered together, we observed a consistent trend between the test and validation cohorts (R2 = 0.908).
Figure 4

Examples of regulatory and mechanistic interactions mediated by microbial proteins followed by the comparison of the test and validation cohorts

(A) Heatmap representation of the transcriptional factors (among the top 20 nodes in the ileal signaling network ranked by the betweenness centrality) regulating the DE-Gs in the ileal signaling network. The number of targets for each of the TFs and the involvement of the DE-Gs in immune responses are depicted. The FC TC and FC VC columns stand for the expression level (in log2 scale) of the DE-Gs from the test and validation cohorts respectively.

(B and C) Inferred molecular mode of action of the subtilisin-domain containing type S8 peptidase (encoded by four different species with enhanced abundances in CD) on ileal gene expression in CD patients. The potential cleavage of MAP3K7 by the peptidase in turn enhances the activity of transcription factors (SMAD3, NFKB1, NFKBIA) which subsequently results in the upregulation of genes which are involved in inflammation, recruitment and adhesion of immune cells, remodeling of extracellular matrix and encoding enzymes involved in the production of inflammatory precursors.

(D) Comparison between the test (TC) and validation cohorts (VC) of the log2 normalized expression fold changes (1.5 ≤ log2FC ≤−1.5, FDR ≤ 0.05) of the transcriptional regulatory targets of SMAD3, NFKB1, NFKBIAwhich are modulated by the subtilisin-domain containing type S8 peptidase.

(E) Heatmap representation of the transcriptional factors (among the top 20 nodes in the ileal and rectal signaling networks ranked by the betweenness centrality) regulating the DE-Gs in the ileal and rectal signaling networks. TFs with less than 10 targets each among the ileal and rectal DE-Gs are not shown. The number of targets specific to ileum/rectum and common to both ileum and rectum for each of the TFs is depicted. The FC I TC and FC I VC columns stand for the expression level (in log2 scale) of the ileal DE-Gs from the test and validation cohorts respectively. FC R TC stands for the expression level of the DE-Gs in the rectal signaling network.

(F) Summary of the over-represented gene ontological biological processes and Reactome pathways in the microbiome-modulated ileal and rectal signaling network.

Examples of regulatory and mechanistic interactions mediated by microbial proteins followed by the comparison of the test and validation cohorts (A) Heatmap representation of the transcriptional factors (among the top 20 nodes in the ileal signaling network ranked by the betweenness centrality) regulating the DE-Gs in the ileal signaling network. The number of targets for each of the TFs and the involvement of the DE-Gs in immune responses are depicted. The FC TC and FC VC columns stand for the expression level (in log2 scale) of the DE-Gs from the test and validation cohorts respectively. (B and C) Inferred molecular mode of action of the subtilisin-domain containing type S8 peptidase (encoded by four different species with enhanced abundances in CD) on ileal gene expression in CD patients. The potential cleavage of MAP3K7 by the peptidase in turn enhances the activity of transcription factors (SMAD3, NFKB1, NFKBIA) which subsequently results in the upregulation of genes which are involved in inflammation, recruitment and adhesion of immune cells, remodeling of extracellular matrix and encoding enzymes involved in the production of inflammatory precursors. (D) Comparison between the test (TC) and validation cohorts (VC) of the log2 normalized expression fold changes (1.5 ≤ log2FC ≤−1.5, FDR ≤ 0.05) of the transcriptional regulatory targets of SMAD3, NFKB1, NFKBIAwhich are modulated by the subtilisin-domain containing type S8 peptidase. (E) Heatmap representation of the transcriptional factors (among the top 20 nodes in the ileal and rectal signaling networks ranked by the betweenness centrality) regulating the DE-Gs in the ileal and rectal signaling networks. TFs with less than 10 targets each among the ileal and rectal DE-Gs are not shown. The number of targets specific to ileum/rectum and common to both ileum and rectum for each of the TFs is depicted. The FC I TC and FC I VC columns stand for the expression level (in log2 scale) of the ileal DE-Gs from the test and validation cohorts respectively. FC R TC stands for the expression level of the DE-Gs in the rectal signaling network. (F) Summary of the over-represented gene ontological biological processes and Reactome pathways in the microbiome-modulated ileal and rectal signaling network. We also compared the regulatory footprints across the ileal and rectal signaling networks (Figures S4A and S4B) and found several commonalities and differences (Figure 4E, Table S8) at the level of regulation of DE-Gs. Of the 87 DE-Gs regulated by a TF from among the set of top 20 nodes (based on betweenness centrality) of the ileal and rectal signaling networks, 54% (47/87) were common to both the ileum and rectum. In other words, ileum and rectum not only harbor common gene signatures in CD but could also be controlled by a set of shared regulators that could be driving the synchronous gene expression. With an analysis of regulators with at least 10 target DE-Gs in either ileum or rectum, we observed different levels of specificity (ranging from 35 to 61%) not only quantitatively but also qualitatively. For example, the DEG targets (COL1A1, MMP13, MMP2, PLAUR (UniProt: Q03405), CCL2, SOD2 (UniProt: P04179), TFPI2 (UniProt: P48307), TIMP1 (UniProt: P01033)) of NFKB1 exclusive to ileum are associated with extracellular matrix remodeling and organization. Conversely, the targets of NFKB1 common to both ileum and rectum are predominantly associated with immune and inflammatory functions. Furthermore, we also identified pathways and biological processes among the genes in the potential microbiota-mediated signaling networks in a site-specific manner (Figure 4F, Table S5).While processes associated with anti-bacterial defense, cellular fate and cellular adhesion among others were over-represented in the ileal signaling network, the rectal network was characterized by the enrichment of metabolic processes. At the level of Reactome signaling pathways, ileum-specific ones include those related to collagen formation, assembly and degradation as well as platelet adhesion to exposed collagen structures.

Disease relevance and druggability of the potential microbiota-modulated signaling networks

To reveal the relevance of the inferred signaling networks modulated by the bacterial proteins, we checked if the proteins in the networks are associated with CD as reported in the OpenTargets database (Carvalho-Silva et al., 2019). About 22.76% (125/549; enrichment p-value: 1.38E-15) and 23.42% (115/491; enrichment p-value: 2.24E-15) of the host proteins in the ileal and rectal signaling networks respectively were annotated as being associated with CD (Table S9) based on at least two sources of evidence (see Table S12 for the full filtered list of CD associated OpenTargets proteins). Among the OpenTargets annotated CD proteins in the ileal and rectal networks, 46/125 (37%) and 40/115 (35%) respectively are derived from evidence sources with a genetic basis although information on host genotypes was not integrated in the current study. In addition, by using drug target information from OpenTarget, some of the proteins in the ileal and rectal signaling networks are known targets of therapeutic molecules used to treat CD (Table 2). Cumulatively, 24 proteins from the ileal and rectal signaling networks are existing targets of CD drugs. These include prominent cytokines such as IL23B (specific to the rectal network), IL6 and CXCL10 and chemokine receptors (CSF2RA (UniProt: P15509), CSF3R (UniProt: Q99062)) (common to both networks). The therapeutically targeted proteins also cover other functional categories such as those associated with JAK-STAT signaling (JAK1 (UniProt: P23458), JAK2 (UniProt: Q60674), JAK3 (UniProt: P52333), TYK2 (UniProt: P29597)), matrix organization and remodelling (MMP1 (UniProt: P03956), MMP7 (UniProt: P09237), MMP13, ITGA4 (UniProt: P13612)), T-cell proliferation and activation (CD80 (UniProt: P33681)) and enzymes involved in the production of inflammatory precursors (PTGS2).
Table 2

Ileal and rectal signaling network proteins which are also targeted by drugs in CD

Protein symbolProtein name# active/completed clinical trialsDrug(s)
BMXCytoplasmic tyrosine-protein kinase BMX1PF-06651600
BTKTyrosine-protein kinase BTK1PF-06651600
CD80bT-lymphocyte activation antigen CD80
CSF2RAGranulocyte-macrophage colony-stimulating factor receptor subunit alpha2Sargramostim
CSF3RGranulocyte colony-stimulating factor receptor3Filgrastim
CXCL10C-X-C motif chemokine 101Eldelumab
ESR2Estrogen receptor beta1Prinaberel
GHRGrowth hormone receptor1Somatropin
IL2RBInterleukin-2 receptor subunit beta1Aldesleukin
IL6Interleukin-62PF-04236921
IL23AbInterleukin-23 subunit alpha12Brazikumab, Risankizumab, Guselkumab
ITGA4Integrin alpha-430Vedolizumab, Natalizumab, Abrilumab
JAK1Tyrosine-protein kinase JAK115Upadacitinib, Tofacitinib, Filgotinib
JAK2Tyrosine-protein kinase JAK29Upadacitinib, Tofacitinib
JAK3Tyrosine-protein kinase JAK310Upadacitinib, Tofacitinib, PF-06651600
MMP1Interstitial collagenase2Doxycycline
MMP7Matrilysin2Doxycycline
MMP13aCollagenase 32Doxycycline
NR3C1Glucocorticoid receptor13Budesonide, Methylprednisolone, Prednisolone, Prednisone
PPARGPeroxisome proliferator-activated receptor gamma7Mesalamine, Pioglitazone
PTGS2Prostaglandin G/H synthase 27Mesalamine
RBX1E3 ubiquitin-protein ligase4Thalidomide, Lenalidomide
TYK2Non-receptor tyrosine-protein kinase9Upadacitinib, Tofacitinib
VDRVitamin D receptor10Ergocalciferol, Cholecalciferol, Calcitriol

Protein present only in ileal signaling network.

Protein present only in rectal signaling network.

Ileal and rectal signaling network proteins which are also targeted by drugs in CD Protein present only in ileal signaling network. Protein present only in rectal signaling network. In addition to the above demonstrated druggability of the potential microbiota modulated signaling network components (irrespective of status of differential gene expression), we also investigated if the DE-Gs themselves could be potentially targeted directly or indirectly by existing drugs. We inferred this by matching gene expression signatures of various drugs in the LINCS database to the gene expression signatures elicited in the ileum (Figures 5A and 5B, Table S10) and rectum (Figures S6A and S6B, Table S10) of CD patients potentially by the altered microbiota. Drugs which could induce a reversal of the expression patterns of the DE-Gs in the ileum or rectum include among others gefitinib, canertinib, dasatinib, trimpiramine, enalapril, and wortmannin. As an example, wortmannin influences the expression of genes involved in various phenotypes associated with inflammatory response, susceptibility to bacterial infection, wound healing and macrophage physiology/chemotaxis (Figure 5C).
Figure 5

Expression signature based matching of drugs to targets in the ileal signaling network

(A) Search scores representing the extent to which a drug can induce/reverse the gene expression signatures of the DE-Gs in the ileal signaling network. Only the top 15 perturbation/drugs are shown.

(B) The expression profiles of the ileal network DE-Gs in response to various perturbation/drugs. The horizontal red-bars along the rows indicate the extent to which the corresponding ileal DEG is differentially expressed in CD patients. The red-label bars along the columns indicate the search score which is representative of the extent to which the perturbation/drug elicited gene expression can induce/reverse the signature profile of the DEG. The red and blue squares of the matrix indicate the upregulation and downregulation of the DEG in response to the corresponding perturbation/drug.

(C) Enriched (adj P-value ≤ 0.05) ontology terms (from the Mammalian Phenotype) of the ileal network DE-Gs which are also overlapping with the Wortmannin elicited gene expression signature.

Expression signature based matching of drugs to targets in the ileal signaling network (A) Search scores representing the extent to which a drug can induce/reverse the gene expression signatures of the DE-Gs in the ileal signaling network. Only the top 15 perturbation/drugs are shown. (B) The expression profiles of the ileal network DE-Gs in response to various perturbation/drugs. The horizontal red-bars along the rows indicate the extent to which the corresponding ileal DEG is differentially expressed in CD patients. The red-label bars along the columns indicate the search score which is representative of the extent to which the perturbation/drug elicited gene expression can induce/reverse the signature profile of the DEG. The red and blue squares of the matrix indicate the upregulation and downregulation of the DEG in response to the corresponding perturbation/drug. (C) Enriched (adj P-value ≤ 0.05) ontology terms (from the Mammalian Phenotype) of the ileal network DE-Gs which are also overlapping with the Wortmannin elicited gene expression signature.

Discussion

Multiple studies have demonstrated that dysbiosis and alterations of the gut microbiome are associated with phenotypic responses including inflammation of the gut mucosal barrier in CD patients. However, at a meta-level, there is a gap in terms of the molecular mechanisms that mediate the effect of the altered gut microbiome on host responses. In this study, we used a computational pipeline (harnessing the power of microbe-host protein-protein interaction predictions, biological networks and network diffusion concepts) to integrate heterogeneous-omic datasets including whole genome shotgun sequencing of the fecal microbiome and transcriptomic read-outs from ileal and rectal biopsies of CD patients. We identified site-specific signaling networks for ileum and rectum, which may explain how the altered CD gut microbiome could modulate host gene expression. To start with, we identified eight bacterial species (AIEC, B. fragilis, C. clostridioforme, C. hathewayi, C. symbiosum, K. pneumoniae, R. gnavus and V. parvula) associated with the gut microbiome of CD patients compared to non-IBD controls. Although strain level resolution was not possible, species level resolution is expected to provide interesting insights, particularly given that (1) the inferences are based on three different evidence types and (2) only a subset of relevant genes/proteins with increased activities in CD patients are considered for the microbe-host interaction analysis. All of the eight species are either known pathogens or support the growth and survival of pathogens. They are mainly confined to the gut although non-gut niches have also been documented for some of them (Elsayed and Zhang, 2004; Knapp et al., 2017). Conversely, pathogens such as K. pneumoniaewhich are predominantly found in non-gut niches like the respiratory tract have also been identified as being relevant to CD. This is understandable since K. pneumoniae is known to be a pathogen which has mastered the art of evading host immune defense mechanisms in different immune cells as well as a wide range of organ systems (Bengoechea and Sa Pessoa, 2019). However, despite the fact that depletion of commensals could have an impact on intestinal homeostasis and IBD(Koboziev et al., 2014), since the goal of the study was to infer the functional effects of the microbiome on the host, we considered, for our global analysis, only microbial species and genes/transcripts with enhanced abundances and activities respectively. Next, we identified bacterial protein groups involved in modulating the activity of human proteins in CD. The small fraction (3.7%) of such protein groups as a proportion of all possible protein clusters could possibly be attributed to various underlying reasons such as (1) the prevalence of mechanisms along other-omic layers such as metabolism in mediating microbe-host interactions (Metwaly et al., 2020) (Franzosa et al., 2019) (2) the ability of singular bacterial proteins or a small number of them to render noticeable phenotypic changes on the host (Carroll and Maharshak, 2013; Cohavy et al., 2000; Dalwadi et al., 2001) and (3) the lack of structural annotation (especially the lack of domain annotations) given that the interaction prediction between the microbial and host proteins is heavily reliant on such annotations. Nevertheless, based on their functional annotations, we identified several relevant host-interacting bacterial proteins capable of inducing post-translational effects which play a role in the pathogenesis of many chronic diseases including IBD (Caruso et al., 2019; Gordon et al., 2020; Schweppe et al., 2015; Zhang et al., 2018, 2020a). The analytical pipeline uncovered the corresponding site-specific (ileum/rectum) signaling networks that captured the effects of the CD relevant bacterial proteins on differentially expressed host genes. Using an independent validation cohort of CD patients carrying the pathogenic B2 enterotype, we could verify our findings as exemplified by overall consensus (78%, R2 = 0.908) between the ileal DE-Gs in the test and validation cohorts. Ranking the proteins using topologically relevant parameters such as betweenness centrality (indicative of the number of shortest paths passing through the protein) (Kincaid and Phillips, 2011) resulted in the identification of central TFs and post-translational signaling proteins. The prevalence of TFs among the BC-ranked nodes is understandable given that upstream signals modulate TF activity which thus subsequently impacts the expression of their immediate targets downstream (Karin and Smeal, 1992). Thus, TFs act as signal amplifiers by modulating gene expression and are involved in complex regulatory circuits which are required for quick and adaptive responses in biological systems (Gao et al., 2018). Some of the top ranked TFs such as STAT3, STAT1, TP53, ESR1, STAT1, SPI1, SMAD3 and MYC are involved in modulating intestinal inflammation and/or known to be playing a role in CD/IBD. STAT3 and STAT1 for example are of interest due to the multiple mechanisms by which STATs impact IBD pathogenesis. STAT1 is differentially expressed in ileal biopsies of CD patients and its deficiency enhances predisposition to gut inflammation (Leon-Cabrera et al., 2018). Adding to the more complex role of STAT3 in CD is its transient/dynamic involvement independent of genetic aberrations (Ferguson et al., 2010; Willson et al., 2012).STAT3 is involved not only in mediating inflammatory responses to infections by pathogens (Backert et al., 2014) and generation of inflammatory Th17 cells (Durant et al., 2010; Liu et al., 2008) but also in modulating the return of the epithelium to steady-states following inflammatory insults (Goldsmith et al., 2011). Synonymous with our observation of STAT3 as a major regulatory hub is the identification as a signaling hub of SOCS3 - a partner in crime with STAT3 in mediating intestinal inflammation. SMAD3, another hub, is involved in the TGF-beta driven inflammatory responses in CD (Monteleone et al., 2001), mucosal immunity and T-cell activation (Yang et al., 1999). In addition to the hubs, many of over-represented signaling pathways in the microbiome-modulated ileal and rectal signaling networks were also identified in independent experimental studies. For instance, TLR3 (UniProt: O15455)/TLR4 associated signaling pathways which were over-represented in the ileal network, have been demonstrated to be active in CD patients (Cario and Podolsky, 2000). TLR4 is also over-expressed in ileal IECs of CD patients (Silva et al., 2008) and upregulated during intestinal inflammation (Hausmann et al., 2002). This is intriguing since TLR pathways, especially those associated with TLR4 are generally activated by bacterial lipopolysaccharides (LPS) in disease states and are also involved in mediating host response to microbial effects (van Bergenhenegouwen et al., 2016; Cerovic et al., 2009; Kløve et al., 2020). However, our results suggest novel modes of TLR activation mediated by bacterial proteins in the context of CD. Concomitant with observations from previous studies about the role of NFKB1 in mediating intestinal inflammatory responses in healthy states (Ramakrishnan et al., 2019) and CD (Han et al., 2017; Schreiber et al., 1998; Segain et al., 2000; Sorrentino, 2017), our results indicated that (1) NFKB1 was among the central regulators in the ileal signaling network and (2) the expression of many inflammatory cytokines was upregulated via NFKB1 by bacterial proteins including those with proteolytic functions as demonstrated by the S8 domain containing peptidase example. Although many stimuli such as butyrate are known to inhibit inflammatory responses via NFKB1 (Segain et al., 2000), the exact mechanisms have remained elusive, particularly in whether and how bacterial proteins can influence inflammatory responses via NFKB1. Under homeostatic conditions, NFKB1 under the influence of active upstream repressors such as MAP3K7 tends to have reduced activity and thereby results in the suppression of pro-inflammatory molecules. However, bacterial proteins such as the S8 domain containing peptidase present in four of the eight CD relevant species (C. clostridioforme, C. hathewayi, C. symbiosum, R. gnavus) potentially inactivates MAP3K7 by binding to the PCSK cleavage motif CLV_PCSK_PC1ET2_1 and thereby inactivating MAP3K7. This results in downstream effects culminating in the de-repression of TFs including NFKB1which is then able to activate the expression of pro-inflammatory cytokines such as IL6, IFNG, CCL2 among others. Interestingly, the S8 domain containing peptidase in absent in all but one proteome (Faecalibacterium sp. CAG:1138; protein uniprot id - R5FTW4) within Faecalibacterium sp and absent in all of the proteomes in Akkarmensia sp. Thus, bacterial proteins with enhanced abundances and activity in CD and mostly absent in commensals such as Faecalibacterium sp. and Akkarmensia sp which are depleted in CD, can “switch-off” mechanisms that under homeostatic conditions would have silenced inflammatory cytokines. That being said, the above described mechanism of action could only be one among many other microbiome-mediated mechanisms modulated by other bacterial proteins and several other players such as small RNAs and metabolites (Valter et al., 2020; Wlodarska et al., 2015). By comparing the ileal and rectal signaling networks mediating the possible microbial influences on ileal and rectal gene expression, we identified site specific commonalities and differences in the gene expression as well as the underlying regulatory programs. At the level of genes potentially modulated by the microbiota, 49 and 31% of the rectal and ileal DE-Gs were unique to the corresponding sites, possibly indicating specific effects of the CD altered microbiota on ileum and rectum. This is partially in line with observations from previous studies which report global differences in the transcriptional activity of genes in IBD patients between the two sites (Bogaert et al., 2010; Zhang et al., 2020b). However, these studies did not consider the plausible microbial influences and hence could be capturing a range of unknown variables on ileal and rectal gene expression. Central regulators such as SPI1, NFKB1 and STAT1 displayed different levels of specificity thus pointing to site-specific regulatory programs mediating the possible effects of the microbiota. Although this is a novel finding with therapeutic implications, it needs further independent validation. By integrating different levels of information such as collection of genes associated with CD pathogenesis, targets of known CD drugs as well as gene expression signatures in response to drugs, we interpreted the disease relevance and druggability of the ileal and rectal networks. Enrichment of our networks with CD associated genes and drug targets not only assign biological significance to the inferred networks but also impart translational value. Eventhough we did not integrate mutation information from CD patients, only a third of the OpenTargets annotated CD proteins in the ileal and rectal networks are known risk loci for CD. Furthermore, expanding this to the entirety of the two signaling networks, only 6.5% on average are known IBD susceptibility loci, a subset of which include prominent components of the inflammatory pathways (NFKBIZ, STAT3, STAT4 (UniProt: Q14765), STAT5A (UniProt: P42229), STAT5B (UniProt: P51692), JAK2), chemokines (CCL2, CXCL1 (UniProt: P09341), CXCL3 (UniProt: P19876), CXCL5 (UniProt: P42830)), interleukin receptors (IL1RL1 (UniProt: Q01638), IL23R (UniProt: Q5VWK5), IL7R (UniProt: P16871)) and tight junctions (CDH1 (UniProt: Q9UM11), ITGA4, HCK (UniProt: P08631), MUC1 (UniProt: P15941)) (Table S14). This suggests that extrinsic factors such as potential microbial influences could affect the transient activity of the proteins in a mutation independent manner. Although further studies are required to ascertain this observation, it is concordant with the conclusions of many genetic studies that there is missing heritability (Chen et al., 2014; Jostins et al., 2012). From a therapeutic viewpoint, functionally and topologically relevant nodes in our networks are known targets of approved IBD drugs such as tofacitinib (JAK1, JAK2, JAK3, TYK2) and vedolizumab (ITGA4).ITGA4 interacts with vascular cell adhesion protein 1, a key protein involved in cellular adhesion and communication, and plays a major role in inflammation by recruiting memory and effector T cells to inflamed intestinal tissues(Elices et al., 1990; Reinhardt et al., 1997; Rüegg et al., 1992). In total, 24 proteins from the ileal and rectal networks could be identified cumulatively as drug targets of 27 different CD therapeutics as inferred from completed or ongoing clinical trials (Tables 2 and S11). From a translational perspective, though, topological relevant proteins in the signaling networks which are not known CD drug targets could be very handy in designing novel therapeutics since they are important mediators of plausible microbial influences on CD pathogenesis. Despite the above-mentioned findings, our study is limited by some of the assumptions and existing bottlenecks in terms of data availability as listed below: (1) The lack of cell-type specific read-outs such as gene expression. This is relevant due to the recent studies reporting differences in cell-type compositions and activities between IBD patients and non-IBD controls (Martin et al., 2019; Smillie et al., 2019) (2) non-availability of strain-level resolution of the microbiome in both the test and validation cohorts, especially given that differences in pathogenicity are attributed to strain identities within the same species (Fang et al., 2018; Kalan et al., 2019; Zhang and Zhao, 2016) (3) lack of microbial read-outs from the intestinal mucosal samples (4) interaction prediction between microbial and host proteins are limited to predictions due to lack of large-scale bacterial-host PPIs (5) the effects of other regulatory layers such as miRNAs (Su et al., 2010) or long non-coding RNAs (Chen et al., 2013; Zacharopoulou et al., 2017) have not been considered during the network diffusion process (6) microbial components have been limited to proteins and excluded other molecules such as small RNAs, and metabolites that have been known to modulate host responses (Ahmadi Badi et al., 2020; Huang et al., 2019; Valter et al., 2020; Wlodarska et al., 2015) (7) studies have shown vast differences in composition and abundance between the rectal/colonic and ileal microbiomes (Zoetendal et al., 2008). Hence, the lack of site-specific read-outs from the ileal and rectal compartments is another drawback of the study. To conclude, using a combinatorial computational workflow enabling the integration of high-resolution microbial datasets representing the altered gut microbiome with the gene expression from ileal and rectal biopsies of CD patients, we inferred novel mechanisms which could mediate the plausible microbial effects on host responses in CD. By performing customized analyses on the inferred signaling networks representing the signal transduction between the CD altered microbial proteins and gene expression, we identified topologically relevant proteins with functional significance as exemplified by their involvement in CD pathogenesis. Using specific examples, we pointed out potential mechanisms of action by which microbial proteins with enriched abundances or enhanced transcriptional activity can activate inflammatory responses in CD. Using gene expression data from a paired independent validation cohort of CD patients, we could validate the microbiome-modulated gene expression signatures in the ileum of CD patients. By comparing the inferred ileal and rectal signaling networks, we identified site specific differences in terms of individual genes as well as the regulatory rewiring occurring between the two sites. We also highlight functionally relevant proteins that are already known drug targets or those which could be targeted de novodue to their topological and functional relevance. Taken together along with the validation using an independent cohort of CD patients, our results provide deeper insights into site-specific mechanisms used by the microbiome in CD patients and provide opportunities for development of more tailored therapies.

STAR★Methods

Key resources table

Resource availability

Lead contact

Further information and requests for reagents should be directed to Dr. Padhmanand Sudhakar (padhmanand.sudhakar@kuleuven.be).

Materials availability

This study did not generate new unique reagents.

Experimental model and subject details

CD patients and non-IBD control subjects

For the validation cohort, patients were selected based on (a) their identification as having active CD as assessed by the endoscopic scores and (b) having paired fecal microbiome profiling datasets. Non-IBD controls were similarly chosen (paired ileal gene expression and fecal microbiome profiling) from the Flemish Gut Flora Project. The ethics committee of the University Hospitals Leuven approved the study (IRB approvals, B322201213950/S53684 and B322201627472/S57662). All individuals gave written informed consent.

Method details

Identifying bacterial species and genes/transcripts with enhanced abundances/transcriptional activity in CD

Bacterial species relevant for CD were selected (Table S1) based on multiple lines of evidence. Firstly, from the IBD Multi-omics Database within the Human Integrated Microbiome project (Lloyd-Price et al., 2019), bacterial species with alterations between CD (n = 67) and non-IBD subjects (n = 27) at both the levels of abundance and transcriptional activity, species with enhanced abundances displaying a t-statistic of ≥ 2 were selected. From the same study, bacterial species contributing to alterations in the community composition as measured by the dysbiotic index between CD and non-IBD subjects (at both the levels of abundance and transcriptional activity) displaying enhanced abundances displaying a t-statistic of ≥ 2 and Q-value ≤ 0.05 were selected. The bacterial species identified above were subjected to a literature search to check if they have been observed in at least one other study to have increased abundance levels in CD patients. This was done to improve the reliability of our inferences about the bacterial species inferred as being important in the context of CD (Table S1). Genes with enhanced abundances/transcriptional activity in CD were determined from the metagenomic and metatranscriptomic profiles. Significant metagenomic Enzyme Classification (EC) profiles, EC transcription, metagenomic KEGG Orthology (KO) profiles, KO transcription, protein abundances, EC-level protein abundances, and KO-level protein abundances with enhanced abundances in CD from the HMP2 study(Lloyd-Price et al., 2019) were identified by setting a threshold for the t-statistic at ≥ 2 and Q-value ≤ 0.05 (Table S2). The KO-number (KEGG orthology number) of the genes/transcripts with significant abundances were retrieved from KEGG (Kanehisa et al., 2017). De-novoprediction of K-numbers for the chosen proteomes were performed using BlastKOALA (Kanehisa et al., 2016).

Orthology analysis

The annotated proteomes corresponding to the identified bacterial species were downloaded from Uniprot (The UniProt Consortium, 2018). Where available, the reference proteomes were retrieved. The stand-alone version of the ortholog group mapping software OMA (Altenhoff et al., 2019) was used to identify orthologous groups. Default settings were used to carry out the ortholog analysis. Orthology tests against multiple members of Faecalibacterium sp. and Akkarmensia sp were performed using the NCBI blastp with default parameter settings (E-value ≤ 0.05).

Microbe-host protein-protein interaction prediction

The microbe-host interaction prediction based on the inferred interactions and their downstream effects was carried out along the lines of the general framework laid out in the manuscript by Andrighetti et al. (2020). To maintain the context specificity (i.e. prevalence in CD), interactions corresponding to bacterial genes/proteins with enhanced abundances in CD (as against the non-IBD controls) (Table S2) were considered for further analysis. Interaction prediction between microbial and host proteins were performed by using the domain-domain and domain-motif interaction prediction principle applied in the manuscript by Andrighetti et al. (2020) and used elsewhere in previous studies (Sudhakar et al., 2019). Briefly, the interaction prediction assumes that protein-pairs with known interacting domain-motif or domain-domain pairs also interact. With the domain-motif method, in order to capture the desired direction of signal transduction (i.e., the modulation of host proteins by the bacterial proteins), we imposed the condition that the interacting domains need to be harbored by the microbial proteins and the motifs on the host proteins. Pfam domain annotation for the microbial proteins was retrieved from Uniprot (The UniProt Consortium, 2018) whereas information about the occurrence of motifs and the gold standard list of domain-motif interactions were obtained from the Eukaryotic Linear Motif database (Dinkel et al., 2014; Gouw et al., 2018). The resulting set of predicted interactions between microbial and host proteins were then subjected to a disordered region based structural filtering procedure as described by Andrighetti et al. (2020). Disordered region predictions were performed by using IUpred2A (Mészáros et al., 2018). In short, only protein-protein interactions corresponding to motifs that lie within disordered regions of the host proteins were considered.

Sub-cellular localization prediction of proteins

Where available, the reference proteomes corresponding to the bacterial species were downloaded from Uniprot (The UniProt Consortium, 2018).Since we did not find evidence for the majority of inferred bacterial species to be intracellular (with the exception of Adherent-Invasive E. coli(AIEC) to host cells, we considered them to be extracellular. Host proteins were also selected based on their localization information. For compiling the extracellular list, selection was done by retaining only proteins located in “plasma membrane” and “extracellular region” according to the databases LocDB (Rastogi and Rost, 2011) and Human Protein Atlas (HPA) (Thul and Lindskog, 2018). Proteins corresponding to all the remaining localization terms were considered as intracellular.

Gene expression in CD patients

The set of differentially expressed genes (DE-Gs) between CD and non-IBD subjects were retrieved from Lloyd-Price et al. (2019). Since biopsies were taken from two different sites (rectum and ileum), we obtained independent lists of differentially expressed genes for each of the two sites.

Inferring effects of microbe-host predictions on host gene expression

To infer the most probable signaling chains which capture the functional effects of the bacterial proteins on host processes, we used the network diffusion algorithm TieDie (Paull et al., 2013). TieDie not only discounts for topological biases in networks such as hubs but also mines out logically consistent paths between a given set of start and stop nodes. Host proteins which are predicted to be bound by the bacterial proteins were used as the start nodes. As for the stop nodes, the transcription factors (TFs) of the differentially expressed genes in CD patients were used. TFs of the DE-Gs were retrieved from the set of directed and signed transcriptional regulatory interactions from DoRothEA (Garcia-Alonso et al., 2019). For each of the TFs in the set of stop nodes, weights were calculated using the following formula (Equation 1). The equation considers both the fold change of the DEG as well as the sign (i.e stimulatory or inhibitory) of the relationship between the TF and the DEG. The set of directed and signed protein-protein interactions from OmniPath(Türei et al., 2016) were used as a source of interactions for inferring the signaling chains between the start and the stop nodes.

Retrieval of biologically plausible chains

Given that there exist upper limits for the diameter of real-world biological networks, we set out path length limitations in order to capture the most relevant signaling chains inferring from the network diffusion methodology described above. In the case of signaling chains mediated by the set of extracellular bacterial proteins, we set a path length of four (i.e., the bacterial protein followed by the host receptor, an intermediary host protein, the TF and the DEG). As for the set of intracellular bacterial proteins (all from AIEC), we set a path length of three (i.e., the bacterial protein followed by the host receptor, the TF and the DEG). Paths of the desired lengths (as described above) were extracted using the command line version of PathLinker (Ritz et al., 2016). Network diffusion and chain extractions were performed separately for every bacterial species and the two sets of DE-Gs corresponding to both the ileal and rectal biopsies. Only chains not corresponding to interactions between bacterial and host proteins mediated by structural complex formation events were prioritized in order to retain the chains specific to post-translational modifications elicited by the bacterial proteins on the host proteins. Furthermore, chains corresponding to post-translational modifications elicited by bacterial histidine kinases (HKs) were discarded since the phosphorylation domains of HKs are predominantly embedded in the cytoplasmic-membrane boundary of bacterial cells (thus rendering the HKs incapable of inducing phosphorylation events on host proteins).

Functional enrichment analysis

The R packages ReactomePA (Yu and He, 2016) and clusterProfiler (Yu et al., 2012) were used to perform the enrichment analysis. An adjusted P-value cut-off of ≤ 0.1 was used to infer the statistically significant enrichment terms. For the enrichment analysis of the signaling networks, all the nodes in the directed PPI network were set as the background. For the corresponding analysis of the DE-Gs in the ileal signaling network, the entire set of DE-Gs were used as the background. For the identification of site-specific enrichment terms, the enrichment was performed with different gene sets separated by site and gene categories. All human genes (receptors, intermediary proteins, transcription factors (TFs) and DE-Gs) were considered for the site-specific enrichment analysis.

Validation cohort

A cohort of paired 16S read-outs from stool samples (n = 20) and gene expression data (n = 20) from ileal biopsies in CD patients, cross-sectionally collected at the University Hospitals Leuven was available for independent validation. The ethics committee of the University Hospitals Leuven approved the study (IRB approvals, B322201213950/S53684 and B322201627472/S57662). All individuals gave written informed consent.

Microbial load measurement by flow cytometry

Cell counting for all samples, was performed in duplicate. Briefly, 0.2 g frozen (−80°C) aliquots were dissolved in physiological solution to a total volume of 100 mL (8.5 g/L NaCl; VWR International, Germany). Subsequently, the slurry was diluted 1,000 times. Samples were filtered using a sterile syringe filter (pore size of 5 μμm; Sartorius Stedim Biotech GmbH, Germany). Next, 1 mL of the microbial cell suspension obtained was stained with 1 μL SYBR Green I (1:100 dilution in DMSO; shaded 15 min incubation at 37°C; 10,000 concentrate, Thermo Fisher Scientific, Massachusetts, USA). The flow cytometry analysis was performed using a C6 Accuri flow cytometer (BD Biosciences, New Jersey, USA) based on Prest et al. (Doherty et al., 2018) Fluorescence events were monitored using the FL1 533/30 nm and FL3 >670 nm optical detectors. In addition, also forward and sideward-scattered light was collected. The BD Accuri CFlow software was used to gate and separate the microbial fluorescence events on the FL1/FL3 density plot from the faecal sample background. A threshold value of 2000 was applied on the FL1 channel. The gated fluorescence events were evaluated on the forward/sideward density plot, as to exclude remaining background events. Instrument and gating settings were kept identical for all samples (fixed staining/gating strategy (Doherty et al., 2018)). Based on the exact weight of the aliquots analysed, cell counts were converted to microbial loads per gram of faecal material. Faecal Moisture content was determined as the percentage of mass loss after lyophilization from 0.2 g frozen aliquots of non-homogenized faecal material (−80°C).

DNA extraction and sequencing data pre-processing

Faecal DNA extraction and microbiota profiling was performed as described previously (Gevers et al., 2014). Briefly, DNA was extracted from faecal material using the MoBio PowerMicrobiome RNA isolation kit with the addition of 10 min incubation at 90°C after the initial vortex step. The V4 region of the 16S rRNA gene was amplified with primer pair 515F/806R (Caporaso et al., 2011). Sequencing was performed on the Illumina MiSeq platform (San Diego, California, USA), to generate paired-end reads of 250 bases in length in each direction. After de-multiplexing, fastq sequences were merged using FLASH (Magoč and Salzberg, 2011) software with default parameters. Successfully combined reads were filtered based on quality (>90% of nucleotides with quality score of 30 or higher for every read) using Fastx tool kit (http://hannonlab.cshl.edu/fastx_toolkit/). Chimeras were removed with UCHIME (Edgar et al., 2011). Faecal samples were processed altering the protocol above to dual-index barcoding. Pre-processing was performed using the DADA2 (Callahan et al., 2016) pipeline v1.6.0.

Relative microbiome profiling (RMP)

For the relative microbiome matrix, each sample was downsized to 10,000 reads by random selection of reads. Samples with less than 10,000 reads were excluded (two samples). The taxonomy of reads was assigned using RDP classifier 2.12 (Wang et al., 2007).

Quantitative microbiome profiling (QMP)

The quantitative microbiome profiling matrix was built as described by Vandeputte and colleagues (Vandeputte et al., 2017). In short, samples were downsized to even sampling depth, defined as the ratio between sampling size (16S rRNA gene copy number corrected sequencing depth) and microbial load (average total cell count per gram of frozen faecal material). 16S rRNA gene copy numbers were retrieved from the ribosomal RNA operon copy numberdatabaserrnDB(Stoddard et al., 2015). The copy number corrected sequencing depth of each sample was rarefied to the level necessary to equate the minimum observed sampling depth in the cohort. Samples with resulting rarefied read counts <150 were excluded from QMP analyses. Rarefied genus abundances were converted into numbers of cells per gram.

Relative microbiome sequence variant profiling

Characterization of the microbiota profiles below genus-level was performed using the DADA2 (Naftali et al., 2016) pipeline sequence variants, with taxonomy assignment by RDP classifier v 2.12 (Wang et al., 2007). A phylogenetic tree for the sequence variants was reconstructed using the dada2 sequence variant sequences, by maximum likelihood reconstruction (GTR model) using PhyML (Guindon et al., 2010). Sequence variants were grouped into species clusters by collapsing sequence variants with less than 0.005 branch length distance in the sequence variants tree. Competition between species clusters with increasing inflammation levels was evaluated by assessing the correlation between calprotectin levels and species clusters dominance, i.e. their relative proportion of the total genus abundance per sample, excluding samples for which the genus total abundance sum was zero.

Quantification and statistical analysis

Statistical analyses were performed in R using packages vegan (Oksanen, 2021), phyloseq (McMurdie and Holmes, 2013), FSA (Derek, 2021), coin (Hothorn, 2021), DirichletMultinomial (Morgan, 2021), and fitdistrplus (Delignette-Muller, 2021). All statistical tests used were two-sided. All p values were corrected for multiple testing using the Benjamini-Hochberg method (reported as FDR) when multiple tests were performed on lists (n > 1) of features (e.g. taxa-metadata or taxa-taxa associations), also when performing multiple pairwise group (n > 2) comparisons (e.g. Kruskal-Wallis test with post-hoc Dunn test). Cramer's phi (√χ2/N) was used as measure of effect size for chi square tests (likelihood ratio test) and r = Z/√N for the Wilcoxon signed rank test.

Faecal microbiome derived features and visualization

Observed genus richness was calculated on the RMP matrix (downsized to 10,000 reads as described above) using phyloseq (McMurdie and Holmes, 2013). Enterotyping (or community typing) based on the Dirichlet Multinomial Mixtures (DMM) approach was performed in R as described by Holmes et al. (Holmes et al., 2012). Enterotyping was performed on a combined genus-abundance matrix including IBD and healthy controls samples compiled with the samples originating from the Flemish Gut Flora Project. Microbiome inter-individual variation was visualized by principal coordinates analysis (PCoA) using Bray-Curtis dissimilarity on the genus-level abundance matrix.

Microbiome and physiological features associations

Taxa unclassified at genus level or present in less than 20% of samples were excluded from the statistical analyses. Pearson or spearman correlations were used respectively for linear or rank-order correlations between continuous variables. Mann–Whitney U was used to test median differences of continuous variables between two different groups. For more than two groups, Kruskal-Wallis test with post-hoc Dunn test were used. Statistical differences in the proportions of categorical variables (enterotypes) between patient groups were evaluated using pairwise Chi-square tests.

Transcriptomic analysis

Only CD patients (n = 20) with a Bacteroides 2 enterotype were included for the differential gene expression analysis from their corresponding ileal biopsies (Table S13). A non-IBD group was used as the control (n = 15). RNA extraction and sequencing was performed as reported in (Verstockt et al., 2019). Raw RNA-sequencing data were aligned to the reference genome using Hisat2 version 2.1.0 (Kim et al., 2015) and absolute counts generated using HTSeq (Anders et al., 2015). Counts were then normalized for library size using the DESeq2 package (Love et al., 2014), and only protein-coding genes (Ensemble hg 19 reference build) having an average of at least 10 normalised read counts were considered for further analysis (n = 14,263). Differential expression analysis with multiple testing correction (Benjamini-Hochberg method) was then performed, and correction for age and gender was taken into account. Genes with a log2 fold change of ≥ 1.5 and ≤ −1.5 and an adjusted P-value of ≤ 0.05 were considered to be differentially expressed. The raw datasets have been deposited at ArrayExpress (accession number: E-MTAB-10395).

Drug and disease relevance of the networks

Proteins (# = 5484) known to be associated with CD based were retrieved from the OpenTargets (Carvalho-Silva et al., 2019) database (Table S12). Only those proteins (# = 720) with at least two different sources of evidence were considered for further analysis. To identify the possible drugs or perturbations which can mimic or reverse the signatures of the DE-Gs (in the CD patients) in the microbiota-induced ileal and rectal networks, the DEG signatures were matched against the Library of Integrated Network-based Cellular Signatures (LINCS) L1000 dataset (Duan et al., 2016). The LINCS L1000 dataset contains more than a million gene expression profiles of human cell lines treated with chemical perturbations or drugs.
REAGENT or RESOURCESOURCEIDENTIFIER
Biological samples

Ileal tissue biopsies of Crohn’s disease patients and non-IBD control subjectsUZ Leuven/KU Leuven biobankhttps://www.uzleuven.be/en/uz-kuleuven-biobank
Fecal samples of Crohn’s disease patients and non-IBD control subjectsUZ Leuven/KU Leuven biobankhttps://www.uzleuven.be/en/uz-kuleuven-biobank

Chemicals, peptides, and recombinant proteins

SYBR Green IThermo Fisher Scientific, Massachusetts, USACat#S7563

Critical commercial assays

PowerMicrobiome RNA isolation kitMo BioCat#26000-50

Deposited data

Raw fastq filesThis paperE-MTAB-10395

Software and algorithms

BlastKOALA(Kanehisa et al., 2016)https://www.kegg.jp/blastkoala/
MicrobioLink(Andrighetti et al., 2020)https://github.com/korcsmarosgroup/HMIpipeline
IUpred2A(Mészáros et al., 2018)http://iupred2a.elte.hu./
TieDie(Paull et al., 2013)https://github.com/epaull/TieDIE
PathLinker(Ritz et al., 2016)https://github.com/Murali-group/PathLinker
ReactomePA(Yu and He, 2016)http://bioconductor.org/packages/release/bioc/html/ReactomePA.html
clusterProfiler(Yu et al., 2012)http://bioconductor.org/packages/release/bioc/html/clusterProfiler.html
FLASH(Magoč and Salzberg, 2011)http://www.cbcb.umd.edu/software/flash
UCHIME(Edgar et al., 2011)http://drive5.com/usearch/manual/uchime_algo.html
Fastx tool kitHannon Labhttp://hannonlab.cshl.edu/fastx_toolkit/
DADA2 pipeline v1.6.0(Naftali et al., 2016)https://benjjneb.github.io/dada2/tutorial.html
RDP classifier 2.12(Wang et al., 2007)https://bioweb.pasteur.fr/packages/pack@rdp_classifier@2.12
rrnDB(Stoddard et al., 2015)https://rrndb.umms.med.umich.edu/
PhyML(Guindon et al., 2010)http://www.atgc-montpellier.fr/phyml/
Vegan(Oksanen, 2021)https://cran.r-project.org/web/packages/vegan/index.html
Phyloseq(McMurdie and Holmes, 2013)https://joey711.github.io/phyloseq/
FSA(Derek, 2021)https://cran.r-project.org/web/packages/FSA/index.html
Coin(Hothorn, 2021)https://cran.r-project.org/web/packages/coin/index.html
DirichletMultinomial(Morgan, 2021)http://bioconductor.org/packages/release//bioc/html/DirichletMultinomial.html
fitdistrplus(Delignette-Muller, 2021)https://cran.r-project.org/web/packages/fitdistrplus/index.html
Hisat2 version 2.1.0(Kim et al., 2015)http://daehwankimlab.github.io/hisat2/manual/
HTSeq(Anders et al., 2015)https://htseq.readthedocs.io/en/master/
DESeq2 package(Love et al., 2014)https://bioconductor.org/packages/release/bioc/html/DESeq2.html
BD Accuri CFlow softwareBDbioscienceshttps://www.bdbiosciences.com/us/instruments/research/cell-analyzers/bd-accuri/software/c6-plus-analysis-software-for-pc-or-mac/p/661083

Other

IBDMDB(Lloyd-Price et al., 2019)https://ibdmdb.org/tunnel/public/summary.html
Uniprot(The UniProt Consortium, 2018)https://www.uniprot.org/
DoRothEA(Garcia-Alonso et al., 2019)https://github.com/saezlab/dorothea-py
OmniPath(Türei et al., 2016)https://omnipathdb.org/
LocDB(Rastogi and Rost, 2011)https://www.rostlab.org/services/locDB/
Human Protein Atlas (HPA)(Thul and Lindskog, 2018)https://www.proteinatlas.org/
OpenTargets(Carvalho-Silva et al., 2019)https://www.targetvalidation.org/
LINCS L1000 dataset(Duan et al., 2016)https://lincsproject.org/LINCS/tools/workflows/find-the-best-place-to-obtain-the-lincs-l1000-data
IBD risk loci(Jostins et al., 2012)PMID: 23128233
  144 in total

Review 1.  Small RNAs - Big Players in Plant-Microbe Interactions.

Authors:  Chien-Yu Huang; Huan Wang; Po Hu; Rachael Hamby; Hailing Jin
Journal:  Cell Host Microbe       Date:  2019-08-14       Impact factor: 21.023

2.  Prediction of protein-protein interactions between human host and a pathogen and its application to three pathogenic bacteria.

Authors:  O Krishnadev; N Srinivasan
Journal:  Int J Biol Macromol       Date:  2011-02-16       Impact factor: 6.953

3.  A gp130-Src-YAP module links inflammation to epithelial regeneration.

Authors:  Koji Taniguchi; Li-Wha Wu; Sergei I Grivennikov; Petrus R de Jong; Ian Lian; Fa-Xing Yu; Kepeng Wang; Samuel B Ho; Brigid S Boland; John T Chang; William J Sandborn; Gary Hardiman; Eyal Raz; Yoshihiko Maehara; Akihiko Yoshimura; Jessica Zucman-Rossi; Kun-Liang Guan; Michael Karin
Journal:  Nature       Date:  2015-02-25       Impact factor: 49.962

4.  Cannabinoid-induced autophagy regulates suppressor of cytokine signaling-3 in intestinal epithelium.

Authors:  Luan C Koay; Rachael J Rigby; Karen L Wright
Journal:  Am J Physiol Gastrointest Liver Physiol       Date:  2014-05-15       Impact factor: 4.052

5.  LocDB: experimental annotations of localization for Homo sapiens and Arabidopsis thaliana.

Authors:  Shruti Rastogi; Burkhard Rost
Journal:  Nucleic Acids Res       Date:  2010-11-11       Impact factor: 16.971

6.  HTSeq--a Python framework to work with high-throughput sequencing data.

Authors:  Simon Anders; Paul Theodor Pyl; Wolfgang Huber
Journal:  Bioinformatics       Date:  2014-09-25       Impact factor: 6.937

7.  Dysbiosis, inflammation, and response to treatment: a longitudinal study of pediatric subjects with newly diagnosed inflammatory bowel disease.

Authors:  Kelly A Shaw; Madeline Bertha; Tatyana Hofmekler; Pankaj Chopra; Tommi Vatanen; Abhiram Srivatsa; Jarod Prince; Archana Kumar; Cary Sauer; Michael E Zwick; Glen A Satten; Aleksandar D Kostic; Jennifer G Mulle; Ramnik J Xavier; Subra Kugathasan
Journal:  Genome Med       Date:  2016-07-13       Impact factor: 11.117

8.  Fecal Microbiota Signatures Are Associated with Response to Ustekinumab Therapy among Crohn's Disease Patients.

Authors:  Matthew K Doherty; Tao Ding; Charlie Koumpouras; Shannon E Telesco; Calixte Monast; Anuk Das; Carrie Brodmerkel; Patrick D Schloss
Journal:  mBio       Date:  2018-03-13       Impact factor: 7.867

9.  Intestinal non-canonical NFκB signaling shapes the local and systemic immune response.

Authors:  Sadeesh K Ramakrishnan; Huabing Zhang; Xiaoya Ma; Inkyung Jung; Andrew J Schwartz; Daniel Triner; Samantha N Devenport; Nupur K Das; Xiang Xue; Melody Y Zeng; Yinling Hu; Richard M Mortensen; Joel K Greenson; Marilia Cascalho; Christiane E Wobus; Justin A Colacino; Gabriel Nunez; Liangyou Rui; Yatrik M Shah
Journal:  Nat Commun       Date:  2019-02-08       Impact factor: 14.919

10.  Host-microbe interactions have shaped the genetic architecture of inflammatory bowel disease.

Authors:  Luke Jostins; Stephan Ripke; Rinse K Weersma; Richard H Duerr; Dermot P McGovern; Ken Y Hui; James C Lee; L Philip Schumm; Yashoda Sharma; Carl A Anderson; Jonah Essers; Mitja Mitrovic; Kaida Ning; Isabelle Cleynen; Emilie Theatre; Sarah L Spain; Soumya Raychaudhuri; Philippe Goyette; Zhi Wei; Clara Abraham; Jean-Paul Achkar; Tariq Ahmad; Leila Amininejad; Ashwin N Ananthakrishnan; Vibeke Andersen; Jane M Andrews; Leonard Baidoo; Tobias Balschun; Peter A Bampton; Alain Bitton; Gabrielle Boucher; Stephan Brand; Carsten Büning; Ariella Cohain; Sven Cichon; Mauro D'Amato; Dirk De Jong; Kathy L Devaney; Marla Dubinsky; Cathryn Edwards; David Ellinghaus; Lynnette R Ferguson; Denis Franchimont; Karin Fransen; Richard Gearry; Michel Georges; Christian Gieger; Jürgen Glas; Talin Haritunians; Ailsa Hart; Chris Hawkey; Matija Hedl; Xinli Hu; Tom H Karlsen; Limas Kupcinskas; Subra Kugathasan; Anna Latiano; Debby Laukens; Ian C Lawrance; Charlie W Lees; Edouard Louis; Gillian Mahy; John Mansfield; Angharad R Morgan; Craig Mowat; William Newman; Orazio Palmieri; Cyriel Y Ponsioen; Uros Potocnik; Natalie J Prescott; Miguel Regueiro; Jerome I Rotter; Richard K Russell; Jeremy D Sanderson; Miquel Sans; Jack Satsangi; Stefan Schreiber; Lisa A Simms; Jurgita Sventoraityte; Stephan R Targan; Kent D Taylor; Mark Tremelling; Hein W Verspaget; Martine De Vos; Cisca Wijmenga; David C Wilson; Juliane Winkelmann; Ramnik J Xavier; Sebastian Zeissig; Bin Zhang; Clarence K Zhang; Hongyu Zhao; Mark S Silverberg; Vito Annese; Hakon Hakonarson; Steven R Brant; Graham Radford-Smith; Christopher G Mathew; John D Rioux; Eric E Schadt; Mark J Daly; Andre Franke; Miles Parkes; Severine Vermeire; Jeffrey C Barrett; Judy H Cho
Journal:  Nature       Date:  2012-11-01       Impact factor: 49.962

View more
  1 in total

1.  Tailoring Multi-omics to Inflammatory Bowel Diseases: All for One and One for All.

Authors:  Padhmanand Sudhakar; Dahham Alsoud; Judith Wellens; Sare Verstockt; Kaline Arnauts; Bram Verstockt; Severine Vermeire
Journal:  J Crohns Colitis       Date:  2022-08-30       Impact factor: 10.020

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.