| Literature DB >> 32650786 |
Mathieu Garand1, Manoj Kumar2, Susie Shih Yin Huang2, Souhaila Al Khodor3.
Abstract
BACKGROUND AND AIMS: The task of identifying a representative and yet manageable target gene list for assessing the pathogenesis of complicated and multifaceted diseases is challenging. Using Inflammatory Bowel Disease (IBD) as an example, we conceived a bioinformatic approach to identify novel genes associated with the various disease subtypes, in combination with known clinical control genes.Entities:
Mesh:
Substances:
Year: 2020 PMID: 32650786 PMCID: PMC7350750 DOI: 10.1186/s12967-020-02408-7
Source DB: PubMed Journal: J Transl Med ISSN: 1479-5876 Impact factor: 5.531
Fig. 1Workflow of constructing a gene panel that putatively contains subtype signatures of Inflammatory Bowel Disease (IBD). a network of the predicted protein–protein interactions (STRING) inferred from the genes associated with immune responses and IBD highlights the complexity and difficulty of making logical interpretation (network image in top-left corner). In order to simplify the process, we devised six gene lists from different sources. Details on the methods used to retrieve the genes, the number of genes in the lists, and the terms used onwards are shown (Gene Lists). Then, we submitted the lists to Venn analysis which resulted in 21 intersections (Venn Analysis). The common (i.e. IBD core genes) and the unique lists for CD, UC, and IBDU were submitted to Literature Lab to obtain pathways and diseases association scores; each gene is ranked based on its contribution weight to a score (Extract Unique Groups–Literature Lab). In the network analysis, the top-ranking genes (those with > 5% contribution to the association) informed which nodes to expand to the primary and secondary nodes. We used both GeneMania and STRING (shown here) to obtain those networks. In the network shown, the edges depict the known protein interactions based on knowledge from curated databases (blue edges), experimentally determined (pink edges), and co-expression data (black edges). Genes without shared pathways are shown as independent nodes. The amalgamation of the gene selections from all the common and subtypes-specific genes amounts to 142 putative target genes
Fig. 2Literature-based assessment of Medical Subject Headings (MeSH) terms, pathways, and Gene Ontology (GO) annotation associated with IBD subtype-specific gene lists. a Representation of MeSH terms associated with each IBD disease subtypes identified using our data mining approach. Briefly, genes representing IBD in general, and those uniquely related to each subtype, were identified in our Venn analysis. Each list of genes was submitted to cluster analysis using Literature Lab PLUS; clusters (outside ring labelled C-1, 2, 3, etc.) and sub-clusters (large dot on lines connected to a cluster) of MeSH terms for each disease group are shown. IBD, Inflammatory Bowel Diseases; CD Crohn’s Diseases: UC Ulcerative colitis: IBD-U IBD unclassified. b Visualization of the associated pathways in each disease subtype and with the IBD core genes. Among the pathways common to IBD and all subtypes, immune processes are highly represented. c Network focused on the key factors and major pathways involved in IBD(s) pathogenesis. The top 50 common associated genes between IBD-CD, and UC were used to generate a network map. Blue connecting lines represent pathways (i.e. the 2 genes connected share a pathway) and red lines represent physical interaction. Genes without connections are not shown in the image. Colored ellipses highlight key immune pathways: pro-inflammatory (green), IL17 (purple), and anti-inflammatory (red). d Gene set annotations with GO. GO terms significantly associated with each disease subtype were identified using GSAn and are indicated by a “X”