| Literature DB >> 27089880 |
Hong Liu1, Jessica Liu2, Michelle Toups3, Timothy Soos2, Christopher Arendt2.
Abstract
BACKGROUND: The immune system is multifaceted, structured by diverse components that interconnect using multilayered dynamic cellular processes. Genomic technologies provide a means for investigating, at the molecular level, the adaptations of the immune system in host defense and its dysregulation in pathological conditions. A critical aspect of intersecting and investigating complex datasets is determining how to best integrate genomic data from diverse platforms and heterogeneous sample populations to capture immunological signatures in health and disease. RESULT: We focus on gene signatures, representing highly enriched genes of immune cell subsets from both diseased and healthy tissues. From these, we construct a series of biomaps that illustrate the molecular linkages between cell subsets from different lineages, the connectivity between different immunological diseases, and the enrichment of cell subset signatures in diseased tissues. Finally, we overlay the downstream genes of drug targets with disease gene signatures to display the potential therapeutic applications for these approaches.Entities:
Keywords: Enrichment analysis; Gene set; Gene signature; Immune cell; Immune disease; Network-based analysis
Mesh:
Year: 2016 PMID: 27089880 PMCID: PMC4836068 DOI: 10.1186/s12859-016-1012-y
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1Similarity matrix of immune cell type gene signatures from human and mouse. Seventy-eight immune cell type gene signatures (20 human and 58 mouse) were paired against each other. Similarity was calculated by Fisher’s exact test of overlapping genes for each pair. Gene signatures were positioned according to their common cell lineage. Color represents the –log (P value of Fisher’s exact test), with red color indicating high similarity, and blue color indicating less/no similarity. Solid line black boxes group the gene signatures from the same lineage in either human or mouse, while dotted line black boxes group those from the same lineage between human and mouse. HSC hematopoietic stem cell, GN granulocyte, MO monocyte
Over-represented CD molecules and cytokine receptors in immune cell type gene signatures
| Cell type | Symbol | Entrez gene name | # of gene set |
|---|---|---|---|
| HSPC | CD34 | CD34 molecule | 4 out of 5 |
| GN | CD300LB | CD300 molecule-like family member b | 6 out of 6 |
| GN | CD300LF | CD300 molecule-like family member f | 6 out of 6 |
| GN | CD33 | CD33 molecule | 6 out of 6 |
| GN | CXCR2 | chemokine (C-X-C motif) receptor 2 | 6 out of 6 |
| GN | CCR1 | chemokine (C-C motif) receptor 1 | 5 out of 6 |
| GN | CD101 | CD101 molecule | 5 out of 6 |
| GN | CD300LD | CD300 molecule-like family member d | 4 out of 6 |
| GN | CXCR1 | chemokine (C-X-C motif) receptor 1 | 4 out of 6 |
| GN | IFNLR1 | interferon, lambda receptor 1 | 4 out of 6 |
| GN | IL13RA1 | interleukin 13 receptor, alpha 1 | 4 out of 6 |
| GN | CD14 | CD14 molecule | 3 out of 6 |
| GN | CD44 | CD44 molecule (Indian blood group) | 3 out of 6 |
| GN | TNFRSF1A | tumor necrosis factor receptor superfamily, member 1A | 3 out of 6 |
| MO | CD300A | CD300a molecule | 7 out of 8 |
| MO | TNFRSF1B | tumor necrosis factor receptor superfamily, member 1B | 7 out of 8 |
| MO | IL10RA | interleukin 10 receptor, alpha | 7 out of 8 |
| MO | IL17RA | interleukin 17 receptor A | 6 out of 8 |
| MO | TNFRSF1A | tumor necrosis factor receptor superfamily, member 1A | 5 out of 8 |
| MO | IL13RA1 | interleukin 13 receptor, alpha 1 | 5 out of 8 |
| MO | CD68 | CD68 molecule | 5 out of 8 |
| MO | TNFRSF21 | tumor necrosis factor receptor superfamily, member 21 | 5 out of 8 |
| MO | CX3CR1 | chemokine (C-X3-C motif) receptor 1 | 5 out of 8 |
| MO | CD244 | CD244 molecule, natural killer cell receptor 2B4 | 4 out of 8 |
| DC | CD74 | CD74 molecule, major histocompatibility complex, class II invariant chain | 9 out of 9 |
| DC | IL10RA | interleukin 10 receptor, alpha | 8 out of 9 |
| DC | XCR1 | chemokine (C motif) receptor | 5 out of 9 |
| B | CD19 | CD19 molecule | 9 out of 10 |
| B | CD37 | CD37 molecule | 9 out of 10 |
| B | CD38 | CD38 molecule | 9 out of 10 |
| B | CD79A | CD79a molecule, immunoglobulin-associated alpha | 9 out of 10 |
| B | CD79B | CD79b molecule, immunoglobulin-associated beta | 9 out of 10 |
| B | CD22 | CD22 molecule | 8 out of 10 |
| B | CD55 | CD55 molecule, decay accelerating factor for complement (Cromer blood group) | 8 out of 10 |
| B | CD72 | CD72 molecule | 8 out of 10 |
| B | CD74 | CD74 molecule, major histocompatibility complex, class II invariant chain | 8 out of 10 |
| B | CXCR5 | chemokine (C-X-C motif) receptor 5 | 8 out of 10 |
| B | TNFRSF13B | tumor necrosis factor receptor superfamily, member 13B | 8 out of 10 |
| B | TNFRSF13C | tumor necrosis factor receptor superfamily, member 13C | 8 out of 10 |
| B | CD180 | CD180 molecule | 7 out of 10 |
| B | CCR6 | chemokine (C-C motif) receptor 6 | 6 out of 10 |
| B | CD200 | CD200 molecule | 5 out of 10 |
| B | IL21R | interleukin 21 receptor | 5 out of 10 |
| B | IL9R | interleukin 9 receptor | 5 out of 10 |
| NK | IL2RB | interleukin 2 receptor, beta | 10 out of 10 |
| NK | IL12RB2 | interleukin 12 receptor, beta 2 | 9 out of 10 |
| NK | CCR5 | chemokine (C-C motif) receptor 5 (gene/pseudogene) | 6 out of 10 |
| NK | CD244 | CD244 molecule, natural killer cell receptor 2B4 | 6 out of 10 |
| NK | CD97 | CD97 molecule | 6 out of 10 |
| NK | IL12RB1 | interleukin 12 receptor, beta 1 | 6 out of 10 |
| NK | IL18RAP | interleukin 18 receptor accessory protein | 6 out of 10 |
| NK | CMKLR1 | chemokine-like receptor 1 | 5 out of 10 |
| NK | IL18R1 | interleukin 18 receptor 1 | 5 out of 10 |
| T | IL27RA | interleukin 27 receptor, alpha | 9 out of 10 |
| T | CD6 | CD6 molecule | 8 out of 10 |
| T | CD2 | CD2 molecule | 7 out of 10 |
| T | CD5 | CD5 molecule | 7 out of 10 |
| T | IL21R | interleukin 21 receptor | 7 out of 10 |
| T | CD4 | CD4 molecule | 6 out of 10 |
| T | CD28 | CD28 molecule | 5 out of 10 |
| T | CD3D | CD3d molecule, delta (CD3-TCR complex) | 5 out of 10 |
| T | CD3E | CD3e molecule, epsilon (CD3-TCR complex) | 5 out of 10 |
| T | TNFRSF25 | tumor necrosis factor receptor superfamily, member 25 | 5 out of 10 |
Immune disease gene signatures
| Disease category | # of gene set | # of study | # of signature disease gene |
|---|---|---|---|
| COPD | 20 | 13 | 169 |
| Asthma | 20 | 10 | 11 |
| Dermatitis | 12 | 8 | 51 |
| Psoriasis | 7 | 5 | 161 |
| IBD | 19 | 8 | 100 |
| Lupus | 25 | 17 | 154 |
| Arthritis | 20 | 11 | 55 |
| Sclerosis | 13 | 7 | 1 |
| T1D | 19 | 8 | 15 |
“# of Gene set” indicates the number of gene sets for each disease category; “# of Study” indicates the number of independent studies that were analyzed to generate the gene sets; “# of Signature Disease Gene” indicates the number of genes existing in more than five gene sets from at least two different studies. COPD chronic obstructive pulmonary disease, IBD inflammatory bowel disease, T1D type 1 diabetes
Fig. 2Similarity matrix of immune disease gene signatures. One hundred fifty-five Immune disease gene signatures were paired against each other. Similarity was calculated by Fisher’s exact test of overlapping genes for each pair. Gene signatures from the same disease category were positioned together. Color represents the –log (P value of Fisher’s exact test), with red color indicating high similarity, and blue color indicating less/no similarity. Black boxes group the gene signatures that represent the same disease category
Fig. 3Over-represented genes in immune disease modules. Twenty-two upregulated genes, found to be common to more than three signature disease gene lists, are illustrated. S100A9 is common to 5 diseases while CCL2 is common among 4 different diseases
Fig. 4Topological localization of pair of disease genes. Signature disease genes representing two different disease categories and with direct links on the interactome were plotted. (I) Topological locations of dermatitis and psoriasis genes. Orange color: genes unique to dermatitis; green color: genes unique to psoriasis; yellow color: genes shared by dermatitis and psoriasis. (II) Topological locations of COPD and asthma genes. Blue color: genes unique to COPD; rose color: genes unique to asthma
Network-based separation of signature disease genes
Network-based separation analysis was used to calculate the separation score for each pair of signature disease genes. Red color highlights the separation score of a pair with itself. Pink color highlights the negative separation score.
Fig. 5The interactions of signature disease genes. (I) Signature disease genes representing all nine disease categories with direct links on the interactome were plotted. The size of the gene name label is proportional to the number of interactions (degrees). Different colors represent different disease categories, while yellow represents genes shared by multiple diseases. The size of the yellow circle is proportional to the number of diseases to which that common gene belongs. (II) Signature disease genes representing three disease categories and with direct links on interactome were plotted. The size of the gene name is proportional to the number of interactions. The thick lines depict the direct IL1B interactions
Immune disease gene signatures and genetic linked genes
| Disease category | # of GWAS gene | # of DEG | # of overlapped gene |
|---|---|---|---|
| COPD | 113 | 3137 | 11 |
| Asthma | 195 | 1783 | 13 |
| Dermatitis | 53 | 2314 | 5 |
| Psoriasis | 56 | 1144 | 9* |
| IBD | 525 | 4095 | 116 |
| Lupus | 181 | 3696 | 33 |
| Arthritis | 301 | 5227 | 83 |
| Sclerosis | 343 | 2041 | 28 |
| T1D | 166 | 3144 | 20 |
“# of GWAS Gene” indicates the number of genes reported in GWAS Catalog that are linked to the related disease category; “# of DEG” indicates the number of Differentially Expressed Genes present in any related disease gene sets; “# of Overlapped Gene” indicates the number of genes existing in both “GWAS Gene” and “DEG”. * indicates that the overlapping is significant based on Chi-square test
Fig. 6Similarity matrix of immune cell type signatures vs. immune disease signatures. Two-hundred-eighty-seven human and mouse immune cell type signatures were paired against 155 immune disease signatures. Similarity was calculated by Fisher’s exact test of overlapping genes for each pair. Cell type signatures were positioned according to their common cell lineage, and disease signatures were positioned according to their disease category. Color represents the –log (P value of Fisher’s exact test), with red color indicating high similarity, and blue color indicating less/no similarity
Fig. 7Clustering of immune drug target gene sets vs. immune disease signatures. One-hundred-twenty-six immune drug target gene sets were paired against 155 immune disease signatures. Hierarchical clustering was performed based on the similarity that was calculated by Fisher’s exact test of the overlapping genes for each pair. (I) Heatmap shows the clustering of drug target gene sets (columns) vs. the disease signatures (rows). A, B lists the diseases that are represented by disease signatures showing similarity with drug target gene sets indicated by C1 and C2. Bottom bar chart indicates the number of linked diseases for each drug target shown in the same order as the above heatmap. (II) C1, C2 tables list the details of drug targets and their linked diseases for drug target gene set clusters shown in (I)
Fig. 8Topological localization of drug target genes and signature disease genes. (I) Network-based separation scores of IL17A target genes and signature disease genes of different disease categories. (II) Network connection of IL17A target genes and IBD signature disease genes. Line represents the direct link between the two groups. Green color: IL17A target gene; blue color: IBD signature disease gene; orange color: common gene. Green color with red outline: target genes have direct connection with more than six signature disease genes