| Literature DB >> 25309909 |
Trevor Clancy1, Eivind Hovig2.
Abstract
Recently, the Immunological Genome Project (ImmGen) completed the first phase of the goal to understand the molecular circuitry underlying the immune cell lineage in mice. That milestone resulted in the creation of the most comprehensive collection of gene expression profiles in the immune cell lineage in any model organism of human disease. There is now a requisite to examine this resource using bioinformatics integration with other molecular information, with the aim of gaining deeper insights into the underlying processes that characterize this immune cell lineage. We present here a bioinformatics approach to study differential protein interaction mechanisms across the entire immune cell lineage, achieved using affinity propagation applied to a protein interaction network similarity matrix. We demonstrate that the integration of protein interaction networks with the most comprehensive database of gene expression profiles of the immune cells can be used to generate hypotheses into the underlying mechanisms governing the differentiation and the differential functional activity across the immune cell lineage. This approach may not only serve as a hypothesis engine to derive understanding of differentiation and mechanisms across the immune cell lineage, but also help identify possible immune lineage specific and common lineage mechanism in the cells protein networks.Entities:
Mesh:
Substances:
Year: 2014 PMID: 25309909 PMCID: PMC4189771 DOI: 10.1155/2014/363408
Source DB: PubMed Journal: Biomed Res Int Impact factor: 3.411
Known transcriptional activators of the immune cell lineages.
| Immune cell type | Known transcriptional activators |
|---|---|
| B cells | POU2AF1, PAX5, EBF1, SPIB, SFPI1, FOXP1 |
| Dendritic cells | RELB, CIITA, AHR, SPIB, SFPI1 |
| Granulocytes | CEBPB, NFE2, SFPI1, FOXO3, CEBPE, FLI1 |
| Hematopoietic stem cells | HLF, LMO2, MYC, MYCN, GATA2, MEIS1, E2F6 |
| Macrophage | CEBPA, CEBPB, SFPI1 |
| Monocytes | CEBPB, SFPI1 |
| Natural killer cells | EOMES, TBX21, SMAD3, GATA3 |
| Natural killer t cells | GATA3, ZBTB16 |
| abT cells | TCF7, BCL11B, GATA3, IKZF2, RORC, SMAD7, TOX |
| gdT Cells | GATA3, SOX13, ID3 |
Figure 1Workflow of affinity propagation on the PNSM and the differential network analysis. The known activator genes which drive the differentiation of the main immune cell types (see Table 1) were used to query the list of their target genes as computed from the OntoGeNet algorithm on the ImmGen resource. The protein network neighborhood of each of the 7965 genes assigned to an ImmGen network module of was integrated with their target lineage information, computed from an integrated set of validated protein interaction network databases. Then, using the Simpson similarity index, their PNSM was computed. Affinity propagation or “message passaging” was then applied on the PNSM, to capture features of the immune lineage network. The resulting exemplars computed from the affinity propagation allows for differential functions to be captured through the lineage tree.
Figure 2Affinity propagation clustering of the protein interaction association similarity matrix of OntoGeNet target module genes. (a) The PNSM is illustrated for both the entire list of target genes computed from the OntoGeNet algorithm on the ImmGen resources. The degree of red color in the heatmap corresponds with the strength of similarity in the protein network for each gene pair. The exemplars as computed from affinity propagation are illustrated in the annotated color bars and the resulting hierarchical clustering (see Table 1 for list of the protein network exemplars). (b) Exemplifies the effect of application of the affinity propagation workflow applied to protein interaction networks, on hubs only. The hub analysis highlights the possible most influential interaction mechanisms activated by the gene regulatory networks (OntoGenet), which govern the immune cell lineage. (c) A differential functional analysis using the Gene Ontology Biological Process (GO-BP) tree is illustrated for two of the computed exemplars. The trajectory of functional significance of GO-PB terms from the two exemplar's genes from Figure 2(b) (indicated by the arrows) is illustrated through the GO-BP tree. GO-BP terms significant for the gene list within the exemplars are highlighted in a red color.
Figure 3Integrated protein interaction networks perspective on the gene regulation networks driving immune cell lineages. The immune cell lineage network is depicted as a bipartite network with multiple edges representation. Each edge represents a protein network exemplar. The multiple edges connect the different node types and reflect the regulator activity superimposed on the multiple protein network exemplars activated by immune lineage regulators. Each relationship is a representation of the gene regulation modules from the ImmGen resource connecting with the known regulators of immune cell lineages. Each edge in the network represents a relationship between an immune cell line lineage type (see legend in Figure 3(1)) and one of the known activating factors regulating the differentiation of that lineage (see Table 1 for list of the known activators used). An edge is drawn in the network if there is connection between a regulator gene (triangle node) and a course module (groups of commonly expressed genes) calculated from OntoGenet (circle nodes). The number of lines between a regulator and a module is a measure of how many “protein network exemplars,” as calculated from the affinity propagation (see main text), are associated to the regulatory module (and therefore a possible measure of the diversity of signaling networks activated in driving the lineage of the immune cell type).
Figure 4Bipartite network representation of immune cell lineages and protein network exemplars. In this bipartite network representation, the protein network exemplars are represented as circles (the names and genes in these exemplar groups are listed in Table 1), and immune cell types represented as square diamonds. A color gradient of degree of the node in the bipartite network ranges from green (lowest) to yellow (intermediate) and red (highest) is represented. Lineage specific exemplars are clearly illustrated in addition to the increasing range of common protein network exemplars. (These two types of patterns are listed in Table 2 and 3, resp.). The protein network exemplars are ordered, 1−10, according to their connectivity to the ten immune cell types in the bipartite network.
Lineage specific protein network exemplars.
| Exemplar ID | Genes assigned to the protein network exemplar | Functional annotation |
| Immune cell type |
|---|---|---|---|---|
| 102 | EPOR, PTPN1, STAT5B | Jak-STAT signaling pathway (KEGG) | 5.20 | Hematopoetic stem cells |
|
| ||||
| 20 | CCNT1, EIF2B1, MYC | Hematopoetic stem cells | ||
|
| ||||
| 86 | EZR, NGFRAP1, NTF3 | Neurotrophin signaling pathway | 4.50 | Granylocytes |
|
| ||||
| 49 | GRB7, TIA1 | Hematopoetic stem cells | ||
|
| ||||
| 104 | ARRB1 BGN PTS | Macrophages | ||
|
| ||||
| 96 | PLCB2, POLA1, VIM | Granylocytes | ||
|
| ||||
| 98 | POT1A, TERF1 | Telomere maintenance via telomerase (GO-BP) | 5.90 | Hematopoetic stem cells |
|
| ||||
| 7 | AR, ATRX, CTCF, SMC1A, SMC3 | Cell cycle (KEGG) | 4.40 | Dendritic cells |
|
| ||||
| 119 | CRE, PRPF40A, SMC2 | Nucleoplasm (GO-CC) | 8.00 | Dendritic cells |
|
| ||||
| 61 | CLDN11, CNOT6L, ITGA5, ITGB1, SPARC | Cell adhesion molecules (CAMs) (KEGG) | 7.80 | Macrophage |
|
| ||||
| 10 | BICC1, CREBBP, CSK, KHDRBS1, PRMT1, RBM39 | Control of Gene Expression by Vitamin D Receptor (KEGG) | 7.00 | Ggranylocyte |
|
| ||||
| 97 | EIF3L, POLR1B, POLR1E | RNA polymerase (KEGG) | 4.70 | Hematopoetic stem cells |
|
| ||||
| 41 | BRCA2, CEBPD, FANCD2 | Cell cycle process (GO-BP) | 5.70 | Macrophages |
|
| ||||
| 70 | ADRB2, DLL1, MAGI3 | Plasma membrane (GO-CC) | 5.40 | abTcells |
|
| ||||
| 88 | CHORDC1, IGBP1, NR3C1, PPP5C | Transition metal Ion binding (GO-MF) | 3.90 | Hematopoetic stem cells |
|
| ||||
| 75 | MEOX1, MEOX2, TLE4 | Transcription factor activity | 3.40 | Granylocytes |
|
| ||||
| 42 | FBXW7, NOTCH1, NPM1 STAT4 | Notch signaling pathway | 1.20 | Hematopoetic stem cells |
|
| ||||
| 36 | AEBP2, EED, MORC3, SETX, UHRF1 | Nucleoplasm (GO-CC) | 2.30 | Dendritic cells |
|
| ||||
| 38 | EIF3A, EIF3B, EIF3I, EIF4E | Translational initiation (GO-BP) | 2.00 | Hematopoetic stem cells |
Common protein network exemplars.
| Exemplar ID | Genes assigned to the protein network exemplar | Functional annotation |
|
|---|---|---|---|
| 127 | TANK, TNFRSF11A, TNFRSF4, TNFRSF9, TRAFD1, ZBP1 | Cytokine-cytokine receptor interaction (KEGG) | 3.50 |
|
| |||
| 32 | ATP6V1A, BAIAP2, CAMK2D, CLU, DLG4, DYNLL1, EPS8, FZD1, FZD4, GRIA3, HCK, INADL, KCNJ10, NSF, OPRD1, PACSIN1, PACSIN2, PGK1, PHB2, PPP3CA, RGS12, SEMA4B, SEMA4C, SLC9A3R1, STXBP1, SYT1 | Wnt signaling pathway (KEGG) | 1.60 |
|
| |||
| 44 | CD2AP, DAPK1, EFNA2, EPHA4, FGR, FYN, PKD2, RAVER1, RGS1, SH2D1A, SH3BP1, SLAMF1, VAV1, VAV2, ZAP70 | T cell receptor signaling pathway (KEGG) | 26.7 |
|
| |||
| 93 | HNRNPA1, LGALS3, NCOA3, PIAS1, RNF19A | Transcription cofactor activity (GO_BP) | 5.30 |
|
| |||
| 29 | CLNK, FYB, LYN, PECAM1, SKAP2 | Leukocyte activation (GO_BP) | 6.30 |
|
| |||
| 62 | CSF2RB, EIF2AK2, GHR, GMCL1, GNB2L1, HES1, IFNGR1, IL6ST, JAK1, JAK2, JAK3, LMO4, NCL, PAG1, SLC40A1 | Jak-STAT signaling pathway (KEGG) | 2.10 |
|
| |||
| 139 | BRCA1, CIITA, GTF2H2, KDM5D, PDLIM4, POLR1A, POLR2B, TRIP4, TRPS1, ZFP111, ZFP292 | Zinc ion binding (GO_BP) | 5.30 |
|
| |||
| 135 | RIPK2, TAX1BP1, TIFA, TNFAIP3, TNIP2 | Apoptosis (GO_BP) | 6.70 |
|
| |||
| 112 | ANXA2, BMYC, GAB2, PRKCE, PRMT5, S100A10 | Fc epsilon RI signaling pathway (KEGG) | 1.40 |
|
| |||
| 140 | CCNG1, CCNG2, E2F1, GSTA4, NR4A3, SWAP70, TRIM32, TRP53 | Cell cycle (GO_BP) | 1.60 |
|
| |||
| 124 | HSPA5, HSPA9, HSPD1, PDXK, RAPGEF4, STIP1, YWHAE | Adenyl ribonucleotide binding (GO_MF) | 6.60 |
|
| |||
| 79 | ALDOA, CD40, IL1R1, IL1RAP, IL1RL1, IRAK3, IRAK4, IRF4, IRF5, LRRFIP1, MYD88, TIRAP, TLR4, TNFRSF13B, TUBA1A | Cytokine-mediated signaling pathway (GO_BP) | 2.50 |
|
| |||
| 126 | BCAR1, BLK, CD22, CD247, CLEC7A, ERBB2, IL15RA, ITGB3, RANBP2, SYK, TUBA4A, WIPF1 | Cell surface receptor linked signal transduction (GO_BP) | 4.10 |
|
| |||
| 21 | CCL3, CCL4, CCR1, CCR3, CCR5 | Chemokine signaling pathway (GO_BP) | 9.80 |
|
| |||
| 63 | APOE, CASK, CNN3, CTTN, HSP90B1, IPO11, KCNMA1, LDHA, MYO5A, NDEL1, NUDC, PRDX2, RAB6B, ROCK2, SH3BP4, TPM1, TPT1, TRF, TUBB5 | Cellular homeostasis (GO_BP) | 9.20 |
|
| |||
| 80 | MYO1C, PCDH15, RICTOR, RRN3, VPS35 | Cytoskeleton organization (GO_BP) | 9.30 |
|
| |||
| 92 | GFR, PDGFRA, PDGFRB, PTEN, SLC9A3R2 | Transmembrane receptor protein tyrosine kinase signaling pathway (GO_BP) | 2.80 |
|
| |||
| 142 | CSNK1E, NXN, PLCG2, RAD51, VANGL1, VANGL2 | Wnt signaling pathway (KEGG) | 3.90 |
|
| |||
| 48 | GRB10, IGF1R, MAP3K5 | Insulin-like growth factor receptor signaling pathway (KEGG) | 1.30 |
|
| |||
| 50 | HCST, IL2RB, KLRK1, TYROBP | Integral to membrane (GO_BP) | 9.50 |
|
| |||
| 52 | ANP32A, DACH1, FOSB, HDAC2, HDAC9, L3MBTL2, MTA1, MTA3, RBBP7, REST, SP3, TCF7L2, WDR5, ZDHHC13, ZFPM1 | Regulation of transcription (GO_BP) | 4.00 |
|
| |||
| 116 | F2RL2, MAP3K2, SMAD1, SMAD5, TOB1, ZEB2 | Cell surface receptor linked signal transduction (GO_BP) | 4.60 |
|
| |||
| 59 | CASP1, CASP3, CASP8, CEBPB, GZMB, IL1B | Regulation of apoptosis (GO_BP) | 1.30 |
|
| |||
| 46 | E2F2, GATA1, GATA3, GFI1B, LMO2, NFYA | Transcription (GO_BP) | 1.30 |
|
| |||
| 94 | AXL, CD19, EPHA2, IL4RA, INSR, IRS2, KRAS, NEDD9, NME2, PDCD4, PIK3AP1, PIK3CA, PIK3CB, PIK3CD, PIK3R1, PLCG1, PTK2B, RALGDS, RASSF5, SIRPA, SOCS6, TEK, TLR2 | Cell surface receptor linked signal transduction (GO_BP) | 2.10 |